Abstract
Intracranial aneurysms (IAs) are characterized by localized dilation or ballooning of a cerebral artery. When IAs rupture, blood leaks into the space around the brain to create a subarachnoid hemorrhage. The latter is associated with a higher risk of disability and mortality. The aims of this study were to gain greater insight into the pathogenesis of ruptured IAs, and to clarify whether identified hub genes represent potential biological markers for assessing the likelihood of IA progression and rupture. Briefly, the GSE36791 and GSE73378 datasets from the National Center of Biotechnology Information Gene Expression Omnibus database were reanalyzed and subjected to a weighted gene co-expression network analysis to test the association between gene sets and clinical features. The clinical significance of these genes as potential biomarkers was also examined, with their expression validated by quantitative real-time PCR. A total of 14 co-expression modules and 238 hub genes were identified. In particular, three modules (labeled turquoise, blue, and brown) were found to highly correlate with IA rupture events. Additionally, six potential biomarkers were identified (BASP1, CEBPB, ECHDC2, GZMK, KLHL3, and SLC2A3), which are strongly associated with the progression and rupture of IAs. Taken together, these findings provide novel insights into potential molecular mechanisms responsible for IAs and they highlight the potential for these particular genes to serve as biomarkers for monitoring IA rupture.
1. Introduction
Intracranial aneurysms (IAs) represent a cerebrovascular disorder which affects between 3% and 5% of individuals. There is a potential risk that an IA will rupture, and this risk is higher in the posterior circulation [1–3]. With advances in intracranial imaging technologies, IAs have been detected more frequently. Consequently, clinicians are increasingly confronted with a dilemma regarding the choice of clinical management. Namely, whether preventive treatments (e.g., endovascular or surgical aneurysm repair) which are associated with inherent complication risks, or conservative management with or without follow-up imaging which leaves patients with a small, yet definite, risk of aneurysm rupture, should be applied [3, 4]. When an IA ruptures, a subarachnoid hemorrhage (SAH) develops. This life-threatening clinical condition has an acute mortality rate of approximately 50% [5, 6]. Despite considerable advances in therapy for IAs, SAH remains a highly lethal condition which is associated with a high socioeconomic burden [7–9]. Thus, an ability to identify IAs which have a high risk of rupture and provide timely preventive treatment may be key to the successful management of IAs.
To predict IA rupture, researchers have studied aneurysmal hemodynamics [10, 11], aneurysmal morphology and location [8], genetics [12], and other factors (e.g., cigarette smoking, hypertension, and positive family history for SAH) [13]. Only inflow angle was identified as a significant predictor of rupture according to morphological parameters [8]. However, aneurysm wall inflammation has also been shown to play a pivotal role in aneurysm growth and rupture [2, 3, 14]. Two scoring systems have been established to evaluate the risk of rupture and to guide treatment. These include the PHASES (population, hypertension, age, size of aneurysm, earlier SAH from another aneurysm, and site of aneurysm) system and the UIATS (unruptured IA treatment score) system [8, 15, 16]. However, standardized management of unruptured IAs remains controversial, as risks of prophylactic treatment must be weighed against possible risk of rupture for individual aneurysms [17].
Despite the vast efforts made to date to prevent the rupture of IAs, the mechanisms mediating the pathology of IAs remains largely unknown. In particular, suitable biomarkers to predict IA rupture remain unavailable. Therefore, the aims of the present study were to gain greater insight into the pathogenesis of ruptured IAs and to clarify whether identified hub genes can be used as potential biological markers to assess the likelihood of IA progression and rupture (Fig 1). For these aims, the GSE36791 [18] and GSE73378 [19] datasets from the National Center of Biotechnology Information (NCBI) Gene Expression Omnibus (GEO, http://www.ncbi.nlm.nih.gov/geo/) were reanalyzed. A weighted gene co-expression network analysis (WGCNA) [20] was performed to test possible correlations between gene sets and clinical features of IAs. Clinically significant genes were identified and their expression levels were validated in patients with ruptured versus unruptured IAs by quantitative real-time (qRT)-PCR.
2. Materials and methods
2.1 Microarray dataset
Gene expression profile data from the GSE36791 dataset were used as a training set to construct co-expression networks and to identify hub genes. This dataset was generated from peripheral blood samples collected from 43 patients with SAH due to ruptured IAs, and from 18 individuals with headaches (the reference group) [18]. To verify the results obtained, the GSE73378 dataset was used as a test set. The GSE73378 dataset was generated from peripheral blood samples collected from 103 patients who developed an aneurysmal SAH at least two years prior, and from 107 individuals used as a reference group [19].
2.2 Data processing and WGCNA
The GSE36791 expression data were already normalized with a BeadArray package [21] (quantile normalization). Probe sets were then mapped to gene symbols according to the GPL10558 platform. After filtering the probes without a corresponding gene symbol, the average value of the gene symbols with multiple probes was calculated [22]. Outlier samples were identified with hierarchical cluster analysis (i.e., distance matrices constructed with Pearson's correlation matrices and a hierarchical agglomerative method to adopt average linkages) by using the hclust function in WGCNA [20]. Based on these results, three samples (GSM901111, GSM901112, and GSM901161) in GSE36791 were removed from subsequent analyses.
In the present study, the co-expression analysis performed was based on a WGCNA, which is a systems biology method for describing correlation patterns among genes across microarray samples [20]. Currently, this method is widely used in the international biomedical field. This method also helps identify clusters (modules) of highly correlated genes across samples. To identify ruptured IA-associated co-expression modules and their key constituents, preprocessed data of GES36791 were analyzed. Briefly, Pearson's correlation matrices were generated (average linkage method) for all pair-wise genes. Next, co-expression similarities were transformed into a weighted adjacency matrix of connection strengths by using a power adjacency function (cutoff height of 0.85). This adjacency matrix was then transformed into a topological overlap matrix to measure relative gene interconnectedness and proximity. Finally, gene co-expression modules were identified based on clustering (hierarchical average linkage) according to topology overlap.
2.3 Identification of modules with clinical significance
In order to identify modules related to the clinical traits of IA (group [rupture IA/ reference] and gender), two different approaches were used [20, 23]. The first approach was to determine gene significance (GS) from a linear regression analysis of gene expression data and clinical traits [20]. GS was defined based on a log10 transformation of P-values. Thus, a higher absolute value of GSi corresponds with greater biological significance of gene i [20]. In addition, module significance (MS) is defined as the average GS value for all of the genes in a module, which implies that the higher the absolute value of MSj, the more biologically significant is module j. The second approach was to define module eigengenes (MEs) as a major component in the principal component analysis for each module. Thus, MEs can be considered representative of the gene expression profiles in a module [20]. Correlations between MEs and clinical traits were calculated to identify relevant modules. Modules with a P-value greater than 0.005 were considered to significantly correlate with certain clinical traits (ruptured IA).
2.4 Identification of hub genes and functional enrichment analysis
Hub genes were defined according to module connectivity [absolute value of Pearson's correlation (cor.geneModuleMembership > 0.8)] and clinical trait relationship [absolute value of Pearson's correlation (cor.geneTraitSignificance > 0.5)] [23]. Hub genes were highly interconnected with genes in the module with potential significance. To functionally characterize the identified hub genes, Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis were performed by using clusterProfiler [24]. The terms obtained from the KEGG pathway and GO analyses (including molecular functions, biological processes, and cellular components) which had false discovery rates (FDRs) < 0.05 were considered to be significantly enriched in the hub genes. Common hub genes were also screened from key genes identified in our previous study [22] which were regarded as “candidate biomarkers” highly associated with IA rupture.
2.5 Analysis of the GSE73378 dataset
GSE73378 expression data were already normalized (quantile normalization) by the surrogate variable analysis (SVA) package [19]. Differentially expressed genes (DEGs) (ruptured IAs vs. references) were then screened by using the limma package [25]. The Benjamini and Hochberg FDR [26] was used to adjust for multiple testing. Threshold values of FDR < 0.005 and absolute fold-change > 1.5 (|log2FC| > 0.585) were applied. Hierarchical clustering analyses (i.e., distance matrices constructed with Pearson's correlation matrices and a hierarchical agglomerative method to adopt average linkages) were also performed to validate the expression of hub genes regarded to be candidate biomarkers for IA rupture. Clustering heat maps were generated by using the pheatmap package [27] in R software (version 3.4.2).
2.6 Collection of blood samples
The expression profiles of six common genes considered to be possible biomarkers were validated in peripheral blood samples collected from 30 patients recruited from the Department of Neurosurgery, The First Hospital of Jilin University, China by qRT-PCR. Patient characteristics are provided in Table 1. Written consent was obtained from all participating individuals prior to sample collection and collection was approved by the Medical Ethics Committee of The First Hospital of Jilin University, in accordance with the Declaration of Helsinki.
Table 1. Clinical characteristics of the patient groups who provided blood samples for qRT-PCR analysis.
Reference | Ruptured IA | PaSAH | |
---|---|---|---|
Number | 10 | 10 | 10 |
Age (mean ± SD), years | 54.00 ± 6.88 | 51.70 ± 12.95 | 54.80 ± 10.76 |
Females (%) | 50 | 60 | 60 |
History of smoking (%) | 30 | 40 | 10 |
History of hypertension (%) | 30 | 30 | 30 |
Excessive drinking (%) | 20 | 20 | 0 |
Reference group: patients with unruptured IAs; Ruptured IA group: patients with ruptured IA; PaSAH group: patients with aneurysmal SAH within at least the prior year.
2.7 Validation of hub genes by qRT-PCR
Briefly, whole blood samples were homogenized in Trizol reagent (Invitrogen, Carlsbad, CA, USA) before total cellular RNA was extracted and transcribed to cDNA with a PrimeScript RT Reagent kit (Invitrogen). Quantitative PCR was subsequently performed by using HieffTM qPCR SYBR Green Master Mix (Low Rox Plus; Yisheng Biotech, Shanghai, China) with the MxPro-Mx3005 P Real-time PCR system (Agilent Technologies, Palo Alto, CA, USA). All primer oligos were synthesized by Sangon Biotechnology (Shanghai, China; listed in Table 2). Levels of mRNA expression were standardized to the mRNA level of glyceraldeyhyde 3-phosphate deydrogenase (GAPDH). Relative quantification was subsequently performed according to the comparative threshold cycle (2-ΔΔCT) method.
Table 2. Sequences of the qRT-PCR primers used in this study.
Gene | Direction | Sequence (5′-3′) |
---|---|---|
BASP1 | Forward | GGAAGCGCCTAGTTCCACAC |
Reverse | TTGGTCGGAATTAGCTGCCG | |
CEBPB | Forward | AAGACCGTGGACAAGCACAG |
Reverse | AACAAGTTCCGCAGGGTGC | |
ECHDC2 | Forward | CTTCGTCAGTGAGCTGCTGG |
Reverse | CACGCCCTTCACTCCACTTC | |
GZMK | Forward | ACCCTGCGAGAAGTCACTGT |
Reverse | CTGGCCTTTGGCATCTCCTG | |
KLHL3 | Forward | GAGCAGTACAACCCAGCGAC |
Reverse | GTCCGCTAAGCACTCCAACC | |
SLC2A3 | Forward | TTCAATGCTGATTGTCAACCTG |
Reverse | GCATTTCAACCGACTTAGCTAC | |
GAPDH | Forward | CCAAGGTCATCCATGACAACT |
Reverse | CAGGGATGATGTTCTGGAGAG |
2.8 Statistical analysis
Experimental qRT-PCR data are presented as the mean ± standard error of the mean (SEM). R software (version 3.4.2) was used to perform statistical analyses. Differences between experimental groups (i.e., ruptured IA vs. reference group; past aneurysmal SAH (PaSHA) vs. reference group) were determined by applying unpaired Student’s t-tests.
3. Results
3.1 Weighted co-expression network construction
The distribution of intensity values for the normalized expression data from all of the samples is shown in the box plot of Fig 2A. These results confirm the reliability of the data used. After preprocessing of the data, a total of 31,335 genes were obtained and 58/61 samples with clinical data were included in the co-expression analysis (Fig 2B and 2C). In the WGCNA [20], a soft threshold parameter, β, of the power function is used to ensure that the co-expression network (adjacency matrix) best approximates a scale-free topology [28]. In the present study, a β value of 6 (scale free R2 = 0.85) was used to ensure a scale-free network (Fig 2D). To identify a final set of hub genes, we first screened the top 25% of the 31,335 genes obtained according to variance to establish a co-expression network by using the “WGCNA” package [20] in R software. The genes which exhibited similar expression patterns were grouped into modules by applying average linkage hierarchical clustering. A total of 14 modules were identified and constructed. These modules are represented with different colors (Fig 2E).
3.2 Identification of key modules
First, MS was used to test correlations between each module and IA rupture. As a result, five modules (out of 14 total modules) were found to exhibit markedly higher correlations with IA rupture [Fig 3A; turquoise module (R2 = 0.35), blue module (R2 = 0.31), brown module (R2 = 0.28), green module (R2 = 0.27), and cyan module (R2 = 0.25)]. The second method employing MEs showed that the turquoise, blue, and brown modules exhibited a significant correlation with IA rupture (P < 0.005, Fig 3B). Therefore, the turquoise, blue, and brown modules were identified as key modules and were subjected to further analysis.
3.3 Identification of hub genes and their functional annotations
Hub genes are highly connected in a module and may be essential for certain biological processes. According to the module connectivity and clinical trait relationships evaluated in the present study (see Methods), a total of 238 genes exhibited high connectivity among three modules (190 genes in the turquoise module; 38 genes in the blue module; and 10 genes in the brown module). Consequently, these 238 genes were identified as hub genes (Fig 4A–4C). When all of these hub genes were subsequently subjected to a GO analysis, the most enriched GO terms were found to involve ribosomes and related pathways. Regarding the biological process (BP) terms for the hub genes, these were mainly enriched in rRNA processing, ribonucleoprotein complex biogenesis, rRNA metabolic process, and ribosome biogenesis; for the cellular compartment (CC) terms, the hub genes were mainly enriched in cytosolic ribosome, ribosomal subunit, ribosome, and focal adhesion (S1 and S2 Figs). Similarly, the KEGG pathway analysis also found the ribosome pathway to be enriched for the hub genes (Fig 4D). The top 30 enrichment GO/KEGG terms (listed according to FDRs and category) are presented in Table 3. In our previous study [22], we identified 396 DEGs in the GSE36791 dataset. Nine of these DEGs are considered to be key genes associated with IA rupture: BASP1, CD74, CEBPB, ECHDC2, GZMK, HLA-DRB3, KLHL3, SLC2A3, and SOCS3 [22]. In the present study, the two approaches used to analyze the GSE36791 dataset identified 108 common genes among the hub genes, and also among the DEGs. In particular, six genes (BASP1, CEBPB, ECHDC2, GZMK, KLHL3, and SLC2A3) were included in both our previously published key genes (n = 9) [22] and in the 108 common genes identified from the GSE36971 dataset (Fig 4E). Therefore, these six genes were identified as potential candidate biomarkers and were subjected to further validation.
Table 3. The top 30 functional enrichment terms enriched for the hub genes identified in this study.
Category | ID | Term | Count | P-value | FDR |
---|---|---|---|---|---|
BP | GO:0006364 | rRNA processing | 21 | 1.23E-09 | 1.05E-09 |
BP | GO:0022613 | Ribonucleoprotein complex biogenesis | 26 | 3.75E-09 | 3.21E-09 |
BP | GO:0016072 | rRNA metabolic process | 21 | 3.75E-09 | 3.21E-09 |
BP | GO:0042254 | Ribosome biogenesis | 22 | 3.75E-09 | 3.21E-09 |
BP | GO:0034470 | ncRNA processing | 24 | 3.75E-09 | 3.21E-09 |
BP | GO:0006614 | SRP-dependent cotranslational protein targeting to membrane | 13 | 6.54E-09 | 5.59E-09 |
BP | GO:0006613 | Cotranslational protein targeting to membrane | 13 | 1.13E-08 | 9.64E-09 |
BP | GO:0045047 | Protein targeting to ER | 13 | 1.67E-08 | 1.43E-08 |
BP | GO:0072599 | Establishment of protein localization to endoplasmic reticulum | 13 | 2.47E-08 | 2.11E-08 |
BP | GO:0000956 | Nuclear-transcribed mRNA catabolic process | 15 | 1.51E-06 | 1.29E-06 |
BP | GO:0090150 | Establishment of protein localization to membrane | 16 | 7.43E-06 | 6.35E-06 |
BP | GO:0006401 | RNA catabolic process | 18 | 1.77E-05 | 1.51E-05 |
BP | GO:0050852 | T cell receptor signaling pathway | 12 | 5.27E-05 | 4.51E-05 |
BP | GO:0042110 | T cell activation | 19 | 8.12E-05 | 6.95E-05 |
CC | GO:0022626 | Cytosolic ribosome | 12 | 7.34E-07 | 5.92E-07 |
CC | GO:0044391 | Ribosomal subunit | 13 | 1.18E-05 | 9.49E-06 |
CC | GO:0005840 | Ribosome | 14 | 1.90E-05 | 1.53E-05 |
CC | GO:0005925 | Focal adhesion | 17 | 5.83E-05 | 4.69E-05 |
CC | GO:0005924 | Cell-substrate adherens junction | 17 | 5.83E-05 | 4.69E-05 |
CC | GO:0030055 | Cell-substrate junction | 17 | 5.83E-05 | 4.69E-05 |
CC | GO:0044445 | Cytosolic part | 13 | 6.70E-05 | 5.40E-05 |
CC | GO:0022625 | Cytosolic large ribosomal subunit | 7 | 1.36E-04 | 1.10E-04 |
CC | GO:0001772 | Immunological synapse | 5 | 8.29E-04 | 6.68E-04 |
CC | GO:0015934 | Large ribosomal subunit | 8 | 9.12E-04 | 7.35E-04 |
CC | GO:0008180 | COP9 signalosome | 5 | 9.12E-04 | 7.35E-04 |
CC | GO:0042101 | T cell receptor complex | 4 | 1.12E-03 | 9.06E-04 |
CC | GO:0022627 | Cytosolic small ribosomal subunit | 5 | 2.67E-03 | 2.15E-03 |
CC | GO:1904813 | Ficolin-1-rich granule lumen | 7 | 7.77E-03 | 6.26E-03 |
MF | GO:0003735 | Structural constituent of ribosome | 11 | 6.27E-04 | 5.28E-04 |
KEGG | hsa03010 | Ribosome | 12 | 3.09E-04 | 3.08E-04 |
BP: biological process; CC: cellular component; MF: molecular function; KEGG: Kyoto Encyclopedia of Genes and Genomes; FDR: false discovery rate.
3.4 Validation with the GSE73378 dataset
The six candidate biomarkers were examined with normalized data of the GSE73378 dataset from the GEO database (Table 4). These data derived from 103 blood samples collected from patients with aneurysmal SAH at least two years prior, as well as from 107 blood samples collected from individuals included as a reference group. After data preprocessing, none of the genes exhibited a significant difference in expression (i.e., a FDR < 0.005 and an absolute fold-change > 1.5), consistent with the results of previous studies [19]. Consequently, hierarchical clustering analysis was performed for the six candidate biomarkers among samples of the GSE73378 dataset. No significant systematic variations in gene expression were found in the GSE73378 dataset, in contrast with the GSE36791 dataset (Fig 5A and 5B).
Table 4. Comparison of six hub genes between two datasets.
Hub Gene | GSE36791 Dataset | GSE73378 Dataset | |||
---|---|---|---|---|---|
Module | log2 FC | FDR | log2 FC | FDR | |
BASP1 | Blue | 0.741 | 3.24E-06 | 0.088 | 0.20 |
CEBPB | Blue | 0.608 | 3.88E-05 | 0.078 | 0.18 |
ECHDC2 | Turquoise | -0.796 | 1.12E-05 | -0.058 | 0.20 |
GZMK | Blue | -0.980 | 2.23E-06 | -0.112 | 0.14 |
KLHL3 | Turquoise | -0.618 | 7.02E-08 | 0.025 | 0.33 |
SLC2A3 | Blue | 0.677 | 3.02E-05 | 0.035 | 0.52 |
FC: fold-change; FDR: false discovery rate.
3.5 Validation of candidate biomarkers by qRT-PCR
To validate the bioinformatics analysis results from the GSE36791 and GSE73378 datasets which indicated that six genes (BASP1, CEBPB, ECHDC2, GZMK, KLHL3, and SLC2A3) are extremely associated with IA rupture, three groups of patients were selected for further analysis of these genes. These groups included patients with IA rupture, patients with a previous aneurysmal SAH (PaSAH), and reference patients. As for reference group, it refers to the patient with no IAs in GSE36791 and GSE73378 datasets. To provide a more reliable validation of these six genes which were differentially expressed between the ruptured and unruptured IA groups [22], we selected the unruptured IA patients to serve as the reference group. When the six biomarkers were assayed by using qRT-PCR, significant differences in expression for all six genes were observed between the ruptured IA group and the reference group (Fig 5C). Specifically, BASP1, CEBPB, and SLC2A3 were found to be upregulated, while ECHDC2, GZMK, and KLHL3 were downregulated. In contrast, the biomarkers had no validated differences between the PaSAH group and the reference group, except for "GZMK" (Fig 5D). These results are consistent with those from the microarray data analysis performed for these selected six genes as described above.
4. Discussion
Over the past decade, systems biology approaches have been applied to gene expression datasets with the goal of extracting meaningful information to accompany lists of differentially expressed genes. WGCNA is a widely used data mining method which is especially useful for examining pairwise correlations between variables such as co-expressed genes to provide valuable insights into biological networks [20]. Highly co-expressed genes are connected in a network and can be grouped into modules. In this study, 14 co-expression modules were identified with WGCNA, and then these modules were used to test possible correlations between IAs and clinical traits (i.e., rupture and gender). Within the top three significant modules, the most central and connected genes (the hub genes) were further identified. In a functional analysis of these hub genes, ribosome and related pathways were found to be involved in the rupture of IAs. In our previous study [22], we identified nine key genes associated with IA rupture by screening DEGs of the GSE36791 dataset. In the present study, we established a co-expression network to identify the biological relevance of hub genes by using WGCNA. Six common genes were identified by two methods and were selected from the same dataset. We hypothesize that these six genes which were identified from biologically-relevant hub genes represent valuable biomarkers for the detection and/or treatment of IA rupture.
Among the genes that were found to be upregulated, Brain acid-soluble protein 1 (BASP1) was initially identified as an abundant membrane-bound protein in the brain. Accumulating evidence suggests BASP1 may have roles in neuronal plasticity and axon regeneration [29–31], as well as roles in apoptosis, differentiation, and transcriptional regulation [32, 33]. CCAAT enhancer binding protein beta (CEBPB) has been identified as an epigenetic regulator of a mesenchymal signature. CEBPB also has important roles in inflammation, cell differentiation, and cell proliferation [34, 35]. In a subset of tumors, abnormal expression and/or methylation of BASP1 and CEBPB have been closely related to occurrence and prognosis. For example, downregulation of BASP1 expression via promoter methylation may potentially be used to diagnose hepatocellular carcinoma in its early stages [36]. Conversely, BASP1 is upregulated in cervical cancers and promotes tumorigenicity, thereby identifying it as a novel prognostic factor [29]. In addition, CEBPB is highly expressed in glioblastoma stem cells and has been shown to promote glioma progression by regulating cyclin D1 and inducing distinct resistance to chemotherapy [37, 38]. However, there is little known about possible roles and underlying mechanisms for BASP1 and CEBPB in ruptured IAs.
Solute-carrier family 2A3 (SLC2A3) encodes the protein, GLUT-3, which is a major GLUT subtype for macrophages. GLUT proteins constitute a family of passive glucose transporters which facilitate the uptake of glucose [39, 40]. In abdominal aortic aneurysms (AAAs), Tsuruda et al. [40] demonstrated that GLUT-3, as well as GLUT-1, appear to be the most important isoforms mediating accumulation of 18 fluoro-deoxyglucose (18F-FDG). In addition, GLUT-1 and GLUT-3 immunoreactivity were observed in macrophages in the aneurysmal wall. GLUT-3 expression has also been associated with metalloproteinase (MMP)-9 activity, which is mainly produced by macrophages in aneurysmal tissues. Correspondingly, high tissue MMP-9 activity has been associated with the progression and rupture of AAAs [41]. Thus, it would be of interest to explore whether there is a similar mechanism for IAs.
Among the genes found to be downregulated, enoyl-coA hydratase domain containing 2 (ECHDC2) encodes a mitochondrial protein which contributes to myocardial injury and cell death [42]. Meanwhile, kelch-like3 (KLHL3) promotes substrate ubiquitination of bound proteins via interactions of the BTB domain with the cullin 3 component of a cullin-RING E3 ubiquitin ligase complex [43]. Yoshida et al. [44] further demonstrated that mice which present a pseudohypoaldosteronism type II (PHA II) phenotype (i.e., a hereditary hypertensive disease) exhibit a significant decrease in KLHL3 expression in both kidney and brain tissues, thereby indicating that KLHL3 may contribute to hereditary hypertension. Since hypertension is an important risk factor for aneurysm rupture, abnormal expression of KLHL3 may be associated with the onset and rupture of IAs. Granzymes comprise a subgroup of granule-associated serine proteases among a family of immune defense proteases. These granzymes have important roles in both innate and adaptive immunity [45, 46]. In humans, there are five granzyme genes (GZMA, GZMB, GZMH, GZMK, and GZMM). Similar to granzyme A, granzyme K (GZMK) is a pro-apoptotic serine protease which is secreted from granules. It has been demonstrated that GZMK exhibits pro-inflammatory potential by inducing the secretion of multiple inflammatory factors such as interleukin (IL)-1β, IL-6, IL-8, and monocyte chemoattractant protein-1 (MCP-1). In addition, GZMK activates protease-activated receptor-1 to induce cell proliferation [46–48].
Gene expression profiles of the six genes of interest were validated by using the GSE73378 dataset. There were no significant differences in gene expression levels between the PaSHA and the reference groups, and this was confirmed with qRT-PCR. Overall, the results obtained indicate that BASP1, CEBPB, and SLC2A3 are selectively upregulated, while ECHDC2, GZMK, and KLHL3 are downregulated, in patients with ruptured IAs compared to the reference group. Furthermore, all of the genes, except GZMK, exhibited the same expression pattern in the PaSHA and reference samples. Based on these results, we have demonstrated that these six genes undergo dynamic changes in IAs during the transition from an unruptured stage to a recovery stage (Fig 5E). Correspondingly, we detected increases or decreases in gene expression among these six gene targets during the IA rupture stage, and it is predicted that these levels return to “normal” during the subsequent recovery stage. Therefore, the potential for these six genes to serve as biomarkers for predicting IA rupture may have important clinical implications. Moreover, whether there is a threshold interval of gene expression which can distinguish unruptured IAs from ruptured IAs is also important to consider. Thus, monitoring of these biomarkers in peripheral blood samples from patients with unruptured IAs could provide a guide for clinical treatment.
There were several potential limitations associated with the present study. First, our datasets were downloaded from the GEO database, and thus, some clinical information is absent (e.g., patient age, cigarette smoking status, hypertension, and size and location of IAs). Second, a relatively small sample size was validated by qRT-PCR. Based on these two potential limitations, a prospective observational cohort study will be conducted to demonstrate the potential prediction function of these six biomarkers, with qRT-PCR (mRNA) and detection of proteins in plasma used to characterize the biomarkers. Briefly, we will serially and longitudinally track patients who progress from an unruptured IA to a ruptured IA, and then post-rupture. Ideally, appropriate genetic data, as well as clinical status data, will be collected for a large number of patients.
In conclusion, we used bioinformatics to analyze datasets generated from patients with ruptured IAs in order to identify hub genes which are associated with the progression and rupture of IAs. A total of 238 hub genes exhibited high connectivity with IA rupture. In particular, BASP1, CEBPB, SLC2A3, ECHDC2, GZMK, and KLHL3 were identified and validated by qRT-PCR. Based on these results, we hypothesize that these six genes potentially represent valuable biomarkers for predicting IA rupture, and their validation would improve both clinical management and treatment outcome of patients with unruptured IAs.
Supporting information
Data Availability
The datasets (GSE36791 and GSE73378) generated and analyzed during the present study are available from the GEO database.(http://www.ncbi.nlm.nih.gov/geo/). Other relevant data are within the manuscript.
Funding Statement
This study was supported by the Young Scientists Fund, National Natural Science Foundation of China (Grant no. 81701124).
References
- 1.Brisman JL, Song JK, Newell DW. Cerebral aneurysms. N Engl J Med. 2006;355(9):928–39. 10.1056/NEJMra052760 [DOI] [PubMed] [Google Scholar]
- 2.Chalouhi N, Hoh BL, Hasan D. Review of cerebral aneurysm formation, growth, and rupture. Stroke. 2013;44(12):3613–22. 10.1161/STROKEAHA.113.002390 [DOI] [PubMed] [Google Scholar]
- 3.Etminan N, Rinkel GJ. Unruptured intracranial aneurysms: development, rupture and preventive management. Nat Rev Neurol. 2016;12(12):699–713. 10.1038/nrneurol.2016.150 [DOI] [PubMed] [Google Scholar]
- 4.Gabriel RA, Kim H, Sidney S, McCulloch CE, Singh V, Johnston SC, et al. Ten-year detection rate of brain arteriovenous malformations in a large, multiethnic, defined population. Stroke. 2010;41(1):21–6. 10.1161/STROKEAHA.109.566018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Bederson JB, Connolly ES Jr., Batjer HH, Dacey RG, Dion JE, Diringer MN, et al. Guidelines for the management of aneurysmal subarachnoid hemorrhage: a statement for healthcare professionals from a special writing group of the Stroke Council, American Heart Association. Stroke. 2009;40(3):994–1025. 10.1161/STROKEAHA.108.191395 [DOI] [PubMed] [Google Scholar]
- 6.Li P, Zhang Q, Wu X, Yang X, Zhang Y, Li Y, et al. Circulating microRNAs serve as novel biological markers for intracranial aneurysms. J Am Heart Assoc. 2014;3(5):e000972 10.1161/JAHA.114.000972 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Rivero-Arias O, Gray A, Wolstenholme J. Burden of disease and costs of aneurysmal subarachnoid haemorrhage (aSAH) in the United Kingdom. Cost Eff Resour Alloc: C/E. 2010;8:6 10.1186/1478-7547-8-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Skodvin TO, Evju O, Sorteberg A, Isaksen JG. Prerupture Intracranial Aneurysm Morphology in Predicting Risk of Rupture: A Matched Case-Control Study. Neurosurgery. 2019;84(1):132–40. 10.1093/neuros/nyy010 [DOI] [PubMed] [Google Scholar]
- 9.Johnston SC, Selvin S, Gress DR. The burden, trends, and demographics of mortality from subarachnoid hemorrhage. Neurology. 1998;50(5):1413–8. 10.1212/wnl.50.5.1413 [DOI] [PubMed] [Google Scholar]
- 10.Can A, Du R. Association of Hemodynamic Factors With Intracranial Aneurysm Formation and Rupture: Systematic Review and Meta-analysis. Neurosurgery. 2016;78(4):510–20. 10.1227/NEU.0000000000001083 [DOI] [PubMed] [Google Scholar]
- 11.Brinjikji W, Chung BJ, Jimenez C, Putman C, Kallmes DF, Cebral JR. Hemodynamic differences between unstable and stable unruptured aneurysms independent of size and location: a pilot study. J Neurointerv Surg. 2017;9(4):376–80. 10.1136/neurintsurg-2016-012327 [DOI] [PubMed] [Google Scholar]
- 12.Alg VS, Sofat R, Houlden H, Werring DJ. Genetic risk factors for intracranial aneurysms: a meta-analysis in more than 116,000 individuals. Neurology. 2013;80(23):2154–65. 10.1212/WNL.0b013e318295d751 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Giordan E, Sorenson TJ, Brinjikji W, Vine R, Lanzino G. Risk factors for growth of conservatively managed unruptured intracranial aneurysms. Acta Neurochir (Wien). 2018;160(12):2419–23. [DOI] [PubMed] [Google Scholar]
- 14.Hasan D, Chalouhi N, Jabbour P, Dumont AS, Kung DK, Magnotta VA, et al. Early change in ferumoxytol-enhanced magnetic resonance imaging signal suggests unstable human cerebral aneurysm: a pilot study. Stroke. 2012;43(12):3258–65. 10.1161/STROKEAHA.112.673400 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Greving JP, Wermer MJ, Brown RD Jr., Morita A, Juvela S, Yonekura M, et al. Development of the PHASES score for prediction of risk of rupture of intracranial aneurysms: a pooled analysis of six prospective cohort studies. Lancet Neurol. 2014;13(1):59–66. 10.1016/S1474-4422(13)70263-1 [DOI] [PubMed] [Google Scholar]
- 16.Etminan N, Brown RD Jr., Beseoglu K, Juvela S, Raymond J, Morita A, et al. The unruptured intracranial aneurysm treatment score: a multidisciplinary consensus. Neurology. 2015;85(10):881–9. 10.1212/WNL.0000000000001891 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Brown RD Jr., Broderick JP. Unruptured intracranial aneurysms: epidemiology, natural history, management options, and familial screening. Lancet Neurol. 2014;13(4):393–404. 10.1016/S1474-4422(14)70015-8 [DOI] [PubMed] [Google Scholar]
- 18.Pera J, Korostynski M, Golda S, Piechota M, Dzbek J, Krzyszkowski T, et al. Gene expression profiling of blood in ruptured intracranial aneurysms: in search of biomarkers. J Cereb Blood Flow Metab. 2013;33(7):1025–31. 10.1038/jcbfm.2013.37 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.van 't Hof FN, Ruigrok YM, Medic J, Sanjabi B, van der Vlies P, Rinkel GJ, et al. Whole Blood Gene Expression Profiles of Patients with a Past Aneurysmal Subarachnoid Hemorrhage. PloS one. 2015;10(10):e0139352 10.1371/journal.pone.0139352 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC bioinformatics. 2008;9:559 10.1186/1471-2105-9-559 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Dunning MJ, Smith ML, Ritchie ME, Tavare S. beadarray: R classes and methods for Illumina bead-based data. Bioinformatics (Oxford, England). 2007;23(16):2183–4. [DOI] [PubMed] [Google Scholar]
- 22.Wang Q, Chen X, Yi D, Song Y, Zhao YH, Luo Q. Expression profile analysis of differentially expressed genes in ruptured intracranial aneurysms: In search of biomarkers. Biochem Biophys Res Commun. 2018;506(3):548–56. 10.1016/j.bbrc.2018.10.117 [DOI] [PubMed] [Google Scholar]
- 23.Yuan L, Chen L, Qian K, Qian G, Wu CL, Wang X, et al. Co-expression network analysis identified six hub genes in association with progression and prognosis in human clear cell renal cell carcinoma (ccRCC). Genom Data. 2017;14:132–40. 10.1016/j.gdata.2017.10.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. Omics: a journal of integrative biology. 2012;16(5):284–7. 10.1089/omi.2011.0118 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43(7):e47 10.1093/nar/gkv007 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B (Methodol). 1995; 57(1):289–300. [Google Scholar]
- 27.Kolde R (2012) pheatmap: Pretty Heatmaps. R package version 1.0.12.
- 28.Zhang B, Horvath S. A general framework for weighted gene co-expression network analysis. Stat Appl Genet Mol Biol. 2005;4:Article17. [DOI] [PubMed] [Google Scholar]
- 29.Tang H, Wang Y, Zhang B, Xiong S, Liu L, Chen W, et al. High brain acid soluble protein 1(BASP1) is a poor prognostic factor for cervical cancer and promotes tumor growth. Cancer Cell Int. 2017;17:97 10.1186/s12935-017-0452-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhou L, Fu L, Lv N, Liu J, Li Y, Chen X, et al. Methylation-associated silencing of BASP1 contributes to leukemogenesis in t(8;21) acute myeloid leukemia. Exp Mol Med. 2018;50(4):44 10.1038/s12276-018-0067-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Korshunova I, Caroni P, Kolkova K, Berezin V, Bock E, Walmod PS. Characterization of BASP1-mediated neurite outgrowth. J Neurosci Res. 2008;86(10):2201–13. 10.1002/jnr.21678 [DOI] [PubMed] [Google Scholar]
- 32.Green LM, Wagner KJ, Campbell HA, Addison K, Roberts SG. Dynamic interaction between WT1 and BASP1 in transcriptional regulation during differentiation. Nucleic acids Res. 2009;37(2):431–40. 10.1093/nar/gkn955 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Sanchez-Nino MD, Sanz AB, Lorz C, Gnirke A, Rastaldi MP, Nair V, et al. BASP1 promotes apoptosis in diabetic nephropathy. J Am Soc Nephrol: JASN. 2010;21(4):610–21. 10.1681/ASN.2009020227 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Carro MS, Lim WK, Alvarez MJ, Bollo RJ, Zhao X, Snyder EY, et al. The transcriptional network for mesenchymal transformation of brain tumours. Nature. 2010;463(7279):318–25. 10.1038/nature08712 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Deng Y, Zhu W, Zhou X. Immune Regulatory Genes Are Major Genetic Factors to Behcet Disease: Systematic Review. Open Rheumatol J. 2018;12:70–85. 10.2174/1874312901812010070 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Moribe T, Iizuka N, Miura T, Stark M, Tamatsukuri S, Ishitsuka H, et al. Identification of novel aberrant methylation of BASP1 and SRD5A2 for early diagnosis of hepatocellular carcinoma by genome-wide search. Int J Oncol. 2008;33(5):949–58. [PubMed] [Google Scholar]
- 37.Luedi MM, Singh SK, Mosley JC, Hatami M, Gumin J, Sulman EP, et al. A Dexamethasone-regulated Gene Signature Is Prognostic for Poor Survival in Glioblastoma Patients. J Neurosurg Anesthesiol. 2017;29(1):46–58. 10.1097/ANA.0000000000000368 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Du C, Pan P, Jiang Y, Zhang Q, Bao J, Liu C. Microarray data analysis to identify crucial genes regulated by CEBPB in human SNB19 glioma cells. World J Surg Oncol. 2016;14(1):258 10.1186/s12957-016-0997-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Yan N. A Glimpse of Membrane Transport through Structures-Advances in the Structural Biology of the GLUT Glucose Transporters. J Mol Biol. 2017;429(17):2710–25. 10.1016/j.jmb.2017.07.009 [DOI] [PubMed] [Google Scholar]
- 40.Tsuruda T, Hatakeyama K, Nagamachi S, Sekita Y, Sakamoto S, Endo GJ, et al. Inhibition of development of abdominal aortic aneurysm by glycolysis restriction. Arterioscler Thromb, Vasc Biol. 2012;32(6):1410–7. [DOI] [PubMed] [Google Scholar]
- 41.Wilson WR, Anderton M, Schwalbe EC, Jones JL, Furness PN, Bell PR, et al. Matrix metalloproteinase-8 and -9 are increased at the site of abdominal aortic aneurysm rupture. Circulation. 2006;113(3):438–45. 10.1161/CIRCULATIONAHA.105.551572 [DOI] [PubMed] [Google Scholar]
- 42.Du J, Li Z, Li QZ, Guan T, Yang Q, Xu H, et al. Enoyl coenzyme a hydratase domain-containing 2, a potential novel regulator of myocardial ischemia injury. J Am Heart Assoc. 2013;2(5):e000233 10.1161/JAHA.113.000233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Ferdaus MZ, McCormick JA. The CUL3/KLHL3-WNK-SPAK/OSR1 pathway as a target for antihypertensive therapy. Am J Physiol Renal Physiol. 2016;310(11):F1389–96. 10.1152/ajprenal.00132.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Yoshida S, Araki Y, Mori T, Sasaki E, Kasagi Y, Isobe K, et al. Decreased KLHL3 expression is involved in the pathogenesis of pseudohypoaldosteronism type II caused by cullin 3 mutation in vivo. Clin Exp Nephrol. 2018;22(6):1251–7. 10.1007/s10157-018-1593-z [DOI] [PubMed] [Google Scholar]
- 45.Ahmad J, Bird PI, Kaiserman D. Analysis of the evolution of granule associated serine proteases of immune defence (GASPIDs) suggests a revised nomenclature. Biol Chem. 2014;395(10):1253–62. 10.1515/hsz-2014-0174 [DOI] [PubMed] [Google Scholar]
- 46.Joeckel LT, Allison CC, Pellegrini M, Bird CH, Bird PI. Granzyme K-deficient mice show no evidence of impaired antiviral immunity. Immunol Cell Biol. 2017;95(8):676–83. 10.1038/icb.2017.35 [DOI] [PubMed] [Google Scholar]
- 47.Joeckel LT, Wallich R, Martin P, Sanchez-Martinez D, Weber FC, Martin SF, et al. Mouse granzyme K has pro-inflammatory potential. Cell Death Differ. 2011;18(7):1112–9. 10.1038/cdd.2011.5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cooper DM, Pechkovsky DV, Hackett TL, Knight DA, Granville DJ. Granzyme K activates protease-activated receptor-1. PloS one. 2011;6(6):e21484 10.1371/journal.pone.0021484 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets (GSE36791 and GSE73378) generated and analyzed during the present study are available from the GEO database.(http://www.ncbi.nlm.nih.gov/geo/). Other relevant data are within the manuscript.