Skip to main content
Computational and Mathematical Methods in Medicine logoLink to Computational and Mathematical Methods in Medicine
. 2014 Jun 15;2014:891761. doi: 10.1155/2014/891761

Dynamic Regulatory Network Reconstruction for Alzheimer's Disease Based on Matrix Decomposition Techniques

Wei Kong 1,*, Xiaoyang Mou 2, Xing Zhi 1, Xin Zhang 1, Yang Yang 3
PMCID: PMC4082865  PMID: 25024739

Abstract

Alzheimer's disease (AD) is the most common form of dementia and leads to irreversible neurodegenerative damage of the brain. Finding the dynamic responses of genes, signaling proteins, transcription factor (TF) activities, and regulatory networks of the progressively deteriorative progress of AD would represent a significant advance in discovering the pathogenesis of AD. However, the high throughput technologies of measuring TF activities are not yet available on a genome-wide scale. In this study, based on DNA microarray gene expression data and a priori information of TFs, network component analysis (NCA) algorithm is applied to determining the TF activities and regulatory influences on TGs of incipient, moderate, and severe AD. Based on that, the dynamical gene regulatory networks of the deteriorative courses of AD were reconstructed. To select significant genes which are differentially expressed in different courses of AD, independent component analysis (ICA), which is better than the traditional clustering methods and can successfully group one gene in different meaningful biological processes, was used. The molecular biological analysis showed that the changes of TF activities and interactions of signaling proteins in mitosis, cell cycle, immune response, and inflammation play an important role in the deterioration of AD.

1. Introduction

Alzheimer's disease (AD) is a neurodegenerative disease with an insidious onset and progressive stage that inevitably leads to death. The disease progression of AD is slow, and it may take several years from onset of cognitive decline to diagnosis. Although several hypotheses have been proposed and many putative AD susceptibility genes have been witnessed in the past decades, the genetics mechanism and pathogenesis of AD are still unclear. Discovering the changes of gene expressions, transcriptional factors (TFs), and the transcriptional regulatory mechanism, which maps out the coordinated dynamic response of TFs and TGs, would provide a significant advance in genome-wide analysis of AD.

The characteristic pathology change in AD is fibrin deposition in cerebral cortex, and it is the deposition of beta-amyloid (Aβ) in cell space and poly-Tau protein in cell. In pathomorphism, the expressions are senile plaques (SP), neurofibrillary tangles (NFT), cerebrovascular amyloid, dystrophic neuritis, and loss of synaptic connections. Neuroinflammation, as well as dysregulation of lipid metabolism and mitochondrial dysfunction, can also be observed [1]. Currently, the main-stream theory regarding the disease mechanism is the amyloid cascade hypothesis [2]. Recent fast development of high throughput technologies such as DNA microarray technology and statistically computational tools [3] enables large-scale measurements of biological signals to discover critical genes, coregulated gene groups, and transcriptional regulatory network for AD. Ray et al. found 18 signaling proteins, which can efficiently classify AD and control subjects, in blood. Biological analysis of the 18 proteins points to systemic dysregulation of hematopoiesis, immune responses, apoptosis, and neuronal support in presymptomatic AD [4]. Ray et al. identified 6 coexpressed gene modules, each of which represented a biological process perturbed in AD [5]. By combining array analysis and quantitative trait loci (QTL) mapping to characterize the genetic variation and genetic regulatory network, Wang et al. identified many AD-related genes coregulating with App including Gsk3b, Falz, Mef2a, Tlk2, Rtn, and Prkca [6]. Zhang et al. found regulators of tmem59 and reconstructed gene regulatory networks of mouse neural stem cells. 16 out of 36 predicted genes, including Ace, aqp1, arrdc3, cd14, cd59a, cds1, cldn1, cox8b, defb11, folr1, gdi2, mmp3, mgp, myrip, Ripk4, rnd3, and sncg in their constructed network, have been reported to be AD related [7].

Furthermore, to overcome the underlying shortcomings of microarray technology such as small sample size, measurement error, and information insufficiency, some other high throughput technologies, such as protein-protein interaction (PPI), transcriptional factors (TFs), and microRNA knowledge, have been integrated and become more informative and powerful for discovering AD's mechanism. Liang et al. integrated gene expression profiles with prior protein-protein interaction (PPI) network information to reveal significantly perturbed subnetwork in 6 brain regions [8]. Panigrahi and Singh identified novel gene variants in various biological processes, new information for network motif, groups of transcriptional factors (TFs), and miRNA targets to demonstrate that there are extensive links between AD and ageing (AG) [9].

As a progressively neurodegenerative disease in the brain, finding out what kinds of genes crucially change and how TFs dynamically regulate their TGs in the degenerative progress of AD would provide a great achievement in discovering pathogenesis of AD. Since the high throughput technologies of measuring TF activities on the genome-scale are much more limited, the challenge now is to construct the dynamic transcriptional regulatory networks with the help of statistically computational tools. In our study, network component analysis (NCA), a method for determining both the activities and regulatory influence on a set of TFs with their known TGs [10], has been used on degenerative progress of AD samples from incipient to moderate and severe AD. NCA models the expression of a gene as a linear combination of the activities of TFs and regulatory strength from a set of gene expression data and a priori given connectivity information between TFs and their target genes. It has been successfully applied to identifying previously unnoticed oscillatory activity patterns in the yeast cell cycle [10], building network to lipopolysaccharide (LPS) in humans [11], generating a predicted activation time stage of catabolite repressor protein in Escherichia coli [12], predicting activities of important transcription factors in a mouse knockout model of human glycerol kinase deficiency, and so on [1317].

Two inputs: gene expression profiles and a predefined regulatory influence matrix which qualitatively provides the initial estimates of the influence of each TF on the TGs, are required by NCA model. In our study, independent component analysis (ICA) is adopted to extract significant genes from biologically meaningful patterns from different stages of AD gene expression data. Compared with the traditional clustering methods, such as k-mean, self-organizing maps (SOM), and hierarchical clustering, which can group each gene in only one class based on the global similarities of the expression profiles, ICA is a biclustering method which can successfully group one gene in different meaningful biological processes and has been successfully applied to microarray data for feature extraction [18, 19], clustering and the classification on yeast cells' cycle [20], and cancer data such as ovarian cancer [21], breast cancer [22], endometrial cancer [23], colon and prostate cancer [24, 25], and acute myeloid leukemia [26]. By combining a priori information between TFs and TGs with significant genes selected by ICA, the selected gene expression profiles and the predefined regulatory influence matrix can be prepared. Finally, we calculate the activities of 10 TFs and regulatory influence on 34 TGs with 71 interactions, using NCA for control, incipient, moderate, and severe AD gene expression data. The dynamic regulatory networks were then successfully built for degenerative progress of AD. Molecular biological analysis demonstrated that the changes of TF activities and their influence on TGs played an important role in AD's onset and deterioration, and the results added additional insights into discovering pathogenesis of AD.

2. Methods

2.1. Independent Component Analysis

Let the n × m matrix X denote the microarray gene expression data with m genes under n samples or conditions. x ij in X is the expression level of the jth gene in the ith sample. In gene expression datasets, the number of genes m is much larger than that of the samples n, mn. Suppose that the data have been preprocessed and normalized; that is, each sample has zero mean and standard deviation, and then the ICA model for gene expression data can be expressed as

X=AS. (1)

In the ICA model of microarray data, the columns of A = [a 1, a 2, …, a n] are the n × n latent vectors of the gene microarray data, S denotes the n × m gene signature matrix or expression mode, in which, the rows of S are statistically independent on each other, and the gene profiles in X are considered to be linear mixture of statistically independent components S combined by an unknown mixing matrix A. To obtain S and A, the demixing model can be expressed as

Y=WX, (2)

where W is an n × n demixing matrix.

The gene expression data provided by microarray technology are considered linear combination of some independent components of specific biological interpretations. The nth row matrix A contained the weights with which the expression levels of the m genes contribute to the nth observed expression profile. Therefore, the assignment for the observed expression profiles with different classes is valid for the rows of A, and each column of A can be associated with one specific expression mode. Since the nth column of A contains the weights with which s n contributes to all observations, this column should show large or small entries according to the class labels. After such characteristically latent variables have been obtained, the corresponding elementary modes can be identified to yield useful information for classification. In addition, the distribution of gene expression levels generally features a small number of significantly overexpressed or underexpressed genes that form very biologically coherent groups and may be interpreted in terms of regulatory pathways.

2.2. Network Component Analysis

NCA is a tool for analyzing gene expression data of dynamic transcriptional networks. It models the expression of a gene as a linear combination of the activity of each TF that controls the expression of the gene [10]. NCA uses transcription network connectivity to deduce transcription factor activities (TFAs) and TF-gene regulation control strengths (CS) from gene expression data. The following transcription regulation model is used:

Ei(t)Ei(0)=(TFAj(t)TFAj(0))CSij, (3)

where E i(t) is the expression level of gene i, TFAj(t), j = 1, …, L, is the activity level of TF j, CSij represents the control strength of TF j on gene i, and (t) and (0) designate condition t and reference condition 0. Log-linear transformation as a standard tool is used to approximate this nonlinear system. The matrix form of (3) after taking the logarithm is shown as follows:

[E]=[C][P]+Γ. (4)

Here [E]  (N × M) is a gene expression matrix of N genes in M samples, matrix [C]  (N × L) represents the connectivity strength of each transcriptional factor (TF) j on target gene i, matrix [P]  (L × M) denotes TFAs on M samples, N is the number of genes, L is the numbers of TFs, M is the number of experiments, and Γ is the residual of the model. The element c ij in matrix [C] is set to 0 if there is no evidence to suggest regulation of gene i by TFj; otherwise, it is set to a nonzero number as an initial value.

As the decomposition of [E] into component matrices is inherently nonunique, Liao et al. proved that if [C] and [P] satisfy the uniqueness criteria, NCA can guarantee a unique solution up to a scaling factor for any given residual Γ. This criterion clearly links NCA results to the biological system and makes interpretation straightforward.

To find the best solution of (4), the least-square algorithm is performed:

min||[E][C][P]||2,s.t.  CZ0, (5)

where Z 0 is the topology induced by the network connectivity pattern. Then the actual estimation of [C] and [P] is performed by a two-step alternating least-squares algorithm which exploits the biconvexity properties of linear decompositions. The least-square constrain is equivalent to a maximum-likelihood procedure in the presence of Gaussian noise with independent and identically distributed component. For details see [10] by Liao et al. (2003).

To perform NCA, two inputs are needed. One is gene expression profiles of TGs, matrix [E]. The other is matrix [C 0], a predefined regulatory influence matrix which provides the initial estimates of the influence of each TF on its TGs. The original biologically qualitative regulatory influence of TFs together with a set of known TGs is obtained from the TRANSFAC public dataset of BIOBASE (http://www.gene-regulation.com). There are 67375 regulatory interactions including 1972 TFs and 6561 TGs without overlap in this dataset. To define key TFs and regulatory network of AD, this TF-TG regulatory interaction dataset is matched with the significant genes extracted by ICA and differentially expressed in all incipient, moderate, and severe AD gene datasets. 10 TFs with their TGs and in total 71 interactions are finally chosen for NCA model. Then, matrix [E] is the gene expression of the TGs in different stages of AD samples, and the initial value of matrix [C 0] is set to 1 if there is connectivity strength of TGi by TFj; otherwise, it is set to 0.

3. Results and Discussion

3.1. ICA Results

To evaluate ICA applied to AD DNA gene expression data, we use the dataset of hippocampal gene expression of control and AD samples from GEO DataSets, series GSE1297, offered by Blalock et al. [27]. The hippocampal specimens they used are obtained through the Brain Bank of the Alzheimer's Disease Research Center at the University of Kentucky. The human Gene Chips (HG-U133A) of Affymetrix and Microarray Suite 5 are used to analyze the microarray data. The procedures for total RNA isolation, labeling, and microarray are described in [27, 28]. There are in total 9 control, 7 incipient, 8 moderate, and 7 severe AD samples included in this dataset with 22283 gene expressions in each sample.

According to Kaissi and his colleagues [3], significance analysis of microarrays (SAM) is better than fold change (FC) and t-test in terms of sensitivity and specificity. In our study, to reduce significant noises, SAM was applied to all 31 samples as a preprocessing method. Three groups were built for ICA feature gene extraction, respectively: control-incipient AD (C-I), control-moderate AD (C-M), and control-severe AD (C-S) samples. After SAM preprocessing, around 4500–5000 genes were reserved for each group. FastICA, presented by Hyvärinen and Oja [29], as the fastest ICA method, was applied to 3 groups to discover significant genes between control and different courses of AD, respectively.

In FastICA algorithm, nonlinear function g(u) = tanh(a1∗u), where a1 was a constant, was used as the probability density distribution of the outputs u during the iteration. As the FastICA algorithm relied on random initializations for its maximization and had the problem of convergence to local optima, we iterated FastICA 50 times to alleviate the instability of the slightly different results in each iteration. For each IC in each time, significant genes were not the same, and we selected top hundreds of genes as significant genes by calculating the number of times for 50 times. FastICA identified 245, 268, and 324 significant genes for C-I, C-M, and C-S group, respectively. By integrating the genes of these 3 different stages of AD and excluding overlapping part, we finally extracted 740 genes as significant genes for the whole AD dataset.

From the biological analysis of FastICA results we found that the significant genes and their relevant pathways were related in immunoreactions, metal protein, membrane protein, lipoprotein, neuropeptide, cytoskeleton protein, binding protein, and ribosomal protein, playing prominent roles in AD and connecting the activation patterns with AD phenotypes. FastICA also found that many oncogenes and phosphoric proteins were significantly lowly expressed in all the 3 courses of AD.

To be compared with ICA results, here we present our previous experimental results of nonnegative matrix factorization (NMF) to analyze the efficiency of different biclustering methods on AD gene expression data [30]. NMF is a matrix factorization method, which assumes that the given gene expression data are combined of positive metagenes with meaningful local biological representation, and represents the original data as a linear combination into reduced sets based on nonnegative constrain. By performing NMF, more than 1500 significant genes were extracted. Lots of them were related to metal metabolism and inflammation; for instance, many upregulated genes were found to be in conjunction with zinc and calcium (Ca (2+)). Furthermore, NMF also identified the genes related to cell growth, cell cycle, apoptosis, cellular fission, and cell repair. The shortcomings of NMF method were that the number of significant genes was hard to reduce, and the method focused on meaningful biological processes due to the unapparent difference among the metagenes.

Another matrix decomposition method and principle component analysis (PCA) on the control and severe samples of the same AD microarray dataset in our previous research [31] were presented here. In the experiment, control samples were modeled by PCA and represented each gene as a linear combination of the dominant principal components (PCs). Then the severe AD samples were projected on the PCA model and the scores can be extracted. The 100 most different genes between the scores obtained by control data and severe AD data were selected for further biological analysis. The results showed that they contained genes in immunoreactions, metal protein, membrane protein, lipoprotein, neuropeptide, cytoskeleton protein, binding protein, ribosomal protein, and phosphoric protein. The limitation of PCA model is that the number of significant genes, which can be extracted, is much fewer than that of ICA.

In summary, the molecular biological analysis of the significant gene extraction confirmed the added value of ICA over NMF and PCA methods in identifying known and novel genes in biological processes. Our results indicate that ICA enables researchers to extract potentially relevant gene expression information from microarray gene expression data and map closer to AD pathways. Moreover, all these 3 methods are based purely on statistical constraints and they do not use any biological knowledge or any transcriptional regulatory structures; therefore, their results cannot contain biologically transcriptional regulatory networks.

3.2. NCA Results

The second stage in this study was to find the activities of TFs and regulatory influence on TGs for 3 courses of AD by NCA. We used the biologically qualitative regulatory influence of TFs and genes from the TRANSFAC public dataset of BIOBASE (http://www.gene-regulation.com) to find the TF genes in ICA results. 23 TFs with their TGs were found in the selected significant genes. Then top 10 TFs, regulating TGs with the number exceeding 13, were selected. These 10 TFs with 34 TGs without overlap and 71 interactions were provided for NCA model. Table 1 shows these 10 TFs with their TGs, chromosome locations, and theirpromoter ID, which will add further information to discovering pathogenesis of AD. Two inputs were prepared for NCA model: one was matrix [E], which presented the gene expression profiles of TGs provided by the original gene expression data of AD, and the other was predefined initial matrix [C 0] which reflected the relative contribution of the TFs on TGs. Then NCA was applied to control, incipient, moderate, and severe AD datasets, respectively. Take the incipient AD samples as an example, in which [E] was a 34 × 7 matrix which denoted the microarray expression profiles of 34 TGs in 7 incipient AD samples. Matrix [C 0] (34 × 10) represented the predefined connectivity strength by the 71 interactions obtained above, and the element c ij in matrix [C 0] was set to 0 if there was no connectivity strength of TG i by TFj; otherwise, it was set to 1 as an initial value. Matrix [P] (10 × 7) contained the TFAs of 10 TFs on 7 samples. After performing NCA, the TF activities (matrix [P]) and the control strength (matrix [C]) were quantitatively obtained for different courses of AD.

Table 1.

Selected TFs and the target genes.

Selected TFs Description Chromosome locations and promoter ID Target genes Number of TGs
ZNF207 Zinc finger protein 207 17q12,  
promoter ID: 17074
GNB1, CLIP1, SFRS18, CSNK1D, SFPQ, CSNK1A1, ASNS, ZBTB20, TUBB, TM2D1, TIMM17A, ENO1, ASH1L, NONO, KPNA2, REEP5, NSF, HSPD1, UBE2D3, DICER1, and G3BP1 21

BUB3 BUB3 budding uninhibited by benzimidazoles 3 homolog 10q26 
promoter ID: 5046
TARDBP, ATP1B1, SFPQ, GNB1, CLIP1, SET, ARPC5L, NCOA1, NOL5A, PPIA, CEP27, REEP5, ENO1, ASNS, ZBTB20, HSPD1, PSMB1, NONO, and KPNA2 19

PTBP1 Polypyrimidine tract binding protein 1 19p13.3 
promoter ID: 20018
PABPN1, UBL3, REEP5, KIAA0232, SET, G3BP1, ENO1, GTF2I, SMARCA4, ASH1L, WDR1, CSNK1D, HSP90AB1, TUBB, FKBP1A, YWHAZ, HSPD1, VAMP4, and NONO 19

ZBTB20 Zinc finger and BTB domain containing 20 3q13.2 
promoter ID: 119577
POGZ, NFIB, CIRBP, TIMM17A, CEP27, SNRPN, ARHGAP5, TUBB, RUFY3, EZH1, NCOA1, GSTA4, KTN1, RPL35A, GPRASP1, PDE4C, SLC35E1, RABGEF1, and ZNF160 19

COPA Coatomer protein complex, subunit alpha 1q23–25 
promoter ID: 2753
G3BP1, TM2D1, GNB1, GTF2I, RIPK5, DCTN4, SCAMP1, HSPA8, RAB6A, CANX, WDR1, ACTB, CSNK1A1, HSP90AB1, and YWHAZ 15

E2F4 E2F transcription factor 4, p107/p130 binding 16q21-q22 
promoter ID: 15321
RUFY3, NTRK2, GABRB2, ENO1, BAIAP2, KTN1, DPP6, PDE4C, EZH1, ITFG1, NECAP1, JAK1, NLRP1, and CHERP 14

POLR2E Polymerase (RNA) II (DNA directed) polypeptide E, 25 kDa 19p13.3 
promoter ID: 22868
SMARCA4, ASH1L, RIPK5, HSBP1, UBE2D3, GNB1, HSPA8, NONO, GSTO1, PGK1, ENO1, WDR1, FKBP1A, and HSP90AB1 14

STIP1 Stress-induced phosphoprotein 1 11q13 
promoter ID: 6654
SFPQ, HSBP1, GTF2I, FAM108B1, GNB1, G3BP1, RAB6A, WDR1, HSPA8, ENO1, NONO, YWHAZ, HSPD1, and HSP90AB1 14

ANAPC5 Anaphase promoting complex subunit 5 12q24.31 
promoter ID: 9688
CEP27, FAM108B1, G3BP1, GTF2I, HSBP1, HSP90AB1, HSPA8, NONO, PGK1, PSMA1, RAB6A, RIPK5, and YWHAZ 13

RNF38 Ring finger protein 38 9p13-12 
promoter ID: 42818
REEP5, SSX2IP, DCTN4, DCUN1D4, HSPA8, C1orf43, SFPQ, ASH1L, ITFG1, HSP90AB1, YWHAZ, MGEA5, and FOXJ3 13

Transcription of genes is controlled by a small number of TFs whose activation via posttranslational modification or ligand binding is the determining factor of gene expression. In general, TFAs are not always correlated with their gene expression profiles. Figure 1 gives the comparison of the estimated TFAs and their gene expression profiles in microarray gene expression profiles. Each subfigure shows the activities and gene expression of TF in control, incipient, moderate, and severe AD. We found that most of TFs represent a novel approach to control cellular processes. Figure 1 showed that the activities of COPA, POLR2E, and ZBTB20 were gradually depressed, which meant that they were deactivated over the deterioration of AD. E2F4, as well as ZNF207, was activated within the deterioration of AD. Moreover, some TFAs were neither monotonic decreasing nor increasing. For example, ANAPC5, BUB3, and RNF38 were deactivated in incipient AD, then greatly activated in moderate AD, and finally decreased in severe AD. PTBP1 was activated in incipient AD and repressed in moderate and severe AD gradually. STIP1 was repressed in incipient and moderate AD and then activated in severe AD. The molecular biological analysis of the functions of TFs and TGs in AD was discussed in the next section.

Figure 1.

Figure 1

10 transcription factor activities (TFAs) compared to their gene expression profiles. Blue solid lines with “○” and green solid lines with triangle denote TFAs and their gene expression profiles, respectively, numbers 1~4 on x-axis denote 4 stages of control, incipient, moderate, and severe AD, and y-axis is the activities or gene expression values of TF.

3.3. Dynamic Regulatory Network of AD

Based on the quantitative results of TFAs (matrix [P]), control strength on the TGs (matrix [C]), and the original microarray gene expression, our original goal of reconstructing the dynamic regulatory networks for different courses of AD was done in Figure 2. 10 TFs were noted with diamond in the middle of each course, and target genes which were activated or repressed by TFs were noted in circles with control strength lines. According to Figure 2, we discovered many additional insights in biological regulatory pathways of different courses of AD. For example, ANAPC5, which was greatly activated in the moderate AD, is a subunit of APC, and APC is essential for cells to progress through anaphase, exit from mitosis, and prevent a premature entry into S phase [32].

Figure 2.

Figure 2

Dynamic transcriptional regulatory network for AD. Transcriptional regulatory network of (a) control, (b) incipient, (c) moderate, and (d) severe AD samples. Diamond in the middle denoted TFs with activity values in different colors, circles with different colors presented the gene expression of target genes, and the lines between TFs and target genes with different colors noted the control strength.

BUB3 was deactivated in incipient AD. It is a crucial component in the formation of the mitotic spindle assembly complex, which forms a complex with other important proteins [33]. ZNF207, which is subunit of ZNF, was activated in the whole AD. Zinc finger transcription factors (ZNF) are transcription factors composed of a zinc finger binding domain and any of a variety of transcription-factor effector domains which exert their modulatory effect in the vicinity of any sequence to which the protein domain binds modulating gene expression directly at the DNA level to be engineered to target virtually any gene [34]. Reference [35] showed that ZBTB20 can be regulated by ZNF207. In our experiment, it showed that some TFs were related to cell cycle and regulated the genes of cell cycle obviously; for example, ANAPC5, BUB3, and ZBTB20 regulated gene expression of CEP27. The CEP family protein is the active component of centrosome and plays a vital role in centriole biogenesis and cell cycle progression control [36]. Furthermore, ZBTB20 could inhibit IκBα gene transcription, govern IκBα protein expression, and then promote NF-κB activation [37].

G3BP1 mediated effect on proliferation in lung cancer and breast cancer [38, 39]. Interestingly, in AD research, our data showed G3BP1 was regulated by ANAPC5, COPA, PTBP1, STIP1, and ZNF206. The result suggested that G3BP1 had biological significance in AD process. Our data also exhibited that the stress-inducible protein-1 (STPI1) regulated HSP90AB1 and HSPD1. STIP1 is an adaptor protein that coordinates the functions of HSP70 and HSP90 in protein folding. It is thought to assist in the transfer of proteins from HSP70 to HSP90 by binding both HSP90 and substrate-bound HSP70. STPI1 was upregulated in the ischemic brains from humans and rodents. The increase in STPI1 expression in vivo was not cell-type specific, as it was found in neurons, glia, and endothelial cells [40]. STIP1 also stimulates ATPase activity of HSP70 and inhibits ATPase, activity of HSP90, which suggests that it regulates both the conformations and ATPase cycles of these chaperones [4143].

COPA coatomer is a cytosolic protein complex that binds to dilysine motifs and reversibly associates with Golgi non-clathrin-coated vesicles, which further mediates biosynthetic protein transport from endoplasmic reticulum (ER) via the Golgi up to the trans-Golgi network [44]. From Figure 1 we can see that the activities of COPA were progressively inhibited in the whole course of AD.

TFII (GTF2I) was classified as a general transcription factor when it was first identified. TFII-I (GTF2I) identified and validated novel neuronal targets which affected the PI3 K and TGFβ signaling pathways in vivo, and it played a major role in the neurodevelopmental features of Williams-Beuren syndrome (WBS) [45]. TFII (GTF2I) was the main cause of the neurocognitive profile [46]. Our results exhibited that, in normal (control) stage, COPA, POLR2E, and STIP1 negatively regulated TFII (GTF2I). In other 3 stages, COPA, POLR2E, and STIP1 positively regulated TFII (GTF2I). In incipient AD, COPA and STIP1 positively regulated TFII (GTF2I); in moderate AD, ANAPC5 and COPA positively regulated TFII (GTF2I); in severe AD, only COPA positively regulated TFII (GTF2I), but STIP1 and PTBP1 negatively regulated TFII (GTF2I). It hints that, in different courses of AD, the TFs play a different role in regulating even the same TGs.

From the original gene expression profiles we can see that ZFP207, as a TF, was increased gradually in the process of the deterioration of AD; by contrast, the gene expression of BUB3, COPA, POLR2E, and ZBZB20 was decreased gradually. The target Gene NONO (p54nrb) protein is a RNA-binding protein, and the gene that interacted with RNA could be selectively modulated by phosphorylation during mitosis [47]. As Table 1 showed, NONO (p54nrb) can be regulated by many kinds of TFs, such as ZNF207, BUB3, PTBP1, POLR2E, STIP1, and ANAPC5. Many of them were related to mitosis and cell growth. ZNF207 (BuGZ) is a gene associated with spindle microtubules and it can regulate chromosome alignment. ZNF207 (BuGZ) directly binds and stabilizes Bub3 and uses its microtubule-binding domain to enhance the loading of Bub3 to kinetochores that have assumed initial interactions with microtubules in prometaphase [48].

Moreover, many target genes in our dynamic regulatory networks, such as CK1alphaLS, CLIP1, TUBB, ENO1, CSNK1D, BUB3 CSNK1A1, and SFPQ, were also related to cell growth [49, 50]. Our data indicated that SFPQ was regulated by ZNF207, BUB3, STIP1, and RNF38. SFPQ pathology may progress together with the tau pathology in AD [51]. ENO1 was regulated by ZNF207, BUB3, PTBP1, E2F4, POLR2E, and STIP1. The gene ENO1 was related to cell growth and migration [52]. Furthermore, as a neuronal expressed tubulin gene, the expression profiles of TUBB which is associated with a spectrum of disorders affecting cerebral cortex formation were decreased gradually in deteriorative processes of AD [50]. Our dynamic regulatory network showed that many TFs and TGs were closely related to the mitosis and cell growth. The results illustrated that regulating the genes related to mitosis and cell growth would become an important way for AD treatment.

Recently, some similar studies on AD gene expression data have been proposed and obtained some meaningful results. For example, Panigrahi and Singh applied an integrative systems biology approach to AD pathway and regulatory network analysis [9]. They identified 26 novel genes and variants associated with AD and ageing and some coexpression networks. They grouped TFs according to the TFBS classes into 10 groups and illustrated that they were related to immune system, DNA binding domain, central nervous system, and so on. Furthermore, they also found some novel information of network motifs and unique miRNA targets as a regulatory process for AD. By contrast, our studies focused on finding the dynamic responses of genes, signaling proteins, and activities of TFs on TGs in the progressively degenerative progress of AD, which would represent a deep understanding of the underlying transcription regulatory process for pathogenesis of AD.

The characteristic pathological change in the deteriorative process of AD is that the cerebral cortex begins to shrink as more and more neurons stop working and die. This phenomenon with the growth of neurons to repair and cell growth has close relation to the environment. In our results, some causes of AD were discovered by the comprehensively biological analysis of the dynamical changes of TFs on TGs, which were related to mitosis, cell growth, immune response, and inflammation. And our future study will also focus on the mitosis, cell growth, immune response, and inflammation of AD to find its real pathogenesis.

4. Conclusions

Microarray technologies enable the simultaneous measurement of all mRNA transcripts and therefore make it possible to reconstruct the gene regulatory network. The traditional models for gene network analysis such as ICA did not use any transcription regulatory information and were purely based on mathematic and statistical properties of the regulatory signals; thus they cannot correctly reconstruct the regulatory network. In this study, by combining biochemical information, that is, the connective strength between TFs and their regulated genes with gene expression profiles, NCA was utilized to deduce the dynamic regulatory network of AD. ICA was firstly applied to identifying significant genes from various biological processes of different courses of AD. Then NCA was applied to inferring regulatory actions of TF activities from gene microarray profiles and partial TF-TG connectivity information.

From the molecular biological analysis of the reconstructed dynamic regulatory network of AD, we found some transcriptional regulatory pattern in more biological insight for AD. We found that some TFs were activated during the whole AD deterioration or activated in the beginning of AD, such as ANAPC5, E2F4, PTBP, and ZNF207, and were proved to play an important role in centriole biogenesis and cell cycle progression control. Another gene, GTF2I, which was important in neurodevelopment and neurocognition, was found dynamically regulated from normal to AD stage. Some TFs, such as COPA, PTBP1 and STIP1, were inhibited in AD. Gene G3BP1 coregulated by these TFs was found low expressed in binding protein in AD. However it always proliferated in cancers. It hints that some biological relationship or signal regulatory pathway should be coordinated between AD and cancer. In general, the dynamically characteristic analysis of TFs on TGs of deteriorative courses of AD helps us focus the future study on mitosis, cell growth, immune response, and inflammatory reaction for the pathogenesis of AD.

Acknowledgment

This work was supported by the National Natural Science Foundation of China (nos. 61271446 and 61003093).

Conflict of Interests

The authors declare that there is no conflict of interests regarding the publication of this paper.

References

  • 1.Meyer-Luehmann M, Spires-Jones TL, Prada C, et al. Rapid appearance and local toxicity of amyloid-β plaques in a mouse model of Alzheimer’s disease. Nature. 2008;451(7179):720–724. doi: 10.1038/nature06616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lei H. Amyloid and Alzheimer’s disease. Protein and Cell. 2010;1(4):312–314. doi: 10.1007/s13238-010-0046-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Kaissi O, Nimpaye E, Singh TR. Genes selection comparative study in microarray data analysis. Bioinformation. 2013;9(20):1019–1022. doi: 10.6026/97320630091019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ray S, Britschgi M, Herbert C, et al. Classification and prediction of clinical Alzheimer’s diagnosis based on plasma signaling proteins. Nature Medicine. 2007;13(11):1359–1362. doi: 10.1038/nm1653. [DOI] [PubMed] [Google Scholar]
  • 5.Ray M, Ruan J, Zhang W. Variations in the transcriptome of Alzheimer’s disease reveal molecular networks involved in cardiovascular diseases. Genome Biology. 2008;9(10, article R148) doi: 10.1186/gb-2008-9-10-r148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Wang X, Chen Y, Wang X, Lu L. Genetic regulatory network analysis for app based on genetical genomics approach. Experimental Aging Research. 2010;36(1):79–93. doi: 10.1080/03610730903418729. [DOI] [PubMed] [Google Scholar]
  • 7.Zhang L, Ju X, Cheng Y, Guo X, Wen T. Identifying Tmem59 related gene regulatory network of mouse neural stem cell from a compendium of expression profiles. BMC Systems Biology. 2011;5, article 152 doi: 10.1186/1752-0509-5-152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Liang D, Han G, Feng X, Sun J, Duan Y, Lei H. Concerted perturbation observed in a hub network in Alzheimer’s disease. PLoS ONE. 2012;7(7) doi: 10.1371/journal.pone.0040498.e40498 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Panigrahi PP, Singh TR. Computational studies on Alzheimer’s disease associated pathways and regulatory patterns using microarray gene expression and network data: revealed association with aging and other diseases. Journal of Theoretical Biology. 2013;334:109–121. doi: 10.1016/j.jtbi.2013.06.013. [DOI] [PubMed] [Google Scholar]
  • 10.Liao JC, Boscolo R, Yang Y-L, Tran LM, Sabatti C, Roychowdhury VP. Network component analysis: reconstruction of regulatory signals in biological systems. Proceedings of the National Academy of Sciences of the United States of America. 2003;100(26):15522–15527. doi: 10.1073/pnas.2136632100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Seok J, Xiao W, Moldawer LL, Davis RW, Covert MW. A dynamic network of transcription in LPS-treated human subjects. BMC Systems Biology. 2009;3, article 78 doi: 10.1186/1752-0509-3-78. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kao KC, Yang Y-L, Boscolo R, Sabatti C, Roychowdhury V, Liao JC. Transcriptome-based determination of multiple transcription regulator activities in Escherichia coli by using network component analysis. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(2):641–646. doi: 10.1073/pnas.0305287101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.MacLennan NK, Rahib L, Shin C, et al. Targeted disruption of glycerol kinase gene in mice: expression analysis in liver shows alterations in network partners related to glycerol kinase activity. Human Molecular Genetics. 2006;15(3):405–415. doi: 10.1093/hmg/ddi457. [DOI] [PubMed] [Google Scholar]
  • 14.Rahib L, MacLennan NK, Horvath S, Liao JC, Dipple KM. Glycerol kinase deficiency alters expression of genes involved in lipid metabolism, carbohydrate metabolism, and insulin signaling. European Journal of Human Genetics. 2007;15(6):646–657. doi: 10.1038/sj.ejhg.5201801. [DOI] [PubMed] [Google Scholar]
  • 15.Galbraith SJ, Tran LM, Liao JC. Transcriptome network component analysis with limited microarray data. Bioinformatics. 2006;22(15):1886–1894. doi: 10.1093/bioinformatics/btl279. [DOI] [PubMed] [Google Scholar]
  • 16.Tran LM, Brynildsen MP, Kao KC, Suen JK, Liao JC. gNCA: a framework for determining transcription factor activity based on transcriptome: identifiability and numerical implementation. Metabolic Engineering. 2005;7(2):128–141. doi: 10.1016/j.ymben.2004.12.001. [DOI] [PubMed] [Google Scholar]
  • 17.Boscolo R, Sabatti C, Liao JC, Roychowdhury VP. A generalized framework for network component analysis. IEEE/ACM Transactions on Computational Biology and Bioinformatics. 2005;2(4):289–301. doi: 10.1109/TCBB.2005.47. [DOI] [PubMed] [Google Scholar]
  • 18.Hori G, Inoue M, Nishimura S, et al. Blind gene classification—an application of a signal separation method. Proceedings of the Genome Informatics Workshop (GIW '01); 2001; Tokyo, Japan. pp. 255–256. [Google Scholar]
  • 19.Liebermeister W. Linear modes of gene expression determined by independent coponent analysis. Bioinformatics. 2002;18(1):51–60. doi: 10.1093/bioinformatics/18.1.51. [DOI] [PubMed] [Google Scholar]
  • 20.Lu X-G, Lin Y-P, Yue W, Wang H-J, Zhou S-W. ICA based supervised gene classification of microarray data in yeast functional genome. Proceedings of the 8th International Conference on High-Performance Computing in Asia-Pacific Region; December 2005; Beijing, China. pp. 633–638. [Google Scholar]
  • 21.Martoglio A-M, Miskin JW, Smith SK, MacKay DJC. A decomposition model to track gene expression signatures: preview on observer-independent classification of ovarian cancer. Bioinformatics. 2002;18(12):1617–1624. doi: 10.1093/bioinformatics/18.12.1617. [DOI] [PubMed] [Google Scholar]
  • 22.Eguizabal A, Laughney AM, García-Allende PB, et al. Direct identification of breast cancer pathologies using blind separation of label-free localized reflectance measurements. Biomedical Optics Express. 2013;4(7):1104–1118. doi: 10.1364/BOE.4.001104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Saidi SA, Holland CM, Kreil DP, et al. Independent component analysis of microarray data in the study of endometrial cancer. Oncogene. 2004;23(39):6677–6683. doi: 10.1038/sj.onc.1207562. [DOI] [PubMed] [Google Scholar]
  • 24.Zhu L, Tang C. Microarray sample clustering using independent component analysis. Proceedings of the IEEE/SMC International Conference on System of Systems Engineering; April 2006; Los Angeles, Calif, USA. pp. 112–117. [Google Scholar]
  • 25.Zhang XW, Yap YL, Wei D, Chen F, Danchin A. Molecular diagnosis of human cancer type by gene expression profiles and independent component analysis. European Journal of Human Genetics. 2005;13(12):1303–1311. doi: 10.1038/sj.ejhg.5201495. [DOI] [PubMed] [Google Scholar]
  • 26.Frigyesi A, Veerla S, Lindgren D, Höglund M. Independent component analysis reveals new and biologically significant structures in micro array data. BMC Bioinformatics. 2006;7, article 290 doi: 10.1186/1471-2105-7-290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Blalock EM, Geddes JW, Chen KC, Porter NM, Markesbery WR, Landfield PW. Incipient Alzheimer’s disease: microarray correlation analyses reveal major transcriptional and tumor suppressor responses. Proceedings of the National Academy of Sciences of the United States of America. 2004;101(7):2173–2178. doi: 10.1073/pnas.0308512100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Blalock EM, Chen K-C, Sharrow K, et al. Gene microarrays in hippocampal aging: statistical profiling identifies novel processes correlated with cognitive impairment. The Journal of Neuroscience. 2003;23(9):3807–3819. doi: 10.1523/JNEUROSCI.23-09-03807.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hyvärinen A, Oja E. A fast fixed-point algorithm for independent component analysis. Neural Computation. 1997;9(7):1483–1492. [Google Scholar]
  • 30.Kong W, Mou X, Hu X. Exploring matrix factorization techniques for significant genes identification of Alzheimer’s disease microarray gene expression data. BMC Bioinformatics. 2011;12(5, article S7) doi: 10.1186/1471-2105-12-S5-S7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kong W, Mou X, Liu Q, et al. Independent component analysis of Alzheimer’s DNA microarray gene expression data. Molecular Neurodegeneration. 2009;4(1, article 4):p. 5. doi: 10.1186/1750-1326-4-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Latchman DS. Transcription factors: an overview. International Journal of Experimental Pathology. 1993;74(5):417–422. [PMC free article] [PubMed] [Google Scholar]
  • 33.Kumar A, Rajendran V, Sethumadhavan R, Purohit R. CEP proteins: the knights of centrosome dynasty. Protoplasma. 2013;250(5):965–983. doi: 10.1007/s00709-013-0488-9. [DOI] [PubMed] [Google Scholar]
  • 34.Gommans WM, Haisma HJ, Rots MG. Engineering zinc finger protein transcription factors: the therapeutic relevance of switching endogenous gene expression on or off at command. Journal of Molecular Biology. 2005;354(3):507–519. doi: 10.1016/j.jmb.2005.06.082. [DOI] [PubMed] [Google Scholar]
  • 35.Harboe TL, Tümer Z, Hansen C, Jensen NA, Tommerup N. Assignment of the human zinc finger gene, ZNF288, to chromosome 3 band q13.2 by radiation hybrid mapping and fluorescence in situ hybridisation. Cytogenetics and Cell Genetics. 2000;89(3-4):156–157. doi: 10.1159/000015600. [DOI] [PubMed] [Google Scholar]
  • 36.Shim J-H, Su Z-Y, Chae J-I, et al. Epigallocatechin gallate suppresses lung cancer cell growth through Ras-GTPase-activating protein SH3 domain-binding protein 1. Cancer Prevention Research. 2010;3(5):670–679. doi: 10.1158/1940-6207.CAPR-09-0185. [DOI] [PubMed] [Google Scholar]
  • 37.Liu X, Zhang P, Bao Y, et al. Zinc finger protein ZBTB20 promotes toll-like receptor-triggered innate immune responses by repressing IκBα gene transcription. Proceedings of the National Academy of Sciences of the United States of America. 2013;110(27):11097–11102. doi: 10.1073/pnas.1301257110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Winslow S, Leandersson K, Larsson C. Regulation of PMP22 mRNA by G3BP1 affects cell proliferation in breast cancer cells. Molecular Cancer. 2013;12(1):p. 156. doi: 10.1186/1476-4598-12-156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Soncini C, Berdo I, Draetta G. Ras-GAP SH3 domain binding protein (G3BP) is a modulator of USP10, a novel human ubiquitin specific protease. Oncogene. 2001;20(29):3869–3879. doi: 10.1038/sj.onc.1204553. [DOI] [PubMed] [Google Scholar]
  • 40.Lee S-D, Lai TW, Lin S-Z, et al. Role of stress-inducible protein-1 in recruitment of bone marrow derived cells into the ischemic brains. EMBO Molecular Medicine. 2013;5(8):1227–1246. doi: 10.1002/emmm.201202258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Becker T, Hartl FU, Wieland F. CD40, an extracellular receptor for binding and uptake of Hsp70-peptide complexes. Journal of Cell Biology. 2002;158(7):1277–1285. doi: 10.1083/jcb.200208083. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Boltze C, Mundschenk J, Unger N, et al. Expression profile of the telomeric complex discriminates between benign and malignant pheochromocytoma. Journal of Clinical Endocrinology and Metabolism. 2003;88(9):4280–4286. doi: 10.1210/jc.2002-021299. [DOI] [PubMed] [Google Scholar]
  • 43.Song Y, Masison DC. Independent regulation of Hsp70 and Hsp90 chaperones by Hsp70/Hsp90-organizing protein Sti1 (Hop1) Journal of Biological Chemistry. 2005;280(40):34178–34185. doi: 10.1074/jbc.M505420200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Quek HH, Chow VTK. Genomic organization and mapping of the human HEP-COP gene (COPA) to 1q. Cytogenetics and Cell Genetics. 1997;76(3-4):139–143. doi: 10.1159/000134532. [DOI] [PubMed] [Google Scholar]
  • 45.Segura-Puimedon M, Borralleras C, Pérez-Jurado LA, Campuzano V. TFII-I regulates target genes in the PI-3K and TGF-β signaling pathways through a novel DNA binding motif. Gene. 2013;527(2):529–536. doi: 10.1016/j.gene.2013.06.050. [DOI] [PubMed] [Google Scholar]
  • 46.Antonell A, del Campo M, Magano LF, et al. Partial 7q11.23 deletions further implicate GTF2I and GTF2IRD1 as the main genes responsible for the Williamse-Beuren syndrome neurocognitive profile. Journal of Medical Genetics. 2010;47(5):312–320. doi: 10.1136/jmg.2009.071712. [DOI] [PubMed] [Google Scholar]
  • 47.Bruelle C, Bédard M, Blier S, Gauthier M, Traish AM, Vincent M. The mitotic phosphorylation of p54nrb modulates its RNA binding activity. Biochemistry and Cell Biology. 2011;89(4):423–433. doi: 10.1139/o11-030. [DOI] [PubMed] [Google Scholar]
  • 48.Jiang H, He X, Wang S, et al. A microtubule-associated zinc finger protein, BuGZ, regulates mitotic chromosome alignment by ensuring Bub3 stability and kinetochore targeting. Developmental Cell. 2014;28(3):268–281. doi: 10.1016/j.devcel.2013.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Panchenko MP, Siddiquee Z, Dombkowski DM, et al. Protein kinase CK1αLS promotes vascular cell proliferation and intimal hyperplasia. American Journal of Pathology. 2010;177(3):1562–1572. doi: 10.2353/ajpath.2010.100327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Cushion TD, Paciorkowski AR, Pilz DT, et al. De novo mutations in the beta-tubulin gene TUBB2A cause simplified gyral patterning and infantile-onset epilepsy. The American Journal of Human Genetics. 2014;94(4):634–641. doi: 10.1016/j.ajhg.2014.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ke Y, Dramiga J, Schütz U, et al. Tau-mediated nuclear depletion and cytoplasmic accumulation of SFPQ in Alzheimer’s and Pick’s disease. PloS ONE. 2012;7(4) doi: 10.1371/journal.pone.0035678.e35678 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Song Y, Luo Q, Long H, et al. Alpha-enolase as a potential cancer prognostic marker promotes cell growth, migration, and invasion in glioma. Molecular Cancer. 2014;21:13–65. doi: 10.1186/1476-4598-13-65. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Computational and Mathematical Methods in Medicine are provided here courtesy of Wiley

RESOURCES