Abstract
Background
Cell cycle is an important clue to unravel the mechanism of cancer cells. Recently, expression profiles of cDNA microarray data of Cancer cell cycle are available for the information of dynamic interactions among Cancer cell cycle related genes. Therefore, it is more appealing to construct a dynamic model for gene regulatory network of Cancer cell cycle to gain more insight into the infrastructure of gene regulatory mechanism of cancer cell via microarray data.
Results
Based on the gene regulatory dynamic model and microarray data, we construct the whole dynamic gene regulatory network of Cancer cell cycle. In this study, we trace back upstream regulatory genes of a target gene to infer the regulatory pathways of the gene network by maximum likelihood estimation method. Finally, based on the dynamic regulatory network, we analyze the regulatory abilities and sensitivities of regulatory genes to clarify their roles in the mechanism of Cancer cell cycle.
Conclusions
Our study presents a systematically iterative approach to discern and characterize the transcriptional regulatory network in Hela cell cycle from the raw expression profiles. The transcription regulatory network in Hela cell cycle can also be confirmed by some experimental reviews. Based on our study and some literature reviews, we can predict and clarify the E2F target genes in G1/S phase, which are crucial for regulating cell cycle progression and tumorigenesis. From the results of the network construction and literature confirmation, we infer that MCM4, MCM5, CDC6, CDC25A, UNG and E2F2 are E2F target genes in Hela cell cycle.
Introduction
The losses of cellular regulation give rise to most case of cancer. In cells, intricate genetic control systems regulate the balance between cell survival and death in response to growth signals, growth-inhibiting signals, and death signals. When some errors occur in the control systems causing cells to proliferate continuously, tumor just comes into being (Badawi et al. 2005).
The proliferation of cancer cell into two individual cells must go through cell cycle process (Whitfield et al. 2002; Valsesia-Wittmann et al. 2004). Cell cycle entails an ordered series of macromolecular events that lead to cell division and the production of two daughter cells. So that cell cycle is meaningful to the proliferation of cancer cell.
Expression levels of thousands of genes fluctuate throughout the cancer cell cycle (Cho et al. 2001; Ishida et al. 2001; Whitfield et al. 2002). Functional genes show periodic transcription to reflect cell growth, DNA synthesis, spindle pole body duplication and migration through the cell cycle (Cho et al. 1998). These processes and their regulation have been extensively investigated at the molecular level (Stillman, 1996; Nurse, 2000; Shah and Cleveland, 2000; Hinchcliffe and Sluder, 2001; Chen et al. 2004). Systems biology can be described as “integrative biology” with the ultimate goal of being able to predict de novo biological outcomes given the list of the components involved (Liu, 2005). Hence it is the coordinated study by (1) investigating the components of cellular networks and their interactions, (2) applying experimental high-throughput and whole-genome techniques, and (3) integrating computational methods with experimental effort (Klipp et al. 2005). In this situation, characterization of the genome-wide transcriptional program of the cell division cycle in mammalian cells is a critical step toward understanding the basic cell cycle processes and their roles in cancer. Therefore, it is worth investigating how these periodic patterns are regulated in the gene regulatory network of the cancer cell cycle from the systems biology perspective.
Gene expression data of Hela cell cycle have been collected (Whitfield et al. 2002) and analyzed with many clustering methods to organize which gene is associated with the cell cycle (Whitfield et al. 2002; Cho et al. 2001). Theoretically, it is possible to engineer the cell cycle network reversely, if we take cDNA expression levels as the output of gene expression networks, and collect cDNA expression levels of transcription factors as input. In order to realize how genes are regulated by transcription factors, we must also understand the interactions between target genes and their transcription factors (which transcription factor binds to which promoters). With all these information and interactive dynamic model, we get some clues to piece up the gene expression regulatory network in Hela cell cycle.
In this study, we attempt to devise an interactive dynamic model to characterize transcriptional regulatory network of the Hela cell cycle from the cDNA expression data of the human cell cycle in tumour (Whitfield et al. 2002). Based on our dynamic regulatory network, we not only predict the upstream regulators but also characterize the significance of the regulators depending on quantifying their regulatory abilities based on the corresponding biochemical kinetic parameters. At first, we construct a discrete-time dynamic model system and calculate the system kinetic parameters as the regulatory ability by using the expression data (Whitfield et al. 2002) and the system identification method (Johansson, 1993). Second, based on the interactive dynamic model, we detect the transcriptional regulatory function of target genes by the maximum likelihood parameter estimation method. Third, we trace back a group of upstream genes that play a role of transcriptional regulators of target genes in the Hela cell cycle of Homo sapiens via deducing the interactive relationship between the expression profiles of regulators and the detected transcription regulation of specific target genes. The pathway kinetic parameters of transcriptional regulatory network of Hela cell cycle are also estimated by the cDNA expression profiles of target genes and their upstream regulatory genes. Finally, these upstream regulatory genes are considered as the target genes. By a similar method, we construct their upstream regulatory genes one by one. Iteratively, we can construct the whole gene regulatory network of Hela cell cycle.
We applied our method to a publicly available data set of HeLa cell with microarray experiment on cell cycle (Whitfield et al. 2002) to identify the transcriptional regulators of cell cycle and to characterize their regulatory abilities on specific target genes. By means of the quantitative system analysis of the transcriptional regulatory network from Hela cell cycle genes, several transcription factors were identified and their regulatory abilities were determined. Further, some genes that may be suspected of regulators of Hela cell cycle are predicted here to be synergistic in harmonizing gene expression. Our proposed algorithm provides a novel approach to gain insight into the gene regulatory network of Hela cell by its gene expression data using system identification technique and discrete-time dynamic model. Furthermore, we combine the constructed Hela cell cycle dynamic network, some experimental reviews and E2F binding site’s research (Ren et al. 2002), we not only confirm the reliability of the dynamic network but also find the E2F target genes in Hela cell cycle progression. Finally, from the results of this study, we infer that E2F directly regulate MCM4, MCM5, CDC6, CDC25A, UNG and E2F2 in Hela cell cycle progression.
Our approach is so different from the statistical clustering method that it not only provides a suitable interactive dynamic model to decipher the complex signal transduction pathway that regulates gene expressions in Hela cell cycle but also predicts some potential regulators that have not been found. We can also quantify the regulatory abilities of the transcription factors by the corresponding kinetic activities to their target genes in the Hela cell cycle regulatory network. We could construct the cell cycle regulatory network in Hela cells quantitatively and discuss the sensitivity of regulatory genes to the gene regulatory network from the system analysis perspective.
System Model and Network Identification
The construction of the transcriptional regulatory network of Hela cell cycle can be divided into two steps. First, the transcriptional regulation should be extracted from the gene expression data by the dynamic discrete-time model. Second, the upstream regulators will be traced back by correlating the transcriptional regulation of target genes with the expression profiles of possible regulators of Hela cell cycle. In this study, 64 transcription regulators, as shown in Table 1, are used as candidates of upstream regulators to each target gene. Finally, the kinetic parameters of gene transcriptional regulatory network of Hela cell are estimated by the cDNA expression profiles of target genes and their upstream regulatory genes.
Table 1. CloneIDs (Whitfield et al. 2002) of 64 transcription factors in human cell cycle are selected as candidates to regulate downstream target genes.
IMAGE:173309(BCLAF1) | IMAGE:564803(FOXM1) IEA |
IMAGE:590774(MAPK13) Knebel et al. |
IMAGE:202704(SCML1) IEA |
IMAGE:241474(BRCA1) IEA |
IMAGE:149809(GATA2) IEA |
IMAGE:1536451 (MHC2TA) | IMAGE:782622 (SP1) |
IMAGE:815287(BRD8) Monden et al. |
IMAGE:135688(GATA2) IEA |
IMAGE:809731(MNT) Meroni et al. |
IMAGE:80318 (SP1) |
IMAGE:825210(C14orf106) IEA |
IMAGE:780958 (GTF3C4) | IMAGE:197520(NCOA3) IEA |
IMAGE:840636 (SRF) |
IMAGE:726588(C14orf106) IEA |
IMAGE:291827 (GTF3C4) | IMAGE:884438(NFE2L2) IEA |
IMAGE:545503 (STAT1) |
IMAGE:1915416(CDK7) Shiekhattar et al. |
IMAGE:344049(HCFC1) Wysocka et al. |
IMAGE:1455463 (NFIC) | IMAGE:134120 (STAT5B) |
IMAGE:130242(CDK7) Shiekhattar et al. |
IMAGE:897806 (HIF1A) | IMAGE:265874 (NFIC) | IMAGE:712840 (STAT5B) |
IMAGE:268652(CDKN1A) IEA |
IMAGE:878184(HMG20B) IEA |
IMAGE:271198(NR3C1) Takahashi et al. |
IMAGE:132857 (STAT5B) |
IMAGE:240367(CTCF) Filippova et al. |
IMAGE:1842250(HMGB2) Shirakawa et al. |
IMAGE:245517(NR5A2) IEA |
IMAGE:272192 (TCERG1) |
IMAGE:490728 (DMTF1) | IMAGE:363103(HMGB2) Shirakawa et al. |
IMAGE:43229(PCNA) IEA |
IMAGE:137387 (TFAP2A) |
IMAGE:487797 (DR1) | IMAGE:242952(ILF2) Reichman et al. |
IMAGE:789182(PCNA) IEA |
IMAGE:868630(TGFB1I4) IEA |
IMAGE:566760 (DR1) | IMAGE:838829(JARID1B) IEA |
IMAGE:30114 (PHTF2) | IMAGE:366414(UHRF1) IEA |
IMAGE:884462(DSCR1) Fuentes et al. |
IMAGE:345056(KIAA1404) IEA |
IMAGE:1947972(PKNOX1) Chen et al. |
IMAGE:1550739(UHRF1) IEA |
IMAGE:236142 (E2F1) | IMAGE:510381(KLF6) Ratziu et al. |
IMAGE:2018976(PTTG1) Dominguez et al. |
IMAGE:246869(ZNF207) Pahl et al. |
IMAGE:768260 (E2F1) | IMAGE:302549(KLF9) Imataka et al. |
IMAGE:781089(PTTG1) Dominguez et al. |
IMAGE:296429(ZNF24) IEA |
IMAGE:293331 (E2F2) | IMAGE:35147(LASS6) Banerjee-Basu et al. |
IMAGE:845502(RBPSUH) Hsieh et al. |
IMAGE:280750(ZNF281) Law et al. |
Dynamic signaling regulatory model
The second-order difference equation is used in the description of dynamic system evolved from the causality of gene regulatory function. Let Xi(k) denote the expression profile of the i-th gene at time point k. The following second-order difference equation is proposed to model the cDNA expression level of the i-th gene,
(1) |
where Gi(k) is the upstream transcriptional regulatory function from regulatory genes to influence the expression profile Xi(k) of the i-th gene while ai and bi are the parameters that characterize the dynamic inherent property of the gene like degradation and oscillation, and ɛi (k) is the random noise of current microarray data or the residue of the model. In general, the second-order difference equation has been widely used to model dynamic discrete-time systems to efficiently characterize the dynamic properties of damping and resonance of systems in physics and engineering. The reason is that the roots of the characteristic equation of second-order dynamic equation may be a real double root, two real roots or conjugate roots, which could easily describe a system with undamping, overdamping, critical damping, under-damping or oscillation, dependent on the specification of their coefficients (Kreszig, 1993). These characteristics can not be easily described by the first-order dynamic equation. Therefore the second order stochastic equation is employed to characterize the biochemical processing of the gene expression.
Evidently, the transcriptional regulatory function Gi(k) controls the synthetic rate of cDNA and the clue of upstream regulatory pathway is involved in Gi(k). Therefore we emphasize on how to detect the upstream regulatory function Gi(k) from expression data Xi(k) and our dynamic model equation in equation (1). In general, it is not easy to extract transcriptional regulatory function Gi(k). In order to extract the input regulatory function Gi(k), we apply Fourier decomposition method to decompose Gi(k) to generate some harmonic sinusoid functions. When the extraction problem is reduced to a simple parameter estimation problem, Gi(k) can be decomposed by the following Fourier series.
(2) |
Then we need to estimate the parameters of Fourier series, αn and βn, that are the magnitudes of different harmonics of sinusoid functions (cos(nt) and sin(nt)) for n = 0, …, N. Fourier series is a good tool to synthesize functions with finite energy by harmonic functions in respect of engineering.
Extraction of the transcriptional regulatory function Gi(k)
Since Gi(k) has been decomposed, we combine equations (1) and (2) to get the following dynamic model equation for the expression profile of the i-th gene,
(3) |
In the above dynamic model equation, the parameters ai, bi, αn, and βn should be estimated by the time profile of expression data of the i-th gene in linear scale, i.e. these parameters should be specified so that the simulating output Xi(k) of the dynamic model in equation (3) should match the expression profile of the i-th gene. The maximum likelihood estimation method is employed to estimate these parameters ai, bi, αn, and βn in equation (3) from the expression profile Xi(k) in the section Methods.
After the parameters αn and βn of the regulatory function Gi(k) have been estimated in the section Methods, we can present the regulation detection Ĝi(k)as follows,
(4) |
where α̂n and β̂n are the estimates of αn and βn, respectively.
We know that the input transcriptional regulatory function Gi(k) of the target gene of Hela cell cycle is often relative to the bindings of transcription factors or some interactions from the upstream regulators. In the next step, we will trace back the corresponding regulatory genes from the input regulatory function Ĝi(k) of the target gene.
Iterative algorithm for constructing gene regulatory network
In biology, the specific biochemical reactions are usually relative to the concentration of specific products. For this purpose, we describe the regulatory function as the following sigmoid function to describe the binding and nonbinding of transcription factors to motif binding sites (Chen et al. 2004; Klipp et al. 2005)
(5) |
where γ is the transition rate and Mj is the mean expression of the j-th regulatory gene’s profile Xj(k).
We determine Ri regulatory genes whose regulatory signals Xj(k) j = 1, … , R are the most correlative to the target gene profile Xi(k) of the i-th target gene. Then, we could reconstruct the gene regulatory network by tracing back the upstream regulators from the extracted regulatory function Ĝi (k), which are contributed by Ri regulatory genes, via the following biochemical kinetic relationship,
(6) |
where cij is the pathway kinetic parameters from the regulatory gene j to target gene i, Ri represents the number of the searched upstream regulatory genes selected by the absolute value of correlation coefficient between the target gene expression profile and the regulatory gene expression profile which is more than 0.8 based on the 95% confidence of normalized correlation coefficients of expression profiles of total cell cycle-related genes in Whitfield et al. 2002, the constant ci0 is the basal level denoting the regulatory function other than upstream regulatory genes, for example, due to post-transcriptional regulation, and ei(k) is the error or the noise of the network model.
Using the maximum likelihood algorithm in Method to estimate the parameters ci0 and cij from Ĝi(k) and Xi(k), the regulation from the upstream regulators is identified as
(7) |
By combining equation (1) and the above equation, the dynamics of transcriptional regulatory network of the Hela cell cycle can be represented by the following identified difference equation,
(8) |
where i = 1, 2, … for all profile target genes in Hela cell cycle.
In fact, equation (8) contains much information for exploring the regulatory network of each specific target gene of the Hela cell cycle. The regulatory genes, which belong to a specific set, Ri, represent the potential upstream regulators for target gene i. The estimated chemical kinetic parameter, ĉij, characterizes the type and intensity of the influence of the jth regulatory gene on the ith target gene, in which positive sign indicates activation and a negative sign indicates repression, and the magnitude is defined as the regulatory ability. After the regulatory pathways of the ith target gene is constructed by tracing back their upstream regulatory genes, these Ri upstream regulators are considered as target genes again to trace back their upstream regulatory genes. Iteratively, we can construct the whole gene regulatory network of the Hela cell cycle globally. The goal of reverse engineering gene regulatory network is to deduce the possible set of regulators and to identify their associated regulation abilities by the available data set from the dynamic system perspective. For this purpose, we devise a novel algorithm based on the dynamic gene expression model for searching possible upstream regulators and then identifying the relevant regulatory abilities ĉij according to equation (8).
Results
Data processing and analysis
Data were extracted by superimposing a grid over each array using GenePix 3.0 software (Axon Instruments). Spots of poor quality, determined by visual inspection, were removed from further analysis. Data of HeLa cell collected for each array were stored in the Stanford Microarray Database (SMD) and are available from SMD at http://genome-www5.stanford.edu/ (Sherlock et al. 2001; Whitfield et al. 2002).
We combine 775 cell cycle-related genes from the Human expression of Hela cell cycle-regulated genes according to the classification by Whitfield et al. (2002) with Human expression of cell cycle-regulated genes according to the traditional classification as the target genes. After that, we select 64 transcription factors (Table 1) from the 775 cell cycle genes (Whitfield et al. 2002).
The raw data were transformed into a linear scale from the original log ratio and applied to our approach. Following the dynamic model in equation (8), the parameters which characterize the dynamic regulatory mechanism are estimated successfully for each target gene in the pathway. Fig. 1 compares the simulation results of the dynamic expression model in equation (8) with the experimental expression profiles for some important cell cycle-related genes regulated by E2F family (Bracken et al. 2004), such as CDC25A (Stanelle et al. 2002; Muller and Helin, 2000; Ren et al. 2002), MCM6 (Ren et al. 2002; Polager et al. 2002; Ishida et al. 2001), E2F1 (Bracken, 2004), CDC6 (Stanelle et al. 2002; Ren et al. 2002), E2F2 (Ren et al. 2002; Muller et al. 2001), MCM5 (Ren et al. 2002; Ishida et al. 2001), MCM4 (Ren et al. 2002; Ishida et al. 2001), PCNA (Muller and Helin, 2000; Ren et al. 2002; Polager et al. 2002; Ishida et al. 2001), RFC4 (Ren et al. 2002; Polager et al. 2002), and DHFR (Ishida et al. 2001). The extracted regulatory functions, Ĝi(k) of these genes which are estimated by the maximum likelihood algorithm in Methods from their expression profiles are shown in Fig. 2. The extracted regulatory function Ĝi(k) in Fig. 2 are employed to estimate the kinetic parameters of gene regulatory network in equation (8) by the parameter estimation scheme in Methods. Our iterative algorithm can find the most likely regulatory genes that may participate in the expression program of Hela cell cycle genes.
Inference of the regulatory pathway
For illustrations, the inferring strategy is applied to the E2F target genes (Bracken et al. 2004) in Hela cell cycle pathways to recognize their upstream regulatory genes. E2F transcription factors are well studied owing to their importance in both cell cycle (Muller and Helin, 2000; Nevins, 2001). Their regulatory abilities are shown in the upstream regulatory functions Ĝi(k) of dynamic equation in Table 2. Parameters of regulatory functions Ĝi(k) in Table 2 represent the regulatory abilities and sensitivities of the relative transcription factors. It is very exciting that E2F1 and E2F2 are found to be active regulators in most E2F target genes listed in Table 2, which agree very well with the previous results (Cam and Dynlacht, 2003; Ivey-Hoyle et al. 1993; Lang et al. 2001). The regulatory abilities of the related regulators implying different degrees of influence are converted to the red-colored lines as positive regulations (activations) and the black-colored lines as negative regulations (inhibitions) for each target gene. Then, based on the dynamic regulatory equations in Table 2 (see detail in Supplementary Table S1), the pathways of E2F target gene in Hela cell cycle regulatory system are described in Fig. 3. The coefficients of these dynamic regulatory equations represent the kinetic activities of regulatory genes. If a regulatory gene is with a large kinetic parameter in the dynamic regulatory equation, it will play an important role in Hela cell cycle and is more sensitive to the gene expression of target gene.
Table 2. Upstream regulatory TFs and their regulatory function Ĝ(k) on E2F target genes in cancer cell cycle.
Based on the dynamic regulatory modeling, the 152 E2F target genes are found and shown in Table 3. In these 152 E2F target genes, 6 genes match the E2F target genes found by Elkon et al. (2003) from 124 E2F target genes edited by Ren et al. (2002) and 17 genes match the E2F target genes found by Elkon et al. (2003) from 872 periodic genes edited by Whitfield et al. (2002). Ren et al. (2002) has found 124 E2F target genes with E2F binding promoter. The comparisons of these results are shown in Fig. 9.
Table 3. The 152 E2F target genes in the gene regulatory network based on the dynamic regulatory modeling.
BAIAP2 | TOP3A | ACYP1 | BIVM | ADCK2 | OGT | C20orf55 |
SSR3 | MED31 | CDC6*# | FLJ10154 | FANCG | PILRB | MET |
SLBP# | NASP | ADCY6 | MGC15716 | APEX2 | FLJ11021 | NR5A2 |
MNT | CDC25A*# | ANKRD10 | ABCA7 | LOC197336 | ASF1B | CDKN2D |
LOC221955 | UNG*# | BARD1 | FLJ13912 | RBBP8 | SFRS5# | CENPE |
LOC90110 | MGC3207 | E2F2* | Pfs2 | MCM8 | C22orf18 | USP16 |
FBXL20 | UBQLN2 | MCM5*# | EGFL5 | C20orf111 | LOC58486 | ARHGAP8 |
ANKRD25 | RAMP | CCRK | NKTR | ABCC5 | MAP3K2 | PSEN1 |
KIAA1529 | MCM6*# | TREX1 | KIAA0092 | MLF1IP | EZH2 | C6 |
PTOP | DMTF1# | RAB23# | LOC200895 | FANCA | CENTB5 | USP6NL |
TRIM26 | KIAA0738 | FLJ10618 | HSPB8 | DNAJC6 | HIST3H2A | LOC51334 |
ZMYND19 | PNN | OACT1 | C14orf130 | RAD18 | BRCA1 | AKAP13 |
CDCA7 | MAP2K6 | ORC1L# | KIAA1586 | SP1 | FLJ35740 | AGPAT3 |
CASP8AP2# | GMNN | MCM4# | HELLS | USP1 | HSPC150 | ASAM |
PLCXD1 | LOC283596 | MSH2 | CLSPN | MYCBP2 | ORC3L | RAB3A |
ZNF367 | CASP2 | LCHN | RFC4 | TOPBP1# | DNAJB4 | HIST2H2BE |
RNP | C9orf42 | FLJ20280 | DONSON | DHFR | RRM1 | HIST2H2AA |
USP53 | E2F1# | PCNA | AP4B1 | DKFZP434B168# | RHOBTB3 | H2AFY |
SKP2 | CCNE2# | MBD4 | MGC2610 | DLNB14 | KIAA0841 | H3F3A |
PDXP | SCML1 | TLOC1 | ATAD2 | RFC2 | NFE2L2 | HIST1H3D |
FLJ20530 | SERPINB3 | SDC1 | RPA2# | NSUN3 | BBS2 | |
PANK2 | UHRF1 | PRPS2 | FEN1 | FKBP6 | FANCD2 |
E2F target genes found by Elkon et al. (2003) from 124 E2F target genes edited by Ren et al. (2002).
E2F target genes found by Elkon et al. (2003) from 872 periodic genes edited by Whitfield et al. (2002).
After finishing the construction of the first layer network of E2F target genes (Bracken et al. 2004), we take these upstream regulators as target genes. By using similar method, we construct the upstream regulatory genes of these target genes. In the second layer network, the regulatory abilities implying different degrees of influence are converted into pink-colored lines as positive regulations (activations) and blue-colored lines as negative regulations (inhibitions). Then, we combine the first layer network and the second layer network together to form a more complete network to E2F target genes in HeLa cell cycle as described in Fig. 4. Iteratively, we can construct the higher layer network to complete the gene regulatory network of Hela cell cycle.
Discussion
The losses of cellular regulation give rise to most cases of cancer. The transcription factors are crucial for regulating cell cycle progression and may be related to the development of a cancer. Therefore, to understand these gene regulatory processes, we need to unravel the regulatory mechanisms of these transcription factors in cell cycle. Our study presents a systematically iterative approach to discern and characterize the transcriptional regulatory network of 775 cell cycle-related genes from the raw expression profiles of Hela cell (Whitfield et al. 2002). Because the transcriptional regulatory network of 775 cell cycle-related genes is very complicated, two miniature gene regulatory networks of E2F family during G1/S phase in Fig. 4 and the other family during G2/M in Fig. 8 are given to illustrate the regulatory mechanism of Hela cell.
Our approach also offers the following advantages. First, based on the dynamic regulatory model, a gene regulatory network of cancer cell could be constructed by the extracted upstream regulatory function through microarray data. Then, the identified regulatory ability for each specific regulator could evaluate the contribution of this regulator; the positive sign stands for activation and the negative sign stands for repression, and the magnitude represents the significance. These advantages of the proposed approach will improve the analysis to cope with rapidly growing microarray data of human. BRCA1 is one of the important cell cycle-related genes to play as a transcriptional repressor in cell cycle progress (Kennedy et al. 2005). This finding matches the gene regulatory network constructed by dynamic regulatory model (shown in Fig. 4 and Fig. 5). It is clear that E2F regulates the expression of a host of factors that function during G1/S transition and S phase even the whole cell cycle (Bracken et al. 2004). E2F is best known for its role in regulating the transcription of genes that positively affect cell cycle progression (Ortega et al. 2002; Sherr and Roberts, 1999; Di et al. 2003; Stott et al. 1998; Trimarchi and Lees, 2002; Hayashi et al. 2006).
Our results almost match this finding (shown in Fig. 4 and Fig. 5). Hayashi with his collaborator found that E2F1 activates human MCM8. In our study, we found that E2F1 activates human MCM5 and MCM6 (shown in Fig. 4 and Fig. 5). Elkon et al. (2003) and Ren et al. (2002) have found that human MCM5 and MCM6 are two E2F target genes (shown in Table 3). As a result of these studies, we infer that human MCM5 and MCM6 may be positively regulated directly by E2F1 in Hela cell cycle. Accumulating evidence indicates that cdc25A possesses oncogenic properties. Recently, overexpression of cdc25A was found in many breast, head and neck cancers (Wu et al. 1998). CDC6 plays a critical role in the regulation of the onset of DNA replication in eukaryotic cells and Cdc6 expression is down-regulated in prostate cancer (Robles et al. 2002). From the results of Table 3, we could infer that E2F directly regulates MCM4, MCM5, CDC6, CDC25A, UNG and E2F2 in Hela cell cycle progression. In Fig. 5, we represent some E2F target genes and show the importance of E2F transcription factor in Hela cell cycle progression. In Fig. 7, we also predict the probable transcription regulations in some target genes, which express in G2/M phase of human cell cycle as shown in the regulatory network of Fig. 8. Further, we can construct not only E2F related regulatory network but also the whole Hela cell cycle network if we have genome-wide microarray data and CHIP data. However, at present, we still need to get enough evidence and CHIP experiments in the same cellular system to confirm our gene regulatory network of Hela cell cycle (Bracken et al. 2004).
Based on the dynamic regulatory modeling, the 152 E2F target genes are found and shown in Table 3. These target genes may be regulated by E2F directly or indirectly. In these 152 E2F target genes, 6 genes match the E2F target genes found by Elkon et al. (2003) from 124 E2F target genes with E2F promoter edited by Ren et al. (2002) and 17 genes match the E2F target genes found by Elkon et al. (2003) from 872 periodic genes in Hela cell cycle edited by Whitfield et al. (2002).
Finally, in order to validate the proposed approach, an independent validation is also given by randomly reshuffling the time order of micro-array experiment but with the same choices of target gene and regulatory genes, to confirm the reliability of the proposed method as shown in Fig. 6. As previous statement, BRCA1 plays as a transcriptional repressor in cell cycle progress (Kennedy et al. 2005) (shown in Fig. 5). From the shuffling results shown in Fig. 6, BRCA1 becomes a transcriptional activator. It is clearly seen that the proposed Hela cell cycle regulatory pathway in Fig. 5 is destroyed by reshuffling the experimental data.
Methods
Maximum likelihood Estimation of ai, bi, αn and βn
The dynamic equation (3) must match the expression profile at all time points and then is arranged in a vector difference form. Consequently, the vector dynamic form of this equation is applied to m time points of expression profile in order to make the dynamic model work.
(M.1) |
where
Next, we translate equation (M.1) into a matrix form,
(M.2) |
where Φi = [ai bi α0 β0 ··· αN βN]T and Ei = ɛim are in vector forms, and Ai = [−Xim−1 − X im−2 cos(0) sin(0) ··· cos (N) sin (N)]T is a matrix.
Then we use the maximum likelihood estimation to derive the optimal parameters estimation of Φ̂i (Johansson, 1993).
We assume that each element in the error vectors, ɛi(k),k = {3, ... , m}, is an independent random variable with a normal distribution with zero mean and variance σ2, and we will estimate the parameter Φ̂i , by the maximum likelihood method.
(M.3) |
The log-likelihood function for given m data points is then described by–
(M.4) |
Here we expect the log-likelihood function to have the maximum at Φ = Φ̂ and σ2 = σ̂ 2. The necessary condition for maximum likelihood estimates Φ̂ and σ̂2 as follows (Johansson, 1993).
(M.5) |
The estimated parameters Φ̂ and σ̂2 are shown below.
(M.6) |
(M.7) |
Theoretically, Ei is just the noise of the gene expression profile of the microarray chips, but some modeling errors and approximation errors in equation (2) are also involved in Ei. So that taking the modeling error and approximation error in our consideration makes our dynamic model equation more approach the actual situation. The number of Fourier series N is determined by tradeoff between the computational complexity of parameter estimation in equation (M.6) and the accuracy of approximation in equation (2). According to our expression data, we choose N = 16 to make the synthesis of these harmonics be the best approximation to the expression data.
Parameter Estimation of ci0 and cij
To estimate the pathway kinetic parameters cij in equation (6) by Ĝi(k) and X̃i(k) with m time points of upstream regulatory expression profile, equation (6) is represented with algebraic form as follows,
(M.8) |
where
We assume that each element in the error vectors, ei(k), k = {3,..., m}, is an independent random variable with a normal distribution with zero mean and variance σei2. Then by the similar procedure, the estimated parameters Ω̂ andσei2 are shown below.
(M.9) |
(M.10) |
Supplementary Material
Acknowledgments
We thank the National Science Council, Taiwan for grants NSC 95-2627-B-007-011.
References
- Badawi RA, Birns J, Watson T, et al. Growth factors and their relationship to neoplastic and paraneoplastic disease. Eur. J. Intern. Med. 2005;16:83–94. doi: 10.1016/j.ejim.2004.10.011. [DOI] [PubMed] [Google Scholar]
- Bracken AP, Ciro M, Cocito A, et al. E2F target genes: unraveling the biology. Trends Biochem. Sci. 2004;29:409–17. doi: 10.1016/j.tibs.2004.06.006. [DOI] [PubMed] [Google Scholar]
- Banerjee-Basu S, Baxevanis AD. Molecular evolution of the homeodomain family of transcription factors. Nucleic Acids Res. 2001;29(15):3258–69. doi: 10.1093/nar/29.15.3258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cam H, Dynlacht BD. Emerging roles for E2F: beyond the G1/S transition and DNA replication. Cancer Cell. 2003;3:311–6. doi: 10.1016/s1535-6108(03)00080-1. [DOI] [PubMed] [Google Scholar]
- Chen HC, Lee HC, Lin TY, et al. Quantitative characterization of the transcriptional regulatory network in the yeast cell cycle. Bioinformatics. 2004;20:1914–27. doi: 10.1093/bioinformatics/bth178. [DOI] [PubMed] [Google Scholar]
- Chen H, Rossier C, Nakamura Y, et al. Cloning of a novel homeobox-containing gene, PKNOX1, and mapping to human chromosome 21q22.3. Genomics. 1997;41(2):193–200. doi: 10.1006/geno.1997.4632. [DOI] [PubMed] [Google Scholar]
- Cho RJ, Campbell MJ, Winzeler EA, et al. A genome-wide transcriptional analysis of the mitotic cell cycle. Mol. Cell. 1998;2:65–73. doi: 10.1016/s1097-2765(00)80114-8. [DOI] [PubMed] [Google Scholar]
- Cho RJ, Huang M, Campbell MJ, et al. Transcriptional regulation and function during the human cell cycle. Nat. Genet. 2001;27:48–54. doi: 10.1038/83751. [DOI] [PubMed] [Google Scholar]
- Di Stefano L, Jensen MR, Helin K. E2F7, a novel E2F featuring DP-independent repression of a subset of E2F-regulated genes. EMBO. J. 2003;22:6289–98. doi: 10.1093/emboj/cdg613. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dominguez A, Ramos-Morales F, Romero F, et al. hpttg, a human homologue of rat pttg, is overexpressed in hematopoietic neoplasms. Evidence for a transcriptional activation function of hPTTG. Oncogene. 1998;17(17):2187–93. doi: 10.1038/sj.onc.1202140. [DOI] [PubMed] [Google Scholar]
- Filippova GN, Fagerlie S, Klenova EM, et al. An exceptionally conserved transcriptional repressor, CTCF, employs different combinations of zinc fingers to bind diverged promoter sequences of avian and mammalian c-myc oncogenes. Mol. Cell. Biol. 1996;16(6):2802–13. doi: 10.1128/mcb.16.6.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fuentes JJ, Pritchard MA, Planas AM, et al. A new human gene from the Down syndrome critical region encodes a proline-rich protein highly expressed in fetal brain and heart. Hum. Mol. Genet. 1995;4(10):1935–44. doi: 10.1093/hmg/4.10.1935. [DOI] [PubMed] [Google Scholar]
- Hayashi R, Goto Y, Haga A, et al. Comparative genomics on MCM8 orthologous genes reveals the transcriptional regulation by transcription factor E2F. Gene. 2006;367:126–34. doi: 10.1016/j.gene.2005.10.002. [DOI] [PubMed] [Google Scholar]
- Hinchcliffe EH, Sluder G. “It takes two to tango”: understanding how centrosome duplication is regulated throughout the cell cycle. Genes Dev. 2001;15:1167–81. doi: 10.1101/gad.894001. [DOI] [PubMed] [Google Scholar]
- Hsieh JJ, Zhou S, Chen L, Young DB, Hayward SD. CIR, a corepressor linking the DNA binding factor CBF1 to the histone deacetylase complex. Proc. Natl. Acad. Sci. U.S.A. 1999;96(1):23–8. 0. doi: 10.1073/pnas.96.1.23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Imataka H, Sogawa K, Yasumoto K, et al. Two regulatory proteins that bind to the basic transcription element (BTE), a GC box sequence in the promoter region of the rat P-4501A1 gene. EMBO. J. 1992;11(10):3663–71. doi: 10.1002/j.1460-2075.1992.tb05451.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ishida S, Huang E, Zuzan H, et al. Role for E2F in control of both DNA replication and mitotic functions as revealed from DNA microarray analysis. Mol. Cell. Biol. 2001;21:4684–99. doi: 10.1128/MCB.21.14.4684-4699.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ivey-Hoyle M, Conroy R, Huber HE, et al. Cloning and characterization of E2F-2, a novel protein with the biochemical properties of transcription factor E2F. Mol. Cell. Biol. 1993;13:7802–12. doi: 10.1128/mcb.13.12.7802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johansson R. System modeling and identification . Prentice-Hall; 1993. pp. 113–9. [Google Scholar]
- Kennedy RD, Gorski JJ, Quinn JE, et al. BRCA1 and c-Myc associate to transcriptionally repress psoriasin, a DNA damage-inducible gene. Cancer Res. 2005;65(22):10265–72. doi: 10.1158/0008-5472.CAN-05-1841. [DOI] [PubMed] [Google Scholar]
- Klipp E, Herwig R, Kowald A, et al. Wiley-VCH; 2005. Systems Biology in Practice: Concepts, Implementation and Application . [Google Scholar]
- Kreyszig R. Advanced Engineering Mathematics. 7th ed . John Wiley; New York: 1993. [Google Scholar]
- Knebel A, Morrice N, Cohen P. A novel method to identify protein kinase substrates: eEF2 kinase is phosphorylated and inhibited by SAPK4/p38delta. EMBO. J. 2001;20(16):4360–9. doi: 10.1093/emboj/20.16.4360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lang SE, McMahon SB, Cole MD, et al. E2F transcriptional activation requires TRRAP and GCN5 cofactors. J. Biol. Chem. 2001;276:32627–34. doi: 10.1074/jbc.M102067200. [DOI] [PubMed] [Google Scholar]
- Law DJ, Du M, Law GL, et al. ZBP-99 defines a conserved family of transcription factors and regulates ornithine decarboxylase gene expression. Biochem. Biophys. Res. Commun. 1999;262(1):113–20. doi: 10.1006/bbrc.1999.1180. [DOI] [PubMed] [Google Scholar]
- Liu ET. Systems Biology, Integrative Biology, Predictive Biology. Cell. 2005;121:505–6. doi: 10.1016/j.cell.2005.04.021. [DOI] [PubMed] [Google Scholar]
- Meroni G, Reymond A, Alcalay M, et al. Rox, a novel bHLHZip protein expressed in quiescent cells that heterodimerizes with Max, binds a non-canonical E box and acts as a transcriptional repressor. EMBO J. 1997;16(10):2892–906. doi: 10.1093/emboj/16.10.2892. Erratum in: EMBO. J 16(19):6055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monden T, Wondisford FE, Hollenberg AN. Isolation and characterization of a novel ligand-dependent thyroid hormone receptor-coactivating protein. J. Biol. Chem. 1997;272(47):29834–41. doi: 10.1074/jbc.272.47.29834. [DOI] [PubMed] [Google Scholar]
- Muller H, Helin K. The E2F transcription factors: key regulators of cell proliferation. Biochim. Biophys. Acta. 2000;1470:M1–12. doi: 10.1016/s0304-419x(99)00030-x. [DOI] [PubMed] [Google Scholar]
- Muller H, Bracken AP, Vernell R, et al. E2Fs regulate the expression of genes involved in differentiation, development, proliferation, and apoptosis. Genes Dev. 2001;15:267–85. doi: 10.1101/gad.864201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nevins JR. The Rb/E2F pathway and cancer. Hum. Mol. Genet. 2001;10:699–703. doi: 10.1093/hmg/10.7.699. [DOI] [PubMed] [Google Scholar]
- Nurse P. A long twentieth century of the cell cycle and beyond. Cell. 2000;100:71–8. doi: 10.1016/s0092-8674(00)81684-0. [DOI] [PubMed] [Google Scholar]
- Ortega S, Malumbres M, Barbacid M. Cyclin D-dependent kinases, INK4 inhibitors and cancer. Biochim. Biophys. Acta. 2002;1602:73–87. doi: 10.1016/s0304-419x(02)00037-9. [DOI] [PubMed] [Google Scholar]
- Pahl PM, Hodges YK, Meltesen L, et al. ZNF207, a ubiquitously expressed zinc finger gene on chromosome 6p21.3. Genomics. 1998;53(3):410–2. doi: 10.1006/geno.1998.5442. [DOI] [PubMed] [Google Scholar]
- Polager S, Kalma Y, Berkovich E, et al. E2Fs up-regulate expression of genes involved in DNA replication, DNA repair and mitosis. Oncogene. 2002;21:437–446. doi: 10.1038/sj.onc.1205102. [DOI] [PubMed] [Google Scholar]
- Ratziu V, Lalazar A, Wong L, et al. Zf9, a Kruppel-like transcription factor up-regulated in vivo during early hepatic fibrosis. Proc. Natl. Acad. Sci. U.S.A. 1998;95(16):9500–5. doi: 10.1073/pnas.95.16.9500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reichman TW, Muniz LC, Mathews MB. The RNA binding protein nuclear factor 90 functions as both a positive and negative regulator of gene expression in mammalian cells. Mol. Cell. Biol. 2002;22(1):343–56. doi: 10.1128/MCB.22.1.343-356.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ren B, Cam H, Takahashi Y, et al. E2F integrates cell cycle progression with DNA repair, replication, and G(2)/M checkpoints. Genes Dev. 2002;16:245–56. doi: 10.1101/gad.949802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robles LD, Frost AR, Davila M, et al. Down-regulation of Cdc6, a cell cycle regulatory gene, in prostate cancer. J. Biol. Chem. 2002;277:25431–8. doi: 10.1074/jbc.M201199200. [DOI] [PubMed] [Google Scholar]
- Shah JV, Cleveland DW. Waiting for anaphase: Mad2 and the spindle assembly checkpoint. Cell. 2000;103:997–1000. doi: 10.1016/s0092-8674(00)00202-6. [DOI] [PubMed] [Google Scholar]
- Sherlock G, Hernandez-Boussard T, Kasarskis A, et al. The Stanford Microarray Database. Nucleic Acids Res. 2001;29:152–5. doi: 10.1093/nar/29.1.152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sherr CJ, Roberts JM. CDK inhibitors: positive and negative regulators of G1-phase progression. Genes Dev. 1999;13:1501–12. doi: 10.1101/gad.13.12.1501. [DOI] [PubMed] [Google Scholar]
- Shiekhattar R, Mermelstein F, Fisher RP, et al. Cdk-activating kinase complex is a component of human transcription factor TFIIH. Nature. 1995;374(6519):283–7. doi: 10.1038/374283a0. [DOI] [PubMed] [Google Scholar]
- Shirakawa H, Yoshida M. Structure of a gene coding for human HMG2 protein. J. Biol. Chem. 1992;267(10):6641–5. [PubMed] [Google Scholar]
- Stanelle J, Stiewe T, Theseling CC, et al. Gene expression changes in response to E2F1 activation. Nucleic Acids Res. 2002;30:1859–67. doi: 10.1093/nar/30.8.1859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stillman B. Cell cycle control of DNA replication. Science. 1996;274:1659–1664. doi: 10.1126/science.274.5293.1659. [DOI] [PubMed] [Google Scholar]
- Stott FJ, Bates S, James MC, et al. The alternative product from the human CDKN2A locus, p14(ARF), participates in a regulatory feedback loop with p53 and MDM2. EMBO. J. 1998;17:5001–5014. doi: 10.1093/emboj/17.17.5001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takahashi S, Wakui H, Gustafsson JA, et al. Functional interaction of the immunosuppressant mizoribine with the 14-3-3 protein. Biochem. Biophys. Res. Commun. 2000;274(1):87–92. doi: 10.1006/bbrc.2000.3104. [DOI] [PubMed] [Google Scholar]
- Trimarchi JM, Lees JA. Sibling rivalry in the E2F family. Nat. Rev. Mol. Cell. Biol. 2002;3(1):11–20. doi: 10.1038/nrm714. [DOI] [PubMed] [Google Scholar]
- Valsesia-Wittmann S, Magdeleine M, Dupasquier S, et al. Oncogenic cooperation between H-Twist and N-Myc overrides failsafe programs in cancer cells. Cancer Cell. 2004;6:625–30. doi: 10.1016/j.ccr.2004.09.033. [DOI] [PubMed] [Google Scholar]
- Whitfield ML, Sherlock G, Saldanha AJ, et al. Identification of genes periodically expressed in the human cell cycle and their expression in tumors. Mol. Biol. Cell. 2002;13:1977–2000. doi: 10.1091/mbc.02-02-0030.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu W, Fan YH, Kemp BL, et al. Overexpression of cdc25A and cdc25B is frequent in primary non-small cell lung cancer but is not associated with overexpression of c-myc. Cancer Res. 1998;58:4082–5. [PubMed] [Google Scholar]
- Wysocka J, Myers MP, Laherty CD, et al. Human Sin3 deacetylase and trithorax-related Set1/Ash2 histone H3–K4 methyl-transferase are tethered together selectively by the cell-proliferation factor HCF-1. Genes Dev. 2003;17(7):896–911. doi: 10.1101/gad.252103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuh CH, Davidson EH. Modular Cis-Regulatory Organization of Endo16, a Gut-Specific Gene of the Sea Urchin Embryo. Development. 1996;122:1069–1082. doi: 10.1242/dev.122.4.1069. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.