Abstract
The cell cycle coordinates core functions such as replication and cell division. However, cell-cycle-regulated transcription in the control of non-core functions, such as cell identity maintenance through specific transcription factors (TFs) and signalling pathways remains unclear. Here, we provide a resource consisting of mapped transcriptomes in unsynchronized HeLa and U2OS cancer cells sorted for cell cycle phase by Fucci reporter expression. We developed a novel algorithm for data analysis that enables efficient visualization and data comparisons and identified cell cycle synchronization of Notch signalling and TFs associated with development. Furthermore, the cell cycle synchronizes with the circadian clock, providing a possible link between developmental transcriptional networks and the cell cycle. In conclusion we find that cell cycle synchronized transcriptional patterns are temporally compartmentalized and more complex than previously anticipated, involving genes, which control cell identity and development.
Introduction
The cell cycle coordinates a series of changes that result in the initiation of specific core functions at different cell cycle stages, supporting, for example, DNA replication, quality control and cell division. One level of control in this process is exercised by feed-forward and feedback loops of posttranslational modifications and protein degradation. Another level of control is maintained via regulated transcription. The transcriptional changes that occur during the cell cycle in mammalian cells are associated with cell cycle transition points: G1-to-S, G2-to-M and M-to-G1[1]. By far, the most well studied point of transcriptional control is the G1-to-S transition, where S phase transcription is activated by E2F1-3, members of the E2F family of transcription factors (TFs), after they are released from the hyperphosphorylated RB protein [1–3] and Cyclin/CDK complexes. This activation is followed by transcriptional repression later in the S phase by E2F4-8 and pocket proteins p107 and p130 in late S phase [1, 4].
In addition to controlling basic functions such as replication and cell division, the cell cycle also affects the maintenance of and changes in cell identity and specification during development. For example, in human embryonic stem cells, Cyclin Ds control TGF-β signalling and the activation status of SMAD2/3, rendering the cell susceptible to differentiation along different lineages in early versus late G1 phase [5]. Likewise, in neural progenitor cells, Notch signalling integrates with the cell cycle to impact neural lineage specification [6, 7]. Further adding to this complex view of the cell cycle, WNT/Frizzled/β-catenin signalling shows a peak in G2 in some cells [8, 9]. TFs that oscillate during the cell cycle may interact with each other in transcriptional networks that can facilitate a series of successive activation or repression steps [1, 10, 11]. This might result in a domino-like effect, feeding information forward through the cell cycle, potentially with some degree of independence. Indeed, studies in yeast cells have shown that up to 20% of the TFs that oscillate with the cell cycle are non-essential TFs [12], and these genes may become self-sustained in their oscillatory behaviour in the absence of the cyclin/CDK-based cell cycle [13]. Such TF networks may thus constitute semi-autonomous transcriptional networks that propagate information through the cell cycle. Adding another layer of control, the circadian molecular clock is an independent transcriptional oscillator that reciprocally synchronizes with the cell cycle [14–17].
To map both core and cell-type-specific cell cycle-dependent transcription, we examined the HeLa cell cycle transcriptome and compared it to that of U2OS cells using RNA sequencing of cells sorted according to cell cycle without chemical synchronization. To efficiently map cell cycle oscillators and categorize distinct cell cycle phase patterns, we developed a novel algorithm, TriComp. This algorithm calculates two comprehensible variables that describe complex relationships between three quantitative variables, i.e., levels of expression in three cell cycle phases. Moreover, it allows for straightforward comparisons between different types of data sets. Patterns identified by the TriComp algorithm were superimposed onto established TF networks to reveal the potential temporal succession of gene expression, reflecting activation/repression hierarchies involved in cell cycle oscillation.
Results
Fucci-based separation of cell cycle phases and deep RNA sequencing
Previous analyses of cell cycle-dependent transcription have in most cases used chemically synchronized cells [18, 19]. Live cell cycle phase reporters such as the Fucci system [20] provide an alternative method that has been explored in recent analyses of the cell cycle in, for example, embryonic stem cells [5, 21, 22]. To characterize the relationship of the Fucci probes to the mitotic cell cycle, we analyzed chromatin content and DNA replication status and visualized it overlaid on Fucci reporter fluorescence, Panel A in S1 Fig. This experiment establishes that the majority of actively replicating cells are mAG-hGem+, mKO2-hCdt1+. We then used Fucci reporters to sort live, unsynchronized cells into discrete populations corresponding to non-replicating 1X-chromatin cells (G1-phase), actively replicating cells (S-phase) and non-replicating 2X-chromatin cells (G2/M-phase) by flow cytometry (Fig 1A and 1B, S1 Table). Control samples of sorted populations were fixed and analysed for cell cycle phase based on DNA content, demonstrating the correlation between Fucci-based and DNA content-based cell cycle phase determination (Panels A,B in S2 Fig). Three cell cycle phase populations from two cell lines, HeLa-Fucci and U2OS-Fucci, were analysed for gene expression by RNA sequencing. The data were processed by an ANOVA-like differential gene expression approach from the edgeR package [23]. Stringency cut-offs were used for the calculated false discovery rate (FDR), average gene expression (logCPM) and maximal fold change (FC) for each gene (Panels C,D in S2 Fig, S2 Table). In this report, we focused on transcript levels and disregarded splice variants. However, the information on splice variants is available for reuse in GEO:ID(PLACEHOLDER: To be submitted). Of the transcripts with measurable mRNA levels, approximately one-third significantly oscillated in synchrony with the cell cycle (FDR≤0.001) (Fig 1C, Panel E in S2 Fig), possibly representing known cell cycle regulated genes as well as unidentified genes expressed in a cell cycle-dependent manner.
A new algorithm for comparisons between three groups of data
To classify and visualize the expression patterns of genes that are regulated over the three sorted cell cycle phases, we developed a novel algorithm for simultaneous comparisons between three groups of data, TriComp. The algorithm generates an angular coordinate for each gene (θ) that reveals the relationship pattern between the three temporal stages and a radial coordinate (r) that reflects the extent of the relationship pattern based on the FC between the highest and lowest expression values in the cell cycle phases (Fig 1D, S2 Table).
Every possible relationship pattern corresponds to a specific θ-value, but the θ-value does not store any information about the intensity of such a relationship (this is stored in the r-value). For example, the relationship between the three expression values (1 | 2 | 4) and (1 | 10 | 100) which both share the relationship (group two being logarithmically exactly halfway between group 1 and 3) share the same θ-value (240°) with the latter having a much higher r-value. The same principle holds true with the two groups of values; (1 | 1 | 1.01), and (1 | 1 | 14682). Both have a θ-value of 300° but widely different r-values. This highlights the importance of including filtering both on statistical significance and biological significance when analyzing data using θ-values. Using these two coordinates reduces the complexity of the three-group comparisons, enables meaningful categorization and provides informative visualization. Moreover, the θ-value (after filtering based on applied statistics and r-value) can be used for comparisons between experiments, between cell lines, and between different data sets.
To categorize genes with similar expression patterns, the spectrum of relationships was divided into 6 categories based on θ-value (Fig 1D, Panel G in S2 Fig). Each θ-category corresponds to a binary classification of gene expression over the three phases relative to each other. Categories 1, 3 and 5 represent patterns of gene expression that peak in a single cell cycle phase: G1, S or G2/M. Categories 2, 4 and 6 represent a composite pattern where gene expression is high in two consecutive cell cycle phases and low in the third phase, i.e., high in G1+S, S+G2/M and G2/M+G1.
The distribution of transcript expression patterns was visualized by projecting the θ and r coordinates of 1132 transcripts in the HeLa-Fucci transcriptome (FDR≤0.001, logCPM>1, FC≥1.5) in a circular graph (Fig 1E). To inspect this distribution more closely, we plotted the θ-values for all significantly oscillating transcripts in HeLa cells (Fig 1F). This uncovered a pattern of genes with high transcript levels in categories 2 (G1+S) and 5 (G2/M) and low transcript levels in categories 3 (S phase) and 6 (G2/M+G1 phase). These results suggest that the expression of a large number of transcripts is controlled throughout the cell cycle.
We validated the TriComp analysis of our sequencing data by correlating the θ-values with Gene Ontology terms associated with the cell cycle (S3 Table). GO terms associated with G1/S transition were found in θ-categories 2 and 3, GO terms involved in DNA replication were restricted to θ-categories 2 and 3, and GO terms related to G2/M transition and mitosis contained genes from θ-categories 5 and 6 (Fig 1G). A visual alignment of θ-values for recognized cell cycle phase markers verified the expected order of expression, in which the transcripts for Cyclin D1/2/3 and Cyclin E1/2 were succeeded by those for Cyclin A1/A2, B1/B2, Aurora kinase A and Polo-like kinase 1 (Fig 1H). These results show that the TriComp algorithm enables straightforward comparisons between the three cell cycle phases and reveals the temporal succession of cell cycle regulated gene expression.
Validation of TriComp analyses and data comparisons
To further validate the TriComp-based analysis, we compared genes oscillating in a cell cycle-dependent manner shared by the HeLa-Fucci and U2OS-Fucci data sets by plotting the θ-value for all significantly oscillating transcripts. The direct comparison showed a clear correlation between the two data sets, as 90% of the genes had a θ-value difference lower than 60 degrees (Fig 2A). Interestingly, the transcriptional pattern observed in the HeLa-Fucci data set (Fig 1F, Fig 2B) was also observed in the U2OS-Fucci data set (Fig 2C), indicating that regardless of cell type, similar transcriptional regulation occurs in a time-specific manner.
The θ-value derived from the TriComp algorithm can be readily used for comparisons between data sets where the temporal descriptor is of another character. To externally validate our data set, the HeLa-Fucci cell cycle data were compared to two previously published cell cycle data sets [18, 19]. These studies used chemically synchronized HeLa cells and sinusoidal curve fitting and pattern matching from multiple microarray time-point analyses to map the oscillations of the transcriptome. After mapping the Whitfield et al. probe results to our transcripts using Gene IDs and BioMart, we compared the peak position of the sinusoidal curve fitting (Arctan2) to the TriComp θ-value π-values (Fig 2D). Out of 723 single-gene unique probes in the Whitfield et al. study, 505 probes were identified to oscillate in our data set as well (FDR≤0.001, FC≥1.1.) (Panel A in S3 Fig).
We next compared our data to Dominguez et al., and of the 1182 genes identified to oscillate by their study, a total of 1078 genes matched gene names in our data set. 589 of these genes significantly oscillated in our data set (at FDR≤0.001, FC≥1.1). Dominguez et al. allocated oscillating genes into 6 categories, and plotting these categories against the TriComp θ-values from our data set showed a similar distribution pattern over the cell cycle (Panel B in S3 Fig). In addition, all of the 67 genes defined by Dominguez et al. as the cell cycle "core set" were found to be statistically significantly oscillating in our data set and the oscillation patterns correspond to our data (Fig 2E).
Together, comparisons to other cell cycle transcriptome experiments showed similar results for transcripts that could be matched in corresponding data sets. Our RNA sequencing identified over 5000 transcripts (including noncoding transcripts) that exhibited cell cycle-dependent oscillatory behaviour (Panel E in S2 Fig), providing a potential opportunity to identify genes with as yet uncovered cell cycle regulatory effects.
Patterns of TF expression during the cell cycle
The pattern observed in Fig 1F indicated extensive transcriptional regulation during the cell cycle. Therefore, we examined the expression patterns of TFs, as regulators of transcription, more closely. To search for potential patterns in TF expression correlating with cell cycle progression, we mapped transcripts coding for TFs in both the HeLa and U2OS data sets by filtering the transcriptome data using a list of known TFs (Animal TFDB (Ref: PMID 22080564)), comprising 1591 genes in total. We identified 390 TFs in HeLa cells and 82 TFs in U2OS cells that significantly oscillated (FDR≤0.001) above the 1.1 FC level (S4 Table). Mapping the θ-values of significantly oscillating TFs in HeLa (Fig 3A) and U2OS (Fig 3B) cells showed a pattern similar to that of all significantly oscillating transcripts in Fig 1F. We next subdivided the transcriptome data into discrete TF subfamilies and analysed the distribution of the θ-values in HeLa (Fig 3C) and U2OS (Fig 3D) cells. Interestingly, apart from the core cell cycle machinery that included TFs like E2F family, other TF families with a high number of oscillating genes included developmental patterning and cell identity genes. The E2F TFs clustered in θ-categories spanning the S phase, while developmental TF families such as Homeobox, Forkhead and zf-C2H2 clustered predominantly in θ-categories 1–2 or 5–6, but not in category 3 (S phase). A STRING[24] analysis of the oscillating TFs in HeLa cells (Panel C in S3 Fig) suggested extensive network integration of a large number of TFs, with nodes such as MYC, TP53, NCOR1, PPARA and SMAD3 appearing to link the cell cycle to TFs associated with neurodevelopment (such as PAX6, ISL1, LHX4 and DLX1) and the circadian clock (ARNTL).
To analyse the phase distribution of the TFs shared between HeLa and U2OS cells, respective θ-values were plotted, showing a high degree of correlation (all shared TFs exhibited a θ-value within 75 degrees of each other) (Fig 3E). At the level of FC≥1.1 and logCPM≥1, we found 39 TFs with significantly oscillating expression that were shared in both data sets. According to θ-values, we could identify six consecutive groups, I-VI, throughout the cell cycle (Fig 3E), indicating that transcriptional regulation follows a pattern of temporal succession. Illustrating this idea, the θ-values for the E2F1 and E2F2 transcripts peaked earlier than those for the E2F7 and E2F8 transcripts (the former clustering with group II and the latter in group IV), creating a known activator/repressor loop (Fig 3F) [1]. Further analysis of E2F1 and its known target genes (using a list of 130 E2F targets from Bracken et al. 2004) identified 46 transcripts (out of 110 gene names found in our data sets) that significantly oscillated in both the HeLa and U2OS data sets. The distribution of E2F targets and non-E2F targets in HeLa or U2OS cells was visualized by plotting the respective θ-value (Fig 4A and 4B) as well as a number of genes in each θ-category (Fig 4C and 4D). E2F target transcripts were present in TriComp θ-categories 2–5, which span the G1/S transition up to G2, and absent in θ-categories 1 (G1) and 6 (G2/M+G1) (except for one gene in the U2OS data). By contrast, numerous non-E2F targets were found in θ-categories 1 (G1) and 6 (G2/M+G1) (Fig 4C and 4D). This difference was statistically significant (p<0.001) and likely reflects G2/M-G1-specific transcriptional control, resulting in temporal compartmentalization and separation of E2F- and non-E2F-dependent transcription.
We further investigated the protein expression of a set of TFs to determine if their oscillatory mRNA behaviour was maintained after translation. Protein expression was examined by immunostaining, with the cell cycle phase determined by Fucci reporters, and by high-throughput image analysis of individual cells. This analysis showed that corresponding protein levels peaked in the same phase or one or two phases later than the mRNA. For example, the mRNA and protein expression of MYBL1 and TGIF1 peaked in the same phase (Fig 4E and 4F), whereas the mRNA level of HMGB2 peaked in G2/M, while its protein level peaked in G1 (Fig 4G). Likewise, the mRNA expression of KDM5B peaked in G2/M, but its protein level was highest in the G1 and S phases (Fig 4H). These results indicate that there is variability in translational control, which is nevertheless maintained in accordance with cell cycle oscillation.
Mapping our data onto known hierarchical relationships of TFs verified that the temporal component of such relationships could be readily identified with TriComp. For example (Fig 4I), the demethylase KDM5B, an upstream activator of E2F1 expression [25], peaked in G1 at the mRNA level and in G1 and S phases at the protein level, similar to the mRNA expression of its target gene E2F1 (Fig 3E). Likewise, the mRNA expression of TFs known to be E2F targets, such as MYBL2 [26] and HMGB2 [27], peaked in S and G2/M phases (Fig 4E and 4G). In summary, oscillations in TF mRNAs may be transformed into oscillations in protein expression and further into the expression of downstream target genes. Therefore, our TriComp analyses can be used as a starting point to construct networks of temporal causality.
Oscillating developmental TFs assemble into transcriptional networks outside of S phase
The analysis of TF distribution in Fig 3C and 3D showed that development-associated TF families such as Homeobox, Forkhead, HMG and zf-Z2H2 had varying patterns different from those of, e.g., the E2F and MYB families (Fig 3C and 3D). Oscillating TFs shared between HeLa and U2OS cells included SOX4, GLI1 and FOXO6 (peaking in G1) and KLF9, KDM5B and PRDM1 (peaking in G2/M-G1) (Fig 3E). These TFs are associated not only with development but also with cancer [28–36]. This observation prompted us to analyse TFs that are known to regulate development, focusing on TFs that were specific to either HeLa or U2OS cells. To this end, individual TFs from a number of TF families that control cell identity and morphogenesis (Forkhead; Homeobox; PAX; bHLH; HMG; CUT) were visualized as polar coordinates (FDR≤0.001, logCPM>1, FC≥1.5), showing a preference for θ-categories 1 (G1) and 2 (G1+S) and a complete absence from θ-category 3 (S) (Fig 5A and 5B). GO enrichment analysis (S5 Table) identified enrichment of neurodevelopmental TFs in HeLa cells (a human cervical carcinoma cell line [37]) (Fig 5C) and mesodermal TFs in U2OS cells (Fig 5D), reflecting the osteosarcoma origin of U2OS [38].
Candidate oscillating developmental programmes were next analysed in more detail by superimposing known hierarchical relationships of the identified TFs. In the HeLa transcriptome, PAX6, a regulator of neurodevelopment, was one of the TFs with the highest FC during the cell cycle, with a peak in G2/M (Fig 5A), which was also confirmed at the protein level (S4 Fig). A known activating TF upstream of PAX6, MEIS1 [39], had an expression profile similar to PAX6 (Fig 5E). Expanding the putative network around PAX6, the mRNA expression of the PAX6 repressor LHX2 peaked in G1+S, i.e., in an anti-phase manner with PAX6 (Fig 5F), reflecting its repressor function. Other oscillating repressors of PAX6 were identified in SHH signalling [40, 41]; both PTCH1 and GLI1 exhibited the highest mRNA expression in G1 phase in HeLa cells (Fig 5F). These data support the idea that hierarchical temporal relationships may be identified and assembled into networks (Fig 5G).
In U2OS cells, cell type-specific TFs synchronized with the cell cycle peaked in G1 (θ-category 1) and G1+S (θ-category 2) (Fig 5B). To map potential network connections, these TFs were overlaid with information on well-established interactions. This analysis revealed connections between SOX6, SOX9, PITX2 and ID2/ID4 (Fig 5H), supporting the association between the U2OS transcriptome and, for example, cardiac development [42, 43], as suggested by the GO associations (Fig 5D).
In conclusion, mapping cell-cycle-synchronized expression by TriComp in our data sets revealed networks of transcriptional regulators that involved more gene categories than has been generally believed.
Cell-cell signalling via Notch peaks in G2/M-G1
Interestingly, PAX6 and FOXN4, which were found to oscillate over the cell cycle in HeLa cells and peaked in S+G2/M (θ-category 4) (Fig 5A), upregulate Notch signalling [44, 45]. FOXN4 directly controls DLL4, which peaked in G1 (θ-category 1) (S2 Table). Furthermore, Notch signalling targets were found to oscillate in our data, with HES7 and HES1 peaking in G1 phase in HeLa and U2OS cells, respectively (Fig 5A and 5B). Plotting the mRNA expression of NOTCH1-3 over the cell cycle confirmed that NOTCH2 expression peaked in G2/M in HeLa cells (θ-category 5), with a similar trend observed in the U2OS data set (Fig 5I). The less expressed NOTCH3 significantly oscillated and peaked in G1 in both HeLa and U2OS cells (S2 Table). The peak expression of full-length NOTCH2 in G2/M was also verified at the protein level in sorted HeLa-Fucci cells by western blotting using two different antibodies (Fig 5J and 5K, Panel A in S5 Fig). Analysing the potential oscillation of Notch target genes showed that NRARP and LFNG peaked in G1, similar to HES7 (Fig 5L). These genes act as a negative feedback loop to repress Notch gene expression and constitute a transcriptional circuit that plays a fundamental role in development during, for example, somitogenesis [46–49]. When exploring our data for receptors other than Notch, members of several classes of receptors were found to oscillate in HeLa and U2OS cells (Panel B in S4 Fig). Interestingly, among these were additional components of the somitogenesis network, such as WNT (FZD1/4/7/8/9) and FGF (FGFR3) signalling [46] and SHH (GLI1, PTCH, SMO) signalling [50] in HeLa cells (S5 Fig). Thus, visualizations based on TriComp analysis in combination with the superimposition of TF networks revealed that TF networks synchronize with the cell cycle, reproducing the hierarchy associated with development.
The circadian molecular clock is synchronized with the cell cycle
The Notch and WNT pathways are known to be regulated by the circadian clock [51, 52], PAX6 is a direct target of the circadian protein CLOCK [53], and SOX4 is controlled by RORα, which is part of the circadian clock [54] (Fig 3E). The findings imply a coupling between the circadian clock and the cell cycle. Indeed, the circadian clock has previously been found to synchronize with the cell cycle in U2OS cells [15] and NIH3T3 cells [14]. Therefore, we examined if circadian clock constituents also oscillated in our data sets.
The intrinsic cellular circadian molecular clock consists of a transcriptional feedback loop, in which the BMAL1 (encoded by ARNTL) and CLOCK genes represent the main activating component and control the feedback repressors CRY1-2 and PER1-3 (overviewed in Fig 6A). Additional repressors are REV-ERBα and REV-ERBβ (encoded by NR1D1 and NR1D2), and additional activators are RORα, RORβ and RORγ (encoded by RORA, RORB and RORC). In HeLa cells, genes from both the activating and repressing sides of the loop were synchronized with the cell cycle, but with a phase shift (Fig 6B). In U2OS cells, only the repressors were cell-cycle-synchronized (Panel A in S6 Fig).
We analysed the protein expression of BMAL1, CLOCK and REV-ERBα during the cell cycle by immunostaining of HeLa-Fucci cells and high-throughput image analysis using Fucci reporters (CLOCK and REV-ERBα) or DNA content (BMAL1) as indicators of the cell cycle phase. The protein levels of CLOCK and BMAL1 were cell-cycle-synchronized, peaking in G2/M, while REV-ERBα peaked at G1 (Fig 6C). Our observations thus support previous reports of synchronization between the cell cycle and the circadian clock [14, 15].
To extend the analyses to genes downstream of the circadian clock, the transcriptomes of HeLa and U2OS cells were compared to a published mouse liver circadian transcriptome data set [55]. Importantly, the liver data set provides information regarding circadian oscillations in predominantly non-proliferative cells, thus reducing the mis-attribution of cell cycle genes to the circadian network. A Venn analysis showed that 35% of the circadian-controlled genes in the liver significantly oscillated in HeLa cells (Fig 6D). Plotting the θ-values of our HeLa data set against the circadian clock variable Circadian Time (CT) for each gene in the circadian transcriptional feedback loop from Yoshitane et al. 2004, verified the temporal segregation of the activating and repressing parts of the molecular clock loop (Fig 6E). Interestingly, a number of TFs regulating development were identified to be circadian oscillating in the Yoshitane et al. data, suggesting that the circadian clock contributes to synchronize developmental transcriptional regulators with the cell cycle (Fig 6F).
To examine the association between the cell cycle and circadian transcriptomes in proliferating cells, we compared the HeLa and U2OS data sets to a transcriptome of oscillating clock genes in NIH/3T3 cells [56]. Fifty percent of the circadian-oscillating genes varied over the cell cycle in HeLa cells, and 13% in U2OS cells (Panel B in S6 Fig). Notably, 27 transcripts were shared between the circadian clock and the cell cycle in both HeLa and U2OS cells, including E2F8, Cyclin B2 (CCNB2) and CDK1 (Panel B in S6 Fig), likely reflecting synchronization between the circadian clock and the cell cycle.
In conclusion, our data suggest that the circadian clock might be an intermediary step in the synchronization between the cell cycle and the networks of TFs associated with development, oscillating in synchrony with the cell cycle (Fig 6F).
Discussion
In this study, we identified that an unexpectedly large number of developmentally associated TFs assemble into networks that are synchronized with the cell cycle in cancer cells. This results in a temporal compartmentalization, analogous to spatial patterning during embryogenesis. The circadian clock was similarly found to be synchronized with the cell cycle, indicating a link between the cell cycle and some of the identified TF networks.
The identification of TF networks was greatly aided by a novel algorithm, TriComp, that allows for easy visualization of expression patterns over the cell cycle, providing more information than mere identification of a single point of peak expression. In addition, the algorithm enables comparisons of data sets even in cases where the underlying temporal variables are calculated differently.
Reactivation of developmental genes has attracted increased interest in the cancer field [57–59], particularly in cancer stem cells. Examples of TFs that control morphogenesis and play a role in cancer that were identified to oscillate in our data include PAX6 [60] and GLI1 [41]. Notably, PAX6 was previously found to be dynamically expressed during the cell cycle in neural progenitor cells, affecting proliferation as well as fate choice during differentiation [61]. In our study, knowledge from TF networks controlling embryo development was used to decipher relationships between TFs that might assemble into similar networks over the cell cycle. Our analyses supported the idea that cell-cycle-synchronized expression of developmental genes might be a consequence of feed-forward transcriptional activation or repression by “non-core” TFs.
In addition to transcriptional feed-forward coordination, synchronized transcription may also be a consequence of cell cycle-governed cell signalling. If cell surface receptors are differentially expressed during the cell cycle, the influence of external factors may occur in a cell cycle-dependent manner, suggesting another dynamic dimension of regulation. Such phase dependent signalling has been established for the WNT/Frizzled/β-catenin pathway [8, 9]. An example of temporally restricted signalling in our data was Notch signalling, which is a central factor in development, controlling cell identity and proliferation [62]. Activated NOTCH2-ICD protein expression peaked in G2/M and was followed by expression of Notch target genes in G1 in HeLa cells. In somitogenesis, Notch interacts with the WNT/Frizzled and FGFR signalling pathways (reviewed in [46, 63, 64]) through a critical negative feedback loop mediated by HES1 and HES7 [49, 65]. This feedback function appeared to be present in the HeLa and U2OS cell cycles as well, as HES1 and HES7 oscillated in our data and peaked in G1, where NOTCH2 expression itself was downregulated.
One unresolved question is what factor(s) synchronize “non-core” transcription and signalling with the cell cycle. A candidate is the circadian clock. Our data show that the circadian clock oscillates in synchrony with the cell cycle in HeLa cells, as was previously shown for U2OS [15] and NIH/3T3 [56] cells. Signalling via the Notch and the WNT/Frizzled pathways has previously been shown to be controlled by the circadian clock [51, 52], and several of the TFs identified in our data (e.g., PAX6) may be targets of the circadian clock [53]. Further supporting such a conclusion is that the circadian TFs CLOCK and BMAL1 peaked in G2, i.e., the same phase as NOTCH2 in our data. However, the reciprocal interaction and synchronization between the cell cycle and the circadian clock, as well as other networks suggested by our data, pose a challenge when trying to identify which oscillatory system is in control of the expression profile of a particular gene.
Non-core cell cycle transcriptional networks such as the circadian clock resemble the non-essential transcription factor networks that oscillate in a self-sustained manner in yeast [12, 13]. One outcome from interactions between cellular oscillators is the establishment of temporally constrained signalling and gene expression, i.e., temporal compartmentalization. Such compartmentalization may provide opportunities for unique combinatorial effects when unique sets of oscillating TFs are temporally co-expressed. This resembles the processes whereby spatial compartmentalization and patterning is established in development [66, 67]. An additional interpretation of the data is that cell identity oscillates during the cell cycle, and this oscillation might determine the onset and direction of differentiation. Altogether, these perspectives reveal unexpected dynamics in transcriptional states, between which a cancer cell fluctuates when progressing through the cell cycle. Understanding these discrete states will have implications in target selection when developing cancer cell therapies, as expression levels as well as the cellular context of the target may change over the cell cycle. As the circadian rhythm appears to be a component in controlling the phase of these oscillations, our data suggest that the time of drug administration can be chosen to match the peak expression of the target, allowing for a higher therapeutic efficacy.
Experimental procedures
Cell culture
HeLa.S-Fucci and U2OS-Fucci are variants of the human cervical carcinoma cell line HeLa.S and the osteosarcoma cell line U2OS that express mKo2-hCdt1(30–120) and mAg-hGem(1–110)[20]. The HeLa.S-Fucci cell line was obtained from Riken Cell Bank, and the U2OS-Fucci cell line was a kind gift from Dr. Masai at the Tokyo Metropolitan Institute of Medical Science, Japan.
HeLa.S-Fucci cells were grown in Dulbecco's modified Eagle's medium (DMEM), and U2OS-Fucci cells were grown in McCoy’s modified medium, both supplemented with 10% foetal bovine serum (FBS), 5% Glutamax and 100 U/ml penicillin-streptomycin (all from Gibco, Invitrogen), at 37°C in a 5% CO2 humidified atmosphere. The cells were dissociated by trypsinization (TrypLE Express 1x) (Gibco, Invitrogen).
Flow cytometry
Cells were grown to ~80% confluency, collected by trypsinization, washed with complete DMEM/McCoy’s with 10% FBS, 5% Glutamax and 100U/ml penicillin-streptomycin, spun down by centrifugation at 200 g for 5 minutes and resuspended in culture medium. The cells were sorted based on different laser emission wavelengths simultaneously by defining three regions for sorting: G1, S and G2/M phases. Additionally, a control group of all three phases were collected. See S1 Table for Minimal Information of a Flow Cytometry experiment (MIFlowCyt).
For western blot 500 000 cells per phase were collected in PBS, spun down and resuspended in 100 μl of lysis buffer (1% SDS, 10% glycerol, 100 mM Tris/HCl pH 7.5). For each phase, post-sorting control was performed (see Panel B in S2 Fig).
RNA extraction, RNA sequencing and data analysis
Total RNA was extracted from sorted cells using an RNeasy Micro Kit (Qiagen) following the manufacturer’s instructions. The RNA concentration and quality were assessed using the Qubit RNA assay kit (Invitrogen). We used 300 ng of total RNA to prepare the TruSeq library, for which we used the Illumina Low-Throughput TruSeq RNA Sample Preparation Kit protocol, resulting in barcoded cDNA. Next, 50 ng of barcoded TruSeq products were used for Illumina RNA sequencing on an Illumina HiSeq 2000 sequencer to generate single-end 50 or 51-nucleotide reads according to the manufacturer’s protocol and as previously described [68]. The expression levels of each sample were normalized as Reads Per Kilobase Per Million (RPKM) by dividing the read count of each transcript model with its length and scaling the total per sample to one million.
Transcriptome data analysis
Differential gene expression analysis was performed on the mapped read counts using an ANOVA-like approach in the edgeR package in R [23], correcting for common, trended and tagwise disparity. For the HeLa cells (three batches), the GLM model was simply (Expression ~ Cell cycle phase). For the U2OS cells (two batches), the samples were corrected for batch-specific effects by including the batch in the GLM model (Expression ~ Cell cycle phase + Batch). This analysis supplied p-values, logFC differences between groups, and FDR-values.
A “Highest-logFC” variable was created by finding the largest logFC difference between any two cell cycle phases.
TriComp: Converting logarithmic FCs into polar relationship descriptor variables
Starting with the log2FC differences between groups 2 and 1, “logFCGr2overGr1”, and groups 3 and 1, “logFCGr3overGr1”, the TriComp algorithm consists of four equations. First, it creates two variables (a and b) used for plotting the circular graphs, in which direction from the centre denotes the relationship between the three groups and distance from the centre denotes the intensity of the respective relationship:
(1) |
(2) |
Then, TriComp calculates the polar coordinates (r, θ) with (a = 0, b = 0) used for (r = 0) and θ starting at 0 straight up and going clockwise. Calculating θ requires four different equations depending on what quadrant it belongs to.
(3) |
(4) |
Calculating which group each gene belongs to according to Fig 2A is performed using the following equation:
(5) |
In the graph using the Euclidean coordinates a and b, θ and r are the corresponding polar coordinates.
Immunofluorescence
For antibodies generated within the Human Protein Atlas (MYBL2, TGIF1, HMGB2, KDM5B), immunostaining was performed on cells grown on glass bottom plates (Whatman Inc.) coated with 50 μg of 12.5 μg/ml human fibronectin (VWR). Approximately 8 000 cells were seeded in each well and incubated at 37°C for 24 hours. After washing with PBS, the cells were fixed with 40 μl of 4% ice cold PFA (Sigma Aldrich) dissolved in growth medium supplemented with 10% serum for 15 minutes and permeabilized with 40 μl of 0.1% Triton X-100 (Sigma Aldrich) in PBS for 3x5 minutes. Primary antibodies (HPA055416, MYBL2; HPA062160, TGIF1; HPA053314, HMGB2; HPA053723, KDM5B) were diluted to 2.5 μg/ml in blocking buffer (PBS + 4% FBS) containing 3.3 μg/ml mouse anti-tubulin (Abcam, ab7291, Cambridge, UK). After washing with PBS, diluted primary antibodies were added (40 μl/well), and the plates were incubated overnight (ON) at 4°C. After the ON incubation, all wells were washed with PBS for 3x10 minutes. Goat anti-mouse Alexa Fluor 405-conjugated (A31553, ThermoFisher) and goat anti-rabbit Alexa Fluor 647-conjugated (A21245, ThermoFisher) secondary antibodies diluted to 2.5 μg/ml in blocking buffer were added, and the plates were incubated for 90 minutes at room temperature (RT). After washing with PBS, all wells were mounted with PBS containing 76.5% glycerol. Image acquisition was performed with a Leica SP5 confocal microscope equipped with a 63-x/1.4 NA oil immersion objective with the support of LAS AF matrix software. Six images were acquired per sample at RT in three sequential steps with the following scanning settings: pinhole 1 Airy unit, 16-bit acquisition and a pixel size of 80x80 nm. The z focus-level was manually adjusted to represent the best visualization of the target protein. The detector gain remained constant for the Fucci markers and microtubules across all samples.
For immunostaining of circadian clock proteins and PAX6, cells were seeded in 96-well glass bottom microplates (BD Falcon) at a density 2.5x103 cells/well ON. The next day, the cells were fixed in 4% paraformaldehyde (PFA) in PBS for 15 minutes at RT. Fixed cells were washed with PBS and permeabilized with 0.3% Triton X-100 in PBS (PBST). After blocking (1% BSA in PBST) for 1 hour at RT, the cells were incubated with a primary antibody diluted in 1% BSA in PBST ON at 4°C. On day 3, the cells were washed 3X with PBST. Before incubation with a fluorescence-labelled secondary antibody, the cells were blocked with 1% BSA in PBST for 30 minutes at RT. To remove cellular RNA, cells were incubated with RNase (20 μg/ml) in PBS for 1 hour at 4°C. Nuclei were counterstained with DAPI (5 μg/ml) in staining buffer (100 mM Tris, pH 7.8; 150 mM NaCl; 1 mM KCl; 0.5 mM MgCl2; 0.1% Nonidet P-40) for 30 minutes at RT.
The primary antibodies used were as follows: Pax6 (1:300; Biolegend), BMAL1 (1:100; NB100-2288, Novus Biological), CLOCK (1:100; PA1-520, Pierce), and REV-ERBα (1:200; H00009572-M02, Abnova). The secondary antibody used was an Alexa Fluor 647-conjugated anti-rabbit antibody (1:1000, Invitrogen).
For EdU incorporation analysis, the cells were grown, fixed and permeabilized as above, but incubated with 10μM EdU 30 min before fixation, then after permeabilization incubated 30min at room temperature with 2μM Alexa 647-conjugated Azide, 10 mM ascorbic acid, 2.5 mM CuSo4 in PBS. DAPI staining was performed by a 1h incubation at 4°C with10μg/mL RNAse A followed by 10 minutes of 1 μg/mL DAPI in PBS at room temperature.
All plates were imaged on an ImageXpress imaging system (Molecular Devices) at 10X magnification, and segmentation of nuclei and measurement of fluorescence intensities were performed in CellProfiler [69]. Per-object measurements were categorized and intensities were analysed in R.
Western blot
Protein lysates of sorted cells were separated by SDS-PAGE on 8% acrylamide gels and blotted onto PVDF membranes (Millipore). The membranes were incubated with primary rabbit anti-NOTCH2 antibodies N4913 (Sigma Aldrich) (Fig 6F and 6G) or D67C8 (Cell Signaling Technology) (Fig 6G, Panel A in S4 Fig) or a mouse anti-β-ACTIN (Abcam, ab8226) antibody ON at 4°C and with HRP-conjugated secondary antibodies (1:1000 Sigma Aldrich) at RT for 1 h. The membranes were developed using an ECL detection reagent (GE Healthcare) and a charge-coupled device camera (Bio-Rad, US). Non-saturated immunoblots were analysed by ImageJ, and protein abundances were normalized to β-ACTIN.
Statistical tools, visualization tools and bioinformatics
R[70] with attached packages ggplot2 [71], dplyr [72], mapplots [73] and XNomial [74] was used for statistical analysis, processing of immunofluorescence data, and visualization. GraphPad PRISM 6 was used for statistical analyses and data visualization. DAVID [75] (https://david.ncifcrf.gov) was used for gene ontology data analyses.
Data availability
The raw read counts, RPKM values and statistical data from EdgeR have been made available as a GEO submission (#GSE104736)
Supporting information
Acknowledgments
We thank Dr. Masai at the Tokyo Metropolitan Institute of Medical Science, Japan for providing the U2OS-Fucci cell line. We thank Arne Lindqvist and Igor Adameyko for reading the manuscript and providing valuable comments.
The study was supported by grants from the Swedish Research Council, Swedish Cancer Foundation, Swedish Childhood Cancer Foundation, the Hållsten Research Foundation, Swedish Society for Medical Research, The Swedish Brain Foundation, Knut & Alice Wallenberg Foundation, Grant Agency of the Czech Republic (GA15-20818S, GA15-21789S), CEITEC 2020 (LQ1601) project with a financial contribution made by the Ministry of Education, Youths and Sports of the Czech Republic (KI-MU: CZ.1.07/2.3.00/20.0180) with special financial support from the National Programme for Sustainability II, grant 15-11707S from the Czech Science Foundation, grants 15-28628A and 15-33999A from the Grant Agency for Health Research of the Czech Republic, by project LQ1605 from the National Program of Sustainability II (MEYS CR), and by the project ICRC-ERA-Human Bridge (No. 316345) funded by the 7th Framework Programme of the European Union.
Data Availability
The raw read data files, Read Counts and RPKM values are available as a GEO submission (https://www.ncbi.nlm.nih.gov/geo/, #GSE104736). EdgeR results and TriComp analysis results are available within the Supporting Information files.
Funding Statement
The study was supported by grants from the Swedish Research Council (M.An), Swedish Cancer Foundation (P.S, M.An), Swedish Childhood Cancer Foundation (M.An), the Hållsten Research Foundation (M.Al), Swedish Society for Medical Research (C.M), The Swedish Brain Foundation (C.M), Knut & Alice Wallenberg Foundation (E.L), Grant Agency of the Czech Republic (GA15-20818S (M.An), GA15-21789S (V.B)), CEITEC 2020 (LQ1601) project with a financial contribution made by the Ministry of Education, Youths and Sports of the Czech Republic (KI-MU: CZ.1.07/2.3.00/20.0180 (V.B) with special financial support from the National Programme for Sustainability II, grant 15-11707S from the Czech Science Foundation (K.S), grants 15-28628A and 15-33999A from the Grant Agency for Health Research of the Czech Republic (K.S), by project LQ1605 from the National Program of Sustainability II (MEYS CR (K.S)), and by the project ICRC-ERA-Human Bridge (No. 316345) funded by the 7th Framework Programme of the European Union (K.S). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Bertoli C, Skotheim JM, de Bruin RA. Control of cell cycle transcription during G1 and S phases. Nat Rev Mol Cell Biol. 2013;14(8):518–28. doi: 10.1038/nrm3629 ; PubMed Central PMCID: PMCPMC4569015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Matsui T, Nieto-Estevez V, Kyrychenko S, Schneider JW, Hsieh J. RB controls growth, survival, and neuronal migration in human cerebral organoids. Development. 2017. doi: 10.1242/dev.143636 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Naser R, Vandenbosch R, Omais S, Hayek D, Jaafar C, Al Lafi S, et al. Role of the Retinoblastoma protein, Rb, during adult neurogenesis in the olfactory bulb. Sci Rep. 2016;6:20230 doi: 10.1038/srep20230 ; PubMed Central PMCID: PMCPMC4742828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sadasivam S, DeCaprio JA. The DREAM complex: master coordinator of cell cycle-dependent gene expression. Nat Rev Cancer. 2013;13(8):585–95. doi: 10.1038/nrc3556 ; PubMed Central PMCID: PMCPMC3986830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pauklin S, Vallier L. The cell-cycle state of stem cells determines cell fate propensity. Cell. 2013;155(1):135–47. doi: 10.1016/j.cell.2013.08.031 ; PubMed Central PMCID: PMCPMC3898746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Imayoshi I, Isomura A, Harima Y, Kawaguchi K, Kori H, Miyachi H, et al. Oscillatory control of factors determining multipotency and fate in mouse neural progenitors. Science. 2013;342(6163):1203–8. doi: 10.1126/science.1242366 . [DOI] [PubMed] [Google Scholar]
- 7.Shimojo H, Ohtsuka T, Kageyama R. Oscillations in notch signaling regulate maintenance of neural progenitors. Neuron. 2008;58(1):52–64. doi: 10.1016/j.neuron.2008.02.014 . [DOI] [PubMed] [Google Scholar]
- 8.Davidson G, Shen J, Huang YL, Su Y, Karaulanov E, Bartscherer K, et al. Cell cycle control of wnt receptor activation. Dev Cell. 2009;17(6):788–99. doi: 10.1016/j.devcel.2009.11.006 . [DOI] [PubMed] [Google Scholar]
- 9.Olmeda D, Castel S, Vilaro S, Cano A. Beta-catenin regulation during the cell cycle: implications in G2/M and apoptosis. Mol Biol Cell. 2003;14(7):2844–60. doi: 10.1091/mbc.E03-01-0865 ; PubMed Central PMCID: PMCPMC165681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Joshi KK, Berge M, Radhakrishnan SK, Viollier PH, Chien P. An Adaptor Hierarchy Regulates Proteolysis during a Bacterial Cell Cycle. Cell. 2015;163(2):419–31. doi: 10.1016/j.cell.2015.09.030 ; PubMed Central PMCID: PMCPMC4600535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Palmisano A, Zamborszky J, Oguz C, Csikasz-Nagy A. Molecular Network Dynamics of Cell Cycle Control: Periodicity of Start and Finish. Methods Mol Biol. 2017;1524:331–49. doi: 10.1007/978-1-4939-6603-5_21 . [DOI] [PubMed] [Google Scholar]
- 12.White MA, Riles L, Cohen BA. A systematic screen for transcriptional regulators of the yeast cell cycle. Genetics. 2009;181(2):435–46. doi: 10.1534/genetics.108.098145 ; PubMed Central PMCID: PMCPMC2644938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lu Y, Cross FR. Periodic cyclin-Cdk activity entrains an autonomous Cdc14 release oscillator. Cell. 2010;141(2):268–79. doi: 10.1016/j.cell.2010.03.021 ; PubMed Central PMCID: PMCPMC4549071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Bieler J, Cannavo R, Gustafson K, Gobet C, Gatfield D, Naef F. Robust synchronization of coupled circadian and cell cycle oscillators in single mammalian cells. Mol Syst Biol. 2014;10:739 doi: 10.15252/msb.20145218 ; PubMed Central PMCID: PMCPMC4299496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hoffmann J, Symul L, Shostak A, Fischer T, Naef F, Brunner M. Non-circadian expression masking clock-driven weak transcription rhythms in U2OS cells. PLoS One. 2014;9(7):e102238 doi: 10.1371/journal.pone.0102238 ; PubMed Central PMCID: PMCPMC4090172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Matsuo T, Yamaguchi S, Mitsui S, Emi A, Shimoda F, Okamura H. Control mechanism of the circadian clock for timing of cell division in vivo. Science. 2003;302(5643):255–9. doi: 10.1126/science.1086271 . [DOI] [PubMed] [Google Scholar]
- 17.Karpowicz P, Zhang Y, Hogenesch JB, Emery P, Perrimon N. The circadian clock gates the intestinal stem cell regenerative state. Cell Rep. 2013;3(4):996–1004. doi: 10.1016/j.celrep.2013.03.016 ; PubMed Central PMCID: PMCPMC3982394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dominguez D, Tsai YH, Gomez N, Jha DK, Davis I, Wang Z. A high-resolution transcriptome map of cell cycle reveals novel connections between periodic genes and cancer. Cell Res. 2016;26(8):946–62. doi: 10.1038/cr.2016.84 ; PubMed Central PMCID: PMCPMC4973334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Whitfield ML, Sherlock G, Saldanha AJ, Murray JI, Ball CA, Alexander KE, et al. Identification of genes periodically expressed in the human cell cycle and their expression in tumors. Mol Biol Cell. 2002;13(6):1977–2000. doi: 10.1091/mbc.02-02-0030. ; PubMed Central PMCID: PMCPMC117619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sakaue-Sawano A, Kurokawa H, Morimura T, Hanyu A, Hama H, Osawa H, et al. Visualizing spatiotemporal dynamics of multicellular cell-cycle progression. Cell. 2008;132(3):487–98. doi: 10.1016/j.cell.2007.12.033 . [DOI] [PubMed] [Google Scholar]
- 21.Mort RL, Ford MJ, Sakaue-Sawano A, Lindstrom NO, Casadio A, Douglas AT, et al. Fucci2a: a bicistronic cell cycle reporter that allows Cre mediated tissue specific expression in mice. Cell Cycle. 2014;13(17):2681–96. doi: 10.4161/15384101.2015.945381 ; PubMed Central PMCID: PMCPMC4613862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Daynac M, Morizur L, Kortulewski T, Gauthier LR, Ruat M, Mouthon MA, et al. Cell Sorting of Neural Stem and Progenitor Cells from the Adult Mouse Subventricular Zone and Live-imaging of their Cell Cycle Dynamics. J Vis Exp. 2015;(103). doi: 10.3791/53247 ; PubMed Central PMCID: PMCPMC4692602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40. doi: 10.1093/bioinformatics/btp616 ; PubMed Central PMCID: PMCPMC2796818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue): D447–52. doi: 10.1093/nar/gku1003 ; PubMed Central PMCID: PMCPMC4383874. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Hayami S, Yoshimatsu M, Veerakumarasivam A, Unoki M, Iwai Y, Tsunoda T, et al. Overexpression of the JmjC histone demethylase KDM5B in human carcinogenesis: involvement in the proliferation of cancer cells through the E2F/RB pathway. Mol Cancer. 2010;9:59 doi: 10.1186/1476-4598-9-59 ; PubMed Central PMCID: PMCPMC2848192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.DeGregori J, Kowalik T, Nevins JR. Cellular targets for activation by the E2F1 transcription factor include DNA synthesis- and G1/S-regulatory genes. Mol Cell Biol. 1995;15(8):4215–24. ; PubMed Central PMCID: PMCPMC230660. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chong JL, Wenzel PL, Saenz-Robles MT, Nair V, Ferrey A, Hagan JP, et al. E2f1-3 switch from activators in progenitor cells to repressors in differentiating cells. Nature. 2009;462(7275):930–4. doi: 10.1038/nature08677 ; PubMed Central PMCID: PMCPMC2806193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Penzo-Mendez AI. Critical roles for SoxC transcription factors in development and cancer. Int J Biochem Cell Biol. 2010;42(3):425–8. doi: 10.1016/j.biocel.2009.07.018 ; PubMed Central PMCID: PMCPMC2862366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Shi S, Cao X, Gu M, You B, Shan Y, You Y. Upregulated Expression of SOX4 Is Associated with Tumor Growth and Metastasis in Nasopharyngeal Carcinoma. Dis Markers. 2015;2015:658141 doi: 10.1155/2015/658141 ; PubMed Central PMCID: PMCPMC4633550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zhao L, Arsenault M, Ng ET, Longmuss E, Chau TC, Hartwig S, et al. SOX4 regulates gonad morphogenesis and promotes male germ cell differentiation in mice. Dev Biol. 2017. doi: 10.1016/j.ydbio.2017.01.013 . [DOI] [PubMed] [Google Scholar]
- 31.Liu Y, Cui L, Huang J, Ji EH, Chen W, Messadi D, et al. SOX4 Promotes Progression in OLP-Associated Squamous Cell Carcinoma. J Cancer. 2016;7(11):1534–40. doi: 10.7150/jca.15689 ; PubMed Central PMCID: PMCPMC4964137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Pal K, Mukhopadhyay S. Primary cilium and sonic hedgehog signaling during neural tube patterning: role of GPCRs and second messengers. Dev Neurobiol. 2015;75(4):337–48. doi: 10.1002/dneu.22193 . [DOI] [PubMed] [Google Scholar]
- 33.Shi X, Wang Q, Gu J, Xuan Z, Wu JI. SMARCA4/Brg1 coordinates genetic and epigenetic networks underlying Shh-type medulloblastoma development. Oncogene. 2016;35(44):5746–58. doi: 10.1038/onc.2016.108 . [DOI] [PubMed] [Google Scholar]
- 34.Qinyu L, Long C, Zhen-dong D, Min-min S, Wei-ze W, Wei-ping Y, et al. FOXO6 promotes gastric cancer cell tumorigenicity via upregulation of C-myc. FEBS Lett. 2013;587(14):2105–11. doi: 10.1016/j.febslet.2013.05.027 . [DOI] [PubMed] [Google Scholar]
- 35.Han M, Xu W, Cheng P, Jin H, Wang X. Histone demethylase lysine demethylase 5B in development and cancer. Oncotarget. 2017;8(5):8980–91. doi: 10.18632/oncotarget.13858 . [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hohenauer T, Moore AW. The Prdm family: expanding roles in stem cells and development. Development. 2012;139(13):2267–82. doi: 10.1242/dev.070110 . [DOI] [PubMed] [Google Scholar]
- 37.Jones HW Jr., McKusick VA, Harper PS, Wuu KD. George Otto Gey. (1899–1970). The HeLa cell and a reappraisal of its origin. Obstet Gynecol. 1971;38(6):945–9. . [PubMed] [Google Scholar]
- 38.Ponten J, Saksela E. Two established in vitro cell lines from human mesenchymal tumours. Int J Cancer. 1967;2(5):434–47. . [DOI] [PubMed] [Google Scholar]
- 39.Zhang X, Friedman A, Heaney S, Purcell P, Maas RL. Meis homeoproteins directly regulate Pax6 during vertebrate lens morphogenesis. Genes Dev. 2002;16(16):2097–107. doi: 10.1101/gad.1007602 ; PubMed Central PMCID: PMCPMC186446. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Briscoe J, Pierani A, Jessell TM, Ericson J. A homeodomain protein code specifies progenitor cell identity and neuronal fate in the ventral neural tube. Cell. 2000;101(4):435–45. . [DOI] [PubMed] [Google Scholar]
- 41.Shahi MH, Afzal M, Sinha S, Eberhart CG, Rey JA, Fan X, et al. Regulation of sonic hedgehog-GLI1 downstream target genes PTCH1, Cyclin D2, Plakoglobin, PAX6 and NKX2.2 and their epigenetic status in medulloblastoma and astrocytoma. BMC Cancer. 2010;10:614 doi: 10.1186/1471-2407-10-614 ; PubMed Central PMCID: PMCPMC2989332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Garside VC, Cullum R, Alder O, Lu DY, Vander Werff R, Bilenky M, et al. SOX9 modulates the expression of key transcription factors required for heart valve development. Development. 2015;142(24):4340–50. doi: 10.1242/dev.125252 . [DOI] [PubMed] [Google Scholar]
- 43.Brent AE, Braun T, Tabin CJ. Genetic analysis of interactions between the somitic muscle, cartilage and tendon cell lineages during mouse development. Development. 2005;132(3):515–28. doi: 10.1242/dev.01605 . [DOI] [PubMed] [Google Scholar]
- 44.Thakurela S, Tiwari N, Schick S, Garding A, Ivanek R, Berninger B, et al. Mapping gene regulatory circuitry of Pax6 during neurogenesis. Cell Discov. 2016;2:15045 doi: 10.1038/celldisc.2015.45 ; PubMed Central PMCID: PMCPMC4860964. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Luo H, Jin K, Xie Z, Qiu F, Li S, Zou M, et al. Forkhead box N4 (Foxn4) activates Dll4-Notch signaling to suppress photoreceptor cell fates of early retinal progenitors. Proc Natl Acad Sci U S A. 2012;109(9):E553–62. doi: 10.1073/pnas.1115767109 ; PubMed Central PMCID: PMCPMC3295323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Dequeant ML, Pourquie O. Segmental patterning of the vertebrate embryonic axis. Nat Rev Genet. 2008;9(5):370–82. doi: 10.1038/nrg2320 . [DOI] [PubMed] [Google Scholar]
- 47.Carrieri FA, Dale JK. Turn It Down a Notch. Front Cell Dev Biol. 2016;4:151 doi: 10.3389/fcell.2016.00151 ; PubMed Central PMCID: PMCPMC5241280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Soza-Ried C, Ozturk E, Ish-Horowicz D, Lewis J. Pulses of Notch activation synchronise oscillating somite cells and entrain the zebrafish segmentation clock. Development. 2014;141(8):1780–8. doi: 10.1242/dev.102111 . [DOI] [PubMed] [Google Scholar]
- 49.Williams DR, Shifley ET, Braunreiter KM, Cole SE. Disruption of somitogenesis by a novel dominant allele of Lfng suggests important roles for protein processing and secretion. Development. 2016;143(5):822–30. doi: 10.1242/dev.128538 . [DOI] [PubMed] [Google Scholar]
- 50.Resende TP, Ferreira M, Teillet MA, Tavares AT, Andrade RP, Palmeirim I. Sonic hedgehog in temporal control of somite formation. Proc Natl Acad Sci U S A. 2010;107(29):12907–12. doi: 10.1073/pnas.1000979107 ; PubMed Central PMCID: PMCPMC2919945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Janich P, Pascual G, Merlos-Suarez A, Batlle E, Ripperger J, Albrecht U, et al. The circadian molecular clock creates epidermal stem cell heterogeneity. Nature. 2011;480(7376):209–14. doi: 10.1038/nature10649 . [DOI] [PubMed] [Google Scholar]
- 52.Jensen LD, Cao Z, Nakamura M, Yang Y, Brautigam L, Andersson P, et al. Opposing effects of circadian clock genes bmal1 and period2 in regulation of VEGF-dependent angiogenesis in developing zebrafish. Cell Rep. 2012;2(2):231–41. doi: 10.1016/j.celrep.2012.07.005 . [DOI] [PubMed] [Google Scholar]
- 53.Morgan R. Pax6 is a direct, positively regulated target of the circadian gene Clock. Dev Dyn. 2004;230(4):643–50. doi: 10.1002/dvdy.20097 . [DOI] [PubMed] [Google Scholar]
- 54.Wang Y, Solt LA, Kojetin DJ, Burris TP. Regulation of p53 stability and apoptosis by a ROR agonist. PLoS One. 2012;7(4):e34921 doi: 10.1371/journal.pone.0034921 ; PubMed Central PMCID: PMCPMC3324542. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Yoshitane H, Ozaki H, Terajima H, Du NH, Suzuki Y, Fujimori T, et al. CLOCK-controlled polyphonic regulation of circadian rhythms through canonical and noncanonical E-boxes. Mol Cell Biol. 2014;34(10):1776–87. doi: 10.1128/MCB.01465-13 ; PubMed Central PMCID: PMCPMC4019033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Schick S, Becker K, Thakurela S, Fournier D, Hampel MH, Legewie S, et al. Identifying Novel Transcriptional Regulators with Circadian Expression. Mol Cell Biol. 2015;36(4):545–58. doi: 10.1128/MCB.00701-15 ; PubMed Central PMCID: PMCPMC4751687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Ben-Porath I, Thomson MW, Carey VJ, Ge R, Bell GW, Regev A, et al. An embryonic stem cell-like gene expression signature in poorly differentiated aggressive human tumors. Nat Genet. 2008;40(5):499–507. doi: 10.1038/ng.127 ; PubMed Central PMCID: PMCPMC2912221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Huang S. On the intrinsic inevitability of cancer: from foetal to fatal attraction. Semin Cancer Biol. 2011;21(3):183–99. doi: 10.1016/j.semcancer.2011.05.003 . [DOI] [PubMed] [Google Scholar]
- 59.Naxerova K, Bult CJ, Peaston A, Fancher K, Knowles BB, Kasif S, et al. Analysis of gene expression in a developmental context emphasizes distinct biological leitmotifs in human cancers. Genome Biol. 2008;9(7):R108 doi: 10.1186/gb-2008-9-7-r108 ; PubMed Central PMCID: PMCPMC2530866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Muratovska A, Zhou C, He S, Goodyer P, Eccles MR. Paired-Box genes are frequently expressed in cancer and often required for cancer cell survival. Oncogene. 2003;22(39):7989–97. doi: 10.1038/sj.onc.1206766 . [DOI] [PubMed] [Google Scholar]
- 61.Hsieh YW, Yang XJ. Dynamic Pax6 expression during the neurogenic cell cycle influences proliferation and cell fate choices of retinal progenitors. Neural Dev. 2009;4:32 doi: 10.1186/1749-8104-4-32 ; PubMed Central PMCID: PMCPMC2741438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Andersson ER, Sandberg R, Lendahl U. Notch signaling: simplicity in design, versatility in function. Development. 2011;138(17):3593–612. doi: 10.1242/dev.063610 . [DOI] [PubMed] [Google Scholar]
- 63.Meinhardt H. Models for patterning primary embryonic body axes: The role of space and time. Semin Cell Dev Biol. 2015;42:103–17. doi: 10.1016/j.semcdb.2015.06.005 . [DOI] [PubMed] [Google Scholar]
- 64.Hubaud A, Pourquie O. Signalling dynamics in vertebrate segmentation. Nat Rev Mol Cell Biol. 2014;15(11):709–21. doi: 10.1038/nrm3891 . [DOI] [PubMed] [Google Scholar]
- 65.Bessho Y, Hirata H, Masamizu Y, Kageyama R. Periodic repression by the bHLH factor Hes7 is an essential mechanism for the somite segmentation clock. Genes Dev. 2003;17(12):1451–6. doi: 10.1101/gad.1092303 ; PubMed Central PMCID: PMCPMC196074. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Briscoe J, Small S. Morphogen rules: design principles of gradient-mediated embryo patterning. Development. 2015;142(23):3996–4009. doi: 10.1242/dev.129452 ; PubMed Central PMCID: PMCPMC4712844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Perrimon N, Pitsouli C, Shilo BZ. Signaling mechanisms controlling cell fate and embryonic patterning. Cold Spring Harb Perspect Biol. 2012;4(8):a005975 doi: 10.1101/cshperspect.a005975 ; PubMed Central PMCID: PMCPMC3405863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Islam S, Zeisel A, Joost S, La Manno G, Zajac P, Kasper M, et al. Quantitative single-cell RNA-seq with unique molecular identifiers. Nat Methods. 2014;11(2):163–6. doi: 10.1038/nmeth.2772 . [DOI] [PubMed] [Google Scholar]
- 69.Carpenter AE, Jones TR, Lamprecht MR, Clarke C, Kang IH, Friman O, et al. CellProfiler: image analysis software for identifying and quantifying cell phenotypes. Genome Biol. 2006;7(10):R100 doi: 10.1186/gb-2006-7-10-r100 ; PubMed Central PMCID: PMCPMC1794559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna, Austria: the R Foundation for Statistical Computing; 2011. Available from: http://www.R-project.org/. [Google Scholar]
- 71.Wickham H. ggplot2: Elegant Graphics for Data Analysis. New York, USA: Springer-Verlag; 2009. [Google Scholar]
- 72.Wickham H, Francois R. dplyr: A Grammar of Data Manipulation. R package version 0.5.0.2016. Available from: https://CRAN.R-project.org/package=dplyr.
- 73.Gerritsen H. mapplots: Data Visualisation on Maps. R package version 1.5. 2014. Available from: https://CRAN.R-project.org/package=mapplots.
- 74.Engels B. XNomial: Exact Goodness-of-Fit Test for Multinomial Data with Fixed Probabilities. R package version 1.0.4.2015. Available from: https://CRAN.R-project.org/package=XNomial.
- 75.Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4(1):44–57. doi: 10.1038/nprot.2008.211 . [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw read data files, Read Counts and RPKM values are available as a GEO submission (https://www.ncbi.nlm.nih.gov/geo/, #GSE104736). EdgeR results and TriComp analysis results are available within the Supporting Information files.
The raw read counts, RPKM values and statistical data from EdgeR have been made available as a GEO submission (#GSE104736)