Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones

Nina Kirstein; Alexander Buschle; Xia Wu; Stefan Krebs; Helmut Blum; Elisabeth Kremmer; Ina M Vorberg; Wolfgang Hammerschmidt; Laurent Lacroix; Olivier Hyrien; Benjamin Audit; Aloys Schepers

doi:10.7554/eLife.62161

. 2021 Mar 8;10:e62161. doi: 10.7554/eLife.62161

Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones

Nina Kirstein ^1,^†, Alexander Buschle ², Xia Wu ^3,^‡, Stefan Krebs ⁴, Helmut Blum ⁴, Elisabeth Kremmer ⁵, Ina M Vorberg ^6,⁷, Wolfgang Hammerschmidt ², Laurent Lacroix ³, Olivier Hyrien ^3,^✉, Benjamin Audit ^8,^✉, Aloys Schepers ^1,^§,^✉

Editors: Bruce Stillman⁹, Kevin Struhl¹⁰

PMCID: PMC7993996 PMID: 33683199

Abstract

Eukaryotic DNA replication initiates during S phase from origins that have been licensed in the preceding G1 phase. Here, we compare ChIP-seq profiles of the licensing factors Orc2, Orc3, Mcm3, and Mcm7 with gene expression, replication timing, and fork directionality profiles obtained by RNA-seq, Repli-seq, and OK-seq. Both, the origin recognition complex (ORC) and the minichromosome maintenance complex (MCM) are significantly and homogeneously depleted from transcribed genes, enriched at gene promoters, and more abundant in early- than in late-replicating domains. Surprisingly, after controlling these variables, no difference in ORC/MCM density is detected between initiation zones, termination zones, unidirectionally replicating regions, and randomly replicating regions. Therefore, ORC/MCM density correlates with replication timing but does not solely regulate the probability of replication initiation. Interestingly, H4K20me3, a histone modification proposed to facilitate late origin licensing, was enriched in late-replicating initiation zones and gene deserts of stochastic replication fork direction. We discuss potential mechanisms specifying when and where replication initiates in human cells.

Research organism: Human, Mouse

Introduction

In human cells, DNA replication initiates from 20,000 to 50,000 replication origins selected from a five- to tenfold excess of potential or ‘licensed’ origins (Moiseeva and Bakkenist, 2018; Papior et al., 2012). Origin licensing, also called pre-replicative complex (pre-RC) formation, occurs in late mitosis and during the G1 phase of the cell cycle. During this step, the origin recognition complex (ORC) binds DNA and, together with Cdt1 and Cdc6, loads minichromosome maintenance complexes (MCM), the core motor of the replicative helicase, as inactive head-to-head double hexamers (MCM-DHs) around double-stranded DNA (Bell and Kaguni, 2013; Evrin et al., 2009; Remus and Diffley, 2009). A single ORC reiteratively loads multiple MCM-DHs. However, once MCM-DHs have been assembled, ORC does not maintain contact with the MCM-DH and neither ORC, nor Cdc6, nor Cdt1 are required for origin activation (Fragkos et al., 2015; Hyrien, 2016; Powell et al., 2015; Remus et al., 2009; Rowles et al., 1999; Sun et al., 2014; Yeeles et al., 2015). During S phase, CDK2 and CDC7 kinase activities in conjunction with other origin-firing factors convert some MCM-DHs into pairs of active CDC45-MCM-GINS helicases that nucleate bidirectional replisome establishment (Douglas et al., 2018; Moiseeva and Bakkenist, 2018). MCM-DHs that do not initiate replication are dislodged from DNA during replication.

In Saccharomyces cerevisiae, origins are genetically defined by specific DNA sequences (Marahrens and Stillman, 1992). In multicellular organisms, no consensus sequence for origin activity has been identified and replication initiates from flexible locations. Although mammalian origins fire at different times through S phase, neighboring origins tend to fire at similar times, partitioning the genome into ~5,000 replication timing domains (RTDs) (Rivera-Mulia and Gilbert, 2016a). RTDs replicate in a reproducible order through S phase (Pope et al., 2014; Zhao et al., 2017). One model for this temporal regulation suggests that RTDs are first selected for initiation, followed by stochastic origin firing within domains (Boulos et al., 2015; Pope et al., 2014; Rhind and Gilbert, 2013; Rivera-Mulia and Gilbert, 2016b). A cascade or domino model suggests that replication first initiates at the most efficient (master) origins and then spreads to less efficient origins within an RTD (Boos and Ferreira, 2019; Guilbaud et al., 2011). Various processes and factors contribute to origin specification such as transcription, DNA sequences, histone variants, histone modifications, and nucleosome dynamics (Akerman et al., 2020; Cayrou et al., 2015; Long et al., 2020; Petryk et al., 2016; Prioleau and MacAlpine, 2016; Smith and Aladjem, 2014). For example, we proposed H4K20me3 to support the licensing of a subset of late-replicating origins in heterochromatin (Brustel et al., 2017). Recently, the histone variant H2A.Z has been implicated in ORC recruitment at early origins through deposition of H4K20me2 by histone methyltransferase SUV420H1 (Long et al., 2020). Furthermore, binding sites for the origin-firing factor Treslin-MTBP often feature a nucleosome-free gap adjacent to H3K4me2 (Kumagai and Dunphy, 2020).

Different approaches have been developed to characterize mammalian origins. Origins have been mapped at the single-molecule level by optical methods or at the cell-population level by sequencing various purified replication intermediates, such as short nascent strands, replication bubbles, and Okazaki fragments (Hulke et al., 2020). Strand-oriented sequencing of Okazaki fragments (OK-seq) reveals the population-averaged replication fork direction (RFD) allowing to map initiation and termination (Chen et al., 2019; McGuffee et al., 2013; Petryk et al., 2016; Smith and Whitehouse, 2012). Bubble-seq (Mesner et al., 2013), single-molecule analyses (Demczuk et al., 2012; Lebofsky et al., 2006; Norio et al., 2005), and OK-seq (Petryk et al., 2016; Wu et al., 2018) studies of human cells all suggest that replication initiates in broad but circumscribed zones consisting of multiple, individually inefficient sites. OK-seq revealed both, early-firing initiation zones (IZs), which are precisely flanked on one or both sides by actively transcribed genes, and late-firing IZs distantly located from active genes (Petryk et al., 2016). Recently, an excellent agreement was observed between early-firing IZs determined by OK-seq and by EdUseq-HU, which identifies nascent DNA synthesized in early S phase cells in the presence of EdU and hydroxyurea (Tubbs et al., 2018). Furthermore, high-resolution Repli-seq identified both early and late IZs consistent with OK-seq IZs (Zhao et al., 2020).

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) was used to map ORC and MCM chromatin binding. In Drosophila, ORC often binds next to open chromatin marks found at transcription start sites (TSSs) (MacAlpine et al., 2010), but MCMs, initially loaded next to ORC, are more abundantly loaded and widely redistributed when cyclin E/CDK2 activity rises in late G1 (Powell et al., 2015). In human cells, ChIP-seq of single ORC subunits identified from 13,000 to 101,000 ORC potential binding sites (Dellino et al., 2013; Long et al., 2020; Miotto et al., 2016). These studies consistently demonstrated a correlation of ORC-DNA binding with TSSs, open chromatin regions, and early replication timing (RT). ChIP-seq of Mcm7 in HeLa cells suggested that MCM-DHs bind regardless of the chromatin environment, but are preferentially activated upstream of active TSSs (Sugimoto et al., 2018). We and others previously used the Epstein–Barr virus (EBV), whose replication in latency is entirely dependent on the human licensing machinery, to compare ORC and MCM binding and replication initiation sites (Chaudhuri et al., 2001; Dhar et al., 2001; Papior et al., 2012; Ritzi et al., 2003; Schepers et al., 2001). A five- to tenfold excess of potential origins were licensed per genome with respect to 1–3 mapped initiation event(s) (Norio, 2001; Norio and Schildkraut, 2004; Papior et al., 2012). These findings support the model that human replication initiates in zones, which comprise multiple, individually inefficient sites.

Here, we present the first comparative survey of four different pre-RC components, replication initiation events, transcription activity, and RT in the genome of the human lymphoblastoid Raji cell line by combining ChIP-seq, OK-seq, RNA-seq, and previously published Repli-seq data (Sima et al., 2018). We find that, in pre-replicative (G1) chromatin, ORC and MCM are broadly distributed over the genome with high ORC density better correlating with early RT than MCM density. ORC/MCM are depleted from actively transcribed gene bodies and enriched at active gene promoters. ORC/MCM density is homogeneous over non-transcribed genes and intergenic regions of comparable RT. Furthermore, regions of similar RT show a similar ORC/MCM density, be they IZs, termination zones, undirectionally replicating regions (presumably lacking initiation events), or randomly replicating regions. These findings suggest that ORC/MCM densities do not solely determine IZs and that a specific contribution of the local chromatin environment is required. Indeed, we previously showed that IZs are enriched in open chromatin marks typical of active or poised enhancers (Petryk et al., 2016). We further show that a subset of non-genic late IZs is enriched in H4K20me3, confirming previous finding that H4K20me3 enhances origin activity in certain chromatin environments (Brustel et al., 2017; Shoaib et al., 2018).

These findings support the cascade model for replication initiation: the entire genome (except transcribed genes) is licensed for replication initiation. Additional process and factors like adjacent active transcription and epigenetic marks are required to specify master zones of higher replication initiation efficiency. The distributed licensing pattern allows the stochastic activation of secondary origins, possibly triggered by approaching replication forks.

Results

Moderate averaging is a suitable approach for ORC and MCM-DH distribution analysis

We used centrifugal elutriation to obtain a G1-enriched, pre-replicative population of human lymphoblastoid Raji cells (Papior et al., 2012). Propidium iodide staining followed by FACS (Figure 1—figure supplement 1a) and western blot analyses of cyclins A, B, and H3S10 phosphorylation (Figure 1—figure supplement 1b) confirmed the cell cycle stages of elutriated fractions. To ensure unbiased mapping of ORC and MCM, we simultaneously targeted Orc2, Orc3, Mcm3, and Mcm7 using validated ChIP-grade antibodies (validated in Papior et al., 2012; Ritzi et al., 2003; Schepers et al., 2001). ChIP efficiencies and qualities were measured using the EBV latent origin oriP as reference (Figure 1—figure supplement 1c). Raji cells contain 50–60 EBV episomes, allowing an easy detection of ORC/MCM at oriP (Adams et al., 1973). The viral protein EBNA1 recruits ORC to oriP’s dyad symmetry element, followed by MCM-DH loading. We detected both ORC and MCM at the dyad symmetry element in G1, whereas a population containing S-G2-M-phased cells depict a reduction in MCM levels, as expected (Figure 1—figure supplement 1c; Papior et al., 2012; Ritzi et al., 2003).

ChIP-seq of two replicates for ORC subunits (Orc2, Orc3) and of three replicates for MCM proteins (Mcm3, Mcm7) resulted in reproducible, but dispersed, ChIP-seq signals as exemplified by the well-characterized replication origin Mcm4/PRKDC (Figure 1a; Ladenburger et al., 2002; Schaarschmidt et al., 2002). We employed the MACS2 peak-calling program (Feng et al., 2012; Zhang et al., 2008), but found that the obtained results were too dependent on the chosen program settings and that ORC and MCM distributions were too dispersed to be efficiently captured by peak calling (data not shown), requiring an alternative approach.

Figure 1. — (a) Sequencing profile visualization in UCSC Genome Browser (hg19) at the Mcm4/PRKDC origin after reads per genomic content normalization: two samples of Orc2 and Orc3, and three samples of Mcm3 and Mcm7, are plotted against the input in three replicates. The profiles are shown in a 10 kb window (chr8: 48,868,314–48,878,313); the mapped position of the origin is indicated as green line. (b) The profile of ORC/MCM ChIP-seq after 1 kb binning at the same locus. The reads of replicates were summed and normalized by the total genome-wide ChIP read frequency followed by input division. Y-axis represents the resulting relative read frequency. (c) Correlation plot between Orc2 and Orc3 relative read frequencies in 1 kb bins. (d) Correlation plot between Mcm3 and Mcm7 relative read frequencies in 1 kb bins. (e) Heatmap of Pearson correlation coefficients r between all ChIP relative read frequencies in 1 kb bins. Column and line order were determined by complete linkage hierarchical clustering using the correlation distance (d = 1 r). Refer to Figure 1—figure supplement 3 for data representation without input division.

Figure 1—figure supplement 1. — (a) Sequencing profile visualization in UCSC Genome Browser (hg19) at the Mcm4/PRKDC origin after reads per genomic content normalization: two samples of Orc2 and Orc3, and three samples of Mcm3 and Mcm7, are plotted against the input in three replicates. The profiles are shown in a 10 kb window (chr8: 48,868,314–48,878,313); the mapped position of the origin is indicated as green line. (b) The profile of ORC/MCM ChIP-seq after 1 kb binning at the same locus. The reads of replicates were summed and normalized by the total genome-wide ChIP read frequency followed by input division. Y-axis represents the resulting relative read frequency. (c) Correlation plot between Orc2 and Orc3 relative read frequencies in 1 kb bins. (d) Correlation plot between Mcm3 and Mcm7 relative read frequencies in 1 kb bins. (e) Heatmap of Pearson correlation coefficients r between all ChIP relative read frequencies in 1 kb bins. Column and line order were determined by complete linkage hierarchical clustering using the correlation distance (d = 1 r). Refer to Figure 1—figure supplement 3 for data representation without input division.

Consequently, we summed up the reads of the ChIP replicates at different binning sizes and normalized the signals against the mean read frequencies of each ChIP sample and against input, as is standard in most ChIP-seq analyses. We computed the Pearson correlation coefficients between ORC/MCM ChIPs and obtained good correlations at 1 kb bin size and only marginal improvement at larger sizes (Figure 1—figure supplement 2a). When working in 1 kb bins, we still detected the enrichment of ORC/MCM at the MCM/PRKDC origin (Figure 1b), indicating that we do not lose local, biologically relevant signals.

In line with a previous report (Teytelman et al., 2009), the input control was significantly underrepresented in DNase hypersensitive (HS) regions, at TSSs, and at early RTDs (Figure 1—figure supplement 2b–d). As sonication-hypersensitive regions correlate with DNase HS regions (Schwartz et al., 2005), we carefully compared our results obtained with and without input normalization. For example, we still detect enrichment of ORC/MCM at the MCM/PRKDC origin when we omit input normalization (Figure 1—figure supplement 3). As will become apparent, similar conclusions were obtained in further analyses performed with or without input normalization.

The reliability and reproducibility of our ChIP experiments is reflected by the high Pearson correlation coefficients of the relative read frequencies of Orc2/Orc3 (r = 0.866, Figure 1c) and Mcm3/Mcm7 (r = 0.879, Figure 1d). The correlations between ORC and MCM were only slightly lower (Mcm3/Orc2/3: r = 0.775/0.757; Mcm7/Orc2/3: r = 0.821/0.800, Figure 1e). Hierarchical clustering based on Pearson correlation of ChIP profiles clustered ORC and MCM profiles together. Similar results were obtained using non input-normalized data (Figure 1—figure supplement 3b–d). To compare our ChIP-seq data to previously published Orc2 ChIP-seq from asynchronously cycling K562 cells (GSE70165; Miotto et al., 2016), we calculated the relative read frequencies of our ORC ChIPs around an aggregate of K562 Orc2 peaks (>1 kb) and found substantial enrichment (Figure 1—figure supplement 3e). Miotto et al., 2016 reported that Orc2 co-localizes with DNase HS sites present at active promoters and enhancers. In line with these observations, we found a significant enrichment of ORC at DNase HS regions > 1 kb, compared to regions deprived of DNase HS sites, with or without input normalization (Figure 1—figure supplement 3f, g). These results further validate our data.

ORC/MCM are enriched in IZs dependent on transcription

We next compared the relative read frequencies of ORC/MCM to active replication initiation units. Using OK-seq in Raji cells (Wu et al., 2018), we calculated the RFD (see Materials and Methods) and delineated preferential replication IZs as ascending segments (ASs) of the RFD profile. RFD profiles present upshifts that define origins to kilobase resolution in yeast (McGuffee et al., 2013), but in mammalian cells these transitions are more gradual, extending over 10–100 kb (Chen et al., 2019; McGuffee et al., 2013; Petryk et al., 2016; Tubbs et al., 2018; Wu et al., 2018). We analyzed ASs > 20 kb, allowing to assess ChIP signals up to 10 kb within ASs (see Materials and Methods). Using the RFD shift across the ASs (ΔRFD) as a measure of replication initiation efficiency, we further required ΔRFD > 0.5 to select the most efficient IZs. In total, we selected 2957 ASs, with an average size of 52.3 kb, which covered 4.9% (155 Mb) of the genome (Figure 2a, green bars, Table 1). In total, 2451 (83%) of all AS located close to genic regions (ASs extended by 20 kb on both sides overlapped with at least one annotated gene). Performing RNA-seq in asynchronously cycling Raji cells, we determined that 673 ASs (22.8% of all ASs) were flanked by actively transcribed genes (transcripts per kilobase per million [TPM] >3) on both sides (type 1 AS), with less than 20 kb between AS borders and the closest transcribed gene. In total, 1026 ASs (34.7%) had only one border associated to a transcribed gene (type 2 AS; TPM >3). Also, 506 ASs (17.1%) were devoid of proximal genes (non-genic AS) (Table 1). The slope did not change considerably between the different AS types, although type 1 ASs were on average slightly more efficient, followed by type 2 ASs, then non-genic ASs (Figure 2—figure supplement 1a). Type 1 and 2 ASs located within early RTDs, while non-genic ASs were predominantly late replicating (Figure 2—figure supplement 1b), as previously observed in GM06990 and HeLa (Petryk et al., 2016).

Figure 2. — (a) Top panel: example of an replication fork direction (RFD) profile on chr1: 178,400,000–182,800,000, covering 4 Mb. Detected ASs are labeled by green rectangles (irrespective of length and RFD shift). Middle and bottom panels: representative Mcm3 (blue) and Orc2 (red) chromatin immunoprecipitation followed by sequencing (ChIP-seq) profiles after binning for the same region. (**b–e**) Average input-normalized relative ChIP read frequencies of Orc2, Orc3, Mcm3, and Mcm7 at AS borders of (b) all AS (L > 20 kb and ΔRFD >0.5; n = 2957), (c) type 1 ASs with transcribed genes at both AS borders (n = 673), (d) type 2 ASs oriented with their AS border associated to transcribed genes at the right (n = 1026), and (e) non-genic ASs in gene-deprived regions (n = 506). The mean of ORC and MCM relative read frequencies is shown ±2 × SEM (lighter shadows). The dashed grey horizontal line indicates relative read frequency 1.0 for reference. For type 1 and 2 ASs, yellow bars mark the AS borders associated to transcribed genes. Refer to Figure 2—figure supplement 2 for analysis without input division.

Figure 2—figure supplement 1. — (a) Top panel: example of an replication fork direction (RFD) profile on chr1: 178,400,000–182,800,000, covering 4 Mb. Detected ASs are labeled by green rectangles (irrespective of length and RFD shift). Middle and bottom panels: representative Mcm3 (blue) and Orc2 (red) chromatin immunoprecipitation followed by sequencing (ChIP-seq) profiles after binning for the same region. (**b–e**) Average input-normalized relative ChIP read frequencies of Orc2, Orc3, Mcm3, and Mcm7 at AS borders of (b) all AS (L > 20 kb and ΔRFD >0.5; n = 2957), (c) type 1 ASs with transcribed genes at both AS borders (n = 673), (d) type 2 ASs oriented with their AS border associated to transcribed genes at the right (n = 1026), and (e) non-genic ASs in gene-deprived regions (n = 506). The mean of ORC and MCM relative read frequencies is shown ±2 × SEM (lighter shadows). The dashed grey horizontal line indicates relative read frequency 1.0 for reference. For type 1 and 2 ASs, yellow bars mark the AS borders associated to transcribed genes. Refer to Figure 2—figure supplement 2 for analysis without input division.

Table 1. Characterization of different AS subtypes.

	Number	Genome coverage (%)	Average length (kb)
All AS	2957	4.9	52.3
Genic AS	2451	4.1	52.3
Type 1 AS	673	1.1	50.7
Type 2 AS	1026	5.2	50.2
Non-genic AS	506	0.8	50.7
Only AS ≥20 kb with ΔRFD > 0.5 were considered. Genic ASs: ASs extended 20 kb on both sides is overlapped by genic region(s) irrespective of transcriptional activity; type 1 and type 2 AS: ASs flanked by expressed genes (TPM ≥3) within 20 kb on both sides (type 1) or one side (type 2); non-genic: no annotated gene ±20 kb of AS borders; AS: ascending segment; RFD: replication fork direction; TPM: transcripts per kilobase per million.

Open in a new tab

To study the relationship between ORC/MCM densities and replication initiation, we computed the relative read frequencies of ORC/MCM around all AS aggregate borders. Both ORC and MCM were, on average, enriched within ASs compared to flanking regions (Figure 2b, Figure 2—figure supplement 2a without input division). To resolve the impact of transcriptional activity, we repeated this calculation for the different AS types (Figure 2c-e; non-input-normalized data in Figure 2—figure supplement 2b–d). Transcriptional activity in AS flanking regions was associated with increased ORC/MCM levels inside ASs (compare Figure 2b, c) and a prominent MCM depletion from transcribed regions (Figure 2c, d, right border). In contrast, in type 2 ASs, ORC/MCM levels remained elevated at non-transcribed AS borders (Figure 2d, left border). No ORC/MCM enrichment was detected within non-genic ASs (Figure 2e).

AS borders associated with transcriptional activity were locally enriched in ORC/MCM (Figure 2c, d, both and right borders respectively). This is in line with previously detected Orc1 accumulation at AS borders (Petryk et al., 2016). Reciprocally, non-genic AS borders only showed a local dip in ORC/MCM levels (Figure 2d, left border; Figure 2e, both borders), but the biological significance of this observation remains unclear. A sequence analysis revealed biased distributions of homopolymeric repeat sequences at AS borders (data not shown). Such sequences may affect nucleosome formation and ORC binding, but may also bias Okazaki fragment/AS border detection at small scales (Figure 2—figure supplement 1a) as well as mappability.

ORC and MCM are depleted from transcribed gene bodies and enriched at TSSs

Consistent with previous OK-seq studies (Chen et al., 2019; Petryk et al., 2016), the average RFD profile of active genes revealed strong ASs upstream of TSSs and downstream of transcriptional termination site (TTSs), and descending RFD segments (DSs) across the active gene bodies (Figure 3—figure supplement 1a). This behavior depended on transcriptional activity as silent genes displayed an overall flat RFD profile (Figure 3—figure supplement 1a). When setting our ORC/MCM ChIP-seq data in relation to transcription, we observed that the ORC relative read distribution was significantly enriched at active TSSs, as already demonstrated in Drosophila (MacAlpine et al., 2010) and human cells (Dellino et al., 2013; Miotto et al., 2016). Thereby, ORC relative read distribution was moderately but significantly higher upstream of TSSs and downstream of TTSs than within active genes (Figure 3a). These observations were independent of input normalization (compare Figure 3a with Figure 3—figure supplement 1b). The depletion of ORC from gene bodies was statistically significant for approximately 45% of actively transcribed genes (Table 1). Compared to ORC, Mcm3 and Mcm7 enrichments at TSSs were less prominent, but depletions from gene bodies were more pronounced (Figure 3a, Figure 3—figure supplement 1b), with 75% and 58% of investigated transcribed gene bodies significantly depleted from Mcm3 and Mcm7, respectively (Table 1). Depletion was strictly homogeneous from TSS to TTS, strongly suggesting that transcription itself displaces ORC and MCM-DH complexes (Figure 3a). In contrast, at silent genes, ORC/MCM were hardly enriched at TSSs and were not depleted from gene bodies (Figure 3b, Figure 3—figure supplement 1c). Increasing transcriptional activity (classified as low: 3–10 TPM; mid: 10–40 TPM; high:>40 TPM) did not have any major impact on ORC/MCM enrichments at TSSs (Figure 3c, Figure 3—figure supplement 1d). ORC/MCM depletion within gene bodies was slightly more pronounced with increasing transcription levels when normalized for input (Figure 3d), but this was less convincing without input normalization (Figure 3—figure supplement 1e). Basal ORC/MCM levels upstream of TSSs and downstream of TTSs were identical, indicating that the local ORC enrichment at TSSs did not result in more MCM loading upstream than downstream of active genes.

Figure 3. — (**a, b**) ORC/MCM relative read frequencies around TSSs or transcriptional termination sites (TTSs) for (a) active genes (transcripts per kilobase per million [TPM] >3) and (b) inactive genes (TPM <3). Only genes larger than 30 kb without any adjacent gene within 15 kb were considered. Distances from TSSs or TTSs are indicated in kb. Means of ORC and MCM frequencies are shown ±2 × SEM (lighter shadows). The dashed grey horizontal line indicates relative read frequency 1.0 for reference. (c) ORC/MCM relative read frequencies at TSSs dependent on transcriptional activity (±2 × SEM). (d) ORC/MCM relative read frequencies upstream of TSSs and within the gene body dependent on transcriptional activity (±2 × SEM; TSSs ± 3 kb removed from analysis). Transcriptional activity was classified as no (TPM <3), low (TPM 3–10), mid (TPM 10–40), and high (TPM >40). Statistics were performed by one-way ANOVA followed by Tukey’s post-hoc test. p-Values are indicated always comparing to the previous transcriptional level. *p<0.05, **p<0.01, ***p<0.001. Refer to Figure 3—figure supplement 1 for analyses without input division.

Figure 3—figure supplement 1. — (**a, b**) ORC/MCM relative read frequencies around TSSs or transcriptional termination sites (TTSs) for (a) active genes (transcripts per kilobase per million [TPM] >3) and (b) inactive genes (TPM <3). Only genes larger than 30 kb without any adjacent gene within 15 kb were considered. Distances from TSSs or TTSs are indicated in kb. Means of ORC and MCM frequencies are shown ±2 × SEM (lighter shadows). The dashed grey horizontal line indicates relative read frequency 1.0 for reference. (c) ORC/MCM relative read frequencies at TSSs dependent on transcriptional activity (±2 × SEM). (d) ORC/MCM relative read frequencies upstream of TSSs and within the gene body dependent on transcriptional activity (±2 × SEM; TSSs ± 3 kb removed from analysis). Transcriptional activity was classified as no (TPM <3), low (TPM 3–10), mid (TPM 10–40), and high (TPM >40). Statistics were performed by one-way ANOVA followed by Tukey’s post-hoc test. p-Values are indicated always comparing to the previous transcriptional level. *p<0.05, **p<0.01, ***p<0.001. Refer to Figure 3—figure supplement 1 for analyses without input division.

Our observation that Mcm3 and Mcm7 are significantly depleted from transcribed gene bodies is consistent with their active displacement by transcription in G1, as previously proposed in Drosophila (Powell et al., 2015) and human cells (Macheret and Halazonetis, 2018). This depletion process contributes to delineating IZs flanked by active genes. In contrast, ORC/MCM density remained constant across non-genic AS borders (Figure 2), suggesting that ORC/MCM are not sufficient to delimit non-genic replication IZs.

ORC/MCM genomic distributions are broad and correlate with RT but not IZs

RT is a crucial aspect of genome stability that is correlated with gene expression and chromatin structure, which coordinate the selection of origins and timing of origin firing (Knott et al., 2009). In yeast, it has been reported that the number of MCM-DHs loaded at origins correlates with RT, suggesting how RT profiles can emerge from stochastic origin firing (Das et al., 2015; Yang et al., 2010). In human cells, ORC binding data have also been used to predict RT profiles (Miotto et al., 2016). To clarify the relationships between IZ location, IZ firing time, and ORC/MCM density in human cells, we used Raji Early/Late Repli-seq data from Sima et al., 2018 and related RT to ORC/MCM relative read frequencies and RFD slope (Sima et al., 2018).

We analyzed four different types of RFD pattern categories (exemplified in Figure 4—figure supplement 1a, b) as previously defined in Petryk et al., 2016: (i) ascending RFD segments (ASs), that is, predominant-IZs; (ii) descending RFD segments (DSs), that is, predominant-termination zones (TZ); (iii) flat segments of high |RFD| (|RFD| > 0.8 over >300 kb), that is, unidirectionally replicating regions (URRs), where replication forks always migrate in the same direction, implying a lack of initiation events; and (iv) flat segments of null RFD regions (NRRs; |RFD| < 0.15 over >500 kb), presumably replicating by random initiation and termination, mainly observed in late-replicating gene deserts (Figure 4—figure supplement 1c).

We calculated relative Orc2 and Mcm3 (Figure 4, Figure 4—figure supplement 1d, e for Orc3 and Mcm7) read frequencies in 10 kb bins against RT in intergenic regions (left column), silent gene bodies (TPM <3, middle column), or active gene bodies (TPM >3, right column). We considered either all bins (top row) or bins corresponding to ASs, DSs, URRs, and NRRs (following rows in descending order). Histograms were normalized by column, that is, each column is the probability density function of ChIP frequency at a given RT bin.

Consistently with Figure 3a, b, expressed genes showed lower ORC/MCM densities than silent genes and intergenic regions (Figure 4, Figure 4—figure supplement 1d, e). This was particularly significant in early- and mid-replicating regions, as demonstrated by Kolmogorov–Smirnov statistics (Figure 4—figure supplement 2a, red circles). The depletion was more pronounced for MCM than ORC, as already noted in Figure 3a. In contrast, the difference between silent genes and intergenic regions was at best marginally significant (blue circles). In all cases, ORC/MCM densities monotonously decreased from early to late RT windows, but this RT dependency was much attenuated in expressed genes, particularly for MCM, as expected if transcription removes this complex from both early- and late-replicating genes (Figure 4b, Figure 4—figure supplement 1e).

Strikingly, our analysis did not reveal any clear differences in the levels of ORC/MCM between intergenic ASs, DSs, URRs, and NRRs when bins of similar RT were compared (Figure 4—figure supplement 2a). A similar behavior was also apparent for ASs, DSs, URRs, and NRRs in silent genes and for DSs and URRs in active genes. Note that the few (579) bins corresponding to ASs in active genes are probably misleading as they are mainly attributable to annotation errors: the annotated gene overlapped the AS but the RNA-seq signal was confined outside the AS (Figure 4; Figure 4—figure supplement 1d, e; Petryk et al., 2016). In summary, the densities of ORC/MCM across genomic segments were related to RT and gene expression but were not predictive of any RFD pattern.

Strictly speaking, the slope of an RFD segment is proportional to the difference between the density of initiation and termination events within the segment (Audit et al., 2013). Therefore, we cannot exclude delocalized initiation events within DSs, which would explain why DSs were not significantly depleted in ORC/MCM compared to ASs (Figure 4—figure supplement 2a). In contrast, we can almost certainly exclude initiation events within URRs, but their ORC/MCM densities were not significantly lower than in ASs. This suggests that specific mechanisms repress potential origins in URRs and/or activate them in ASs.

Taken together, these results suggest that the density of ORC/MCM is not a reliable predictor of initiation probability, even though ORC density (and to a lesser extent MCM density) well correlated with RT. Thus, potential origins are widespread through the genome, but additional genetic or epigenetic factors are regulating whether and when they fire.

Cell cycle dynamics of ORC and MCM binding

The results above revealed a gradient of ORC/MCM densities according to RT. To confirm this observation, we extracted early and late RTDs employing a threshold of log₂(Early/Late) > 1.6 for early RTDs and <−2.0 for late RTDs, which resulted in 302 early RTDs covering 642.8 Mb and 287 late RTDs covering 617.4 Mb of the genome. Restricting the analysis to intergenic regions, we calculated the mean ORC/MCM relative read frequencies of pre-replicative G1-phased chromatin in early compared to late RTDs. ORC was 1.4 times enriched in early RTDs compared to late RTDs (Figure 5a, Figure 4—figure supplement 2b, Table 2). Mcm3 and Mcm7 levels, although showing the same tendencies, were less contrasted than ORC.

Figure 5. — (a) Origin recognition complex/minichromosome maintenance complex (ORC/MCM) G1 chromatin relative read frequencies (±2 × standard error of the mean [SEM]) in early or late replication timing domains (RTDs). Early RTDs were defined as log₂(Early/Late) > 1.6; late RTDs < −2.0. The analysis was performed in 10 kb bins. Any gene ±10 kb was removed from the analysis. Statistics were performed using one-sided t-test. (b) ORC/MCM relative read frequencies (±2 × SEM) obtained from S-G2-M chromatin in early or late RTDs using the same settings as in (a). (c) Average ORC/MCM relative read frequencies at H4K20me3 peaks (>1 kb). (d) H4K20me3 relative read frequencies at AS borders of the different AS types. Type 2 ASs are oriented with their AS borders associated to transcribed genes at the right. Means of H4K20me3 relative read frequencies are shown ±2 × SEM (lighter shadows). (e) Boxplot representation of H4K20me3 relative read frequencies within the different AS types. Boxplot represents the mean (circle), median (thick line), first and third quartile (box), and first and ninth decile (whiskers) of the relative read frequencies in each AS type. Statistics were performed by one-way ANOVA followed by Tukey’s post-hoc test. (f) Histogram representation of mean ±2 × SEM of ORC/MCM relative read frequencies in G1 at 242 H4K20me3-low non-genic ASs and 154 H4K20me3-high non-genic ASs. Statistics were performed using one-sided t-test. ***p<0.001. Refer to Figure 5—figure supplement 1 for validation of H4K20me3 chromatin immunoprecipitation.

Figure 5—figure supplement 1. — (a) Origin recognition complex/minichromosome maintenance complex (ORC/MCM) G1 chromatin relative read frequencies (±2 × standard error of the mean [SEM]) in early or late replication timing domains (RTDs). Early RTDs were defined as log₂(Early/Late) > 1.6; late RTDs < −2.0. The analysis was performed in 10 kb bins. Any gene ±10 kb was removed from the analysis. Statistics were performed using one-sided t-test. (b) ORC/MCM relative read frequencies (±2 × SEM) obtained from S-G2-M chromatin in early or late RTDs using the same settings as in (a). (c) Average ORC/MCM relative read frequencies at H4K20me3 peaks (>1 kb). (d) H4K20me3 relative read frequencies at AS borders of the different AS types. Type 2 ASs are oriented with their AS borders associated to transcribed genes at the right. Means of H4K20me3 relative read frequencies are shown ±2 × SEM (lighter shadows). (e) Boxplot representation of H4K20me3 relative read frequencies within the different AS types. Boxplot represents the mean (circle), median (thick line), first and third quartile (box), and first and ninth decile (whiskers) of the relative read frequencies in each AS type. Statistics were performed by one-way ANOVA followed by Tukey’s post-hoc test. (f) Histogram representation of mean ±2 × SEM of ORC/MCM relative read frequencies in G1 at 242 H4K20me3-low non-genic ASs and 154 H4K20me3-high non-genic ASs. Statistics were performed using one-sided t-test. ***p<0.001. Refer to Figure 5—figure supplement 1 for validation of H4K20me3 chromatin immunoprecipitation.

Table 2. Ratio of chromatin immunoprecipitation (ChIP) mean relative read frequencies in early versus late replication timing domains and G1 versus S-G2-M samples.

	Mean relative read frequency ratio (early/late)		Mean relative read frequency ratio (G1/S-G2-M)
	G1	S-G2-M	Early	Late
Orc2	1.40	1.18	1.11	0.93
Orc3	1.47	1.24	1.10	0.93
Mcm3	1.15	0.93	1.16	0.93
Mcm7	1.19	1.02	1.11	0.96
Calculated in 10 kb bins. All annotated genic regions were removed ± 10 kb.

Open in a new tab

To confirm the biological relevance of this finding, we repeated this analysis using chromatin from late S-G2-M chromatin (elutriation fraction 80 ml/min; Figure 1—figure supplement 1a, b), when replication has displaced most of the MCMs, as exemplified by qPCR at oriP’s dyad symmetry element (Figure 1—figure supplement 1c). In S-G2-M chromatin, ORC was still significantly enriched in early RTDs, compared to late RTDs, but MCMs were not, consistent with completed replication of early but not late RTDs (Figure 5b, Figure 4—figure supplement 2c, Table 2). These results demonstrate that the MCM signal is dynamic through the cell cycle as expected. These results also show that the preferential binding of ORC to early replicating (open) chromatin is not dependent on cell cycle stage.

Given that Orc1 in human cells is degraded at the G1-S transition and in early S phase (Kreitz et al., 2001; Méndez et al., 2002; Ohta et al., 2003), it might appear surprising that we detect Orc2 and Orc3 binding to S-G2-M chromatin. ChIP-seq only allows monitoring the relative distribution of chromatin-bound proteins along the genome and not their absolute levels. We therefore do not exclude that Orc2 and Orc3 binding to chromatin is globally and origin-specifically decreased after G1-S entry (Gerhardt et al., 2006; Siddiqui and Stillman, 2007). In human cells, Orc1 reappears as cells enter mitosis and is the first ORC subunit to bind to mitotic chromosomes, but other ORC subunits seem to join only in daughter G1 cells (Kara et al., 2015). Nevertheless, GFP‐tagged Orc1 was found to associate with chromatin throughout mitosis in living Chinese hamster cells and to co-localize with Orc4 in metaphase spreads (Okuno et al., 2001). The binding of Orc2 and Orc3 we detect in S-G2-M may either occur independently of Orc1 or reflect the binding of the entire complex in late mitotic cells.

Late-replicating non-genic ASs and NRRs are characterized by H4K20me3

We and others recently demonstrated that H4K20me3 is involved in licensing a subset of late-replicating regions (Benetti et al., 2007; Brustel et al., 2017; Pannetier et al., 2008). Here, we looked further into the relation between this chromatin mark, ORC/MCM, and replication initiation. We performed ChIP for H4K20me3 and H4K20me1 in three replicates from G1-phased cells and validated them by qPCR (Figure 5—figure supplement 1a, b). An exemplary H4K20me3 profile is shown along ORC/MCM profiles in Figure 5—figure supplement 1d. We performed MACS2 broad peak detection, keeping only peaks overlapping in all three samples (16,852 peaks for H4K20me3 and 12,264 peaks for H4K20me1, ranging in size from 200 bp to 105 kb and 183 kb, respectively; Table 2, Figure 5—figure supplement 1c). We calculated ORC/MCM relative read frequencies binned at 1 kb resolution at H4K20me3/H4K20me1 peaks > 1 kb (12,251 and 6277 peaks, respectively). ORC and, to a lower extent, MCM were enriched at H4K20me3, but not H4K20me1 peaks (Figure 5c, Figure 5—figure supplement 1e).

H4K20me3 coverage at the different AS types depicts an increased H4K20me3 signal only in non-genic ASs, disclosing the first histone modification specific for late-replicating non-genic ASs (Figure 5d, e). Starting from 506 non-genic ASs, we extracted two subsets of 154 and 242 non-genic ASs, where H4K20me3 relative read frequencies were above the mean genome value by more than 1.5× standard deviation, or below the genome mean value, respectively. ORC/MCM were enriched at the H4K20me3-high subgroup compared to the H4K20me3-low subgroup (Figure 5f). These results suggest that the presence of H4K20me3 at transcriptionally independent, non-genic ASs may contribute at the origin-licensing step to specifying these regions as highly efficient ‘master' initiation zones (Ma-IZs) in late-replicating DNA. The difference of ORC/MCM densities between H4K20me3-high and -low subgroups was less pronounced in chromatin derived from S-G2-M-phased cells. The dynamic differences between the cell cycle fractions confirm the biological relevance of this finding (Figure 5—figure supplement 1f).

To further explore the links between H4K20me3 and replication initiation, we analyzed the density of this modification in genome segments of various RT, gene activity, and RFD patterns (Figure 6a). Several interesting observations emerged from this analysis. First, the H4K20me3 level was weakly but systematically more abundant in early than in late-replicating chromatin, suggesting that H4K20me3 is not exclusively present in late-replicating heterochromatin. Second, the H4K20me3 level was slightly lower in transcribed genes than in the non-transcribed rest of the genome (Figure 6a, Figure 5—figure supplement 1g). Third, AS and DS bins showed comparable distributions of H4K20me3 levels at comparable RT and gene expression status (Figure 6a, Figure 5—figure supplement 1g). Interestingly, NRRs showed a specific, broader distribution of H4K20me3 levels, including a higher proportion of highly enriched windows, especially compared to URRs (compare boxplots in Figure 6a, Figure 5—figure supplement 1g). Locally, high densities of H4K20me3 are therefore detected not only in late, non-genic AS segments but also in late-replicating gene deserts of null RFD, which presumably replicate by widespread, spatially random initiation. This result led us to repeat the analysis of ORC/MCM enrichment at H4K20me3-high and -low 10 kb intergenic bins in NRRs. Again, ORC/MCM were more abundant at H4K20me3-high than -low bins (Figure 6b). These findings support the hypothesis that H4K20me3 facilitates origin licensing specifically in these heterochromatic segments (Brustel et al., 2017).

Figure 6. — (a) 3 × 5 panel of 2D histograms of H4K20me3 chromatin immunoprecipitation relative read frequencies versus replication timing (RT) (average log₂(Early/Late) over 100 kb binned according to the decile of RT distribution). The analysis was performed in 10 kb bins. Histograms are normalized by column and displayed for different bin categories (columns: intergenic regions, silent genes, expressed genes; rows: all bins, ascending segment [AS], descending segment [DS], unidirectionally replicating region [URR], null replication fork directionality region [NRR] bins) as for origin recognition complex/minichromosome maintenance complex (ORC/MCM) in Figure 4. The number of bins per histogram is indicated in each panel. Superimposed boxplots represent the mean (circle), median (thick line), first and third quartile (box), and first and ninth decile (whiskers) of the relative read frequencies in each timing bins. Refer to Figure 5—figure supplement 1g for statistical comparisons. (b) Histogram representation of mean ±2 × SEM of ORC/MCM relative read frequencies at 3986 H4K20me3-low NRR 10 kb bins and 504 H4K20me3-high NRR 10 kb bins. Statistics were performed using one-sided t-test. ***p<0.001.

Discussion

The study presented here provides a novel, comprehensive genome-wide analysis of multiple pre-RC proteins compared with RFD, transcription, and RT profiles in human cells. We find a widespread presence of ORC/MCM throughout the genome, with variations that only depend on RT or active transcription. ORC/MCM are depleted from transcribed genes and enriched at TSSs. ORC/MCM are more abundant in early than in late RTDs. The even distribution of ORC/MCM observed within IZs is consistent with OK-seq results, suggesting that initiation probability is fairly homogeneous within IZs. However, when RT and transcriptional effects are controlled, no significant differences in ORC/MCM densities are detected between regions supporting either preferential replication initiation (ASs) or termination (DSs), or random replication (NRRs), or unidirectional, passive replication (URRs). We consequently propose that potential origins, defined by loaded MCM-DHs, are widespread through the genome and that preferential initiation sites are selected for activation in S phase based on additional genetic and/or epigenetic factors. We further show that subsets of non-genic ASs and randomly replicating gene deserts are enriched in H4K20me3, which helps recruiting ORC/MCM to these late-replicating segments.

Our data suggest that transcription has both positive and negative effects on origin activity. Actively transcribed gene bodies are depleted of ORC/MCM (Figure 2, Figure 3a). As reported in Drosophila (Powell et al., 2015), we propose that active transcription removes ORC/MCM from transcribed gene bodies, which reduces their replication initiation capacity. This mechanism is consistent with previous studies reporting that replication does not initiate within transcribed genes (Hamlin et al., 2010; Hyrien et al., 1995; Knott et al., 2009; Macheret and Halazonetis, 2018; Martin et al., 2011; Sasaki et al., 2006).

By contrast, ORC and, to a lesser degree, MCM are enriched at active TSSs (Figure 3). Active TSSs are regions of open chromatin structure characterized by DNase- or MNase hypersensitivity. Such hypersensitivity is also a hallmark of Ma-IZs (Boulos et al., 2015; Papior et al., 2012) and preferred ORC binding sites (Miotto et al., 2016; Petryk et al., 2016). However, this increased ORC binding does not necessarily increase local initiation efficiency since the most efficient initiation sites identified by EdUseq-HU within early IZs are associated with poly(dA:dT) tracts, but not TSSs (Tubbs et al., 2018). Our finding that MCMs are less enriched at TSSs than ORC also argues against highly preferential origin licensing at TSSs. Furthermore, since MCMs are distributed fairly evenly upstream and downstream of transcribed gene bodies (Figure 3a), the preferred binding of ORC at TSSs does not result in increased MCM-DH loading specifically upstream of genes.

We previously reported that IZs are enriched in the histone variant H2A.Z and in open chromatin marks (H3K4me1, H3K27ac, DNAse HS sites) typical of active or poised enhancers (Petryk et al., 2016), which could potentially explain why IZs are more accessible to firing factors than flanking segments with comparable MCM-DH density. Recently, it was reported that H2A.Z recruits Suv420H1, which induces H4K20 dimethylation (Long et al., 2020). H4K20me2 interacts with the BAH domain of ORC1 (Beck et al., 2012a; Kuo et al., 2012; Vermeulen et al., 2010). The H2A.Z–H4K20me2–ORC1 axis therefore supports a role for H2A.Z in ORC recruitment and origin licensing (Long et al., 2020; Petryk et al., 2016). Furthermore, H3.3/H2A.Z double variant–containing nucleosomes present at active promoters and other regulatory regions constitute a less stable nucleosome that is more easily displaced, resulting in nucleosome-free gaps (Jin et al., 2009). Interestingly, nucleosome-free gaps associated with H3K4me2 are found at most binding sites for the origin-firing factor Treslin/MTBP, which may form looping interactions with distantly located MCM-DH to promote dispersed initiation within broad zones (Kumagai and Dunphy, 2020). These results provide novel insight into how multiple open chromatin marks previously detected within IZs may promote not only origin licensing, but also origin firing.

H4K20 methylation has multiple functions in ensuring genome integrity, such as DNA replication (Beck et al., 2012b; Long et al., 2020; Picard et al., 2014; Tardat et al., 2010), DNA repair, and chromatin compaction (Jørgensen et al., 2013; Nakamura et al., 2019; Shoaib et al., 2018), suggesting that the different functions are context-dependent and executed with different players. However, it is important to discriminate between H4K20me2, which is the most abundant H4K20 methylation state, and H4K20me3, which is more restricted (Jørgensen et al., 2013). We previously demonstrated that H4K20me3 provides a platform to enhance licensing in late-replicating heterochromatin (Brustel et al., 2017). We functionally link replication licensing to H4K20me3 in a specific subset of late-replicating domains as we detect both elevated ORC and MCM levels when selecting for H4K20me3-enriched non-genic ASs and NRRs (Figures 5f and 6b). Whether H4K20me3 and/or additional chromatin modifications may also promote the origin-firing step remains to be investigated.

In higher eukaryotes, it has been proposed that RT could simply result from the spatial distribution of potential origins upon S phase entry. The latter distribution has been derived from ORC (Dellino et al., 2013; Miotto et al., 2016) or MCM-DH (Das et al., 2015; Hyrien, 2016) abundance, as well as from epigenetic mark profiles Gindin et al., 2014. For example, Miotto et al., 2016 performed computational simulations where stochastic initiations at experimentally mapped ORC binding sites allow to reproduce human RT profiles. Our data also indicate a convincing correlation of ORC density with RT. However, we observed a weaker correlation of MCM-DH density with RT, and a lack of correlation with RFD slope, suggesting that origin-firing probability, and therefore RT, is not solely regulated by MCM-DH density (Figure 4b, Figure 4—figure supplement 1e, Figure 5a). The resolution of RT profiles is much less than RFD profiles, and it is not clear if models that predict RT would still correctly predict IZs. In fact, it was recently observed that all chromatin marks associated to open chromatin allowed very good predictions of RT profiles Gindin et al., 2014. Since open chromatin independently facilitates ORC binding in G1 phase and access of firing factors to MCM-DHs in S phase, open chromatin marks and ORC density may both predict RT without implying a direct causal link between RT and ORC binding. Probably, only the location of MCM-DHs associated with appropriate open chromatin marks to recruit firing factors is causative of RT.

The spatio-temporal replication program can change during cellular differentiation (Marchal et al., 2019). Comparison with chromatin conformation capture (Hi-C) data has shown that early and late RTDs correspond to the more and less accessible compartments of the genome, respectively (Ryba et al., 2010). Recently, Sima et al. used the CRISPR-Cas9 technology to identify three separate, cis-acting elements that together control the early replication time of the pluripotency-associated Dppa2/4 domain in mouse embryonic stem cells (mESCs) Sima et al., 2019. Strikingly, these early replication control elements (ERCEs) are enriched in CTCF-independent Hi-C interactions and active epigenetic marks (DNase1 HS, p300, H3K27ac, H3K4me1, H3K4me3) previously observed at OK-seq IZs (Petryk et al., 2018; Petryk et al., 2016). By mining mESC OK-seq data (Petryk et al., 2018), we found that the three ERCEs of the Dppa2/4 domain indeed fall within IZs (Figure 7—figure supplement 1a). Furthermore, the aggregate 1835 ERCEs predicted genome-wide by Sima et al., from epigenetic profiles of mESCs, shows a significant, positive RFD shift indicative of efficient replication initiation (Figure 7—figure supplement 1b). This finding is confirmed in proliferating PHA-stimulated primary splenic B cells (Figure 7—figure supplement 1c), attesting to the general validity of these observations. Since our data suggest that a higher ORC/MCM density is not a distinguishing feature of IZs from the rest of the genome, IZ specification cannot solely occur at the origin-licensing step. Open and dynamic chromatin structures found at Ma-IZs and ERCEs (Petryk et al., 2016; Sima et al., 2019) might not only facilitate origin licensing in G1 but also promote chromatin binding of limiting firing factors during S phase (Boos and Ferreira, 2019; Krude et al., 1997; Kumagai and Dunphy, 2020).

Conclusion

Our mapping of ORC and MCM complexes shows that in human cells most of the genome, except transcribed genes, is licensed for replication during the G1 phase of the cell cycle. ORC/MCM are more enriched in early than in late RTDs (Figure 7a) but only a fraction of MCM-DHs is selected for initiation during S phase. Open chromatin marks define efficient Ma-IZs, often but not always circumscribed by active genes (Figure 7b). Such marks may favor origin licensing in G1 but also binding of origin firing factors in S phase. In addition, H4K20me3 facilitates origin licensing in late-replicating regions (Figure 7c). Once forks emanate from Ma-IZs within an RTD, the omnipresence of MCM-DHs allows a cascade of replication initiation to take place dispersively between IZs (Figure 7b). The identification of ERCEs supports the hypothesis that open chromatin facilitates early origin activation. The links between chromatin structure and transcription, on the one hand, and origin licensing and activation, on the other hand, facilitate the timely activation of appropriate origins during programmed development.

Figure 7. — (a) Replication is organized in large segments of constant replication timing (early replication timing domain [RTD], dark grey; late RTD, light grey) (Marchal et al., 2019). While we observe the ubiquitous presence of the origin recognition complex (ORC; orange) and the minichromosome maintenance complex (MCM; blue) throughout the genome, the enrichment levels of ORC/MCM were higher in early RTDs compared to late RTDs. (b) Early RTDs are among other characterized by active transcription. ORC/MCM are locally highly enriched at active transcription start site (TSS). However, actively transcribed gene bodies (black) are deprived of ORC/MCM, often correlating with replication termination (blue). Besides TSSs, we find ORC/MCM stochastically distributed along intergenic regions. We hypothesize that traveling replication forks trigger activation of replication in a cascade (red arrows). (c) In gene-deprived and transcriptionally silent late-replicating heterochromatin, we detected homogeneous ORC/MCM distribution at generally lower levels. H4K20me3 is present at late-replicating non-genic ascending segments (ASs) and null RFD regions (NRRs) and leads to enhanced ORC/MCM binding, linking this histone mark to replication activation in heterochromatin.

Figure 7—figure supplement 1. — (a) Replication is organized in large segments of constant replication timing (early replication timing domain [RTD], dark grey; late RTD, light grey) (Marchal et al., 2019). While we observe the ubiquitous presence of the origin recognition complex (ORC; orange) and the minichromosome maintenance complex (MCM; blue) throughout the genome, the enrichment levels of ORC/MCM were higher in early RTDs compared to late RTDs. (b) Early RTDs are among other characterized by active transcription. ORC/MCM are locally highly enriched at active transcription start site (TSS). However, actively transcribed gene bodies (black) are deprived of ORC/MCM, often correlating with replication termination (blue). Besides TSSs, we find ORC/MCM stochastically distributed along intergenic regions. We hypothesize that traveling replication forks trigger activation of replication in a cascade (red arrows). (c) In gene-deprived and transcriptionally silent late-replicating heterochromatin, we detected homogeneous ORC/MCM distribution at generally lower levels. H4K20me3 is present at late-replicating non-genic ascending segments (ASs) and null RFD regions (NRRs) and leads to enhanced ORC/MCM binding, linking this histone mark to replication activation in heterochromatin.

Materials and methods

Key resources table.

Reagent type (species) or resource	Designation	Source or reference	Identifiers	Additional information
Cell line (Homo sapiens)	Raji (lymphoblast)	ATCC/DZMZ	ATCC: CCL-86 DZMZ: ACC 319 RRID:CVCL_0511	B-lymphocyte; Burkitt’slymphoma Received from:https://www.dsmz.de/collection/catalogue/details/culture/AC C-319 Tested mycoplasma negative
Antibody	Cyclin A1/A2 (rabbit monoclonal)	Abcam	ab185619	WB: 1:1000
Antibody	Cyclin B1 (mouse monoclonal)	Abcam	ab72, RRID:AB_305751	WB: 1:1000
Antibody	H3S10P (rabbit monoclonal)	Cell Signaling	Clone D2C8, cat. no. 3377, RRID:AB_1549592	WB: 1:1000
Antibody	GAPDH (rat monoclonal)	This paper	Clone GAPDH3 10F4; Rat IgG2c	WB: 1:50
Antibody	H4K20me1 (mouse monoclonal)	Diagenode	MAb-147-100	ChIP: 2.5 µg
Antibody	H4K20me3 (rabbit polyclonal)	Diagenode	pAb-057-050, RRID:AB_2617145	ChIP: 2.5 µg
Antibody	Rabbit-IgG (polyclonal)	Sigma	R2004, RRID:AB_261311	ChIP: 10 µg
Antibody	Orc2 (rabbit polyclonal)	Ritzi et al., 2003	SA93	Whole serum; ChIP: 15 µl
Antibody	Orc3 (rabbit polyclonal)	Ritzi et al., 2003	SA7976	Whole serum; ChIP: 15 µl
Antibody	Mcm3 (rabbit polyclonal)	Ritzi et al., 2003	SA8413	Whole serum; ChIP: 15 µl
Antibody	Mcm7 (rabbit polyclonal)	Ritzi et al., 2003	SA8496	Whole serum; ChIP: 15 µl
Peptide, recombinant protein	Protein A Sepharose 4 Fast Flow	GE Healthcare	GE17-5280-11
Peptide, recombinant protein	Protein G Sepharose 4 Fast Flow	GE Healthcare	GE17-0618-06
Sequence-based reagent	oriP DS_fw	This paper	qPCR primers	5′-AGTTCACTGCCCGCTCCT-3′
Sequence-based reagent	oriP DS_rv	This paper	qPCR primers	5′-CAGGATTCCACGAGGGTAGT-3′
Sequence-based reagent	H4K20me1positive_fw	Eric Julien, personal communication	qPCR primers	5′-ATGCCTTCTTGCCTCTTGTC-3′
Sequence-based reagent	H4K20me1positive_rv	Eric Julien, personal communication	qPCR primers	5′-AGTTAAAAGCAGCCCTGGTG-3′
Sequence-based reagent	H4K20me3positive_fw	Eric Julien, personal communication	qPCR primers	5′-TCTGAGCAGGGTTGCAAGTAC-3′
Sequence-based reagent	H4K20me3positive_rv	Eric Julien, personal communication	qPCR primers	5′-AAGGAAATGATGCCCAGCTG-3′
Chemical compound, drug	Formaldehyde	Thermo Scientific	Prod# 28908	16% formaldehyde solution; methanol-free
Chemical compound, drug	Proteinase K	Roche	Cat. no. 03 115 852 001	1 mg/ml; 8 mg
Chemical compound, drug	RNase	Roche	Cat. no. 11 119 915 001	0.5 µg/µl; 2 µg
Commercial assay or kit	NucleoSpin Extract II Kit	Macherey-Nagel	Cat. no. 740609.50
Commercial assay or kit	Accel-NGS 1S Plus DNA Library Kit for Illumina	Swift Biosciences	Cat. no. 10096
Commercial assay or kit	Direct-zol^TM RNA MiniPrep kit	Zymo Research	Cat. no. R2051
Commercial assay or kit	Encore Complete RNA-Seq Library Systems kit	NuGEN	Cat. no. 0333-32
Software, algorithm	Tophat2	Kim et al., 2013	RRID:SCR_013035
Software, algorithm	HTSeq-count	Anders et al., 2015	RRID:SCR_011867
Software, algorithm	BWA (v.0.7.4)	Li and Durbin, 2009	RRID:SCR_010910	OK-seq mapping, default parameters
Software, algorithm	bowtie (v.1.1.1)	Langmead et al., 2009	RRID:SCR_005476	ChIP-seq mapping, bowtie -m one index file.fastq
Software, algorithm	deepTools (v.3.3.1)	Ramírez et al., 2016	RRID:SCR_016366
Software, algorithm	MACS2 (v.2.2.5)	Zhang et al., 2008	RRID:SCR_013291	Default settings, `--broad`
Software, algorithm	R (v.3.2.3)	R Development Core Team, 2018	RRID:SCR_001905
Software, algorithm	dplyr (v.0.8.5)	Wickham et al., 2020	RRID:SCR_016708	R package
Software, algorithm	ggplot2 (v.3.1.0)	Wickham, 2016	RRID:SCR_014601	R package
Software, algorithm	gplots (v.3.0.3)	Warnes et al., 2020		R package
Software, algorithm	Python 2.7 and Phyton 3	van Rossum, 1995	RRID:SCR_008394
Software, algorithm	numpy (v.1.18.5)	Harris et al., 2020	RRID:SCR_008633	Python library
Software, algorithm	matplotlib (v.3.2.3)	Hunter, 2007	RRID:SCR_008624	Python library
Software, algorithm	SciPy (v.1.5.0)	Virtanen et al., 2020	RRID:SCR_008058	Python library

Open in a new tab

Cell culture

Raji cells (ATCC: CCL-86; DZMZ: ACC 319) were directly obtained from DZMS and tested mycoplasma negative. Raji cells were cultured at 37°C and 5% CO₂ in RPMI 1640 (Gibco, Thermo Fisher, USA) supplemented with 8% FCS (Lot BS225160.5, Bio and SELL, Germany), 100 Units/ml penicillin, 100 µg/ml streptomycin (Gibco, Thermo Fisher, USA), 1× MEM non-essential amino acids (Gibco, Thermo Fisher, USA), 2 mM L-glutamine (Gibco, Thermo Fisher, USA), and 1 mM sodium pyruvate (Gibco, Thermo Fisher, USA).

RNA extraction, sequencing, and TPM calculation

RNA-seq was performed in three independent replicates. RNA was extracted from 3 × 10⁵ Raji cells using Direct-zol^TM RNA MiniPrep kit (Zymo Research) according to manufacturer’s instructions. RNA quality was confirmed by Bioanalyzer RNA integrity numbers between 9.8 and 10 followed by library preparation (Encore Complete RNA-Seq Library Systems kit [NuGEN]). Single-end 100 bp sequencing was performed by Illumina HiSeq 1500 to a sequencing depth of 25 million reads. The reads were mapped to hg19 genome using Tophat2 and assigned to annotated genes (HTSeq-count) (Anders et al., 2015; Kim et al., 2013). TPM values were calculated for each sample ( $T P M_{j} = 10^{6} \frac{n_{j}}{l_{j}} / \sum_{i} \frac{n_{i}}{l_{i}}$ , where n_i is the number of reads that map to gene i whose total exon length expressed in kb is l_i) as previously described (Wagner et al., 2012).

Replication fork directionality profiling using OK-seq method in Raji

Raji OK-seq was recently published and is available from the European Nucleotide Archive under accession number PRJEB25180 (see Data access section) (Wu et al., 2018). Reads > 10 nt were aligned to the human reference genome (hg19) using the BWA (version 0.7.4) software with default parameters (Li and Durbin, 2009). We considered uniquely mapped reads only and counted identical alignments (same site and strand) as 1 to remove PCR duplicate reads. Five replicates were sequenced providing a total number of 193.1 million filtered reads (between 19.1 and 114.1 million reads per replicate). RFD was computed as $R F D = \frac{(R - F)}{(R + F)}$ , where ‘R’ (resp. ‘F’) is the number of reads mapped to the reverse (resp. forward) strand of the considered regions. RFD profiles from replicates were highly correlated, with Pearson correlation computed in 50 kb non-overlapping windows with >100 mapped reads (R + F) ranging from 0.962 to 0.993. Reads from the five replicate experiments were pooled together for further analyses.

Determining regions of ascending, descending, and constant RFD

RFD profiling of two human cell lines revealed that replication primarily initiates stochastically within broad (up to 150 kb) zones and terminates dispersedly between them (Petryk et al., 2016). These IZs correspond to quasi-linear ASs) of varying size and slope within the RFD profiles. As previously described for mean RT profiles analysis (Audit et al., 2013; Baker et al., 2012), we determined the smoothed RFD profile convexity from the convolution with the second derivative of the Gaussian function of standard deviation 32 kb. In total, 4891 ASs were delineated as the regions between positive and negative convexity extrema of large amplitude. The amplitude threshold was set in a conservative manner in order to mainly detect the most prominent IZs as described and to avoid false positives Petryk et al., 2016. Descending segments (DSs) were detected symmetrically to ASs as regions between negative and positive convexity extrema using the same threshold. Noting pos_5’ and pos_3’ the location of the start and end position of an AS or DS segment, each segment was associated to its size pos_3’-pos_5’ and the RFD shift across its length: ΔRFD = RFD (pos_3’) – RFD (pos_5’). DS segments were less numerous (2477 versus 4891) and on average larger (126 kb versus 38.8 kb) than AS segments, as expected, and presented a smaller average RFD shift (|ΔRFD| = 0.69 versus 0.83).

Initial RFD profiling in human also revealed regions of unidirectional fork progression and regions of null RFD where replication is bidirectional. URRs were delineated as regions where |ΔRFD| > 0.8 homogeneously over at least 300 kb (401 regions of mean length 442 kb covering 177 Mb). NRRs were delineated as regions where |ΔRFD| < 0.15 homogeneously over at least 500 kb (127 regions of mean length 862 kb covering 110 Mb). Thresholds were set in a conservative manner to avoid false positive, particularly not to confuse RFD zero-crossing segments with NRR.

Centrifugal elutriation and flow cytometry

For centrifugal elutriation, 5 × 10⁹ exponentially growing Raji cells were harvested, washed with PBS, and resuspended in 50 ml RPMI 1680, 8% FCS, 1 mM EDTA, 0.25 U/ml DNaseI (Roche, Germany). Concentrated cell suspension was passed through 40 µm cell strainer and injected in a Beckman JE-5.0 rotor with a large separation chamber turning at 1500 rpm and a flow rate of 30 ml/min controlled by a Cole-Parmer Masterflex pump. While rotor speed was kept constant, 400 ml fractions were collected at increasing flow rates (40, 45, 50, 60, and 80 ml/min). Individual fractions were quantified, 5 × 10⁶ cells washed in PBS, ethanol fixed, RNase treated and stained with 0.5 mg/ml Propidium Iodide. DNA content was measured using the FL2 channel of FACSCalibur (BD Biosciences, Germany). Remaining cells were subjected to chromatin cross-linking.

Generation of GAPDH monoclonal antibody

Rat monoclonal antibody GAPDH3 10F4 were generated by immunization with a peptide comprising amino acids RLEKPAKYDDIKKVVK of human GAPDH (aa246-263) coupled to OVA. Animals were injected subcutaneously and intraperitoneally with a mixture of 50 μg peptide, 5 nmol CpG (Tib Molbiol, Berlin, Germany), and an equal volume of incomplete Freund’s adjuvant. Six weeks later a booster injection was performed without Freund’s adjuvant. Three days later, spleen cells were fused with P3X63Ag8.653 myeloma cells using standard procedures. Hybridoma supernatants were screened in a solid-phase enzyme-linked immunosorbent assay for binding to GAPDH antigen. Positive supernatants were further assayed for western blotting. Hybridoma cells were subcloned twice by limiting dilution to obtain the monoclonal cell line stably producing antibody GAPDH3 10F4 (rat IgG2c).

Chromatin cross-linking with formaldehyde

Raji cells were washed twice with PBS, resuspended in PBS to a concentration of 2 × 10⁷ cells/ml, and passed through 100 µm cell strainer (Corning Inc, USA). Fixation for 5 min at room temperature was performed by adding an equal volume of PBS 2% methanol-free formaldehyde (Thermo Scientific, USA, final concentration: 1% formaldehyde) and stopped by the addition of glycine (125 mM final concentration). After washing once with PBS and once with PBS 0.5% NP-40, cells were resuspended in PBS containing 10% glycerol, pelleted, and snap frozen in liquid nitrogen.

Cyclin western blot

Cross-linked samples were thawed on ice, then resuspended in LB3+ sonication buffer (see below) containing protease inhibitor and 10 mM MG132. After sonicating 3 × 5 min (30 s on, 30 s off) using Bioruptor in the presence of 212–300 µm glass beads, samples were treated with 50 U Benzonase for 15 min at room temperature and centrifuged 15 min at maximum speed. Also, 50 µg protein lysates were loaded on 10% SDS-polyacrylamide gel (cyclin A1/A2, cyclin B1), or 12.5–15% gradient gel (H3S10P). Cyclin A1/A2 (Abcam, ab185619), cyclin B1 (Abcam, ab72), and H3S10P (Cell Signaling, D2C8) antibodies were used in 1:1000 dilutions, and GAPDH (clone GAPDH3 10F4, rat IgG2c; this study) was diluted 1:50. HRP-coupled secondary antibodies were used in 1:10,000 dilutions. Detection was done using ECL on CEA Blue Sensitive X-ray films.

Chromatin sonication

Cross-linked cell pellets were thawed on ice, then resuspended in LB3(+) buffer (25 mM HEPES [pH 7.5], 140 mM NaCl, 1 mM EDTA, 0.5 mM EGTA, 0.5% sarkosyl, 0.1% DOC, 0.5% Triton-X-100, 1× protease inhibitor complete [Roche, Germany]) to a final concentration of 2 × 10⁷ cells/ml. Sonication was performed in AFA Fiber and Cap tubes (12 × 12 mm, Covaris, Great Britain) at an average temperature of 5°C at 100 W, 150 cycles/burst, 10% duty cycle, 20 min (S-G2-M fraction: 17 min) using the Covaris S220 (Covaris Inc, UK), resulting in DNA fragments of 100–300 bp on average.

Chromatin immunoprecipitation and qPCR quality control

Sheared chromatin was pre-cleared with 50 µl protein A Sepharose 4 Fast Flow beads (GE Healthcare, Germany) per 500 µg chromatin for 2 hr. Then, 500 µg chromatin (or 250 µg for histone methylation) were incubated with rabbit anti-Orc2, anti-Orc3, anti-Mcm3, anti-Mcm7 (Papior et al., 2012), mouse anti-H4K20me1 (Diagenode, MAb-147-100), rabbit anti-H4K20me3 (Diagenode, pAb-057-050), or IgG isotype controls for 16 hr at 4°C. BSA-blocked protein A beads (0.5 mg/ml BSA, 30 µg/ml salmon sperm, 1× protease inhibitor complete, 0.1% Triton-X-100 in LB3(-) buffer [without detergents]) were added (50 µl/500 µg chromatin) and incubated for at least 4 hr on an orbital shaker at 4°C. Sequential washing steps with RIPA-150mM NaCl (0.1% SDS, 0.5% DOC, 1% NP-40, 50 mM Tris [pH 8.0], 1 mM EDTA), RIPA-300 mM NaCl, RIPA-250 mM LiCl buffer, and twice in TE (pH 8.0) buffer were performed. Immunoprecipitated chromatin fragments were eluted from the beads by shaking twice at 1200 rpm for 10 min at 65°C in 100 µl TE 1% SDS. The elution was treated with 80 µg RNAse A for 2 hr at 37°C and with 8 µg proteinase K at 65°C for 16 hr. DNA was purified using the NucleoSpin Extract II Kit. Quantitative PCR analysis of the EBV oriP Dyad Symmetry element (for pre-RC ChIP) or H4K20me1 and -me3 positive loci were performed using the SYBR Green I Master Mix (Roche) and the Roche LightCycler 480 System. Oligo sequences for qPCR were DS_fw: AGTTCACTGCCCGCTCCT, DS_rv: CAGGATTCCACGAGGGTAGT, H4K20me1positive_fw: ATGCCTTCTTGCCTCTTGTC, H4K20me1positive_rv: AGTTAAAAGCAGCCCTGGTG, H4K20me3positive_fw: TCTGAGCAGGGTTGCAAGTAC, H4K20me3positive_rv: AAGGAAATGATGCCCAGCTG. Chromatin fragment sizes were verified by loading 1–2 µg chromatin on a 1.5% agarose gel. Samples were quantified using Qubit HS dsDNA assay.

ChIP-sample sequencing

ChIP sample library preparations from >4 ng of ChIP-DNA was performed using Accel-NGS 1S Plus DNA Library Kit for Illumina (Swift Biosciences). A 50 bp single-end sequencing was done with the Illumina HiSEQ 1500 sequencer to a sequencing depth of ~70 million reads. Fastq-files were mapped against the human genome (hg19, GRCh37, version 2009), extended for the EBV genome (NC007605) using bowtie (v1.1.1) (Langmead et al., 2009). Sequencing profiles were generated using deepTools’ bamCoverage function using reads extension to 200 bp and reads per genomic content normalization (Ramírez et al., 2016). Visualization was performed in UCSC Genome Browser (http://genome.ucsc.edu).

For H4K20me1 and -me3 ChIP-seq data, MACS2 peak-calling (Zhang et al., 2008) was performed using the broad setting and overlapping peaks in three replicates were retained for further analyses.

Binning approach and normalization

All data processing and analysis steps were performed in R (v.3.2.3) and numpy (v.1.18.5) python library, and visualizations were done using the ggplot2 (v3.1.0) package (R Development Core Team, 2018) and matplotlib (v.3.2.3) python library. The numbers of reads were calculated in non-overlapping 1 or 10 kb bins and saved in bed files for further analysis. To combine replicates, their sum per bin was calculated (=read frequency). To adjust for sequencing depth, the mean frequency per bin was calculated for the whole sequenced genome and all bins’ counts were divided by this mean value, resulting in the normalized read frequency. To account for variations in the input sample, we additionally removed bins without reads in the input from all samples and divided by the normalized read frequency of the input, resulting in the relative read frequency. When aggregating different loci, input normalization was performed after averaging. This resulted in relative read frequency ranging from 0 to ~30. Pairwise Pearson correlations of ORC/MCM samples were clustered by hierarchical clustering using complete linkage clustering.

Relation of ChIP relative read frequencies to Orc2 (K562) and DNase hypersensitivity

Orc2 ChIP-seq data in asynchronously cycling K562 cells was retrieved from GSE70165 (Miotto et al., 2016). Peak calling using default MACS2 settings resulted in 16,767 detected peaks overlapping from two replicates.

The ENCODE ‘DNase clusters’ track wgEncodeRegDnaseClusteredV3.bed.gz (December 3, 2017) containing DNase hypersensitive sites from 125 cell lines were retrieved from Thurman et al., 2012. Bins overlapping or not with HS sites larger than 1 kb were defined and the respective ChIP read frequency assigned for comparison.

Comparison of ChIP relative read frequencies to replication data

ASs were aligned on their left (5') and right (3') borders. Mean and standard error of the mean (SEM) of relative read frequencies of aligned 1 kb bins were then computed to assess the average ChIP signal around the considered AS borders 50 kb away from the AS to 10 kb within the AS as this was sufficient to visualize the full increase of ORC/MCM coverage when entering ASs – the ORC/MCM relative read frequency plateaus inside ASs in Figure 2b-d are clearly seen. To make sure bins within the ASs were closer to the considered AS border than to the opposite border, only ASs of size >20 kb were used (3247/4891). We also limited this analysis to ASs corresponding to efficient IZs by requiring ΔRFD > 0.5, filtering out a further 290 lowly efficient ASs, leaving 2957 ASs for the analyses (Table 1).

In order to interrogate the relationship between ASs and transcription, we compared the results obtained for different AS groups: 506 ASs were classified as non-genic AS when the AS locus extended 20 kb at both ends did not overlap any annotated gene; the remaining 2451 ASs were classified as genic ASs. From the latter group, 673 ASs were classified as type 1 ASs when both AS borders were flanked by at least one actively transcribed gene (distance of both AS borders to the closest transcribed [TPM >3] gene body was <20 kb), and 1026 ASs were classified as type 2 ASs when only one AS border was associated to a transcribed gene (Table 1).

In order to assess the role of H4H20me3 mark on AS specification, we also classified non-genic ASs depending on their input-normed H4K20me3 relative read frequency. We grouped the non-genic ASs where the H4K20me3 relative read frequency was above the genome mean value by more than 1.5 standard deviation (estimated over the whole genome) and the non-genic ASs where the H4K20me3 relative read frequency was below the genome mean value. This resulted in 154 non-genic ASs with H4K20me3 signal significantly higher than genome average and 242 non-genic ASs with H4K20me3 signal lower than genome average.

A similar selection was performed on fully intergenic 10 kb windows within NRRs (as done above using the mean and standard deviation of H4K20me3 relative read frequency estimated on all fully intergenic 10 kb windows). This resulted in 504 and 3986 windows with high and low H4K20me3 signal, respectively.

Comparison of ChIP relative read frequencies to transcription data

Gene-containing bins were determined and overlapping genes removed from the analysis. For cumulative analysis, we only worked with genes larger 30 kb and assigned the gene expression levels in TPM accordingly. Genes were either aligned at their TSS or their TTS, and the corresponding average ChIP read frequency windows were calculated in a 30 kb window centered on the alignment site.

Comparison of ChIP relative read frequencies to RT

For identification of RTDs in Raji cells, we used the early- to late-RT ratio determined by Repli-seq (Sima et al., 2018). We directly worked from the pre-computed early to late log ratio from supplementary file GSE102522_Raji_log2_hg19.txt downloaded from GEO (accession number GSE102522). The timing of every non-overlapping 10 kb bin was calculated as the averaged log₂(Early/Late) ratio within the surrounding 100 kb window. Early RTDs were defined as regions where the average log ratio >1.6 and late RTDs as regions where the average log ratio <−2.0. These thresholds resulted in 1648 early RTDs, ranging from 10 to 8940 kb in size, with a mean size of 591 kb, while we detected 2046 late RTDs in sizes from 10 to 8860 kb, averaging at 470 kb. These RTDs were used to classify ChIP read relative frequencies calculated in 10 kb bins as early or late RT. Bins overlapping any gene extended by 10 kb on both sides were removed from the analysis to avoid effects of gene activity on ChIP signals.

Comparison of ChIP relative read frequencies distributions at different RT depending on transcriptional and replicative status

All non-overlapping 10 kb windows were classified as intergenic if closest genes were more than 5 kb away, as belonging to a silent (resp. expressed) gene body if the window was inside a gene with TPM <3 (resp. TPM >3) and at more than 3 kb of gene borders, otherwise windows were disregarded. This made sure that specific ChIP signal at gene TSS and TTS was not considered in the analysis. Using the three window categories, we computed the 2D histograms of ChIP relative read frequencies versus RT in intergenic, silent and expressed gene bodies. We used 10 timing bins corresponding to the deciles of the whole genome timing distribution. For each timing bin, the histogram counts were normalized so as to obtain an estimate of the probability distribution function of the ChIP signal at the considered RT. The analysis was reproduced after restricting for windows fully in (i) AS segments (size >20 kb, ΔRFD > 0.5), (ii) DS segments (size >20 kb, ΔRFD < −0.5), (iii) URRs, and (iv) NRRs.

Statistics

Statistical analyses were performed in R using one-sided t-test with Welch correction and 95% confidence interval or one-way ANOVA followed by Tukey’s multiple comparisons of means with 95% family-wise confidence level, if appropriate. Comparison between ChIP signal distribution observed in two situations was performed computing the two-sample Kolmogorov–Smirnov statistics D_KS using SciPy (v.1.5.0) statistical library and correcting for sample sizes by reporting $Z_{K S} = D_{K S} \sqrt{\frac{n m}{n + m}}$ , where n and m are the sizes of the two samples, respectively.

ERCE RFD profiles

The positions of the three genetically identified ERCEs in the mESC Dppa2/4 locus and of the 1835 predicted mESC ERCEs were downloaded from Sima et al., 2019. The mESC OK-seq data were downloaded from Petryk et al., 2018 (SRR7535256) and mapped to mm10 genome (Petryk et al., 2018). OK-seq data from cycling mouse B cells were downloaded from Tubbs et al., 2018 (GSE116319). The RFD profile was computed as in Hennion et al., 2020 with 10 kb binning steps. Predicted ERCE shuffling was performed using a homemade function keeping the number of ERCE constant for each chromosome and avoiding unmapped genome sequences (genome regions with >20 consecutive Ns). Aggregated average RFD profiles were centered on the ERCE. The profile's envelopes represent the 95% confidence interval based on the mean and standard deviation at each position.

Acknowledgements

We thank Tobias Straub for initial help with bioinformatical analyses, Torsten Krude for critical comments on the manuscript, and Hadi Kabalane for help with Raji RFD data.

AS was supported by the Deutsche Forschungsgemeinschaft (SFB 1064 TP05), SPP1230, and the HELENA graduate school of the Helmholtz Zentrum München. BA and OH were supported by the Agence Nationale de la Recherche (ANR-15-CE12-0011, ANR-18-CE45-0002, ANR-19-CE12-0028) and the Fondation pour la Recherche Médicale (FRM DEI201512344404), and the Cancéropôle Ile-de-France and the INCa (PL-BIO16-302). OH was supported by the Ligue Nationale Contre le Cancer (Comité de Paris; RS19/75-75), the Association pour la Recherche sur le Cancer (PJA 20171206387), and the program ‘Investissements d'Avenir’ launched by the French Government and implemented by the ANR (ANR-10-IDEX-0001-02 PSL*Research University). WH was supported by the Deutsche Forschungsgemeinschaft (SFB1064/TP A13, SFB-TR36/TP A04), Deutsche Krebshilfe (grant number 70112875), and National Cancer Institute (grant number CA70723). Publication costs were covered by the SFB 1064.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Olivier Hyrien, Email: hyrien@biologie.ens.fr.

Benjamin Audit, Email: benjamin.audit@ens-lyon.fr.

Aloys Schepers, Email: schepers@helmholtz-muenchen.de.

Bruce Stillman, Cold Spring Harbor Laboratory, United States.

Kevin Struhl, Harvard Medical School, United States.

Funding Information

This paper was supported by the following grants:

Helmholtz Zentrum München (GmbH) to Nina Kirstein, Alexander Buschle, Wolfgang Hammerschmidt, Aloys Schepers.
Deutsche Forschungsgemeinschaft SFB 1064/TP05 to Aloys Schepers.
Agence Nationale de la Recherche ANR-15-CE12-0011 to Olivier Hyrien, Benjamin Audit.
Fondation pour la Recherche Médicale FRM DEI201512344404 to Olivier Hyrien, Benjamin Audit.
Cancéropôle Île-de-France PL-BIO16-302 to Olivier Hyrien, Benjamin Audit.
Ligue Contre le Cancer RS19/75-75 to Olivier Hyrien.
Association pour la Recherche sur le Cancer PJA 20171206387 to Olivier Hyrien.
Deutsche Krebshilfe 70112875 to Wolfgang Hammerschmidt.
National Cancer Institute CA70723 to Wolfgang Hammerschmidt.
Agence Nationale de la Recherche ANR-18-CE45-0002 to Olivier Hyrien, Benjamin Audit.
Agence Nationale de la Recherche ANR-19-CE12-0028 to Olivier Hyrien, Benjamin Audit.
Agence Nationale de la Recherche ANR-10-IDEX-0001-02 to Olivier Hyrien.
Deutsche Forschungsgemeinschaft SFB1064/TP A13 to Wolfgang Hammerschmidt.
Deutsche Forschungsgemeinschaft SFB-TR36/TP A04 to Wolfgang Hammerschmidt.
Deutsche Forschungsgemeinschaft SPP1230 to Aloys Schepers.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Data curation, Formal analysis, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Data curation, Formal analysis, Investigation.

Data curation, Investigation.

Data curation.

Resources, Supervision.

Resources.

antibody generation and alidation.

Conceptualization, Supervision, Funding acquisition.

Formal analysis, Visualization.

Conceptualization, Supervision, Funding acquisition, Validation, Writing - original draft, Writing - review and editing.

Conceptualization, Software, Formal analysis, Supervision, Funding acquisition, Validation, Investigation, Writing - original draft, Project administration, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Transparent reporting form

elife-62161-transrepform.docx^{(251.3KB, docx)}

Data availability

Sequencing data have been deposited in the European Nucleotide Archive (ENA) and NCBI Gene Expression Omnibus as indicated — ChIP-Seq: PRJEB32855, RNA-seq Raji: PRJEB31867, OK-seq Raji: PRJEB25180, Repli-seq Raji: GSE102522, OK-seq mESC: SRR7535256, OK-seq mouse B-cells: GSE116319. All data generated or analysed during this study are included in the manuscript and supporting files. Source data files have been provided for Figures 1–6, Figure 1–Supplements 1–3, Figure 2–Supplements 1,2; Figure 3–Supplement 1; Figure 4–Supplements 1,2; Figure 5–Supplement 1, and Figure 7–Supplement 1.

The following dataset was generated:

Kirstein N, Buschle A, Wu X, Krebs S, Blum H, Hammerschmidt W, Lacroix L, Hyrien O, Audit B, Schepers A. 2020. Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones. European Nucleotide Archive (ENA) PRJEB32855

The following previously published datasets were used:

Buschle A, Mrozek-Gorska P, Krebs S, Blum H, Cernilogar FM, Schotta G, Pich D, Straub T, Hammerschmidt W. 2019. RNA-seq in Raji cells with inducible BZLF1 prior to and after induction of EBV's lytic cycle by doxycycline. European Nucleotide Archive (ENA) PRJEB31867

Wu X, Kabalane H, Kahli M, Petryk N, Laperrousaz B, Jaszczyszyn Y, Drillon G, Nicolini FE, Perot G, Robert A, Fund C, Chibon F, Xia R, Wiels J, Argoul F, Maguer-Satta V, Arneodo A, Audit B, Hyrien O. 2018. Developmental and cancer-associated plasticity of DNA replication preferentially targets GC-poor, lowly expressed and late-replicating regions. European Nucleotide Archive (ENA) PRJEB25180

Sima J, Bartlett DA, Gordon MR, Gilbert DM. 2018. Bacterial artificial chromosomes establish replication timing and sub-nuclear compartment de novo as extra-chromosomal vectors [repli-seq] NCBI Gene Expression Omnibus. GSE102522

Petryk N, Dalby M, Wenger A, Stromme CB, Strandsby A, Andersson R, Groth A. 2018. MCM2 promotes symmetric inheritance of modified histones during DNA replication. European Nucleotide Archive (ENA) SRR7535256

Tubbs A, Sridharan S, van Wietmarschen N, Maman Y, Callen E, Stanlie A, Wu W, Wu X, Day A, Wong N, Yin M, Canela A, Fu H, Redon C, Pruitt SC, Jaszczyszyn Y, Aladjem MI, Aplan PD, Hyrien O, Nussenzweig A. 2018. OK-seq profile from cycling (S) phase untreated B cells. NCBI Gene Expression Omnibus. GSE116319

References

Adams A, Lindahl T, Klein G. Linear association between cellular DNA and Epstein-Barr virus DNA in a human lymphoblastoid cell line. PNAS. 1973;70:2888–2892. doi: 10.1073/pnas.70.10.2888. [DOI] [PMC free article] [PubMed] [Google Scholar]
Akerman I, Kasaai B, Bazarova A, Sang PB, Peiffer I, Artufel M, Derelle R, Smith G, Rodriguez-Martinez M, Romano M, Kinet S, Tino P, Theillet C, Taylor N, Ballester B, Méchali M. A predictable conserved DNA base composition signature defines human core DNA replication origins. Nature Communications. 2020;11:18527. doi: 10.1038/s41467-020-18527-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Anders S, Pyl PT, Huber W. HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
Audit B, Baker A, Chen CL, Rappailles A, Guilbaud G, Julienne H, Goldar A, d'Aubenton-Carafa Y, Hyrien O, Thermes C, Arneodo A. Multiscale analysis of genome-wide replication timing profiles using a wavelet-based signal-processing algorithm. Nature Protocols. 2013;8:98–110. doi: 10.1038/nprot.2012.145. [DOI] [PubMed] [Google Scholar]
Baker A, Audit B, Chen CL, Moindrot B, Leleu A, Guilbaud G, Rappailles A, Vaillant C, Goldar A, Mongelard F, d'Aubenton-Carafa Y, Hyrien O, Thermes C, Arneodo A. Replication fork polarity gradients revealed by megabase-sized U-shaped replication timing domains in human cell lines. PLOS Computational Biology. 2012;8:e1002443. doi: 10.1371/journal.pcbi.1002443. [DOI] [PMC free article] [PubMed] [Google Scholar]
Beck DB, Burton A, Oda H, Ziegler-Birling C, Torres-Padilla ME, Reinberg D. The role of PR-Set7 in replication licensing depends on Suv4-20h. Genes & Development. 2012a;26:2580–2589. doi: 10.1101/gad.195636.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
Beck DB, Oda H, Shen SS, Reinberg D. PR-Set7 and H4K20me1: at the crossroads of genome integrity, cell cycle, chromosome condensation, and transcription. Genes & Development. 2012b;26:325–337. doi: 10.1101/gad.177444.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bell SP, Kaguni JM. Helicase loading at chromosomal origins of replication. Cold Spring Harbor Perspectives in Biology. 2013;5:a010124. doi: 10.1101/cshperspect.a010124. [DOI] [PMC free article] [PubMed] [Google Scholar]
Benetti R, Gonzalo S, Jaco I, Schotta G, Klatt P, Jenuwein T, Blasco MA. Suv4-20h deficiency results in telomere elongation and derepression of telomere recombination. Journal of Cell Biology. 2007;178:925–936. doi: 10.1083/jcb.200703081. [DOI] [PMC free article] [PubMed] [Google Scholar]
Boos D, Ferreira P. Origin firing regulations to control genome replication timing. Genes. 2019;10:199. doi: 10.3390/genes10030199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Boulos RE, Drillon G, Argoul F, Arneodo A, Audit B. Structural organization of human replication timing domains. FEBS Letters. 2015;589:2944–2957. doi: 10.1016/j.febslet.2015.04.015. [DOI] [PubMed] [Google Scholar]
Brustel J, Kirstein N, Izard F, Grimaud C, Prorok P, Cayrou C, Schotta G, Abdelsamie AF, Déjardin J, Méchali M, Baldacci G, Sardet C, Cadoret JC, Schepers A, Julien E. Histone H4K20 tri-methylation at late-firing origins ensures timely heterochromatin replication. The EMBO Journal. 2017;36:2726–2741. doi: 10.15252/embj.201796541. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cayrou C, Ballester B, Peiffer I, Fenouil R, Coulombe P, Andrau JC, van Helden J, Méchali M. The chromatin environment shapes DNA replication origin organization and defines origin classes. Genome Research. 2015;25:1873–1885. doi: 10.1101/gr.192799.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chaudhuri B, Xu H, Todorov I, Dutta A, Yates JL. Human DNA replication initiation factors, ORC and MCM, associate with oriP of Epstein-Barr virus. PNAS. 2001;98:10085–10089. doi: 10.1073/pnas.181347998. [DOI] [PMC free article] [PubMed] [Google Scholar]
Chen YH, Keegan S, Kahli M, Tonzi P, Fenyö D, Huang TT, Smith DJ. Transcription shapes DNA replication initiation and termination in human cells. Nature Structural & Molecular Biology. 2019;26:67–77. doi: 10.1038/s41594-018-0171-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Das SP, Borrman T, Liu VW, Yang SC, Bechhoefer J, Rhind N. Replication timing is regulated by the number of MCMs loaded at origins. Genome Research. 2015;25:1886–1892. doi: 10.1101/gr.195305.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dellino GI, Cittaro D, Piccioni R, Luzi L, Banfi S, Segalla S, Cesaroni M, Mendoza-Maldonado R, Giacca M, Pelicci PG. Genome-wide mapping of human DNA-replication origins: levels of transcription at ORC1 sites regulate origin selection and replication timing. Genome Research. 2013;23:1–11. doi: 10.1101/gr.142331.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
Demczuk A, Gauthier MG, Veras I, Kosiyatrakul S, Schildkraut CL, Busslinger M, Bechhoefer J, Norio P. Regulation of DNA replication within the immunoglobulin heavy-chain locus during B cell commitment. PLOS Biology. 2012;10:e1001360. doi: 10.1371/journal.pbio.1001360. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dhar SK, Yoshida K, Machida Y, Khaira P, Chaudhuri B, Wohlschlegel JA, Leffak M, Yates J, Dutta A. Replication from oriP of Epstein-Barr virus requires human ORC and is inhibited by geminin. Cell. 2001;106:287–296. doi: 10.1016/S0092-8674(01)00458-5. [DOI] [PubMed] [Google Scholar]
Douglas ME, Ali FA, Costa A, Diffley JFX. The mechanism of eukaryotic CMG helicase activation. Nature. 2018;555:265–268. doi: 10.1038/nature25787. [DOI] [PMC free article] [PubMed] [Google Scholar]
Evrin C, Clarke P, Zech J, Lurz R, Sun J, Uhle S, Li H, Stillman B, Speck C. A double-hexameric MCM2-7 complex is loaded onto origin DNA during licensing of eukaryotic DNA replication. PNAS. 2009;106:20240–20245. doi: 10.1073/pnas.0911500106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Feng J, Liu T, Qin B, Zhang Y, Liu XS. Identifying ChIP-seq enrichment using MACS. Nature Protocols. 2012;7:1728–1740. doi: 10.1038/nprot.2012.101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fragkos M, Ganier O, Coulombe P, Méchali M. DNA replication origin activation in space and time. Nature Reviews Molecular Cell Biology. 2015;16:360–374. doi: 10.1038/nrm4002. [DOI] [PubMed] [Google Scholar]
Gerhardt J, Jafar S, Spindler MP, Ott E, Schepers A. Identification of new human origins of DNA replication by an origin-trapping assay. Molecular and Cellular Biology. 2006;26:7731–7746. doi: 10.1128/MCB.01392-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gindin Y, Valenzuela MS, Aladjem MI, Meltzer PS, Bilke S. A chromatin structure-based model accurately predicts DNA replication timing in human cells. Molecular Systems Biology. 2014;10:722. doi: 10.1002/msb.134859. [DOI] [PMC free article] [PubMed] [Google Scholar]
Guilbaud G, Rappailles A, Baker A, Chen CL, Arneodo A, Goldar A, d'Aubenton-Carafa Y, Thermes C, Audit B, Hyrien O. Evidence for sequential and increasing activation of replication origins along replication timing gradients in the human genome. PLOS Computational Biology. 2011;7:e1002322. doi: 10.1371/journal.pcbi.1002322. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hamlin JL, Mesner LD, Dijkwel PA. A winding road to origin discovery. Chromosome Research. 2010;18:45–61. doi: 10.1007/s10577-009-9089-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, Wieser E, Taylor J, Berg S, Smith NJ, Kern R, Picus M, Hoyer S, van Kerkwijk MH, Brett M, Haldane A, Del Río JF, Wiebe M, Peterson P, Gérard-Marchant P, Sheppard K, Reddy T, Weckesser W, Abbasi H, Gohlke C, Oliphant TE. Array programming with NumPy. Nature. 2020;585:357–362. doi: 10.1038/s41586-020-2649-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hennion M, Arbona JM, Lacroix L, Cruaud C, Theulot B, Tallec BL, Proux F, Wu X, Novikova E, Engelen S, Lemainque A, Audit B, Hyrien O. FORK-seq: replication landscape of the Saccharomyces cerevisiae genome by nanopore sequencing. Genome Biology. 2020;21:125. doi: 10.1186/s13059-020-02013-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hulke ML, Massey DJ, Koren A. Genomic methods for measuring DNA replication dynamics. Chromosome Research. 2020;28:49–67. doi: 10.1007/s10577-019-09624-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hunter JD. Matplotlib: a 2D graphics environment. Computing in Science & Engineering. 2007;9:90–95. doi: 10.1109/MCSE.2007.55. [DOI] [Google Scholar]
Hyrien O, Maric C, Méchali M. Transition in specification of embryonic metazoan DNA replication origins. Science. 1995;270:994–997. doi: 10.1126/science.270.5238.994. [DOI] [PubMed] [Google Scholar]
Hyrien O. How MCM loading and spreading specify eukaryotic DNA replication initiation sites. F1000Research. 2016;5:2063. doi: 10.12688/f1000research.9008.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jin C, Zang C, Wei G, Cui K, Peng W, Zhao K, Felsenfeld G. H3.3/H2A.Z double variant–containing nucleosomes mark 'nucleosome-free regions' of active promoters and other regulatory regions. Nature Genetics. 2009;41:941–945. doi: 10.1038/ng.409. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jørgensen S, Schotta G, Sørensen CS. Histone H4 lysine 20 methylation: key player in epigenetic regulation of genomic integrity. Nucleic Acids Research. 2013;41:2797–2806. doi: 10.1093/nar/gkt012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kara N, Hossain M, Prasanth SG, Stillman B. Orc1 binding to mitotic chromosomes precedes spatial patterning during G1 phase and assembly of the origin recognition complex in human cells. Journal of Biological Chemistry. 2015;290:12355–12369. doi: 10.1074/jbc.M114.625012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
Knott SR, Viggiani CJ, Aparicio OM. To promote and protect: coordinating DNA replication and transcription for genome stability. Epigenetics. 2009;4:362–365. doi: 10.4161/epi.4.6.9712. [DOI] [PubMed] [Google Scholar]
Kreitz S, Ritzi M, Baack M, Knippers R. The human origin recognition complex protein 1 dissociates from chromatin during S phase in HeLa cells. Journal of Biological Chemistry. 2001;276:6337–6342. doi: 10.1074/jbc.M009473200. [DOI] [PubMed] [Google Scholar]
Krude T, Jackman M, Pines J, Laskey RA. Cyclin/Cdk-dependent initiation of DNA replication in a human cell-free system. Cell. 1997;88:109–119. doi: 10.1016/S0092-8674(00)81863-2. [DOI] [PubMed] [Google Scholar]
Kumagai A, Dunphy WG. Binding of the Treslin-MTBP complex to specific regions of the human genome promotes the initiation of DNA replication. Cell Reports. 2020;32:108178. doi: 10.1016/j.celrep.2020.108178. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kuo AJ, Song J, Cheung P, Ishibe-Murakami S, Yamazoe S, Chen JK, Patel DJ, Gozani O. The BAH domain of ORC1 links H4K20me2 to DNA replication licensing and Meier-Gorlin syndrome. Nature. 2012;484:115–119. doi: 10.1038/nature10956. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ladenburger EM, Keller C, Knippers R. Identification of a binding region for human origin recognition complex proteins 1 and 2 that coincides with an origin of DNA replication. Molecular and Cellular Biology. 2002;22:1036–1048. doi: 10.1128/MCB.22.4.1036-1048.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lebofsky R, Heilig R, Sonnleitner M, Weissenbach J, Bensimon A. DNA replication origin interference increases the spacing between initiation events in human cells. Molecular Biology of the Cell. 2006;17:5337–5345. doi: 10.1091/mbc.e06-04-0298. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
Long H, Zhang L, Lv M, Wen Z, Zhang W, Chen X, Zhang P, Li T, Chang L, Jin C, Wu G, Wang X, Yang F, Pei J, Chen P, Margueron R, Deng H, Zhu M, Li G. H2A.Z facilitates licensing and activation of early replication origins. Nature. 2020;577:576–581. doi: 10.1038/s41586-019-1877-9. [DOI] [PubMed] [Google Scholar]
MacAlpine HK, Gordân R, Powell SK, Hartemink AJ, MacAlpine DM. Drosophila ORC localizes to open chromatin and marks sites of cohesin complex loading. Genome Research. 2010;20:201–211. doi: 10.1101/gr.097873.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Macheret M, Halazonetis TD. Intragenic origins due to short G1 phases underlie oncogene-induced DNA replication stress. Nature. 2018;555:112–116. doi: 10.1038/nature25507. [DOI] [PMC free article] [PubMed] [Google Scholar]
Marahrens Y, Stillman B. A yeast chromosomal origin of DNA replication defined by multiple functional elements. Science. 1992;255:817–823. doi: 10.1126/science.1536007. [DOI] [PubMed] [Google Scholar]
Marchal C, Sima J, Gilbert DM. Control of DNA replication timing in the 3D genome. Nature Reviews Molecular Cell Biology. 2019;20:721–737. doi: 10.1038/s41580-019-0162-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
Martin MM, Ryan M, Kim R, Zakas AL, Fu H, Lin CM, Reinhold WC, Davis SR, Bilke S, Liu H, Doroshow JH, Reimers MA, Valenzuela MS, Pommier Y, Meltzer PS, Aladjem MI. Genome-wide depletion of replication initiation events in highly transcribed regions. Genome Research. 2011;21:1822–1832. doi: 10.1101/gr.124644.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
McGuffee SR, Smith DJ, Whitehouse I. Quantitative, genome-wide analysis of eukaryotic replication initiation and termination. Molecular Cell. 2013;50:123–135. doi: 10.1016/j.molcel.2013.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Méndez J, Zou-Yang XH, Kim SY, Hidaka M, Tansey WP, Stillman B. Human origin recognition complex large subunit is degraded by ubiquitin-mediated proteolysis after initiation of DNA replication. Molecular Cell. 2002;9:481–491. doi: 10.1016/S1097-2765(02)00467-7. [DOI] [PubMed] [Google Scholar]
Mesner LD, Valsakumar V, Cieslik M, Pickin R, Hamlin JL, Bekiranov S. Bubble-seq analysis of the human genome reveals distinct chromatin-mediated mechanisms for regulating early- and late-firing origins. Genome Research. 2013;23:1774–1788. doi: 10.1101/gr.155218.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
Miotto B, Ji Z, Struhl K. Selectivity of ORC binding sites and the relation to replication timing, fragile sites, and deletions in cancers. PNAS. 2016;113:E4810–E4819. doi: 10.1073/pnas.1609060113. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moiseeva TN, Bakkenist CJ. Regulation of the initiation of DNA replication in human cells. DNA Repair. 2018;72:99–106. doi: 10.1016/j.dnarep.2018.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nakamura K, Saredi G, Becker JR, Foster BM, Nguyen NV, Beyer TE, Cesa LC, Faull PA, Lukauskas S, Frimurer T, Chapman JR, Bartke T, Groth A. H4K20me0 recognition by BRCA1-BARD1 directs homologous recombination to sister chromatids. Nature Cell Biology. 2019;21:311–318. doi: 10.1038/s41556-019-0282-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Norio P. Visualization of DNA replication on individual Epstein-Barr virus episomes. Science. 2001;294:2361–2364. doi: 10.1126/science.1064603. [DOI] [PubMed] [Google Scholar]
Norio P, Kosiyatrakul S, Yang Q, Guan Z, Brown NM, Thomas S, Riblet R, Schildkraut CL. Progressive activation of DNA replication initiation in large domains of the immunoglobulin heavy chain locus during B cell development. Molecular Cell. 2005;20:575–587. doi: 10.1016/j.molcel.2005.10.029. [DOI] [PubMed] [Google Scholar]
Norio P, Schildkraut CL. Plasticity of DNA replication initiation in Epstein-Barr virus episomes. PLOS Biology. 2004;2:e152. doi: 10.1371/journal.pbio.0020152. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ohta S, Tatsumi Y, Fujita M, Tsurimoto T, Obuse C. The ORC1 cycle in human cells: ii. dynamic changes in the human ORC complex during the cell cycle. The Journal of Biological Chemistry. 2003;278:41535–41540. doi: 10.1074/jbc.M307535200. [DOI] [PubMed] [Google Scholar]
Okuno Y, McNairn AJ, den Elzen N, Pines J, Gilbert DM. Stability, chromatin association and functional activity of mammalian pre-replication complex proteins during the cell cycle. The EMBO Journal. 2001;20:4263–4277. doi: 10.1093/emboj/20.15.4263. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pannetier M, Julien E, Schotta G, Tardat M, Sardet C, Jenuwein T, Feil R. PR-SET7 and SUV4-20H regulate H4 lysine-20 methylation at imprinting control regions in the mouse. EMBO Reports. 2008;9:998–1005. doi: 10.1038/embor.2008.147. [DOI] [PMC free article] [PubMed] [Google Scholar]
Papior P, Arteaga-Salas JM, Günther T, Grundhoff A, Schepers A. Open chromatin structures regulate the efficiencies of pre-RC formation and replication initiation in Epstein-Barr virus. Journal of Cell Biology. 2012;198:509–528. doi: 10.1083/jcb.201109105. [DOI] [PMC free article] [PubMed] [Google Scholar]
Petryk N, Kahli M, d'Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Silvain M, Thermes C, Chen CL, Hyrien O. Replication landscape of the human genome. Nature Communications. 2016;7:10208. doi: 10.1038/ncomms10208. [DOI] [PMC free article] [PubMed] [Google Scholar]
Petryk N, Dalby M, Wenger A, Stromme CB, Strandsby A, Andersson R, Groth A. MCM2 promotes symmetric inheritance of modified histones during DNA replication. Science. 2018;361:1389–1392. doi: 10.1126/science.aau0294. [DOI] [PubMed] [Google Scholar]
Picard F, Cadoret JC, Audit B, Arneodo A, Alberti A, Battail C, Duret L, Prioleau MN. The spatiotemporal program of DNA replication is associated with specific combinations of chromatin marks in human cells. PLOS Genetics. 2014;10:e1004282. doi: 10.1371/journal.pgen.1004282. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pope BD, Ryba T, Dileep V, Yue F, Wu W, Denas O, Vera DL, Wang Y, Hansen RS, Canfield TK, Thurman RE, Cheng Y, Gülsoy G, Dennis JH, Snyder MP, Stamatoyannopoulos JA, Taylor J, Hardison RC, Kahveci T, Ren B, Gilbert DM. Topologically associating domains are stable units of replication-timing regulation. Nature. 2014;515:402–405. doi: 10.1038/nature13986. [DOI] [PMC free article] [PubMed] [Google Scholar]
Powell SK, MacAlpine HK, Prinz JA, Li Y, Belsky JA, MacAlpine DM. Dynamic loading and redistribution of the Mcm2-7 helicase complex through the cell cycle. The EMBO Journal. 2015;34:531–543. doi: 10.15252/embj.201488307. [DOI] [PMC free article] [PubMed] [Google Scholar]
Prioleau MN, MacAlpine DM. DNA replication origins-where do we begin? Genes & Development. 2016;30:1683–1697. doi: 10.1101/gad.285114.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2018. https://www.R-project.org [Google Scholar]
Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
Remus D, Beuron F, Tolun G, Griffith JD, Morris EP, Diffley JF. Concerted loading of Mcm2-7 double hexamers around DNA during DNA replication origin licensing. Cell. 2009;139:719–730. doi: 10.1016/j.cell.2009.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
Remus D, Diffley JFX. Eukaryotic DNA replication control: lock and load, then fire. Current Opinion in Cell Biology. 2009;21:771–777. doi: 10.1016/j.ceb.2009.08.002. [DOI] [PubMed] [Google Scholar]
Rhind N, Gilbert DM. DNA replication timing. Cold Spring Harbor Perspectives in Biology. 2013;5:a010132. doi: 10.1101/cshperspect.a010132. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ritzi M, Tillack K, Gerhardt J, Ott E, Humme S, Kremmer E, Hammerschmidt W, Schepers A. Complex protein-DNA dynamics at the latent origin of DNA replication of Epstein-Barr virus. Journal of Cell Science. 2003;116:3971–3984. doi: 10.1242/jcs.00708. [DOI] [PubMed] [Google Scholar]
Rivera-Mulia JC, Gilbert DM. Replicating large genomes: divide and conquer. Molecular Cell. 2016a;62:756–765. doi: 10.1016/j.molcel.2016.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rivera-Mulia JC, Gilbert DM. Replication timing and transcriptional control: beyond cause and effect-part III. Current Opinion in Cell Biology. 2016b;40:168–178. doi: 10.1016/j.ceb.2016.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rowles A, Tada S, Blow JJ. Changes in association of the Xenopus origin recognition complex with chromatin on licensing of replication origins. Journal of Cell Science. 1999;112:2011–2018. doi: 10.1242/jcs.112.12.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ryba T, Hiratani I, Lu J, Itoh M, Kulik M, Zhang J, Schulz TC, Robins AJ, Dalton S, Gilbert DM. Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types. Genome Research. 2010;20:761–770. doi: 10.1101/gr.099655.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sasaki T, Ramanathan S, Okuno Y, Kumagai C, Shaikh SS, Gilbert DM. The chinese hamster dihydrofolate reductase replication origin decision point follows activation of transcription and suppresses initiation of replication within transcription units. Molecular and Cellular Biology. 2006;26:1051–1062. doi: 10.1128/MCB.26.3.1051-1062.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schaarschmidt D, Ladenburger EM, Keller C, Knippers R. Human mcm proteins at a replication origin during the G1 to S phase transition. Nucleic Acids Research. 2002;30:4176–4185. doi: 10.1093/nar/gkf532. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schepers A, Ritzi M, Bousset K, Kremmer E, Yates JL, Harwood J, Diffley JF, Hammerschmidt W. Human origin recognition complex binds to the region of the latent origin of DNA replication of Epstein-Barr virus. The EMBO Journal. 2001;20:4588–4602. doi: 10.1093/emboj/20.16.4588. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schwartz YB, Kahn TG, Pirrotta V. Characteristic low density and shear sensitivity of cross-linked chromatin containing polycomb complexes. Molecular and Cellular Biology. 2005;25:432–439. doi: 10.1128/MCB.25.1.432-439.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shoaib M, Walter D, Gillespie PJ, Izard F, Fahrenkrog B, Lleres D, Lerdrup M, Johansen JV, Hansen K, Julien E, Blow JJ, Sørensen CS. Histone H4K20 methylation mediated chromatin compaction threshold ensures genome integrity by limiting DNA replication licensing. Nature Communications. 2018;9:3704. doi: 10.1038/s41467-018-06066-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Siddiqui K, Stillman B. ATP-dependent assembly of the human origin recognition complex. Journal of Biological Chemistry. 2007;282:32370–32383. doi: 10.1074/jbc.M705905200. [DOI] [PubMed] [Google Scholar]
Sima J, Bartlett DA, Gordon MR, Gilbert DM. Bacterial artificial chromosomes establish replication timing and sub-nuclear compartment de novo as extra-chromosomal vectors. Nucleic Acids Research. 2018;46:1810–1820. doi: 10.1093/nar/gkx1265. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sima J, Chakraborty A, Dileep V, Michalski M, Klein KN, Holcomb NP, Turner JL, Paulsen MT, Rivera-Mulia JC, Trevilla-Garcia C, Bartlett DA, Zhao PA, Washburn BK, Nora EP, Kraft K, Mundlos S, Bruneau BG, Ljungman M, Fraser P, Ay F, Gilbert DM. Identifying Cis elements for spatiotemporal control of mammalian DNA replication. Cell. 2019;176:816–830. doi: 10.1016/j.cell.2018.11.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith OK, Aladjem MI. Chromatin structure and replication origins: determinants of chromosome replication and nuclear organization. Journal of Molecular Biology. 2014;426:3330–3341. doi: 10.1016/j.jmb.2014.05.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
Smith DJ, Whitehouse I. Intrinsic coupling of lagging-strand synthesis to chromatin assembly. Nature. 2012;483:434–438. doi: 10.1038/nature10895. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sugimoto N, Maehara K, Yoshida K, Ohkawa Y, Fujita M. Genome-wide analysis of the spatiotemporal regulation of firing and dormant replication origins in human cells. Nucleic Acids Research. 2018;46:6683–6696. doi: 10.1093/nar/gky476. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sun J, Fernandez-Cid A, Riera A, Tognetti S, Yuan Z, Stillman B, Speck C, Li H. Structural and mechanistic insights into Mcm2-7 double-hexamer assembly and function. Genes & Development. 2014;28:2291–2303. doi: 10.1101/gad.242313.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tardat M, Brustel J, Kirsh O, Lefevbre C, Callanan M, Sardet C, Julien E. The histone H4 lys 20 methyltransferase PR-Set7 regulates replication origins in mammalian cells. Nature Cell Biology. 2010;12:1086–1093. doi: 10.1038/ncb2113. [DOI] [PubMed] [Google Scholar]
Teytelman L, Ozaydin B, Zill O, Lefrançois P, Snyder M, Rine J, Eisen MB. Impact of chromatin structures on DNA processing for genomic analyses. PLOS ONE. 2009;4:e6700. doi: 10.1371/journal.pone.0006700. [DOI] [PMC free article] [PubMed] [Google Scholar]
Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, John S, Sandstrom R, Bates D, Boatman L, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Qu H, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Yan Y, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature. 2012;489:75–82. doi: 10.1038/nature11232. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tubbs A, Sridharan S, van Wietmarschen N, Maman Y, Callen E, Stanlie A, Wu W, Wu X, Day A, Wong N, Yin M, Canela A, Fu H, Redon C, Pruitt SC, Jaszczyszyn Y, Aladjem MI, Aplan PD, Hyrien O, Nussenzweig A. Dual roles of poly(dA:dt) Tracts in replication initiation and fork collapse. Cell. 2018;174:1127–1142. doi: 10.1016/j.cell.2018.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
van Rossum G. Phyton Tutorial. Centrum Voor Wiskunde en Informatica, Department of Algorithmics and Architecture 1995
Vermeulen M, Eberl HC, Matarese F, Marks H, Denissov S, Butter F, Lee KK, Olsen JV, Hyman AA, Stunnenberg HG, Mann M. Quantitative interaction proteomics and genome-wide profiling of epigenetic histone marks and their readers. Cell. 2010;142:967–980. doi: 10.1016/j.cell.2010.08.020. [DOI] [PubMed] [Google Scholar]
Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy 1.0 Contributors SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wagner GP, Kin K, Lynch VJ. Measurement of mRNA abundance using RNA-seq data: rpkm measure is inconsistent among samples. Theory in Biosciences. 2012;131:281–285. doi: 10.1007/s12064-012-0162-3. [DOI] [PubMed] [Google Scholar]
Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, Lumley T. Gplots: Various R Programming Tools for Plotting Data. 2020 https://rdrr.io/cran/gplots/
Wickham H. Ggplot: 2 Elegant Graphics for Data Analysis. Berlin, Germany: Springer; 2016. [Google Scholar]
Wickham H, Francois R, Henry L, Müller K. Dplyr: A Grammar of Data Manipulation. 2020 https://github.com/tidyverse/dplyr
Wu X, Kabalane H, Kahli M, Petryk N, Laperrousaz B, Jaszczyszyn Y, Drillon G, Nicolini FE, Perot G, Robert A, Fund C, Chibon F, Xia R, Wiels J, Argoul F, Maguer-Satta V, Arneodo A, Audit B, Hyrien O. Developmental and cancer-associated plasticity of DNA replication preferentially targets GC-poor, lowly expressed and late-replicating regions. Nucleic Acids Research. 2018;46:10157–10172. doi: 10.1093/nar/gky797. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yang SC, Rhind N, Bechhoefer J. Modeling genome-wide replication kinetics reveals a mechanism for regulation of replication timing. Molecular Systems Biology. 2010;6:404. doi: 10.1038/msb.2010.61. [DOI] [PMC free article] [PubMed] [Google Scholar]
Yeeles JT, Deegan TD, Janska A, Early A, Diffley JF. Regulated eukaryotic DNA replication origin firing with purified proteins. Nature. 2015;519:431–435. doi: 10.1038/nature14285. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhao PA, Rivera-Mulia JC, Gilbert DM. Replication domains: genome compartmentalization into functional replication units. Advances in Experimental Medicine and Biology. 2017;1042:229–257. doi: 10.1007/978-981-10-6955-0_11. [DOI] [PubMed] [Google Scholar]
Zhao PA, Sasaki T, Gilbert DM. High-resolution Repli-Seq defines the temporal choreography of initiation, elongation and termination of replication in mammalian cells. Genome Biology. 2020;21:76. doi: 10.1186/s13059-020-01983-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife. doi: 10.7554/eLife.62161.sa1

Decision letter

Editor: Bruce Stillman¹

Reviewed by: Bruce Stillman²

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The manuscript characterizes the chromatin binding of two ORC subunits and two subunits of the MCM2-7 hexamer that are required for the initiation of DNA replication in different domains of the genome that are defined by the timing of DNA replication during S phase, early versus late. The binding of ORC and MCM subunits was compared with replication fork direction, transcription and replication timing profiles in human cells and subtle changes in the densities of these proteins in the different chromatin regions were observed. The distribution of these subunits, however, does not determine replication timing. The distribution in the genome of the histone H4K20me3 modification was also examined, indicating that it facilitates origin licensing in late-replicating regions. The authors suggest that factors other than the density and distribution of pre-Replicative Complexes determine the timing of the initiation of DNA replication during S phase.

Decision letter after peer review:

Thank you for submitting your article "Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Bruce Stillman as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by Kevin Struhl as the Senior Editor.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

As the editors have judged that your manuscript is of interest, but as described below that additional experiments and analysis are required before it will be re-considered for publication.

Summary:

The authors have performed an extensive analysis of the binding or ORC2 and ORC3 subunits of the Origin Recognition Complex (ORC) and the MCM3 and MCM7 subunits of the MCM2-7 helicase subunits by chromatin immunoprecipitation and then compared the data to replicating timing patterns throughout the genome of human Raji cells. Based on previous studies, some of which derive from the authors' labs, they have identified different regions of the genome that replicate at different times and these are associated or not with transcription start sites (TSS; early replicating), with non-transcribed DNA and regions that replicate unidirectionally or have no preference in a population of cells. Now the authors correlate the MCM2-7 and ORC binding with the replication timing and the various categories of chromatin.

They conclude that ORC and MCM binding correlates with each other in G1 phase, which is not surprising, and then that early replicating regions are more associated with domains enriched in ORC/MCM binding, and uniquely, that there are large regions of homogeneous distributions of ORC/MCM. They conclude that ORC/MCM correlates with replication timing but not the probability of replication initiation. They also demonstrate that histone H4K20me3 location is localized with ORC/MCM and that a certain late replicating DNA has higher H4K20me3 binding.

The paper contains a great deal of work and is of interest to those in the DNA replication field, adding to what is already known. There are some papers that are either not discussed well or not even mentioned and this should be corrected. However, before the paper can be re-considered for publication, the authors need to address some major concerns:

Essential revisions

1) One major issue relates to the small enrichments of ORC/MCM they observe in early versus late replicating DNA and the possibility that this represents a DNA isolation bias rather than a real correlation or cause (see specific points 4, 5 and 6 below). The paper does not address this issue which, given the small differences in ORC/MCM between early and late replicating regions (1.4 fold), this needs to be addressed. It is understood that genome-wide mapping of pre-RC components in mammalian cells is challenging. In all of the studies to date, the ChIP enrichment is very modest and not confined to tight peaks that are typical of transcription factor binding. The weak and broad patterns of localization and their enrichment at hypersensitive and transcription start sites may be a technical artifact or is reflective of the underlying biology. While the signals they are observing are likely biological, it is still very difficult (as the authors allude to in the Discussion) to disentangle causation and correlation with the observed patterns and enrichment at transcription start sites and DNase HS sites. What would make this story much stronger is to demonstrate that the MCM2-7 signal is dynamic – that is that the enrichment patterns they observe in G1 should be very different from the patterns in late S-phase or G2 when replication forks have displaced most of the MCMs. The authors need to perform ChIP-seq on the cells elutriated at 80 ml/min. The ORC profile may also change due to the dynamic nature of ORC in human cells, but the MCM definitely should only be enriched in late replicating regions of the genome in late S phase and this comparison is needed.

2) As shown in Figure 1A, stochastic variation can be observed for MCM3/7 and ORC2/3 ChIP-seq replicates, it's reasonable to speculate that the input signal can also fluctuate randomly among replicates. Moreover, most of the conclusions in this manuscript are based on the input normalized signals. However, as shown by Figure 1A and the record for ENA PRJEB32855, no replication is performed for the input. Thus, we suggest that the authors provide replicates for the input, and normalize the ChIP signal to pooled input signals.

3) As shown in Figure 1—figure supplement 2A, the input signals at "DNase hypersensitive (HS)" is lower than those at "no HS"; and in Figure 1—figure supplement 3E, the ChIP signals of MCM3/7 and ORC2/3 at "HS" are higher than those in "no HS". Thus, when the ChIP signals of MCM3/7 and ORC2/3 are normalized to the input, will the difference of ChIP signals of McCM3/7 or ORC2/3 at "HS" and "no HS" be amplified artificially?

Additional comments related to the above major comments described above:

1) In Figure 1, the authors show a convincing correlation between the ChIP-seq profiles of ORC2 and ORC3, as well as between MCM3 and MCM7. Miotto et al. had published ORC2 ChIP-seq data using asynchronous K562 human erythroid cells. The data in the Kirstein et al. paper reports ChIP data of ORC2 form Raji lymphoblastoid cells. Although the cell types and cell cycle stage are different, it would be valuable to show a Pearson correlation between the two different ORC2 sets. This should be shown as a Supplement to Figure 1. The authors could also comment on the data of ORC1 ChIP from Dellino et al., 2013 and Long et al., 2019 (see point 8 below) whether these ORC patterns correlate with the other ORC ChIP data.

2) Figure 5A and Figure 4—figure supplement 2B. The results show that ORC is 1.4 times more frequently found in early versus late replicating regions. It is possible that the chromatin in early replicating regions is more accessible to the ChIP procedure than late replicating regions, which are likely more compact and hence difficult to access using antibodies. How have the authors excluded the possibility that extraction of DNA fragments in early versus late replicating regions could explain the difference in ORC binding? It should be noted that the authors previous papers and the Discussion in this paper claims accessibility to chromatin by replication factors may explain replication timing, yet they have assumed that the sonication and antibodies used for ChIP analysis are equally accessible. It is known that heterochromatic regions of the genome form phase transitions that may behave completely differently than actively transcribed and "accessible" regions of the genome.

3) Figure 5E. The same concern outlined in comment 2 above could explain the small, albeit statistically significant difference between H4K20me3 high and low regions of the genome.

4) Figure 6A and related text. The very slight differences in H4K20me3 levels could also be explained by extraction artIfact.

5) “However, potential origins are defined by assembled MCM-DHs, not by ORC”. This statement that potential origins are determined by the MCM2-7 DH and not by ORC is not logical because MCM2-7 DH is loaded by ORC and other factors. The idea that it correlates with ORC is dismissed a few lines later, but none of these statements are justified. What is the evidence that access of firing factors to MCM2-7 DH are regulated by chromatin access?

6) In a significant paper describing ORC ChIP and replication initiation, it was shown that ORC binding correlates with histone H2AZ and this could explain early replication origin activity. This paper is not even cited, much less discussed, and it should be (see Long et al., 2019.

7) The authors have dismissed the replication timing model proposed by Miottto et al., 2016, but it is not clear why. This model should be discussed in relationship to the model in Figure 7.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

The reviewers have discussed your response to the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission that addresses one issue.

We would like to draw your attention to changes in our revision policy that we have made in response to COVID-19 (https://elifesciences.org/articles/57162). Specifically, we are asking editors to accept without delay manuscripts, like yours, that they judge can stand as eLife papers without additional data, even if they feel that they would make the manuscript stronger. Thus the revisions requested below only address clarity and presentation.

The revised paper has incorporated new data that compares the abundance of ORC2, ORC3, MCM3 and MCM7 protein at two difference stages of the cell division cycle and raises some interesting observations. The authors have extensively addressed all of the original reviewer comments and provide new analysis. The differences in MCM and ORC levels at the different classes of gene expression, except for the gene bodies, are very modest but nonetheless statistically significant.

The general conclusion is that ORC is wide spread on the genome in G1 phase and MCM localizes with sites of initiation of DNA replication, and that histone H3K4me3 is correlated with late origin firing. At all locations, the exact site of initiation of DNA replication is stochastic.

One paradox needs explaining that arises from the new data presented in the revised paper compared to data in the literature.

1) The data in Figure 4—figure supplement 2C show that MCM3 and MCM7 levels are reduced in S-G2-M cells compared to G1 cells, but there remains a difference between early versus late replication timing domains in both cell cycle stages. In contrast, ORC is high in early RFDs and low in late RFDs at both stages. Perhaps the authors should discuss the significance of this result, in light of the fact that ORC1 is degraded in human cells at the G1-S transition and should not be present later in the cell cycle util it is re-synthesized. Does this mean ORC2 and ORC3 remain chromatin bound during the cell cycle and what does this mean.

We suggest that the authors address this issue in the Discussion of a revised manuscript which can then proceed.

eLife. 2021 Mar 8;10:e62161. doi: 10.7554/eLife.62161.sa2

Author response

Essential revisions

1) One major issue relates to the small enrichments of ORC/MCM they observe in early versus late replicating DNA and the possibility that this represents a DNA isolation bias rather than a real correlation or cause (see specific points 4, 5 and 6 below). The paper does not address this issue which, given the small differences in ORC/MCM between early and late replicating regions (1.4 fold), this needs to be addressed. It is understood that genome-wide mapping of pre-RC components in mammalian cells is challenging. In all of the studies to date, the ChIP enrichment is very modest and not confined to tight peaks that are typical of transcription factor binding. The weak and broad patterns of localization and their enrichment at hypersensitive and transcription start sites may be a technical artifact or is reflective of the underlying biology. While the signals they are observing are likely biological, it is still very difficult (as the authors allude to in the Discussion) to disentangle causation and correlation with the observed patterns and enrichment at transcription start sites and DNase HS sites. What would make this story much stronger is to demonstrate that the MCM2-7 signal is dynamic – that is that the enrichment patterns they observe in G1 should be very different from the patterns in late S-phase or G2 when replication forks have displaced most of the MCMs. The authors need to perform ChIP-seq on the cells elutriated at 80 ml/min. The ORC profile may also change due to the dynamic nature of ORC in human cells, but the MCM definitely should only be enriched in late replicating regions of the genome in late S phase and this comparison is needed.

We thank the reviewers for this well-argued comment. We performed ORC and MCM ChIP-seq from elutriation fraction 80 ml/min, hereafter referred to as late S-G2-M (as determined in Figure 1—figure supplement 1) and analyzed their distribution with respect to replication timing.

In G1-derived chromatin, we previously observed both higher ORC and MCM binding in early RTDs compared to late RTDs (Figure 4, Figure 4—figure supplement 1D, E, Figure 5A and B, Figure 4—figure supplement 2B). In S-G2-M samples, ORC was still enriched in early compared to late RTDs. However, MCM signals were reduced in early compared to late RTDs, especially Mcm3. The early/late and G1/S-G2-M ratios ORC/MCM ChIP relative read frequencies are presented in Table 2. Early RTDs bind more ORC and MCM than late RTDs in G1, but this tendency is reduced or abolished in S-G2-M. Statistical testing validates that both ORC and MCM signals are dynamic between G1 and S-G2-M as predicted by the reviewers.

The cell-cycle dependent dynamic binding of ORC and MCM clearly suggests that the ORC/MCM enrichments observed in G1 in early compared late RTDs are of biological nature and not a technical artifact. We included the S-G2-M chromatin analyses in updated Figure 5B and the data without input normalization in updated Figure 4—figure supplement 2C.

2) As shown in Figure 1A, stochastic variation can be observed for MCM3/7 and ORC2/3 ChIP-seq replicates, it's reasonable to speculate that the input signal can also fluctuate randomly among replicates. Moreover, most of the conclusions in this manuscript are based on the input normalized signals. However, as shown by Figure 1A and the record for ENA PRJEB32855, no replication is performed for the input. Thus, we suggest that the authors provide replicates for the input, and normalize the ChIP signal to pooled input signals.

The reviewers are correct, we indeed did not show the input in replicates, as we were working with pooled input to obtain a higher read coverage. However, we performed input sequencing in three replicates and now show input replicates in Figure 1A. We uploaded the according input files to ENA PRJEB32855.

3) As shown in Figure 1—figure supplement 2A, the input signals at "DNase hypersensitive (HS)" is lower than those at "no HS"; and in Figure 1—figure supplement 3E, the ChIP signals of MCM3/7 and ORC2/3 at "HS" are higher than those in "no HS". Thus, when the ChIP signals of MCM3/7 and ORC2/3 are normalized to the input, will the difference of ChIP signals of McCM3/7 or ORC2/3 at "HS" and "no HS" be amplified artificially?

We indeed observed differential normalized read frequencies for the input when comparing “HS” to “no HS” (manuscript Figure 1—figure supplement 2B, Figure 1—figure supplement 3G).

When analyzing ORC and MCM normalized read frequencies without input division, we still observe a significant enrichment of ORC and MCM at HS sites. However, the enrichment is clearly more prominent for ORC, which is in line with previously reported Orc2 ChIP (Miotto et al., 2016). We do not necessarily expect the same observation for MCM due to their more disperse distribution.

We added DNase HS data without input normalization as new Figure 1—figure supplement 3G to the manuscript.

Input division is a widely used approach for ChIP-seq data analysis, because it also corrects for chromatin solubilization efficiencies between experiments. However, in our manuscript, we were aware of a potential bias introduced by the input. This is of relevance if the signal heights are low, as is the case for ORC and MCM. To visualize the impact of input normalization on the ORC/MCM profiles, we additionally included all analyses without input normalization in the figure supplements. When comparing both analysis approaches, our conclusions do not change qualitatively, although we do observe minor quantitative differences. We explain these important controls in the Results and mention figures comparing data with or without input normalization throughout the text.

Additional comments related to the above major comments described above:

1) In Figure 1, the authors show a convincing correlation between the ChIP-seq profiles of ORC2 and ORC3, as well as between MCM3 and MCM7. Miotto et al. had published ORC2 ChIP-seq data using asynchronous K562 human erythroid cells. The data in the Kirstein et al. paper reports ChIP data of ORC2 form Raji lymphoblastoid cells. Although the cell types and cell cycle stage are different, it would be valuable to show a Pearson correlation between the two different ORC2 sets. This should be shown as a Supplement to Figure 1. The authors could also comment on the data of ORC1 ChIP from Dellino et al., 2013 and Long et al., 2019 (see point 8 below) whether these ORC patterns correlate with the other ORC ChIP data.

Our ORC/MCM ChIP-seq was established using optimized cross-linking time and mild sonication settings to avoid disruption of complexes on chromatin. The Covaris ultrasonicator S220 focuses the energy on the sample, allowing a considerably reduced applied power. These carefully established settings and the use of different antibodies to IP ORC- and MCM-proteins were presumably also the reason why we observed the dispersed ORC/MCM profile described in the manuscript. The homogenous ORC/MCM distribution over the genome was addressed by our binning approach at 1 kb resolution.

Dellino et al. performed Orc1 ChIP from low-density chromatin of HeLa cells (Dellino et al., 2013). Thereby, they introduced a bias towards early replicating and transcriptionally active euchromatic regions. Indeed, when binning Orc1 data at 1kb, Orc1 was also enriched at TSS (Author response image 1A). Interestingly, low density input was also mildly enriched at TSS.

Author response image 1. — a) HeLa Orc1 and b) K562 Orc2 normalized read frequencies around TSSs or TTSs (independent of gene activity). Only genes larger than 30 kb without any adjacent gene within 15 kb were considered. Distances from TSSs or TTSs are indicated in kb. Means of Orc1 and Orc2 frequencies are shown ± 2 x SEM (lighter shadows). The dashed grey horizontal line indicates relative read frequency 1.0 for reference. c) Heat map of Pearson correlation coefficients R of our ORC ChIPs and the sum of two Orc2 replicates in K562 (Miotto et al.) and Orc1 in HeLa (Dellino et al., 2013) at 1 kb resolution. Column and line order were determined by complete linkage hierarchical clustering using the correlation distance (d = 1-r). d) Average ORC relative read frequencies at Orc2 (Miotto et al., 2016) peaks (>1 kb).

Orc2 ChIPs were performed by Miotto et al. in asynchronously cycling K562 cells using unfractionated chromatin and found to majorly depend on chromatin accessibility (Miotto et al., 2016). They detected Orc2 in promoters and regions enriched for active chromatin marks, hence also reporting a link between Orc2 and transcription. This was confirmed by binning their data at 1kb and analyzing their distribution across genes (Author response image 1B).

Both Orc1 and Orc2 were found at TSS, in line with our data (Figure 3A). However, both approaches did not detect the depletion from the gene body. This observation may either result from G1 cell cycle enrichment, or technical differences, or a combination of both. TSS also seem to be robust “storage sites” of ORC, independent of the cell cycle. We also assume that rigorous sonication reduces “weak” and disperse binding throughout the genome, thus mainly detecting majorly robust ORC binding at accessible sites. Technical differences, as well as the different cell lines and cell cycle stages may be the reason why we observe a poor correlation between Orc1 (Dellino), Orc2 (Miotto) and our ORC data at 1 kb resolution (Author response image 1C).

However, when we performed MACS2 peak calling on Orc2 in K562 (using default settings and cutoffs results in 16,767 detected peaks overlapping from two replicates) and calculated the average profile of our Orc2 or Orc3 at those peaks, we observe substantial co-enrichment (Author response image 1D).

We included Author response image 1D in the manuscript as Figure 1—figure supplement 3e and discussed our Orc2/3 enrichments at K562 Orc2 peaks together with the relation to DNase HS sites.

2) Figure 5A and Figure 4—figure supplement 2B. The results show that ORC is 1.4 times more frequently found in early versus late replicating regions. It is possible that the chromatin in early replicating regions is more accessible to the ChIP procedure than late replicating regions, which are likely more compact and hence difficult to access using antibodies. How have the authors excluded the possibility that extraction of DNA fragments in early versus late replicating regions could explain the difference in ORC binding? It should be noted that the authors previous papers and the Discussion in this paper claims accessibility to chromatin by replication factors may explain replication timing, yet they have assumed that the sonication and antibodies used for ChIP analysis are equally accessible. It is known that heterochromatic regions of the genome form phase transitions that may behave completely differently than actively transcribed and "accessible" regions of the genome.

The reviewers are addressing a very important issue that is often neglected when discussing ChIP results and is also often leading to misunderstanding. Late replicating regions of the genome are mainly heterochromatic and therefore often described as more compact and less accessible. This describes the situation within a cell. As a consequence, this chromatin is often more difficult to solubilize by sonification and/or MNase digest and therefore underrepresented in ChIP samples. In the ChIP test tube, however, the solubilized heterochromatin is equally accessible to antibodies as euchromatin. Furthermore, in our experiments, early replicating, “accessible” (eu)chromatin is in fact underrepresented in the input compared to late replicating (hetero)chromatin (Figure 4—figure supplement 2B). This is a consequence of the ultrasonication step: Less compact euchromatin requires less power to be fragmented to smaller fragments than compact heterochromatin. These euchromatic fragments are often too small to be captured resulting in an underrepresentation of euchromatin in ChIP samples. Therefore, input normalization enhances, rather than negates, the observed ORC enrichment in early replicating regions. In conclusion, neither an easier chromatin sonication nor an easier access of solubilized chromatin to antibodies can explain the stronger binding of ORC to early replicating regions.

3) Figure 5E. The same concern outlined in comment 2 above could explain the small, albeit statistically significant difference between H4K20me3 high and low regions of the genome.

We do not have any information about the relative accessibility of H4K20me3-high/low regions to provide a direct answer to this point. However, both H4K20me3-low and -high data sets are extracted from late replicating domains, as we are exclusively investigating late-replicating, non-genic AS and flat, null RFD (NRRs) that are only found in late-replicating DNA. Therefore, we do not expect a differential cellular accessibility and solubility unless uncorrrelated to replication timing.

Furthermore, analyses of ORC/MCM enrichments at H4K20me3-low and H4K20me3-high sites in G1 and S-G2-M chromatin clearly show the dynamics of ORC/MCM before and after replication. In particular ORC/MCM binding is reduced in S-G2-M at the high-H4K20me3 windows, confirming the expected dynamic behavior at potential replication origins.

We have integrated this argument in our manuscript and the S-G2-M data set as Fiugre 5F and Figure 5—figure supplement 1F.

4) Figure 6A and related text. The very slight differences in H4K20me3 levels could also be explained by extraction artIfact.

We are working with up to three biological replicates and target 6 different proteins (Orc2, Orc3, Mcm3, Mcm7, H4K20me3, H4K20me1). Extraction artefacts specific to early versus late replicating chromatin would appear in all samples, disregarding the targeted protein. In the case of H4K20me3, we observe a specific enrichment of high H4K20me3 levels in late replicating NRRs (Figure 6A, bottom row) that is not observed for any ORC/MCM, hence we can almost certainly exclude that our observation relies on extraction artefacts.

5) “However, potential origins are defined by assembled MCM-DHs, not by ORC”. This statement that potential origins are determined by the MCM2-7 DH and not by ORC is not logical because MCM2-7 DH is loaded by ORC and other factors. The idea that it correlates with ORC is dismissed a few lines later, but none of these statements are justified.

We agree that MCM-DH is loaded by ORC. However, it is the MCM-DH helicase that eventually nucleates replisome assembly following duplex melting (Yeeles et al., 2015). Therefore, the sites of replication initiation are ultimately dictated by the location of activated MCM-DH, not ORC. Accordingly, it has been shown several times that ORC is dispensable for replication initiation once MCM-DH have been loaded (Gros et al., 2015; Hua and Newport, 1998; Rowles et al., 1999). Furthermore, once loaded onto DNA by ORC, the MCM-DH helicase is free to either diffuse away from ORC (Evrin et al., 2009; Remus and Diffley, 2009) or be actively displaced by transcription or other chromatin-related processes (Edwards et al., 2002; Gros et al., 2015; Powell et al., 2015; Ritzi et al., 1998) Therefore, we expect that the location of initiation sites is defined by activated MCM-DH, whose location can differ from ORC.

What is the evidence that access of firing factors to MCM2-7 DH are regulated by chromatin access?

A large number of studies correlate genome-wide DNA accessibility (as measured by nuclease, methylase or transposase assays) with early replication and/or sites of replication initiation (Bell et al., 2010; Gilbert et al., 2004). We assume that the reviewers do not question this correlation. The question is therefore whether this correlation reflects causality.

On the one hand, we see no reason to assume that limiting origin firing factors would diffuse in chromatin very differently from the proteins used in DNA accessibility assays. We therefore expect them to encounter MCM-DH in accessible chromatin faster than in less accessible chromatin, so that even a genome with a theoretical strictly uniform density of MCM-DH would still show early- and late-replicating regions. Thus, replication timing would be at least in part the inevitable consequence of the uneven accessibility of the genome.

On the other hand, it has been proposed that budding yeast replication timing profiles can simply be accounted by a "multiple initiator model" in which all MCM-DH are equally accessible to limiting firing factors but individual origins differ by the number of bound MCM-DH so that origins with more MCM-DH fire earlier on average than those with less (Yang et al., 2010). However, Das et al. have reported that the correlation between MCM ChIP-seq signal and timing, although significant, is < 0.5 and that other factors must contribute as well (Das et al., 2015). For example, large numbers of MCM-DHs are loaded at late-replicating telomeres, suggesting that heterochromatin can somehow delay the firing of loaded MCM-DHs.

Histone modifications are known to modulate chromatin accessibility with effects on origin firing time. In budding yeast, moving a given origin from a late to an early replicating region of the chromosome can advance its firing time (Ferguson and Fangman, 1992). Targeting histone acetylases or deacetylases to specific chromosomal sites can increase or decrease origin efficiency both in yeast (Vogelauer et al., 2002) and human cells (Goren et al., 2008). Deleting the yeast Sin3-Rpd3 deacetylase causes origins that normally fire late to fire earlier in S phase, coincident with increased acetylation and accelerated loading of Cdc45, a limiting origin firing factor (Aparicio et al., 2004; Vogelauer et al., 2002). Thus, the local chromatin environment has the ability to potentially permit or restrict an origin from firing.

One potentially important difference between protein samples used in accessibility assays and diffusible origin firing factors is that the latter may experience specific physical interactions that modulate the effect of chromatin accessibility. For example Swi6, a fission yeast HP1 homologue, interacts with DDK, a rate-limiting origin firing kinase, to activate origins in early S phase, specifically at pericentromeric and silent mating-type heterochromatin loci (Hayashi et al., 2009). Mutations that diminish this interaction but do not affect Swi6/HP1 localization result in retardation of replication specifically at these heterochromatin loci. Therefore, this specific interaction can overcome the suppressive effect of “closed” chromatin created by Swi6/HP1. Similarly, a prevailing hypothesis is that chromatin acetylation stimulates DNA replication by opening chromatin structure (Gindin et al., 2014). However, recent results suggest that the acetyl-histone binding proteins BRD2 and BRD4 physically interact with the rate limiting origin firing factor TICRR/TRESLIN and that abrogation of this interaction disrupts the normal replication program (Sansam et al., 2018).

In summary, it is clear that the replication program reflects both, heterogeneity in MCM-DH loading and regulation of MCM-DH activation by chromatin structure. Whether specific histone modifications act by facilitating diffusion of firing factors through chromatin, by providing specific docking platforms for such factors, or by a combination of both mechanisms is not completely elucidated. At any rate, however, heterogeneity in MCM-DH loading alone appears insufficient to explain our results. An additional, significant role for chromatin structure in regulating MCM activation after MCM loading by ORC therefore appears likely. We have restructured the corresponding paragraphs in the Discussion, cited selected relevant studies, and tried to integrate these complex mechanisms in a rephrased Conclusion and the graphical model (Figure 7).

6) In a significant paper describing ORC ChIP and replication initiation, it was shown that ORC binding correlates with histone H2AZ and this could explain early replication origin activity. This paper is not even cited, much less discussed, and it should be (see Long et al., 2019.

We apologize for omitting this important paper. We have now discussed this publication in our manuscript. Its findings are in line with our previous paper that showed H2A.Z enrichment within AS (Petryk et al., 2016).

7) The authors have dismissed the replication timing model proposed by Miottto et al. 2016, but it is not clear why. This model should be discussed in relationship to the model in Figure 7.

We had no intention of dismissing the replication timing model proposed by Miotto et al. (Miotto et al., 2016). The authors simulated in this study replication timing based on their detected ~52,000 Orc2 peaks and assuming stochastic origin firing. They obtained a good correlation between simulated and Repli-seq determined replication timing profiles, and the correlation improved when they also assumed dispersed origin firing from “non-specific” Orc2 sites, that escaped experimental detection in their setup. Consistent with Miotto et al., we observed global differences of ORC levels when comparing early against late RTDs (Figure 5A, Figure 4—figure supplement 2B). In addition, we detected ORC in regions where Miotto et al. suggested its presence but did not detect it.

However, in an independent modelling approach (Gindin et al., 2014), all chromatin marks associated to open chromatin also allowed very good predictions of replication timing based on a large diversity of chromatin marks associated to open chromatin. Hence, the question remains of what is causally responsible for early replication / replication initiation. Furthermore, the current resolution of RT profiles (50-100 kb) is not sufficient to map replication IZs, whereas the resolution of RFD profiles (1-5 kb) does. Whether the models of Miotto et al. or Gindin et al. remain valid up to replication initiation sites also remains to be addressed.

Our work, which for the first time compares the location of ORC and MCM to high-resolution RFD profiles, suggests that neither ORC nor MCM density suffices to predict replication initiation at this resolution. We therefore conclude that the replication program reflects not only heterogeneity in ORC/MCM-DH density but also regulation of MCM-DH activation by chromatin structure (see additional comment 5 above). In other words, our model is consistent with the model of Miotto et al., but with an additional layer of regulation. We added a paragraph in the Discussion to explicit these points.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

The revised paper has incorporated new data that compares the abundance of ORC2, ORC3, MCM3 and MCM7 protein at two difference stages of the cell division cycle and raises some interesting observations. The authors have extensively addressed all of the original reviewer comments and provide new analysis. The differences in MCM and ORC levels at the different classes of gene expression, except for the gene bodies, are very modest but nonetheless statistically significant.

The general conclusion is that ORC is wide spread on the genome in G1 phase and MCM localizes with sites of initiation of DNA replication, and that histone H3K4me3 is correlated with late origin firing. At all locations, the exact site of initiation of DNA replication is stochastic.

One paradox needs explaining that arises from the new data presented in the revised paper compared to data in the literature.

1) The data in Figure 4—figure supplement 2C show that MCM3 and MCM7 levels are reduced in S-G2-M cells compared to G1 cells, but there remains a difference between early versus late replication timing domains in both cell cycle stages. In contrast, ORC is high in early RFDs and low in late RFDs at both stages. Perhaps the authors should discuss the significance of this result, in light of the fact that ORC1 is degraded in human cells at the G1-S transition and should not be present later in the cell cycle util it is re-synthesized. Does this mean ORC2 and ORC3 remain chromatin bound during the cell cycle and what does this mean.

We suggest that the authors address this issue in the Discussion of a revised manuscript which can then proceed.

We thank the reviewer for raising this interesting question. It is well described that Orc1 in human cells is degraded at the G1-S transition and in early S phase and is re-synthesized at in mitosis. However, the chromatin binding of the remaining ORC subunits is controversially discussed. In our study, we detect Orc2 and Orc3 binding to S-G2-M chromatin. However, ChIP-seq, which we used to detect chromatin-bound ORC only allows monitoring the relative distribution of chromatin-bound proteins along the genome and not their absolute levels. Therefore, we can only be speculative and do not exclude that Orc2 and Orc3 binding to chromatin is globally decreased after replication. The binding of Orc2 and Orc3 we detect in S-G2-M may either occur independently of Orc1 or reflect the binding of the entire complex in late mitotic cells. We have added a corresponding discussion in our manuscript.

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Kirstein N, Buschle A, Wu X, Krebs S, Blum H, Hammerschmidt W, Lacroix L, Hyrien O, Audit B, Schepers A. 2020. Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones. European Nucleotide Archive (ENA) PRJEB32855 [DOI] [PMC free article] [PubMed]
Buschle A, Mrozek-Gorska P, Krebs S, Blum H, Cernilogar FM, Schotta G, Pich D, Straub T, Hammerschmidt W. 2019. RNA-seq in Raji cells with inducible BZLF1 prior to and after induction of EBV's lytic cycle by doxycycline. European Nucleotide Archive (ENA) PRJEB31867
Wu X, Kabalane H, Kahli M, Petryk N, Laperrousaz B, Jaszczyszyn Y, Drillon G, Nicolini FE, Perot G, Robert A, Fund C, Chibon F, Xia R, Wiels J, Argoul F, Maguer-Satta V, Arneodo A, Audit B, Hyrien O. 2018. Developmental and cancer-associated plasticity of DNA replication preferentially targets GC-poor, lowly expressed and late-replicating regions. European Nucleotide Archive (ENA) PRJEB25180
Sima J, Bartlett DA, Gordon MR, Gilbert DM. 2018. Bacterial artificial chromosomes establish replication timing and sub-nuclear compartment de novo as extra-chromosomal vectors [repli-seq] NCBI Gene Expression Omnibus. GSE102522 [DOI] [PMC free article] [PubMed]
Petryk N, Dalby M, Wenger A, Stromme CB, Strandsby A, Andersson R, Groth A. 2018. MCM2 promotes symmetric inheritance of modified histones during DNA replication. European Nucleotide Archive (ENA) SRR7535256 [DOI] [PubMed]
Tubbs A, Sridharan S, van Wietmarschen N, Maman Y, Callen E, Stanlie A, Wu W, Wu X, Day A, Wong N, Yin M, Canela A, Fu H, Redon C, Pruitt SC, Jaszczyszyn Y, Aladjem MI, Aplan PD, Hyrien O, Nussenzweig A. 2018. OK-seq profile from cycling (S) phase untreated B cells. NCBI Gene Expression Omnibus. GSE116319

Supplementary Materials

Transparent reporting form

elife-62161-transrepform.docx^{(251.3KB, docx)}

Data Availability Statement

The following dataset was generated:

The following previously published datasets were used:

[bib1] Adams A, Lindahl T, Klein G. Linear association between cellular DNA and Epstein-Barr virus DNA in a human lymphoblastoid cell line. PNAS. 1973;70:2888–2892. doi: 10.1073/pnas.70.10.2888. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] Akerman I, Kasaai B, Bazarova A, Sang PB, Peiffer I, Artufel M, Derelle R, Smith G, Rodriguez-Martinez M, Romano M, Kinet S, Tino P, Theillet C, Taylor N, Ballester B, Méchali M. A predictable conserved DNA base composition signature defines human core DNA replication origins. Nature Communications. 2020;11:18527. doi: 10.1038/s41467-020-18527-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Anders S, Pyl PT, Huber W. HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Audit B, Baker A, Chen CL, Rappailles A, Guilbaud G, Julienne H, Goldar A, d'Aubenton-Carafa Y, Hyrien O, Thermes C, Arneodo A. Multiscale analysis of genome-wide replication timing profiles using a wavelet-based signal-processing algorithm. Nature Protocols. 2013;8:98–110. doi: 10.1038/nprot.2012.145. [DOI] [PubMed] [Google Scholar]

[bib5] Baker A, Audit B, Chen CL, Moindrot B, Leleu A, Guilbaud G, Rappailles A, Vaillant C, Goldar A, Mongelard F, d'Aubenton-Carafa Y, Hyrien O, Thermes C, Arneodo A. Replication fork polarity gradients revealed by megabase-sized U-shaped replication timing domains in human cell lines. PLOS Computational Biology. 2012;8:e1002443. doi: 10.1371/journal.pcbi.1002443. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Beck DB, Burton A, Oda H, Ziegler-Birling C, Torres-Padilla ME, Reinberg D. The role of PR-Set7 in replication licensing depends on Suv4-20h. Genes & Development. 2012a;26:2580–2589. doi: 10.1101/gad.195636.112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Beck DB, Oda H, Shen SS, Reinberg D. PR-Set7 and H4K20me1: at the crossroads of genome integrity, cell cycle, chromosome condensation, and transcription. Genes & Development. 2012b;26:325–337. doi: 10.1101/gad.177444.111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Bell SP, Kaguni JM. Helicase loading at chromosomal origins of replication. Cold Spring Harbor Perspectives in Biology. 2013;5:a010124. doi: 10.1101/cshperspect.a010124. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Benetti R, Gonzalo S, Jaco I, Schotta G, Klatt P, Jenuwein T, Blasco MA. Suv4-20h deficiency results in telomere elongation and derepression of telomere recombination. Journal of Cell Biology. 2007;178:925–936. doi: 10.1083/jcb.200703081. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] Boos D, Ferreira P. Origin firing regulations to control genome replication timing. Genes. 2019;10:199. doi: 10.3390/genes10030199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] Boulos RE, Drillon G, Argoul F, Arneodo A, Audit B. Structural organization of human replication timing domains. FEBS Letters. 2015;589:2944–2957. doi: 10.1016/j.febslet.2015.04.015. [DOI] [PubMed] [Google Scholar]

[bib12] Brustel J, Kirstein N, Izard F, Grimaud C, Prorok P, Cayrou C, Schotta G, Abdelsamie AF, Déjardin J, Méchali M, Baldacci G, Sardet C, Cadoret JC, Schepers A, Julien E. Histone H4K20 tri-methylation at late-firing origins ensures timely heterochromatin replication. The EMBO Journal. 2017;36:2726–2741. doi: 10.15252/embj.201796541. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Cayrou C, Ballester B, Peiffer I, Fenouil R, Coulombe P, Andrau JC, van Helden J, Méchali M. The chromatin environment shapes DNA replication origin organization and defines origin classes. Genome Research. 2015;25:1873–1885. doi: 10.1101/gr.192799.115. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Chaudhuri B, Xu H, Todorov I, Dutta A, Yates JL. Human DNA replication initiation factors, ORC and MCM, associate with oriP of Epstein-Barr virus. PNAS. 2001;98:10085–10089. doi: 10.1073/pnas.181347998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Chen YH, Keegan S, Kahli M, Tonzi P, Fenyö D, Huang TT, Smith DJ. Transcription shapes DNA replication initiation and termination in human cells. Nature Structural & Molecular Biology. 2019;26:67–77. doi: 10.1038/s41594-018-0171-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Das SP, Borrman T, Liu VW, Yang SC, Bechhoefer J, Rhind N. Replication timing is regulated by the number of MCMs loaded at origins. Genome Research. 2015;25:1886–1892. doi: 10.1101/gr.195305.115. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Dellino GI, Cittaro D, Piccioni R, Luzi L, Banfi S, Segalla S, Cesaroni M, Mendoza-Maldonado R, Giacca M, Pelicci PG. Genome-wide mapping of human DNA-replication origins: levels of transcription at ORC1 sites regulate origin selection and replication timing. Genome Research. 2013;23:1–11. doi: 10.1101/gr.142331.112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] Demczuk A, Gauthier MG, Veras I, Kosiyatrakul S, Schildkraut CL, Busslinger M, Bechhoefer J, Norio P. Regulation of DNA replication within the immunoglobulin heavy-chain locus during B cell commitment. PLOS Biology. 2012;10:e1001360. doi: 10.1371/journal.pbio.1001360. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Dhar SK, Yoshida K, Machida Y, Khaira P, Chaudhuri B, Wohlschlegel JA, Leffak M, Yates J, Dutta A. Replication from oriP of Epstein-Barr virus requires human ORC and is inhibited by geminin. Cell. 2001;106:287–296. doi: 10.1016/S0092-8674(01)00458-5. [DOI] [PubMed] [Google Scholar]

[bib20] Douglas ME, Ali FA, Costa A, Diffley JFX. The mechanism of eukaryotic CMG helicase activation. Nature. 2018;555:265–268. doi: 10.1038/nature25787. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Evrin C, Clarke P, Zech J, Lurz R, Sun J, Uhle S, Li H, Stillman B, Speck C. A double-hexameric MCM2-7 complex is loaded onto origin DNA during licensing of eukaryotic DNA replication. PNAS. 2009;106:20240–20245. doi: 10.1073/pnas.0911500106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Feng J, Liu T, Qin B, Zhang Y, Liu XS. Identifying ChIP-seq enrichment using MACS. Nature Protocols. 2012;7:1728–1740. doi: 10.1038/nprot.2012.101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Fragkos M, Ganier O, Coulombe P, Méchali M. DNA replication origin activation in space and time. Nature Reviews Molecular Cell Biology. 2015;16:360–374. doi: 10.1038/nrm4002. [DOI] [PubMed] [Google Scholar]

[bib24] Gerhardt J, Jafar S, Spindler MP, Ott E, Schepers A. Identification of new human origins of DNA replication by an origin-trapping assay. Molecular and Cellular Biology. 2006;26:7731–7746. doi: 10.1128/MCB.01392-06. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Gindin Y, Valenzuela MS, Aladjem MI, Meltzer PS, Bilke S. A chromatin structure-based model accurately predicts DNA replication timing in human cells. Molecular Systems Biology. 2014;10:722. doi: 10.1002/msb.134859. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Guilbaud G, Rappailles A, Baker A, Chen CL, Arneodo A, Goldar A, d'Aubenton-Carafa Y, Thermes C, Audit B, Hyrien O. Evidence for sequential and increasing activation of replication origins along replication timing gradients in the human genome. PLOS Computational Biology. 2011;7:e1002322. doi: 10.1371/journal.pcbi.1002322. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Hamlin JL, Mesner LD, Dijkwel PA. A winding road to origin discovery. Chromosome Research. 2010;18:45–61. doi: 10.1007/s10577-009-9089-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Harris CR, Millman KJ, van der Walt SJ, Gommers R, Virtanen P, Cournapeau D, Wieser E, Taylor J, Berg S, Smith NJ, Kern R, Picus M, Hoyer S, van Kerkwijk MH, Brett M, Haldane A, Del Río JF, Wiebe M, Peterson P, Gérard-Marchant P, Sheppard K, Reddy T, Weckesser W, Abbasi H, Gohlke C, Oliphant TE. Array programming with NumPy. Nature. 2020;585:357–362. doi: 10.1038/s41586-020-2649-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Hennion M, Arbona JM, Lacroix L, Cruaud C, Theulot B, Tallec BL, Proux F, Wu X, Novikova E, Engelen S, Lemainque A, Audit B, Hyrien O. FORK-seq: replication landscape of the Saccharomyces cerevisiae genome by nanopore sequencing. Genome Biology. 2020;21:125. doi: 10.1186/s13059-020-02013-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib30] Hulke ML, Massey DJ, Koren A. Genomic methods for measuring DNA replication dynamics. Chromosome Research. 2020;28:49–67. doi: 10.1007/s10577-019-09624-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Hunter JD. Matplotlib: a 2D graphics environment. Computing in Science & Engineering. 2007;9:90–95. doi: 10.1109/MCSE.2007.55. [DOI] [Google Scholar]

[bib32] Hyrien O, Maric C, Méchali M. Transition in specification of embryonic metazoan DNA replication origins. Science. 1995;270:994–997. doi: 10.1126/science.270.5238.994. [DOI] [PubMed] [Google Scholar]

[bib33] Hyrien O. How MCM loading and spreading specify eukaryotic DNA replication initiation sites. F1000Research. 2016;5:2063. doi: 10.12688/f1000research.9008.1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] Jin C, Zang C, Wei G, Cui K, Peng W, Zhao K, Felsenfeld G. H3.3/H2A.Z double variant–containing nucleosomes mark 'nucleosome-free regions' of active promoters and other regulatory regions. Nature Genetics. 2009;41:941–945. doi: 10.1038/ng.409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] Jørgensen S, Schotta G, Sørensen CS. Histone H4 lysine 20 methylation: key player in epigenetic regulation of genomic integrity. Nucleic Acids Research. 2013;41:2797–2806. doi: 10.1093/nar/gkt012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Kara N, Hossain M, Prasanth SG, Stillman B. Orc1 binding to mitotic chromosomes precedes spatial patterning during G1 phase and assembly of the origin recognition complex in human cells. Journal of Biological Chemistry. 2015;290:12355–12369. doi: 10.1074/jbc.M114.625012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biology. 2013;14:R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] Knott SR, Viggiani CJ, Aparicio OM. To promote and protect: coordinating DNA replication and transcription for genome stability. Epigenetics. 2009;4:362–365. doi: 10.4161/epi.4.6.9712. [DOI] [PubMed] [Google Scholar]

[bib39] Kreitz S, Ritzi M, Baack M, Knippers R. The human origin recognition complex protein 1 dissociates from chromatin during S phase in HeLa cells. Journal of Biological Chemistry. 2001;276:6337–6342. doi: 10.1074/jbc.M009473200. [DOI] [PubMed] [Google Scholar]

[bib40] Krude T, Jackman M, Pines J, Laskey RA. Cyclin/Cdk-dependent initiation of DNA replication in a human cell-free system. Cell. 1997;88:109–119. doi: 10.1016/S0092-8674(00)81863-2. [DOI] [PubMed] [Google Scholar]

[bib41] Kumagai A, Dunphy WG. Binding of the Treslin-MTBP complex to specific regions of the human genome promotes the initiation of DNA replication. Cell Reports. 2020;32:108178. doi: 10.1016/j.celrep.2020.108178. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] Kuo AJ, Song J, Cheung P, Ishibe-Murakami S, Yamazoe S, Chen JK, Patel DJ, Gozani O. The BAH domain of ORC1 links H4K20me2 to DNA replication licensing and Meier-Gorlin syndrome. Nature. 2012;484:115–119. doi: 10.1038/nature10956. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] Ladenburger EM, Keller C, Knippers R. Identification of a binding region for human origin recognition complex proteins 1 and 2 that coincides with an origin of DNA replication. Molecular and Cellular Biology. 2002;22:1036–1048. doi: 10.1128/MCB.22.4.1036-1048.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] Lebofsky R, Heilig R, Sonnleitner M, Weissenbach J, Bensimon A. DNA replication origin interference increases the spacing between initiation events in human cells. Molecular Biology of the Cell. 2006;17:5337–5345. doi: 10.1091/mbc.e06-04-0298. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] Long H, Zhang L, Lv M, Wen Z, Zhang W, Chen X, Zhang P, Li T, Chang L, Jin C, Wu G, Wang X, Yang F, Pei J, Chen P, Margueron R, Deng H, Zhu M, Li G. H2A.Z facilitates licensing and activation of early replication origins. Nature. 2020;577:576–581. doi: 10.1038/s41586-019-1877-9. [DOI] [PubMed] [Google Scholar]

[bib48] MacAlpine HK, Gordân R, Powell SK, Hartemink AJ, MacAlpine DM. Drosophila ORC localizes to open chromatin and marks sites of cohesin complex loading. Genome Research. 2010;20:201–211. doi: 10.1101/gr.097873.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] Macheret M, Halazonetis TD. Intragenic origins due to short G1 phases underlie oncogene-induced DNA replication stress. Nature. 2018;555:112–116. doi: 10.1038/nature25507. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] Marahrens Y, Stillman B. A yeast chromosomal origin of DNA replication defined by multiple functional elements. Science. 1992;255:817–823. doi: 10.1126/science.1536007. [DOI] [PubMed] [Google Scholar]

[bib51] Marchal C, Sima J, Gilbert DM. Control of DNA replication timing in the 3D genome. Nature Reviews Molecular Cell Biology. 2019;20:721–737. doi: 10.1038/s41580-019-0162-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] Martin MM, Ryan M, Kim R, Zakas AL, Fu H, Lin CM, Reinhold WC, Davis SR, Bilke S, Liu H, Doroshow JH, Reimers MA, Valenzuela MS, Pommier Y, Meltzer PS, Aladjem MI. Genome-wide depletion of replication initiation events in highly transcribed regions. Genome Research. 2011;21:1822–1832. doi: 10.1101/gr.124644.111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib53] McGuffee SR, Smith DJ, Whitehouse I. Quantitative, genome-wide analysis of eukaryotic replication initiation and termination. Molecular Cell. 2013;50:123–135. doi: 10.1016/j.molcel.2013.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib54] Méndez J, Zou-Yang XH, Kim SY, Hidaka M, Tansey WP, Stillman B. Human origin recognition complex large subunit is degraded by ubiquitin-mediated proteolysis after initiation of DNA replication. Molecular Cell. 2002;9:481–491. doi: 10.1016/S1097-2765(02)00467-7. [DOI] [PubMed] [Google Scholar]

[bib55] Mesner LD, Valsakumar V, Cieslik M, Pickin R, Hamlin JL, Bekiranov S. Bubble-seq analysis of the human genome reveals distinct chromatin-mediated mechanisms for regulating early- and late-firing origins. Genome Research. 2013;23:1774–1788. doi: 10.1101/gr.155218.113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] Miotto B, Ji Z, Struhl K. Selectivity of ORC binding sites and the relation to replication timing, fragile sites, and deletions in cancers. PNAS. 2016;113:E4810–E4819. doi: 10.1073/pnas.1609060113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib57] Moiseeva TN, Bakkenist CJ. Regulation of the initiation of DNA replication in human cells. DNA Repair. 2018;72:99–106. doi: 10.1016/j.dnarep.2018.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib58] Nakamura K, Saredi G, Becker JR, Foster BM, Nguyen NV, Beyer TE, Cesa LC, Faull PA, Lukauskas S, Frimurer T, Chapman JR, Bartke T, Groth A. H4K20me0 recognition by BRCA1-BARD1 directs homologous recombination to sister chromatids. Nature Cell Biology. 2019;21:311–318. doi: 10.1038/s41556-019-0282-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib59] Norio P. Visualization of DNA replication on individual Epstein-Barr virus episomes. Science. 2001;294:2361–2364. doi: 10.1126/science.1064603. [DOI] [PubMed] [Google Scholar]

[bib60] Norio P, Kosiyatrakul S, Yang Q, Guan Z, Brown NM, Thomas S, Riblet R, Schildkraut CL. Progressive activation of DNA replication initiation in large domains of the immunoglobulin heavy chain locus during B cell development. Molecular Cell. 2005;20:575–587. doi: 10.1016/j.molcel.2005.10.029. [DOI] [PubMed] [Google Scholar]

[bib61] Norio P, Schildkraut CL. Plasticity of DNA replication initiation in Epstein-Barr virus episomes. PLOS Biology. 2004;2:e152. doi: 10.1371/journal.pbio.0020152. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib62] Ohta S, Tatsumi Y, Fujita M, Tsurimoto T, Obuse C. The ORC1 cycle in human cells: ii. dynamic changes in the human ORC complex during the cell cycle. The Journal of Biological Chemistry. 2003;278:41535–41540. doi: 10.1074/jbc.M307535200. [DOI] [PubMed] [Google Scholar]

[bib63] Okuno Y, McNairn AJ, den Elzen N, Pines J, Gilbert DM. Stability, chromatin association and functional activity of mammalian pre-replication complex proteins during the cell cycle. The EMBO Journal. 2001;20:4263–4277. doi: 10.1093/emboj/20.15.4263. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib64] Pannetier M, Julien E, Schotta G, Tardat M, Sardet C, Jenuwein T, Feil R. PR-SET7 and SUV4-20H regulate H4 lysine-20 methylation at imprinting control regions in the mouse. EMBO Reports. 2008;9:998–1005. doi: 10.1038/embor.2008.147. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib65] Papior P, Arteaga-Salas JM, Günther T, Grundhoff A, Schepers A. Open chromatin structures regulate the efficiencies of pre-RC formation and replication initiation in Epstein-Barr virus. Journal of Cell Biology. 2012;198:509–528. doi: 10.1083/jcb.201109105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib66] Petryk N, Kahli M, d'Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Silvain M, Thermes C, Chen CL, Hyrien O. Replication landscape of the human genome. Nature Communications. 2016;7:10208. doi: 10.1038/ncomms10208. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib67] Petryk N, Dalby M, Wenger A, Stromme CB, Strandsby A, Andersson R, Groth A. MCM2 promotes symmetric inheritance of modified histones during DNA replication. Science. 2018;361:1389–1392. doi: 10.1126/science.aau0294. [DOI] [PubMed] [Google Scholar]

[bib68] Picard F, Cadoret JC, Audit B, Arneodo A, Alberti A, Battail C, Duret L, Prioleau MN. The spatiotemporal program of DNA replication is associated with specific combinations of chromatin marks in human cells. PLOS Genetics. 2014;10:e1004282. doi: 10.1371/journal.pgen.1004282. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib69] Pope BD, Ryba T, Dileep V, Yue F, Wu W, Denas O, Vera DL, Wang Y, Hansen RS, Canfield TK, Thurman RE, Cheng Y, Gülsoy G, Dennis JH, Snyder MP, Stamatoyannopoulos JA, Taylor J, Hardison RC, Kahveci T, Ren B, Gilbert DM. Topologically associating domains are stable units of replication-timing regulation. Nature. 2014;515:402–405. doi: 10.1038/nature13986. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib70] Powell SK, MacAlpine HK, Prinz JA, Li Y, Belsky JA, MacAlpine DM. Dynamic loading and redistribution of the Mcm2-7 helicase complex through the cell cycle. The EMBO Journal. 2015;34:531–543. doi: 10.15252/embj.201488307. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib71] Prioleau MN, MacAlpine DM. DNA replication origins-where do we begin? Genes & Development. 2016;30:1683–1697. doi: 10.1101/gad.285114.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib72] R Development Core Team . Vienna, Austria: R Foundation for Statistical Computing; 2018. https://www.R-project.org [Google Scholar]

[bib73] Ramírez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Research. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib74] Remus D, Beuron F, Tolun G, Griffith JD, Morris EP, Diffley JF. Concerted loading of Mcm2-7 double hexamers around DNA during DNA replication origin licensing. Cell. 2009;139:719–730. doi: 10.1016/j.cell.2009.10.015. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib75] Remus D, Diffley JFX. Eukaryotic DNA replication control: lock and load, then fire. Current Opinion in Cell Biology. 2009;21:771–777. doi: 10.1016/j.ceb.2009.08.002. [DOI] [PubMed] [Google Scholar]

[bib76] Rhind N, Gilbert DM. DNA replication timing. Cold Spring Harbor Perspectives in Biology. 2013;5:a010132. doi: 10.1101/cshperspect.a010132. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib77] Ritzi M, Tillack K, Gerhardt J, Ott E, Humme S, Kremmer E, Hammerschmidt W, Schepers A. Complex protein-DNA dynamics at the latent origin of DNA replication of Epstein-Barr virus. Journal of Cell Science. 2003;116:3971–3984. doi: 10.1242/jcs.00708. [DOI] [PubMed] [Google Scholar]

[bib78] Rivera-Mulia JC, Gilbert DM. Replicating large genomes: divide and conquer. Molecular Cell. 2016a;62:756–765. doi: 10.1016/j.molcel.2016.05.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib79] Rivera-Mulia JC, Gilbert DM. Replication timing and transcriptional control: beyond cause and effect-part III. Current Opinion in Cell Biology. 2016b;40:168–178. doi: 10.1016/j.ceb.2016.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib80] Rowles A, Tada S, Blow JJ. Changes in association of the Xenopus origin recognition complex with chromatin on licensing of replication origins. Journal of Cell Science. 1999;112:2011–2018. doi: 10.1242/jcs.112.12.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib81] Ryba T, Hiratani I, Lu J, Itoh M, Kulik M, Zhang J, Schulz TC, Robins AJ, Dalton S, Gilbert DM. Evolutionarily conserved replication timing profiles predict long-range chromatin interactions and distinguish closely related cell types. Genome Research. 2010;20:761–770. doi: 10.1101/gr.099655.109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib82] Sasaki T, Ramanathan S, Okuno Y, Kumagai C, Shaikh SS, Gilbert DM. The chinese hamster dihydrofolate reductase replication origin decision point follows activation of transcription and suppresses initiation of replication within transcription units. Molecular and Cellular Biology. 2006;26:1051–1062. doi: 10.1128/MCB.26.3.1051-1062.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib83] Schaarschmidt D, Ladenburger EM, Keller C, Knippers R. Human mcm proteins at a replication origin during the G1 to S phase transition. Nucleic Acids Research. 2002;30:4176–4185. doi: 10.1093/nar/gkf532. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib84] Schepers A, Ritzi M, Bousset K, Kremmer E, Yates JL, Harwood J, Diffley JF, Hammerschmidt W. Human origin recognition complex binds to the region of the latent origin of DNA replication of Epstein-Barr virus. The EMBO Journal. 2001;20:4588–4602. doi: 10.1093/emboj/20.16.4588. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib85] Schwartz YB, Kahn TG, Pirrotta V. Characteristic low density and shear sensitivity of cross-linked chromatin containing polycomb complexes. Molecular and Cellular Biology. 2005;25:432–439. doi: 10.1128/MCB.25.1.432-439.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib86] Shoaib M, Walter D, Gillespie PJ, Izard F, Fahrenkrog B, Lleres D, Lerdrup M, Johansen JV, Hansen K, Julien E, Blow JJ, Sørensen CS. Histone H4K20 methylation mediated chromatin compaction threshold ensures genome integrity by limiting DNA replication licensing. Nature Communications. 2018;9:3704. doi: 10.1038/s41467-018-06066-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib87] Siddiqui K, Stillman B. ATP-dependent assembly of the human origin recognition complex. Journal of Biological Chemistry. 2007;282:32370–32383. doi: 10.1074/jbc.M705905200. [DOI] [PubMed] [Google Scholar]

[bib88] Sima J, Bartlett DA, Gordon MR, Gilbert DM. Bacterial artificial chromosomes establish replication timing and sub-nuclear compartment de novo as extra-chromosomal vectors. Nucleic Acids Research. 2018;46:1810–1820. doi: 10.1093/nar/gkx1265. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib89] Sima J, Chakraborty A, Dileep V, Michalski M, Klein KN, Holcomb NP, Turner JL, Paulsen MT, Rivera-Mulia JC, Trevilla-Garcia C, Bartlett DA, Zhao PA, Washburn BK, Nora EP, Kraft K, Mundlos S, Bruneau BG, Ljungman M, Fraser P, Ay F, Gilbert DM. Identifying Cis elements for spatiotemporal control of mammalian DNA replication. Cell. 2019;176:816–830. doi: 10.1016/j.cell.2018.11.036. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib90] Smith OK, Aladjem MI. Chromatin structure and replication origins: determinants of chromosome replication and nuclear organization. Journal of Molecular Biology. 2014;426:3330–3341. doi: 10.1016/j.jmb.2014.05.027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib91] Smith DJ, Whitehouse I. Intrinsic coupling of lagging-strand synthesis to chromatin assembly. Nature. 2012;483:434–438. doi: 10.1038/nature10895. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib92] Sugimoto N, Maehara K, Yoshida K, Ohkawa Y, Fujita M. Genome-wide analysis of the spatiotemporal regulation of firing and dormant replication origins in human cells. Nucleic Acids Research. 2018;46:6683–6696. doi: 10.1093/nar/gky476. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib93] Sun J, Fernandez-Cid A, Riera A, Tognetti S, Yuan Z, Stillman B, Speck C, Li H. Structural and mechanistic insights into Mcm2-7 double-hexamer assembly and function. Genes & Development. 2014;28:2291–2303. doi: 10.1101/gad.242313.114. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib94] Tardat M, Brustel J, Kirsh O, Lefevbre C, Callanan M, Sardet C, Julien E. The histone H4 lys 20 methyltransferase PR-Set7 regulates replication origins in mammalian cells. Nature Cell Biology. 2010;12:1086–1093. doi: 10.1038/ncb2113. [DOI] [PubMed] [Google Scholar]

[bib95] Teytelman L, Ozaydin B, Zill O, Lefrançois P, Snyder M, Rine J, Eisen MB. Impact of chromatin structures on DNA processing for genomic analyses. PLOS ONE. 2009;4:e6700. doi: 10.1371/journal.pone.0006700. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib96] Thurman RE, Rynes E, Humbert R, Vierstra J, Maurano MT, Haugen E, Sheffield NC, Stergachis AB, Wang H, Vernot B, Garg K, John S, Sandstrom R, Bates D, Boatman L, Canfield TK, Diegel M, Dunn D, Ebersol AK, Frum T, Giste E, Johnson AK, Johnson EM, Kutyavin T, Lajoie B, Lee BK, Lee K, London D, Lotakis D, Neph S, Neri F, Nguyen ED, Qu H, Reynolds AP, Roach V, Safi A, Sanchez ME, Sanyal A, Shafer A, Simon JM, Song L, Vong S, Weaver M, Yan Y, Zhang Z, Zhang Z, Lenhard B, Tewari M, Dorschner MO, Hansen RS, Navas PA, Stamatoyannopoulos G, Iyer VR, Lieb JD, Sunyaev SR, Akey JM, Sabo PJ, Kaul R, Furey TS, Dekker J, Crawford GE, Stamatoyannopoulos JA. The accessible chromatin landscape of the human genome. Nature. 2012;489:75–82. doi: 10.1038/nature11232. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib97] Tubbs A, Sridharan S, van Wietmarschen N, Maman Y, Callen E, Stanlie A, Wu W, Wu X, Day A, Wong N, Yin M, Canela A, Fu H, Redon C, Pruitt SC, Jaszczyszyn Y, Aladjem MI, Aplan PD, Hyrien O, Nussenzweig A. Dual roles of poly(dA:dt) Tracts in replication initiation and fork collapse. Cell. 2018;174:1127–1142. doi: 10.1016/j.cell.2018.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib98] van Rossum G. Phyton Tutorial. Centrum Voor Wiskunde en Informatica, Department of Algorithmics and Architecture 1995

[bib99] Vermeulen M, Eberl HC, Matarese F, Marks H, Denissov S, Butter F, Lee KK, Olsen JV, Hyman AA, Stunnenberg HG, Mann M. Quantitative interaction proteomics and genome-wide profiling of epigenetic histone marks and their readers. Cell. 2010;142:967–980. doi: 10.1016/j.cell.2010.08.020. [DOI] [PubMed] [Google Scholar]

[bib100] Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, van der Walt SJ, Brett M, Wilson J, Millman KJ, Mayorov N, Nelson ARJ, Jones E, Kern R, Larson E, Carey CJ, Polat İ, Feng Y, Moore EW, VanderPlas J, Laxalde D, Perktold J, Cimrman R, Henriksen I, Quintero EA, Harris CR, Archibald AM, Ribeiro AH, Pedregosa F, van Mulbregt P, SciPy 1.0 Contributors SciPy 1.0: fundamental algorithms for scientific computing in Python. Nature Methods. 2020;17:261–272. doi: 10.1038/s41592-019-0686-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib101] Wagner GP, Kin K, Lynch VJ. Measurement of mRNA abundance using RNA-seq data: rpkm measure is inconsistent among samples. Theory in Biosciences. 2012;131:281–285. doi: 10.1007/s12064-012-0162-3. [DOI] [PubMed] [Google Scholar]

[bib102] Warnes GR, Bolker B, Bonebakker L, Gentleman R, Huber W, Liaw A, Lumley T. Gplots: Various R Programming Tools for Plotting Data. 2020 https://rdrr.io/cran/gplots/

[bib103] Wickham H. Ggplot: 2 Elegant Graphics for Data Analysis. Berlin, Germany: Springer; 2016. [Google Scholar]

[bib104] Wickham H, Francois R, Henry L, Müller K. Dplyr: A Grammar of Data Manipulation. 2020 https://github.com/tidyverse/dplyr

[bib105] Wu X, Kabalane H, Kahli M, Petryk N, Laperrousaz B, Jaszczyszyn Y, Drillon G, Nicolini FE, Perot G, Robert A, Fund C, Chibon F, Xia R, Wiels J, Argoul F, Maguer-Satta V, Arneodo A, Audit B, Hyrien O. Developmental and cancer-associated plasticity of DNA replication preferentially targets GC-poor, lowly expressed and late-replicating regions. Nucleic Acids Research. 2018;46:10157–10172. doi: 10.1093/nar/gky797. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib106] Yang SC, Rhind N, Bechhoefer J. Modeling genome-wide replication kinetics reveals a mechanism for regulation of replication timing. Molecular Systems Biology. 2010;6:404. doi: 10.1038/msb.2010.61. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib107] Yeeles JT, Deegan TD, Janska A, Early A, Diffley JF. Regulated eukaryotic DNA replication origin firing with purified proteins. Nature. 2015;519:431–435. doi: 10.1038/nature14285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib108] Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, Liu XS. Model-based analysis of ChIP-Seq (MACS) Genome Biology. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib109] Zhao PA, Rivera-Mulia JC, Gilbert DM. Replication domains: genome compartmentalization into functional replication units. Advances in Experimental Medicine and Biology. 2017;1042:229–257. doi: 10.1007/978-981-10-6955-0_11. [DOI] [PubMed] [Google Scholar]

[bib110] Zhao PA, Sasaki T, Gilbert DM. High-resolution Repli-Seq defines the temporal choreography of initiation, elongation and termination of replication in mammalian cells. Genome Biology. 2020;21:76. doi: 10.1186/s13059-020-01983-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Human ORC/MCM density is low in active genes and correlates with replication time but does not delimit initiation zones

Nina Kirstein

Alexander Buschle

Xia Wu

Stefan Krebs

Helmut Blum

Elisabeth Kremmer

Ina M Vorberg

Wolfgang Hammerschmidt

Laurent Lacroix

Olivier Hyrien

Benjamin Audit

Aloys Schepers

Roles

Abstract

Introduction

Results

Moderate averaging is a suitable approach for ORC and MCM-DH distribution analysis

Figure 1. Moderate averaging represents a valid approach for origin recognition complex/minichromosome maintenance complex (ORC/MCM) chromatin immunoprecipitation followed by sequencing (ChIP-seq) analysis.

Figure 1—figure supplement 1. Experimental validation of cell cycle fractionation and origin recognition complex and minichromosome maintenance complex (ORC/MCM) chromatin immunoprecipitation followed by sequencing quality.

Figure 1—figure supplement 2. The input sequencing control is differentially represented in regions of biological function.

Figure 1—figure supplement 3. Origin recognition complex/minichromosome maintenance complex (ORC/MCM) enrichments at the MCM4/PRKDC origin persists without input normalization.

ORC/MCM are enriched in IZs dependent on transcription

Figure 2. Origin recognition complex/minichromosome maintenance complex (ORC/MCM) enrichment within ascending segments (ASs) depends on active transcription.

Figure 2—figure supplement 1. Characterization of different ascending segment (AS) types.

Figure 2—figure supplement 2. Origin recognition complex/minichromosome maintenance complex (ORC/MCM) enrichments within ascending segments (ASs) without input normalization.

Table 1. Characterization of different AS subtypes.

ORC and MCM are depleted from transcribed gene bodies and enriched at TSSs

Figure 3. Origin recognition complex (ORC) is enriched at active transcription start sites (TSSs) while minichromosome maintenance complex (MCM) is depleted from actively transcribed genes.

Figure 3—figure supplement 1. Replication fork direction (RFD) and origin recognition complex/minichromosome maintenance complex (ORC/MCM) profiles without input normalization at gene extremities.

ORC/MCM genomic distributions are broad and correlate with RT but not IZs

Figure 4. Origin recognition complex/minichromosome maintenance complex (ORC/MCM) levels correlate with replication timing (RT) and transcriptional activity but are otherwise homogeneously distributed along the genome and uncorrelated to replication fork direction (RFD) patterns.

Figure 4—figure supplement 2. Kolmogorov–Smirnov statistics between the origin recognition complex/minichromosome maintenance complex (ORC/MCM).

Cell cycle dynamics of ORC and MCM binding

Figure 5. H4K20me3 selectively marks a subset of late-replicating non-genic ascending segments (ASs).

Figure 5—figure supplement 1. Origin recognition complex/minichromosome maintenance complex (ORC/MCM) is enriched in late-replicating, H4K20me3-high non-genic ascending segment (AS) and null RFD region (NRR) windows.

Table 2. Ratio of chromatin immunoprecipitation (ChIP) mean relative read frequencies in early versus late replication timing domains and G1 versus S-G2-M samples.

Late-replicating non-genic ASs and NRRs are characterized by H4K20me3

Figure 6. H4K20me3 is enriched in late-replicating null RFD regions (NRR).

Discussion

Conclusion

Figure 7. Model for replication organization in higher eukaryotes.

Figure 7—figure supplement 1. Early replication control elements (ERCEs) correlate with replication initiation.

Materials and methods

Key resources table.

Cell culture

RNA extraction, sequencing, and TPM calculation

Replication fork directionality profiling using OK-seq method in Raji

Determining regions of ascending, descending, and constant RFD

Centrifugal elutriation and flow cytometry

Generation of GAPDH monoclonal antibody

Chromatin cross-linking with formaldehyde

Cyclin western blot

Chromatin sonication

Chromatin immunoprecipitation and qPCR quality control

ChIP-sample sequencing

Binning approach and normalization

Relation of ChIP relative read frequencies to Orc2 (K562) and DNase hypersensitivity

Comparison of ChIP relative read frequencies to replication data

Comparison of ChIP relative read frequencies to transcription data

Comparison of ChIP relative read frequencies to RT

Comparison of ChIP relative read frequencies distributions at different RT depending on transcriptional and replicative status

Statistics

ERCE RFD profiles

Acknowledgements

Funding Statement

Contributor Information

Funding Information

Additional information

Competing interests

Author contributions

Additional files

Data availability

References

Decision letter

Roles

Author response

Author response image 1. Comparison of our ORC/MCM ChIPs with previous Orc1 and Orc2 ChIP-seq data.

Associated Data