Abstract
Chromatin assembled with the histone H3 variant CENP-A is the heritable epigenetic determinant of human centromere identity. Using genome-wide mapping and reference models for 23 human centromeres, CENP-A binding sites are identified within the megabase-long, repetitive α-satellite DNAs at each centromere. CENP-A is shown in early G1 to be assembled into nucleosomes within each centromere and onto 11,390 transcriptionally active sites on the chromosome arms. DNA replication is demonstrated to remove ectopically loaded, non-centromeric CENP-A. In contrast, tethering of centromeric CENP-A to the sites of DNA replication through the constitutive centromere associated network (CCAN) is shown to enable precise reloading of centromere-bound CENP-A onto the same DNA sequences as in its initial pre-replication loading. Thus, DNA replication acts as an error correction mechanism for maintaining centromere identity through its removal of non-centromeric CENP-A coupled with CCAN-mediated retention and precise reloading of centromeric CENP-A.
Introduction
Correct chromosome segregation relies on a unique chromatin domain known as the centromere. Human centromeres are located on megabase long1 chromosomal regions and are comprised of tandemly repeated arrays of a ~171 bp element, termed α-satellite DNA2–4. CENP-A is a histone H3 variant5,6 that replaces histone H3 in chromatin assembled onto ~3% of α-satellite DNA repeats7,8, and is flanked by pericentric heterochromatin containing H3K9me2/39. Nevertheless, α-satellite DNA sequences are neither sufficient nor essential for centromere identity2,10, as demonstrated by several measures including identification of multiple examples of acquisition of a new centromere (referred to as a neocentromere) at a new location coupled with inactivation of the original centromere11.
This has led to a consensus view that mammalian centromeres are defined by an epigenetic mark2. Use of gene replacement in human cells and fission yeast has identified the mark to be CENP-A-containing chromatin12, which maintains and propagates centromere function indefinitely by recruiting CENP-C and the 16 subunit constitutive centromere associated network (CCAN)13–16. We8 and others17 have shown that the overwhelming majority of human CENP-A chromatin particles are octameric nucleosomes containing two molecules of CENP-A at all cell cycle points and with heterotypic CENP-A/histone H3-containing nucleosomes comprising at most 2% of CENP-A-containing chromatin8.
During DNA replication, initially bound CENP-A is quantitatively redistributed to each daughter centromere18, while incorporation of new CENP-A into chromatin occurs only after exit from mitosis18,19 when its loading chaperone HJURP20,21 is active22. Temporal separation of centromeric DNA replication from new CENP-A chromatin assembly raises the important question of how is the centromeric epigenetic mark maintained across the cell cycle when it would be expected to be dislodged by DNA replication and diluted at each centromere as no new CENP-A is assembled until the next G118. Moreover, endogenous CENP-A comprises only ~0.1% of the total histone H3 variants. Recognizing that a proportion of CENP-A is assembled at the centromeres with the remainder loaded onto sites on the chromosome arms7,8,23, long-term maintenance of centromere identity and function requires limiting accumulation of non-centromeric CENP-A. Indeed, artificially increasing CENP-A expression in human cells increases ectopic deposition at non-centromeric sites, accompanied by chromosome segregation aberrations23–26.
Using CENP-A chromatin immunoprecipitation and mapping onto centromere reference models for the centromere of each human chromosome, we now identify that DNA synthesis acts as an error correction mechanism to maintain epigenetically defined centromere identity by mediating precise reassembly of centromere-bound CENP-A chromatin, while removing ectopically loaded CENP-A found within transcriptionally active chromatin outside of the centromeres.
Results
CENP-A binding at 23 human centromere reference models
We produced HeLa cells expressing either i) CENP-ALAP, a CENP-A variant carboxy-terminally fused to a localization [EYFP] and affinity [His] purification tag27 at one endogenous CENP-A allele (Supplementary Fig. 1a, Fig. 1a) or ii) stably expressing an elevated level (4.5 times the level of CENP-A in parental cells) of CENP-ATAP, a CENP-A fusion with carboxy-terminal tandem affinity purification (S protein and protein A) tags separated by a TEV protease cleavage site (Supplementary Fig. 1b, Fig. 1a). Both CENP-A variants localize to centromeres (Supplementary Fig. 1c,d) and support long-term centromere identity and mediate high fidelity chromosome segregation in the absence of wild type CENP-A7,8. Importantly, while the HeLa cells we use have acquired an aneuploid genome, they are chromosomally stable, with high fidelity centromere function that has maintained the same karyotype over nearly two decades28.
To immunopurify CENP-A-containing chromatin at G1 or G2, chromatin was isolated from synchronized cells (Fig. 1a; Supplementary Fig. 1e). Nuclease digestion was used to produce mono-nucleosomes from chromatin isolated at G1 and G2, yielding the expected 147 bp of protected DNA length for nucleosomes assembled with histone H3 (Fig. 1a,c - upper panel). In parallel, chromatin was also isolated from randomly cycling cells stably expressing TAP tagged histone H3.1 (H3.1TAP – Supplementary Fig. 1b, Fig. 1a)13. CENP-ALAP, CENP-ATAP or H3.1TAP containing chromatin was then affinity purified and eluted under mild conditions with PreScission or TEV protease cleavage (Fig. 1a).
α-satellite DNA sequences were enriched 30–35 fold in DNA isolated from CENP-ATAP or CENP-A+/LAP cells (Fig. 1b), the expected enrichment since α-satellite DNA comprises ~3% of the genome8,29. While micro-capillary electrophoresis of bulk input chromatin produced the expected 147 bp of protected DNA length for nucleosomes assembled with histone H3 (Fig. 1c - upper panel), isolated CENP-ALAP chromatin expressed at endogenous CENP-A levels produced DNA lengths centered on 133 bp, before and after DNA replication (Fig. 1c, lower panel), as previously reported for octameric CENP-A-containing nucleosomes with DNA unwinding at entry and exit8,30.
Affinity purified CENP-ALAP, CENP-ATAP, and H3.1TAP-bound DNAs were sequenced and mapped (Fig. 1a,d; Supplementary Table 1) onto the centromere reference model of the human X chromosome31 and centromere models (incorporated into the HuRef genome hg38) for each human autosome that include the observed variation in α-satellite Higher Order Repeat (HOR) array sequences32,33. Sequences bound by CENP-A were identified (Fig. 1d,e; Supplementary Fig. 2) using algorithm-based scripts [SICER and MACS34,35]. CENP-ALAP expressed at endogenous levels mapped with high reproducibility across the centromeric regions of all 23 reference centromeres (see Fig. 1e for chromosome 18 and Figure S2 for the other 22). Sequences bound were largely unaffected by increasing CENP-A levels by 4.5-fold in CENP-ATAP cells (Figs. 1e and 2a,b).
Centromeric α-satellite arrays varied widely in CENP-A binding, from 10.5-fold enrichment for array D3Z1 in cen3 to 213-fold for array GJ211930.1 in cen10, and with most enriched 20–40-fold relative to input DNAs (Supplementary Table 2). For the 6 (of 17) centromeres which contain more than one α-satellite array, only one array actively bound CENP-A (Supplementary Table 2). Multiple α-satellite arrays in 11 centromeres (Supplementary Table 2) showed enriched CENP-A binding in two or more arrays, consistent with CENP-A binding to a different array in each homologue, as previously shown for cen17 in two diploid human cell lines36. The increased levels of CENP-A in CENP-ATAP cells did not increase the number of centromeric binding peaks (Fig. 1d,e), but elevated CENP-A occupancy at some divergent monomeric α-satellite repeats (Supplementary Fig. 1f) or within HORs (Supplementary Fig. 1g), with both examples occurring in regions with few CENP-B boxes.
CENP-A nucleosomes are retained at centromeric loading sites after DNA replication
Despite the known redistribution of initially centromere-bound CENP-A onto each of the new daughter centromeres without addition of new CENP-A18, comparison of the sequences bound by CENP-A in G1 with those bound in G2 revealed that for all 23 centromeres, at both normal (CENP-A+/LAP) and elevated (CENP-ATAP) levels, CENP-A was bound to indistinguishable α-satellite sequences before and after DNA replication (Fig. 1e and Supplementary Fig. 2). Indeed, almost all (87%) of α−satellite binding peaks algorithmically identified for CENP-ATAP during G1 remained at G2 (Supplementary Fig. 1h, top). A similarly high (89%) retention of CENP-A peaks found in G1 remained at G2 in CENP-ALAP cells with CENP-A expressed at endogenous levels (Supplementary Fig. 1h, bottom). After filtering out multi-mapping reads, 96 single copy, centromeric CENP-A binding sites were identified within the HORs of the 23 reference centromeres. Remarkably, examination of these before and after DNA replication in CENP-ATAP cells revealed quantitative retention of CENP-A in G2 in almost all (93 of 96) of these unique centromeric sites and with the remaining 3 peaks only slightly diminished (Fig. 2; Supplementary Fig. 1i).
Ectopic CENP-A assembled onto chromosome arms in early G1 is removed by G2
In addition to the striking enrichment at centromeric α-satellites, genome-wide mapping of CENP-A-bound DNAs revealed preferential and highly reproducible incorporation into unique sequences, non-α-satellite sites on the arms of all 23 chromosomes (Figs. 3a,b and 4). At endogenous CENP-A levels, 11,390 ectopic sites were identified, 620 of which were loaded ≥5 fold over background. Sites enriched for bound CENP-A were essentially identical in DNAs from randomly cycling cells or G1 cells (Fig. 3a,b). While a 4.5-fold increase in CENP-A levels in CENP-ATAP cells did not increase the binding peaks (Fig. 1d) or the number of unique single copy sites within centromeric HORs (Supplementary Fig. 1i, bottom), it drove an increased number of sites of CENP-A incorporation on the arms, producing 40,279 non-centromeric sites (Fig. 3d), 12,550 of which were loaded ≥5-fold over background (Spplementary Fig. 3a).
Remarkably, for all 23 human chromosomes and for CENP-A accumulated to endogenous (CENP-ALAP) or increased (CENP-ATAP) expression levels, passage from G1 to G2 almost eliminated enrichment of CENP-A binding to specific sites on the chromosome arms, while leaving α-satellite bound sequences unaffected (Figs. 1d, 3, 4). Loss by G2 of CENP-A binding at specific arm sites was highly reproducible (see experimental replicas in Fig. 3b). Scoring peak binding sites with thresholds of ≥ 5-, 10- or 100-fold of CENP-A binding over background, at least 90% of sites bound on chromosome arms in G1 in CENP-ATAP cells were removed by early G2 (Spplementary Fig. 3a) and all of those still identified in G2 were substantially reduced in peak height.
Ectopic CENP-A removal after DNA replication was confirmed using CENP-A chromatin immunoprecipitation following synchronization in G1 or mitosis (Supplementary Fig. 3b) in a second, nearly diploid human cell line (DLD-1) in which the two CENP-A alleles were modified to encode a degron tagged, auxin-inducible degradable CENP-AAID or a doxycycline-inducible CENP-AWT 37. Levels of CENP-A loaded at each of four ectopic sites before (G1) and after DNA replication revealed that almost all (85–90%) of ectopic CENP-A loaded in G1 was removed by mitotic entry (Fig. 3e).
Neocentromeres are not at sites of ectopic CENP-A loading
Recognizing that ectopic loading of CENP-A on chromosomes could be one component of neocentromere formation, we tested if the positions of known human neocentromeres38 are sites of preferential ectopic CENP-A loading. Despite cytogenetic positioning of many reported neocentromeres11, only two have been precisely mapped39. The first (PDNC4) spans 300 kb on chromosome 440 (87.278 to 87.578 Mb in hg38). In CENP-ATAP cells, only four sites of elevated CENP-A were present in G1 within the genomic region of this neocentromere; only two of which had CENP-A loading >5-fold over the background. Importantly, all sites were removed in G2 derived chromatin (Fig. 5a). An additional neocentromere (IMS13q) maps to a 100 kb region on chromosome 1341 (97.047 to 97.147 Mb in hg38 assembly39). CENP-A binding to this region in CENP-ATAP cells in G1 did not differ in density of peaks or peak heights with many similarly loaded sites scattered across the long arm of chromosome 13 (Supplementary Fig. 3c). Passage from G1 to G2 of CENP-ATAP cells stripped almost all CENP-A bound corresponding to the region of the IMS13q neocentromere. While we cannot exclude the possibility that other cell types have different epigenetic landscapes that affect the sites to which CENP-A binds at non-centromeric regions, our examination of these two best defined neocentromeres offers no support for neocentromere formation arising at the site of an inherent hotspot of ectopic CENP-A loading.
CENP-A is ectopically loaded at early G1 into open/active chromatin
The sites on the chromosome arms into which CENP-A was assembled in G1 in CENP-ATAP cells were enriched 2-fold (compared to levels expected by chance) at promoters or enhancers of expressed genes, with a 2.5-fold enrichment at sites bound by the transcriptional repressor CTCF (Supplementary Fig. 3e), trends similar to previous reports for cells with increased CENP-A23. More than 80% of CENP-ATAP binding sites on chromosome arms with peak heights ≥ 5-fold over background (Fig. 5b–d) overlapped with DNase I hypersensitive, accessible chromatin sites identified by the ENCODE project and which are functionally related to transcriptional activity. Similarly, CENP-A expressed at endogenous levels was enriched 3-fold at DNase I hypersensitive sites (Fig. 5e) and promoters (Supplementary Fig. 3g).
Most (80%) of non-centromeric CENP-ATAP binding peaks overlapped with H3k4me1 and H3k4me2 sites found in active and primed enhancers and at transcription factor binding sites identified by ENCODE, with a similar trend for CENP-ALAP (Fig. 5d,e). Ectopic CENP-ATAP or CENP-ALAP (Supplementary Fig. 4a,b) peaks also showed a significant overlap with other marks of active transcription, including H2A.Z, H3K4me3, H3K27ac, H3K36me3 and H3K9ac. Conversely, both CENP-ATAP and CENP-ALAP were not enriched at H3k27me3 sites tightly associated with inactive gene promoters and facultatively repressed genes (Supplementary Fig. 3d,f and Supplementary Fig. 4a,b). CENP-A binding peaks showed a mild (30–40%) overlap with histone modifications H4K20me1 and H3K79me2 (active transcription marks) or the H3K9me3 mark of transcription repression (Supplementary Fig. 4a,b). Overall, most (65% and 93%, respectively) of ectopic CENP-A sites in cells with endogenous or elevated CENP-A were associated with any active transcription mark (Supplementary Fig. 4a,b), consistent with ectopic CENP-A preferentially bound to open, active chromatin.
Ectopic CENP-A in G1 in either CENP-ATAP or CENP-ALAP cells bound to ‘High Occupancy Target’ (HOT) regions defined by highly expressed regions of the genome (and that show binding of unrelated transcription factors without underlying sequence specificity42) were almost quantitatively removed in cells that had progressed through DNA replication (Supplementary Fig. 3h,i), demonstrating that enrichment of CENP-A at such highly expressed regions cannot be a consequence of non-specific binding to “hyper-ChIPable regions”43.
Analysis of published CENP-A ChIP-seq datasets for HT1080 cells44, a human epithelial fibrosarcoma cell line expressing FLAG-tagged CENP-A at ~3-fold the level of parental CENP-A, revealed similar trends: ectopic sites were enriched at DNase I hypersensitive sites and at transcription activation marks (H2A.Z and H3K4me1/2) but were not enriched at the transcription repression mark H3K27me3 (Supplementary Fig. 4c). Similarly, CENP-A ChIP-sequencing datasets from the HuRef human lymphoblastoid cell line45 again revealed that the majority (51%) of ectopic CENP-A accumulated to its endogenous level was found at marks associated with transcription activation (including H2A.Z, H3K36me3 and H4K20me1). CENP-A was not enriched at the transcription repression mark H3K27me3 (Supplementary Fig. 4d), although in all three cell lines analyzed ectopic CENP-A was enriched at H3K9me3 sites associated with heterochromatin of constitutively repressed genes (Supplementary Fig. 4a–d).
Ectopic, but not centromeric, CENP-A is removed by replication fork progression
We next tested whether removal by G2 of CENP-A assembled into nucleosomes at unique sites on the chromosome arms is mediated by the direct action of the DNA replication machinery. CENP-ATAP was affinity purified from mid S phase cells and CENP-A-bound DNAs were sequenced and mapped (Fig. 6a; Supplementary Fig. 5a). In parallel, newly synthesized DNA in synchronized cells was labeled by addition of bromodeoxyuridine (BrdU) for 1 hour at early (S0-S1), mid (S3-S4), and late S phase (S6-S7) (Fig. 6a; Supplementary Fig. 5a). Genomic DNA from each time point was sonicated (Supplementary Fig. 4b) and immunoprecipitated with a BrdU antibody (Fig. 6a). Eluted DNA was then sequenced and mapped to the genome [an approach known as Repli-seq46], yielding regions of early, mid, and late replicating chromatin (an example from a region of an arm of chromosome 20 is shown in Fig. 6b). Early replication timing was validated (Supplementary Fig. 5c) for two genes (MRGPRE and MMP15) previously reported to be early replicating (ENCODE Repli-seq46). Similarly, a gene (HBE1) and a centromeric region (Sat2) previously reported to be late replicating (ENCODE Repli-seq46) were confirmed to be replicated late (Supplementary Fig. 5d).
CENP-A chromatin immunoprecipitated from early and mid S phase cells yielded levels of α-satellite DNA enrichment (Supplementary Fig. 5e) similar to those achieved at G1 phase (Fig. 1b). Furthermore, nucleosomal CENP-A chromatin produced by micrococcal nuclease digestion protected 133 bp of DNA at early and mid S phase (Supplementary Fig. 5f) just as it did in G1 and G2 [Fig. 1c; see also ref8], with no evidence for a structural change from hemisomes to nucleosomes and back to hemisomes during S phase as previously claimed47. Mapping of CENP-A binding sites on chromosome arms, combined with Repli-seq, revealed that almost all (91%) ectopic G1 CENP-A binding was found in early or mid S replicating regions (Fig. 6b,c). While alphoid DNA sequences have been reported to replicate mid-to-late48 S phase, in our cells α-satellite containing DNAs in all 23 centromeres were found almost exclusively to be late replicating (Fig. 6d).
Remarkably, throughout S phase, centromere bound CENP-A found in G1 was completely retained across each reference centromere with the same sequence binding preferences (Fig. 6e; Supplementary Fig. 5g). Retention of CENP-A binding during DNA replication was also observed at the unique sequence binding sites within the HORs of each centromere: all 96 CENP-ATAP G1 peaks at single copy variants within α-satellite HORs remained bound by CENP-A (Fig. 6f; Supplementary Fig. 5h). In contrast, early replicating ectopic CENP-A binding sites were nearly quantitatively removed during or quickly after their replication and were no longer visible at mid-S phase (Fig. 6g,h). Similarly, ectopic CENP-A binding sites found in mid-S replicating regions remained at mid-S but were removed quickly after that and were absent by late S/G2 (Fig. 6i,j). For the 10% of ectopic CENP-A G1 peaks in late-S replicating regions (Fig. 6c), almost all (85%) were removed by G2 (Fig. 6k,l), while late replicating centromeric CENP-A peaks were retained including the single copy variants within the α-satellite HORs (Fig. 6d–f; Fig. 2; Supplementary Fig. 5h). Thus, ectopic, but not centromeric, CENP-A binding sites are removed as DNA replication progresses.
CENP-C/CCAN remain centromeric CENP-A-associated during DNA replication
To comprehensively determine the components which associate with CENP-A chromatin during replication in late S, we used mass spectrometry following affinity purification of CENP-A nucleosomes (Supplementary Fig. 5i, left panel). A structural link that normally bridges multiple centromeric CENP-A nucleosomes and nucleates full kinetochore assembly before mitotic entry is the 16-subunit Constitutive Centromere Associated Network (CCAN)49–52. This complex is anchored to CENP-A primarily through CENP-C50,53,54 and sustained by CENP-B binding to CENP-B box sequences within α-satellite DNAs55. Remarkably, mass spectrometry identified that all 16 CCAN components13,15 remained associated with mono-nucleosomal CENP-A chromatin affinity purified from late S/G2 (Table 1). Stable association with CENP-A was also seen for HJURP, multiple chromatin remodeling factors and nuclear chaperones, histones, centromere and kinetochore components, and other DNA replication proteins (Supplementary Fig. 5j–n). The continuing interaction during DNA replication of CCAN proteins with CENP-A and which is maintained even on mono-nucleosomes provides strong experimental support that the CCAN complex tethers CCAN-bound centromeric CENP-A at or near the centromeric DNA replication forks, thereby enabling its efficient reincorporation after replication fork passage.
Table 1. Mass spectrometry of CENP-A individual nucleosomes reveal all the CCAN network components co-precipitated with CENP-A at late S/G2.
To test this further, the composition of CENP-A-containing nucleosomal complexes from G1 to late S/G2 was determined following affinity purification (via the TAP tag) of chromatin-bound CENP-ATAP from a predominantly mononucleosome pool (Supplementary Fig. 5hi, right panel). We initially focused on the Chromatin Assembly Factor 1 (CAF1) complex, which is required for de novo chromatin assembly following DNA replication56. Its p48 subunit (also known as CAF1 subunit c, RbAp48, or RBBP4) binds histone H457, is a binding partner in a CENP-A pre-nucleosomal complex with HJURP and nucleophosmin (NPM1)21, and maintains the deacetylated state of histones in the central core of centromeres after deposition58. Remarkably, CAF1 p48 co-immunopurified with CENP-A from G1 through late S/G2 (Fig. 7a). In striking contrast, the two other CAF1 subunits (CAF1 p150 and CAF1 p60) that are essential for de novo chromatin assembly in vitro59, remained much more strongly associated with CENP-A nucleosomes in late S/G2 compared to mid-S (Fig. 7a). Additionally, MCM2, a core subunit of the DNA replicative helicase MCM2–7 complex that recycles old histones as the replication fork advances60, was robustly co-purified with CENP-A only in late S phase derived chromatin, with no association detected in mid-S (Fig. 7a).
CENP-C is essential for the maintenance of centromeric CENP-A during DNA replication
The stable association only in late S phase (when all centromeric, but only a small minority of ectopically loaded CENP-A, is replicated) of CENP-A with MCM2 and the CAF1 subunits necessary for chromatin reassembly suggested that CENP-C and its CCAN complex tethered centromeric CENP-A to near the replication forks and stabilized CENP-A binding to MCM2 and CAF1, thereby enabling CENP-A reassembly onto the daughter centromeres after DNA replication. We tested this possibililty by rapidly depleting CENP-C just after S phase entry in a human cell line (CENP-CAE/AE) in which both CENP-C alleles were genome-engineered to produce CENP-C fused to both an auxin-inducible degron and EYFP55. Thymidine was used to synchronize these CENP-CAE/AE cells at the G1/S boundary (Fig. 7b–c). CENP-C degradation was induced just after S phase entry in order to test CENP-C’s role specifically during centromeric DNA replication in late S phase (Fig. 7b), but without affecting deposition of new CENP-A that occurs in early G1.
Auxin addition 2 hours after release from thymidine block resulted in polyubiquitination and degradation of almost all CENP-CAE within 15 minutes, as was evident by loss of fluorescence of the EYFP in CENP-CAE (Fig. 7b–d and video S1). DNA replication was then allowed to continue without CENP-C and the CCAN complex it nucleates53,55. At the end of DNA replication, chromatin-bound CENP-A was immunoprecipitated and the enrichment of α-satellite containing DNA was determined. In randomly cycling cells this resulted in 30-fold enrichment of alphoid DNA (Fig. 7e). At the end of DNA replication and distribution of CENP-A to the two daughter centromeres in CENP-C containing cells, alphoid DNA enrichment was reduced by half, as expected from doubling of centromeric DNA without addition of new CENP-A. However, degradation of CENP-C early in DNA replication led to loss by the end of S phase of most (73%) of CENP-A initially bound to α-satellite DNA (Fig. 7e).
CENP-C-dependent retention of centromeric CENP-A late in S phase was confirmed by examination of two specific α-satellite variants found within the HORs of the centromeres of chromosomes 8 (Fig. 7f) and 15 (Fig. 7g). Each of these satellite variants is represented only once in the human genome and each shows precise retention in G2 of CENP-A bound in G1 (Fig. 2). At both variants, α-satellite DNA was enriched 50–60-fold following CENP-A ChIP from randomly cycling cells, which was reduced to half as much after DNA replication. Following CENP-C degradation in early S phase, however, CENP-A was not retained at either site during DNA replication (Fig. 7f,g). Taken together, these results demonstrate that depletion of CENP-C (and CCAN bound to CENP-A53,55) prior to centromere DNA replication results in loss of centromeric CENP-A by the end of S phase.
Discussion
Using reference models for 23 human centromeres, we have identified that during DNA replication CENP-A nucleosomes initially assembled onto centromeric α-satellite repeats are reassembled onto the same spectrum of α-satellite repeat sequences of each daughter centromere as are bound prior to DNA replication. Additionally, genome-wide mapping of sites of CENP-A assembly has identified that when CENP-A is expressed at endogenous levels the selectivity of the histone chaperone HJURP’s loading in early G1 of new CENP-A at or near existing sites of centromeric CENP-A-containing chromatin is insufficient to prevent its loading onto >11,000 sites along the chromosome arms (Fig. 3d). We also show that the number of ectopic sites increases as CENP-A expression levels increase, as has been reported in multiple human cancers39,61,62. Sites of ectopic CENP-A are replicated in early and mid-S (Fig. 6c) and are nearly quantitatively removed as DNA replication progresses (Fig. 6g–l).
Taken together, our evidence identifies that DNA replication functions not only to duplicate centromeric DNA but also as an error correction mechanism to maintain epigenetically-defined centromere position and identity by coupling centromeric CENP-A retention with its removal from assembly sites on the chromosome arms (Fig. 7h). Indeed, our data reveal that CENP-A loaded onto unique, single copy sites within α−satellite DNAs of the 23 reference centromeres is precisely maintained at these sites during and after DNA replication, offering direct support that (at least for each of these single copy sites) the replication machine re-loads CENP-A back onto the exact same centromeric DNA site (Figs. 2, 6f,g).
DNA replication produces a very different situation for CENP-A initially assembled into nucleosomes on the chromosome arms. Sites of ectopically loaded CENP-A are nearly quantitatively stripped during DNA replication (Figs. 3, 4, 6h–m), therefore precluding pre-mitotic acquisition of CENP-A-dependent centromere function at non-centromeric sites and reinforcing centromere position and identity (Fig. 7h). Without such correction, ectopically loaded sites would be maintained cell cycle after cell cycle, potentially recruiting CENP-C and assembly of the CCAN complex13–16. Arm-associated CENP-A/CCAN would present a major problem for faithful assembly and function of a single centromere/kinetochore per chromosome, both by acquisition of partial centromere function and by competition with the authentic centromeres for the pool of available CCAN components. Indeed, high levels of CENP-A expression 1) lead to recruitment of detectable levels of 3 of 16 CCAN components (CENP-C, CENP-N and Mis18) assembled onto the arms23,24,63, 2) are associated with ongoing chromosome segregation errors25, and 3) have been reported in several cancers where it has been proposed to be associated with increased invasiveness and poor prognosis26,61,62.
As to the mechanism for retention during DNA replication of centromeric but not ectopically loaded CENP-A, our mass spectrometry analysis identifies a strong association of HJURP with CENP-A mono-nucleosomes at late S phase, comparable to the association identified in G1 (Supplementary Fig. 5l), supporting a probable role for HJURP in CENP-A retention, perhaps through interaction with MCM2–7 complex, as has previously been suggested21. This is consistent with evidence that HJURP can associate with MCM2 in a histone-independent manner60, consistent with a possible co-chaperone relationship for CENP-A. Moreover, degradation of HJURP in early S reduces centromeric CENP-A retention through S phase64.
Most importantly, our evidence demonstrates that the local reassembly of CENP-A within centromeric domains requires the continuing centromeric CENP-A association with CCAN complexes (Fig. 7) which act to tether disassembled CENP-A/H4 near to the sites of centromere DNA replication. This local CENP-C/CCAN-dependent retention of CENP-A, coupled with the actions of the MCM2 replicative helicase, HJURP, and CAF1, enables CENP-A’s precise reassembly into chromatin within each daughter centromere, thereby maintaining epigenetically defined centromere identity.
Supplementary Material
Acknowledgments
We thank A. Desai, P. Ly and C. Eissler for critical discussion, L.E.T. Jansen (Gulbenkian Institute, Portugal) for reagents, D-H. Kim for productive discussions and technical help. This work was supported by grants (R01 GM-074150 and R35 GM-122476) from the NIH to D.W.C., who receives salary support from Ludwig Cancer Research.
Footnotes
Conflict of Interests
The authors declare that they have no conflict of interest.
References
- 1.Wevrick R & Willard HF Long-range organization of tandem arrays of alpha satellite DNA at the centromeres of human chromosomes: high-frequency array-length polymorphism and meiotic stability. Proc Natl Acad Sci U S A 86, 9394–8 (1989). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cleveland DW, Mao Y & Sullivan KF Centromeres and kinetochores: from epigenetics to mitotic checkpoint signaling. Cell 112, 407–21 (2003). [DOI] [PubMed] [Google Scholar]
- 3.Willard HF Chromosome-specific organization of human alpha satellite DNA. American journal of human genetics 37, 524–32 (1985). [PMC free article] [PubMed] [Google Scholar]
- 4.Manuelidis L & Wu JC Homology between human and simian repeated DNA. Nature 276, 92–4 (1978). [DOI] [PubMed] [Google Scholar]
- 5.Earnshaw WC & Rothfield N Identification of a family of human centromere proteins using autoimmune sera from patients with scleroderma. Chromosoma 91, 313–21 (1985). [DOI] [PubMed] [Google Scholar]
- 6.Palmer DK, O’Day K, Wener MH, Andrews BS & Margolis RLA 17-kD centromere protein (CENP-A) copurifies with nucleosome core particles and with histones. J Cell Biol 104, 805–15 (1987). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bodor DL et al. The quantitative architecture of centromeric chromatin. eLife 3, e02137 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Nechemia-Arbely Y et al. Human centromeric CENP-A chromatin is a homotypic, octameric nucleosome at all cell cycle points. J Cell Biol 216, 607–621 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Sullivan BA & Karpen GH Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nature structural & molecular biology 11, 1076–83 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Stimpson KM & Sullivan BA Epigenomics of centromere assembly and function. Current opinion in cell biology 22, 772–80 (2010). [DOI] [PubMed] [Google Scholar]
- 11.Marshall OJ, Chueh AC, Wong LH & Choo KH Neocentromeres: new insights into centromere structure, disease development, and karyotype evolution. Am J Hum Genet 82, 261–82 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Fachinetti D et al. A two-step mechanism for epigenetic specification of centromere identity and function. Nature cell biology 15, 1056–66 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Foltz DR et al. The human CENP-A centromeric nucleosome-associated complex. Nature cell biology 8, 458–69 (2006). [DOI] [PubMed] [Google Scholar]
- 14.Okada M et al. The CENP-H-I complex is required for the efficient incorporation of newly synthesized CENP-A into centromeres. Nat Cell Biol 8, 446–57 (2006). [DOI] [PubMed] [Google Scholar]
- 15.Hori T et al. CCAN makes multiple contacts with centromeric DNA to provide distinct pathways to the outer kinetochore. Cell 135, 1039–52 (2008). [DOI] [PubMed] [Google Scholar]
- 16.Hori T, Shang WH, Takeuchi K & Fukagawa T The CCAN recruits CENP-A to the centromere and forms the structural core for kinetochore assembly. The Journal of cell biology 200, 45–60 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Padeganeh A et al. Octameric CENP-A nucleosomes are present at human centromeres throughout the cell cycle. Current biology: CB 23, 764–9 (2013). [DOI] [PubMed] [Google Scholar]
- 18.Jansen LE, Black BE, Foltz DR & Cleveland DW Propagation of centromeric chromatin requires exit from mitosis. The Journal of cell biology 176, 795–805 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Nechemia-Arbely Y, Fachinetti D & Cleveland DW Replicating centromeric chromatin: spatial and temporal control of CENP-A assembly. Experimental cell research 318, 1353–60 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Foltz DR et al. Centromere-specific assembly of CENP-a nucleosomes is mediated by HJURP. Cell 137, 472–84 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Dunleavy EM et al. HJURP is a cell-cycle-dependent maintenance and deposition factor of CENP-A at centromeres. Cell 137, 485–97 (2009). [DOI] [PubMed] [Google Scholar]
- 22.Silva MC et al. Cdk activity couples epigenetic centromere inheritance to cell cycle progression. Developmental Cell 22, 52–63 (2012). [DOI] [PubMed] [Google Scholar]
- 23.Lacoste N et al. Mislocalization of the centromeric histone variant CenH3/CENP-A in human cells depends on the chaperone DAXX. Molecular cell 53, 631–44 (2014). [DOI] [PubMed] [Google Scholar]
- 24.Van Hooser AA et al. Specification of kinetochore-forming chromatin by the histone H3 variant CENP-A. Journal of cell science 114, 3529–42 (2001). [DOI] [PubMed] [Google Scholar]
- 25.Shrestha RL et al. Mislocalization of centromeric histone H3 variant CENP-A contributes to chromosomal instability (CIN) in human cells. Oncotarget 8, 46781–46800 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Filipescu D et al. Essential role for centromeric factors following p53 loss and oncogenic transformation. Genes Dev 31, 463–480 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Mata JF, Lopes T, Gardner R & Jansen LE A rapid FACS-based strategy to isolate human gene knockin and knockout clones. PloS one 7, e32646 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Landry JJ et al. The genomic and transcriptomic landscape of a HeLa cell line. G3 3, 1213–24 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hayden KE et al. Sequences associated with centromere competency in the human genome. Molecular and cellular biology 33, 763–72 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Conde e Silva N et al. CENP-A-containing nucleosomes: easier disassembly versus exclusive centromeric localization. Journal of molecular biology 370, 555–73 (2007). [DOI] [PubMed] [Google Scholar]
- 31.Miga KH et al. Centromere reference models for human chromosomes X and Y satellite arrays. Genome research 24, 697–707 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Schneider VA et al. Evaluation of GRCh38 and de novo haploid genome assemblies demonstrates the enduring quality of the reference assembly. Genome Res 27, 849–864 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Levy S et al. The diploid genome sequence of an individual human. PLoS biology 5, e254 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Zhang Y et al. Model-based analysis of ChIP-Seq (MACS). Genome biology 9, R137 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zang C et al. A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics 25, 1952–8 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Maloney KA et al. Functional epialleles at an endogenous human centromere. Proc Natl Acad Sci U S A 109, 13704–9 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ly P et al. Selective Y centromere inactivation triggers chromosome shattering in micronuclei and repair by non-homologous end joining. Nat Cell Biol 19, 68–75 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Amor DJ & Choo KH Neocentromeres: role in human disease, evolution, and centromere study. American journal of human genetics 71, 695–714 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hasson D et al. The octamer is the major form of CENP-A nucleosomes at human centromeres. Nature structural & molecular biology 20, 687–95 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Amor DJ et al. Human centromere repositioning “in progress”. Proceedings of the National Academy of Sciences of the United States of America 101, 6542–7 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Alonso A et al. Co-localization of CENP-C and CENP-H to discontinuous domains of CENP-A chromatin at human neocentromeres. Genome biology 8, R148 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Li H et al. Functional annotation of HOT regions in the human genome: implications for human disease and cancer. Sci Rep 5, 11633 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Teytelman L, Thurtle DM, Rine J & van Oudenaarden A Highly expressed loci are vulnerable to misleading ChIP localization of multiple unrelated proteins. Proc Natl Acad Sci U S A 110, 18602–7 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Thakur J & Henikoff S Unexpected conformational variations of the human centromeric chromatin complex. Genes Dev 32, 20–25 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Henikoff JG, Thakur J, Kasinathan S & Henikoff S A unique chromatin complex occupies young alpha-satellite arrays of human centromeres. Sci Adv 1(2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hansen RS et al. Sequencing newly replicated DNA reveals widespread plasticity in human replication timing. Proc Natl Acad Sci U S A 107, 139–44 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Bui M et al. Cell-cycle-dependent structural transitions in the human CENP-A nucleosome in vivo. Cell 150, 317–26 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Erliandri I et al. Replication of alpha-satellite DNA arrays in endogenous human centromeric regions and in human artificial chromosome. Nucleic Acids Res 42, 11502–16 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Guse A, Carroll CW, Moree B, Fuller CJ & Straight AF In vitro centromere and kinetochore assembly on defined chromatin templates. Nature 477, 354–8 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Carroll CW, Milks KJ & Straight AF Dual recognition of CENP-A nucleosomes is required for centromere assembly. J Cell Biol 189, 1143–55 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Petrovic A et al. Structure of the MIS12 Complex and Molecular Basis of Its Interaction with CENP-C at Human Kinetochores. Cell 167, 1028–1040 e15 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Rago F, Gascoigne KE & Cheeseman IM Distinct organization and regulation of the outer kinetochore KMN network downstream of CENP-C and CENP-T. Curr Biol 25, 671–7 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Klare K et al. CENP-C is a blueprint for constitutive centromere-associated network assembly within human kinetochores. J Cell Biol 210, 11–22 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Weir JR et al. Insights from biochemical reconstitution into the architecture of human kinetochores. Nature 537, 249–253 (2016). [DOI] [PubMed] [Google Scholar]
- 55.Hoffmann S et al. CENP-A Is Dispensable for Mitotic Centromere Function after Initial Centromere/Kinetochore Assembly. Cell Rep 17, 2394–2404 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Smith S & Stillman B Stepwise assembly of chromatin during DNA replication in vitro. EMBO J 10, 971–80 (1991). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Verreault A, Kaufman PD, Kobayashi R & Stillman B Nucleosome assembly by a complex of CAF-1 and acetylated histones H3/H4. Cell 87, 95–104 (1996). [DOI] [PubMed] [Google Scholar]
- 58.Hayashi T et al. Mis16 and Mis18 are required for CENP-A loading and histone deacetylation at centromeres. Cell 118, 715–29 (2004). [DOI] [PubMed] [Google Scholar]
- 59.Kaufman PD, Kobayashi R, Kessler N & Stillman B The p150 and p60 subunits of chromatin assembly factor I: a molecular link between newly synthesized histones and DNA replication. Cell 81, 1105–14 (1995). [DOI] [PubMed] [Google Scholar]
- 60.Huang H et al. A unique binding mode enables MCM2 to chaperone histones H3-H4 at replication forks. Nat Struct Mol Biol 22, 618–26 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Zhang W et al. Centromere and kinetochore gene misexpression predicts cancer patient survival and response to radiotherapy and chemotherapy. Nature communications 7, 12619 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Sun X et al. Elevated expression of the centromere protein-A(CENP-A)-encoding gene as a prognostic and predictive biomarker in human cancers. Int J Cancer 139, 899–907 (2016). [DOI] [PubMed] [Google Scholar]
- 63.Gascoigne KE et al. Induced ectopic kinetochore assembly bypasses the requirement for CENP-A nucleosomes. Cell 145, 410–22 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Zasadzinska E et al. Inheritance of CENP-A Nucleosomes during DNA Replication Requires HJURP. Dev Cell 47, 348–362 e7 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.