Abstract
Centromeres are the chromosomal loci at which spindle microtubules attach to mediate chromosome segregation during mitosis and meiosis. In most eukaryotes, centromeres are made up of highly repetitive DNA sequences (satellite DNA) interspersed with middle repetitive DNA sequences (transposable elements). Despite the efforts to establish complete genomic sequences of eukaryotic organisms, the so-called ‘finished’ genomes are not actually complete because the centromeres have not been assembled due to the intrinsic difficulties in constructing both physical maps and complete sequence assemblies of long stretches of tandemly repetitive DNA. Here we show the first molecular structure of an endogenous Drosophila centromere and the ability of the C-rich dodeca satellite strand to form dimeric i-motifs. The finding of i-motif structures in simple and complex centromeric satellite DNAs leads us to suggest that these centromeric sequences may have been selected not by their primary sequence but by their ability to form noncanonical secondary structures.
Centromere sequences evolve rapidly due to inevitable recombination processes undergone by tandemly repeated sequences but many centromere proteins are conserved1. This paradox could be explained by the presence of a conserved sequence-independent structural motif, rather than a particular sequence motif2. The maintenance of a centromere-specific motif through evolution could be driven by molecular co-evolution of centromeric DNA and centromeric proteins. Importantly, in support of this evolutionary hypothesis, it has been shown that the centromere-specific histone H3 variant CENP-A (CID in Drosophila) has evolved in concert with centromeric satellite DNAs3,4.
Although a centromeric structural motif might be sufficient to direct the formation of centromeric chromatin on its own, the episodic occurrence of centromere activity associated with noncentromeric sequences, neocentromeres5,6, and the frequent inactivation/reactivation of centromeres7,8,9,10,11,12 indicates that centromere specification involves both genomic competency and epigenetic mechanisms13. It is now recognized that CENP-A-containing nucleosomes provide the epigenetic mark to establish the centromere-specific chromatin14,15,16, and that the centromeric chromatin contains blocks of CENP-A nucleosomes interspersed with blocks of canonical histone H3 nucleosomes17. However, the folding of this centromeric chromatin is still not clear and is of much controversy18,19.
Although there is evidence to suggest that each Drosophila melanogaster endogenous centromere is made up of different simple and complex satellite DNAs20,21,22,23,24,25, their molecular structure has yet to be determined. The centromeric region of chromosome 3 of D. melanogaster, as well as the centromeric region of chromosomes 2 and 3 of the sibling species Drosophila simulans and Drosophila mauritiana, contains dodeca satellite 11/12 bp tandem repeats (CCCGTACTGGT/CCCGTACTCGGT) showing asymmetric distribution of guanine and cytosine residues such that one strand is predominantly G-rich and the other C-rich25,26,27.
In order to fully understand the structural and functional aspects of centromeres, it is important to elucidate the types of secondary DNA structures that can be formed by their constituent repeat units. Hence, we determined that the G-rich dodeca satellite strand is able to fold into very stable intramolecular hairpin structures that are stabilized by the formation of noncanonical G:A pairs28, and recently we have shown that not only the type B monomer of the human centromeric alpha-satellite28 but also the type A are able to form dimeric i-motif structures29. The i-motif is a four-stranded intercalated structure formed by the association of two parallel duplexes combined in an antiparallel fashion by forming intercalated hemi-protonated C:C+ base pairs30,31,32,33. As i-motif formation requires protonation of cytosines these structures are more stable at acidic pH, although, depending on particular C-rich sequences, they can fold at neutral pH34,35. I-motifs can also exist at neutral pH under molecular crowded conditions36 and under transcriptionally induced negative superhelicity37.
Since centromere specification may rely on centromeric structural motifs under the control of epigenetic mechanisms, knowledge of the fine structure of the endogenous D. melanogaster centromeres is required to elucidate the potential formation of noncanonical centromeric DNA structures. However, due to the repetitive nature of centric heterochromatin, the endogenous centromeres remain poorly represented in the new D. melanogaster Release 6 reference genome sequence38.
The heterochromatin of D. melanogaster has been subdivided into 61 distinct cytological regions39, and the primary constriction of the third chromosome localizes asymmetrically within the h53 region39. Previous work has shown that the dodeca satellite DNA hybridized very close to the primary constriction of the third chromosome, but extending to the right arm26, and that, occasionally, two hybridization signals can be seen in prometaphase chromosomes27. Moreover, the cytological analysis of the free chromosome arm F(3R)1 has shown that the amount of dodeca satellite on the right arm can be reduced without compromising chromosome segregation26. Initial studies of the long-range structure of the dodeca satellite DNA were reported by27. Common-cutting restriction enzymes that do not cut within dodeca satellite DNA revealed that most of the dodeca satellite was organized in two major blocks. In addition, the distribution of restriction sites in the long-range map suggested that the region between the blocks consists of complex DNA sequences, while the flanking region of one of the blocks is likely to contain another putative block of repetitive DNA yet undescribed.
Results and Discussion
Since a complete physical map across the centromere should extend from chromosome arm 3L to chromosome arm 3R, we set out to isolate bacterial artificial chromosome (BAC) clones that contain dodeca satellite, and to construct a comprehensive map around the dodeca satellite blocks using 20 restriction enzymes that do not occur within the dodeca satellite. Single and double genomic digests were size-fractionated by pulsed-field gel electrophoresis (PFGE) using a “Waltzer” apparatus40, which gives sharp resolution up to 2 Mb (representative digests are shown in Supplementary Fig. S1). In order to obtain dodeca satellite clones, three D. melanogaster BAC libraries were screened: the RPCI-98 library generated by cloning EcoRI-digested genomic DNA and the CHORI-221 and CHORI-223 libraries generated from sheared genomic DNA. BAC end sequencing and fingerprinting data of the stronger clones were used to construct contigs, and five clones were chosen for complete sequencing: CH221-29J09 (containing rDNA intergenic spacer (IGS) related sequences, cIGS), CH211-27P10 (containing Akap200 related sequences) and BACR19P07, BACR16A01 and BACR12I02 that were also positive for the retrotransposon Circe. The presence of Circe sequences in the centromeric region h53 had previously been reported41. By combining data from the sequence of these BACs and the sequence of whole genome shotgun scaffolds containing dodeca satellite with the results of an accurate restriction site mapping of genomic DNA, it has been possible to determine the position and orientation of the first eight scaffolds in the Release 6 assembly of the chromosome arm 3R (Fig. 1a).
The dodeca satellite sequences at this endogenous centromere are organized as two adjacent major blocks, block I and block II, plus several minor blocks (Fig. 1a). Interestingly, a detailed analysis of the sequence of the blocks has shown that block I has more undeca than dodeca repeats (Supplementary Fig. S2) and that the sequence of both repeat units are highly conserved (Supplementary Fig. S3). Moreover, this centromeric region contains transposable elements and two segmental duplications: one results from a duplication of a fragment of Akap200 (chromosome arm 2L at 29C) and subsequent amplification, and the other, located at one edge of block I, results from a duplication of IGS sequences at the nucleolus organizer region (NOR) (Fig. 1a). Nevertheless, FISH to mitotic chromosomes under low-stringency conditions with a cIGS-specific probe has not detected IGS-related sequences in the centromeric region of chromosome 2, although clear cross-hybridization signals occur at the NORs (Supplementary Fig. S4).
To elucidate whether CID interacts with dodeca satellite sequences, immunofluorescence-FISH experiments were performed. Thus, by using SuUR Su(var)3–9 double mutants to suppress the normal under-replication of Drosophila heterochromatin during the process of polytenization, we showed that CID co-localizes on polytene chromosomes with dodeca satellite sequences42. To corroborate this interaction, we increased the resolution performing immunofluorescence-FISH on extended chromatin fibers from cultured S2 cells and found that anti-CID antibody and dodeca satellite signals co-localize in approximately one fourth of the CID-positive fibers (n = 46), consistent with dodeca satellite being present only in the centromeric heterochromatin of chromosome 3, and not in the other four Drosophila melanogaster chromosomes (Fig. 1b). Some of the fibers that were positive for dodeca did not show co-staining for CID. This could indicate that dodeca satellite, most likely block II and minor blocks, extends beyond the centromeric chromatin, which contains CID nucleosomes. A similar scenario has been observed in the centromeres of Arabidopsis thaliana, in which part of the 178 bp satellite repeats extend beyond the centromeric chromatin into pericentromeric regions. This pericentromeric 178-bp satellite associates with H3.1-containing chromatin, while the centromeric 178-bp repeats associate with CENH3-containing chromatine43,44. Although the assembly of Droshophila melanogaster centromere 3 presented here is not completed at a base pair level, our results strongly suggest that dodeca-satellite block I is a good candidate for the centromere of chromosome 3.
During our effort to identify the putative block of simple sequence DNA in the flanking region of the dodeca satellite block 1, we repeated the cytological mapping of the 10 bp satellite (AATAACATAG)n using a fluorescent probe, which improves sensitivity and resolution with respect to results obtained with tritiated probes22. The 10 bp satellite had been mapped by22 to region h37 on the second chromosome (contiguous to the centromeric region h38) and to region h48 on the third chromosome (far away from the centromeric region h53). Unexpectedly, FISH experiments with dodeca and 10 bp satellite probes revealed no additional sites for the 10 bp satellite, but showed a change in its location on the third chromosome from h48 to h52p, a position which is very close to dodeca satellite (Fig. 2a–e). To investigate further the possibility that the flanking satellite DNA corresponds to the 10 bp satellite, we asked whether the 1.2 Mb BssHII fragment containing both dodeca satellite and flanking satellite sequences would hybridize with the 10 bp satellite probe. To this end, genomic DNA was digested with BssHII, size-fractionated by PFGE, transferred to a nylon membrane, hybridized with the dodeca satellite probe and then stripped and re-hybridized with the 10 bp satellite probe. As can be seen in Fig. 2f, there is a 1.2 Mb fragment (labeled with an asterisk) that hybridizes with both probes. Finally, the junction between 10 bp satellite and dodeca satellite sequences was found by searching the Trace Archive database (Fig. 2g). This result indicates that the 10 bp satellite DNA is physically linked to the dodeca satellite DNA. Here, it is important to remember that PROD, a protein required for centromere condensation45 and that specifically recognizes the 10 bp satellite45, is located near but not in the CID-containing chromatin46. Therefore, the physical map constructed comprises two adjacent chromatin domains with distinct functions. Although we have not completed the assembly at a base pair level, the sequence obtained from the five newly sequenced BAC clones, together with pre-existing contigs, with the identification of large blocks of several DNA satellites, and with the correct re-location of the 10 bp satellite from h48 to h52p, represent the most comprehensive physical map and assembly across the centromere of chromosome 3 of Drosophila melanogaster.
To determine the structural behavior of the dodeca satellite DNA, several oligonucleotides containing the dodeca repeat and, its main variant, the undeca repeat were studied by NMR, circular dichroism (CD) and mass spectrometry (MS). The G-rich and the C-rich strands were analyzed under different experimental conditions (Supplementary Fig. S5 and S6). In all cases, the NMR spectra of the G-rich oligonucleotides indicate the formation of G:C base pairs, and no formation of G-tetrads is observed even at high K+ concentrations (Supplementary Fig. S5). This is in agreement with the formation of intramolecular hairpins previously reported47. However, under acidic conditions the NMR spectra of the oligonucleotides corresponding to the C-rich strand of the dodeca and undeca repeats exhibit sharp imino signals around 15–16 ppm, characteristic of i-motif formation (Fig. 3a,b and Supplementary Fig. S6,8). I-motif formation is confirmed by CD spectra, which show the characteristic strong positive band at 285 nm48 (Fig. 3c,d). CD melting experiments show that these structures are quite stable at pH 4.0, with melting temperatures around 42 °C for dodeca and 45 °C for undeca (Supplementary Fig. S7e,f). Mass spectrometry data clearly indicate the formation of dimeric structures at acidic pH (Fig. 3e,f). The peaks corresponding to the dimeric species are not present at neutral pH and become very intense at pH 4 (Supplementary Fig. S7a–d). This pH dependence is consistent with i-motif structures. Interestingly, no tetrameric species are observed in mass spectrometry experiments, indicating that the structures formed by these oligonucleotides are the result of the self-association of two hairpins; similar dimeric structures as those observed in the A and B box of the human alpha satellite28,29. To further explore this similarity, we carried out two-dimensional NMR experiments of the dodeca oligonucleotide. Although a full structural determination is beyond the scope of this study, some interesting information can be readily spotted from the NOESY spectra. Each of the six cytosine imino signals (14.5–16.0 ppm) exhibit NOE cross-peaks with two amino protons (Fig. 4a). The presence of only two cross-peaks with cytosine amino protons instead of four (see Fig. 4c) denotes the formation of C:C+ base pairs between equivalent residues in each subunit. Other NOEs characteristic of i-motifs are also observed, such as imino-imino cross-peaks between adjacent C:C+ base pairs (Fig. 4a) or H1’-H1’ contacts between deoxyribose protons (Fig. 4b). According to these experimental data, we suggest a plausible structural model in which two hairpins self-associate in a head-to-head orientation through the formation of six intermolecular C:C+ base pairs (Fig. 4d).
In order to explore the existence of these structures under conditions that better represent a physiological context, we performed CD experiments at pH 6 and pH 7, and we explored the influence of the crowding agent PEG4000 on the stability of the structure. The CD spectra of both sequences at pH 7 show a maximum of ellipticity around 275 nm and a minimum around 240 nm (Supplementary Fig. S9). These CD spectra are different than those observed in single stranded DNA, and are considered as indicative of “i-motif-like” structures49. At pH 6, the maximum ellipticities of both sequences appear at around 285 nm, which is characteristic of i-motif structures. Importantly, the addition of 20% w/w PEG4000 produces a notable increase of the intensity at 285 nm (Supplementary Fig. S9), which indicates that the crowding conditions favour the formation of the i-motif at this pH. These results are totally consistent with previous observations on the stabilizing effect of crowding agents in other i-motif structures36, and suggest that the crowding environment present in the centromeric nucleosome might favour i-motif formation.
In conclusion, we have shown that the C-rich strand of the dodeca satellite (from both its 11 bp and 12 bp repeat units) is able to form dimeric i-motif structures in vitro. This experimental evidence, together with recent findings of similar i-motif structures in the human centromeric alpha-satellite29 lead us to suggest that we may be observing a structural motif common to centromeric sequences. Interestingly, preliminary results on the 359 bp satellite DNA (centromeric DNA from the Drosophila X chromosome)22 point in this direction, since the 359 bp C-rich region (found at internucleosomal linkers50) can also fold into i-motif structures (Supplementary Fig. S10).
These in vitro results stimulate the study of non-canonical DNA structures in vivo. The recent observation of G-quadruplex structures in vivo -20 years after their discovery in vitro- in telomeres and in gene promoters demonstrates that non-canonical DNA structures, different than the double helix, play significant roles in biological processes51. Much effort is still ahead to probe whether i-motifs exist in vivo or not, although recent findings on gene inhibition by small molecules that bind i-motif structures in promoter regions, suggest they might form in vivo52,53. If the occurrence of these structures were demonstrated in a centromeric nucleosomal context, they could be a structural signature of centromeric regions. Moreover, this would provide further support for our hypothesis that centromeric sequences are selected not by their primary sequence but by their ability to form noncanonical secondary structures54.
Materials and Methods
Drosophila strains and cell lines
Oregon R was used as wild-type strain. The isogenic red e strain was used for the construction of the physical map. Standard culture conditions and media were used. Drosophila S2 cells were grown and maintained as described46.
DNA analysis, sequencing and probes
High molecular weight DNA from 0–12 h Drosophila embryos was prepared in agarose plugs as previously described by Ref. 25,55,56. Restriction enzyme digestions were performed following the suppliers’ recommendations. DNA was analyzed by pulsed-field gel electrophoresis using a “Waltzer” apparatus40, and transferred to Hybond N+ nylon filters (Amersham) in 0.4 M NaOH.
The dodeca satellite probe was pBK6E21825. The 10 bp satellite oligo probe was 5′-AATAACATAGAATAACATAGAATAACATAGAATAACATAGAATAACATAG-3′. The centromeric IGS (cIGS) probe (5.6 kb fragment) was obtained from BACR31J03 using the primers: cIGS-Fw: 5′-TGGCAGCGTTTTAAGGGATG-3′ and cIGS-Rv: 5′-TAAGACGCCTGCAGAGAACG-3′. The PCR was carried out as described by57. The PCR product was cloned in vector pGEM-T (Promega). Plasmid probes were 32P-labeled by random-priming and oligonucleotide probes were 32P-labeled with T4 polynucleotide kinase. The BAC clones were sequenced at The Wellcome Trust Sanger Institute by the standard shotgun sequencing and directed finishing approach. The GenBank accession number for the sequence of BAC19P07, BAC16A01, BAC12I02, CH221-29J09 and CH221-27P10 are CU311183, CR942806, CR942807, CU463787 and CU313318, respectively.
Fluorescence in situ hybridization to mitotic chromosomes
Larval neuroblast chromosomes from Oregon R were prepared as described previously58. Chromosomes were counterstained with 4′,6-diamino-2-phenylindole (DAPI). The dodeca satellite oligo probe 5′-CCCGTACTGGTCCCGTACTGGTCCCGTACTCGGTCCCGTACTCGGT-3′ and the 10 bp satellite oligo probe 5′-AATAACATAGAATAACATAGAATAACATAGAATAACATAGAATAACATAG-3′ were chemically synthesized and labeled at the 5′ end with Cy3 or at the 3′ end with fluorescein (New England Biolabs). DNA probes derived from clones or PCR products were labeled by nick translation with digoxygenin-11-dUTP (Roche) using the DIG-Nick Translation Mix (Roche). Digoxygenin labeled probes were detected with Anti-Digoxigenin-Rhodamine, Fab fragments (Roche) in a 1:200 dilution, following supplier recommendations. Digital images were obtained using a Zeiss Axiover 200 microscope equipped with a cooled Charge-Coupled Device camera. The fluorescent signals were recorded separately as grey-scale digital images and then pseudo-colored and merged using Adobe Photoshop software.
Immunofluorescence-FISH on extended chromatin fibers
Extended chromatin fibers were prepared from S2 cells by centrifuging 5 × 104 cells onto slides at 800 rpm for 4 min in a Cytospin 4 (Thermo Shandon, Pittsburgh, PA), and then slides were dipped into salt detergent lysis buffer (25 mM Tris, pH 7.5, 500 mM NaCl, and 1% Triton X-100) for 25 minutes, slowly and steadily removed using an in-house made device consisting of a modified EasyDip™ Slide Staining System connected to a peristaltic pump, and subsequently fixed in 4% paraformaldehyde (PFA) for 5 minutes. Slides were incubated in 1×PBST (1×PBS ± 0.05% Tween-20) for 15 minutes. Slides were dipped again in the former lysis dilution for 15 minutes, after which they were slowly and steadily removed. Slides were blocked in 1×PBS, 0.1% Triton X-100, 4% formaldehyde for 10 minutes at room temperature and washed for 5 minutes in 1x PBS before proceeding to immunostaining. Slides were blocked in goat serum (Zymed Laboratories) for 30 minutes and incubated overnight at 4 °C with a chicken anti-CID antibody46, diluted to 1:100 in blocking buffer. Slides were washed 3 times for 5 minutes in 1×PBST and incubated for 1 hour at 37 °C in Alexa 488 anti-chicken secondary antibody (Molecular probes). Slides were then washed 3 times in 1×PBST and 3 times in 1×PBS. After immunofluorescence with CID antibodies, slides were re-fixed in 4% formaldehyde for 15 minutes and then hybridized to the dodeca probe. For each slide, around 250 ng of dodeca probe were precipitated with 3 M Sodium Acetate and absolute ethanol, re-suspended in hybridization solution (50% formamide, 10% dextran sulfate, 2×SSC) and denatured for 10 min at 80 °C. Slides were incubated at 37 °C for 24 h.
DNA sample preparation for NMR and MS experiments
Oligonucleotides were purchased from Integrated DNA Technologies, IDT, Coralville, IA, USA. Samples for NMR experiments were dissolved in 9:1 H2O/D2O. Buffer conditions: 25 mM sodium phosphate, 100 mM NaCl pH 4.0 for C rich sequences and 25 mM potassium phosphate, 100 mM KCl pH 7.0 for G rich sequences. The latter were previously annealed by heating at 90 °C for 5 minutes and cooling down to room temperature overnight.
Samples for MS experiments were dissolved at 100 μM in 100 mM NH4OAc buffer at pH 7 and pH 4. pH was adjusted by adding acetic acid and NH3 aliquots.
NMR experiments
All NMR spectra were acquired in Bruker spectrometers operating at 600 and 800 MHz, equipped with cryoprobes and processed with the TOPSPIN software. A jump-and-return pulse sequence59 was employed to observe the rapidly exchanging protons in 1D H2O experiments. In most of the experiments in H2O, water suppression was achieved by including a WATERGATE module in the pulse sequence prior to acquisition. NOESY experiments were recorded at 5 °C, in 25 mM NaPi, pH 4.0 (9:1 H2O/D2O) buffer and with mixing time of 100 ms.
Circular Dichroism spectroscopy
Circular dichroism spectra at different temperatures were recorded on a Jasco J-810 spectropolarimeter fitted with a thermostated cell holder. CD spectra were recorded in 25 mM sodium phosphate buffer, pH 4, with 100 mM NaCl (100 μM oligo concentration). CD melting curves were recorded at the wavelength of the larger positive band, 285 nm with a heating rate of 0.5 °C.min−1.
Experiments with PEG4000 were performed by preparing the sample in buffer 25 mM NaPi, 100 mM NaCl containing 20% w/w PEG4000. A volume of the sample is weighted and an amount of 20% of the measured weight is added as PEG4000 (Sigma-Aldrich) to the sample. Then pH is fixed by adding HCl and NaCl aliquots.
Mass spectrometry
All ESI-MS experiments were carried out in the negative ion mode on an Exactive ESI-Orbitrap mass spectrometer (Thermo Scientific, Bremen, Germany). The ESI spray voltage and capillary voltage used were −2.75 kV and −20 V, respectively. The capillary temperature was set to 150 °C. Tube lens and skimmer voltage were fixed to 180 V and −10 V, respectively. Samples were injected at a flow rate of 4 μL min−1.
Additional Information
How to cite this article: Garavís, M. et al. The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs. Sci. Rep. 5, 13307; doi: 10.1038/srep13307 (2015).
Supplementary Material
Acknowledgments
We dedicate this work in memory of our dear colleague and friend, Alfredo Villasante, who died during the revision of this manuscript. We gratefully acknowledge D. V. Laurents for revision of the manuscript. We thank G. H. Karpen for instructing M. Mendez-Lago in the preparation of extended chromatin fibers and for critical comments on the manuscript. We thank the Structural Biophysical Chemistry platform of the IECB (CNRS UMS3033/Inserm US001) for the access to the mass spectrometry facility and Dr. F. Rosu for his kind assistance. We also thank A. Losada for the initial long-range restriction analysis. We acknowledge financial support from MICINN (BFU2014-52864-R to C.G.; BFU2011-30295-C02-01 to A.V.), the Inserm (ATIP-Avenir Grant no. R12086GS to V.G.), the Conseil Régional Aquitaine (Grant no. 20121304005 to V.G.), the EU (FP7-PEOPLE-2012-CIG-333611 to V.G.), the Wellcome Trus, and the institutional grant from the Fundación Ramón Areces to the Centro de Biología Molecular “Severo Ochoa”. M.G. was supported by the FPI-fellowship BES-2009-027909.
Footnotes
Author Contributions A.V., C.G., S.L.W. and V.G. designed and M.G. and M.M.L. performed the experiments. M.G. and C.G. performed and analysed the NMR and CD experiments; M.G. and V.G., the mass spectrometry; and M.M.L. and A.V., the fluorescence and FISH experiments. S.L.W. performed the BAC clone sequencing. A.V. supervised the project and wrote the manuscript with contributions from all authors. C.G. revised the final version of the manuscript.
References
- Henikoff S., Ahmad K. & Malik H. S. The centromere paradox: stable inheritance with rapidly evolving DNA. Science 293, 1098–1102 (2001). [DOI] [PubMed] [Google Scholar]
- Abad J. P. & Villasante A. Searching for a common centromeric structural motif: Drosophila centromeric satellite DNAs show propensity to form telomeric-like unusual DNA structures. Genetica 109, 71–75 (2000). [DOI] [PubMed] [Google Scholar]
- Malik H. S. & Henikoff S. Adaptive evolution of Cid, a centromere-specific histone in Drosophila. Genetics 157, 1293–1298 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schueler M. G., Swanson W., Thomas P. J. & Green E. D. Adaptive evolution of foundation kinetochore proteins in primates. Mol. Biol. Evol. 27, 1585–1597 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Du Sart D. et al. A functional neo-centromere formed through activation of a latent human centromere and consisting of non-alpha-satellite DNA. Nat. Genet. 16, 144–153 (1997). [DOI] [PubMed] [Google Scholar]
- Williams B. C., Murphy T. D., Goldberg M. L. & Karpen G. H. Neocentromere activity of structurally acentric mini-chromosomes in Drosophila. Nat. Genet. 18, 30–38 (1998). [DOI] [PubMed] [Google Scholar]
- Rocchi M., Archidiacono N., Schempp W., Capozzi O. & Stanyon R. Centromere repositioning in mammals. Heredity 108, 59–67 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han F., Gao Z. & Birchler J. A. Reactivation of an inactive centromere reveals epigenetic and structural components for centromere specification in maize. Plant Cell 21, 1929–1939 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Agudo M. et al. A dicentric chromosome of Drosophila melanogaster showing alternate centromere inactivation. Chromosoma 109, 190–196 (2000). [DOI] [PubMed] [Google Scholar]
- Sullivan B. A. & Willard H. F. Stable dicentric X chromosomes with two functional centromeres. Nat. Genet. 20, 227–228 (1998). [DOI] [PubMed] [Google Scholar]
- Fisher A. M. et al. Centromeric inactivation in a dicentric human Y; 21 translocation chromosome. Chromosoma 106, 199–206 (1997). [DOI] [PubMed] [Google Scholar]
- Steiner N. C. & Clarke L. A novel epigenetic effect can alter centromere function in fission yeast. Cell 79, 865–874 (1994). [DOI] [PubMed] [Google Scholar]
- Hayden K. E. et al. Sequences associated with centromere competency in the human genome. Mol. Cell. Biol. 33, 763–772 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allshire R. C. & Karpen G. H. Epigenetic regulation of centromeric chromatin: old dogs, new tricks? Nat. Rev. Genet. 9, 923–937 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Black B. E. & Cleveland D. W. Epigenetic centromere propagation and the nature of CENP-a nucleosomes. Cell 144, 471–479 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fachinetti D. et al. A two-step mechanism for epigenetic specification of centromere identity and function. Nat. Cell Biol. 15, 1056–1066 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blower M. D., Sullivan B. A. & Karpen G. H. Conserved organization of centromeric chromatin in flies and humans. Dev. Cell 2, 319–330 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ribeiro S. A. et al. A super-resolution map of the vertebrate kinetochore. Proc. Natl. Acad. Sci. USA. 107, 10484–10489 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sullivan B. A. & Karpen G. H. Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nat. Struct. Mol. Biol. 11, 1076–1083 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abad J. et al. Pericentromeric regions containing 1.688 satellite DNA sequences show anti-kinetochore antibody staining in prometaphase chromosomes of Drosophila melanogaster. Mol. Gen. Genet. 264, 371–377 (2000). [DOI] [PubMed] [Google Scholar]
- Agudo M. et al. Centromeres from telomeres? The centromeric region of the Y chromosome of Drosophila melanogaster contains a tandem array of telomeric HeT-A-and TART-related sequences. Nucleic Acids Res. 27, 3318–3324 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lohe A. R., Hilliker A. & Roberts P. Mapping simple repeated DNA sequences in heterochromatin of Drosophila melanogaster. Genetics 134, 1149 (1993). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mendez-Lago M. et al. Novel sequencing strategy for repetitive DNA in a Drosophila BAC clone reveals that the centromeric region of the Y chromosome evolved from a telomere. Nucleic Acids Res. 37, 2264–2273 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun D. et al. Inhibition of human telomerase by a G-quadruplex-interactive compound. J. Med. Chem. 40, 2113–2116 (1997). [DOI] [PubMed] [Google Scholar]
- Abad J. P. et al. Dodeca satellite: a conserved G+C-rich satellite from the centromeric heterochromatin of Drosophila melanogaster. Proc. Natl. Acad. Sci. USA. 89, 4663–4667 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carmena M., Abad J. P., Villasante A. & Gonzalez C. The Drosophila melanogaster dodecasatellite sequence is closely linked to the centromere and can form connections between sister chromatids during mitosis. J. Cell Sci. 105, 41–50 (1993). [DOI] [PubMed] [Google Scholar]
- Losada A., Abad J., Agudo M. & Villasante A. Long-range analysis of the centromeric region of Drosophila melanogaster chromosome 3. Chromosome Res. 8, 651–653 (2000). [DOI] [PubMed] [Google Scholar]
- Gallego J., Chou S. H. & Reid B. R. Centromeric pyrimidine strands fold into an intercalated motif by forming a double hairpin with a novel T:G:G:T tetrad: solution structure of the d(TCCCGTTTCCA) dimer. J. Mol. Biol. 273, 840–856 (1997). [DOI] [PubMed] [Google Scholar]
- Garavís M., Escaja N., Gabelica V., Villasante A. & González C. Centromeric alpha-satellite DNA adopts dimeric i-motif structures capped by AT Hoogsteen base pairs. Chem. Eur. J. 21, 9816–9824 (2015). [DOI] [PubMed] [Google Scholar]
- Benabou S., Avino A., Eritja R., Gonzalez C. & Gargallo R. Fundamental aspects of the nucleic acid i-motif structures. RSC Adv. 4, 26956–26980 (2014). [Google Scholar]
- Day H. A., Pavlou P. & Waller Z. A. E. i-Motif DNA: Structure, stability and targeting with ligands. Bioorg. Med. Chem. 22, 4407–4418 (2014). [DOI] [PubMed] [Google Scholar]
- Gehring K., Leroy J.-L. & Gueron M. A tetrameric DNA structure with protonated cytosine-cytosine base pairs. Nature 363, 561–565 (1993). [DOI] [PubMed] [Google Scholar]
- Leroy J. L., Gehring K., Kettani A. & Gueron M. Acid multimers of oligodeoxycytidine strands: stoichiometry, base-pair characterization, and proton exchange properties. Biochemistry 32, 6019–6031 (1993). [DOI] [PubMed] [Google Scholar]
- Brazier J. A., Shah A. & Brown G. D. I-Motif formation in gene promoters: unusually stable formation in sequences complementary to known G-quadruplexes. Chem. Commun. 48, 10739–10741 (2012). [DOI] [PubMed] [Google Scholar]
- Leroy J. L., Guéron M., Mergny J. L. & Hélène C. Intramolecular folding of a fragment of the cytosine-rich strand of telomeric DNA into an i-motif. Nucleic Acids Res. 22, 1600–1606 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cui J., Waltman P., Le V. H. & Lewis E. A. The effect of molecular crowding on the stability of human c-MYC promoter sequence i-motif at neutral pH. Molecules 18, 12751–12767 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sun D. & Hurley L. H. The importance of negative superhelicity in inducing the formation of G-quadruplex and i-motif structures in the c-Myc promoter: implications for drug targeting and control of gene expression. J. Med. Chem. 52, 2863–2874 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoskins R. A. et al. The Release 6 reference sequence of the Drosophila melanogaster genome. Genome Res. 25, 445–458 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gatti M. & Pimpinelli S. Functional elements in Drosophila melanogaster heterochromatin. Annu. Rev. Genet. 26, 239–276 (1992). [DOI] [PubMed] [Google Scholar]
- Southern E., Anand R., Brown W. & Fletcher D. A model for the separation of large DNA molecules by crossed field gel electrophoresis. Nucleic Acids Res. 15, 5925–5943 (1987). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Losada A., Abad J. P., Agudo M. & Villasante A. The analysis of Circe, an LTR retrotransposon of Drosophila melanogaster, suggests that an insertion of non-LTR retrotransposons into LTR elements can create chimeric retroelements. Mol. Biol. Evol. 16, 1341–1346 (1999). [DOI] [PubMed] [Google Scholar]
- Andreyeva E. N. et al. High-resolution analysis of Drosophila heterochromatin organization using SuUR Su(var)3-9 double mutants. Proc. Natl. Acad. Sci. USA. 104, 12819–12824 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vaquero-Sedas M. I. & Vega-Palas M. A. Differential association of Arabidopsis telomeres and centromeres with Histone H3 variants. Sci. Rep. 3, (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang W., Lee H. R., Koo D. H. & Jiang J. Epigenetic modification of centromeric chromatin: hypomethylation of DNA sequences in the CENH3-associated chromatin in Arabidopsis thaliana and maize. Plant Cell 20, 25–34 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Török T., Gorjánácz M., Bryant P. J. & Kiss I. Prod is a novel DNA-binding protein that binds to the 1.686 g/cm3 10 bp satellite repeat of Drosophila melanogaster. Nucleic Acids Res. 28, 3551–3557 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blower M. D. & Karpen G. H. The role of Drosophila CID in kinetochore formation, cell-cycle progression and heterochromatin interactions. Nat. Cell Biol. 3, 730–739 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferrer N., Azorin F., Villasante A., Gutierrez C. & Abad J. Centromeric dodeca-satellite DNA sequences form fold-back structures. J. Mol. Biol. 245, 8–21 (1995). [DOI] [PubMed] [Google Scholar]
- Kypr J., Kejnovská I., Renčiuk D. & Vorlíčková M. Circular dichroism and conformational polymorphism of DNA. Nucleic Acids Res. 37, 1713–1725 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dettler J. M. et al. Biophysical characterization of an ensemble of intramolecular i-motifs formed by the human c-MYC NHE III1 P1 promoter mutant sequence. Biophys. J. 99, 561–567 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Käs E. & Laemmli U. K. In vivo topoisomerase II cleavage of the Drosophila histone and satellite III repeats: DNA sequence and structural characteristics. EMBO J. 11, 705 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biffi G., Tannahill D., McCafferty J. & Balasubramanian S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat. Chem. 5, 182–186 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kendrick S. et al. The dynamic character of the BCL2 promoter i-motif provides a mechanism for modulation of gene expression by compounds that bind selectively to the alternative DNA hairpin structure. J. Am. Chem. Soc. 136, 4161–4171 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kang H. J., Kendrick S., Hecht S. M. & Hurley L. H. The transcriptional complex between the BCL2 i-motif and hnRNP LL is a molecular switch for control of gene expression that can be modulated by small molecules. J. Am. Chem. Soc. 136, 4172–4185 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garavís M., González C. & Villasante A. On the origin of the eukaryotic chromosome: the role of noncanonical DNA structures in telomere evolution. Genome Biol. Evol. 5, 1142–1150 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karpen G. H. & Spradling A. C. Analysis of subtelomeric heterochromatin in the Drosophila minichromosome Dp1187 by single P element insertional mutagenesis. Genetics 132, 737–753 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karpen G. H. & Spradling A. C. Reduced DNA Polytenization of a Minichromosome Region Undergoing Position-Effect Variegation in Drosophila. Cell 63, 97–107 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Losada A., Abad J. P. & Villasante A. Organization of DNA sequences near the centromere of the Drosophila melanogaster Y chromosome. Chromosoma 106, 503–512 (1997). [DOI] [PubMed] [Google Scholar]
- Gatti M., Bonaccorsi S. & Pimpinelli S. Looking at Drosophila mitotic chromosomes. Methods Cell Biol. 44, 371–391 (1994). [DOI] [PubMed] [Google Scholar]
- Plateau P. & Gueron M. Exchangeable proton NMR without base-line distorsion, using new strong-pulse sequences. J. Am. Chem. Soc. 104, 7310–7311 (1982). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.