Abstract
Nucleosomal DNA sequences generally follow a well-known pattern with ∼10-bp periodic WW (where W is A or T) dinucleotides that oscillate in phase with each other and out of phase with SS (where S is G or C) dinucleotides. However, nucleosomes with other DNA patterns have not been systematically analyzed. Here, we focus on an opposite pattern, namely anti-WW/SS pattern, in which WW dinucleotides preferentially occur at DNA sites that bend into major grooves and SS (where S is G or C) dinucleotides are often found at sites that bend into minor grooves. Nucleosomes with the anti-WW/SS pattern are widespread and exhibit a species- and context-specific distribution in eukaryotic genomes. Unlike non-mammals (yeast, nematode and fly), there is a positive correlation between the enrichment of anti-WW/SS nucleosomes and RNA Pol II transcriptional levels in mammals (mouse and human). Interestingly, such enrichment is not due to underlying DNA sequence. In addition, chromatin remodeling complexes have an impact on the abundance but not on the distribution of anti-WW/SS nucleosomes in yeast. Our data reveal distinct roles of cis- and trans-acting factors in the rotational positioning of nucleosomes between non-mammals and mammals. Implications of the anti-WW/SS sequence pattern for RNA Pol II transcription are discussed.
INTRODUCTION
The eukaryotic genome is organized into chromatin in which the access of protein to DNA is tightly regulated to facilitate genomic functions such as transcription, replication and repair. The basic repeating unit of chromatin is the nucleosome core particle (NCP) that consists of a histone octamer, around which 147 base pairs (bp) of DNA is wrapped in ∼1.7 turns (1). One of the most intriguing questions in chromosome biology is to understand where nucleosomes are positioned and how specific positioning of nucleosomes affects transcription factor (TF) binding and gene expression in the context of chromatin. Experimentally, genome-wide nucleosome positions are usually mapped by two types of methods. The first type involves treating chromatin with high concentrations of micrococcal nuclease (MNase) followed by massively parallel sequencing (denoted as MNase-Seq). However, MNase has strong sequence preferences: it cuts predominantly within AT-rich sequences in free DNA (2,3) and in linker DNA between nucleosomes (4,5). This sequence specificity can be overcome by a method based on site-directed hydroxyl radicals (denoted as chemical method) (6). The chemical method allows the mapping of nucleosome dyad positions at a single base-pair resolution (7–9) and recently was applied to the yeast (10–12) and mouse (13) genomes.
Historically, nucleosome positioning is usually characterized by two parameters: rotational positioning, defined by the side of the DNA helix that faces the histones, and translational positioning, defined by the nucleosome midpoint (or dyad) with regard to the DNA sequence (14). Several cis and trans determinants of the translational positioning of nucleosomes have been reviewed in literature (15).
With regard to the rotational positioning, it is well established that DNA sequence is a critical determinant (16) and various sequence patterns have been proposed (17–26). One of the most well-known patterns is the WW/SS pattern (where W is A or T and S is G or C), which was first described by Travers and colleagues (27). Specifically, WW dinucleotides tend to occur at the sites where nucleosomal DNA bends into minor grooves (i.e. minor-groove bending sites or minor-GBS) facing toward the histone core, while SS dinucleotides are often positioned at the sites where nucleosomal DNA bends into major grooves (i.e. major-groove bending sites or major-GBS) facing toward the histone core. This pattern has been successfully used to predict the rotational positioning of nucleosomes (28). An anti-WW/SS pattern was described in which WW runs inverse to SS, and it was found that promoter nucleosomes in yeast favor this pattern (29). That is, the number of nucleosomes following the anti-WW/SS pattern exceeds or equals to the number of nucleosomes following the well-known WW/SS pattern in yeast promoters. Note that other sequence patterns are also described (29), including RR/YY and anti-RR/YY patterns (where R is A or G and Y is C or T), and these patterns are beyond the scope of the present study.
Structurally, highly conserved ‘sprocket’ arginine residues (30,31) insert into the minor-GBS (32). Short poly(dA:dT) stretches help to narrow DNA minor grooves, potentially enhancing electrostatic interactions between DNA phosphate backbone and ‘sprocket’ arginine residues (33–35). These arginine-DNA contacts that occur every ∼10 bp within NCP provides the structural basis for the rotational positioning of nucleosomes (1,33–35). In this light, anti-WW/SS nucleosomes seem to have unfavorable DNA-histone interactions, representing a relatively unstable structure. However, many questions of how anti-WW/SS nucleosomes are distributed across eukaryotic genomes and what factors influence their abundance remain poorly understood.
Here, we have quantified nucleosomes with or without the WW/SS sequence pattern, and systemically analyzed the fraction and the distribution of anti-WW/SS nucleosomes across five eukaryotic genomes under different cellular and growth conditions. We found that anti-WW/SS nucleosomes are widespread in the genomes regardless of the mapping methods and distributed in a species- and context-specific manner. In non-mammals including yeast, anti-WW/SS nucleosomes are not enriched in promoters. However, in mammals (e.g. mice and humans), this type of nucleosomes is enriched in promoter and genic regions but not in repetitive DNA elements. The enrichment of anti-WW/SS nucleosomes is positively correlated with RNA Pol II transcriptional levels, but negatively correlated with the presence of the periodic WW (or SS) pattern. Moreover, we found that chromatin remodelers have an impact on the number but not the distribution of anti-WW/SS nucleosomes in yeast. Our results indicate that cis- and trans-acting factors play distinct roles in controlling the rotational positioning of nucleosomes between non-mammals and mammals.
MATERIALS AND METHODS
Definition of Minor- and major-groove bending sites in a nucleosomal DNA fragment
Minor- and major-groove bending sites (GBS) in a 147-bp nucleosomal DNA fragment were defined previously (36). Briefly, minor-GBS are located at the superhelical locations (SHL) ±0.5, ±1.5, …, ±6.5, while major-GBS are located at SHL ±1, ±2, …, ±6. Each site is 3 or 4 bp in length. In total, there are 14 minor-GBS and 12 major-GBS along a 147-bp NCP fragment. Note that the minor-GBS at SHL ±0.5 (shown in gray in Supplementary Figure S1) are not included for analysis because out-of-phase WW/SS peaks were observed at these two sites (27). As a result, 12 minor-GBS covering 48 bp, and 12 major-GBS covering 44 bp are used in our analysis (Supplementary Table S1). Since each minor-GBS (4 bp in length) holds three unique dinucleotides, the maximum number of dinucleotides in 12 minor-GBS is 36 ( = 3 × 6 × 2). Similarly, the maximum number of dinucleotides in 12 major-GBS is 32 ( = (3 × 4 + 2 × 2) × 2). To account for this difference, we assign a coefficient of 1.125 ( = 36/32) to the number of WW or SS dinucleotides occurring at major-GBS (Table 1).
Table 1.
Classification of nucleosomal DNA sequence patterns
| Sequence Type | WW | SS |
|---|---|---|
| Type 1 | minor WW ≥ Major WW * coef† | minor SS ≤ Major SS * coef |
| Type 2 | minor WW ≥ Major WW * coef | minor SS > Major SS * coef |
| Type 3 | minor WW < Major WW * coef | minor SS ≤ Major SS * coef |
| Type 4 | minor WW < Major WW * coef | minor SS > Major SS * coef |
†coef: the coefficient is a constant of 1.125 ( =36/32). It refers to the ratio between 36 dinucleotide positions in minor-groove bending sites and 32 dinucleotide positions in major-groove bending sites (see Supplementary Table S1).
Classification of nucleosomes with different NPS patterns
Nucleosomal DNA sequences are divided into four types based on relative abundance of WW and SS dinucleotides in minor- and major-GBS (Table 1). Type 1 nucleosomes have the well-known WW/SS pattern (i.e. WW are more abundant in minor-GBS than in major-GBS, whereas SS are more abundant in major-GBS than in minor-GBS). Specifically, for a given 147-bp NCP fragment, if the total number of WW dinucleotides in the 12 minor-GBS is greater than its counterpart in the 12 major-GBS, and the total number of SS dinucleotides in the 12 major-GBS is greater than its counterpart in the 12 minor-GBS, this fragment is classified as Type 1 nucleosomal DNA (Table 1 and Figure 1A).
Figure 1.
Four sequence patterns of nucleosomal DNA in yeast. Shown are frequencies of the combined AA, TT, AT and TA dinucleotides (denoted as WW, shown in blue) and GG, CC, GC and CG dinucleotides (denoted as SS, shown in red) at each nucleosomal position, which were ‘symmetrized’ with respect to the dyad (dashed lines). Three base-pair running averages of the WW and SS frequencies were calculated and plotted: Type 1 (A), Type 2 (B), Type 3 (C) and Type 4 (D). The blue and red shade areas cover the range of minor- and major-GBS, respectively (see ‘Materials and Methods’ section). The Type 1 pattern refers to the WW/SS pattern, whereas the Type 4 pattern refers to the anti-WW/SS pattern.
Type 4 nucleosomes are characterized with an anti-WW/SS pattern. That is, if a 147-bp NCP fragment has more WW dinucleotides in 12 major-GBS than in minor-GBS, and has more SS dinucleotides in minor-GBS than in major-GBS, this fragment is classified as Type 4 nucleosomal DNA (Table 1 and Figure 1D). Type 2 and 3 nucleosomes have ‘mixed’ patterns, in which both WW and SS are abundant in minor-GBS (Type 2, Figure 1B) or in major-GBS (Type 3, Figure 1C). Specifically, a Type-2 NCP fragment has more WW and SS dinucleotides in minor-GBS than in major-GBS, while a Type-3 NCP fragment has more WW and SS dinucleotides in major-GBS than in minor-GBS. For a given genomic region, the difference between Type 1 and Type 4 nucleosomes in percentage (%) is denoted as ΔNPS. That is, ΔNPS = Type 1 (%) – Type 4 (%).
High-resolution nucleosome datasets in yeast and higher eukaryotes
A total of 18 high-resolution nucleosomal DNA datasets from various eukaryotes were used in this study (Supplementary Table S2), in which nucleosome positions were mapped by the MNase-Seq method or the chemical method. Two datasets, one from yeast (10) and the other from mouse (13), were generated by the chemical method, and the dyad positions were precisely mapped at the single base-pair resolution. These positions were obtained from the published papers (10,13). Other 16 datasets were produced by paired-end MNase-Seq. Because this sequencing technique generates short reads on both ends of a nucleosomal DNA fragment, after mapping them to the reference genome, the length of the fragment can be precisely determined. Only the fragments of 147 bp in length were analyzed in this study (Supplementary Tables S3 and S4). This is because that in this case, minor- and major-GBS can be determined in an unambiguous manner based on a high-resolution nucleosome crystal structure with a 147-bp DNA fragment (36, also see Supplementary Figure S1).
Paired-end nucleosome sequence data from yeast include wild-type (WT) cells YPE458 (37), and MCY3647 (38) and mutant (MT) cells (37,38) including single mutants (isw1Δ, isw2Δ, chd1Δ, rsc8Δ), double mutants (isw1Δ isw2Δ, isw1Δ chd1Δ, isw2Δ chd1Δ), and triple mutants (isw1Δ isw2Δ chd1Δ). The paired reads were aligned to the Saccharomyces cerevisiae genome using ELAND (Illumina). Only the reads uniquely aligned to the genome with no mismatch were selected. Note that all yeast strains (WT or mutants) were grown to mid-log phase in synthetic complete (SC) medium. All of the WT strains (YPE458 and MCY3647) should be equivalent for most purposes, since they grew in the same SC medium.
Paired-end MNase-seq data were also taken from Caenorhabditis elegans embryos and sperms (39), Drosophila S2 cells (40,41), mouse embryonic stem cells (mESC) (13) and human lymphoblastoid cell lines 18508 and 19238 (42). BAM files were either downloaded from NCBI GEO database (37,38,42) or obtained by mapping raw reads to the corresponding genomes (13,39–41) using the default setting of Bowtie 2 (43), i.e. –sensitive, -I 0, -X 500.
Genome-wide gene expression (RNA-seq) data in yeast and higher eukaryotes
The RPKM (Reads Per Kilobase of transcript per Million) values in RNA-seq data of C. elegans embryos (GSM1652723) and sperms (GSM3188165), Drosophila S2 cells (GSM410195), mouse mESCs (GSM2183915) and human lymphoblastoids (GSE121926) were used to divide genes into four groups based on transcriptional frequencies (Supplementary Table S5).
The yeast RNA-seq data for the SC condition were used for analysis (44). Raw reads were mapped to the yeast genome by the default setting of Bowtie (43). The RPKM values of the yeast data were calculated following the same method as the human RNA-seq data (45).
Repeat elements in the mammalian genomes
Mouse (mm10) and human (hg19) repetitive region positions were downloaded from the UCSC Genome Browser. The repeat elements were identified using RepeatMasker (v3.2.7) and Repbase Update (9.11). The major types of repeat elements were selected for analysis, including short interspersed nuclear elements (SINE) (Alu, MIR), long interspersed nuclear elements (LINE) (CR1, L1, L2 and RTE), long terminal repeats (LTR) (ERV1, ERVK, ERVL and Gypsy), Simple Repeat ((TG)n, (TCG)n, (CACTC)n, (GAGTG)n and (TATATG)n). The remaining repeat elements were included into the ‘Other’ category including MuDR, PiggyBac, TcMar-Mariner, hAT-Charlie. Only the elements with >150 bp in length (i.e. approximately the size of one nucleosome) were selected for analysis.
Since human nucleosomes were mapped to the genome assembly hg18 (42), the UCSC LiftOver utility was used along with the hg18-hg19 chain file to convert the hg19 repeat elements to their corresponding positions in human genome assembly hg18. Nucleosomes overlapped with these repeat families were summarized in Supplementary Table S10.
Distance auto- and cross-correlation function
The distance auto-correlation (DAC) and distance cross-correlation (DCC) functions have been discussed previously (46,47). In the present study, DAC function was used to calculate the correlation between WW or SS dinucleotides. That is, if a WW dinucleotide (AA, TT, AT or TA) is separated from another WW dinucleotide by a distance d, one occurrence is counted for that distance in the DAC function. For the DCC function, we counted how many times a WW dinucleotide is separated from a SS dinucleotide (GG, CC, GC or CG) by a distance d, where d is between 1 and 150 bp.
RESULTS
Anti-WW/SS nucleosomes are widespread in eukaryotes
NCP fragments of 147 bp in length were divided into four sequence patterns, Type 1–4 (Table 1), based on the relative occurrence of WW and SS dinucleotides in 12 minor- and 12 major-GBS (Supplementary Table S1 and Figure S1). The Type 1 pattern represents the well-known WW/SS sequence pattern in which WW and SS preferentially occur in minor-GBS and major-GBS, respectively (Figure 1A). Type 2 nucleosomes follow a ‘mixed’ pattern, in which both WW and SS dinucleotides are more abundant in minor-GBS than major-GBS (Figure 1B). The Type 3 pattern is opposite to the Type 2 pattern in which both WW and SS are more abundant in major-GBS than minor-GBS (Figure 1C). The Type 4 pattern is inverse to Type 1 with WW and SS dinucleotides preferentially occurring in major- and minor-GBS respectively (Figure 1D). For the sake of simplicity, nucleosomes with the Type 1 pattern are denoted as WW/SS nucleosomes, whereas those with the Type 4 pattern are denoted as anti-WW/SS nucleosomes.
We have identified all four types of sequence patterns in datasets (Supplementary Tables S2 and S3) from yeast, fruit flies, nematodes, mice and humans (Figure 1 and Supplementary Figures S2–7). Our analysis has led to several interesting observations. First, since the datasets were produced by different methods (i.e. the MNase-Seq method or the chemical method), the presence of anti-WW/SS nucleosomes is independent of nucleosome mapping methods. Second, anti-WW/SS nucleosomes occur in all chromosomes (Supplementary Table S4), indicating that they are widespread in the genomes. Third, except for the yeast dataset obtained by chemical mapping, the fraction of WW/SS nucleosomes is less than 50% of all nucleosomes (Figure 2), indicating that the well-known WW/SS sequence pattern is not predominant. Fourth, the fractions of Type 1 and Type 4 nucleosomes vary substantially. For instance, the fractions of the four types of nucleosomes are 36–57% (Type 1), 20–22% (Type 2), 8–13% (Type 3) and 13–31% (Type 4), respectively (Supplementary Table S4). Since Type 2 and Type 3 nucleosomes have relatively small changes among datasets, we were prompted to develop a measure, ΔNPS ( = Type 1 (%) – Type 4 (%)), to gauge the difference between Type 1 and Type 4 nucleosomes (Supplementary Figure S8). A ΔNPS value represents the relative abundance of WW/SS versus anti-WW/SS nucleosomes in a given genomic region.
Figure 2.
Comparison of the fractions of nucleosomal DNA patterns in eukaryotes. Shown is the fractions of four sequence patterns in nucleosomal DNA from yeast, nematode, fruit flies, mice and humans. For chemical mapping data, the ‘unique’ maps of nuleosomal dyad positions were taken from literature (10,13) and the corresponding 147-bp NCP fragments were used in this study. For the paired-end MNase mapping data, the two biological replicates or relevant datasets were taken from literature and 147-bp NCP fragments in these datasets were used for analysis. The fractions of sequence patterns were calculated for each dataset (Supplementary Table S4).
Taken together, anti-WW/SS nucleosomes accounting for 13–31% of total nucleosomes exist in all eukaryotes examined and are widespread across the genomes. We next sought to understand how anti-WW/SS nucleosomes are distributed in the genomes.
Anti-WW/SS nucleosomes are enriched in mammalian genes and associated with transcription
We started with genomic regions surrounding the transcriptional start sites (TSS) of genes that were separated into quartiles by transcriptional levels (Supplementary Table S5). As expected, nucleosome occupancy profiles in yeast reveal a well-established genomic pattern (48,49): the presence of an ∼200-bp nucleosome-depleted region (NDR), flanked by phased nucleosomes, which form a highly regular array extending into gene bodies (Figure 3A and B). The phased nucleosomes, named as nucleosome -1, +1, +2, +3, +4 and +5, are organized relative to TSS (50,51) (Supplementary Table S6).
Figure 3.
Nucleosome occupancy profiles around TSS of yeast (A and B), fly (B), nematode (C), mouse (D and E) and human (F) genes. There are two separate nucleosome datasets for yeast and mice, derived from chemical mapping (A and D) and MNase-Seq mapping (B and E), respectively. Nucleosome occupancy signals ±1 kb of verified TSSs are separated into quartiles based on transcriptional levels (Supplementary Table S5). The first 25%-ile represents the least active genes whereas the fourth 25%-ile represents the most active genes. The average nucleosome profile of all genes is shown in black. Nucleosomes −1 to +5 are demarcated by dashed lines and arrows, following the methods used in previous studies (77). These definable zones relative to the TSS (position 0) to which a nucleosome midpoint may be assigned are: −1, +1, +2, +3, +4 and +5 (see Supplementary Table S6 for nucleosome ranges). Note the nucleosome occupancy of the nematode dataset (D) looks noisier than other datasets probably due to low read coverage.
Overall, the regularity of the nucleosome arrays depends on species, transcription levels and nucleosome mapping methods (Figure 3). First, nucleosome arrays in yeast are more regular than their counterparts in other species. Second, as transcriptional frequencies go higher, the nucleosome arrays tend to be more regular (compare red lines with blue lines). Third, nucleosomes mapped by the chemical method tend to more regularly positioned than those mapped by the MNase-Seq method (compare Figure 3A and B with Figure 3E and F), presumably because the hydroxyl-cleavage-based chemical method overcomes the sequence specificity of MNase (2,3) and is able to map nucleosomes at a higher resolution (6).
Comparison of nucleosomal ΔNPS value for all genes across the species reveals some interesting patterns. First, in the non-mammalian genes, the ΔNPS values appear close to their genomic ΔNPS (see black lines and dashed lines in Figure 4A–D), suggesting that the relative abundance of Type 1 and Type 4 nucleosomes has little change. This pattern is confirmed by additional datasets in flies (Supplementary Figure S9A and B) and nematodes (Supplementary Figure S10A and B). Consistently, quantitative analysis reveals a small difference between nucleosomal ΔNPS (averaged over nucleosome −1 to +5) and genomic ΔNPS: <1% for yeast and 1–3% for flies and nematodes (Supplementary Table S7). In yeast, it has been shown that the fraction of Type 1 nucleosomes is significantly more than that of Type 4 nucleosomes with the difference ranging from ∼20% (MNase-seq set) to ∼45% (chemical set). This difference varies little between promoters and other regions in the yeast genome. Note that our results differ from a previous study by Ioshikhes and colleagues (29), in which a small set (<10 000) of single-end H2A.Z nucleosomal DNA sequences, compared to >4 million fragments in the present study (Supplementary Table S3), were used to analyze the positive or negative correlation with the WW/SS pattern. They found that the number of promotor nucleosomes following the anti-WW/SS pattern was higher or close to the number of nucleosomes following the well-known WW/SS pattern. Because the single-end sequencing method is unable to provide accurate estimations of nucleosome dyads, in our view, the observed discrepancies are likely due to the different methods, as well as different quality and size of datasets used in both studies.
Figure 4.
Nucleosome ΔNPS values for genes separated into quartiles by transcriptional frequencies. The first 25%-ile represents the least active genes whereas the fourth 25%-ile represents the most active genes. The ΔNPS values for all genes are shown in black. The genomic ΔNPS values are denoted by dashed lines.
Second, in mammalian genes, the ΔNPS values of nucleosome −1 to +5 tend to be lower than the genomic ΔNPS, with the lowest point at nucleosome +1 (black lines in Figure 4E-G), indicating that the relative abundance of Type 1 and Type 4 nucleosomes have changed. The averaged nucleosomal ΔNPS values exhibit a substantial difference, 5–11%, from the genomic ΔNPS value, which is much higher than that of non-mammals, 0–3% (Supplementary Table S7). Detailed analysis on the ΔNPS values of nucleosome +1 in the mouse chemical dataset shows that the fraction of Type 1 nucleosomes is decreased from 47 to 41%, whereas the fraction of Type 4 nucleosomes is increased from 19 to 24% (Supplementary Table S8). A similar trend is seen in the mouse and human MNase datasets (Supplementary Table S8), clearly illustrating that the fraction of Type 1 nucleosomes is decreased whereas the fraction of Type 4 nucleosome is increased in nucleosome +1. This mammalian pattern (Figure 4E–G) is in marked contrast to the non-mammalian one (Figure 4A–D), and an analysis of statistical errors of both patterns shows that anti-WW/SS nucleosomes are indeed enriched in mammalian genes (Supplementary Figure S11). Note that the observed difference is not due to the selected 147-bp NCP fragments because this difference still holds in a separate study that includes all NCP fragments (Supplementary Figure S12).
Third, the change in ΔNPS values is more correlated with transcriptional levels in mammalian genes compared to non-mammalian ones (Figure 4). For example, in yeast, no clear change is seen for the nucleosomal ΔNPS values in highly (red line) and lowly (blue line) transcribed genes (Figure 4A–D). By contrast, in mammals, lowly transcribed genes tend to have higher ΔNPS values whereas highly transcribed genes tend to have lower ΔNPS values (Figure 4E–G). Again, the nucleosome +1 was used for illustration (Supplementary Table S8): the fraction of Type 1 nucleosomes is decreased from lowly transcribed genes (Group 1) to highly transcribed genes (Group 4). By contrast, the fraction of Type 4 nucleosomes is increased from lowly transcribed genes to highly transcribed genes. These results have established a clear link between RNA Pol II transcription levels and the rotational settings of nucleosomes in mammals, that is, nucleosomes are more likely to follow the well-known WW/SS pattern in lowly transcribed genes and the anti-WW/SS pattern in highly transcribed genes.
Overall, we have seen a drastic difference in the distribution of anti-WW/SS nucleosomes in non-mammalian versus mammalian genes. That is, anti-WW/SS nucleosomes are relatively increased in genes of the mammals as compared to the non-mammalian species. Moreover, the local increase of anti-WW/SS nucleosomes seems to be associated with RNA Pol II transcription.
Periodic DNA patterns are missing in mammalian genes
To check whether the observed local increase of anti-WW/SS is caused by underlying DNA patterns, we calculated DAC functions (46,47) of WW dinucleotides in the TSS-surrounding regions (i.e. between −500 bp and +1000 bp relative to TSS). Inspection of the DAC functions in non-mammals reveals a very regular structure, with peaks separated by ∼10 bp (Figure 5A–C). The same ∼10-bp periodicity is observed in the DAC functions of SS dinucleotides (Supplementary Figure S13A–C) and the DCC functions between WW and SS dinucleotides (Supplementary Figure S14A–C). Note that the DAC functions have the peaks at the distance of ∼10 × n bp (where n = 1, 2, …), whereas the DCC functions have the peaks at the distance of ∼10 × n + 5 bp (where n = 1, 2, …), indicating that WW (or SS) are in phase with each other, whereas WW and SS dinucleotides are out of phase. Our results suggest stable nucleosomes are likely to form with an optimal rotational setting in non-mammalian genes, which may explain small variations of the ΔNPS values observed for nucleosome −1 to +5 (see black lines in Figure 4A–D).
Figure 5.
DAC function profiles for WW dinucleotides in yeast (A), fly (B), nematode (C), mouse (D) and human (E) DNA. Genomic fragments [−500 bp, +1000 bp] relative to verified TSSs (position 0) were used for analysis. Both raw (thin lines) and 3-bp running average (thick lines) values were plotted.
By contrast, both DAC and DCC functions in mouse and human genes exhibit no recognizable peaks (Figure 5D and E; Supplementary Figures S13D-E and S14D-E), indicating that the ∼10-bp periodic WW and SS patterns are missing in mammals, consistent with earlier studies (52). This lack of sequence periodicity in mammals is remarkable because it suggests that the local increase of anti-WW/SS is not due to intrinsic DNA sequence patterns but to trans-acting factors such as RNA Pol II. RNA transcription may facilitate the formation of anti-WW/SS nucleosomes that are intrinsically unstable, which in turn promotes gene transcription (see ‘Discussion’ section).
Taken together, we have observed a significant difference in DNA sequence patterns between mammalian and non-mammalian genes. The non-mammalian DNA exhibits an in-phase pattern of WW (or SS) dinucleotides and an out-of-phase pattern between WW and SS dinucleotides. By contrast, no periodic pattern is observed in mammals. Our data suggest that DNA sequence plays a distinctive role in the rotational settings of nucleosomes in non-mammals versus mammals around TSS.
Anti-WW/SS nucleosomes are not over-represented in mammalian repeats
To examine if the enrichment of anti-WW/SS nucleosomes in mammals is context dependent, we focused on repetitive DNA elements. It is because that up to 40–50% of a mammalian genome contain repeat sequences derived from transposable elements (53), including elements with LTR, SINE and LINE. Many repetitive sequences occur in heterochromatic regions (54) that are characterized with high levels of condensation throughout the cell cycle (55), low rates of meiotic recombination (56) and the ability to silence gene expression (57). We therefore hypothesize that the fraction of anti-WW/SS nucleosomes in repeats would differ from that in genic regions (Supplementary Figure S8).
We first checked the fraction of repeat families in the human genome (Figure 6A; Supplementary Tables S9 and S10) and compared with the fractions of nucleosomes residing in these families (Figure 6B). Although 18 and 6% of human repeats are LTR and simple repeat elements, only 8 and 1% of NCP fragments are found in these elements, respectively, indicating that nucleosomes are depleted in these families. By contrast, no apparent depletion of nucleosomes is observed in LINE and SINE elements (41 versus 40% for LINE and 29 versus 24% for SINE) (Figure 6A and B).
Figure 6.
Fractions of DNA sequence patterns and ΔNPS values in human repeat families. (A) Fraction of repeat families in human genome. The fractions were taken from literature (78). (B) Fraction of 147-bp human nucleosomes residing in each repeat family. (C) Fraction of four types of nucleosomes in each repeat family. (D) Nucleosome ΔNPS values in genic and repetitive DNA regions. The genomic ΔNPS value is indicated by dashed lines.
Then we calculated the fractions of four sequence types and the ΔNPS value across repeat families (Figure 6C and D) and found that ΔNPS values in all families are higher than that in genic region. In particular, SINE- and LINE-bearing nucleosomes have ΔNPS values close to the genomic value (Figure 6D). Similar observations are seen in mouse repeats (Supplementary Table S11 and Supplementary Figure S15), indicating that anti-WW/SS nucleosomes are not over-represented in mammalian repeats. Examination of the DAC functions of WW dinucleotides in human LINE and SINE elements reveals an ∼10-bp periodicity (Supplementary Figure S16), which is absent in genic regions (Figure 5D and E), suggesting that repetitive DNA is organized differently from DNA in genic regions.
Chromatin remodelers influence the fraction of anti-WW/SS nucleosomes
At last, we investigated if other trans-acting factors such as chromatin remodeling complexes can influence the abundance of anti-WW/SS nucleosomes. To this end, yeast mutant strains in which single or multiple chromatin remodeling complexes have been deleted were used for analysis. These mutant strains include four single mutants (isw1Δ, isw2Δ, chd1Δ and rsc8Δ), three double mutants (isw1Δ isw2Δ, isw1Δ chd1Δ and isw2Δ chd1Δ), and one triple mutant (isw1Δ isw2Δ chd1Δ) (37,38). From the fraction of four sequence types (Supplementary Figure S17) and genomic ΔNPS value (Supplementary Table S12) of each mutant, it is clear that all mutants except isw2Δ have significantly different ΔNPS values from the WT strain (Student's t-test P < 10−5) (Figure 7), with the average deviation being 1.85% (Supplementary Table S12).
Figure 7.
Genomic ΔNPS values in yeast WT and mutant strains. ΔNPS values were calculated by chromosomes and shown in a box-whisker plot, with the mean representing the genomic ΔNPS. The genomic ΔNPS values of WT strains are used as reference points to compare with those of mutant strains (Supplementary Table S12) in which one or more chromatin remodeler genes are knocked out (50).
The double mutant strain isw1Δ chd1Δ was used for detailed analysis because nucleosome phasing is disrupted (Supplementary Figure S18A), as shown before (37). The genomic ΔNPS value of this strain is 16.84%, which is 3.3% smaller than that of the WT strain, 20.14% (Supplementary Table S12), suggesting that the deletion of both ISW1 and CHD1 leads to an increase of anti-WW/SS nucleosomes. However, the nucleosome ΔNPS profile around TSS is very similar to that of WT strain (Supplementary Figure S18B and Figure 4A), indicating that anti-WW/SS nucleosomes are not enriched in the genic regions, although the overall fraction of anti-WW/SS nucleosomes is increased. Similar patterns are seen in other mutants, e.g., chd1Δ isw1Δ chd1Δ, isw2Δ chd1Δ and isw1Δ isw2Δ chd1Δ (Figure 8). Our data suggest that chromatin remodeler(s) have an impact on the fraction but not on the distribution of anti-WW/SS nucleosomes in non-mammals. However, it remains to be determined if this observation holds for mammals.
Figure 8.
ΔNPS values of promoter and downstream nucleosomes in yeast mutant strains. For the sake of comparison, the ranges of nucleosomes are the same for both the WT and mutant strains (Supplementary Table S6). Other notations follow Figure 3.
DISCUSSIONS
A model for cis and trans determinants of rotational positioning of nucleosomes
We have conducted a comprehensive analysis of WW/SS-based sequence patterns for nucleosomes in five eukaryotes. Our work demonstrates that nucleosomes with the anti-WW/SS pattern are widespread in eukaryotic genomes. Interestingly, these nucleosomes are distributed differently in non-mammals (e.g. yeast, nematodes and fruit flies) versus in mammals (e.g. mice and humans). Specifically, anti-WW/SS nucleosomes are enriched in the promoter and genic regions of mammalian genomes but not of non-mammalian genomes (Figure 9A). We have further analyzed the impact of various cis- and trans-acting factors on the fraction and the distribution of anti-WW/SS nucleosomes. In light of these findings, we propose a model for the distinct role of cis and trans factors in the rotational positioning of nucleosomes between non-mammals and mammals (Figure 9B).
Figure 9.

Models for the roles of cis and trans factors in rotational positioning of nucleosomes. (A) Species-specific distribution of anti-WW/SS nucleosomes. In mammals, the fraction of anti-WW/SS nucleosomes in nucleosomes −1 to +5 is increased and the fraction of WW/SS nucleosomes is decreased, suggesting that these nucleosomes are unstable (represented by fuzziness of nucleosome positioning). This change is not seen in non-mammalian genes. Nucleosomes −1 to +5 in mammals are depicted in dark brown and other nucleosomes (including those residing in mammalian repetitive DNA) are depicted in light brown. (B) Distinct role of cis and trans factors in rotational positioning of nucleosomes. Both cis-acting factors (e.g. DNA sequences) and trans-acting factors (e.g. chromatin remodeling complexes and RNA Pol II) affect rotational settings of nucleosomes. In non-mammals, DNA sequence plays a more important role than RNA Pol II because (i) the ∼10-bp periodic WW (or SS) patterns are pronounced; (ii) no clear change in the ΔNPS values is seen between highly and lowly transcribed genes. By contrast, in mammalian genes, RNA Pol II plays a more important role because (i) the ∼10-bp periodicity is diminished, and (ii) a clear change in ΔNPS values is seen between highly and lowly transcribed genes. Chromatin remodelers also have an impact on the rotational setting of nucleosomes in non-mammals, but this effect is unclear in mammals.
Based on our model, the rotational positioning of nucleosomes can be influenced by cis-acting factors (DNA sequence patterns) and trans-acting factors (chromatin remodeling complexes and RNA Pol II) in both non-mammals and mammals. However, the relative importance of these factors differ. In non-mammals, the cis-acting factors play a more important role than trans-acting factors because (i) genomic DNA exhibits sequence patterns with an ∼10-bp periodicity (Figure 5A–C; Supplementary Figures S13A–C and S14A–C), and (ii) the fraction of anti-WW/SS nucleosomes varies little across the genomes (Figures 4A–D and 8). By contrast, the trans-acting factors such as RNA Pol II appears to be more important in mammals, due to a lack of periodic sequence patterns (Figure 5D and E; Supplementary Figures S13D-E and S14D-E), and the local increase of anti-WW/SS nucleosomes in mammalian promoters and genic regions (Figure 4E–G).
It is worth noting that overall, periodic WW (or SS) patterns appear to be very strong in yeast, become weaker in nematodes and flies and completely disappear in mice and humans (Figure 5 and Supplementary Figures S13 and S14). This implies that the importance of cis-acting factors in the rotational positioning of nucleosomes decreases in evolution, especially in promoter and genic regions. However, the periodic sequence patterns are preserved in mammalian repetitive DNA elements (Supplementary Figure S16) and anti-WW/SS nucleosomes are not over-represented (Figure 6D). These data suggest that molecular mechanisms governing the rotational settings of nucleosomes in mammals are context dependent.
Recenlty, Tompitak et al. have found a large difference in the intrinsically encoded nucleosome affinity at promoter regions of unicellular versus multicellular eukaryotes (58). Unlike the NDR-encoded in unicellular eukaryotes (e.g. yeast), promoters in multicellular eukaryotes (e.g. humans) encode nucleosome-attracting regions (NAR), which is a universal feature of multicellular life. Two interesting hypotheses have been proposed to account for the presence of NAR in humans: one is to suppress gene transcription and facilitate cell differentiation, whereas the other is to retain nucleosomes in the germline to pass on epigenetic information. Together with our findings, we hypothesize that two different codes for nucleosome positioning evolve independently: the code for translational positioning (represented by intrinsic nucleosome affinity) is weak in yeast and becomes increasingly strong late in evolution, whereas the code for rotational positioning (represented by WW/SS patterns) is strong in yeast and gradually disappears late in evolution.
Implications of anti-WW/SS nucleosomes for RNA Pol II transcription
The interactions between RNA Pol II transcription and nucleosomes have been extensively studied since the discovery of nucleosomal barrier (59,60). Early work has established that the nucleosome presents a strong barrier to transcription in vitro, and the nature of this barrier has been studies by structural, biochemical and biophysical studies (61,62). However, the questions of how the Pol II machinery efficiently overcomes nucleosomes and transcribes through gene body in vivo remain unclear. Various trans factors modulating the nucleosomal barrier have been proposed (60), including histone modifications, histone variant replacement, histone chaperones and nucleosome remodelers. Our work suggests that cis factors such as DNA sequence also contribute to modulation of nucleosomal barrier, especially in mammals (see below).
As shown above, anti-WW/SS nucleosomes are enriched in promoters and genic regions of mammals, and this enrichment is increased with the transcriptional levels. One possible explanation is that because of the intrinsic instability of nucleosomes with respect to rotational settings, RNA Pol II transcription is promoted. However, the formation of anti-WW/SS nucleosomes is not energetically favorable because GC-containing dinucleotides in the minor-GBS do not have favorable interactions with the ‘sprocket’ arginine residues in histones (see ‘Introduction’ section). In addition, the enrichment of this type of nucleosomes is not seen in highly transcribed genes in non-mammals (Figure 4A–D). An alternative explanation is that RNA Pol II transcription helps to position anti-WW/SS nucleosomes that are thermodynamically unstable. In these nucleosomes, the presence of SS dinucleotides in the minor-GBS weakens DNA-histone interactions, which may facilitate the unwrapping of DNA from nucleosomes (63), thereby promoting RNA Pol II transcription. This tendency may also help explain histone eviction observed in highly transcribed genes (64,65).
Rotational positioning and binding of transcriptional factors in mammalian repeats
Repetitive DNA regions in human are characterized with different chromatin organizations. For example, LTR/ERV elements are highly enriched in open chromatin regions (66), whereas Alu elements that belong to the SINE superfamily harbor two nucleosomes (67–69). Our analysis of the human nucleosome data, which were derived from lymphoblastoid cell lines (42), is in general agreement with these observations (Figure 6A and B).
A group of TFs known as pioneer factors (e.g., p53, Oct4, Sox2, Klf4 etc.) is able to interact with nucleosomal DNA (70–72) and many of them preferentially bind to repetitive DNA elements (73). We and other groups have shown that the rotational positioning of nucleosomes is critical for TF binding including p53 (70,74). We have found that an Alu-residing nucleosome, if taking the optimal rotational setting, help to expose the putative binding sites of p53 (75). The present study has demonstrated that nucleosomes are likely to take the optimal rotational setting in human repeats including SINE/Alu elements (Figure 6D), suggesting that many p53 binding sites in Alu elements are likely to be exposed. This is also in agreement with our recent findings that a large amount (∼40%) of p53 binding sites occurs in various repeats including Alu (76). Further analyses are needed to address the biological roles of anti-WW/SS nucleosomes in repeats to see if, in these cases, TF binding sites are buried inside of nucleosomes and inhibitory for TF binding, as we suggested for p53 (74). Understanding the rotational positioning of nucleosomes in human repeats will shed new light on mechanisms regulating retrotransposon activities in a normal or diseased cellular context.
Supplementary Material
Notes
Present address: Gregory M. Wright, Department of Pharmacology, School of Medicine, University of Colorado (Anschutz Medical Campus), Aurora, CO 80045, USA.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
NIH Grant [GM116102 to F.C.] and RIT Faculty of Development (FEAD) Funds (to F.C.). Funding for open access charge: NIH Grant [GM116102].
Conflict of interest statement. None declared.
REFERENCES
- 1. Luger K., Mader A.W., Richmond R.K., Sargent D.F., Rochmond T.J.. Crystal structure of the nucleosome core particle at 2.8Å resolution. Nature. 1997; 389:251–260. [DOI] [PubMed] [Google Scholar]
- 2. Dingwall C., Lomonossoff G.P., Laskey R.A.. High sequence specificity of micrococcal nuclease. Nucleic Acids Res. 1981; 9:2659–2673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Horz W., Altenburger W.. Sequence specific cleavage of DNA by micrococcal nuclease. Nucleic Acids Res. 1981; 9:2643–2658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Field Y., Kaplan N., Fondufe-Mittendorf Y., Moore I., Sharon E., Lubling Y., Widom J., Segal E.. Distinct modes of regulation by chromatin encoded through nucleosome positioning signals. PLoS Comput. Biol. 2008; 4:e1000175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Cui F., Zhurkin V.B.. Distinctive sequence patterns in metazoan and yeast nucleosomes: implications for linker histone binding to AT-rich and methylated DNA. Nucleic Acids Res. 2009; 37:2818–2829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Flaus A., Luger K., Tan S., Richmond T.J.. Mapping nucleosome position at single base-pair resolution by using site-directed hydroxyl radicals. Proc. Natl. Acad. Sci. U.S.A. 1996; 93:1370–1375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Panetta G., Buttinelli M., Flaus A., Richmond T.J., Rhodes D.. Differential nucleosome positioning on Xenopus oocyte and somatic 5S RNA genes determines both TFIIIA and H1 binding: a mechanism for selective H1 repression. J. Mol. Biol. 1998; 282:683–697. [DOI] [PubMed] [Google Scholar]
- 8. Flaus A., Richmond T.J.. Positioning and stability of nucleosomes on MMTV 3′LTR sequences. J. Mol. Biol. 1998; 275:427–441. [DOI] [PubMed] [Google Scholar]
- 9. Kassabov S.R., Henry N.M., Zofall M., Tsukiyama T., Bartholomew B.. High-resolution mapping of changes in histone-DNA contacts of nucleosomes remodeled by ISW2. Mol. Cell Biol. 2002; 22:7524–7534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Brogaard K., Xi L., Wang J.P., Widom J.. A map of nucleosome positions in yeast at base-pair resolution. Nature. 2012; 486:496–501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Moyle-Heyrman G., Zaichuk T., Xi L., Zhang Q., Uhlenbeck O.C., Holmgren R., Widom J., Wang J.P.. Chemical map of Schizosaccharomyces pombe reveals species-specific features in nucleosome positioning. Proc. Natl. Acad. Sci. U.S.A. 2013; 110:20158–20163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Chereji R.V., Ramachandran S., Bryson T.D., Henikoff S.. Precise genome-wide mapping of single nucleosomes and linkers in vivo. Genome Biol. 2018; 19:19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Voong L.N., Xi L., Sebeson A.C., Xiong B., Wang J.P., Wang X.. Insights into nucleosome organization in mouse embryonic stem cells through chemical mapping. Cell. 2016; 167:1555–1570. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Lu Q., Wallrath L.L., Elgin S.C.. Nucleosome positioning and gene regulation. J. Cell Biochem. 1994; 55:83–92. [DOI] [PubMed] [Google Scholar]
- 15. Struhl K., Segal E.. Determinants of nucleosome positioning. Nat. Struct. Mol. Biol. 2013; 20:267–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Trifonov E.N. Thirty years of multiple sequence codes. Genomics Proteomics Bioinformatics. 2011; 9:1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Mengeritsky G., Trifonov E.N.. Nucleotide sequence-directed mapping of the nucleosomes. Nucleic Acids Res. 1983; 11:3833–3851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Zhurkin V.B. Specific alignment of nucleosomes on DNA correlates with periodic distribution of purine-pyrimidine and pyrimidine-purine dimers. FEBS Lett. 1983; 158:293–297. [DOI] [PubMed] [Google Scholar]
- 19. Drew H.R., Travers A.A.. DNA bending and its relation to nucleosome positioning. J. Mol. Biol. 1985; 186:773–790. [DOI] [PubMed] [Google Scholar]
- 20. Calladine C.R., Drew H.R.. Principles of sequence-dependent flexure of DNA. J. Mol. Biol. 1986; 192:907–918. [DOI] [PubMed] [Google Scholar]
- 21. Uberbacher E.C., Harp J.M., Bunick G.J.. DNA sequence patterns in precisely positioned nucleosome. J. Biomol. Struct. Dyn. 1988; 6:105–120. [DOI] [PubMed] [Google Scholar]
- 22. Ioshikhes I., Bolshoy A., Trifonov E.N.. Preferred positions of AA and TT dinucleotides in aligned nucleosome DNA sequences. J. Biomol. Struct. Dyn. 1992; 9:1111–1117. [DOI] [PubMed] [Google Scholar]
- 23. Baldi P., Brunak S., Chauvin Y., Krogh A.. Naturally occurring nucleosome positioning signals in human exons and introns. J. Mol. Biol. 1996; 263:503–510. [DOI] [PubMed] [Google Scholar]
- 24. Ioshikhes I., Bolshoy A., Derenshteyn K., Borodovsky M., Trifonov E.N.. Nucleosome DNA sequence pattern revealed by multiple alignment of experimentally mapped sequences. J. Mol. Biol. 1996; 262:129–139. [DOI] [PubMed] [Google Scholar]
- 25. Lowary P.T., Widom J.. New DNA sequence rules for high affinity binding to histone octamer and sequence-directed nucleosome positioning. J. Mol. Biol. 1998; 276:19–42. [DOI] [PubMed] [Google Scholar]
- 26. Kogan S.B., Kato M., Kiyama R., Trifonov E.N.. Sequence structure of human nucleosome DNA. J. Biomol. Struct. Dyn. 2006; 24:43–48. [DOI] [PubMed] [Google Scholar]
- 27. Satchwell S.C., Drew H.R., Travers A.A.. Sequence periodicities in chicken nucleosome core DNA. J. Mol. Biol. 1986; 191:659–675. [DOI] [PubMed] [Google Scholar]
- 28. Cui F., Chen L., LoVerso P.R., Zhurkin V.B.. Prediction of nucleosome rotational positioning in yeast and human genomes based on sequence-dependent DNA anisotropy. BMC Bioinformatics. 2014; 15:313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Ioshikhes I., Hosid S., Pugh B.F.. Variety of genomic DNA patterns for nucleosome positioning. Genome Res. 2011; 21:1863–1871. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Muthurajan U.M., Bao Y., Forsberg L.J., Edayathumangalam R.K., Suto R.K., Chakravarthy S., Dyer P.N., Luger K.. Structure and dynamics of nucleosomal DNA. Biopolymers. 2003; 68:547–556. [DOI] [PubMed] [Google Scholar]
- 31. Sullivan S.A., Landsman D.. Characterization of sequence variability in nucleosome core histone folds. Proteins. 2003; 52:454–465. [DOI] [PubMed] [Google Scholar]
- 32. Davey C.A., Sargent D.F., Luger K., Maeder A.W., Richmond T.J.. Solvent mediated interactions in the structure of the nucleosome core particle at 1.9Å resolution. J. Mol. Biol. 2002; 391:1097–1113. [DOI] [PubMed] [Google Scholar]
- 33. Rohs R., West S.M., Sosinsky A., Liu P., Mann R.S., Honig B.. The role of DNA shape in protein-DNA recognition. Nature. 2009; 461:1248–1253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Wang D., Ulyanov N.B., Zhurkin V.B.. Sequence-dependent Kink-and-Slide deformations of nucleosomal DNA facilitated by histone arginines bound in the minor groove. J. Biomol. Struct. Dyn. 2010; 27:843–859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. West S.M., Rohs R., Mann R.S., Honig B.. Electrostatic interactions between arginines and the minor groove in the nucleosomes. J. Biomol. Struct. Dyn. 2010; 27:861–866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Cui F., Zhurkin V.B.. Structure-based analysis of DNA sequence patterns guiding nucleosome positioning in vitro. J. Biomol. Struct. Dyn. 2010; 27:821–841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37. Ocampo J., Chereji R.V., Eriksson P.R., Clark D.J.. The ISW1 and CHD1 ATP-dependent chromatin remodelers compete to set nucleosome spacing in vivo. Nucleic Acids Res. 2016; 44:4625–4635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Ganguli D., Chereji R.V., Iben J.R., Cole H.A., Clark D.J.. RSC-dependent constructive and destructive interference between opposing arrays of phased nucleosomes in yeast. Genome Res. 2014; 24:1637–1649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Tabuchi T.M., Rechtsteiner A., Jeffers T.E., Egelhofer T.A., Murphy C.T., Strome S.. Caenorhabditis elegans sperm carry a histone-based epigenetic memory of both spermatogenesis and oogenesis. Nat. Commun. 2018; 9:4310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Chereji R.V., Kan T.W., Grudniewska M.K., Romashchenko A.V., Berezikov E., Zhimulev I.F., Guryev V., Morozov A.V., Moshkin Y.M.. Genome-wide profiling of nucleosome sensitivity and chromatin accessibility in Drosophila melanogaster. Nucleic Acids Res. 2016; 44:1036–1051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Fuda N.J., Guertin M.J., Sharma S., Danko C.G., Martins A.L., Siepel A., Lis J.T.. GATA factor maintains nucleosome-free regions and has a role in RNA polymerase II recruitment to promoters. PLos Genet. 2015; 11:e1005108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Gaffney D.J., McVicker G., Pai A.A., Fondufe-Mittendorf Y.N., Lewellen N., Michelini K., Widom J., Gilad Y., Pritchard J.K.. Controls of nucleosome positioning in the human genome. PLos Genet. 2012; 8:e1003036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Langmead B., Salzberg S.L.. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012; 9:357–359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Waern K., Snyder M.. Extensive transcript diversity and novel upstream open reading frame regulation in yeast. G3 (Bethesda). 2013; 3:343–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Valouev A., Johnson S.M., Boyd S.D., Smith C.L., Fire A.Z., Sidow A.. Determinants of nucleosome organization in primary human cells. Nature. 2011; 474:516–520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Zhurkin V.B. Periodicity in DNA primary structure is defined by secondary structure of the coded protein. Nucleic Acids Res. 1981; 9:1963–1971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Cui F., Cole H.A., Clark D.J., Zhurkin V.B.. Transcriptional activation of yeast genes disrupt intragenic nucleosome phasing. Nucleic Acids Res. 2012; 40:10753–10764. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48. Sekinger E.A., Moqtaderi Z., Struhl K.. Intrinsic histone-DNA interactions and low nucleosome density are important for preferential accessibility of promoter regions in yeast. Mol. Cell. 2005; 18:736–748. [DOI] [PubMed] [Google Scholar]
- 49. Yuan G., Liu Y., Dion M.F., Slack M.D., Wu L.F., Altschuler S.J., Rando O.J.. Genome-scale identification of nucleosome positions in S. cerevisiae. Science. 2005; 309:626–630. [DOI] [PubMed] [Google Scholar]
- 50. Mavrich T.N., Ioshikhes I.P., Venters B.J., Jiang C., Tomsho L.P., Qi J., Schuster S.C., Albert I., Pugh B.F.. A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome. Genome Res. 2008; 18:1073–1083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Shivaswamy S., Bhinge A., Zhao Y., Jones S., Hirst M., Iyer V.R.. Dynamic remodeling of individual nucleosomes across a eukaryotic genome in response to transcriptional perturbation. PLoS Biol. 2008; 6:e65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Tolstorukov M.Y., Kharchenko P.V., Goldman J.A., Kingston R.E., Park P.J.. Comparative analysis of H2A.Z nucleosome organization in the human and yeast genomes. Genome Res. 2009; 19:967–977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53. Consortium Mouse Genome Sequencing, Waterston RH, Lindblad-Toh K., Birney E., Rogers J., Abril J.F., Agarwal P., Agarwala R., Ainscough R., Alexandersson M. et al.. Initial sequencing and comparative analysis of the mouse genome. Nature. 2002; 420:520–562. [DOI] [PubMed] [Google Scholar]
- 54. Grewal S.I.S., Jia S.. Heterochromatin revisited. Nat. Rev. Genet. 2007; 8:35–46. [DOI] [PubMed] [Google Scholar]
- 55. Francastel C., Schubeler D., Martin D.I., Groudine M.. Nuclear compartmentalization and gene activity. Nat. Rev. Mol. Cell Biol. 2000; 1:137–143. [DOI] [PubMed] [Google Scholar]
- 56. Szauter P. An analysis of regional constrains on exchange in Drosophila melanogaster using recombination-defective meiotic mutants. Genetics. 1984; 106:45–71. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Eissenberg J.C., Elgin S.C.. The HP1 protein family: getting a grip on chromatin. Curr. Opin. Genet. Dev. 2000; 10:204–210. [DOI] [PubMed] [Google Scholar]
- 58. Tompitak M., Vaillant C., Schiessel H.. Genomes of multicellular organisms have evolved to attract nucleosomes to promoter regions. Biophys. J. 2017; 112:505–511. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Studitsky V.M., Clark D.J., Felsenfeld G.. Overcoming a nucleosomal barrier to transcription. Cell. 1995; 83:19–27. [DOI] [PubMed] [Google Scholar]
- 60. Teves S.S., Weber C.M., Henikoff S.. Transcribing through the nucleosome. Trends Biochem. Sci. 2014; 39:577–586. [DOI] [PubMed] [Google Scholar]
- 61. Gaykalova D.A., Kulaeva O.I., Volokh O., Shaytan A.K., Hsieh F.-K., Kirpichnikov M.P., Sokolova O.S., Studitsky V.M.. Structural analysis of nucleosomal barrier to transcription. Proc. Natl. Acad. Sci. U.S.A. 2015; 112:E5787–E5795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Hall M.A., Shundrovsky A., Bai L., Fulbright R.M., Lis J.T., Wang M.D.. High-resolution dynamic mapping of histone-DNA interactions in a nucleosome. Nat. Struct. Mol. Biol. 2009; 16:124–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Ngo T.T.M., Zhang Q., Zhou R., Yodh J.G., Ha T.. Asymmetric unwrapping of nucleosomes under tension directed by DNA local flexibility. Cell. 2016; 160:1135–1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Kristjuhan A., Svejstrup J.Q.. Evidence for distinct mechanisms facilitating transcription elongation through chromatin in vivo. EMBO J. 2004; 23:4243–4252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65. Schwabish M.A., Struhl K.. Evidence for eviction and rapid deposition of histones upon transcriptional elongation by RNA polymerase II. Mol. Cell Biol. 2004; 24:10111–10117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66. Jacques P.E., Jeyakani J., Bourque G.. The majority of primate-specific regulatory sequences are derived from transposable elements. PLos Genet. 2013; 9:e1003504. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67. Englander E.W., Howard B.H.. Nucleosome positioning by human Alu elements in chromatin. J. Biol. Chem. 1995; 270:10091–10096. [DOI] [PubMed] [Google Scholar]
- 68. Englander E.W., Wolffe A.P., Howard B.H.. Nucleosome interactions with a human Alu element. J. Biol. Chem. 1993; 268:19565–19573. [PubMed] [Google Scholar]
- 69. Tanaka Y., Yamashita R., Suzuki Y., Nakai K.. Effects of Alu elements on global nucleosome positioning in the human genome. BMC Genomics. 2010; 11:309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70. Sahu G., Wang D., Chen C.B., Zhurkin V.B., Harrington R.E., Appella E., Hager G.L., Nagaich A.K.. p53 binding to nucleosomal DNA depends on the rotational positioning of DNA response element. J. Biol. Chem. 2009; 285:1321–1332. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71. Laptenko O., Beckerman R., Freulich E., Prives C.. p53 binding to nucleosomes within the p21 promoter in vivo leads to nucleosome loss and transcriptional activation. Proc. Natl. Acad. Sci. U.S.A. 2011; 108:10385–10390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72. Soufi A., Garcia M.F., Jaroszewicz A., Osman N., Pellegrini M., Zaret K.S.. Pioneer transcription factors target partial DNA motifs on nucleosomes to initiate reprogramming. Cell. 2015; 161:555–568. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73. Kunarso G., Chia N.Y., Jeyakani J., Hwang C., Lu X., Chan Y.S., Ng H.H., Bourque G.. Transposable elements have rewired the core regulatory network of human embryonic stem cells. Nat. Genet. 2010; 42:631–634. [DOI] [PubMed] [Google Scholar]
- 74. Cui F., Zhurkin V.B.. Rotational positioning of nucleosomes facilitates selective binding of p53 to response elements associated with cell cycle arrest. Nucleic Acids Res. 2014; 42:836–847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75. Cui F., Sirotin M.V., Zhurkin V.B.. Impact of Alu repeats on the evolution of human p53 binding sites. Biol. Direct. 2011; 6:2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76. Bao F., LoVerso P.R., Fisk J.N., Zhurkin V.B., Cui F.. P53 binding sites in normal and cancer cells are characterized by distinct chromatin context. Cell Cycle. 2017; 16:2073–2085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77. Jiang C., Pugh B.F.. A compiled and systematic reference map of nucleosome positions across the Saccharomyces cerevisiae genome. Genome Biol. 2009; 10:R109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78. Treangen T.J., Salzberg S.L.. Repetitive DNA and next-generation sequencing: computational challenges and solutions. Nat. Rev. Genet. 2012; 13:36–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.








