Abstract
Spike-in normalization is a powerful approach to assess global changes in data obtained from genomic mapping of DNA-associated proteins by methods such as ChIP-sequencing (ChIP-seq)1,2 or CUT&RUN3. While multiple spike-in methods provide detailed documentation, the implementation of these approaches often omit critical quality control steps and veer from the established procedures. Spike-in normalization typically makes use of a single scalar to normalize genome-wide data, making the approach particularly vulnerable to errors in implementation. Here, we show that proper application of spike-in normalization can increase quantification accuracy across a spectrum of conditions and outline how misuse of spike-in approaches can create erroneous biological interpretations. We conclude by providing guidelines to minimize pitfalls when applying this approach to normalize data from protein-DNA interaction results.
Spike-in normalization emerged to accurately assess global changes in signal, yet misuse of the approach can skew results
Spike-in normalization was developed as a means to correctly quantify protein-DNA interactions in cases where the overall concentration of target DNA-associated proteins changes significantly between samples1,2,4. The spike-in of exogenous chromatin from another species to each sample prior to immunoprecipitation (IP) serves as an internal control, with the assumption that the epitope of interest does not vary in the added exogenous material. In a somewhat analogous manner to the use of the ERCC RNA standards5, this normalization strategy can reduce variability between replicates2 and capture changes in genome-wide signal intensity that would otherwise be obscured by standard read-depth normalization1. Multiple spike-in normalization approaches have been tailored for specific applications; as such they differ by the source of the exogenous chromatin (biological or synthetic), whether a separate antibody is used for the sample and the exogenous epitope, the ratio between the exogenous chromatin and the sample chromatin, and the downstream computational analysis pipeline (Table 1).
Table 1. Overview of spike-in methods and examples of misuse that create uncertainty in interpretation.
Left column summarizes commonly used spike-in normalization methods. Variations on background choice, normalization method, or spike-in species are highlighted. The most common deviations from published protocols involve separate alignment to the target and spike-in genomes and lack of sufficient/consistent input samples. Hs: Homo sapiens, Dm: Drosophila melanogaster, Mm: Mus musculus.
| Normalization tool | Normalization Model | Key limitations | Examples of Misuse |
|---|---|---|---|
| ChIP-Rx: Common antibody for the sample and spike-in chromatin1 | , = normalization constant = Spike-in reads |
The model assumes linear behavior of signal to epitope abundance. Normalization metric applied linearly across signal and background and does not employ the input to account for spike-in variation. | Inappropriate alignment to spike-in and target genome separately10 |
| Bonhoure et al.: A method using a common antibody for the sample and spike-in chromatin2 |
= genomic segment = sample counts = estimated from input, probability distribution of nonspecific reads (noise) = specific tag counts corresponding to protein occupancy scores (signal) = experimental errors |
Significant overlap between genomes. The model assumes linear behavior of signal to epitope abundance. Spike-in normalization: assumes the background-adjusted counts should be invariant between samples. Normalization only to “reliable signal” regions of the spike-in data |
Spike-in reads vary by ~10 fold and are too low for accurate quantification. Incomplete draft of the spike-in reference genome (D. iulia) assembly. Background regions are assumed, but not confirmed to be invariant11. |
| Epicypher ICeChIP: Synthetic nucleosome spike-in12 | Calculate the % Input for each gene locus of interest (or genome-wide for ChIP-seq). Calculate the % Input for the on-target SNAP-ChIP spike-in |
Limited to study of histone marks and common epitope tags. Synthetic histones must be purchased for each modification of interest. | Ratios of spike-in and sample input chromatin varies by ~9x between replicates and low spike-in read depth13. |
| Parallel ChIP: Spike-in specific antibody as well as antibodies are common for sample and spike-in chromatin14 |
Spike-in specific antibody: adjusted spike-in peaks to have distribution centered at zero (unchanged), applied same normalization factor to sample. Parallel ChIP: antibody for DNA-binding protein whose genomic distribution and intensities are invariant. Plot control peaks separately, adjust center distribution of peaks to be unchanged. |
Applied spike-in normalization in two ways, first with a spike-in specific antibody for H2Av, histone variant only in . Second approach normalized to a separate control antibody (Parallel ChIP). Use of a spike-in specific antibody assumes experimental procedures will not affect spike-in IP differently than the target IP. | Inappropriate alignment to spike-in and target genome separately. The close evolutionary distance between spike-in and target results in overlap of reads that align to both genomes15. |
| Egan et al.: A method using a spike-in specific antibody 16 | The spike-in normalization is performed by counting reads in each sample and using those read counts to generate correction factors; calculated as control/treated reads. | Spike-in specific antibody assumes experimental procedures will not affect spike-in IP differently than the target IP. No requirement for including inputs. | Inappropriate alignment to spike-in and target genome separately. Some input samples missing17. |
| SNP-ChIP: Use SNPs to enable spike-in of different S. cerevisiae strains18 |
= ratio of reads in target/spike-in species |
Normalization factor can only be derived from SNP regions18,19. Limited in application depending on target genome. | None identified in our survey. |
| Active Motif Spike-in Normalization Kit: (Cat #61686 and #53083) adapted from Egan et al16 | Select IP with lowest # of spike-in reads, divide by spike-in reads for that sample to get normalization factor. Down sample target reads by normalization factor. | No use of inputs in normalization to account for variable spike-in/target chromatin ratio. Separate spike-in specific antibody. | Inappropriate alignment to separate spike-in and target genomes. No input samples available20. |
Importantly, the implementation of spike-in normalization has several potential pitfalls. The basic assumption is that the ratio between the spike-in and the sample chromatin is identical between conditions, providing a constant signal to normalize against. Yet trusting this assumption without incorporating proper quality control (QC) steps may lead to erroneous normalization factors in the analysis (e.g., cases where initial spike-in to sample chromatin ratios vary). In addition to experimental error, key assumptions such as the expectation that ChIP enrichment is linearly dependent on the relative epitope abundance over a range of different spike-in concentrations and antibodies, as well as other experimental parameters, may not hold. Further, in contrast to other widespread approaches (e.g., quantile normalization) that incorporate the genome-wide distribution of ChIP enrichment to draw conclusions, most spike-in normalization techniques rely only on a single scaling factor to carry out the genome-wide transformation (Table 1). Thus, this single scaling factor may highly influence the quantitative results and biological interpretation obtained from such genome-wide assays.
Many original spike-in normalization methods address these vulnerabilities in their work; however, a major concern emanates from the improper use of spike-in normalization by studies that deviate from the original protocols cited. Such misuses include: (1) a lack of critical QC steps leading to large variability between ratios of spike-in to sample chromatin or unsuccessful ChIP of the spike-in; (2) deviations from the recommendations in the original method (e.g., alternative alignment strategies); and (3) absence of true biological replicates that would otherwise reveal unexpected variation. These misusage cases call into question the validity of some of the post-normalization conclusions.
While the discussion outlined above refers to usage of spike-in chromatin to normalize ChIP-seq experiments, attempts to employ similar normalization procedures have also been applied to other genome-wide mapping methods such as CUT&RUN and CUT&Tag3,6. However, in some of these cases the term “spike-in” indicates the addition of exogenous naked DNA instead of chromatin. For instance, either spike-in of Drosophila DNA or carryover E.coli DNA from MNase purification are added to CUT&RUN as a means for normalization of library preparation3,6,7. We note that the use of the terms “spike-in calibration” or “normalization” in these cases does not account for variation from antibody efficiency and thus caution the reader to take that into consideration when conducting analysis and data interpretation. Importantly, CUT&RUN was shown to benefit from the addition of exogenous cells8 or synthetic nucleosomes to the samples9. In general, we recommend choosing the spike-in methods that account for as many potential sources of experimental variation as possible. Ideally, this means a spike-in containing the epitope of interest from biological material resembling the sample (e.g., cells or chromatin) as this strategy provides the most quality controls and assessments of antibody efficiency, sample handling and processing, etc. In this correspondence, we focus on methods and datasets that employ exogenous spike-in chromatin to normalize ChIP-seq data.
Here, we first provide examples to demonstrate how proper application of spike-in normalization can increase quantification accuracy. We then summarize our survey of misuse cases from the literature and discuss how the inappropriate employment of spike-in normalization could affect the resulting biological observations. Lastly, we present an example to convey the importance of using the appropriate computational approach for correcting variations in spike-in normalization and conclude by providing guidelines to minimize pitfalls when using this approach.
Proper application of spike-in normalization can increase quantification accuracy across signal ranges
To evaluate the ability of spike-in normalization to correctly quantify variations in the abundance of DNA-associated proteins we focused on experimental conditions with pre-defined ground truth. We initially reanalyzed the data from one of the original spike-in methods1, which includes titration of H3K79me2 levels over a 10-fold range (Supplemental Fig. 1a) by mixing known ratios of cells treated and untreated by the DOT1L inhibitor. This data, as was previously shown1, provides a clear demonstration of the ability of the spike-in normalization approach to improve upon read-depth normalization and correctly quantify enrichment over the range of signal intensity (Fig. 1a).
Fig. 1. Demonstration of the ability of spike-in normalization to accurately capture signal variation over wide and narrow dynamic ranges.

(a) Reanalysis of data from Orlando et al.1, where untreated cells (high H3K79me2 signal) were mixed with DOT1 inhibited Jurkat cells (low H3K79me2 signal) in 5 ratios of treated/untreated (0/100, 25/75, 50/50, 75/25, 100/0). Dynamic range of H3K79me2 signal was measured by using Western blot results from Orlando et al.1 with ImageJ; the change in H3K79me2 between 0% treated and 100% treated samples was approximately 10-fold (Supplemental Fig. 1a). Maximum signal (10X) and minimum signal (1X) are labeled on x axes. The H3K79me2 ChIP-seq signal was quantified and plotted against the line of expected signal either by using standard read depth normalization (left) or spike-in normalization (right). Accuracy of the fit of the expected line was determined by R squared (the value is reported in top right of each plot). (b) The results of a similar titration experiment generated to focus on a narrower dynamic range. Mitotic-arrested cells were generated by treating with thymidine then S-trityl-cysteine (STC), hereafter termed “mitotic”. We estimate approximately 85-90% of cells arrested in prometaphase by this method. From previous mass-spectrometry data21, the approximate fold change between mitotic H3K9ac (low H3K9ac signal) and interphase H3K9ac (high H3K9ac signal) was 3x, labeled on x axes (Supplemental Fig. 1b). We used interphase cells (high H3K9ac) mixed with mitotic-arrested cells (low H3K9ac) in six ratios (100/0, 95/5, 75/25, 50/50, 25/75, 0/100). To each of the samples, we spiked-in both D. melanogaster and S. cerevisiae chromatin. Quantification of the H3K9ac signal after read-depth normalization (left) or spike-in normalization using Drosophila (right) was plotted as in (a). The normalization using S. cerevisiae provided similar results (Supplemental Fig. 3). Within each plot, H3K79me3 or H3K9ac signal was min-max normalized according to the following equation: . Here, is the average minimum signal (100% treated cells in (a) or 100% mitotic cells in (b)), is the average maximum signal (0% treated cells in (a) or 0% mitotic cells in (b)). is the signal for each sample and is the minmax normalized signal for each sample. QCs for the dataset are in Supplemental Fig. 2-6 and Supplemental Table 3.
Next, building on the observations by Orlando et al.1, we conducted additional titration experiments to evaluate the spectrum of conditions where spike-in normalization can operate. Our previous mass-spectrometry results showed that mitotic cells have about 3-fold reduction in H3K9ac compared to interphase (Supplemental Fig. 1b)21. Using mitotic and interphase cells, we conducted a titration experiment focused on this narrow range of acetylation levels (from 1x to 3x; Fig. 1b; Supplemental Fig. 2). Importantly, the narrow signal range provides a powerful demonstration of the inability of standard read-depth normalization to capture the expected trend (Fig. 1b, left). On the other hand, spike-in normalization effectively separates the samples based on their expected signal at this dynamic range (Fig. 1b right; Supplemental Fig. 3).
The impact of misuse of spike-in normalization on downstream results
We identified in the literature several common scenarios of inappropriate employment of spike-in normalization (Fig. 2). We first consider each of these scenarios and the potential effect of such spike-in normalization misuse on the resulting biological conclusions. Next, we provide an example for the widespread inappropriate implementation by reanalysis of public datasets using this normalization approach.
Figure 2. A schematic depicting the impact of misuse of spike-in normalization on downstream results.

Left, “True Enrichment” shows a hypothetical scenario of two conditions with variable histone acetylation profiles. Panels a-e show various scenarios observed in literature and their effect on both local signal (depicted as schematics of individual ChIP-seq peaks) and global signal (shown as a scatterplot of log-normalized signal for all peaks for each condition). The data represents 3 replicates, while for simplicity, the peaks and scatter plots show only the average of the replicates. Of note – these scenarios are not mutually exclusive, and combinations of these misuse of spike-in normalization can occur.
(a) Traditional ChIP-seq with no spike-in added – the signal appears identical between the two conditions, not capturing the true patterns.
(b) Properly performed normalization with spike-in ChIP (blue) yields results that accurately follow the original ground truth.
(c) Variable proportion of spike-in added to the samples, this skews the analysis and condition #2 appears to wrongly have higher signal.
(d) Low yield of spike-in data, either from input chromatin levels or technical issues. This precludes any QC of the spike-in. Also, the low number of reads that align to the spike-in genome could highly skew the results one way or the other (depicted as overlaying normalized peaks and two plots in both directions).
(e) Exogenous chromatin from a species which is phylogenetically close to the sample species. This could skew the results in both directions, depending on the percent of misassigned reads in each sample (depicted as in (d)).
No spike-in.
For ChIP-seq without spike-ins, global changes are lost after read-depth normalization, and the samples appear identical1,2 (Fig. 2a).
Proper, constant spike-in/sample ratio.
In contrast, proper implementation of the spike-in normalization approach that includes a constant ratio of exogenous/target chromatin accurately captures the ground truth of relative signal between samples (Fig. 2b, green checkmark).
Variable spike-in/sample ratio.
Variation between exogenous/target chromatin ratios between ChIP-seq conditions (Fig. 2c). In this example, the normalization distorts the underlying biology in a manner that is dependent on the ratio, resulting in the normalized enrichment of the target epitope deviating from the ground truth. Deceptively, a QC of the spike-in by read-depth normalization will obscure the differences in total epitope amount, in a similar manner to the ChIP-seq of “no spike-in” samples in Fig. 2a.
One of the most important QC steps for the processing of spike-in experiments is to analyze the non-IP input (control) for all samples. Input samples contain the sheared chromatin that was not used for ChIP and serve as a control to identify non-specific signal. The input can also be used to directly measure the sample/spike-in chromatin ratio that was initially present before the ChIP step is performed, which should be consistent across all samples.
To evaluate the breadth of studies that fail to conduct this QC, we surveyed both GEO datasets and Google Scholar results. We first included 4 studies that initially developed spike-in normalization methods1,2,14,16 and experimental data we generated (Fig. 2b). The remainder of datasets were chosen more agnostically, by surveying GEO and Google Scholar results for the phrase “spike-in ChIP”, then sorting the results according to best match/relevance, and lastly filtering out irrelevant datasets (no ChIP-seq data, or lack of spike-ins as previously defined in this correspondence). We aimed to include a diversity of species used (spike-in and target) and epitope types (histone mark vs transcription factor), arriving at a final set of 53 datasets in our survey, expecting that these data make up a sizable proportion of all ChIP-seq spike-in studies. Out of these 53, only 27 included the appropriate input samples for each condition (Fig. 3 insert)1,10,11,13-15,22-42. In 12 of the datasets2,17,42-51, a subset lacked inputs for some conditions or treatments, making it impossible to QC the spike-in/target chromatin ratio. The last 14 datasets16,20,52-63 lacked inputs altogether (Fig. 3 insert).
Figure 3. Variations in the ratios of spike-in to sample chromatin in public datasets.

We examined 53 datasets. 51 of these were obtained from our survey. We additionally included the data we generated for Fig. 1b and Supplemental Fig. 2 (n=2). Of these only 27 had sufficient input samples for each condition (52.9%, pie chart; insert). One additional dataset plotted had > 1 input but was still missing inputs for some conditions. We aligned the inputs to a concatenated spike-in and target genomes (Methods). For each dataset, the input with lowest ratio of spike-in/sample reads was scaled down to 1, to capture the variation between input samples in the same dataset. Y-axis is shown on log2 scale to capture the diversity in variation within datasets. Alignment information for each individual sample is available in Supplemental Table 1, GEO accessions for each dataset are in Supplemental Table 5.
Of the datasets with > 1 input sample, the median variation of the spike-in/target chromatin ratio is 1.4-fold, with the extreme variation being 12.4-fold (Fig. 3; Supplemental Table 1)11. Thus, the assumption that equal amounts of exogenous chromatin were incorporated into all samples should be questioned. Further, this highlights the importance of using the input samples for every sample and condition for QC, as the inputs may provide an opportunity to correct variations in the spike-in/target chromatin ratios as is recommended previously2,53. Otherwise, an unaccounted-for variation in the spike-in/sample ranges may result in erroneous scaling that can lead to invalid results in a similar manner to the scenario in Fig. 2c.
Low spike-in/sample ratio.
In cases where the ratio of exogenous chromatin added is extremely low, it can be impossible to conduct QC for the ChIP-seq of the spike-in chromatin. This may also increase the noise due to the higher variability associated with the potentially low read counts originating from the exogenous chromatin (Fig. 2d). We conducted a literature survey to evaluate the abundance of such cases, finding that the genome coverage of the spike-in samples ranged between 0.00012X to 6.5X. In 15 of the studies the genome coverage of the spike-in samples was low (< 0.1X)2,10,11,14,23,26-29,31,33,34,38,41,43. In an extreme example11, where the spike-in was designed to be at a ratio of ~1:10,000 of the sample, the coverage of the spike-in genome was 0.00012X to 0.000375X11.
Alternative alignment strategies.
Original spike-in protocols indicate that the alignment is ideally conducted by using a concatenated genome from both the exogenous and target species, where multimapping reads are discarded1,2. While aligning the sequencing data separately to the two genomes may provide a benefit, and for applications where the expected enrichment occurs in repeat regions it could potentially improve accuracy, such an alignment approach requires careful post processing. For instance, reads that align to both genomes may be easily double counted, thus potentially rendering the analysis inaccurate (Fig. 2e, Supplemental Fig. 7 and Supplemental Table 2). This risk increases when using closely related spike-in and target genomes such as H. sapiens and M. musculus (as observed in 15 datasets out of the 53 we surveyed)2,14,15,25,28,29,41,43,44,48,52,53,56,59,61.
Accounting for variation in the spike-in/sample ratio may increase quantification accuracy across a wide range
Many of the public datasets we surveyed had variability in the spike-in/sample ratio across conditions (Fig. 3, above). To evaluate the ability of correcting for such deviations between samples, we generated a dataset where we only varied the ratio of spike-in to target chromatin and quantified differences in enrichment. We used mitotic human cells (synchronized HeLa-S3 with low H3K9ac) that were treated with the pan-HDAC inhibitor TSA which increases the global levels of histone acetylation64 or DMSO as a control. Using the same chromatin preparation from either TSA- or DMSO-treated cells, we varied the ratio of the spike-in to target chromatin over 5 orders of magnitude (from 0.00025x to 2.5x; Fig. 4; Supplemental Fig. 8; Supplemental Table 4). Using this data, we compared three commonly used normalization approaches: (1) standard read-depth normalization (Fig. 4a); (2) normalization of relative IP ratios of spike-in and target under the assumption that the spike-in is constant (in a similar manner to previous studies1,16, Fig. 4b); and (3) using the non-IP input controls to detect the ratios of spike-in to target chromatin present in each experiment to calibrate IP normalization (Fig. 4c).
Figure 4. An example of the ability of proper normalization strategy to correct for variations in the spike-in/sample ratio.

Mitotically arrested HeLa-S3 cells were treated with either the pan-HDAC inhibitor TSA (to increase global acetylation) or DMSO as a control. For each treatment, using the same chromatin preparation the amount of added spike-in was varied over five orders of magnitude (0.00025 to 2.5x spike-in/target ratio). The data were normalized by three approaches: (a) read-depth normalization – the change in the signal between the conditions cannot be observed; (b) normalization of relative IP ratios of spike-in and target – most of the change in the signal is not observable and the amount of spike-in added impacts the general signal; and (c) account for the ratio of spike-in/target by using the non-IP input controls and then normalize as in (b) using the relative spike-in/target IP – the known differences between the conditions can be detected in all cases but the one where the amount of spike-in is higher than the target. QCs for the dataset are in Supplemental Fig. 8-14 and Supplemental Table 4.
These data show that both read depth normalization (Fig. 4a) or ignoring the true ratio of the spike-in to target chromatin (Fig. 4b) is prone to fail in capturing the ground truth. When we account for the ratio of spike-in/target by using the input and then normalize using the spike-in IP, the change in signal between conditions can be detected across most of the tested spectrum of spike-in/target ratios (Fig. 4c). Importantly, following this normalization approach, the target signal, which should remain invariant, fluctuated by a ratio of 0.74-2.33x when we increased the spike-in/target ratio by 10x in a stepwise manner (Fig. 4c). Thus, while some level of variation in the spike-in/target ratio can potentially be corrected for by using the inputs as recommended previously2, it is critical to minimize the variability in this ratio between conditions. Lastly, this observation also demonstrates that the ability of spike-in normalization to quantify true signal can be impacted by the amount of spike-in chromatin added.
Recommendations and Future Directions
Given the risk of misusing spike-in normalization, and the high number of occurrences of such instances, we provide several guidelines for avoiding pitfalls when implementing spike-in normalization for ChIP-seq experiments.
The QC steps of the spike-in are required. This includes both measuring the spike-in/target ratio for each sample by isolating and sequencing the unenriched input sample. It is also critical to analyze the ChIP-seq signal for the spike-in itself – i.e., by visual inspection on a genome browser such as UCSC65 or IGV66 as well as peak calling and metagene analysis.
The spike-in species should have a well annotated and complete genome assembly, ideally a model species such as D. melanogaster.
-
The proper experimental implementation of spike-in chromatin should be guided by multiple considerations: (1) the raw spike-in chromatin material must be sufficient for verifying a successful IP when analyzing the data (measured as detailed in Methods); (2) the amount of target chromatin relative to spike-in should allow sufficient sequencing of the mixed species while remaining within practical sequencing depths; and (3) the depth of sequencing for the mixed species should account for the added genome of the spike-in samples and follow the ENCODE guidelines69 to provide sufficient coverage.
The above parameters will be affected by the size of the spike-in genome, the epitope of interest (and its prevalence in spike-in and target samples), as well as the overall experimental design. As an example, based on our experience and studies by others1, when using spike-in chromatin from species such as D. melanogaster or S. cerevisiae to detect changes in histone modifications in H. sapiens, we recommend aiming for a genome coverage of about 0.1x for D. melanogaster (~100-150k reads) and a genome coverage of about 0.5x for S. cerevisiae (~50-100k reads), given the high gene density of the S. cerevisiae genome67.
To decrease variability in spike-in/target chromatin ratios, we quantify DNA before combining chromatin from each species. Additionally, we caution that for some applications, the exogenous chromatin may take up more reads in the IP than expected, and titrations of the amounts may be useful to identify optimal conditions (detailed in Methods).
Each experiment should include 3–4 biological replicates to ensure reproducibility. Additionally, confidence intervals of spike-in/sample signal should be plotted to enable estimation of the variability.
To determine the acceptable level of variation between conditions for the ChIP signal of the exogenous chromatin, we recommend applying the Irreproducible Discovery Rate (IDR) calculation, as suggested by the ENCODE’s guidelines for assessing variability between replicates69.
The variation of spike-in/target unenriched inputs should not exceed the variation of spike-in/target ChIP reads (this variation is measured as ChIP reads aligning to spike-in genome divided by ChIP reads aligning to target genome, see Methods). Otherwise, rather than the change in true ChIP signal between spike-in and target, the technical variation in spike-in quantities will have an outsized impact on the normalization factors.
Alignment to a merged genome that combines both the target and spike-in species and stringent filtering (keeping only primary alignments and MAPQ>10) decreases the chances of ambiguous alignment.
Lastly, for robust and reproducible results, the conclusions of the experiment should be validated by an orthogonal assay such as quantitative Western blots of the chromatin fraction, immunofluorescence assays, mass spectrometry, or other genetic or biochemical evidence depending on the context of the study.
Further, spike-in normalization methods are under active development and improvements that may address current key limitations of the strategy. Here, we discuss some of the main future directions:
Most spike-in normalization methods employ a single scaling factor to transform the genome-wide data resulting from ChIP-seq. This global transformation of the entire dataset may not accurately reflect the true values in all regions, particularly when considering ChIP-enriched peaks versus nonenriched background regions. Recent ChIP-seq normalization methods attempt to address this limitation by scaling signal and background separately2,70. An additional way to improve the robustness of the applied scalar is to curate the regions where spike-in reads are quantified to derive the normalization factor (e.g., limiting the analysis to regions surrounding the TSS when analyzing active histone marks).
One strategy to increase confidence in the spike-in normalization factor is to employ two exogenous spike-ins as we demonstrated here (Fig. 1b and Supplemental Fig. 3). Each spike-in species is used to generate its own normalization factor. Applying both normalization factors reduces the effect of noise inherent from using each alone and the two species enhance the QC.
Rather than deriving a normalization factor based on the simple sum of all spike-in reads, it may be more prudent to control for the effect of outliers in a similar manner to DESeq271 by using the middle quartiles of the spike-in peaks to improve the robustness of the normalization constant calculation.
Altogether, the proper use of spike-in normalization improves the sensitivity for detecting genome-wide changes between conditions. Given the increase in the use of ChIP-seq and similar genomic approaches to identify the impact of perturbations with widespread effects, we envision that spike-in normalization along with the advancement of this method, will be critical to identify new biological observations.
Methods:
Public Data Analysis
Detailed code is available at the Github repository link in the Additional Data section, however the basic procedure for calculating alignment statistics is shown below:
-
All FASTQ files downloaded using sra-toolkit fasterq-dump:
$ fasterq-dump --skip-technical --split-3 SRR#####
-
FASTQ files aligned with Bowtie2 default parameters to a concatenated reference genome containing spike-in and target genomes. Spike-in chromosomes labeled with suffix of genome version to distinguish from target chromosomes.
$ bowtie2 -p 24 -x target_spike-Bowtie2Index/target_spike SRR#####.fastq -S SRR######.dualalign.sam
-
Aligned file sorted, then split into species-specific .bam files with bamutil splitChromosome
$ samtools sort SRR######.sam > SRR######.sorted.bam
$ bam splitChromosome --in SRR######.sorted.bam –out SRR#####.
-
Unmapped, non-primary alignments removed, and reads with MAPQ<10 removed with HOMER makeTagDirectories. Fragment length explicitly set to 150.
$ makeTagDirectories SRR######-tagdir/ -genome genome -fragLength 150 SRR######.genome.bam
*NOTE*: The number of aligned reads for paired end Read 1 may not equal Read 2, therefore “total tags” may be reported in intervals of 0.5.
-
5.
Total read counts tallied in Supplementary Table 1.
Genome versions:
H. sapiens: hg38
D. melanogaster: dm6
D. pseudoobscura: dp3
M. musculus: mm10
D. iulia: GCA_019049465.1
Synthetic nucleosome sequences from Epicypher: Kmet_stat panel
Experimental Methods
Cell Culture
Hela-S3 Cell Media and Culture:
HeLa-S3 cells were grown in Dulbecco’s Modified Eagle’s Medium (DMEM) supplemented with 10% fetal bovine serum (FBS), 1x penicillin-streptomycin, L-glutamine, and 110mg/L sodium pyruvate. Cells were incubated at 37°C and 5% CO2.
S2 Cell Culture:
Drosophila S2 cells were grown in Schneiders Drosophila Medium at 28°C in the dark.
Yeast Cell Culture:
Yeast Cell Culture: From frozen stocks, starter cultures of strain LPY6494 were inoculated in 3mL of Gibco liquid YPD media (Cat #A1374501), grown to log phase at 30°C at 220rpm, then expanded to 50mL cultures at an OD of 0.08 and grown for 16 hours at 30°C and 220rpm. OD was estimated with Nanodrop. Yeast culture and fixation performed as previously described72.
Hela-S3 Mitotic Synchronization
Cells were synchronized in G1/S by adding 2 mM thymidine (Sigma T1895-1G) for 16 hr, washed with PBS, released for 3 hr in fresh medium, and pro-metaphase arrested with 10uM S-trityl-L-cysteine (STC) (Sigma 164739) for 12 hr then harvested. For simplicity, we term these cells as “mitotic”, even though according to our estimates approximately 85-90% of cells are arrested in prometaphase. When specified, cells are treated with DMSO or 150nM TSA (Selleck Chemicals S1045) 9 hours before harvesting.
ChIP-seq
ChIP-seq was performed as previously described73. Briefly, HeLa-S3 cells were double crosslinked on plates with 2 mM DSG for 30 minutes (Fisher Scientific NC1736108) and 1% formaldehyde (Pierce ThermoFisher PI28906) for 10 minutes. Crosslinking was quenched with 1/20th volume of 2.625 M glycine (EMD Millipore 357002) and 0.5% BSA (NEB B9000S), then cells were washed 3 times with 0.5% BSA/PBS + Protease inhibitor (PIC, Roche 11697498001), spinning down at 1200g for 5 minutes at 4C. Washed pellets were snap frozen then stored in −80C. Cells were lysed with LB3 lysis buffer + PIC, then chromatin was sheared on the PIXUL sonicator for 108 minutes (Active Motif 53130). S2 cells were crosslinked in 1% formaldehyde for 10 minutes at room temperature, then quenched with glycine and 0.5% BSA, lysed in LB3, and sonicated on the PIXUL for 60 minutes. Yeast cultures in log phase were crosslinked with 0.86% formaldehyde for 30 minutes. The yeast cell wall was digested with Zymolase at 37°C for 60 minutes, cells were washed 3x in PBS + PI, then resuspended in NP-S buffer + PI. Yeast were sonicated on the PIXUL for 120 minutes. All PIXUL sonication was done with settings Pulse (N) = 50, PRF (kHz) = 1.00, Burst Rate (Hz) = 20, only Process Time was varied.
Sheared chromatin from each species were combined and then inputs were taken before immunoprecipitation for each condition to assess the relative ratios of chromatin. For the experiment detailed in Fig. 1b, where H3K9ac was titrated in mitotic and interphase HelaS3 cells, chromatin from approximately 600k HeLaS3 cells was combined with chromatin from 500k S. cerevisiae and 100k D. melanogaster S2 cells in each IP. For the experiment detailed in Fig. 4, where we titrated S. cerevisiae spike-in amounts, chromatin from S. cerevisiae was added in ratios ranging from 0.00025-2.5x relative to HeLaS3 cells.
Notes on minimizing variability between spike-in/target ratios:
Since there are multiple opportunities to introduce variability between cell counting and combining of material from each species, it is difficult to maintain constant spike-in/target chromatin ratios. To overcome this challenge, rather than relying on cell count estimates, we measure sheared DNA quantities as means to estimate chromatin amounts prior to combining material from each species.
Thus, following shearing by PIXUL, a subset of chromatin was de-crosslinked, the RNA was removed with RNAse A (Roche-SIGMA 11119915001) and protein was removed with Proteinase K (Invitrogen 25530049). The resulting DNA is quantified with Qubit HS DNA kit (Invitrogen Q32854). Then the samples can be calibrated such that equal amounts of spike-in and target chromatin are added for every condition. Except for the experiment where yeast chromatin was deliberately varied, the estimated number of yeast cells added per IP is no more than 1x the number of human cells. As the S. cerevisiae genome is gene dense and the epitope of interest is a histone acetylation mark, S. cerevisiae makes up a significant proportion of reads in the IP (Supplemental Table 3). In a sample with minimal target IP signal, e.g. 100% STC-treated (mitotic) cells, S. cerevisiae accounts for ~0.8% of the input reads and ~30% of IP reads from the same chromatin pool (Supplemental Table 3, Supplemental Fig. 2).
To each IP, 10uL of Protein A beads (Thermo Fisher Scientific 10002D) and 4uL of H3K9ac (CST 9649) were added, 1% Triton X-100 and 1x PIC were added. Samples were incubated overnight on rotator at 4C. IP Washes were performed as follows: 3x with 180uL of WBI, 3x with 180uL of WBIII, and 2x with 185uL TET, then eluted in 25uL of cold TT Buffer (buffer components below). Inputs were incubated with Speedbeads (Cytiva 65152105050250), in 12% PEG (Millipore Sigma P5413) and 1M NaCl for 10 minutes at room temperature, then washed twice with 80% EtOH, and eluted in 25uL cold TT Buffer. From inputs and IPs, libraries were prepared with the NEB Ultra II Library Prep Kit (NEB E7645L). Barcoded adapters were from BioO NEXTflex ChIP-seq Adapters (NOVA-514104 & NOVA-514151). Libraries were de-crosslinked at 55C 1hr, then 65C overnight, RNA removed with RNAse A (Roche-SIGMA 11119915001) and protein removed with Proteinase K (Invitrogen 25530049), and cleaned with Speedbeads, 8.6% PEG and 0.8M NaCl. Next libraries were PCR amplified with 1uM Solexa 1GA and 1GB primers and NEB Ultra II Q5 Mastermix (PCR protocol: 98C for 30’’, then 12 cycles of 98C for 10’’, 60C for 15’’, 72C for 30’’, then hold at 72C for 2’). A final cleanup was done with Speedbeads, 8.5% PEG and 1M NaCl, washed twice with 80% EtOH, then the final libraries were eluted in 20uL TT. Library size was estimated on a 2% agarose gel and concentration quantified with Qubit HS dsDNA kit (Invitrogen Q32854). Libraries were pooled and sequenced on Illumina NextSeq 550.
Buffer components:
NP-S Buffer: 10mM Tris-HCl pH 7.4, 0.5mM EDTA, 50mM NaCl, 5mM MgCl2, 1mM CaCl2, 0.075% NP-40, 1mM 2-mercaptoethanol
LB3: 10mM Tris-HCl pH 7.5, 100mM NaCl, 1mM EDTA, 0.1% deoxycholate, 0.5% Sarkosyl
WBI: 20mM Tris-HCl pH 7.5, 150mM NaCl, 1% Triton X-100, 2mM EDTA, 1% SDS
WBIII: 10mM Tris-HCL pH 7.5, 250mM LiCl, 1% Triton X-100, 1mM EDTA, 0.7% deoxycholate
TET: 10mM Tris-HCl pH 8.0, 1mM EDTA, 0.2% Tween 20
TT Buffer: 10 mM Tris-HCl pH 8.0, 0.05% Tween 20
Bioinformatic Analysis
An overview of data processing and analysis steps is shown in Supplemental Fig. 15, and details that vary between datasets are described below.
Data Processing and Initial Analysis
Samples were demultiplexed with Illumina’s bcl2fastq program, adapters were removed with Trimmomatic, and FASTQC was run for sequencing quality control. Reads were aligned with BWA MEM to a concatenated genome containing hg38 + spike-in species (dm6, sacCer3, or both). The aligned sam file was sorted and split by chromosome using bamUtil splitChromosome. Spike-in chromosomes were pre-labeled with a suffix (_sac3 or _dm6), samtools merge was used to recombine alignments for each species. Duplicate reads were removed with samtools markdup. For easier downstream processing, HOMER tag directories were made; by default read counts for each species were filtered for primary alignments with MAPQ > 10. Signal was quantified at peaks with HOMER annotatePeaks. Peak finding for each individual sample was done with HOMER findPeaks, with - style histone, -size 1000 -minDist 2500 with the corresponding inputs as is recommended for variable length peaks: http://homer.ucsd.edu/homer/ngs/peaks.html; FRIP score for each sample was estimated in output file of peak finding.
For the yeast spike-in titration dataset, scatterplots of peak counts were generated to compare spike-in titration conditions in Supplemental Fig. 14. Sample IPs were combined into one tag directory, and all inputs combined similarly, then HOMER findPeaks was done with the recommended variable length peak parameters on this “mega-sample” using the “mega-input” as the input control. The increased read-depth of both sample and input files will increase the tag threshold for peak finding, resulting in more significant peaks. Read-depth normalized signal at each peak was quantified with HOMER annotatePeaks using mega-sample peaks to determine quantified regions, with additional parameters -size 1000. For each treatment condition (TSA or DMSO), signal in each increasing yeast ratio (0.0025-2.5x) were plotted against the sample with the minimum yeast ratio (0.00025x).
Visualization
Bigwig files were created with HOMER makeBigWig and viewed on the UCSC Genome Browser. Heatmaps were generated using Deeptools computeMatrix and plotHeatmap with the union set of all peaks called for each experiment, with additional parameters: reference-point --referencePoint center -b 2000 -a 2000. Scatterplots were generated from HOMER annotatePeaks quantification of signal at peak or tss regions as specified in figure captions. Plots generated with R, ggPlot2, and cowplot.
Normalization
ChIP-Rx titration of H3K79me2:
ChIP-Rx dataset from Orlando et al. was normalized according to the previously published method, with additional steps outlined below. The experiment was performed in human Jurkat cells, with Drosophila S2 cells as a spike-in. Therefore, all previously described alignment steps were followed with D. melanogaster and H. sapiens alignments, with Drosophila chromosomes labeled with the “_dm6” suffix. The initial ChIP-Rx method determines the normalization factor as , where = (number of reads aligning to the Drosophila genome/1000000)1. We additionally applied an adjustment factor accounting for the variability in spike-in/target chromatin present in the input samples. The ratio of spike-in/target input reads is defined by the equation . was scaled as in Fig. 3 by so that the sample with the lowest is set to 1. The final normalized signal is therefore calculated as As H3K79me2 is predominantly located on promoters, we quantified signal at TSS using similar parameters as Orlando et al., measuring 5kb upstream and downstream of the TSS with HOMER annotatePeaks.pl tss hg38 - size 10000 -hist 25. TSS histograms were made first with traditional read-normalization (the default behavior of HOMER annotatePeaks). Spike-in normalization was done by multiplying read-normalized histogram signal by the spike-in normalization factor “IP_input_factor” for each sample.
Titration of H3K9ac:
The dataset containing a titration of mitotic and interphase HelaS3 cells includes spike-in chromatin from two separate species, D. melanogaster and S. cerevisiae. Alignment was done against the concatenated genome of hg38 + dm6 + sacCer3. After splitting alignment files into each species, removing PCR duplicates, and filtering reads by making HOMER tag directories, reads aligning to each species were quantified as shown in Supplemental Table 3. Normalization factors for each spike-in species were determined as follows: norm_factor = (spike-in IP reads/target IP reads) * (target input reads/spike-in input reads). Normalization factors for each spike-in species, named “fly norm factor”, and “yeast norm factor” are listed in Supplemental Table 3. H3K9ac is present on both promoters and enhancers, so we sought to quantify signal at a set of H3K9ac peaks across all samples in the experiment. Sample IPs were combined into one tag directory, and all inputs combined similarly, then HOMER findPeaks was done with the recommended variable length peak parameters on this “mega-sample” using the “mega-input” as the input control. The increased read-depth of both sample and input files will increase the tag threshold for peak finding, resulting in more significant peaks. H3K9ac signal was quantified at these peaks with HOMER annotatePeaks, using the mega-sample peak file as first argument to define quantified regions, then additional parameters hg38 -size 4000 -hist 25. Spike-in normalization was done by dividing read-normalized histogram signal by the spike-in normalization factor “fly norm factor” or “yeast norm factor” for each sample.
Titration of S. cerevisiae spike-in:
The spike-in titration dataset, comparing TSA and DMSO treated mitotic Hela-S3 cells contained S. cerevisiae chromatin, added in ratios ranging from 0.00025-2.5x that of the target chromatin. Alignment was done against the concatenated genome of hg38 + sacCer3. After splitting alignment files into each species, removing PCR duplicates, and filtering reads by making HOMER tag directories, reads aligning to each species were quantified as shown in Supplemental Table 4. Spike-in normalization was performed according to two commonly used strategies: 1) normalizing to spike-in IP alone, and 2) normalizing to both spike-in IP and input chromatin.
The normalization factor was calculated as the ratio of yeast/human reads multiplied by a scaling factor of 200 to account for relative genome size and is reported in Supplemental Table 4 as “yeast/human reads*200”. When normalizing to spike-in IP alone, data were normalized by dividing by the normalization factor, “yeast/human reads*200” column for each IP sample. Signal normalized to spike-in IP is calculated as .
When normalizing to spike-in IP and input, the normalization factor “yeast/human reads*200” for each IP was divided by the corresponding input value in the same column. The resulting normalization factor accounting for IP and input is reported in Supplemental Table 4 as the “spike_IP_input_norm”. Spike-in normalization was done by multiplying read-normalized histogram signal by the spike-in normalization factor “IP_input_factor” for each sample. Spike-in normalization was done by dividing read-normalized histogram signal by the spike-in normalization factor “IP_input_factor” for each sample. The final normalized signal is calculated as
Quantification of Normalized Signal
ChIP-Rx titration of H3K79me2 & Titration of H3K9ac:
Total signal was estimated by calculating area-under-the-curve for both read-depth normalized and spike-in normalized histograms (see Github). Total signal for each sample was then min-max normalized by the equation . Here, is the average minimum signal (H3K79me2 titration: 100% treated cells, H3K9ac titration: 100% mitotic cells), max(x) is the average maximum signal (H3K79me2 titration: 0% treated cells, H3K9ac titration: 0% mitotic cells). is the signal for each sample and is the minmax normalized signal for each sample. The resulting data is scaled such that the average minimum signal is set at 0, and the average maximum signal is set at 1. This allows more intuitive comparison across datasets and normalization methods. The accuracy of each normalization method is quantified by calculating the R2 value of data against the line of expected signal (diagonal line shown in plots of Fig. 1). For the minmax normalized data, the line of expected signal is simply the diagonal line connecting maximum (y=1) and minimum (y=0) signal. R2 is calculated by the equation where and . R2 values are reported in Fig. 1 as inserts in plots.
Titration of S. cerevisiae spike-in:
While there is existing evidence of the effect of TSA on increasing histone acetylation in mitotic HelaS3 cells, this dataset contains no known maximum/minimum values with which to construct a standard curve. Therefore, H3K9ac signal was quantified at every peak with HOMER annotatePeaks -size 1000, and the distribution of signal across peaks were shown as violin plots, comparing read-normalized (Fig. 4a), spike-in IP normalized (Fig. 4b), and spike-in IP and input normalized (Fig. 4c) peak counts. Additionally, the ratio of TSA/DMSO counts in each spike-in titration condition were plotted for each normalization method (Supplemental Fig. 14).
Supplementary Material
Supplemental Table 2: Examples of how multi-mappers may affect alignment statistics.
Supplemental Table 3: Sequencing statistics for H3K9ac ChIP in Interphase/Mitotic HeLa-S3 titration experiment.
Supplemental Table 4: Sequencing statistics for H3K9ac ChIP-seq in TSA/DMSO treated Mitotic HeLa-S3 cells.
Supplemental Table 5: GEO Accession numbers for Public Datasets in Supplemental Table 1.
Supplemental Table 1: Metadata and alignment statistics for public datasets analyzed.
Acknowledgments
Research reported in this publication was supported in part by NIH/NIMH grant R01MH127077 (A.G. and C.B.) and NSF grant 2003358 (A.G.). We thank Dr. Itamar Simon for providing conceptual guidance and feedback on the manuscript. We thank R. Wachs for helping with illustrations. We thank the Pillus lab at UCSD for providing S. cerevisiae cells and guidance on yeast culture and ChIP-seq.
Footnotes
Data and Code Availability
Data generated in this study are at GSE273915 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE273915). Public data used to generate Fig. 1a are at GSE60104. UCSC Genome Browser sessions available (H3K9ac titration data, spike-in titration human data, spike-in titration yeast data). Code is on Github (https://github.com/lapatel22/spike_in_correspondence_2024/).
References
- (1).Orlando DA; Chen MW; Brown VE; Solanki S; Choi YJ; Olson ER; Fritz CC; Bradner JE; Guenther MG Quantitative ChIP-Seq Normalization Reveals Global Modulation of the Epigenome. Cell Reports 2014, 9 (3), 1163–1170. 10.1016/j.celrep.2014.10.018. [DOI] [PubMed] [Google Scholar]
- (2).Bonhoure N; Bounova G; Bernasconi D; Praz V; Lammers F; Canella D; Willis IM; Herr W; Hernandez N; Delorenzi M; Hernandez N; Delorenzi M; Deplancke B; Desvergne B; Guex N; Herr W; Naef F; Rougemont J; Schibler U; Andersin T; Cousin P; Gilardi F; Gos P; Lammers F; Raghav S; Villeneuve D; Fabbretti R; Vlegel V; Xenarios I; Migliavacca E; Praz V; David F; Jarosz Y; Kuznetsov D; Liechti R; Martin O; Delafontaine J; Cajan J; Gustafson K; Krier I; Leleu M; Molina N; Naldi A; Rib L; Symul L; Bounova G Quantifying ChIP-Seq Data: A Spiking Method Providing an Internal Reference for Sample-to-Sample Normalization. Genome Res 2014, 24 (7), 1157–1168. 10.1101/gr.168260.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (3).Meers MP; Bryson TD; Henikoff JG; Henikoff S Improved CUT&RUN Chromatin Profiling Tools. eLife 2019, 8, e46314. 10.7554/eLife.46314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (4).Chen K; Hu Z; Xia Z; Zhao D; Li W; Tyler JK The Overlooked Fact: Fundamental Need for Spike-In Control for Virtually All Genome-Wide Analyses. Molecular and Cellular Biology 2016, 36 (5), 662–667. 10.1128/MCB.00970-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (5).Jiang L; Schlesinger F; Davis CA; Zhang Y; Li R; Salit M; Gingeras TR; Oliver B Synthetic Spike-in Standards for RNA-Seq Experiments. Genome Res. 2011, 21 (9), 1543–1551. 10.1101/gr.121095.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (6).Showpnil IA; Selich-Anderson J; Taslim C; Boone MA; Crow JC; Theisen ER; Lessnick SL EWS/FLI Mediated Reprogramming of 3D Chromatin Promotes an Altered Transcriptional State in Ewing Sarcoma. Nucleic Acids Research 2022, 50 (17), 9814–9837. 10.1093/nar/gkac747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Skene PJ; Henikoff S An Efficient Targeted Nuclease Strategy for High-Resolution Mapping of DNA Binding Sites. eLife 2017, 6, e21856. 10.7554/eLife.21856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Wooten M; Takushi B; Ahmad K; Henikoff S Aclarubicin Stimulates RNA Polymerase II Elongation at Closely Spaced Divergent Promoters. Science Advances 2023, 9 (24), eadg3257. 10.1126/sciadv.adg3257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Terekhanova NV; Karpova A; Liang W-W; Strzalkowski A; Chen S; Li Y; Southard-Smith AN; Iglesia MD; Wendl MC; Jayasinghe RG; Liu J; Song Y; Cao S; Houston A; Liu X; Wyczalkowski MA; Lu RJ-H; Caravan W; Shinkle A; Naser Al Deen N; Herndon JM; Mudd J; Ma C; Sarkar H; Sato K; Ibrahim OM; Mo C-K; Chasnoff SE; Porta-Pardo E; Held JM; Pachynski R; Schwarz JK; Gillanders WE; Kim AH; Vij R; DiPersio JF; Puram SV; Chheda MG; Fuh KC; DeNardo DG; Fields RC; Chen F; Raphael BJ; Ding L Epigenetic Regulation during Cancer Transitions across 11 Tumour Types. Nature 2023, 623 (7986), 432–441. 10.1038/s41586-023-06682-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (10).Ma Z; Wang H; Cai Y; Wang H; Niu K; Wu X; Ma H; Yang Y; Tong W; Liu F; Liu Z; Zhang Y; Liu R; Zhu Z-J; Liu N Epigenetic Drift of H3K27me3 in Aging Links Glycolysis to Healthy Longevity in Drosophila. eLife 2018, 7, e35368. 10.7554/eLife.35368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (11).Wang Z; Chivu AG; Choate LA; Rice EJ; Miller DC; Chu T; Chou S-P; Kingsley NB; Petersen JL; Finno CJ; Bellone RR; Antczak DF; Lis JT; Danko CG Prediction of Histone Post-Translational Modification Patterns Based on Nascent Transcription Data. Nat Genet 2022, 54 (3), 295–305. 10.1038/s41588-022-01026-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Grzybowski AT; Chen Z; Ruthenburg AJ Calibrating ChIP-Seq with Nucleosomal Internal Standards to Measure Histone Modification Density Genome Wide. Molecular Cell 2015, 58 (5), 886–899. 10.1016/j.molcel.2015.04.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Yano S; Ishiuchi T; Abe S; Namekawa SH; Huang G; Ogawa Y; Sasaki H Histone H3K36me2 and H3K36me3 Form a Chromatin Platform Essential for DNMT3A-Dependent DNA Methylation in Mouse Oocytes. Nat Commun 2022, 13 (1), 4440. 10.1038/s41467-022-32141-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (14).Guertin MJ; Cullen AE; Markowetz F; Holding AN Parallel Factor ChIP Provides Essential Internal Control for Quantitative Differential ChIP-Seq. Nucleic Acids Research 2018, 46 (12), e75. 10.1093/nar/gky252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Nakato R; Sakata T; Wang J; Nagai LAE; Nagaoka Y; Oba GM; Bando M; Shirahige K Context-Dependent Perturbations in Chromatin Folding and the Transcriptome by Cohesin and Related Factors. Nat Commun 2023, 14 (1), 5647. 10.1038/s41467-023-41316-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Egan B; Yuan C-C; Craske ML; Labhart P; Guler GD; Arnott D; Maile TM; Busby J; Henry C; Kelly TK; Tindell CA; Jhunjhunwala S; Zhao F; Hatton C; Bryant BM; Classon M; Trojer P An Alternative Approach to ChIP-Seq Normalization Enables Detection of Genome-Wide Changes in Histone H3 Lysine 27 Trimethylation upon EZH2 Inhibition. PLOS ONE 2016, 11 (11), e0166438. 10.1371/journal.pone.0166438. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (17).Greulich F; Wierer M; Mechtidou A; Gonzalez-Garcia O; Uhlenhaut NH The Glucocorticoid Receptor Recruits the COMPASS Complex to Regulate Inflammatory Transcription at Macrophage Enhancers. Cell Reports 2021, 34 (6), 108742. 10.1016/j.celrep.2021.108742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (18).Vale-Silva LA; Markowitz TE; Hochwagen A SNP-ChIP: A Versatile and Tag-Free Method to Quantify Changes in Protein Binding across the Genome. BMC Genomics 2019, 20 (1), 54. 10.1186/s12864-018-5368-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Milano CR; Ur SN; Gu Y; Zhang J; Allison R; Brown G; Neale MJ; Tromer EC; Corbett KD; Hochwagen A Chromatin Binding by HORMAD Proteins Regulates Meiotic Recombination Initiation. EMBO J 2024, 43 (5), 836–867. 10.1038/s44318-024-00034-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (20).Okabe A; Huang KK; Matsusaka K; Fukuyo M; Xing M; Ong X; Hoshii T; Usui G; Seki M; Mano Y; Rahmutulla B; Kanda T; Suzuki T; Rha SY; Ushiku T; Fukayama M; Tan P; Kaneda A Cross-Species Chromatin Interactions Drive Transcriptional Rewiring in Epstein–Barr Virus–Positive Gastric Adenocarcinoma. Nat Genet 2020, 52 (9), 919–930. 10.1038/s41588-020-0665-7. [DOI] [PubMed] [Google Scholar]
- (21).Javasky E; Shamir I; Gandhi S; Egri S; Sandler O; Rothbart SB; Kaplan N; Jaffe JD; Goren A; Simon I Study of Mitotic Chromatin Supports a Model of Bookmarking by Histone Modifications and Reveals Nucleosome Deposition Patterns. Genome Res. 2018, 28 (10), 1455–1466. 10.1101/gr.230300.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (22).Laurent M; Cordeddu L; Zahedi Y; Ekwall K LEO1 Is Required for Efficient Entry into Quiescence, Control of H3K9 Methylation and Gene Expression in Human Fibroblasts. Biomolecules 2023, 13 (11), 1662. 10.3390/biom13111662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (23).Yan X; Liu Z PAX5 Promotes Heterocharomatin Formation. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE165035. [Google Scholar]
- (24).Shirane K; Miura F; Ito T; Lorincz MC NSD1-Deposited H3K36me2 Directs de Novo Methylation in the Mouse Male Germline and Counteracts Polycomb-Associated Silencing. Nat Genet 2020, 52 (10), 1088–1098. 10.1038/s41588-020-0689-z. [DOI] [PubMed] [Google Scholar]
- (25).Bressin A; Jasnovidova O; Arnold M; Altendorfer E; Trajkovski F; Kratz TA; Handzlik JE; Hnisz D; Mayer A High-Sensitive Nascent Transcript Sequencing Reveals BRD4-Specific Control of Widespread Enhancer and Target Gene Transcription. Nat Commun 2023, 14 (1), 4971. 10.1038/s41467-023-40633-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Ramasamy S; Aljahani A; Karpinska MA; Cao TBN; Velychko T; Cruz JN; Lidschreiber M; Oudelaar AM The Mediator Complex Regulates Enhancer-Promoter Interactions. Nat Struct Mol Biol 2023, 30 (7), 991–1000. 10.1038/s41594-023-01027-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (27).Glancy E; Wang C; Tuck E; Healy E; Amato S; Neikes HK; Mariani A; Mucha M; Vermeulen M; Pasini D; Bracken AP PRC2.1- and PRC2.2-Specific Accessory Proteins Drive Recruitment of Different Forms of Canonical PRC1. Mol Cell 2023, 83 (9), 1393–1411.e7. 10.1016/j.molcel.2023.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).Lambuta RA; Nanni L; Liu Y; Diaz-Miyar J; Iyer A; Tavernari D; Katanayeva N; Ciriello G; Oricchio E Whole-Genome Doubling Drives Oncogenic Loss of Chromatin Segregation. Nature 2023, 615 (7954), 925–933. 10.1038/s41586-023-05794-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (29).Hughes AL; Szczurek AT; Kelley JR; Lastuvkova A; Turberfield AH; Dimitrova E; Blackledge NP; Klose RJ A CpG Island-Encoded Mechanism Protects Genes from Premature Transcription Termination. Nat Commun 2023, 14, 726. 10.1038/s41467-023-36236-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (30).Lund PJ; Gates LA; Leboeuf M; Smith SA; Chau L; Lopes M; Friedman ES; Saiman Y; Kim MS; Shoffler CA; Petucci C; Allis CD; Wu GD; Garcia BA Stable Isotope Tracing in Vivo Reveals a Metabolic Bridge Linking the Microbiota to Host Histone Acetylation. Cell Rep 2022, 41 (11), 111809. 10.1016/j.celrep.2022.111809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (31).Terrone S; Valat J; Fontrodona N; Giraud G; Claude J-B; Combe E; Lapendry A; Polvèche H; Ameur LB; Duvermy A; Modolo L; Bernard P; Mortreux F; Auboeuf D; Bourgeois CF RNA Helicase-Dependent Gene Looping Impacts Messenger RNA Processing. Nucleic Acids Res 2022, 50 (16), 9226–9246. 10.1093/nar/gkac717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (32).Marasco LE; Dujardin G; Sousa-Luís R; Liu YH; Stigliano JN; Nomakuchi T; Proudfoot NJ; Krainer AR; Kornblihtt AR Counteracting Chromatin Effects of a Splicing-Correcting Antisense Oligonucleotide Improves Its Therapeutic Efficacy in Spinal Muscular Atrophy. Cell 2022, 185 (12), 2057–2070.e15. 10.1016/j.cell.2022.04.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (33).Yasuhara T; Xing Y-H; Bauer NC; Lee L; Dong R; Yadav T; Soberman RJ; Rivera MN; Zou L Condensates Induced by Transcription Inhibition Localize Active Chromatin to Nucleoli. Mol Cell 2022, 82 (15), 2738–2753.e6. 10.1016/j.molcel.2022.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (34).Chu L; Qu Y; An Y; Hou L; Li J; Li W; Fan G; Song B-L; Li E; Zhang L; Qi W Induction of Senescence-Associated Secretory Phenotype Underlies the Therapeutic Efficacy of PRC2 Inhibition in Cancer. Cell Death Dis 2022, 13 (2), 155. 10.1038/s41419-022-04601-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (35).Hsu JY; Danis EP; Nance S; O’Brien JH; Gustafson AL; Wessells VM; Goodspeed AE; Talbot JC; Amacher SL; Jedlicka P; Black JC; Costello JC; Durbin AD; Artinger KB; Ford HL SIX1 Reprograms Myogenic Transcription Factors to Maintain the Rhabdomyosarcoma Undifferentiated State. Cell Rep 2022, 38 (5), 110323. 10.1016/j.celrep.2022.110323. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (36).Li X-Y; Harrison MM; Villalta JE; Kaplan T; Eisen MB Establishment of Regions of Genomic Activity during the Drosophila Maternal to Zygotic Transition. eLife 2014, 3, e03737. 10.7554/eLife.03737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Zahedi Y; Zeng S; Ekwall K An Essential Role for the Ino80 Chromatin Remodeling Complex in Regulation of Gene Expression during Cellular Quiescence. Chromosome Res 2023, 31 (2), 14. 10.1007/s10577-023-09723-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Poplineau M; Platet N; Mazuel A; Hérault L; Koide S; Kuribayashi W; Carbuccia N; N’Guyen L; Vernerey J; Oshima M; Birnbaum D; Lachaud C; Iwama A; Duprez E Single Cell Analyses Reveal a Non-Canonical EZH2 Activity As a Main Driver of Retinoic Acid Resistance in PLZF/Rara Leukemia. Blood 2021, 138 (Supplement 1), 2205. 10.1182/blood-2021-146777. [DOI] [Google Scholar]
- (39).Cottone L; Ligammari L; Lee H-M; Knowles HJ; Henderson S; Bianco S; Davies C; Strauss S; Amary F; Leite AP; Tirabosco R; Haendler K; Schultze JL; Herrero J; O’Donnell P; Grigoriadis AE; Salomoni P; Flanagan AM Aberrant Paracrine Signalling for Bone Remodelling Underlies the Mutant Histone-Driven Giant Cell Tumour of Bone. Cell Death Differ 2022, 29 (12), 2459–2471. 10.1038/s41418-022-01031-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (40).Mattingly M; Seidel C; Muñoz S; Hao Y; Zhang Y; Wen Z; Florens L; Uhlmann F; Gerton JL Mediator Recruits the Cohesin Loader Scc2 to RNA Pol II-Transcribed Genes and Promotes Sister Chromatid Cohesion. Curr Biol 2022, 32 (13), 2884–2896.e6. 10.1016/j.cub.2022.05.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (41).Dimitrova E; Feldmann A; van der Weide RH; Flach KD; Lastuvkova A; de Wit E; Klose RJ Distinct Roles for CKM–Mediator in Controlling Polycomb-Dependent Chromosomal Interactions and Priming Genes for Induction. Nat Struct Mol Biol 2022, 29 (10), 1000–1010. 10.1038/s41594-022-00840-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Simmler P; Cortijo C; Koch LM; Galliker P; Angori S; Bolck HA; Mueller C; Vukolic A; Mirtschink P; Christinat Y; Davidson NR; Lehmann K-V; Pellegrini G; Pauli C; Lenggenhager D; Guccini I; Ringel T; Hirt C; Marquart KF; Schaefer M; Rätsch G; Peter M; Moch H; Stoffel M; Schwank G SF3B1 Facilitates HIF1-Signaling and Promotes Malignancy in Pancreatic Cancer. Cell Reports 2022, 40 (8). 10.1016/j.celrep.2022.111266. [DOI] [PubMed] [Google Scholar]
- (43).Zhang Q; Gail E; Flanigan S; Healy E; Jones N; Ng X; Uckelmann M; Levina V; Davidovich C Inseparable RNA Binding and Chromatin Modification Activities of a Nucleosome-Interacting Surface in EZH2. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE239445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (44).Wang R; Xu Q; Wang C; Tian K; Wang H; Ji X Multiomic Analysis of Cohesin Reveals That ZBTB Transcription Factors Contribute to Chromatin Interactions. Nucleic Acids Res 2023, 51 (13), 6784–6805. 10.1093/nar/gkad401. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Carrol J; Ament S; Shetty A; Pearl J; Cantle J ChIP-Seq of Histone Modifications and EZH2 in the Striatum of Four-Month-Old HttQ111/+ and Wildtype Mice. [Google Scholar]
- (46).Bondra ER; Rine J Context-Dependent Function of the Transcriptional Regulator Rap1 in Gene Silencing and Activation in Saccharomyces Cerevisiae. Proceedings of the National Academy of Sciences 2023, 120 (40), e2304343120. 10.1073/pnas.2304343120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (47).Mohan DR; Borges KS; Finco I; LaPensee CR; Rege J; Solon AL; Little DW; Else T; Almeida MQ; Dang D; Haggerty-Skeans J; Apfelbaum AA; Vinco M; Wakamatsu A; Mariani BMP; Amorim LC; Latronico AC; Mendonca BB; Zerbini MCN; Lawlor ER; Ohi R; Auchus RJ; Rainey WE; Marie SKN; Giordano TJ; Venneti S; Fragoso MCBV; Breault DT; Lerario AM; Hammer GD β-Catenin-Driven Differentiation Is a Tissue-Specific Epigenetic Vulnerability in Adrenal Cancer. Cancer Res 2023, 83 (13), 2123–2141. 10.1158/0008-5472.CAN-22-2712. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (48).Bayer M; Boller S; Ramamoothy S; Zolotarev N; Cauchy P; Iwanami N; Mittler G; Boehm T; Grosschedl R Tnpo3 Enables EBF1 Function in Conditions of Antagonistic Notch Signaling. Genes Dev 2022, 36 (15-16), 901–915. 10.1101/gad.349696.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Li D; Yu X; Kottur J; Gong W; Zhang Z; Storey AJ; Tsai Y-H; Uryu H; Shen Y; Byrum SD; Edmondson RD; Mackintosh SG; Cai L; Liu Z; Aggarwal AK; Tackett AJ; Liu J; Jin J; Wang GG Discovery of a Dual WDR5 and Ikaros PROTAC Degrader as an Anti-Cancer Therapeutic. Oncogene 2022, 41 (24), 3328–3340. 10.1038/s41388-022-02340-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (50).Barral A; Pozo G; Ducrot L; Papadopoulos GL; Sauzet S; Oldfield AJ; Cavalli G; Déjardin J SETDB1/NSD-Dependent H3K9me3/H3K36me3 Dual Heterochromatin Maintains Gene Expression Profiles by Bookmarking Poised Enhancers. Molecular Cell 2022, 82 (4), 816–832.e12. 10.1016/j.molcel.2021.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (51).Wang J; Yu X; Gong W; Liu X; Park K-S; Ma A; Tsai Y-H; Shen Y; Onikubo T; Pi W-C; Allison DF; Liu J; Chen W-Y; Cai L; Roeder RG; Jin J; Wang GG EZH2 Noncanonically Binds cMyc and P300 through a Cryptic Transactivation Domain to Mediate Gene Activation and Promote Oncogenesis. Nat Cell Biol 2022, 24 (3), 384–399. 10.1038/s41556-022-00850-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (52).Santana JF; Spector BM; Suarez GA; Luse DS; Price DH NELF Focuses Sites of Initiation and Maintains Promoter Architecture. Nucleic Acids Research 2024, gkad1253. 10.1093/nar/gkad1253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (53).Georgiades E; Harrold CL; Roberts N; Kassouf M; Riva SG; Sanders E; Francis HS; Blayney J; Oudelaar AM; Milne TA; Higgs DR; Hughes J Active Regulatory Elements Recruit Cohesin to Establish Cell-Specific Chromatin Domains. bioRxiv October 17, 2023, p 2023.10.13.562171. 10.1101/2023.10.13.562171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (54).Ramirez J; Ferrari R; Sanz RT; Valverde-Santiago M; Nacht AS; Castillo D; Dily FL; Neguembor MV; Malatesta M; Marti-Renom MA; Beato M; Vicent GP The Hippo Kinase LATS1 Controls CTCF Chromatin Occupancy and the Hormonal Response of Three-Dimensionally Grown Breast Cancer Cells. bioRxiv November 21, 2023, p 2023.11.20.566232. 10.1101/2023.11.20.566232. [DOI] [Google Scholar]
- (55).Ali S; Mann-Nuttel R; Scheu S ChIP-Seq of Wild Type pDC Transcriptomes. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE171868. [Google Scholar]
- (56).Braceros AK; Schertzer MD; Omer A; Trotman JB; Davis ES; Dowen JM; Phanstiel DH; Aiden EL; Calabrese JM Proximity-Dependent Recruitment of Polycomb Repressive Complexes by the lncRNA Airn. Cell Reports 2023, 42 (7), 112803. 10.1016/j.celrep.2023.112803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (57).Wang H; Fan Z; Shliaha PV; Miele M; Hendrickson RC; Jiang X; Helin K H3K4me3 Regulates RNA Polymerase II Promoter-Proximal Pause-Release. Nature 2023, 615 (7951), 339–348. 10.1038/s41586-023-05780-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (58).Esnault C; Magat T; Zine El Aabidine A; Garcia-Oliver E; Cucchiarini A; Bouchouika S; Lleres D; Goerke L; Luo Y; Verga D; Lacroix L; Feil R; Spicuglia S; Mergny J-L; Andrau J-C G4access Identifies G-Quadruplexes and Their Associations with Open Chromatin and Imprinting Control Regions. Nat Genet 2023, 55 (8), 1359–1369. 10.1038/s41588-023-01437-4. [DOI] [PubMed] [Google Scholar]
- (59).Li Y; Huang J; Bao L; Zhu J; Duan W; Zheng H; Wang H; Jiang Y; Liu W; Zhang M; Yu Y; Yi C; Ji X RNA Pol II Preferentially Regulates Ribosomal Protein Expression by Trapping Disassociated Subunits. Molecular Cell 2023, 83 (8), 1280–1297.e11. 10.1016/j.molcel.2023.02.028. [DOI] [PubMed] [Google Scholar]
- (60).Pundhir S; Su J; Tapia M; Hansen AM; Haile JS; Hansen K; Porse BT The Impact of SWI/SNF and NuRD Inactivation on Gene Expression Is Tightly Coupled with Levels of RNA Polymerase II Occupancy at Promoters. Genome Res 2023, 33 (3), 332–345. 10.1101/gr.277089.122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (61).Wille CK; Zhang X; Haws SA; Denu JM; Sridharan R DOT1L Is a Barrier to Histone Acetylation during Reprogramming to Pluripotency. Science Advances 2023, 9 (46), eadf3980. 10.1126/sciadv.adf3980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (62).Vanderkruk B; Maeshima N; Pasula DJ; An M; McDonald CL; Suresh P; Luciani DS; Lynn FC; Hoffman BG Methylation of Histone H3 Lysine 4 Is Required for Maintenance of Beta Cell Function in Adult Mice. Diabetologia 2023, 66 (6), 1097–1115. 10.1007/s00125-023-05896-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (63).Li Y; Huang J; Zhu J; Bao L; Wang H; Jiang Y; Tian K; Wang R; Zheng H; Duan W; Lai W; Yi X; Zhu Y; Guo T; Ji X Targeted Protein Degradation Reveals RNA Pol II Heterogeneity and Functional Diversity. Molecular Cell 2022, 82 (20), 3943–3959.e11. 10.1016/j.molcel.2022.08.023. [DOI] [PubMed] [Google Scholar]
- (64).Vigushin DM; Ali S; Pace PE; Mirsaidi N; Ito K; Adcock I; Coombes RC Trichostatin A Is a Histone Deacetylase Inhibitor with Potent Antitumor Activity against Breast Cancer in Vivo. Clin Cancer Res 2001, 7 (4), 971–976. [PubMed] [Google Scholar]
- (65).Kent WJ; Sugnet CW; Furey TS; Roskin KM; Pringle TH; Zahler AM; Haussler D The Human Genome Browser at UCSC. Genome Res. 2002, 12 (6), 996–1006. 10.1101/gr.229102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (66).Robinson JT; Thorvaldsdóttir H; Winckler W; Guttman M; Lander ES; Getz G; Mesirov JP Integrative Genomics Viewer. Nat Biotechnol 2011, 29 (1), 24–26. 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (67).Cherry JM; Adler C; Ball C; Chervitz SA; Dwight SS; Hester ET; Jia Y; Juvik G; Roe T; Schroeder M; Weng S; Botstein D SGD: Saccharomyces Genome Database. Nucleic Acids Research 1998, 26 (1), 73–79. 10.1093/nar/26.1.73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (68).Heinz S; Benner C; Spann N; Bertolino E; Lin YC; Laslo P; Cheng JX; Murre C; Singh H; Glass CK Simple Combinations of Lineage-Determining Transcription Factors Prime Cis-Regulatory Elements Required for Macrophage and B Cell Identities. Mol Cell 2010, 38 (4), 576–589. 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (69).Landt SG; Marinov GK; Kundaje A; Kheradpour P; Pauli F; Batzoglou S; Bernstein BE; Bickel P; Brown JB; Cayting P; Chen Y; DeSalvo G; Epstein C; Fisher-Aylor KI; Euskirchen G; Gerstein M; Gertz J; Hartemink AJ; Hoffman MM; Iyer VR; Jung YL; Karmakar S; Kellis M; Kharchenko PV; Li Q; Liu T; Liu XS; Ma L; Milosavljevic A; Myers RM; Park PJ; Pazin MJ; Perry MD; Raha D; Reddy TE; Rozowsky J; Shoresh N; Sidow A; Slattery M; Stamatoyannopoulos JA; Tolstorukov MY; White KP; Xi S; Farnham PJ; Lieb JD; Wold BJ; Snyder M ChIP-Seq Guidelines and Practices of the ENCODE and modENCODE Consortia. Genome Res 2012, 22 (9), 1813–1831. 10.1101/gr.136184.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (70).Blanco E; Di Croce L; Aranda S SpikChIP: A Novel Computational Methodology to Compare Multiple ChIP-Seq Using Spike-in Chromatin. NAR Genomics and Bioinformatics 2021, 3 (3), lqab064. 10.1093/nargab/lqab064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (71).Love MI; Huber W; Anders S Moderated Estimation of Fold Change and Dispersion for RNA-Seq Data with DESeq2. Genome Biology 2014, 15 (12), 550. 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (72).Sherman F. Getting Started with Yeast. In Methods in Enzymology; Guthrie C, Fink GR, Eds.; Guide to Yeast Genetics and Molecular and Cell Biology - Part B; Academic Press, 2002; Vol. 350, pp 3–41. 10.1016/S0076-6879(02)50954-X. [DOI] [PubMed] [Google Scholar]
- (73).Texari L; Spann NJ; Troutman TD; Sakai M; Seidman JS; Heinz S An Optimized Protocol for Rapid, Sensitive and Robust on-Bead ChIP-Seq from Primary Cells. STAR Protoc 2021, 2 (1), 100358. 10.1016/j.xpro.2021.100358. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplemental Table 2: Examples of how multi-mappers may affect alignment statistics.
Supplemental Table 3: Sequencing statistics for H3K9ac ChIP in Interphase/Mitotic HeLa-S3 titration experiment.
Supplemental Table 4: Sequencing statistics for H3K9ac ChIP-seq in TSA/DMSO treated Mitotic HeLa-S3 cells.
Supplemental Table 5: GEO Accession numbers for Public Datasets in Supplemental Table 1.
Supplemental Table 1: Metadata and alignment statistics for public datasets analyzed.
