Skip to main content
Life Science Alliance logoLink to Life Science Alliance
. 2019 Jul 22;2(4):e201900318. doi: 10.26508/lsa.201900318

TAF-ChIP: an ultra-low input approach for genome-wide chromatin immunoprecipitation assay

Junaid Akhtar 1,, Piyush More 2,*, Steffen Albrecht 5,*, Federico Marini 3,4, Waldemar Kaiser 1, Apurva Kulkarni 1, Leszek Wojnowski 2, Jean-Fred Fontaine 5, Miguel A Andrade-Navarro 5, Marion Silies 1, Christian Berger 1,
PMCID: PMC6653780  PMID: 31331983

The authors present a novel method for obtaining chromatin profiles from low cell numbers without prior nuclei isolation. The method is successfully implemented in generating epigenetic profile from 100 cells with high signal-to-noise ratio.

Abstract

Chromatin immunoprecipitation (ChIP) followed by next generation sequencing (ChIP-Seq) is a powerful technique to study transcriptional regulation. However, the requirement of millions of cells to generate results with high signal-to-noise ratio precludes it in the study of small cell populations. Here, we present a tagmentation-assisted fragmentation ChIP (TAF-ChIP) and sequencing method to generate high-quality histone profiles from low cell numbers. The data obtained from the TAF-ChIP approach are amenable to standard tools for ChIP-Seq analysis, owing to its high signal-to-noise ratio. The epigenetic profiles from TAF-ChIP approach showed high agreement with conventional ChIP-Seq datasets, thereby underlining the utility of this approach.

Introduction

Chromatin immunoprecipitation coupled with next generation sequencing (ChIP-Seq) is a powerful and unbiased approach to study genome-wide DNA–protein interactions and epigenetic modifications (1). However, the prerequisite of huge starting material (millions of cells) limits its utility in studying rare cell types (2). First, sonication, the by far most popular method for fragmentation in ChIP-Seq experiments, can destroy the epitope used for immunoprecipitation, especially when the material is limited (3). The alternative approach of micrococcal nuclease-based digestion (MNase) is hard to control in its efficacy and saturation, and it also shows some degree of sequence-dependent biases (4, 5, 6). Second, the addition of sequencing adaptors for the generation of final libraries involves steps where the limitation of ligation and loss of material during purification steps can result in libraries with low complexity.

Recently, there have been several attempts to adapt ChIP-Seq protocols to address these limitations to apply them to samples with low number of cells (7, 8). One such method, called FARP-ChIP, used nontarget cells for protection during sonication. To prevent the loss of DNA during library preparation, a biotinylated synthetic DNA (biotin-DNA) is used as a carrier DNA. The approach was successfully implemented to obtain the epigenetic profile from samples of 500 mouse embryonic stem cells. However, it required deep sequencing runs (∼100 million reads), and the number of reads mapping to the DNA of the target cell type was low (∼16%), which makes this method less feasible for many applications and also cost-intensive. Some other recent methods used prior ligation of barcoded adaptors to the chromatin digested by MNase, followed by a computational demultiplexing strategy to obtain profiles from samples of low cell numbers (9). The barcoding strategy was shown to dramatically reduce the number of cells required for each profile and can also remove the biases arising from different chromatin preparations. However, the method still initially requires samples of 10,000–100,000 cells as the starting material. Another approach, microscale μChIP-Seq, was used to generate the profile from samples of 500 cells. However, the method is a scaled-down version of the conventional ChIP-Seq approach with samples subdivided at the level of immunoprecipitation (10). The method ChIPmentation uses Tn5 transposon-mediated tagmentation for preparation of libraries as an alternative to the ligation-based library preparation methods (11). This reduces the hands-on time for library preparation and input requirements. However, this approach uses sonication for fragmenting the chromatin before immunoprecipitation. Moreover, this method still uses a large batch preparation of chromatin and uses subsequent splitting of the sample to generate the profile from samples of 10,000 cells. Recently, the CUT&RUN approach was implemented to generate profiles from samples of 100 cells using antibody-targeted micrococcal nuclease (12). The released and captured DNA was used to generate Illumina-compatible libraries.

Here, we describe an alternative approach for ChIP that uses tagmentation-assisted fragmentation of chromatin (TAF-ChIP) with hyperactive Tn5 transposase from Illumina. The method uses limited sonication power only for nuclear lysis and Tn5 activity for chromatin fragmentation. We have used this approach to generate high-quality datasets from as few as 100 human and 1,000 Drosophila cells. This approach has minimal hands-on time and does not involve labor-intensive library preparation workflow. Furthermore, it could be easily implemented to any type of cells. Comparisons of TAF-ChIP results with ENCODE datasets, CUT&RUN, and conventional ChIP-Seq performed in identical cell types demonstrate the utility of this approach. We expect our approach to be especially useful in conditions where the amount of sample is the limiting factor, such as material isolated from animals and clinical samples.

Results

Method overview

There are two challenging steps in generating high-quality ChIP-Seq datasets from samples with a very low number of cells. First, the fragmentation of chromatin without compromising the integrity of the associated proteins; second, the generation of Illumina-compatible sequencing libraries, which requires the purified DNA to undergo multiple manipulation steps, namely, end-repair, ligation of the sequencing adaptors, and PCR amplification. These steps also require bead-based purification of nonamplified DNA, where any potential loss of DNA can severely compromise the completion of successful libraries, especially when the starting amount of DNA is low. Furthermore, the intermediate steps can also be the source of variability.

To overcome these limitations, we used tagmentation as a tool to fragment the DNA. Tn5-mediated tagmentation had been previously used for the addition of sequencing adaptors on immunoprecipitated material, when preparing ChIP libraries and genomic DNA libraries.

Here, we instead used Tn5 activity to fragment the intact chromatin during immunoprecipitation. This approach has two major advantages. First, there is no need to fragment the chromatin before immunoprecipitation. Therefore, this strategy prevents potential loss of DNA–protein interactions during fragmentation, especially when compared with sonication. Furthermore, sonication is extremely variable between different machines, even if they are of the same specifications. Second, our tagmentation reactions use the hyperactive Tn5 transposomes that are preloaded with sequencing adaptors (13). Thus, after proteinase K inactivation, the immunoprecipitated material can be directly PCR-amplified. This results in a one-step DNA library generation, which overcomes the limitation in efficiency of ligation and also avoids intermediate purification steps, thereby preventing loss of material (14). After PCR amplification, the amplified libraries are bead-purified.

Application of the TAF-ChIP approach on sorted Drosophila NSCs and human K562 cells

For TAF-ChIP samples, the cells were directly sorted into immunopreciptation buffer owing to the low FACS sheath fluid volume and directly preceded to nuclear lysis with low energy sonication. The low-energy sonication used here did not result in any visible fragmentation of chromatin. The nonfragmented chromatin was subjected to immunoprecipitation and tagmentation. After tagmentation, enzymes and background regions were washed away with subsequent high-stringency washes. DNA was purified and PCR-amplified to generate Illumina-compatible DNA libraries (see the Materials and Methods section for further details) (Figs 1 and S1A). For conventional ChIP-Seq samples, cells from Drosophila larval brain were sorted, pelleted, and resuspended in lysis buffer, as described earlier (15). Upon immunoprecipitation with specific antibodies, the DNA was extracted and converted into DNA libraries (Fig S1B). For the purpose of this study, we used two different types of starting materials: type II neural stem cells (NSCs) from Drosophila larval brain and human K562 cells, a human immortalized myelogenous leukemia line. We used formaldehyde to fix freshly dissected Drosophila larval brains or harvested K562 cells. The dissected Drosophila larval brains expressed a GFP-tagged deadpan (Dpn) protein under the control of its endogenous enhancer, which is a transcription factor only present in NSCs in the brain. This GFP was used to sort NSCs from wild-type larvae, as described earlier (16).

Figure 1. Schematic overview of TAF-ChIP approach.

Figure 1.

(1) Formaldehyde fixed cells were directly sorted into radio immunoprecipitation (RIPA) buffer (see the Materials and Methods section for details). (2) The cells were briefly sonicated at low intensity to break open the nuclei. (3) Antibodies were coupled to magnetic beads in the presence of blocking reagents. (4) Antibody-coupled beads were added to the cell lysate and incubated overnight at 4°C. (5) The tagmentation reaction was performed after initial washes with low salt IP buffer and homemade tagmentation buffer. (6) The tagmentation reaction and the background regions (not anchored by antibody interaction) were washed away with subsequent high-stringency washes. (7) The proteinase K was heat-inactivated and the material was PCR-amplified without purification.

Figure S1. Comparison of TAF-ChIP with ENCODE, conventional ChIP-Seq, and CUT&RUN.

Figure S1.

(A) Bioanalyzer profile before and after tagmentation with indicated cell numbers. The amount of Tn5 tagmentase was kept constant in all conditions. (B) Bioanalyzer profile of a representative TAF-ChIP library showing the size distribution of final library fragments. (C) Genome browser track example of H3K27Me3 with TAF-ChIP approach and recently published CUT&RUN method with different cell numbers, as indicated in the labels. The label below the tracks shows the gene model, and the y-axis represents normalized read density in reads per million. (D) Clusterogram of TAF-ChIP H3K9Me3 (highlighted with a rectangular box) and indicated datasets derived from the signal in the peak file. H3K4Me3 and H3K36Me3 were included as controls. Note that the TAF-ChIP replicates for H3K9Me3 cluster together with the equivalent datasets from ENCODE. The legend in the left indicates the distance based on Jaccard index. (E) Clusterogram of TAF-ChIP H3K27Me3 datasets (highlighted by a rectangular box) with equivalent datasets from ENCODE, CUT&RUN datasets (from 100, 3,000, and 6,000 cells), and nonrelated histone modification datasets used as control. (F) Clusterogram of TAF-ChIP H3K9Me3 datasets (highlighted by a rectangular box) with conventional ChIP-Seq dataset from Drosophila NSCs.

FACS-sorting of wild-type NSCs is not applicable to obtain the ∼1 million cells necessary to generate a conventional ChIP-Seq dataset, as one Drosophila brain consists of approximately 400 NSCs only. Thus, to compare the TAF-ChIP with the conventional ChIP-Seq protocol, we used the Gal4/UAS binary expression system to express a constitutively active Notch protein (Notchintra) in all type II NSCs (UAS/GAL4 system; wor-Gal4; ase-Gal80 fly line), also expressing UAS-CD8-GFP (17). The expression of constitutively active Notchintra protein results in a massive over-proliferation of cells with the properties of type II NSCs amenable to cell sorting for conventional ChIP-Seq (18).

We sorted type II NSCs from this line with identical settings as above, for TAF-ChIP (1,000 cells) as well as for conventional ChIP-Seq (1.2 million cells). For obtaining 100 K562 cells, we stained the cells with Hoechst dye and used FACS for collecting samples with the precise number of cells. To benchmark our TAF-ChIP datasets from K562 cells, we used publicly available datasets from the ENCODE project (19, 20).

The Tn5 tagmentation is preferably carried out in the open chromatin region because of higher accessibility (which is the basis of the ATAC-Seq approach), and thus, these regions can get over-represented (21). To distinguish from this scenario and to get a better estimate of background signal, we also performed TAF-ChIP experiments with histone H3.

Detailed evaluation of TAF-ChIP

To investigate in detail the performance of TAF-ChIP against both the conventional ChIP-Seq and the recently described CUT&RUN low amount method, we used receiver-operating characteristic (ROC) curves and precision-recall (PRC) curves (7). Towards this goal, we compared the peaks in K562 cells for the TAF-ChIP datasets, conventional ENCODE datasets, and CUT&RUN datasets for 100, 3,000, and 6,000 cells at various false discovery rate (FDR) cutoffs and using the replicated peaks of the conventional ENCODE dataset as reference (12, 19). K652 curves were calculated by mapping peaks to 5 kb non-overlapping genomics windows. Similarly, we also compared peaks for TAF-ChIP and for conventional ChIP-Seq datasets from Drosophila UAS-derived NSCs at various FDR cutoffs and using the first replicate of the conventional ChIP-Seq dataset as reference. NSC curves were calculated by mapping peaks to 1-kb non-overlapping genomic windows. The peaks were always obtained with MACS2 peak calling algorithm using either input (conventional ChIP-Seq) or H3 datasets (TAF-ChIP) as controls.

For human K562 cells, both the ROC curves and the precision-recall curves showed that the 100-cell TAF-ChIP dataset was comparable with the reference ENCODE replicate, as well as to 3,000 and 6,000 cells CUT&RUN datasets, outperforming the 100-cell CUT&RUN dataset (Fig 2A and B). Only ∼500 peaks were called at 5% FDR for the 100-cell CUT&RUN dataset. This could be due to high occurrence of noise in the 100-cell CUT&RUN dataset, which can be observed in the genome browser profile (Fig S1C). The CUT&RUN method on 100 cells was not able to recall more than 75% of the reference even though the peak calling parameters had no restrictions.

Figure 2. Comparison of TAF-ChIP with conventional ChIP-Seq and with the CUT&RUN low amount method.

Figure 2.

(A) ROC curves of TAF-ChIP and CUT&RUN for H3K27Me3 in K562 cells. The ROC curves were plotted using as reference replicated peaks of the conventional ChIP-Seq ENCODE dataset selected at 5% FDR cutoff (downloaded from the ENCODE database). No FDR cutoff was used to define peaks for TAF-ChIP replicates and the CUT&RUN datasets with MACS2. Peaks were mapped to 5 kb non-overlapping genomic windows to calculate true-positive rate or recall, false-positive rate and precision for a changing P-Value threshold. Area under the curve (AUC) is indicated in the legend in decreasing order, and the * indicates the failure to faithfully calculate the AUC. (B) Precision-recall curve for TAF-ChIP and CUT&RUN datasets for H3K27Me3 in K562 cells. (C, D) ROC curves of TAF-ChIP and conventional ChIP-Seq in Drosophila NSCs. The ROC curves for H3K4Me3 (C) and H3K9me3 (D) were plotted using as reference peaks of the first conventional ChIP-Seq replicate selected at 5% FDR cutoff. No FDR cutoff was used to define peaks for TAF-ChIP replicates and the second conventional ChIP-Seq replicate. Peaks were mapped to 1 kb non-overlapping genomic windows to calculate true-positive rate or recall, false-positive rate, and precision. AUC is indicated in the legend in decreasing order. (E, F) Precision-recall curve for TAF-ChIP and conventional ChIP-Seq in Drosophila NSCs. Using same references and data as above, precision-recall curves were plotted for H3K4Me3 (E) and H3k9Me3 (F). (G) Comparison of the genomic window sets for Drosophila brain-derived wt NSCs analyzed for H3K4Me3 binding by TAF-ChIP (TAF_Wt), and Drosophila tumor-derived NSCs analyzed by TAF-ChIP or conventional ChIP-Seq (TAF_Tum and Conv). The TAF-ChIP samples are highlighted by a red rectangular box. The heat map indicates pairwise similarity according to the Jaccard index. Axes show results of hierarchical clustering. (H) The Jaccard index and hierarchical clustering, as described in (G), to compare H3K27Me3 binding in K562 cells. The comparison was performed for 100 cells TAF-ChIP samples (highlighted with a red rectangular box), to CUT&RUN method with 100, 3,000, and 6,000-cell samples, and to conventional ChIP-Seq (ENCODE) (12, 19).

For Drosophila NSCs, the ROC and PRC curves showed that our TAF-ChIP approach has a comparable performance to the inter-replicate results of conventional ChIP-Seq (Fig 2C–F).

We next compared the datasets by hierarchical clustering using a similarity measure based on the Jaccard index calculated on sets of genomic windows from peaks defined at 5% FDR. The conventional H3K4Me3 ChIP-Seq datasets from Drosophila NSCs (tumor derived) clustered together with H3K4Me3 TAF-ChIP datasets from tumor NSCs rather than with wild-type NSCs (Fig 2G). For K562 cells, the H3K27Me3 TAF-ChIP datasets clustered together with the corresponding ENCODE dataset and with CUT&RUN datasets from higher cell numbers (Fig 2H). Consistent with our ROC curve and PRC curve analysis, the 100-cell CUT&RUN dataset showed lower similarity to the rest of the datasets.

We also plotted the hierarchical clustering for H3K9Me3 and H3K27Me3 with other histone ChIP-Seq datasets included as control. The TAF-ChIP datasets always clustered together with their corresponding ENCODE datasets rather than with unrelated histone ChIP-Seq (Fig S1D and E). The TAF-ChIP dataset for H3K9Me3 from Drosophila NSCs (tumor) also clustered together with conventional ChIP-Seq performed in identical NSCs (Fig S1F).

Comparison of 100-cell TAF-ChIP with ENCODE dataset

To further test the applicability of TAF-ChIP, we next used corresponding conventional ChIP-Seq datasets from the ENCODE project for benchmarking.

The H3K27Me3 TAF-ChIP and H3K9Me3 TAF-ChIP from samples of 100 cells showed similar profiles when compared with the corresponding ENCODE datasets, as visualized through genome browser tracks (Fig 3A and B), and also had good agreement between the replicates when Pearson’s correlation coefficient was calculated using average signal in each 2-kb non-overlapping genomic window (Fig S2A and B). The metagene profile for H3K27Me3 and H3K9Me3 showed decrease at the transcription start sites (TSSs) and higher signal on the gene body, similar to the profile obtained with the ENCODE dataset (Fig 3C and D). We used the MACS2 peaks calling algorithm for identifying the peaks in both TAF-ChIP and ENCODE datasets, with identical parameters. The corresponding input samples, fragmented input control for ENCODE and H3 TAF for TAF-ChIP, were used as controls for peak calling. The annotation of peaks identified in the TAF-ChIP dataset and in the corresponding one from ENCODE showed similarity in distribution of overlapping genomic features, for both H3K27Me3 and H3K9Me3 datasets (Fig 3E). The overlap between the peaks called for ENCODE and TAF-ChIP was 50% and 56% for H3K27Me3 and H3K9Me3, respectively (Fig 3F). Next, we divided the peaks into 10 different quantiles according to FDR, with quantile 1 associated with the lowest FDR peaks and quantile 10 associated with the highest. The FDR quantile recovery analysis for H3K27Me3 and H3K9Me3 peaks compared with replicated peaks of ENCODE was higher for lower FDR quantiles, at around 60 and 70%, respectively (Fig S2C and D). The fraction of reads in peaks called with TAF-ChIP had also similar distribution profile when compared with the ENCODE ChIP-Seq. However, the level of this enrichment was smaller for TAF-ChIP (Fig 3G). Nonetheless, the heat maps generated for all the peaks identified in ENCODE ChIP-Seq datasets and sorted according to the intensity in the ENCODE ChIP-Seq, showed profiles similar and comparable with TAF-ChIP datasets from 100 K562 cells (Fig 3H).

Figure 3. TAF-ChIP results from 100 K562 cells are comparable with conventional Encode ChIP-Seq datasets.

Figure 3.

(A, B) Genome browser track example of H3K27Me3 and H3K9Me3 (A and B, respectively) ChIP performed in 100 FACS sorted K562 cells with TAF-ChIP approach and corresponding K562 conventional ChIP-Seq datasets from the ENCODE project in duplicates, as indicated in the labels. The label below the tracks shows the gene model and the y-axis represents normalized read density in reads per million. The enriched regions are highlighted with shaded box. (C, D) Metagene profiles of H3K27Me3 and H3K9Me3 (C and D, respectively) with standard error to the mean for all the genes, −1,000 bp upstream of TSS and +1,000 bp downstream of transcription end sites (TES). Read counts per million of mapped reads is shown on the y-axis, whereas the x-axis depicts genomic coordinates. (E) Genomic distribution of annotated peaks obtained from the ENCODE datasets and TAF-ChIP (100 K562 cells), for indicated histone marks. Note the majority of H3K27Me3 and H3K9Me3 peaks are at the intergenic regions, consistent with the expectation. (F) Overlap between the peaks identified from the ENCODE and TAF-ChIP datasets, for the indicated histone modifications (see the Materials and Methods section for further details). (G) Average profile of TAF-ChIP and corresponding ENCODE ChIP-Seq centered at the peaks for the indicated histone modifications. The y-axis depicts average per base value into the peaks, whereas x-axis depicts genomic coordinates centered at the peaks. (H) Distributions of reads at gene locations of indicated histone modifications from ENCODE ChIP-Seq and TAF-ChIP method, centered at the peaks (−1 kb to +1 kb). Rows indicate all the peaks and are sorted by decreasing affinities in the ENCODE ChIP-Seq datasets. The color labels to the right indicate the level of enrichment.

Figure S2. TAF-ChIP libraries evaluated in terms of replicate variability and peaks recovery, from 100 K562 cells.

Figure S2.

(A, B) Pearson correlation between the indicated replicates of TAF-ChIP samples from 100 K562 cells, across equal sized bins of 2000 bp. (C, D) Percentage recovery of peaks in TAF-ChIP dataset compared with a corresponding ENCODE ChIP-Seq dataset. The quantiles were sorted according to increasing FDR (provided by MACS2). Recovery of peaks of one of the replicates of the ENCODE ChIP-Seq dataset when the other is used as reference is shown for comparison. For H3K27Me3 (C) and H3K9Me3 (D), histone modifications in K562 cells.

TAF-ChIP performed on Drosophila NSCs shows high agreement with conventional ChIP-Seq

To compare TAF-ChIP with conventional ChIP-Seq, we analyzed both H3K4Me3 and H3K9Me3 histone marks, from identical cell types, as described above. The TAF-ChIP generated datasets showed similar signal-to-noise ratio when compared with corresponding conventional ChIP-Seq datasets, as visualized through genome browser tracks (Fig 4A and B). The TAF-ChIP data also showed high degree of mappability and low level of sequence duplication. The uniquely mapped reads for H3K4Me3 samples were at ∼80%. The unique mapping rate for H3K9Me3 was lower at ∼60%, yet this can be expected because of prevalence of this mark at repeat elements and transposons (Fig S3A). The replicates also showed good concordance between themselves when Pearson’s correlation coefficient was calculated using average signal in each 500-bp non-overlapping genomic windows (Fig S3B and C). The metagene profile for H3K4Me3 normalized to H3 and IgG control showed higher signal at the TSSs, consistent with the higher enrichment of this mark at the promoters (Fig 4C). On the other hand, the metagene profile for H3K9Me3 showed higher enrichment over gene body (data not shown). Furthermore, the qPCR analysis of TAF-ChIP and conventional ChIP showed comparable enrichment for enriched loci and similar level of background at nonenriched locus (Fig S3D). Next, we used the MACS2 software (21) to identify peaks in the TAF-ChIP and in conventional ChIP-Seq datasets. The fragmented input control and H3 TAF-ChIP datasets were used as input control for conventional ChIP-Seq and TAF-ChIP datasets for peak calling, respectively. The deposition of H3K9Me3 was mostly on intergenic regions; therefore, we used peak coordinates to generate the normalized metagene profile (Fig 4D). The annotation of peaks obtained from TAF-ChIP and ChIP-Seq showed a higher degree of similarity for the H3K4Me3 mark than for the H3K9Me3 mark, the latter displaying more overlap to promoters and less to intergenic regions in conventional ChIP-Seq (Fig 4E). Nevertheless, consistent with the expectation, the large fraction of H3K4Me3 peaks was at the promoters, whereas the most H3K9Me3 peaks were at the distal intergenic regions. Next, we calculated the overlap between the peaks called for conventional ChIP-Seq and TAF-ChIP datasets using the ChIPpeakAnno package from Bioconductor (22). The peaks called for H3K4Me3 showed 85% overlap between the conventional and TAF-ChIP approaches at 5% FDR. The peaks called at 5% FDR for H3K9Me3 had 68% of overlap between the conventional and TAF-ChIP approach (Fig 4F).

Figure 4. TAF-ChIP results from low number of NSCs are comparable with conventional ChIP-Seq (Conv-ChIP).

Figure 4.

(A, B) Genome browser track example of H3K4Me3 and H3K9Me3 ChIP (panel A and B, respectively) performed in FACS-sorted NSCs with conventional ChIP-Seq (1.2 million cells) and TAF-ChIP (1,000 cells), as indicated by the labels. The label below the tracks shows the gene model and the y-axis represents normalized read density in reads per million (rpm). (C) Metagene profiles of H3K4Me3 with standard error to the mean for all genes, −1,000 bp upstream of TSS and +1,000 bp downstream of transcription end sites (TESs). Log2-fold changes against input controls are shown on the y-axis, whereas the x-axis depicts genomic coordinates. (D) Metagene profiles of H3K9Me3 with standard error to the mean for enriched regions, −1,000 bp upstream and +1,000 bp downstream of peaks. Log2-fold changes against input control are shown on the y-axis, whereas the x-axis depicts genomic coordinates. (E) Distribution of annotated peaks obtained from conventional ChIP-Seq and TAF-ChIP, for indicated histone marks. Note that most H3K4Me3 and H3K9Me3 peaks are at the promoters and at the intergenic regions, respectively, consistent with the expectation. (F) Overlap between the peaks identified from conventional ChIP-Seq and TAF-ChIP datasets, for the indicated histone modifications. MACS2 software with identical parameters (see the Materials and Methods section for details) was used to identify the peaks against the respective input controls, and those present in both replicates were considered for the comparison. (G) Average profile of TAF-ChIP and conventional ChIP-Seq centered at the peaks for the indicated histone modifications. The y-axis depicts average per base value into the peaks, whereas the x-axis depicts genomic coordinates centered at the peaks. (H) Distributions of reads at gene locations of indicated histone modifications from conventional ChIP-Seq and TAF-ChIP. Rows indicate all the peaks and are sorted by decreasing affinities in the conventional ChIP-Seq datasets. The color labels to the right indicate the level of enrichment.

Figure S3. TAF-ChIP libraries evaluated in terms of replicate variability and peaks recovery, from 1,000 Drosophila NSCs.

Figure S3.

(A) Table showing the percentage of mapped reads from TAF-ChIP experiments. The read duplicates were quantified using Picard Tools. (B, C) Pearson correlation between the indicated replicates of TAF-ChIP samples from Drosophila 1,000 cells, across equal-sized bins of 500 bp. (D) ChIP-qPCR analysis of enrichment at the tested loci in TAF-ChIP and conventional approach against respective input controls, fragmented DNA in conventional and H3 for TAF. The tested loci Mapk, Dab, Rca1, and Dock showed enrichment in conventional and TAF-ChIP-Seq results, whereas Mapk distal did not show any enrichment in either approach. (E, F) Percentage recovery of peaks in TAF-ChIP datasets compared with conventional ChIP-Seq datasets. The quantiles were sorted according to increasing FDR (provided by MACS2). Recovery of peaks of one of the replicates of conventional ChIP-Seq using the other as reference is shown for comparison. For H3K4Me3 (E) and H3K9Me3 (F), histone modifications in Drosophila NSCs. (G, H) Fraction of peaks recovered for H3K4Me3 (G) and H3K9Me3 (H) TAF-ChIP samples from Drosophila, at various sequencing depths.

Next, we performed the peak recovery in different FDR quantiles, as explained before for K562 datasets. Using one H3K4me3 conventional ChIP-Seq replicate as reference, TAF-ChIP recalled ∼99% of the peaks until quantile 6, and was comparable with the other replicate of the conventional ChIP-Seq (Fig S3E). The relationship between recall and FDR was very weak for H3K9Me3; however, it was still similar to conventional ChIP-Seq (Fig S3F). The read distribution at the peaks still showed enrichment for TAF-ChIP, albeit to a lower level when compared with conventional ChIP-Seq datasets (Fig 4G). The analysis for saturation of peak recall showed higher recall of peaks for H3K4Me3 at shallow sequencing depth, whereas for H3K9Me3, the number of recalled peaks continued to increase with increasing sequencing depths (Fig S3G and H). This was consistent with the observed tendencies for point-source histone modifications (such as H3K4Me3) and histone modifications with broad domains of enrichments (such as H3K9Me3) (20). The distributions of reads at genomic locations generated for TAF-ChIP and conventional ChIP-Seq datasets and sorted according to the intensity in the conventional ChIP-Seq resulted in similar and comparable profiles (Fig 4H).

TAF-ChIP gave consistent results with variable numbers of cells used as starting material

After establishing TAF-ChIP using low number of cells and its subsequent benchmarking against conventional ChIP-Seq performed in identical cells, we next assayed whether TAF-ChIP can give comparable results with similar resolution, when variable numbers of cells are used as the starting material. Towards this goal, we resorted to use wild-type NSCs from Drosophila brains. We sorted two samples containing 1,000 and 5,000 NSCs, respectively. The TAF-ChIP generated datasets from 1,000 and 5,000 NSCs resulting in nearly identical profiles, as visualized through genome browser tracks (Fig S4A). The distributions of reads at genomic locations generated from 1,000 NSCs and 5,000 NSCs also showed comparable profiles (Fig S4B). The read distribution in the peaks for samples with 1,000 NSCs and 5,000 NSCs were also comparable with each other (Fig S4C). Altogether, these results suggest that TAF-ChIP is amenable to conditions when starting material is variable to few folds and would produce similar results.

Figure S4. TAF-ChIP results from different amounts of starting material are identical and H3 TAF-ChIP do not show any visible biases for open chromatin.

Figure S4.

(A) Genome browser track example of H3K4Me3 ChIP performed for two replicates in 1,000 and 5,000 FACS-sorted Drosophila NSCs along with H3 controls, as indicated in the labels. The label below the tracks shows the gene model, and the y-axis represents normalized read density in reads per million (rpm). (B) Heat maps of TAF-ChIP datasets for H3K4Me3 from 1,000 and 5,000 sorted Drosophila NSCs. Rows indicate all the peaks and are sorted by decreasing affinities in the conventional ChIP-Seq datasets. The color labels to the right indicate the level of enrichment. (C) Average profile of TAF-ChIP data from 1,000 and 5,000 NSCs, centered at the peaks for the H3K4Me3 modification. The y-axis depicts average per base peak signal, whereas the x-axis depicts genomic coordinates centered at the peaks. (D) Metagene profiles of H3 and H3K4Me3 from 1,000 Drosophila NSCs with standard error to the mean for all genes, −1,000 bp upstream of transcription start sites and +1,000 bp downstream of transcription end sites (TESs). Read counts per million of mapped reads is shown on the y-axis, whereas the x-axis depicts genomic coordinates. (E) Metagene profiles of indicated datasets with standard error to the mean for all genes in K562 cells, −1,000 bp upstream of TSSs and +1,000 bp downstream of TESs. Read counts per million of mapped reads is shown on the y-axis, whereas the x-axis depicts genomic coordinates. The ATAC-Seq and DNAse-Seq datasets are derived from PMID: 26280331.

Discussion

Here, we present an easy, TAF-ChIP and sequencing method to generate high-quality datasets from samples with low cell numbers. The workflow of TAF-ChIP contains fewer steps than conventional ChIP-Seq with minimum hands-on time during library preparation, preventing loss of material and potential user introduced variability. Because of tagmentation during immunoprecipitation, the cells can be directly sorted into the IP/lysis buffer. This eliminates the centrifugation step to collect the cells, which can also lead to potential loss of material. Also, unlike ATAC-Seq where intact nuclei are tagmented and partial tagmentation is used to study chromatin accessibility, our approach tagments after nuclear lysis (21). The metagene profile of H3 TAF-ChIP dataset did not show any enrichment for TSS, suggesting our method resulted in tagmentation without any visible biases for open chromatin regions (Fig S4D and E). Furthermore, we also showed the application of TAF-ChIP for both open chromatin marks such as H3K4Me3 as well as for repressive marks such as H3K9Me3 and H3K27Me3. TAF-ChIP is easier to implement than MNase-based approaches that leads to overdigestion of chromatin, and results in one-step generation of Illumina-compatible libraries. The TAF-ChIP approach is suitable for assaying factors for which the chromatin association might be dependent on RNA, as Tn5 does not perturb the RNA intermediate. Also, the tagmentation does not show any sequence-dependent biases, in contrast to other restriction-based protocols (6, 13). Furthermore, the approach does not need any specialized equipment and, thus, can be implemented in a standard molecular biology laboratory.

We have used here the Tn5 transposase from Nextera XT DNA library kit; however, TAF-ChIP could be easily implemented with Tn5 loaded with different unique molecular indices (13). This could be easily implemented in massively parallel TAF-ChIP-Seq applications and may even further decrease the required starting material. Moreover, as this approach can be used for various cell types, it could be also combined with a nontarget cell type used as “spike-in” and DNA carrier.

We showed that TAF-ChIP datasets have signal-to-noise ratios that are comparable with conventional ChIP-Seq datasets and, thus, are amenable to standard bioinformatics pipelines for ChIP-Seq analysis. Our evaluation of TAF-ChIP datasets showed results comparable with conventional ChIP-Seq and better than CUT&RUN, a comparable low amount method. For histone marks, we demonstrated the use of H3 TAF-ChIP as an input control for better background estimation. However, in some conditions and perturbation experiments, the distribution of H3 might have weak to strong biases. An alternative control for TAF-ChIP could be also immunoprecipitation with IgG from similar species (Fig 4C). Although the genome browser profiles obtained from a sample of 100 K562 cells showed slightly inferior signal-to-noise ratio compared with the conventional datasets from the ENCODE project (Fig 2A), the peaks identified were mostly overlapping, especially at lower FDRs. The peaks that were unique to either TAF-ChIP or to the conventional method still showed higher read coverage compared with randomly selected regions of comparable size (Fig S5A–D). This suggests that thresholding (based on FDR) implemented by the peak caller software might have hindered their identification in one of the datasets. Furthermore, we suspect that the signal-to-noise ratio can be improved by pooling the samples tagmented with different indices before washes, and using the demultiplexing strategy to obtain the data.

Figure S5. Coverage of reads over the overlapping and unique peaks identified in the TAF-ChIP and conventional datasets.

Figure S5.

(A–D) Boxplots showing coverage of raw reads over the peaks identified either uniquely in TAF (TAF Unique) or conventional ChIP-Seq approach (ENCODE Unique or Conv. Unique), or common to both of the approaches (TAF ENCODE or TAF Conv.). Shown also is the coverage of reads over randomly selected regions of comparable size (Random). From K562 cells for H3K27Me3 (A) and H3K9Me3 (B), and from Drosophila NSCs for H3K4Me3 (C) and H3K9Me3 (D).

We have also been able to generate the profile of a RNA-modifying enzyme, recently shown in vertebrates to associate with chromatin, by performing TAF-ChIP using 1,000 cells from transgenically tagged Drosophila and antibody directed against the tag (unpublished results). We conclude that the only limiting factors defining the low cell number sample providing biologically meaningful TAF-ChIP results are the availability of a good antibody and a reasonable number of binding sites in the genome. We have shown that TAF-ChIP provides reliable datasets from samples of as low as 100 isolated cells without requiring prior isolation of nuclei and with an extremely easy and straightforward workflow; therefore, we expect that TAF-ChIP will be very useful when access to higher numbers of cells is limited.

Materials and Methods

Antibodies

The following antibodies were used in this study. For H3K4Me3 ChIP, antibody from Abcam (Cat. no. ab8580) was used. H3K9Me3 ChIP was performed using an antibody from Active Motif (Cat. no. 39161), H3K27Me3 with Active Motif (Cat. no. 39155), and H3 (ab1791; Abcam).

Fixation and cell sorting from Drosophila larval brain

Briefly, required number of larval brains after 48 h of larval hatching were dissected in PBS. After dissection, larval brains were fixed with 1% formaldehyde in PBS for 10 min at room temperature, followed by quenching of the fix with 125 mM glycine. The larval brains were dissociated and resuspended according to the previously established method (16). The cells were then sorted on BD FACSAria according to the strength of GFP and size of the NSCs, resulting in a pure population of type II NSCs.

Fixation and cell sorting of K562 cells

K562 cells cultured in RPMI medium (supplemented with 10% Fetal Bovine Serum), at 37°C and 5% CO2, were fixed for 10 min at room temperature with 1% formaldehyde. The crosslink was quenched with 125 mM glycine, and sorted on BD FACSAria cell sorter using Hoechst stain. A total of 100 K562 cells were directly sorted in RIPA 140 mM (10 mM Tris-Cl, pH 8.0, 140 mM NaCl, 0.5 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS).

Conventional ChIP-Seq and library preparation

Fixed cells (1.2 million FACS sorted NSCs per replicate) were resuspended in 140 mM RIPA (10 mM Tris-Cl pH 8.0, 140 mM NaCl, 0.1 mM EDTA pH 8.0, 1% Triton X-100, and 0.1% SDS) and subjected to 14 cycles of sonication on a bioruptor (Diagnode), with 30 secs “ON”/“OFF” at high settings. After sonication, the samples were centrifuged at 14,000 g for 10 min at 4°C and the supernatant was transferred to a fresh tube. The extracts were incubated overnight with 2 μg of specific antibody at 4°C with head-over-tail rotations. After overnight incubations, 20 μl of blocked Protein A and G Dynabeads were added to the tubes and further incubated for 3 h to capture the antibodies. The beads were separated with a magnetic rack and were washed as following: once with 140 mM RIPA (10 mM Tris-Cl, pH 8.0, 140 mM NaCl, 0.1 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS), four times with 250 mM RIPA (10 mM Tris-Cl, pH 8.0, 250 mM NaCl, 0.1 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS) and twice with TE buffer, pH 8.0 (10 mM Tris-Cl, pH 8.0 and 0.1 mM EDTA pH 8.0). After the immunoprecipitation, samples were RNase-treated (NEB) and subjected to Proteinase K treatment for reversal of cross-links, 12 h at 37°C and at least 6 h at 65°C. The samples after proteinase K treatment were subjected to phenol–chloroform extraction. After precipitating and pelleting, the DNA was dissolved in 30 μl of TE buffer, pH 8.0. The recovered DNA was converted into libraries using NebNext Ultra II DNA library preparation kit, following the manufacturer’s protocol.

TAF-ChIP and library preparation

Cells were fixed for 10 min at room temperature with 1% formaldehyde in PBS. The formaldehyde was quenched with 125 mM glycine, for 5 min at room temperature. The cells were washed once with ice-cold PBS and directly sorted in 240 μl of 140 mM RIPA (10 mM Tris-Cl, pH 8.0, 140 mM NaCl, 0.1 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS) and sonicated with three cycles at low power settings for breaking the nuclei. Sorting small number of cells (100 cells normally elute in <2 μl) will have no effect on buffer composition owing to small volume. In the meantime, 15 μl of Protein A and G Dynabeads were coupled to 1 μg of specific antibody in the blocking buffer (RIPA 140 mM supplemented with 0.2 mg/ml BSA, 0.05 mg/ml of glycogen, and 0.2 mg/ml of yeast tRNA) for 2-3 h at 4°C. The partially fragmented chromatin was centrifuged at 2,000 g for 10 min at 4°C, and the supernatant was transferred to the tube with blocked and antibody-coupled beads. The centrifugation step is optional and the samples can be also directly added to the coupled beads. The samples were incubated at 4°C overnight with head-over-tail rotations. The samples were then washed twice briefly with 300 μl of homemade tagmentation buffer (20 mM Tris(hydroxymethyl) aminomethane, pH 7.6; 10 mM MgCl2; and 20% [vol/vol] dimethylformamide) using magnetic rack for beads separation. The washed beads were resupended in 20 μl of 1× tagmentation DNA buffer (Nextera XT Kit) containing 1 μl of Nextera DNA tagmentation enzyme and incubated at 37°C for 40 min with constant shaking in a thermoblock at 500 rpm (Eppendorf thermomixer compact). After the tagmentation, the beads were washed as following: once with 140 mM RIPA (10 mM Tris-Cl, pH 8.0, 140 mM NaCl, 0.1 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS), four times with 250 mM RIPA (10 mM Tris-Cl, pH 8.0, 250 mM NaCl, 0.1 mM EDTA, pH 8.0, 1% Triton X-100, and 0.1% SDS), and twice with TE buffer, pH 8.0 (10 mM Tris-Cl, pH 8.0 and 0.1 mM EDTA, pH 8.0). The samples were subjected to proteinase K treatment in 50 μl of TE buffer, pH 8.0, with 5 μl of 20 mg/ml of proteinase K for overnight at 60°C in thermoblock with shaking at 500 rpm (Eppendorf thermomixer compact). The samples can be then either phenol–chloroform extracted (A) or can be directly PCR-amplified after deactivating proteinase K (B). Phenol–chloroform extraction (A): The volume of the aqueous phase was brought to 100–200 μl by adding TE buffer, and 300 μl of phenol:chloroform (pH 7.7–8.3) was added to the tube. After vortexing, the tubes were spun at 20,000 g for 5 min at room temperature, and the aqueous phase was transferred to a fresh tube. The DNA was precipitated by adding 1/10th volume of 3 M sodium acetate (pH 5.2), 5 μl of glycogen (20 mg/ml), and 700 μl of 100% ethanol. The precipitation mix was incubated overnight at −80°C followed by centrifugation at 18,800 g for 30 min at 4°C. The pellet was washed once with 75% ethanol and resuspended in 30 μl of TE buffer. The DNA was amplified in 100 μl reaction with 1X NEBNext High-Fidelity PCR Mix with 0.4 μM of primers containing molecular indices (listed in Table 1) with the following program: 72°C for 3 min (98°C for 10 s, 63°C for 30 s, and 72°C for 30 s) for 12 cycles, 72°C for 5 min, and hold at 4°C. Deactivating proteinase K (B): the proteinase K was heat-inactivated for 95°C for 5 min and directly amplified in a 100-μl reaction with 1× NEBNext High-Fidelity PCR Mix with 0.4 μM primers containing molecular indices (listed in Table 1) with the following program: 72°C for 3 min {98°C for 10 s, 63°C for 30 s, and 72°C for 30 s} for 12 cycles, 72°C for 5 min, and hold at 4°C.

Table 1.

Primers used in qPCR and TAF-ChIP library preparation.

Mapk distal Fwd ATCGGGACCTTAAGCCAAGT
Mapk distal Rev AAACGCTTTTACTGCTGATGG
Dock Fwd GCTCCGGCAAAATCATTAAA
Dock Rev CGCGATTGAAAAACACACAA
Dab Fwd CCCCACAACGCCTTAAAGTA
Dab Rev TTTGCGTCTTCCGTCTCTTT
Rca1 Fwd GGTCACACTGATCCGTACCC
Rca1 Rev CTCCAACTCGAAGGATGACC
Primer-Neg Fwd CCATTAATCGAGGGCTGAAA
Primer-Neg Rev TTGGGGCATAAACAGAGGAC
fw ATAC-seq primer, general, no index AATGATACGGCGACCACCGAGATCTACACTCGTCGGCAGCGTCAGATGT*G
rev ATAC-seq primer, Truseq, index 34 CATGGC CAAGCAGAAGACGGCATACGAGATGCCATGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 48 TCGGCA CAAGCAGAAGACGGCATACGAGATTGCCGAGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 22 CGTACG CAAGCAGAAGACGGCATACGAGATCGTACGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 15 ATGTCA CAAGCAGAAGACGGCATACGAGATTGACATGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 46 TCCCGA CAAGCAGAAGACGGCATACGAGATTCGGGAGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 45 TCATTC CAAGCAGAAGACGGCATACGAGATGAATGAGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 40 CTCAGA CAAGCAGAAGACGGCATACGAGATTCTGAGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 39 CTATAC CAAGCAGAAGACGGCATACGAGATGTATAGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 38 CTAGCT CAAGCAGAAGACGGCATACGAGATAGCTAGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 37 CGGAAT CAAGCAGAAGACGGCATACGAGATATTCCGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 36 CCAACA CAAGCAGAAGACGGCATACGAGATTGTTGGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 35 CATTTT CAAGCAGAAGACGGCATACGAGATAAAATGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 25 ACTGAT CAAGCAGAAGACGGCATACGAGATATCAGTGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 26 ATGAGC CAAGCAGAAGACGGCATACGAGATGCTCATGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 27 ATTCCT CAAGCAGAAGACGGCATACGAGATAGGAATGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 28 CAAAAG CAAGCAGAAGACGGCATACGAGATCTTTTGGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 23 GAGTGG CAAGCAGAAGACGGCATACGAGATCCACTCGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 24 GGTAGC CAAGCAGAAGACGGCATACGAGATGCTACCGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 42 TAATCG CAAGCAGAAGACGGCATACGAGATCGATTAGTCTCGTGGGCTCGGAGATG*T
rev ATAC-seq primer, Truseq index, 41 GACGAC CAAGCAGAAGACGGCATACGAGATGTCGTCGTCTCGTGGGCTCGGAGATG*T

The PCR reaction was purified with bead-based size selection to remove fragments larger than 1,000 bp. Ampure Xp beads were added to the PCR reaction in a ratio of 0.2× ratio to bind larger fragments. The magnetic beads were separated with the help of magnetic rack and the supernatant was transferred to a fresh tube. Ampure Xp beads were added to the PCR reaction in a ratio of 0.8× to bind the target library. After PCR purification, the libraries were analyzed on Agilent Bioanalyzer for size distribution and the concentration was measured using a Qubit fluorometer. The finished libraries were pooled in equimolar amounts and sequenced on Illumina NextSeq 500. The step-by-step protocol is also provided as Supplemental Data S1 (22.3KB, docx) .

Supplemental Data S1.

LSA-2019-00318_Supplemental_Data_1.docx (22.3KB, docx)

TAF-ChIP and conventional ChIP-qPCR

2 μl of TAF as well as conventional ChIP library was used for checking enrichment with various primer pairs (listed in Table 1) on Applied Biosystem ViiA 7 real-time machine using SYBR green reagent (Cat. No. 4367659; Life Technologies).

Demultiplexing and mapping

Demultiplexing and fastq file conversion were performed using blc2fastq (v.1.8.4). Reads from ChIP-Seq libraries were mapped using bowtie2 (v. 2.2.8) (23) and filtered for uniquely mapped reads. The genome build and annotation used for all Drosophila samples was BDGP6 (ENSEMBL release 84). The genome build and annotation used for the K562 samples was hg38 (ENSEMBL release 84).

Normalization, peak calling, and overlaps

The mapped BAM files were normalized to RPKMs using deepTools, and bigwig coverage files were generated. Peak calling was performed using MACS2 (v 2.1.1-20160309) (24). The peaks were called with the following settings: (i) for Drosophila H3K4Me3, macs2 callpeak -t ChIP.bam -c Control.bam -f BAMPE -g dm –q 0.05; (ii) for Drosophila H3K9Me3, macs2 callpeak -t ChIP.bam -c Control.bam -f BAMPE -g dm --broad --broad-cutoff 0.05; and (iii) for K562 H3K9Me3 and H3K27Me3, macs2 callpeak -t ChIP.bam -c Control.bam -f BAMPE -g hs --broad --broad-cutoff 0.05. The resulting peaks were annotated with the ChIPseeker package from Bioconductor, using nearest gene to peak summit as assignment criteria (25). The individual peaks of corresponding modification and approach were merged using bedTools, except ENCODE peak files (26). For ENCODE, owing to high variability between the replicates, we used replicated peaks provided by ENCODE database. After merging, the overlaps were calculated with ChIPpeakAnno package with following commands: (i) for H3K4Me3 in Drosophila NSCs, overlap = findOverlapsOfPeaks(ConvK4Me3, TAFK4Me3, maxgap = 100); (ii) for H3K9Me3 in Drosophila, overlap = findOverlapsOfPeaks(ConvK9Me3, TAFK9Me3, maxgap = 200); (iii) for H3K27Me3 in K562, overlap = findOverlapsOfPeaks(ENCODEK27Me3, TAFK27Me3, maxgap = 4000); and (iv) for H3K9Me3 in K562; overlap = findOverlapsOfPeaks(ENCODEK9Me3, TAFK9Me3, maxgap = 4000).

Computational scripts

All the parameters used for computational analysis and detailed scripts are provided in a separate Supplemental Data S2 (16.2KB, txt) . The heat maps were generated using deepTools (v 3.5.1) (26).

Supplemental Data S2.

LSA-2019-00318_Supplemental_Data_2.txt (16.2KB, txt)

Accession number

All the ChIP-Seq data generated in this study are submitted to the GEO database (GSE112633).

Supplementary Material

Reviewer comments

Acknowledgements

We thank the Bloomington Drosophila Stock Center for the fly lines used in this study. The EMBL Genomics Core Facility (Heidelberg, Germany), especially Vladimir Benes for all the sequencing runs. The IMB FACS facility for the help with sorting of the neuroblast and K562 cells. We thank Jean-Yves Roignant, Guillaume Junion, Joachim Urban, and Prasad Chitke for critically reading the manuscript. This work was supported by the funding Deutsche Forschungsgemeinschaft (DFG) BE 4728 1-1 and 3-1 to C Berger, and grant SI 1991/1-1 to M Silies. The open access funding for publication was provided by University of Mainz. P More, S Albrecht, and A Kulkarni thank the International PhD Programme of the Institute of Molecular Biology, Mainz, for supporting the PhD. The work of F Marini is supported by the German Federal Ministry of Education and Research (BMBF 01EO1003).

Author Contributions

  • J Akhtar: conceptualization, formal analysis, investigation, methodology, project administration, and writing—original draft.

  • P More: formal analysis and writing—review and editing.

  • S Albrecht: data curation, formal analysis, and writing—review and editing.

  • F Marini: formal analysis and writing—review and editing.

  • W Kaiser: dissecting larval brain.

  • A Kulkarni: dissecting larval brain.

  • J-F Fontaine: supervision and writing—review and editing.

  • L Wojnowski: writing—review and editing.

  • MA Andrade-Navarro: writing—review and editing.

  • M Silies: writing—original draft.

  • C Berger: funding acquisition and writing—review and editing.

Conflict of Interest Statement

The authors declare that they have no conflict of interest.

References

  • 1.Orlando V. (2000) Mapping chromosomal proteins in vivo by formaldehyde-crosslinked-chromatin immunoprecipitation. Trends Biochem Sci 25: 99–104. 10.1016/s0968-0004(99)01535-2 [DOI] [PubMed] [Google Scholar]
  • 2.Mundade R, Ozer HG, Wei H, Prabhu L, Lu T (2014) Role of ChIP-seq in the discovery of transcription factor binding sites, differential gene regulation mechanism, epigenetic marks and beyond. Cell Cycle 13: 2847–2852. 10.4161/15384101.2014.949201 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Stathopulos PB, Scholz GA, Hwang YM, Rumfeldt JA, Lepock JR, Meiering EM (2004) Sonication of proteins causes formation of aggregates that resemble amyloid. Protein Sci 13: 3017–3027. 10.1110/ps.04831804 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Mieczkowski J, Cook A, Bowman SK, Mueller B, Alver BH, Kundu S, Deaton AM, Urban JA, Larschan E, Park PJ, et al. (2016) MNase titration reveals differences between nucleosome occupancy and chromatin accessibility. Nat Commun 7: 11485 10.1038/ncomms11485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gutierrez G, Milan-Zambrano G, Medina DA, Jordan-Pla A, Perez-Ortin JE, Penate X, Chavez S (2017) Subtracting the sequence bias from partially digested MNase-seq data reveals a general contribution of TFIIS to nucleosome positioning. Epigenetics Chromatin 10: 58 10.1186/s13072-017-0165-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Dingwall C, Lomonossoff GP, Laskey RA (1981) High sequence specificity of micrococcal nuclease. Nucleic Acids Res 9: 2659–2673. 10.1093/nar/9.12.2659 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Zheng X, Yue S, Chen H, Weber B, Jia J, Zheng Y (2015) Low-cell-number epigenome profiling aids the study of lens aging and hematopoiesis. Cell Rep 13: 1505–1518. 10.1016/j.celrep.2015.10.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Adli M, Zhu J, Bernstein BE (2010) Genome-wide chromatin maps derived from limited numbers of hematopoietic progenitors. Nat Methods 7: 615–618. 10.1038/nmeth.1478 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.van Galen P, Viny AD, Ram O, Ryan RJ, Cotton MJ, Donohue L, Sievers C, Drier Y, Liau BB, Gillespie SM, et al. (2016) A multiplexed system for quantitative comparisons of chromatin landscapes. Mol Cell 61: 170–180. 10.1016/j.molcel.2015.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ross-Innes CS, Stark R, Teschendorff AE, Holmes KA, Ali HR, Dunning MJ, Brown GD, Gojis O, Ellis IO, Green AR, et al. (2012) Differential oestrogen receptor binding is associated with clinical outcome in breast cancer. Nature 481: 389–393. 10.1038/nature10730 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Schmidl C, Rendeiro AF, Sheffield NC, Bock C (2015) ChIPmentation: Fast, robust, low-input ChIP-seq for histones and transcription factors. Nat Methods 12: 963–965. 10.1038/nmeth.3542 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Skene PJ, Henikoff JG, Henikoff S (2018) Targeted in situ genome-wide profiling with high efficiency for low cell numbers. Nat Protoc 13: 1006–1019. 10.1038/nprot.2018.015 [DOI] [PubMed] [Google Scholar]
  • 13.Picelli S, Bjorklund AK, Reinius B, Sagasser S, Winberg G, Sandberg R (2014) Tn5 transposase and tagmentation procedures for massively scaled sequencing projects. Genome Res 24: 2033–2040. 10.1101/gr.177881.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Seguin-Orlando A, Schubert M, Clary J, Stagegaard J, Alberdi MT, Prado JL, Prieto A, Willerslev E, Orlando L (2013) Ligation bias in illumina next-generation DNA libraries: Implications for sequencing ancient genomes. PLoS One 8: e78575 10.1371/journal.pone.0078575 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ghavi-Helm Y, Zhao B, Furlong EE (2016) Chromatin immunoprecipitation for analyzing transcription factor binding and histone modifications in Drosophila. Methods Mol Biol 1478: 263–277. 10.1007/978-1-4939-6371-3_16 [DOI] [PubMed] [Google Scholar]
  • 16.Berger C, Harzer H, Burkard TR, Steinmann J, van der Horst S, Laurenson AS, Novatchkova M, Reichert H, Knoblich JA (2012) FACS purification and transcriptome analysis of drosophila neural stem cells reveals a role for Klumpfuss in self-renewal. Cell Rep 2: 407–418. 10.1016/j.celrep.2012.07.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bowman SK, Rolland V, Betschinger J, Kinsey KA, Emery G, Knoblich JA (2008) The tumor suppressors Brat and Numb regulate transit-amplifying neuroblast lineages in Drosophila. Dev Cell 14: 535–546. 10.1016/j.devcel.2008.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wang H, Somers GW, Bashirullah A, Heberlein U, Yu F, Chia W (2006) Aurora-A acts as a tumor suppressor and regulates self-renewal of Drosophila neuroblasts. Genes Dev 20: 3453–3463. 10.1101/gad.1487506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Consortium EP. (2012) An integrated encyclopedia of DNA elements in the human genome. Nature 489: 57–74. 10.1038/nature11247 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, Bernstein BE, Bickel P, Brown JB, Cayting P, et al. (2012) ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res 22: 1813–1831. 10.1101/gr.136184.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Buenrostro JD, Giresi PG, Zaba LC, Chang HY, Greenleaf WJ (2013) Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat Methods 10: 1213–1218. 10.1038/nmeth.2688 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK (2010) Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell 38: 576–589. 10.1016/j.molcel.2010.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Langmead B. (2010) Aligning short sequencing reads with Bowtie. Curr Protoc Bioinformatics 10.1002/0471250953.bi1107s32Chapter 11. Unit 11 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, et al. (2008) Model-based analysis of ChIP-seq (MACS). Genome Biol 9: R137 10.1186/gb-2008-9-9-r137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Yu G, Wang LG, He QY (2015) ChIPseeker: An R/bioconductor package for ChIP peak annotation, comparison and visualization. Bioinformatics 31: 2382–2383. 10.1093/bioinformatics/btv145 [DOI] [PubMed] [Google Scholar]
  • 26.Ramirez F, Ryan DP, Grüning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dündar F, Manke T (2016) deepTools2: A next generation web server for deep-sequencing data analysis. Nucleic Acids Res 44: W160–W165. 10.1093/nar/gkw257 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data S1.

LSA-2019-00318_Supplemental_Data_1.docx (22.3KB, docx)

Supplemental Data S2.

LSA-2019-00318_Supplemental_Data_2.txt (16.2KB, txt)

Reviewer comments

Articles from Life Science Alliance are provided here courtesy of Life Science Alliance LLC

RESOURCES