Skip to main content
Frontiers in Genetics logoLink to Frontiers in Genetics
. 2021 Mar 26;12:649959. doi: 10.3389/fgene.2021.649959

“Adopt-a-Tissue” Initiative Advances Efforts to Identify Tissue-Specific Histone Marks in the Mare

N B Kingsley 1,2, Natasha A Hamilton 3, Gabriella Lindgren 4,5, Ludovic Orlando 6, Ernie Bailey 7, Samantha Brooks 8, Molly McCue 9, T S Kalbfleisch 7, James N MacLeod 7, Jessica L Petersen 10, Carrie J Finno 2, Rebecca R Bellone 1,2,*
PMCID: PMC8033197  PMID: 33841506

Introduction

The equine genetics and genomics research community has a long history of synergistic collaborations for developing tools and resources to advance equine biology. Starting in 1995 with the first International Equine Gene Mapping Workshop supported by the Dorothy Russell Havemeyer Foundation Inc. (Bailey, 2010), researchers collaborated to build comprehensive equine linkage maps (Guérin et al., 1999, 2003; Penedo et al., 2005; Swinburne et al., 2006), radiation hybrid and comparative maps (Caetano et al., 1999; Chowdhary et al., 2002), physical marker and BAC contig maps (Raudsepp et al., 2004, 2008; Leeb et al., 2006), reference genomes for the horse (Wade et al., 2009; Kalbfleisch et al., 2018), and genotyping arrays to economically map and study traits of interest for horse owners and breeders (McCue et al., 2012; McCoy and McCue, 2014; Schaefer et al., 2017). Continuing the legacy of community-based advancements, a new collective effort began in 2015 to functionally annotate DNA elements in the horse as part of the international Functional Annotation of ANimal Genomes (FAANG) Consortium (Andersson et al., 2015; Tuggle et al., 2016; Burns et al., 2018).

Reminiscent of the ENCODE project in humans and mice (Dunham et al., 2012), the ultimate goal of the FAANG consortium is to annotate the major functional elements in the genomes of domesticated animal species (Andersson et al., 2015). In particular, four histone modifications were chosen by the consortium to characterize the genomic locations of enhancers (H3K4me1), promoters and transcription start sites (H3K4me3), open chromatin with active regulatory elements (H3K27ac), and facultative heterochromatin with inaccessible or repressed regulatory elements (H3K27me3) (Andersson et al., 2015; Giuffra and Tuggle, 2019). The initial equine FAANG efforts identified putative regulatory regions in eight prioritized tissues of interest (TOI) by performing Chromatin Immuno-Precipitation Sequencing (ChIP-Seq) for the four target histone marks (Kingsley et al., 2020). In that investigation, more than one million putative regulatory sites were characterized across the equine genome. With more than 80 tissues, cell lines, and body fluids stored in the equine biobank (Burns et al., 2018), further opportunities to expand the scope of the annotation work exist. To leverage the benefits of the biobank, a collaborative sponsorship program titled “Adopt-a-Tissue” was created to enable researchers from across the globe to select and support annotation of a tissue by the equine FAANG group. Through this effort, four additional “Adopted” tissues— spleen, metacarpal 3 (MC3), sesamoid, and full thickness skin— were assayed by histone mark ChIP-Seq to expand the tissue-specific annotation resources for the entire equine research community.

Methods

All ChIP-Seq assays were performed by Diagenode ChIP-Seq Profiling Service (Diagenode, Cat# G02010000, Liège, Belgium). Summarized experimental procedures are available in more detail at the FAANG FTP portal hosted by EBI (ftp://ftp.faang.ebi.ac.uk/faang/ftp/protocols/assays/ and ftp://ftp.faang.ebi.ac.uk/faang/ftp/protocols/experiments/). Spleen samples were processed following the assay procedures outlined in UCD_SOP_ChIP-Seq_for_Histone_Marks_20191101.pdf. Skin and both bone tissues were processed following the experimental protocols outlined in UCD_SOP_ChIP-seq_for_Histone_Marks_Skin_20201218.pdf and UCD_SOP_ChIP-seq_for_Histone_Marks_Bone_20201218.pdf, respectively. “Adopted” tissues, as summarized in Supplementary Table 1, were collected from two Thoroughbred mares (denoted as ECA_UCD_AH1 for SAMEA104728862 and ECA_UCD_AH2 for SAMEA104728877) as part of the FAANG equine biobank (Burns et al., 2018) following protocols approved by the University of California, Davis Institutional Animal Care and Use Committee (Protocol #19037).

Chromatin was isolated from the two bone tissues using the TrueMicro ChIP-Seq kit (Diagenode Cat# C01010140) and from spleen and skin using the iDeal ChIP-Seq kit for Histones (Diagenode Cat# C01010059). Starting amounts for each replicate varied by tissue with ~100 mg for spleen, 375–770 mg for MC3, 445–650 mg for sesamoid, and ~125 mg for skin. After homogenization, fixed samples were sheared with the Bioruptor® Pico (Diagenode Cat# B01060001) for 12 (spleen), 10–12 (MC3 and sesamoid), and 8 (skin) cycles of 30 s on and 30 s off. The amount of chromatin yield and thus chromatin per IP varied by tissue. Spleen and skin had the greatest amounts (1.5 μg and 600 ng, respectively) per IP and MC3 and sesamoid had the least (350 ng each). The following antibody concentrations were used for MC3, sesamoid, and skin: 0.5 μg for H3K4me1, 0.5 μg for H3K4me3, 1 μg for H3K27ac, and 1 μg for H3K27me3. To account for the greater amount of chromatin from spleen, twice the amount of antibody was used for each mark compared to the other three tissues. For all tissues, 10% of the total chromatin from each replicate was saved for the input.

Libraries were prepared with the IP-Star® Compact Automated System (Diagenode Cat# B03000002) using the MicroPlex Library Preparation Kit v2 (Diagenode Cat# C05010013). Spleen, MC3, and sesamoid were sequenced as 50 base pair single-end (SE) reads on the HiSeq 4000 platform (Illumina, San Diego, CA, USA). For these tissues, the broad mark (H3K27me3) was sequenced to a minimum of 50M raw reads while the remaining marks (H3Kme1, H3K4me3, and H3K27ac) and the input were sequenced to a minimum depth of 30 M raw reads. Due to advancements in sequencing technology, skin tissue was sequenced as 50 base pair paired-end (PE) reads on the NovaSeq 6000 (Illumina, San Diego, CA, USA). For skin, the broad mark (H3K27me3) was sequenced to a minimum of 100 M raw fragments while the remaining marks (H3Kme1, H3K4me3, and H3K27ac) and the input were sequenced to a minimum depth of 40 M raw fragments.

Methods for analyzing SE reads followed the procedures described previously (Kingsley et al., 2020) and modifications were made to the SE analysis methods to accommodate PE data generated from skin. After trimming with Trim-Galore version 0.4.0 (Martin, 2011; Andrews et al., 2012), reads were aligned to EquCab3.0 (Kalbfleisch et al., 2018) with BWA-MEM version 0.7.9a (Li and Durbin, 2009). Alignments in BAM format were filtered using SAMtools version 1.9 (Li et al., 2009). Reads were removed if they did not map, had secondary alignments (including split hits), failed platform/vendor quality tests, were identified as optical duplicates, or had an alignment quality score <30. PE reads were also removed if the mates did not map. PCR duplicates were marked with PicardTools version 2.7.1 (Picard toolkit, 2019) and removed with SAMtools. For peak-calling, MACS2 version 2.1.1.20160309 (Zhang et al., 2008) was used to call peaks for all marks with PE data denoted by a PE flag (-f BAMPE). SICERpy version 0.1.1 was also used to call peaks for H3K27me3 as it specializes in broad peak calling (SICERpy, SICERpy, GitHub Repository; Zang et al., 2009). To use SICERpy with the PE data, the second read in each pair was removed and data were processed as SE based on recommendations from the software developers. Peak-calls were combined by identifying overlapping regions of enrichment in both biological replicates where at least one replicate was significantly enriched for a given mark. Heatmaps and quality metrics were generated using deepTools 2.4.2 (Ramírez et al., 2016), SPP 1.13 (Kharchenko et al., 2008), and custom scripts. Detailed bioinformatic workflows are available at ftp://ftp.faang.ebi.ac.uk/faang/ftp/protocols/analysis/.

Quality Assessment

Library Complexity

Data were assessed for library complexity with metrics established by ENCODE and endorsed by FAANG, including nonredundant fraction (NRF), PCR bottleneck coefficient 1 (PBC1), and PCR bottleneck coefficient 2 (PBC2) (Landt et al., 2012; Kingsley et al., 2020). All of the libraries prepared surpassed the quality threshold for the PBC2 metric (PBC2 > 1), however, several marks and tissues fell below the quality threshold for NRF and PBC1 (Table 1). For example, three of the four marks for spleen passed all library complexity measures while the H3K27me3 data from both biological replicates failed NRF and PBC1. Additionally, both replicates for sesamoid and MC3 passed all three metrics for H3K4me1 and H3K27me3 but fell below threshold for H3K4me3 and H3K27ac. All skin libraries passed NRF and PBC1 thresholds with three exceptions: both replicates for H3K4me3 and ECA_UCD_AH2 replicate for H3K4me1.

Table 1.

Quality metrics and peak-calling summary for each biological replicate.

Mark Tissue Replicate NRF PBC1 PBC2 NSC RSC JSD FRiP Peak calls
Threshold: (>0.5) (>0.5) (>1) (>1.05) (>0.8) (>0.05) (>0.01)
K4me1 Spleen AH1 0.65 0.64 2.82 1.03 0.98 0.39 0.20 84,146
K4me1 Spleen AH2 0.74 0.74 3.80 1.04 0.96 0.39 0.25 113,447
K4me3 Spleen AH1 0.62 0.64 2.98 2.67 1.44 0.62 0.57 28,735
K4me3 Spleen AH2 0.57 0.60 2.67 2.53 1.34 0.62 0.57 31,198
K27ac Spleen AH1 0.65 0.66 2.96 1.34 1.54 0.46 0.29 50,977
K27ac Spleen AH2 0.70 0.70 3.40 1.31 1.46 0.45 0.29 60,281
K27me3 OriginalSpleen AH1 0.43 0.42 1.79 1.01 0.64 0.02 0.05 1,136
K27me3 RepeatSpleen AH1 0.13 0.28 2.94 1.03 0.33 0.20 0.02 164
K27me3 MergedSpleen AH1 0.32 0.40 1.86 1.01 0.59 0.50 0.05 6,297
K27me3 OriginalSpleen AH2 0.44 0.43 1.82 1.01 0.67 0.05 0.08 6,647
K27me3 RepeatSpleen AH2 0.66 0.67 3.02 1.02 0.74 0.06 0.10 32,492
K27me3 MergedSpleen AH2 0.53 0.55 2.40 1.01 0.83 0.06 0.11 37,629
K4me1 Sesamoid AH1 0.63 0.63 2.64 1.02 0.63 0.31 0.08 43,397
K4me1 Sesamoid AH2 0.84 0.85 6.46 1.01 0.49 0.21 0.00 4
K4me3 Sesamoid AH1 0.37 0.39 1.79 2.33 1.26 0.57 0.48 19,617
K4me3 Sesamoid AH2 0.39 0.40 1.78 1.40 1.20 0.41 0.21 16,524
K27ac Sesamoid AH1 0.42 0.42 1.81 1.16 1.19 0.40 0.19 34,223
K27ac Sesamoid AH2 0.34 0.34 1.67 1.02 0.60 0.31 0.02 5,013
K27me3 Sesamoid AH1 0.68 0.68 3.11 1.01 0.48 0.25 0.06 1,840
K27me3 Sesamoid AH2 0.75 0.75 3.97 1.01 0.56 0.18 0.00 0
K4me1 MC3 AH1 0.74 0.74 3.76 1.02 0.68 0.12 0.09 56,238
K4me1 MC3 AH2 0.77 0.77 4.28 1.02 0.63 0.13 0.07 47,452
K4me3 MC3 AH1 0.14 0.27 2.91 2.73 1.22 0.48 0.50 19,209
K4me3 MC3 AH2 0.32 0.34 1.71 2.56 1.20 0.48 0.50 21,339
K27ac MC3 AH1 0.27 0.29 1.65 1.25 1.19 0.30 0.23 36,022
K27ac MC3 AH2 0.09 0.26 4.56 1.38 0.95 0.32 0.19 16,638
K27me3 MC3 AH1 0.57 0.58 2.36 1.01 0.55 0.25 0.10 17,001
K27me3 MC3 AH2 0.65 0.65 2.88 1.01 0.51 0.22 0.08 13,790
K4me1 Skin AH1 0.43 0.47 2.14 1.15 2.85 0.29 0.34 115,470
K4me1 Skin AH2 0.53 0.55 2.39 1.13 3.06 0.25 0.29 109,322
K4me3 Skin AH1 0.41 0.46 2.12 3.14 1.27 0.60 0.69 24,442
K4me3 Skin AH2 0.32 0.40 2.12 3.19 1.30 0.60 0.68 23,584
K27ac Skin AH1 0.50 0.53 2.30 1.51 1.45 0.41 0.47 58,278
K27ac Skin AH2 0.58 0.59 2.57 1.47 1.46 0.40 0.47 57,737
K27me3 Skin AH1 0.50 0.53 2.36 1.06 3.20 0.14 0.24 95,788
K27me3 Skin AH2 0.50 0.54 2.40 1.06 4.09 0.11 0.21 77,151

The summary includes six quality metrics—NRF, Non-Redundant Fraction; PBC1 and PBC2, PCR Bottleneck Coefficient 1 and 2; NSC, Normalized Strand Cross-Correlation Coefficient; RSC, Relative Strand Cross-Correlation Coefficient; FRiP, Fraction of Reads in Peaks– and thresholds originally established by ENCODE, and the Jensen Shannon Distance (JSD). Samples include all of the original spleen IP, the repeated spleen IP for H3K27me3, and the merged (original + repeated) spleen IPs for H3K27me3. Peaks used to determine FRiP and peak numbers for H3K27me3 were called with SICER. All other peaks were generated with MACS2. Biological replicates are denoted as AH1 for SAMEA104728862 and AH2 for SAMEA104728877. Red highlighting indicates values below the quality thresholds.

In addition to quality metrics, sequencing data were evaluated at several processing stages of the analysis including alignment and PCR deduplication. All datasets generated high mapping quality scores (>35) and exceeded the minimum sequencing targets as described in the methods (Supplementary Table 2). Skin and spleen tissues retained a high number of reads for H3K4me1, H3K4me3, and H3K27ac after alignment, filtering, and deduplication (>20 M reads per replicate). Although all three activating marks were sequenced to the same target for both bone tissues, H3K4me1 retained more than 20 M reads per replicate while H3K4me3 and H3K27ac fell below 20 M processed reads per replicate with the majority of reads removed by deduplication. More than 40 M reads remained for each H3K27me3 replicate after processing with the exception of ECA_UCD_AH2 for sesamoid.

IP Enrichment

Data were also evaluated for IP enrichment using a variety of metrics to determine signal quality. Using normalized strand cross correlation (NSC) and relative strand cross correlation (RSC) assessments established by ENCODE (Landt et al., 2012), all marks for skin tissue exceeded the minimum quality threshold (Table 1). Additionally, the biological replicates for H3K4me3 and H3K27ac from spleen and MC3, as well as the H3K4me3 replicates for sesamoid, passed both cross-correlation measures. Similar to the library complexity metrics, several tissues fell below the quality thresholds (NCS > 1.05 and RSC > 0.8) including H3K4me1 from sesamoid and MC3; H3K27ac from ECA_UCD_AH2 sesamoid; and H3K27me3 from spleen, sesamoid, and MC3. Alignments were also assessed using the Jensen Shannon distance (JSD) to compare the distribution of reads with that of the background (input). Using JSD, H3K27me3 from both spleen replicates had values below 0.05, which is indicative of insufficient IP enrichment.

The final measure of IP enrichment evaluated the fraction of reads in peaks (FRiP) by comparing the peak calls with the read distribution for each sample. All tissues produced a high proportion of aligned reads within peaks for H3K4me3, ranging from 0.21 for sesamoid to 0.69 for skin. Similarly, MC3, skin, and spleen generated high FRiP scores for H3K27ac (0.47–0.19), and peaks from skin and spleen also scored well for H3K4me1 (0.47–0.29). Although lower than the values from skin and spleen, FRiP scores from MC3 indicated sufficient enrichment was obtained for H3K4me1 (0.07–0.09). For sesamoid tissue, the ECA_UCD_AH2 replicate generated peaks with comparable enrichment for H3K4me1, H3K27ac, and H3K27me3, while the ECA_UCD_AH2 replicate scored below threshold for both H3K4me1 and H3K27me3 (0.0005 and 0.0043, respectively). Further, H3K27me3 peaks from skin generated a substantially higher fraction of reads compared with MC3 and spleen (0.21–0.24 vs. 0.05–0.10), although all three of these tissues obtained sufficient enrichment based on this assessment.

Replicate Comparison

In addition to quality assessments for the read alignments, peaks called from the biological replicates were compared. For most of the marks, the percentage of genome covered by peaks was consistent with previously reported values for the TOI (Table 1). For sesamoid tissue, at least one replicate for H3K4me1, H3K27ac, and H3K27me3 generated fewer peak calls than expected based on results from the other replicate and the MC3 replicates. Additionally, the initial data for H3K27me3 from both spleen replicates yielded fewer peaks in accordance with the low complexity and enrichment scores for those libraries. The Jaccard similarity coefficient identified the highest correlation between the biological replicates for H3K4me3 across all “adopted” tissues, ranging from 0.65 to 0.84 (Table 2), and data from skin also showed high correlation for all marks (0.44–0.84). Replicates for spleen and MC3 had moderate levels of similarity for H3K4me1 and H3K27ac (0.32–0.58), while the biological replicates for H3K4me1 and H3K27me3 from sesamoid had no identity detected, consistent with the low-scoring quality assessments.

Table 2.

Summary of the combined peak calls and replicate comparison.

Mark Tissue Combined peak number % Covered Jaccard similarity coefficient
H3K4me1 Spleen 73,528 2.98 0.44
H3K4me3 Spleen 28,661 1.56 0.80
H3K27ac Spleen 51,427 1.82 0.58
H3K27me3 MACS2 Spleen 7,349 0.09 0.01
H3K27me3 SICER Spleen 449 0.22 0.03
H3K4me1 MC3 46,511 1.16 0.32
H3K4me3 MC3 20,556 1.10 0.75
H3K27ac MC3 31,547 1.08 0.38
H3K27me3 MACS2 MC3 15,304 0.40 0.28
H3K27me3 SICER MC3 5,628 2.57 0.28
H3K4me1 Sesamoid 750 0.01 0.00
H3K4me3 Sesamoid 17,361 1.07 0.65
H3K27ac Sesamoid 13,160 0.67 0.08
H3K27me3 MACS2 Sesamoid 390 0.01 0.00
H3K27me3 SICER Sesamoid 703 0.26 0.00
H3K4me1 Skin 92,971 4.56 0.50
H3K4me3 Skin 24,353 1.60 0.84
H3K27ac Skin 54,946 3.38 0.67
H3K27me3 MACS2 Skin 51,480 6.02 0.44
H3K27me3 SICER Skin 11,764 6.28 0.44

The summary includes the combined number of peaks and the percentage of the genome covered by those peaks. The Jaccard Similarity Coefficient compares the two biological replicates with 1 being perfectly concordant and 0 being entirely discordant. Peaks for H3K4me1, H3K4me3, and H3K27ac were called with MACS2.

Additional Data Collection

Due to insufficient enrichment and replicate identity, IP and sequencing were repeated for H3K27me3 from both spleen replicates. Unfortunately, the repeated ECA_UCD_AH1 data had low library complexity and IP enrichment (Table 1 and Supplementary Table 2). To achieve sufficient data for accurate peak calling from spleen tissue, the first round of IP and sequencing from ECA_UCD_AH1 for H3K27me3 and both rounds from ECA_UCD_AH2 were used for combined peak calling. Reads from the two input files for ECA_UCD_AH2 were also merged. The number of combined peaks increased from 4,955 covering 1.98% from the first round of sequencing to 5,267 covering 2.18% of the genome when data were merged (Table 2). Similar issues with enrichment prevented sufficient signal for peak calling in sesamoid for three of the four marks, and therefore, a second round of IP and quality evaluation of ECA_UCD_AH2 sesamoid is underway for H3K4me1, H3K27ac, and H3K27me3.

Data Metrics

After combining replicates, the number of retained peaks for each mark from the SE data ranged from 4,933 to 73,528 for spleen and from 5,628 to 46,511 for MC3 (Table 2). For both tissues, H3K4me1— the mark indicative of enhancers— was found to have the highest number of peaks while the repressive mark was found to have the lowest. This pattern is also consistent with the TOI data (Kingsley et al., 2020). For PE skin data, the number of combined peaks varied from 24,353 to 92,971 regions, and H3K4me3, which denotes promoters, was the mark with the lowest number of peaks. Additionally, the amount of the genome covered by H3K27me3 peaks was substantially higher for skin compared to the other equine FAANG tissues analyzed to date (6.28 vs. 2.94%), while the number of reads retained for H3K27me3 from the PE data after filtering (42.8%) was comparable to the average retained for all of the equine H3K27me3 SE data (41.3%, PRJEB42315 and PRJEB35307).

Evaluating general enrichment patterns revealed that the “adopted” tissues detected mark distributions for the activating marks that were consistent with those identified previously for the TOI (Supplementary Figures 13). Data for H3K27me3 from skin, however, generated strong enrichment around the TSS and upstream of an average gene, while still maintaining a similar level of relative enrichment for H3K27me3 distributed throughout the rest of the gene body and downstream as seen for other tissues (Supplementary Figure 4). Evaluation of the spleen datasets detected the strongest H3K27me3 enrichment when combining the original ECA_UCD_AH1 dataset and the merged ECA_UCD_AH2 dataset (denoted as “spleen” on Supplementary Figure 4). While enrichment distributions for sesamoid tissue detected consistent patterns for H3K4me1, H3K27ac, and H3K27me3, the relative level of enrichment is lower than expected based on the other tissues. In addition to genome-wide evaluations, the replicate-combined peak calls were also manually evaluated across a small number of well-characterized regions. Consistent with expectations, activating marks were detected at the TSS and upstream of ubiquitously expressed genes such as ACTB for all tissues (Supplementary Figures 5A,B). Additionally, all “adopted” tissues lacked peaks indicative of active transcription for a liver-specific gene known as CYP2E1 (Supplementary Figures 5C,D).

Discussion

The ENCODE project profoundly impacted scientific understanding of genome function in humans by enabling researchers to explore previously impossible challenges, such as charting genomic landscape shifts during development and uncovering enhancer networks associated with disease (Nord et al., 2013; Rhie et al., 2016). The advancements made by ENCODE paved a path for the FAANG consortium to characterize genomic function in numerous agricultural species (Andersson et al., 2015; Tuggle et al., 2016; Giuffra and Tuggle, 2019), which will expand research opportunities across diverse genera. As a part of the larger consortium, the equine FAANG group established a community-based initiative to “adopt” additional tissues for annotation. As a result of that expansive collaborative effort, characterization of putative regulatory regions was performed in spleen, sesamoid, MC3, and skin. The four additional tissues are of major importance for equine health and traits of economic impact. Specifically, research on catastrophic fracture involving sesamoid and MC3 can benefit from bone-specific annotations as recent advances in treatment have focused on transgenically modified stem cell therapeutics (Ball et al., 2019). Similarly, many diseases and traits under artificial selection in horses, such as melanoma, insect bite hypersensitivity, and coat colors including Appaloosa spotting among others, involve skin tissue (Rieder et al., 2000, 2001; Bellone et al., 2008, 2013; Rosengren Pielberg et al., 2008; Curik et al., 2013; Lanz et al., 2017). Several of these characterized phenotypes have been associated with mutations affecting gene expression (Rieder et al., 2000; Rosengren Pielberg et al., 2008; Bellone et al., 2013), making regulatory regions identified from whole skin a valuable resource for equine researchers. The “Adopt-a-Tissue” effort fits into a broader legacy of collaborative resource development that has historically led to rapid advancements for equine genomics and will continue to push equine science toward new frontiers. In concordance with past community efforts, the high quality data generated from the “Adopted” tissues are publicly available to benefit all investigators and lead to further progress in equine research.

Using quality metrics first standardized by ENCODE (Dunham et al., 2012), we identified low IP enrichment for the broad mark in spleen, sesamoid, and MC3 tissues. Unlike the SE datasets, the skin replicates sequenced with PE reads generated a higher enrichment signal for H3K27me3 as determined by quality metrics and enrichment topology plots. In particular, enrichment near the TSS was more strongly detected for skin than for any of the TOI or the other “adopted” tissues, suggesting that PE reads may better evaluate the broad repressive mark than SE datasets. With only one tissue evaluated as PE, we cannot exclude the possibility that this enrichment pattern may be skin-specific rather than evidence of a better method for detecting H3K27me3. Although enrichment difficulties have been previously recognized for the broad domains like those of H3K27me3 (Landt et al., 2012; Carelli et al., 2017), investigation of specific ChIP methods for broad histone marks appear to be rare. O'Geen et al. (2011) used both short and long sonication periods to account for the different rates of shearing efficiency for compact versus. open chromatin. They found that the larger DNA fragments after sonication were more enriched for broad repressive histone marks while smaller fragments were more likely to contain active chromatin modifications (O'Geen et al., 2011). Their work suggests that shorter sonication times and stringent size selection may bias ChIP samples toward higher enrichment of regions containing narrow marks at the expense of more condensed areas with broad marks, yet current ChIP-Seq standards do not encourage separate protocols for the different mark topologies (Landt et al., 2012; ENCODE Guidelines for Experiments Generating ChIP-seq Data, 2017). Instead, advances in ChIP-Seq methods have focused on analysis and software development to accommodate the different enrichment levels expected from broad and narrow domains assayed with the same protocol (Zhang et al., 2008; Zang et al., 2009). Future investigations involving H3K27me3 and other broad histone modifications may benefit from developing bench protocols, including sequencing parameters, that are specific for broad marks.

To account for insufficient H3K27me3 signal from spleen tissue, IP and sequencing were repeated for both biological replicates. By combining the reads from both sets of data for ECA_UCD_AH2, we were able to obtain sufficient enrichment for peak identification. These data support that combining results from different IPs performed on the same tissue sample can be a useful approach to obtain the enrichment needed for annotation purposes. Study of the best means for combining information from biological and technical replicates for differential enrichment analyses suggests that combining ChIP datasets without accounting for enrichment levels may lead to more false negatives (Bao et al., 2013). Although our data may not have captured all possible peaks, combining data enabled detection of more H3K27me3 peak calls with higher consistency than possible with the first dataset alone. Therefore, the current peak calls can serve as the starting point for spleen-specific annotations, which can be improved upon with characterization of heterochromatin regions from additional equine spleen samples.

The low quality metrics for three of the four marks from ECA_UCD_AH1 sesamoid tissue indicated there was low IP enrichment. To the best of the authors' knowledge, the MC3 and sesamoid data generated here represent the first histone mark peak calls from healthy, whole bone tissue. The overall lower quality metrics for bone tissues support the difficulty of working with these tissues, however, one of the two replicates for sesamoid showed sufficient quality for all four marks, suggesting the issue may be sample specific. To determine if any issues arose during chromatin extraction or IP, further evaluation of H3K4me1, H3K27ac, and H3K27me3 marks in sesamoid tissue from ECA_UCD_AH1 is warranted. Additional data generated from ECA_UCD_AH1 sesamoid tissue will be added to PRJEB42315 when available.

Previous equine annotations were developed based on homology and transcriptomics, leaving much of the genome, especially noncoding regions, uncharacterized (Hestand et al., 2015; Aken et al., 2016; Mansour et al., 2017). While valuable, annotation of regulatory regions based solely on homology with other species is not expected to be sufficient given the evolutionary role of these elements within and among species (Schmidt et al., 2010; McLean et al., 2011; Shibata et al., 2012; Lowdon et al., 2016). With the first publication of the equine FAANG data from eight prioritized tissues (Kingsley et al., 2020) and the four “adopted” tissues presented in this manuscript, researchers can begin to interrogate the role of regulatory regions in equine traits, such as the recent investigation of a novel 16 KB deletion associated with an ocular disorder known as distichiasis (Hisey et al., 2020). Future annotations for the horse will include maps of regulatory states characteristic of healthy tissue, making it a vital resource to compare against disease states. The histone ChIP-Seq data from the horse have already been integrated into a useable annotation resource by a new project known as FAANGMine (FAANGMine, FAANGMine). Similar to FlyMine (Lyne et al., 2007), the project aims to combine the results from all of the genomic assays used by the FAANG consortium into a single resource for easier use. Thanks to these integration effort, additional equine FAANG datasets including the “adopted” tissue peak calls will open up opportunities for variant investigations in previously uncharacterized noncoding regions and expand research opportunities in equine omics.

Data Availability Statement

Data were submitted to the European Nucleotide Archive following the best practices established by the FAANG Metadata and Data Sharing Committee and the FAANG Data Coordination Centre (Harrison et al., 2018). All of the new data referenced in the article were submitted under project ID PRJEB42315. The following files types were submitted for all high quality data: raw fastq for each mark and input from both replicates (38 files), processed BAM files for each mark and input from both replicates (34 files), bed files with peak calls per replicate including both SICERpy and MACS2 calls for H3K27me3 (32 files), and bed files with combined peak calls including those from both SICERpy and MACS2 for H3K27me3 (16 files). All files and metadata can be accessed from the FAANG Data Portal (https://data.faang.org/home). Previously published FAANG data used in the comparisons are also available from the FAANG data portal under project ID PRJEB35307 (Kingsley et al., 2020).

Ethics Statement

The animal study was reviewed and approved by University of California, Davis Institutional Animal Care and Use Committee.

Author Contributions

CF, RB, JP, TK, JM, NH, GL, LO, SB, MM, and EB: conceptualization. NK, RB, CF, and JP: methodology. NK: formal analysis, investigation, data curation, and visualization. CF, RB, and JP: resources. NK and RB: writing—original draft preparation. CF, JP, JM, TK, NH, GL, LO, SB, MM, and EB: writing—review and editing. RB: supervision. CF, RB, and JP: project administration. CF, RB, JP, TK, JM, NH, GL, LO, EB, SB, and MM: funding acquisition. All authors have read and agreed to the published version of the manuscript.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors would like to acknowledge Drs. Catherine Creppe, Paola Genevini, and the other members of the Diagenode Epigenomics Services Team for their high-quality histone mark profiling service. The authors would also like to thank Drs. Colin Kern, Huaijun Zhou, Pablo Ross, Ying Wang, and the other members of the UC Davis FAANG group for providing technical expertise and feedback. Additionally, the authors would like to recognize the contributions from horse owners that made this work possible.

Footnotes

Funding. Funding for experimental materials, and other resources was provided by the Grayson Jockey Club Foundation, United States Department of Agriculture (USDA) NRSP-8 equine species coordinator funds, a Priority Partnership Collaboration Award from the University of Sydney and University of California, Davis, and the Center for Equine Health (CEH) at UC Davis with funds provided by the State of California pari-mutuel fund and contributions by private donors. Support for NK was provided by the Grayson Jockey Club Foundation, USDA (2018-06530), Morris Animal Foundation (D16-EQ-028), and a CEH fellowship. Support for CF was provided by the National Institutes of Health (L40 TR001136). Publication fees supplied by UC Davis Open Access Publication Fund.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2021.649959/full#supplementary-material

References

  1. Aken B. L., Ayling S., Barrell D., Clarke L., Curwen V., Fairley S., et al. (2016). The Ensembl gene annotation system. Database. 2016, 1–19. 10.1093/database/baw093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Andersson L., Archibald A. L., Bottema C. D., Brauning R., Burgess S. C., Burt D. W., et al. (2015). Coordinated international action to accelerate genome-to-phenome with FAANG, the Functional Annotation of Animal Genomes project. Genome Biol. 16, 4–9. 10.1186/s13059-015-0622-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Andrews S., Krueger F., Segonds-Pichon A., Biggins L., Krueger C., Wingett S. (2012). Trim Galore. Available online at: https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/ (accessed May 6, 2020).
  4. Bailey E. (2010). Horse genomics and the Dorothy Russell Havemeyer Foundation. Anim. Genet. 41:1. 10.1111/j.1365-2052.2010.02136.x [DOI] [PubMed] [Google Scholar]
  5. Ball A. N., Phillips J. N., McIlwraith C. W., Kawcak C. E., Samulski R. J., Goodrich L. R. (2019). Genetic modification of scAAV-equine-BMP-2 transduced bone-marrow-derived mesenchymal stem cells before and after cryopreservation: An “off-the-shelf” option for fracture repair. J. Orthop. Res. 37, 1310–1317. 10.1002/jor.24209 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bao Y., Vinciotti V., Wit E., 't Hoen P. A. C. (2013). Accounting for immunoprecipitation efficiencies in the statistical analysis of ChIP-seq data. BMC Bioinformatics 14:169. 10.1186/1471-2105-14-169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bellone R. R., Brooks S. A., Sandmeyer L., Murphy B. A., Forsyth G., Archer S., et al. (2008). Differential gene expression of TRPM1, the potential cause of congenital stationary night blindness and coat spotting patterns (LP) in the appaloosa horse (Equus caballus). Genetics 179, 1861–1870. 10.1534/genetics.108.088807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bellone R. R., Holl H., Setaluri V., Devi S., Maddodi N., Archer S., et al. (2013). Evidence for a retroviral insertion in TRPM1 as the cause of congenital stationary night blindness and leopard complex spotting in the horse. PLoS ONE 8, 1–14. 10.1371/journal.pone.0078280 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Burns E. N., Bordbari M. H., Mienaltowski M. J., Affolter V. K., Barro M. V., Gianino F., et al. (2018). Generation of an equine biobank to be used for functional annotation of animal genomes project. Anim. Genet. 49, 564–570. 10.1111/age.12717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Caetano A. R., Shiue Y. L., Lyons L. A., O'Brien S. J., Laughlin T. F., Bowling A. T., et al. (1999). A comparative gene map of the horse (Equus caballus). Genome Res. 9, 1239–1249. 10.1101/gr.9.12.1239 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Carelli F. N., Sharma G., Ahringer J. (2017). Broad chromatin domains: an important facet of genome regulation. BioEssays 39, 1–7. 10.1002/bies.201700124 [DOI] [PubMed] [Google Scholar]
  12. Chowdhary B. P., Raudsepp T., Honeycutt D., Owens E. K., Piumi F., Guérin G., et al. (2002). Construction of a 5000rad whole-genome radiation hybrid panel in the horse and generation of a comprehensive and comparative map for ECA11. Mamm. Genome 13, 89–94. 10.1007/s00335-001-2089-8 [DOI] [PubMed] [Google Scholar]
  13. Curik I., Druml T., Seltenhammer M., Sundström E., Pielberg G. R., Andersson L., et al. (2013). Complex inheritance of melanoma and pigmentation of coat and skin in grey horses. PLoS Genet. 9:e1003248. 10.1371/journal.pgen.1003248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Dunham I., Kundaje A., Aldred S. F., Collins P. J., Davis C. A., Doyle F., et al. (2012). An integrated encyclopedia of DNA elements in the human genome. Nature 489, 57–74. 10.1038/nature11247 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. ENCODE Guidelines for Experiments Generating ChIP-seq Data (2017). Available online at: https://www.encodeproject.org/about/experiment-guidelines/ (accessed June 30, 2020).
  16. FAANGMine . Available online at: http://faangmine.org (accessed July 16, 2020).
  17. Giuffra E., Tuggle C. K. (2019). Functional annotation of animal genomes (FAANG): current achievements and roadmap. Annu. Rev. Anim. Biosci. 7, 65–88. 10.1146/annurev-animal-020518-114913 [DOI] [PubMed] [Google Scholar]
  18. Guérin G., Bailey E., Bernoco D., Anderson I., Antczak D. F., Bell K., et al. (1999). Report of the international equine gene mapping workshop: male linkage map. Anim. Genet. 30, 341–354. 10.1046/j.1365-2052.1999.00510.x [DOI] [PubMed] [Google Scholar]
  19. Guérin G., Bailey E., Bernoco D., Anderson I., Antczak D. F., Bell K., et al. (2003). The second generation of the International Equine Gene Mapping Workshop half-sibling linkage map. Anim. Genet. 34, 161–168. 10.1046/j.1365-2052.2003.00973.x [DOI] [PubMed] [Google Scholar]
  20. Harrison P. W., Fan J., Richardson D., Clarke L., Zerbino D., Cochrane G., et al. (2018). FAANG, establishing metadata standards, validation and best practices for the farmed and companion animal community. Anim. Genet. 49, 520–526. 10.1111/age.12736 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Hestand M. S., Kalbfleisch T. S., Coleman S. J., Zeng Z., Liu J., Orlando L., et al. (2015). Annotation of the protein coding regions of the equine genome. PLoS ONE 10, 1–13. 10.1371/journal.pone.0124375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hisey E. A., Hermans H., Lounsberry Z. T., Avila F., Grahn R. A., Knickelbein K. E., et al. (2020). Whole genome sequencing identified a 16 kilobase deletion on ECA13 associated with distichiasis in Friesian horses. BMC Genomics 21, 1–13. 10.1186/s12864-020-07265-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Kalbfleisch T. S., Rice E. S., DePriest M. S., Walenz B. P., Hestand M. S., Vermeesch J. R., et al. (2018). Improved reference genome for the domestic horse increases assembly contiguity and composition. Commun. Biol. 1, 1–8. 10.1038/s42003-018-0199-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kharchenko P. V., Tolstorukov M. Y., Park P. J. (2008). Design and analysis of ChIP-seq experiments for DNA-binding proteins. Nat. Biotechnol. 26, 1351–1359. 10.1038/nbt.1508 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kingsley N. B., Kern C., Creppe C., Hales E. N., Zhou H., Kalbfleisch T. S., et al. (2020). Functionally annotating regulatory elements in the equine genome using Histone Mark ChIP-Seq. Genes 11:3. 10.3390/genes11010003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Landt S. G., Marinov G. K., Kundaje A., Kheradpour P., Pauli F., Batzoglou S., et al. (2012). ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 22, 1813–1831. 10.1101/gr.136184.111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lanz S., Brunner A., Graubner C., Marti E., Gerber V. (2017). Insect bite hypersensitivity in horses is associated with airway hyperreactivity. J. Vet. Intern. Med. 31, 1877–1883. 10.1111/jvim.14817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Leeb T., Vogl C., Zhu B., de Jong P. J., Binns M. M., Chowdhary B. P., et al. (2006). A human-horse comparative map based on equine BAC end sequences. Genomics 87, 772–776. 10.1016/j.ygeno.2006.03.002 [DOI] [PubMed] [Google Scholar]
  29. Li H., Durbin R. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760. 10.1093/bioinformatics/btp324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Li H., Handsaker B., Wysoker A., Fennell T., Ruan J., Homer N., et al. (2009). The sequence alignment/map format and SAMtools. Bioinformatics 25, 2078–2079. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lowdon R. F., Jang H. S., Wang T. (2016). Evolution of epigenetic regulation in vertebrate genomes. Trends Genet. 32, 269–283. 10.1016/j.tig.2016.03.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lyne R., Smith R., Rutherford K., Wakeling M., Varley A., Guillier F., et al. (2007). FlyMine: an integrated database for drosophila and anopheles genomics. Genome Biol. 8:R129. 10.1186/gb-2007-8-7-r129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Mansour T. A., Scott E. Y., Finno C. J., Bellone R. R., Mienaltowski M. J., Penedo M. C., et al. (2017). Tissue resolved, gene structure refined equine transcriptome. BMC Genomics 18, 1–12. 10.1186/s12864-016-3451-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Martin M. (2011). Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12. 10.14806/ej.17.1.200 [DOI] [Google Scholar]
  35. McCoy A. M., McCue M. E. (2014). Validation of imputation between equine genotyping arrays. Anim. Genet. 45:153. 10.1111/age.12093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. McCue M. E., Bannasch D. L., Petersen J. L., Gurr J., Bailey E., Binns M. M., et al. (2012). A high density SNP array for the domestic horse and extant Perissodactyla: utility for association mapping, genetic diversity, and phylogeny studies. PLoS Genet. 8:e1002451. 10.1371/journal.pgen.1002451 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. McLean C. Y., Reno P. L., Pollen A. A., Bassan A. I., Capellini T. D., Guenther C., et al. (2011). Human-specific loss of regulatory DNA and the evolution of human-specific traits. Nature 471, 216–219. 10.1038/nature09774 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Nord A. S., Blow M. J., Attanasio C., Akiyama J. A., Holt A., Hosseini R., et al. (2013). Rapid and pervasive changes in genome-wide enhancer usage during mammalian development. Cell 155, 1521–1531. 10.1016/j.cell.2013.11.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. O'Geen H., Echipare L., Farnham P. J. (2011). Using ChIP-seq technology to generate high-resolution profiles of histone modifications. Methods Mol. Biol. 791, 265–286. 10.1007/978-1-61779-316-5_20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Penedo M. C. T., Millon L. V., Bernoco D., Bailey E., Binns M., Cholewinski G., et al. (2005). International equine gene mapping workshop report: a comprehensive linkage map constructed with data from new markers and by merging four mapping resources. Cytogenet. Genome Res. 111, 5–15. 10.1159/000085664 [DOI] [PubMed] [Google Scholar]
  41. Picard toolkit (2019). Broad Institute, GitHub Repos. Available online at: http://broadinstitute.github.io/picard/ (accessed July 8, 2020).
  42. Ramírez F., Ryan D. P., Grüning B., Bhardwaj V., Kilpert F., Richter A. S., et al. (2016). deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 44, W160–W165. 10.1093/nar/gkw257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Raudsepp T., Gustafson-Seabury A., Durkin K., Wagner M. L., Goh G., Seabury C. M., et al. (2008). A 4,103 marker integrated physical and comparative map of the horse genome. Cytogenet. Genome Res. 122, 28–36. 10.1159/000151313 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Raudsepp T., Santani A., Wallner B., Kata S. R., Ren C., Zhang H., et al. (2004). A detailed physical map of the horse Y chromosome. Proc. Natl. Acad. Sci. U. S. A. 101, 9321–9326. 10.1073/pnas.0403011101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Rhie S. K., Guo Y., Tak Y. G., Yao L., Shen H., Coetzee G. A., et al. (2016). Identification of activated enhancers and linked transcription factors in breast, prostate, and kidney tumors by tracing enhancer networks using epigenetic traits. Epigenet. Chromatin 9, 1–17. 10.1186/s13072-016-0102-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Rieder S., Stricker C., Joerg H., Dummer R., Stranzinger G. (2000). A comparative genetic approach for the investigation of ageing grey horse melanoma. J. Anim. Breed. Genet. 117, 73–82. 10.1111/j.1439-0388.2000x.00245.x [DOI] [Google Scholar]
  47. Rieder S., Taourit S., Mariat D., Langlois B., Guérin G. (2001). Mutations in the agouti (ASIP), the extension (MCIR), and the brown (TYRP1) loci and their association to coat color phenotypes in horses (Equus caballus). Mamm. Genome 12, 450–455. 10.1007/s003350020017 [DOI] [PubMed] [Google Scholar]
  48. Rosengren Pielberg G., Golovko A., Sundström E., Curik I., Lennartsson J., Seltenhammer M. H., et al. (2008). A cis-acting regulatory mutation causes premature hair graying and susceptibility to melanoma in the horse. Nat. Genet. 40, 1004–1009. 10.1038/ng.185 [DOI] [PubMed] [Google Scholar]
  49. Schaefer R. J., Schubert M., Bailey E., Bannasch D. L., Barrey E., Bar-Gal G. K., et al. (2017). Developing a 670k genotyping array to tag ~2M SNPs across 24 horse breeds. BMC Genomics 18, 1–18. 10.1186/s12864-017-3943-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Schmidt D., Wilson M. D., Ballester B., Schwalie P. C., Brown G. D., Marshall A., et al. (2010). Five-vertebrate ChIP-seq reveals the evolutionary dynamics of transcription factor binding. Science 328, 1036–1040. 10.1126/science.1186176 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Shibata Y., Sheffield N. C., Fedrigo O., Babbitt C. C., Wortham M., Tewari A. K., et al. (2012). Extensive evolutionary changes in regulatory element activity during human origins are associated with altered gene expression and positive selection. PLoS Genet. 8:e1002789. 10.1371/journal.pgen.1002789 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. SICERpy . GitHub Repository. Available online at: https://github.com/dariober/SICERpy (accessed July 8, 2020).
  53. Swinburne J. E., Boursnell M., Hill G., Pettitt L., Allen T., Chowdhary B., et al. (2006). Single linkage group per chromosome genetic linkage map for the horse, based on two three-generation, full-sibling, crossbred horse reference families. Genomics 87, 1–29. 10.1016/j.ygeno.2005.09.001 [DOI] [PubMed] [Google Scholar]
  54. Tuggle C. K., Giuffra E., White S. N., Clarke L., Zhou H., Ross P. J., et al. (2016). GO-FAANG meeting: a gathering on functional annotation of animal genomes. Anim. Genet. 47, 528–533. 10.1111/age.12466 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Wade C. M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F., et al. (2009). Genome sequence, comparative analysis, and population genetics of the domestic horse. Science. 326, 865–867. 10.1126/science.1178158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Zang C., Schones D. E., Zeng C., Cui K., Zhao K., Peng W. (2009). A clustering approach for identification of enriched domains from histone modification ChIP-Seq data. Bioinformatics 25, 1952–1958. 10.1093/bioinformatics/btp340 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Zhang Y., Liu T., Meyer C. A., Eeckhoute J., Johnson D. S., Bernstein B. E., et al. (2008). Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9:R137. 10.1186/gb-2008-9-9-r137 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

Data were submitted to the European Nucleotide Archive following the best practices established by the FAANG Metadata and Data Sharing Committee and the FAANG Data Coordination Centre (Harrison et al., 2018). All of the new data referenced in the article were submitted under project ID PRJEB42315. The following files types were submitted for all high quality data: raw fastq for each mark and input from both replicates (38 files), processed BAM files for each mark and input from both replicates (34 files), bed files with peak calls per replicate including both SICERpy and MACS2 calls for H3K27me3 (32 files), and bed files with combined peak calls including those from both SICERpy and MACS2 for H3K27me3 (16 files). All files and metadata can be accessed from the FAANG Data Portal (https://data.faang.org/home). Previously published FAANG data used in the comparisons are also available from the FAANG data portal under project ID PRJEB35307 (Kingsley et al., 2020).


Articles from Frontiers in Genetics are provided here courtesy of Frontiers Media SA

RESOURCES