Skip to main content
. 2022 Aug 31;11:e76500. doi: 10.7554/eLife.76500

Figure 6. Machine learning prediction of AP-1 binding sites genome-wide.

(A–B) Area under ROC and P-R curves for gkm-SVM comparison of Fos peaks in the C57BL/6 J genome (positive set) and length-matched, random genomic regions (negative set). Shown are AUC values for different lengths of input DNA sequence, ranging from 10 to 300 bp, indicated in black. The same analysis was repeated after masking all instances of core AP-1 motifs (TGASTCA; n=4000 randomly selected loci), indicated in green. (C) Frequency of consensus, human DNase footprints (from Vierstra et al., 2020) containing an extended AP-1 k-mer (VTGACTCAB) at positions relative to DNase-seq summits (n=164,705 footprints). (D) Width of DNase footprints that contain an extended AP-1 k-mer (VTGACTCAB). (E) Number of additional DNase footprints within 100 bp of DNase-seq summits at DNase peaks with a VTGACTCAB-containing footprint. (F) Distance between VTGACTCAB-containing footprints and nearest neighboring DNase footprint.

Figure 6.

Figure 6—figure supplement 1. gkm-SVM analysis of Fos, Tead1, and CTCF peaks.

Figure 6—figure supplement 1.

(A–C) ROC and P-R curves for gkm-SVM comparison of Fos, Tead1, and CTCF peaks compared to randomly sampled regions in the C57BL/6 J genome. (D–G) Top enriched 10-mers from gkm-SVM comparing the central 60 bp from n=4,000 randomly selected Fos (with and without masking TGASTCA k-mers), Tead1, and CTCF peaks in C57BL/6 J MEFs (positive set) with n=4000 randomly sampled 60 bp windows across the C57BL/6 J genome (negative set). (H) Top enriched 10-mers from gkm-SVM comparing the central 60 bp from active (positive set) and inactive allele (negative set) at Fos peaks with significantly allele-specific H3K27ac levels. (I) Top enriched 10-mers from gkm-SVM comparing the central 60 bp from n=4000 DNase footprints containing VTGACTCAB k-mers (positive set) and n=4000 random gene-distal genomic windows in the human genome (hg38) centered on VTGACTCAB k-mers (negative set).