Skip to main content
[Preprint]. 2024 Nov 21:2024.05.28.596138. Originally published 2024 May 31. [Version 2] doi: 10.1101/2024.05.28.596138

Figure 3: Enhancers and promoters display differential motif complexity.

Figure 3:

(A) PCA embeddings of internal model representations of sequences at PRO-cap peaks overlapping promoters or distal enhancers. (B) Distributions of the number of identified motif instances in peaks overlapping either promoters or distal enhancers. (C) Fraction of PRO-cap peaks containing at least one instance of a motif, overall vs. in peaks overlapping either promoters or distal enhancers (* = p < 0.005, ** = p < 10−19, two-sided Fisher’s Exact test). (D) Identified motif instance strengths (cosine similarity to the motif CWM) across PRO-cap peaks overlapping promoters and distal enhancers (* = p < 10−2, ** = p < 10−4, two-sided Mann-Whitney U test). (E) Counts task predictions made on held-out PRO-cap peaks by ProCapNet (y-axis) vs. a re-trained version ProCapNet that only saw promoter sequences during training.