Properties of emSeqs in the MPRA
(A) Histogram distribution of emSeq regulatory activity (log2(FC)) in six EBV B cell replicates compared with four plasmid controls. Positive values represent increased regulatory activity and negative values represent decreased activity in EBV B cells relative to plasmid controls. Oligo count is plotted on the y axis.
(B) Volcano plot of emSeq effect sizes (−log10(padj) from DESeq2) in EBV B cells relative to controls. Horizontal red line represents padj ≤ 0.05; vertical red lines (log2(FC) ± 0.58) represent a 1.5 FC difference between the EBV B replicates and plasmid controls.
(C) Proportion of emSeqs within each variant type. Significant differences between the proportion of emSeqs within hQTLs and the other variant types are shown: ∗∗∗∗chi-square p < 0.0001.
(D) Boxplots of emSeq effect sizes (log2(FC)) for each variant type. The x axis is sorted in descending order by mean log2(FC) of the variant types. Significant differences in the means of hQTL effect sizes compared with the other variant types are shown: ∗t test p < 0.05; ∗∗p < 0.01, ∗∗∗p < 0.001, ∗∗∗∗p <0.0001.
(E and F) Significant TFs enriched in all emSeqs (E) and hQTL emSeqs (F). TF rank and HOMER −log10(p) are plotted. FDR≤0.05 effects are highlighted in red. Top TFs are indicated.