Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2018 Oct 17;13(10):e0205608. doi: 10.1371/journal.pone.0205608

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication.

PMC Copyright notice

Fig 1 — A) The computational pipeline used to identify the most enriched motifs in the DNase-accessible CGIs from K562 cells. Bedtools was used to identify CGIs that overlap with ENCODE derived DNase-sensitive peaks. Homer was then used to identify the most enriched DNA sequence motifs in the DNase-accessible CGIs. B) Known transcription factor binding sites in the enriched motif list. C) Heatmaps showing vertebrate conservation and DNase-seq profiles for motifs #7 and #10 including their flanking 50 bp. PhyloP scores represent conservation of individual base-pairs throughout vertebrate genomes. Positive scores in PhyloP heatmaps, shown by red colors, indicate high sequence conservation whereas negative scores (blue colors) indicate acceleration of base pairs. PhastCons heatmaps show probability scores (ranging from 0 to 1) of conserved DNA elements. DNase-seq heatmaps of K562 and HeLa cells indicate DNase accessibility. Low accessibility, as indicated in dark colors, identifies a central DNase-seq footprint associated with motif #10 in both cell lines. Both motifs occur in DNase-sensitive CGIs of many different cell lines, but only motif #10 was consistently associated with a DNase-seq footprint (Fig A in S1 Fig). D) Motif co-occurrence odds-ratio matrix in DNase-sensitive CGIs. The odds-ratio is the value of observed-to-expected coincidence if motifs were distributed by chance. Higher values indicate a higher likelihood of co-occurrence of indicated motifs in the matrix. E) Pie charts indicate frequency of the motif copy number in human CGIs. F) Metagene profiles for all human CGIs, motifs #7 and #10 were generated by the Homer annotated script, showing that both motifs are associated with TSSs. G) Homer annotated script was used to generate genomic annotation enrichment scores to identify genomic regions where these motifs are enriched. The data indicate that motif #7 and #10 are significantly enriched in promoters and CGIs. The enrichment values were calculated using the cumulative hypergeometric distribution method.