Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jun 18.
Published in final edited form as: Mol Cell. 2015 May 28;58(6):1101–1112. doi: 10.1016/j.molcel.2015.04.006

Upstream Anti-sense Promoters are Hubs of Transcription Factor Binding and Active Histone Modifications

Benjamin S Scruggs 1,, Daniel A Gilchrist 1,*,, Sergei Nechaev 1,#, Ginger W Muse 1, Adam Burkholder 2, David C Fargo 2, Karen Adelman 1
PMCID: PMC4475495  NIHMSID: NIHMS678991  PMID: 26028540

SUMMARY

Anti-sense transcription originating upstream of mammalian protein-coding genes is a well-documented phenomenon, but remarkably little is known about the regulation or function of anti-sense promoters and the non-coding RNAs they generate. Here we define at nucleotide resolution the divergent transcription start sites (TSSs) near mouse mRNA genes. We find that coupled sense and anti-sense TSSs precisely define the boundaries of a nucleosome-depleted region (NDR) that is highly enriched in transcription factor (TF) motifs. Notably, as the distance between sense and anti-sense TSSs increases, so does the size of the NDR, the level of signal-dependent TF binding and gene activation. We further discover a group of anti-sense TSSs in macrophages with an enhancer-like chromatin signature. Interestingly, this signature identifies divergent promoters that are activated during immune challenge. We propose that anti-sense promoters serve as platforms for TF binding and establishment of active chromatin to further regulate or enhance sense-strand mRNA expression.

INTRODUCTION

How cells achieve the appropriate responses to intrinsic and extrinsic signals remains a central question in biology. Changes in gene expression constitute a major facet of cellular responses to all types of stimuli, and one of the great accomplishments of molecular biology has been the elucidation of core promoter sequences around which the transcription machinery is organized (Butler and Kadonaga, 2002). Structural and biochemical studies have helped establish basic rules regarding spatial arrangements of DNA sequence elements, complexes such as the General Transcription Factors and Mediator that facilitate RNA Polymerase II (Pol II) binding, and the role of chromatin in defining the promoter environment (Conaway and Conaway, 2013; Roeder, 2005). However, advances in genomic technology have revealed that far more of the mammalian genome is transcribed than previously anticipated, including intergenic regions upstream of protein-coding genes and distal enhancers (Carninci et al., 2005; Cheng et al., 2005). Thus, there is much still to learn about the logic underlying the transcriptional repertoire of mammalian cells.

Notably, recent studies have revealed that Pol II transcribes in both sense and anti-sense directions near many mRNA genes (Core et al., 2008; Preker et al., 2008; Seila et al., 2008). At such bidirectional promoters, Pol II has been shown to initiate transcription and undergo promoter-proximal pausing at both the protein-coding, sense TSS and from within the upstream region in the anti-sense direction (Core et al., 2008; Duttke et al., 2015; Flynn et al., 2011). Such divergent transcription is particularly widespread at mammalian promoters, which typically lack key core promoter elements such at the TATA motif and are thought to be information-poor due to their richness in CpG dinucleotides. This enigmatic, dual promoter structure raises fundamental questions about the specificity, purpose and regulation of such non-coding upstream anti-sense transcription.

Whereas transcription on the sense strand leads to processive elongation of mRNAs that can extend tens of kilobases (kb) in length, transcription in the anti-sense direction is much less processive and typically results in unstable non-coding RNA (ncRNA) species of limited length (<2 kb, Flynn et al., 2011; Ntini et al., 2013; Preker et al., 2008; Sigova et al., 2013). An elegant explanation for the differential processivity arose from studies of DNA sequence surrounding bidirectional promoters. Sequences recognized by the U1 small nuclear ribonucleoprotein (U1 snRNP), which promotes efficient RNA splicing and processive Pol II elongation (Berg et al., 2012), are enriched downstream of sense TSSs. In contrast, poly-A sequences (PAS) that trigger termination of transcription and RNA poly-adenylation (Proudfoot, 2011) are much more frequent downstream of anti-sense TSSs (Almada et al., 2013; Andersen et al., 2012; Ntini et al., 2013). Thus, a “U1-PAS axis” is proposed to drive the differences in Pol II processivity in the sense and anti-sense directions.

Intriguingly, many distal enhancer regions also display bidirectional transcription initiation and generate short, unstable transcripts (De Santa et al., 2010; Kim et al., 2010), raising parallels between enhancers and anti-sense promoters near protein-coding genes (Core et al., 2014). Enhancers are sites of active chromatin modifications and high transcription factor (TF) occupancy that are suggested to boost mRNA output by looping towards their target gene and increasing the local concentration of TFs and other transcription stimulatory complexes (Calo and Wysocka, 2013).

To address what function(s), if any, are served by the non-coding anti-sense transcription arising upstream of protein-coding genes, we precisely annotated both sense and anti-sense TSSs in murine macrophages. We found that bidirectional transcription is widespread and arises from two discrete, focused TSSs that are adjacent to distinct peaks of TF binding sites and TF occupancy. We present evidence that the distance between divergent TSSs impacts promoter structure: mRNA promoters with distant anti-sense TSSs have larger nucleosome-depleted regions, more accessible TF motifs, elevated TF binding, and greater transcription activation. Further, we discovered a subset of anti-sense TSSs with characteristics of enhancers such as enrichment in histone H3 mono-methylated at K4 (H3K4me1), H3 acetylated at K27 (H3K27ac) and p300 occupancy. These TSSs are also highly bound by PU.1, a master regulator of the macrophage lineage implicated in the establishment of tissue-specific enhancers. Notably, divergent promoters bearing this signature around the anti-sense TSS are highly active and strongly inducible by immune challenge, suggesting that factors associated with anti-sense TSSs might modulate mRNA expression.

RESULTS

Start-seq Precisely Identifies Transcription Start Sites

To elucidate the regulation and function of bidirectional transcription near protein-coding genes, we first obtained a high-resolution view of Pol II activity across the mouse genome using a recently described strategy for isolation of TSS-associated RNAs (Henriques et al., 2013; Nechaev et al., 2010). High-throughput sequencing of these nascent 25–65 nucleotide (nt) capped RNA species from the 5′-end (which we refer to as Start-seq) allows for precise definition of TSSs at nucleotide resolution. This method has previously enabled characterization of TSS-proximal sequence elements and promoter structure in Drosophila (Gilchrist et al., 2010; Nechaev et al., 2010), but such studies have not yet been carried out in mammals.

We sequenced start site-associated RNAs (or Start-RNAs) from murine bone marrow-derived macrophages that were unstimulated or challenged with bacterial lipopolysaccharide (LPS), to capture TSSs that were basally active as well as those induced by immune stimulation. Roughly half of RefSeq genes examined showed significant Start-RNA reads within a 2 kilobase (kb) window around their promoter, indicative of Pol II initiation and gene activity (Figure S1A; 12,228 of 23,747 RefSeq genes). In many cases, the peak of Start-RNA reads on the sense strand was in perfect agreement with RefSeq annotation (Figure 1A), but Start-RNA peaks could also be offset from the annotated TSS (Figure 1B). On a global scale, Start-RNA reads were diffusely distributed around RefSeq TSSs (Figure 1C, left panel, Figure 1D and Figure S1A), suggesting several possibilities: i) the use of dispersed TSSs by Pol II, or ii) modest discrepancies between start site usage in macrophages and the current mouse genomic annotation.

Figure 1. High-Resolution Mapping of TSSs Reveals Focused Transcription Initiation in Sense and Anti-sense Directions.

Figure 1

(A and B) Pol II ChIP-seq data (red) shows peaks of Pol II at the sense (green arrow) and anti-sense (purple arrow) TSSs. Start-seq data is shown for the Sense strand (green) and Anti-sense strand (purple). RefSeq gene models are shown in black with TSS depicted by arrows. In (A) the RefSeq TSS for Socs6 (black arrow) is aligned exactly with the observed, sense-strand TSS identified by Start-seq data. In (B) the observed sense-strand TSS is slightly offset (38 nt) from RefSeq annotation for the Btg2 gene.

(C) Distribution of Start-seq data on the sense strand, aligned around RefSeq annotated TSSs (left, black arrow) or the peak of sense-strand Start-RNA data in this region (right, green arrow), which we term the observed sense TSSs. Color bar at bottom indicates read depth. N=12,228 genes defined as active in mouse macrophages.

(D) Average distribution of Start-seq reads at single nucleotide resolution in a 21 nt window centered on either RefSeq (black) or sense TSSs (red). Data are for genes shown in (C).

(E) Information content in a ± 5 nt window around RefSeq and sense TSSs determined by WebLogo. Position 0 indicates the RefSeq (top) or sense TSS (bottom).

(F) Sense (green) and Anti-sense (purple) strand Start-RNA reads centered around sense TSSs. Note focused initiation around sense TSSs (right) with dispersed anti-sense signal (left).

(G) Sense (green) and Anti-sense (purple) strand Start-RNA reads centered around Anti-sense TSSs (purple arrows) reveal generally focused anti-sense initiation from the 9,219 promoters possessing sufficient Start-seq data for identification of sense and anti-sense TSSs.

(H) Information content around anti-sense TSSs as in (E).

See also Figure S1.

To distinguish between these possibilities, the peaks of Start-RNA data were used to define ‘observed’ TSSs in macrophages. When Start-RNA reads were aligned and graphed with respect to these observed sense-strand TSSs, it revealed a tightening of read distribution (Figure 1C, right panel and Figure 1D) and a clear increase in the information content present near observed sense TSSs as compared to RefSeq annotation (Figure 1E). Further, de novo motif discovery identified sequences resembling the Inr and TATA motif (Butler and Kadonaga, 2002) that were enriched around observed TSSs (Figure S1B). Indeed, motif analysis confirmed a greater occurrence of both Inr and TATA elements at observed vs. RefSeq TSSs (Figure S1C). These data demonstrate the power of Start-seq for defining TSSs with high confidence and bringing promoter regulatory features into focus.

Interestingly, Start-RNAs on the anti-sense strand displayed a broad distribution when aligned against the observed sense TSSs (Figure 1F, left panel), indicating that anti-sense transcription does not initiate at a set distance from sense TSSs. However, anti-sense Start-RNAs frequently exhibited focused peaks at individual loci (Figures 1A and 1B). We thus used our Start-RNA data sets to define the position of anti-sense TSSs occurring within 1 kb upstream of the sense TSS. We were able to clearly define anti-sense TSSs at >75% of active promoters (9,219 of 12,228). Importantly, we note that TSSs with levels of anti-sense transcription that fall below the stringent threshold employed for defining an anti-sense TSS location are not necessarily devoid of anti-sense transcription activity.

Alignment of anti-sense Start-RNAs around these anti-sense TSSs (Figure 1G, left panel) revealed generally focused transcription initiation. To probe the factors underlying this specificity, we examined DNA sequence in the region immediately surrounding anti-sense TSSs. This analysis revealed strong information content at the anti-sense TSSs (Figure 1H) and a frequency of Inr motif occurrence (27%) that was nearly identical to that observed at sense TSSs (28%; Figure S1C).

Together, these data indicate that Pol II initiates transcription specifically and divergently near the majority of active genes in macrophages. However, across different genes, the location of non-coding anti-sense initiation is set at a variable distance upstream of the coding sense TSS location. The median distance between sense and anti-sense TSSs is 176 base pairs (bp), but a considerable fraction of anti-sense TSSs are more distal, with many genes displaying sense/anti-sense distances of greater than 200 bp (Figure S1D).

Sense and Anti-sense TSSs Represent A Coupled Promoter Unit

Although paired sense and anti-sense TSSs flank the same regulatory information, it is not clear whether promoter sequences regulate bidirectional transcription in unison, or if Pol II activity in each direction could show distinct, uncoupled regimes. To investigate this, we examined the transcriptional profile of bidirectional promoters in resting and activated macrophages. RNA Pol II ChIP-seq and Start-RNA-seq were performed on macrophages challenged with LPS for 0, 30 or 120 minutes (referred to hereafter as: untreated, 30′ LPS and 120′ LPS). Protein-coding genes that were transcriptionally activated by LPS were identified by increases in Pol II ChIP-seq signal within the gene body at each time point as compared to untreated macrophages (Figure S2A). In agreement with earlier work (Amit et al., 2009; Bhatt et al., 2012; Escoubet-Lozach et al., 2011; Ramirez-Carrozzi et al., 2006), genes could be readily classified as ‘Early’, where Pol II levels increased across the gene by 30′ (Figure 2A), ‘Late’, where increased Pol II elongation was detected only at 120′ after LPS treatment (Figure 2B), or ‘Control’, where Pol II signal did not increase within the gene at either time point (Figure S2B). These gene classes were validated by analysis of published RNA-seq data, wherein chromatin-associated RNA species that had elongated to >200 nt in length were isolated over a time course of bacterial challenge in macrophages (Bhatt et al., 2012). Indeed, elongated transcripts were evident at Early genes at the 30′ time point, whereas Late genes underwent transcription following 120′ of immune challenge (Figure S2C).

Figure 2. Gene Activation Leads to Coupled Bidirectional Transcription Initiation, but Pol II only Elongates Effectively in the Sense Direction.

Figure 2

(A and B) Pol II ChIP-seq (red), sense Start-RNA (green), and anti-sense Start-RNA (purple) data are shown at representative Early (A) and Late (B) response genes in untreated macrophages or following 30′ or 120′ LPS challenge. Gene models are shown in black with observed sense TSSs depicted by arrows.

(C) Distribution of Pearson correlation coefficients for Start-RNAs levels at sense TSSs paired with adjacent anti-sense TSSs (orange) is significantly higher than when paired randomly with other anti-sense TSSs.

(D) Fold change in Start-RNA levels in the sense (S) or anti-sense (A–S) TSS region (TSS ±100 bp) was calculated for Early, Late, and Control genes. Fold-changes were calculated between untreated and 30′ LPS-treated macrophages (left), and 30′ and 120′ LPS-treated samples (right).

(E) Distribution of Pearson correlation coefficients for fold change in Start-RNA levels between untreated and 30′ LPS-treated macrophages. Bidirectional sense TSSs were paired as in (C).

(F) Fold change in elongated nascent RNA levels in the sense (S) or anti-sense (A–S) TSS region (TSS to +250 bp downstream) was calculated for Early, Late, and Control genes, and shown as in (D).

See also Figure S2.

In general, the number of Start-RNAs from the sense TSS was considerably higher than at the coupled anti-sense TSS (e.g. Figures 2A and 2B). However, Early and Late genes often showed concomitant increases in sense and anti-sense Start-RNAs upon gene activation, consistent with earlier indications of coordinated transcription in the two directions (Sigova et al., 2013). For example, both sense and anti-sense Start-RNAs peaked after 30′ of LPS treatment at the Early gene Cxcl1, and diminished at 120′ as Pol II levels subsided (Figure 2A). Likewise, the Late gene Socs1 showed maximal levels of Start-RNAs in both directions after 120′ of LPS challenge.

To globally evaluate the degree of coordination between sense and anti-sense transcription we first analyzed the levels of Start-RNAs from all active genes in resting macrophages. We calculated Pearson correlation coefficients between Start-RNA reads from coupled sense and anti-sense promoters, as compared to reads from randomly paired sense and anti-sense TSSs (see Extended Experimental Procedures). This analysis revealed stronger correspondence between Start-RNAs at coupled bidirectional TSSs than between TSSs paired at random (Figure 2C, Bidirectional pairs are ‘right-shifted’ towards higher correlation coefficients).

We next determined how broadly anti-sense transcription initiation increased along with sense transcription upon LPS challenge, by calculating the fold change in Start-RNAs at Early, Late and Control genes (Figure 2D). As expected, sense-strand Start-RNA levels increased at Early genes following 30′ LPS challenge (Figure 2D, left panel). Notably, anti-sense Start-RNA reads were similarly elevated. Start-RNA levels also increased equivalently in both directions at Late genes after 120′ LPS treatment (Figure 2D, right panel). Accordingly, the fold change in Start-RNAs following 30′ LPS challenge revealed a very strong correlation when evaluating coupled sense/anti-sense TSS pairs (Figure 2E). By comparison, there was little correspondence between fold changes in Start-RNA levels at randomly paired sense/anti-sense TSSs. Thus, activation of protein-coding genes stimulates coordinated transcription initiation and formation of short (25–65 nt) Start-RNAs at coupled sense and anti-sense TSSs.

To determine whether the burst in anti-sense transcription initiation upon LPS challenge led to generation of long ncRNAs, we evaluated data measuring longer nascent RNA species (Bhatt et al., 2012). As anticipated, there was good correspondence between levels of sense-strand Start-RNAs and elongated RNAs observed in resting macrophages (Figure S2D) and a clear increase in elongated RNA reads was seen on the sense strand for both Early and Late genes at the expected time points (Figure 2F). However, there was a strong bias towards elongated RNA signal in the sense direction, with significantly less long anti-sense RNA produced at any time point (Figure 2F). Thus, even under conditions where transcription initiation and short RNA synthesis is activated similarly in both directions, the processivity of Pol II in the anti-sense direction is markedly lower, with many transcripts failing to reach ~200 nt in length.

Coupled TSSs Define a Nucleosome-Depleted Region of Variable Size

To obtain a clearer view of the regulatory landscape surrounding sense/anti-sense promoter pairs, we ranked promoters by increasing distance between sense and anti-sense TSSs (Figure 3A). Heat maps of data from resting macrophages showed that Pol II and TBP ChIP-seq signal tracked with sense/anti-sense TSS locations (Figure 3B), indicating that two distinct transcription complexes can be located at the divergent TSSs, as shown previously for yeast (Rhee and Pugh, 2012). MNase-seq experiments showed highly positioned nucleosomes located just downstream of Pol II in the sense direction, with regularly spaced nucleosomes extending further into the gene (Figures 3B and 3C, right panel, black lines). Remarkably, precise definition of anti-sense TSSs revealed equally prominent positioned nucleosomes in the anti-sense direction (Figure 3B and 3C, left panel, black lines), in a pattern that mirrored the distribution of nucleosomes in the coding region. Thus, we find a symmetric distribution of nucleosomes on either side of the coupled TSS pair.

Figure 3. Transcription Machinery and Nucleosomes Are Highly Organized around Both Sense and Anti-sense TSSs.

Figure 3

(A) Start-RNA reads obtained from macrophages are shown in the anti-sense (purple) and sense (green) direction, both aligned with respect to sense TSSs. Genes are rank ordered by the distance between the sense and anti-sense TSSs. Data shown throughout this figure are for 8,730 Pol II-bound promoters with sufficient Start-seq reads for identification of sense and anti-sense TSSs in resting macrophages.

(B) Pol II and TBP ChIP-seq, MNase-seq and CpG dinucleotide count are shown, centered on sense TSSs, with genes rank ordered as in (A).

(C) Average distribution of nucleosomes (from MNase-seq, black) and CpG dinucleotide count (red) centered on either the anti-sense (left) or sense (right) TSSs.

(D) CpG island distribution and MNase-seq are shown for bidirectional genes categorized as CpG+ (N=6,464) or CpG− (N=2,266), ranked by increasing distance between sense and anti-sense TSSs.

(E) Conservation score across placental mammals (phyloP) is shown in 2 kb windows centered on anti-sense or sense TSSs as shown in (C). Dashed line indicates average conservation score across the mouse genome.

(F) Number of poly-A sequences (PAS; black) or motifs recognized by U1 snRNP (U1; red) in 50-mer bins centered upon either the anti-sense TSS (left) or sense TSS (right).

See also Figure S3.

The area between the bidirectional TSSs was largely devoid of MNase-seq reads, with the divergent sense and anti-sense TSS pairs bracketing an accessible nucleosome-depleted region. To confirm this, we examined FAIRE-seq data, which provides an independent measure of relative DNA accessibility (Ostuni et al., 2013). FAIRE-seq signal from macrophages was in concordance with MNase-seq data (Figure S3A) and indicated that the greater the distance between sense and anti-sense TSSs, the larger the intervening region of accessible DNA.

CpG-islands can influence the formation of nucleosome-depleted promoter regions (Fenouil et al., 2012), and thus might be involved in establishing variably sized NDRs. Interestingly, in the sense direction, CpG content showed little relationship to TSSs location or nucleosome positioning (Figure 3B and 3C, red lines, right panel). This result held true whether considering CpG density (Figures 3B and 3C) or CpG island designations (Figure S3A): high CpG content typically encompassed both the NDR and highly positioned downstream nucleosomes. Thus, the relationship between CpG content and nucleosome occupancy in the sense direction remains unclear.

Notably, the upstream edge of CpG-richness broadly coincided with anti-sense TSSs (Figure 3C, left panel) and the first positioned nucleosome in the anti-sense direction is found in a region of rapidly declining CpG-frequency (Figure 3C, left panel). To investigate a potential role for CpG content in creating NDRs of different size, we separated bidirectional promoters that intersected a CpG island (CpG+, 74%) from those that didn’t (CpG−). Comparison of the distances between sense/anti-sense TSSs at CpG+ and CpG− genes indicated a modest difference in the median width (179 bp CpG+ vs. 161 bp CpG−; Figure S3B), suggesting that CpG islands, or the factors that bind to them, may influence the upstream reach of the NDR.

However, CpG islands are not necessary to create a wide, defined region of nucleosome deprivation upstream of the sense TSS. Heat maps of MNase-seq data, rank ordered by increasing distance between sense/anti-sense TSSs showed similar nucleosome profiles for CpG+ and CpG− bidirectional genes (Figures 3D). At both gene groups, NDRs were considerably larger at genes with more distal anti-sense TSSs and a well-positioned nucleosome was evident immediately adjacent to the anti-sense TSS. Moreover, CpG islands were not sufficient to generate an extended NDR: examination of the nucleosome profile at CpG+ promoters revealed weak agreement between the upstream edge of CpG islands and the NDR (Figures S3C and S3D). Thus, CpG content may facilitate nucleosome depletion around promoters, but the upstream boundary of the NDR at bidirectional genes corresponds to the position of anti-sense transcription initiation and not the CpG island.

Sequence Conservation Within Bidirectional Promoters

To elucidate the role of DNA sequence in defining anti-sense TSS location, we calculated sequence conservation in the promoter region among placental mammals. As anticipated, sequences immediately adjacent to sense TSSs showed high conservation that extended downstream into the gene, presumably reflecting evolutionary pressure to maintain mRNA promoters and coding regions (Figure 3E, right panel). Examining conservation around anti-sense TSSs revealed high conservation scores around the anti-sense promoter and upstream region, that dropped precipitously just past the TSSs (Figure 3E, left panel). Interestingly, this indicates that conservation is low across the region where an anti-sense transcript is encoded. Thus, strong sequence conservation extends upstream from the protein-coding gene to the anti-sense TSS, but does not reach into to the ncRNA.

A prediction of DNA sequence-driven TSS selection is general conservation of sense/anti-sense TSS locations across cell types. To test this, we generated Start-seq data from mouse embryonic fibroblasts (MEFs) of the same genotype. Alignment of Start-seq data obtained from MEFs against the TSSs identified in macrophages showed strong agreement in positioning of both sense and anti-sense TSSs (Figure S3E), indicating that bidirectional start sites defined in one cell type were often used in cells of different origin. To further define the specificity of anti-sense transcription initiation, we investigated the distribution of Start-RNA reads around sense and anti-sense TSSs. Surprisingly, we found similarly precise transcription initiation around sense TSSs of different classes (unidirectional vs. bidirectional, CpG+ vs. CpG−), with ~40% of sense TSSs in each class showing strongly focused initiation (Figure S3F). Moreover, the level of focus at anti-sense TSSs was comparable to sense TSSs (33% of TSSs classified as focused), consistent with DNA sequence directing anti-sense transcription to initiate at a particular location (Duttke et al., 2015).

Specific sequence motifs showed striking asymmetry downstream of the sense and anti-sense TSSs. As described above, binding of the U1 snRNP promotes Pol II processivity and splicing of mRNA transcripts (Berg et al., 2012), and motifs for U1 association are known to be enriched downstream of sense-strand TSSs. Our motif analysis concurs with these findings and reveals a strong, sharp peak of U1 motifs centered ~55 nt downstream of sense TSSs (Figure 3F, red lines), coinciding precisely with the region where Pol II makes the transition to productive elongation. This is in clear contrast to the distribution of poly-A sequences (PAS) which are important for transcription termination and are depleted downstream of sense TSSs (Almada et al., 2013; Ntini et al., 2013). In the anti-sense direction, the opposite pattern is observed: there is a marked increase in PAS just downstream of anti-sense TSSs (Figure 3F, left panel), and a dearth of U1 motifs. We note that the enrichment of PAS motifs downstream of anti-sense TSSs is consistent with the very limited processivity of anti-sense transcription elongation described above when comparing Start-seq to elongated RNA-seq data (Figure 2D vs. 2F).

Transcription Factor Binding Motifs are Highly Enriched Near both Sense and Anti-sense TSSs

To examine whether sequence conservation between bidirectional promoters could in part reflect TF binding motifs, we determined the frequency of core vertebrate motifs around this region. Heat maps of motif occurrence revealed enrichment in consensus motif frequency that encompassed the region between sense and anti-sense TSSs for both CpG+ and CpG− bidirectional promoters (Figures 4A and S4A, respectively). Notably, the region of focused TF motif enrichment was substantially larger at genes with distant anti-sense TSSs, and there was a striking overlap in the locations of TF motif enrichment and the region of nucleosome deprivation. This suggests that the binding of TFs could contribute to establishment and/or maintenance of an enlarged NDR, consistent with recent work. Likewise, vertebrate motifs were enriched within the smaller region of nucleosome depletion observed upstream of unidirectional TSSs (Figures 4A and S4A).

Figure 4. TF Motifs and Binding are Focused within NDR Between Bidirectional TSSs.

Figure 4

(A) Heat maps depict MNase-seq, locations of a consolidated panel of vertebrate TF binding motifs and PU.1 ChIP-seq in a 2 kb window centered on sense TSSs (green arrow). Active CpG+ genes are categorized as bidirectional (N=6,464) or unidirectional (N=1,585).

(B) Composite distribution of a consolidated panel of 131 non-redundant vertebrate TF binding motifs (black) and PU.1 motifs (red) in 50-mer bins for bidirectional genes, centered on either the anti-sense (left) or sense (right) TSS.

(C) Composite metagene distribution of PU.1 motifs centered on either the anti-sense (left) or sense (right) TSSs at all unidirectional (black) or bidirectional (orange) genes.

(D) Average distribution of PU.1 ChIP-seq reads in 50-mer bins is shown in 2 kb windows centered on anti-sense or sense TSSs as shown in (B).

(E) Number of PU.1 motifs at CpG+ genes that are unidirectional, or bidirectional with the smallest (blue) or largest (red) sense/anti-sense distances. Motifs were summed in promoter-proximal and promoter-distal windows as described in the text. N=1,585 genes in each group.

(F) PU.1 ChIP-seq reads within each window were summed and normalized to the number of PU.1 motifs present.

See also Figure S4.

As expected, there was a peak of vertebrate TF motifs upstream of sense TSSs that was also evident when focusing on individual TFs, such as PU.1 (Figures 4B and 4C, right panel). PU.1 is a master regulator required for establishing macrophage identity (Scott et al., 1994) that is known to occupy thousands of enhancers and promoters in resting macrophages (Kaikkonen et al., 2013; Ostuni et al., 2013). Accordingly, PU.1 ChIP-seq signal showed enrichment at promoters (Figures 4A and 4D, right panel), including at anti-sense TSSs (Figure 4D, left panel).

To determine whether focused PU.1 occupancy near anti-sense promoters was driven by consensus motifs, we aligned data for motif occurrence to anti-sense TSSs. We found that PU.1 motifs displayed a sharp peak in enrichment directly upstream of anti-sense TSSs (Figure 4B, left panel). Moreover, vertebrate motifs in general showed strong enrichment immediately adjacent to anti-sense promoters (Figure 4B, left panel). This suggests that TFs bound to these clustered motifs could help specify the location of anti-sense transcription initiation by directing recruitment of the transcription machinery.

Enhanced TF Binding at Bidirectional Promoters

We noted that whereas PU.1 motifs are distributed around unidirectional promoters, PU.1 ChIP signal is restricted to the region immediately adjacent to the TSSs (Figures 4C and 4D, right panels). By contrast, there was closer agreement between PU.1 motif localization and PU.1 ChIP-seq signal at bidirectional genes. This suggested that focused enrichment of TF motifs at sense and anti-sense TSSs might facilitate PU.1 binding by enabling collaboration between TFs. Indeed, PU.1 is known to access cognate motifs at enhancers through cooperation with partner TFs that work together to increase chromatin accessibility (Heinz et al., 2010).

To probe whether the clustered TF motifs observed near bidirectional TSSs might provide a binding advantage, we first determined the number of consensus PU.1 motifs present upstream of unidirectional and bidirectional promoters, separating the latter into those with the smallest and largest sense/anti-sense distances. Motif frequencies were counted in promoter-proximal regions (from the TSS to 176 bp upstream, with 176 bp being the most common sense/anti-sense distance detected, Figure S1D) and promoter-distal regions (from −1kb to −177 bp upstream of sense TSSs). We found similar PU.1 motif occurrence for each group promoter-proximally, and unidirectional promoters displayed slightly more PU.1 motifs promoter-distally (Figure 4C).

The number of consensus motifs in each window was then used to normalize PU.1 ChIP-seq signal from resting macrophages over each region, to give ChIP signal intensity per motif. This analysis revealed higher PU.1 ChIP-seq signal per motif at bidirectional than unidirectional genes (Figures 4D, left and S4C). This was particularly striking for TSS-distal PU.1 motifs located upstream of genes with the largest sense/anti-sense distances, where the anti-sense TSS and NDR extend into this region (Figure 4D, right). This implies that aspects of bidirectional promoters can facilitate TF binding to cognate motifs located at a distance from sense promoters and suggests that anti-sense TSSs serve as an additional ‘hub’ for TF recruitment. We note that collaboration between TFs could enhance TF occupancy both directly through cooperative interactions, and indirectly through the generation of an accessible chromatin domain (Heinz et al., 2010).

Variably-sized NDRs in Resting Macrophages Impact Signal-dependent Transcription Factor Recruitment and Gene Activation

The observation of variably sized NDRs upstream of bidirectional TSSs in resting macrophages suggested the intriguing possibility that larger NDRs could contain more accessible binding sites for stimulus-dependent TFs like NF-κB. Notably, NF-κB is unable to bind cognate motifs when they are occluded by nucleosomes and thus a majority of rapid, signal-dependent NF-κB binding events occur within regions that were nucleosome deprived prior to immune challenge (Saccani et al., 2001). To probe this possibility, we first calculated the distance between sense and anti-sense TSSs for Early response, Late response and Control genes in macrophages, using this distance as a proxy for the size of the NDR. Surprisingly, despite Early response genes having fewer CpG+ promoters than the other gene classes (Figure S5A), Early genes displayed significantly wider sense/anti-sense distances than Late or Control genes (Figure 5A; Early gene median width=220 bp, Late median width=184 bp). This was true for both CpG+ and CpG− Early genes, which had very similar median sense/anti-sense distances (Figures S5B–S5C).

Figure 5. Increased NF-κB binding and Gene Output at Promoters with Distant Anti-sense TSSs.

Figure 5

(A) Sense/anti-sense TSS distances for Early response, Late response, and Control gene groups. Median width of Early promoters (220 bp) is significantly greater than that of Late and Control promoters.

(B) MNase-seq data shown at Early genes categorized as bidirectional or unidirectional (N=113). Bidirectional genes are ranked by increasing distance between sense/anti-sense TSSs. Genes quartiles with smallest and largest sense and anti-sense distances are indicated at right (N=236 for each).

(C) Average NF-κB (left) ChIP-seq signal in stimulated macrophages. Data is shown for Early unidirectional genes as well as bidirectional genes with the smallest vs. largest inter-TSS distances as defined in (B).

(D) Elongated nascent RNA reads (Bhatt et al., 2012) are shown for Early response genes in untreated macrophages and macrophages challenged for 15 or 30 minutes with LPS. Data is shown for gene groups defined in (B).

See also Figure S5.

Heat maps of MNase-seq profiles around Early promoters verified nucleosome deprivation between bidirectional TSSs and enlarged NDRs at many Early genes (Figure 5B). To investigate whether an extension of the NDR upstream at bidirectional promoters could expose underlying NF-κB motifs, we calculated the number of NF-κB motifs at Early genes using two different windows: the genomic median sense/anti-sense distance (TSS to −176 bp) or the experimentally observed sense/anti-sense distance at each Early gene. Strikingly, using the position of the anti-sense TSS as a measure of accessible promoter size revealed a 1.6-fold increase in NF-κB motifs detected upstream of Early genes (Figure S5D) vs. the median promoter-proximal window.

To evaluate signal-dependent NF-κB binding as a function of NDR size, we stratified Early genes with the smallest sense/anti-sense distances (Figure 5B, denoted at right in blue) and those with the largest distances (denoted in red). Motif analysis indicated that Early genes with the most distant TSSs had considerably more NF-κB consensus elements detected within their NDRs than genes with the smallest sense/anti-sense distances, and more genes with multiple NF-κB motifs (Figure S5E). We then examined ChIP-seq signals for NF-κB at each group of genes shortly after stimulation of macrophages (Kaikkonen et al., 2013) and compared the signal at Early genes transcribed unidirectionally. Both groups of bidirectional genes exhibited higher NF-κB ChIP-seq signals than unidirectional genes, and NF-κB signal spread farther upstream at genes with the largest sense/anti-sense distances (Figure 5C, red line). This is consistent with a larger region of accessible DNA in resting cells exposing more NF-κB motifs for rapid occupancy following LPS exposure (Figure S5E).

To test whether the observed differences in NF-κB ChIP-seq signal among Early genes resulted in differences in gene output, we used elongated RNA-seq data from macrophages that were untreated or challenged with LPS (Bhatt et al., 2012). A strong, rapid increase in elongated RNAs was detected at Early genes with the largest sense/anti-sense distances, while those with closer anti-sense TSSs showed more modest increases in RNA production (Figure 5D). This result was significant at both CpG+ and CpG− promoters (Figure S5F). In agreement with NF-κB ChIP-seq data, genes with unidirectional TSSs underwent the lowest levels of activation upon immune challenge (Figure 5D). We conclude that the larger NDRs found in resting macrophages at genes with distant anti-sense TSSs can facilitate more recruitment of signal-dependent TFs like NF-κB, thereby enabling stronger gene activation.

A Subset of Anti-sense TSSs Exhibit Enhancer-like Chromatin Features

We next investigated whether genes with anti-sense transcription displayed altered epigenetic chromatin modifications upstream of the sense TSS. For example, chromatin features such as tri-methylation of histone H3 K4 (H3K4me3) have been linked to transcriptional activation at mRNA promoters, and mono-methylation of histone H3 K4 (H3K4me1) coincides with enhancer regions (Calo and Wysocka, 2013; Heintzman et al., 2009; Visel et al., 2009). We analyzed the distribution of these histone modifications around bidirectional as compared to unidirectional promoters using ChIP-seq from resting macrophages. As expected, H3K4me3 signal peaked in the sense direction near the first downstream nucleosomes and was present at similar, low levels upstream of bidirectional and unidirectional genes (Figure 6A). In contrast, H3K4me1 was enriched upstream of bidirectional TSSs relative to unidirectional TSSs (Figure 6B). Given the association of H3K4me1 with enhancer regions, we evaluated whether other characteristics of enhancers were present upstream of bidirectional TSSs. Indeed, ChIP-seq signal for two key marks of active enhancers, the histone acetyl-transferase p300 and H3 K27 acetylation (H3K27ac, Calo and Wysocka, 2013) were significantly higher upstream of genes with anti-sense transcription (Figure 6C).

Figure 6. Enhancer-like Chromatin Signature Around Anti-sense TSSs.

Figure 6

(A and B) Average distribution of H3K4me3 (A) and H3K4me1 (B) ChIP-seq signal for Pol II-bound unidirectional promoters (black; N=2,240) and bidirectional promoters (red; N=8,730) in 50 bp bins centered on sense TSSs.

(C) Shown are H3K27Ac and p300 ChIP-seq reads in the region from the sense TSS to −1 kb at unidirectional (black) and bidirectional (orange) genes. Asterisks signify P<0.0001.

(D) H3K4me1 ChIP-seq signal at bidirectional promoters in a 2 kb window centered on anti-sense TSSs. Bidirectional promoters are ranked by decreasing H3K4me1 signal in the region from −1 kb to the observed anti-sense TSS, and divided into the top quartile of H3K4me1 (red) and all other quartiles of H3K4me1 signal (green), as shown at right.

(E) H3K27ac, p300 and H3K4me3 ChIP-seq reads in the region from the sense TSS to −1 kb at genes from the top quartile of upstream H3K4me1 signal (red) and all other quartiles (green), as shown in (D). Asterisks signify P<0.0001.

(F) Elongated nascent RNA (sense TSS to +250 bp) and Start-RNA reads (±100 bp around sense TSS) at H3K4me1 quartiles as shown in (E). Asterisks signify P<0.0001.

(G) Enriched GO biological processes among bidirectional promoters with highest levels of H3K4me1 (top quartile).

See also Figure S6.

Interestingly, not all bidirectional promoters shared these features equally: ranking genes by the level of H3K4me1 near anti-sense TSSs revealed a subset with particularly strong H3K4me1 signal (Figure 6D, top quartile of signal denoted in red). Genes in this group showed a full range of sense/anti-sense distances (Figure S6A) and H3K4me1 signal was independent of NDR size (Figure S6B), indicating that this histone mark was not reflecting levels of open chromatin. Notably, genes in the top quartile of H3K4me1 signal were enriched in other features of enhancers near their anti-sense TSSs, such as H3K27ac and p300 ChIP-seq signal, and displayed lower upstream H3K4me3 signal (Figure 6E). Moreover, bidirectional promoters in this top quartile of H3K4me1 signal produced elevated levels of sense-strand Start-RNAs and elongated RNAs (Figures 6F and S6A). Thus, we suggest that the presence of enhancer-like characteristics around the anti-sense TSS might contribute to transcription activity from the sense TSS partner.

To understand what kinds of genes displayed these features in macrophages, we performed Gene Ontology analysis on the most H3K4me1 enriched genes (top quartile of signal). This revealed a strong enrichment in genes involved in immune and defense responses (Figure 6G), including key regulators such as toll-like receptors, interleukins and key chemokines. In conclusion, anti-sense TSSs endowed with enhancer-like features in macrophages appear to be preferentially coupled to active genes with immune and inflammatory functions.

PU.1 Enrichment at Bidirectional TSSs of Immune-inducible Genes

To probe whether genes within the top quartile of H3K4me1 enrichment were indeed involved in immune responsiveness, we calculated the percentage of Early, Late and Control genes contained within this group. This revealed more Early than Late genes with high levels of H3K4me1, but with both groups present at levels higher than Control genes (Figure 7A). We then analyzed a time course of nascent elongated RNA production following LPS treatment and found that genes in the top quartile of H3K4me1 signal exhibit much greater levels of activated transcription upon bacterial challenge as compared to genes in the lower three quartiles (Figures 7B and S7A), revealing a correlation between histone modifications around the anti-sense TSS and gene inducibility.

Figure 7. PU.1 Enrichment at Bidirectional TSSs Marks Active and Inducible Genes.

Figure 7

(A) Percentage of Early, Late, and Control genes that fall in the top quartile of H3K4me1 signal around anti-sense TSSs.

(B) Average elongated RNA reads (sense TSS to +250 bp) for genes in the top quartile of H3K4me1 signal, vs. all other bidirectional genes, across a two hour time course of LPS challenge.

(C and D) Average distribution of PU.1 motifs (C) and PU.1 occupancy (D) in 50-mer bins for genes in the top quartile of H3K4me1 vs. all other bidirectional genes, centered on either the anti-sense (left) or sense (right) TSS.

(E) Model for transcriptional activity at bidirectional (top) vs. unidirectional (bottom) promoters. At bidirectional promoters, a lineage-specifying factor such as PU.1 (blue) collaborates with other TFs (yellow) to bind the focused region of TF motifs at coupled sense (green) and anti-sense (purple) TSSs. Through the activity of TFs and the transcription machinery, an NDR is established at both TSSs and spreads between the coupled promoters. This process renders additional transcription factor binding motifs accessible (orange boxes). Following stimulation, signal-dependent transcription factors like NF-κB (orange) preferentially bind motifs in the pre-established NDR enabling high transcriptional output.

At unidirectional promoters, a limited NDR size and absence of well-positioned upstream nucleosome is observed, such that promoter-distal TF motifs are often occluded by chromatin and unavailable for binding.

See also Figure S7.

To determine how the H3K4me1 modification was specified at a subset of bidirectional promoters, we considered that deposition of H3K4me1 and other enhancer marks in macrophages often follows binding of the lineage factor PU.1 (Ghisletti et al., 2010; Heinz et al., 2010; Kaikkonen et al., 2013). Quantifying the distribution of PU.1 motifs around sense and anti-sense TSSs revealed that genes enriched in H3K4me1 had considerably more PU.1 motifs near the sense and anti-sense TSSs than genes with lower H3K4me1 signal (Figure 7C). Moreover, genes in the top quartile of H3K4me1 signal in macrophages displayed significantly elevated PU.1 ChIP-seq signal around both TSSs (Figure 7D and S7B). Thus, PU.1 association with bidirectional promoters is enriched at genes with enhancer-like chromatin features and increased activity.

DISCUSSION

We propose that the predominant mammalian promoter unit should be defined to include sense and upstream anti-sense TSSs as well as the intervening DNA sequence. Bidirectional promoter architecture is widespread: we observe clear anti-sense TSSs at >75% of active genes in murine macrophages. Anti-sense transcription initiation is precise and guided by DNA sequence content including distinct core promoter-like elements, as observed in recent work (Duttke et al., 2015). Further, we find peaks of TF motif occurrence focused directly upstream of both sense and anti-sense TSSs (Figure 7E). High TF occupancy at bidirectional TSSs is envisioned to allow collaboration between TFs, to recruit the transcription machinery and define sites of transcription initiation. Once initiated, Pol II could elongate divergently, creating super-helical tension between the promoters that further discourages nucleosome assembly (Core et al., 2008; Preker et al., 2008; Seila et al., 2008). Importantly, increased distance between sense and anti-sense TSSs is associated with a larger NDR in resting macrophages. This creates a broader region of accessible chromatin and exposes more signal-dependent TFs motifs. Thus, identification of anti-sense TSS locations provides valuable information on promoter chromatin architecture and which promoter-distal TF motifs would be readily available for binding.

These results provide insights into the profile of transcriptional responses elicited by bacterial challenge of macrophages. It has long been appreciated that a single immune stimulus activates hundreds of genes to differing magnitudes and with variable kinetics, however the mechanisms underlying this transcriptional diversity have remained unclear (Bhatt et al., 2012; Escoubet-Lozach et al., 2011; Rogatsky and Adelman, 2014). Our findings indicate that the magnitude of gene activation is connected to the distance between sense and anti-sense TSSs, since this impacts the number of accessible TF motifs. We find significantly greater sense/anti-sense TSS distances and larger NDRs among Early response genes than at Late genes (Figure 5A), supporting the idea that NF-κB motifs located in open chromatin enable fast binding and transcriptional responses.

Further, we suggest a role for PU.1 binding near anti-sense TSSs in promoting a unique chromatin environment at a subset of active macrophage genes (Figure 7A). PU.1 is critical for defining macrophage identity and is strictly required in mature cells for inducible gene expression (Eichbaum et al., 1994; Grove and Plumb, 1993). These roles of PU.1 have generally been attributed to its association with enhancers (Kaikkonen et al., 2013; Ostuni et al., 2013) and the majority of PU.1-bound regions are indeed located distal from mRNA TSSs. However, nearly 9,000 binding sites for PU.1 in macrophages are located near gene promoters (Ghisletti et al., 2010), indicating that PU.1 also plays a TSS-proximal role. In support of this idea, we find that a subset of anti-sense TSSs displayed enrichment in PU.1 binding sites and PU.1 ChIP-seq signal, along with accumulation of H3K4me1, H3K27ac and p300. The combination of enrichment in PU.1 enhancer-like chromatin marks and non-processive transcription elongation at anti-sense TSSs is reminiscent of recent studies on macrophage enhancers (Kaikkonen et al., 2013), suggesting the intriguing possibility that anti-sense TSSs might serve as proximal enhancers of mRNA expression.

In conclusion, we find that two previously unappreciated features of promoter architecture impact basal and stimulus-responsive transcription, namely, the distance between sense and anti-sense TSSs and the presence of enhancer-like chromatin features near the anti-sense TSS. We show that anti-sense TSSs organize local chromatin, extending the NDR and enabling the asymmetric acquisition of activating histone modifications. We suggest that bidirectional promoters generate an optimized environment for recruitment of TFs and the transcription machinery, both in resting macrophages and during immune challenge, shedding new light on the regulatory landscape surrounding mammalian protein-coding genes.

EXPERIMENTAL PROCEDURES

Genomic Data Sets

Pol II ChIP-seq was performed as described in (Muse et al., 2007). Start-RNAs were prepared from nuclear RNA as described in (Nechaev et al., 2010). MNase-digested chromatin from macrophages and libraries were prepared as described in (Gilchrist et al., 2010). PU.1, H3K4me1 and H3K4me3 ChIP-seq and FAIRE-seq data from resting bone marrow derived macrophages (Ostuni et al., 2013) were downloaded from GEO (GSE38379). NF-kB (p65) ChIP-seq data from stimulated macrophages (Kaikkonen et al., 2013) were downloaded from GEO (GSE48759) and TBP ChIP-seq data (Escoubet-Lozach et al., 2011) were downloaded from GEO (GSE23622). Elongated nascent RNA-seq data from the chromatin fraction of bone marrow derived macrophages (Bhatt et al., 2012) were downloaded from GEO (GSE32916).

Sense and Anti-sense TSS Identification

Observed sense TSSs were defined within 2000 nt search windows centered on RefSeq annotated TSSs, using the location to which to largest number of Start-RNA reads aligned, with further considerations noted in Extended Experimental Procedures. A preference toward minimal distance for re-annotation relative to RefSeq was employed. Anti-sense TSS search windows were defined within a 1000 nt region upstream of each sense TSS, on the opposite strand using the same procedure described for sense TSS loci with proximity to the sense TSS utilized to break ties between locations with equivalent Start-seq reads.

LPS Responsive Gene Classification

Genes were categorized as Early or Late response genes if they displayed a ≥2-fold increase in Pol II ChIP-seq gene body signal after a 30′ or 120′ LPS challenge, respectively. An equivalent number of control genes were selected at random from the population that showed no increase in Pol II levels at either time point.

Statistics

Thorughout the manuscript, box plots show 25th–75th percentiles and error bars depict 10–90th percentiles. Mann-Whitney tests were used to determine P-values, except where noted otherwise.

Supplementary Material

supplement

Acknowledgments

We thank Paul Wade, Torben Jensen and Chris Burge for helpful comments on this project and manuscript. This research was supported by the Intramural Research Program of the NIH, National Institute of Environmental Health Sciences to K.A. (Z01 ES101987).

Footnotes

AUTHOR CONTRIBUTIONS

D.A.G, B.S.S and K.A. conceived of this project and wrote the manuscript. S.N. prepared Start-RNAs, G.M. performed ChIP-seq and D.A.G carried out remaining experiments. B.S.S, D.A.G, A.B. and D.F. performed data analysis.

ACCESSION NUMBERS

Sequencing data described in this work have been deposited in GEO under accession # GSE62151.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Almada AE, Wu X, Kriz AJ, Burge CB, Sharp PA. Promoter directionality is controlled by U1 snRNP and polyadenylation signals. Nature. 2013;499:360–363. doi: 10.1038/nature12349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Amit I, Garber M, Chevrier N, Leite AP, Donner Y, Eisenhaure T, Guttman M, Grenier JK, Li W, Zuk O, et al. Unbiased reconstruction of a mammalian transcriptional network mediating pathogen responses. Science. 2009;326:257–263. doi: 10.1126/science.1179050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Andersen PK, Lykke-Andersen S, Jensen TH. Promoter-proximal polyadenylation sites reduce transcription activity. Genes Dev. 2012;26:2169–2179. doi: 10.1101/gad.189126.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Berg MG, Singh LN, Younis I, Liu Q, Pinto AM, Kaida D, Zhang Z, Cho S, Sherrill-Mix S, Wan L, Dreyfuss G. U1 snRNP determines mRNA length and regulates isoform expression. Cell. 2012;150:53–64. doi: 10.1016/j.cell.2012.05.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bhatt DM, Pandya-Jones A, Tong AJ, Barozzi I, Lissner MM, Natoli G, Black DL, Smale ST. Transcript dynamics of proinflammatory genes revealed by sequence analysis of subcellular RNA fractions. Cell. 2012;150:279–290. doi: 10.1016/j.cell.2012.05.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Butler JE, Kadonaga JT. The RNA polymerase II core promoter: a key component in the regulation of gene expression. Genes Dev. 2002;16:2583–2592. doi: 10.1101/gad.1026202. [DOI] [PubMed] [Google Scholar]
  7. Calo E, Wysocka J. Modification of enhancer chromatin: what, how, and why? Mol. Cell. 2013;49:825–837. doi: 10.1016/j.molcel.2013.01.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Carninci P, Kasukawa T, Katayama S, Gough J, Frith MC, Maeda N, Oyama R, Ravasi T, Lenhard B, Wells C, et al. The transcriptional landscape of the mammalian genome. Science. 2005;309:1559–1563. doi: 10.1126/science.1112014. [DOI] [PubMed] [Google Scholar]
  9. Cheng J, Kapranov P, Drenkow J, Dike S, Brubaker S, Patel S, Long J, Stern D, Tammana H, Helt G, et al. Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution. Science. 2005;308:1149–1154. doi: 10.1126/science.1108625. [DOI] [PubMed] [Google Scholar]
  10. Conaway RC, Conaway JW. The Mediator complex and transcription elongation. Biochim Biophys Acta. 2013;1829:69–75. doi: 10.1016/j.bbagrm.2012.08.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Core LJ, Martins AL, Danko CG, Waters CT, Siepel A, Lis JT. Analysis of nascent RNA identifies a unified architecture of initiation regions at mammalian promoters and enhancers. Nat Genet. 2014;46:1311–1320. doi: 10.1038/ng.3142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Core LJ, Waterfall JJ, Lis JT. Nascent RNA sequencing reveals widespread pausing and divergent initiation at human promoters. Science. 2008;322:1845–1848. doi: 10.1126/science.1162228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. De Santa F, Barozzi I, Mietton F, Ghisletti S, Polletti S, Tusi BK, Muller H, Ragoussis J, Wei CL, Natoli G. A large fraction of extragenic RNA pol II transcription sites overlap enhancers. PLoS biology. 2010;8:e1000384. doi: 10.1371/journal.pbio.1000384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Duttke SH, Lacadie SA, Ibrahim MM, Glass CK, Corcoran DL, Benner C, Heinz S, Kadonaga JT, Ohler U. Human promoters are intrinsically directional. Mol Cell. 2015;57:674–684. doi: 10.1016/j.molcel.2014.12.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Eichbaum QG, Iyer R, Raveh DP, Mathieu C, Ezekowitz RA. Restriction of interferon gamma responsiveness and basal expression of the myeloid human Fc gamma R1b gene is mediated by a functional PU.1 site and a transcription initiator consensus. J Exp Med. 1994;179:1985–1996. doi: 10.1084/jem.179.6.1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Escoubet-Lozach L, Benner C, Kaikkonen MU, Lozach J, Heinz S, Spann NJ, Crotti A, Stender J, Ghisletti S, Reichart D, et al. Mechanisms establishing TLR4-responsive activation states of inflammatory response genes. PLoS Genet. 2011;7:e1002401. doi: 10.1371/journal.pgen.1002401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fenouil R, Cauchy P, Koch F, Descostes N, Cabeza JZ, Innocenti C, Ferrier P, Spicuglia S, Gut M, Gut I, Andrau JC. CpG islands and GC content dictate nucleosome depletion in a transcription-independent manner at mammalian promoters. Genome Res. 2012;22:2399–2408. doi: 10.1101/gr.138776.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Flynn RA, Almada AE, Zamudio JR, Sharp PA. Antisense RNA polymerase II divergent transcripts are P-TEFb dependent and substrates for the RNA exosome. Proc Natl Acad Sci USA. 2011;108:10460–10465. doi: 10.1073/pnas.1106630108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ghisletti S, Barozzi I, Mietton F, Polletti S, De Santa F, Venturini E, Gregory L, Lonie L, Chew A, Wei CL, et al. Identification and characterization of enhancers controlling the inflammatory gene expression program in macrophages. Immunity. 2010;32:317–328. doi: 10.1016/j.immuni.2010.02.008. [DOI] [PubMed] [Google Scholar]
  20. Gilchrist DA, Dos Santos G, Fargo DC, Xie B, Gao Y, Li L, Adelman K. Pausing of RNA polymerase II disrupts DNA-specified nucleosome organization to enable precise gene regulation. Cell. 2010;143:540–551. doi: 10.1016/j.cell.2010.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Grove M, Plumb M. C/EBP, NF-kappa B, and c-Ets family members and transcriptional regulation of the cell-specific and inducible macrophage inflammatory protein 1 alpha immediate-early gene. Mol Cell Biol. 1993;13:5276–5289. doi: 10.1128/mcb.13.9.5276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Heintzman ND, Hon GC, Hawkins RD, Kheradpour P, Stark A, Harp LF, Ye Z, Lee LK, Stuart RK, Ching CW, et al. Histone modifications at human enhancers reflect global cell-type-specific gene expression. Nature. 2009;459:108–112. doi: 10.1038/nature07829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK. Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol Cell. 2010;38:576–589. doi: 10.1016/j.molcel.2010.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Henriques T, Gilchrist DA, Nechaev S, Bern M, Muse GW, Burkholder A, Fargo DC, Adelman K. Stable Pausing by RNA Polymerase II Provides an Opportunity to Target and Integrate Regulatory Signals. Mol Cell. 2013;52:517–528. doi: 10.1016/j.molcel.2013.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kaikkonen MU, Spann NJ, Heinz S, Romanoski CE, Allison KA, Stender JD, Chun HB, Tough DF, Prinjha RK, Benner C, Glass CK. Remodeling of the enhancer landscape during macrophage activation is coupled to enhancer transcription. Mol Cell. 2013;51:310–325. doi: 10.1016/j.molcel.2013.07.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Kim TK, Hemberg M, Gray JM, Costa AM, Bear DM, Wu J, Harmin DA, Laptewicz M, Barbara-Haley K, Kuersten S, et al. Widespread transcription at neuronal activity-regulated enhancers. Nature. 2010;465:182–187. doi: 10.1038/nature09033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Muse GW, Gilchrist DA, Nechaev S, Shah R, Parker JS, Grissom SF, Zeitlinger J, Adelman K. RNA polymerase is poised for activation across the genome. Nat Genet. 2007;39:1507–1511. doi: 10.1038/ng.2007.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Nechaev S, Fargo DC, dos Santos G, Liu L, Gao Y, Adelman K. Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila. Science. 2010;327:335–338. doi: 10.1126/science.1181421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ntini E, Jarvelin AI, Bornholdt J, Chen Y, Boyd M, Jorgensen M, Andersson R, Hoof I, Schein A, Andersen PR, et al. Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality. Nat Struct Mol Biol. 2013;20:923–928. doi: 10.1038/nsmb.2640. [DOI] [PubMed] [Google Scholar]
  30. Ostuni R, Piccolo V, Barozzi I, Polletti S, Termanini A, Bonifacio S, Curina A, Prosperini E, Ghisletti S, Natoli G. Latent enhancers activated by stimulation in differentiated cells. Cell. 2013;152:157–171. doi: 10.1016/j.cell.2012.12.018. [DOI] [PubMed] [Google Scholar]
  31. Preker P, Nielsen J, Kammler S, Lykke-Andersen S, Christensen MS, Mapendano CK, Schierup MH, Jensen TH. RNA exosome depletion reveals transcription upstream of active human promoters. Science. 2008;322:1851–1854. doi: 10.1126/science.1164096. [DOI] [PubMed] [Google Scholar]
  32. Proudfoot NJ. Ending the message: poly(A) signals then and now. Genes Dev. 2011;25:1770–1782. doi: 10.1101/gad.17268411. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Ramirez-Carrozzi VR, Nazarian AA, Li CC, Gore SL, Sridharan R, Imbalzano AN, Smale ST. Selective and antagonistic functions of SWI/SNF and Mi-2beta nucleosome remodeling complexes during an inflammatory response. Genes Dev. 2006;20:282–296. doi: 10.1101/gad.1383206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rhee HS, Pugh BF. Genome-wide structure and organization of eukaryotic pre-initiation complexes. Nature. 2012;483:295–301. doi: 10.1038/nature10799. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Roeder RG. Transcriptional regulation and the role of diverse coactivators in animal cells. FEBS Lett. 2005;579:909–915. doi: 10.1016/j.febslet.2004.12.007. [DOI] [PubMed] [Google Scholar]
  36. Rogatsky I, Adelman K. Preparing the first responders: building the inflammatory transcriptome from the ground up. Mol Cell. 2014;54:245–254. doi: 10.1016/j.molcel.2014.03.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Saccani S, Pantano S, Natoli G. Two waves of nuclear factor kappaB recruitment to target promoters. J Exp Med. 2001;193:1351–1359. doi: 10.1084/jem.193.12.1351. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Scott EW, Simon MC, Anastasi J, Singh H. Requirement of transcription factor PU.1 in the development of multiple hematopoietic lineages. Science. 1994;265:1573–1577. doi: 10.1126/science.8079170. [DOI] [PubMed] [Google Scholar]
  39. Seila AC, Calabrese JM, Levine SS, Yeo GW, Rahl PB, Flynn RA, Young RA, Sharp PA. Divergent transcription from active promoters. Science. 2008;322:1849–1851. doi: 10.1126/science.1162253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Sigova AA, Mullen AC, Molinie B, Gupta S, Orlando DA, Guenther MG, Almada AE, Lin C, Sharp PA, Giallourakis CC, Young RA. Divergent transcription of long noncoding RNA/mRNA gene pairs in embryonic stem cells. Proc Natl Acad Sci USA. 2013;110:2876–2881. doi: 10.1073/pnas.1221904110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Visel A, Blow MJ, Li Z, Zhang T, Akiyama JA, Holt A, Plajzer-Frick I, Shoukry M, Wright C, Chen F, et al. ChIP-seq accurately predicts tissue-specific activity of enhancers. Nature. 2009;457:854–858. doi: 10.1038/nature07730. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplement

RESOURCES