Abstract
Genome‐wide transcription studies are revealing an increasing number of “dispersed promoters” that, unlike “focused promoters”, lack well‐conserved sequence motifs and tight regulation. Dispersed promoters are nevertheless marked by well‐defined chromatin structures, suggesting that specific sequence elements must exist in these unregulated promoters. Here, we have analyzed regions of transcription initiation in the eukaryotic parasite Trypanosoma brucei, in which RNA polymerase II transcription initiation occurs over broad regions without distinct promoter motifs and lacks regulation. Using a combination of site‐specific and genome‐wide assays, we identified GT‐rich promoters that can drive transcription and promote the targeted deposition of the histone variant H2A.Z in a genomic context‐dependent manner. In addition, upon mapping nucleosome occupancy at high resolution, we find nucleosome positioning to correlate with RNA pol II enrichment and gene expression, pointing to a role in RNA maturation. Nucleosome positioning may thus represent a previously unrecognized layer of gene regulation in trypanosomes. Our findings show that even highly dispersed, unregulated promoters contain specific DNA elements that are able to induce transcription and changes in chromatin structure.
Keywords: core promoter, histone variant, nucleosome occupancy, Trypanosoma brucei
Subject Categories: Chromatin, Epigenetics, Genomics & Functional Genomics; Microbiology, Virology & Host Pathogen Interaction; Transcription
Introduction
Transcription initiation involves the assembly of an elaborate complex of transcription factors to target the polymerase to the correct genomic locus. Two key features of this process are a core promoter, which functions as a platform on which the transcription machinery assembles, and a chromatin structure that allows the transcription machinery to access the promoter DNA (Kadonaga, 2012). Typically, the core promoter is defined as the minimal DNA sequence necessary and sufficient to direct the accurate initiation of transcription by an RNA polymerase (RNA pol) (Roeder, 1996). Core promoters commonly comprise specific DNA elements that confer distinct properties to the promoter; for example, they serve as binding sites for transcription factors (Roeder, 1996) or are involved in defining a specific chromatin structures that leave the DNA depleted of nucleosomes (Raisner et al, 2005; Yuan et al, 2005; Mavrich et al, 2008a).
While most eukaryotic DNA is occupied by nucleosomes, genome‐wide nucleosome occupancy maps that have been generated for more than 30 organisms have revealed a consistent pattern: In all organisms studied thus far, promoters and other regulatory elements are depleted of nucleosomes (Hughes & Rando, 2014). In addition, there is an enrichment of the histone variant H2A.Z at the nucleosomes that flank the nucleosome‐depleted regions (NDRs) (Albert et al, 2007; Barski et al, 2007; Mavrich et al, 2008b). Reports regarding the biological function of H2A.Z are conflicting, with some studies describing that incorporation of H2A.Z stabilizes the nucleosome and others describing a destabilizing effect (reviewed in Zlatanova & Thakar, 2008). Similarly, some studies report an activating role for H2A.Z on transcription, while others describe a repressive influence (reviewed in Marques et al, 2010).
The most important factor in the establishment of NDRs seems to be the DNA sequence (Struhl & Segal, 2013). Homopolymeric sequences such as poly(dA:dT) and poly(dG:dC) are intrinsically rigid, a property that negatively affects the ability of DNA to bend around histone octamers and thus strongly inhibits nucleosome formation (Suter et al, 2000; Segal & Widom, 2009). The mechanisms leading to the deposition of H2A.Z at promoters remain less well understood, but findings in yeast suggest that deposition may be dependent on the presence of NDRs (Raisner et al, 2005), which are recognized by the chromatin remodeling complexes SWR‐C/SWR1 (Yen et al, 2013). SWR‐C/SWR1 had previously been shown to catalyze the exchange of H2A with H2A.Z (Mizuguchi et al, 2004).
Different genome‐wide methods have identified a rapidly increasing number of promoters that lack well‐defined sequence motifs and revealed two general classes of promoters: “focused” and “dispersed” (Carninci et al, 2006). Historically, the majority of research has been devoted to focused promoters where transcription initiation occurs from a single transcription start site (TSS) or a narrow cluster of TSSs that are associated with well‐defined core promoter motifs, for example, a TATAA box. However, it is becoming increasingly clear that a large percentage of promoters (~70% in mammals, Saxonov et al, 2006) are dispersed, typically containing TSSs spread over a range of 50–100 bp, although regions as wide as 10 kb have been reported (Koch et al, 2011). Besides mammals, dispersed promoters have been identified in yeast (Zhang & Dietrich, 2005), Arabidopsis thaliana (Yamamoto et al, 2009), Drosophila melanogaster (Ni et al, 2010), Xenopus laevis (van Heeringen et al, 2011), Leishmania major (Martinez‐Calvillo et al, 2003), and Trypanosoma brucei (Kolev et al, 2010). It is likely that more dispersed promoters will be discovered during similar genome‐wide TSS studies in less well‐studied organisms, which will help to elucidate the biological significance of this apparent promoter dichotomy.
Data from vertebrates and metazoans indicate that focused promoters drive transcription of highly regulated genes while dispersed promoters drive transcription of weakly regulated genes that lack well‐defined sequence motifs (Sandelin et al, 2007; Lenhard et al, 2012). This correlation has raised questions about the role of promoter sequence motifs, for example, whether specific promoter motifs are required for transcription initiation of ubiquitously expressed genes or whether promoter motifs exist in organisms that do not regulate transcription initiation. In the latter case, it is possible that an “open” chromatin structure would permit transcription factors and polymerase to access the DNA and be sufficient for transcription initiation.
To elucidate the link between promoter structure and transcriptional regulation, we investigated the role of promoter sequence elements on transcription and histone variant deposition in an organism lacking transcriptional regulation, such as T. brucei. Unlike any eukaryotes studied thus far, this protozoan parasite belonging to the order of Trypanosomatida, which encompasses clinically important pathogens such as L. major, Trypanosoma cruzi, and T. brucei, appears to completely lack the ability to regulate transcription by RNA pol II (Clayton, 2002).
In these pathogens, RNA pol II‐transcribed protein‐coding genes are organized in long polycistronic transcription units (PTUs) with no clustering of functionally related genes (Berriman et al, 2005). Primary transcripts are co‐transcriptionally processed into individual mRNAs by coupled trans‐splicing and polyadenylation reactions (LeBowitz et al, 1993; Matthews et al, 1994). In trans‐splicing, a 39‐nt spliced‐leader sequence containing the cap structure required for translation and possibly other aspects of mRNA function is added to the 5′‐end of the 5′ UTR (Campbell et al, 1984; Kooter et al, 1984; Sutton & Boothroyd, 1986; Perry et al, 1987). Consistent with the notion that ubiquitously expressed genes lack well‐defined promoter motifs, RNA pol II promoter motifs have been elusive in T. brucei, with the exception of the spliced‐leader RNA promoter (Das et al, 2005; Schimanski et al, 2005). Furthermore, RNA pol II transcription initiation in T. brucei can occur in the absence of identifiable promoter motifs (Marchetti et al, 1998; McAndrew et al, 1998) suggesting that a permissive chromatin structure may be sufficient for transcription initiation in this parasite. In agreement with this hypothesis, several genome‐wide studies have found the boundaries of PTUs to be marked by distinct histone variants, such as H2A.Z, and histone modifications (Siegel et al, 2009; Thomas et al, 2009; Wright et al, 2010).
As seen in other organisms, H2A.Z is found at sites of RNA pol II transcription initiation in T. brucei. However, H2A.Z deposition is not restricted to narrow, well‐defined sites upstream of individual genes; instead, H2A.Z is enriched across regions of ~10 kb in width, encompassing the first few genes of every PTU (Siegel et al, 2009). How H2A.Z is targeted to sites of transcription initiation in the absence of DNA motifs remains an enigma. In addition, high‐throughput sequencing of primary transcripts suggests that RNA pol II transcription initiation occurs over broad genomic regions, matching sites of H2A.Z enrichment, rather than at well‐defined TSSs (Kolev et al, 2010). We thus use the term transcription start regions (TSRs) rather than TSSs and define TSRs as regions enriched in H2A.Z compared to the rest of the genome. Nucleosome occupancy has not been systematically analyzed in trypanosomatids, but several studies find RNA pol I‐transcribed genes to be depleted of nucleosomes (Figueiredo & Cross, 2010; Stanne & Rudenko, 2010).
In this study, we aimed to elucidate the features that define RNA pol II TSRs in T. brucei by determining the importance of DNA sequence elements in transcription initiation, histone variant recruitment, and nucleosome positioning.
We have generated for the first time genome‐wide maps of nucleosome occupancy and RNA pol II enrichment in T. brucei as well as an improved dataset of TSSs. Using these datasets and data from other epigenome studies, we devised a dual‐luciferase reporter assay to systematically test the ability of specific DNA elements to initiate transcription. Our study identified a GT‐rich promoter motif, able to initiate transcription and to aid in the targeted deposition of the histone variant H2A.Z. In addition, we find that TSRs are more sensitive to micrococcal nuclease (MNase) than the remainder of the genome.
Unexpectedly, we find RNA pol II levels and nucleosome occupancy to correlate not only at TSRs but also upstream of most genes, even within PTUs, pointing to a role for nucleosome positioning in RNA processing, rather than transcription initiation.
Results
RNA pol II transcription initiates transcription at the 5′‐end of H2A.Z peaks
In T. brucei, RNA pol II transcription is thought to initiate within ~10‐kb‐wide regions enriched in H2A.Z (Siegel et al, 2009; Kolev et al, 2010). However, in an effort to identify RNA pol II promoter elements, we decided to determine the sites of RNA pol II transcription initiation more precisely following a two‐pronged approach. First, we determined the genome‐wide distribution of RNA pol II enrichment by ChIP‐seq (chromatin immunoprecipitation followed by next‐generation sequencing) using a cell line expressing Ty1‐tagged RPB9, a subunit only found in the RNA pol II complex (Devaux et al, 2006). In addition, we determined the distribution of H2A.Z by MNase‐ChIP‐seq, using a custom‐made, H2A.Z‐specific antibody (Appendix Fig S1). The RPB9‐ChIP‐seq data revealed RNA pol II to be strongly enriched across a ~2‐kb region at the 5′‐end of H2A.Z‐enriched sites (Fig 1A). Since polymerase enrichment correlates with transcriptional pausing and RNA pol II pausing just downstream of promoters has been described in a wide range of organisms (Adelman & Lis, 2012), we hypothesized that the sites of RNA pol II enrichment observed in this study mark sites of transcription initiation.
Next, to corroborate that sites of RNA pol II enrichment indeed mark sites of transcription initiation, we mapped the distribution of small primary transcripts carrying a 5′‐triphosphate. Since triphosphates are removed during RNA maturation, mapping of triphosphate‐containing RNA can be used to identify sites of transcription initiation (Fig 1B). Selecting for short triphosphate‐containing transcripts allowed us to reduce the amount of rRNA and to identify sites of transcription initiation more precisely. As anticipated, we found small primary transcripts to be enriched at the 5′‐end of sites of H2A.Z enrichment (Fig 1C). Interestingly, levels of primary transcripts peaked just upstream of RNA pol II enrichment (compare Fig 1A and C), suggesting that RNA pol lI transcription pauses 100–200 bp downstream of its initiation, similar to the situation observed in metazoans (Adelman & Lis, 2012). However, unlike dispersed promoters in metazoans, which induce bidirectional transcription initiation (Wu & Sharp, 2013), our data indicate that in T. brucei, transcription initiation has a strong strand bias with a ratio of sense to antisense primary transcripts of 4:1.
Together, our RNA pol II ChIP‐seq data and our primary transcripts analysis revealed the 5′‐end of H2A.Z‐enriched regions as the primary sites of RNA pol II initiation.
The ability of DNA sequence elements to initiate transcription depends on the genomic context
To test whether a specific DNA sequence element can drive transcription initiation in vivo in T. brucei, we devised a reporter assay that allowed us to insert different DNA sequence elements upstream of a firefly luciferase reporter gene (FLUC). This required the identification of a non‐transcribed, yet transcriptionally permissive region of the genome. The organization of the T. brucei genome into long PTUs means that most of the core genome is actively transcribed, the exceptions being small regions between PTUs (Kolev et al, 2010; Siegel et al, 2010). However, previous ChIP‐seq assays had indicated that polycistronic transcription terminates in regions enriched in H3.V and H4.V (Fig 2A), leading to the assumption that these histone variants repress transcription (Siegel et al, 2009). Therefore, we inserted the reporter construct between two divergent PTUs, which corresponds to a non‐transcribed region of the genome that contains low levels of H3.V and H4.V (Fig 2B).
To determine the capacity of DNA sequence elements to initiate transcription, we generated two reporter constructs each containing the DNA from a different transcription start region (TSR) upstream of a luciferase gene (regA and regB; Fig 2C, Table EV1). Insertion of the reporter constructs into a region between two divergent PTUs of chr. 1 resulted in an 8.7‐fold and 9.6‐fold increase in luciferase activity compared to insertion of a control reporter construct lacking a promoter sequence. These results indicated that the DNA sequence found at TSRs contained elements able to initiate transcription. Targeting these luciferase reporter constructs to two genomic loci with increased H3.V and H4.V levels did not result in luciferase expression, not even in a cell line lacking H3.V (Fig EV1), underlining the importance of the genomic context for gene expression.
To identify specific DNA elements that are able to initiate transcription within the TSR, we divided the two TSRs into seven and five evenly spaced fragments, respectively (1,800 bp in width and with 500‐bp overlap between adjacent fragments). When repeating the assay using the different TSR fragments, we observed that all 12 fragments were capable of initiating transcription well above background (Fig 2C). The highest luciferase levels were observed for the two fragments originating from the 5′‐end of each TSR (regA2 and regB1). One fragment extending ~300 nt upstream of the TSR (regA1) yielded similar levels.
The observation that all TSR fragments were capable of initiating transcription at similar levels argues against the presence of well‐defined canonical promoter motifs. Instead, the observed pattern is similar to that reported for dispersed promoters that lead to broad regions of transcription initiation (Deaton & Bird, 2011).
GT‐rich promoter elements can induce transcription
Our analysis of primary transcripts indicated that transcription initiation has a strong strand bias. To determine whether TSR‐derived sequence elements are sufficient to ensure directionality (i.e., more sense transcription than antisense transcription), we inverted TSR fragments in the reporter assays (Fig 2C; regA2rc, regB1rc). After inversion, both fragments showed a 4.7‐fold and 3.3‐fold decrease in their capacity to drive luciferase expression, in good agreement with the 4:1 ratio of sense to antisense primary transcripts we observed. Based on these results, we hypothesized the following: (i) Promoter elements capable of ensuring directional transcription initiation are present in T. brucei, (ii) these elements are enriched within TSRs, and (iii) they are unevenly distributed across the coding and non‐coding strands, giving rise to the observed bias in sense vs. antisense transcripts.
To identify such sequence elements, we divided all TSRs into five evenly spaced regions, and in each region, we searched for 10mers that were unevenly distributed between the two strands, that is, 10mers that were present at least sixfold more often on the coding than on the non‐coding strand. The 5′‐end of TSRs contained the highest number of unevenly distributed 10mers, which decreased toward the 3′‐end of the TSR (Fig 3A and Dataset EV1). The vast majority of enriched 10mers contained long stretches of Gs or Ts (Fig 3B and Dataset EV1), which is in good agreement with previous reports of a G‐to‐C and T‐to‐A skew in L. major and T. brucei (McDonagh et al, 2000; Siegel et al, 2009).
To test our hypothesis that these GT‐rich elements are capable of driving directional transcription, we generated two synthetic promoters (210 and 416 bp) composed of 10mers that are enriched on the coding strand (Appendix Fig S2, Table EV2). Integration of the synthetic GT‐rich promoters upstream of the FLUC gene yielded high luciferase activity, 2.4‐fold and 2.0‐fold higher than full‐length TSRs (compare Figs 2C and 3C). Furthermore, transcription was highly dependent on the direction of the GT‐rich element, supporting our hypothesis that DNA sequence elements confer directional transcription. Insertion of GT‐rich sequence elements between divergent TSRs had no effect on the transcript levels of genes upstream or downstream of the insertion site (Fig EV2).
To compare the activity of the GT‐rich sequence elements to that of endogenous RNA pol I and RNA pol II transcription, we targeted an rRNA promoter‐driven luciferase gene to the rDNA spacer and inserted a promoter‐less luciferase gene within a RNA pol II‐transcribed PTU. Not surprisingly, we find rRNA promoter‐driven luciferase expression to be 38.7‐ to 193.6‐fold higher when the transgene is located in the rDNA spacer compared to the region between two divergent RNA pol II TSRs. In agreement with earlier studies (Alsford et al, 2005), we find luciferase levels to vary greatly among clones, probably because the constructs integrated in different genomic locations. The large differences in transgene expression between insertion in the rDNA spacer and between divergent TSRs can be explained by the spatial restriction of RNA pol I to the nucleolus.
For RNA pol II‐driven luciferase, we find the endogenous levels to be 11.6‐fold higher than those induced by the GT‐rich element and the GT‐rich element‐induced levels 18.5‐fold higher than those from our negative control, not containing any promoter upstream of the luciferase gene (Fig EV3).
Thus, our results indicate that GT‐rich elements can ensure directional transcription initiation, albeit not at the levels of endogenous RNA pol II transcription, which likely reflects the influence of the genomic location.
GT‐rich promoter elements induce targeted deposition of H2A.Z
Previous studies have reported that RNA pol II transcription initiation in T. brucei occurs in regions that lack promoter elements, suggesting that a permissive chromatin structure, which allows RNA pol II to access the DNA, may be sufficient for transcription initiation (Marchetti et al, 1998; McAndrew et al, 1998). In support of this hypothesis, we have previously found TSRs to be enriched in specific different histone variants and histone modifications, including the histone variant H2A.Z (Siegel et al, 2009; Wright et al, 2010).
To determine whether GT‐rich promoter elements contribute to the enrichment of histone variants at TSRs, we mapped the genome‐wide distribution of H2A.Z in transgenic cell lines containing GT‐rich sequence elements of two different lengths. The data revealed that both sites containing synthetic GT‐elements were enriched in H2A.Z. However, levels of H2A.Z were lower across GT‐elements than across an endogenous TSR (Fig 3D).
Next, we investigated the ability of GT‐rich elements to drive transcription and to promote H2A.Z deposition over time. Chromatin structures are dynamic, and it is possible that following insertion of the GT‐rich element, it may take several cell divisions for transcription levels and H2A.Z levels to peak. Therefore, we re‐generated the cell lines containing the short and the long GT‐rich promoter construct, repeated the luciferase assays 8 and 30 days post‐infection, and observed a significant change in luciferase expression over time (Fig EV4). In addition, we repeated the H2A.Z MNase‐ChIP‐seq assay 8 days post‐infection. For the short GT‐rich element, we saw a time‐dependent increase in H2A.Z levels, corresponding to the increase in luciferase activity. For the long GT‐rich element, no such increase was observed (Fig 3D).
Thus, while GT‐rich sequence elements can induce transcription and promote the targeted deposition of H2A.Z, the degree to which it does may vary over time. In addition, our data indicate that relocated TSR sequences and GT‐rich promoter‐induced transcription and/or H2A.Z levels do not reach those of endogenous RNA pol II promoters, highlighting again the importance of the genomic context.
Sites enriched in H2A.Z show increased sensitivity to MNase
Numerous studies performed in different organisms, including T. brucei, have suggested that nucleosomes containing H2A.Z are less stable than canonical nucleosomes (Suto et al, 2000; Abbott et al, 2001; Zhang et al, 2005; Jin & Felsenfeld, 2007; Siegel et al, 2009). To understand the biological significance of decreased nucleosome stability, we determined whether DNA associated with H2A.Z, that is, DNA within TSRs, is more accessible to proteins than DNA outside of TSRs.
To assess DNA accessibility, we established an MNase‐ChIP‐seq approach for T. brucei. This approach combines MNase digestion of formaldehyde‐cross‐linked chromatin with immunoprecipitation of nucleosomes and high‐throughput paired‐end DNA sequencing (Fig EV5A). MNase preferentially cleaves within linker DNA between nucleosomes. The resulting MNase digestion products, 147 bp in length, accurately reveal the positions of nucleosomes when they are sequenced and mapped back to the genome and can be used to generate nucleosome occupancy maps (Cole et al, 2012). In addition, MNase‐ChIP‐seq can be used to identify “loosely bound” nucleosomes. Regions containing loosely bound nucleosomes and, as a consequence, more accessible DNA yield on average shorter MNase cleavage products (< 147 bp) than regions that are composed of more compact chromatin (Weiner et al, 2010).
To avoid over‐digestion, we titrated MNase digestion such that a small population of dinucleosomal DNA remained (Fig EV5B and C). Next, nucleosomes were immunoprecipitated with a custom‐made histone H3 antiserum (Gassen et al, 2012), and the DNA was isolated and sequenced. A total of 18.6 million concordantly aligning 100‐bp sequence reads could be mapped to the T. brucei genome corresponding to an average genome coverage of ~53×.
Despite T. brucei utilizing “unusual” mechanisms of gene regulation, the biophysical properties of DNA should be conserved. Given that dinucleotides vary considerably with respect to their bending properties, optimal nucleosome formation occurs when bendable dinucleotides (AT and TA) occur at intervals of 10 bp. The exact positioning of the histone octamer with respect to the ~10‐bp helical repeat is termed “rotational positioning” (Struhl & Segal, 2013). Thus, to validate the quality of our nucleosome occupancy maps, we determined the rotational positioning of T. brucei nucleosome sequences. As described for all organisms analyzed so far, we found a strong periodicity in AA/AT/TT/TA (Fig EV5D). These data indicate that dinucleotide patterns are important for the rotational positioning of trypanosome nucleosomes and suggest that our nucleosome occupancy maps are of high resolution.
Analysis of nucleosome occupancy across TSRs indicated no general depletion of nucleosomes. However, grouping of sequenced MNase digestion products based on size revealed that TSRs are enriched in short sub‐mononucleosomal fragments (100–130 nt) and depleted from long supra‐mononucleosomal fragments (> 175 nt) (Fig 4A). To validate this observation, we plotted the averaged nucleosome occupancy across all TSRs. This further supported an increase in sub‐mononucleosomal DNA fragments across TSRs (Fig 4B).
These findings indicate that DNA associated with H2A.Z‐containing nucleosomes is more easily digested than DNA bound to canonical nucleosomes, revealing TSRs as regions of increased DNA accessibility.
Nucleosome occupancy at exon boundaries correlates with RNA pol II enrichment and transcript levels
From humans to yeast, promoters that drive the transcription of ubiquitously expressed genes contain a well‐defined nucleosome‐depleted region (NDR) flanked by strongly positioned nucleosomes at the site of transcription initiation (Hughes & Rando, 2014). In addition to TSSs, strongly positioned nucleosomes have been observed at intron/exon boundaries in many different eukaryotes including other protozoan parasites such as Plasmodium falciparum (Schwartz et al, 2009; Tilgner et al, 2009; Kensche et al, 2016). Generally, exons show increased nucleosome occupancy compared to introns, and it has been proposed that this increase temporarily slows the rate of RNA pol II transcription, thereby facilitating co‐transcriptional recruitment of splicing factors (Naftelberg et al, 2015).
Given the broad regions of ubiquitous transcription initiation, we investigated whether T. brucei may contain a well‐defined NDR within each TSR. In the absence of sharp peaks of transcription initiation and because genes are organized in PTUs, we averaged the nucleosome occupancy across TSRs using the ATG of the first gene of each PTU as a proxy for TSSs. This analysis revealed a strong depletion of H3 and H2A.Z‐containing nucleosomes ~90 nt upstream of the ATG (Fig 5A and B). However, our primary transcript data had suggested that RNA pol II transcription initiation occurs at the 5′‐end of H2A.Z‐enriched regions, which is much further upstream than the ATG of the first gene. In addition, given that the median 5′ UTR length in T. brucei is ~90 nt (excluding the spliced‐leader RNA) (Siegel et al, 2011), the center of the NDR coincided with the 5′‐end of the 5′ UTR. Thus, our data point to a role of NDRs in RNA processing rather than transcription initiation.
With the exceptions of two genes, no cis‐splicing has been observed in T. brucei (Mair et al, 2000; Berriman et al, 2005). Nevertheless, during trans‐splicing an identical 39‐nt spliced‐leader sequence from the donor‐spliced‐leader RNA is transferred to the 5′‐end of every mRNA (Michaeli, 2011) and, the 5′‐end of the 5′ UTR of each gene serves as a splice acceptor site (SAS). Thus, should strongly positioned nucleosomes mark the exon boundary, rather than sites of transcription initiation, one would expect to find an NDR upstream of every gene, not just in TSRs. We therefore extended the analysis of nucleosomes to the remaining genes of PTUs (all genes except the first gene) and observed a similar pattern of positioning, indicating that most genes are preceded by an NDR (Fig 5C).
Should nucleosomes be able to slow the rate of RNA pol II elongation and thereby enhance splicing efficiency as previously proposed (Naftelberg et al, 2015), we would expect (i) RNA pol II levels to correlate with nucleosome occupancy and (ii) a strong increase in nucleosome occupancy to correlate with efficient trans‐splicing and, as a consequence, high RNA levels. Analyzing our RNA pol II ChIP‐seq data, we find RNA pol II levels to closely resemble those of nucleosome occupancy (Fig 5D). In addition, when grouping genes based on the level of detected transcripts, high (top 25%), intermediate (middle 25%), and low (bottom 25%), we find a well‐defined NDR upstream of genes with high and intermediate transcript levels, but not upstream of genes with low levels (Fig 5D).
Taken together, our data suggest that while TSRs lack well‐defined NDRs in T. brucei, strong nucleosome positioning across exon boundaries may affect the rate of RNA pol II and RNA processing.
Composition of polyY tract affects nucleosome positioning and gene expression
One important contributor to splicing is the pyrimidine‐rich (polyY) tract, located just upstream of the SAS. At the RNA level, polyY tracts serve as binding sites for the U2AF65 subunit of the spliceosome (Kielkopf et al, 2001). In addition, at the DNA level, homopolymeric sequences such as polyY tracts are intrinsically rigid and are thus strongly inhibitory to nucleosome formation (Suter et al, 2000).
Previous transient transfection experiments, in which the effect of nucleosome positioning could not be evaluated, indicated that length and composition of polyY tracts influence trans‐splicing in T. brucei (Siegel et al, 2005). To better understand the cause of nucleosome positioning at exon boundaries, we investigated the influence of the polyY tract in nucleosome positioning in vivo. To this end, we generated three transgenic cell lines carrying a luciferase reporter construct containing the short GT‐rich promoter followed by (i) a polyY tract found upstream of the highly expressed GPEET procyclin genes, (ii) a long T‐rich polyY tract shown to be among the most efficient in transient trans‐splicing assays, or (iii) no polyY tract. All constructs contained an identical GPEET 5′ UTR sequence. Luciferase levels were highest for the GPEET polyY tract, twofold lower for the T‐rich polyY tract, and virtually absent in the cell line lacking a polyY tract (Fig 6A). These measurements are in good agreement with transient transfection experiments and underline the importance of polyY tracts for efficient trans‐splicing.
Next, we used these cell lines to generate three additional high‐resolution nucleosome occupancy maps (as described above). The additional nucleosome occupancy maps were interesting in several aspects. First, we noticed that in all three cell lines, the GT‐rich sequence was strongly depleted of nucleosomes (Fig 6B). We suspect this depletion to be caused by the long homopolymeric G and T stretches present in the promoter element. The low overall amount of nucleosomes in this region may also explain why the H2A.Z levels at GT‐rich regions did not reach those of endogenous TSRs (Fig 3D).
Comparing the three additional nucleosome occupancy maps, it became apparent that the homopolymeric nature of the long T‐rich polyY tract, which should give rise to a rather rigid DNA double helix, resulted in an expansion of the NDR (Fig 6B, middle panel) compared to the nucleosome occupancy observed in the cell line carrying the GPEET polyY tract (Fig 6B, upper panel) and that carrying no polyY tract (Fig 6B, lower panel). Finally, our data revealed the presence of an NDR upstream of the ORF in the absence of a polyY tract (Fig 6B, lower panel).
Thus, while the polyY tract is clearly important for efficient trans‐splicing and can affect nucleosome positioning, there must be other elements that ensure the presence of NDRs. Such elements may be located in the 5′ UTR.
Discussion
Several recent genome‐wide studies performed in different higher eukaryotes revealed a strong association of ubiquitously expressed genes with dispersed promoters, lacking well‐defined sequence motifs. To elucidate the conservation of this association, we analyzed promoter motifs in T. brucei, an evolutionarily highly divergent eukaryote that lacks transcriptional regulation of RNA pol II‐transcribed genes. In addition, promoters and associated transcription factors have remained elusive in T. brucei; hence, how RNA pol II transcription is initiated in this parasite remains a fundamental question.
In this study, we have identified a GT‐rich promoter element capable of conferring transcription initiation and contributing to the targeted deposition of H2A.Z. Our data demonstrate that RNA pol II promoter sequence elements are present even in organisms lacking regulation of RNA pol II transcription initiation. Thus, our findings suggest that the link between a lack of well‐defined sequence elements and a lack of gene regulation may be highly conserved in evolution.
In contrast to the general conservation we have found at the DNA level, our data revealed significant differences regarding the chromatin structure at promoter sites. Unlike NDRs in other eukaryotes, in T. brucei we have shown NDRs to be associated with splice sites and not with sites of transcription initiation. Given the regulatory potential of NDRs, this difference may reflect the lack of transcriptional regulation in T. brucei and the importance of post‐transcriptional mechanisms of gene regulation, such as trans‐splicing. The observed link between nucleosome occupancy, polymerase levels, and splicing elements is not unprecedented. Genome‐wide nucleosome occupancy maps not only revealed NDRs across promoters, but also exposed a general increase of nucleosome occupancy in exons compared to introns and a preference of nucleosomes for constitutively spliced exons over alternatively spliced exons (Schwartz et al, 2009; Huang et al, 2012). Splicing of exons occurs co‐transcriptionally and so it has been proposed that nucleosomes positioned at the 5′‐end of exons may function as “speed bumps” to slow down RNA pol II elongation, thereby promoting inclusion of exons (Schwartz & Ast, 2010). According to this model, the presence or absence of nucleosomes at intron/exon boundaries influences exon selection, thereby affecting alternative splicing.
Cis‐splicing and trans‐splicing share many mechanistic similarities (Michaeli, 2011). Yet, mapping of SASs in T. brucei revealed an unexpected heterogeneity in SASs. For many genes, the major SAS was found to differ from one life cycle stage of the parasite to another, leading to the generation of transcripts with different 5′ UTRs (Kolev et al, 2010; Nilsson et al, 2010; Siegel et al, 2010). Thus, in the absence of transcriptional regulation, alternative trans‐splicing has been proposed as a mechanism to regulate gene expression. The factors contributing to life cycle‐specific SAS preferences are still unknown.
As reported for organisms that undergo cis‐splicing, our data reveal a strong increase in nucleosome occupancy at the 5′‐end of exons. In addition, our RNA pol II ChIP‐seq data suggest that the increase in nucleosome occupancy may act as a barrier and temporarily slow RNA pol II elongation leading to the observed increase in RNA pol II levels. A decrease in RNA pol II elongation speed may increase trans‐splicing efficiency. Furthermore, small changes in nucleosome positioning may slow the polymerase at different positions, thereby affecting the choice of SAS. Analogously, small differences in nucleosome positioning between different life cycle stages could contribute to the observed life cycle‐specific SAS preferences. These hypotheses could be corroborated in the future by comparing nucleosome occupancy maps from different life cycle stages of T. brucei.
One critical finding of our promoter assays was the importance of the genomic context; that is, we found the ability of specific DNA sequence elements to initiate transcription to vary greatly depending on the site of insertion. For example, insertion of an rRNA promoter‐driven luciferase gene between two divergent TSRs yielded low levels of luciferase, while insertion of the same construct into rDNA arrays yielded high levels. Insertion of a TSR sequence between two divergent TSRs led to intermediate levels of transcriptional activity, while insertion of the same construct into a region enriched in H3.V led to no activity. Insertions of GT‐rich elements yielded similar site‐specific effects.
A reason for the importance of the genomic context could be that the T. brucei genome possesses a well‐defined 3D genome architecture. This hypothesis is supported by several microscopy‐based studies that have observed clustering of specific RNA pol I, RNA pol II, and RNA pol lII subunits in distinct loci (Navarro & Gull, 2001; Uzureau et al, 2008; Alsford & Horn, 2011). Most notably, the distribution of RNA pol I is restricted to the nucleolus and the expression site body where the variant surface antigens are transcribed (Navarro & Gull, 2001). Thus, the target site‐specific differences in RNA pol I‐driven luciferase activity are in good agreement with an important role of genome organization.
The lack of transcriptional regulation, the organization of genes in long PTUs, and the apparent absence of promoter motifs in trypanosomatids have suggested fundamental differences in terms of gene regulation compared to other eukaryotes. In this study, we find that specific DNA sequence elements can drive directional transcription and affect local chromatin structure. Furthermore, our data underscore the importance of genomic context in gene expression and establish a link between nucleosome positioning and exon boundaries.
Thus, our findings suggest that despite its evolutionary divergence, T. brucei uses several of the mechanisms found in other eukaryotes to regulate its gene expression, implying that many of the strategies to regulate gene expression are highly conserved in evolution. At the same time, our data revealed fundamental differences regarding the patterns of nucleosome occupancy between higher eukaryotes and trypanosomes. The observed differences may reflect the lack of RNA pol II transcription regulation in T. brucei and its strong dependency on post‐transcriptional mechanisms of gene regulation.
Materials and Methods
For details of T. brucei manipulation, plasmids, and data analysis, see the Appendix Supplementary Methods.
Trypanosoma brucei culture
Trypanosoma brucei wild‐type and genetically modified strains were derived from Lister 427 bloodstream‐form MITat 1.2 (clone 221) or a derivative “single marker” cell line (SM) expressing T7 RNA polymerase and the Tet repressor (Wirtz & Clayton, 1995) and cultured at 37°C in HMI‐11 medium. Where appropriate, the following drug concentrations were used: 2 μg/ml G418, 5 μg/ml hygromycin, 50 μg/ml blasticidin, and 1 μg/ml doxycycline. Transfections were performed using a Nucleofector (Amaxa) as described previously (Scahill et al, 2008).
Sequencing of small 5′‐triphosphate‐containing RNA
Small RNA (< 200 nt) was purified from 5 × 107 cells using a combination of miRNeasy Mini (Qiagen) and RNeasy® MinElute® Cleanup (Qiagen) following the instructions of the manufacturer. Two micrograms of small RNA was treated with 1 unit of Terminator™ 5′‐Phosphate‐Dependent Exonuclease (TEX, Epicentre) in 1× Terminator Reaction Buffer B for 30 min at 42°C to remove 5′‐monophosphate RNA. Next, the RNA was purified and one half (+5′‐polyphosphatase) was treated with 20 units RNA 5′‐polyphosphatase to convert 5′‐triphosphate RNA to 5′‐monophosphate RNA. The second half (−5′‐polyphosphatase) was left untreated and served as a control. The sequencing libraries were constructed using the NEBNext® Multiplex Small RNA Library Prep Set for Illumina® (New England BioLabs) and sequenced on an Illumina® NextSeq 500.
RNA pol II ChIP‐seq
The RNA pol II ChIP was performed in cell lines, in which both alleles of RPB9, a subunit of the RNA pol II complex, contained an endogenous Ty1 tag. Except for minor changes, the ChIP was performed as described previously (Wedel & Siegel, 2017). In brief, 3 × 108 cells were harvested, formaldehyde‐cross‐linked, and permeabilized. After centrifugation, the pellet was resuspended in 600 μl of NP‐S buffer and sonicated for 50 cycles at low strength in a 15‐ml tube. After centrifugation, the supernatant was transferred to a new microcentrifuge tube and 60 μl was separated as input. Immunoprecipitation of DNA bound by RBP9 was performed using Dynabeads® Protein G (Invitrogen) coupled to a BB2 antibody (Bastin et al, 1996) at 4°C overnight and under the presence of 300 mM NaCl. Bound material was washed, cross‐links were reversed, and immunoprecipitated DNA was purified. Sequencing libraries were constructed and sequenced using an Illumina® NextSeq 500.
Dual‐luciferase assays
Luciferase activities were measured with the Dual‐Glo® Luciferase Assay System from Promega. A total of 5 × 106 cells were harvested, washed with 1× phosphate‐buffered saline (PBS; 2.7 nM KCl, 2 mM KH2PO4, 10 mM Na2HPO4, 137 mM NaCl), and resuspended in 200 μL 1× PBS. In a 96‐well plate, 50 μl cell suspension (1.25 × 106 cells) and 50 μl Dual‐Glo® Luciferase Reagent were mixed, and after 10 min, firefly luciferase activity was measured for 1 s in a Victor Light Luminometer. Fifty microliters Dual‐Glo® Stop&Glo® Reagent was added, and after 10 min, Renilla luciferase activity was measured. All measurements were performed at least in duplicate. To normalize for differences in cell number, firefly luciferase activity was normalized with Renilla luciferase activity. All raw and normalized luciferase measurements are listed in Dataset EV3.
Antibody production, affinity purification, and characterization
Polyclonal antibodies specific for H2A.Z were raised by immunizing rabbits with the following peptide: DDAVPQAPLVGGVAMSPEQAS, following a 145‐day immunization protocol (Pineda Antikörper‐Service). The antisera were affinity‐purified using SulfoLink Coupling Gel (ThermoFisher) with the immobilized peptide as described elsewhere (Harlow & Lane, 1999). To determine antibody specificity, different transgenic cell lines were analyzed by Western blotting (Appendix Fig S1) as described previously (Siegel et al, 2008).
Overexpression of Ty1‐tagged H2A.Z in ΔH2A.Z cells
To generate an H2A.Z double‐knockout mutant expressing Ty‐H2A.Z constitutively, we first introduced a Ty‐tagged H2A.Z in a ribosomal locus using pLEW111 (Hoek et al, 2000) upon removal of the tetracycline repressor by digestion with BglII. The knockout of both endogenous H2A.Z (Tb427.07.6360) alleles was introduced by using the pyrFEKO system (Scahill et al, 2008). To amplify the target regions, the following primers were used: upstream of H2A.Z CDS: CGGTACCAACACTAGACGGC, CGTGTCCGTGTATAATGCGC; downstream of H2A.Z CDS: TTGTTGCCTTCAGCTCGCTA, CACTAAAACGGGCCACCTCT.
MNase‐ChIP‐seq
Immunoprecipitation of mononucleosomes was performed as described previously (Wedel & Siegel, 2017). In brief, 2 × 106 cells were harvested, formaldehyde‐cross‐linked, and lysed using digitonin. To fragment the chromatin, the sample was digested with MNase. Immunoprecipitation of nucleosomal DNA was performed using Dynabeads® M‐280 sheep anti‐rabbit IgG (Invitrogen) coupled to polyclonal H3 rabbit antiserum (Gassen et al, 2012) at 4°C for 2 h or coupled to our custom‐made polyclonal affinity‐purified H2A.Z rabbit antibody (this publication) at RT for 30 min in the presence of 0.05% SDS. Bound material was washed and eluted. Cross‐links were reversed, and mononucleosomal DNA was purified. ChIP‐seq libraries were constructed using 35 ng of immunoprecipitated DNA or 35 ng of input DNA and sequenced on an Illumina® HiSeq 2500 or NextSeq 500.
Mapping, normalization, and visualization of sequencing data
Immunoprecipitated DNA samples were sequenced in paired‐end mode using an Illumina HiSeq 2500 or an Illumina NextSeq 500 sequencer with 2 × 100 and 2 × 76 cycles, respectively. The processing of the sequencing data was performed as described previously (Wedel & Siegel, 2017). In brief, the trimmed and clipped reads were mapped with bowtie2 version 2.1.0 (Langmead & Salzberg, 2012) or bowtie version 1.1.1 (Langmead et al, 2009) to the reference genomes Tb927v24 or Tb427v24 with different settings listed in Dataset EV2. The alignments in SAM format were converted to BAM format, sorted, and indexed using samtools version 0.1.19‐44428 cd (Li et al, 2009). If necessary, non‐uniquely mapped reads were removed. The genomes were downloaded from EuPathDB (Aurrecoechea et al, 2013) and used as reference in all analyses. To normalize the number of reads to counts per billion reads (CBR), wiggle files were generated with COVERnant version 0.3.0 subcommand ratio (available for download at GitHub https://github.com/konrad/COVERnant). For visualization of the aligned reads, coverage values for the region of interest were extracted and visualized with GraphPad Prism version 5.0b.
Data and source code availability
All sequencing data generated for this publication have been deposited in NCBI's Gene Expression Omnibus (Edgar et al, 2002) and are accessible through GEO Series accession number GSE98061 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE98061). The computational data analysis was implemented as a Unix shell script, which together with further programs generated for this study is available at https://doi.org/10.5281/zenodo.438156.
Author contributions
TNS designed research; CW and RD performed research; CW, KUF, and TNS analyzed data; and CW and TNS wrote the manuscript.
Conflict of interest
The authors declare that they have no conflict of interest.
Supporting information
Acknowledgements
We thank Stan Gorski and all members of the Siegel Laboratory for critical reading of the manuscript and Christian Janzen, Amelie Kraus, and Jens Hör for many valuable discussions. This work was funded by the Young Investigator Program of the Research Center of Infectious Diseases (ZINF) of the University of Würzburg, Germany, and the grant SI 1610/2‐1 from The German Research Foundation DFG.
The EMBO Journal (2017) 36: 2581–2594
References
- Abbott DW, Ivanova VS, Wang X, Bonner WM, Ausió J (2001) Characterization of the stability and folding of H2A.Z chromatin particles: implications for transcriptional activation. J Biol Chem 276: 41945–41949 [DOI] [PubMed] [Google Scholar]
- Adelman K, Lis JT (2012) Promoter‐proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat Rev Genet 13: 720–731 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Albert I, Mavrich TN, Tomsho LP, Qi J, Zanton SJ, Schuster SC, Pugh BF (2007) Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genome. Nature 446: 572–576 [DOI] [PubMed] [Google Scholar]
- Alsford S, Kawahara T, Glover L, Horn D (2005) Tagging a T. brucei RRNA locus improves stable transfection efficiency and circumvents inducible expression position effects. Mol Biochem Parasitol 144: 142–148 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Alsford S, Horn D (2011) Elongator protein 3b negatively regulates ribosomal DNA transcription in African trypanosomes. Mol Cell Biol 31: 1822–1832 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aurrecoechea C, Barreto A, Brestelli J, Brunk BP, Cade S, Doherty R, Fischer S, Gajria B, Gao X, Gingle A, Grant G, Harb OS, Heiges M, Hu S, Iodice J, Kissinger JC, Kraemer ET, Li W, Pinney DF, Pitts B et al (2013) EuPathDB: the eukaryotic pathogen database. Nucleic Acids Res 41: D684–D691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barski A, Cuddapah S, Cui K, Roh TY, Schones DE, Wang Z, Wei G, Chepelev I, Zhao K (2007) High‐resolution profiling of histone methylations in the human genome. Cell 129: 823–837 [DOI] [PubMed] [Google Scholar]
- Bastin P, Bagherzadeh A, Matthews KR, Gull K (1996) A novel epitope tag system to study protein targeting and organelle biogenesis in Trypanosoma brucei . Mol Biochem Parasitol 77: 235–239 [DOI] [PubMed] [Google Scholar]
- Berriman M, Ghedin E, Hertz‐Fowler C, Blandin G, Renauld H, Bartholomeu DC, Lennard NJ, Caler E, Hamlin NE, Haas B, Bohme U, Hannick L, Aslett MA, Shallom J, Marcello L, Hou L, Wickstead B, Alsmark UC, Arrowsmith C, Atkin RJ et al (2005) The genome of the African trypanosome Trypanosoma brucei . Science 309: 416–422 [DOI] [PubMed] [Google Scholar]
- Burge CB, Tuschl T, Sharp JA (1999) Splicing of precursors to mRNAs by the spliceosomes In The RNA world: the nature of modern RNA suggests a prebiotic RNA, Gesteland RF, Cech T, Atkins JF. (eds), pp 525–559. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; [Google Scholar]
- Campbell DA, Thornton DA, Boothroyd JC (1984) Apparent discontinuous transcription of Trypanosoma brucei variant surface antigen genes. Nature 311: 350–355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carninci P, Sandelin A, Lenhard B, Katayama S, Shimokawa K, Ponjavic J, Semple CA, Taylor MS, Engstrom PG, Frith MC, Forrest AR, Alkema WB, Tan SL, Plessy C, Kodzius R, Ravasi T, Kasukawa T, Fukuda S, Kanamori‐Katayama M, Kitazume Y et al (2006) Genome‐wide analysis of mammalian promoter architecture and evolution. Nat Genet 38: 626–635 [DOI] [PubMed] [Google Scholar]
- Clayton CE (2002) Life without transcriptional control? From fly to man and back again. EMBO J 21: 1881–1888 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole HA, Howard BH, Clark DJ (2012) Genome‐wide mapping of nucleosomes in yeast using paired‐end sequencing. Methods Enzymol 513: 145–168 [DOI] [PubMed] [Google Scholar]
- Das A, Zhang Q, Palenchar JB, Chatterjee B, Cross GAM, Bellofatto V (2005) Trypanosomal TBP functions with the multisubunit transcription factor tSNAP to direct spliced‐leader RNA gene expression. Mol Cell Biol 25: 7314–7322 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deaton AM, Bird A (2011) CpG islands and the regulation of transcription. Genes Dev 25: 1010–1022 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Devaux S, Lecordier L, Uzureau P, Walgraffe D, Dierick JF, Poelvoorde P, Pays E, Vanhamme L (2006) Characterization of RNA polymerase II subunits of Trypanosoma brucei . Mol Biochem Parasitol 148: 60–68 [DOI] [PubMed] [Google Scholar]
- Edgar R, Domrachev M, Lash AE (2002) Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res 30: 207–210 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fadda A, Ryten M, Droll D, Rojas F, Farber V, Haanstra JR, Merce C, Bakker BM, Matthews K, Clayton C (2014) Transcriptome‐wide analysis of trypanosome mRNA decay reveals complex degradation kinetics and suggests a role for co‐transcriptional degradation in determining mRNA levels. Mol Microbiol 94: 307–326 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Figueiredo LM, Cross GA (2010) Nucleosomes are depleted at the VSG expression site transcribed by RNA polymerase I in African trypanosomes. Eukaryot Cell 9: 148–154 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gassen A, Brechtefeld D, Schandry N, Arteaga‐Salas JM, Israel L, Imhof A, Janzen CJ (2012) DOT1A‐dependent H3K76 methylation is required for replication regulation in Trypanosoma brucei . Nucleic Acids Res 40: 10302–10311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harlow E, Lane D (1999) Using antibodies: a laboratory manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press; [Google Scholar]
- van Heeringen SJ, Akhtar W, Jacobi UG, Akkers RC, Suzuki Y, Veenstra GJ (2011) Nucleotide composition‐linked divergence of vertebrate core promoter architecture. Genome Res 21: 410–421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoek M, Engstler M, Cross GA (2000) Expression‐site‐associated gene 8 (ESAG8) of Trypanosoma brucei is apparently essential and accumulates in the nucleolus. J Cell Sci 113: 3959–3968 [DOI] [PubMed] [Google Scholar]
- Huang H, Yu S, Liu H, Sun X (2012) Nucleosome organization in sequences of alternative events in human genome. Biosystems 109: 214–219 [DOI] [PubMed] [Google Scholar]
- Hughes AL, Rando OJ (2014) Mechanisms underlying nucleosome positioning in vivo . Annu Rev Biophys 43: 41–63 [DOI] [PubMed] [Google Scholar]
- Jin C, Felsenfeld G (2007) Nucleosome stability mediated by histone variants H3.3 and H2A.Z. Genes Dev 21: 1519–1529 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kadonaga JT (2012) Perspectives on the RNA polymerase II core promoter. Wiley Interdiscip Rev Dev Biol 1: 40–51 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kensche PR, Hoeijmakers WA, Toenhake CG, Bras M, Chappell L, Berriman M, Bártfai R (2016) The nucleosome landscape of Plasmodium falciparum reveals chromatin architecture and dynamics of regulatory sequences. Nucleic Acids Res 44: 2110–2124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kielkopf CL, Rodionova NA, Green MR, Burley SK (2001) A novel peptide recognition mode revealed by the X‐ray structure of a core U2AF35/U2AF65 heterodimer. Cell 106: 595–605 [DOI] [PubMed] [Google Scholar]
- Koch F, Fenouil R, Gut M, Cauchy P, Albert TK, Zacarias‐Cabeza J, Spicuglia S, de la Chapelle AL, Heidemann M, Hintermair C, Eick D, Gut I, Ferrier P, Andrau JC (2011) Transcription initiation platforms and GTF recruitment at tissue‐specific enhancers and promoters. Nat Struct Mol Biol 18: 956–963 [DOI] [PubMed] [Google Scholar]
- Kolev NG, Franklin JB, Carmi S, Shi H, Michaeli S, Tschudi C (2010) The transcriptome of the human pathogen Trypanosoma brucei at single‐nucleotide resolution. PLoS Pathog 6: e1001090 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kooter JM, De Lange T, Borst P (1984) Discontinuous synthesis of mRNA in trypanosomes. EMBO J 3: 2387–2392 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory‐efficient alignment of short DNA sequences to the human genome. Genome Biol 10: R25 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langmead B, Salzberg SL (2012) Fast gapped‐read alignment with Bowtie 2. Nat Methods 9: 357–359 [DOI] [PMC free article] [PubMed] [Google Scholar]
- LeBowitz JH, Smith HQ, Rusche L, Beverley SM (1993) Coupling of poly(A) site selection and trans‐splicing in Leishmania . Genes Dev 7: 996–1007 [DOI] [PubMed] [Google Scholar]
- Lenhard B, Sandelin A, Carninci P (2012) Metazoan promoters: emerging characteristics and insights into transcriptional regulation. Nat Rev Genet 13: 233–245 [DOI] [PubMed] [Google Scholar]
- Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25: 2078–2079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mair G, Shi H, Li H, Djikeng A, Aviles HO, Bishop JR, Falcone FH, Gavrilescu C, Montgomery JL, Santori MI, Stern LS, Wang Z, Ullu E, Tschudi C (2000) A new twist in trypanosome RNA metabolism: cis‐splicing of pre‐mRNA. RNA 6: 163–169 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marchetti MA, Tschudi C, Silva E, Ullu E (1998) Physical and transcriptional analysis of the Trypanosoma brucei genome reveals a typical eukaryotic arrangement with close interspersion of RNA polymerase II‐ and III‐transcribed genes. Nucleic Acids Res 26: 3591–3598 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marques M, Laflamme L, Gervais AL, Gaudreau L (2010) Reconciling the positive and negative roles of histone H2A.Z in gene transcription. Epigenetics 5: 267–272 [DOI] [PubMed] [Google Scholar]
- Martinez‐Calvillo S, Yan S, Nguyen D, Fox M, Stuart K, Myler PJ (2003) Transcription of Leishmania major Friedlin chromosome 1 initiates in both directions within a single region. Mol Cell 11: 1291–1299 [DOI] [PubMed] [Google Scholar]
- Matthews KR, Tschudi C, Ullu E (1994) A common pyrimidine‐rich motif governs trans‐splicing and polyadenylation of tubulin polycistronic pre‐mRNA in trypanosomes. Genes Dev 8: 491–501 [DOI] [PubMed] [Google Scholar]
- Mavrich TN, Ioshikhes IP, Venters BJ, Jiang C, Tomsho LP, Qi J, Schuster SC, Albert I, Pugh BF (2008a) A barrier nucleosome model for statistical positioning of nucleosomes throughout the yeast genome. Genome Res 18: 1073–1083 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mavrich TN, Jiang C, Ioshikhes IP, Li X, Venters BJ, Zanton SJ, Tomsho LP, Qi J, Glaser RL, Schuster SC, Gilmour DS, Albert I, Pugh BF (2008b) Nucleosome organization in the Drosophila genome. Nature 453: 358–362 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McAndrew M, Graham S, Hartmann C, Clayton C (1998) Testing promoter activity in the trypanosome genome: isolation of a metacyclic‐type VSG promoter, and unexpected insights into RNA polymerase II transcription. Exp Parasitol 90: 65–76 [DOI] [PubMed] [Google Scholar]
- McDonagh PD, Myler PJ, Stuart K (2000) The unusual gene organization of Leishmania major chromosome 1 may reflect novel transcription processes. Nucleic Acids Res 28: 2800–2803 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michaeli S (2011) Trans‐splicing in trypanosomes: machinery and its impact on the parasite transcriptome. Future Microbiol 6: 459–474 [DOI] [PubMed] [Google Scholar]
- Mizuguchi G, Shen X, Landry J, Wu WH, Sen S, Wu C (2004) ATP‐driven exchange of histone H2AZ variant catalyzed by SWR1 chromatin remodeling complex. Science 303: 343–348 [DOI] [PubMed] [Google Scholar]
- Naftelberg S, Schor IE, Ast G, Kornblihtt AR (2015) Regulation of alternative splicing through coupling with transcription and chromatin structure. Annu Rev Biochem 84: 165–198 [DOI] [PubMed] [Google Scholar]
- Navarro M, Gull K (2001) A pol I transcriptional body associated with VSG mono‐allelic expression in Trypanosoma brucei . Nature 414: 759–763 [DOI] [PubMed] [Google Scholar]
- Ni T, Corcoran DL, Rach EA, Song S, Spana EP, Gao Y, Ohler U, Zhu J (2010) A paired‐end sequencing strategy to map the complex landscape of transcription initiation. Nat Methods 7: 521–527 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nilsson D, Gunasekera K, Mani J, Osteras M, Farinelli L, Baerlocher L, Roditi I, Ochsenreiter T (2010) Spliced leader trapping reveals widespread alternative splicing patterns in the highly dynamic transcriptome of Trypanosoma brucei . PLoS Pathog 6: e1001037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perry KL, Watkins KP, Agabian N (1987) Trypanosome mRNAs have unusual “cap 4” structures acquired by addition of a spliced leader. Proc Natl Acad Sci USA 84: 8190–8194 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quintales L, Vazquez E, Antequera F (2015) Comparative analysis of methods for genome‐wide nucleosome cartography. Brief Bioinform 16: 576–587 [DOI] [PubMed] [Google Scholar]
- Raisner RM, Hartley PD, Meneghini MD, Bao MZ, Liu CL, Schreiber SL, Rando OJ, Madhani HD (2005) Histone variant H2A.Z marks the 5′ ends of both active and inactive genes in euchromatin. Cell 123: 233–248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roeder RG (1996) The role of general initiation factors in transcription by RNA polymerase II. Trends Biochem Sci 21: 327–335 [PubMed] [Google Scholar]
- Sandelin A, Carninci P, Lenhard B, Ponjavic J, Hayashizaki Y, Hume DA (2007) Mammalian RNA polymerase II core promoters: insights from genome‐wide studies. Nat Rev Genet 8: 424–436 [DOI] [PubMed] [Google Scholar]
- Saxonov S, Berg P, Brutlag DL (2006) A genome‐wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters. Proc Natl Acad Sci USA 103: 1412–1417 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scahill MD, Pastar I, Cross GAM (2008) CRE recombinase‐based positive‐negative selection systems for genetic manipulation in Trypanosoma brucei . Mol Biochem Parasitol 157: 73–82 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schimanski B, Nguyen TN, Günzl A (2005) Characterization of a multisubunit transcription factor complex essential for spliced‐leader RNA gene transcription in Trypanosoma brucei . Mol Cell Biol 25: 7303–7313 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartz S, Meshorer E, Ast G (2009) Chromatin organization marks exon‐intron structure. Nat Struct Mol Biol 16: 990–995 [DOI] [PubMed] [Google Scholar]
- Schwartz S, Ast G (2010) Chromatin density and splicing destiny: on the cross‐talk between chromatin structure and splicing. EMBO J 29: 1629–1636 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Segal E, Widom J (2009) Poly(dA:dT) tracts: major determinants of nucleosome organization. Curr Opin Struct Biol 19: 65–71 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel TN, Tan KS, Cross GAM (2005) Systematic study of sequence motifs for RNA trans splicing in Trypanosoma brucei . Mol Cell Biol 25: 9586–9594 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel TN, Kawahara T, Degrasse JA, Janzen CJ, Horn D, Cross GAM (2008) Acetylation of histone H4K4 is cell cycle regulated and mediated by HAT3 in Trypanosoma brucei . Mol Microbiol 67: 762–771 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel TN, Hekstra DR, Kemp LE, Figueiredo LM, Lowell JE, Fenyo D, Wang X, Dewell S, Cross GAM (2009) Four histone variants mark the boundaries of polycistronic transcription units in Trypanosoma brucei . Genes Dev 23: 1063–1076 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel TN, Hekstra DR, Wang X, Dewell S, Cross GAM (2010) Genome‐wide analysis of mRNA abundance in two life‐cycle stages of Trypanosoma brucei and identification of splicing and polyadenylation sites. Nucleic Acids Res 38: 4946–4957 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel TN, Gunasekera K, Cross GA, Ochsenreiter T (2011) Gene expression in Trypanosoma brucei: lessons from high‐throughput RNA sequencing. Trends Parasitol 27: 434–441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stanne TM, Rudenko G (2010) Active VSG expression sites in Trypanosoma brucei are depleted of nucleosomes. Eukaryot Cell 9: 136–147 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Struhl K, Segal E (2013) Determinants of nucleosome positioning. Nat Struct Mol Biol 20: 267–273 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suter B, Schnappauf G, Thoma F (2000) Poly(dA.dT) sequences exist as rigid DNA structures in nucleosome‐free yeast promoters in vivo . Nucleic Acids Res 28: 4083–4089 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suto RK, Clarkson MJ, Tremethick DJ, Luger K (2000) Crystal structure of a nucleosome core particle containing the variant histone H2A.Z. Nat Struct Biol 7: 1121–1124 [DOI] [PubMed] [Google Scholar]
- Sutton RE, Boothroyd JC (1986) Evidence for trans splicing in trypanosomes. Cell 47: 527–535 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomas S, Green A, Sturm NR, Campbell DA, Myler PJ (2009) Histone acetylations mark origins of polycistronic transcription in Leishmania major . BMC Genom 10: 152 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tilgner H, Nikolaou C, Althammer S, Sammeth M, Beato M, Valcarcel J, Guigo R (2009) Nucleosome positioning as a determinant of exon recognition. Nat Struct Mol Biol 16: 996–1001 [DOI] [PubMed] [Google Scholar]
- Uzureau P, Daniels JP, Walgraffe D, Wickstead B, Pays E, Gull K, Vanhamme L (2008) Identification and characterization of two trypanosome TFIIS proteins exhibiting particular domain architectures and differential nuclear localizations. Mol Microbiol 69: 1121–1136 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vasquez JJ, Hon CC, Vanselow JT, Schlosser A, Siegel TN (2014) Comparative ribosome profiling reveals extensive translational complexity in different Trypanosoma brucei life cycle stages. Nucleic Acids Res 42: 3623–3637 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wedel C, Siegel TN (2017) Genome‐wide analysis of chromatin structures in Trypanosoma brucei using high‐resolution MNase‐ChIP‐seq. Exp Parasitol 180: 2–12 [DOI] [PubMed] [Google Scholar]
- Weiner A, Hughes A, Yassour M, Rando OJ, Friedman N (2010) High‐resolution nucleosome mapping reveals transcription‐dependent promoter packaging. Genome Res 20: 90–100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wirtz E, Clayton C (1995) Inducible gene expression in trypanosomes mediated by a prokaryotic repressor. Science 268: 1179–1183 [DOI] [PubMed] [Google Scholar]
- Wright JR, Siegel TN, Cross GAM (2010) Histone H3 trimethylated at lysine 4 is enriched at probable transcription start sites in Trypanosoma brucei . Mol Biochem Parasitol 136: 434–450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu X, Sharp PA (2013) Divergent transcription: a driving force for new gene origination? Cell 155: 990–996 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yamamoto YY, Yoshitsugu T, Sakurai T, Seki M, Shinozaki K, Obokata J (2009) Heterogeneity of Arabidopsis core promoters revealed by high‐density TSS analysis. Plant J 60: 350–362 [DOI] [PubMed] [Google Scholar]
- Yen K, Vinayachandran V, Pugh BF (2013) SWR‐C and INO80 chromatin remodelers recognize nucleosome‐free regions near +1 nucleosomes. Cell 154: 1246–1256 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan GC, Liu YJ, Dion MF, Slack MD, Wu LF, Altschuler SJ, Rando OJ (2005) Genome‐scale identification of nucleosome positions in S. cerevisiae . Science 309: 626–630 [DOI] [PubMed] [Google Scholar]
- Zhang Z, Dietrich FS (2005) Mapping of transcription start sites in Saccharomyces cerevisiae using 5′ SAGE. Nucleic Acids Res 33: 2838–2851 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang H, Roberts DN, Cairns BR (2005) Genome‐wide dynamics of Htz1, a histone H2A variant that poises repressed/basal promoters for activation through histone loss. Cell 123: 219–231 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zlatanova J, Thakar A (2008) H2A.Z: view from the top. Structure 16: 166–179 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.