Skip to main content
RNA logoLink to RNA
. 2026 Feb;32(2):250–266. doi: 10.1261/rna.080659.125

Comprehensive mapping of the 5′ and 3′ untranslated regions of Aspergillus fumigatus reveals diverse mechanisms of mRNA processing including premature transcription termination

Lukas Schrettenbrunner 1,, Corinne Maufrais 2,3, Guilhem Janbon 2, Edward WJ Wallace 4, Matthew G Blango 1
PMCID: PMC12810187  PMID: 41326228

Abstract

In the 20 years since the first genome sequencing of Aspergillus fumigatus, the field has seen an explosion in both the number of sequenced genomes and our molecular understanding of this ubiquitous human fungal pathogen. Despite an improved knowledge of the A. fumigatus genome, we still know little about the transcriptome, with key regulatory sequences like the untranslated regions of mRNA based only on in silico predictions and bulk RNA-seq. Here, we provide an improved description of 5′ and 3′ untranslated regions of A. fumigatus poly(A)-enriched RNA through experimental mapping of transcription start sites and polyadenylation sites using 5′ and 3′ End-Seq. We assigned high-quality 5′ ends to 2747 genes (average length 126 nt), 3′ ends to 7079 genes (average length 268 nt), and improved our understanding of the regulatory landscape of A. fumigatus gene expression. We leveraged the refined 5′ UTRs to identify upstream open reading frames and binding sites for important RNA binding proteins like the translational regulator Ssd1 and the 3′ UTRs to define binding sites for PUF proteins known to contribute to mRNA localization and regulation. Although a single isoform typically dominated expression, we observed 148 instances of alternative start sites and 1675 alternative stop sites. Interestingly, we detected multiple examples of premature transcriptional termination, including the first evidence for promoter-proximal premature transcriptional termination in a member of the Eurotiomycetes. Ultimately, we provide a resource to the Aspergillus community and an accurate starting point for unraveling the complexities of gene regulation in an important human pathogen.

Keywords: Aspergillus fumigatus, End-Seq, untranslated regions (UTRs), fungal pathogen, premature termination

INTRODUCTION

Transcriptome annotations are currently relied upon for myriad molecular investigations, yet in most cases these annotations themselves remain at best a work in progress. The value of a proper transcriptome annotation is obvious, as accuracy in transcription start and end sites facilitates definition of the regulatory elements responsible for the resultant RNA transcript and informs on transcription initiation, elongation, and termination. With correct transcript boundaries comes an improved understanding of upstream regulatory elements like promoters and enhancers; post-transcriptional regulatory elements like upstream open reading frames (uORFs) that ultimately contribute to mRNA stability and translation efficiency; and the coding potential of a given RNA (Aspden et al. 2023). Despite the clear advantages of an accurately annotated transcriptome and coding sequence, these resources are largely lacking from many eukaryotic pathogens and nonmodel organisms, a complicating factor we will partially remedy here for one noteworthy human fungal pathogen, Aspergillus fumigatus.

A. fumigatus is a World Health Organization critical priority fungal pathogen of the Eurotiomycetes class, highly diverged from the ascomycete model yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe (Li et al. 2021). A. fumigatus was first sequenced in 2005 using the reference strain and clinical isolate Af293, collected from a postmortem lung biopsy of a neutropenic patient (Nierman et al. 2005). A second commonly used laboratory strain of the CEA10 lineage (A1163) was sequenced in 2008 (Fedorova et al. 2008), and the genome has been further improved and expanded by additional sequencing efforts, including a major expansion in sequenced strains through study of the pangenome (Barber et al. 2021; Bertuzzi et al. 2021; Horta et al. 2022; Lofgren et al. 2022; Rhodes et al. 2022). The first telomere-to-telomere assembly of A. fumigatus was published in 2022 using A1160 (a descendant strain of CEA10) and further broadened our understanding of the genomic architecture (Bowyer et al. 2022). The transcriptome of the A. fumigatus reference strain Af293 was initially annotated with the help of RNA-seq data and computational predictions, yielding better predicted UTRs than the more pathogenic CEA10 strain. To date, limited experimental evidence exists confirming the accuracy of the gene predictions for any of the A. fumigatus strains mentioned (Krappmann et al. 2004; Jöchl et al. 2008; Müller et al. 2012).

The boundaries of an RNA transcript, like a messenger RNA (mRNA) or long noncoding RNA (lncRNA), are bookended by a transcription start site at the 5′ end and a transcriptional terminator at the 3′ end. In mRNA and some lncRNA, a polyadenylation site (PAS) at the 3′ end of the transcript determines the site of cleavage prior to poly(A) addition. These features are not precisely defined, but instead are composed of multiple sites that provide regulatory versatility to the transcribed RNA. Much work has been done to assign transcript start sites (TSSs) and PAS in a wide variety of organisms, particularly in model organisms like S. cerevisiae (McMillan et al. 2019; Dang et al. 2022). Software predictions are routinely performed with the support of RNA-seq data to assign untranslated regions (UTRs), but in many cases these predictions are incomplete and miss important regulatory features (Behr et al. 2013; Shenker et al. 2015). Sequencing approaches designed to enrich for transcript ends provide a much more accurate picture of the mature RNA with UTRs than standard RNA-seq (Pelechano et al. 2014; Malabat et al. 2015; Afik et al. 2017).

In many organisms, including fungi like Cryptococcus neoformans and Neurospora crassa, the selection of TSS dictates the inclusion or exclusion of regulatory elements such as mitochondrial localization signals or uORFs (Wallace et al. 2020). The resulting alternative protein forms have been termed echoforms to describe the case where a single gene encodes two or more differentially localized proteins (Yogev and Pines 2011; Bader et al. 2020). Recent work in C. neoformans provided a more complete characterization of TSSs and uncovered a novel transcription factor dictating alternative TSS selection (Dang et al. 2024). In Aspergillus nidulans, the conidiation factor brlA is transcribed in two different isoforms. brlAβ is transcribed during the hyphal stage and contains a uORF, which blocks translation of the main ORF at this developmental stage. The TSS of brlAα, on the other hand, is downstream from the uORF and is utilized during conidiation allowing for translation of the transcript (Han et al. 1993; Prade and Timberlake 1993). In A. fumigatus, the best example of such regulation stems from studies of the conserved transcriptional activator cross-pathway control A (CpcA; Gcn4p in yeast), which coordinates amino acid starvation responses (Krappmann et al. 2004). In S. cerevisiae, four uORFs in the 5′ leader of GCN4 orchestrate translation (Gunišová and Valášek 2014), whereas the pathogenic yeast Candida albicans harbors a single regulatory uORF (Sundaram and Grant 2014). A. fumigatus cpcA has two uORFs but is also capable of autoregulation (Krappmann et al. 2004), highlighting the variability of such elements even at one vital, conserved locus.

Akin to transcription start sites, alternative polyadenylation (aPAS) sites are equally dynamic and important, with clear links to RNA stability (Tian and Manley 2017), RNA localization (Arora et al. 2022), and modulation of protein production (de Prisco et al. 2023), among others. In fungi, a range of roles have been described for aPAS, including regulation of stress resistance and virulence (Franceschetti et al. 2011; Graber et al. 2013). An underestimated and often overlooked mechanism of gene regulation is premature transcription termination (PTT) (Kamieniarz-Gdula and Proudfoot 2019). PTT is most prevalent in metazoans as promoter proximal stalling of RNA polymerase II (Jonkers and Lis 2015). If stalling is not resolved, this can lead to PTT. Whether fungal-specific PTT mechanisms are functional in A. fumigatus is unknown. PTT can also be triggered by cryptic poly(A) signals often located within introns (Hoque et al. 2013). These examples demonstrate the regulatory potential of transcriptional start site and polyadenylation site selection.

Here, we improve our understanding of the A. fumigatus CEA10 transcriptome by performing 5′ and 3′ End-Seq coupled with RNA-seq to experimentally validate the ends of poly(A)-tailed RNA molecules. A. fumigatus has UTRs similar in length to ascomycetes like S. pombe, but relatively shorter 5′ UTRs and longer 3′UTRs compared to the basidiomycete pathogen C. neoformans. We uncover intriguing examples of premature transcription termination and other regulation that will serve as a resource to the community moving forward.

RESULTS

End-Seq defines 5′ and 3′ ends of A. fumigatus CEA10 transcripts

To determine baseline gene expression and transcript start (TSS) and end sites (TES) of A. fumigatus strain CEA10, four biological replicates of mycelium were grown in AMM liquid cultures for 24 h at 37°C (standard growth conditions) or 42°C (mild heat stress), followed by RNA-seq and 5′ and 3′ End-Seq of the same RNA samples. DESeq2 analysis on the RNA-seq data revealed minor changes due to heat stress, with 78 genes significantly downregulated and 75 upregulated (Supplemental Table S1; Supplemental Fig. S1). No GO-terms were enriched for the downregulated genes; however, for the upregulated genes, several enriched categories were identified using FungiDB.org (Amos et al. 2021). In the “Cellular Component” category, four terms related to the cell wall were significantly upregulated (Supplemental Table S2), consistent with the term “1,3-β-D-glucan metabolic process” that was observed in the “Biological Process” category. Collectively, these categories suggest minor cell wall stress, consistent with previous results from the literature indicating that A. fumigatus alters cell wall composition during heat stress (Fabri et al. 2021). Additionally, when we used FungiDB to search for enriched “Molecular Functions,” only the GO-term “HSP90 protein binding” was significantly enriched, hinting at a conserved heat shock response and adjustment of carbohydrate metabolism (Albrecht et al. 2010). To increase coverage of end sites, we combined all replicates for the downstream End-Seq analysis, as we only observed minor differences at the transcriptional level between the two growth conditions (including no alternative TSS/TES usage between differentially expressed genes).

5′ and 3′ End-Seq was performed on total RNA as described in Figure 1A. Of note, for 5′ End-Seq, poly(A)-enriched RNA was heat-fragmented prior to phosphorylation and degradation of uncapped 5′ ends, whereas for 3′ End-Seq, total RNA was heat-fragmented prior to poly(A) enrichment. 5′ End-Seq led to an enrichment upstream of many annotated start codons; however, there were also substantial reads mapping throughout the CDS and adjacent noncoding regions, whereas the resulting 3′ End-Seq reads were primarily located after the annotated end of the respective coding sequence (CDS), as expected.

FIGURE 1.

FIGURE 1.

End-Seq defines UTRs in A. fumigatus. (A) Graphical description of the sequencing strategy using commercially available 5′ and 3′ End-Seq kits from Eclipse Bioinnovations (USA). For 5′ End-Seq, we combined three biological replicates of fungus grown at 37°C and 42°C each for a total of six replicates, and for 3′ End-Seq, we had four biological replicates for each condition resulting in eight total replicates. (Created in BioRender, Blango 2025, https://biorender.com/lguy6bt.) (B) Schematic depiction of the filtering strategy to achieve high-confidence sites. (Created in BioRender, Blango 2025, https://biorender.com/j5p9m7c.) (CF) Example plots of mRNA-seq, 5′ End-Seq, and 3′ End-Seq read coverage. Orange indicates reads aligning to the + strand, while purple indicates the − strand. The annotated AUG is indicated by a black arrow, the CDS is indicated by the thick blue line, and introns by a thin blue line. The y-axis indicates total read counts. Read tracks were smoothened to improve viewability, and this infrequently masks some small introns. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Plots show (C) AFUB_009600, an mRNA splicing factor; (D) AFUB_066690 encoding Aspf2, a major allergen of A. fumigatus; (E) genes of the pseurotin A biosynthetic gene cluster; and (F) AFUB_069420 encoding CpcA with the length of the newly defined 5′ UTR indicated.

To determine the most prominent transcript end site and to exclude artifacts, we extracted the 5′ and 3′ end positions of the respective reads and applied a previously established peak-calling algorithm (Dang et al. 2024). We kept sites identified in at least three replicates and a minimum of 20 total reads (Fig. 1B). The remaining sites were assigned to the closest gene on the same strand. All sites found within the annotated gene (0 nt distance from nearest annotated gene) or further than 800 nt from the annotated translational start or stop site were manually assessed for contiguous read coverage in the RNA-seq analysis indicative of a true TSS/TES. We then assigned each end site a confidence value of “3,” “2,” or “1,” with “3” representing high-confidence sites, “2” indicating a probable end site requiring experimental validation, and “1” delineating an unlikely end site of an annotated gene in CEA10 (Supplemental Table S3).

After manual curation, 2897 TSS and 9460 TES were assigned a high confidence value of “3,” covering 27.1% and 69.9% of 10,124 annotated genes in the CEA10 genome (ASM15014v1), respectively. These percentages refer to genes where we were able to computationally determine an exact site. The location of more TSS/TES can be estimated by looking at the alignment of the 5′/3′ End-Seq reads, respectively. Two representative examples of genes (e.g., AFUB_009600, a splicing factor; and AFUB_066690, encoding the major A. fumigatus allergen Aspf2 [Dasari et al. 2018]) with annotated TSS and TES sites are shown in Figure 1C and D. For comparison, we performed a lift over of genome annotation coordinates from reference strain Af293 to CEA10 using the FLO software package (https://github.com/wurmlab/flo), which generally improved the annotation; however, we also observed cases where the predicted UTRs do not match the experimental validation. In Figure 1E, the annotated start and stop sites of the pseurotin A biosynthetic gene cluster are shown, indicating the ability of End-Seq analysis to provide useful context even in gene dense loci. As an additional positive control, we confirmed the two uORF-containing 5′ end of the cpcA gene of A. fumigatus to occur at position −855, very similar to previous reports of a TSS at position −858 (Fig. 1F; Supplemental Table S3; Krappmann et al. 2004).

A single mRNA isoform typically dominates A. fumigatus gene expression

The 2897 TSS and 9460 TES were annotated to 2747 and 7079 unique genes (with 2615 genes having both the TSS and TES annotated), respectively, resulting in 148 genes with alternative 5′ UTRs and 1675 with alternative 3′ UTRs based on the high-confidence ends. A total of 2599 genes had only one TSS, and 5404 genes had only one TES. This suggests that the majority of A. fumigatus genes are transcribed as a single isoform without alternative TES and TSS under standard growth conditions. Globally, 3′-UTR variants were more common than 5′ UTR variants, with 25% of genes with at least one identified high-confidence TES having multiple TESs versus 5% of genes with at least one identified high-confidence TSS having multiple TSSs. Among the genes exhibiting alternative TSS/TES, most have only one additional alternative (145 genes with two alternative 5′ UTRs; 1354 genes with two alternative 3′ UTRs) (Fig. 2A–C). A few genes have several potential UTR isoforms, for example, three genes exhibit three alternative 5′ UTRs; 293 genes had three alternative 3′ UTRs, 87 with four, 30 with five, nine with six, and one each with seven and nine 3′ UTRs (see example in Fig. 2D). When there was an alternative TSS/TES, this usually correlated with an increase/drop in transcript level with respect to the main start or stop site, which can be observed in the “Total RNA-Seq” track (Fig. 2).

FIGURE 2.

FIGURE 2.

Multiple genes have transcripts with observed alternative 5′ and 3′ RNA end sites. (A) Read alignment of AFUB_097740 gene encoding a protein of unknown function, (B) AFUB_012320 encoding a putative nitrate transporter, (C) AFUB_001520 encoding 60S ribosomal protein L31e, and (D) AFUB_000040 encoding a protein with predicted RNA polymerase II transcription factor activity. Read alignment of several genes; the y-axis indicates total read counts. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Read tracks were smoothened to improve viewability, and this infrequently masks some small introns.

A. fumigatus has longer 3′ UTRs than other reported fungi, despite comparable 5′ UTRs

We analyzed high-confidence end sites (value of 3) to determine the average length of the UTRs in A. fumigatus CEA10 (Fig. 3A; Supplemental Table S3). The mean length of 5′ UTRs was 126 nt with a median length of 49 nt, whereas the mean length of the 3′ UTRs was slightly longer at 268 nt, with a median length of 184 nt. Compared to UTR lengths in C. neoformans and S. pombe, A. fumigatus 5′ UTRs were slightly shorter (177 nt in C. neoformans and 152 nt in S. pombe vs. 126 nt mean length in A. fumigatus) but longer than C. albicans 5′ UTRs, which were 88 nt long on average (Misra et al. 2002; Wilhelm et al. 2008; Sellam et al. 2010; Chen et al. 2011; Graber et al. 2013; Srivastava et al. 2018; McMillan et al. 2019; Wallace et al. 2020). The A. fumigatus 3′ UTRs were notably longer than those of C. neoformans, S. cerevisiae, S. pombe, and C. albicans (186, 144, 169, and 84 nt vs. 268 nt mean length) (Fig. 3B; Graber et al. 1999; Wilhelm et al. 2008; Sellam et al. 2010; Wallace et al. 2020). We performed an additional control analysis excluding any UTRs with a calculated length of zero relative to the annotated gene and determined 5′ UTRs to be 218 nt (mean) with a median length of 135 nt and 3′ UTRs to be 282 nt (mean) with a median of 195 nt.

FIGURE 3.

FIGURE 3.

A. fumigatus exhibits 5′ UTRs of typical length, but relatively longer 3′ UTRs than other fungi. (A) Histograms (bin size equals 5) of UTR length distribution based on the observed high-confidence 5′ and 3′ poly(A)-enriched RNA ends. (B) Comparison of average UTR lengths with other eukaryotes (Graber et al. 1999; Misra et al. 2002; Chen et al. 2011; Srivastava et al. 2018; Wallace et al. 2020; Hong and Jeong 2023). (C) Transcriptomic snapshot of AFUB_035670 together with the measured 5′ and 3′ end sites; the 5′ UTR of AFUB_035670 was determined to be 2150 nt long compared to the prior prediction of 703 nt in Af293. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Read tracks were smoothened to improve viewability, and this infrequently masks some small introns.

We observed a number of TSS (and to a much lesser extent TES) within the boundaries of the annotated CDS of genes. In many cases, this was likely due to incorrect gene annotation and often supported by our RNA-seq coverage data. In 140 of the 207 genes with a TSS inside of the annotated CDS, the reported TSS was confirmed by total RNA-seq data and exhibited transcription downstream from the annotated start codon. In these cases, the current CDS annotation prediction is likely too long and includes an upstream (in-frame) start codon (e.g., see Supplemental Fig. S2), which could be utilized under different growth conditions. We often observed a few reads aligning to the upstream region, indicating that there could be an alternative TSS (as in Fig. 2B) although these events typically did not meet our threshold for inclusion as bona fide TSS.

The inclusion of an upstream start codon has previously been linked to signal peptides that alter protein localization (Natsoulis et al. 1986; Mireau et al. 1996; Mudge et al. 1998; Wallace et al. 2020). Therefore, we extracted the sequences from the high-confidence 5′ UTRs and scanned for upstream start codons (uAUGs). A total of 1180 of 2897 genes contained at least one uAUG (Supplemental Table S4), suggesting a vast, largely uninvestigated regulatory potential.

One particularly interesting example warranting future study is the 2150 nt-long 5′ UTR of AFUB_035670, which contains both a UTR intron and 30 uAUGs and coincides with the promoter of an antisense neighboring gene, AFUB_035680 (Fig. 3C). In silico prediction of the coding sequence using the longer 5′ UTR did not reveal an obvious contiguous open reading frame, suggestive of a regulatory role rather than misannotation in this case.

Similarly, the AFUB_067240 gene showed the highest number of uAUGs at 34 occurrences in its 2111 nt 5′ UTR. In fact, 275 out of 1180 genes with an uAUG within our newly set transcript boundaries had five or more uAUGs (Supplemental Table S4), which makes it unlikely that the annotated start codon is used and suggests a misannotated start codon or unusual regulation. An alternative explanation might be that these are tandem-transcribed noncoding RNA, although further work including ribosome profiling will be needed to fully understand these outliers.

Shortest and longest UTRs reveal novel features of A. fumigatus gene regulation

The length of 5′ UTR has a tremendous impact on translation rate. Shorter 5′ UTRs typically imply efficient translation, while longer 5′ UTRs harbor more regulatory elements that slow down translation (Kozak 1991; Hinnebusch et al. 2016). Short 3′ UTRs are associated with greater RNA stability but show less regulatory/localization sites than their longer counterparts (Nam et al. 2014; Mayr 2017). Therefore, we divided the high-confidence UTRs according to their length and assessed the 10% shortest and longest regions (Supplemental Fig. S3). A GO-term analysis of the 10% shortest 5′ UTRs showed a significant enrichment of ribosomal genes (e.g., Biological Process GO-terms: “structural constituent of ribosome,” “ribosome,” “translation”) and genes associated with mitochondrial membranes (“mitochondrial respiratory chain complex III assembly”) (Supplemental Fig. S3A; Supplemental Table S5). These findings are consistent with the idea that genes requiring high levels of translation, like those encoding ribosomal proteins, harbor shorter UTRs as a means to optimize translational output (for review, see Petibon et al. 2021). Among the genes with the 10% longest 5′ UTRs, phosphatases and kinases were especially common (e.g., “protein phosphorylation,” “protein kinase activity,” “transferase activity, transferring phosphorus-containing groups”) (Supplemental Fig. S3B). Genes involved in RNA/DNA interaction were also enriched (e.g., “DNA-binding transcription factor activity,” “nucleic acid binding,” “chromatin binding”). As longer UTRs are often postulated to possess additional regulatory capacity, these genes with long UTRs may be a particularly appealing place to find novel post-transcriptional regulatory mechanisms in A. fumigatus and related species.

No Biological Process GO-term was enriched among the genes with the 10% shortest 3′ UTRs. Enriched GO-terms for the 10% longest 3′ UTRs could generally be grouped into three categories: (1) transcription (e.g., “regulation of DNA templated transcription,” “DNA-binding transcription factor activity, RNA polymerase II specific,” “sequence specific DNA binding”); (2) phosphatases and kinases (e.g., “protein kinase activity,” “Transferase activity, transferring phosphorus containing groups,” “protein phosphorylation”); and (3) signaling (e.g., “G protein-coupled receptor signaling pathway,” “signal transduction,” “heterotrimeric G-protein complex”) (Supplemental Fig. S3C). Collectively, assessment of the longest and shortest UTRs revealed a reliance on short UTRs for ribosomal genes and longer UTRs for genes kinases and phosphatases.

The sequence context around TSS defines promoter strength, binding of transcription factors, and can also influence transcript diversity (Landry et al. 2003; Smale and Kadonaga 2003; Lenhard et al. 2012). To determine global and specific motifs around the TSS, we next extracted genomic sequences ±25 nt from the transcription start site of the 5′ UTRs. First, we predicted motifs using STREME (version 5.5.7) around all TSSs (Bailey 2021). We observed several motifs with significant, albeit limited, enrichment (Supplemental Fig. S4A). Due to the abundance and proximity, we suspect that the observed CATCAACA motif may be an initiator/initiator-like variant (Lu and Lin 2021), whereas the GARGAGRRRRARRR purine-rich motif may denote open chromatin capable of regulation via G-quadruplexes, RNA:DNA hybrids, or triple helices (Antonov and Medvedeva 2018; Greifenstein et al. 2021; Warner et al. 2021). When we did STREME analysis on the 10% shortest and longest 5′ UTRs, three different motifs were identified in each case (Supplemental Fig. S4B,C); however, the enrichment for these features was also minimal, indicating that a more sophisticated analysis will be required to tease apart these regulatory features of transcription in A. fumigatus. Next, WEB-logos were created from these sequences to assess potential sequence bias around the TSS. For both long and short UTRs, we found an enrichment of a TG at positions −1 and 0 (Supplemental Fig. S5). Interestingly, this enrichment of TG was not observed when the sequences for all identified high-confidence start sites were used. Instead, an enrichment for a CA at positions 1 and 2 was observed, suggesting some specificity in the longest and shortest UTRs (Supplemental Fig. S5). Of note, while A and G were slightly more prevalent upstream of and downstream from the TSS in the short and long UTR genes, C and T were slightly more prevalent upstream of and downstream from positions −1 and 0 when all TSSs were considered. Interestingly, in S. cerevisiae, T-richness upstream of the TSS is positively correlated with promoter activity (Lubliner et al. 2013).

Known RNA binding motifs are enriched in UTRs of expected client genes

Ssd1/SsdA/GUL-1/Sts5 is a fungal-specific RNA-binding protein that regulates growth and cell wall synthesis (Ballou et al. 2020; Hall and Wallace 2022). A. fumigatus SsdA is critical for regulation of morphology by the CotA kinase (Martin-Vicente et al. 2024). The binding motif (CNYTCNYT) of S. cerevisiae Ssd1 is well characterized and highly enriched in the 5′ UTRs of many mRNAs (Hogan et al. 2008; Bayne et al. 2021). For efficient binding, the motif should occur at least twice within the 5′ UTR (Bayne et al. 2021). After ensuring that Ssd1 (AFUB_010850) is expressed in our data set (250 mean read counts), we scanned our newly defined 5′ UTR sequences for instances with ≥2 Ssd1 binding motifs and found 527 such genes (Fig. 4A). AFUB_063890 (encoding the ortholog of S. cerevisiae Ecm33) held the most 5′ UTR Ssd1-binding sites with 14. Ecm33 is a GPI-anchored cell wall protein and known client mRNA of Ssd1 in S. cerevisiae (Li et al. 2013). A GO-term analysis using the 527 genes with ≥2 Ssd1 binding sites and a background list of genes with high-confidence 5′ UTRs revealed an enrichment of the categories “fungal-type cell wall,” “fungal-type cell wall organization,” “carbohydrate metabolic process,” “nucleic acid binding,” and “DNA binding” (Fig. 4B), consistent with the literature (Hogan et al. 2008; Bayne et al. 2021). We also confirmed that the Ssd1 binding motif was indeed enriched in 5′ UTRs by shuffling the 5′ UTR sequences and in parallel running the same analysis on 3′ UTR sequences. In both cases, this showed a significantly reduced number of motifs (Fig. 4C).

FIGURE 4.

FIGURE 4.

Ssd1 and PUF binding sites are enriched in 5′ and 3′ UTRs, respectively. (A) Histogram showing the number of Ssd1 binding motifs in newly defined 5′ UTRs. Only genes with at least two motif occurrences were plotted. (B) GO-term analysis of the same genes as in A with a supplied background list of all genes with an identified 5′ UTR in our study. GO-term analysis was done using https://fungifun3.hki-jena.de/. (C) Percent of CNYTCNYT motif occurrence in different sequence types. For this graph, a single occurrence was also counted. (D) Potential PUF protein family binding motif occurrence in different sequence types. Indicated are the number of sequences with the respective motif and the percent of the total number of input sequences in brackets. Sequence motifs were taken from Hogan et al. (2015) and Wilinski et al. (2017).

The Pumilio and FBF (PUF) family RNA binding proteins mainly interact with the 3′ UTRs of mRNAs encoding for membrane-associated proteins and are known modulators of RNA translation, stabilization, and localization (Wang et al. 2018; Murante and Hogan 2019). Knockouts of PUF genes in A. nidulans showed only minor defects in regard to spore numbers, colony diameter, and asexual development (Son et al. 2021). A. fumigatus encodes six proteins with a PUF domain (AFUB_038030, AFUB_022630, AFUB_093980, AFUB_002890, AFUB_009860, and AFUB_052880) (Galagan et al. 2005; Hogan et al. 2015). In our data set, these six genes exhibited low expression (≤72 mean counts). The literature suggests several potential binding motifs, and we scanned our identified 3′-UTR sequences for their occurrence (Hogan et al. 2008; Wilinski et al. 2017; Sadée et al. 2022). Only the motif UGUA[A/C/U]AUA resulted in enriched binding sites over the background (5′-UTR sequences and shuffled 3′-UTR sequences) (Fig. 4D). We identified 653 genes with at least one of the three sequences occurring in a 3′ UTR. A GO-term analysis with all genes yielded no enrichment; however, when only the 19 genes with two binding sites were considered (two binding sites was the maximum observed), several GO terms were significantly enriched. The terms “uniporter activity,” “mitochondrial calcium ion homeostasis,” and “calcium channel activity” were comparable to binding patterns of PUF family proteins in other species (García-Rodríguez et al. 2007), but the small number of hits in this category may suggest further refinement is necessary to understand PUF protein biology in A. fumigatus.

Overall, the enrichment of known protein-binding motifs in newly annotated UTR sequences argues that these sequences can be used to discover binding motifs and regulatory functions.

End-Seq reveals numerous cases of putative premature (intronic) transcription termination

We believe that the data set provided here will prove to be a tool for hypothesis generation moving forward. One compelling area for future study centers on the 484 (of 9460 total) identified TESs located within the predicted CDS of genes. These events can be classified as cases of (1) early termination shortly after the TSS, (2) slightly shorter mRNA isoforms near the predicted TES likely resulting from poor annotation, and (3) transcripts that are significantly shortened, potentially resulting in truncated peptides. We can also differentiate between intronic and exonic sites. While exonic transcription termination sites most likely lead to degradation of the transcript by mechanisms like nonstop decay, intronic termination can yield shortened proteins (van Hoof et al. 2002; Vasudevan et al. 2002; Kamieniarz-Gdula and Proudfoot 2019). Out of the 484 3′ sites located within CDS, 154 are located either within an intron or adjacent to an intron that is alternatively spliced.

A particularly interesting example comes from the AFUB_001550 transcript. AFUB_001550 is the predicted ortholog of S. cerevisiae ENV9, an oxidoreductase that plays an important role in lipid droplet morphology (Siddiqah et al. 2017). The pre-mRNA consists of four exons and three introns, where the first two introns are short and the third is relatively long at 274 nt, with an in-frame stop codon. Our sequencing data suggest that transcription is primarily terminated within the last intron, and only a fraction of reads reach the purported 3′ UTR. This can be observed in the total-poly(A)-RNA sequencing track as a drop in reads after the last intron, which was also confirmed by RT-qPCR (Fig. 5A,B). In S. cerevisiae, ENV9 functions as a short-chain dehydrogenase, and protein domain predictions for full-length AFUB_001550 reveal similar short-chain dehydrogenase domains. In both orthologs, these domains are in the C-terminal half of the protein and predicted to be absent in the truncated peptide produced from transcript using the TES in intron 3. In S. cerevisiae, ENV9 is important for lipid droplet morphology when the yeast is grown on poor carbon sources like glycerol or ethanol, with the C terminus playing an important role in membrane anchoring (Siddiqah et al. 2017). The short transcript produced by A. fumigatus may hint at an additional layer of regulation during transcription to limit membrane tethering or dehydrogenase activity of AFUB_001550 under favorable growth conditions or alternatively suggest that the gene is undergoing negative (purifying) selection.

FIGURE 5.

FIGURE 5.

3′ End-Seq reveals examples of premature termination in A. fumigatus. (A) Read alignment of AFUB_001550 encoding ortholog of S. cerevisiae ENV9; the y-axis indicates total read counts. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Amplified regions in B are indicated by the red lines in the gene model. The location of the intronic stop codon is indicated by a red arrow. (B) Relative expression level of exon 1 and exon 4 of AFUB_001550 measured by qPCR. Data from three biological replicates. (C) Schematic depiction of the predicted protein domain of AFUB_001550 using data from InterPro; the location of the observed internal stop site is indicated. (D) Read alignment of the previously uncharacterized AFUB_069590 gene; the y-axis indicates total read counts. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Amplified regions in E are indicated by the red line in the gene model. The location of the intronic stop codon is indicated by a red arrow. (E) Relative expression level of 5′ UTR and exon 5 of AFUB_069590 measured by qPCR. Data from three biological replicates. (F) Schematic depiction of the predicted protein domain of AFUB_069590 using data from InterPro; the approximate location of the observed internal stop site is indicated. Read tracks were smoothened to improve viewability, and this infrequently masks some small introns.

A second similar example is the uncharacterized gene AFUB_069590, which consists of five exons and four introns (Fig. 5D). Most transcripts in our analysis ended in the last intron, which appears to be mostly retained. We again confirmed a difference in isoform abundance using RT-qPCR with PCR primers to amplify total (short and long) versus long variants (Fig. 5E) Protein domain prediction of AFUB_069590 identified a Rad1 domain, typical of cell cycle control proteins (Marathi et al. 1998), in the last exon that would be lost using the premature termination site (Fig. 5F). The first four (relatively short) exons encode a predicted oligosaccharyltransferase domain; however, as in the example for AFUB_001550 above, the exact function of these domains in tandem requires further study.

A. fumigatus exhibits putative promoter proximal transcription termination

As premature transcription termination is a relevant regulatory mechanism across multiple kingdoms (Kamieniarz-Gdula and Proudfoot 2019), we next assessed putative PTT sites in our data set. We identified 42 genes with putative promoter proximal PTT using our high-confidence end sites, but it is noteworthy that there are certainly more potential promoter proximal PTT sites in the lower confidence groups, as these are often rare events that would not be expected to meet all our thresholds for high-confidence end sites. Additional examples were identified further downstream in the gene body (Fig. 5; Supplemental Table S2), but here we focus on the promoter proximal examples. Many of our identified cases are likely derived from cryptic intronic PAS. For example, AFUB_004020 exhibits both a detectable main TES and a minor alternative TES downstream from the annotated stop codon (Fig. 6A). According to our sequencing data, AFUB_004020 also has a TES within the first intron, producing a transcript of only ∼300 nt in length. Another interesting example is AFUB_039430, where the internal TES is in the third intron, the longest of the transcript (Fig. 6B). Although both examples shown in Figure 6A,B are also located close to or within introns, they are functionally different from the examples in Figure 5. In the latter case, the premature transcription termination leads to a decrease in transcription and a shortened mRNA, while in the former, no substantial drop in transcription after the premature transcription is observed. These putatively short RNAs could either be targeted for degradation (although they were stable enough to sequence), act as noncoding RNA, or even be translated into micropeptides (Kamieniarz-Gdula and Proudfoot 2019) to influence the A. fumigatus transcriptome/translatome.

FIGURE 6.

FIGURE 6.

Promoter proximal transcription termination occurs in A. fumigatus. Read alignment of (A) AFUB_004020 and (B) AFUB_039430, each encoding proteins of unknown function. The y-axis indicates total read counts for “-Seq” panels. Arrowheads indicate the positions of detected high-confidence 5′/3′ end sites. Read tracks were smoothened to improve viewability, and this infrequently masks some small introns. (C) Table depicting mean number of occurrences of the denoted Nrd1 and Nab3 binding sequences per designated sequence type (either ±100 or +500 nt as indicated). Sequences were taken from Webb et al. (2014).

In S. cerevisiae, an alternative transcription termination mechanism is regulated by the Nrd1–Nab3–Sen1 (NNS) complex. Although the main function of the NNS complex is the transcriptional termination of ncRNA, it has also been shown to be important for the premature termination of mRNAs (Arigo et al. 2006; Tudek et al. 2014; Webb et al. 2014; Chaves-Arquero and Pérez-Cañadillas 2023). For Nrd1 and Nab3, several RNA binding motifs were described using PAR-CLIP (Webb et al. 2014). We scanned the sequences around the sites of premature termination for the occurrence of the highest-ranking hits for Nrd1 (motif: TGTA) and Nab3 (GTAG). The Nrd1 motif was slightly enriched in sequences around PTT sites over sequences neighboring all 3′ sites or downstream from 5′ sites (Fig. 6C). This enrichment was more prominent for the Nab3 motif (Fig. 6C), supporting an involvement of the NNS complex in unconventional transcription termination in A. fumigatus. Additional effort will be required to fully understand the interplay of PTT and the NNS complex in A. fumigatus gene regulation.

DISCUSSION

The current public genome of the common laboratory strain CEA10 of A. fumigatus largely lacks experimental validation or even predictive information about the termini of RNA transcripts. Here, we provide an improved description of A. fumigatus CEA10 transcript start and end sites of poly(A)-tailed RNA from fungal mycelium. We were able to measure high-confidence TSS for 27.1% and TES for 69.9% of all annotated genes. The inclusion of middle- and potentially even low-confidence end sites in Supplemental Table S3 increases these numbers further and will likely prove informative on a case-by-case basis for genes-of-interest.

We expect this resource to be useful to the community in several ways. First, a more complete map of the A. fumigatus UTRs will support improved molecular cloning projects in terms of creation of genetic deletion and overexpression strains. Second, a more accurate description of UTRs will facilitate our understanding of the regulatory potential of such elements, particularly as protein binding hubs. UTR-binding RNA binding proteins are well known to regulate the level of translation, in addition to the stability and localization of the RNA (Kurischko et al. 2011; Mendonsa et al. 2023).

The GO-term analysis performed on the genes with the shortest and longest UTRs confirmed that genes harboring a short 5′ UTR were enriched for ribosomal and mitochondrial genes, consistent with high translation. Genes with the longest UTRs, on the other hand, were enriched in GO-categories where additional regulation on the RNA level could be required, including kinases, phosphatases, metabolic genes, and transcription factors. Interestingly, the nucleotide frequency around the TSS of the long and short UTRs was quite different from the nucleotide frequency around all TSSs (Supplemental Fig. S4D). There, we observed a higher frequency of Ts, which has been shown in S. cerevisiae to be positively correlated with promoter activity (Lubliner et al. 2013). This could suggest that the genes with short/long UTRs are under different regulatory pressures, although it is worth noting that there is a very enriched T at position −1 from the TSS.

While 3′ End-Seq reads mostly accumulated at the expected location close to the end of genes, 5′ End-Seq reads were typically found throughout the gene body. This is likely caused by the technical limitations of identifying 5′ ends using poly(A)-transcript enrichment, which requires selection of intact, full-length transcripts, followed by heat fragmentation, phosphorylation of non-5′-m7G-capped RNA, and finally enzymatic degradation of these phosphorylated fragments prior to library preparation (Afik et al. 2017). If the fragments are not fully phosphorylated and degraded, incorrect RNA fragments can be incorporated into the sequencing library. Nevertheless, for many genes we observed a clear enrichment of reads in proximity to the expected TSS. The nature of the poly(A) enrichment and selection of TES meant that this data set had lower noise, allowing for a more accurate definition of end sites. With this data set, we can more precisely define the A. fumigatus transcriptome, for example, showing it to possess 5′ UTRs of 126 nt mean length similar to other fungi and slightly longer 3′ UTRs at 268 nt. These UTRs harbored numerous uAUGs and motifs for interaction with RNA binding proteins like SsdA and the PUF proteins. It will be interesting to see how UTR lengths change under stress or throughout A. fumigatus development in the future, perhaps supplemented by direct RNA sequencing (Depledge et al. 2019; Parker et al. 2020; Ibrahim et al. 2021) or ribosomal profiling approaches as performed previously (Wallace et al. 2020).

Previous RNA-end analyses have proven that transcription does not always perfectly start or stop (or undergo cleavage) at the same position. Rather, transcript ends vary by a few nucleotides, resulting in a heterogenous population of mature transcripts (Geisberg et al. 2020; Wallace et al. 2020; Dang et al. 2024). As we performed a relatively strict analysis of TSS and TES and only kept peaks that were in at least three replicates, we did not assign TSS and TES to some genes because their window was wider than tolerated by our approach. Nevertheless, in most of these cases, the information about the start and stop regions can still be obtained by using the supplemental alignment files to assess the peak manually. We also noticed that most transcripts in A. fumigatus are transcribed as one isoform under normal growth conditions, with relatively limited numbers of alternative start or stop sites. How this will be affected by growth under stress conditions or during development remains an open question.

We expect such a data set to be an engine for hypothesis generation moving forward. One exciting observation in this direction was the numerous observed examples of premature transcription termination (PTT). Although previously demonstrated in the ascomycete S. cerevisiae (Arigo et al. 2006; Amodeo et al. 2022), to the best of our knowledge, this is the first report of PTT in Eurotiomycetes. We identified 42 high-confidence instances of promoter proximal PTT, but many additional examples appear to be scattered through the transcriptome for termination within the gene body. Clearly, more work will be required to define both the breadth and relevance of these examples in terms of regulation. This work can serve as a starting point to perform ribosome profiling in A. fumigatus (with a special focus on genes with uAUGs). Additionally, it will be interesting to see how targeted manipulation of the binding sites of the PUF proteins and Ssd1 influences the post-transcriptional regulation of their client genes. Ultimately, the definition of 5′ and 3′ ends of A. fumigatus transcripts will facilitate further discovery through an enhanced understanding of the gene regulatory mechanisms underpinning pathogenesis. How these transcriptome features relate to other members of the Aspergillus species complex or fungal pathogens more broadly remains an open question that certainly deserves additional attention.

MATERIALS AND METHODS

Strains and culture conditions

A. fumigatus strain CEA10 was grown on Aspergillus minimal media (AMM)-agar plates for 5 days at 37°C, and the spores were harvested by collection through 30 µm filters (Miltenyi Biotec). AMM was composed of 70 mM NaNO3, 11.2 mM KH2PO4, 7 mM KCl, 2 mM MgSO4, 1% (w/v) glucose, and 1 µL/mL trace element solution at pH 6.5 (1 g of FeSO4 • 7H2O, 8.8 g of ZnSO4 • 7H2O, 0.4 g of CuSO4 • 5H2O, 0.15 g of MnSO4 • H2O, 0.1 g of NaB4O7 • 10 H2O, 0.05 g of (NH4)6Mo7O24 • 4H2O, and ultra-filtrated water to 1000 mL). Fifty milliliters of liquid AMM was inoculated with 1 × 108 CEA10 spores and then incubated for 24 h at either 37°C or 42°C at 200 rpm. Mycelium was collected using Miracloth, washed with water, and ground with mortar and pestle in liquid nitrogen prior to downstream analyses.

RNA-seq and End-Seq library preparation

RNA was isolated using a TRIzol-based approach as described in Kelani et al. (2023) with minor modifications; RNA was DNase-treated according to the manufacturer's recommendation (Thermo Fisher, RNase-free DNase) and purified using an RNA Clean & Concentrator-5 kit (Zymo Research Corporation). DNase-treated RNA was quality checked on an agarose gel, and additionally, the RNA integrity number (RIN) values were determined by Bioanalyzer (Agilent). All samples had RIN values between 9 and 9.9. To sequence 5′ and 3′ poly(A)-containing RNA ends, 3 µg of total RNA was used as input for library preparation using 5′ and 3′ RNA-End-Seq kits (Eclipse Bioinnovations) according to the manufacturer's recommendation. In brief, for 5′ End-Seq, poly(A) RNA was enriched with oligo(dT) beads, the RNA was heat fragmented, uncapped 5′ ends were phosphorylated and digested, and the remaining 5′-terminal RNA fragments were subjected to library preparation. For 3′ End-Seq, total RNA was heat fragmented, the poly(A)-tailed fragments were isolated with oligo(dT) beads, and the AT-hybrid was digested prior to library preparation.

For poly(A)-RNA-seq, library preparation was performed by NovoGene UK according to standard protocols. Briefly, mRNA was purified from total RNA using poly(T) oligo-attached magnetic beads. Following fragmentation, first strand cDNA was synthesized using random hexamer primers. The second strand cDNA was synthesized using dUTP. The directional library was considered complete after end repair, A-tailing, adapter ligation, size selection, USER enzyme digestion, amplification, and finally purification. The library was then checked by Qubit and real-time PCR for quantification and by bioanalyzer for size distribution, prior to pooling for Illumina sequencing using an Illumina NovaSeq.

Reverse transcription-quantitative PCR (RT-qPCR)

Mycelium from 24 h CEA10 cultures was transferred into a 2 mL screw-cap tube filled with 1/3 0.5 mm soda lime glass beads (BioSpec) and mixed with 1 mL TRIzol. After bead beating 3× for 30 sec, with a 1 min break between, samples were incubated for 5 min at RT. Two hundred microliters of chloroform was added, and the tube was shaken vigorously. Following an incubation of 3 min at room temperature, the samples were centrifuged for 15 min at 4°C and 12,000g. The aqueous phase was transferred into a new tube and mixed with the same amount of 100% ethanol. RNA was then further cleaned up with an RNA Clean & Concentrator kit from Zymo Research Corporation according to the manufacturer's recommendation. One microgram of RNA was DNase digested using DNase-treated according to the manufacturer's recommendation (Thermo Fisher, RNase-free DNase). DNase-digested RNA was again purified with an RNA Clean & Concentrator kit, and 100 ng was used for reverse transcription using oligo(dT)-primer (Thermo Fisher, Maxima H Minus First Strand cDNA Synthesis Kit). For qPCR, Luna Universal qPCR Master Mix (NEB) was used with each data point performed in technical triplicate. The primers used are listed in Supplemental Table S6.

Data processing and analysis

Total RNA-seq

The quality of raw reads was assessed with FastQC (v0.12.0). Remaining adaptor sequences were removed with Trim Galore (v0.5.0). FastQC was applied again, and the FASTQ files were aligned to the A. fumigatus CEA10 strain genome (Aspergillus_fumigatusa1163.ASM15014v1) with HiSat (v2.2.1) using default parameters, but allowing for a maximum intron length of 5000 nt to improve mapping. Counts were determined by htseq-count (v0.9.1) and used as input for DESeq2 (v2.11.40.7). DESeq2 was run with the default read cutoff of <21 and a significance cutoff of Padj <0.05.

End-Seq

The quality of raw reads was assessed with FastQC (v0.12.0). Unique molecular identifiers (UMIs) were removed with umi_tools (v0.5.5; with the setting ‐‐bc-pattern=NNNNNNNNNN for recognition of UMIs). Remaining adaptor sequences were removed with Trim Galore (v0.5.0). 5′ End-Seq reads were further treated with fastp (v0.22.0). The resulting FASTQ files were again checked for quality with FastQC and then aligned to the A. fumigatus CEA10 strain genome (Aspergillus_fumigatusa1163.ASM15014v1) with HiSat (v2.2.1), allowing for a maximum intron length of 5000 nt. The resulting BAM files were converted to wig files using bam2wig (v1.6; dependent on samtools v1.14). Both read strands were then combined into a single file. To scale and transform the files into the bedGraph format, wiggletools (v1.2.11) was used with default parameters. As the 3′ end reads were antisense at this step, we swapped the strand with the “scale” option −1.

End-Seq peak calling

We performed End-Seq peak calling as previously described (Dang et al. 2024). To gain high-confidence transcription start and stop sites, only peaks that were in at least three replicates were retained using the overlaps function of wiggletools (v1.2.11). These files were filtered to retain peaks with at least 20 reads for further analysis. For downstream processing, duplicated peaks were removed by applying BEDOPS –merge (v2.4.41). Bedtools closest (v2.30.0) was used to assign the individual peaks to the associated genes on the chromosome. To avoid significant overlap with closely spaced genes on the opposite strand that could potentially interfere with the assignment, both the genome and the peaks were separated by strand. On the forward strand, the 3′ end assignment was limited to genes upstream of the peak. Accordingly, on the reverse strand, assignment was limited to genes downstream from the peak. For the 5′ end, the gene assignment must be reverted. As such, the forward strand assignment was limited to the closest gene downstream from and on the reverse strand to the closest gene upstream of the peak location. The resulting data set was then manually curated, and each peak was assigned a “confidence score” of either 3 (high-confidence TSS/TES), 2 (possible TSS/TES, but further experimental validation required), or 1 (unlikely a TSS/TES of a known gene in A. fumigatus). Genome coverage plots were made with SparK (v2.6.2) (Kurtenbach and William Harbour 2019).

Motif analysis

For motif prediction, genomic sequences were extracted from ±25 nt from the TSS and TES and subjected to STREME (version 5.5.7) (Bailey 2021). Randomly shuffled input sequences with maintained sequence composition were created by the STREME software itself and used as control sequences. To find binding motifs of Ssd1, PUF family proteins, Nrd1 and Nab3 motifs, and upstream translation start codons, an in-house R-script requiring the following packages was used: “BiocGenerics” v0.52.0 (Biocondutor), “tibble” v3.2.1, and “tidyr” v1.3.1.

Data and statistical analysis

Statistical analyses were performed using R version 4.4.2 (October 31, 2024) in R Studio Version 2024.12.0+467 (2024.12.0+467) as described in detail in the Materials and Methods section or figure legends as appropriate.

DATA DEPOSITION

All raw and processed sequencing data generated in this study have been submitted to the NCBI Gene Expression Omnibus (GEO; https://www.ncbi.nlm.nih.gov/geo/) with subseries GSE296593 and associated accession numbers GSE296590, GSE296591, and GSE296592.

SUPPLEMENTAL MATERIAL

Supplemental material is available for this article.

ACKNOWLEDGMENTS

We thank Amelia Barber and the members of the RNA Biology of Fungal Infections Junior Research Group for helpful discussions throughout this project. The work presented here was generously supported by the Federal Ministry of Research, Technology and Space (BMFTR: https://www.bmbf.de/), Germany, Project FKZ 01K12012 “RFIN – RNA-Biologie von Pilzinfektionen.” Additional funding support came from the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany's Excellence Strategy—EXC 2051—Project-ID 390713860 and via Leibniz Competition funds from the Leibniz Association through the Leibniz Collaborative Excellence Program, Project K569/2023 “FuRTHER - Fungal RNA Transmission Impacting Human Epigenome Regulation.” The funders had no role in the design, analysis, decision to publish, or preparation of the document.

Footnotes

Freely available online through the RNA Open Access option.

MEET THE FIRST AUTHOR

Lukas Schrettenbrunner.

Lukas Schrettenbrunner

Meet the First Author(s) is an editorial feature within RNA, in which the first author(s) of research-based papers in each issue have the opportunity to introduce themselves and their work to readers of RNA and the RNA research community. Lukas Schrettenbrunner is the first author of this paper, “Comprehensive mapping of the 5′ and 3′ untranslated regions of Aspergillus fumigatus reveals diverse mechanisms of mRNA processing including premature transcription termination.” Lukas is a PhD student in the laboratory of Dr. Matthew Blango at the Leibniz Institute for Natural Product Research and Infection Biology in Jena, Germany. The Blango lab is studying the RNA biology of fungal infections with a special focus on the mold Aspergillus fumigatus. During his PhD, he has been investigating several aspects of this regulation, focusing on the contribution of RNA interference to extracellular communication and improving our understanding of the fungal transcriptome through 5′ and 3′ poly(A)-RNA End-Seq.

What are the major results described in your paper, and how do they impact this branch of the field?

Our main goal for this study was to comprehensively map the 3′ and 5′ ends of poly(A)-enriched RNA of Aspergillus fumigatus. We believe that this data set will prove to be valuable for the fungal community, as we provide numerous previously unmapped UTR sequences. We also discovered numerous cases of premature transcriptional termination, which potentially lead to truncated proteins that have lost their main domain. Additionally, we show the first examples of promoter proximal termination in the Eurotiomycetes.

What led you to study RNA or this aspect of RNA science?

The University of Regensburg, where I did my Bachelor's and Master's studies, had a special research program on RNA binding proteins. So, my scientific socialization had a focus on RNA biology. In both my Bachelor's and Master's thesis, I studied transcription elongation of RNA-Pol II in Arabidopsis thaliana. I finished my undergraduate studies before the Covid-19 pandemic hit the world, but alongside the pandemic, RNA therapeutics made their broad public appearance. While I was looking for labs for my PhD, the Blango lab sparked my attention, as our overarching goal is to find ways to “fight” A. fumigatus with RNA. For this project, I was able to transfer and expand my wet lab skills and learn bioinformatics on the way.

What are some of the landmark moments that provoked your interest in science or your development as a scientist?

As a kid, I already liked to be outside and explore nature. The first real moment where I felt like science could be something for me was when I learnt about DNA replication in high school. After we were told the initial steps, it suddenly made a lot of sense in my head, and I pieced together the rest of the steps before the teacher told them to us. Initially, I thought I would go into ecology or botany as I still loved to be in the outdoors, but once at university, I quickly discovered that molecular biology was actually my main interest.

If you were able to give one piece of advice to your younger self, what would that be?

Do not say yes to every project that comes your way. You might think you have time to spare right now, but when you need to wrap up a number of projects in parallel this will backfire on you.

What are your subsequent near- or long-term career plans?

I will soon be finishing my PhD, and I am planning to do a postdoc, preferably in the field of transcription or fungal biology. In the long run, I want to transition into a science management/coordination role to offer my strong organizational skills to the science community.

REFERENCES

  1. Afik S, Bartok O, Artyomov MN, Shishkin AA, Kadri S, Hanan M, Zhu X, Garber M, Kadener S. 2017. Defining the 5′ and 3′ landscape of the Drosophila transcriptome with Exo-seq and RNaseH-seq. Nucleic Acids Res 45: e95. 10.1093/nar/gkx133 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Albrecht D, Guthke R, Brakhage AA, Kniemeyer O. 2010. Integrative analysis of the heat shock response in Aspergillus fumigatus. BMC Genomics 11: 32. 10.1186/1471-2164-11-32 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Amodeo ME, Mitchell SPC, Pavan V, Kuehner JN. 2022. RNA polymerase II transcription attenuation at the yeast DNA repair gene DEF1 is biologically significant and dependent on the Hrp1 RNA-recognition motif. G3 (Bethesda) 13: jkac292. 10.1093/g3journal/jkac292 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Amos B, Aurrecoechea C, Barba M, Barreto A, Basenko Evelina Y, Bażant W, Belnap R, Blevins AS, Böhme U, Brestelli J, et al. 2021. VEuPathDB: the eukaryotic pathogen, vector and host bioinformatics resource center. Nucleic Acids Res 50: D898–D911. 10.1093/nar/gkab929 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Antonov I, Medvedeva YA. 2018. Purine-rich low complexity regions are potential RNA binding hubs in the human genome. F1000Res 7: 76. 10.12688/f1000research.13522.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Arigo JT, Carroll KL, Ames JM, Corden JL. 2006. Regulation of yeast NRD1 expression by premature transcription termination. Mol Cell 21: 641–651. 10.1016/j.molcel.2006.02.005 [DOI] [PubMed] [Google Scholar]
  7. Arora A, Goering R, Lo HYG, Lo J, Moffatt C, Taliaferro JM. 2022. The role of alternative polyadenylation in the regulation of subcellular RNA localization. Front Genet 12: 818668. 10.3389/fgene.2021.818668 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Aspden JL, Wallace EWJ, Whiffin N. 2023. Not all exons are protein coding: addressing a common misconception. Cell Genom 3: 100296. 10.1016/j.xgen.2023.100296 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bader G, Enkler L, Araiso Y, Hemmerle M, Binko K, Baranowska E, De Craene J-O, Ruer-Laventie J, Pieters J, Tribouillard-Tanvier D, et al. 2020. Assigning mitochondrial localization of dual localized proteins using a yeast Bi-Genomic Mitochondrial-Split-GFP. eLife 9: e56649. 10.7554/eLife.56649 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bailey TL. 2021. STREME: accurate and versatile sequence motif discovery. Bioinformatics 37: 2834–2840. 10.1093/bioinformatics/btab203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Ballou ER, Cook AG, Wallace EWJ. 2020. Repeated evolution of inactive pseudonucleases in a fungal branch of the Dis3/RNase II family of nucleases. Mol Biol Evol 38: 1837–1846. 10.1093/molbev/msaa324 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Barber AE, Sae-Ong T, Kang K, Seelbinder B, Li J, Walther G, Panagiotou G, Kurzai O. 2021. Aspergillus fumigatus pan-genome analysis identifies genetic variants associated with human infection. Nat Microbiol 6: 1526–1536. 10.1038/s41564-021-00993-x [DOI] [PubMed] [Google Scholar]
  13. Bayne RA, Jayachandran U, Kasprowicz A, Bresson S, Tollervey D, Wallace EWJ, Cook Atlanta G. 2021. Yeast Ssd1 is a non-enzymatic member of the RNase II family with an alternative RNA recognition site. Nucleic Acids Res 50: 2923–2937. 10.1093/nar/gkab615 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Behr J, Kahles A, Zhong Y, Sreedharan VT, Drewe P, Rätsch G. 2013. MITIE: simultaneous RNA-seq-based transcript identification and quantification in multiple samples. Bioinformatics 29: 2529–2538. 10.1093/bioinformatics/btt442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bertuzzi M, van Rhijn N, Krappmann S, Bowyer P, Bromley MJ, Bignell EM. 2021. On the lineage of Aspergillus fumigatus isolates in common laboratory use. Med Mycol 59: 7–13. 10.1093/mmy/myaa075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Bowyer P, Currin A, Delneri D, Fraczek MG. 2022. Telomere-to-telomere genome sequence of the model mould pathogen Aspergillus fumigatus. Nat Commun 13: 5394. 10.1038/s41467-022-32924-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chaves-Arquero B, Pérez-Cañadillas JM. 2023. The Nrd1-Nab3-Sen1 transcription termination complex from a structural perspective. Biochem Soc Trans 51: 1257–1269. 10.1042/BST20221418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chen C-H, Lin H-Y, Pan C-L, Chen F-C. 2011. The plausible reason why the length of 5′ untranslated region is unrelated to organismal complexity. BMC Res Notes 4: 312. 10.1186/1756-0500-4-312 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Dang TTV, Colin J, Janbon G. 2022. Alternative transcription start site usage and functional implications in pathogenic fungi. J Fungi 8: 1044. 10.3390/jof8101044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Dang TTV, Maufrais C, Colin J, Moyrand F, Mouyna I, Coppée J-Y, Onyishi CU, Lipecka J, Guerrera IC, May RC, et al. 2024. Alternative TSS use is widespread in Cryptococcus fungi in response to environmental cues and regulated genome-wide by the transcription factor Tur1. PLoS Biol 22: e3002724. 10.1371/journal.pbio.3002724 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dasari P, Shopova IA, Stroe M, Wartenberg D, Martin-Dahse H, Beyersdorf N, Hortschansky P, Dietrich S, Cseresnyés Z, Figge MT, et al. 2018. Aspf2 from Aspergillus fumigatus recruits human immune regulators for immune evasion and cell damage. Front Immunol 9: 1635. 10.3389/fimmu.2018.01635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Depledge DP, Srinivas KP, Sadaoka T, Bready D, Mori Y, Placantonakis DG, Mohr I, Wilson AC. 2019. Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen. Nat Commun 10: 754. 10.1038/s41467-019-08734-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. de Prisco N, Ford C, Elrod ND, Lee W, Tang LC, Huang K-L, Lin A, Ji P, Jonnakuti VS, Boyle L, et al. 2023. Alternative polyadenylation alters protein dosage by switching between intronic and 3′UTR sites. Sci Adv 9: eade4814. 10.1126/sciadv.ade4814 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fabri JHTM, Rocha MC, Fernandes CM, Persinoti GF, Ries LNA, Cunha A, Goldman GH, Del Poeta M, Malavazi I. 2021. The heat shock transcription factor HsfA is essential for thermotolerance and regulates cell wall integrity in Aspergillus fumigatus. Front Microbiol 12: 656548. 10.3389/fmicb.2021.656548 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Fedorova ND, Khaldi N, Joardar VS, Maiti R, Amedeo P, Anderson MJ, Crabtree J, Silva JC, Badger JH, Albarraq A, et al. 2008. Genomic islands in the pathogenic filamentous fungus Aspergillus fumigatus. PLoS Genet 4: e1000046. 10.1371/journal.pgen.1000046 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Franceschetti M, Bueno E, Wilson RA, Tucker SL, Gómez-Mena C, Calder G, Sesma A. 2011. Fungal virulence and development is regulated by alternative pre-mRNA 3′end processing in Magnaporthe oryzae. PLoS Pathog 7: e1002441. 10.1371/journal.ppat.1002441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Galagan JE, Calvo SE, Cuomo C, Ma LJ, Wortman JR, Batzoglou S, Lee SI, Baştürkmen M, Spevak CC, Clutterbuck J, et al. 2005. Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae. Nature 438: 1105–1115. 10.1038/nature04341 [DOI] [PubMed] [Google Scholar]
  28. García-Rodríguez LJ, Gay AC, Pon LA. 2007. Puf3p, a Pumilio family RNA binding protein, localizes to mitochondria and regulates mitochondrial biogenesis and motility in budding yeast. J Cell Biol 176: 197–207. 10.1083/jcb.200606054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Geisberg JV, Moqtaderi Z, Struhl K. 2020. The transcriptional elongation rate regulates alternative polyadenylation in yeast. eLife 9: e59810. 10.7554/eLife.59810 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Graber JH, Cantor CR, Mohr SC, Smith TF. 1999. Genomic detection of new yeast pre-mRNA 3′-end-processing signals. Nucleic Acids Res 27: 888–894. 10.1093/nar/27.3.888 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Graber JH, Nazeer FI, Yeh PC, Kuehner JN, Borikar S, Hoskinson D, Moore CL. 2013. DNA damage induces targeted, genome-wide variation of poly(A) sites in budding yeast. Genome Res 23: 1690–1703. 10.1101/gr.144964.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Greifenstein AA, Jo S, Bierhoff H. 2021. RNA:DNA triple helices: from peculiar structures to pervasive chromatin regulators. Essays Biochem 65: 731–740. 10.1042/EBC20200089 [DOI] [PubMed] [Google Scholar]
  33. Gunišová S, Valášek LS. 2014. Fail-safe mechanism of GCN4 translational control—uORF2 promotes reinitiation by analogous mechanism to uORF1 and thus secures its key role in GCN4 expression. Nucleic Acids Res 42: 5880–5893. 10.1093/nar/gku204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Hall RA, Wallace EWJ. 2022. Post-transcriptional control of fungal cell wall synthesis. Cell Surf 8: 100074. 10.1016/j.tcsw.2022.100074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Han S, Navarro J, Greve RA, Adams TH. 1993. Translational repression of brlA expression prevents premature development in Aspergillus. EMBO J 12: 2449–2457. 10.1002/j.1460-2075.1993.tb05899.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Hinnebusch AG, Ivanov IP, Sonenberg N. 2016. Translational control by 5′-untranslated regions of eukaryotic mRNAs. Science 352: 1413–1416. 10.1126/science.aad9868 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Hogan DJ, Riordan DP, Gerber AP, Herschlag D, Brown PO. 2008. Diverse RNA-binding proteins interact with functionally related sets of RNAs, suggesting an extensive regulatory system. PLoS Biol 6: e255. 10.1371/journal.pbio.0060255 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Hogan GJ, Brown PO, Herschlag D. 2015. Evolutionary conservation and diversification of Puf RNA binding proteins and their mRNA targets. PLoS Biol 13: e1002307. 10.1371/journal.pbio.1002307 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hong D, Jeong S. 2023. 3′UTR diversity: expanding repertoire of RNA alterations in human mRNAs. Mol Cells 46: 48–56. 10.14348/molcells.2023.0003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hoque M, Ji Z, Zheng D, Luo W, Li W, You B, Park JY, Yehia G, Tian B. 2013. Analysis of alternative cleavage and polyadenylation by 3′ region extraction and deep sequencing. Nat Methods 10: 133–139. 10.1038/nmeth.2288 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Horta MAC, Steenwyk JL, Mead ME, Dos Santos LHB, Zhao S, Gibbons JG, Marcet-Houben M, Gabaldón T, Rokas A, Goldman GH. 2022. Examination of genome-wide ortholog variation in clinical and environmental isolates of the fungal pathogen Aspergillus fumigatus. mBio 13: e0151922. 10.1128/mbio.01519-22 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Ibrahim F, Oppelt J, Maragkakis M, Mourelatos Z. 2021. TERA-Seq: true end-to-end sequencing of native RNA molecules for transcriptome characterization. Nucleic Acids Res 49: e115. 10.1093/nar/gkab713 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Jöchl C, Rederstorff M, Hertel J, Stadler PF, Hofacker IL, Schrettl M, Haas H, Hüttenhofer A. 2008. Small ncRNA transcriptome analysis from Aspergillus fumigatus suggests a novel mechanism for regulation of protein synthesis. Nucleic Acids Res 36: 2677–2689. 10.1093/nar/gkn123 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Jonkers I, Lis JT. 2015. Getting up to speed with transcription elongation by RNA polymerase II. Nat Rev Mol Cell Biol 16: 167–177. 10.1038/nrm3953 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Kamieniarz-Gdula K, Proudfoot NJ. 2019. Transcriptional control by premature termination: a forgotten mechanism. Trends Genet 35: 553–564. 10.1016/j.tig.2019.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Kelani AA, Bruch A, Rivieccio F, Visser C, Krüger T, Weaver D, Pan X, Schäuble S, Panagiotou G, Kniemeyer O, et al. 2023. Disruption of the Aspergillus fumigatus RNA interference machinery alters the conidial transcriptome. RNA 29: 1033–1050. 10.1261/rna.079350.122 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kozak M. 1991. Structural features in eukaryotic mRNAs that modulate the initiation of translation. J Biol Chem 266: 19867–19870. 10.1016/S0021-9258(18)54860-2 [DOI] [PubMed] [Google Scholar]
  48. Krappmann S, Bignell EM, Reichard U, Rogers T, Haynes K, Braus GH. 2004. The Aspergillus fumigatus transcriptional activator CpcA contributes significantly to the virulence of this fungal pathogen. Mol Microbiol 52: 785–799. 10.1111/j.1365-2958.2004.04015.x [DOI] [PubMed] [Google Scholar]
  49. Kurischko C, Kuravi VK, Herbert CJ, Luca FC. 2011. Nucleocytoplasmic shuttling of Ssd1 defines the destiny of its bound mRNAs. Mol Microbiol 81: 831–849. 10.1111/j.1365-2958.2011.07731.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Kurtenbach S, William Harbour J. 2019. SparK: a publication-quality NGS visualization tool. bioRxiv. 10.1101/845529 [DOI] [Google Scholar]
  51. Landry JR, Mager DL, Wilhelm BT. 2003. Complex controls: the role of alternative promoters in mammalian genomes. Trends Genet 19: 640–648. 10.1016/j.tig.2003.09.014 [DOI] [PubMed] [Google Scholar]
  52. Lenhard B, Sandelin A, Carninci P. 2012. Metazoan promoters: emerging characteristics and insights into transcriptional regulation. Nat Rev Genet 13: 233–245. 10.1038/nrg3163 [DOI] [PubMed] [Google Scholar]
  53. Li L, Miles S, Melville Z, Prasad A, Bradley G, Breeden LL. 2013. Key events during the transition from rapid growth to quiescence in budding yeast require posttranscriptional regulators. Mol Biol Cell 24: 3697–3709. 10.1091/mbc.e13-05-0241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Li Y, Steenwyk JL, Chang Y, Wang Y, James TY, Stajich JE, Spatafora JW, Groenewald M, Dunn CW, Hittinger CT, et al. 2021. A genome-scale phylogeny of the kingdom Fungi. Curr Biol 31: 1653–1665.e5. 10.1016/j.cub.2021.01.074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Lofgren LA, Ross BS, Cramer RA, Stajich JE. 2022. The pan-genome of Aspergillus fumigatus provides a high-resolution view of its population structure revealing high levels of lineage-specific diversity driven by recombination. PLoS Biol 20: e3001890. 10.1371/journal.pbio.3001890 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Lu Z, Lin Z. 2021. The origin and evolution of a distinct mechanism of transcription initiation in yeasts. Genome Res 31: 51–63. 10.1101/gr.264325.120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lubliner S, Keren L, Segal E. 2013. Sequence features of yeast and human core promoters that are predictive of maximal promoter activity. Nucleic Acids Resh 41: 5569–5581. 10.1093/nar/gkt256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Malabat C, Feuerbach F, Ma L, Saveanu C, Jacquier A. 2015. Quality control of transcription start site selection by nonsense-mediated-mRNA decay. eLife 4: e06722. 10.7554/eLife.06722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Marathi UK, Dahlen M, Sunnerhagen P, Romero AV, Ramagli LS, Siciliano MJ, Li L, Legerski RJ. 1998. RAD1, a human structural homolog of the Schizosaccharomyces pombe RAD1 cell cycle checkpoint gene. Genomics 54: 344–347. 10.1006/geno.1998.5589 [DOI] [PubMed] [Google Scholar]
  60. Martin-Vicente A, Souza ACO, Guruceaga X, Thorn HI, Xie J, Nywening AV, Ge W, Fortwendel JR. 2024. A conserved fungal morphogenetic kinase regulates pathogenic growth in response to carbon source diversity. Nat Commun 15: 8945. 10.1038/s41467-024-53358-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Mayr C. 2017. Regulation by 3′-untranslated regions. Annu Rev Genet 51: 171–194. 10.1146/annurev-genet-120116-024704 [DOI] [PubMed] [Google Scholar]
  62. McMillan J, Lu Z, Rodriguez JS, Ahn T-H, Lin Z. 2019. YeasTSS: an integrative web database of yeast transcription start sites. Database 2019: baz048. 10.1093/database/baz048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Mendonsa S, von Kügelgen N, Dantsuji S, Ron M, Breimann L, Baranovskii A, Lödige I, Kirchner M, Fischer M, Zerna N, et al. 2023. Massively parallel identification of mRNA localization elements in primary cortical neurons. Nat Neurosci 26: 394–405. 10.1038/s41593-022-01243-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Mireau H, Lancelin D, Small ID. 1996. The same Arabidopsis gene encodes both cytosolic and mitochondrial alanyl-tRNA synthetases. Plant Cell 8: 1027–1039. 10.1105/tpc.8.6.1027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Misra S, Crosby MA, Mungall CJ, Matthews BB, Campbell KS, Hradecky P, Huang Y, Kaminker JS, Millburn GH, Prochnik SE, et al. 2002. Annotation of the Drosophila melanogaster euchromatic genome: a systematic review. Genome Biol 3: RESEARCH0083. 10.1186/gb-2002-3-12-research0083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Mudge SJ, Williams JH, Eyre HJ, Sutherland GR, Cowan PJ, Power DA. 1998. Complex organisation of the 5′-end of the human glycine tRNA synthetase gene. Gene 209: 45–50. 10.1016/S0378-1119(98)00007-9 [DOI] [PubMed] [Google Scholar]
  67. Müller S, Baldin C, Groth M, Guthke R, Kniemeyer O, Brakhage AA, Valiante V. 2012. Comparison of transcriptome technologies in the pathogenic fungus Aspergillus fumigatus reveals novel insights into the genome and MpkA dependent gene expression. BMC Genomics 13: 519. 10.1186/1471-2164-13-519 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Murante D, Hogan DA. 2019. New mitochondrial targets in fungal pathogens. mBio 10: e02258-19. 10.1128/mBio.02258-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Nam J-W, Rissland Olivia S, Koppstein D, Abreu-Goodger C, Jan Calvin H, Agarwal V, Yildirim Muhammed A, Rodriguez A, Bartel David P. 2014. Global analyses of the effect of different cellular contexts on microRNA targeting. Mol Cell 53: 1031–1043. 10.1016/j.molcel.2014.02.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Natsoulis G, Hilger F, Fink GR. 1986. The HTS1 gene encodes both the cytoplasmic and mitochondrial histidine tRNA synthetases of S. cerevisiae. Cell 46: 235–243. 10.1016/0092-8674(86)90740-3 [DOI] [PubMed] [Google Scholar]
  71. Nierman WC, Pain A, Anderson MJ, Wortman JR, Kim HS, Arroyo J, Berriman M, Abe K, Archer DB, Bermejo C, et al. 2005. Genomic sequence of the pathogenic and allergenic filamentous fungus Aspergillus fumigatus. Nature 438: 1151–1156. 10.1038/nature04332 [DOI] [PubMed] [Google Scholar]
  72. Parker MT, Knop K, Sherwood AV, Schurch NJ, Mackinnon K, Gould PD, Hall AJW, Barton GJ, Simpson GG. 2020. Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification. eLife 9: e49658. 10.7554/eLife.49658 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Pelechano V, Wei W, Jakob P, Steinmetz LM. 2014. Genome-wide identification of transcript start and end sites by transcript isoform sequencing. Nat Protoc 9: 1740–1759. 10.1038/nprot.2014.121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Petibon C, Malik Ghulam M, Catala M, Abou Elela S. 2021. Regulation of ribosomal protein genes: an ordered anarchy. WIREs RNA 12: e1632. 10.1002/wrna.1632 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Prade RA, Timberlake WE. 1993. The Aspergillus nidulans brlA regulatory locus consists of overlapping transcription units that are individually required for conidiophore development. EMBO J 12: 2439–2447. 10.1002/j.1460-2075.1993.tb05898.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rhodes J, Abdolrasouli A, Dunne K, Sewell TR, Zhang Y, Ballard E, Brackin AP, van Rhijn N, Chown H, Tsitsopoulou A, et al. 2022. Population genomics confirms acquisition of drug-resistant Aspergillus fumigatus infection by humans from the environment. Nat Microbiol 7: 663–674. 10.1038/s41564-022-01091-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Sadée C, Hagler LD, Becker WR, Jarmoskaite I, Vaidyanathan PP, Denny SK, Greenleaf WJ, Herschlag D. 2022. A comprehensive thermodynamic model for RNA binding by the Saccharomyces cerevisiae Pumilio protein PUF4. Nat Commun 13: 4522. 10.1038/s41467-022-31968-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Sellam A, Hogues H, Askew C, Tebbji F, van het Hoog M, Lavoie H, Kumamoto CA, Whiteway M, Nantel A. 2010. Experimental annotation of the human pathogen Candida albicans coding and noncoding transcribed regions using high-resolution tiling arrays. Genome Biol 11: R71. 10.1186/gb-2010-11-7-r71 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Shenker S, Miura P, Sanfilippo P, Lai EC. 2015. IsoSCM: improved and alternative 3′ UTR annotation using multiple change-point inference. RNA 21: 14–27. 10.1261/rna.046037.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Siddiqah IM, Manandhar SP, Cocca SM, Hsueh T, Cervantes V, Gharakhanian E. 2017. Yeast ENV9 encodes a conserved lipid droplet (LD) short-chain dehydrogenase involved in LD morphology. Curr Genet 63: 1053–1072. 10.1007/s00294-017-0702-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Smale ST, Kadonaga JT. 2003. The RNA polymerase II core promoter. Annu Rev Biochem 72: 449–479. 10.1146/annurev.biochem.72.121801.161520 [DOI] [PubMed] [Google Scholar]
  82. Son S-H, Jang S-Y, Park H-S. 2021. Functions of PUF family RNA-binding proteins in Aspergillus nidulans. J Microbiol Biotechnol 31: 676–685. 10.4014/jmb.2101.01011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Srivastava AK, Lu Y, Zinta G, Lang Z, Zhu J-K. 2018. UTR-dependent control of gene expression in plants. Trends Plant Sci 23: 248–259. 10.1016/j.tplants.2017.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Sundaram A, Grant CM. 2014. A single inhibitory upstream open reading frame (uORF) is sufficient to regulate Candida albicans GCN4 translation in response to amino acid starvation conditions. RNA 20: 559–567. 10.1261/rna.042267.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Tian B, Manley JL. 2017. Alternative polyadenylation of mRNA precursors. Nat Rev Mol Cell Biol 18: 18–30. 10.1038/nrm.2016.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Tudek A, Porrua O, Kabzinski T, Lidschreiber M, Kubicek K, Fortova A, Lacroute F, Vanacova S, Cramer P, Stefl R, et al. 2014. Molecular basis for coordinating transcription termination with noncoding RNA degradation. Mol Cell 55: 467–481. 10.1016/j.molcel.2014.05.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. van Hoof A, Frischmeyer PA, Dietz HC, Parker R. 2002. Exosome-mediated recognition and degradation of mRNAs lacking a termination codon. Science 295: 2262–2264. 10.1126/science.1067272 [DOI] [PubMed] [Google Scholar]
  88. Vasudevan S, Peltz SW, Wilusz CJ. 2002. Non-stop decay‐‐a new mRNA surveillance pathway. Bioessays 24: 785–788. 10.1002/bies.10153 [DOI] [PubMed] [Google Scholar]
  89. Wallace EWJ, Maufrais C, Sales-Lee J, Tuck LR, de Oliveira L, Feuerbach F, Moyrand F, Natarajan P, Madhani HD, Janbon G. 2020. Quantitative global studies reveal differential translational control by start codon context across the fungal kingdom. Nucleic Acids Res 48: 2312–2331. 10.1093/nar/gkaa060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wang M, Ogé L, Perez-Garcia M-D, Hamama L, Sakr S. 2018. The PUF protein family: overview on PUF RNA targets, biological functions, and post transcriptional regulation. Int J Mol Sci 19: 410. 10.3390/ijms19020410 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Warner EF, Bohálová N, Brázda V, Waller ZAE, Bidula S. 2021. Analysis of putative quadruplex-forming sequences in fungal genomes: novel antifungal targets? Microb Genom 7: 000570. 10.1099/mgen.0.000570 [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Webb S, Hector RD, Kudla G, Granneman S. 2014. PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast. Genome Biol 15: R8. 10.1186/gb-2014-15-1-r8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Wilhelm BT, Marguerat S, Watt S, Schubert F, Wood V, Goodhead I, Penkett CJ, Rogers J, Bähler J. 2008. Dynamic repertoire of a eukaryotic transcriptome surveyed at single-nucleotide resolution. Nature 453: 1239–1243. 10.1038/nature07002 [DOI] [PubMed] [Google Scholar]
  94. Wilinski D, Buter N, Klocko AD, Lapointe CP, Selker EU, Gasch AP, Wickens M. 2017. Recurrent rewiring and emergence of RNA regulatory networks. Proc Natl Acad Sci 114: E2816–E2825. 10.1073/pnas.1617777114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Yogev O, Pines O. 2011. Dual targeting of mitochondrial proteins: mechanism, regulation and function. Biochim Biophys Acta 1808: 1012–1020. 10.1016/j.bbamem.2010.07.004 [DOI] [PubMed] [Google Scholar]

Articles from RNA are provided here courtesy of The RNA Society

RESOURCES