Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2018 Mar 27;46(11):5822–5836. doi: 10.1093/nar/gky225

The loss of SMG1 causes defects in quality control pathways in Physcomitrella patens

James P B Lloyd 1,✉,2,3, Daniel Lang 2,2,3, Andreas D Zimmer 2,3, Barry Causier 1, Ralf Reski 2,3, Brendan Davies 1,
PMCID: PMC6009662  PMID: 29596649

Abstract

Nonsense-mediated mRNA decay (NMD) is important for RNA quality control and gene regulation in eukaryotes. NMD targets aberrant transcripts for decay and also directly influences the abundance of non-aberrant transcripts. In animals, the SMG1 kinase plays an essential role in NMD by phosphorylating the core NMD factor UPF1. Despite SMG1 being ubiquitous throughout the plant kingdom, little is known about its function, probably because SMG1 is atypically absent from the genome of the model plant, Arabidopsis thaliana. By combining our previously established SMG1 knockout in moss with transcriptome-wide analysis, we reveal the range of processes involving SMG1 in plants. Machine learning assisted analysis suggests that 32% of multi-isoform genes produce NMD-targeted transcripts and that splice junctions downstream of a stop codon act as the major determinant of NMD targeting. Furthermore, we suggest that SMG1 is involved in other quality control pathways, affecting DNA repair and the unfolded protein response, in addition to its role in mRNA quality control. Consistent with this, smg1 plants have increased susceptibility to DNA damage, but increased tolerance to unfolded protein inducing agents. The potential involvement of SMG1 in RNA, DNA and protein quality control has major implications for the study of these processes in plants.

INTRODUCTION

Eukaryotic gene expression is optimized through many regulated steps, including the differential incorporation of exonic and intronic sequences in mRNA (alternative splicing; AS). AS has the capacity to alter both the protein coding potential of a transcript and its stability. Nonsense-mediated mRNA decay (NMD) is a process that influences the steady state levels of specific transcripts by targeting them for decay (1). NMD was originally characterized as a quality control pathway that degrades transcripts with premature termination codons (PTCs), introduced by mutations or splicing errors (1). However, it is now clear that NMD plays a more substantial role, since it has been shown to influence the steady state expression of between 1% and 10% of genes in flies, worms, mammals and Arabidopsis thaliana (2–7). Additionally, AS-coupled to NMD (AS-NMD) is an emerging regulatory mechanism in plants and animals, in which splicing differentially includes or excludes PTCs from transcripts, thereby altering the stability of transcripts by shifting them in and out of the influence of NMD (8,9).

PTCs are distinguished from normal termination codons due to their location in ‘unusual’ contexts (10). For example, a stop codon positioned ≥50 nucleotides upstream of an splice junction can trigger decay of the transcript (11–14); this is known as the 50 nt rule (14). The presence of a long 3′ untranslated region (3′ UTR) constitutes another ‘unusual’ context, which has been reported to trigger NMD in plants, animals and fungi (12,15–17). Upstream open reading frames (uORFs) and conserved peptide uORFs can also trigger NMD (7,18,19). In mammals, it is estimated that a third of intron-containing genes could be subject to AS-NMD (20,21), and at least 13–17% of A. thaliana intron-containing genes undergo AS-NMD (22,23). Many of these splicing events are conserved, and are thus likely to be regulated events, required for correct expression of the targeted genes (8,24).

The RNA helicase, UPF1, is a core component of the NMD machinery. UPF1 is essential for viability in A. thaliana, with the null mutant allele exhibiting seedling lethality (25). Expression of AtUPF1 in tobacco leaves revealed that, as in animals, plant UPF1 is phosphorylated (26,27). However, the kinase responsible is unknown. In animals, the kinase, SMG1, phosphorylates UPF1, but SMG1 is absent from the A. thaliana genome (28–30); SMG1 is present in nearly all other examined plants, including Arabidopsis lyrata (29,30), making the model plant, A. thaliana, the exception rather than the rule (29,30). We recently showed that SMG1 functions in the NMD pathway of the moss Physcomitrella patens (29). The ubiquitous presence of SMG1 in plants other than A. thaliana and Capsella rubella, and its conserved role in NMD, imply that this RNA quality control mechanism could be more similar in the plant and animal kingdoms than previously thought.

Having established moss as a model to characterize the function of SMG1 in plants, we performed RNA-seq to examine changes in gene expression and alternative splicing upon the loss of SMG1. We found that 32% of expressed multi-isoform genes produce one or more isoforms targeted to NMD, similar to the number described in mammals (20,21). A machine learning approach allowed us to identify features that are important for NMD target recognition: revealing that splice junctions in 3′ UTRs are an important determinant of NMD-sensitivity. Surprisingly, we found that not only was the NMD quality control pathway compromised, but also other quality control pathways were disrupted in smg1 mutant plants. The unfolded protein response (UPR) was activated in smg1 plants and, when exposed to protein unfolding drugs, mutant plants were more resistant than wild-type (WT) plants, suggesting that SMG1 may have a role in repressing the UPR. Additionally, DNA damage repair genes were also activated in smg1 plants, which were more susceptible to DNA damage than WT plants, suggesting that SMG1 is needed for normal DNA damage repair. This highlights the importance of SMG1 in plant growth and the need to study the unfolded protein response and the response to DNA damage in plants other than A. thaliana.

MATERIALS AND METHODS

Plant growth, treatments and phenotypic examination

Physcomitrella patens ssp. patens (Hedwig) ecotype ‘Gransden 2004’ (31,32) was cultured on BCDAT medium at 25°C under continuous light (33). Lawns of protonemal filaments were grown on BCDAT medium overlaid with a cellophane disc and sub-cultured by homogenisation at seven-day intervals. To assess the reaction of moss to drug treatment, moss tissue five- to six-days post-homogenisation, was inoculated as ‘spot inocula’ on BCDAT supplemented with drug or solvent control and grown for three weeks. To induce DNA damage moss was treated with 100 or 200 ng/ml bleomycin (Euro Nippon Kayaku GmbH, Frankfurt, Germany). To induce the unfolded protein response of the endoplasmic reticulum BCDAT medium was supplemented with tunicamycin (Tm) 2.5 μg/ml or DMSO solvent control. To induce the unfolded protein response across the whole cell BCDAT medium was supplemented with 10 mM l-azetidine- 2-carboxylic acid (AZC), a proline analogue, or 10 mM proline as a control for growth. To estimate moss colony area, photographs of moss plates were taken, the image software ImageJ was used to convert the images into binary format and the number of pixels corresponding to a colony/plant measured. Moss colony size was normalised between plates and converted into mm2 by estimating the area of the plate (34). To test for the drug-genotype interaction in the Tm treated plants, we implemented a non-parametric version of the ANOVA test (Aligned Rank Transform test) in R (35), to account for non-normality and non-homogeneity of variances, even after log-transformation.

RNA Isolation, RNA-seq and qRT-PCR

Total RNA was collected from 100 mg of protonema tissue, five-days post homogenisation using the RNeasy Plant Mini Kit with on-column DNase I treatment (Qiagen). RNA for RNA-seq was sent to GTAC Biotech (Constance, Germany) for library preparation using the TruSeq sample preparation kit (Illumina) and sequencing on the Illumina HiSeq2000 to generate single-end 100 nt long reads. Two biological replicates were collected for WT and each of the two mutant lines. Template for quantitative (q)RT-PCR was made using 1 μg of total RNA and iScript cDNA synthesis Kit (Bio-Rad). qRT-PCR was carried out using a Bio-rad CFX96 Real-Time System with SsoFast EvaGreen Supermix (Bio-Rad). To monitor the expression of unfolded protein responsive genes, moss tissue five-days post-homogenisation was inoculated as ‘spot inocula’ on BCDAT supplemented with 1 μg/ml of tunicamycin or DMSO as solvent control and grown for two weeks before tissue was collected for RNA extraction. All primers are listed in Supplemental Table S1.

Transcript assembly, isoform selection and differential analysis of expression and splicing

The reads generated by RNA-seq were mapped to the genome using TopHat (tophat-1.3.1.Linux_x86_64 custom version supporting additional bowtie1 parameters), which can map reads spanning exon-exon junctions (36) employing the parameter configuration: -I 20000 (maximum intron size), –segment-mismatches 2, –segment-length 18, –microexon-search, –maqerr 90, –bowtie-strata, –bowtie-all, –bowtie-best, –max-multihits 50 and –initial-read-mismatch 3, and using the version 1.6 of the moss genome as a reference (37). Transcript- and gene-wise read counts were obtained using htseq-count (38). Differential gene expression analysis was carried out using edgeR (39), DESeq (40) and NOISeq (41) with a significance threshold of corrected P < 0.05. Genes were considered differentially expressed if they were identified by at least two of the three tools in both smg1 mutants. GO and MapMan enrichments were then performed on the differentially expressed genes using custom R scripts utilizing the weight algorithm of the topGO package for GO analysis (for scripts, see Zenodo archive: https://doi.org/10.5281/zenodo.826164).

To identify novel transcripts, mapped reads of the individual libraries were assembled into candidate transcripts using Cufflinks (42), which were subsequently clustered and combined together with the V1.6 gene structures into consensus gene models using PASA and a set of custom perl scripts (for scripts, see Zenodo archive: https://doi.org/10.5281/zenodo.826164). The main open reading frame (ORF) was chosen based on homology, protein length and isoform expression. Using the improved annotation, we counted reads in each library which were specific to individual splicing events and used these event-specific read counts (Perl script; https://doi.org/10.5281/zenodo.826164) to perform differential expression analysis at the splice site level using the general linear model method implemented in edgeR (39). Transcript expression was also measured, with htseq-count (38). We contrasted isoform abundances in mutant and wild-type lines by estimating CPM-abundance, log-Fold change and FDR-corrected P-value for each splicing event and whole isoform separately. The FDR-corrected P-value threshold was P < 0.01. Change of protein structure/function/domains was evaluated considering the BLAST homology and PFAM domain coverage of the reference and the alternative transcripts. For this we compared alignments of both domain and BLAST matches. If these were altered (difference in length or number) then we considered the protein to be significantly altered.

Identification of NMD targeting features and prediction of NMD targets using ensemble (Machine) learning

In order to utilize the classified splice events to identify common NMD targeting features, and to discriminate between true NMD targets and secondarily deregulated isoforms in the smg1 mutants, we collected 361 additional structural and sequence features in addition to the expression data described above. First, the categorical values were split up into binary features (e.g. AS-type.alt_acceptor = 1|0). In addition, we collected numeric attributes describing the expression level in the mutants/WT, measuring 46 features of gene structure and uORFs, 207 sequence compositional features of transcript- and event-level contexts, three features describing cytosine methylation levels (43) at splice sites as well as 67 frequencies of overrepresented motif and K-mers. Subsequently we compacted the feature vector by filtering near-zero variance and reducing correlated (>75%) features using the caret package (44). Two versions of the machine learning approach were performed: The first used all available transcript attributes, including the expression data (EXP), while the second used only structural and sequence features, not features related to expression data (NOEXP).

A training set of 107 transcripts which were either likely NMD targets or non-NMD targets were selected (Supplemental Table S2). NMD targets were chosen by reviewing many loci in the Genome Browser by identifying alternative isoforms which led to increases in expression upon NMD inhibition and introduced an early stop codon. Non-NMD targets were chosen due to the lack of change in expression in smg1 plants. The training set was established in an iterative procedure that involved multiple rounds of manual curation, training, prediction, repeat. We selected a random sample of targets for manual curation. Random sampling was not only carried out on the entire data set as a whole, but on subsets that were defined based on exploratory analysis of features not necessarily linked to NMD, like e.g. alternative donors or CDS changing splicing events, to get a broad representation of the data. We performed cross-validation to ensure that we did not over-fit. Additionally, some of the training set was withheld from rounds of machine learning, and then tested on the excluded training set list. This ensured that some features were not being over-predicted before we proceeded to the final round where we used the whole training set to generate the final, cross-validated model. With this training set, we utilized the resulting feature vector for each of the identified splicing events to perform supervised classification following an ensemble learning/bagging approach that utilized 76 distinct machine learning methods. Each of the 76 compatible, classifiers implemented in the caret package (44) was trained and evaluated using three times repeated 5-fold cross-validation. The trained models were filtered for models with Accuracy ≥95% (EXP) and ≥75% (NOEXP) in the cross-validation procedure. The resulting 19 (EXP) and 11(NOEXP) models were used for predicting the whole dataset. Each model results in an event classification as: is.NMD = true or false.

RESULTS

Analysis of differential gene expression in smg1 moss

To investigate the influence of SMG1 on gene expression, we analysed the transcriptomic changes that occur upon the loss of SMG1 in the moss P. patens. RNA was collected from WT moss and two independently generated deletion mutants (smg1-1 and smg1-2) (29) and subjected to high-throughput RNA-sequencing (RNA-seq). RNA-seq reads were mapped to the moss genome, with an average mapping rate of 89% (Supplemental Table S3). The subsequent analysis followed a two-pronged approach (Figure 1A). First, we identified differentially expressed genes using the published version 1.6 gene models of the moss genome (37). Second, we inferred isoforms via transcript assemblies, examined differential splicing at the event level and then took a machine learning approach to identify the transcript features that target transcript isoforms to NMD (Figure 1A).

Figure 1.

Figure 1.

Analysis of differential expression from smg1 mutant plants. (A) Outline of the analysis pipeline to find differential gene expression and differential alternative splicing in the smg1 mutant plants. (B and C) Genes up-regulated and down-regulated, respectively, in smg1Δ lines when compared to WT (P < 0.05). Three different tools were used to assess if a transcript was up- or down-regulated (DESeq, edgeR and NOISeq). During the first round of selection, only genes that were differentially regulated in at least two tools were taken forward (overlap is indicated with an asterisk).

Identifying dysregulated pathways in smg1 mutant plants

We generated a conservative list of differentially expressed genes by using three computational tools: DESeq, edgeR and NOISeq (40,41,45). Genes that showed significantly changed expression in both mutants (smg1-1 and smg1-2), by at least two tools, were considered to be differential (Supplemental Table S4). Using these strict criteria, we identified 1648 and 3400 genes with increased and decreased steady state expression in smg1, respectively (Figure 1B and C). Therefore, large-scale changes in moss gene expression result from the loss of SMG1, with 13% of the moss genes showing differential expression. This is consistent with the wide-ranging transcriptional changes observed in NMD compromised flies, worms, human cells and A. thaliana (4,46–48).

Given the previously identified smg1 mutant phenotype of reduced gametophore number (29), we examined the expression of genes associated with the early development of gametophores (buds) to see if they were reduced in smg1 plants. Genes with reduced expression in smg1 plants show a significant enrichment for previously identified bud induced genes (49) (Hypergeometric test: P = 1.14 × 10−103), including five known bud marker genes (50) (Table 1). These data demonstrate that our RNA-seq analysis can capture the known phenotypic consequences of the deletion of SMG1.

Table 1. Genes involved in bud development have lower expression in smg1 mutant moss.

Gene ID Gene name Fold change (log2) in smg1 plants Significant tests
Pp1s25_23V6 BIP1 –188.22 2
Pp1s400_1V6 BIP2 –152.61 2
Pp1s31_134V6 BIP3 –69.08 2
Pp1s190_39V6 BIP4 –37.43 3
Pp1s8_135V6 BIP6 –34.66 2

The significant tests column reflects the number of differential testing tools (DESeq, edgeR and NOISeq) in which the gene was significantly changed.

To discover other biological processes that are disrupted after SMG1 deletion in moss, the conservative list of differentially expressed genes was used for gene ontology (GO) and MapMan (51) enrichment analysis. Amongst the genes that showed enhanced expression in smg1 mutants, the most highly enriched MapMan term is DNA synthesis/chromatin structure (Table 2). Manual inspection of these genes revealed that at least four of them have a role in DNA repair. DNA repair and DNA recombination were also enriched terms in the GO analysis (Supplemental Table S5), indicating an over-expression of genes relating to DNA damage repair in the smg1 mutants. A dysregulation of the DNA repair pathway upon the loss of SMG1 is reminiscent of a defect seen in animal cells with knocked down SMG1 (52). The second most highly enriched MapMan term associated with genes with an increased steady state expression in smg1 plants was stress/abiotic/heat, which completely overlapped with another MapMan term, chaperones and co-chaperones/HSP70s/chaperones (Table 2 and Supplemental Table S6). The majority of these genes (21/23) encode predicted chaperones, co-chaperones or proteases with predicted roles in the degradation of unfolded proteins (Supplemental Table S7). These genes appear to be associated with various cellular compartments including the cytosol, chloroplast and mitochondria. The unfolded protein response (UPR) of the endoplasmic reticulum (UPRER) has been the focus of many studies in animals and plants (53,54). A manual search for moss homologues of genes implicated in the UPRER in A. thaliana revealed that expression of genes such as BiP and Derlin-1 is increased in smg1 (Supplemental Table S7). Genes showing reduced expression in the smg1 mutants were also identified and categorized by terms such as photosynthesis, translation, cell wall modification and leucine rich repeat receptor kinases, suggesting a decrease in normal growth and metabolism in the absence of SMG1 (Supplemental Tables S5 and S6). Taken together, analysis of differential gene expression indicates that the loss of SMG1 leads to the dysregulation of many genes, with stress response genes relating to DNA damage and the unfolded protein response being over-expressed; the biological processes enriched among the down-regulated genes are consistent with an overall developmental retardation, hindered stress-response, and metabolic impairment.

Table 2. Over-represented MapMan terms of genes with increased steady state expression in smg1 plants.

MapMan term description P value
DNA synthesis/chromatin structure 4.58E–03
Stress abiotic heat 4.66E–03
Transport ABC transporters and multidrug resistance systems 1.48E–02
Chaperones and cochaperones HSP70s chaperones 2.33E–02
RNA processing splicing 2.60E–02

The loss of SMG1 leads to disrupted DNA and protein quality control systems

Many genes involved in the UPR showed increased steady state expression in smg1 plants compared to WT (Table 2), suggesting that SMG1 is important for the normal response to unfolded proteins. We treated moss with tunicamycin (Tm) to induce unfolded proteins in the ER (55–59) and found that smg1 lines grown on Tm were larger and greener than WT controls (Figure 2A and B). To confirm that this is not a drug-specific effect, we also treated WT and smg1 lines with the proline analogue, L-azetidine-2-carboxylic acid (AZC), previously shown to induce unfolded proteins in A. thaliana (57,59). As with Tm, smg1 plants grown on AZC were larger and greener than WT controls (Supplemental Figure S1). smg1 colonies are typically larger than WT colonies on control media, except when supplemented with proline as a control for AZC (Supplemental Figure S1A). However, we still see smg1 plants outgrowing WT plants on AZC, further supporting our conclusion that smg1 plants are better able to grow under high unfolded protein stress than WT plants. We then selected six UPR-related genes for which the RNA-seq analysis showed stronger expression in the smg1 mutants than in the WT, for validation using qRT-PCR. All six genes showed elevated expression in smg1 mutant lines (Figure 2C), in agreement with our RNA-seq analysis. We found that some of these genes (BiP1, BiP2 and ERjd3A) were also elevated in moss treated with Tm (Figure 2D). BiP2 in particular is highly inducible by Tm treatment, with steady state levels increasing over 50-fold. Taken together, these data show that smg1 mutant lines exhibit partial resistance to inducers of unfolded proteins, which may be due to an increased synthesis of chaperones and other proteins involved in clearing the cell of unfolded proteins.

Figure 2.

Figure 2.

smg1 plants are partially resistant to the unfolded protein inducing drug tunicamycin (Tm). (A) Three week old plants grown on Tm or solvent control (DMSO). Scale bar is 1 mm. (B) Moss colony size on Tm (2.5 μg/ml). n = 5–12. Asterisks represent a statistically significant difference from WT using an unpaired t test P < 0.05. A significant drug/genotype interaction was identified (Aligned Rank Transform test; P = 4.14 × 10−5), along with a significant drug treatment (Aligned Rank Transform test; P = 7.25 × 10−11) and genotype (Aligned Rank Transform test; P = 7.02 × 10−5) effect. (C) qRT-PCR analysis of BiP1 (Pp1s181_3V6), BiP2 (Pp1s288_23V6), Derlin-1a (Pp1s213_66V6), ERjd3A (Pp1s368_19V6), IRE1b (Pp1s34_189V6) and HSF (Pp1s31_388V6), expression in WT and smg1Δ line 1 (DMSO treated as solvent control). (D) qRT-PCR analysis of BiP1 (Pp1s181_3V6), BiP2 (Pp1s288_23V6), Derlin-1a (Pp1s213_66V6), ERjd3A (Pp1s368_19V6), IRE1b (Pp1s34_189V6), and HSF (Pp1s31_388V6), expression in WT treated with Tm (1 μg/ml) or untreated (DMSO solvent control). The fold change indicates the amount of target expression normalized to that of PpEF1α and relative to WT levels. Error bars represent the standard error of the mean from three biological replicates. Asterisks indicate conditions with a statistically significant difference from WT (DMSO solvent control) using an unpaired t test (*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001). WT DMSO control in panel C and D represent the same data.

The activation of UPRER and some other cellular stresses in animal cells are known to inhibit NMD (19,60–62). Therefore, we tested the expression of several NMD targets in Tm treated moss to determine whether the UPRER also reduces the activity of NMD in moss. Five transcripts were tested, four of which have previously been shown to be NMD targets in moss (29), and one novel PTC-containing splice variant (HSF PTC+) identified in this study. As expected, all these transcripts are increased in the smg1 mutant (Figure 3A), but Tm treatment does not significantly elevate their expression (Figure 3B). Our data suggest that activation of the UPR does not suppress the NMD pathway in moss, as it does in animals (19,60–62).

Figure 3.

Figure 3.

NMD targets are not increased in Tm exposed plants. (A) The expression of NMD targets RS2Z37 (Pp1s69_23V6), Pp108464 (Pp1s270_54V6), HSF Pp1s31_388V6eIF5L1 (Pp1s626_4V6), and SMG7-2 (Pp1s311_73V6) in the smg1 mutant plants treated with DMSO as a solvent control. (B) The expression of NMD targets RS2Z37 (Pp1s69_23V6), Pp108464 (Pp1s270_54V6), HSF (Pp1s31_388V6), eIF5L1 (Pp1s626_4V6), and SMG7-2 (Pp1s311_73V6) in WT plants exposed to two weeks of Tm (1 μg/ml). WT DMSO control in panel A and B represent the same data. The fold change indicates the amount of target expression normalized to that of PpEF1α and relative to WT levels. Error bars represent the standard error of the mean from three biological replicates. Asterisks indicate conditions with a statistically significant difference from WT (DMSO solvent control) using an unpaired t test (*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001).

Our differential gene expression analysis also showed that many genes with an increased steady state expression in moss smg1 mutant plants were associated with DNA repair (Table 2, Supplemental Tables S5 and S6). In mammals, SMG1 has an NMD-independent role in the DNA repair pathway and the loss of SMG1 increases the expression of DNA damage markers and DNA damage susceptibility (52,63). Therefore, moss was grown on the DNA damage inducing agent bleomycin, which has previously been used to test the susceptibility of moss mutants to DNA damage (34,64,65). After three weeks of growth on bleomycin, smg1 plants were significantly smaller and less green than WT plants (Figure 4), suggesting that the loss of SMG1 leads to an increased susceptibility to DNA damage, implicating SMG1 in the moss DNA repair pathway.

Figure 4.

Figure 4.

smg1 lines are more susceptible than WT to bleomycin. (A) Two week old moss plants grown on media with or without bleomycin. Two representative colonies of each line are shown for each treatment. Scale bar is 1 mm. (B) Moss colony size after three weeks growth on 100 ng/ml bleomycin. Data is the mean from 6–12 plants. Error bars represent standard deviation. Asterisks indicate lines with a statistically significant difference from the WT using an unpaired t test P < 0.05.

Taken together, these data show that the loss of SMG1 leads to the dysregulation of multiple quality control pathways in moss plants, evident in gene expression changes as well as the altered responses to drugs that induce either unfolded protein, or DNA damage. Therefore, SMG1 could be an important factor for quality control at the DNA and protein level, in addition to its established role in RNA quality control.

NMD in moss is triggered by the presence of 3′ UTR splice junctions

Various transcript features, including long 3′ UTRs and 3′ UTR splice junctions, have been proposed to mark a stop codon as aberrant, triggering NMD in various organisms, but it is currently unclear what features determine whether a transcript is an NMD target in moss. Using the RNA-seq data from smg1 plants and a machine learning approach, we were able to identify many features linked to the NMD-sensitivity of a transcript. First, we used Cufflinks (42) and PASA (66) to infer transcript isoforms from RNA-seq reads and existing transcripts, and to classify AS types in the newly assembled transcriptome. Then reads mapped unambiguously to the splicing events identified by PASA were tested for differential abundance in smg1 plants against WT plants using the generalized linear modelling in edgeR (45). This allowed us to identify direct targets of AS-NMD. A list of 108 manually curated NMD-sensitive and NMD-insensitive transcript events (Supplemental Table S2) was used as a training set for cross-validated training of 76 distinct machine learning techniques, available via the caret package (44). To identify the transcripts that trigger NMD, 328 transcript attributes were used by the machine learning approach, including details of transcript expression, splicing, and the presence of motifs. We identified 14128 events as NMD-targeted, representing 32% of annotated genes with expressed multiple isoforms. The transcript attributes most useful in determining NMD-sensitivity are presented in Figure 5A and Supplemental Figure S2. Some of these transcript attributes reveal mechanistic insights into how NMD targets are identified in moss. The transcript attribute ‘3′ UTR spliced introns’ is a positive predictor of a transcript targeted to NMD (Figure 5A), indicating that moss recognizes premature stop codons by the presence of a downstream exon-junction complex, as do animals and other plants (12,14,17,23,67). Surprisingly, we found that long 3′ UTRs are not positive predictors of NMD-targeting (Figure 5A), as has been suggested for animals and other plants (12,15,17). We also found that the more distant a 3′ UTR splice junction was from the stop codon, the less efficient at triggering NMD it appeared to be (Figure 5A). Other features include an under-representation of specific codons in uORFs (ATC, AAC, and GGT) and/or an over-representation (CAGTTGAAATTT and GTGAAAVTTTTC) or under-representation (CCAACATCAT, CTTGGCTA, and THTCAWGGGT) of certain motifs in the NMD targeted transcripts (Figure 5A and Supplemental Figure S2). However, it is unclear how these attributes alter NMD-sensitivity.

Figure 5.

Figure 5.

Transcript attributes influencing NMD target identification. (A) Factors that influenced NMD target status. The plot quantifies how well transcript attributes distinguish between NMD-targeted and non-NMD targeted events/transcripts in the ensemble machine learning approach. Transcript attributes act as either positive (red) or negative (purple) predictors of NMD-targeted status. Transcript attributes are located above the relevant feature of the example transcript models (bottom), with the exception of transcript-wide features (fold change (log2), retained intron, motif: CTACAAGA and alterative donor site), located on the right of the plot. Example transcript models represent two transcript isoforms, one non-NMD target (top) and one NMD target (bottom). Transcript attributes are ranked by ‘median importance’: Importance is the relative predictive power of each transcript attribute in identifying NMD-targeted events/transcripts. The median importance is taken from across the various machine learning tools used in this study. Abbreviations used in plot: TSS is transcriptional start site, CDS is coding sequence, and UTR is untranslated region. (B) The frequency of different AS types with increased, decreased or unchanged alternative splicing after the loss of SMG1. The AS types included: Included exon (retained exon; n = 2260), skipped exon (n = 829), retained intron (n = 58 459), spliced intron (n = 4636), alternative acceptor site (n = 5106), and alternative donor site (n = 4110). The frequency of exitrons (n = 18901) and other retained introns (n = 39 558) after loss of SMG1. Exitrons are defined as retained introns that are within a coding region and do not introduce a PTC. More differential exitrons have increased than differential retained introns (Fisher's exact test one-tailed test, P = 0.0056).

The GO enrichment of predicted direct AS-NMD targets (Supplemental Figure S3) reveals that splicing coupled to NMD is important for controlling the expression of a wide range of genes with diverse functions. Therefore, the mechanism of AS-NMD has a broad role in gene expression and is not limited to controlling a single process. We found that nearly all (96%) splicing introduced PTCs disrupt conserved domains of the encoded protein (Supplemental Figure S4), highlighting the importance of NMD in maintaining the proteome and the need for correct identification of NMD targeted transcript isoforms when curating a computationally generated transcriptome. Taken together, these data suggest that the NMD pathway of moss targets transcripts to NMD predominately via the presence of a 3′ UTR located splice junction but not a long 3′ UTR.

Transcript attributes related to the type of alternative splicing have predictive power in determining whether a transcript is targeted to NMD (Figure 5A). The presence of an alternative donor site is a positive predictor of NMD targeting, while the presence of a retained intron acts as a negative predictor (Figure 5A). Indeed, the steady state expression of transcripts with alternative donor sites tend to increase rather than decrease in the smg1 plants (Figure 5B). This pattern is also seen for transcripts with included exons, skipped exons, spliced introns, and alternative acceptor sites, but not for retained introns (Figure 5B). A similar fraction of retained introns increases and decreases after NMD is inhibited (Figure 5B), suggesting that most changes in transcript with retained introns are linked to secondary changes rather directly leading to NMD. Exitrons are a set of coding retained introns that do not introduce PTCs (68), we found that exitrons and other retained introns has a similar pattern of change after NMD was inhibited (Figure 5B), further evidence that retained introns are not used as a global mechanism to induce NMD. Therefore, not all AS types are equally used to target transcripts to NMD.

Retained intron transcripts have been reported as NMD targets in various eukaryotes (48,69,70). However, we (Figure 5) and others (22,23,71) have found many retained intron transcripts to be resistant to NMD. To confirm this result, we selected three retained intron events to examine in more detail. In addition to having retained intron isoforms, PpSCL30, PpRS27 and PpRS2Z37, all have NMD targeted isoforms resulting from an included exon (PpSCL30 and PpRS27) or alternative acceptor site (PpRS2Z37). qRT-PCR revealed that the predicted NMD targeted isoforms resulting from included exons (Figure 6) or alternative acceptor sites (29) are present in increased abundance in the smg1 mutant lines, but the retained intron isoforms show no such increase (Figure 6), supporting the notion that some retained intron transcripts are resistant to NMD in moss, as has been found for A. thaliana and mammals (22,23,71).

Figure 6.

Figure 6.

Isoforms with PTCs introduced by a retained intron do not have increased steady state expression in smg1 mutants. (A) qRT-PCR analysis of the PpSCL30 (Pp1s183_39V6) retained intron PTC+ variant. (B) qRT-PCR analysis of the PpSCL30 (Pp1s183_39V6) included exon PTC+ variant. (C) qRT-PCR analysis of the PpRS27 (Pp1s173_12V6) retained intron PTC+ variant. (D) qRT-PCR analysis of the PpRS27 (Pp1s173_12V6) included exon PTC+ variant. (E) qRT-PCR analysis of the PpRS2Z37 (Pp1s69_23V6) retained intron PTC+ variant. (A–E) The fold change indicates the amount of target expression normalized to that of PpEF1α and relative to WT levels. Error bars represent the standard error of the mean from three biological replicates. Asterisks indicate conditions with a statistically significant difference from WT using an unpaired t test (*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001). NS, not significantly different. The black boxes represent constitutive exons, grey boxes represent alternative exons and black lines represent unspliced introns. Light grey lines represent introduced PTCs. Change in expression PpRS2Z37 (Pp1s69_23V6) alternative acceptor site PTC+ variant was already reported (29).

DISCUSSION

We have previously discovered that SMG1 is widespread in the plant kingdom and functions in the NMD pathway of the moss Physcomitrella patens (29). The specific loss of SMG1 in the A. thaliana makes it impossible to study the role of this NMD factor in this widely used plant model system. We therefore took advantage of moss smg1 mutants to examine the functional role of SMG1 in plants at the transcriptomic level. We show that the NMD factor SMG1 has an important role in moss growth and development and that the loss of SMG1 leads to widespread changes in the steady state expression of thousands of genes and splice variants. As in animals and A. thaliana, we find that AS-NMD is a major mechanism of gene regulation, influencing up to 32% of genes with multiple isoforms; we estimate that between 3–18% of all splice events in moss lead to NMD. This suggests that NMD is a frequent consequence of alternative splicing and plays an important role in moss gene regulation.

Many PTC-containing transcript isoforms with retained introns are not NMD targets, adding to the evidence from other systems that retained intron transcript isoforms are NMD-resistant (22,23,71). Finally, we showed that the loss of SMG1 affects the plant's ability to respond to both DNA damage and unfolded protein inducing drugs, suggesting an important role for SMG1 in stress responses of plants.

Loss of SMG1 leads to dysregulation of quality control pathways in moss

SMG1 has an important role in NMD (29). Using differential gene expression analysis, we identified multiple pathways that were dysregulated in smg1 plants, which revealed the surprising result that, in addition to this RNA quality control pathway, genes in the DNA quality control pathway (DNA repair) and protein quality control pathway (unfolded protein response) were also up-regulated (Table 2 and Supplemental Tables S5 and S6). We used a conservative approach to identify differentially expressed genes, looking only at genes reported as significant from at least two of edgeR, DESeq and NOISeq. Given the large overlap between NOISeq and the other tools, we speculate that NOISeq is the most accurate, but also the most conservative of the tools used. Not only did we find many changes in other quality control pathways of smg1 plants, but we found that these gene expression changes translated into an altered response of smg1 plants to both DNA damage and unfolded proteins. When compared with WT plants, smg1 mutants show increased susceptibility to DNA damage (Figure 4) and enhanced resistance to unfolded proteins (Figure 2 and Supplemental Figure S1). This is in contrast to A. thaliana, where inhibition of NMD leads to an auto-immune like state (7,72), due to the increased expression of toll-like leucine-rich repeat receptors (73). In moss, we see the opposite trend, with leucine-rich repeat receptors being enriched among genes with decreased expression (Supplemental Table S6). Additionally, we do not see evidence for an auto-immune like response in moss, with no enrichment of pathogen response genes increasing in smg1 plants (Supplemental Tables S5 and S6).

It is unclear whether the activation of these pathways results indirectly from the decrease in NMD, due to the loss of a central NMD factor (SMG1), causing dysregulation of regulators normally targeted by NMD, or whether it is a direct result of SMG1 having NMD-independent functions in these pathways (Figure 7). Regarding the UPR, it is possible that the loss of NMD increases the level of unfolded proteins, for example through increased expression of truncated proteins, which could prime the mutant plants. This might allow the smg1 plants to tolerate additional unfolded proteins. Alternatively, NMD could act as a regulatory mechanism to repress the UPR in moss (Figure 7). Future work will help to distinguish between these two possibilities. Currently, smg1 mutants are the only known viable NMD factor knockout line. Attempts to knockout other NMD factors have been unsuccessful. For example, when we attempted to knock out the only other single-copy core NMD factor in the moss genome, UPF2 (29), 15 independent knockout lines were obtained, however, all 15 retained a second copy of UPF2, suggesting lethality when removing UPF2 from regenerating protoplasts. Our attempts to generate knockout lines for the two UPF1 homologues have produced similar results.

Figure 7.

Figure 7.

SMG1 is important for multiple quality control pathways in moss. SMG1 activates the NMD pathway, which leads to repression of PTC-containing transcripts. We suggest that SMG1, potentially acting via NMD, represses the unfolded protein response, and activates the DNA repair machinery.

In mammals, NMD targets transcription factors involved in the activation of the UPRER through the uORFs in their 5′ UTRs (19,60,62). In response to unfolded protein stress, NMD is repressed, thus increasing the expression levels of these transcription factors (60). The repression of NMD is achieved through phosphorylation of eIF2-α (60), although how this achieves NMD repression is unclear. In this study, we found no evidence for a global repression of NMD when plants are exposed to inducers of unfolded proteins (Figure 3), suggesting that if NMD regulates the UPR in plants, it operates through a different mechanism than in mammals.

Given the large number of pathways that NMD regulates (3,7), it is possible that SMG1 acts through NMD to prevent DNA damage by activating DNA repair; the compromised NMD pathway in the smg1 mutant could conceivably result in spontaneous DNA damage to accumulating, due to the lack of repair, leading to the increased expression of DNA damage related genes observed (Table 1). smg1 was recently identified in an unbiased screen for activators of DNA repair in C. elegans (74). It was then shown that mutations in other NMD factors also lead to compromised double strand break repair in C. elegans (74), suggesting that the NMD pathway acts to activate DNA repair, similar to what we have observed in moss (Figure 7). We found some DNA repair factors were alternatively spliced and then targeted to NMD (Supplemental Figure S3), although this is unlikely to be the cause of the changes in DNA damage response seen in the smg1 mutant plants: NMD is acting here to degrade the PTC-containing, non-functional transcript isoform, rather than directly altering the protein-producing isoform. However, it is possible that when NMD is inhibited, truncated proteins may be translated and disrupt the activity of the full-length proteins. Alternatively to DNA repair activity being controlled via NMD, SMG1 could act directly in the DNA repair pathway (Figure 7). Such a role has been observed in humans, where SMG1 phosphorylates p53 in response to DNA damage (52,63,75), and the loss of SMG1 leads to spontaneous DNA damage and increased susceptibility to ionizing radiation (52,63). Since plants lack a homologue of p53, SMG1 might phosphorylate a plant equivalent of p53. SOG1 has previously been proposed to be such an equivalent of p53 in plants (76–78). SOG1 is phosphorylated at SQ dipeptides by the ATM kinase, as is p53 in the mammalian DNA repair pathway (75,79). It is also known that the SMG1 kinase has a specificity for SQ dipeptides (80). It is therefore tempting to speculate that in plants SMG1 phosphorylates SOG1 at SQ dipeptides, as does ATM. SMG1 may also act directly in the regulation of splicing of important regulators of DNA repair, as has recently been demonstrated in animal cells (81). The sensitivity to bleomycin of smg1 plants is similar to that of the recently characterised deletion of the moss RecQ4 (65), raises the possibly of a mechanistic link. Especially as some RecQ helicases are phosphorylated by PIK kinases (82). Given the significance of DNA repair, it will be important to investigate further the mechanism through which SMG1 confers tolerance to DNA damage in moss and potentially most other plants. This is not possible with A. thaliana given the genome is lacking an SMG1 orthologue (28–30), highlighting the need for alternative model plants like P. patens that allow the study of the link between DNA damage repair and NMD.

Downstream splice junctions are an important determinant of NMD in moss

Several features have been suggested to target transcripts to NMD in diverse organisms. These include long 3′ UTRs and 3′ UTR located splice junctions at least 50 nucleotides downstream of the stop codon. Long 3′ UTRs are proposed to act as NMD targeting features either via stalling of termination due to a greater distance between the stop codon and the poly-A binding protein or by long 3′ UTRs binding more UPF1 until a critical level is reached and NMD is activated (83). A splicing event in the 3′ UTR may deposit an exon-junction complex (EJC), which can recruit NMD factors to stimulate NMD during termination of translation at the upstream stop codon, although 3′ UTR splice junctions have been reported to trigger NMD in Tetrahymena thermophila without the involvement of the EJC (84). Additionally, fission yeast NMD is enhanced by splicing but is also independent of the EJC (85), suggesting that many organisms are able to sense splicing linked PTCs without involvement of the EJC. We found, among our high confidence NMD targeted isoforms, that a splice junction in the 3′ UTR was a powerful predictor of NMD, but this was not the case for long 3′ UTRs (Figure 5A). While average 3′ UTR lengths do vary between plant species, moss appears to be very typical in this regard (37), suggesting this might not be some quirk of moss. Some studies report that the average 3′ UTR length of NMD targets is greater than non-NMD targets, but this is likely due to these studies not accounting for the presence of 3′ UTR splice junctions. In human cancers, an unbiased machine learning approach identified 3′ UTR splice junctions, but not 3′ UTR length, to be a major determinant of NMD-sensitivity (86). When the presence of 3′ UTR splice junctions are accounted for, there is little or no difference in 3′ UTR length between NMD-sensitive and NMD-insensitive transcripts (84,87). Interestingly, other work has shown that in humans long 3′ UTR transcripts are protected from NMD by certain cis-sequence elements (88) or by the binding of PTBP1 near the stop codon (89), suggesting that many transcripts with naturally long 3′ UTRs are protected from NMD by RNA-binding proteins, explaining why some studies find individual transcripts with long 3′ UTRs to be NMD-sensitive but globally, most are likely protected. Our work suggests that the NMD pathway of moss is similar to that of mammals, and points to the likelihood that a similar mechanism already operated in the last eukaryotic common ancestor, although it remains to be seen whether the EJC itself is involved in NMD in moss, as in other plants and animals (12,17,90,91). The moss genome does encode two copies of all four of the core EJC components (Supplemental Table S9), but EJC-independent NMD involving splicing has also been observed in other species (84,85).

PTC-containing isoforms with retained introns are largely resistant to NMD in moss

Retention of introns frequently introduces PTCs, and retained intron transcripts have been identified as NMD targets in many species (48,70,92). Retained intron transcripts have also been predicted to be NMD targets in plants (93). Recent work in A. thaliana and mammals have identified that many retained intron isoforms are resistant to NMD (22,23,71) and this is potentially due to nuclear localization of retained intron isoforms (71,94). We found this to be the case in moss, as retained intron transcripts are underrepresented amongst NMD targets, in contrast to other products of alternative splicing (Figures 5 and 6). However, a handful of retained intron transcripts are found in association with translating ribosomes in plants (95–97), suggesting that some escape the nucleus. Further work is needed to understand the details of nuclear localization of retained intron transcripts and whether such transcripts become subject to NMD if they escape the nucleus.

NMD targets pose a problem for accurate annotation of genomes

Almost every day new genomes and transcriptomes are added to the public repositories and there has been tremendous progress in the bioinformatic prediction of transcripts in the recent years, however, the accurate reconstruction of spliced transcripts and prediction of their encoded ORFs still poses a difficult task. Our work here, and the work of others (23,98,99), demonstrate how prevalent AS-NMD is, and how it can cause problems for genome annotation approaches. Our data show that 3′ UTR splicing is a powerful predictor of NMD in moss, consistent with studies of other plants (12,17,23) and animals (14,86). Often, the longest ORF of a transcript is chosen when annotating a genome (100–102); however, when annotating NMD targeted transcripts, a stop codon introduced through splicing can be so early in the transcript that a the longest ORF starts from an internal methionine. Selection of the internal methionine masks this transcript as an NMD targeted transcript and simply appears to be a N-terminal truncation. Therefore, start codon selection should be influenced by what are likely translated AUGs, rather than the one leading to the longest ORF. Once an authentic start codon is selected, 3′ UTR splicing is likely to represent NMD targets or an artefact, such as a gene model fusion.

Additionally, not all transcripts with splice junctions in the 3′ UTR will be NMD targets. For example, many retained intron transcripts escape NMD due to nuclear detention (71,94). Without experimental validation of each transcript, computational annotations will need to classify transcripts with caution. Taking a more holistic approach to genome annotation, by taking into account the authentic start codon and type of splicing occurring in the transcript, should provide the research community a more accurate and useful resource.

A biological rationale for SMG1

Notwithstanding the significant progress made in studying plant NMD using the A. thaliana model system, its exceptional status as one of only two known land plant species to lack the NMD kinase SMG1, limits its utility (29,30). The SMG1 gene has been independently lost multiple times over the course of eukaryotic evolution, including in many fungal species and at least twice in land plants (29,30). Furthermore, the loss of SMG1 appears to have little to no effect on the NMD pathways in fruit flies and zebrafish (103–105). Therefore, in many organisms, SMG1 does not play a role in NMD at all. It is likely that another mechanism is able to readily replace SMG1 in the role of UPF1 activation upon PTC recognition, and this mechanism might be as ancient as SMG1 itself (29,30). It has been suggested that NMD is not an active process, but instead a passive process stemming from lack of translating ribosomes on the 3′ end of the gene (106). Intron retention-containing transcripts being NMD-insensitive might be consistent with a translation-dependent passive mechanism, however, much data supports the role of an active NMD pathway, for example: (107,108).

Here we used moss, a basal plant model in which SMG1 plays a role in NMD, to demonstrate that loss of SMG1 affects several quality control pathways (Figure 7). We confirm our previous finding that SMG1 is important for the RNA quality control pathway, NMD (29). However, we also find that SMG1 appears to be needed for the correct functioning of the UPR and the DNA repair pathways. SMG1 could potentially act to integrate quality control pathways for RNA, DNA and protein in plants.

DATA AVAILABILITY

Program to Assemble Spliced Alignments (PASA) is available at https://pasapipeline.github.io/. edgeR is an R package available from Bioconductor (https://bioconductor.org/packages/release/bioc/html/edgeR.html). NOISeq is an R package available from Bioconductor (https://www.bioconductor.org/packages/release/bioc/html/NOISeq.html). DESeq is an R package available from Bioconductor (http://bioconductor.org/packages/release/bioc/html/DESeq.html). TopHat is available at http://ccb.jhu.edu/software/tophat/index.shtml. Cufflinks is available at http://cole-trapnell-lab.github.io/cufflinks/. Datasets and scripts used in analysis of this publication are available as a Zenodo archive (https://doi.org/10.5281/zenodo.826164). htseq-count is part of the Python package HTSeq available at https://pypi.python.org/pypi/HTSeq. topGO is an R package available from Bioconductor (http://bioconductor.org/packages/release/bioc/html/topGO.html). caret is an R package available from CRAN (https://cran.r-project.org/web/packages/caret/). RNA-seq datasets have been deposited at the NCBI’s Short Read Archive under the BioProject accession number: PRJNA417704. The smg1 knockout moss plants are available from the International Moss Stock Center (www.moss-stock-center.org/), smg1-1: #40832 and smg1-2: #40833.

Supplementary Material

Supplementary Data

ACKNOWLEDGEMENTS

We would like to thank Dr Yasuko Kamisugi and Dr Andrew Cuming for materials and assistance for the bleomycin experiments and use of moss tissue culture facilities, and Mary McKay for assistance sub-culturing moss.

SUPPLEMENTARY DATA

Supplementary Data are available at NAR Online.

FUNDING

Gatsby Charitable Foundation studentship (to J.P.B.L.); BBSRC (BB/H00775X/1 and BB/L006170/1 to B.D.); Excellence Initiative of the German Federal and State Governments (EXC 294 to R.R.). Funding for open access charge: University of Leeds.

Conflict of interest statement. None declared.

REFERENCES

  • 1. Mühlemann O. Recognition of nonsense mRNA: towards a unified model. Biochem. Soc. Trans. 2008; 36:497–501. [DOI] [PubMed] [Google Scholar]
  • 2. He F., Li X., Spatrick P., Casillo R., Dong S., Jacobson A.. Genome-wide analysis of mRNAs regulated by the nonsense-mediated and 5′ to 3′ mRNA decay pathways in yeast. Mol. Cell. 2003; 12:1439–1452. [DOI] [PubMed] [Google Scholar]
  • 3. Mendell J.T., Sharifi N.A., Meyers J.L., Martinez-Murillo F., Dietz H.C.. Nonsense surveillance regulates expression of diverse classes of mammalian transcripts and mutes genomic noise. Nat. Genet. 2004; 36:1073–1078. [DOI] [PubMed] [Google Scholar]
  • 4. Rehwinkel J., Letunic I., Raes J., Bork P., Izaurralde E.. Nonsense-mediated mRNA decay factors act in concert to regulate common mRNA targets. RNA. 2005; 11:1530–1544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Guan Q., Zheng W., Tang S., Liu X., Zinkel R.A., Tsui K.-W., Yandell B.S., Culbertson M.R.. Impact of nonsense-mediated mRNA decay on the global expression profile of budding yeast. PLoS Genet. 2006; 2:e203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Kurihara Y., Matsui A., Hanada K., Kawashima M., Ishida J., Morosawa T., Tanaka M., Kaminuma E., Mochizuki Y., Matsushima A. et al. Genome-wide suppression of aberrant mRNA-like noncoding RNAs by NMD in Arabidopsis. Proc. Natl. Acad. Sci. U.S.A. 2009; 106:2453–2458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Rayson S., Arciga-Reyes L., Wootton L., De Torres Zabala M., Truman W., Graham N., Grant M., Davies B.. A role for nonsense-mediated mRNA decay in plants: pathogen responses are induced in Arabidopsis thaliana NMD mutants. PLoS One. 2012; 7:e31917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lareau L.F., Inada M., Green R.E., Wengrod J.C., Brenner S.E.. Unproductive splicing of SR genes associated with highly conserved and ultraconserved DNA elements. Nature. 2007; 446:926–929. [DOI] [PubMed] [Google Scholar]
  • 9. Sureshkumar S., Dent C., Seleznev A., Tasset C., Balasubramanian S.. Nonsense-mediated mRNA decay modulates FLM-dependent thermosensory flowering response in Arabidopsis. Nature Plants. 2016; 2:16055. [DOI] [PubMed] [Google Scholar]
  • 10. Kervestin S., Jacobson A.. NMD: a multifaceted response to premature translational termination. Nat. Rev. Mol. Cell Biol. 2012; 13:700–712. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Nicholson P., Yepiskoposyan H., Metze S., Zamudio Orozco R., Kleinschmidt N., Mühlemann O.. Nonsense-mediated mRNA decay in human cells: mechanistic insights, functions beyond quality control and the double-life of NMD factors. Cell. Mol. Life Sci. 2010; 67:677–700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Kerényi Z., Mérai Z., Hiripi L., Benkovics A., Gyula P., Lacomme C., Barta E., Nagy F., Silhavy D.. Inter-kingdom conservation of mechanism of nonsense-mediated mRNA decay. EMBO J. 2008; 27:1585–1595. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Nyiko T., Kerenyi F., Szabadkai L., Benkovics A.H., Major P., Sonkoly B., Merai Z., Barta E., Niemiec E., Kufel J. et al. Plant nonsense-mediated mRNA decay is controlled by different autoregulatory circuits and can be induced by an EJC-like complex. Nucleic Acids Res. 2013; 41:6715–6728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Nagy E., Maquat L.E.. A rule for termination-codon position within intron-containing genes: when nonsense affects RNA abundance. Trends Biochem. Sci. 1998; 23:198–199. [DOI] [PubMed] [Google Scholar]
  • 15. Singh G., Rebbapragada I., Lykke-Andersen J.. A competition between stimulators and antagonists of Upf complex recruitment governs human nonsense-mediated mRNA decay. PLoS Biol. 2008; 6:e111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Kebaara B.W., Atkin A.L.. Long 3′-UTRs target wild-type mRNAs for nonsense-mediated mRNA decay in Saccharomyces cerevisiae. Nucleic acids Res. 2009; 37:2771–2778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Kertész S., Kerényi Z., Mérai Z., Bartos I., Pálfy T., Barta E., Silhavy D.. Both introns and long 3′-UTRs operate as cis-acting elements to trigger nonsense-mediated decay in plants. Nucleic Acids Res. 2006; 34:6147–6157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Nyikó T., Sonkoly B., Mérai Z., Benkovics A.H., Silhavy D.. Plant upstream ORFs can trigger nonsense-mediated mRNA decay in a size-dependent manner. Plant Mol. Biol. 2009; 71:367–378. [DOI] [PubMed] [Google Scholar]
  • 19. Gardner L.B. Hypoxic inhibition of nonsense-mediated RNA decay regulates gene expression and the integrated stress response. Mol. Cell. Biol. 2008; 28:3729–3741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Lewis B.P., Green R.E., Brenner S.E.. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc. Natl. Acad. Sci. U.S.A. 2003; 100:189–192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Weischenfeldt J., Waage J., Tian G., Zhao J., Damgaard I., Jakobsen J.S., Kristiansen K., Krogh A., Wang J., Porse B.T.. Mammalian tissues defective in nonsense-mediated mRNA decay display highly aberrant splicing patterns. Genome Biol. 2012; 13:R35. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Kalyna M., Simpson C.G., Syed N.H., Lewandowska D., Marquez Y., Kusenda B., Marshall J., Fuller J., Cardle L., McNicol J. et al. Alternative splicing and nonsense-mediated decay modulate expression of important regulatory genes in Arabidopsis. Nucleic Acids Res. 2012; 40:2454–2469. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Drechsel G., Kahles A., Kesarwani A.K., Stauffer E., Behr J., Drewe P., Rätsch G., Wachter A.. Nonsense-mediated decay of alternative precursor mRNA splicing variants is a major determinant of the Arabidopsis steady state transcriptome. Plant Cell. 2013; 25:3726–3742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Lareau L.F., Brenner S.E.. Regulation of splicing factors by alternative splicing and NMD is conserved between kingdoms yet evolutionarily flexible. Mol. Biol. Evol. 2015; 32:1072–1079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Arciga-Reyes L., Wootton L., Kieffer M., Davies B.. UPF1 is required for nonsense-mediated mRNA decay (NMD) and RNAi in Arabidopsis. Plant J. 2006; 47:480–489. [DOI] [PubMed] [Google Scholar]
  • 26. Mérai Z., Benkovics A.H., Nyikó T., Debreczeny M., Hiripi L., Kerényi Z., Kondorosi E., Silhavy D.. The late steps of plant nonsense-mediated mRNA decay. Plant J. 2012; 73:50–62. [DOI] [PubMed] [Google Scholar]
  • 27. Kerényi F., Wawer I., Sikorski P.J., Kufel J., Silhavy D.. Phosphorylation of the N- and C-terminal UPF1 domains plays a critical role in plant nonsense-mediated mRNA decay. Plant J. 2013; 76:836–848. [DOI] [PubMed] [Google Scholar]
  • 28. Grimson A., O’Connor S., Newman C.L., Anderson P.. SMG-1 is a phosphatidylinositol kinase-related protein kinase required for nonsense-mediated mRNA Decay in Caenorhabditis elegans. Mol. Cell. Biol. 2004; 24:7483–7490. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Lloyd J.P.B., Davies B.. SMG1 is an ancient nonsense-mediated mRNA decay effector. Plant J. 2013; 76:800–810. [DOI] [PubMed] [Google Scholar]
  • 30. Causier B., Li Z., De Smet R., Lloyd J.P.B., Van de Peer Y., Davies B.. Conservation of nonsense-mediated mRNA decay complex components throughout eukaryotic evolution. Scientific Rep. 2017; 7:16692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Ashton N., Cove D.. The isolation and preliminary characterisation of auxotrophic and analogue resistant mutants of the moss, Physcomitrella patens. Mol. Gen. Genet. MGG. 1977; 154:87–95. [Google Scholar]
  • 32. Rensing S.A., Lang D., Zimmer A.D., Terry A., Salamov A., Shapiro H., Nishiyama T., Perroud P.-F., Lindquist E.A., Kamisugi Y. et al. The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants. Science. 2008; 319:64–69. [DOI] [PubMed] [Google Scholar]
  • 33. Nishiyama T., Hiwatashi Y., Sakakibara I., Kato M., Hasebe M.. Tagged mutagenesis and gene-trap in the moss, Physcomitrella patens by shuttle mutagenesis. DNA Res. 2000; 7:9–17. [DOI] [PubMed] [Google Scholar]
  • 34. Kamisugi Y., Schaefer D.G., Kozak J., Charlot F., Vrielynck N., Holá M., Angelis K.J., Cuming A.C., Nogué F.. MRE11 and RAD50, but not NBS1, are essential for gene targeting in the moss Physcomitrella patens. Nucleic Acids Res. 2011; 40:3496–3510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Wobbrock J.O., Findlater L., Gergle D., Higgins J.J.. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2011; Vancouver: ACM; 143–146. [Google Scholar]
  • 36. Trapnell C., Pachter L., Salzberg S.L.. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009; 25:1105–1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Zimmer A.D., Lang D., Buchta K., Rombauts S., Nishiyama T., Hasebe M., Van de Peer Y., Rensing S.A., Reski R.. Reannotation and extended community resources for the genome of the non-seed plant Physcomitrella patens provide insights into the evolution of plant gene structures and functions. BMC Genomics. 2013; 14:498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Anders S., Pyl P.T., Huber W.. HTSeq–a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015; 31:166–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. McCarthy D.J., Chen Y., Smyth G.K.. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Res. 2012; 40:4288–4297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Anders S., Huber W.. Differential expression analysis for sequence count data. Genome Biol. 2010; 11:R106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Tarazona S., Garcia-Alcalde F., Dopazo J., Ferrer A., Conesa A.. Differential expression in RNA-seq: a matter of depth. Genome Res. 2011; 21:2213–2223. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Trapnell C., Williams B.A., Pertea G., Mortazavi A., Kwan G., van Baren M.J., Salzberg S.L., Wold B.J., Pachter L.. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat. Biotechnol. 2010; 28:511–515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Zemach A., McDaniel I.E., Silva P., Zilberman D.. Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010; 328:916–919. [DOI] [PubMed] [Google Scholar]
  • 44. Kuhn M. Building predictive models in R using the caret package. J. Stat. Softw. 2008; 28:26. [Google Scholar]
  • 45. Robinson M.D., McCarthy D.J., Smyth G.K.. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010; 26:139–140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Mendell J.T., ap Rhys C.M.J., Dietz H.C.. Separable roles for rent1/hUpf1 in altered splicing and decay of nonsense transcripts. Science. 2002; 298:419–422. [DOI] [PubMed] [Google Scholar]
  • 47. Rayson S., Ashworth M., de Torres Zabala M., Grant M., Davies B.. The salicylic acid dependent and independent effects of NMD in plants. Plant Signal. Behav. 2012; 7:1434–1437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Ramani A.K., Nelson A.C., Kapranov P., Bell I., Gingeras T.R., Fraser A.G.. High resolution transcriptome maps for wild-type and nonsense-mediated decay-defective Caenorhabditis elegans. Genome Biol. 2009; 10:R101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Frank M.H., Scanlon M.J.. Cell-specific transcriptomic analyses of three-dimensional shoot development in the moss Physcomitrella patens. Plant J. 2015; 83:743–751. [DOI] [PubMed] [Google Scholar]
  • 50. Brun F., Gonneau M., Laloue M., Nogué F.. Identification of Physcomitrella patens genes specific of bud and gametophore formation. Plant Sci. 2003; 165:1267–1274. [Google Scholar]
  • 51. Thimm O., Blasing O., Gibon Y., Nagel A., Meyer S., Kruger P., Selbig J., Muller L.A., Rhee S.Y., Stitt M.. MAPMAN: a user-driven tool to display genomics data sets onto diagrams of metabolic pathways and other biological processes. Plant J. 2004; 37:914–939. [DOI] [PubMed] [Google Scholar]
  • 52. Brumbaugh K.M., Otterness D.M., Geisen C., Oliveira V., Brognard J., Li X., Lejeune F., Tibbetts R.S., Maquat L.E., Abraham R.T.. The mRNA surveillance protein hSMG-1 functions in genotoxic stress response pathways in mammalian cells. Mol. Cell. 2004; 14:585–598. [DOI] [PubMed] [Google Scholar]
  • 53. Iwata Y., Koizumi N.. Plant transducers of the endoplasmic reticulum unfolded protein response. Trends Plant Sci. 2012; 17:720–727. [DOI] [PubMed] [Google Scholar]
  • 54. Sakaki K., Yoshina S., Shen X., Han J., Desantis M.R., Xiong M., Mitani S., Kaufman R.J.. RNA surveillance is required for endoplasmic reticulum homeostasis. Proc. Natl. Acad. Sci. U.S.A. 2012; 109:8079–8084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55. Humbert S., Zhong S., Deng Y., Howell S.H., Rothstein S.J.. Alteration of the bZIP60/IRE1 pathway affects plant response to ER Stress in Arabidopsis thaliana. PLoS One. 2012; 7:e39023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56. Wang S., Narendra S., Fedoroff N.. Heterotrimeric G protein signaling in the Arabidopsis unfolded protein response. Proc. Natl. Acad. Sci. U.S.A. 2007; 104:3817–3822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Sugio A., Dreos R., Aparicio F., Maule A.J.. The cytosolic protein response as a subcomponent of the wider heat shock response in Arabidopsis. Plant Cell. 2009; 21:642–654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Iwata Y., Fedoroff N.V., Koizumi N.. Arabidopsis bZIP60 is a proteolysis-activated transcription factor involved in the endoplasmic reticulum stress response. Plant Cell. 2008; 20:3107–3121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59. Kamauchi S., Nakatani H., Nakano C., Urade R.. Gene expression in response to endoplasmic reticulum stress in Arabidopsis thaliana. FEBS J. 2005; 272:3461–3476. [DOI] [PubMed] [Google Scholar]
  • 60. Wang D., Zavadil J., Martin L., Parisi F., Friedman E., Levy D., Harding H., Ron D., Gardner L.B.. Inhibition of nonsense-mediated RNA decay by the tumor microenvironment promotes tumorigenesis. Mol. Cell. Biol. 2011; 31:3670–3680. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Oren Y.S., McClure M.L., Rowe S.M., Sorscher E.J., Bester A.C., Manor M., Kerem E., Rivlin J., Zahdeh F., Mann M. et al. The unfolded protein response affects readthrough of premature termination codons. EMBO Mol. Med. 2014; 6:685–701. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62. Karam R., Lou C.H., Kroeger H., Huang L., Lin J.H., Wilkinson M.F.. The unfolded protein response is shaped by the NMD pathway. EMBO Rep. 2015; 16:599–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63. Azzalin C.M., Lingner J.. The double life of UPF1 in RNA and DNA stability pathways. Cell Cycle. 2006; 5:1496–1498. [DOI] [PubMed] [Google Scholar]
  • 64. Kamisugi Y., Whitaker J.W., Cuming A.C.. The transcriptional response to DNA-double-strand breaks in Physcomitrella patens. PLoS One. 2016; 11:e0161204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65. Wiedemann G., van Gessel N., Kochl F., Hunn L., Schulze K., Maloukh L., Nogue F., Decker E.L., Hartung F., Reski R.. RecQ helicases function in development, DNA repair, and gene targeting in Physcomitrella patens. Plant Cell. 2018; doi:10.1105/tpc.17.00632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66. Haas B.J., Delcher A.L., Mount S.M., Wortman J.R., Smith R.K. Jr., Hannick L.I., Maiti R., Ronning C.M., Rusch D.B., Town C.D. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 2003; 31:5654–5666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67. Le Hir H., Gatfield D., Izaurralde E., Moore M.J.. The exon-exon junction complex provides a binding platform for factors involved in mRNA export and nonsense-mediated mRNA decay. EMBO J. 2001; 20:4987–4997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68. Marquez Y., Hopfler M., Ayatollahi Z., Barta A., Kalyna M.. Unmasking alternative splicing inside protein-coding exons defines exitrons and their role in proteome plasticity. Genome Res. 2015; 25:995–1007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69. Ponce de León I., Oliver J.P., Castro A., Gaggero C., Bentancor M., Vidal S.. Erwinia carotovora elicitors and Botrytis cinerea activate defense responses in Physcomitrella patens. BMC Plant Biol. 2007; 7:52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70. Sayani S., Janis M., Lee C.Y., Toesca I., Chanfreau G.F.. Widespread impact of nonsense-mediated mRNA decay on the yeast intronome. Mol. Cell. 2008; 31:360–370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71. Boutz P.L., Bhutkar A., Sharp P.A.. Detained introns are a novel, widespread class of post-transcriptionally spliced introns. Genes Dev. 2015; 29:63–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72. Riehs-Kearnan N., Gloggnitzer J., Dekrout B., Jonak C., Riha K.. Aberrant growth and lethality of Arabidopsis deficient in nonsense-mediated RNA decay factors is caused by autoimmune-like response. Nucleic Acids Res. 2012; 40:5615–5624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73. Gloggnitzer J., Akimcheva S., Srinivasan A., Kusenda B., Riehs N., Stampfl H., Bautor J., Dekrout B., Jonak C., Jimenez-Gomez J.M. et al. Nonsense-mediated mRNA decay modulates immune receptor levels to regulate plant antibacterial defense. Cell Host Microbe. 2014; 16:376–390. [DOI] [PubMed] [Google Scholar]
  • 74. Gonzalez-Huici V., Wang B., Gartner A.. A role for the nonsense-mediated mRNA decay pathway in maintaining genome stability in Caenorhabditis elegans. Genetics. 2017; 206:1853–1864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75. Gehen S.C., Staversky R.J., Bambara R.A., Keng P.C., O’Reilly M.A.. hSMG-1 and ATM sequentially and independently regulate the G1 checkpoint during oxidative stress. Oncogene. 2008; 27:4065–4074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76. Yoshiyama K., Conklin P.A., Huefner N.D., Britt A.B.. Suppressor of gamma response 1 (SOG1) encodes a putative transcription factor governing multiple responses to DNA damage. Proc. Natl. Acad. Sci. U.S.A. 2009; 106:12843–12848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77. Yoshiyama K.O., Kobayashi J., Ogita N., Ueda M., Kimura S., Maki H., Umeda M.. ATM-mediated phosphorylation of SOG1 is essential for the DNA damage response in Arabidopsis. EMBO Rep. 2013; 14:817–822. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78. Waterworth W.M., Drury G.E., Bray C.M., West C.E.. Repairing breaks in the plant genome: the importance of keeping it together. New Phytologist. 2011; 192:805–822. [DOI] [PubMed] [Google Scholar]
  • 79. Yamashita A., Ohnishi T., Kashima I., Taya Y., Ohno S.. Human SMG-1, a novel phosphatidylinositol 3-kinase-related protein kinase, associates with components of the mRNA surveillance complex and is involved in the regulation of nonsense-mediated mRNA decay. Genes Dev. 2001; 15:2215–2228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80. Okada-Katsuhata Y., Yamashita A., Kutsuzawa K., Izumi N., Hirahara F., Ohno S.. N- and C-terminal Upf1 phosphorylations create binding platforms for SMG-6 and SMG-5:SMG-7 during NMD. Nucleic Acids Res. 2012; 40:1251–1266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81. Chen J., Crutchley J., Zhang D., Owzar K., Kastan M.B.. Identification of a DNA damage-Induced alternative splicing pathway that regulates p53 and cellular senescence markers. Cancer Discov. 2017; 7:766–781. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82. Croteau D.L., Popuri V., Opresko P.L., Bohr V.A.. Human RecQ helicases in DNA repair, recombination, and replication. Annu. Rev. Biochem. 2014; 83:519–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83. Hogg J.R., Goff S.P.. Upf1 senses 3′UTR length to potentiate mRNA decay. Cell. 2010; 143:379–389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84. Tian M., Yang W., Zhang J., Dang H., Lu X., Fu C., Miao W.. Nonsense-mediated mRNA decay in Tetrahymena is EJC independent and requires a protozoa-specific nuclease. Nucleic Acids Res. 2017; 45:6848–6863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85. Wen J., Brogna S.. Splicing-dependent NMD does not require the EJC in Schizosaccharomyces pombe. EMBO J. 2010; 29:1537–1551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86. Lindeboom R.G., Supek F., Lehner B.. The rules and impact of nonsense-mediated mRNA decay in human cancers. Nat. Genet. 2016; 48:1112–1118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87. Colombo M., Karousis E.D., Bourquin J., Bruggmann R., Muhlemann O.. Transcriptome-wide identification of NMD-targeted human mRNAs reveals extensive redundancy between SMG6- and SMG7-mediated degradation pathways. RNA. 2017; 23:189–201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88. Toma K.G., Rebbapragada I., Durand S., Lykke-Andersen J.. Identification of elements in human long 3′ UTRs that inhibit nonsense-mediated decay. RNA. 2015; 21:887–897. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89. Ge Z., Quek B.L., Beemon K.L., Hogg J.R.. Polypyrimidine tract binding protein 1 protects mRNAs from recognition by the nonsense-mediated mRNA decay pathway. Elife. 2016; 5:e11155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90. Gehring N.H., Neu-Yilik G., Schell T., Hentze M.W., Kulozik A.E.. Y14 and hUpf3b form an NMD-activating complex. Mol. Cell. 2003; 11:939–949. [DOI] [PubMed] [Google Scholar]
  • 91. Gehring N.H., Kunz J.B., Neu-Yilik G., Breit S., Viegas M.H., Hentze M.W., Kulozik A.E.. Exon-junction complex components specify distinct routes of nonsense-mediated mRNA decay with differential cofactor requirements. Mol. Cell. 2005; 20:65–75. [DOI] [PubMed] [Google Scholar]
  • 92. Jaillon O., Bouhouche K., Gout J.-F., Aury J.-M., Noel B., Saudemont B., Nowacki M., Serrano V., Porcel B.M., Ségurens B. et al. Translational control of intron splicing in eukaryotes. Nature. 2008; 451:359–362. [DOI] [PubMed] [Google Scholar]
  • 93. Wang B.-B., Brendel V.. Genomewide comparative analysis of alternative splicing in plants. Proc. Natl. Acad. Sci. U.S.A. 2006; 103:7175–7180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94. Gohring J., Jacak J., Barta A.. Imaging of endogenous messenger RNA splice variants in living cells reveals nuclear retention of transcripts inaccessible to nonsense-mediated decay in Arabidopsis. Plant Cell. 2014; 26:754–764. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95. Zhang X., Rosen B.D., Tang H., Krishnakumar V., Town C.D.. Polyribosomal RNA-Seq reveals the decreased complexity and diversity of the Arabidopsis translatome. PLoS One. 2015; 10:e0117699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96. Ner-Gaon H., Halachmi R., Savaldi-Goldstein S., Rubin E., Ophir R., Fluhr R.. Intron retention is a major phenomenon in alternative splicing in Arabidopsis. Plant J. 2004; 39:877–885. [DOI] [PubMed] [Google Scholar]
  • 97. Jiao Y., Meyerowitz E.M.. Cell-type specific analysis of translating RNAs in developing flowers reveals new levels of control. Mol. Syst. Biol. 2010; 6:419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98. Steward C.A., Parker A.P.J., Minassian B.A., Sisodiya S.M., Frankish A., Harrow J.. Genome annotation for clinical genomic diagnostics: strengths and weaknesses. Genome Med. 2017; 9:49. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99. Brown J.W., Simpson C.G., Marquez Y., Gadd G.M., Barta A., Kalyna M.. Lost in translation: pitfalls in deciphering plant alternative splicing transcripts. Plant Cell. 2015; 27:2083–2087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100. Wang L., Park H.J., Dasari S., Wang S., Kocher J.P., Li W.. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 2013; 41:e74. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101. Vitting-Seerup K., Sandelin A.. The landscape of isoform switches in human cancers. Mol. Cancer Res. 2017; 15:1206–1220. [DOI] [PubMed] [Google Scholar]
  • 102. Hsu M.K., Lin H.Y., Chen F.C.. NMD Classifier: a reliable and systematic classification tool for nonsense-mediated decay events. PLoS One. 2017; 12:e0174798. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103. Wittkopp N., Huntzinger E., Weiler C., Saulière J., Schmidt S., Sonawane M., Izaurralde E.. Nonsense-mediated mRNA decay effectors are essential for zebrafish embryonic development and survival. Mol. Cell. Biol. 2009; 29:3517–3528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104. Metzstein M.M., Krasnow M.A.. Functions of the nonsense-mediated mRNA decay pathway in Drosophila development. PLoS Genet. 2006; 2:e180. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105. Chen Z., Smith K.R., Batterham P., Robin C.. Smg1 nonsense mutations do not abolish nonsense-mediated mRNA decay in Drosophila melanogaster. Genetics. 2005; 171:403–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106. Brogna S., McLeod T., Petric M.. The meaning of NMD: translate or perish. Trends Genet. 2016; 32:395–407. [DOI] [PubMed] [Google Scholar]
  • 107. Merai Z., Benkovics A.H., Nyiko T., Debreczeny M., Hiripi L., Kerenyi Z., Kondorosi E., Silhavy D.. The late steps of plant nonsense-mediated mRNA decay. Plant J. 2013; 73:50–62. [DOI] [PubMed] [Google Scholar]
  • 108. Huntzinger E., Kashima I., Fauser M., Sauliere J., Izaurralde E.. SMG6 is the catalytic endonuclease that cleaves mRNAs containing nonsense codons in metazoan. RNA. 2008; 14:2609–2617. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Data

Data Availability Statement

Program to Assemble Spliced Alignments (PASA) is available at https://pasapipeline.github.io/. edgeR is an R package available from Bioconductor (https://bioconductor.org/packages/release/bioc/html/edgeR.html). NOISeq is an R package available from Bioconductor (https://www.bioconductor.org/packages/release/bioc/html/NOISeq.html). DESeq is an R package available from Bioconductor (http://bioconductor.org/packages/release/bioc/html/DESeq.html). TopHat is available at http://ccb.jhu.edu/software/tophat/index.shtml. Cufflinks is available at http://cole-trapnell-lab.github.io/cufflinks/. Datasets and scripts used in analysis of this publication are available as a Zenodo archive (https://doi.org/10.5281/zenodo.826164). htseq-count is part of the Python package HTSeq available at https://pypi.python.org/pypi/HTSeq. topGO is an R package available from Bioconductor (http://bioconductor.org/packages/release/bioc/html/topGO.html). caret is an R package available from CRAN (https://cran.r-project.org/web/packages/caret/). RNA-seq datasets have been deposited at the NCBI’s Short Read Archive under the BioProject accession number: PRJNA417704. The smg1 knockout moss plants are available from the International Moss Stock Center (www.moss-stock-center.org/), smg1-1: #40832 and smg1-2: #40833.


Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES