Skip to main content
Life Science Alliance logoLink to Life Science Alliance
. 2018 May 16;1(2):e201800052. doi: 10.26508/lsa.201800052

Pervasive allele-specific regulation on RNA decay in hybrid mice

Wei Sun 1,2,*, Qingsong Gao 2,*, Bernhard Schaefke 1, Yuhui Hu 1, Wei Chen 1,3,
PMCID: PMC6238540  PMID: 30456349

Using a F1 hybrid mouse system, this study globally investigates the effects of cis-regulatory divergence on RNA decay in mammals and reveals the evolutionary roles of RNA decay regulation.

Abstract

Cellular RNA abundance is determined by both RNA transcription and decay. Therefore, change in RNA abundance, which can drive phenotypic diversity between different species, could arise from genetic variants affecting either process. However, previous studies in the evolution of RNA expression have been largely focused on transcription. Here, to globally investigate the effects of cis-regulatory divergence on RNA decay in mammals for the first time, we quantified allele-specific differences in RNA decay rates (ASD) in an F1 hybrid mouse. Out of 8,815 genes with sufficient data, we identified 621 genes exhibiting significant cis-divergence. Systematic analysis of these genes revealed that the genetic variants affecting microRNA binding and RNA secondary structures contribute to the observed divergences. Finally, we demonstrated that although the divergences in RNA abundance were predominantly determined by allelic differences in RNA transcription, most genes with significant ASD did not exhibit significant difference in RNA abundance. For these genes, the apparently compensatory effect between the allelic differences in RNA transcription and ASD suggests that changes in RNA decay could serve as important means to stabilize RNA abundances during mammalian evolution.

Introduction

Eukaryotic gene expression is regulated at multiple steps, and the balance between two opposing biological processes, RNA transcription and its decay, determines the cellular abundance of RNA transcripts (Garneau et al, 2007; Dolken et al, 2008; Schwanhausser et al, 2011; Rabani et al, 2011, 2014). Although to date most studies on RNA expression regulation were focused solely on transcription, recent works have clearly demonstrated the important role of RNA decay (Raghavan et al, 2002; Hao & Baltimore, 2009; Schwanhausser et al, 2011; Rabani et al, 2011, 2014). Often, in response to an extrinsic or intrinsic stimulus, the RNA decay rate can change rapidly to adjust the RNA levels with or without transcriptional change (Raghavan et al, 2002; Hao & Baltimore, 2009). Such regulation is mediated by the interaction between cis-regulatory elements residing within the RNA transcripts and diffusible trans-acting factors, including RNA-binding proteins (RBPs) and regulatory RNAs such as microRNAs. During the past decades, a number of cis-elements have been identified (Caput et al, 1986; Shaw & Kamen, 1986; Xia et al, 1996; Bartel, 2004; Mendell et al, 2004; Vlasova et al, 2008; Ivanov & Anderson, 2013), and importantly, genetic variants affecting these cis-elements often alter the RNA decay rate and can result in pathological phenotypes (Rodningen et al, 1998; Xia et al, 1998; Wang et al, 2008; Puimege et al, 2015; Khabar, 2017; Patel et al, 2017).

Changes in RNA expression constitute one of the major forces driving both phenotypic diversity among individuals within the same species (Albert & Kruglyak, 2015) and evolutionary divergence between different species (Necsulea & Kaessmann, 2014). Such changes could arise from genetic variants affecting either transcription or decay. However, because most previous studies analyzed only the effects of genetic variants on steady-state RNA expression levels, they could not distinguish the effects on transcription from those on decay and thus could not elucidate the underlying regulatory mechanisms. To address this, the Gilad and Pritchard labs analyzed the individual-specific mRNA decay rates of more than 16,000 genes in 70 Yoruba HapMap lymphoblastoid cell lines and identified 31 genes with significant cis-RNA decay quantitative trait loci (rdQTLs) at a false discovery rate (FDR) of 15% (Pai et al, 2012). To increase their detection power, they then focused on single-nucleotide polymorphisms (SNPs) already identified as steady-state expression QTLs (eQTLs) (Pai et al, 2012). Out of 1,257 eQTLs, 195 were also significantly associated with variations in mRNA decay rates. Interestingly, among the joint QTLs, whereas in 55% cases, the alleles with higher steady-state level decay slower, the remaining 45% showed the opposite pattern of allelic bias between the steady-state expression and RNA decay.

A more direct approach to estimate the cis-regulatory effect on RNA degradation is to compare the allele-specific decay rates of RNA transcripts in an F1 hybrid (Dori-Bachash et al, 2011, 2012; Andrie et al, 2014). Those allelic transcripts are subject to the same trans-regulatory environment, so that observed allelic differences should reflect the impact of cis-regulatory divergence. Recently, several studies have investigated allele-specific differences in mRNA decay rates (ASD) for F1 hybrids between different genetically diverse yeast strains (Dori-Bachash et al, 2011, 2012; Andrie et al, 2014). Strikingly, in all these F1 hybrid studies in yeast, for more than 80% of the genes with significant allelic biases in mRNA decay (ASD), their allele-specific mRNA decay and allele-specific RNA expression biased toward opposite alleles, suggesting pervasive compensatory effects between the evolutions of RNA transcription and RNA decay. Such occurrence (>80%) of compensatory effects observed in yeast is much higher than that (45%) observed in the aforementioned human rdQTLs study (Pai et al, 2012). Compared with unicellular organisms such as yeast, more complex gene regulation would be required in multicellular organisms with various organs and cell types. Therefore, such different observation may reflect different evolutionary modes of gene expressing between yeast and mammals. However, alternatively, it can also be due to the different designs of these studies (QTLs versus F1 hybrid). To finally tackle this question, a direct genome-wide profiling of allele-specific RNA decay patterns in multicellular species, such as mammals, would be necessary.

Here, to globally investigate the effects of cis-regulatory divergence on RNA decay in mammals, we quantified ASD in an F1 hybrid between two inbred mouse strains, Mus musculus C57BL/6J (BL6) and Mus spretus SPRET/EiJ mouse strain (SPRET). These two mouse strains diverged ∼1.5 million years ago, resulting in ∼35.4 million SNPs and ∼4.5 million insertions and deletions (indels) between their genomes (Dejager et al, 2009; Keane et al, 2011). Such a high sequence divergence allowed us to unambiguously determine the allelic origin for a large fraction of sequencing reads, thereby enabling accurate measurement of ASD for thousands of genes. In total, out of 8,815 genes with sufficient data for accurate quantification of ASD, we identified 621 genes (7.0%) exhibiting significant cis-divergence. Compared with genes without allelic bias, those with ASD divergence contained higher densities of sequence variants. Systematic analysis of sequence features of the genes with biased allelic decay revealed that miRNA-binding sites within 3′ untranslated regions (UTRs) and the local RNA secondary structure in both coding regions and 3′ UTRs could affect RNA decay. Finally, via investigating the role of ASD in the allele-specific RNA abundances (ASA), we demonstrated that on one hand, the observed ASA divergences were predominantly determined by the allelic differences in RNA transcription (AST) and on the other hand, most (>80%) of the genes with significant ASD did not exhibit significant ASA, indicating the pervasive compensatory effects between AST and ASD also existing in mammalian evolution and suggesting that changes in RNA decay rates could serve as important means to stabilize RNA abundances during evolution.

Results

Pervasive allelic divergence on RNA decay rates in an F1 hybrid mouse

To investigate the allelic divergence of RNA decay rates in a mammalian system, we measured the ASD in a fibroblast cell line derived from an F1 hybrid mouse between the BL6 and SPRET strains. As shown in Fig 1, we monitored the changes of the allelic RNA abundances following transcriptional arrest using actinomycin D. More specifically, paired-end sequencing was performed on poly-A RNA samples isolated from two biological replicates of F1 fibroblast cells collected at 0, 0.5, and 1.5 h subsequent to transcriptional arrest. On average, each sample yielded 130.1 million read pairs (Table S1). Fig S1 shows the good reproducibility between the two replicates for all the three time points. The high density of sequence variants between the genomes of BL6 and SPRET enabled unambiguous assignment of allelic origin for an average of 62.5 million read pairs in each sample (Table S1; see the Materials and Methods section for details).

Figure 1. Overview of experimental design.

Figure 1.

Fibroblast cells were isolated and cultured from the adult F1 hybrid mice between C57BL/6J and SPRET/EiJ. Two replicates of RNAs collected at three different time points following transcriptional arrest were sequenced.

Figure S1. Reproducibility of mRNA sequencing data.

Figure S1.

(A) Scatterplot comparing the abundance of cellular mRNA (log2-transformed sum of both alleles) between two biological replicates at 0 h. Each dot represents one gene. (B) Scatterplot comparing the log2-transformed fold change of the two alleles between two biological replicates at 0 h. (C) Scatterplot comparing the abundance of cellular mRNA (log2-transformed sum of both alleles) between two biological replicates at 0.5 h. Each dot represents one gene. (D) Scatterplot comparing the log2-transformed fold change of the two alleles between two biological replicates at 0.5 h. (E) Scatterplot comparing the abundance of cellular mRNA (log2-transformed sum of both alleles) between two biological replicates at 1.5 h. Each dot represents one gene. (F) Scatterplot comparing the log2-transformed fold change of the two alleles between two biological replicates at 1.5 h.

To estimate the allele-specific RNA decay rate in a quantitative manner, we used the reads with unambiguous allelic origin. More specifically, we used only the reads that were mapped on SNP loci within genic regions. After filtering out the SNP loci with potential allelic read mapping bias due to the incomplete SNP annotation in paralogous genes or pseudogenes, 8,815 genes containing at least five SNPs supported with sufficient allelic reads were retained (Fig S2; see the Materials and Methods section for details).

Figure S2. Filtering of SNP loci with potential allelic mapping and assignment biases.

Figure S2.

(A) Scatterplot comparing log2-transformed fold change of gene expression between parental strains and the two alleles in mock F1 hybrid created by mixing parental strain sequencing reads (see the Materials and Methods section for details). (B) Scatterplot comparing log2-transformed fold change of the two alleles between using uniquely mapped reads only and those including also multiple mapped reads.

To identify the genes with significant ASD, we combined a previously published logistic model and a bootstrapping strategy (Andrie et al, 2014; Muzzey et al, 2014). In brief, we assumed an exponential decay model for each allele. For each time point (0 , 0.5 , and 1.5 h after transcriptional arrest), the read counts derived from one allele given the total were modeled by a binomial distribution. After logit transformation, the parameters could be directly estimated using a linear logistic model in which the regression coefficient for time variable represents the mRNA decay rate difference Δλ=λ1λ2 between the two alleles (see the Materials and Methods section for details). To assess the significance of ASD, we then applied a bootstrapping strategy to estimate the confidence of estimated ∆λ. Specifically, for each gene consisting of a list of at least five SNP loci, we generated 5,000 new lists, each consisting of the same number of SNP loci that were chosen at random with replacement from the original list. For each of the 5,000 random lists, ∆λ was estimated using the same logistic model, and altogether yielded a bootstrap distribution, which was then summarized with a mean and a standard deviation. The larger the bootstrap mean deviates from zero, the larger the decay rate diverges between the two alleles. In contrast, lower bootstrap standard deviation gives higher confidence in the estimation of ∆λ. According to the bootstrap mean and standard deviation, the statistical significance of ASD was then determined for each gene. After applying a threshold of Benjamini–Hochberg–adjusted P-value < 0.05 and |Δλ>0.06| in both replicates (FDR = 4.18%; Fig S3), we identified 621 (7.0%) genes exhibiting significant ASD (Fig 2A). Fig 2B shows two representative examples with significant ASD, biased toward the BL6 and the SPRET allele, respectively.

Figure S3. FDR of Δλ estimation.

Figure S3.

FDR (y-axis) was plotted against different Δλ threshold (x-axis) in identifying genes with significant ASD. See the Materials and Methods section for details.

Figure 2. Identification of genes with significant ASD.

Figure 2.

(A) Scatterplot showing the bootstrap means (x-axis) and standard deviations (y-axis) of estimated ASD. Dashed blue lines indicate the Benjamini–Hochberg–adjusted P-value of 0.05 and dashed purple lines indicate a minimum decay rate difference of 0.06. Out of 8,815 genes (black), 621 (red) exhibited significant ASD. (B) Bar plots showing the number of sequencing reads assigned to BL6 (red) or SPRET (blue) alleles (y-axis) at different SNP loci (x-axis) of three time points (0, 0.5, and 1.5 h). BL6 and SPRET allele degraded faster in Armc7 and Rbak genes, respectively. (C) Scatterplot comparing allelic decay rate difference (Δλ) estimated based on Illumina sequencing data (y-axis) to that based on PacBio sequencing (x-axis) for the 25 randomly selected genes. Δλ estimated based on the two technologies was significantly correlated (rPearson=0.93, P-value<2.0×1011).

To assess the accuracy in quantifying ASD based on short Illumina reads, we randomly selected 25 genes for independent experimental validation. Using the PacBio RS system, we deep-sequenced the RT–PCR products amplified from samples collected at 0 and 1.5 h, using primers targeted at the regions with no sequence variants between the two alleles (see the Materials and Methods section). The longer read length allowed the assignment of the PacBio reads to the parental alleles without any ambiguity. Allelic ratios of the read counts were then compared between the two time points. As shown in Fig 2C, the allelic decay rates estimated in this way were significantly correlated with those determined using the Illumina approach (rPearson=0.93, P-value<2.0×1011).

Genomic features that correlate with ASD divergences

The ASD divergences observed in F1 cells should reflect the effect of the sequence variants influencing cis-regulatory elements within the RNA transcripts. To study the potential cis-features accounting for the observed allelic biases, we first calculated the frequencies of sequence variants for the genes with or without significant ASD. As shown in Figs 3A and S4, the genes with significant ASD (621) exhibit significantly higher density of sequence variants than the genes without significant ASD (1,319 control genes); P-value<2.2×1016, two-sided Kolmogorov–Smirnov test (see the Materials and Methods section for details).

Figure 3. Sequence features that were correlated with ASD.

Figure 3.

(A) The cumulative distribution function (CDF) of SNP density (number of SNPs per kb) for genes with significant ASD (red) and without (control genes, blue). Compared with the control genes, the genes with significant ASD showed significantly higher SNP density (P-value ˂ 2.2 × 10–16, two-sided Kolmogorov–Smirnov test). (B) Box plots and scatterplots showing the distribution of miRNA-binding site number difference between the stable and unstable alleles for genes with significant ASD and controls. For controls, the difference centered around zero (P-value = 0.86, two-sided Mann–Whitney U test), whereas in ASD genes, unstable alleles tend to possess more miRNA target sites than the stable alleles (P-value = 1.0 × 10–4, two-sided Mann–Whitney U test). Only the genes with ≥10 miRNA-binding sites combining the two alleles together and ≥1 different sites between the two alleles were used. (C) Violin plots and scatterplots comparing the distribution of the absolute MFE difference (|ΔMFE|) between ASD genes and controls. The horizontal lines indicate the median. Compared with controls, ASD genes exhibited larger allelic differences (P-value = 4.4 × 10–3 two-sided Mann–Whitney U test).

Figure S4. SNP density comparison.

Figure S4.

The cumulative distribution function (CDF) of SNP density (number of SNPs per kb) in 5′ UTR (A), CDS (B), and 3′ UTR (C) for genes with significant ASD (red) and without (control genes, blue). Compared with the control genes, the genes with significant ASD always showed significantly higher SNP density.

Next, we sought to identify the potential cis-elements accounting for such ASD divergences. Given the well-known importance of miRNA in regulating RNA stability, we first focused on the variants affecting miRNA target sites (Bartel, 2004). For this purpose, we predicted for both alleles the target sites of the miRNAs expressed in the F1 fibroblasts using TargetScan (Friedman et al, 2009) (see the Materials and Methods section). Then we compared the number of miRNA-binding sites between the two alleles for the genes with significant ASD and 621 control genes with similar variant density, but without allelic divergence in decay rates, separately (Fig S5; see the Materials and Methods section for selection of these control genes). For the top 50 highly expressed miRNAs, Fig 3B shows that the difference in the number of their binding sites between the stable (slow-decaying) allele and the unstable (fast-decaying) allele centered symmetrically around zero for the control group (the stable allele was randomly selected here). In contrast, for the ASD genes, the distribution is not symmetric: the unstable alleles tend to possess more miRNA target sites than the stable alleles, demonstrating the contribution of allelic differences in miRNA regulation to the observed ASD. The same trend holds true for the top 100 highly expressed miRNAs (Fig S6) and also holds true when predicting miRNA target sites using a different algorithm, miRanda (Enright et al, 2003) (Fig S7). It is known that miRNAs confer the regulation mainly through binding to the targeting sites at 3′ UTR regions. Therefore, we further separated the genes into coding regions, 5′ UTR and 3′ UTR, predicted the miRNA-binding sites, and repeated the same allelic comparison for the three regions separately. Interestingly, the significant contribution of allelic difference in miRNA-binding sites could only be observed for 3′ UTR regions, consistent with the canonical model of miRNA regulation (Fig S8).

Figure S5. Selection of control genes with similar density of sequence variants.

Figure S5.

The cumulative distribution function (CDF) of SNP density (number of SNPs per kb) in whole gene, 5′ UTR, CDS, and 3′ UTR for genes with significant ASD (red) and a group of selected control genes with similar density of sequence variants.

Figure S6. Comparison of miRNA-binding sites using top 100 highly expressed miRNAs.

Figure S6.

Box plots and scatterplots showing the distribution of top 100 highly expressed miRNA-binding site number difference between the stable and unstable alleles for genes with significant ASD and controls. For controls, the difference centered around zero, whereas in ASD genes, unstable alleles tend to possess more miRNA target sites than the stable alleles

Figure S7. Box plots and scatterplots showing the distribution of miRNA-binding site number difference between the stable and unstable alleles for genes with significant ASD and controls estimated using miRanda.

Figure S7.

To validate our findings for miRNA-binding sites using TargetScan (Fig 3B), a similar analysis was performed using a different miRNA-binding site prediction algorithm miRanda (version 3.3a), with parameters miranda mirna.fa target.fa -sc 180 -en 1 -scale 4, in which mirna.fa was downloaded from miRBase (http://www.mirbase.org/). For controls, the difference centered around zero (P-value = 0.77, two-sided Mann–Whitney U test), whereas in ASD genes, unstable alleles tend to possess more miRNA target sites than the stable alleles (P-value = 0.027, two-sided Mann–Whitney U test). Only the genes with ≥10 miRNA-binding sites combining the two alleles together and ≥1 different sites were used.

Figure S8. Comparison of miRNA-binding sites in different regions.

Figure S8.

Box plots and scatterplots showing the distribution of miRNA-binding site number difference between the stable and unstable alleles for genes with significant ASD and controls for top 50 highly expressed miRNAs in 5′ UTR, CDS, and 3′ UTR, respectively.

RNA secondary structure has been reported to regulate RNA decay (Skripkin et al, 1990; Hamilton et al, 1999; Park & Maquat, 2013; Spitale et al, 2015). To check if the sequence variants affecting RNA secondary structures contribute to the observed ASD, we calculated the minimal free energy (MFE) of RNA segments (20-nt flanking each SNP) along the whole transcript for the two alleles separately using RNAfold (Lorenz et al, 2011), and then compared the allelic differences between ASD genes and control genes (see the Materials and Methods section). As shown in Fig 3C, compared with the control genes, the ASD genes indeed exhibited larger allelic differences in MFE values (|ΔMFE|). The trends remain regardless of the length of RNA fragments used for MFE calculation (Fig S9) and also holds true when calculating MFE using a different algorithm, RNAstructure (Bellaousov et al, 2013) (Fig S10). We again separated the genes into coding regions, 5′ UTR and 3′ UTR, and repeated the analysis for the three regions separately. As shown in Fig S11, interestingly, larger allelic MFE differences in ASD genes could be observed in both the CDS regions and 3′ UTR regions, but not in the 5′ UTR regions.

Figure S9. RNA secondary structure comparison using different window sizes.

Figure S9.

Violin plots and scatterplots comparing the distribution of the absolute MFE difference (|ΔMFE|) between ASD genes and controls using a 21-, 61-, 81-, and 101-nt region surrounding SNPs. Compared with controls, ASD genes always exhibited larger allelic differences.

Figure S10. Violin plots and scatterplots comparing the distribution of the absolute MFE difference (|ΔMFE|) between ASD genes and controls using MFE calculated with RNAstructure.

Figure S10.

To validate our findings for MFE using RNAfold (Fig 3C), a similar analysis was performed using a different MFE calculating algorithm RNAstructure (version 6.0.1, http://rna.urmc.rochester.edu/RNAstructure.html) with the following parameters Fold 41nt-window.fa output -MFE. Specifically, for each sequence variant, we calculated the MFE of a 41-nt RNA segments (20-nt flanking each variant) along the whole transcript for the two alleles separately, and then calculated their absolute difference. For each gene, we used the maximum |ΔMFE| among all the variants to represent the allelic difference in mRNA secondary structure. The horizontal lines indicate the median. Compared with controls, ASD genes exhibited larger allelic differences (P-value = 0.042, two-sided Mann–Whitney U test).

Figure S11. RNA secondary structure comparison in different regions.

Figure S11.

Violin plots and scatterplots comparing the distribution of the absolute MFE difference (|ΔMFE|) between ASD genes and controls using SNPs in 5′ UTR, CDS, and 3′ UTR. Compared with controls, ASD genes exhibited larger allelic differences using SNPs in CDS or 3′ UTR but not in 5′ UTR.

In previous studies, a number of additional sequence motifs have also been reported to affect RNA stability. One of such cis-elements is the well-known AU-rich elements (AREs) (Shaw & Kamen, 1986). It has been demonstrated that depending on the RBPs recruited, AREs could either stabilize or destabilize the host RNA transcripts (Garcia-Maurino et al, 2017). To investigate whether AREs also accounted for the ASD observed in this study, we calculated the ARE difference between the two alleles using the program AREScore (Spasic et al, 2012) (see the Materials and Methods section). However, as shown in Fig S12, no significant difference in allelic ARE divergence was observed between the control and ASD gene groups. Codon usage has recently been shown to play an important role in regulating mRNA stability (Bazzini et al, 2016; Mishima & Tmari, 2016). Here, to investigate whether codon usage differences between the two alleles contributed to the ASD observed in this study, we calculated the codon usage biases of the two alleles using codon adaptation index (Sharp & Li, 1987), but we did not observe any significant correlation between the allelic difference in codon usage and the observed ASD (Fig S13; see the Materials and Methods section).

Figure S12. AREScore comparison.

Figure S12.

Box plots and scatterplots showing the distribution of the value (A) and the absolute value (B) of AREScore difference between the stable and unstable alleles for genes with significant ASD and controls. No significant difference was observed between the control and ASD gene group for both value and the absolute value of AREScore allelic difference.

Figure S13. Codon adaption index comparison.

Figure S13.

Box plots and scatterplots showing the distribution of codon adaptation index difference between the stable and unstable alleles for genes with significant ASD and controls. No significant difference was observed between ASD gene and control group.

The role of ASD in the allelic difference of RNA abundances

In the F1 hybrids, the allele-specific bias in RNA abundance (ASA) results from the balance between AST and ASD. Previous studies in yeast using similar hybrid systems have demonstrated that the allelic biases in the two processes often possess opposite effects on the RNA abundance and some of the evolutionary changes in RNA decay are mechanistically coupled with those in RNA transcription (Dori-Bachash et al, 2011). Considering the higher complexity of gene regulation, here based on our dataset, we sought to address in a mammalian system whether and how the two processes, ASD and AST, coordinated with each other. For this purpose, we first investigated the relative contribution of the ASD to the ASA, the latter being estimated based on our poly-A RNA sequencing data collected at 0 h (steady state, before transcription arresting). Using the same bootstrapping strategy on log2 fold change of allelic expression at the same FDR threshold (adjusted P-value < 0.05, allelic divergence greater than twofold, FDR = 4.76%), out of the 8,815 genes for which we could confidently measure ASD, we identified 1,241 genes exhibiting ASA divergence (Figs 4 and S14).

Figure 4. The role of ASD in the allelic difference of RNA abundances.

Figure 4.

Scatterplot comparing each gene's allele-specific expression (log2-transformed fold change at y-axis) and decay (Δλ at x-axis). Dashed gray lines indicate twofold change for gene expression and 0.06 for decay rate difference, respectively (FDR < 0.05). Genes with significant allelic bias at only RNA abundance level, only decay level, and both levels were depicted in green, orange, and purple, respectively.

Figure S14. Identification of genes with significant ASA.

Figure S14.

(A) FDR (y-axis) was plotted against different Δλ threshold (x-axis) in identifying genes with significant ASA. See the Materials and Methods section for details. (B) Scatterplot showing the bootstrap means (x-axis) and standard deviations (y-axis) of estimated ASA. Dashed blue lines indicate the Benjamini–Hochberg–adjusted P-value of 0.05 and dashed black lines indicate twofold divergence of gene expression. Out of 8,815 genes (black), 1,241 (red) exhibited significant ASE.

To study the role of ASD in ASA, we then compared the genes with significant ASA to those with ASD. On one hand, most of the 1,241 genes with significant ASA (1,136 genes, 91.5%) did not exhibit significant ASD (Fig 4), suggesting that cis-divergence in RNA decay did not contribute much to the observed ASA. Instead, the ASA should largely result from the significant allelic biases in RNA transcription. On the other hand, among the 621 ASD genes, most (516 genes, 83.1%) did not exhibit significant ASA (Fig 4). For these genes, allelic bias in RNA transcription and that in RNA decay have opposite effects on the RNA abundances. To avoid the effect of arbitrary thresholds, we used different combinations of FDRs for ASA and ASD. As shown in Fig S15, the trend is consistently observed at different cutoffs.

Figure S15. The role of ASD in the allelic difference of RNA abundances under different combinations of FDR thresholds.

Figure S15.

Bar plots showing the percentage of ASD genes in those with significant ASA (blue bars) and the percentage of ASA genes in those with significant ASD (red bars) at different combinations of FDR thresholds (0.005, 0.0075, 0.01, 0.025, 0.05, 0.075, and 0.1 for ASD and ASA).

Discussion

The cellular abundance of RNA transcripts is determined by the balance between RNA transcription and decay. Therefore, change in RNA expression could arise from genetic variants affecting either/both of the processes. In spite of this, most of the previous studies in the evolution of RNA expression have been largely focused only on transcription. To globally investigate cis-divergence of RNA decay in mammals, we conducted a first genome–wide ASD profiling in a hybrid mouse system, the F1 cross between the BL6 and SPRET inbred mouse strains. Among all the mouse strains with high-quality genome assembly, SPRET has the largest number of sequence variants relative to BL6, which provides a large number of potential regulatory variants between the two strains (Keane et al, 2011). In total, out of 8,815 genes with sufficient data for accurate quantification of allelic difference in RNA decay rates, we identified 621 genes (7.0%) exhibiting significant cis-divergence, indicating widespread cis-divergences in RNA decay.

To distinguish the effects of transcription from those of decay on the changes of RNA abundance, the Tirosh lab investigated the evolutionary divergence in mRNA decay between closely related yeast species and their F1 hybrid (Dori-Bachash et al, 2011). Interestingly, they found that nearly 80% of the genes with differences in both mRNA degradation and steady-state levels and decay and transcription had opposing effects. In a later study by the Akey Lab, comparing the ASD in an F1 hybrid of two genetically diverse yeast strains, a similar phenomenon was observed. These studies suggest that in yeast, RNA transcription and decay are evolved in an opposite manner, indicating strong stabilizing selection for steady-state RNA expression levels. Compared with simple organisms such as Saccharomyces yeasts, much higher complexity is often required in the regulation of gene expression in multicellular species with various organs and cell types. Thus, it is an intriguing question whether the observed evolutionary patterns of RNA transcription and RNA decay in yeast also hold true for higher organisms, such as mammals. In a population study of the human interindividual variations in RNA decay, although a significant proportion (45%) of rdQTLs exhibited opposite effects to those of RNA transcription (inferred from steady-state mRNA expression levels), this proportion is much smaller than that identified from yeast studies (>80%). There are at least four different possible (not necessarily mutually exclusive) scenarios explaining the different observations between yeast and human: 1) Mechanical coupling for opposing effects in transcription and decay is not as prevalent in mammals as in Saccharomyces yeasts. 2) Pai et al (2012) sought to identify the rdQTLs within the set of significant eQTLs. In this case, if the effect of one rdQTL balanced the effects of other variants (such as QTLs on RNA transcription) on the mRNA expression level of the target gene, resulting in no significant variation among the population, then there would be no eQTL identified for this target gene. Consequently, with this study design, compensatory rdQTLs would be largely ignored and the total amount of rdQTLs as well as the proportion of rdQTLs with opposing effects to transcription remained largely underestimated. 3) Gene expression regulation in yeasts and mammals evolved along different trajectories, with stronger stabilizing selection in Saccharomyces than mammals. Such scenario would be consistent with the vastly greater effective population size of yeasts relative to that of mammals. 4) The divergence time and reproductive isolation between two yeast species or strains is much larger than between variants stemming from the same human population. Therefore, no evidence for compensatory evolution would be expected in the latter case, assuming random mating with regard to the QTLs studied and the absence of population substructure.

In this study, comparing the genes with significant ASD to those with significant ASA, we observed that the majority (1,136 out of 1,241, 91.5%) of the genes exhibiting ASA showed no significant ASD, indicating that allelic difference in transcription should be the predominant contributor to the observed ASA. This observation is in agreement with the previous human QTL study of RNA decay, in which the authors found that most (84.5%) of the identified eQTLs (expressed RNA abundance QTLs) were not rdQTLs (Pai et al, 2012). Taken together, it is likely, in mammals, that most of the divergence on the cellular RNA abundances results from the changes of RNA transcription. Interestingly and more importantly, we observed that 83.1% of the genes with significant ASD did not show allelic biases in RNA abundances, suggesting cis-divergences on RNA transcription and decay in these genes have opposite effects on RNA abundance. This indicates that pervasive opposing effects between transcription and decay observed in yeast also exist in mammals. The second scenario discussed above most likely explained the observation in the previous human QTL study.

The opposite cis-divergent effect could result from two possible scenarios. First, a mechanistic coupling between RNA transcription and decay, where the same cis change simultaneously leads to an increase (decrease) in transcription and an increase (decrease) in RNA decay. Second, to stabilize the RNA abundance, a change causing increased (decreased) transcription (or decay) is followed by an independent change causing increased (decreased) decay (or transcription). By comparing the parental differences and the allelic differences for both RNA transcription and RNA decay in the yeast hybrid system, Dori-Bachash et al (2011) distinguished the cis-/trans-origins of the divergences in RNA transcription and RNA decay. Interestingly, for those genes with opposite effects on RNA transcription and decay, the divergences of RNA transcription and decay often originated either both from cis or both from trans, suggesting that these opposite divergences might result from the same genetic variants, thus mechanistically coupled. Further analyses indeed suggested that the changes in some trans-factors (such as Rpb4/7 and Ccr4-Not protein complexes) might be involved in the coupled evolution of RNA transcription and decay in yeast, a clear demonstration of the first scenario (Dori-Bachash et al, 2011). However, here in our system, to what extent the two scenarios account for the coordinated evolution of RNA transcription and decay awaits future functional studies.

cis-Divergence in RNA decay should result solely from the sequence variants on the mRNA transcripts affecting cis-regulatory elements (e.g., miRNA-binding sites). Therefore, it would be possible to investigate the regulatory mechanisms underlying the cis-divergence in RNA decay by analyzing the sequence differences of ASD genes between the two alleles. Indeed, by such analysis, we demonstrated that sequence variants affecting miRNA binding could contribute to the observed ASD divergence. In contrast, in our previous analysis of allele-specific translation efficiency using the same F1 cells, we did not observe the significant impact of miRNA binding on translation, indicating, at least in the cellular system as used in our studies, that miRNAs regulate gene expression mostly through RNA degradation (Hou et al, 2015). In addition to miRNA-binding sites, our sequence analysis also revealed that variants affecting RNA secondary structures could also lead to the cis-divergence in RNA decay. Interestingly, in contrast to miRNA-binding sites, we did not observe between the two alleles the significant correlation (or anti-correlation) between the stability of RNA secondary structure and the rate of RNA decay (Fig S16). This might reflect the fact that different double-strand RBPs could either accelerate or decelerate RNA decay. For example, it has been shown that Staufen1 could bind to RNA duplexes and trigger the degradation of the bound RNAs (Park & Maquat, 2013), whereas HNRPA2B1 could bind to specific RNA secondary structures and thereafter stabilize the host transcripts (Hamilton et al, 1999).

Figure S16. RNA secondary structure comparison.

Figure S16.

Violin plots and scatterplots comparing the distribution of the MFE difference (ΔMFE) between the stable and unstable alleles for genes with significant ASD and controls. No significant correlation (or anti-correlation) between the stability of RNA secondary structure and the rate of RNA decay was observed.

Surprisingly, we did not find any significant impact of several known cis-regulatory features on the observed allelic biases in RNA decay, such as ARE and codon usage. A possible explanation is that ASD might be due to the combined effects of a large set of diverse mechanisms, and the individual contributions of these specific features with lower frequencies and/or smaller effect sizes might not be sufficient to reach statistical significance.

Finally, this study served as a first proof-of-principle investigation that used a mammalian F1 hybrid system to globally analyze the cis-divergences of RNA decay. One caveat of this study is that the conclusions were drawn from the results observed in mouse fibroblast cells. Thus, one future research direction would be to investigate whether our observations would remain the same in other mammalian tissues and cells. Furthermore, it has been shown that RNA decay plays more important roles during the response to extrinsic or intrinsic stimuli. Thus, future studies using our F1 system under those dynamic conditions would reveal more novel insights into the molecular mechanisms underlying the evolution of RNA decay in mammals.

Materials and Methods

F1(B×S) hybrid mouse fibroblast cell cultures

The F1(B×S) hybrid mice were obtained as described before (Gao et al, 2013). Adult mouse fibroblast cells were isolated and cultured according to the protocol from Encyclopedia of DNA Elements project (https://genome.ucsc.edu/encode/protocols/cell/mouse/Fibroblast_Stam_protocol.pdf) with modification of cell culture medium (RPMI 1640 Medium, GlutaMAX Supplement [Gibco; Life Technologies] with 0.5% FBS and 1% Penicillin/Streptomycin Solution).

Actinomycin D treatment and RNA sequencing

Actinomycin d (10 mg/ml, Sigma-Aldrich) was directly added to cell cultures. Cells were collected at 0, 0.5, and 1.5 h after the addition of actinomycin D. Total RNA from the collected cell samples was extracted using TriZOL reagent (Life Technologies) following the manufacturer's protocol. Stranded mRNA sequencing libraries were prepared with 500 ng total RNA according to the manufacturer's protocol (Illumina). The libraries were sequenced in a 2 × 100 +7 manner on a HiSeq 2000 platform (Illumina).

Reference sequences and gene annotation

The reference sequences and the Ensembl gene annotation of the C57BL/6J genome (mm10) were downloaded from the Ensembl FTP server (http://ftp.ensembl.org, version GRCm38, release 74). The RefSeq gene annotation was downloaded from the University of California, Santa Cruz (UCSC), genome browser (http://hgdownload.soe.ucsc.edu/goldenPath/mm10/database/). The single nucleotide variants and indels between BL6 and SPRET were downloaded from the Mouse Genome Project Web site (http://www.sanger.ac.uk/). The vcf2diploid tool (version 0.2.6) in the AlleleSeq pipeline was used to construct SPRET genome by incorporating the single nucleotide variants and indels into BL6 genome (Rozowsky et al, 2011). The chain file between the two genomes was also reported as an output, which was further used with the UCSC liftOver tool. The liftOver tool from the UCSC Genome Browser (Kuhn et al, 2013) was applied to get SPRET gene annotation.

Allele-specific sequencing read alignment

Flexbar was first used to trim RNA-seq reads that pass the Illumina filter to remove Illumina adapter sequences with parameters -x 6 -u 0 -m 50 -ae RIGHT -at 3 (Dodt et al, 2012). Read pairs that were concordantly mapped to the reference sequences of rRNA, tRNA, snRNA, snoRNA, and miscRNAs (available from Ensembl and RepeatMasker annotation) using Bowtie2 (version 2.1.0) with default parameters (in end-to-end and sensitive mode) were excluded.

The remaining reads were then aligned to the mouse genome reference sequences (see above) using TopHat (version 2.0.8) with default mapping parameters and Ensembl gene annotation (Trapnell et al, 2009). Concordantly mapped read pairs (i.e., mates of a read pair mapped to the same transcript with opposite orientation) were then assigned to the parental allele with less mapping edit distance; read pairs with equal edit distance to either allele were assigned as “common.” Read pairs that mapped to sex chromosomes and mitochondrial DNA were excluded for further analysis. Genomic alignment coordinates for reads from the SPRET/EiJ allele were then converted to the corresponding locations in the C57BL/6J reference genome using the UCSC liftOver tool and their chain files.

Filtering of SNP loci with potential allelic mapping and assignment biases

To estimate ASD, only the reads that could be unambiguously assigned to SNP loci from either allele were counted (see above). To avoid bias due to the potential misalignment of reads to the wrong allele, we used previously published datasets generated from fibroblast cell lines of the two parental strains (Gao et al, 2015). Specifically, we first created a mock F1 hybrid RNA-seq dataset by combining equal amounts of RNA-seq reads derived from the parental strains. We then performed the same alignment analysis as described above on the mock F1 hybrid and the two parental strain datasets. For each SNP locus, the numbers of reads assigned to the parent strains (in the original datasets) or specifically to the parental alleles (in the mock datasets) were then counted and compared and Fisher's exact test was used to filter the SNP loci with potential bias (P-value < 0.05, after Benjamini–Hochberg correction for multiple testing).

Because of potentially incomplete annotation of SNPs at paralogous genes or pseudogenes in the SPRET/EiJ genome, some reads, which could be mapped to multiple gene loci if the C57BL/6J sequence was used as a reference, were mapped to a unique position in the SPRET/EiJ genome. In such cases, removal of multiple mapped reads (only from C57BL/6J allele) could lead to inaccurate calculation of ASD. To avoid such bias, for each SNP locus, based on the mock datasets, we compared the ratio of allele-specific reads, including multiple mapped reads, with that counting only uniquely mapped reads. Fisher's exact test was used to filter the SNP loci with potential bias (P-value < 0.05, after Benjamini–Hochberg correction for multiple testing).

Estimation of allelic differences in mRNA decay rate

After SNP loci filtering (see above), only the genes with at least five SNPs supported by sufficient allelic reads in all different time course samples (i.e., mRNA0 h, BL6 + mRNA0 h, SPRET ≥ 10 and mRNA0.5 h, BL6 + mRNA0.5 h, SPRET ≥ 10 and mRNA1.5 h, BL6 + mRNA1.5 h, SPRET ≥ 10 and mRNA0 h, BL6 + mRNA0.5 h, BL6 + mRNA1.5 h, BL6 ≥ 15 and mRNA0 h, SPRET + mRNA0.5 h, SPRET + mRNA1.5 h, SPRET ≥ 15) were considered for further analysis.

To determine whether a gene exhibited allelic differences in mRNA decay rate, we combined a previously published logistic model and a bootstrapping strategy (Andrie et al, 2014; Muzzey et al, 2014). Specifically, we let Ni(t) be the number of mRNA transcripts for allele i (i = 1, 2, representing BL6 and SPRET) at time t. We assumed an exponential decay dNi(t)dt=λdNi(t) for a constant λ, such that Ni(t)=Ni(0)exp(λt). For each time point t, the number of RNA-seq reads that we can assign to an allele ni(t) is a fraction f(t), of the total number of mRNA transcripts for that allele, such that ni(t)=f(t)Ni(t). We then assumed the model ni(t)Poisson[f(t)Ni(t)]. Under this model, the distribution of the counts for strain 1 (BL6) given the total is binomial:

p(t)=f(t)N1(t)f(t)N1(t)+f(t)N2(t)=N1(0)N2(0)exp([λ1λ2]t)N1(0)N2(0)exp([λ1λ2]t)+1.

Taking the log it gives:

log(p(t)1p(t))=log(N1(0)N2(0))[λ1λ2]t=α+βt.

In this linear logistic model, the mRNA decay rate differences between the two alleles can be directly estimated using the parameter β. The parameter exp(β) represents the change in the odds of observing an mRNA allele of the strain 1 type, given a 1-h increase in time (t is measured in hours). If decay rates are the same in both strains (λ1=λ2), then β = 0.

To assess the uncertainty of estimated mRNA decay rate differences, a bootstrapping procedure was applied (Muzzey et al, 2014). Specifically, for each gene consisting of a list of n (n ≥ 5) SNP loci, we generated 5,000 new lists, each consisting of n SNP loci that were chosen at random with replacement from the original list. For each of the 5,000 random lists, mRNA decay rate differences between the two alleles were estimated using the above logistic model, and then yielded a bootstrap distribution, from which we got the bootstrapping mean and standard deviation. To determine the statistical significance of genes with ASD, we calculated a P-value based on the z-score that represented how many folds of standard deviation the bootstrapping mean deviated from zero. The raw P-values were then adjusted using the Benjamini–Hochberg method. To estimate the FDR, we used a similar permutation strategy as described before (Sterne-Weiler et al, 2013). In brief, gene labels were shuffled for 100 times in both replicates, and in each of the 100 shuffled sets, we calculated the number of genes in both replicates meeting the bootstrapping significance requirement (adjusted P-value < 0.05) and decay rate difference requirement (β=|λ1λ2|>x), and biased toward the same allele. Then, for each of the 100 permutations of each value x, the FDR was estimated as false positives divided by the number of real genes passing the same threshold. Finally, Benjamini–Hochberg–adjusted P-value < 0.05 and |Δλ|>0.06 in both replicates (FDR = 4.18%) was used as the threshold for determining whether a gene exhibited significant allelic differences in mRNA decay rate.

PacBio sequencing and data analysis

Starting with 500 ng of total RNA, DNase treatments were first performed according to the manufacturer's protocol (TURBO DNA-free kit; Thermo Fisher Scientific) for samples collected at 0 and 1.5 h after actinomycin D treatment. Reverse transcription (RT) reactions were followed using random hexamer primers (Thermo Fisher Scientific) and SuperScript II reverse transcriptase (Thermo Fisher Scientific). PCR reactions were then performed using 1 μl of RT products as template in 50 μl of GoTaq PCR system (Promega). PCR primers were designed for amplifying the genic region containing sequence variants between B6 and SP transcripts. All primer sequences are listed in Table S2. The PCR program was as follows: 4 min at 95 °C; followed by 28 cycles of 30 s at 95 °C, 30 s at 55 °C, and 45 at 72 °C; and a final elongation of 10 min at 72 °C. Different PCR products from the same RT product using different primers were then mixed and purified using Agencourt AMPure XP system (Beckman Coulter) and quantified by Qubit HS dsDNA measurement system (Life Technologies). These mixed PCR products were then sequenced on PacBio RS SMRT platform according to the manufacturer's instruction.

Table S2 PCR primers for PacBio validation. (21.8KB, docx)

Sequence reads from the PacBio RS SMRT chip were processed through PacBio's SMRT-Portal analysis suite to generate circular consensus sequences. The circular consensus sequences were then mapped to a reference database containing both alleles of target genes using BLAST with default parameters. The best hit was retained for each aligned sequence read. These reads were then assigned to C57BL/6J or SPRET/EiJ allele with fewer mismatches. The numbers of reads assigned to either allele of each gene at 0 and 1.5 h were counted, respectively. The following equation was used to estimate ASD:

ASD=log2(mRNA1.5h, BL6/mRNA1.5h,SPRETmRNA0h,BL6/mRNA0h,SPRET).

Selection of control genes without ASD

To compare with the genes exhibiting ASD, we selected a separate group of control genes that were also supported by sufficient allelic reads (see above) but did not show any difference in decay rate between the two alleles: 1) P-value from bootstrapping analysis >0.05 for both replicates; 2) |Δλ|<0.03 for both replicates; 3) bootstrapping deviation <0.1 (i.e., 95% quantile of all genes) for both replicates.

To analyze the sequence features of the genes exhibiting ASD, we further selected a subset of these control genes, which possessed similar density of sequence variants as ASD genes, to avoid the potential bias due to the different variant densities between ASD and control genes. Specifically, based on the distribution of sequence variant density across the whole transcript in ASD genes, we randomly selected from all the control genes a subset with the same variant density distribution as ASD genes.

Local RNA secondary structure

Local RNA secondary structure MFE was calculated using RNAfold from ViennaRNA package version 2.1.9 with default parameters at a temperature of 37 °C (Lorenz et al, 2011). Specifically, we calculated the MFE of an i-nt region (i = 21, 41, 61, 81, and 101) flanking each SNP between C57BL/6J allele and SPRET/EiJ allele. The variant of interest was placed at the center of each window if the whole window was within the transcript; however, if the variant was <(i – 1)/2 nt (e.g., 20 bp for i = 41) from the end of the RNA transcript, the i-nt window was shifted such that its boundary lay at the end of the transcript. We then calculated the absolute difference of MFE between the two alleles (|ΔMFE|) for each SNP. For each gene or each region (including 5′ UTR, CDS, and 3′ UTR), we used the maximum |ΔMFE|among all its SNP loci to represent the allelic difference in mRNA secondary structure.

Other sequence features, including miRNA-binding sites, codon usage bias, and AREs

miRNA target sites in each gene were counted using a custom Perl script by matching three site-types (i.e., 8mer, 7mer-m8, and 7mer-1A) using TargetScan v7 (Friedman et al, 2009) as previously described (Hou et al, 2015). For both control and ASD gene groups, we used only the genes with ≥10 miRNA-binding sites combining the two alleles together and ≥1 different sites between the two alleles.

The effects of AREs were estimated using the AREScore algorithm with default parameters (Spasic et al, 2012). Briefly, AREScore calculates a score based on the number of AUUUA pentamers, the distance between these pentamers, and whether they are located within an AU-block. The 3′ UTR sequences for either allele of each gene were submitted to AREScore web server (http://arescore.dkfz.de/arescore.pl).

Codon usage bias was estimated using the CAI calculated using CodonW version 1.4.4 (http://codonw.sourceforge.net/). For each gene, the coding sequence (CDS) of each allele was used as input for CodonW.

In the analysis of these sequence features, the difference between the stable (slow-decaying) and unstable (fast-decaying) alleles was calculated for ASD and control genes (see the selection of control genes without ASD section for details) separately. RefSeq annotation was used to separate each coding gene into 5′ UTR, CDS, and 3′ UTR. For the genes with multiple isoforms, the longest one was used. When 5′ UTR (CDS and 3′ UTR) region is considered, only the genes with 5′ UTR (CDS and 3′ UTR) are used. Note that some genes do not have annotated 5′ UTR or 3′ UTR, and noncoding RNAs are not considered when separating genes into different regions.

Data Availability

The RNA-seq data from this publication have been submitted to the European Nucleotide Archive (http://www.ebi.ac.uk/ena) and assigned the accession no. ERP017147.

Supplemental Information

Supplementary Information is available at https://doi.org/10.26508/lsa.201800052.

Supplementary Material

Reviewer comments

Acknowledgements

We thank Dr. Xi Wang for sharing the script for miRNA analysis and helpful discussions. W Chen was supported by National Natural Science Foundation of China (31771443), basic research grant from Science and Technology Innovation Commission of Shenzhen Municipal Government (JCYJ20170307105752508), China Thousand Talent Program, startup funds from Southern University of Science and Technology, and Peacock Plan of Shenzhen Municipal Government. W Sun and Q Gao were supported by the Chinese Scholarship Council.

Author Contributions

  • W Sun: conceptualization, validation, methodology, writing—original draft, review, and editing, experiments.

  • Q Gao: conceptualization, resources, software, methodology, and writing—original draft, review, and editing.

  • B Schaefke: data curation and writing—original draft, review, and editing.

  • Y Hu: data curation, formal analysis, and writing—original draft, review, and editing.

  • W Chen: conceptualization, supervision, writing—original draft, project administration, and writing—review and editing.

Conflict of Interest Statement

The authors declare that they have no conflict of interest.

References

  1. Albert FW, Kruglyak L (2015) The role of regulatory variation in complex traits and disease. Nat Rev Genet 16: 197–212. 10.1038/nrg3891 [DOI] [PubMed] [Google Scholar]
  2. Andrie JM, Wakefield J, Akey JM (2014) Heritable variation of mRNA decay rates in yeast. Genome Res 24: 2000–2010. 10.1101/gr.175802.114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bartel DP. (2004) MicroRNAs: Genomics, biogenesis, mechanism, and function. Cell 116: 281–297. 10.1016/s0092-8674(04)00045-5 [DOI] [PubMed] [Google Scholar]
  4. Bazzini AA, Del Viso F, Moreno-Mateos MA, Johnstone TG, Vejnar CE, Qin Y, Yao J, Khokha MK, Giraldez AJ (2016) Codon identity regulates mRNA stability and translation efficiency during the maternal-to-zygotic transition. EMBO J 35: 2087–2103. 10.15252/embj.201694699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bellaousov S, Reuter JS, Seetin MG, Mathews DH (2013) RNAstructure: Web servers for RNA secondary structure prediction and analysis. Nucleic Acids Res 41: W471–W474. 10.1093/nar/gkt290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Caput D, Beutler B, Hartog K, Thayer R, Brown-Shimer S, Cerami A (1986) Identification of a common nucleotide sequence in the 3′-untranslated region of mRNA molecules specifying inflammatory mediators. Proc Natl Acad Sci USA 83: 1670–1674. 10.1073/pnas.83.6.1670 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Dejager L, Libert C, Montagutelli X (2009) Thirty years of Mus spretus: A promising future. Trends Genet 25: 234–241. 10.1016/j.tig.2009.03.007 [DOI] [PubMed] [Google Scholar]
  8. Dodt M, Roehr JT, Ahmed R, Dieterich C (2012) FLEXBAR-flexible barcode and adapter processing for next-generation sequencing platforms. Biology (Basel) 1: 895–905. 10.3390/biology1030895 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Dolken L, Ruzsics Z, Radle B, Friedel CC, Zimmer R, Mages J, Hoffmann R, Dickinson P, Forster T, Ghazal P, et al. (2008) High-resolution gene expression profiling for simultaneous kinetic parameter analysis of RNA synthesis and decay. RNA 14: 1959–1972. 10.1261/rna.1136108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Dori-Bachash M, Shalem O, Manor YS, Pilpel Y, Tirosh I (2012) Widespread promoter-mediated coordination of transcription and mRNA degradation. Genome Biol 13: R114 10.1186/gb-2012-13-12-r114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Dori-Bachash M, Shema E, Tirosh I (2011) Coupled evolution of transcription and mRNA degradation. PLoS Biol 9: e1001106 10.1371/journal.pbio.1001106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Enright AJ, John B, Gaul U, Tuschl T, Sander C, Marks DS (2003) MicroRNA targets in Drosophila. Genome Biol 5: R1 10.1186/gb-2003-5-1-r1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Friedman RC, Farh KKH, Burge CB, Bartel DP (2009) Most mammalian mRNAs are conserved targets of microRNAs. Genome Research 19: 92–105. 10.1101/gr.082701.108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Gao Q, Sun W, Ballegeer M, Libert C, Chen W (2015) Predominant contribution of cis-regulatory divergence in the evolution of mouse alternative splicing. Mol Syst Biol 11: 816 10.15252/msb.20145970 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Gao Q, Sun W, You X, Froehler S, Chen W (2013) A systematic evaluation of hybridization-based mouse exome capture system. BMC Genomics 14: 492 10.1186/1471-2164-14-492 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Garcia-Maurino SM, Rivero-Rodriguez F, Velazquez-Cruz A, Hernandez-Vellisca M, Diaz-Quintana A, De la Rosa MA, Diaz-Moreno I (2017) RNA Binding Protein Regulation and Cross-Talk in the Control of AU-rich mRNA Fate. Front Mol Biosci 4: 71 10.3389/fmolb.2017.00071 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Garneau NL, Wilusz J, Wilusz CJ (2007) The highways and byways of mRNA decay. Nat Rev Mol Cell Biol 8: 113–126. 10.1038/nrm2104 [DOI] [PubMed] [Google Scholar]
  18. Hamilton BJ, Nichols RC, Tsukamoto H, Boado RJ, Pardridge WM, Rigby WF (1999). hnRNP A2 and hnRNP L bind the 3'UTR of glucose transporter 1 mRNA and exist as a complex in vivo. Biochem Biophys Res Commun 261: 646–651. 10.1006/bbrc.1999.1040 [DOI] [PubMed] [Google Scholar]
  19. Hao S, Baltimore D (2009) The stability of mRNA influences the temporal order of the induction of genes encoding inflammatory molecules. Nat Immunol 10: 281–288. 10.1038/ni.1699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hou J, Wang X, McShane E, Zauber H, Sun W, Selbach M, Chen W (2015) Extensive allele-specific translational regulation in hybrid mice. Mol Syst Biol 11: 825 10.15252/msb.156240 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Ivanov P, Anderson P (2013) Post-transcriptional regulatory networks in immunity. Immunol Rev 253: 253–272. 10.1111/imr.12051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Keane TM, Goodstadt L, Danecek P, White MA, Wong K, Yalcin B, Heger A, Agam A, Slater G, Goodson M, et al. (2011) Mouse genomic variation and its effect on phenotypes and gene regulation. Nature 477: 289–294. 10.1038/nature10413 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Khabar KS. (2017) Hallmarks of cancer and AU-rich elements. Wiley Interdiscip Rev RNA 8 10.1002/wrna.1368 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kuhn RM, Haussler D, Kent WJ (2013) The UCSC genome browser and associated tools. Brief Bioinform 14: 144–161. 10.1093/bib/bbs038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Lorenz R, Bernhart SH, Honer Zu Siederdissen C, Tafer H, Flamm C, Stadler PF, Hofacker IL (2011) ViennaRNA package 2.0. Algorithms Mol Biol 6: 26 10.1186/1748-7188-6-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Mendell JT, Sharifi NA, Meyers JL, Martinez-Murillo F, Dietz HC (2004) Nonsense surveillance regulates expression of diverse classes of mammalian transcripts and mutes genomic noise. Nat Genet 36: 1073–1078. 10.1038/ng1429 [DOI] [PubMed] [Google Scholar]
  27. Mishima Y, Tomari Y (2016) Codon usage and 3′ UTR length determine maternal mRNA stability in zebrafish. Mol Cell 61: 874–885. 10.1016/j.molcel.2016.02.027 [DOI] [PubMed] [Google Scholar]
  28. Muzzey D, Sherlock G, Weissman JS (2014) Extensive and coordinated control of allele-specific expression by both transcription and translation in Candida albicans. Genome Res 24: 963–973. 10.1101/gr.166322.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Necsulea A, Kaessmann H (2014) Evolutionary dynamics of coding and non-coding transcriptomes. Nat Rev Genet 15: 734–748. 10.1038/nrg3802 [DOI] [PubMed] [Google Scholar]
  30. Pai AA, Cain CE, Mizrahi-Man O, De Leon S, Lewellen N, Veyrieras JB, Degner JF, Gaffney DJ, Pickrell JK, Stephens M, et al. (2012) The contribution of RNA decay quantitative trait loci to inter-individual variation in steady-state gene expression levels. PLoS Genet 8: e1003000 10.1371/journal.pgen.1003000 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Park E, Maquat LE (2013) Staufen-mediated mRNA decay. Wiley Interdiscip Rev RNA 4: 423–435. 10.1002/wrna.1168 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Patel N, Khan AO, Al-Saif M, Moghrabi WN, AlMaarik BM, Ibrahim N, Abdulwahab F, Hashem M, Alshidi T, Alobeid E, et al. (2017) A novel mechanism for variable phenotypic expressivity in Mendelian diseases uncovered by an AU-rich element (ARE)-creating mutation. Genome Biol 18: 144 10.1186/s13059-017-1274-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Puimege L, Van Hauwermeiren F, Steeland S, Van Ryckeghem S, Vandewalle J, Lodens S, Dejager L, Vandevyver S, Staelens J, Timmermans S, et al. (2015) Glucocorticoid-induced microRNA-511 protects against TNF by down-regulating TNFR1. EMBO Mol Med 7: 1004–1017. 10.15252/emmm.201405010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rabani M, Levin JZ, Fan L, Adiconis X, Raychowdhury R, Garber M, Gnirke A, Nusbaum C, Hacohen N, Friedman N, et al. (2011) Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells. Nat Biotechnol 29: 436–442. 10.1038/nbt.1861 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Rabani M, Raychowdhury R, Jovanovic M, Rooney M, Stumpo DJ, Pauli A, Hacohen N, Schier AF, Blackshear PJ, Friedman N, et al. (2014) High-resolution sequencing and modeling identifies distinct dynamic RNA regulatory strategies. Cell 159: 1698–1710. 10.1016/j.cell.2014.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Raghavan A, Ogilvie RL, Reilly C, Abelson ML, Raghavan S, Vasdewani J, Krathwohl M, Bohjanen PR (2002) Genome-wide analysis of mRNA decay in resting and activated primary human T lymphocytes. Nucleic Acids Res 30: 5529–5538. 10.1093/nar/gkf682 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Rodningen OK, Tonstad S, Ose L, Berg K, Leren TP (1998) Effects of a 9.6-kb deletion of the LDL receptor gene (FH Helsinki) on structure and levels of mRNA. Hum Mutat 12: 95–102. [DOI] [PubMed] [Google Scholar]
  38. Rozowsky J, Abyzov A, Wang J, Alves P, Raha D, Harmanci A, Leng J, Bjornson R, Kong Y, Kitabayashi N, et al. (2011) AlleleSeq: Analysis of allele-specific expression and binding in a network framework. Mol Syst Biol 7: 522 10.1038/msb.2011.54 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M (2011). Global quantification of mammalian gene expression control. Nature 473: 337–342. 10.1038/nature10098 [DOI] [PubMed] [Google Scholar]
  40. Sharp PM, Li WH (1987) The codon Adaptation Index—A measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res 15: 1281–1295. 10.1093/nar/15.3.1281 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Shaw G, Kamen R (1986) A conserved AU sequence from the 3′ untranslated region of GM-CSF mRNA mediates selective mRNA degradation. Cell 46: 659–667. 10.1016/0092-8674(86)90341-7 [DOI] [PubMed] [Google Scholar]
  42. Skripkin EA, Adhin MR, de Smit MH, van Duin J (1990) Secondary structure of the central region of bacteriophage MS2 RNA. Conservation and biological significance. J Mol Biol 211: 447–463. 10.1016/0022-2836(90)90364-r [DOI] [PubMed] [Google Scholar]
  43. Spasic M, Friedel CC, Schott J, Kreth J, Leppek K, Hofmann S, Ozgur S, Stoecklin G (2012) Genome-wide assessment of AU-rich elements by the AREScore algorithm. PLoS Genet 8: e1002433 10.1371/journal.pgen.1002433 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Spitale RC, Flynn RA, Zhang QC, Crisalli P, Lee B, Jung JW, Kuchelmeister HY, Batista PJ, Torre EA, Kool ET, et al. (2015) Structural imprints in vivo decode RNA regulatory mechanisms. Nature 519: 486–490. 10.1038/nature14263 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Sterne-Weiler T, Martinez-Nunez RT, Howard JM, Cvitovik I, Katzman S, Tariq MA, Pourmand N, Sanford JR (2013) Frac-seq reveals isoform-specific recruitment to polyribosomes. Genome Res 23: 1615–1623. 10.1101/gr.148585.112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Trapnell C, Pachter L, Salzberg SL (2009) TopHat: Discovering splice junctions with RNA-seq. Bioinformatics 25: 1105–1111. 10.1093/bioinformatics/btp120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Vlasova IA, Tahoe NM, Fan D, Larsson O, Rattenbacher B, Sternjohn JR, Vasdewani J, Karypis G, Reilly CS, Bitterman PB, et al. (2008) Conserved GU-rich elements mediate mRNA decay by binding to CUG-binding protein 1. Mol Cell 29: 263–270. 10.1016/j.molcel.2007.11.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wang G, van der Walt JM, Mayhew G, Li YJ, Zuchner S, Scott WK, Martin ER, Vance JM (2008) Variation in the miRNA-433 binding site of FGF20 confers risk for Parkinson disease by overexpression of alpha-synuclein. Am J Hum Genet 82: 283–289. 10.1016/j.ajhg.2007.09.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Xia J, Scherer SW, Cohen PT, Majer M, Xi T, Norman RA, Knowler WC, Bogardus C, Prochazka M (1998) A common variant in PPP1R3 associated with insulin resistance and type 2 diabetes. Diabetes 47: 1519–1524. 10.2337/diabetes.47.9.1519 [DOI] [PubMed] [Google Scholar]
  50. Xia Z, Ghildyal N, Austen KF, Stevens RL (1996) Post-transcriptional regulation of chymase expression in mast cells. A cytokine-dependent mechanism for controlling the expression of granule neutral proteases of hematopoietic cells. J Biol Chem 271: 8747–8753. 10.1074/jbc.271.15.8747 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S2 PCR primers for PacBio validation. (21.8KB, docx)

Reviewer comments

Data Availability Statement

The RNA-seq data from this publication have been submitted to the European Nucleotide Archive (http://www.ebi.ac.uk/ena) and assigned the accession no. ERP017147.


Articles from Life Science Alliance are provided here courtesy of Life Science Alliance LLC

RESOURCES