Skip to main content
eLife logoLink to eLife
. 2025 Jun 6;14:RP104074. doi: 10.7554/eLife.104074

Upstream open reading frames buffer translational variability during Drosophila evolution and development

Yuanqiang Sun 1,, Yuange Duan 1,, Peixiang Gao 1, Chenlu Liu 1, Kaichun Jin 1, Shengqian Dou 2, Wenxiong Tang 1, Hong Zhang 3, Jian Lu 1,4,5,
Editors: Wenfeng Qian6, Alan M Moses7
PMCID: PMC12143884  PMID: 40478227

Abstract

Protein abundance tends to be more evolutionarily conserved than mRNA levels both within and between species, yet the mechanisms underlying this phenomenon remain largely unknown. Upstream open reading frames (uORFs) are widespread cis-regulatory elements in eukaryotic genomes that regulate translation, but it remains unclear whether and how uORFs contribute to stabilizing protein levels. In this study, we performed ribosome translation simulations on mRNA to quantitatively assess the extent to which uORF translation influences the translational variability of downstream coding sequences (CDSs) across varying contexts. Our simulations revealed that uORF translation dampens CDS translational variability, with buffering capacity increasing in proportion to uORF translation efficiency, length, and number. We then compared the translatomes at different developmental stages of two Drosophila species, demonstrating that uORFs buffer mRNA translation fluctuations during both evolution and development. Experimentally, deleting a uORF in the bicoid (bcd) gene—a prominent example of translational buffering—resulted in extensive changes in gene expression and phenotypes in Drosophila melanogaster. Additionally, we observed uORF-mediated buffering between primates and within human populations. Together, our results reveal a novel regulatory mechanism by which uORFs stabilize gene translation during development and across evolutionary time.

Research organism: Drosophila melanogaster, Drosophila simulans, Humans

Introduction

Organisms have evolved various strategies for the spatiotemporal regulation of gene expression (Buccitelli and Selbach, 2020; Lee and Young, 2013; Macneil and Walhout, 2011). This is important because aberrant gene expression can result in phenotypic defects or diseases, while the variation and evolution of gene expression patterns frequently promote phenotypic diversification and adaptation (Hill et al., 2021). Although variations in mRNA abundance are widely observed within or between species, protein abundance tends to show stronger evolutionary constraint (Signor and Nuzhdin, 2018; Vogel, 2013), as observed in yeasts (Artieri and Fraser, 2014; McManus et al., 2014; Wang et al., 2015), primates (Khan et al., 2013; Wang et al., 2018), and other organisms (Kusnadi et al., 2022; Laurent et al., 2010; Schrimpf et al., 2009). Nevertheless, the molecular mechanisms by which the conservation of protein abundance across species is achieved are largely unknown (Signor and Nuzhdin, 2018; Vogel, 2013).

Eukaryotic mRNA translation is a crucial step in gene expression and is highly regulated by multilayered mechanisms (Jackson et al., 2010; Sonenberg and Hinnebusch, 2009; Teixeira and Lehmann, 2019). Upstream open reading frames (uORFs), which are short open reading frames in the 5-terminal untranslated regions (5′ UTRs) of eukaryotic mRNAs, play crucial roles in regulating mRNA translation. Approximately 50% of eukaryotic genes contain uORFs (Zhang et al., 2021), and their evolution has been tightly shaped by natural selection (Churbanov et al., 2005; Neafsey and Galagan, 2007; Resch et al., 2009; Zhang et al., 2018a; Zhang et al., 2019). The functions of uORFs have been explored in various contexts, including development (Chen et al., 2017; Cheng et al., 2018; Komonyi et al., 2005; Kurihara et al., 2018; Malzer et al., 2013; Medenbach et al., 2011; Yang et al., 2017; Zhang et al., 2018a), disease (Calvo et al., 2009; Lee et al., 2021; Liu et al., 1999; Wen et al., 2009; Wiestner et al., 1998), and stress responses (Andreev et al., 2018; Andreev et al., 2015; Bohlen et al., 2020; Costa-Mattioli and Walter, 2020; Hinnebusch, 2005; Lu et al., 2004; Vasudevan et al., 2020; Vattem and Wek, 2004). The prevailing consensus is that uORFs typically repress downstream coding sequence (CDS) translation by sequestering ribosomes, a process influenced by factors such as uORF length, position, and sequence context (Andreev et al., 2018; Calvo et al., 2009; Hinnebusch et al., 2016; Johnstone et al., 2016; Kozak, 1989; Morris and Geballe, 2000; Young and Wek, 2016). However, under stress conditions, certain uORFs can facilitate CDS translation by promoting ribosome reinitiation, illustrating their context-dependent functions (Andreev et al., 2018; Andreev et al., 2015; Baird et al., 2014; Chen et al., 2010; Dever et al., 1992; Palam et al., 2011; Young and Wek, 2016).

Gene expression noise, which arises from the inherent stochasticity of biological processes such as transcription and translation, is generally detrimental to organismal fitness (Fraser et al., 2004) and is primarily determined at the translational level (Thattai and van Oudenaarden, 2001). Recent studies suggest uORFs might play essential roles in buffering translational noise and stabilizing protein expression. For example, Wu et al., 2022 demonstrated that uORFs reduce protein production rates to stabilize TOC1 protein levels, ensuring precise circadian clock function in plants (Wu et al., 2022). Similarly, Bottorff et al., 2022 used a human cell reporter system to show that a ribosome stall in cytomegaloviral UL4 uORFs buffers against CDS translation reductions (Bottorff et al., 2022). Under stress, translation initiation is typically downregulated, yet most human mRNAs resistant to this inhibition contain translated uORFs, with a single uORF often being sufficient for resistance (Andreev et al., 2015). The computational model of Initiation Complexes Interference with Elongating Ribosomes (ICIER) suggests that derepression of downstream translation is a general mechanism of uORF-mediated stress resistance (Andreev et al., 2018). Despite these findings, the current understanding of uORFs in stabilizing translation is limited to single-gene cases or stressed conditions. It remains unclear whether and how uORFs affect gene translation variability on a genome-wide scale during evolution and development, and whether the identified mechanisms are universal or vary substantially among different taxa. To address these questions, a combination of modeling, genome-wide analyses, and comparative studies across species is required.

In this study, we first adapted the ICIER framework (Andreev et al., 2018) to simulate the translating ribosome on an mRNA to quantitatively measure the extent to which uORF translation reduces the translational variability of the downstream CDS under different translation contexts. We then compared the translatomes of two closely related Drosophila species, D. melanogaster and D. simulans, and further supported the notion that uORFs could buffer the fluctuations of CDS translation during the development and evolution of Drosophila. The patterns also reappeared among primates and human populations. We next knocked out the bicoid (bcd) uORF, a case showing significant buffering effect in our data, and observed broad changes in the embryonic transcriptome and phenotypic defects in D. melanogaster. Together, our results demonstrate a novel role for uORFs in maintaining translation stability during Drosophila evolution and development.

Results

An extended ICIER model for quantifying uORF buffering in CDS translation

To quantitatively assess how uORF translation modulates the variability of downstream CDS translation, we adapted the ICIER model (Andreev et al., 2018), originally grounded in the totally asymmetric simple exclusion process (TASEP). TASEP has been extensively utilized to model the stochastic nature of ribosome movement along mRNA, capturing the effects of ribosome traffic jams, where ribosomes may slow down or stall when a site ahead is occupied (Ciandrini et al., 2010; Reuveni et al., 2011; von der Haar, 2012; Zhao and Krishnan, 2014). The ICIER model extends this by simulating the interplay between scanning (40S) and elongating (80S) ribosomes, particularly focusing on how uORFs impact the overall translation rate of the main CDS (Andreev et al., 2018).

We extended the original ICIER model (Andreev et al., 2018) with several major modifications (Figure 1A). First, while the original ICIER model only considered the scenario where the elongating ribosome (80S) causes downstream scanning ribosomes (40S) to dissociate from the mRNA when they move along the mRNA and collide, recent findings have shown that upstream dissociation can also play a critical role in uORF-mediated regulation (Bottorff et al., 2022). To incorporate this, we accounted for more complex ribosome interactions, including three possible scenarios where the 80S collides with 40S: (i) 80S only causes the downstream 40S to dissociate from the mRNA with a probability of Kdown (‘downstream dissociation’, Kdown ranging from 0 to 1), following the original ICIER model; (ii) the 80S only causes the upstream 40S to dissociate from the mRNA with a probability of Kup (‘upstream dissociation’, Kup >0 and Kdown = 0); and (iii) a combination of the downstream and upstream dissociation models (‘double dissociation’, Kup >0 and Kdown >0).

Figure 1. Modeling simulation of uORF-mediated translation buffering.

(A) Model schema of the modified ICIER model (on the top). The parameters are listed in the box below the schema. (B) Heatmap showing the CVs of CDS translation rate (NEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. The left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under ICDS = 0.8 and IuORF = 0 (top panel, without uORF) and ICDS = 0.8 and IuORF = 0.4 (bottom panel, with uORF). (C) Heatmap showing CVs of CDS translation rate (NEC) under different LuORF (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. The left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under LuORF = 2 and IuORF = 0.2 (top panel) and LuORF = 30 and IuORF = 0.2 (bottom panel). (D) Heatmap showing median δ under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. The left panel elicited by the dotted line from a specific square of the right heatmap was an example showing the distribution of δ under ICDS = 0.8 and IuORF = 0.2. The vertical dashed line indicated the median value of δ.

Figure 1—source data 1. Raw numerical data underlying the figures.

Figure 1.

Figure 1—figure supplement 1. The input distribution and model simulation flow.

Figure 1—figure supplement 1.

(A) Two distributions of the Rin input were used in our simulation: an exponential distribution and a uniform distribution. A total of 1000 Rin values (ranging from 0 to 0.1) were randomly generated following either distribution. (B) Under a fixed combination of parameters (Rin distribution, dissociation model, IuORF and ICDS), the model simulation produces 1000 NEC values from 1000 Rin input. The median of these 1000 NEC represented the median of translation rate of CDS under the 1000 varying Rin inputs and the other fixed parameters. The coefficient of variation (CV) of these 1000 NEC reflected the variability of translation rate of CDS under the 1000 varying Rin inputs and the other fixed parameters.
Figure 1—figure supplement 1—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 2. The median of CDS translation rate under uniform distribution of Rin.

Figure 1—figure supplement 2.

Heatmaps showing the median CDS translation rate (NEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model (A), a uniform distribution of Rin input and the upstream dissociation model (B), a uniform distribution of Rin input and the double dissociation model (C). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under ICDS = 0.8 and IuORF = 0 (top panel, without uORF) and ICDS = 0.8 and IuORF = 0.4 (bottom panel, with uORF).
Figure 1—figure supplement 2—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 3. The median of CDS translation rates under exponential distribution of Rin.

Figure 1—figure supplement 3.

Heatmaps showing the median CDS translation rate (NEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with an exponential distribution of Rin input and the downstream dissociation model (A), an exponential distribution of Rin input and the upstream dissociation model (B), an exponential distribution of Rin input and the double dissociation model (C). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under ICDS = 0.8 and IuORF = 0 (top panel, without uORF) and ICDS = 0.8 and IuORF = 0.4 (bottom panel, with uORF).
Figure 1—figure supplement 3—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 4. The CVs of CDS translation rates under uniform distribution of Rin.

Figure 1—figure supplement 4.

Heatmaps showing the CVs of CDS translation rate (NEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the upstream dissociation model (A), a uniform distribution of Rin input and the double dissociation model (B). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under ICDS = 0.8 and IuORF = 0 (top panel, without uORF) and ICDS = 0.8 and IuORF = 0.4 (bottom panel, with uORF).
Figure 1—figure supplement 4—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 5. The CVs of CDS translation rate under exponential distribution of Rin .

Figure 1—figure supplement 5.

Heatmaps showing the CVs of CDS translation rate (NEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with an exponential distribution of Rin input and the downstream dissociation model (A), an exponential distribution of Rin input and the upstream dissociation model (B), an exponential distribution of Rin input and the double dissociation model (C). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under ICDS = 0.8 and IuORF = 0 (top panel, without uORF) and ICDS = 0.8 and IuORF = 0.4 (bottom panel, with uORF).
Figure 1—figure supplement 5—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 6. The CVs of CDS translation rates across different uORF length under uniform distribution of Rin.

Figure 1—figure supplement 6.

Heatmaps showing the CVs of CDS translation rate (NEC) under different LuORF (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the upstream dissociation model (A), a uniform distribution of Rin input and the double dissociation model (B). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under LuORF = 2 and IuORF = 0.2 (top panel, shorter uORF) and LuORF = 30 and IuORF = 0.2 (bottom panel, longer uORF).
Figure 1—figure supplement 6—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 7. The CVs of CDS translation rates across different uORF length under uniform distribution of exponential distribution of Rin.

Figure 1—figure supplement 7.

Heatmaps showing the CVs of CDS translation rate (NEC) under different LuORF (x-axis) and IuORF (y-axis) combinations with an exponential distribution of Rin input and the downstream dissociation model (A), an exponential distribution of Rin input and the upstream dissociation model (B), an exponential distribution of Rin input and the double dissociation model (C). For each heatmap, the left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under LuORF = 2 and IuORF = 0.2 (top panel, shorter uORF) and LuORF = 30 and IuORF = 0.2 (bottom panel, longer uORF).
Figure 1—figure supplement 7—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 8. The correlations between changes of uORF translation rate and downstream CDS translation rate under uniform distribution of Rin.

Figure 1—figure supplement 8.

Heatmaps showing the Spearman’s correlations between changes of uORF translation rate (ΔNEU) and downstream CDS translation rate (ΔNEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model (A), a uniform distribution of Rin input and the upstream dissociation model (B), a uniform distribution of Rin input and the double dissociation model (C). For each heatmap, the left panel elicited by the dotted line from the specific square of the right heatmap was an example showing the Spearman’s correlations between changes of uORF translation rate (ΔNEU) and downstream CDS translation rate (ΔNEC) under ICDS = 0.8 & IuORF = 0.2. All p-values <0.001.
Figure 1—figure supplement 8—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 9. The correlations between changes of uORF translation rate and downstream CDS translation rate under exponential distribution of Rin.

Figure 1—figure supplement 9.

Heatmaps showing the Spearman’s correlations between changes of uORF translation rate (ΔNEU) and downstream CDS translation rate (ΔNEC) under different ICDS (x-axis) and IuORF (y-axis) combinations with an exponential distribution of Rin input and the downstream dissociation model (A), an exponential distribution of Rin input and the upstream dissociation model (B), an exponential distribution of Rin input and the double dissociation model (C). For each heatmap, the left panel elicited by the dotted line from the specific square of the right heatmap was an example showing the Spearman’s correlations between changes of uORF translation rate (ΔNEU) and downstream CDS translation rate (ΔNEC) under ICDS = 0.8 and IuORF = 0.2. All p-values <0.001.
Figure 1—figure supplement 9—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 10. The median δ under uniform distribution of Rin.

Figure 1—figure supplement 10.

Heatmaps showing the median δ under different ICDS (x-axis) and IuORF (y-axis) combinations with a uniform distribution of Rin input and the upstream dissociation model (A), a uniform distribution of Rin input, and the double dissociation model (B). For each heatmap, the left panel elicited by the dotted line from the specific square of the right heatmap was an example showing the distribution of δ under ICDS = 0.8 and IuORF = 0.2. The vertical dashed line indicated the median value of δ.
Figure 1—figure supplement 10—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 11. The median δ under exponential distribution of Rin.

Figure 1—figure supplement 11.

Heatmaps showing the median δ under different ICDS (x-axis) and IuORF (y-axis) combinations with an exponential distribution of Rin input and the downstream dissociation model (A), an exponential distribution of Rin input and the upstream dissociation model (B), an exponential distribution of Rin input and the double dissociation model (C). For each heatmap, the left panel elicited by the dotted line from the specific square of the right heatmap was an example showing the distribution of δ under ICDS = 0.8 and IuORF = 0.2. The vertical dashed line indicated the median value of δ.
Figure 1—figure supplement 11—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 12. Two-uORF model simulation measurements.

Figure 1—figure supplement 12.

(A) The two-uORF model schema. The factors and parameters were the same as those illustrated in Figure 1A except where specifically indicated. (B) Heatmap showing the CVs of CDS translation rate (NEC) under different IuORF2 (x-axis) and IuORF1 (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. The left panels elicited by the dotted lines from specific squares of the right heatmap were two examples showing the distribution of NEC under IuORF1=0 & IuORF2=0 (top panel, without uORF) and IuORF1=0.2 & IuORF2=0.2 (bottom panel, with two uORFs). (C) Heatmap showing the median CDS translation rate (NEC) under different IuORF2 (x-axis) and IuORF1 (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. (D) Spearman’s correlations between changes of uORF1 translation rate (ΔNEU1) and downstream CDS translation rate (ΔNEC) under different IuORF2 (x-axis) and IuORF1 (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. (E) Heatmap showing the median δ[log2(ΔNEC/ΔNEU1)] under different IuORF2 (x-axis) and IuORF1 (y-axis) combinations with a uniform distribution of Rin input and the downstream dissociation model. The left panel elicited by the dotted line from the specific square of the right heatmap was an example showing the distribution of δ under IuORF1=0.2 and IuORF2=0.2. The vertical dashed line indicated the median value of δ.
Figure 1—figure supplement 12—source data 1. Raw numerical data underlying the figures.
Figure 1—figure supplement 13. Comparison of the buffering effects between single uORF and two uORFs.

Figure 1—figure supplement 13.

In each panel, the IuORF of single-uORF model equals to IuORF1 of two-uORF model, with both values ranging from 0 to 0.5. The x-axis in each panel denotes the values of different IuORF2 in the two-uORF model, ranging from 0 to 0.5. The y-axis in each panel represents the ratio of the CV of NEC with two uORFs to that with a single uORF.
Figure 1—figure supplement 13—source data 1. Raw numerical data underlying the figures.

Second, the original ICIER model only considered the ribosome collision and dissociation in the uORF and counted the 40S scanning ribosome escaping from the uORF as a proxy of the CDS translation rate. In our extended model, we also considered the ribosome collision and dissociation events in the CDS downstream of the uORF. We recorded the number of 80S ribosomes that completed translation at the stop codon of a CDS (NEC) or uORF (NEu) during a given time interval, using these counts as proxies to quantify the translation rate of CDS or uORF. These indices allowed us to directly and quantitatively measure the impact of uORFs on CDS translation.

Third, while the original ICIER model accounted for only a single uORF within an mRNA, we extended it to consider two uORFs coexisting on an mRNA molecule, allowing us to explore the possible combinatorial effects of uORF-mediated translation regulation in a more complex yet more common scenario, as previous studies have shown that uORFs tend to be clustered within genes (Zhang et al., 2021).

These extensions allow for a more comprehensive exploration of uORF-mediated translational buffering, offering deeper insights into how these regulatory elements might stabilize protein synthesis across varying translation contexts.

uORF-mediated buffering of CDS translation across different parameter settings

To systematically investigate the extent to which uORF translation modulates the variability of downstream CDS translation, we conducted simulations across various parameter settings using three dissociation models: upstream, downstream, and double dissociation, each reflecting different possible interactions between scanning and elongating ribosomes on the mRNA. We considered a range of parameters crucial to the translation process (Supplementary file 1), including the length of the 5′ leader before the uORF (fixed at 150 nucleotides), the length of the uORF itself (ranging from 2 to 100 codons), the distance between the uORF stop codon and the CDS start codon (150 nucleotides), the length of the CDS (500 codons), and the length of the 3′ UTR (150 nucleotides). Additionally, we modeled the probabilities associated with ribosome movement and initiation, such as the probability of a 40S ribosome moving to the next nucleotide (νs=0.3), the probability of an 80S ribosome moving to the next position within the uORF (νEu=0.3) and within the CDS (νEC=0.5), and the probability of loading a new 40S ribosome at the 5' end of the mRNA (Rin). We also explored different probabilities of translation initiation at both the uORF start codon (IuORF) and the CDS start codon (ICDS). Key parameters were adapted from the original ICIER model (Andreev et al., 2018), ensuring a robust basis for comparison while allowing exploration of additional variables that influence uORF-mediated translational buffering.

In our simulations, R values were varied to simulate fluctuations in translational resources, such as ribosome availability, that could arise from genetic differences or environmental changes during evolution or development. By generating 1000 R values following either uniform or exponential distributions (ranging from 0 to 0.1, Figure 1—figure supplement 1A), we aimed to capture the natural variability in ribosome loading rates that might occur across different species, individuals, or developmental stages. These values were then fed into the simulation models to evaluate their impact on both the level and variability of translation rate for CDSs, with the number of ribosomes completing translation on CDSs (NEC) in a given time interval used as a proxy for translational rate (Figure 1—figure supplement 1B).

Across all model settings, uORF translation (IuORF>0) consistently reduced CDS translation rate (NEC) by about 30–80% as IuORF increased from 0.1 to 0.5, compared to scenarios where the uORF was absent or untranslated (IuORF=0; Figure 1—figure supplements 2 and 3). This confirms the inhibitory effect of uORFs on downstream CDS translation. Notably, the coefficient of variation (CV), a measurement of variability for translation rate (NEC), was lower when the uORF was translated (Figure 1B). These CV values further decreased by approximately 10% to 25% as IuORF increased from 0.1 to 0.5 (Figure 1B), and this buffering effect persisted across different parameter settings (Figure 1—figure supplements 4 and 5). Moreover, the CV of translation rate (NEC) further decreased by about 6–30% as uORF length (LuORF) increased from 2 to 100 codons (Figure 1C, Figure 1—figure supplements 6 and 7). These simulation results indicate that uORF translation can reduce variability in downstream CDS translation, with the buffering capacity positively correlated with both uORF translation initiation efficiency and length under the applied simulation conditions.

To more quantitatively investigate the relationship between changes in translation rate in uORF (NEU) and CDS (NEC), for each parameter setting, we used the median value (Rinm) of the 1000 Rin inputs as the baseline, calculating the corresponding NECm and NEUm values. We then calculated the changes of NEC(ΔNEC) and NEU(ΔNEU) relative to NECm and NEUm for each other Rin value, respectively. Across different parameter settings, ΔNEU was consistently and significantly positively correlated with ΔNEC (p<0.001, Spearman’s correlation; Figure 1—figure supplements 8 and 9), indicating that fluctuations of the ribosome loading rate influence the translation of both uORFs and CDSs in the same direction. Nevertheless, our simulations showed that variations in Rin led to a larger change in NEU than in NEC, as the median value of δ [defined by log2(ΔNEC/ΔNEU)] was consistently less than 0 across various IuORF and ICDS combinations (Figure 1D, Figure 1—figure supplements 10 and 11). This finding suggests that translational fluctuation in uORFs is greater than that in downstream CDSs, indicating that uORFs can buffer against upstream fluctuations.

Simulations of mRNAs with two uORFs revealed patterns consistent with a buffering role for uORFs (Figure 1—figure supplement 12). To compare the buffering effects of a single uORF versus two uORFs, we calculated the ratio of the CV of NEC with two uORFs to that with a single uORF. A ratio less than 1 suggests that two uORFs provide greater buffering than a single uORF. For comparability, we examined the CV of NEC where the IuORF in the single-uORF model equals IuORF1 (IuORF of the first uORF) in the two-uORF model, both ranging from 0 to 0.5. The CV ratio consistently remained below 1 across a range of IuORF2 (IuORF of the second uORF; Figure 1—figure supplement 13), indicating that two uORFs offer stronger buffering than a single uORF.

Collectively, these simulations collectively suggest: (1) uORF-mediated translational control buffers against CDS translation variability, (2) uORFs exhibit greater translational fluctuations than downstream CDSs, and (3) the buffering capacity of uORFs positively correlates with their translation initiation efficiency and length. Subsequently, we sought to validate these simulation results in a biological context by confirming the uORF-mediated buffering effect during organismal evolution and development, as both processes face environmental and/or genetic changes that frequently disturb mRNA translation.

Generating matched translatome data from two Drosophila species for comparative analysis

To validate the uORF-mediated translational buffering during Drosophila evolution, we performed a comparative analysis of the translatomes of two closely related Drosophila species, D. melanogaster and D. simulans, which diverged approximately 5.4 million years ago (Tamura et al., 2004). We generated high-throughput sequencing data for D. simulans, including transcriptome (mRNA-Seq) and translatome (Ribo-Seq) profiles from various developmental stages and tissues, including embryos at 0–2 hr, 2–6 hr, 6–12 hr, and 12–24 hr, third-instar larvae, P7–8 pupae, female and male bodies, and female and male heads. In total, we obtained approximately 786 million high-quality reads for D. simulans (Supplementary file 2). These datasets were designed to be directly comparable to the previously published D. melanogaster data (Zhang et al., 2018a), with identical embryonic stages and tissue types. For each sample, we followed established procedures (Chothani et al., 2022; Ingolia, 2016; Ingolia et al., 2009; Ingolia et al., 2011; Zhang et al., 2018a) to calculate the translational efficiency (TE) for each feature (CDS or uORF). TE serves as a proxy for the translation rate at which ribosomes translate mRNA into proteins, typically quantified by comparing the density of ribosome-protected mRNA fragment (RPF) to the mRNA abundance for a feature (see Materials and methods). This comprehensive comparative translatome analysis, utilizing matched developmental stages and tissues between the two species, allowed us to assess the uORF-mediated translational buffering effects during evolution.

Translational conservation and dominance of uORFs between Drosophila species

Given that the translation of uORFs is a crucial determinant of their functional impact, we first characterized the translational profiles of uORFs in D. melanogaster and D. simulans to explore the evolutionary roles of uORFs in translation regulation. We focused on canonical uORFs that initiate with an ATG start codon in the 5′ UTR and terminate with a stop codon (TAA, TAG, or TGA). Because the ATG start codon is the defining feature of a canonical uORF and tends to be more conserved than its downstream sequence (Zhang et al., 2021), we defined uORF conservation based on the presence of the ATG start codon in the 5′ UTR of D. melanogaster and its orthologous positions in D. simulans, regardless of differences in the stop codon. Using this criterion, we identified 18,412 canonical uORFs with conserved start codons between the two species. Additionally, we identified 2789 canonical uORFs specific to D. melanogaster and 2440 canonical uORFs specific to D. simulans. The TE values of the conserved uORFs were highly correlated between the two species across all developmental stages and tissues examined, with Spearman correlation coefficients (rho) ranging from 0.478 to 0.573 (Figure 2A). In contrast, TE of CDSs exhibited a significantly higher correlation between the two species in the corresponding samples compared to that of uORFs, with Spearman’s rho ranging from 0.588 to 0.806 (p=0.002, Wilcoxon signed-rank test; Figure 2A). This observation is consistent with our simulation results, which indicate that uORFs experience greater translational fluctuations than their downstream CDSs. Notably, conserved uORFs exhibited significantly higher TEs compared to species-specific uORFs in both species. The median TE of conserved uORFs was 1.62 times that of non-conserved uORFs in D. simulans, while the corresponding ratio in D. melanogaster was 1.52 (Figure 2B).

Figure 2. Conservation and translation of uORFs between D. melanogaster and D. simulans.

(A) Spearman’s correlation coefficients (rho, represented by the bars) of TEs between Dm (D. melanogaster) and Ds (D. simulans) for CDS (red) and uORFs (blue). All P-values for Spearman’s correlation are less than 0.001. The p value for the comparison between rho values of CDSs TE and uORFs TE is 0.002 (Wilcoxon signed-rank test). Data for the female head sample is shown as an example in the right panel. The x- and y-axes represent the TEs in Dm and Ds. (B) The median of TE of conserved and species-specific uORFs in each sample. Each dot represents the median TE of a sample for a specific uORF class. Data from the female head sample is shown as an example in the right panel. p values were obtained from Wilcoxon rank-sum tests. ***, p<0.001. (C) Fraction of conserved uORFs among dominant uORFs and other translated uORFs in each sample. The paired samples in Dm and Ds were linked together. The p value was obtained by the paired Wilcoxon signed-rank test. ***, p<0.001. (D) Absolute values of the interspecific TE fold changes (log2TE-FC) of dominant uORFs and the other translated uORFs in each sample. The paired samples in Dm and Ds were linked together. The median value of each sample is shown. The p value was obtained via the paired Wilcoxon signed-rank test. ***, p<0.001. Data from the female head sample were used as an example in the right panel.

Figure 2—source data 1. Raw numerical data underlying the figures.

Figure 2.

Figure 2—figure supplement 1. The dominant uORF showed the highest TE than other uORFs with the same gene.

Figure 2—figure supplement 1.

uORFs were ranked by decreasing TEs within each gene. The uORF with the highest TE within each gene was defined as the dominant uORF (#1). ‘#2’ represents the second highest uORF TE and the same goes for ‘#3’ and ‘>3’. Each dot represents the median TE of a sample.
Figure 2—figure supplement 1—source data 1. Raw numerical data underlying the figures.
Figure 2—figure supplement 2. Observed and expected numbers of genes sharing dominant uORFs in all samples.

Figure 2—figure supplement 2.

Among the genes of D. melanogaster (where only the longest transcripts were considered), 7259 (52.2%) genes had no uORF (‘no-uORF’ genes), 2687 (19.3%) genes had one uORF (‘one-uORF’ genes), and 3961 (28.5%) genes had multiple (≥2) uORFs (‘multiple-uORF’ genes). The numbers in the header of each panel represent genes with translated uORFs (TE >0.1) in ≥N samples (2≤N ≤ 10). The TEs of all uORFs were shuffled 1000 times. p values were obtained from randomization tests.
Figure 2—figure supplement 2—source data 1. Raw numerical data underlying the figures.

In D. melanogaster, 7259 (52.2%) genes had no uORFs, 2687 (19.3%) had a single uORF, and 3961 (28.5%) contained multiple uORFs. Among genes with multiple uORFs, we defined the uORF with the highest TE as the dominant uORF for that gene, as TE is one of the most relevant metrics for assessing uORF function (Johnstone et al., 2016; Zhang et al., 2021). The median TE of the dominant uORF was 4.84 times that of the second-highest uORF within the same gene in D. melanogaster, and the corresponding ratio was 5.21 times in D. simulans (Figure 2—figure supplement 1). To assess the consistency of this dominance across different tissues and developmental stages, we identified 3072 multiple-uORF genes in D. melanogaster with at least one translated uORF (TE >0.1 in at least five stages/tissues). Of these, 569 genes consistently used the same dominant uORF across the measured samples, significantly higher than the number expected under randomness (5 genes, 95% confidence interval: 1–10) based on shuffling the TEs of uORFs 1000 times (Figure 2—figure supplement 2). This trend was also observed in D. simulans and persisted under different thresholds for defining ‘translated uORFs’ (Figure 2—figure supplement 2). These results suggest that genes with multiple uORFs tend to retain the same dominant uORF across developmental stages, indicating that the dominant uORF may serve as the key translational regulator of the downstream CDS. Moreover, we found that the dominant uORFs showed a higher proportion of conserved uATGs than the other translated uORFs (median proportion 82.5% vs 78.8%, p<0.001; Figure 2C). Additionally, the absolute values of TE fold-change (|log2TE-FC|) between the two species were, on average, 23.2% smaller for dominant uORFs than for other conserved uORFs (Figure 2D), suggesting that dominant uORFs are more likely under stronger stabilizing selection. These findings suggest that, in genes with multiple uORFs, the dominantly translated uORF may play a more important role in regulating CDS translation than the other uORFs.

uORFs buffer interspecific translational divergence of CDSs

To investigate the relationship between translational changes in uORFs and their downstream CDSs across species, we analyzed the translation efficiency (TE) of uORFs and corresponding downstream CDSs in D. melanogaster and D. simulans. Consistent with our simulations that the translation of a uORF is tightly linked to that of downstream CDS, uORFs exhibited a modest, yet statistically significant, positive correlation with the TE of their downstream CDSs across all samples analyzed (p<0.001, Spearman’s correlation) (Figure 3A). We then compared the interspecific TE change of a uORF (βu=TEuORF,sim/TEuORF,mel) with that of its corresponding CDS (βC=TECDS,sim/TECDS,mel) between D. melanogaster and D. simulans. We found βu is significantly positively correlated with βC across all samples (p-values <0.001, Spearman’s correlation; Figure 3—figure supplement 1). These results align well with our simulations, which showed that fluctuations in translational factors (such as ribosomes) influence both uORF and CDS translation in the same direction (Figure 1—figure supplements 8 and 9).

Figure 3. uORFs reduce CDS translational divergence between D. melanogaster and D. simulans.

(A) The correlation of uORF TEs and the corresponding CDS TEs in 10 samples of Dm (D. melanogaster) and Ds (D. simulans). The bars represent Spearman’s correlation coefficient (rho). In all samples, we obtained both p-values <0.001. Data for the female head sample of Dm and Ds are shown as examples in the right panel. (B) The absolute values of interspecific TE changes for CDS and uORF in each sample between two species. For visualization purposes, all values greater than 4 were assigned a value of 4. ***, p<0.001, Wilcoxon rank-sum test. (C) Genes expressed in female heads (mRNA RPKM >0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominantly translated uORF or not. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon rank-sum tests between the neighboring groups. ***, p<0.001. (D) Genes expressed in female heads were classified into three classes according to the length of translated uORFs. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon rank-sum tests between the neighboring groups. ***, p<0.001.

Figure 3—source data 1. Raw numerical data and statistical analysis underlying the figures.

Figure 3.

Figure 3—figure supplement 1. The positive correlation of interspecific TE changes between uORFs and CDSs.

Figure 3—figure supplement 1.

Correlations between interspecific uORF TE changes (log2βu) and CDS TE changes (log2βC) in 10 samples. The x-axis was divided into 50 equal bins with increasing βu. Spearman’s correlation coefficients (rho) are shown at the top left. ***, p<0.001 in the correlation test.
Figure 3—figure supplement 1—source data 1. Raw numerical data underlying the figures.
Figure 3—figure supplement 2. Gene ontology analysis of the genes with log2(β2)0 in each stage and tissue.

Figure 3—figure supplement 2.

The biological process (BP) terms with q-values <0.05 in each sample type are indicated with red, and others are indicated with white.
Figure 3—figure supplement 3. Reads count distribution of uORFs and CDSs.

Figure 3—figure supplement 3.

Distribution of mRNA read counts and RPF counts mapped to uORFs and CDSs for all expressed uORFs (A) or only highly expressed genes (B) in female head sample. The distribution patterns were similar across other samples, and the data are not shown here. Dm, D. melanogaster; Ds, D. simulans. The read counts were log2 transformed.
Figure 3—figure supplement 3—source data 1. Raw numerical data underlying the figures.
Figure 3—figure supplement 4. The scheme illustrating the calculation of βu, βc and γ.

Figure 3—figure supplement 4.

When the cellular environment causes mRNA translation to be enhanced from baseline level, the TEuORF changed from 2 to 4 (βu=2), while the TEuORF changed with a smaller degree due to uORF’s buffering, from 4 to 6 (βC=1.5). This resulted in γ <1. Conversely, when the cellular environment causes mRNA translation to be reduced from the baseline level, the TEuORF changed from 2 to 1 (βu=0.5), while the TEuORF changed with a smaller degree due to uORF’s buffering, from 4 to 3 (βC=0.76). This resulted in γ >1. Overall, both βu >1 and γ <1, and βu <1 and γ >1, indicated the existence of uORFs’ buffering.
Figure 3—figure supplement 5. Conserved and dominantly translated uORFs showed the stronger buffering effect.

Figure 3—figure supplement 5.

Genes expressed in different stages/tissues (mRNA RPKM >0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominantly translated uORF or not. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon rank-sum tests between the neighboring groups. ***, p<0.001.
Figure 3—figure supplement 5—source data 1. Raw numerical data underlying the figures.
Figure 3—figure supplement 6. Longer uORFs showed the stronger buffering effect.

Figure 3—figure supplement 6.

Genes expressed in different stages/tissues (mRNA RPKM >0.1 in both species) were classified into three classes according to the length of translated uORFs. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon rank-sum tests between the neighboring groups. ***, p<0.001.

While the direction of TE changes for uORFs and CDSs tends to be consistent, our simulations suggest that the magnitude of TE changes in CDSs is generally smaller than that in uORFs, due to the buffering effect of uORF (Figure 1—figure supplements 10 and 11). To validate this, we first identified uORFs and CDSs with significant interspecific TE differences by assessing whether βu or βC significantly deviated from 1, using an established statistical framework (Zhang et al., 2018a). This analysis uncovered 1151–4189 CDSs with significant interspecific TE changes (FDR <0.05; Table 1), with genes involved in development, morphogenesis, and differentiation being significantly enriched during embryonic stages (Figure 3—figure supplement 2). Conversely, genes related to metabolism, response to stimuli, and signaling were enriched in larval, pupal, and adult stages (Figure 3—figure supplement 2). Additionally, we identified 144–1193 uORFs with significant TE differences between species, accounting for approximately 1–15% of expressed uORFs (Table 1). Note that due to their shorter length and generally lower TE, uORFs had considerably lower read counts than CDSs, limiting the statistical power to detect significant interspecific TE differences for uORFs. This trend consistently holds whether analyzing all expressed uORFs (Figure 3—figure supplement 3A) or only highly expressed genes (Figure 3—figure supplement 3B). Thus, the fewer uORFs showing significant TE divergence likely reflects lower read counts and statistical sensitivity rather than reduced translational variability relative to CDSs. In fact, the absolute values of log2(fold change) of TE for uORFs between D. melanogaster and D. simulans were significantly greater than those observed for corresponding CDSs across all samples (p<0.001, Wilcoxon signed-rank test; Figure 3B), suggesting that the magnitude of TE changes in CDSs is generally smaller than that in uORFs, due to the buffering effect of uORF.

Table 1. Numbers of genes showing different magnitudes of TE changes between uORFs and CDS at the interspecific level.

Table 1—source data 1. Raw data and statistical analysis for uORF TE changes.
Table 1—source data 2. Raw data and statistical analysis for CDS TE changes.
Sample types # of expressed uORFs * βu ≠ 1(%) # of expressed CDSs * βc ≠ 1(%) uORF-CDS pairs with βu>1 uORF-CDS pairs with βu<1
Total γ>1 γ<1 Total γ>1 γ<1
0–2 hr embryo 7704 1193
(15.49)
7934 4189
(52.80)
770 0 366 423 69 0
2–6 hr embryo 7822 567
(7.25)
9795 3063
(31.27)
249 0 63 318 135 0
6–12 hr embryo 10,400 1040
(10.00)
10,643 2,924
(27.47)
973 0 641 67 22 1
12–24 hr embryo 11,365 535
(4.71)
11,158 3537
(31.70)
454 0 234 81 8 2
Larva 10,008 464
(4.64)
11,831 3554
(30.04)
110 0 26 354 177 0
Pupa 12,309 635
(5.16)
12,209 4087
(33.48)
136 1 15 499 210 0
Male body 10,894 197 (1.81) 12,284 2432 (19.80) 122 0 29 75 4 0
Male head 10,904 144
(1.32)
10,447 1151
(11.02)
119 0 31 25 5 0
Female body 9809 340 (3.47) 11,002 3605 (32.77) 279 0 17 61 3 0
Female head 10,935 332
(3.04)
10,545 1270
(12.04)
324 0 152 8 3 0
*

Only uORFs and CDSs with an mRNA RPKM >0.1 in both D. melanogaster and D. simulans were considered in each sample pair in the analysis.

βu=TEuORF,sim/TEuORF,mel is the fold change of TEuORF in D. simulans relative to D. melanogaster for each sample. βu=TECDS,sim/TECDS,mel is the fold change of TECDS in D. simulans relative to D. melanogaster for each sample. γ=βc/βu . For each CDS–uORF pair, either βu>1 and γ>1 , or βu<1 and γ>1 means that the magnitude of TE change is lower for a CDS than for a uORF. The statistical significance of βc, βu, and γ were determined according to an FDR < 0.05.

We further quantitatively compared the magnitude of interspecific TE changes between uORFs and their corresponding CDSs using a previous method (Zhang et al., 2018a). We defined γ=βc/βu for a uORF-CDS pair within the same mRNA and tested whether γ was significantly different from 1 to identify pairs with differential TE changes between uORFs and CDSs (Figure 3—figure supplement 4). When γ <1 and βu >1, or γ >1 and βu <1, it indicates that the TE change for the CDS is smaller than that for the uORF (Figure 3—figure supplement 4). Among CDS-uORF pairs where βu >1, nearly all (8–487) showed a significant γ < 1 in each sample, except for one pair from the pupal stage where γ >1 (Table 1). This suggests that the magnitude of TE changes in CDSs was generally smaller than in uORFs when uORF TE increased, and vice versa when uORF TE decreased (βu <1) (Table 1). These comparative translatome analyses indicate that uORFs buffered downstream CDS translation changes during the evolutionary divergence of D. melanogaster and D. simulans.

uORF buffering is influenced by its conservation, dominance, and length

To investigate how the conservation level and translation patterns of uORFs influence their buffering capacity on CDS translation, we categorized genes expressed in each pair of samples into three classes: Class I, genes with conserved uORFs that are dominantly translated (i.e., exhibiting the highest TE among all uORFs within the same gene) in both Drosophila species; Class II, genes with conserved uORFs that are translated in both species but not dominantly translated in at least one; and Class III, the remaining expressed genes. We then compared the absolute values of interspecific TE changes of the CDS (|βc|) across these three categories. Significant differences in |βc| were observed, with a consistent hierarchy of Class I < II < III across all pairs of samples (Figure 3C, Figure 3—figure supplement 5). On average, Class I genes exhibited an average of 8.18% and 23.8% lower |βc| values compared to Class II and Class III, respectively. This indicates that conserved and dominantly translated uORFs exert a stronger buffering effect on CDS translation.

To further validate the simulation results suggesting that longer uORFs have a stronger buffering effect (Figure 1C, Figure 1—figure supplements 6 and 7), we divided genes expressed in each pair of samples into three groups: those without translated uORFs (No), those with short uORFs (short, total length below the median), and those with long uORFs (long, total length above the median). Consistently, longer uORFs were associated with stronger buffering effects on CDS translation across all pairs of samples (Figure 3D, Figure 3—figure supplement 6). Specifically, genes with longer uORFs showed 12.7% and 26.5% lower |βc| values compared to genes with short uORFs or no uORFs, respectively.

Overall, these findings underscore that the buffering capability of a uORF is positively correlated with its conservation level, translation dominance, and length.

uORFs buffer translational fluctuations during Drosophila development

Gene expression undergoes dynamic changes during Drosophila development, with significant alterations in the translation program to meet developmental demands, including shifts in ribosome loading rates (Kronja et al., 2014; Malzer et al., 2013; Qin et al., 2007). Therefore, we extended our analysis to investigate the role of uORFs in buffering these translational fluctuations, hypothesizing that uORFs could mitigate them during development. According to our hypothesis, if a gene has a translated uORF in D. melanogaster but not in its orthologous gene in D. simulans, then the translation of this gene is likely more stable across developmental stages in D. melanogaster than its ortholog in D. simulans, and vice versa.

To test this hypothesis, we compared the CV of CDS TE across 10 developmental stages in D. melanogaster and in D. simulans, respectively. Genes with translated uORFs (TE >0.1 in at least one sample) in D. melanogaster exhibited significantly 22.5% smaller CVs in D. melanogaster than their orthologs lacking these uORFs in D. simulans, indicating more stable translation (Figure 4A). Consistently, for genes with translated uORFs in D. simulans but not in D. melanogaster, the CVs were 13.3% lower in D. simulans compared to their orthologs lacking these uORFs in D. melanogaster (Figure 4B). Consistent results were observed when a uORF is required to be translated (TE >0.1) in all 10 samples (Figure 4—figure supplement 1A), in 4 embryonic stages (Figure 4—figure supplement 1B), or in 6 stages including embryos, larva, and pupa (Figure 4—figure supplement 1C). Moreover, within each species, genes with translated uORFs also showed less variability in CDS TE across developmental stages compared to those without translated uORFs, with a 31.8% reduction in D. melanogaster and a 28.9% reduction in D. simulans (Figure 4C). This effect was consistent across different thresholds for defining ‘translated uORFs’ (Figure 4—figure supplement 2).

Figure 4. uORFs could reduce CDS translational fluctuation during Drosophila development.

(A) The CV of TECDS across 10 Dm (D. melanogaster) samples and 10 Ds (D. simulans) samples. The selected gene with uORFs translated (TE >0.1) in at least one Dm sample but its homologous gene without translated uORF in Ds samples. Each pair of dots linked by a gray line represents a pair of homologous genes in Dm and Ds. ***, p<0.001, Wilcoxon signed-rank test. (B) The CV of TECDS across 10 Dm samples and 10 Ds samples. The selected gene with uORFs translated (TE >0.1) in at least one Ds sample but its homologous gene without translated uORF in Dm samples. Each pair of dots linked by a gray line represents a pair of homologous genes in Dm and Ds. ***, p<0.001, Wilcoxon signed-rank test. (C) Within each Drosophila species, the CV of TECDS of genes with translated uORFs compared to genes without the translated uORFs. The p values are obtained by the Wilcoxon rank-sum test. ***, p<0.001.

Figure 4—source data 1. Raw numerical data underlying the figures.

Figure 4.

Figure 4—figure supplement 1. uORFs reduce CDS translational fluctuation during Drosophila development under different cutoffs on defining ‘translated uORFs’.

Figure 4—figure supplement 1.

The CV of TECDS across Dm (D. melanogaster) samples and Ds (D. simulans) samples was shown as boxplots. The selected genes harbor the translated uORFs in one of Drosophila species but its homologous gene without translated uORFs in another Drosophila species. Each pair of dots linked by a gray line represents a pair of homologous genes in Dm and Ds. (A) Translated uORF was defined as uORF translated (TE >0.1) in all 10 samples. (B) Translated uORF was defined as uORF translated (TE >0.1) in four embryonic stages. The CV of TECDS across these four stages was calculated. (C) Translated uORF was defined as uORF translated (TE >0.1) in the six developmental stages including four embryonic stages, larva, and pupa. The CV of TECDS across these six stages was calculated. Wilcoxon signed-rank sum test. *, p<0.05; ***, p<0.001.
Figure 4—figure supplement 1—source data 1. Raw numerical data underlying the figures.
Figure 4—figure supplement 2. uORFs could reduce CDS translational variation during Drosophila development under different cutoffs on defining ‘translated uORFs’.

Figure 4—figure supplement 2.

Within each species, the CV of TECDS across different samples was measured for genes with translated uORFs and genes without translated uORFs, respectively. Each dot in the boxplot represents one gene. Translated uORF was defined as uORF translated (TE >0.1) in at least four embryonic stages, six developmental stages (left) or translated or in all 10 samples. Wilcoxon rank-sum tests. ***, p<0.001. Dm, D. melanogaster; Ds, D. simulans.
Figure 4—figure supplement 2—source data 1. Raw numerical data underlying the figures.

These results suggest that uORFs function as translational buffers, reducing gene translation fluctuations during Drosophila development.

Knocking out the uORF of bcd increased bcd CDS translation in D. melanogaster

After verifying the uORF-mediated translational buffering during Drosophila evolution and development, we next aimed to directly explore the biological function of these buffering-capable uORFs in vivo. We first applied stringent criteria to identify uORFs with significant buffering effects on CDS translation between D. melanogaster and D. simulans. Specifically, we looked for (1) uORFs with significant TE changes (|log2(βu)| >1.5, adjusted p<0.05), (2) negligible changes in its corresponding CDS translation (|log2(βC)| <0.05, adjusted p>0.05), and (3) a significant difference between the magnitude of these changes (|log₂(γ)| >1.5, adjusted p<0.05). We identified 131 uORF-CDS pairs in 103 genes that meet these criteria in at least one stage/tissue (Supplementary file 3), with a majority of the genes (67%, 69 out of 103) from embryonic stages (Figure 5—figure supplement 1A), suggesting a crucial role for uORFs in maintaining translational stability during early development. Among these genes, one notable case is the bicoid (bcd) gene, a master regulator of anterior-posterior axis patterning during early embryogenesis (Berleth et al., 1988; Driever and Nüsslein-Volhard, 1988a; Driever and Nüsslein-Volhard, 1988b). The bicoid gene contains a 4-codon uORF (excluding the stop codon) in its 5′ UTR, and branch length score (BLS) analysis (Zhang et al., 2021) showed that the start codon (uATG) of the bcd uORF is highly conserved across the Drosophila phylogeny, with a BLS of 0.90 (on a scale from 0 to 1, where higher values indicate greater conservation; Figure 5A, Figure 5—figure supplement 1B). Ribo-Seq data revealed that in 0–2 hr embryos, the TE of the bcd uORF varied more than threefold, while the TE of the CDS was virtually the same between D. melanogaster and D. simulans (Figure 5B and Supplementary file 3). This suggests that the bcd uORF significantly buffers translation during early development.

Figure 5. The strong buffering effect of the bcd uORF on CDS translation between the two Drosophila species.

(A) Multiple sequence alignment of the bcd uORF and partial CDS in D. melanogaster and 20 other Drosophila species. The uORF and CDS are boxed in green and purple, respectively. The start codons of the uORF and CDS are boxed in red. (B) The coverage of mRNA-Seq (top), Ribo-Seq (middle), and TEs (bottom) of the bcd uORF and CDS in 0–2 hr embryos of D. melanogaster (red) and D. simulans (blue). The uORF and CDS are denoted at the lower panel with dark green triangles and purple boxes, respectively. The two dashed lines mark the CDS region. The uORF TE, CDS TE and their interspecific changes were labeled at the bottom.

Figure 5—source data 1. Raw numerical data underlying the figures.

Figure 5.

Figure 5—figure supplement 1. Screening of genes containing uORFs exhibiting strong translational buffering effects.

Figure 5—figure supplement 1.

(A) The proportion (%) of genes with buffering uORFs in all expressed genes across developmental stages/tissues. (B) The distribution of adjusted. P of γ (x-axis, -log10 transformed) and BLS (y-axis) of dominant uORFs in embryonic stages. The bcd was highlighted by red color.
Figure 5—figure supplement 1—source data 1. Raw numerical data underlying the figures.

To investigate the regulatory role of the bcd uORF, we used CRISPR-Cas9 to knock out its start codon in D. melanogaster, generating two mutant homozygotes (uKO1/uKO1 and uKO2/uKO2) with a genetic background matched to that of the wild-type (WT; Figure 6A, Figure 6—figure supplement 1). To determine whether these mutations enhanced bcd CDS translation, we performed ribosome fractionation followed by qPCR (Abdelmohsen et al., 2014; Panda et al., 2016; Panda et al., 2017), comparing bcd mRNA levels in polysome and monosome fractions (P-to-M ratio) from 0 to 2 hr embryos of uKO1/uKO1, uKO2/uKO2, and WT flies (Figure 6B). A larger P-to-M ratio means more mRNAs are enriched in the polysome fractions and bound by more ribosomes, thus indicative of higher translation efficiency. P-to-M ratios were higher in the mutants compared to WT at 29 °C (Figure 6C), with a similar but less pronounced trend at 25 °C, where the difference between uKO2/uKO2 and WT was not statistically significant (Figure 6C). These findings, along with the known impact of temperature on gene expression and phenotypic plasticity (Amourda et al., 2018; Lu et al., 2021), suggest that the regulatory function of the uORF and overall translation efficiency are temperature-sensitive, highlighting a complex interplay between environmental conditions and gene regulation. To further confirm the bcd uORF’s regulatory function, we conducted dual-luciferase reporter assays. The 5′ UTR from bcd mutant (uKO1 or uKO2) or WT was cloned into a Renilla luciferase reporter construct (Figure 6D). Luciferase activity was significantly higher in uKO1 and uKO2 compared to the WT 5′ UTR, confirming the repressive role of the bcd uORF.

Figure 6. Knocking out the bcd uORF increases CDS translation and perturbs the transcriptome during D. melanogaster embryogenesis.

(A) Genotypes of WT and two uORF knock-out strains (uKO1 and uKO2) generated by CRISPR-Cas9 technology. The uORF is boxed in dark green, and the red ATG represents the start codon of the uORF in the D. melanogaster genome. (B) Two ribosome fractions (monosome and polysome) of 0–2 hr embryos were separated in a sucrose density gradient. Relative RNA abundance in the monosome and polysome fractions was quantified by real-time quantitative PCR. (C) P-to-M ratio of bcd mRNA (bcd mRNA abundance in polysome fraction/bcd mRNA abundance in monosome fraction) at 25 °C (left) and 29 °C (right). The P-to-M ratios of mutants were normalized to WT controls at 25 °C and at 29 °C, respectively. Error bars represent the S.E. of six biological replicates. Asterisks indicate statistical significance (*, p<0.05; **, p<0.01; ***, p<0.001; n.s., p>0.05). (D) Dual-luciferase assay for bcd WT uORF and mutated uORF. The reporter structures of the WT and uORF mutants are illustrated on the left. The uORF mutant sequence was the same as that in the fly mutant created with CRISPR-Cas9 technology. The relative activity of Renilla luciferase was normalized to that of firefly luciferase. Error bars represent the S.E. of six biological replicates. Asterisks indicate statistical significance (***, p <0.001). (E) The number of DEGs in each stage and their intersection with each other at 25 °C (top) and 29 °C (bottom). (F) Gene ontology analysis of DEGs at 29 °C in each stage. The biological process (BP) terms with q-values <0.05 in each stage are indicated in red, and others are indicated in white.

Figure 6—source data 1. Raw numerical data underlying the figures.

Figure 6.

Figure 6—figure supplement 1. Overview of homozygous uORF-KO mutant screening.

Figure 6—figure supplement 1.

(A) Procedures for the primary screening of CRISPR/Cas9-induced mutants. Red ‘-’ denotes the bcd-uORF-KO allele. (B) Purification of the genetic background of homozygous mutants by back-crossing with WT lines for nine generations.
Figure 6—figure supplement 2. Heatmap of sample-to-sample distances of RNA-Seq libraries.

Figure 6—figure supplement 2.

Mut, mutant; WT, wild-type; rep1/rep2, biological replicate 1/biological replicate 2.
Figure 6—figure supplement 2—source data 1. Raw numerical data underlying the figures.
Figure 6—figure supplement 3. The ratio of Bcd target genes among DEGs in bcd-uORF-KO embryos.

Figure 6—figure supplement 3.

The enrichment of Bcd targets in each embryo stage at 29 ºC. Enrichment was calculated as the ratio of the Bcd target proportion in DEGs to the Bcd target proportion in non-DEGs. χ2 tests were performed to compare the difference in Bcd target distribution in DEGs and non-DEGs. The number of DEGs and the number of Bcd targets in DEGs for each stage are provided above the bar. Asterisks indicate statistical significance (χ2 tests; *, p<0.05; **, p<0.01; ***, p<0.001; n.s., p>0.05).
Figure 6—figure supplement 3—source data 1. Raw numerical data underlying the figures.
Figure 6—figure supplement 4. The correlation of expression changes (log2FoldChange) between RNA-Seq (y-axis) and RT-qPCR (x-axis) of 20 bcd targets in four embryo stages in uKO2/uKO2 mutant compared to WT.

Figure 6—figure supplement 4.

Figure 6—figure supplement 4—source data 1. Raw numerical data underlying the figures.

bcd uORF mutants show wide transcriptomic alteration during Drosophila embryogenesis

Since Bcd regulates the expression of many zygotic genes (Berleth et al., 1988; Driever and Nüsslein-Volhard, 1988a; Driever and Nüsslein-Volhard, 1988b), we anticipated that the increased translation of bcd resulting from disrupting the uORF would influence Drosophila transcriptomes and phenotypes. To verify this notion, we performed RNA sequencing on embryos from WT and uKO2/uKO2 flies at four developmental stages (0–2 hr, 2–6 hr, 6–12 hr, and 12–24 h) under both 25°C and 29°C conditions, using two biological replicates (Supplementary file 4 and Figure 6—figure supplement 2). Differential expression analysis revealed widespread alterations in gene expression between WT and mutant embryos, with the number of differentially expressed genes (DEGs) increasing over developmental time and at higher temperature (Figure 6E). At 25 °C, we identified 674, 817, 1047, and 2041 DEGs in 0–2 hr, 2–6 hr, 6–12 hr, and 12–24 hr embryos, respectively; while at 29 °C, we detected 3884, 2358, 2901, and 4164 DEGs in the corresponding stages (Figure 6E). The majority of DEGs were stage-specific, with only a small fraction consistently differentially expressed across all four stages. Functional enrichment analysis of the DEGs revealed distinct biological pathways affected at each stage, including cell morphogenesis and pattern specification in 0–2 hr embryos, metabolic processes and tissue development in 2–6 hr and 6–12 hr embryos, and mitochondrial respiration in 12–24 hr embryos (Figure 6F). Notably, direct targets of Bcd (Li et al., 2008) were significantly enriched among the DEGs in three out of the four stages, with the exception of 2–6 hr embryos (Figure 6—figure supplement 3). RT-qPCR validation of 20 target genes of Bcd confirmed the reliability of the RNA-seq differential expression analysis (Figure 6—figure supplement 4). Together, these findings demonstrate that disruption of the bcd uORF leads to widespread transcriptional changes during Drosophila development, affecting processes ranging from embryogenesis to postembryonic metabolism.

bcd uORF mutants display decreased hatching rates and starvation resistance

Given the widespread transcriptome alterations, we anticipated phenotypic abnormalities in the bcd uORF mutants. As expected, both uKO1/uKO1 and uKO2/uKO2 mutants exhibited significantly lower hatching rates compared to WT [p=1.4 × 10–6 and 7.9 × 10–6, respectively, Wilcoxon rank-sum test (WRST); Figure 7A]. At 25 °C, uKO1/uKO1 mutants produced fewer offspring than WT flies (p<0.001, WRST, Figure 7B). Given that bcd is a maternal gene, we expected reciprocal crosses between uKO1/uKO1 mutants and WT flies to produce different outcomes. Indeed, crossing uKO1/uKO1 males with WT females resulted in offspring numbers similar to those from WT crosses, while crossing uKO1/uKO1 females with WT males yielded offspring numbers comparable to crosses between uKO1/uKO1 mutants (Figure 7B). This verified that the reduction in offspring is due to maternal defects in the uKO1/uKO1 mutants. Similar patterns were observed for uKO2/uKO2 mutants at 25 °C (Figure 7B). Notably, the fecundity reduction in uORF-KO mutants was more pronounced at 29 °C (Figure 7C). Furthermore, crossing uKO1/uKO1 and uKO2/uKO2 mutants also produced significantly fewer progeny than WT flies (Figure 7C), ruling out genetic background or off-target effects. Collectively, these data demonstrate that disrupting the bcd uORF significantly impaired hatchability and fertility.

Figure 7. Knockout of the bcd uORF reduces offspring number and starvation resistance.

Figure 7.

(A) Comparison of the hatching rates (%) of mutant and WT offspring (n=20, Wilcoxon rank-sum test; ***, p<0.001). (B) The offspring number per maternal parent in different crosses over 10 days at 25 °C. Asterisks indicate significant differences between various crosses and crosses of WT females with WT males (n=20, Wilcoxon rank-sum test; *, p<0.05; **, p<0.01; ***, p<0.001; n.s., p>0.05). The different crosses were denoted as the x-axis labels. (C) The offspring number per maternal parent in different crosses over 10 days at 29 °C. (D) Survival curves of WT and mutant adult flies of females (left) and males (right) under starvation conditions. The black line represents the WT, the red line represents the uKO1/uKO1 mutant, and the blue line represents the uKO2/uKO2 mutant. Asterisks indicate significant differences compared to the WT. (n=200, log-rank test; ***, p<0.001; n.s., p>0.05).

Figure 7—source data 1. Raw numerical data underlying the figures.

We also found that both uKO1/uKO1 and uKO2/uKO2 female mutants perished significantly faster than WT flies under starvation conditions (Figure 7D). Males showed similar tendencies, although the difference was not statistically significant for uKO1/uKO1 mutants (Figure 7D). These data suggest that the knockout of bcd uORF diminished starvation resistance in adults, likely due to embryogenesis abnormalities induced by the bcd uORF deletion, even in those that successfully developed to adulthood.

Conservation of uORF-mediated translational buffering in primates

To explore the generality of uORF-mediated translational buffering across evolutionary clades, we analyzed previously published transcriptome and translatome data from three tissues (brain, liver, and testis) in humans and macaques (Wang et al., 2020). We identified 33,680 canonical uORFs in humans and 29,516 in macaques, with 24,385 conserved between the two species. Despite the larger number of uORFs in primates compared to Drosophila due to differences in genome size and gene number, the median TE of conserved uORFs was 1.79 times that of non-conserved uORFs in humans, and the corresponding ratio was 3.43 in macaques (Figure 8A, Figure 8—figure supplement 1). TEs of uORFs were positively correlated between humans and macaques across all tissues (p<0.001, Figure 8B). Additionally, significant positive correlations were observed between the TEs of uORFs and their corresponding coding sequences (CDSs) in all tissues (Figure 8—figure supplement 2). Although interspecific TE divergence of uORFs (βu) and CDSs (βC) were positively correlated (p<0.001, Figure 8C), uORFs generally exhibited larger divergence (Supplementary file 5). Notably, longer uORFs showed stronger buffering effects on CDS translation, reducing interspecific TE divergence by 12.9% and 38.0% compared to genes with short or no uORFs (Figure 8D, Figure 8—figure supplement 3).

Figure 8. uORFs function as translational buffers in primates.

(A) Boxplots showing the TEs of conserved and species-specific uORFs between Hs (H. sapiens) and Mm (M. mulatta). Data for the brain is shown as an example. Wilcoxon rank-sum tests. ***, p<0.001. (B) Spearman’s correlation coefficient (rho) of uORFs’ TE between humans and macaques. The rho values in the brain, liver, and testis were shown as bar plots. ***, p<0.001. Data for the brain is shown as an example in the right panel. (C) Correlation between interspecific uORF TE changes (log2βu) and corresponding CDS TE changes (log2βC) in three tissues. The x-axis was divided into 50 equal bins with increasing βu. (D) Genes expressed in brains were classified into three classes according to the total length of translated uORFs. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon rank-sum tests between the neighboring groups. ***, p<0.001. (E) Genes expressed in brains (mRNA RPKM >0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominantly translated uORF (TE >0.1) in both species or not. Boxplots showing interspecific CDS TE variability |log2(βc)| of different gene classes. p values were calculated using Wilcoxon-rank sum tests between the neighboring groups. ***, p<0.001. (F) Boxplot showing the coefficients of variation (CVs) of CDS TE among the 69 lymphoblastoid cell lines (LCLs). Expressed genes (mean mRNA RPKM >0.1) were divided into 20 bins with increased mRNA expression levels. In each bin, the genes were divided into two fractions according to whether the gene had a translated uORF or not. Wilcoxon rank-sum tests. *, p<0.05; **, p<0.01; ***, p<0.001.

Figure 8—source data 1. Raw numerical data underlying the figures.

Figure 8.

Figure 8—figure supplement 1. Boxplots showing the TEs of conserved and species-specific uORFs in the liver and testis of H.

Figure 8—figure supplement 1.

sapiens and M. mulatta. Wilcoxon rank-sum tests. ***, p<0.001.
Figure 8—figure supplement 1—source data 1. Raw numerical data underlying the figures.
Figure 8—figure supplement 2. Correlations of uORF TE and corresponding CDS TE in 3 tissues of H.

Figure 8—figure supplement 2.

sapiens and M. mulatta. The bars represent Spearman’s correlation coefficient (rho). In all samples, we obtained p-values <0.001. Data from brains are shown as examples in the right panel.
Figure 8—figure supplement 2—source data 1. Raw numerical data underlying the figures.
Figure 8—figure supplement 3. Boxplots showing interspecific CDS TE variability |log2(βCDS)| for three gene groups according to the length of their translated uORFs (No, short, long) in the liver (left) and testis (testis).

Figure 8—figure supplement 3.

Wilcoxon rank-sum tests. ***, p<0.001.
Figure 8—figure supplement 3—source data 1. Raw numerical data underlying the figures.
Figure 8—figure supplement 4. Boxplots showing interspecific CDS TE variability |log2(βCDS)| for different gene classes in the liver and testis.

Figure 8—figure supplement 4.

Genes expressed in liver or testis (mRNA RPKM >0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominant translated uORF (TE >0.1) in both species or not. Wilcoxon rank-sum tests. ***, p<0.001.
Figure 8—figure supplement 4—source data 1. Raw numerical data underlying the figures.
Figure 8—figure supplement 5. Left panel: Boxplot showing the coefficients of variation (CVs) of CDS TEs among 69 human lymphoblastoid cell lines (LCLs) for three gene groups.

Figure 8—figure supplement 5.

Genes were classified into three groups according to the total length of translated uORFs. Right panel: boxplot showing the coefficients of variation (CVs) of CDS TEs among the human cell lines. Genes with translated uORFs were divided into 20 bins with increasing mRNA expression levels. In each bin, genes were divided into two fractions according to the total length of translated uORFs.
Figure 8—figure supplement 5—source data 1. Raw numerical data underlying the figures.

As in Drosophila, we categorized expressed human genes into three classes based on uORF conservation and translation: Class I, genes with conserved and dominantly translated uORFs in both humans and macaques; Class II, genes with conserved uORFs translated in both species but not dominantly in at least one; and Class III, the remaining expressed genes (Figure 8E, Figure 8—figure supplement 4). Consistent with findings in Drosophila, significant differences in |βc| between humans and macaques were observed in the order of Class I < II < III, with Class I genes showing 9.8% and 17.1% lower |βc| values compared to Class II and Class III genes, respectively (Figure 8E, Figure 8—figure supplement 4). These similarities between primates and Drosophila—two clades that diverged over 700 million years ago (Hedges et al., 2006) —suggest that uORF-mediated translational buffering is a widespread mechanism for stabilizing gene translation across evolutionary clades.

We also analyzed matched mRNA-Seq and Ribo-Seq data from 69 human lymphoblastoid cell lines (Battle et al., 2015; Lappalainen et al., 2013) to test whether uORFs buffer against translational variability across different individuals within humans. Genes with translated uORFs exhibited, on average, 8.65% lower CVs in CDS TE across individuals than genes without translated uORFs (Figure 8F), with longer uORFs showing stronger buffering effects (Figure 8—figure supplement 5). Collectively, these findings suggest that uORFs play a crucial role in reducing translational variability both across evolutionary clades and within species.

Discussion

Translational control is vital for maintaining protein homeostasis and cellular activities (Sherman and Qian, 2013; Stein and Frydman, 2019). However, the mechanisms underlying the conservation of protein abundance across species remain largely unknown (Signor and Nuzhdin, 2018; Vogel, 2013). In this study, we extended the ICIER model and conducted simulations to explore uORFs’ regulatory roles in buffering translation during evolution. Our simulations demonstrated that uORF translation reduces variability in downstream CDS translation, with buffering capacity positively correlated with uORF translation efficiency, length, and number. Comparative translatome analyses across developmental stages of two Drosophila species provided evidence that uORFs mitigate interspecific differences in CDS translation. Similar patterns were observed between humans and macaques, and between human individuals, suggesting that uORF-mediated buffering is an evolutionarily conserved mechanism in animals. Additionally, in vivo experiments showed that knocking out the bcd uORF led to aberrant embryogenesis and altered starvation resistance, with more pronounced phenotypic effects at higher temperatures, supporting the role of uORFs in fine-tuning translation and phenotypic outcomes.

While the prevailing consensus on uORFs’ functions is that they repress the translation of downstream CDS by sequestering ribosomes (Andreev et al., 2018; Calvo et al., 2009; Hinnebusch et al., 2016; Johnstone et al., 2016; Kozak, 1989; Morris and Geballe, 2000; Young and Wek, 2016), recent studies have suggested uORFs might play essential roles in stabilizing protein expression (Andreev et al., 2018; Bottorff et al., 2022; Wu et al., 2022). While these studies have primarily focused on the role of uORFs in single-gene cases or under stress conditions, our work expands this knowledge by demonstrating that uORFs act as a general mechanism for buffering translational variability on a genome-wide scale across multiple species and developmental stages. Our findings also suggest that uORF-mediated translational buffering is not merely a response to environmental stress but an evolutionarily conserved mechanism integral to maintaining protein homeostasis across species. Organisms can buffer phenotypic variation against environmental or genotypic perturbations through ‘canalization’ (Waddington, 1942). The discovery that uORFs are crucial for Drosophila development links them to canalization, providing evidence that uORFs reduce interspecific CDS translation differences and enhance phenotypic stability. While previous studies primarily focused on the stabilization of transcriptional levels (Denby et al., 2012; Ebert and Sharp, 2012; Jarosz et al., 2010; Lu et al., 2023; Payne and Wagner, 2015) or protein levels (Alon, 2007; Jarosz et al., 2010; Somogyvári et al., 2022; Wu et al., 2022; Zabinsky et al., 2019), our study opens new avenues for exploring how organisms maintain phenotypic robustness at the translational level through molecular mechanisms like uORF-mediated buffering. This study lays the groundwork for future research into the evolutionary and functional roles of uORFs, particularly in the context of canalization and adaptive evolution.

Previous studies have shown that a significant fraction of fixed uORFs in the populations of D. melanogaster and humans were driven by positive Darwinian selection (Zhang et al., 2018a; Zhang et al., 2021), suggesting active maintenance through adaptive evolution rather than purely neutral or deleterious processes. While uORFs have traditionally been recognized for their capacity to attenuate translation of downstream CDSs, accumulating evidence now underscores their critical role in stabilizing gene expression under fluctuating cellular and environmental conditions (Andreev et al., 2018; Bottorff et al., 2022; Wu et al., 2022). Whether the favored evolutionary selection of uORFs acts primarily through their role in translational repression or translational buffering remains a compelling yet unresolved question, as these two functions are inherently linked. Indeed, highly conserved uORFs tend to be translated at higher levels, resulting not only in stronger inhibition of CDS translation (Calvo et al., 2009; Johnstone et al., 2016; Zhang et al., 2021), but also in a more pronounced buffering effect, as demonstrated in this study. This buffering capacity of uORFs potentially provides selective advantages by reducing fluctuations in protein synthesis, thus minimizing gene-expression noise and enhancing cellular homeostasis. This suggests that selection may favor uORFs that contribute to translational robustness, a hypothesis supported by findings in yeast and mammals showing that uORFs are significantly enriched in stress-response genes and control the translation of certain master regulators of stress responses (Andreev et al., 2015; Lawless et al., 2009; Mueller and Hinnebusch, 1986; Vattem and Wek, 2004). Our study suggests that translational buffering, rather than translational repression alone, can also drive evolutionary selection favoring uORFs, although it remains challenging to empirically disentangle these functions. Future comparative genomic analyses, coupled with experimental approaches such as ribosome profiling and functional mutagenesis, will be crucial in elucidating the precise evolutionary forces driving uORF conservation and adaptation.

The original ICIER model was devised to elucidate how a single uORF can confer resistance to global translation inhibition on its host gene during stress conditions (Andreev et al., 2018). It is proposed that an elongating ribosome (80S) causes downstream 40S subunits to dissociate from the mRNA after collision. Our revised model expands this scope to include bidirectional dissociation, where an 80S ribosome can release both upstream and downstream 40S subunits, reflecting recent findings that 80S primarily causes upstream 40S dissociation during 40S/80S collisions (Bottorff et al., 2022). Mechanistically, increasing the loading rate of 40S ribosomes onto the mRNA (Rin) elevates the density of 40S and 80S ribosomes scanning the uORF, thereby raising the likelihood of 40S-80S collisions and subsequent 40S dissociation. This leads to a reduced flow of 40S ribosomes reaching the CDS relative to the initial increase in 40S at the 5’ end of the mRNA and uORF. Conversely, a decrease in the 40S ribosome loading rate reduces the density of scanning ribosomes, diminishing the probability of 40S-80S collisions and 40S dissociation, and partially restoring the flow of 40S ribosomes to the CDS. Overall, the 40S-80S collision mechanism and subsequent 40S dissociation at the uORF moderate downstream CDS translation less than changes in ribosome loading at the 5' end of the mRNA or uORF. The revised model underscores the analogy of uORFs as ‘molecular dams’, adeptly modulating the stream of ribosomes and cushioning downstream CDS translation against fluctuations (Figure 9A). Additionally, our study broadens its perspective to account for the potential interactive effects of multiple uORFs within a gene, recognizing the prevalent clustering of these regulatory elements and their combinatorial influence on translational control. In essence, our study presents a nuanced and comprehensive expansion of the ICIER model, paving the way for a deeper understanding of uORFs as pivotal elements in the maintenance of protein synthesis stability under variable translational conditions. However, we realize the existence of alternative mechanisms governing uORF function, such as those found in traditional repression models (Wu et al., 2022) or ribosome queuing and reinitiation strategies (Bottorff et al., 2022). As such, further computational studies are needed to dissect and understand the full spectrum of uORF-mediated translational regulation.

Figure 9. Model illustrating the uORF-mediated translational buffering.

Figure 9.

(A) uORFs can reduce and stabilize the downstream CDS translation through ‘ribosomes collision and dissociation’ mechanism in an mRNA containing uORFs. In contrast, the CDS translation is higher and more fluctuant in an mRNA without uORFs. (B) The collided and stalled 40S subunit dissociates from mRNA, is ubiquitinated by the E3 ubiquitin ligase RNF10, and degraded by the initiation ribosome quality control (iRQC) pathway.

Ribosome slowdown or stalling on mRNA due to rare codons (Bottorff et al., 2022; Ivanov et al., 2018; Lin et al., 2019; Meijer and Thomas, 2003) or nascent blocking peptides (Ito and Chiba, 2013; Lovett and Rogers, 1996; Vilela and McCarthy, 2003; Wei et al., 2012) frequently triggers ribosome collisions genome-wide (Han et al., 2020; Meydan and Guydosh, 2020; Zhao et al., 2021). Such collisions, especially among elongating 80S ribosomes, often activate ribosome quality control (RQC) pathways that recognize collision interfaces on the 40S subunit, leading to ribosomal subunit dissociation and degradation (Goldman et al., 2021; Juszkiewicz et al., 2020; Simms et al., 2019). In mammals, ZNF598 specifically identifies collided ribosomes to initiate ubiquitin-dependent protein and mRNA quality control pathways (Geng et al., 2024; Juszkiewicz et al., 2018; Narita et al., 2022; Sundaramoorthy et al., 2017; Ugajin et al., 2023). Analogously, yeast employs Hel2-mediated ubiquitination of uS10, initiating dissociation via the RQC-trigger complex (RQT; Best et al., 2023). Furthermore, the human RQT (hRQT) complex recognizes ubiquitinated ribosomes and induces subunit dissociation similarly to yeast RQT (Hashimoto et al., 2020). However, transient ribosome collisions can evade RQC by promoting resumed elongation through mechanical force provided by trailing ribosomes, thereby mitigating stalling (Madern et al., 2025). Beyond 80S collisions, evidence increasingly highlights a distinct collision type involving scanning 40S subunits or pre-initiation (43S) complexes. Recently, an initiation RQC pathway (iRQC) targeting the small ribosomal subunit (40S) has been described, particularly involving collisions between scanning 43S complexes or between stalled 43S and elongating 80S ribosomes (Figure 9B; Ford et al., 2025; Garshott et al., 2021). During iRQC, E3 ubiquitin ligase RNF10 ubiquitinates uS3 and uS5 proteins, resulting in 40S degradation (Ford et al., 2025). This mechanism aligns closely with our ICIER model, proposing collision-driven 43S dissociation in the 5′ UTRs. Future studies exploring these mechanisms in greater detail will clarify how uORFs modulate translational regulation through buffering effects.

The bcd gene, crucial for Drosophila embryogenesis, has been well studied (Cho et al., 2005; Driever and Nüsslein-Volhard, 1988a; Driever and Nüsslein-Volhard, 1988b; Dubnau and Struhl, 1996; Hannon et al., 2017; Singh et al., 2022; Struhl et al., 1989), but its uORF remains underexplored. Our study showed that deleting the bcd uORF led to abnormal phenotypes and reduced fitness, underscoring the importance of uORF-mediated translation. These effects were more severe under heat stress (29℃) than at normal temperatures (25℃), suggesting uORFs play a vital role in buffering phenotypic variation. We demonstrated that the bcd uORF represses CDS translation using sucrose gradient fractionation followed by qPCR—an approach that directly measures translation efficiency while minimizing confounding from RNA/protein degradation. However, detecting Bcd protein levels with antibodies across developmental stages or conditions in the mutants and wild-type controls would provide an even stronger validation of our model and should be explored in future studies. Rescue experiments, typically used to confirm that phenotypic changes are due to specific genetic edits, faced challenges in our study and previous uORF-KO experiments in animals (Miyake et al., 2023; Wethmar et al., 2010) and plants (Si et al., 2020; Xing et al., 2020; Xue et al., 2023; Zhang et al., 2018b). We altered the start codon of the bcd uORF to minimize effects on the 5′ UTR, without impacting the coding sequence, making traditional rescue experiments impractical. To control for genetic background variations, we backcrossed two bcd uORF-KO mutant strains with w1118 flies for nine generations, using w1118 as a control. Both uORF-KO mutants (uKO1/uKO1 and uKO2/uKO2) consistently showed reduced progeny, ruling out background or off-target effects. Crosses between the two mutants produced fewer offspring than WT crosses, confirming that the reduced progeny was due to the uORF deletions, not genetic background. As bcd is maternally inherited, we expected defective phenotypes only in offspring from mutant females crossed with wild-type males. This was confirmed: uKO1/uKO1 males crossed with wild-type females had normal offspring, while uKO1/uKO1 females crossed with wild-type males had reduced offspring, similar to mutant crosses. Similar results were observed for uKO2/uKO2 mutants. These findings confirm that disrupting bcd uORFs significantly reduces hatchability and fertility, effects not attributable to genetic background. Our results highlight the critical role of bcd uORFs in development and fecundity and provide a basis for future studies on uORF regulation in other genes across species.

Taken together, this study demonstrates the role of uORF-mediated translational buffering in mitigating variability in gene translation during species divergence and development, using large-scale comparative transcriptome and translatome data. Our work addresses gaps in understanding the stabilization of gene expression at the translational level and offers new insights into the evolutionary and functional significance of uORFs.

Materials and methods

Key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Gene (D. melanogaster) bicoid FlyBase FBgn0000166 N/A
Strain, strain background (D. simulans) Sim4 (D. simulans) This study N/A Used to generate all D. simulans libraries
Genetic reagent (D. melanogaster) y,v; attp40{nos-Cas9} Tsinghua Fly Center TH00788 Host line injected with sgRNA plasmid
Genetic reagent (D. melanogaster) w1118 Bloomington Drosophila Stock Center BDSC: 3605 Used for outcrossing and wild-type control
Genetic reagent (D. melanogaster) uKO1/uKO1 This study N/A Bcd-uORF-knockout mutant
Genetic reagent (D. melanogaster) uKO2/uKO2 This study N/A Bcd-uORF-knockout mutant
Cell line (D. melanogaster) S2 cells Thermo Fisher Scientific R69007 Mycoplasma-free, regularly tested using PCR-based assay
Biological sample (D. simulans) Embryos, larva, pupa, bodies, heads This study N/A Used for mRNA-seq and Ribo-seq
Biological sample (D. melanogaster) Embryos, larva, pupa, bodies, heads Zhang et al., 2018a N/A Used for mRNA-seq and Ribo-seq
Recombinant DNA reagent pU6B Tsinghua Fly Center N/A sgRNA-expression vector
Recombinant DNA reagent psiCHECK-2 Promega C8021 Used for dual-luciferase reporter assay
Commercial assay or kit PrimeScript II 1st Strand cDNA Synthesis Kit Takara Cat# 6210 A Used for cDNA synthesis
Commercial assay or kit PowerUp SYBR Green Master Mix Thermo Fisher Scientific N/A Used for RT-qPCR
Commercial assay or kit Dual-Luciferase Reporter Assay System Promega E1980 For measuring Renilla and firefly luciferase activity
Chemical compound, drug SuperaseIn RNase inhibitor Ambion Cat# AM2694 RNase inhibitor
Chemical compound, drug Proteinase inhibitor cocktail Roche N/A Proteinase inhibitor
Chemical compound, drug Lipofectamine 3000 Thermo Fisher Scientific L3000001 transfection of plasmids into S2 cells
Software, algorithm R R Foundation for Statistical Computing https://www.r-project.org Version 4.2.3; packages used include ggplot2, DESeq2, etc.
Software, algorithm Python Python Software Foundation https://www.python.org Version 3.10
Software, algorithm Cutadapt 3.0 https://cutadapt.readthedocs.io N/A Used for adaptor trimming
Software, algorithm Bowtie2 v2.2.3 https://bowtie-bio.sourceforge.net/bowtie2/index.shtml N/A Used for read mapping to contaminant/reference genomes
Software, algorithm STAR https://github.com/alexdobin/STAR RRID:SCR_004463 Used for aligning mRNA-Seq and Ribo-Seq reads to reference genomes
Software, algorithm psite (Plastid) https://plastid.readthedocs.io N/A Used to assign P-sites of ribosome-protected fragments (RPFs)
Sequence-based reagent Bcd-uORF-sgRNA-F This paper sgRNA for Bcd uORF knock-out ATCGCAAAAACGCAAAATGT
Sequence-based reagent Bcd-uORF-sgRNA-R This paper sgRNA for Bcd uORF knock-out ACATTTTGCGTTTTTGCGAT
Sequence-based reagent Bcd-qPCR-F This paper RT‒qPCR primers used for Bcd GATGTATCTGGGTGGCTGCT
Sequence-based reagent Bcd-qPCR-F This paper RT‒qPCR primers used for Bcd CCGAAATGTGGGACGATAAC
Sequence-based reagent Bcd-genotyping-F This paper Primers used for Bcd mutant genotyping GCTTTGCCGTACTGTTCGAT
Sequence-based reagent Bcd-genotyping-R This paper Primers used for Bcd mutant genotyping AACTGAAGCTGCGGATGTTG
Sequence-based reagent Rp49-qPCR-F This paper RT‒qPCR primers used for rp49 CACTTCATCCGCCACCAGTC
Sequence-based reagent Rp49-qPCR-R This paper RT‒qPCR primers used for rp49 CGCTTGTTCGATCCGTAACC

Modeling the uORF-mediated buffering effect on CDS translation

We adapted the stochastic ICIER framework developed by Andreev et al., 2018 based on the TASEP model, with several major modifications. First, we concatenated a 500-codon CDS downstream of the uORF. A leaky scanning ribosome from the uORF would initiate translation at the CDS start codon with a probability of ICDS. The probability of elongating ribosomes moving along the CDS (νEC=0.5) in each action was higher than that along uORFs (νEu=0.3) in the model settings, considering that uORFs usually encode blocking peptides (Ito and Chiba, 2013; Lovett and Rogers, 1996; Luo and Sachs, 1996; Vilela and McCarthy, 2003; Wei et al., 2012) or contain stalling codons (Bottorff et al., 2022; Ivanov et al., 2018; Meijer and Thomas, 2003). Second, we recorded the number of elongating ribosomes that completed translation at the stop codon of a CDS (NEC) or uORF (NEu) during a given time period and regarded them as proxies for quantifying the CDS or uORF TE. Third, we considered three models for simulating the consequences of scanning ribosomes colliding with elongating ribosomes mentioned above: a downstream dissociation model, an upstream dissociation model, and a double dissociation model.

In detail, the mRNA molecule structure in the simulations consisted of five parts: a 5' leader before the uORF (150 nucleotides), the uORF (ranging from 2 to 100 codons), a segment between the uORF and CDS (150 nucleotides), the CDS (500 codons) and the 3′ UTR (150 nucleotides). The Rin determined the loading rate of the 40S scanning ribosomes on the mRNA 5′-terminus, and it roughly represented the availability of trans translational resources in the cell. We generated two sets of Rin values (Figure 1—figure supplement 1A) to simulate the variation in the availability of translational resources (40S scanning ribosomes, etc.): (i) 1000 values of Rin were generated from a random generator of a uniform distribution, U (0, 0.1); (ii) 1000 values were first generated from a random generator of an exponential distribution, E (1), and then each of the 1000 values was divided by 70 to obtain 1000 values of Rin. IuORF and ICDS represent the probability of translation at the start codon of a uORF or CDS, respectively. Weused different combinations of IuORF and ICDS to test how translational initiation strength influenced the buffering effect of uORFs. νs determined the scanning rate of the 40S ribosome, and νEu and νEC determined the elongation rate of the 80S ribosome at a uORF or CDS, respectively. All the parameters used in our simulation are listed in Supplementary file 1.

The mRNA molecule was modeled as an array, and the value of each position in the array represented the occupation status of ribosomes along the mRNA molecule. The process of translation was simulated by a series of discrete actions referred to Andreev et al., 2018. Upon each action, there was a probability that certain events would occur, including the addition of a 40S ribosome to the 5'-terminus, the transformation of a 40S ribosome into an 80S ribosome at a CDS or uORF start codon, the movement of a 40S ribosome or 80S ribosome to the next codon in the CDS or uORF, etc. More specific operations in each action were described in Appendix 1.

After 1,000,000 actions, we recorded the number of 80S ribosomes that completed translation at the uORF (NEu) and CDS (NEC) for each Rin input, which was used to represent the protein production rates (i.e. translation rate) of the uORF and CDS, respectively. To test the effects of different factors, we used various combinations of the parameters in Supplementary file 1 in the simulations. For a given distribution of Rin input (1000 values) and the dissociation model, we obtained the corresponding 1000 NEC values and their CV values by calculating the ratio of the standard deviation to the mean of NEC.

In the double-uORF simulation, we only adopted a uniform distribution of Rin input and the downstream dissociation model. Most parameters and simulation processes were similar to those in the single-uORF simulation, except that an additional uORF was introduced upstream of the CDS with an initiation probability of IuORF2.

Annotation of uORFs in D. melanogaster and D. simulans

We downloaded the whole-genome sequence alignment (maf) of D. melanogaster (dm6) and 26 other insect species from the University of California Santa Cruz (UCSC) genome browser (genome.ucsc.edu) (Rosenbloom et al., 2015). To improve the genome correspondence between D. melanogaster and D. simulans, we adopted a newly published reference-quality genome of D. simulans (NCBI, ASM438218v1; Chakraborty et al., 2021) to replace the original D. simulans genome in UCSC maf. We softly masked repetitive sequences in the new genome by RepeatMasker 4.1.1 (http://www.repeatmasker.org) and aligned it to dm6 with lastz (Chiaromonte et al., 2002) in runLastzChain.sh following UCSC guidelines. The lastz alignment parameters and the scoring matrix were the same as the parameters of dm6 and droSim1 in UCSC. Chained alignments were processed into nets by the chainNet and netSyntenic programs. The alignment was integrated into the multiple alignments of 27 species with multiz (Blanchette et al., 2004).

Based on the genome annotation of D. melanogaster (FlyBase r6.04, https://flybase.org/), we used the Galaxy platform to parse the multiple sequence alignments of 5′ UTRs in Drosophila. The 5′ UTR sequences of each annotated transcript from D. melanogaster and the corresponding sequences in D. simulans were extracted from the maf. The start codons of putative uORFs were identified by scanning all the ATG triplets (uATGs) within the 5′ UTRs of D. melanogaster and D. simulans. uATGs that overlapped with any annotated CDS region were removed. The presence or absence of each uATG of D. melanogaster was determined at orthologous sites in D. simulans based on multiple genome alignments, and vice versa. The conserved uORF was defined as a uORF where its uATG was present in both D. melanogaster and the corresponding orthologous positions of D. simulans. For each protein-coding gene, we only considered the canonical transcript. For transcripts containing multiple uORFs, we defined the uORF that showed the highest TE as the dominant uORF. The branch length score (BLS) of the uATGs was calculated as we previously described (Zhang et al., 2021).

Fly materials and general raising conditions

The sim4 strain of D. simulans was used to generate all the libraries of D. simulans in this study. All flies were raised on standard corn medium and grown in 12 hr light: 12 hr dark cycles at 25 °C for general conditions or at 29 °C for specific experimental design. The samples of embryos at different stages, larva, pupa, bodies, and heads were collected following a previous protocol (Zhang et al., 2018a).

Processing Drosophila mRNA-Seq and Ribo-Seq data

Ribo-Seq and matched mRNA-Seq libraries for different developmental stages and tissues of D. simulans were constructed as we previously described (Zhang et al., 2018a) and sequenced on an Illumina HiSeq-2500 sequencer (run type: single-end; read length: 50 nucleotides) according to the manufacturer’s protocol. The 3' adaptor sequences (TGGAATTCTCGGGTGCCAAGG) were trimmed using Cutadapt 3.0 with default parameters (Martin, 2011), and the NGS reads were mapped to the genomes of yeast, Wolbachia, Drosophila viruses and the sequences of tRNAs, ribosomal RNAs (rRNAs), small nuclear RNAs (snRNAs) or small nucleolar RNAs (snoRNAs) of D. melanogaster (FlyBase r6.04) and D. simulans (FlyBase r1.3) using Bowtie2 version 2.2.3 (Langmead and Salzberg, 2012) with the parameters -p8 --local -k1. The mapped reads of these genomes/sequences were further removed in the downstream analysis.

After filtering, the mRNA-Seq and Ribo-Seq reads were mapped to the reference genomes of D. melanogaster (FlyBase, r6.04) and D. simulans (NCBI, ASM438218v1), respectively, using the Spliced Transcripts Alignment to a Reference (STAR) algorithm (Dobin et al., 2013). For Ribo-Seq reads, we assigned a mapped RPF (27–34 nucleotides in length) to its P-site using the psite script from Plastid (Dunn and Weissman, 2016). The uniquely mapped reads were extracted and then mapped to the CDS of D. melanogaster and D. simulans, respectively, using STAR (Dobin et al., 2013). The P-sites of RPF or mRNA-Seq reads that overlapped with a CDS were counted separately. The reads that were not mapped to CDSs were then mapped to the 5′ UTRs of D. melanogaster and D. simulans. The P-sites of RPF or mRNA-Seq reads that overlapped with uORFs were counted separately.

The TE of a given uORF or CDS was calculated as the RPKMP-site/RPKMmRNA ratio. For a few uORFs with RPKMmRNA = 0 but RPKMP-site >0, a pseudocount of 0.1 was added to both RPKMmRNA and RPKMP-site to avoid dividing a positive value by zero. In each sample, expressed genes were defined as the genes with a CDS RPKMmRNA >0.1 in both species, and translated uORFs were defined as uORFs with a TE >0.1.

Testing the statistical significance of the difference in the interspecific TE change between a uORF and its downstream CDS

For this analysis, we adopted the methods developed by Zhang et al., 2018a. Briefly, for the samples of female bodies or male bodies of D. melanogaster with biological replicates, we obtained the log2(TE) and SE of the log2(TE) of a uORF or CDS by contrasting the RPF counts against mRNA-Seq read counts using DESeq2. Then, we fitted the SE values against the normalized mRNA counts and log2(TE) values using the gam function in the R package mgcv, with a log link to obtain the SE ~mRNA counts + log2TE functions. For other samples without biological replicates, we estimated the SE of the log2(TE) for a feature (CDS or uORF) by applying the fitted functions obtained based on the biological replicates of female and male bodies to the observed mRNA counts and log2(TE) values. We identified uORFs whose TEs differed significantly between the paired samples of D. melanogaster and D. simulans by testing whether the value obtained for log2(βu)=log2(TEuORF,sim)log2(TEuORF,mel) was significantly different from 0. Based on the SE of the log2(TE) derived as described above, the SE of the log2(βu) can be derived as follows:

SE_log2(βu)=(SE_log2TEuORF,sim)2+(SE_log2TEuORF,met)2

Then, we defined γ=βcβu (where βc=TECDS,sim/TECDS,mel) and tested whether log2(γ) was significantly different from 0 to determine whether the magnitude of interspecific TE changes in CDSs and uORFs was significantly different. We obtained log2TEuORF,sim, log2TEuORF,mel, log2TECDS,sim, and log2TECDS,mel as described above and estimated SE_log2TEuORF,sim, SE_log2TEuORF,mel, SE_log2TECDS,sim, and SE_log2TECDS,mel based on the biological replicates of female and male bodies of D. melanogaster. Finally, log2(γ) also follows a normal distribution with SE denoted as SE_log2(γ) =

(SE_log2TEuORF,sim)2+(SE_log2TEuORF,mel)2+(SE_log2TECDS,sim)2+(SE_log2TECDS,mel)2

As the Wald statistic, log2(γ)SE_log2(γ), follows a standard normal distribution under the null hypothesis that log2(βu)=0, we calculated the p value as follows: 2(1ϕ (|log2(γ)SE_log2(γ)|)).

uORFs are shorter than CDSs and have substantially lower read counts, reducing statistical power when comparing TE changes. To mitigate this bias, we focused on highly expressed genes, defined as those with both mRNA and RPF RPKM values above the 50th percentile in D. melanogaster and D. simulans. uORFs and CDSs of these genes were used in the calculation of significant TE change.

Knocking out a uORF with CRISPR-Cas9 technology in D. melanogaster

We searched for possible sgRNA target sites near the uATG start codon of a uORF using the Benchling website (https://www.benchling.com/crispr/) to design optimal single guide RNA (sgRNA) sequences with high specificity and low off-target effects. We then synthesized single-stranded complementary DNAs (ssDNAs) and annealed them to obtain double-stranded DNA (dsDNA), which served as the template for sgRNA expression. The template sequences of the sgRNA used for bcd uATG-KO are listed in the Key Resources Table. The dsDNA was then ligated into the BbsI-digested pU6B vector. The pU6B-sgRNA plasmid was purified and injected into the embryos of transgenic Cas9 flies collected within one hour of laying at the Tsinghua Fly Center as described in Ni et al., 2011. The injected embryos were kept at 25 °C and 60% humidity until adulthood (G0). The G0 adult flies that hatched from injected embryos were individually crossed with other flies (y sc v) to increase the number of offspring. Then, the F1 progeny were crossed with flies carrying an appropriate balancer (Dr, e/TM3, Sb). After F2 spawning, the F1 individuals were screened for mutations of interest by genotyping. The primers used for genotyping are listed in the Key Resources Table. The F2 progeny whose parents showed positive genotyping results were then screened for the yellow- gene to separate the chromosome carrying nos-Cas9. The screened F2 males were crossed with the flies containing the same balancer as above. After F2 genotyping, the progeny (F3) of positive F2 individuals showing the same mutation status were crossed individually to generate homozygous mutants in the F4 generation. The original homozygous mutants were sequentially outcrossed with w1118 flies for nine generations to purify the genetic background (Figure 6—figure supplement 1B).

Ribosome fraction analysis by sucrose gradient fractionation and RT‒qPCR

The 0–2 hr embryos obtained from w1118, uKO1/uKO1, and uKO2/uKO2 mutants raised at 25°C and 29°C were collected and homogenized in a Dounce homogenizer with lysis buffer [50 mM Tris pH 7.5, 150 mM NaCl, 5 mM MgCl2, 1% Triton X-100, 2 mM dithiothreitol (DTT), 20 U/ml SuperaseIn (Ambion), 0.5 tablets of proteinase inhibitor (Roche), 100 μg/ml emetine (Sigma Aldrich), and 50 μM guanosine 5′-[β,γ-imido]triphosphate trisodium salt hydrate (GMP-PNP; Sigma Aldrich)] at 4 °C. The lysates were clarified by centrifugation at 4 °C and 20,000×g for 8 min, and the supernatants were transferred to new 1.5 ml tubes. 10–45% sucrose gradients were prepared in buffer (250 mM NaCl, 50 mM Tris pH 7.5, 15 mM MgCl2, 0.5 mM DTT, 12 U/ml RNaseOUT, 0.5 tablets of protease inhibitor, and 20 μg/ml emetine) using a Gradient Master (Biocomp Instruments) in ULTRA-CLEAR Thinwall Tubes (Beckman Coulter). A sample volume of up to 500 μl was applied to the top of each gradient. After ultracentrifugation with a Hitachi P40ST rotor at 35,000×rpm for 3 hr at 4 °C, the monosome and polysome fractions were collected, flash-frozen in liquid nitrogen, and stored at –80 °C until further use.

The RNA in the monosome and polysome fractions was extracted separately using TRIzol reagent (Life Technologies, Inc) and chloroform (Beijing Chemical Works) following the manufacturer’s instructions and was reverse transcribed into cDNA using the PrimeScript II 1st Strand cDNA Synthesis Kit (Takara). RT‒qPCR analysis of bcd cDNA and its targets was performed using PowerUp SYBR Green Master Mix (Thermo Fisher) following the manufacturer’s instructions with rp49 as an internal control. The primer sequences employed for RT-qPCR are listed in the Key Resources Table. For each sample, the ratio of bcd mRNA abundance in the polysome fraction to that in the monosome fraction was calculated as the P-to-M ratio. Six biological replicates were performed for each sample.

Cell lines

Drosophila S2 cells were purchased from Thermo Fisher Scientific (product number: R69007) and their identity was not subsequently confirmed. The cells were regularly tested and found negative for mycoplasma contamination using PCR-based assays.

Dual-luciferase reporter assays

The wild-type (WT) 5′ UTR of bcd was cloned from cDNA by PCR, and uATG mutations were introduced into 5′ UTR using specific amplification primers. The WT and mutated 5′ UTR sequences were ligated into a linearized reporter plasmid (psiCHECK-2 vector, Promega). The whole sequence of all the plasmids was validated by Sanger sequencing.

Drosophila S2 cells were cultured in Schneider’s Insect Medium (Sigma) plus 10% (by volume) heat-inactivated fetal bovine serum, 100 U/ml penicillin and 100 µg/ml streptomycin (Thermo Fisher) at 25℃ without CO2 for 24 hr to reach 1–2×106 cells/ml before further treatments. Plasmid transfection was conducted with Lipofectamine 3000 (L3000001, Thermo Fisher) according to the supplier’s protocol. The Renilla luciferase activity associated with WT or uORF-mutated 5′ UTRs was measured according to the manual of the Dual-Luciferase Reporter Assay System (Promega) 32 hr after transfection and was normalized to the activity of firefly luciferase.

mRNA-Seq in the embryos of mutant and WT flies

We collected 0–2 hr, 2–6 hr, 6–12 hr, and 12–24 hr embryos of w1118 and uKO2/uKO2 mutants raised at 25°C and 29°C and conducted mRNA-Seq. Library construction and sequencing with PE150 were conducted by Annoroad on the Illumina Nova6000 platform. Two biological replicates were sequenced for each sample. The clean data were mapped to the reference genome of D. melanogaster (FlyBase, r6.04) using STAR (Dobin et al., 2013). Reads mapped to the exons of each gene were tabulated with htseq-count (Anders et al., 2015). The differentially expressed genes were identified using DESeq2 (McCarthy et al., 2012). The Gene Ontology (GO) analyses were conducted using the “clusterProfiler” package in R (Yu et al., 2012).

Measurement of embryo hatchability

We collected embryos from WT and mutant flies and manually seeded them in vials containing standard corn medium at a density of 30 embryos per vial and 20 vials per strain. The embryos were cultivated under standard conditions (60% humidity, 12 hr light: 12 hr dark cycles at 25 °C). We counted the number of pupae in each vial after the completion of pupation and calculated the corresponding hatching rate = numberpupa/ 30.

Quantification of offspring number per female fly

Newly hatched virgins were picked out and allowed to mature for two days in separate vials. They were then mated by placing one virgin female with three male flies for two days. After that, each female parent was transferred to a new vial to count the offspring number produced in each 10 day period at 25°C and 29°C. All assays were performed with 20 females per genotype.

Measurement of starvation resistance in adult flies

We selected 3- to 5-day-old adult males and females and placed them in the starvation medium (1.5% agar), with 10 flies per vial and 10 vials for both males and females from each strain. We made observations every 6 hr or 12 hr to count the number of deaths under starvation conditions until all flies had starved to death. The survival curves were plotted by the ggsurvplot package in R.

mRNA-Seq and Ribo-Seq data analysis in primates

The mRNA-Seq and Ribo-Seq data from the brains, livers, and testes of humans and macaques were downloaded from reference (Wang et al., 2020) with accession number E-MTAB-7247 (ArrayExpress). The mRNA-Seq and Ribo-Seq data of human lymphoblastoid cell lines (LCL) from Yoruba individuals were downloaded from references (Battle et al., 2015; Lappalainen et al., 2013) under accession numbers GSE61742 and E-GEUV-1. The matched mRNA-Seq and Ribo-Seq libraries of 69 individuals were used.

The uORF annotation and downstream analysis procedures for the human and macaque data were similar to those applied in Drosophila as described above. The differential analysis of translational efficiency in humans and macaques was conducted by Xtail (Xiao et al., 2016). In each pair of human-macaque samples, expressed genes were defined as the genes with a CDS RPKMmRNA >0.1 in both species. The translated uORFs in a sample were defined as uORFs with a TE >0.1. For the human cell line data, expressed genes were defined as genes with a mean CDS RPKMmRNA >0.1 across the cell lines, and translated uORFs were defined as uORFs with a mean TE >0.1.

Acknowledgements

We thank Drs. Wei Xie, Weiwei Zhai, and Xionglei He for constructive comments and suggestions. We thank the National Center for Protein Sciences at Peking University for their technical assistance. Some of the analyses were performed on the High-Performance Computing Platform of the Center for Life Sciences. This work was supported by grants from the Ministry of Science and Technology of the People’s Republic of China (2022YFE0132000), the Yunnan Provincial Science and Technology Project at Southwest United Graduate School (202302 A0370006), the National Natural Science Foundation of China (32070597) and the Natural Science Foundation of Beijing (5212006). The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Appendix 1

The detailed operations per action in the ICIER simulation

The translation process of an mRNA molecule is modeled as a series of discrete actions. During each action, certain events may occur with certain probabilities for each entity (40S or 80S ribosome). The operations per action are described as follows:

If the first 10 triplets were vacant of the mRNA, a 40S ribosome would be added there with a probability of Rin.

2. If a 40S ribosome was located at the start codon of the uORF, it would transform into an 80S ribosome with a probability of IuORF. If a 40S ribosome was located at the start codon of the CDS, it would transform into an 80S ribosome with a probability of ICDS.

3.If an 80S ribosome was located at the stop codon of the uORF or CDS, it would dissociate from the mRNA. The dissociation of an 80S ribosome from the uORF or CDS stop codon was counted as a completed translation event, and the values of NEC and NEu were incremented by one, respectively.

If a 40S ribosome arrived at the 3' end of the mRNA, it would dissociate from the mRNA.

5. For each 40S ribosome on the mRNA, if its next position was vacant (in the downstream-dissociation model) or not occupied by another 40S ribosome (in the upstream-dissociation model and double-dissociation model), it would move to the next position with a probability of νs.

6. For each 80S ribosome in the uORF, if its next position was vacant (in the upstream-dissociation model) or not occupied by another 80S ribosome (in the downstream-dissociation model and double-dissociation model), it would move to the next position with a probability of νEu. Similarly, for each 80S ribosome in the CDS, if its next position was vacant (in the upstream-dissociation model) or not occupied by an 80S ribosome (in the downstream-dissociation model and double-dissociation model), it would move to the next position with a probability of νEC.

It was checked whether any 80S ribosome collided with a 40S ribosome. If such a collision occurred, the 40S ribosome involved would dissociate from the mRNA with a probability of Kup (upstream dissociation or double-dissociation model) or Kdown (downstream dissociation or double-dissociation model).

These above operations are executed repeatedly until the action count reaches the limit given by the user (1,000,000 in this study).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Jian Lu, Email: luj@pku.edu.cn.

Wenfeng Qian, Chinese Academy of Sciences, China.

Alan M Moses, University of Toronto, Canada.

Funding Information

This paper was supported by the following grants:

  • Ministry of Science and Technology of the People's Republic of China 2022YFE0132000 to Jian Lu.

  • Yunnan Provincial Science and Technology Project at Southwest United Graduate School 202302A0370006 to Jian Lu.

  • National Natural Science Foundation of China 32070597 to Jian Lu.

  • Natural Science Foundation of Beijing 5212006 to Jian Lu.

Additional information

Competing interests

No competing interests declared.

Author contributions

Resources, Data curation, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Data curation, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Formal analysis, Investigation.

Data curation, Formal analysis, Investigation.

Methodology.

Data curation.

Data curation.

Data curation.

Conceptualization, Supervision, Funding acquisition, Validation, Investigation, Writing – original draft, Project administration, Writing – review and editing.

Additional files

Supplementary file 1. Parameters used in our simulation.
elife-104074-supp1.docx (83.8KB, docx)
Supplementary file 2. Mapping statistics of Ribo-Seq and matched mRNA-Seq libraries.
elife-104074-supp2.docx (112.8KB, docx)
Supplementary file 3. Genes with uORFs showing strong evidence of translational buffering.
Supplementary file 4. Mapping statistics of mRNA-Seq libraries for bcd uKO2/uKO2 mutant and WT flies.
elife-104074-supp4.docx (114.9KB, docx)
Supplementary file 5. Numbers of genes showing different magnitudes of TE changes between uORFs and CDS at the interspecific level, H. sapiens and M. mulatta.
MDAR checklist

Data availability

All deep-sequencing data generated in this study, including single-ended mRNA-Seq and Ribo-Seq data of 10 developmental stages and tissues of Drosophila simulans and paired-end mRNA-Seq data of 0-2 h, 2-6 h, 6-12 h, and 12-24 h Drosophila melanogaster embryos, were deposited in the China National Genomics Data Center Genome Sequence Archive (GSA) under accession numbers CRA003198, CRA007425, and CRA007426. The mRNA-Seq and Ribo-Seq data for the different developmental stages and tissues of Drosophila melanogaster were published in our previous paper and were deposited in the Sequence Read Archive (SRA) under accession number SRP067542. All original code has been deposited on GitHub: https://github.com/lujlab/uORF_buffer (copy archived at Lu, 2022) and https://github.com/lujlab/Buffer_eLife2025 (copy archived at Lu, 2025).

The following datasets were generated:

Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA003198

Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007425

Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007426

The following previously published dataset was used:

Zhang H, Dou S, He F, Luo J, Wei L, Lu J. 2018. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. NCBI Sequence Read Archive. SRP067542

References

  1. Abdelmohsen K, Panda AC, Kang MJ, Guo R, Kim J, Grammatikakis I, Yoon JH, Dudekula DB, Noh JH, Yang X, Martindale JL, Gorospe M. 7SL RNA represses p53 translation by competing with HuR. Nucleic Acids Research. 2014;42:10099–10111. doi: 10.1093/nar/gku686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Alon U. Network motifs: theory and experimental approaches. Nature Reviews. Genetics. 2007;8:450–461. doi: 10.1038/nrg2102. [DOI] [PubMed] [Google Scholar]
  3. Amourda C, Chong J, Saunders TE. MicroRNAs buffer genetic variation at specific temperatures during embryonic development. bioRxiv. 2018 doi: 10.1101/444810. [DOI]
  4. Anders S, Pyl PT, Huber W. HTSeq--a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Andreev DE, O’Connor PBF, Fahey C, Kenny EM, Terenin IM, Dmitriev SE, Cormican P, Morris DW, Shatsky IN, Baranov PV. Translation of 5’ leaders is pervasive in genes resistant to eIF2 repression. eLife. 2015;4:e03971. doi: 10.7554/eLife.03971. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Andreev DE, Arnold M, Kiniry SJ, Loughran G, Michel AM, Rachinskii D, Baranov PV. TASEP modelling provides a parsimonious explanation for the ability of a single uORF to derepress translation during the integrated stress response. eLife. 2018;7:e32563. doi: 10.7554/eLife.32563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Artieri CG, Fraser HB. Evolution at two levels of gene expression in yeast. Genome Research. 2014;24:411–421. doi: 10.1101/gr.165522.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Baird TD, Palam LR, Fusakio ME, Willy JA, Davis CM, McClintick JN, Anthony TG, Wek RC. Selective mRNA translation during eIF2 phosphorylation induces expression of IBTKalpha. Molecular Biology of the Cell. 2014;25:1686–1697. doi: 10.1091/mbc.E14-02-0704. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Battle A, Khan Z, Wang SH, Mitrano A, Ford MJ, Pritchard JK, Gilad Y. Genomic variation. Impact of regulatory variation from RNA to protein. Science. 2015;347:664–667. doi: 10.1126/science.1260793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Berleth T, Burri M, Thoma G, Bopp D, Richstein S, Frigerio G, Noll M, Nüsslein-Volhard C. The role of localization of bicoid RNA in organizing the anterior pattern of the Drosophila embryo. The EMBO Journal. 1988;7:1749–1756. doi: 10.1002/j.1460-2075.1988.tb03004.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Best K, Ikeuchi K, Kater L, Best D, Musial J, Matsuo Y, Berninghausen O, Becker T, Inada T, Beckmann R. Structural basis for clearing of ribosome collisions by the RQT complex. Nature Communications. 2023;14:921. doi: 10.1038/s41467-023-36230-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Blanchette M, Kent WJ, Riemer C, Elnitski L, Smit AFA, Roskin KM, Baertsch R, Rosenbloom K, Clawson H, Green ED, Haussler D, Miller W. Aligning multiple genomic sequences with the threaded blockset aligner. Genome Research. 2004;14:708–715. doi: 10.1101/gr.1933104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bohlen J, Harbrecht L, Blanco S, Clemm von Hohenberg K, Fenzl K, Kramer G, Bukau B, Teleman AA. DENR promotes translation reinitiation via ribosome recycling to drive expression of oncogenes including ATF4. Nature Communications. 2020;11:4676. doi: 10.1038/s41467-020-18452-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bottorff TA, Park H, Geballe AP, Subramaniam AR. Translational buffering by ribosome stalling in upstream open reading frames. PLOS Genetics. 2022;18:e1010460. doi: 10.1371/journal.pgen.1010460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Buccitelli C, Selbach M. mRNAs, proteins and the emerging principles of gene expression control. Nature Reviews. Genetics. 2020;21:630–644. doi: 10.1038/s41576-020-0258-4. [DOI] [PubMed] [Google Scholar]
  16. Calvo SE, Pagliarini DJ, Mootha VK. Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humans. PNAS. 2009;106:7507–7512. doi: 10.1073/pnas.0810916106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Chakraborty M, Chang CH, Khost DE, Vedanayagam J, Adrion JR, Liao Y, Montooth KL, Meiklejohn CD, Larracuente AM, Emerson JJ. Evolution of genome structure in the Drosophila simulans species complex. Genome Research. 2021;31:380–396. doi: 10.1101/gr.263442.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Chen Y-J, Tan BC-M, Cheng Y-Y, Chen J-S, Lee S-C. Differential regulation of CHOP translation by phosphorylated eIF4E under stress conditions. Nucleic Acids Research. 2010;38:764–777. doi: 10.1093/nar/gkp1034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chen J, Tresenrider A, Chia M, McSwiggen DT, Spedale G, Jorgensen V, Liao H, van Werven FJ, Ünal E. Kinetochore inactivation by expression of a repressive mRNA. eLife. 2017;6:e27417. doi: 10.7554/eLife.27417. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Cheng Z, Otto GM, Powers EN, Keskin A, Mertins P, Carr SA, Jovanovic M, Brar GA. Pervasive, coordinated protein-level changes driven by transcript isoform switching during meiosis. Cell. 2018;172:910–923. doi: 10.1016/j.cell.2018.01.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Chiaromonte F, Yap VB, Miller W. Scoring pairwise genomic sequence alignments. Pacific Symposium on Biocomputing. 2002;01:115–126. doi: 10.1142/9789812799623_0012. [DOI] [PubMed] [Google Scholar]
  22. Cho PF, Poulin F, Cho-Park YA, Cho-Park IB, Chicoine JD, Lasko P, Sonenberg N. A new paradigm for translational control: inhibition via 5’-3’ mRNA tethering by Bicoid and the eIF4E cognate 4EHP. Cell. 2005;121:411–423. doi: 10.1016/j.cell.2005.02.024. [DOI] [PubMed] [Google Scholar]
  23. Chothani SP, Adami E, Widjaja AA, Langley SR, Viswanathan S, Pua CJ, Zhihao NT, Harmston N, D’Agostino G, Whiffin N, Mao W, Ouyang JF, Lim WW, Lim S, Lee CQE, Grubman A, Chen J, Kovalik JP, Tryggvason K, Polo JM, Ho L, Cook SA, Rackham OJL, Schafer S. A high-resolution map of human RNA translation. Molecular Cell. 2022;82:2885–2899. doi: 10.1016/j.molcel.2022.06.023. [DOI] [PubMed] [Google Scholar]
  24. Churbanov A, Rogozin IB, Babenko VN, Ali H, Koonin EV. Evolutionary conservation suggests a regulatory function of AUG triplets in 5’-UTRs of eukaryotic genes. Nucleic Acids Research. 2005;33:5512–5520. doi: 10.1093/nar/gki847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Ciandrini L, Stansfield I, Romano MC. Role of the particle’s stepping cycle in an asymmetric exclusion process: a model of mRNA translation. Physical Review. E, Statistical, Nonlinear, and Soft Matter Physics. 2010;81:051904. doi: 10.1103/PhysRevE.81.051904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Costa-Mattioli M, Walter P. The integrated stress response: from mechanism to disease. Science. 2020;368:eaat5314. doi: 10.1126/science.aat5314. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Denby CM, Im JH, Yu RC, Pesce CG, Brem RB. Negative feedback confers mutational robustness in yeast transcription factor regulation. PNAS. 2012;109:3874–3878. doi: 10.1073/pnas.1116360109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Dever TE, Feng L, Wek RC, Cigan AM, Donahue TF, Hinnebusch AG. Phosphorylation of initiation factor 2α by protein kinase GCN2 mediates gene-specific translational control of GCN4 in yeast. Cell. 1992;68:585–596. doi: 10.1016/0092-8674(92)90193-G. [DOI] [PubMed] [Google Scholar]
  29. Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Driever W, Nüsslein-Volhard C. The bicoid protein determines position in the Drosophila embryo in a concentration-dependent manner. Cell. 1988a;54:95–104. doi: 10.1016/0092-8674(88)90183-3. [DOI] [PubMed] [Google Scholar]
  31. Driever W, Nüsslein-Volhard C. A gradient of bicoid protein in Drosophila embryos. Cell. 1988b;54:83–93. doi: 10.1016/0092-8674(88)90182-1. [DOI] [PubMed] [Google Scholar]
  32. Dubnau J, Struhl G. RNA recognition and translational regulation by a homeodomain protein. Nature. 1996;379:694–699. doi: 10.1038/379694a0. [DOI] [PubMed] [Google Scholar]
  33. Dunn JG, Weissman JS. Plastid: nucleotide-resolution analysis of next-generation sequencing and genomics data. BMC Genomics. 2016;17:958. doi: 10.1186/s12864-016-3278-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Ebert MS, Sharp PA. Roles for microRNAs in conferring robustness to biological processes. Cell. 2012;149:515–524. doi: 10.1016/j.cell.2012.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Ford PW, Garshott DM, Narasimhan M, Ge X, Jordahl EM, Subramanya S, Bennett EJ. RNF10 and RIOK3 facilitate 40S ribosomal subunit degradation upon 60S biogenesis disruption or amino acid starvation. Cell Reports. 2025;44:115371. doi: 10.1016/j.celrep.2025.115371. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Fraser HB, Hirsh AE, Giaever G, Kumm J, Eisen MB. Noise minimization in eukaryotic gene expression. PLOS Biology. 2004;2:e137. doi: 10.1371/journal.pbio.0020137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Garshott DM, An H, Sundaramoorthy E, Leonard M, Vicary A, Harper JW, Bennett EJ. iRQC, a surveillance pathway for 40S ribosomal quality control during mRNA translation initiation. Cell Reports. 2021;36:109642. doi: 10.1016/j.celrep.2021.109642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Geng J, Li S, Li Y, Wu Z, Bhurtel S, Rimal S, Khan D, Ohja R, Brandman O, Lu B. Stalled translation by mitochondrial stress upregulates a CNOT4-ZNF598 ribosomal quality control pathway important for tissue homeostasis. Nature Communications. 2024;15:1637. doi: 10.1038/s41467-024-45525-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Goldman DH, Livingston NM, Movsik J, Wu B, Green R. Live-cell imaging reveals kinetic determinants of quality control triggered by ribosome stalling. Molecular Cell. 2021;81:1830–1840. doi: 10.1016/j.molcel.2021.01.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Han P, Shichino Y, Schneider-Poetsch T, Mito M, Hashimoto S, Udagawa T, Kohno K, Yoshida M, Mishima Y, Inada T, Iwasaki S. Genome-wide survey of ribosome collision. Cell Reports. 2020;31:107610. doi: 10.1016/j.celrep.2020.107610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Hannon CE, Blythe SA, Wieschaus EF. Concentration dependent chromatin states induced by the bicoid morphogen gradient. eLife. 2017;6:e28275. doi: 10.7554/eLife.28275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hashimoto S, Sugiyama T, Yamazaki R, Nobuta R, Inada T. Identification of a novel trigger complex that facilitates ribosome-associated quality control in mammalian cells. Scientific Reports. 2020;10:3422. doi: 10.1038/s41598-020-60241-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hedges SB, Dudley J, Kumar S. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 2006;22:2971–2972. doi: 10.1093/bioinformatics/btl505. [DOI] [PubMed] [Google Scholar]
  44. Hill MS, Vande Zande P, Wittkopp PJ. Molecular and evolutionary processes generating variation in gene expression. Nature Reviews. Genetics. 2021;22:203–215. doi: 10.1038/s41576-020-00304-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Hinnebusch AG. Translational regulation of GCN4 and the general amino acid control of yeast. Annual Review of Microbiology. 2005;59:407–450. doi: 10.1146/annurev.micro.59.031805.133833. [DOI] [PubMed] [Google Scholar]
  46. Hinnebusch AG, Ivanov IP, Sonenberg N. Translational control by 5’-untranslated regions of eukaryotic mRNAs. Science. 2016;352:1413–1416. doi: 10.1126/science.aad9868. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–223. doi: 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Ingolia NT, Lareau LF, Weissman JS. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell. 2011;147:789–802. doi: 10.1016/j.cell.2011.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Ingolia NT. Ribosome footprint profiling of translation throughout the genome. Cell. 2016;165:22–33. doi: 10.1016/j.cell.2016.02.066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Ito K, Chiba S. Arrest peptides: cis-acting modulators of translation. Annual Review of Biochemistry. 2013;82:171–202. doi: 10.1146/annurev-biochem-080211-105026. [DOI] [PubMed] [Google Scholar]
  51. Ivanov IP, Shin BS, Loughran G, Tzani I, Young-Baird SK, Cao C, Atkins JF, Dever TE. Polyamine control of translation elongation regulates start site selection on antizyme inhibitor mRNA via ribosome queuing. Molecular Cell. 2018;70:254–264. doi: 10.1016/j.molcel.2018.03.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Jackson RJ, Hellen CUT, Pestova TV. The mechanism of eukaryotic translation initiation and principles of its regulation. Nature Reviews. Molecular Cell Biology. 2010;11:113–127. doi: 10.1038/nrm2838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Jarosz DF, Taipale M, Lindquist S. Protein homeostasis and the phenotypic manifestation of genetic diversity: principles and mechanisms. Annual Review of Genetics. 2010;44:189–216. doi: 10.1146/annurev.genet.40.110405.090412. [DOI] [PubMed] [Google Scholar]
  54. Johnstone TG, Bazzini AA, Giraldez AJ. Upstream ORFs are prevalent translational repressors in vertebrates. The EMBO Journal. 2016;35:706–723. doi: 10.15252/embj.201592759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Juszkiewicz S, Chandrasekaran V, Lin Z, Kraatz S, Ramakrishnan V, Hegde RS. ZNF598 is a quality control sensor of collided ribosomes. Molecular Cell. 2018;72:469–481. doi: 10.1016/j.molcel.2018.08.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Juszkiewicz S, Slodkowicz G, Lin Z, Freire-Pritchett P, Peak-Chew SY, Hegde RS. Ribosome collisions trigger cis-acting feedback inhibition of translation initiation. eLife. 2020;9:e60038. doi: 10.7554/eLife.60038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Khan Z, Ford MJ, Cusanovich DA, Mitrano A, Pritchard JK, Gilad Y. Primate transcript and protein expression levels evolve under compensatory selection pressures. Science. 2013;342:1100–1104. doi: 10.1126/science.1242379. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Komonyi O, Pápai G, Enunlu I, Muratoglu S, Pankotai T, Kopitova D, Maróy P, Udvardy A, Boros I. DTL, the Drosophila homolog of PIMT/Tgs1 nuclear receptor coactivator-interacting protein/RNA methyltransferase, has an essential role in development. The Journal of Biological Chemistry. 2005;280:12397–12404. doi: 10.1074/jbc.M409251200. [DOI] [PubMed] [Google Scholar]
  59. Kozak M. The scanning model for translation: an update. The Journal of Cell Biology. 1989;108:229–241. doi: 10.1083/jcb.108.2.229. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Kronja I, Yuan B, Eichhorn SW, Dzeyk K, Krijgsveld J, Bartel DP, Orr-Weaver TL. Widespread changes in the posttranscriptional landscape at the Drosophila Oocyte-to-Embryo transition. Cell Reports. 2014;7:1495–1508. doi: 10.1016/j.celrep.2014.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kurihara Y, Makita Y, Kawashima M, Fujita T, Iwasaki S, Matsui M. Transcripts from downstream alternative transcription start sites evade uORF-mediated inhibition of gene expression in Arabidopsis. PNAS. 2018;115:7831–7836. doi: 10.1073/pnas.1804971115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Kusnadi EP, Timpone C, Topisirovic I, Larsson O, Furic L. Regulation of gene expression via translational buffering. Biochimica et Biophysica Acta. Molecular Cell Research. 2022;1869:119140. doi: 10.1016/j.bbamcr.2021.119140. [DOI] [PubMed] [Google Scholar]
  63. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nature Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Lappalainen T, Sammeth M, Friedländer MR, ’t Hoen PAC, Monlong J, Rivas MA, Gonzàlez-Porta M, Kurbatova N, Griebel T, Ferreira PG, Barann M, Wieland T, Greger L, van Iterson M, Almlöf J, Ribeca P, Pulyakhina I, Esser D, Giger T, Tikhonov A, Sultan M, Bertier G, MacArthur DG, Lek M, Lizano E, Buermans HPJ, Padioleau I, Schwarzmayr T, Karlberg O, Ongen H, Kilpinen H, Beltran S, Gut M, Kahlem K, Amstislavskiy V, Stegle O, Pirinen M, Montgomery SB, Donnelly P, McCarthy MI, Flicek P, Strom TM, Geuvadis Consortium. Lehrach H, Schreiber S, Sudbrak R, Carracedo A, Antonarakis SE, Häsler R, Syvänen A-C, van Ommen G-J, Brazma A, Meitinger T, Rosenstiel P, Guigó R, Gut IG, Estivill X, Dermitzakis ET. Transcriptome and genome sequencing uncovers functional variation in humans. Nature. 2013;501:506–511. doi: 10.1038/nature12531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Laurent JM, Vogel C, Kwon T, Craig SA, Boutz DR, Huse HK, Nozue K, Walia H, Whiteley M, Ronald PC, Marcotte EM. Protein abundances are more conserved than mRNA abundances across diverse taxa. Proteomics. 2010;10:4209–4212. doi: 10.1002/pmic.201000327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Lawless C, Pearson RD, Selley JN, Smirnova JB, Grant CM, Ashe MP, Pavitt GD, Hubbard SJ. Upstream sequence elements direct post-transcriptional regulation of gene expression under stress conditions in yeast. BMC Genomics. 2009;10:7. doi: 10.1186/1471-2164-10-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Lee TI, Young RA. Transcriptional regulation and its misregulation in disease. Cell. 2013;152:1237–1251. doi: 10.1016/j.cell.2013.02.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Lee DSM, Park J, Kromer A, Baras A, Rader DJ, Ritchie MD, Ghanem LR, Barash Y. Disrupting upstream translation in mRNAs is associated with human disease. Nature Communications. 2021;12:1515. doi: 10.1038/s41467-021-21812-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Li X, MacArthur S, Bourgon R, Nix D, Pollard DA, Iyer VN, Hechmer A, Simirenko L, Stapleton M, Luengo Hendriks CL, Chu HC, Ogawa N, Inwood W, Sementchenko V, Beaton A, Weiszmann R, Celniker SE, Knowles DW, Gingeras T, Speed TP, Eisen MB, Biggin MD. Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm. PLOS Biology. 2008;6:e27. doi: 10.1371/journal.pbio.0060027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Lin Y, May GE, Kready H, Nazzaro L, Mao M, Spealman P, Creeger Y, McManus CJ. Impacts of uORF codon identity and position on translation regulation. Nucleic Acids Research. 2019;47:9358–9367. doi: 10.1093/nar/gkz681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Liu L, Dilworth D, Gao L, Monzon J, Summers A, Lassam N, Hogg D. Mutation of the CDKN2A 5’ UTR creates an aberrant initiation codon and predisposes to melanoma. Nature Genetics. 1999;21:128–132. doi: 10.1038/5082. [DOI] [PubMed] [Google Scholar]
  72. Lovett PS, Rogers EJ. Ribosome regulation by the nascent peptide. Microbiological Reviews. 1996;60:366–385. doi: 10.1128/mr.60.2.366-385.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Lu PD, Harding HP, Ron D. Translation reinitiation at alternative open reading frames regulates gene expression in an integrated stress response. The Journal of Cell Biology. 2004;167:27–33. doi: 10.1083/jcb.200408003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Lu G, Zhao Y, Chen Q, Lin P, Tang T, Tang Z, Liufu Z, Wu CI. When development is constantly but weakly perturbed - canalization by microRNAs. bioRxiv. 2021 doi: 10.1101/2021.09.04.458966. [DOI] [PMC free article] [PubMed]
  75. Lu J. UORF_buffer. swh:1:rev:56a30b78bab48b8d050d39d4b9362cdfb9c8267aSoftware Heritage. 2022 https://archive.softwareheritage.org/swh:1:dir:20fd2a2283a82064ba72910b68d54fabde8ab18a;origin=https://github.com/lujlab/uORF_buffer;visit=swh:1:snp:ea4e6051aad01eee62eb90b8dd81642b265653f1;anchor=swh:1:rev:56a30b78bab48b8d050d39d4b9362cdfb9c8267a
  76. Lu GA, Zhang J, Zhao Y, Chen Q, Lin P, Tang T, Tang Z, Wen H, Liufu Z, Wu CI. Canalization of Phenotypes-when the transcriptome is constantly but weakly perturbed. Molecular Biology and Evolution. 2023;40:msad005. doi: 10.1093/molbev/msad005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Lu J. Buffer_eLife2025. swh:1:rev:53fe1fed30b812769d7774c8c60b933798aa4e45Software Heritage. 2025 https://archive.softwareheritage.org/swh:1:dir:27b1cd9a33d18d32e6d333dd8c378c85a80425d4;origin=https://github.com/lujlab/Buffer_eLife2025;visit=swh:1:snp:30d3bdf7ecda280c0b5ceac8cb72bbd9db50a54e;anchor=swh:1:rev:53fe1fed30b812769d7774c8c60b933798aa4e45
  78. Luo Z, Sachs MS. Role of an upstream open reading frame in mediating arginine-specific translational control in Neurospora crassa. Journal of Bacteriology. 1996;178:2172–2177. doi: 10.1128/jb.178.8.2172-2177.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Macneil LT, Walhout AJM. Gene regulatory networks and the role of robustness and stochasticity in the control of gene expression. Genome Research. 2011;21:645–657. doi: 10.1101/gr.097378.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Madern MF, Yang S, Witteveen O, Segeren HA, Bauer M, Tanenbaum ME. Long-term imaging of individual ribosomes reveals ribosome cooperativity in mRNA translation. Cell. 2025;188:1896–1911. doi: 10.1016/j.cell.2025.01.016. [DOI] [PubMed] [Google Scholar]
  81. Malzer E, Szajewska-Skuta M, Dalton LE, Thomas SE, Hu N, Skaer H, Lomas DA, Crowther DC, Marciniak SJ. Coordinate regulation of eIF2α phosphorylation by PPP1R15 and GCN2 is required during Drosophila development. Journal of Cell Science. 2013;126:1406–1415. doi: 10.1242/jcs.117614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.Journal. 2011;17:10–12. doi: 10.14806/ej.17.1.200. [DOI] [Google Scholar]
  83. McCarthy DJ, Chen Y, Smyth GK. Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation. Nucleic Acids Research. 2012;40:4288–4297. doi: 10.1093/nar/gks042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. McManus CJ, May GE, Spealman P, Shteyman A. Ribosome profiling reveals post-transcriptional buffering of divergent gene expression in yeast. Genome Research. 2014;24:422–430. doi: 10.1101/gr.164996.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Medenbach J, Seiler M, Hentze MW. Translational control via protein-regulated upstream open reading frames. Cell. 2011;145:902–913. doi: 10.1016/j.cell.2011.05.005. [DOI] [PubMed] [Google Scholar]
  86. Meijer HA, Thomas AAM. Ribosomes stalling on uORF1 in the Xenopus Cx41 5’ UTR inhibit downstream translation initiation. Nucleic Acids Research. 2003;31:3174–3184. doi: 10.1093/nar/gkg429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Meydan S, Guydosh NR. Disome and trisome profiling reveal genome-wide targets of ribosome quality control. Molecular Cell. 2020;79:588–602. doi: 10.1016/j.molcel.2020.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Miyake T, Inoue Y, Shao X, Seta T, Aoki Y, Nguyen Pham KT, Shichino Y, Sasaki J, Sasaki T, Ikawa M, Yamaguchi Y, Okamura H, Iwasaki S, Doi M. Minimal upstream open reading frame of Per2 mediates phase fitness of the circadian clock to day/night physiological body temperature rhythm. Cell Reports. 2023;42:112157. doi: 10.1016/j.celrep.2023.112157. [DOI] [PubMed] [Google Scholar]
  89. Morris DR, Geballe AP. Upstream open reading frames as regulators of mRNA translation. Molecular and Cellular Biology. 2000;20:8635–8642. doi: 10.1128/MCB.20.23.8635-8642.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Mueller PP, Hinnebusch AG. Multiple upstream AUG codons mediate translational control of GCN4. Cell. 1986;45:201–207. doi: 10.1016/0092-8674(86)90384-3. [DOI] [PubMed] [Google Scholar]
  91. Narita M, Denk T, Matsuo Y, Sugiyama T, Kikuguchi C, Ito S, Sato N, Suzuki T, Hashimoto S, Machová I, Tesina P, Beckmann R, Inada T. A distinct mammalian disome collision interface harbors K63-linked polyubiquitination of uS10 to trigger hRQT-mediated subunit dissociation. Nature Communications. 2022;13:6411. doi: 10.1038/s41467-022-34097-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Neafsey DE, Galagan JE. Dual modes of natural selection on upstream open reading frames. Molecular Biology and Evolution. 2007;24:1744–1751. doi: 10.1093/molbev/msm093. [DOI] [PubMed] [Google Scholar]
  93. Ni JQ, Zhou R, Czech B, Liu LP, Holderbaum L, Yang-Zhou D, Shim HS, Tao R, Handler D, Karpowicz P, Binari R, Booker M, Brennecke J, Perkins LA, Hannon GJ, Perrimon N. A genome-scale shRNA resource for transgenic RNAi in Drosophila. Nature Methods. 2011;8:405–407. doi: 10.1038/nmeth.1592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Palam LR, Baird TD, Wek RC. Phosphorylation of eIF2 facilitates ribosomal bypass of an inhibitory upstream ORF to enhance CHOP translation. The Journal of Biological Chemistry. 2011;286:10939–10949. doi: 10.1074/jbc.M110.216093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Panda AC, Abdelmohsen K, Martindale JL, Di Germanio C, Yang X, Grammatikakis I, Noh JH, Zhang Y, Lehrmann E, Dudekula DB, De S, Becker KG, White EJ, Wilson GM, de Cabo R, Gorospe M. Novel RNA-binding activity of MYF5 enhances Ccnd1/Cyclin D1 mRNA translation during myogenesis. Nucleic Acids Research. 2016;44:2393–2408. doi: 10.1093/nar/gkw023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Panda AC, Martindale JL, Gorospe M. Polysome fractionation to analyze mRNA distribution profiles. Bio-Protocol. 2017;7:e2126. doi: 10.21769/BioProtoc.2126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Payne JL, Wagner A. Mechanisms of mutational robustness in transcriptional regulation. Frontiers in Genetics. 2015;6:322. doi: 10.3389/fgene.2015.00322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Qin X, Ahn S, Speed TP, Rubin GM. Global analyses of mRNA translational control during early Drosophila embryogenesis. Genome Biology. 2007;8:R63. doi: 10.1186/gb-2007-8-4-r63. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Resch AM, Ogurtsov AY, Rogozin IB, Shabalina SA, Koonin EV. Evolution of alternative and constitutive regions of mammalian 5’UTRs. BMC Genomics. 2009;10:162. doi: 10.1186/1471-2164-10-162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Reuveni S, Meilijson I, Kupiec M, Ruppin E, Tuller T. Genome-scale analysis of translation elongation with a ribosome flow model. PLOS Computational Biology. 2011;7:e1002127. doi: 10.1371/journal.pcbi.1002127. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Rosenbloom KR, Armstrong J, Barber GP, Casper J, Clawson H, Diekhans M, Dreszer TR, Fujita PA, Guruvadoo L, Haeussler M, Harte RA, Heitner S, Hickey G, Hinrichs AS, Hubley R, Karolchik D, Learned K, Lee BT, Li CH, Miga KH, Nguyen N, Paten B, Raney BJ, Smit AFA, Speir ML, Zweig AS, Haussler D, Kuhn RM, Kent WJ. The UCSC genome browser database: 2015 update. Nucleic Acids Research. 2015;43:D670–D681. doi: 10.1093/nar/gku1177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Schrimpf SP, Weiss M, Reiter L, Ahrens CH, Jovanovic M, Malmström J, Brunner E, Mohanty S, Lercher MJ, Hunziker PE, Aebersold R, von Mering C, Hengartner MO. Comparative functional analysis of the Caenorhabditis elegans and Drosophila melanogaster proteomes. PLOS Biology. 2009;7:e48. doi: 10.1371/journal.pbio.1000048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Sherman MY, Qian SB. Less is more: improving proteostasis by translation slow down. Trends in Biochemical Sciences. 2013;38:585–591. doi: 10.1016/j.tibs.2013.09.003. [DOI] [PubMed] [Google Scholar]
  104. Si X, Zhang H, Wang Y, Chen K, Gao C. Manipulating gene translation in plants by CRISPR-Cas9-mediated genome editing of upstream open reading frames. Nature Protocols. 2020;15:338–363. doi: 10.1038/s41596-019-0238-3. [DOI] [PubMed] [Google Scholar]
  105. Signor SA, Nuzhdin SV. The evolution of gene expression in cis and trans. Trends in Genetics. 2018;34:532–544. doi: 10.1016/j.tig.2018.03.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Simms CL, Yan LL, Qiu JK, Zaher HS. Ribosome collisions result in +1 frameshifting in the absence of No-Go Decay. Cell Reports. 2019;28:1679–1689. doi: 10.1016/j.celrep.2019.07.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Singh AP, Wu P, Ryabichko S, Raimundo J, Swan M, Wieschaus E, Gregor T, Toettcher JE. Optogenetic control of the Bicoid morphogen reveals fast and slow modes of gap gene regulation. Cell Reports. 2022;38:110543. doi: 10.1016/j.celrep.2022.110543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Somogyvári M, Khatatneh S, Sőti C. Hsp90: from cellular to organismal proteostasis. Cells. 2022;11:2479. doi: 10.3390/cells11162479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. Sonenberg N, Hinnebusch AG. Regulation of translation initiation in eukaryotes: mechanisms and biological targets. Cell. 2009;136:731–745. doi: 10.1016/j.cell.2009.01.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Stein KC, Frydman J. The stop-and-go traffic regulating protein biogenesis: how translation kinetics controls proteostasis. The Journal of Biological Chemistry. 2019;294:2076–2084. doi: 10.1074/jbc.REV118.002814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Struhl G, Struhl K, Macdonald PM. The gradient morphogen bicoid is a concentration-dependent transcriptional activator. Cell. 1989;57:1259–1273. doi: 10.1016/0092-8674(89)90062-7. [DOI] [PubMed] [Google Scholar]
  112. Sundaramoorthy E, Leonard M, Mak R, Liao J, Fulzele A, Bennett EJ. ZNF598 and RACK1 regulate mammalian ribosome-associated quality control function by mediating regulatory 40S Ribosomal Ubiquitylation. Molecular Cell. 2017;65:751–760. doi: 10.1016/j.molcel.2016.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Tamura K, Subramanian S, Kumar S. Temporal patterns of fruit fly (Drosophila) evolution revealed by mutation clocks. Molecular Biology and Evolution. 2004;21:36–44. doi: 10.1093/molbev/msg236. [DOI] [PubMed] [Google Scholar]
  114. Teixeira FK, Lehmann R. Translational control during developmental transitions. Cold Spring Harbor Perspectives in Biology. 2019;11:a032987. doi: 10.1101/cshperspect.a032987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Thattai M, van Oudenaarden A. Intrinsic noise in gene regulatory networks. PNAS. 2001;98:8614–8619. doi: 10.1073/pnas.151588598. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Ugajin N, Imami K, Takada H, Ishihama Y, Chiba S, Mishima Y. Znf598-mediated Rps10/eS10 ubiquitination contributes to the ribosome ubiquitination dynamics during zebrafish development. RNA. 2023;29:1910–1927. doi: 10.1261/rna.079633.123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Vasudevan D, Neuman SD, Yang A, Lough L, Brown B, Bashirullah A, Cardozo T, Ryoo HD. Translational induction of ATF4 during integrated stress response requires noncanonical initiation factors eIF2D and DENR. Nature Communications. 2020;11:4677. doi: 10.1038/s41467-020-18453-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Vattem KM, Wek RC. Reinitiation involving upstream ORFs regulates ATF4 mRNA translation in mammalian cells. PNAS. 2004;101:11269–11274. doi: 10.1073/pnas.0400541101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Vilela C, McCarthy JEG. Regulation of fungal gene expression via short open reading frames in the mRNA 5’untranslated region. Molecular Microbiology. 2003;49:859–867. doi: 10.1046/j.1365-2958.2003.03622.x. [DOI] [PubMed] [Google Scholar]
  120. Vogel C. Evolution. Protein expression under pressure. Science. 2013;342:1052–1053. doi: 10.1126/science.1247833. [DOI] [PubMed] [Google Scholar]
  121. von der Haar T. Mathematical and computational modelling of ribosomal movement and protein synthesis: an overview. Computational and Structural Biotechnology Journal. 2012;1:e201204002. doi: 10.5936/csbj.201204002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Waddington CH. Canalization of development and the inheritance of acquired characters. Nature. 1942;150:563–565. doi: 10.1038/150563a0. [DOI] [PubMed] [Google Scholar]
  123. Wang Z, Sun X, Zhao Y, Guo X, Jiang H, Li H, Gu Z. Evolution of gene regulation during transcription and translation. Genome Biology and Evolution. 2015;7:1155–1167. doi: 10.1093/gbe/evv059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Wang SH, Hsiao CJ, Khan Z, Pritchard JK. Post-translational buffering leads to convergent protein expression levels between primates. Genome Biology. 2018;19:83. doi: 10.1186/s13059-018-1451-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Wang Z-Y, Leushkin E, Liechti A, Ovchinnikova S, Mößinger K, Brüning T, Rummel C, Grützner F, Cardoso-Moreira M, Janich P, Gatfield D, Diagouraga B, de Massy B, Gill ME, Peters AHFM, Anders S, Kaessmann H. Transcriptome and translatome co-evolution in mammals. Nature. 2020;588:642–647. doi: 10.1038/s41586-020-2899-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Wei J, Wu C, Sachs MS. The arginine attenuator peptide interferes with the ribosome peptidyl transferase center. Molecular and Cellular Biology. 2012;32:2396–2406. doi: 10.1128/MCB.00136-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Wen Y, Liu Y, Xu Y, Zhao Y, Hua R, Wang K, Sun M, Li Y, Yang S, Zhang X-J, Kruse R, Cichon S, Betz RC, Nöthen MM, van Steensel MAM, van Geel M, Steijlen PM, Hohl D, Huber M, Dunnill GS, Kennedy C, Messenger A, Munro CS, Terrinoni A, Hovnanian A, Bodemer C, de Prost Y, Paller AS, Irvine AD, Sinclair R, Green J, Shang D, Liu Q, Luo Y, Jiang L, Chen H-D, Lo WH-Y, McLean WHI, He C-D, Zhang X. Loss-of-function mutations of an inhibitory upstream ORF in the human hairless transcript cause Marie Unna hereditary hypotrichosis. Nature Genetics. 2009;41:228–233. doi: 10.1038/ng.276. [DOI] [PubMed] [Google Scholar]
  128. Wethmar K, Bégay V, Smink JJ, Zaragoza K, Wiesenthal V, Dörken B, Calkhoven CF, Leutz A. C/EBPbetaDeltauORF mice--a genetic model for uORF-mediated translational control in mammals. Genes & Development. 2010;24:15–20. doi: 10.1101/gad.557910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Wiestner A, Schlemper RJ, van der Maas AP, Skoda RC. An activating splice donor mutation in the thrombopoietin gene causes hereditary thrombocythaemia. Nature Genetics. 1998;18:49–52. doi: 10.1038/ng0198-49. [DOI] [PubMed] [Google Scholar]
  130. Wu H-W, Fajiculay E, Wu J-F, Yan C-CS, Hsu C-P, Wu S-H. Noise reduction by upstream open reading frames. Nature Plants. 2022;8:474–480. doi: 10.1038/s41477-022-01136-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  131. Xiao Z, Zou Q, Liu Y, Yang X. Genome-wide assessment of differential translations with ribosome profiling data. Nature Communications. 2016;7:11194. doi: 10.1038/ncomms11194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Xing S, Chen K, Zhu H, Zhang R, Zhang H, Li B, Gao C. Fine-tuning sugar content in strawberry. Genome Biology. 2020;21:230. doi: 10.1186/s13059-020-02146-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  133. Xue C, Qiu F, Wang Y, Li B, Zhao KT, Chen K, Gao C. Tuning plant phenotypes by precise, graded downregulation of gene expression. Nature Biotechnology. 2023;41:1758–1764. doi: 10.1038/s41587-023-01707-w. [DOI] [PubMed] [Google Scholar]
  134. Yang YF, Zhang X, Ma X, Zhao T, Sun Q, Huan Q, Wu S, Du Z, Qian W. Trans-splicing enhances translational efficiency in C. elegans. Genome Research. 2017;27:1525–1535. doi: 10.1101/gr.202150.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  135. Young SK, Wek RC. Upstream open reading frames differentially regulate gene-specific translation in the integrated stress response. The Journal of Biological Chemistry. 2016;291:16927–16935. doi: 10.1074/jbc.R116.733899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  137. Zabinsky RA, Mason GA, Queitsch C, Jarosz DF. It’s not magic - Hsp90 and its effects on genetic and epigenetic variation. Seminars in Cell & Developmental Biology. 2019;88:21–35. doi: 10.1016/j.semcdb.2018.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  138. Zhang H, Dou S, He F, Luo J, Wei L, Lu J. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. PLOS Biology. 2018a;16:e2003903. doi: 10.1371/journal.pbio.2003903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  139. Zhang H, Si X, Ji X, Fan R, Liu J, Chen K, Wang D, Gao C. Genome editing of upstream open reading frames enables translational control in plants. Nature Biotechnology. 2018b;36:894–898. doi: 10.1038/nbt.4202. [DOI] [PubMed] [Google Scholar]
  140. Zhang H, Wang Y, Lu J. Function and evolution of upstream ORFs in Eukaryotes. Trends in Biochemical Sciences. 2019;44:782–794. doi: 10.1016/j.tibs.2019.03.002. [DOI] [PubMed] [Google Scholar]
  141. Zhang H, Wang Y, Wu X, Tang X, Wu C, Lu J. Determinants of genome-wide distribution and evolution of uORFs in eukaryotes. Nature Communications. 2021;12:1076. doi: 10.1038/s41467-021-21394-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  142. Zhao YB, Krishnan J. mRNA translation and protein synthesis: an analysis of different modelling methodologies and a new PBN based approach. BMC Systems Biology. 2014;8:25. doi: 10.1186/1752-0509-8-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  143. Zhao T, Chen YM, Li Y, Wang J, Chen S, Gao N, Qian W. Disome-seq reveals widespread ribosome collisions that promote cotranslational protein folding. Genome Biology. 2021;22:16. doi: 10.1186/s13059-020-02256-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

eLife Assessment

Wenfeng Qian 1

This study reveals the important role of upstream open reading frames (uORFs) in limiting the translational variability of downstream coding sequences. Through a combination of computational simulations, comparative analyses of translation efficiency across different developmental stages in two closely related Drosophila species, and manipulative, experimental validation of translation buffering by an uORF for a gene, the authors provide convincing evidence supporting their conclusions. This work will be of broad interest to molecular biologists and geneticists.

Reviewer #1 (Public review):

Anonymous

Summary:

The authors set out to explore the role of upstream open reading frames (uORFs) in stabilizing protein levels during Drosophila development and evolution. By utilizing a modified ICIER model for ribosome translation simulations and conducting experimental validations in Drosophila species, the study investigates how uORFs buffer translational variability of downstream coding sequences. The findings reveal that uORFs significantly reduce translational variability, which contributes to gene expression stability across different biological contexts and evolutionary timeframes.

Strengths:

(1) The study introduces a sophisticated adaptation of the ICIER model, enabling detailed simulation of ribosomal traffic and its implications for translation efficiency.

(2) The integration of computational predictions with empirical data through knockout experiments and translatome analysis in Drosophila provides a compelling validation of the model's predictions.

(3) By demonstrating the evolutionary conservation of uORFs' buffering effects, the study provides insights that are likely applicable to a wide range of eukaryotes.

Weaknesses:

(1) Although the study is technically sound, it does not clearly articulate the mechanisms through which uORFs buffer translational variability. A clearer hypothesis detailing the potential molecular interactions or regulatory pathways by which uORFs influence translational stability would enhance the comprehension and impact of the findings.

(2) The study could be further improved by a discussion regarding the evolutionary selection of uORFs. Specifically, it would be beneficial to explore whether uORFs are favored evolutionarily primarily for their role in reducing translation efficiency or for their capability to stabilize translation variability. Such a discussion would provide deeper insights into the evolutionary dynamics and functional significance of uORFs in genetic regulation.

Comments on revisions:

The authors have adequately addressed my previous concerns.

Reviewer #2 (Public review):

Anonymous

uORFs, short open reading frames located in the 5' UTR, are pervasive in genomes. However, their roles in maintaining protein abundance are not clear. In this study, the authors propose that uORFs act as "molecular dam", limiting the fluctuation of the translation of downstream coding sequences. First, they performed in silico simulations using an improved ICIER model, and demonstrated that uORF translation reduces CDS translational variability, with buffering capacity increasing in proportion to uORF efficiency, length, and number. Next, they analysed the translatome between two related Drosophila species, revealing that genes with uORFs exhibit smaller fluctuations in translation between the two species and across different developmental stages within the same species. Moreover, they identified that bicoid, a critical gene for Drosophila development, contains a uORF with substantial changes in translation efficiency. Deleting this uORF in Drosophila melanogaster significantly affected its gene expression, hatching rates, and survival under stress conditions. Lastly, by leveraging public Ribo-seq data, the authors showed that the buffering effect of uORFs is also evident between primates and within human populations. Collectively, the study significantly advances our understanding of how uORFs regulate the translation of downstream coding sequences at the genome-wide scale, as well as during development and evolution. It would be particularly interesting to explore whether similar buffering functions are conserved in other organisms, and whether their regulatory effects could be harnessed for practical applications, such as improving crop traits or benefiting human health.

Comments on revisions:

The authors have fully addressed all of my concerns, and the revisions have substantially improved the manuscript. I have no further comments.

eLife. 2025 Jun 6;14:RP104074. doi: 10.7554/eLife.104074.3.sa3

Author response

Yuanqiang Sun 1, Yuange Duan 2, Peixiang Gao 3, Chenlu Liu 4, Kaichun Jin 5, Shengqian Dou 6, Wenxiong Tang 7, Hong Zhang 8, Jian Lu 9

The following is the authors’ response to the original reviews

Public Reviews:

Reviewer #1 (Public review):

Summary:

The authors set out to explore the role of upstream open reading frames (uORFs) in stabilizing protein levels during Drosophila development and evolution. By utilizing a modified ICIER model for ribosome translation simulations and conducting experimental validations in Drosophila species, the study investigates how uORFs buffer translational variability of downstream coding sequences. The findings reveal that uORFs significantly reduce translational variability, which contributes to gene expression stability across different biological contexts and evolutionary timeframes.

We thank the reviewer for carefully reading our manuscript and providing thoughtful and constructive feedback. We believe the manuscript has been significantly improved by incorporating your suggestions. Please find our detailed responses and corresponding revisions below.

Strengths:

(1) The study introduces a sophisticated adaptation of the ICIER model, enabling detailed simulation of ribosomal traffic and its implications for translation efficiency.

(2) The integration of computational predictions with empirical data through knockout experiments and translatome analysis in Drosophila provides a compelling validation of the model's predictions.

(3) By demonstrating the evolutionary conservation of uORFs' buffering effects, the study provides insights that are likely applicable to a wide range of eukaryotes.

We appreciate your positive feedback and thoughtful summary of the strengths of our study.

Weaknesses:

(1) Although the study is technically sound, it does not clearly articulate the mechanisms through which uORFs buffer translational variability. A clearer hypothesis detailing the potential molecular interactions or regulatory pathways by which uORFs influence translational stability would enhance the comprehension and impact of the findings.

Thanks for your constructive comments. In the Discussion section of our previous submission (Original Lines 470-489), we proposed that uORFs function as “molecular dams” to smooth out fluctuations in ribosomal flow toward downstream CDS regions, primarily via mechanisms involving ribosome collision and dissociation. To further address your concern, we have expanded the Discussion and included a new model figure (Fig. 9) to more clearly articulate the potential biological and mechanistic basis by which translating 80S ribosomes may induce the dissociation of 40S ribosomes. The revised section (Lines 540–557) now reads:

“Ribosome slowdown or stalling on mRNA due to rare codons [56,96-98] or nascent blocking peptides [99-102] frequently triggers ribosome collisions genome-wide [103-105]. Such collisions, especially among elongating 80S ribosomes, often activate ribosome quality control (RQC) pathways that recognize collision interfaces on the 40S subunit, leading to ribosomal subunit dissociation and degradation [106-108]. In mammals, ZNF598 specifically identifies collided ribosomes to initiate ubiquitin-dependent protein and mRNA quality control pathways [109-113]. Analogously, yeast employs Hel2-mediated ubiquitination of uS10, initiating dissociation via the RQC-trigger complex (RQT) [114]. Furthermore, the human RQT (hRQT) complex recognizes ubiquitinated ribosomes and induces subunit dissociation similarly to yeast RQT [115]. However, transient ribosome collisions can evade RQC by promoting resumed elongation through mechanical force provided by trailing ribosomes, thereby mitigating stalling [116]. Beyond 80S collisions, evidence increasingly highlights a distinct collision type involving scanning 40S subunits or pre-initiation (43S) complexes. Recently, an initiation RQC pathway (iRQC) targeting the small ribosomal subunit (40S) has been described, particularly involving collisions between scanning 43S complexes or between stalled 43S and elongating 80S ribosomes (Figure 9B) [117,118]. During iRQC, E3 ubiquitin ligase RNF10 ubiquitinates uS3 and uS5 proteins, resulting in 40S degradation [118]. This mechanism aligns closely with our ICIER model, proposing collision-driven 43S dissociation in the 5' UTRs. Future studies exploring these mechanisms in greater detail will clarify how uORFs modulate translational regulation through buffering effects.”

(2) The study could be further improved by a discussion regarding the evolutionary selection of uORFs. Specifically, it would be beneficial to explore whether uORFs are favored evolutionarily primarily for their role in reducing translation efficiency or for their capability to stabilize translation variability. Such a discussion would provide deeper insights into the evolutionary dynamics and functional significance of uORFs in genetic regulation.

Thank you for this insightful suggestion. We agree that understanding whether uORFs are evolutionarily favored for their role in translational repression or for their capacity to buffer translational variability is a compelling and unresolved question. Our study suggests that translational buffering, rather than translational repression alone, can also drive evolutionary selection favoring uORFs, although it remains challenging to empirically disentangle these functions due to their inherent linkage. We have expanded the discussion in the revised manuscript to address this point in more detail (Lines 494-513), which is reproduced as follows:

“Previous studies have shown that a significant fraction of fixed uORFs in the populations of D. melanogaster and humans were driven by positive Darwinian selection 63,67, suggesting active maintenance through adaptive evolution rather than purely neutral or deleterious processes. While uORFs have traditionally been recognized for their capacity to attenuate translation of downstream CDSs, accumulating evidence now underscores their critical role in stabilizing gene expression under fluctuating cellular and environmental conditions [43,55,56]. Whether the favored evolutionary selection of uORFs acts primarily through their role in translational repression or translational buffering remains a compelling yet unresolved question, as these two functions are inherently linked. Indeed, highly conserved uORFs tend to be translated at higher levels, resulting not only in stronger inhibition of CDS translation [34,45,67] but also in a more pronounced buffering effect, as demonstrated in this study. This buffering capacity of uORFs potentially provides selective advantages by reducing fluctuations in protein synthesis, thus minimizing gene-expression noise and enhancing cellular homeostasis. This suggests that selection may favor uORFs that contribute to translational robustness, a hypothesis supported by findings in yeast and mammals showing that uORFs are significantly enriched in stressresponse genes and control the translation of certain master regulators of stress responses [41,42,94,95]. Our study suggests that translational buffering, rather than translational repression alone, can also drive evolutionary selection favoring uORFs, although it remains challenging to empirically disentangle these functions. Future comparative genomic analyses, coupled with experimental approaches such as ribosome profiling and functional mutagenesis, will be crucial in elucidating the precise evolutionary forces driving uORF conservation and adaptation.”

Reviewer #2 (Public review):

uORFs, short open reading frames located in the 5' UTR, are pervasive in genomes. However, their roles in maintaining protein abundance are not clear. In this study, the authors propose that uORFs act as "molecular dam", limiting the fluctuation of the translation of downstream coding sequences. First, they performed in silico simulations using an improved ICIER model, and demonstrated that uORF translation reduces CDS translational variability, with buffering capacity increasing in proportion to uORF efficiency, length, and number. Next, they analzed the translatome between two related Drosophila species, revealing that genes with uORFs exhibit smaller fluctuations in translation between the two species and across different developmental stages within the same specify. Moreover, they identified that bicoid, a critical gene for Drosophila development, contains a uORF with substantial changes in translation efficiency. Deleting this uORF in Drosophila melanogaster significantly affected its gene expression, hatching rates, and survival under stress condition. Lastly, by leveraging public Ribo-seq data, the authors showed that the buffering effect of uORFs is also evident between primates and within human populations. Collectively, the study advances our understanding of how uORFs regulate the translation of downstream coding sequences at the genome-wide scale, as well as during development and evolution.

The conclusions of this paper are mostly well supported by data, but some definitions and data analysis need to be clarified and extended.

We thank the reviewer for the thoughtful and constructive review. Your summary accurately captures the key findings of our study. We have carefully addressed all your concerns in the revised manuscript, and we believe it has been significantly improved based on your valuable input.

(1) There are two definitions of translation efficiency (TE) in the manuscript: one refers to the number of 80S ribosomes that complete translation at the stop codon of a CDS within a given time interval, while the other is calculated based on Ribo-seq and mRNA-seq data (as described on Page 7, line 209). To avoid potential misunderstandings, please use distinct terms to differentiate these two definitions.

Thank you for highlighting this important point, and we apologize for the confusion. The two definitions of translation efficiency (TE) in our manuscript arise from methodological differences between simulation and experimental analyses. To clarify, in the revised manuscript, we use “translation rate” in the context of simulations to describe the number of 80S ribosomes completing translation at the CDS stop codon per unit time. We retain the conventional “translation efficiency (TE)” for Ribo-seq–based measurements.

In this revised manuscript, we have added a more detailed explanation of TE in the revised manuscript (Lines 202–206), which now reads:

“For each sample, we followed established procedures [62-66] to calculate the translational efficiency (TE) for each feature (CDS or uORF). TE serves as a proxy for the translation rate at which ribosomes translate mRNA into proteins, typically quantified by comparing the density of ribosome-protected mRNA fragment (RPF) to the mRNA abundance for that feature (see Materials and Methods).”

(2) Page 7, line 209: "The translational efficiencies (TEs) of the conserved uORFs were highly correlated between the two species across all developmental stages and tissues examined, with Spearman correlation coefficients ranging from 0.478 to 0.573 (Fig. 2A)." However, the authors did not analyze the correlation of translation efficiency of conserved CDSs between the two species, and compare this correlation to the correlation between the TEs of CDSs. These analyzes will further support the authors conclusion regarding the role of conserved uORFs in translation regulation.

In the revised manuscript, we have incorporated a comparison of translational efficiency (TE) correlations for conserved CDSs between the two species. We found that CDSs exhibit significantly higher interspecific TE correlations than uORFs, with Spearman’s rho ranging from 0.588 to 0.806. This suggests that uORFs tend to show greater variability in TE than CDSs, consistent with our model in which uORFs buffer fluctuations in downstream CDS translation. The updated results were included in the revised manuscript (Lines 223-227) as follows:

“In contrast, TE of CDSs exhibited a significantly higher correlation between the two species in the corresponding samples compared to that of uORFs, with Spearman’s rho ranging from 0.588 to 0.806 (P = 0.002, Wilcoxon signed-rank test; Figure 2A). This observation is consistent with our simulation results, which indicate that uORFs experience greater translational fluctuations than their downstream CDSs.”

(3) Page 8, line 217: "Among genes with multiple uORFs, one uORF generally emerged as dominant, displaying a higher TE than the others within the same gene (Fig. 2C)." The basis for determining dominance among uORFs is not explained and this lack of clarification undermines the interpretation of these findings.

Thank you for pointing this out. We apologize for the confusion. In our study, a “dominant” uORF is defined as the one with the highest translation efficiency (TE) among all uORFs within the same gene. This designation is based solely on TE, which we consider a key metric for uORF activity, as it directly reflects translational output and potential regulatory impact. We have revised the manuscript to clarify this definition (Lines 232–244), now stating:

“Among genes with multiple uORFs, we defined the uORF with the highest TE as the dominant uORF for that gene, as TE is one of the most relevant metrics for assessing uORF function 45,67…… These results suggest that genes with multiple uORFs tend to retain the same dominant uORF across developmental stages, indicating that the dominant uORFs may serve as the key translational regulator of the downstream CDS.

(4) According to the simulation, the translation of uORFs should exhibit greater variability than that of CDSs. However, the authors observed significantly fewer uORFs with significant TE changes compared to CDSs. This discrepancy may be due to lower sequencing depth resulting in fewer reads mapped to uORFs. Therefore, the authors may compare this variability specifically among highly expressed genes.

Thank you for this thoughtful observation. We agree that the lower proportion of uORFs showing significant TE changes compared to CDSs, as reported in Table 1, appears inconsistent with our conclusion that uORFs exhibit greater translational variability. However, this discrepancy is largely attributable to differences in sequencing depth and feature length—uORFs are generally much shorter and more weakly expressed than CDSs, resulting in fewer mapped reads and reduced statistical power (Figure S18A).

To address this issue, we first followed your suggestion and restricted our analysis to genes with both mRNA and RPF RPKM values above the 50th percentile in D. melanogaster and D. simulans. While this filtering increased the total proportion of features with significant TE changes (due to improved read coverage), the proportion of significant uORFs still remained lower than that of CDSs (Table R1). This suggests that even among highly expressed genes, the disparity in read counts between uORFs and CDSs persists (Figure S18B), and thus the issue is not fully resolved.

To better capture biological relevance, we compared the absolute values of log2(TE changes) between D. melanogaster and D. simulans for uORFs and their corresponding CDSs. Across all samples, uORFs consistently exhibit larger TE shifts than their downstream CDSs, supporting our model that uORFs act as translational buffers (Figure 3B).

We have made relevant changes to report the new analysis in this revised manuscript. Specifically, in our original submission, we stated this observation with the sentence “The smaller number of uORFs showing significant TE changes compared to CDSs between D. melanogaster and D. simulans likely reflects their shorter length and reduced statistical power, rather than indicating that uORFs are less variable in translation than CDSs.” To make this point clearer, in the revised version (Lines 275-284), we rephrased this sentence which read as follows:

“Note that due to their shorter length and generally lower TE, uORFs had considerably lower read counts than CDSs, limiting the statistical power to detect significant interspecific TE differences for uORFs. This trend consistently holds whether analyzing all expressed uORFs (Figure S18A) or only highly expressed genes (Figure S18B). Thus, the fewer uORFs showing significant TE divergence likely reflects lower read counts and statistical sensitivity rather than reduced translational variability relative to CDSs. In fact, the absolute values of log2(fold change) of TE for uORFs between D. melanogaster and D. simulans were significantly greater than those observed for corresponding CDSs across all samples (P < 0.001, Wilcoxon signed-rank test; Figure 3B), suggesting that the magnitude of

TE changes in CDSs is generally smaller than that in uORFs, due to the buffering effect of uORF.”

Author response table 1. Proportion of uORFs and CDSs with significant TE changes before and after selecting HEGs.

Original Highly expressed genes
Sample types # of expressed uORFs beta_("III ")!=1 (%) # of expressed CDSs b_(sigma)!=1 (%) # of expressed uORFs beta_("III ")!=1 (%) # of expresse d CDSs b_(c)!=1 (%)
0-2 h embryo 7,704 1,193 (15.49) 7,934 4,189 (52.80) 3,531 703 (19.90) 2760 1,393 (50.47)
2-6h embryo 7,822 567 (7.25) 9,795 3,063 (31.27) 4,666 456 (9.77) 3910 1,926 (49.26)
6-12h embryo 10,400 1,040 (10.00) 10,643 2,924 (27.47) 5,718 866 (15.15) 4271 1,544 (36.15)
12-24h embryo 11,365 535 (4.71) 11,158 3,537 (31.70) 5,454 423 (7.76) 4304 1,852 (43.03)
Larva 10,008 464 (4.64) 11,831 3,554 (30.04) 5,247 377 (7.18) 4546 2,149 (47.27)
Pupa 12,309 635 (5.16) 12,209 4,087 (33.48) 6,028 480 (7.96) 4487 2,196 (48.94)
Male body 10,894 197 (1.81) 12,284 2,432 (19.80) 4,422 135 (3.05) 4461 1,390 (31.16)
Male head 10,904 144 (1.32) 10,447 1,151 (11.02) 5,644 123 (2.18) 3981 605 (15.20)
Female body 9,809 340 (3.47) 11,002 3,605 (32.77) 4,231 255 (6.03) 4064 2,047 (50.37)
Female head 10,935 332 (3.04) 10,545 1,270 (12.04) 5,510 269 (4.88) 4019 618 (15.38)

(5) If possible, the author may need to use antibodies against bicoid to test the effect of ATG deletion on bicoid expression, particularly under different developmental stages or growth conditions.

According to the authors' conclusions, the deletion mutant should exhibit greater variability in bicoid protein abundance. This experiment could provide strong support for the proposed mechanisms.

Thank you for this excellent suggestion. We fully agree that testing Bcd protein levels across developmental stages or stress conditions using antibodies would be a strong validation of our model, which predicts greater variability in Bcd protein abundance upon uORF deletion.

In fact, we attempted such experiments in both wild-type and mutant backgrounds. However, we encountered substantial difficulties in obtaining a reliable anti-Bcd antibody. Some Bcd antibodies referenced in the published literature were homemade and often shared among research groups as gifts [1-3] and some commercially available antibodies cited in previous studies are no longer supplied by vendors [4-6]. We managed to obtain a custom-made antibody from Professor Feng Liu, but unfortunately, it produced inconsistent and unsatisfactory results. Despite considerable effort—including during the COVID-19 pandemic—we were unable to identify a reagent suitable for robust and reproducible detection of Bcd protein.

As an alternative, we used sucrose gradient fractionation followed by qPCR to directly measure the translation efficiency of bicoid in vivo. We believe this approach offers a clear and quantitative readout of translational activity, and it avoids potential confounding from protein degradation, which may vary across conditions and developmental stages. Nonetheless, we recognize the value of antibody-based validation and will pursue this direction in future work if reliable antibodies become available. We have added this limitation to the revised Discussion section (Lines 563–568) as follows:

“We demonstrated that the bcd uORF represses CDS translation using sucrose gradient fractionation followed by qPCR—an approach that directly measures translation efficiency while minimizing confounding from RNA/protein degradation. However, detecting Bcd protein levels with antibodies across developmental stages or conditions in the mutants and wild-type controls would provide an even stronger validation of our model and should be explored in future studies.”

Recommendations for the authors:

Reviewer #1 (Recommendations for the authors):

(1) The authors should provide a more detailed explanation for the modifications made to the ICIER model. Specifically, an explanation of the biological or mechanistic rationale behind the ability of the 80S ribosome to cause upstream 40S ribosomes to dissociate from mRNA would help clarify this aspect of the model.

Thank you for this suggestion. In the original submission, we described our modifications to the ICIER model in the section titled “An extended ICIER model for quantifying uORF buffering in CDS translation” (Lines 88-124 of the revised manuscript).

To further clarify the biological rationale behind this mechanism, we have now included a conceptual model figure (Figure 9) illustrating mechanistically how uORF translation can buffer downstream translation within a single mRNA molecule. Additionally, we expanded the Discussion to summarize the current understanding of how collisions between translating 80S ribosomes and scanning 40S subunits may lead to dissociation, referencing known initial ribosome quality control (iRQC) pathways. These revisions provide a clearer mechanistic framework for interpreting the buffering effects modeled in our simulations. The relevant part is reproduced from Discussion (Lines 540-557) which reads as follows:

“Ribosome slowdown or stalling on mRNA due to rare codons [56,96-98] or nascent blocking peptides [99-102] frequently triggers ribosome collisions genome-wide [103-105]. Such collisions, especially among elongating 80S ribosomes, often activate ribosome quality control (RQC) pathways that recognize collision interfaces on the 40S subunit, leading to ribosomal subunit dissociation and degradation [106-108]. In mammals, ZNF598 specifically identifies collided ribosomes to initiate ubiquitin-dependent protein and mRNA quality control pathways [109-113]. Analogously, yeast employs Hel2-mediated ubiquitination of uS10, initiating dissociation via the RQC-trigger complex (RQT) [114]. Furthermore, the human RQT (hRQT) complex recognizes ubiquitinated ribosomes and induces subunit dissociation similarly to yeast RQT [115]. However, transient ribosome collisions can evade RQC by promoting resumed elongation through mechanical force provided by trailing ribosomes, thereby mitigating stalling [116]. Beyond 80S collisions, evidence increasingly highlights a distinct collision type involving scanning 40S subunits or pre-initiation (43S) complexes. Recently, an initiation RQC pathway (iRQC) targeting the small ribosomal subunit (40S) has been described, particularly involving collisions between scanning 43S complexes or between stalled 43S and elongating 80S ribosomes (Figure 9B) [117,118]. During iRQC, E3 ubiquitin ligase RNF10 ubiquitinates uS3 and uS5 proteins, resulting in 40S degradation [118]. This mechanism aligns closely with our ICIER model, proposing collision-driven 43S dissociation in the 5' UTRs. Future studies exploring these mechanisms in greater detail will clarify how uORFs modulate translational regulation through buffering effects.”

(2) The figure legend references Figure 5C; however, this figure appears to be missing from the document.

We apologize for the oversight. The missing panel previously referred to as Figure 5C has now been incorporated into the revised Figure 6A. The figure and its corresponding legend have been corrected accordingly in the updated manuscript.

Reviewer #2 (Recommendations for the authors):

This is an important study that enhances our understanding of the roles of uORFs in translational regulation. In addition to the suggestions provided in the public review, the following minor points should be addressed before publication in eLife:

(1) Page 7, line 207: "We identified 18,412 canonical uORFs shared between the two species (referred to as conserved uORFs hereafter)." The term "canonical uORFs" requires clarification. Does this refer to uORFs with specific sequence features, conservation, or another defining characteristic?

Thank you for pointing this out. We apologize for the lack of clarity. In our study, a canonical uORF is defined as an open reading frame (ORF) that initiates with a canonical AUG start codon located in the 5′ untranslated region (UTR) and terminates with a stop codon (UAA, UAG, or UGA) within the same mRNA. Conservation of uORFs is defined solely based on the presence of AUG start codons at orthologous positions in the 5′ UTR across species, regardless of differences in the stop codon.

To clarify this definition, we have revised the sentence as follows (Lines 213-219): “We focused on canonical uORFs that initiate with an ATG start codon in the 5′ UTR and terminate with a stop codon (TAA, TAG, or TGA). Because the ATG start codon is the defining feature of a canonical uORF and tends to be more conserved than its downstream sequence [67], we defined uORF conservation based on the presence of the ATG start codon in the 5′ UTR of D. melanogaster and its orthologous positions in D. simulans, regardless of differences in the stop codon. Using this criterion, we identified 18,412 canonical uORFs with conserved start codons between the two species.”

(2) Page 8, line 227: "Furthermore, the dominant uORFs showed a higher proportion of conserved uATGs than the other translated uORFs." There appears to be a typographical error. Should "other uATGs" instead read "other uORFs"?

Thank you for pointing this out. As we addressed in response to your previous concern, in this study, we defined uORF conservation primarily based on the presence of their start codon (uATG) both in D. melanogaster and the orthologous sites of D. simulans, as the start codon is the defining feature of a uORF and tends to be more conserved than the remaining sequence, as demonstrated in our previous study [7]. We used the term “conserved uATGs” to reflect this definition and believe it accurately conveys the intended meaning in this context.

(3) Page 8, line 240: "uORFs exhibited a significant positive correlation with the TE of their downstream CDSs in all samples analyzed (P < 0.001, Spearman's correlation)." A Spearman's rho of 0.11 or 0.21 may not practically represent a "significant" positive correlation. Consider rephrasing this as "a positive correlation."

Thank you for the suggestion. We have revised the sentence in the manuscript to read (Lines 257-259): “uORFs exhibited a modest, yet statistically significant, positive correlation with the TE of their downstream CDSs across all samples analyzed (P < 0.001, Spearman’s correlation).”

(4) Page 9, line 269: The analysis of interspecific TE changes between uORFs and their corresponding CDSs is a crucial piece of evidence supporting the authors' conclusions. Presenting this analysis as part of the figures, rather than in "Table 1," would improve clarity and accessibility.

Thank you for this suggestion. In Table 1, we originally presented the number of uORFs and CDSs that showed significant differences in TE between D. melanogaster and D. simulans during various developmental stages. One key point we aimed to emphasize was that, although TE changes in uORFs and their downstream CDSs are positively correlated, there is a notable difference in the magnitude of these changes. To better convey this, we have summarized the core findings of Table 1 in graphical form.

In Figure 3B of the revised version, we compared the absolute values of interspecific TE changes between CDS and uORF, showing that CDSs consistently exhibit smaller shifts than their upstream uORFs. This result further supports the translational buffering effect of uORFs on downstream CDS expression. We have included the updated results in the revised manuscript (Lines 281-284) as follows:

“In fact, the absolute values of log2(fold change) of TE for uORFs between D. melanogaster and D. simulans was significantly greater than that observed for corresponding CDSs across all samples (P < 0.001, Wilcoxon signed-rank test; Figure 3B), suggesting that the magnitude of TE changes in CDSs is generally smaller than that in uORFs, due to the buffering effect of uORF.”

(5) Page 9, line 279: The phrase "dominantly translated" needs clarification. Does it refer to Figure 2C, where one uORF is dominantly translated within a gene, or does it mean that the uORF's translation is higher than that of its corresponding CDS?

We apologize for the obscurity. The phrase "dominantly translated" means one uORF with the highest TE compared to other uORFs within a gene. We have rephrased the relevant sentence in the revised version (Lines 299-304), which now reads:

“To investigate how the conservation level and translation patterns of uORFs influence their buffering capacity on CDS translation, we categorized genes expressed in each pair of samples into three classes:

Class I, genes with conserved uORFs that are dominantly translated (i.e., exhibiting the highest TE among all uORFs within the same gene) in both Drosophila species; Class II, genes with conserved uORFs that are translated in both species but not dominantly translated in at least one; and Class III, the remaining expressed genes.”

(6) The sequencing data and analysis code should be made publicly available before publication to ensure transparency and reproducibility.

Thank you for this suggestion. As described in the Data availability section, all deepsequencing data generated in this study, including single-ended mRNA-Seq and Ribo-Seq data of 10 developmental stages and tissues of Drosophila simulans and paired-end mRNA-Seq data of 0-2 h, 26 h, 6-12 h, and 12-24 h Drosophila melanogaster embryos, were deposited in the China National Genomics Data Center Genome Sequence Archive (GSA) under accession numbers CRA003198, CRA007425, and CRA007426. The mRNA-Seq and Ribo-Seq data for the different developmental stages and tissues of Drosophila melanogaster were published in our previous paper [8] and were deposited in the Sequence Read Archive (SRA) under accession number SRP067542.

All original code has been deposited on GitHub: https://github.com/lujlab/uORF_buffer; https://github.com/lujlab/Buffer_eLife2025.

Response reference

(1) Li, X.Y., MacArthur, S., Bourgon, R., Nix, D., Pollard, D.A., Iyer, V.N., Hechmer, A., Simirenko, L., Stapleton, M., Luengo Hendriks, C.L., et al. (2008). Transcription factors bind thousands of active and inactive regions in the Drosophila blastoderm. PLoS Biol 6, e27. 10.1371/journal.pbio.0060027.

(2) Horner, V.L., Czank, A., Jang, J.K., Singh, N., Williams, B.C., Puro, J., Kubli, E., Hanes, S.D., McKim, K.S., Wolfner, M.F., and Goldberg, M.L. (2006). The Drosophila calcipressin sarah is required for several aspects of egg activation. Curr Biol 16, 1441-1446. 10.1016/j.cub.2006.06.024.

(3) Lee, K.M., Linskens, A.M., and Doe, C.Q. (2022). Hunchback activates Bicoid in Pair1 neurons to regulate synapse number and locomotor circuit function. Curr Biol 32, 2430-2441 e2433. 10.1016/j.cub.2022.04.025.

(4) Wharton, T.H., Nomie, K.J., and Wharton, R.P. (2018). No significant regulation of bicoid mRNA by Pumilio or Nanos in the early Drosophila embryo. PLoS One 13, e0194865. 10.1371/journal.pone.0194865.

(5) Wang, J., Zhang, S., Lu, H., and Xu, H. (2022). Differential regulation of alternative promoters emerges from unified kinetics of enhancer-promoter interaction. Nat Commun 13, 2714. 10.1038/s41467-022-30315-6.

(6) Xu, H., Sepulveda, L.A., Figard, L., Sokac, A.M., and Golding, I. (2015). Combining protein and mRNA quantification to decipher transcriptional regulation. Nat Methods 12, 739-742. 10.1038/nmeth.3446.

(7) Zhang, H., Wang, Y., Wu, X., Tang, X., Wu, C., and Lu, J. (2021). Determinants of genomewide distribution and evolution of uORFs in eukaryotes. Nat Commun 12, 1076. 10.1038/s41467-021-21394-y.

(8) Zhang, H., Dou, S., He, F., Luo, J., Wei, L., and Lu, J. (2018). Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. PLoS Biol 16, e2003903. 10.1371/journal.pbio.2003903.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Data Citations

    1. Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA003198 [DOI] [PMC free article] [PubMed]
    2. Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007425 [DOI] [PMC free article] [PubMed]
    3. Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007426 [DOI] [PMC free article] [PubMed]
    4. Zhang H, Dou S, He F, Luo J, Wei L, Lu J. 2018. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. NCBI Sequence Read Archive. SRP067542 [DOI] [PMC free article] [PubMed]

    Supplementary Materials

    Figure 1—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 2—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 3—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 4—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 5—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 6—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 7—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 8—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 9—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 10—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 11—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 12—source data 1. Raw numerical data underlying the figures.
    Figure 1—figure supplement 13—source data 1. Raw numerical data underlying the figures.
    Figure 2—source data 1. Raw numerical data underlying the figures.
    Figure 2—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 2—figure supplement 2—source data 1. Raw numerical data underlying the figures.
    Figure 3—source data 1. Raw numerical data and statistical analysis underlying the figures.
    Figure 3—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 3—figure supplement 3—source data 1. Raw numerical data underlying the figures.
    Figure 3—figure supplement 5—source data 1. Raw numerical data underlying the figures.
    Table 1—source data 1. Raw data and statistical analysis for uORF TE changes.
    Table 1—source data 2. Raw data and statistical analysis for CDS TE changes.
    Figure 4—source data 1. Raw numerical data underlying the figures.
    Figure 4—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 4—figure supplement 2—source data 1. Raw numerical data underlying the figures.
    Figure 5—source data 1. Raw numerical data underlying the figures.
    Figure 5—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 6—source data 1. Raw numerical data underlying the figures.
    Figure 6—figure supplement 2—source data 1. Raw numerical data underlying the figures.
    Figure 6—figure supplement 3—source data 1. Raw numerical data underlying the figures.
    Figure 6—figure supplement 4—source data 1. Raw numerical data underlying the figures.
    Figure 7—source data 1. Raw numerical data underlying the figures.
    Figure 8—source data 1. Raw numerical data underlying the figures.
    Figure 8—figure supplement 1—source data 1. Raw numerical data underlying the figures.
    Figure 8—figure supplement 2—source data 1. Raw numerical data underlying the figures.
    Figure 8—figure supplement 3—source data 1. Raw numerical data underlying the figures.
    Figure 8—figure supplement 4—source data 1. Raw numerical data underlying the figures.
    Figure 8—figure supplement 5—source data 1. Raw numerical data underlying the figures.
    Supplementary file 1. Parameters used in our simulation.
    elife-104074-supp1.docx (83.8KB, docx)
    Supplementary file 2. Mapping statistics of Ribo-Seq and matched mRNA-Seq libraries.
    elife-104074-supp2.docx (112.8KB, docx)
    Supplementary file 3. Genes with uORFs showing strong evidence of translational buffering.
    Supplementary file 4. Mapping statistics of mRNA-Seq libraries for bcd uKO2/uKO2 mutant and WT flies.
    elife-104074-supp4.docx (114.9KB, docx)
    Supplementary file 5. Numbers of genes showing different magnitudes of TE changes between uORFs and CDS at the interspecific level, H. sapiens and M. mulatta.
    MDAR checklist

    Data Availability Statement

    All deep-sequencing data generated in this study, including single-ended mRNA-Seq and Ribo-Seq data of 10 developmental stages and tissues of Drosophila simulans and paired-end mRNA-Seq data of 0-2 h, 2-6 h, 6-12 h, and 12-24 h Drosophila melanogaster embryos, were deposited in the China National Genomics Data Center Genome Sequence Archive (GSA) under accession numbers CRA003198, CRA007425, and CRA007426. The mRNA-Seq and Ribo-Seq data for the different developmental stages and tissues of Drosophila melanogaster were published in our previous paper and were deposited in the Sequence Read Archive (SRA) under accession number SRP067542. All original code has been deposited on GitHub: https://github.com/lujlab/uORF_buffer (copy archived at Lu, 2022) and https://github.com/lujlab/Buffer_eLife2025 (copy archived at Lu, 2025).

    The following datasets were generated:

    Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA003198

    Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007425

    Sun Y, Duan Y, Gao P, Liu C, Jin K, Dou S, Tang W, Zhang H, Lu J. 2024. Upstream open reading frames buffer translational variability during Drosophila evolution and development. China National Genomics Data Center Genome Sequence Archive. CRA007426

    The following previously published dataset was used:

    Zhang H, Dou S, He F, Luo J, Wei L, Lu J. 2018. Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila development. NCBI Sequence Read Archive. SRP067542


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES