Repurposing the mammalian RNA-binding protein Musashi-1 as an allosteric translation repressor in bacteria

Roswitha Dolcemascolo; María Heras-Hernández; Lucas Goiriz; Roser Montagud-Martínez; Alejandro Requena-Menéndez; Raúl Ruiz; Anna Pérez-Ràfols; R Anahí Higuera-Rodríguez; Guillermo Pérez-Ropero; Wim F Vranken; Tommaso Martelli; Wolfgang Kaiser; Jos Buijs; Guillermo Rodrigo

doi:10.7554/eLife.91777

. 2024 Feb 16;12:RP91777. doi: 10.7554/eLife.91777

Repurposing the mammalian RNA-binding protein Musashi-1 as an allosteric translation repressor in bacteria

Roswitha Dolcemascolo ^1,^2,^†, María Heras-Hernández ^1,^†, Lucas Goiriz ^1,^3,^†, Roser Montagud-Martínez ^1,², Alejandro Requena-Menéndez ¹, Raúl Ruiz ¹, Anna Pérez-Ràfols ^4,⁵, R Anahí Higuera-Rodríguez ^6,⁷, Guillermo Pérez-Ropero ^8,⁹, Wim F Vranken ^10,¹¹, Tommaso Martelli ⁴, Wolfgang Kaiser ⁶, Jos Buijs ^8,¹², Guillermo Rodrigo ^1,^✉

Editors: Joseph T Wade¹³, Christian R Landry¹⁴

PMCID: PMC10942595 PMID: 38363283

Abstract

The RNA recognition motif (RRM) is the most common RNA-binding protein domain identified in nature. However, RRM-containing proteins are only prevalent in eukaryotic phyla, in which they play central regulatory roles. Here, we engineered an orthogonal post-transcriptional control system of gene expression in the bacterium Escherichia coli with the mammalian RNA-binding protein Musashi-1, which is a stem cell marker with neurodevelopmental role that contains two canonical RRMs. In the circuit, Musashi-1 is regulated transcriptionally and works as an allosteric translation repressor thanks to a specific interaction with the N-terminal coding region of a messenger RNA and its structural plasticity to respond to fatty acids. We fully characterized the genetic system at the population and single-cell levels showing a significant fold change in reporter expression, and the underlying molecular mechanism by assessing the in vitro binding kinetics and in vivo functionality of a series of RNA mutants. The dynamic response of the system was well recapitulated by a bottom-up mathematical model. Moreover, we applied the post-transcriptional mechanism engineered with Musashi-1 to specifically regulate a gene within an operon, implement combinatorial regulation, and reduce protein expression noise. This work illustrates how RRM-based regulation can be adapted to simple organisms, thereby adding a new regulatory layer in prokaryotes for translation control.

Research organism: E. coli, Mouse

Introduction

Gene regulation at the post-transcriptional level is pervasive in living organisms of ranging complexity (Waters and Storz, 2009; Holmqvist and Vogel, 2018; Jonas and Izaurralde, 2015; Glisovic et al., 2008). Indeed, the ability to regulate the genetic information flow at different points appears instrumental to maximize the integration of intrinsic and extrinsic signals, which enables an efficient information processing by the organisms. However, the solutions implemented in prokaryotes and eukaryotes greatly differ. In prokaryotes, small RNAs (sRNAs) regulate messenger RNA (mRNA) stability and translation initiation (Waters and Storz, 2009), supported by a series of RNA-binding proteins (e.g., Hfq) that act globally (Holmqvist and Vogel, 2018). Regulatory proteins of specific scope in these simple organisms mainly operate in the transcriptional layer (Madan Babu et al., 2006), what is aligned with the models presented in the early times of molecular biology (Jacob and Monod, 1961). By contrast, eukaryotes deploy a sizeable number of RNA-binding proteins with a variety of functions (Glisovic et al., 2008) that participate in the regulation of mRNA turnover, transport, splicing, and translation in a gene-specific manner and also at a global scale. In animals, in particular, most RNA-binding proteins contain RNA recognition motifs (RRMs) (Maris et al., 2005). RRMs are small globular domains of about 90 amino acids that fold into four antiparallel β-strands and two α-helices, which can bind to single-strand RNAs with sufficient affinity and specificity to control biological processes (Messias and Sattler, 2004). Yet, while important to attain functional diversity in the post-transcriptional layer in animals, RRMs are not prevalent in all organisms. In fact, the scarcity of RRM-containing proteins in prokaryotes and the often-unknown functional role of those identified by bioinformatic methods (Maruyama et al., 1999) question whether RRMs can readily work in organisms with much simpler gene expression machinery and intracellular organization. If so, this would raise the potential to use RRM–RNA interactions as an orthogonal layer to engineer gene regulation in prokaryotes.

To address these intriguing questions, we adopted a synthetic biology approach where a specific RRM-containing protein was incorporated in a bacterium in order to engineer a post-transcriptional control module. Synthetic biology has highlighted how living cells can be (re)programmed through the assembly of independent genetic elements into functional networks for a variety of applications in biotechnology and biomedicine (Khalil and Collins, 2010). Yet, synthetic biology can also be used to disentangle natural systems and probe hypotheses about biological function (Bashor and Collins, 2018). In previous work, some proteins with the ability to recognize RNA have been exploited as translation factors in bacteria for a gene-specific regulation (Belmont and Niles, 2010; Katz et al., 2019; Cao et al., 2015). The first instance was the tetracycline repressor protein (TetR), which naturally functions as a transcription factor, by means of the selection of synthetic RNA aptamers (Belmont and Niles, 2010). The bacteriophage MS2 coat protein (MS2CP) (Katz et al., 2019) and eukaryotic Pumilio homology domains (Cao et al., 2015) were also used in synthetic circuits. Alternatively, a wide palette of post-transcriptional control systems based on sRNAs have been developed in recent years to program gene expression in bacteria (Qi and Arkin, 2014). Of note, these systems are amenable to be combined with regulatory proteins to attain complex dynamic behaviors (Rosado et al., 2018). A heterologous RRM-containing protein with definite regulatory activity, in addition to provide empirical evidence on the adaptability of such RNA-binding domains to different genetic backgrounds, would enlarge the synthetic biology toolkit (Shotwell et al., 2020), boosting applications in which high orthogonality, expression fine-tuning, and signal integrability are required features. In addition, RRMs can themselves be allosterically regulated, opening up new avenues for post-transcriptional regulation by small molecules.

Still, there are instances of bacterial proteins that regulate translation in a gene-specific manner, such as CsrA to control glycogen biosynthesis (Liu and Romeo, 1997) or the ribosomal protein S8 to exert self-repression (Meyer, 2018). Besides, it is worth noting that some bacteriophages follow this mechanism to modulate their infection cycle. These are the cases, for example, of the coat proteins of the phages MS2 (infecting Escherichia coli) or PP7 (infecting Pseudomonas aeruginosa), which regulate the expression of the cognate phage replicases through protein–RNA interactions (Babitzke et al., 2009). However, one limitation for synthetic biology developments is that such phage proteins are not allosteric. At the post-transcriptional level, bacteria mostly rely on a large palette of cis- and trans-acting non-coding RNAs to either activate or repress protein expression, resulting in the regulation of translation initiation, mRNA stability, or transcription termination, and even allowing sensing small molecules (Waters and Storz, 2009; Qi and Arkin, 2014). Thus, there should be efforts to replicate this functional versatility with proteins.

In this work, the mammalian RNA-binding protein Musashi-1 (MSI-1) (Fox et al., 2015) was used as a translation repressor in the bacterium E. coli (Figure 1a). MSI-1 belongs to an evolutionarily conserved family of RRM-containing proteins, of which a member was first identified in Drosophila melanogaster (Nakamura et al., 1994). MSI-1 contains two RRMs in the N-terminal region (RRM1 and RRM2) and recognizes the RNA consensus sequence RU_nAGU on the nanomolar affinity scale (Imai et al., 2001). Importantly, MSI-1 can be allosterically inhibited by fatty acids (in particular, 18–22-carbon ω–9 monounsaturated fatty acids) (Clingman et al., 2014). In mammals, MSI-1 is mainly expressed in stem cells of neural and epithelial lineage and plays crucial roles in differentiation, tumorigenesis, and cell cycle regulation (Fox et al., 2015). Notably, MSI-1 regulates Notch signaling by repressing the translation of a key protein in the pathway (Imai et al., 2001). Hence, rather than moving genetic elements from simple to complex organisms, as it is normally done (e.g., the TetR-aptamer module was implemented in simple eukaryotes Ganesan et al., 2016), we reversed the path by moving an important mammalian gene (from Mus musculus) to E. coli. Some eukaryotic factors have already been implemented in bacteria to regulate gene expression at different levels (Cao et al., 2015; MacDonald et al., 2021), but the case of RRM-containing proteins has remained elusive. In the following, we present quantitative experimental and theoretical results on the response dynamics of a synthetic gene circuit in which MSI-1 works as an allosteric translation repressor. There, MSI-1 is transcriptionally controlled by the lactose repressor protein (LacI), and translation regulation by MSI-1 is accomplished by means of a specific interaction with an mRNA (encoding a reporter protein) that harbors a suitable binding motif in its N-terminal coding region.

Figure 1—figure supplement 1. — (a) Overview of the biotechnological development. In mammals, MSI-1 binds to the 3’ untranslated region (UTR) of its target mRNA to repress translation. Here, the *M. musculus* gene coding for MSI-1 was moved to *E. coli* (transgenesis) to implement a synthetic regulation system at the level of translation. (b) Schematic of the synthetic gene circuit engineered in *E. coli*. A truncated version of MSI-1 (termed MSI-1*) was expressed from the PLlac promoter to be induced with lactose (or isopropyl β-D-1-thiogalactopyranoside [IPTG]) in a genetic background overexpressing LacI. sfGFP was used as a reporter expressed from a constitutive promoter (J23119) and under the control of a suitable RNA motif recognized by MSI-1* in the N-terminal coding region of the transcript (viz., located after the start codon). The activity of MSI-1* could in turn be allosterically inhibited by oleic acid. In electronic terms, this circuit implements an IMPLY logic gate. The inset shows the predicted secondary structure of the N-terminal coding region of the reporter mRNA. Within the motif (blue shaded), the consensus recognition sequences (RU_nAGU) are bolded and the minimal cores (UAG) are marked in red. System implemented with pRM1+ and pREP6. (c) Dose–response curve of the system using lactose as inducer (up to 1 mM). MSI-1* downregulated sfGFP expression by 2.5-fold. The inset shows the dynamic range of the response using lactose or IPTG (1 mM), showing a statistically significant regulation in both cases (Welch’s t-test, two-tailed p<0.05). (d) Transfer function of the system (between sfGFP and MSI-1*). The inset shows the dose–response curve of eBFP2 expressed from the PLlac promoter (proxy of MSI-1* expression) with lactose. (e) Scatter plot of the dynamic response of the system in the Crick space (translation rate vs. transcription rate). The dose–response curve of mScarlet expressed from the J23119 promoter with lactose was used to perform the decomposition (vertical line fitted to 48 AU/h). The inset shows the growth rate of the cells for each induction condition (horizontal line fitted to 0.55 h^–1). In all cases, points correspond to experimental data, while solid lines come from adjusted mathematical models. Error bars correspond to standard deviations (n = 3). (f) Probability-based histograms of sfGFP expression from single-cell data for different lactose concentrations, showing a statistically significant regulation (one-way ANOVA, p<10^–4). The inset shows the percentage of cells in the ON state (sfGFP expressed), according to a specified threshold, for each lactose concentration. AU, arbitrary units.

Figure 1—source data 1. Bulk fluorescence data of sfGFP, eBFP2, and mScarlet with lactose and single-cell data of sfGFP.

elife-91777-fig1-data1.xlsx^{(528.4KB, xlsx)}

Results

A Musashi protein can downregulate translation in bacteria

From the amino acid sequence of M. musculus MSI-1, we generated a nucleotide sequence with codons optimized for E. coli expression. Knowing that the C-terminus of MSI-1 is of low structural complexity (Iwaoka et al., 2017), we cloned a truncated version of the gene encompassing the first 192 amino acids, which include the two RRMs, to implement our synthetic circuit (Figure 1b). The resulting protein (termed MSI-1*) was expressed from a synthetic PL-based promoter repressed by LacI (termed PLlac) (Lutz and Bujard, 1997) lying in a high copy number plasmid. This allowed controlling the expression of the heterologous RNA-binding protein at the transcriptional level with lactose or isopropyl β-D-1-thiogalactopyranoside (IPTG) in a genetic background overexpressing LacI. As a regulated element, we used the superfolder green fluorescent protein (sfGFP) (Pédelacq et al., 2006), which was expressed from a constitutive promoter (J23119) lying in a low copy number plasmid (Figure 1—figure supplement 1). An RNA motif obtained by affinity elution-based RNA selection (SELEX) containing two copies of the consensus recognition sequence (viz., GUUAGU and AUUUAGU) (Imai et al., 2001) was placed in frame after the start codon of sfGFP. This motif folds into a stem-loop structure that allows stabilizing the exposure of the recognition sequence to the solvent. In this way, MSI-1* can repress translation by blocking the binding of the ribosome, presumably by imposing a steric hindrance for the 30S ribosomal subunit. This mode of action differs from the natural one in mammals, in which MSI-1 binds to the 3′ untranslated region (UTR) of its target mRNA (Numb) to repress translation by disrupting the activation function of the poly(A)-binding protein (Kawahara et al., 2008). Here, considering lactose (or IPTG) and oleic acid as the two inputs and sfGFP as the output, MSI-1* being an internal allosteric regulator operating at the post-transcriptional level, an IMPLY gate would model the logic behavior of the resulting circuit (i.e., sfGFP would only turn off with lactose and without oleic acid in the medium).

We first characterized by bulk fluorometry the dose–response curve of the system using a lactose concentration gradient up to 1 mM. Our data show that MSI-1* downregulated sfGFP expression by 2.5-fold (Figure 1c). Fitting a Hill equation, we obtained a regulatory coefficient of 99 μM (lactose concentration at which the repression is half of the maximal) and a Hill coefficient of 1.7 (Appendix 1). We also observed that IPTG (a synthetic compound) triggered a very similar response. To further inspect the activity of the RNA-binding protein, we filtered out the transcriptional regulatory effect. For that, we expressed the enhanced blue fluorescent protein 2 (eBFP2) (Ai et al., 2007) from the PLlac promoter to obtain the corresponding dose–response curve with lactose. In this way, eBFP2 expression was a proxy of MSI-1* expression, which allowed representing the transfer function of the engineered regulation (Figure 1d). A Hill equation with no cooperative binding (i.e., Hill coefficient of 1) explained the data with sufficient agreement, suggesting that only one protein interacted with a given mRNA (i.e., each RRM of MSI-1* binds to a consensus sequence repeat, in agreement with a previous structural model; Iwaoka et al., 2017). We also measured the cell growth rate for all induction conditions, finding that the values were almost constant. This indicates that the expression of the mammalian protein did not produce a significant burden to the bacterial cell.

In simple terms, protein expression comes from the product of the transcription and translation rates of the gene. Hence, we examined such a decomposition in the case of sfGFP expression regulated by MSI-1*. Of note, the low copy number plasmid harbors an additional transcriptional unit to express the monomeric red fluorescent protein mScarlet (Bindels et al., 2017) from a constitutive promoter (J23119). We then monitored its expression profile with lactose. Assuming that sfGFP and mScarlet were equally transcribed, as they were expressed from the same promoter, and that the translation rate of mScarlet was constant, the product of mScarlet expression and cell growth rate was considered a proxy of the transcription rate of sfGFP. Moreover, the ratio of sfGFP and mScarlet expressions was a proxy of the translation rate of sfGFP (Klumpp et al., 2009). This served us to represent the dynamics of the system in a plane defined as translation rate vs. transcription rate (termed Crick space; Hausser et al., 2019), highlighting that the change in sfGFP expression with lactose comes indeed from translation regulation (Figure 1e). A reverse transcription quantitative polymerase chain reaction (RT-qPCR) was used to confirm the preservation of the sfGFP mRNA level (Figure 1—figure supplement 2). Finally, to evaluate the heterogeneity of the response within a bacterial population, we performed single-cell measurements of sfGFP expression by flow cytometry. Unimodal distributions able to shift in response to lactose were observed (Figure 1f). Setting a threshold to categorize expression, we found that the percentage of cells in the ON state dropped from 87% to 15% upon addition of 1 mM lactose. In sum, our results show that MSI-1* can regulate translation in a specific manner in E. coli, and hence that eukaryotic regulators can be borrowed to be functional elements in prokaryotes.

Mechanistic insight into the engineered regulation based on a protein–RNA interaction

We then introduced a series of point mutations into the SELEX RNA motif to assess their effect over the regulatory activity of the RRM-containing protein (Figure 2a). These mutations change the consensus recognition sequence of at least one repeat. A characterization of all systems revealed that the mutations affected both the maximal level and fold change of sfGFP expression (Figure 2b). Of note, a single point mutation in one repeat leading to RU_nCGU (mutant 1) was quite detrimental for the MSI-1*-based regulation (only 1.4-fold reduction in sfGFP expression). Despite the mutation substantially reducing sfGFP expression in the absence of MSI-1*, the presumed repressed state upon addition of lactose did not change much, suggesting the difficulty of the protein for targeting the mutated mRNA. This agrees with the prior observation that, within the consensus sequence, UAG is a minimal core that determines the specific recognition by MSI-1 (Zearfoss et al., 2014). A double point mutation changing the minimal cores of the two repeats (UAC rather than UAG; mutant 5) also resulted in a detrimental action, but not to a greater extent. We also engineered a new reporter system with a minimal RNA motif consisting of a single copy of the shortest possible consensus sequence (AUAGU), but its characterization showed no apparent regulation by MSI-1* (Figure 2—figure supplement 1). Taken together, two copies of the consensus sequence seem necessary for a successful regulation of protein expression.

Figure 2. — (a) Sequences and predicted secondary structures of the different RNA motif variants for MSI-1 binding analyzed in this work. Point-mutations indicated in red. Three-dimensional representations of the RRM1 and RNA motif are also shown. Within the RRM1, the region that recognizes the RNA is shown in blue. (b) Dynamic range of the response of the different genetic systems using lactose (1 mM), showing a statistically significant regulation in all cases (Welch’s t-test, two-tailed p<0.05; although some mutants present a small fold change). (c) Schematic of the heliX biosensor platform. A double-strand DNA nanolever was immobilized on a gold electrode of the chip. The nanolever carried a fluorophore in one end and the RNA motif for MSI-1 binding in the other. Binding between MSI-1_h* (injected analyte) and RNA led to a fluorescence change, whose monitoring in real time served to extract the kinetic constants that characterize the interaction. (d) Scatter plot of the experimentally-determined kinetic constants of association and dissociation between the protein and the RNA for all systems (original and five mutants). Means and deviations calculated in log scale (geometric). (e) Correlation between the maximal sfGFP expression level (in the absence of lactose) and the translation rate predicted with RBS calculator. Linear regression performed. (f) Correlation between the fold change in sfGFP expression and the dissociation constant (K_D). Deviations calculated by propagation. Linear regression performed (vs. 1/K_D). Blue shaded areas indicate 95% confidence intervals. In all cases, error bars correspond to standard deviations (n = 3). AU, arbitrary units.

Figure 2—source data 1. Bulk fluorescence data of sfGFP and binding kinetics measurements.

elife-91777-fig2-data1.xlsx^{(8.3KB, xlsx)}

Figure 2—figure supplement 1. — (a) Sequences and predicted secondary structures of the different RNA motif variants for MSI-1 binding analyzed in this work. Point-mutations indicated in red. Three-dimensional representations of the RRM1 and RNA motif are also shown. Within the RRM1, the region that recognizes the RNA is shown in blue. (b) Dynamic range of the response of the different genetic systems using lactose (1 mM), showing a statistically significant regulation in all cases (Welch’s t-test, two-tailed p<0.05; although some mutants present a small fold change). (c) Schematic of the heliX biosensor platform. A double-strand DNA nanolever was immobilized on a gold electrode of the chip. The nanolever carried a fluorophore in one end and the RNA motif for MSI-1 binding in the other. Binding between MSI-1_h* (injected analyte) and RNA led to a fluorescence change, whose monitoring in real time served to extract the kinetic constants that characterize the interaction. (d) Scatter plot of the experimentally-determined kinetic constants of association and dissociation between the protein and the RNA for all systems (original and five mutants). Means and deviations calculated in log scale (geometric). (e) Correlation between the maximal sfGFP expression level (in the absence of lactose) and the translation rate predicted with RBS calculator. Linear regression performed. (f) Correlation between the fold change in sfGFP expression and the dissociation constant (K_D). Deviations calculated by propagation. Linear regression performed (vs. 1/K_D). Blue shaded areas indicate 95% confidence intervals. In all cases, error bars correspond to standard deviations (n = 3). AU, arbitrary units.

Figure 2—source data 1. Bulk fluorescence data of sfGFP and binding kinetics measurements.

elife-91777-fig2-data1.xlsx^{(8.3KB, xlsx)}

To relate the cellular effects with protein–RNA interactions, we obtained a purified MSI-1* preparation in order to perform in vitro binding kinetics assays (Figure 2—figure supplement 2). For that, a gene coding for a truncated version of the human MSI-1 was expressed from a T7 polymerase promoter in E. coli. With respect to the M. musculus version, this protein only differs in one residue of RRM2 (then termed MSI-1_h*), which is the subsidiary domain for RNA recognition (note also that the human and mouse proteins recognize the same consensus sequence; Zearfoss et al., 2014). To avoid the necessity of labeling the molecules of interest and allow working with very low amounts of protein and RNA, we used the switchSENSE technology, which allows measuring molecular dynamics on a chip (Figure 2c; Cléry et al., 2017). Figure 2d summarizes the resulting protein–RNA association and dissociation rates (k_ON and k_OFF, respectively; see also Figure 2—figure supplement 3). In the case of the original RNA motif, we found an association rate of 1.1 nM^-1 min^–1, which means that a single regulator molecule would take 1–3 min to find its target in the cell, and a residence time of the protein on the RNA of 1.5 min (given by 1/k_OFF). Of note, the reported value of k_ON is relatively close to the upper limit imposed by the diffusion rate (~1 nM^–1 s^–1). This fast rate suggests that MSI-1* is able to find its target mRNA in E. coli, competing with ribosomes and ribonucleases, and then achieve translation regulation. We also found that a single mutation in one of the two UAG minimal cores (mutants 1 and 2) led to similar association but faster dissociation (almost four times faster dissociation), whereas a double mutation affecting the two cores (mutant 5) disturbed both phases (almost 15 times slower association and 10 times faster dissociation). The dissociation constant (K_D = k_OFF/k_ON) was 0.62 nM for the original system, while 87 nM for mutant 5. The switchSENSE technology allowed revealing that affinity on the subnanomolar scale, refining a previous estimate of 4 nM obtained by gel shift assays (Imai et al., 2001). To contextualize these values, we compared to the binding kinetics of MS2CP, a phage RNA-binding protein that has evolved in a prokaryotic context and that we recently exploited to study how expression noise emerges and propagates through translation regulation (Dolcemascolo et al., 2022). Previous work disclosed an association rate to the cognate RNA motif of 0.032 nM^–1 min^–1 and a residence time of 12 min, leading to a dissociation constant of 2.6 nM (Buenrostro et al., 2014). Thus, MSI-1* would target RNA faster than MS2CP, but once this happened the phage protein would remain bound longer.

Next, we tried to predict the impact of the mutations on sfGFP expression. On the one hand, we used an empirical free-energy model (RBS calculator) to obtain an estimate of the mRNA translation rate from the sequence (Salis et al., 2009). However, only a poor correlation (R² = 0.16) with the maximal expression level was observed (Figure 2e), suggesting that additional variables should be considered. For example, it was surprising the higher expression level in the case of mutant 4, despite a minimal change in the structure of the RNA motif (Figure 2—figure supplement 3a; we ensured that sfGFP was in frame in this case). On the other hand, when the fold change was correlated with the inverse of the dissociation constant (1/K_D, i.e., the equilibrium constant) better results were obtained (R² = 0.75; Figure 2f). Mutant 1 is illustrative in this case because, even though a fast association rate was preserved (1.6 nM^–1 min^–1), it displayed a marginal regulatory activity as a result of a shorter residence time (0.41 min). This indicates that the underlying protein–RNA interaction in the bacterial circuit was close to thermodynamic equilibrium.

A mathematical model captured the dynamic response of the system

Translation regulation is more challenging than transcription regulation because mRNA is unstable compared to DNA, especially in bacteria. In E. coli, in particular, the average mRNA half-life is about 5 min (Bernstein et al., 2002). However, it is possible to derive a common mathematical framework from which to analyze the dynamics of both regulatory modes (Figure 3a). The fold change in protein expression is a suitable mesoscopic parameter that is directly related to the kinetic parameters that characterize the interaction in the cell (Garcia and Phillips, 2011). Using mass action kinetics, we obtained a general mathematical description of the fold change as a function of the regulator concentration (R), the association and dissociation rates, the leakage fraction of RNA/peptide-chain elongation, and the nucleic acid degradation rate (Appendix 2). To visualize the impact of the different parameters, we represented the fold change equation as a heatmap. When there is no nucleic acid degradation (DNA), a linear dependence between the first-order association rate (k_ONR) and k_OFF is established to maintain a given fold change value (Figure 3b), which would correspond to the case of transcription regulation. Accordingly, our model converges to the classical description of fold = 1 + R/K_D. However, if the nucleic acid degrades quickly (mRNA), the dependence between the first-order kinetic rates becomes nonlinear (Figure 3c). Indeed, in the case of translation regulation, it is important to note that when k_ONR is lower than the mRNA degradation rate (i.e., the mRNA is degraded faster than the protein binds), the functionality is greatly compromised. To overcome this barrier, the regulator needs to be highly expressed as MSI-1* is in our system (we estimate R > 1 μM with 1 mM lactose). Furthermore, when the residence time is much longer than the mRNA half-life (i.e., the mRNA is degraded before the protein unbinds), K_D is not a suitable parameter to characterize the regulation, which is solely association-dependent, resulting in non-equilibrium thermodynamics (Goiriz and Rodrigo, 2021). According to the aforementioned kinetic rates, this would be the case for MS2CP, but not for MSI-1* (i.e., both k_ON and k_OFF are instrumental to describe the regulation exerted by MSI-1*). Furthermore, given the 2.5-fold downregulation in our system, we estimated an elongation leakage fraction of 40% (using the fold change equation in the limit R → ∞). This leakage would come from the ability of ribosomes to elongate even if MSI-1* is bound and their ability to bind sooner to the sfGFP mRNA due to a conserved transcription–translation coupling mechanism (Kohler et al., 2017).

Figure 3. — (a) Schematics of gene regulation at different levels with proteins that bind to nucleic acids (DNA or RNA). On the left, schematic of transcription regulation (e.g., LacI regulating MSI-1* expression). On the right, schematic of translation regulation (e.g., MSI-1* regulating sfGFP expression). A general mathematical expression (gray shaded) was derived to calculate the fold change in protein expression as a function of the regulator concentration (R), the association and dissociation rates (k_ON and k_OFF), the elongation leakage fraction ( $ε$ ), and the nucleic acid degradation rate ( $δ$ ). (b) Heatmap of the fold change as a function of k_ONR and k_OFF (i.e., the first-order kinetic rates that characterize the protein–DNA/RNA interaction) when $δ = 0$ and $ε = 0.1$ . This would correspond to transcription regulation. (c) Heatmap of the fold change when $δ = 0.14$ min^–1 and $ε = 0.1$ . This would correspond to translation regulation. (d) Total red fluorescence of the cell population (ΣmScarlet) over time without and with 1 mM lactose. In this case, the cell growth rate was fitted to 0.80 h^–1. (e) Total green fluorescence of the cell population (ΣsfGFP) over time without and with 1 mM lactose. The inset shows the dynamic response for different lactose concentrations. (f) Correlation between the experimental values of ΣsfGFP at different times and for different lactose concentrations and the predicted values from a mathematical model that accounts for population growth and gene regulation. Data for t > 2 h. Linear regression performed. (g) Ratio of total green and red fluorescence as a proxy of cellular sfGFP expression over time. Ratio not represented at early times due to the high error obtained given the low number of cells present in the culture (gray shaded area). Deviations calculated by propagation. In all cases, points correspond to experimental data, while solid lines come from an adjusted mathematical model. Error bars correspond to standard deviations (n = 3). AU, arbitrary units.

Figure 3—source data 1. Bulk fluorescence data of sfGFP and mScarlet with time.

elife-91777-fig3-data1.xlsx^{(7.5KB, xlsx)}

Figure 3—figure supplement 1. — (a) Schematics of gene regulation at different levels with proteins that bind to nucleic acids (DNA or RNA). On the left, schematic of transcription regulation (e.g., LacI regulating MSI-1* expression). On the right, schematic of translation regulation (e.g., MSI-1* regulating sfGFP expression). A general mathematical expression (gray shaded) was derived to calculate the fold change in protein expression as a function of the regulator concentration (R), the association and dissociation rates (k_ON and k_OFF), the elongation leakage fraction ( $ε$ ), and the nucleic acid degradation rate ( $δ$ ). (b) Heatmap of the fold change as a function of k_ONR and k_OFF (i.e., the first-order kinetic rates that characterize the protein–DNA/RNA interaction) when $δ = 0$ and $ε = 0.1$ . This would correspond to transcription regulation. (c) Heatmap of the fold change when $δ = 0.14$ min^–1 and $ε = 0.1$ . This would correspond to translation regulation. (d) Total red fluorescence of the cell population (ΣmScarlet) over time without and with 1 mM lactose. In this case, the cell growth rate was fitted to 0.80 h^–1. (e) Total green fluorescence of the cell population (ΣsfGFP) over time without and with 1 mM lactose. The inset shows the dynamic response for different lactose concentrations. (f) Correlation between the experimental values of ΣsfGFP at different times and for different lactose concentrations and the predicted values from a mathematical model that accounts for population growth and gene regulation. Data for t > 2 h. Linear regression performed. (g) Ratio of total green and red fluorescence as a proxy of cellular sfGFP expression over time. Ratio not represented at early times due to the high error obtained given the low number of cells present in the culture (gray shaded area). Deviations calculated by propagation. In all cases, points correspond to experimental data, while solid lines come from an adjusted mathematical model. Error bars correspond to standard deviations (n = 3). AU, arbitrary units.

Figure 3—source data 1. Bulk fluorescence data of sfGFP and mScarlet with time.

elife-91777-fig3-data1.xlsx^{(7.5KB, xlsx)}

In addition, we studied the transient response of the gene circuit with lactose as both MSI-1* and sfGFP expressions changed with time. For that, we quantified the total red fluorescence of the cell population (Figure 3d), which is an estimate of the total number of cells, and the total green fluorescence (Figure 3e), which comes from the composition of population growth and gene regulation. We developed a bottom-up mathematical model based on differential equations to predict sfGFP expression in the cell (Appendix 3), as well as a phenomenological model for the bacterial growth (Appendix 4). The parameter values were adjusted with the curves without and with 1 mM lactose. Then, we used the mathematical model to predict the transient responses for different intermediate lactose concentrations, finding excellent agreement with the experimental data (R² = 0.98; Figure 3f). We also characterized the time-course response of the circuit with IPTG, encountering similar results (Figure 3—figure supplement 1). Moreover, to explore the maintenance of the regulatory behavior when the cell physiology changes, we characterized cells growing in solid medium with a repurposed LigandTracer technology, which initially was developed to monitor molecular interactions in real time (Björke and Andersson, 2006). In this case, a significant difference in the total red fluorescence was observed without and with 1 mM IPTG, suggesting that MSI-1* expression was costly for the cell in these conditions. Besides, the total green fluorescence of the growing population was recapitulated using the model with a 2.6-fold downregulation of cellular sfGFP expression, which is in tune with the results in liquid medium (Figure 3—figure supplement 2). Subsequently, we analyzed the intracellular response. The time-dependent ratio of total green and red fluorescence was used as a proxy of sfGFP expression. A delay in the response is expected because MSI-1* needs to be produced upon addition of lactose (Rosenfeld and Alon, 2003). Nevertheless, our model predicted a faster response than experimentally observed (Figure 3g). Overall, this quantitative inspection of translation regulation backs connections between molecular attributes and cellular behavior.

Rational redesign of the targeted transcript to enhance the dynamic range of the response

The presence of stem-loop structures in the N-terminal coding region contributes to lower the expression level. The more stable and closer to the start codon, the greater the impact on expression (Paulus et al., 2004). We hypothesized that, by destabilizing the RNA motif for MSI-1 binding, we would obtain an alternative regulatory system with higher expression levels. Accordingly, a new reporter system was engineered removing three base pairs from the stem, maintaining the two consensus recognition sequences. An experimental analysis revealed a 4.9-fold increase of the maximal sfGFP expression level and a 2.0-fold downregulation with 1 mM lactose (Figure 4a, redesign 1). We then investigated the possibility of increasing the dynamic range of the response by placing three consecutive RNA motifs. However, we did not observe a greater downregulation with 1 mM lactose (Figure 4a, redesign 2), suggesting that the additional motifs far away from the start codon had no effect; what was noticed is an effect on the maximal expression level.

Figure 4. — (a) Dynamic range of the response of three redesigned genetic systems using lactose (1 mM), showing a statistically significant regulation in the first and third cases (Welch’s t-test, two-tailed p<0.05). The predicted secondary structures of the N-terminal coding regions of the reporter mRNAs are shown on the right. Redesign 1 (red1) was implemented with pREP4b and redesign 2 (red2) with pREP4b3x, which contains three MSI-1 binding sites. These stem-loop structures are less stable than the original one. Redesign 3 (red3) was implemented with pREP7. (b) Dose–response curve of the redesign-3 system using lactose as inducer (up to 1 mM). MSI-1* downregulated sfGFP expression by 8.6-fold (Welch’s t-test, two-tailed p<0.05). The inset shows the scatter plot of the dynamic response in the Crick space (translation rate vs. transcription rate; vertical line fitted to 27 AU/h). The predicted secondary structure of the N-terminal coding region of the reporter mRNA is shown on the right; the mRNA contains two MSI-1 binding sites (blue shaded). In the 5’ UTR, the binding site is formed by two RU_nAGU repeats that flank the RBS without forming secondary structure. In the N-terminal coding region, the binding site is the original one. The minimal cores (UAG) are marked in red. Points correspond to experimental data, while the solid line comes from an adjusted mathematical model. In all cases, error bars correspond to standard deviations (n = 3). (c) Probability-based histograms of sfGFP expression from single-cell data for different lactose concentrations (redesign 3), showing a statistically significant regulation (one-way ANOVA, p<10^–4). The inset shows the percentage of cells in the ON state (sfGFP expressed), according to a specified threshold, for each lactose concentration. AU, arbitrary units.

Figure 4—source data 1. Bulk fluorescence data of sfGFP with lactose.

elife-91777-fig4-data1.xlsx^{(447.8KB, xlsx)}

Figure 4—figure supplement 1. — (a) Dynamic range of the response of three redesigned genetic systems using lactose (1 mM), showing a statistically significant regulation in the first and third cases (Welch’s t-test, two-tailed p<0.05). The predicted secondary structures of the N-terminal coding regions of the reporter mRNAs are shown on the right. Redesign 1 (red1) was implemented with pREP4b and redesign 2 (red2) with pREP4b3x, which contains three MSI-1 binding sites. These stem-loop structures are less stable than the original one. Redesign 3 (red3) was implemented with pREP7. (b) Dose–response curve of the redesign-3 system using lactose as inducer (up to 1 mM). MSI-1* downregulated sfGFP expression by 8.6-fold (Welch’s t-test, two-tailed p<0.05). The inset shows the scatter plot of the dynamic response in the Crick space (translation rate vs. transcription rate; vertical line fitted to 27 AU/h). The predicted secondary structure of the N-terminal coding region of the reporter mRNA is shown on the right; the mRNA contains two MSI-1 binding sites (blue shaded). In the 5’ UTR, the binding site is formed by two RU_nAGU repeats that flank the RBS without forming secondary structure. In the N-terminal coding region, the binding site is the original one. The minimal cores (UAG) are marked in red. Points correspond to experimental data, while the solid line comes from an adjusted mathematical model. In all cases, error bars correspond to standard deviations (n = 3). (c) Probability-based histograms of sfGFP expression from single-cell data for different lactose concentrations (redesign 3), showing a statistically significant regulation (one-way ANOVA, p<10^–4). The inset shows the percentage of cells in the ON state (sfGFP expressed), according to a specified threshold, for each lactose concentration. AU, arbitrary units.

Figure 4—source data 1. Bulk fluorescence data of sfGFP with lactose.

elife-91777-fig4-data1.xlsx^{(447.8KB, xlsx)}

As a further strategy to enhance the dynamic range of the response, we redesigned the 5′ UTR of sfGFP to accommodate two additional RU_nAGU repeats (viz., GUUUAGU and AUUUAGU) flanking the ribosome binding site (RBS), maintaining the original RNA motif after the start codon. In this way, MSI-1* can also block the RNA component of the 30S ribosomal subunit. Indeed, this is a widespread post-transcriptional regulatory strategy in prokaryotes, as it happens, for example, with the MS2 phage replicase (Babitzke et al., 2009). It is worth to note that the new 5′ UTR remained unstructured. We characterized by bulk fluorometry the dose-response curve of this new system, revealing an 8.6-fold downregulation of sfGFP expression by MSI-1* (Figure 4b, redesign 3; see also Figure 4—figure supplement 1 to appreciate the tight control of MSI-1* expression with the PLlac promoter). This was a substantial increase in performance with respect to the 2.5-fold downregulation of the system shown in Figure 1b. Fitting a Hill equation, we obtained a regulatory coefficient of 86 μM and a Hill coefficient of 4.5 (Appendix 1). While the regulatory coefficient was similar than in the original system (99 μM), the Hill coefficient was significantly higher (compared to 1.7). Interestingly, an apparent cooperativity was established between two MSI-1* proteins by binding to adjacent sites. The dynamics of the system was also represented in the Crick space to highlight the change in translation rate. At the single-cell level, we found a 91% of ON cells in the uninduced state that decreased to 5.3% with 1 mM lactose (Figure 4c). Taken together, our data present MSI-1* as a powerful heterologous translation regulator in bacteria.

The regulatory activity of a Musashi protein in bacteria can be externally controlled by a fatty acid

The ability of proteins to respond to small molecules is instrumental for environmental and metabolic sensing. Previous work revealed that MSI-1 can be allosterically inhibited by ω–9 monounsaturated fatty acids and, in particular, by oleic acid (Clingman et al., 2014), an 18-carbon fatty acid naturally found in various animal and plant oils (e.g., olive oil). Oleic acid binds to the RRM1 domain of MSI-1 and induces a conformational change that prevents RNA recognition (Figure 5a). To gain insight about the interactions between the elements of our system, we performed gel electrophoresis mobility shift assays using the purified MSI-1* protein, the RNA motif as a label-free sRNA molecule, and oleic acid. The different mobility of the nucleic acids upon binding to proteins and the coincident staining capacity of nucleic and fatty acids were exploited. We confirmed the MSI-1*–RNA interaction using a protein concentration gradient in this in vitro setup (Figure 5—figure supplement 1a), and we found that the interaction was completely disrupted in the presence of 1 mM oleic acid (Figure 5b). Furthermore, using an oleic acid concentration gradient, we obtained a half-maximal effective inhibitory concentration of about 0.5 mM (Figure 5—figure supplement 1b).

Figure 5. — (a) Three-dimensional structural schematic of the allosteric regulation. RRM1 of MSI-1 is shown alone, in complex with the RNA motif, and in complex with oleic acid. (b) Gel electrophoresis mobility shift assay to test the allosteric inhibition of MSI-1* with oleic acid. A purified MSI-1* protein (45 μM), the RNA motif as a label-free sRNA molecule (11 μM), and oleic acid (1 mM) were mixed in a combinatorial way in vitro. On the left, nucleic acid-stained gel. On the right, protein-stained gel (Coomassie). The different formed species are indicated. M denotes molecular marker (GeneRuler ultra-low range DNA ladder, 10–300 bp, Thermo). BSA was used as a control. (**c, d**) Probability-based histograms of sfGFP expression from single-cell data for different induction conditions (1 mM lactose or 1 mM lactose + 20 mM oleic acid) for the original system (c) and the redesign-3 system (d), showing statistically significant regulation in both cases (one-way ANOVA, p<10^–4). The insets show the percentages of cells in the ON state (sfGFP expressed), according to a specified threshold, for each condition. (e) On the top, images of *E. coli* colonies harboring pRM1+ and pREP6. Bacteria were seeded in LB-agar plates with suitable inducers (1 mM lactose or 1 mM lactose + 20 mM oleic acid). Fluorescence and bright-field images are shown. On the bottom, schematics of the working modes of the synthetic gene circuit according to the different induction conditions. (f) Images of *E. coli* colonies harboring pRM1+ and pREP7. AU, arbitrary units.

Figure 5—source data 1. Single-cell data of sfGFP.

elife-91777-fig5-data1.xlsx^{(1.3MB, xlsx)}

Figure 5—source data 2. Full gel images.

elife-91777-fig5-data2.zip^{(3.1MB, zip)}

Figure 5—figure supplement 1. — (a) Three-dimensional structural schematic of the allosteric regulation. RRM1 of MSI-1 is shown alone, in complex with the RNA motif, and in complex with oleic acid. (b) Gel electrophoresis mobility shift assay to test the allosteric inhibition of MSI-1* with oleic acid. A purified MSI-1* protein (45 μM), the RNA motif as a label-free sRNA molecule (11 μM), and oleic acid (1 mM) were mixed in a combinatorial way in vitro. On the left, nucleic acid-stained gel. On the right, protein-stained gel (Coomassie). The different formed species are indicated. M denotes molecular marker (GeneRuler ultra-low range DNA ladder, 10–300 bp, Thermo). BSA was used as a control. (**c, d**) Probability-based histograms of sfGFP expression from single-cell data for different induction conditions (1 mM lactose or 1 mM lactose + 20 mM oleic acid) for the original system (c) and the redesign-3 system (d), showing statistically significant regulation in both cases (one-way ANOVA, p<10^–4). The insets show the percentages of cells in the ON state (sfGFP expressed), according to a specified threshold, for each condition. (e) On the top, images of *E. coli* colonies harboring pRM1+ and pREP6. Bacteria were seeded in LB-agar plates with suitable inducers (1 mM lactose or 1 mM lactose + 20 mM oleic acid). Fluorescence and bright-field images are shown. On the bottom, schematics of the working modes of the synthetic gene circuit according to the different induction conditions. (f) Images of *E. coli* colonies harboring pRM1+ and pREP7. AU, arbitrary units.

Figure 5—source data 1. Single-cell data of sfGFP.

elife-91777-fig5-data1.xlsx^{(1.3MB, xlsx)}

Figure 5—source data 2. Full gel images.

elife-91777-fig5-data2.zip^{(3.1MB, zip)}

Subsequently, we assessed the effect of oleic acid over the regulatory activity of MSI-1* expressed in E. coli. This bacterium has evolved a machinery to uptake fatty acids from the environment. FadL and FadD are two membrane proteins that act as transporters, and FadE is the first enzyme that processes the fatty acid via the β-oxidation cycle (Fujita et al., 2007). Because of the high turbidity of the cell culture observed in the presence of oleic acid, we characterized the system by single-cell measurements of sfGFP expression by flow cytometry. In the case of the original system, the percentage of cells in the ON state increased from 10% (with 1 mM lactose) to 49% upon addition of 20 mM oleic acid (Figure 5c; see the 2D probability-based histograms in Figure 5—figure supplement 2). However, the initial 93% of ON cells observed in the absence of lactose was not recovered. Arguably, oleic acid was partially degraded once it entered the cell. Nevertheless, the system implemented with the redesign-3 reporter displayed a better dynamic behavior in response to lactose and oleic acid. In particular, the percentage of cells in the ON state increased from almost 0 (with 1 mM lactose) to 71% upon addition of 20 mM oleic acid (Figure 5d; see also Figure 5—figure supplement 2). In addition, we investigated this allosteric regulation by imaging the fluorescence of bacterial colonies grown in solid medium with different inducers. In stationary phase, FadE and the rest of oxidative enzymes could be saturated with the fatty acids generated from the membrane degradation (Navarro Llorens et al., 2010), oleic acid then having more time to interact with MSI-1*. Notably, we found a substantial inhibition of the repressive action of MSI-1* with 20 mM oleic acid in the case of both systems (Figure 5e and f; see also Figure 5—figure supplement 3). Conclusively, these results illustrate how the plasticity of RRM-containing proteins (e.g., MSI-1) can be exploited to engineer, even in simple organisms, gene regulatory circuits that operate in an integrated way at the transcriptional, translational, and post-translational levels.

Application of a Musashi protein for intra-operon, combinatorial, and noise regulation

Transcription regulation has been engineered in E. coli to end with purposeful and versatile gene expression programs (MacDonald et al., 2021; Nielsen et al., 2016). However, this type of control faces limitations, such as to regulate a specific gene within an operon or to implement a definite combinatorial regulation without a large screening of promoter variants. To show that MSI-1* is instrumental to address these issues and ultimately increase our ability to program gene expression (Figure 6a), a new regulatory circuit was engineered in which sfGFP and mScarlet were both forming a single transcriptional unit (i.e., bicistronic operon) under a synthetic PL-based promoter regulated by the tetracycline repressor protein (TetR; promoter termed PLtet) (Lutz and Bujard, 1997). This allowed controlling the expression of both fluorescent proteins at the transcriptional level with anhydrotetracycline (aTC) in a genetic background overexpressing TetR. Furthermore, an RNA motif for MSI-1 binding was placed in front of sfGFP (Figure 6b). A characterization by bulk fluorometry using lactose (1 mM) and aTC (100 ng/mL) in a combinatorial way showed the specific regulation of sfGFP expression by MSI-1* and the ability to combine signals exploiting transcription and translation regulation (Figure 6c; implementation with the redesign-3 motif due to its enhanced dynamic range). A NIMPLY gate would model the logic behavior of the resulting circuit (i.e., sfGFP would only turn on with aTC and without lactose in the medium). These data also excluded the possibility that MSI-1* operated transcriptionally as a result of spurious DNA targeting.

Figure 6. — (a) Overview of the regulatory utility of MSI-1*. It could (i) regulate the expression of a given enzyme belonging to a polycistronic operon for a metabolic pathway control, (ii) be exploited together with transcription factors to implement combinatorial regulations following the genetic information flow, envisioning biosensing applications, and (iii) regulate noise in protein expression with the aim of producing cell populations less disperse, especially for bacterial delivery applications in animals. (b) Schematic of a new synthetic gene circuit engineered in *E. coli*. MSI-1* was always expressed from the PLlac promoter to be induced with lactose in a genetic background overexpressing LacI. sfGFP and mScarlet were used as reporters, both expressed from the PLtet promoter to be induced with anhydrotetracycline (aTC) in a genetic background overexpressing TetR. In this bicistronic operon, only sfGFP was under the control of a suitable RNA motif recognized by MSI-1* in the leader region of the transcript (original motif or redesign-3). In electronic terms, this circuit implements a NIMPLY logic gate considering sfGFP as the output. System implemented with pRM1+ and pREP6α or pREP7α. (c) Dynamic range of the response using lactose (1 mM) and aTC (100 ng/mL) in a combinatorial way. aTC significantly activated the expression of the operon (Welch’s t-test, two-tailed p<0.05) and lactose, through the action of MSI-1*, significantly downregulated sfGFP expression in a specific way (data for pREP7α; Welch’s t-test, two-tailed p<0.05). Error bars correspond to standard deviations (n = 3). (d) Probability-based histograms of sfGFP expression from single-cell data for different inducer concentrations. On the left, pRM1+ and pREP6α with 100 ng/mL aTC and 1 mM lactose (i) or 15 ng/mL aTC (ii). On the right, pRM1+ and pREP7α with 100 ng/mL aTC and 1 mM lactose (iii) or 30 ng/mL aTC (iv). The mean expression and Fano factor are shown. AU, arbitrary units.

Figure 6—source data 1. Bulk fluorescence data of sfGFP and mScarlet and single-cell data of sfGFP.

elife-91777-fig6-data1.xlsx^{(142.4KB, xlsx)}

Figure 6—figure supplement 1. — (a) Overview of the regulatory utility of MSI-1*. It could (i) regulate the expression of a given enzyme belonging to a polycistronic operon for a metabolic pathway control, (ii) be exploited together with transcription factors to implement combinatorial regulations following the genetic information flow, envisioning biosensing applications, and (iii) regulate noise in protein expression with the aim of producing cell populations less disperse, especially for bacterial delivery applications in animals. (b) Schematic of a new synthetic gene circuit engineered in *E. coli*. MSI-1* was always expressed from the PLlac promoter to be induced with lactose in a genetic background overexpressing LacI. sfGFP and mScarlet were used as reporters, both expressed from the PLtet promoter to be induced with anhydrotetracycline (aTC) in a genetic background overexpressing TetR. In this bicistronic operon, only sfGFP was under the control of a suitable RNA motif recognized by MSI-1* in the leader region of the transcript (original motif or redesign-3). In electronic terms, this circuit implements a NIMPLY logic gate considering sfGFP as the output. System implemented with pRM1+ and pREP6α or pREP7α. (c) Dynamic range of the response using lactose (1 mM) and aTC (100 ng/mL) in a combinatorial way. aTC significantly activated the expression of the operon (Welch’s t-test, two-tailed p<0.05) and lactose, through the action of MSI-1*, significantly downregulated sfGFP expression in a specific way (data for pREP7α; Welch’s t-test, two-tailed p<0.05). Error bars correspond to standard deviations (n = 3). (d) Probability-based histograms of sfGFP expression from single-cell data for different inducer concentrations. On the left, pRM1+ and pREP6α with 100 ng/mL aTC and 1 mM lactose (i) or 15 ng/mL aTC (ii). On the right, pRM1+ and pREP7α with 100 ng/mL aTC and 1 mM lactose (iii) or 30 ng/mL aTC (iv). The mean expression and Fano factor are shown. AU, arbitrary units.

Figure 6—source data 1. Bulk fluorescence data of sfGFP and mScarlet and single-cell data of sfGFP.

elife-91777-fig6-data1.xlsx^{(142.4KB, xlsx)}

In addition, we analyzed how MSI-1* regulated noise in protein expression monitoring green fluorescence in single cells. Inducing the circuit of Figure 6b with 100 ng/mL aTC and 1 mM lactose produced almost the same mean expression level than with an intermediate aTC concentration (15 ng/mL when the implementation was with the original motif and 30 ng/mL when it was with the redesign-3 motif). However, the resulting unimodal distributions displayed different dispersions, lower when MSI-1* was not repressed. The Fano factor (the ratio between variance and mean) (Sanchez et al., 2013) was used to quantify the responses, finding reductions of 35 and 65% depending on the implementation (Figure 6d). Furthermore, for the circuit of Figure 1b, we found a 38% lower Fano factor when inducing with 1 mM lactose and 20 mM oleic acid than with 0.1 mM lactose, despite having similar mean expression levels (Figure 6—figure supplement 1). Of note, the response sensitivity was dominated by transcription regulation when the PL-based promoter was induced with an intermediate concentration of lactose (0.1 mM) or aTC (15–30 ng/mL). By contrast, the response sensitivity was dominated by translation regulation when the PL-based promoter was fully induced (1 mM lactose or 100 ng/mL aTC), thereby controlling the heterogeneity of the response (Dolcemascolo et al., 2022). Overall, these results illustrate the utility of repurposed mammalian RNA-binding proteins in bacteria for a fine expression control.

Discussion

The successful incorporation of the mammalian MSI-1 protein as a translation factor in E. coli highlights, in first place, the versatility of RRM-containing proteins to function as specific post-transcriptional regulators in any living cell, from prokaryotes to eukaryotes. Our data show that the protein–RNA association phase is very fast, which is suitable for regulation even in cellular contexts in which RNA molecules are short-lived, such as in E. coli (Bernstein et al., 2002). Nonetheless, it is important to stress that the kinetic parameters in vivo might differ from those measured in vitro due to off-target bindings and crowding effects (Hammar et al., 2012). Moreover, our data show that a downregulation of translation rate up to 8.6-fold can be achieved, with an appropriate design of the target mRNA leader region, and that the engineered cell can sense oleic acid from the environment. Here, the C-terminal low-complexity domain of the native MSI-1 was discarded to create MSI-1* (Iwaoka et al., 2017), in order to increase solubility, even though this domain might contribute to RNA binding (Järvelin et al., 2016). Further work should be conducted to enhance the fold change of the regulatory module and engineer complex circuits with it.

Interestingly, proteins associated with clustered regularly interspaced short palindromic repeats (CRISPR), which belong to the prokaryotic immune system, contain distorted RRM versions (Koonin and Makarova, 2013). Some CRISPR proteins might have evolved, for example, from an ancestral RRM-based (palm) polymerase after duplications, fusions, and diversification. Noting that the palm domain indeed presents an RRM-like fold (Anantharaman et al., 2010), we hypothesize that a boost of functionally diverse RRM-containing proteins took place once the polymerases were confined into the nucleus, as the pressure for efficient replication was relieved in the cytoplasm, which would provide a rationale on the unbalance noticed between eukaryotes and prokaryotes (Maris et al., 2005; Koonin et al., 2020).

In second place, our results pave the way for engineering more complex circuits in bacteria with plastic and orthogonal RNA-binding proteins, such as MSI-1, capable of signal multiplexing. Nature is a formidable reservoir of functional genetic material sculpted by evolution that can be exploited to (re)program specific living cells (Khalil and Collins, 2010). However, to overcome biological barriers, transgenes usually come from related organisms or cognate parasites at the cost of limiting the potential engineering. Therefore, efforts to borrow functional elements from highly diverse organisms are suggestive (e.g., regulatory proteins from mammals to bacteria), with the ultimate goal of developing industrial or biomedical applications.

Notably, advances in synthetic biology have pushed the bioproduction of a wide variety of compounds in bacteria as a result of a better ability to fine-tune enzyme expression (Choi et al., 2019). Translation regulation is instrumental to this end because in multiple cases different enzymes are expressed from the same transcriptional unit (i.e., operon). Previous work exploited regulatory RNAs for such a tuning (Na et al., 2013), but the use of RNA-binding proteins as translation factors is also appealing. We envision the application of MSI-1* as a genetic tool for metabolic engineering. The additional use of RNA-binding proteins able to alter mRNA stability might lead to the implementation of more complex circuits at the post-transcriptional level. Furthermore, MSI-1* is able to respond to fatty acids, which are ideal precursors of potential biofuels due to their long hydrocarbon chains. In particular, biofuel in the form of fatty acid ethyl ester, whose bioproduction in E. coli can be optimized by reengineering the regulation of the β-oxidation cycle with the allosteric transcription factor FadR (Zhang et al., 2012). Arguably, MSI-1* might be used in place of or in combination with FadR for subsequent developments. However, engineering regulatory circuits for efficient bioproduction is not evident in general as the enzymatic expression levels may require fine-tuning, so systems-level mathematical models need to be considered for design along with a wide genetic toolkit for implementation (Choi et al., 2019). We anticipate that other animal RRM-containing proteins might be repurposed in E. coli as translation factors. Moreover, protein design might be used to reengineer MSI-1* in order to respond to new ligands, maintaining high specificity and affinity for a particular RNA sequence, as previously done with the transcription factor LacI (Taylor et al., 2016).

In addition, the Musashi protein family is of clinical importance, as in humans it is involved in different neurodegenerative disorders (e.g., Alzheimer’s disease) and some types of cancer (Fox et al., 2015; Montalbano et al., 2020; Kang et al., 2017). Therefore, the development of simple genetic systems from which to test protein mutants, potential target mRNAs, decoying RNA aptamers, and inhibitory small molecules in a systematic manner is very relevant. Furthermore, isolating human regulatory elements would help to filter out indirect effects that likely occur in the natural context. This might lead to new therapeutic opportunities. Nevertheless, one limitation of using E. coli as a chassis is that some post-translational modifications (PTMs) may be lost, thereby compromising the functionality of the expressed proteins (Sahdev et al., 2008). Fortunately, there are metabolic engineering efforts devoted to implement eukaryotic PTM pathways in E. coli, such as the glycosylation pathway (Valderrama-Rincon et al., 2012).

In conclusion, the functionalization of RRM-containing proteins in bacteria offers exciting prospects, especially as more information becomes available on how individual RRM domains bind to precise RNA sequences, interact with further protein domains, and respond to small molecules through allosteric effects. This work illustrates how synthetic biology, through the rational assembly of heterologous genes and designer cis-regulatory elements into circuits, is useful to generate knowledge about the application range of a fundamental type of proteins in nature.

Materials and methods

Strains, plasmids, and reagents

E. coli Dh5α was used for cloning purposes following standard procedures. To express our genetic circuit for functional characterization, E. coli MG1655-Z1 cells (lacI⁺, tetR⁺) were used. This strain was co-transformed with two plasmids, called pRM1+ (KanR, pSC101-E93R ori; leading to ~230 copies/cell) (Peterson and Phillips, 2008) and pREP6 (CamR, p15A ori; leading to ~15 copies/cell). On the one hand, pRM1+ was obtained by cloning a truncated coding region of the M. musculus MSI-1 protein (the first 192 amino acids, containing the two RRMs; UniProt #Q61474; termed MSI-1*). This gene was under the transcriptional control of the inducible promoter PLlac. On the other hand, pREP6 was obtained by cloning the coding region of sfGFP with an RNA sequence motif recognized by MSI-1. The coding region of mScarlet was also present in the plasmid. These two genes were under the control of the constitutive promoter J23119 in two different transcriptional units. In addition to the original RNA sequence motif, five point-mutated sequences were designed and cloned in pREP6. Additional RNA sequence motifs were cloned in front of sfGFP for control experiments (the resulting plasmids were named pREP4, pREP4b, pREP4b3x, and pREP7). In particular, pREP4b3x incorporates three RNA motifs in tandem after the start codon, and pREP7 has two RU_nAGU repeats flanking the RBS and a full RNA motif after the start codon. Additional reporter plasmids were constructed using the inducible promoter PLtet to assess the intra-operon regulation, the implementation of combinatorial regulation, and the buffering of expression noise (the resulting plasmids were named pREP6α and pREP7α). Suitable genetic cassettes to obtain the final constructions were synthesized by IDT. Appendix 5 lists all plasmids used in this work. Appendix 6 presents the nucleotide sequences of the different genetic elements.

To perform the dynamic assays with LigandTracer (Ridgeview), E. coli BL21(DE3) cells (lacI⁺, T7pol⁺) were used. This strain was also co-transformed with pRM1+ and pREP6. To purify a recombinant Musashi protein, E. coli BL21-Gold(DE3) cells (lacI⁺, T7pol⁺) were used. A truncated coding region of the human MSI-1 protein (the first 200 amino acids; UniProt #O43347; termed MSI-1_h*) was cloned under the control of a T7pol promoter into the plasmid pET29b (KanR, pUC ori).

Luria-Bertani (LB) medium was used for the overnight cultures and M9 minimal medium (1× M9 minimal salts, 2 mM MgSO₄, 0.1 mM CaCl₂, 0.05% thiamine, 0.05% casamino acids, and 1% glycerol or 0.4% glucose) for the characterization cultures. M9-glucose medium was only used for real-time fluorescence quantification in liquid medium with IPTG. LB-agar was used for real-time fluorescence quantification in solid medium. Kanamycin and chloramphenicol were used at a concentration of 50 μg/mL and 34 μg/mL, respectively. Lactose and IPTG were used as the inducers of the system (controlling the expression of MSI-1* in E. coli) at a concentration of 5, 10, 20, 50, 100, 200, 500, or 1000 μM. aTC was also used to induce the modified systems with PLtet at a concentration of 15, 30, or 100 ng/mL. Oleic acid was used as the allosteric inhibitor of MSI-1* at a concentration of 20 mM in the in vivo assays (both in liquid and solid medium). In the in vitro assays, oleic acid was used at a concentration of 0.01, 0.1, 0.2, 0.5, 0.7, 1, 1.5, or 2 mM. It was neutralized with NaOH and used in a medium containing 0.5% tergitol NP-40. Compounds were provided by Merck.

Bulk fluorometry

Cultures (2 mL) inoculated from single colonies (three replicates) were grown overnight in LB medium with shaking (220 rpm) at 37 °C. Cultures were then diluted 1:100 in fresh M9 medium (200 μL) with the appropriate inducer (lactose, IPTG, and/or aTC). The microplate (96 wells, black, clear bottom; Corning) was incubated with shaking (1300 rpm) at 37 °C up to 8–10 h (to reach an OD₆₀₀ around 0.5–0.7). At different times, the microplate was assayed in a Varioskan Lux fluorometer (Thermo) to measure absorbance (600 nm), green fluorescence (excitation: 485 nm, emission: 535 nm), and red fluorescence (excitation: 570 nm, emission: 610 nm). To characterize the time-course response of the system, cultures were grown to exponential phase and then diluted before adding the inducer (to minimize the response lag). Mean background values of absorbance and fluorescence, corresponding to M9 medium, were subtracted to correct the signals. Normalized fluorescence was calculated as the slope of the linear regression between fluorescence and absorbance (assuming fluorophore maturation faster than cell doubling time and no proteolytic degradation) (Leveau and Lindow, 2001). The mean value of normalized fluorescence corresponding to non-transformed cells was then subtracted to obtain a final estimate of expression. In addition, cell growth rate was calculated as the slope of the linear regression between the logarithm of background-subtracted absorbance and time in the exponential phase.

Real-time fluorescence quantification in solid medium

Cultures (2 mL) inoculated from single colonies (three replicates) were grown overnight in LB medium with shaking (220 rpm) at 37 °C. The overnight culture was plated (15 μL) in areas A and D of a MultiDish 2x2 plate (Ridgeview) coated with LB-agar. IPTG was added in areas A and B of the dish at the final concentration of 1 mM. Area C was kept free of cells/inducers as a reference. The dish was then placed in the rotating support of the LigandTracer instrument (Ridgeview) and incubated at 37 °C for 24 h. The fluorescence from sfGFP and mScarlet was quantified with time in the seeded areas of the dish using the BlueGreen (excitation: 488 nm, emission: 535 nm) and OrangeRed (excitation: 568 nm, emission: 620 nm) detectors. The readouts of the opposite parts of the dish were subtracted to correct the signals.

Flow cytometry

Cultures (2 mL) inoculated from single colonies (three replicates) were grown overnight in LB medium with shaking (220 rpm) at 37 °C. Cultures were then diluted 1:100 in fresh LB medium (200 μL) to load a microplate (96 wells, black, clear bottom; Corning) with the appropriate concentrations of lactose (0, 100, 1000 μM), oleic acid (0, 20 mM), and/or aTC (15, 30, 100 ng/mL). The microplate was then incubated with shaking (1300 rpm) at 37 °C until cultures reached a sufficient OD₆₀₀. Cultures (6 μL) were then diluted in PBS (1 mL). Fluorescence was measured in an LSRFortessa flow cytometer (BD) using a 488 nm laser and a 530 nm filter for green fluorescence. Events were gated by using the forward and side scatter signals and compensated (~10⁴ events after this process). The mean value of the autofluorescence of the cells was subtracted to obtain a final estimate of expression. Data analysis was performed with MATLAB (MathWorks).

Purification of a Musashi protein

Cells were grown in LB medium with shaking at 37 °C until OD₆₀₀ reached 0.6–0.8. Subsequently, the expression of MSI-1_h* was induced with 0.5 mM IPTG. Cells were incubated at 37 °C for 4 h and harvested by centrifugation at 7500 rpm for 15 min at 4 °C. The cell pellet was resuspended in a lysis buffer (50 mM Tris–HCl, pH 8.0, 500 mM NaCl, 10% glycerol, with protease inhibitor cocktail), ruptured by sonication, and separated by centrifugation at 30,000 rpm for 35 min at 4 °C. The soluble fraction was collected and treated with a 5% polyethylenimine solution in order to remove DNA/RNA attached to the protein. Resuspension of the protein was done in 20 mM Tris–HCl, pH 9.0, with protease inhibitor cocktail. Soluble protein was filtered with a 0.22 μm membrane and purified by ion-exchange chromatography using an Anion exchange Q FF 16/10 column previously equilibrated in alkaline buffer. The protein was collected on the flow-through. The protein was filtered and further purified to homogeneity by size-exclusion chromatography using a Hi load 26/60 Superdex 75 pg column previously equilibrated in alkaline buffer with NaCl. The purified fractions were collected and buffer exchange chromatography was performed using a HiPrep 26/10 Desalting column previously equilibrated with the final buffer (20 mM MES, pH 6.0, 100 mM NaCl, 0.5 mM EDTA, with protease inhibitor cocktail). Purification performed at Giotto.

Binding kinetics assays of protein–RNA interactions

Binding experiments of the purified MSI-1_h* protein against different RNA ligands were performed using the switchSENSE proximity sensing technology (Cléry et al., 2017; Langer et al., 2013) and a suitable adapter chip on the heliX biosensor platform (Dynamic Biosensors). The adapter chip consists of a microfluidic channel with two gold electrodes functionalized with fluorophore-decorated DNA nanolevers that serve as linkers between the gold surface and the ligand of interest. A constant negative voltage is applied to the electrodes to keep the DNA nanolevers in an upright position. Binding between the injected analyte (MSI-1_h*) and the ligand attached to the sensor surface (RNA) leads to the alteration of the chemical surrounding of the dye, which results in a fluorescence change. Fluorescence change of the dye in real time describes the binding kinetics of the molecule of interest. Kinetic experiments consisted of a protein association phase (5 min) and a dissociation phase (15 min) in which the chip was rinsed with a buffer (50 mM Tris–HCl, 0.5 mM EDTA, 140 mM NaCl, 0.05% Tween 20, 1 mM TCEP, pH 7.2). A flow rate of 100 µL/min was applied and a sampling rate of 1 Hz was used.

Six different RNA ligands (original and five mutants) were attached to the 5′ end of a generic 48 nt DNA ligand strand, which is part of the DNA linker system on the heliX adapter chip surface. All oligonucleotides were synthesized by Ella Biotech. The ligand strand was hybridized with an adapter strand carrying the fluorophore. Different fluorophores were tested toward their sensitivity for protein–RNA interactions. The green fluorophore Gb showed the most significant signal change. The other half of the adapter strand is complementary to a DNA anchor strand, which is pre-attached to the chip surface. The immobilization of the RNA used a standard functionalization procedure on the heliX device. Kinetic rate constants and affinities were obtained by fitting the experimental data with theoretical binding models implemented in the heliOS software (Dynamic Biosensors). Exponential decay models were used. As a negative control to check for unspecific protein–RNA binding, the single-strand RNA sequence CGGCGCCGC was used (without any binding motif). All data were referenced with a blank run and with the negative control.

RT-qPCR

Cultures (2 mL) inoculated from single colonies (three replicates) were grown overnight in LB medium with shaking (220 rpm) at 37 °C. Cultures were then diluted 1:100 in fresh LB medium (2 mL) with the appropriate inducer (lactose) and were grown until OD₆₀₀ reached 0.6–0.8. Then, 500 µL of each culture was mixed with RNAprotect Bacteria Reagent (QIAGEN). Subsequently, RNA extraction was carried out with the RNeasy kit (QIAGEN), choosing the enzymatic lysis and proteinase K digestion of bacteria (recommended for Gram-negative bacteria grown in complex media). The eluted RNA sampled were quantified using a NanoDrop spectrophotometer (Thermo).

The TaqPath 1‐step RT‐qPCR master mix, CG was used. Then, 1 µL of sample was mixed with 500 nM of forward and reverse primers, 250 nM of ssDNA probe, and 5 µL of the master mix for a total volume of 20 µL (adjusted with RNase-free water) in a fast microplate (Applied). Two independent mixes were prepared, one for targeting sfGFP and another for the E. coli b3500 gene, which was employed as the reference gene. Reactions were performed in a QuantStudio 3 equipment (Thermo) with this protocol: incubation at 25 °C for 2 min for uracil-N glycosylation, followed by 50 °C for 15 min for RT (reverse transcription), followed by an inactivation step at 95 °C for 2 min, then followed by 40 cycles of amplification at 95 °C for 3 s and 60 °C for 30 s.

Gel electrophoresis

Mobility shift assays with the purified MSI-1_h* protein and its cognate RNA motif were performed. The RNA motif was generated by in vitro transcription with the TranscriptAid T7 high yield transcription kit (Thermo) from a DNA template. It was then purified using the RNA clean and concentrator column (Zymo) and quantified in a NanoDrop spectrophotometer (Thermo). Bovine serum albumin (BSA) was used as a control protein (at 30 μM). Reactions with different combinations of elements were prepared (MSI-1_h* at 45 μM, RNA at 11 μM, and oleic acid at 1 mM). Reactions with concentration gradients of MSI-1_h* (from 0 to 45 μM) and oleic acid (from 0 to 2 mM) were also performed. Reactions were incubated for 30 min at 37 °C. Reaction volumes were then loaded in 3% agarose gels prepared with 0.5× TBE and stained using RealSafe (Durviz). Gels ran for 45 min at room temperature applying 110 V. The GeneRuler ultra-low-range DNA ladder (10–300 bp, Thermo) was used. This staining served to reveal the RNA and oleic acid (free or in complex with the MSI-1_h* protein) (Perea and Greenbaum, 2020; Fessenden-Raden, 1972). In addition, gels were soaked for 10 min in the Coomassie blue stain (Fisher) at room temperature with shaking to reveal the proteins. Gels were then soaked in a destaining solution overnight to remove the excess of blue stain. Pictures were taken with the Imager2 gel documentation system (VWR).

Microscopy

LB-agar plates seeded with E. coli MG1655-Z1 cells co-transformed with pRM1+ and pREP6 or pREP7 were grown overnight at 37 °C. Lactose (1 mM) and oleic acid (20 mM) were used as supplements. The plates were irradiated with blue light and images were acquired with a 2.8 Mpixel camera with a filter for green fluorescence in a light microscope (Leica MSV269). The commercial software provided by Leica was used to adjust the visualization of the differential fluorescence among plates. The fluorescence intensity of the colonies was quantified with Fiji (Schindelin et al., 2012).

Mathematical modeling

On the one hand, Hill equations were used to empirically model sfGFP expression with lactose/IPTG, eBFP2 expression with lactose, and sfGFP expression with eBFP2 expression (see Appendix 1 for details). On the other hand, a system of ordinary differential equations was developed to model the dynamic response of the synthetic gene circuit from a bottom-up approach. The system accounted for the intracellular mRNA and protein concentrations, considering a scenario of equilibrium to model both LacI-DNA and MSI-1*-RNA binding (see Appendix 3 for details). Parameter values were obtained by nonlinear fitting against our experimental data.

Molecular visualization in silico

The RMM1 of MSI-1 protein structure determined by nuclear magnetic resonance was downloaded from the UniProt database (https://www.uniprot.org/; Bairoch et al., 2005). A 3D structure of the RNA motif subsequence involving the two RU_nAGU repeats was predicted with the RNAComposer software (Popenda et al., 2012). The oleic acid molecule was downloaded from the ChemSpider database (https://www.chemspider.com/). All the molecules were loaded, visualized, colored, trimmed (where necessary), and manually docked using the open-source PyMol software (Schrödinger; pymol.org).

Resources availability

The sequences of all genetic elements used in this work are presented in the Appendixes. Plasmids available upon request to the corresponding author.

Acknowledgements

We thank M Sattler (TUM) for useful discussions. This work was supported by the grants H2020-MSCA-ITN-2018 #813239 (RNAct) from the European Commission and PGC2018-101410-B-I00 (SYSY-RNA) from the Spanish Ministry of Science and Innovation (co-financed by the European Regional Development Fund). RD, APR, RAHR, and GPR acknowledge each a Marie Curie fellowship. LG was supported by a predoctoral fellowship from the Valencia Regional Government (ACIF/2021/183).

Appendix 1

The repression of sfGFP as a function of lactose is modeled by the following Hill equation (Weiss, 1997)

[sfGFP] = \frac{A_{1}}{1 + {(\frac{[Lactose]}{K_{1}})}^{n_{1}}} + B_{1},

where $K_{1}$ is the regulatory coefficient, $n_{1}$ the Hill coefficient, $A_{1} + B_{1}$ the maximal expression level, and $B_{1}$ the basal expression level at full repression. In the case of sfGFP, its concentration is given by the normalized green fluorescence signal in arbitrary units (AU). The adjusted parameter values are $A_{1} = 90.7$ AU, $B_{1} = 62.1$ AU, $K_{1} = 99.1 μ$ M, and $n_{1} = 1.70$ (Figure 1c).

We used the very same equation to model the dynamic response of the system implemented with pREP7. In this case, the adjusted parameter values are $A_{1} = 289.9$ AU, $B_{1} = 39.4$ AU, $K_{1} = 85.7 μ$ M, and $n_{1} = 4.45$ (Figure 4b).

Also, the following Hill equation models the repression of sfGFP by IPTG

[sfGFP] = \frac{A_{2}}{1 + {(\frac{[IPTG]}{K_{2}})}^{n_{2}}} + B_{2} .

The adjusted parameter values are $A_{2} = 76.4$ AU, $B_{2} = 52.3$ AU, $K_{2} = 71.3 μ$ M, and $n_{2} = 2.28$ (Figure 3—figure supplement 1a).

In addition, the activation of eBFP2, proxy of MSI-1*, as a function of lactose is modeled by the following Hill equation:

[MSI-1*] \propto [eBFP2] = \frac{A_{3} {(\frac{[Lactose]}{K_{3}})}^{n_{3}}}{1 + {(\frac{[Lactose]}{K_{3}})}^{n_{3}}} + B_{3},

where $K_{3}$ is the regulatory coefficient, $n_{3}$ the Hill coefficient, $A_{3} + B_{3}$ the maximal expression level, and $B_{3}$ the basal expression level with no activation. In the case of eBFP2, its concentration is given by the normalized blue fluorescence signal in AU. The adjusted parameter values are $A_{3} = 23.1$ AU, $B_{3} = 1.88$ AU, $K_{3} = 359 μ$ M, and $n_{3} = 2.81$ (Figure 1d).

Finally, the following Michaelis equation (a particular case of the Hill equation when there is no cooperativity)

[sfGFP] = \frac{A_{4}}{1 + \frac{[eBFP2]}{K_{4}}}

defines the engineered regulation between MSI-1* (given by eBFP2) and sfGFP. Here, no basal expression level is considered. The adjusted parameter values are $A_{4} = 165$ AU and $K_{4} = 10.2$ AU (Figure 1d).

Appendix 2

The fold change in protein expression can be calculated from the fundamental parameters that model the regulatory system, such as the association rate of the regulator to the nucleic acid ( $k_{ON}$ ), the dissociation rate ( $k_{OFF}$ ), the concentration of the regulator in the cell ( $R$ ), and the degradation rate of the nucleic acid ( $δ$ ). If we denote by $A_{0}$ the concentration of free nucleic acid, by $A_{R}$ the concentration of nucleic acid with the regulator bound, and by $P$ the concentration of the regulated protein, we can write

\frac{d A_{0}}{d t} = α - k_{ON} R A_{0} + k_{OFF} A_{R} - δ A_{0}

\frac{d A_{R}}{d t} = k_{ON} R A_{0} - k_{OFF} A_{R} - δ A_{R}

\frac{d P}{d t} = β A_{0} + ε β A_{R} - μ P,

where $α$ is the synthesis rate of the nucleic acid, $β$ the synthesis rate of the protein, and $ε$ the leakage fraction of protein synthesis when the regulator is bound. Note that in steady state $A_{0 \infty} + A_{R \infty} = \frac{α}{δ}$ .

If $R = 0$ , then $P_{\infty} = \frac{α β}{δ μ}$ (steady state). If $R > 0$ , then $P_{\infty} = \frac{α β}{δ μ} (\frac{ε k_{ON} R + k_{OFF} + δ}{k_{ON} R + k_{OFF} + δ})$ . Therefore, it turns out that

fold = \frac{P_{\infty} (R = 0)}{P_{\infty} (R > 0)} = \frac{k_{ON} R + k_{OFF} + δ}{ε k_{ON} R + k_{OFF} + δ} .

Importantly, this model can be applied either to transcription regulation or translation regulation. The main difference is that in the case of transcription the nucleic acid targeted by the regulator (DNA) is stable (we can model this as $δ = μ$ , and then set $δ ≃ 0$ in the fold change equation), while in the case of translation, the nucleic acid targeted by the regulator (mRNA) is unstable ( $δ ≫ μ$ ).

Appendix 3

The system of ordinary differential equations (ODEs) that governs the dynamics of the engineered circuit, considering the intracellular concentrations of mRNAs and proteins (Rodrigo et al., 2011), reads

\frac{d [m R N A_{MSI-1*}]}{d t} = α_{x} (\frac{ρ_{x} + {(\frac{[L a c t o s e]}{θ_{x}})}^{n_{x}}}{1 + {(\frac{[L a c t o s e]}{θ_{x}})}^{n_{x}}}) - δ [m R N A_{MSI-1*}]

\frac{d [MSI-1*]}{d t} = β_{x} [m R N A_{MSI-1*}] - μ [MSI-1*]

\frac{d [m R N A_{s f G F P}]}{d t} = α_{y} - δ [m R N A_{s f G F P}]

\frac{d [sfGFP]}{d t} = β_{y} (\frac{1}{1 + \frac{[MSI-1*]}{θ_{y}}}) [{mRNA}_{sfGFP}] - μ [sfGFP],

where $α_{x}$ is the maximal transcription rate of the msi-1* gene, $α_{y}$ the maximal transcription rate of the sfGFP gene, $β_{x}$ the maximal translation rate of msi-1*, $β_{y}$ the maximal translation rate of sfGFP, $δ$ the mRNA degradation rate (assumed equal for the msi-1* and sfGFP genes), $ρ_{x}$ the repression fold of LacI at the transcriptional level, $θ_{x}$ the effective dissociation constant between LacI and lactose, $n_{x}$ the effective binding cooperativity of LacI, $θ_{y}$ the effective dissociation constant between MSI-1* and the RNA motif in the sfGFP gene, and μ the bacterial growth rate.

The analytical solution of this system of ODEs can be obtained through the use of the Laplace transform (Bracewell, 2000) and reads

[{mRNA}_{MSI-1*}] (t) = \frac{α_{x}}{δ} (\frac{ρ_{x} + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}}{1 + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}}) (1 - e^{- δ t}) + {[{mRNA}_{MSI-1*}]}_{0} e^{- δ t}

\begin{array}{ll} [MSI-1*] (t) = β_{x} \int_{0}^{t} e^{- μ (t - τ)} [{mRNA}_{MSI-1*}] (τ) d τ + {[MSI-1*]}_{0} e^{- μ t} ≃ \\ ≃ \frac{β_{x}}{μ} {[{mRNA}_{MSI-1*}]}_{\infty} (1 - e^{- μ t}) + {[MSI-1*]}_{0} e^{- μ t} \end{array}

[{mRNA}_{sfGFP}] (t) = \frac{α_{y}}{δ} (1 - e^{- δ t}) + {[{mRNA}_{sfGFP}]}_{0} e^{- δ t}

\begin{array}{ll} [sfGFP] (t) = β_{y} \int_{0}^{t} e^{- μ (t - τ)} (\frac{[{mRNA}_{sfGFP}] (τ)}{1 + \frac{[MSI-1*] (τ)}{θ_{y}}}) d τ + {[sfGFP]}_{0} e^{- μ t} ≃ \\ ≃ \frac{α_{y} β_{y}}{δ} \int_{0}^{t} \frac{e^{- μ (t - τ)}}{1 + \frac{[MSI-1*] (τ)}{θ_{y}}} d τ + {[sfGFP]}_{0} e^{- μ t}, \end{array}

where to perform the approximations $δ ≫ μ$ is considered (quasi-steady state scenario).

Then, in the steady state, we have

{[{mRNA}_{MSI-1*}]}_{\infty} = \frac{α_{x}}{δ} (\frac{ρ_{x} + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}}{1 + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}})

{[MSI-1*]}_{\infty} = \frac{α_{x} β_{x}}{δ μ} (\frac{ρ_{x} + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}}{1 + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}})

{[{mRNA}_{sfGFP}]}_{\infty} = \frac{α_{y}}{δ}

{[sfGFP]}_{\infty} = \frac{α_{y} β_{y}}{δ μ} (\frac{1}{1 + \frac{α_{x} β_{x}}{δ μ θ_{y}} (\frac{ρ_{x} + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}}{1 + {(\frac{[Lactose]}{θ_{x}})}^{n_{x}}})}) .

From the growth curves, we calculated $μ = 0.8$ h⁻¹. Knowing that in E. coli the average half-life of mRNA is 5 min (Bernstein et al., 2002), we set $δ = 0.14$ min⁻¹. Using our experimental data, the adjusted parameter values are $\frac{α_{x} β_{x}}{θ_{y}} = 13$ h⁻², $α_{y} β_{y} = 17$ AU/h², $ρ_{x} = 0.075$ , $θ_{x} = 150 μ$ M, and $n_{x} = 1.5$ (Figure 3e–g).

Appendix 4

The number of cells ( $N$ ) in a bacterial culture with time can be described by a logistic function (Peleg and Corradini, 2011) as

N (t) = \frac{N_{max}}{1 + e^{- μ (t - ψ)}},

where $N_{max}$ is the maximal capacity of the medium, μ the bacterial growth rate, and $ψ$ the delay of the response (or the time at which the culture reaches half of the capacity).

In our experimental system, the constitutive expression of mScarlet may be used to estimate the total number of cells. Indeed, the absolute red fluorescence level ( $Σ mScarlet$ ) may be assumed proportional to $N$ . Thus, we may write

Σ mScarlet (t) = \frac{Σ {mScarlet}_{max}}{1 + e^{- μ (t - ψ)}}

Σ sfGFP (t) = [sfGFP] (t) \cdot Σ mScarlet (t) .

Using our experimental data, the adjusted parameter values are $Σ {mScarlet}_{max} = 13.9$ AU in the case of no induction, $Σ {mScarlet}_{max} = 12.5$ AU when induced with 1 mM lactose, $μ = 0.8$ h⁻¹, and $ψ = 6.5$ h (Figure 3d). $[sfGFP] (t)$ was calculated as described in Appendix 3.

With the time-dependent experimental data in solid media (from LigandTracer), the adjusted parameter values are $μ = 0.0156$ min⁻¹, $ψ = 513$ min, $Σ {mScarlet}_{max} = 1035$ AU, and $[sfGFP] = 6.23$ AU in the case of no induction, and $μ = 0.0111$ min⁻¹, $ψ = 630$ min, $Σ {mScarlet}_{max} = 821$ AU, and $[sfGFP] = 2.42$ AU when induced with 1 mM IPTG (Figure 3—figure supplement 2). In this case, for simplicity, we considered a quasi-steady state scenario, setting constant the sfGFP expression. Moreover, we noticed a delay of about 100 min ( $= ν$ ) between the mScarlet and sfGFP expressions, so the equation $Σ sfGFP (t) = [sfGFP] \cdot Σ mScarlet (t + ν)$ was used instead to fit the data.

Appendix 5

List of plasmids used in this work.

Name	Insert features	Backbone features	Reference
pRM1+	PLlac:msi-1*	KanR, pSC101(E93R) ori	This work
pRM0	void	KanR, pSC101(E93R) ori	This work
pRKFR2	PLlac:eBFP2	KanR, pSC101(E93K) ori	Dolcemascolo et al., 2022
pREP6	J23119:sfGFP (with RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP6-mut1	J23119:sfGFP (with mutated RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP6-mut2	J23119:sfGFP (with mutated RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP6-mut3	J23119:sfGFP (with mutated RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP6-mut4	J23119:sfGFP (with mutated RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP6-mut5	J23119:sfGFP (with mutated RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP7	J23119:sfGFP (with RNA motif for MSI-1* binding and consensus sequences within RBS)	CamR, p15A ori	This work
pREP4	J23119:sfGFP (with minimal RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP4b	J23119:sfGFP (with less structured RNA motif for MSI-1* binding)	CamR, p15A ori	This work
pREP4b3x	J23119:sfGFP (with 3× less structured RNA motifs for MSI-1* binding)	CamR, p15A ori	This work
pGio	T7p:msi-1_h*	KanR, pUC ori	This work
pREP6α	PLtet:sfGFP-mScarlet (with RNA motif for MSI-1* binding in front of sfGFP)	CamR, p15A ori	This work
pREP7α	PLtet:sfGFP-mScarlet (with RNA motif for MSI-1* binding and consensus sequences within RBS in front of sfGFP)	CamR, p15A ori	This work

Open in a new tab

Appendix 6

Nucleotide sequences of the elements used to implement our synthetic gene circuits.

Name	Sequence
PLlac	`AATTGTGAGCGGATAACAATTGACATTGTGAGCGGATAACAAGATACTGAGCAC`
msi-1* (codon optimized to E. coli from M. musculus)	ATGGAAACGGACGCCCCGCAGCCGGGACTGGCCTCTCCTGACTCTCCTCACGACCCA TGCAAGATGTTTATTGGTGGACTTTCTTGGCAGACTACTCAGGAGGGTCTTCGTGAA TACTTCGGTCAATTTGGCGAAGTGAAAGAGTGTCTTGTGATGCGCGATCCTTTAACC AAGCGTAGTCGCGGATTTGGCTTCGTCACGTTCATGGACCAGGCAGGCGTGGATAAG GTGCTGGCGCAGAGTCGTCACGAATTAGATTCAAAAACGATTGACCCCAAAGTGGCG TTCCCACGTCGCGCCCAACCTAAAATGGTTACTCGTACCAAAAAGATTTTCGTAGGA GGCTTATCCGTAAATACCACGGTAGAAGATGTAAAGCATTACTTCGAACAGTTTGGA AAGGTGGATGATGCAATGCTTATGTTTGATAAGACCACAAACCGTCATCGTGGATTC GGCTTTGTGACCTTTGAATCGGAGGATATCGTTGAGAAGGTCTGCGAAATCCACTTT CATGAAATTAATAACAAAATGGTTGAGTGTAAGAAGGCGCAACCGAAAGAAGTCATG TCTCCTTAA
J23119	`TTGACAGCTAGCTCAGTCCTAGGTATAATGCTAGC`
PLtet	`TCCCTATCAGTGATAGAGATTGACATCCCTATCAGTGATAGAGATACTGAGCAC`
sfGFP (RNA motif underlined)	`ATGGGCAGCGTTAGTTATTTAGTTCGTATGCCAACTAGTCGTAAAGGCGAAGAGCTG TTCACTGGTGTCGTCCCTATTCTGGTGGAACTGGATGGTGATGTCAACGGTCAT` AAG TTTTCCGTGCGTGGCGAGGGTGAAGGTGACGCAACTAATGGTAAACTGACGCTGAAG TTCATCTGTACTACTGGTAAACTGCCGGTACCTTGGCCGACTCTGGTAACGACGCTG ACTTATGGTGTTCAGTGCTTTGCTCGTTATCCGGACCATATGAAGCAGCATGACTTC TTCAAGTCCGCCATGCCGGAAGGCTATGTGCAGGAACGCACGATTTCCTTTAAGGAT GACGGCACGTACAAAACGCGTGCGGAAGTGAAATTTGAAGGCGATACCCTGGTAAAC CGCATTGAGCTGAAAGGCATTGACTTTAAAGAAGACGGCAATATCCTGGGCCATAAG CTGGAATACAATTTTAACAGCCACAATGTTTACATCACCGCCGATAAACAAAAAAAT GGCATTAAAGCGAATTTTAAAATTCGCCACAACGTGGAGGATGGCAGCGTGCAGCTG GCTGATCACTACCAGCAAAACACTCCAATCGGTGATGGTCCTGTTCTGCTGCCAGAC AATCACTATCTGAGCACGCAAAGCGTTCTGTCTAAAGATCCGAACGAGAAACGCGAT CATATGGTTCTGCTGGAGTTCGTAACCGCAGCGGGCATCACGCATGGTATGGATGAA CTGTACAAATAA
RNA motif mutant 1 (A>C substitution)	`GGCAGCGTTCGTTATTTAGTTCGTATGCC`
RNA motif mutant 2 (G>C substitution)	`GGCAGCGTTACTTATTTAGTTCGTATGCC`
RNA motif mutant 3 (T>C substitution)	`GGCAGCGTTAGCTATTTAGTTCGTATGCC`
RNA motif mutant 4 (G insertion, a nucleotide was deleted downstream to be in frame)	`GGCAGCGTTAGTTATGTTAGTTCGTATGCC`
RNA motif mutant 5 (two G>C substitutions)	`GGCAGCGTTACTTATTTACTTCGTATGCC`
Consensus sequences within RBS	`ATCATGTGTTTAGTTAGGAGATTTAGTTA`
Minimal RNA motif	`TTTATAGTTT`
Less structured RNA motif	`GGCGTTAGTTATTTAGTTCGCC`
3× less structured RNA motifs	`GGCGTTAGTTATTTAGTTCGCCGACGCGTTAGTTTATTTAGTTCGCGATATCCGGTT AGTTATTTAGTTACGG`
mScarlet	ATGGGATCCGTGAGCAAGGGCGAGGCAGTGATCAAGGAGTTCATGCGGTTCAAGGTG CACATGGAGGGCTCCATGAACGGCCACGAGTTCGAGATCGAGGGCGAGGGCGAGGGC CGCCCCTACGAGGGCACCCAGACCGCCAAGCTGAAGGTGACCAAGGGTGGCCCCCTG CCCTTCTCCTGGGACATCCTGTCCCCTCAGTTCATGTACGGCTCCAGGGCCTTCATC AAGCACCCCGCCGACATCCCCGACTACTATAAGCAGTCCTTCCCCGAGGGCTTCAAG TGGGAGCGCGTGATGAACTTCGAGGACGGCGGCGCCGTGACCGTGACCCAGGACACC TCCCTGGAGGACGGCACCCTGATCTACAAGGTGAAGCTCCGCGGCACCAACTTCCCT CCTGACGGCCCCGTAATGCAGAAGAAGACAATGGGCTGGGAAGCGTCCACCGAGCGG TTGTACCCCGAGGACGGCGTGCTGAAGGGCGACATTAAGATGGCCCTGCGCCTGAAG GACGGCGGCCGTTACCTGGCGGACTTCAAGACCACCTACAAGGCCAAGAAGCCCGTG CAGATGCCCGGCGCCTACAACGTCGACCGCAAGTTGGACATCACCTCCCACAACGAG GACTACACCGTGGTGGAACAGTACGAACGCTCCGAGGGCCGCCACTCCACCGGCGGC ATGGACGAGCTGTACAAGTAA
eBFP2	ATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTG GACGGCGACGTAAACGGCCACAAGTTCAGCGTGAGGGGCGAGGGCGAGGGCGATGCC ACCAACGGCAAGCTGACCCTGAAGTTCATCTGCACCACCGGCAAGCTGCCCGTGCCC TGGCCCACCCTCGTGACCACCCTGAGCCACGGCGTGCAGTGCTTCGCCCGCTACCCC GACCACATGAAGCAGCACGACTTCTTCAAGTCCGCCATGCCCGAAGGCTACGTCCAG GAGCGCACCATCTTCTTCAAGGACGACGGCACCTACAAGACCCGCGCCGAGGTGAAG TTCGAGGGCGACACCCTGGTGAACCGCATCGAGCTGAAGGGCGTCGACTTCAAGGAG GACGGCAACATCCTGGGGCACAAGCTGGAGTACAACTTCAACAGCCACAACATCTAT ATCATGGCCGTCAAGCAGAAGAACGGCATCAAGGTGAACTTCAAGATCCGCCACAAC GTGGAGGACGGCAGCGTGCAGCTCGCCGACCACTACCAGCAGAACACCCCCATCGGC GACGGCCCCGTGCTGCTGCCCGACAGCCACTACCTGAGCACCCAGTCCGTGCTGAGC AAAGACCCCAACGAGAAGCGCGATCACATGGTCCTGCTGGAGTTCCGCACCGCCGCC GGGATCACTCTCGGCATGGACGAGCTGTACAAG

Open in a new tab

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Guillermo Rodrigo, Email: guillermo.rodrigo@csic.es.

Joseph T Wade, New York State Department of Health, United States.

Christian R Landry, Université Laval, Canada.

Funding Information

This paper was supported by the following grants:

European Commission H2020-MSCA-ITN-2018 #813239 to Wim F Vranken, Tommaso Martelli, Wolfgang Kaiser, Jos Buijs, Guillermo Rodrigo.
Ministerio de Ciencia e Innovación PGC2018-101410-B-I00 to Guillermo Rodrigo.
Generalitat Valenciana ACIF/2021/183 to Lucas Goiriz.

Additional information

Competing interests

No competing interests declared.

APR works for Giotto Biotech.

RAHR works for Dynamic Biosensors.

GPR works for Ridgeview Instruments.

TM works for Giotto Biotech.

WFV works for Dynamic Biosensors.

JB works for Ridgeview Instruments.

Author contributions

Formal analysis, Investigation, Writing – review and editing.

Supervision, Investigation.

Investigation.

Formal analysis, Investigation.

Investigation.

Formal analysis, Investigation.

Formal analysis, Supervision, Funding acquisition, Writing – review and editing.

Formal analysis, Supervision, Funding acquisition.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Investigation, Writing - original draft.

Additional files

MDAR checklist

elife-91777-mdarchecklist1.docx^{(99.3KB, docx)}

Data availability

All data generated or analysed during this study are included in the manuscript and supporting files.

References

Ai H, Shaner NC, Cheng Z, Tsien RY, Campbell RE. Exploration of new chromophore structures leads to the identification of improved blue fluorescent proteins. Biochemistry. 2007;46:5904–5910. doi: 10.1021/bi700199g. [DOI] [PubMed] [Google Scholar]
Anantharaman V, Iyer LM, Aravind L. Presence of a classical RRM-fold palm domain in thg1-type 3’- 5’nucleic acid polymerases and the origin of the GGDEF and CRISPR polymerase domains. Biology Direct. 2010;5:43. doi: 10.1186/1745-6150-5-43. [DOI] [PMC free article] [PubMed] [Google Scholar]
Babitzke P, Baker CS, Romeo T. Regulation of translation initiation by RNA binding proteins. Annual Review of Microbiology. 2009;63:27–44. doi: 10.1146/annurev.micro.091208.073514. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O’Donovan C, Redaschi N, Yeh LSL. The universal protein resource (uniprot) Nucleic Acids Research. 2005;33:D154–D159. doi: 10.1093/nar/gki070. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bashor CJ, Collins JJ. Understanding biological regulation through synthetic biology. Annual Review of Biophysics. 2018;47:399–423. doi: 10.1146/annurev-biophys-070816-033903. [DOI] [PubMed] [Google Scholar]
Belmont BJ, Niles JC. Engineering a direct and inducible protein-RNA interaction to regulate RNA biology. ACS Chemical Biology. 2010;5:851–861. doi: 10.1021/cb100070j. [DOI] [PubMed] [Google Scholar]
Bernstein JA, Khodursky AB, Lin PH, Lin-Chao S, Cohen SN. Global analysis of mRNA decay and abundance in Escherichia coli at single-gene resolution using two-color fluorescent DNA microarrays. PNAS. 2002;99:9697–9702. doi: 10.1073/pnas.112318199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bindels DS, Haarbosch L, van Weeren L, Postma M, Wiese KE, Mastop M, Aumonier S, Gotthard G, Royant A, Hink MA, Gadella TJ. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nature Methods. 2017;14:53–56. doi: 10.1038/nmeth.4074. [DOI] [PubMed] [Google Scholar]
Björke H, Andersson K. Measuring the affinity of a radioligand with its receptor using a rotating cell dish with in situ reference area. Applied Radiation and Isotopes. 2006;64:32–37. doi: 10.1016/j.apradiso.2005.06.007. [DOI] [PubMed] [Google Scholar]
Bracewell RN. The Fourier Transform and Its Applications. McGraw Hill; 2000. [Google Scholar]
Buenrostro JD, Araya CL, Chircus LM, Layton CJ, Chang HY, Snyder MP, Greenleaf WJ. Quantitative analysis of RNA-protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes. Nature Biotechnology. 2014;32:562–568. doi: 10.1038/nbt.2880. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cao J, Arha M, Sudrik C, Mukherjee A, Wu X, Kane RS. A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells. Nucleic Acids Research. 2015;43:4353–4362. doi: 10.1093/nar/gkv290. [DOI] [PMC free article] [PubMed] [Google Scholar]
Choi KR, Jang WD, Yang D, Cho JS, Park D, Lee SY. Systems metabolic engineering strategies: Integrating systems and synthetic biology with metabolic engineering. Trends in Biotechnology. 2019;37:817–837. doi: 10.1016/j.tibtech.2019.01.003. [DOI] [PubMed] [Google Scholar]
Cléry A, Sohier TJM, Welte T, Langer A, Allain FHT. switchSENSE: a new technology to study protein-RNA interactions. Methods. 2017;118–119:137–145. doi: 10.1016/j.ymeth.2017.03.004. [DOI] [PubMed] [Google Scholar]
Clingman CC, Deveau LM, Hay SA, Genga RM, Shandilya SMD, Massi F, Ryder SP. Allosteric inhibition of a stem cell RNA-binding protein by an intermediary metabolite. eLife. 2014;3:e02848. doi: 10.7554/eLife.02848. [DOI] [PMC free article] [PubMed] [Google Scholar]
Dolcemascolo R, Goiriz L, Montagud-Martínez R, Rodrigo G. Gene regulation by a protein translation factor at the single-cell level. PLOS Computational Biology. 2022;18:e1010087. doi: 10.1371/journal.pcbi.1010087. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fessenden-Raden JM. Effect of fatty acids on the movement and staining of membrane proteins in polyacrylamide gel electrophoresis. Biochemical and Biophysical Research Communications. 1972;46:1347–1353. doi: 10.1016/s0006-291x(72)80123-2. [DOI] [PubMed] [Google Scholar]
Fox RG, Park FD, Koechlein CS, Kritzik M, Reya T. Musashi signaling in stem cells and cancer. Annual Review of Cell and Developmental Biology. 2015;31:249–267. doi: 10.1146/annurev-cellbio-100814-125446. [DOI] [PubMed] [Google Scholar]
Fujita Y, Matsuoka H, Hirooka K. Regulation of fatty acid metabolism in bacteria. Molecular Microbiology. 2007;66:829–839. doi: 10.1111/j.1365-2958.2007.05947.x. [DOI] [PubMed] [Google Scholar]
Ganesan SM, Falla A, Goldfless SJ, Nasamu AS, Niles JC. Synthetic RNA-protein modules integrated with native translation mechanisms to control gene expression in malaria parasites. Nature Communications. 2016;7:10727. doi: 10.1038/ncomms10727. [DOI] [PMC free article] [PubMed] [Google Scholar]
Garcia HG, Phillips R. Quantitative dissection of the simple repression input-output function. PNAS. 2011;108:12173–12178. doi: 10.1073/pnas.1015616108. [DOI] [PMC free article] [PubMed] [Google Scholar]
Glisovic T, Bachorik JL, Yong J, Dreyfuss G. RNA-binding proteins and post-transcriptional gene regulation. FEBS Letters. 2008;582:1977–1986. doi: 10.1016/j.febslet.2008.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Goiriz L, Rodrigo G. Nonequilibrium thermodynamics of the RNA-RNA interaction underlying a genetic transposition program. Physical Review. E. 2021;103:042410. doi: 10.1103/PhysRevE.103.042410. [DOI] [PubMed] [Google Scholar]
Hammar P, Leroy P, Mahmutovic A, Marklund EG, Berg OG, Elf J. The lac repressor displays facilitated diffusion in living cells. Science. 2012;336:1595–1598. doi: 10.1126/science.1221648. [DOI] [PubMed] [Google Scholar]
Hausser J, Mayo A, Keren L, Alon U. Central dogma rates and the trade-off between precision and economy in gene expression. Nature Communications. 2019;10:68. doi: 10.1038/s41467-018-07391-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Holmqvist E, Vogel J. RNA-binding proteins in bacteria. Nature Reviews. Microbiology. 2018;16:601–615. doi: 10.1038/s41579-018-0049-5. [DOI] [PubMed] [Google Scholar]
Imai T, Tokunaga A, Yoshida T, Hashimoto M, Mikoshiba K, Weinmaster G, Nakafuku M, Okano H. The neural RNA-binding protein musashi1 translationally regulates mammalian numb gene expression by interacting with its mRNA. Molecular and Cellular Biology. 2001;21:3888–3900. doi: 10.1128/MCB.21.12.3888-3900.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Iwaoka R, Nagata T, Tsuda K, Imai T, Okano H, Kobayashi N, Katahira M. STructural insight into the recognition of r(uag) by musashi-1 rbd2, and construction of a model of musashi-1 rbd1-2 bound to the minimum target rna. Molecules. 2017;22:1207. doi: 10.3390/molecules22071207. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jacob F, Monod J. Genetic regulatory mechanisms in the synthesis of proteins. Journal of Molecular Biology. 1961;3:318–356. doi: 10.1016/s0022-2836(61)80072-7. [DOI] [PubMed] [Google Scholar]
Järvelin AI, Noerenberg M, Davis I, Castello A. The new (dis)order in RNA regulation. Cell Communication and Signaling. 2016;14:9. doi: 10.1186/s12964-016-0132-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jonas S, Izaurralde E. Towards a molecular understanding of microRNA-mediated gene silencing. Nature Reviews. Genetics. 2015;16:421–433. doi: 10.1038/nrg3965. [DOI] [PubMed] [Google Scholar]
Kang MH, Jeong KJ, Kim WY, Lee HJ, Gong G, Suh N, Győrffy B, Kim S, Jeong SY, Mills GB, Park YY. Musashi RNA-binding protein 2 regulates estrogen receptor 1 function in breast cancer. Oncogene. 2017;36:1745–1752. doi: 10.1038/onc.2016.327. [DOI] [PubMed] [Google Scholar]
Katz N, Cohen R, Solomon O, Kaufmann B, Atar O, Yakhini Z, Goldberg S, Amit R. Synthetic 5’ utrs can either up- or downregulate expression upon rna-binding protein binding. Cell Systems. 2019;9:93–106. doi: 10.1016/j.cels.2019.04.007. [DOI] [PubMed] [Google Scholar]
Kawahara H, Imai T, Imataka H, Tsujimoto M, Matsumoto K, Okano H. Neural RNA-binding protein Musashi1 inhibits translation initiation by competing with eIF4G for PABP. The Journal of Cell Biology. 2008;181:639–653. doi: 10.1083/jcb.200708004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Khalil AS, Collins JJ. Synthetic biology: applications come of age. Nature Reviews. Genetics. 2010;11:367–379. doi: 10.1038/nrg2775. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klumpp S, Zhang Z, Hwa T. Growth rate-dependent global effects on gene expression in bacteria. Cell. 2009;139:1366–1375. doi: 10.1016/j.cell.2009.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kohler R, Mooney RA, Mills DJ, Landick R, Cramer P. Architecture of a transcribing-translating expressome. Science. 2017;356:194–197. doi: 10.1126/science.aal3059. [DOI] [PMC free article] [PubMed] [Google Scholar]
Koonin EV, Makarova KS. CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes. RNA Biology. 2013;10:679–686. doi: 10.4161/rna.24022. [DOI] [PMC free article] [PubMed] [Google Scholar]
Koonin EV, Krupovic M, Ishino S, Ishino Y. The replication machinery of LUCA: common origin of DNA replication and transcription. BMC Biology. 2020;18:61. doi: 10.1186/s12915-020-00800-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Langer A, Hampel PA, Kaiser W, Knezevic J, Welte T, Villa V, Maruyama M, Svejda M, Jähner S, Fischer F, Strasser R, Rant U. Protein analysis by time-resolved measurements with an electro-switchable DNA chip. Nature Communications. 2013;4:2099. doi: 10.1038/ncomms3099. [DOI] [PMC free article] [PubMed] [Google Scholar]
Leveau JH, Lindow SE. Predictive and interpretive simulation of green fluorescent protein expression in reporter bacteria. Journal of Bacteriology. 2001;183:6752–6762. doi: 10.1128/JB.183.23.6752-6762.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu MY, Romeo T. The global regulator CsrA of Escherichia coli is a specific mRNA-binding protein. Journal of Bacteriology. 1997;179:4639–4642. doi: 10.1128/jb.179.14.4639-4642.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lutz R, Bujard H. Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/I1-I2 regulatory elements. Nucleic Acids Research. 1997;25:1203–1210. doi: 10.1093/nar/25.6.1203. [DOI] [PMC free article] [PubMed] [Google Scholar]
MacDonald IC, Seamons TR, Emmons JC, Javdan SB, Deans TL. Enhanced regulation of prokaryotic gene expression by a eukaryotic transcriptional activator. Nature Communications. 2021;12:4109. doi: 10.1038/s41467-021-24434-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Madan Babu M, Teichmann SA, Aravind L. Evolutionary dynamics of prokaryotic transcriptional regulatory networks. Journal of Molecular Biology. 2006;358:614–633. doi: 10.1016/j.jmb.2006.02.019. [DOI] [PubMed] [Google Scholar]
Maris C, Dominguez C, Allain FHT. The RNA recognition motif, a plastic RNA-binding platform to regulate post-transcriptional gene expression. The FEBS Journal. 2005;272:2118–2131. doi: 10.1111/j.1742-4658.2005.04653.x. [DOI] [PubMed] [Google Scholar]
Maruyama K, Sato N, Ohta N. Conservation of structure and cold-regulation of RNA-binding proteins in cyanobacteria: probable convergent evolution with eukaryotic glycine-rich RNA-binding proteins. Nucleic Acids Research. 1999;27:2029–2036. doi: 10.1093/nar/27.9.2029. [DOI] [PMC free article] [PubMed] [Google Scholar]
Messias AC, Sattler M. Structural basis of single-stranded RNA recognition. Accounts of Chemical Research. 2004;37:279–287. doi: 10.1021/ar030034m. [DOI] [PubMed] [Google Scholar]
Meyer MM. rRNA Mimicry in RNA regulation of gene expression. Microbiology Spectrum. 2018;6:2017. doi: 10.1128/microbiolspec.RWR-0006-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
Montalbano M, McAllen S, Puangmalai N, Sengupta U, Bhatt N, Johnson OD, Kharas MG, Kayed R. RNA-binding proteins musashi and tau soluble aggregates initiate nuclear dysfunction. Nature Communications. 2020;11:4305. doi: 10.1038/s41467-020-18022-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
Na D, Yoo SM, Chung H, Park H, Park JH, Lee SY. Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs. Nature Biotechnology. 2013;31:170–174. doi: 10.1038/nbt.2461. [DOI] [PubMed] [Google Scholar]
Nakamura M, Okano H, Blendy JA, Montell C. Musashi, a neural RNA-binding protein required for Drosophila adult external sensory organ development. Neuron. 1994;13:67–81. doi: 10.1016/0896-6273(94)90460-x. [DOI] [PubMed] [Google Scholar]
Navarro Llorens JM, Tormo A, Martínez-García E. Stationary phase in gram-negative bacteria. FEMS Microbiology Reviews. 2010;34:476–495. doi: 10.1111/j.1574-6976.2010.00213.x. [DOI] [PubMed] [Google Scholar]
Nielsen AAK, Der BS, Shin J, Vaidyanathan P, Paralanov V, Strychalski EA, Ross D, Densmore D, Voigt CA. Genetic circuit design automation. Science. 2016;352:aac7341. doi: 10.1126/science.aac7341. [DOI] [PubMed] [Google Scholar]
Paulus M, Haslbeck M, Watzele M. RNA stem-loop enhanced expression of previously non-expressible genes. Nucleic Acids Research. 2004;32:e78. doi: 10.1093/nar/gnh076. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pédelacq JD, Cabantous S, Tran T, Terwilliger TC, Waldo GS. Engineering and characterization of a superfolder green fluorescent protein. Nature Biotechnology. 2006;24:79–88. doi: 10.1038/nbt1172. [DOI] [PubMed] [Google Scholar]
Peleg M, Corradini MG. Microbial growth curves: what the models tell us and what they cannot. Critical Reviews in Food Science and Nutrition. 2011;51:917–945. doi: 10.1080/10408398.2011.570463. [DOI] [PubMed] [Google Scholar]
Perea W, Greenbaum NL. Label-free horizontal EMSA for analysis of protein-RNA interactions. Analytical Biochemistry. 2020;599:113736. doi: 10.1016/j.ab.2020.113736. [DOI] [PubMed] [Google Scholar]
Peterson J, Phillips GJ. New pSC101-derivative cloning vectors with elevated copy numbers. Plasmid. 2008;59:193–201. doi: 10.1016/j.plasmid.2008.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
Popenda M, Szachniuk M, Antczak M, Purzycka KJ, Lukasiak P, Bartol N, Blazewicz J, Adamiak RW. Automated 3D structure composition for large RNAs. Nucleic Acids Research. 2012;40:e112. doi: 10.1093/nar/gks339. [DOI] [PMC free article] [PubMed] [Google Scholar]
Qi LS, Arkin AP. A versatile framework for microbial engineering using synthetic non-coding RNAs. Nature Reviews. Microbiology. 2014;12:341–354. doi: 10.1038/nrmicro3244. [DOI] [PubMed] [Google Scholar]
Rodrigo G, Carrera J, Jaramillo A. Computational design of synthetic regulatory networks from a genetic library to characterize the designability of dynamical behaviors. Nucleic Acids Research. 2011;39:e138. doi: 10.1093/nar/gkr616. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosado A, Cordero T, Rodrigo G. Binary addition in a living cell based on riboregulation. PLOS Genetics. 2018;14:e1007548. doi: 10.1371/journal.pgen.1007548. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rosenfeld N, Alon U. Response delays and the structure of transcription networks. Journal of Molecular Biology. 2003;329:645–654. doi: 10.1016/s0022-2836(03)00506-0. [DOI] [PubMed] [Google Scholar]
Sahdev S, Khattar SK, Saini KS. Production of active eukaryotic proteins through bacterial expression systems: a review of the existing biotechnology strategies. Molecular and Cellular Biochemistry. 2008;307:249–264. doi: 10.1007/s11010-007-9603-6. [DOI] [PubMed] [Google Scholar]
Salis HM, Mirsky EA, Voigt CA. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 2009;27:946–950. doi: 10.1038/nbt.1568. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sanchez A, Choubey S, Kondev J. Regulation of noise in gene expression. Annual Review of Biophysics. 2013;42:469–491. doi: 10.1146/annurev-biophys-083012-130401. [DOI] [PubMed] [Google Scholar]
Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shotwell CR, Cleary JD, Berglund JA. The potential of engineered eukaryotic RNA-binding proteins as molecular tools and therapeutics. Wiley Interdisciplinary Reviews. RNA. 2020;11:e1573. doi: 10.1002/wrna.1573. [DOI] [PMC free article] [PubMed] [Google Scholar]
Taylor ND, Garruss AS, Moretti R, Chan S, Arbing MA, Cascio D, Rogers JK, Isaacs FJ, Kosuri S, Baker D, Fields S, Church GM, Raman S. Engineering an allosteric transcription factor to respond to new ligands. Nature Methods. 2016;13:177–183. doi: 10.1038/nmeth.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
Valderrama-Rincon JD, Fisher AC, Merritt JH, Fan YY, Reading CA, Chhiba K, Heiss C, Azadi P, Aebi M, DeLisa MP. An engineered eukaryotic protein glycosylation pathway in Escherichia coli. Nature Chemical Biology. 2012;8:434–436. doi: 10.1038/nchembio.921. [DOI] [PMC free article] [PubMed] [Google Scholar]
Waters LS, Storz G. Regulatory RNAs in bacteria. Cell. 2009;136:615–628. doi: 10.1016/j.cell.2009.01.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weiss JN. The Hill equation revisited: uses and misuses. FASEB Journal. 1997;11:835–841. [PubMed] [Google Scholar]
Zearfoss NR, Deveau LM, Clingman CC, Schmidt E, Johnson ES, Massi F, Ryder SP. A conserved three-nucleotide core motif defines Musashi RNA binding specificity. The Journal of Biological Chemistry. 2014;289:35530–35541. doi: 10.1074/jbc.M114.597112. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zhang F, Carothers JM, Keasling JD. Design of a dynamic sensor-regulator system for production of chemicals and fuels derived from fatty acids. Nature Biotechnology. 2012;30:354–359. doi: 10.1038/nbt.2149. [DOI] [PubMed] [Google Scholar]

eLife. doi: 10.7554/eLife.91777.3.sa0

eLife assessment

Joseph T Wade ¹

This important study demonstrates the use of the mammalian Musashi-1 (MSI-1) RNA-binding protein as a tool for regulating gene expression in Escherichia coli. The authors provide convincing evidence that MSI-1 functions as an effective repressor of translation, and that MSI-1 can be allosterically controlled by oleic acid. This work establishes MSI-1 as a potential tool for synthetic biology applications, and the system developed here can be used for mechanistic studies of MSI-1.

eLife. doi: 10.7554/eLife.91777.3.sa1

Joint Public Review:

Anonymous

The authors develop reporter constructs in E. coli that are repressed by the mammalian Musashi-1 (MSI-1) RNA-binding protein. Using a set of rigorously controlled experiments, the authors convincingly show that MSI-1 can be directed to control translation, and that translational control by MSI-1 can be modulated allosterically by oleic acid. This is a potentially useful tool for synthetic biologists, with the advantage over transcriptional regulation that one gene in an operon could be targeted. The authors' MSI-1-regulated reporter constructs could also be useful for mechanistic studies of MSI-1.

The authors initial construct design led to only weak regulation by MSI-1, presumably because the MSI-1 binding sites were not suitably positioned to repress translation initiation. A more rationally designed construct led to considerably greater repression. A minor weakness of the paper is that the authors used their initial, weakly regulated construct to assess the effect of MSI-1 binding site mutations and for their mathematical modeling; these experiments would be better suited to the more strongly regulated construct.

eLife. 2024 Feb 16;12:RP91777. doi: 10.7554/eLife.91777.3.sa2

Author Response

Roswitha Dolcemascolo ¹, María Heras-Hernández ², Lucas Goiriz ³, Roser Montagud-Martínez ⁴, Alejandro Requena-Menéndez ⁵, Raúl Ruiz ⁶, Anna Pérez-Ràfols ⁷, R Anahí Higuera-Rodríguez ⁸, Guillermo Pérez-Ropero ⁹, Wim Vranken ¹⁰, Tommaso Martelli ¹¹, Wolfgang Kaiser ¹², Jos Buijs ¹³, Guillermo Rodrigo ¹⁴

The following is the authors’ response to the original reviews.

Summary of the reviewers’ discussion:

The development of MSI-1 as a post-transcriptional regulator of gene expression in Escherichia coli represents a valuable addition to the synthetic biology toolkit. MSI-1 has advantages over transcriptional regulators because it has the potential to target single genes in operons. Allosteric control of MSI-1 by oleic acid increases its versatility.

Authors’ response: We thank the reviewers and editor for this evaluation.

We recommend that authors add experiments to test the mechanism of regulation by MSI-1 or soften their claims about translational regulation. We also recommend that the authors expand their discussion of other natural and synthetic regulatory systems that target translation.

Authors’ response: In this revision, we have added new experimental results from RT-qPCR, bulk fluorometry, and flow cytometry assays to further support our conclusions. We have also enlarged the Introduction and Discussion.

Adding an experiment to quantify the effect of oleic acid with the most strongly regulated reporter construct (i.e., flow cytometry with redesign-3) would substantially increase the impact of the work.

Authors’ response: We have done this experimental quantification (see the new Fig. 5d).

Reviewer #1 (Public Review):

The authors develop reporter constructs in E. coli where gene expression, presumably translation, is repressed by MSI-1. This is a potentially useful tool for synthetic biologists, with the advantage over transcriptional regulation that one gene in an operon could be targeted. That being said, an important caveat of translational regulation that is not addressed in the manuscript is the potential for downstream effects on RNA stability and/or transcription termination. The authors' MSI-1-regulated reporter constructs could also be useful for mechanistic studies of MSI-1.

Authors’ response: We thank the reviewer for such appreciation of our work. Regarding the potential effects on RNA stability or transcription termination, we would like to highlight our results with the sfGFP-mScarlet bicistron (Fig. 6c), showing the specific regulation of sfGFP by MSI-1* and not of mScarlet. Anyway, for this revision we have conducted an RT-qPCR experiment to quantify the mRNA level of sfGFP to further support our conclusions (see the new Fig. S2).

The author's initial construct design led to only weak regulation by MSI-1, presumably because the MSI-1 binding sites were not suitably positioned to repress translation initiation. A more rationally designed construct led to considerably greater repression. One weakness of the paper is that the authors did not use their redesigned construct that is more strongly repressed to demonstrate allosteric regulation by oleic acid using a comparable assay (e.g., flow cytometry) to that used in other experiments. The potential for allosteric regulation is a major strength of the MSI-1 system, so this is a significant gap. Similarly, the authors use the weakly regulated constructs to assess the effect of MSI-1 binding site mutations and for their mathematical modeling; these experiments would be better suited to the more strongly regulated construct.

Authors’ response: For this revision, we have performed the flow cytometric quantification of the allosteric regulation by oleic acid in the redesigned-3 system (see the new Fig. 5d). Regarding the kinetic study, we focused on the reporter system with just one recognition motif for simplicity. A reporter system with two recognition motifs, thereby recruiting two different proteins, increases the complexity to distill the effect of point mutations.

Reviewer #1 (Recommendations For The Authors):

1. Figure 5. Panels c-f look at colonies on plates, with numbers from these data being difficult to compare with either the bulk fluorescence or single-cell fluorescence values shown in other figures. Supplementary Figure 8 shows data for single cells; these data would be more appropriate in Figure 5, with the plate-based data moving to the supplement. Moreover, measuring the effect of oleic acid on the redesign-3 reporter using flow cytometry would assess the impact of oleic acid on the most strongly regulated reporter; this would be the most impactful analysis.

Authors’ response: We have redone Fig. 5 to include flow cytometry data (also for the system implemented with the redesign-3 reporter).

2. Paragraph starting line 438. The authors should briefly discuss the potential for translational repression leading to reduced RNA stability, and in the case of rapid repression that impacts transcription-coupled translation, its impact on Rho-dependent transcription termination. These factors could alter the expression of neighboring genes.

Authors’ response: As we have shown with the RT-qPCR experiment, the mRNA level of the target gene does not change in response to protein binding. We agree that mRNA stability could potentially be changed by using other RNA-targeting proteins. But in our view, a reduction of RNA stability is not a regulation of translation. We have added the following sentence in the Discussion: “The additional use of RNA-binding proteins able to alter mRNA stability might lead to the implementation of more complex circuits at the posttranscriptional level.”

3. Figure 1. It would be informative to include a control where cells have an empty plasmid rather than a plasmid expressing MSI-1, to address leakiness of MSI-1 expression.

Authors’ response: We have constructed a void plasmid as suggested and performed new bulk fluorometry assays. The new Fig. S8 shows the tight control of MSI-1* expression with the PLlac promoter. No apparent leakage is observed.

4. Line 132. Where were the two sequences positioned with respect to each other than the start codon? It would be helpful to show the sequence in Figure 1.

Authors’ response: The precise sequence is shown in the inset of Fig. 1b. The motif is placed just after the start codon.

5. Line 135. The authors envisioned repression mechanism isn't clear from the text, specifically the meaning of "block the progression" and "initial phase". As far as I know, there is no precedent for RNA-binding proteins repressing translation in bacteria by preventing translation elongation. Presumably, repression in the context described here would be due to MSI-1 binding over the ribosome-binding site, although the predicted hairpin may also occlude binding of initiating 30S ribosomes in the absence of MSI-1 binding.

Authors’ response: It is difficult to know the exact mode of action. In page 7, we have rewritten a sentence to have: “In this way, MSI-1* can repress translation by blocking the binding of the ribosome, presumably by imposing a steric hindrance for the 30S ribosomal subunit.”

6. Figure 1e is overly complicated and hence is difficult to interpret. The key result is that mScarlet expression is unchanged as a function of lactose concentration. It is sufficient to show the inset graph as a supplementary figure panel and to conclude that regulation of sfGFP is at a post-transcriptional level. Similarly, the inset in Figure 4b is unnecessary.

Authors’ response: The inset of Fig. 1e shows that the growth rate of the cells is almost constant when lactose varies. A change in growth rate will affect protein expression. The use of a two-reporter system, one regulated translationally and the other not, is instrumental to extract from fluorescence data estimates of transcription and translation rates. Of course, showing that mScarlet expression is almost constant when lactose varies would be sufficient, but we believe that performing a fine treatment of the data helps to better understand the regulatory system from a mathematical and mechanistic point of view. Therefore, despite increasing the complexity of the figure, we prefer to keep the representation of the Crick spaces (following Alon’s terminology, see our ref. 32). We have tried to carefully explain Fig. 1e in the text.

7. Figure 1f and Figure 4c would be easier to interpret as two-dimensional plots.

Authors’ response: We decided to use 3D plots to have more compact representations of the data in the main figures. The accompanying insets show the percentage of cells above the threshold, which helps to understand the regulatory effects. In any case, we have provided the corresponding 2D plots in Fig. S10.

8. I don't think Figure 2e is relevant. The key result is shown in Figure 2f, i.e., the effect of mutations on regulation by MSI-1.

Authors’ response: We agree with the reviewer that the key result is shown in panel f. However, we prefer to keep panel e in Fig. 2 because, even if negative, this result may incite further research. In addition, we avoid the rearrangement of the whole figure.

9. Lines 311-313. Without additional evidence that the mutants are toxic, I suggest removing this text.

Authors’ response: As suggested, we have removed that claim.

Reviewer #2 (Public Review):

Summary:

Dolcemascolo and colleagues describe the use of the mammalian RNA-binding protein Musashi-1 (MSI-1) to implement translational regulation systems in E. coli. They perform detailed in vitro studies of MSI-1 and its binding to different RNA sequences. They provide compelling evidence of the effectiveness of the regulatory system in multiple circuits using different mRNA sequence motifs. They harness allosteric inhibition of MSI-1 by omega-9 monounsaturated fatty acids to demonstrate a fatty-acid-responsive circuit in E. coli.

Strengths:

The experimental results are compelling and the characterization of the binding between MSI-1 and different RNA sequences is thorough and performed via multiple complementary techniques. Several new useful circuit components are demonstrated.

Authors’ response: We thank the reviewer for such appreciation of our work.

Weaknesses:

MSI-1 provides 8.6-fold downregulation of sfGFP with an optimized mRNA sequence. In some applications, a larger degree of repression may be required.

Authors’ response: We agree with the reviewer in this point. We expect to conduct further research in the future to optimize the dynamic range of the system. We have added the following sentence in the Discussion: “Further work should be conducted to enhance the fold change of the regulatory module and engineer complex circuits with it.”

Reviewer #2 (Recommendations For The Authors):

Overall, I think this paper is very well done and quite thorough. I only have minor suggestions:

For Figures 1f and 4c, it is quite hard to interpret the fraction of cells above the threshold with the 3d perspective. It would be clearer to use a more standard 2d plot where the histograms are offset along the y-axis and the threshold is indicated by a vertical line.

For Figure 4b, the highlighting of different sequence regions in red3 appears to be offset by one base (e.g. AAU is highlighted rather than AUG).

Authors’ response: This has been corrected.

For line 504, it seems that MSI-1* is used for two different proteins. A different name should be assigned to this 200-residue protein to avoid confusion with the other MSI-1*.

Authors’ response: We now use the term MSI-1h* for the human version of the protein.

The note (Page S12) that A_0 + A_R = alpha/delta only applies in steady-state conditions, which should be stated.

Authors’ response: We have specified that.

It seems that some authors work for the companies that sell some of the instruments/consumables used for the assays, specifically switchSENSE and LigandTracer. This may be something that should be declared under Competing Interests for the paper.

Authors’ response: We are sorry for having missed this point. We have included a Competing Interests section to state that “RAHR and WFV work for Dynamic Biosensors. GPR and JB work for Ridgeview Instruments”.

Reviewer #3 (Public Review):

Summary:

In this work, the authors co-opt the RRM-binding protein Musashi-1 to act as a translational repressor. The novelty of the work is in the adoption of the allosteric RRM protein Musashi-1 into a translational reporter and the demonstration that RRM proteins, which are ubiquitous in eukaryotic systems, but rare in prokaryotic ones, may act effectively as post-translational regulators in E. coli. The extent of repression achieved by the best design presented in this work is not substantially improved compared to other synthetic regulatory schemes developed for E. coli, even those that similarly regulate translation (eg. native PP7 repression is approximately 10-fold, Lim et al. J. Biol. Chem. 2001 276:22507-22513). Furthermore, the mechanism of regulation is not established due to missing key experiments. The work would be of broader interest if the allosteric properties of Musashi-1 were more effective in the context of regulation. Unfortunately, the authors do not demonstrate that fatty acids can completely de-repress expression in the experimental system used for most of their assays, nor do they use this ability in their provided application (NIMPLY gate).

Authors’ response: For this revision, we have performed the flow cytometric quantification of the allosteric regulation by oleic acid in the redesigned-3 system, showing substantial de-repression of the system with the biochemical compound. We have redone Fig. 5 and modified the Results section accordingly. Aligned with the reviewers and editor, we believe that this new result helps to improve our manuscript.

Strengths:

The first major achievement of this work is the demonstration that a eukaryotic RRM protein may be used to posttranscriptionally regulate expression in bacteria. In my limited literature search, this appears to be the first engineering attempt to design an RBP to directly regulate translation in E. coli, although engineered control of translation via other approaches including alterations to RNA structure or via trans-acting sRNAs have been previously described (for review see Vigar and Wieden Biochim Biophys. Acta Gen. Subj. 2017, 1861:3060-3069). Additionally, several viral systems (e.g. MS2 and PP7) have been directly co-opted to work in a similar fashion in the past (utilized recently in Nguyen et al. ACS Synthetic Biol 2022, 11:1710-1718).

Authors’ response: We thank the reviewer for such appreciation of our work.

The second achievement of this work is the demonstration that the allosteric regulation of Musashi-1 binding can be utilized to modulate the regulatory activity. However, the liquid culture demonstration (Suppl. Fig 8) shows that this is not a very effective switch, with de-repressed reporter activity showing substantial change but not approaching un-repressed activity. This effect is stronger when colonies are grown on a solid medium (Fig. 5).

Authors’ response: As we have previously indicated, the flow cytometric quantification of the allosteric regulation by oleic acid in the redesigned-3 system in liquid culture showed substantial de-repression with the biochemical compound. It is now stated in the text the following: “Nevertheless, the system implemented with the redesign-3 reporter displayed a better dynamic behavior in response to lactose and oleic acid. In particular, the percentage of cells in the ON state increased from 0 (with 1 mM lactose) to 71% upon addition of 20 mM oleic acid (Fig. 5d).” This new result helps to improve our manuscript.

Weaknesses:

In this work, the authors codon optimize the mouse Musashi-1 coding sequence for expression in E. coli and demonstrate using an sfGFP reporter that an engineered Musashi-1 binding site near the translational start site is sufficient to enable a modest reduction in reporter gene expression. The authors postulate that the reduction in expression due to inhibition of ribosome translocation along the transcript (lines 134/135), as an expression of a control transcript (mScarlet) driven by the same promoter (Plac) but without the Musashi-1 recognition site does not demonstrate the same repression. However, the situation could be more complex. Other possibilities include inhibition of translation initiation rather than elongation, as well as accelerated mRNA decay of transcripts that are not actively translated. The authors do not present any measurements of sfGFP mRNA levels.

Authors’ response: In page 7, we have rewritten a sentence to have: “In this way, MSI-1* can repress translation by blocking the binding of the ribosome, presumably by imposing a steric hindrance for the 30S ribosomal subunit.” In addition, for this revision we have conducted an RT-qPCR experiment to quantify the mRNA level of sfGFP to further support our conclusions (see the new Fig. S2). As shown, there is no change in the mRNA level upon inducing the system with lactose.

In subsequent sections of the work, the authors create a series of point mutations to assess RNA-protein binding and assess these via both a sfGFP reporter and in vitro binding assays (switchSENSE). Ultimately, it is difficult to fully rationalize and interpret the behavior of these mutants in the context provided. The authors do identify a relationship between equilibrium constant (1/KD) and fold-repression. However, it is not clear from the narrative why this relationship should exist. Fold-repression is one measure of regulator efficacy, but it is an indirect measure determined from unrepressed and repressed expression. It is not clear why unrepressed expression (in the absence of the protein) is expected to be a function of the equilibrium constant.

Authors’ response: A mathematical derivation from mass action kinetics on why the fold change scales with 1/KD is provided in Note S2. It is the ratio between the unrepressed and repressed expression (i.e., fold change) what scales with 1/KD, but not the expression of a particular state. This kind of relationship has been previously established in the case of transcription regulation [see e.g. Garcia & Phillips, PNAS (2011), our ref. 39]. Our mathematical modeling results expand previous work by providing a single picture from which to analyze transcription and translation regulation.

Subsequent rational redesign of the Musashi-1 binding sequence to produce three alternative designs shows that fold-repression may be improved to approximately 8.6-fold. However, the rationalization of why the best design (red3) achieves this increase based on either the extensive modelling or in vitro measured binding constants is not well articulated. Furthermore, this extent of regulation is approximately that which can be achieved from the PP7 system with its native components (Lim et al. J. Biol. Chem. 2001 276:22507-22513).

Authors’ response: In the case of translation control, the regulation is more challenging because the target is quickly degraded, especially in bacteria (in contrast to transcription control, where the target is stable). This is acknowledged in the manuscript. Even though, it is possible to engineer synthetic circuits with sRNAs or RNA-binding proteins with sufficient dynamic range. We expect to conduct further research in the future to optimize the dynamic range of the system. We have added the following sentence in the Discussion: “Further work should be conducted to enhance the fold change of the regulatory module and engineer complex circuits with it.” Regarding the articulation of the results for the mutants and mathematical model, see our responses in the following questions.

The application provided for this regulator (NIMPLY gate), is not an inherently novel regulatory paradigm, and it does not capitalize on the allosteric properties of Musashi-1, but rather treats Musashi-1 as a non-allosteric component of a regulatory circuit.

Authors’ response: The NIMPLY gate refers to lactose and aTC as inputs. Considering oleic acid as an additional input will lead to a more complex logic. In the last Results section, we wanted to show that the post-transcriptional mechanism engineered with Musashi-1 can be useful specifically regulate a gene within an operon, to implement combinatorial regulation (i.e., coupling transcription and translation control), and to reduce protein expression noise. To these ends, the allosteric ability of the Musashi-1 was not so determinant. In this regard, it would be true that such fine regulatory effects might be achieved as well with non-allosteric RNA-binding proteins, such as MS2CP or PP7CP.

Reviewer #3 (Recommendations For The Authors):

1. In the introduction the authors should adequately address the native bacterial mechanisms that allow posttranscriptional regulation in bacteria as well as better discuss previous examples of translational repressors.

Authors’ response: We have added the following paragraph in the Introduction: “Even though bacteria do not appear to exploit proteins to regulate translation in a gene-specific manner, it is worth noting that some bacteriophages do follow this mechanism to modulate their infection cycle. These are the cases, e.g., of the coat proteins of the phages MS2 (infecting Escherichia coli) or PP7 (infecting Pseudomonas aeruginosa), which regulate the expression of the cognate phage replicases through protein-RNA interactions [18]. However, one limitation for synthetic biology developments is that such phage proteins are not allosteric. At the post-transcriptional level, bacteria mostly rely on a large palette of cis- and trans-acting non-coding RNAs to either activate or repress protein expression, resulting in the regulation of translation initiation, mRNA stability, or transcription termination, and even allowing sensing small molecules [1,15]. Thus, there should be efforts to replicate this functional versatility with proteins in bacteria.”

2. Given the location of the Musashi-1 binding site in the sfGFP reporter, it may be blocking translation initiation, rather than blocking the progression of the ribosome once attached (line 134/135). The schematic in Fig 1a. is also not overly clear in describing the differences in mechanisms between eukaryotic and prokaryotic systems described in the text.

Authors’ response: In page 7, we have rewritten a sentence to have: “In this way, MSI-1* can repress translation by blocking the binding of the ribosome, presumably by imposing a steric hindrance for the 30S ribosomal subunit.” In page 14, we have added the following sentence: “In this way, MSI-1* can also block the RNA component of the 30S ribosomal subunit.”

3. The authors did not directly examine mRNA levels of their reporter to establish translational regulation. In many cases, inhibition of translation is accompanied by an increased degradation rate in bacterial systems. The authors do not seem to recognize this as a possible amplifier in their system, relying exclusively on normalization via another transcript produced from the same promoter (mScarlet).

Authors’ response: For this revision we have conducted an RT-qPCR experiment to quantify the mRNA level of sfGFP to further support our conclusions (see the new Fig. S2). As shown, there is no change in the mRNA level upon inducing the system with lactose.

4. The results presented for mutations 1-5 are not consistent with the author's models for what is occurring. In particular, mutant 1 displays a reduction in reporter production in the absence of Musashi-1, but the production in the presence does not change from the unaltered sequence. The claim that mutation 1 (in the UAG binding site) results in less binding and ultimately in less regulation is not substantiated since this loss of regulation is due to a reduction in unrepressed expression rather than an increase in expression when Musashi-1 is present.

Authors’ response: We respectfully disagree with this appreciation. In the case of mutant 1, if the Musashi protein recognized the target mRNA with the same affinity as in the original scenario, the red bar would be much lower. Because the Musashi protein hardly recognizes the mutant-1 mRNA, the blue and red bars are quite similar. To clarify this point, we have added the following text in the manuscript: “Despite that mutation substantially reduced sfGFP expression in absence of MSI-1*, the presumed repressed state upon addition of lactose did not change much, suggesting the difficulty of the protein for targeting the mutated mRNA.”

5. Given point 5 above, it is not clear to me why one would expect the 1/KD to be predictive fold-repression in the presence and absence of the repressor. I would rather see the relationship described as predictive in Fig. 2f (fold change vs. 1/KD) rather than the non-linear relationship. It is difficult to qualitatively evaluate the fit quality with the way the data are currently presented.

Authors’ response: Note S2 provides a mathematical derivation from mass action kinetics on why the fold change scales with 1/KD. The R2 value that we provide for the fitting corresponds to the linear regression between fold and 1/KD, as specified in the figure legend. However, we think that the representation of fold vs. KD in log scale is more illustrative in this case.

6. It is not clear what conclusion is determined from the computational modeling, or how this work contributes to the narrative presented. It does not seem like what is learned from these experiments is utilized for novel designs. Furthermore, several of the assumptions within the model may be problematic including the high rate of "elongation leakage" described and the lack of justification for RNA degradation rates utilized.

Authors’ response: The mathematical modeling was performed to rationalize our experimental data. Our idea was more to recapitulate the observed dynamics than to guide the design of new systems. Our model might be exploited to this end in further research, as the reviewer suggests. Besides, elongation leakage is a concept that applies to both transcription and translation regulation systems, and it is not more than the ability of the RNA polymerase or ribosome to elongate even if there is a protein bound to the nucleic acid. This parameter can be set to 0 in the model if appropriate. Moreover, we cite the paper by Bernstein et al., PNAS (2002), our ref. 38, to justify that in E. coli the average mRNA half-life is about 5 min (i.e., degradation rate of 0.14 min-1).

7. The data presented in Figure 4 are not presented in a consistent way. While it would be somewhat redundant, including the 0 and 1 mM lactose data for red3 in Figure 4a would be helpful for comparison purposes.

Authors’ response: We have added the requested bar plot in Fig. 4a.

8. The presence of additional Musashi-1 sites upstream of the start codon in red3, and their impact on impact on the fold-repression may support an inhibition of the translation initiation model rather than an inhibition of elongation.

Authors’ response: In page 7, we have rewritten a sentence to have: “In this way, MSI-1* can repress translation by blocking the binding of the ribosome, presumably by imposing a steric hindrance for the 30S ribosomal subunit.” In page 14, we have added the following sentence: “In this way, MSI-1* can also block the RNA component of the 30S ribosomal subunit.”

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure 1—source data 1. Bulk fluorescence data of sfGFP, eBFP2, and mScarlet with lactose and single-cell data of sfGFP.

elife-91777-fig1-data1.xlsx^{(528.4KB, xlsx)}

Figure 2—source data 1. Bulk fluorescence data of sfGFP and binding kinetics measurements.

elife-91777-fig2-data1.xlsx^{(8.3KB, xlsx)}

Figure 3—source data 1. Bulk fluorescence data of sfGFP and mScarlet with time.

elife-91777-fig3-data1.xlsx^{(7.5KB, xlsx)}

Figure 4—source data 1. Bulk fluorescence data of sfGFP with lactose.

elife-91777-fig4-data1.xlsx^{(447.8KB, xlsx)}

Figure 5—source data 1. Single-cell data of sfGFP.

elife-91777-fig5-data1.xlsx^{(1.3MB, xlsx)}

Figure 5—source data 2. Full gel images.

elife-91777-fig5-data2.zip^{(3.1MB, zip)}

Figure 5—figure supplement 1—source data 1. Full gel images.

elife-91777-fig5-figsupp1-data1.zip^{(3.3MB, zip)}

Figure 6—source data 1. Bulk fluorescence data of sfGFP and mScarlet and single-cell data of sfGFP.

elife-91777-fig6-data1.xlsx^{(142.4KB, xlsx)}

MDAR checklist

elife-91777-mdarchecklist1.docx^{(99.3KB, docx)}

Data Availability Statement

All data generated or analysed during this study are included in the manuscript and supporting files.

[bib1] Ai H, Shaner NC, Cheng Z, Tsien RY, Campbell RE. Exploration of new chromophore structures leads to the identification of improved blue fluorescent proteins. Biochemistry. 2007;46:5904–5910. doi: 10.1021/bi700199g. [DOI] [PubMed] [Google Scholar]

[bib2] Anantharaman V, Iyer LM, Aravind L. Presence of a classical RRM-fold palm domain in thg1-type 3’- 5’nucleic acid polymerases and the origin of the GGDEF and CRISPR polymerase domains. Biology Direct. 2010;5:43. doi: 10.1186/1745-6150-5-43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib3] Babitzke P, Baker CS, Romeo T. Regulation of translation initiation by RNA binding proteins. Annual Review of Microbiology. 2009;63:27–44. doi: 10.1146/annurev.micro.091208.073514. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] Bairoch A, Apweiler R, Wu CH, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, Magrane M, Martin MJ, Natale DA, O’Donovan C, Redaschi N, Yeh LSL. The universal protein resource (uniprot) Nucleic Acids Research. 2005;33:D154–D159. doi: 10.1093/nar/gki070. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Bashor CJ, Collins JJ. Understanding biological regulation through synthetic biology. Annual Review of Biophysics. 2018;47:399–423. doi: 10.1146/annurev-biophys-070816-033903. [DOI] [PubMed] [Google Scholar]

[bib6] Belmont BJ, Niles JC. Engineering a direct and inducible protein-RNA interaction to regulate RNA biology. ACS Chemical Biology. 2010;5:851–861. doi: 10.1021/cb100070j. [DOI] [PubMed] [Google Scholar]

[bib7] Bernstein JA, Khodursky AB, Lin PH, Lin-Chao S, Cohen SN. Global analysis of mRNA decay and abundance in Escherichia coli at single-gene resolution using two-color fluorescent DNA microarrays. PNAS. 2002;99:9697–9702. doi: 10.1073/pnas.112318199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Bindels DS, Haarbosch L, van Weeren L, Postma M, Wiese KE, Mastop M, Aumonier S, Gotthard G, Royant A, Hink MA, Gadella TJ. mScarlet: a bright monomeric red fluorescent protein for cellular imaging. Nature Methods. 2017;14:53–56. doi: 10.1038/nmeth.4074. [DOI] [PubMed] [Google Scholar]

[bib9] Björke H, Andersson K. Measuring the affinity of a radioligand with its receptor using a rotating cell dish with in situ reference area. Applied Radiation and Isotopes. 2006;64:32–37. doi: 10.1016/j.apradiso.2005.06.007. [DOI] [PubMed] [Google Scholar]

[bib10] Bracewell RN. The Fourier Transform and Its Applications. McGraw Hill; 2000. [Google Scholar]

[bib11] Buenrostro JD, Araya CL, Chircus LM, Layton CJ, Chang HY, Snyder MP, Greenleaf WJ. Quantitative analysis of RNA-protein interactions on a massively parallel array reveals biophysical and evolutionary landscapes. Nature Biotechnology. 2014;32:562–568. doi: 10.1038/nbt.2880. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib12] Cao J, Arha M, Sudrik C, Mukherjee A, Wu X, Kane RS. A universal strategy for regulating mRNA translation in prokaryotic and eukaryotic cells. Nucleic Acids Research. 2015;43:4353–4362. doi: 10.1093/nar/gkv290. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Choi KR, Jang WD, Yang D, Cho JS, Park D, Lee SY. Systems metabolic engineering strategies: Integrating systems and synthetic biology with metabolic engineering. Trends in Biotechnology. 2019;37:817–837. doi: 10.1016/j.tibtech.2019.01.003. [DOI] [PubMed] [Google Scholar]

[bib14] Cléry A, Sohier TJM, Welte T, Langer A, Allain FHT. switchSENSE: a new technology to study protein-RNA interactions. Methods. 2017;118–119:137–145. doi: 10.1016/j.ymeth.2017.03.004. [DOI] [PubMed] [Google Scholar]

[bib15] Clingman CC, Deveau LM, Hay SA, Genga RM, Shandilya SMD, Massi F, Ryder SP. Allosteric inhibition of a stem cell RNA-binding protein by an intermediary metabolite. eLife. 2014;3:e02848. doi: 10.7554/eLife.02848. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Dolcemascolo R, Goiriz L, Montagud-Martínez R, Rodrigo G. Gene regulation by a protein translation factor at the single-cell level. PLOS Computational Biology. 2022;18:e1010087. doi: 10.1371/journal.pcbi.1010087. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Fessenden-Raden JM. Effect of fatty acids on the movement and staining of membrane proteins in polyacrylamide gel electrophoresis. Biochemical and Biophysical Research Communications. 1972;46:1347–1353. doi: 10.1016/s0006-291x(72)80123-2. [DOI] [PubMed] [Google Scholar]

[bib18] Fox RG, Park FD, Koechlein CS, Kritzik M, Reya T. Musashi signaling in stem cells and cancer. Annual Review of Cell and Developmental Biology. 2015;31:249–267. doi: 10.1146/annurev-cellbio-100814-125446. [DOI] [PubMed] [Google Scholar]

[bib19] Fujita Y, Matsuoka H, Hirooka K. Regulation of fatty acid metabolism in bacteria. Molecular Microbiology. 2007;66:829–839. doi: 10.1111/j.1365-2958.2007.05947.x. [DOI] [PubMed] [Google Scholar]

[bib20] Ganesan SM, Falla A, Goldfless SJ, Nasamu AS, Niles JC. Synthetic RNA-protein modules integrated with native translation mechanisms to control gene expression in malaria parasites. Nature Communications. 2016;7:10727. doi: 10.1038/ncomms10727. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Garcia HG, Phillips R. Quantitative dissection of the simple repression input-output function. PNAS. 2011;108:12173–12178. doi: 10.1073/pnas.1015616108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Glisovic T, Bachorik JL, Yong J, Dreyfuss G. RNA-binding proteins and post-transcriptional gene regulation. FEBS Letters. 2008;582:1977–1986. doi: 10.1016/j.febslet.2008.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Goiriz L, Rodrigo G. Nonequilibrium thermodynamics of the RNA-RNA interaction underlying a genetic transposition program. Physical Review. E. 2021;103:042410. doi: 10.1103/PhysRevE.103.042410. [DOI] [PubMed] [Google Scholar]

[bib24] Hammar P, Leroy P, Mahmutovic A, Marklund EG, Berg OG, Elf J. The lac repressor displays facilitated diffusion in living cells. Science. 2012;336:1595–1598. doi: 10.1126/science.1221648. [DOI] [PubMed] [Google Scholar]

[bib25] Hausser J, Mayo A, Keren L, Alon U. Central dogma rates and the trade-off between precision and economy in gene expression. Nature Communications. 2019;10:68. doi: 10.1038/s41467-018-07391-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Holmqvist E, Vogel J. RNA-binding proteins in bacteria. Nature Reviews. Microbiology. 2018;16:601–615. doi: 10.1038/s41579-018-0049-5. [DOI] [PubMed] [Google Scholar]

[bib27] Imai T, Tokunaga A, Yoshida T, Hashimoto M, Mikoshiba K, Weinmaster G, Nakafuku M, Okano H. The neural RNA-binding protein musashi1 translationally regulates mammalian numb gene expression by interacting with its mRNA. Molecular and Cellular Biology. 2001;21:3888–3900. doi: 10.1128/MCB.21.12.3888-3900.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Iwaoka R, Nagata T, Tsuda K, Imai T, Okano H, Kobayashi N, Katahira M. STructural insight into the recognition of r(uag) by musashi-1 rbd2, and construction of a model of musashi-1 rbd1-2 bound to the minimum target rna. Molecules. 2017;22:1207. doi: 10.3390/molecules22071207. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Jacob F, Monod J. Genetic regulatory mechanisms in the synthesis of proteins. Journal of Molecular Biology. 1961;3:318–356. doi: 10.1016/s0022-2836(61)80072-7. [DOI] [PubMed] [Google Scholar]

[bib30] Järvelin AI, Noerenberg M, Davis I, Castello A. The new (dis)order in RNA regulation. Cell Communication and Signaling. 2016;14:9. doi: 10.1186/s12964-016-0132-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Jonas S, Izaurralde E. Towards a molecular understanding of microRNA-mediated gene silencing. Nature Reviews. Genetics. 2015;16:421–433. doi: 10.1038/nrg3965. [DOI] [PubMed] [Google Scholar]

[bib32] Kang MH, Jeong KJ, Kim WY, Lee HJ, Gong G, Suh N, Győrffy B, Kim S, Jeong SY, Mills GB, Park YY. Musashi RNA-binding protein 2 regulates estrogen receptor 1 function in breast cancer. Oncogene. 2017;36:1745–1752. doi: 10.1038/onc.2016.327. [DOI] [PubMed] [Google Scholar]

[bib33] Katz N, Cohen R, Solomon O, Kaufmann B, Atar O, Yakhini Z, Goldberg S, Amit R. Synthetic 5’ utrs can either up- or downregulate expression upon rna-binding protein binding. Cell Systems. 2019;9:93–106. doi: 10.1016/j.cels.2019.04.007. [DOI] [PubMed] [Google Scholar]

[bib34] Kawahara H, Imai T, Imataka H, Tsujimoto M, Matsumoto K, Okano H. Neural RNA-binding protein Musashi1 inhibits translation initiation by competing with eIF4G for PABP. The Journal of Cell Biology. 2008;181:639–653. doi: 10.1083/jcb.200708004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] Khalil AS, Collins JJ. Synthetic biology: applications come of age. Nature Reviews. Genetics. 2010;11:367–379. doi: 10.1038/nrg2775. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Klumpp S, Zhang Z, Hwa T. Growth rate-dependent global effects on gene expression in bacteria. Cell. 2009;139:1366–1375. doi: 10.1016/j.cell.2009.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] Kohler R, Mooney RA, Mills DJ, Landick R, Cramer P. Architecture of a transcribing-translating expressome. Science. 2017;356:194–197. doi: 10.1126/science.aal3059. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] Koonin EV, Makarova KS. CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes. RNA Biology. 2013;10:679–686. doi: 10.4161/rna.24022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Koonin EV, Krupovic M, Ishino S, Ishino Y. The replication machinery of LUCA: common origin of DNA replication and transcription. BMC Biology. 2020;18:61. doi: 10.1186/s12915-020-00800-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Langer A, Hampel PA, Kaiser W, Knezevic J, Welte T, Villa V, Maruyama M, Svejda M, Jähner S, Fischer F, Strasser R, Rant U. Protein analysis by time-resolved measurements with an electro-switchable DNA chip. Nature Communications. 2013;4:2099. doi: 10.1038/ncomms3099. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] Leveau JH, Lindow SE. Predictive and interpretive simulation of green fluorescent protein expression in reporter bacteria. Journal of Bacteriology. 2001;183:6752–6762. doi: 10.1128/JB.183.23.6752-6762.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] Liu MY, Romeo T. The global regulator CsrA of Escherichia coli is a specific mRNA-binding protein. Journal of Bacteriology. 1997;179:4639–4642. doi: 10.1128/jb.179.14.4639-4642.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib43] Lutz R, Bujard H. Independent and tight regulation of transcriptional units in Escherichia coli via the LacR/O, the TetR/O and AraC/I1-I2 regulatory elements. Nucleic Acids Research. 1997;25:1203–1210. doi: 10.1093/nar/25.6.1203. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] MacDonald IC, Seamons TR, Emmons JC, Javdan SB, Deans TL. Enhanced regulation of prokaryotic gene expression by a eukaryotic transcriptional activator. Nature Communications. 2021;12:4109. doi: 10.1038/s41467-021-24434-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] Madan Babu M, Teichmann SA, Aravind L. Evolutionary dynamics of prokaryotic transcriptional regulatory networks. Journal of Molecular Biology. 2006;358:614–633. doi: 10.1016/j.jmb.2006.02.019. [DOI] [PubMed] [Google Scholar]

[bib46] Maris C, Dominguez C, Allain FHT. The RNA recognition motif, a plastic RNA-binding platform to regulate post-transcriptional gene expression. The FEBS Journal. 2005;272:2118–2131. doi: 10.1111/j.1742-4658.2005.04653.x. [DOI] [PubMed] [Google Scholar]

[bib47] Maruyama K, Sato N, Ohta N. Conservation of structure and cold-regulation of RNA-binding proteins in cyanobacteria: probable convergent evolution with eukaryotic glycine-rich RNA-binding proteins. Nucleic Acids Research. 1999;27:2029–2036. doi: 10.1093/nar/27.9.2029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] Messias AC, Sattler M. Structural basis of single-stranded RNA recognition. Accounts of Chemical Research. 2004;37:279–287. doi: 10.1021/ar030034m. [DOI] [PubMed] [Google Scholar]

[bib49] Meyer MM. rRNA Mimicry in RNA regulation of gene expression. Microbiology Spectrum. 2018;6:2017. doi: 10.1128/microbiolspec.RWR-0006-2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] Montalbano M, McAllen S, Puangmalai N, Sengupta U, Bhatt N, Johnson OD, Kharas MG, Kayed R. RNA-binding proteins musashi and tau soluble aggregates initiate nuclear dysfunction. Nature Communications. 2020;11:4305. doi: 10.1038/s41467-020-18022-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] Na D, Yoo SM, Chung H, Park H, Park JH, Lee SY. Metabolic engineering of Escherichia coli using synthetic small regulatory RNAs. Nature Biotechnology. 2013;31:170–174. doi: 10.1038/nbt.2461. [DOI] [PubMed] [Google Scholar]

[bib52] Nakamura M, Okano H, Blendy JA, Montell C. Musashi, a neural RNA-binding protein required for Drosophila adult external sensory organ development. Neuron. 1994;13:67–81. doi: 10.1016/0896-6273(94)90460-x. [DOI] [PubMed] [Google Scholar]

[bib53] Navarro Llorens JM, Tormo A, Martínez-García E. Stationary phase in gram-negative bacteria. FEMS Microbiology Reviews. 2010;34:476–495. doi: 10.1111/j.1574-6976.2010.00213.x. [DOI] [PubMed] [Google Scholar]

[bib54] Nielsen AAK, Der BS, Shin J, Vaidyanathan P, Paralanov V, Strychalski EA, Ross D, Densmore D, Voigt CA. Genetic circuit design automation. Science. 2016;352:aac7341. doi: 10.1126/science.aac7341. [DOI] [PubMed] [Google Scholar]

[bib55] Paulus M, Haslbeck M, Watzele M. RNA stem-loop enhanced expression of previously non-expressible genes. Nucleic Acids Research. 2004;32:e78. doi: 10.1093/nar/gnh076. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] Pédelacq JD, Cabantous S, Tran T, Terwilliger TC, Waldo GS. Engineering and characterization of a superfolder green fluorescent protein. Nature Biotechnology. 2006;24:79–88. doi: 10.1038/nbt1172. [DOI] [PubMed] [Google Scholar]

[bib57] Peleg M, Corradini MG. Microbial growth curves: what the models tell us and what they cannot. Critical Reviews in Food Science and Nutrition. 2011;51:917–945. doi: 10.1080/10408398.2011.570463. [DOI] [PubMed] [Google Scholar]

[bib58] Perea W, Greenbaum NL. Label-free horizontal EMSA for analysis of protein-RNA interactions. Analytical Biochemistry. 2020;599:113736. doi: 10.1016/j.ab.2020.113736. [DOI] [PubMed] [Google Scholar]

[bib59] Peterson J, Phillips GJ. New pSC101-derivative cloning vectors with elevated copy numbers. Plasmid. 2008;59:193–201. doi: 10.1016/j.plasmid.2008.01.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib60] Popenda M, Szachniuk M, Antczak M, Purzycka KJ, Lukasiak P, Bartol N, Blazewicz J, Adamiak RW. Automated 3D structure composition for large RNAs. Nucleic Acids Research. 2012;40:e112. doi: 10.1093/nar/gks339. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib61] Qi LS, Arkin AP. A versatile framework for microbial engineering using synthetic non-coding RNAs. Nature Reviews. Microbiology. 2014;12:341–354. doi: 10.1038/nrmicro3244. [DOI] [PubMed] [Google Scholar]

[bib62] Rodrigo G, Carrera J, Jaramillo A. Computational design of synthetic regulatory networks from a genetic library to characterize the designability of dynamical behaviors. Nucleic Acids Research. 2011;39:e138. doi: 10.1093/nar/gkr616. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib63] Rosado A, Cordero T, Rodrigo G. Binary addition in a living cell based on riboregulation. PLOS Genetics. 2018;14:e1007548. doi: 10.1371/journal.pgen.1007548. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib64] Rosenfeld N, Alon U. Response delays and the structure of transcription networks. Journal of Molecular Biology. 2003;329:645–654. doi: 10.1016/s0022-2836(03)00506-0. [DOI] [PubMed] [Google Scholar]

[bib65] Sahdev S, Khattar SK, Saini KS. Production of active eukaryotic proteins through bacterial expression systems: a review of the existing biotechnology strategies. Molecular and Cellular Biochemistry. 2008;307:249–264. doi: 10.1007/s11010-007-9603-6. [DOI] [PubMed] [Google Scholar]

[bib66] Salis HM, Mirsky EA, Voigt CA. Automated design of synthetic ribosome binding sites to control protein expression. Nature Biotechnology. 2009;27:946–950. doi: 10.1038/nbt.1568. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib67] Sanchez A, Choubey S, Kondev J. Regulation of noise in gene expression. Annual Review of Biophysics. 2013;42:469–491. doi: 10.1146/annurev-biophys-083012-130401. [DOI] [PubMed] [Google Scholar]

[bib68] Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, Tinevez JY, White DJ, Hartenstein V, Eliceiri K, Tomancak P, Cardona A. Fiji: an open-source platform for biological-image analysis. Nature Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib69] Shotwell CR, Cleary JD, Berglund JA. The potential of engineered eukaryotic RNA-binding proteins as molecular tools and therapeutics. Wiley Interdisciplinary Reviews. RNA. 2020;11:e1573. doi: 10.1002/wrna.1573. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib70] Taylor ND, Garruss AS, Moretti R, Chan S, Arbing MA, Cascio D, Rogers JK, Isaacs FJ, Kosuri S, Baker D, Fields S, Church GM, Raman S. Engineering an allosteric transcription factor to respond to new ligands. Nature Methods. 2016;13:177–183. doi: 10.1038/nmeth.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib71] Valderrama-Rincon JD, Fisher AC, Merritt JH, Fan YY, Reading CA, Chhiba K, Heiss C, Azadi P, Aebi M, DeLisa MP. An engineered eukaryotic protein glycosylation pathway in Escherichia coli. Nature Chemical Biology. 2012;8:434–436. doi: 10.1038/nchembio.921. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib72] Waters LS, Storz G. Regulatory RNAs in bacteria. Cell. 2009;136:615–628. doi: 10.1016/j.cell.2009.01.043. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib73] Weiss JN. The Hill equation revisited: uses and misuses. FASEB Journal. 1997;11:835–841. [PubMed] [Google Scholar]

[bib74] Zearfoss NR, Deveau LM, Clingman CC, Schmidt E, Johnson ES, Massi F, Ryder SP. A conserved three-nucleotide core motif defines Musashi RNA binding specificity. The Journal of Biological Chemistry. 2014;289:35530–35541. doi: 10.1074/jbc.M114.597112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib75] Zhang F, Carothers JM, Keasling JD. Design of a dynamic sensor-regulator system for production of chemicals and fuels derived from fatty acids. Nature Biotechnology. 2012;30:354–359. doi: 10.1038/nbt.2149. [DOI] [PubMed] [Google Scholar]

PERMALINK

Repurposing the mammalian RNA-binding protein Musashi-1 as an allosteric translation repressor in bacteria

Roswitha Dolcemascolo

María Heras-Hernández

Lucas Goiriz

Roser Montagud-Martínez

Alejandro Requena-Menéndez

Raúl Ruiz

Anna Pérez-Ràfols

R Anahí Higuera-Rodríguez

Guillermo Pérez-Ropero

Wim F Vranken

Tommaso Martelli

Wolfgang Kaiser

Jos Buijs

Guillermo Rodrigo

Roles

Abstract

Introduction

Figure 1. Musashi-1 can downregulate translation in bacteria.

Figure 1—figure supplement 1. Maps of the plasmids used to implement the synthetic gene circuit in which MSI-1* represses the translation of sfGFP.

Figure 1—figure supplement 2. RT-qPCR results for the mRNA level of sfGFP.

Results

A Musashi protein can downregulate translation in bacteria

Mechanistic insight into the engineered regulation based on a protein–RNA interaction

Figure 2. Mechanistic characterization of the Musashi-1–mRNA interaction.

Figure 2—figure supplement 1. Characterization of the system response with lactose using pREP4 as a reporter plasmid.

Figure 2—figure supplement 2. Musashi protein purification.

Figure 2—figure supplement 3. Characterization of different mutant RNA motifs in terms of binding kinetics against the MSI-1h* protein.

A mathematical model captured the dynamic response of the system

Figure 3. A mathematical model captures the dynamic response of the system.

Figure 3—figure supplement 1. Characterization of the system response with IPTG (implemented with pRM1+ and pREP6).

Figure 3—figure supplement 2. Dynamic response of the system in solid medium.

Rational redesign of the targeted transcript to enhance the dynamic range of the response

Figure 4. mRNA redesign to enhance the downregulation by Musashi-1.

Figure 4—figure supplement 1. PLlac promoter tightly controls MSI-1* expression.

The regulatory activity of a Musashi protein in bacteria can be externally controlled by a fatty acid

Figure 5. Oleic acid inhibits the regulatory activity of Musashi-1 in bacteria.

Figure 5—figure supplement 1. Gel electrophoresis mobility shift assays to test the MSI-1h*-RNA and the MSI-1h*-oleic acid interactions (nucleic acid-stained gels).

Figure 5—figure supplement 2. 2D visualization of probability-based histograms of sfGFP expression from single-cell data.

Figure 5—figure supplement 3. Quantification of the green fluorescence of the colonies (denoted by ΣsfGFP as it is from populations; n = 5).

Application of a Musashi protein for intra-operon, combinatorial, and noise regulation

Figure 6. Applications of Musashi-1 for a fine expression control in bacteria.

Figure 6—figure supplement 1. Probability-based histograms of sfGFP expression from single-cell data for different inducer concentrations (1 mM lactose + 20 mM oleic acid on the top, 0.1 mM lactose on the bottom).

Discussion

Materials and methods

Strains, plasmids, and reagents

Bulk fluorometry

Real-time fluorescence quantification in solid medium

Flow cytometry

Purification of a Musashi protein

Binding kinetics assays of protein–RNA interactions

RT-qPCR

Gel electrophoresis

Microscopy

Mathematical modeling

Molecular visualization in silico

Resources availability

Acknowledgements

Appendix 1

Appendix 2

Appendix 3

Appendix 4

Appendix 5

Appendix 6

Funding Statement

Contributor Information

Funding Information

Additional information

Competing interests

Author contributions

Additional files

Data availability

References

eLife assessment

Joseph T Wade

Roles

Joint Public Review:

Anonymous

Roles

Figure 2—figure supplement 3. Characterization of different mutant RNA motifs in terms of binding kinetics against the MSI-1_h* protein.

Figure 5—figure supplement 1. Gel electrophoresis mobility shift assays to test the MSI-1_h-RNA and the MSI-1_h-oleic acid interactions (nucleic acid-stained gels).