Abstract
The engineering of genetic circuits to perform predefined computations is central to synthetic biology, enabling living cells with new functionalities applicable across various domains. However, these circuits are often specifically tailored to particular cellular hosts, with Escherichia coli being the most popular. Consequently, their intended functions may not translate well to other organisms, limiting their scope. Understanding circuit dynamics in less familiar organisms is crucial, especially for niche-specific applications requiring cellular chassis different from model organisms generally used in synthetic biology. Here, we develop a combined experimental and theoretical pipeline to evaluate the performance of NOT logic circuits, also called inverters, in the soil bacterium Pseudomonas protegens Pf-5—a host renowned for its unique environmental functions and a newcomer to genetic circuitry. Inverters were experimentally tested to characterize input-output functionality, and mathematical modelling was used to infer the dynamic principles of circuit modules. The model quantified the individual impacts of key parameters—such as translation efficiency, repressor performance, and promoter activity—on output levels, enabling predictions about inter-host circuit portability. This parameter calibration revealed unique properties of the chassis, including steeper transitions between on and off circuit states compared to the synthetic biology workhorse Pseudomonas putida. Moreover, model parameters guided the successful serial connection of two NOT gates, enabling YES logic function. As a result, our work provides DNA parts, circuits and mathematical characterizations to establish P. protegens Pf-5 as a viable chassis for environmental synthetic biology.
1. Introduction
Synthetic biology focuses on the rational design and implementation of predefined computations in living cells [1, 2, 3]. To achieve this goal, the programming of genetic circuits plays a crucial role in sensing inputs and generating outputs according to rules and instructions encoded in DNA sequences [4, 5]. These circuits function as various entities, such as Boolean logic gates [6, 7], counters [8], switches [9], feedback controllers [10], and memories [11], serving as tools to introduce innovative functions into host organisms.
While successful in various applications [12, 13], spanning from biotechnology [14] to bioremediation [15], the programming of biological functions often gives greater emphasis to the software (i.e., the genetic circuit), overlooking the crucial role played by the hardware (i.e., the host cell) [16]. For instance, the optimization of genetic circuits typically relies on testing libraries of DNA parts [17, 18], such as promoters [19] or ribosome binding sites [20], to identify the set of parts that accurately produces the desired phenotypic response. However, it is essential to recognize that the hardware is equally crucial for ensuring the proper performance of the software. For example, certain DNA parts may function effectively in some hosts but poorly in others—the function of genetic circuits is highly context-dependent [21, 22]. This underscores the necessity of tailoring genetic circuits specifically to a particular type of cellular hardware. It could be argued that circuits alone are not sufficient to achieve predefined functionality; rather, the tandem of circuit and host is the actual functional unit. Consequently, what works well in one host often needs to be re-engineered to perform effectively in a different organism [23]. Far from being a disadvantage, this requirement has been leveraged to expand the functional abilities of circuits [21].
Each cellular host possesses specific dynamics that distinguish it from other organisms, ultimately influencing the performance of genetic circuits. While distinctions are more apparent between bacteria and eukaryotic cells, significant mechanistic differences also exist among bacterial species. These distinctions extend beyond important differences in shape [24] or doubling times [25], impacting circuit performance by encompassing variations in resource distribution [26] (e.g., ribosomes) and cellular functions [27] (e.g., transcription vs translation). In synthetic biology, the concept of a chassis [28] refers to a well-characterized host organism that serves as a framework for bioengineering, offering a comprehensive understanding of its impact on the performance of genetic circuits. To date, there is a lack of such well-characterized hosts—a gap that underscores the challenge addressed in this work.
The model bacterium Escherichia coli has been pivotal in synthetic biology, benefiting from the extensive knowledge accumulated about this chassis since the early days of molecular biology [29]. However, extending the applications of genetic circuits beyond the laboratory requires customising their functions to suit niche-specific cellular hosts. For instance, there are cells with naturally evolved traits ideal for targeting sustainability applications [30]. Examples include the engineering of cells to link greenhouse gas sequestration with the bioproduction of valuable compounds using Clostridium autoethanogenum and Cupriavidus necator [31, 32]. Additionally, rhizosphere-specific applications have been implemented using nitrogen-fixing hosts such as Azotobacter vinelandii, Stutzerimonas stutzeri (previously Pseudomonas stutzeri), and Klebsiella variicola [33, 34]. Other well-studied cellular chassis for various synthetic biology applications include Bacillus subtilis [35], Pseudomonas putida [36], Geobacter sulfurreducens [37, 38], and non-bacterial organisms like the yeast Saccharomyces cerevisiae [39]. In all cases, circuits need to be tailored to their host, not only to adapt to the specific internal workings of the cell but also to leverage this specificity, which may provide advantages in various application domains.
Here, we focus on the soil bacterium Pseudomonas protegens, an important environmental host with unique interactions with soil’s biotic (e.g., plants and other microbes) and abiotic (e.g., pollutants) components. For example, this bacterium excels in biocontrol activity against plant pathogens [40, 41]. However, compared to other organisms, the engineering of predefined functions in P. protegens has lagged behind. Recent examples, though, highlight its potential as a host for bioengineering endeavours [42, 43]. Despite these advancements, thorough characterization of genetic parts, circuits, context dependencies, and dynamic details is still lacking—an issue that this paper aims to address, unlocking the potential of synthetic biology in this host. To evaluate the context dependency of circuits in P. protegens, we performed parallel experiments in the related soil bacterium P. putida [44, 45]—a closely related organism with other important environmental traits. By comparing circuit dynamics between the two hosts and developing mathematical models, we were able to make reliable predictions of circuit behavior, though not in all cases with complete accuracy. Importantly, the model also allowed us to design new circuits by identifying which ones could be coupled to achieve specific performances. Thus, our approach supports both prediction of circuit output prior to experimental characterization and the rational design of novel circuit combinations. Although mathematical modelling [46, 47] and computational tools [48, 49] are crucial to genetic circuit design and have become fundamental in synthetic biology [50, 51], such predictive capabilities are still elusive and constitute an overarching challenge in the field [52].
In the following section, we present experimental and theoretical results illustrating the performance of a library of genetic circuits in P. protegens. Key dynamic parameters, considered the fingerprint of each circuit, assist in predicting circuit portability between this host and P. putida. Altogether, these results contribute to establishing P. protegens as a synthetic biology chassis.
2. Results and discussion
Metrics for evaluating genetic circuit performance
To assess the performance of genetic circuits in Pseudomonas protegens Pf-5, we used a set of standardized NOT logic gates, or inverters. This benchmarking set is a library of 10 circuits that perform a NOT function: if the input regulator is absent (logic 0), the output protein will be expressed at high levels (logic 1), and vice versa. In other words, the output inverts the input.
Each circuit within the library was independently analysed to measure input-output function. The circuit’s mathematical model was then fitted to the experimental data, and the inverters functionality was compared to the performance of the same testing library in the host Pseudomonas putida KT2440. Comparing circuit functions across organisms is challenging because a circuit’s performance is significantly influenced by its host context. To address this, we used measurements of the varying background activities observed in each cell type as controls. These measurements accounted for both background fluorescence and the signal of an empty NOT gate (i.e., the construct containing only the reporter gene under the control of the Biobricks BBA J23101 standard promoter). This activity was used to normalize circuit behaviour against background activity, allowing mathematical modelling to account for both the contextual dependencies impacting the circuit and its intrinsic function. The substantial variance observed in the standard promoter activity between the two hosts underscores the importance of this step.
The structure of the inverter circuits and their mathematical counterparts are depicted in Figure 1A. All synthetic constructs share a common architecture, comprising the reporter gene (yfp) used as the output and the repressor-promoter pair (LacI-pTac) that senses input concentration (IPTG). While input and output molecules are the same, the variable component within the library, which defines the unique characteristics of each inverter, is a distinct repressor (Rx) and its corresponding promoter (pRx). The benchmarking set includes seven different repressor-promoter pairs, plus three additional variants for circuits regulated by the QacR or SrpR repressors, making a total of 10 circuits. The mathematical model simulates the NOT function of the gates by considering the mechanistic kinetic rates within each circuit. As depicted in the model scheme, a circuit can be divided into two distinct modules: rates K describe the dynamics of the repressor regulator, while rates C describe those of the reporter. The actual binding and repression of the target promoter pRx is modelled using a Hill equation.
Figure 1. Testing metrics for evaluating genetic circuits performance.
A. The NOT circuits (top) consist of a common architecture shared among all, with a unique set of a repressor and its cognate promoter, denoted here as Rx and pRx, respectively. Circuits were mechanistically modelled (bottom) with the following key rates for parameterizing their performance: K0, K1, K2, K3 and Kd for the transcription, translation, degradations and dissociation constant of the repressor machinery; and C0, C1, C2, and C3 for the same dynamics involved in YFP expression. Repression in the NOT gates was modelled using a Hill function, and IPTG activation in the control model was also described with a Hill function. B. The rates were clustered into 4 testing metrics that can be directly related to the performance of various DNA sequences within the circuits: the reporter expression cassette (a), the repressor expression cassette and the Rx-pRx binding dissociation constant (g), the leakiness of the central repressible promoter pRx (β), and the Rx-pRx binding cooperativity (n). C. Each NOT logic characterization, resulting in a graph that relates the level of input (IPTG, x-axis) to the level of output (YFP, y-axis), was analyzed by fitting the circuit’s mathematical model to the experimental data, which returns a specific and unique set of testing metrics. D. The effect of increasing the value of each of the four metrics on NOT logic output.
To characterize the gates, models were fitted to the experimental data. However, models based on the full set of rates (K and C) exhibited overfitting, which refers to errors in parameter estimation caused by an excessive number of variables. This occurred because these models were overly complex for the data aimed to match, namely fluorescence measurements. In simpler terms, the models failed to discern the dynamics responsible for variations in performance. Consequently, in order to derive mechanistic insights from the fitting process, we merged all parameters K—along with the Rx-pRx binding dissociation constant—and C into single values, denoted as g (repressor cassette) and a (reporter cassette), respectively. Additionally, new parameters, denoted as β (leakiness) and n (Hill coefficient), were introduced to represent the central promoter repressibility (i.e., the leakiness of pRx at maximum repression) and the cooperativity of the repressor binding the output promoter pRx (i.e., the affinity of Rx to bind pRx after other Rx was bound), respectively (Figure 1B). This approach yielded identifiable values that could be fitted to our data and also elucidated mechanistic details inherent to the genetic construct. As a result, for each characterization of a genetic inverter—where input concentration (IPTG) is linked to output expression (YFP) (Figure 1C, top)—the model estimated a specific set of values for the four parameters (a, g, β, n) (Figure 1C, bottom). Increasing or decreasing each parameter directly affects inverter functions (Figure 1D), making them suitable for examining experimental results that display differences in curves. For example, increasing parameter a shifts inverter curves upwards, particularly at low input (IPTG) values, while high input values result in similar output responses. This occurs because parameter a serves as a proxy for the YFP expression machinery; thus, the stronger it is, the more responsive it becomes to low IPTG values (i.e., output 1), while its inhibited state exhibits only a mild increase. Conversely, parameter g shifts the curve downwards while maintaining the initial point at low IPTG values relatively constant. The latter happens also in the effect of parameter β but shifting the curve upwards. Parameter n exhibits a nonlinear impact on the circuit, consistent with what was expected for repression interactions (see Methods - equation 4).
These four metrics effectively described inverter variations and were utilized to analyze the results from the library, as described next.
Genetic inverters performance in Pseudomonas protegens Pf-5
The library of 10 genetic inverters was initially characterized through input-output expression analysis, with Figure 2 showing results normalized against background activity (the RPUs of pTac in contrast to those for each pRx and their experimental variability for P. protegens are depicted in Supplementary Figure S1). For each construct, the output (YFP) was measured following induction with increasing concentrations of the input (IPTG). This process generated 10 transfer functions exhibiting distinct differences while maintaining the NOT logic: low IPTG resulted in high YFP and vice versa. Next, experimental observations were used by the mathematical model to determine parameter values for the four performance metrics: a (reporter cassette), g (repressor cassette), β (leakiness), and n (Hill coefficient) (Figure 3A). Error bars on parameter values represent the standard errors of the fitted parameters, reflecting the uncertainty in their estimated values from the model fit. In other words, greater uncertainty indicates that the parameter can vary widely (within the range of the uncertainty) while still fitting the experimental data successfully. Conversely, low uncertainty means the model found a specific value for that parameter. This secondary dataset revealed performance alterations that were not easily estimated through visual inspection alone. For example, certain inverter curves in P. protegens may appear similar visually but display significant differences in metric values (e.g., LitRL1 vs QacRQ1). In this study, the four metrics generated from our combined experimental and theoretical pipeline were used as the unique fingerprint of each genetic inverter. We also applied this mathematical process to previously reported experimental data in P. putida [53], fitting the model to obtain the same parameters and facilitate comparisons.
Figure 2. Evaluation of genetic inverters in P. protegens Pf-5.
Each of the 10 genetic inverters was characterized using input concentrations of IPTG ranging from 0 to 1000 µM. Output fluorescence was measured as relative units adjusting the standards to those of P. protegens, facilitating direct comparisons. A. The dynamics of the 10 inverters within each host were plotted fixing the Y axis scale to showcase the variability of the library. B. The results of each inverter were plotted against those of P. putida to facilitate the comparison between hosts. In all plots, experimental data and model fit are depicted using dots and solid lines, respectively. Average and standard error of the mean were calculated from three independent biological replicates. The corresponding goodness-of-fit values (R2) for each gate and host are summarized in Supplementary Table S5.
Figure 3. Contextual dependencies of the genetic inverters performance metrics.
A. Scale of the parameters a (purple), g (green), β (grey) and n (blue) extracted from the fitting of the mathematical model to the experimental data for each inverter. The values calculated for each inverter in Pseudomonas protegens Pf-5 (coloured bars) are depicted against their equivalents in Pseudomonas putida KT2440 (white bars). Parameters a, β and n were fixed within the QacR and SrpR sets of variants. Parameter g, including the variable repressor expression cassette, was calculated independently. Error bars represent the standard errors of the fitted parameter estimates. B. Correlations of the inverters performance metrics between P. putida KT2440 (x-axis) and P. protegens Pf-5 (y-axis). The trajectory and the deviation for each parameter are represented as dashed lines and shadowed areas respectively. Solid dots mark the intersection for each inverter between both hosts. Parameter values except g (inverse concentration arbitrary units) in panels A and B are dimensionless.
A closer examination of parameter values and their uncertainty reveals certain features within each inverter in the library. For instance, the reporter expression activity, captured by parameter a, ranges from 0 to 2 (dimensionless parameter values), with circuits covering most values in between—a common behaviour for both P. protegens and P. putida (Figure 3A). Since the RBS and coding sequence are common to all inverters, this suggests that the variability is mainly due to changes in the promoter, thus offering a wide range of activity to meet future application requirements. The variability of parameter g, which encompasses repressor expression and Rx-pRx binding dissociation constant, ranges from 0 to 100 (inverse concentration arbitrary units). However, g displays different performance in the two hosts: while P. putida consistently shows relatively low values, P. protegens covers the entire range from very low (e.g., circuit BetIE1) to very high (e.g., circuit SrpRS1).
The fact that all circuits share a common backbone makes it easier in some cases to interpret parameter values in terms of mechanistic dynamics. For instance, changes in parameter a can be fully attributed to the transcription of the output promoter (rate C0 in the model). However, parameter g, which represents the repressor expression cassette, captures the dynamics of only one common DNA part: the IPTG-inducible promoter pTac, used in all inverters. Although this promoter initiates transcription, the RBS and coding sequence are unique to each construct. Therefore, mRNA sequences vary from one repressor to another, affecting rate K0 due to differences in length or complexity. Rates K1, K2, and K3 (Figure 1A) may also differ among the inverters because of variations in translation initiation, protein synthesis, and degradation rates of mRNAs and repressors. These changes in the mechanistic processes underlying parameter g (including the repressor-promoter dissociation constant, Kd) likely cause the observed variability in P. protegens Pf-5. To provide a mechanistic interpretation, the role of the RBS in the dynamics of parameter g is explored later in this study.
The values for parameters β and n are more uniformly distributed across the ten inverters and do not show much variability (Figure 3A). Despite this, these parameters provide crucial insights and allowed us to draw significant conclusions from their values. Parameter β captures the repressibility of the output promoter by its cognate repressor, showing slight variations between 0 and 0.2, with the remarkable exception of gate BetIE1. Unlike the other three parameters, which are unbounded (i.e., their values are not limited to any upper bound), β must have a value between 0 and 1, as it represents the percentage relationship between the on and off states of the inverter. Therefore, all inverters perform well in that regard, and the difference between the on and off states is clear and nearly as wide as possible. In the exceptional case of BetIE1, the off state (Figure 2B) is only as low as half its maximum on value, indicating that this gate is not working as well as the others in P. putida—as a result its β is highly uncertain (Figure 3A).
Parameter n showed a clear trend when comparing its values between the two hosts (Figure 3A): it is consistently higher in P. protegens Pf-5. Since this parameter captures the cooperativity of the repressor binding to the output promoter within a Hill equation, this higher trend suggests there may be a different regime of nonlinearity between the two organisms. Nonlinear dynamics are crucial for biological systems to run key motifs such as bistability or oscillations, and a consistently higher n may provide a library of DNA parts with unique nonlinear trends, aiding the rational design of complex circuits [53].
Contextual dependencies in Pseudomonas protegens Pf-5 compared to Pseudomonas putida KT2440
After determining the parameters a, g, β, and n for each construct, the next question is whether Pseudomonas protegens Pf-5 imposes modifications on them that are significantly different from those imposed by the reference strain Pseudomonas putida KT2440. While the DNA sequence of the components remains unchanged, their phenotypic response is decisively altered by contextual dependencies. Therefore, from a design perspective, given that DNA sequences can be entirely controlled (mutations and evolutionary dynamics aside), it is desirable to identify the effect of such dependencies.
In order to analyse these dependencies and compare circuit performance metrics between P. protegens and P. putida, the mathematical model was adjusted to each bacterial host by calibrating the IPTG sensor module shared by all synthetic constructs (LacI-pTac) (Supplementary Figure S2). For each of the four key parameters, circuits displayed different correlation types (Figure 3B): positive linear correlation for parameters a and n, exponential in the case of g and seemingly negative linear tendency for β. Contextual dependencies are to some extent encoded within these correlations.
For instance, something that may not be obvious looking at Figure 2B, where P. protegens shows lower initial fluorescence output values in most cases, is that the activity of the output cassettes (parameter a) is virtually the same in both hosts. This is inferred by the fact that there is a positive linear correlation in this parameter; besides, values for P. putida (x-axis) and P. protegens (y-axis) are within the same range. Two additional results strengthen this conclusion, one theoretical one experimental. Firstly, the ratios between rates C0 (output promoter transcription) and K0 (input promoter transcription) also exhibit a similar distribution across inverters and hosts (Supplementary Figure S3). Secondly, the experimentally obtained dynamic ranges (i.e., fold-change between on and off scenarios) are equal or wider to those of P. putida for 7 out of 10 inverters in P. protegens (Supplementary Figure S4).
Parameters β and n are consistently higher in P. protegens both concurrently, and linear correlations suggest this trend is stable. This indicates intriguing dynamics: higher β values make NOT gates difficult to repress at high input levels, while higher n values make NOT gates easy to repress at high input levels (Figure 1D). A potential multi-objective optimization task (beyond the scope of this study) would be to find a sequence that minimizes β while maximizing n to achieve the best possible NOT function. However, this may not be feasible, as experiments here suggest the host context influences both parameters simultaneously.
However, it is parameter g that shows the most significant performance differences due to context effects: exponential correlation and very different range values on the axis. We now take a closer look at this parameter, which accounts for the translation dynamics of the repressor.
The P. protegens Pf-5 context enhances the RBS effect over the repressor expression cassette activity
There are 5 variants within the library of inverters (2 regulated by QacR and 3 by SrpR) which only differ on the RBS sequence of the repressor. Figure 4A sketches the general architecture of these inverters and the mechanistic rates integrating the parameter g. For all constructs sharing the same repressor, the Hill function parameters a, β, and n were fixed, and only the translation rate parameter g was fitted individually to each construct to capture the effect of the RBS on repressor activity. The plots of either QacRQ1 and QacRQ2 (Figure 4B) or SrpRS1, SrpRS2 and SrpRS4 (Figure 4C) were merged for both P. protegens Pf-5 (top) and P. putida KT2440 (bottom) along with the four performance metrics for each host. The visual inspection of the inverter functions show that altering the repressor RBS sequence is enough to modify the input-output relationship—with this effect being clearly more pronounced in P. protegens Pf-5. Moreover, this correlates with the changes attributed by the model to the activity (parameter g) of the repressor expression cassette.
Figure 4. Differential impact of RBS sequence over parameter g.
A. Schematic representation of the inverters architecture highlighting the main genetic elements integrating the repressor expression cassette, thus impacting the parameter g (top); draft of the QacR and SrpR repressor expression cassette variants (middle); detail on the mechanistic processes involved in the repressor expression cassette activity (i.e., transcription, translation, mRNA degradation, protein degradation and dissociation constant), and their rates (K0, K1, K2, K3 and Kd) showcasing protein synthesis (K1 – in red) as a key process (bottom). B. Experimental data from three independent biological replicates and model fits of the QacR-based inverter variants in P. protegens Pf-5 (green) and P. putida KT2440 (red) are depicted using dots and solid lines, respectively. The 4 testing metrics a, g, β, and n were calculated and presented in bar plots - g1 and g2 correspond to Q1 and Q2 RBS variants respectively. Error bars represent the standard errors of the fitted parameter estimates. C. Experimental data from three independent biological replicates and model fits of the SrpR-based inverter variants in P. protegens Pf-5 (green) and P. putida KT2440 (red) are depicted using dots and solid lines, respectively. The 4 testing metrics a, g, β, and n were calculated and presented in bar plots - g1, g2 and g4 correspond to S1, S2 and S4 RBS variants respectively. Error bars represent the standard errors of the fitted parameter estimates. Parameter values except g (inverse concentration arbitrary units) in panels B and C are dimensionless using intervals: 0-2 for a, 0-2 (x102) for g, 0-4 (x10−1) for β, and 0-3 for n. D. Nucleotide sequences of the RBS variants and their predicted translation initiation rate for the QacR-based inverters. E. Nucleotide sequences of the RBS variants and their predicted translation initiation rate for the SrpR-based inverters. Positions with nucleotide substitutions or deletions are highlighted in red in panels D and E.
The predicted translation initiation rates (TIRs) of the mRNA variants for QacR and SrpR were calculated through the De Novo DNA [54, 55, 56] algorithms (Supplementary Table S3). An extract of the relevant information including the name, differential sequence and TIR is also presented in Figure 4D (QacR) and 4E (SrpR)—of note that the RBS Calculator produced identical values for both bacterial hosts. In general terms, if we compare the predicted TIRs with the g values calculated through our experimental and theoretical pipeline, both (TIR and g values) follow the same trend: the higher the predicted TIR, the higher also the value obtained for g. This result supports our model as a tool to evaluate functional modules within genetic circuits while inferring the impact of individual rates (e.g., protein translation) over the modules and the circuits themselves.
Modulation of protein accumulation or circuit performance through RBS modifications is not unconventional. However, here we go a step further by providing a framework to study the effect of such modifications in a context-dependent manner. Our results suggest that there is an intrinsic contextual effect over RBS modifications on phenotypic variance. As a consequence, the cell’s translation machinery can be leveraged to control how RBS modifications influence input-output functions as desired. Indeed, RBS changes produce an estimated 40-fold (S1/S2) increase in the overall repressor expression cassette activity for SrpR when implemented as a P. protegens Pf-5 logic circuit, in contrast to the modest (5-fold) phenotypic response observed within the context of P. putida.
Nonetheless, this result could be attributed to more than the TIR alone. For instance, translation processivity could favor the RBS effect. We calculated the codon adaptation index (CAI) for the 7 NOT repressors, YFP, and LacI in both bacterial species [57]. The resulting CAI (Supplementary Table S4) is lower in P. protegens Pf-5 for all of them. These results raise intriguing questions: Are RBS-related dependencies linked to differences in protein translation across hosts, or are they buffered by other factors like mRNA levels and protein degradation? While these remain unanswered, our findings suggest that combining RBS designs with host context is key to improving genetic circuit portability. A recent study also supports this, showing that while RBS modulation fine-tunes function within a host, considering host context can shift overall circuit performance [58]. Despite some uncertainties about the underlying mechanisms, we can leverage this phenomenon to engineer circuits and even predict their function in different hosts with varying intrinsic properties.
Prediction of circuit portability between P. putida KT2440 and P. protegens Pf-5
Portability between bacterial hosts remains a major challenge for the application of synthetic genetic circuits. What functions well in one host often fails in another. The ability to predict a circuit’s performance in a new host would be highly valuable to the field. Here, we developed a predictor tool based on the four parameters a, g, β, and n.
Figures 5A–C illustrate the process of predicting the function of a circuit tested in P. putida KT2440 and applying it to P. protegens Pf-5. The LitRL1 gate was selected for prediction (Figure 5A). The parameter values of the remaining nine gates in the library were correlated between P. putida and P. protegens, providing a setup to interpolate P. putida LitRL1 parameter values and predict the parameters for this gate in P. protegens (Figure 5B). Importantly, LitRL1 was left out of this correlation process. Leave-One-Out Cross-Validation (LOOCV) [59] is a model evaluation technique where the model is trained on n-1 samples, with one left out as the test set. Figure 5C shows the prediction of how LitRL1 will perform in P. protegens (black line) compared against the experimental data of LitRL1 in that host (green dots), showing good agreement between them.We assessed prediction accuracy by comparing predicted and experimental fits, with an R2 value above 0.8 for this case (Supplementary Table S6), where 1 represents the best possible outcome.
Figure 5. Roadmap for the prediction of circuit portability between hosts.
Panels A, B and C represent a visual breakdown of the process from experimental data to inter-host prediction of inverter dynamics. A. Experimental data and model fit of the LitRL1 inverter in P. putida KT2440 are depicted using dots and solid lines, respectively. Parameter values except g (inverse concentration arbitrary units), represented as bars are dimensionless using intervals: 0-2 for a, 0-2 (x102) for g, 0-4 (x10−1) for β, and 0-3 for n. Error bars represent the standard errors of the fitted parameter estimates. B. Interpolation of the parameters (a, g, β and n) from the LitRL1 inverter in P. putida KT2440 into the correlations generated with the other nine gates (all but LitRL1); the predicted and the experimentally-assessed intersections for each parameter between P. putida KT2440 (x-axis) and P. protegens Pf-5 (y-axis) are represented as black and white dots, respectively. C. The predicted LitRL1 inverter dynamics in P. protegens Pf-5 is plotted along with the experimental data as a black dashed line and green dots, respectively. D. Predicted inverter dynamics and experimental data from three independent biological replicates represented as black dashed lines and green dots for each of the 10 NOT logic circuits included in this study using the correlations of the other nine.
This process was repeated for all 10 gates in the library, completing the LOOCV technique to thoroughly assess predictability (full results in Supplementary Figure S5). As shown in Figure 5D, four gates were exceptionally well predicted, with confidence metrics (Supplementary Table S6) exceeding 0.8 (out of 1) for LitRL1, QacRQ1, SrpRS2, and SrpRS4. SrpRS1 had an intermediate score of 0.69, while the remaining gates showed low to very low predictability. These results mark a significant step forward in predicting genetic circuit performance in P. protegens Pf-5.
Besides, a simulation of 1000 sets of random inverter circuits was performed to confirm or reject the model-based predictions. In each set, all 10 gates were assigned parameter values sampled uniformly within the following ranges: a ∈ [0, 2], g ∈ [0, 200], β ∈ [0, 1], and n ∈ [0, 2.5]. Each random circuit was evaluated using the same confidence metric (R2) as applied to the experimental gates, by comparing predicted and experimental responses. The probability of achieving confidence metrics within the range observed in this study was below 0.05 after excluding the HlyIIRH1 and BetIE1 outliers (Supplementary Figure S6). The exclusion of these two gates was decided as follows, only for this particular test. Their prediction coefficient of determination after fitting to the experimental data is far below that of the other gates in the library (-17.85 for BetIE1 and -51.47 for HlyIIRH1)-while all others fall roughly between 1 and -1 (Supplementary Table S6). Taking a closer look at the parameter correlations (Figure 3B and Supplementary Figure S5), we observe that parameters g (BetIE1) and a (HlyIIRH1) are very close to the bottom left corner. In this scenario, deviations (even small ones) between predicted and actual parameter values result in significant relative deviations. Indeed, BetIE1 parameter g for P. protegens is only slightly above the intersection between the two hosts (Supplementary Figure S5). However, in combination with an overestimated β seems enough to provide a curve that significantly deviates from its experimental characterization (Figure 5D). A similar scenario is observed for the HlyIIRH1 parameter a. The overestimation of this parameter results in a much higher predicted ON state than its experimental counterpart. Altogether, these results validate our roadmap towards the prediction of circuit portability to novel synthetic biology chassis.
Model-guided assembly of a digital YES gate in P. protegens Pf-5
For two gates to be coupled, that is, the output of the first serves as the input to the second—they must be mutually compatible. Unlike electronic systems with well-defined binary states, genetic circuits exhibit noisy behavior and variable input/output levels, especially in host-dependent contexts. This variability complicates the design of reliable gate interactions.
As a case study, guided by our modelling results, we built a YES function by connecting two NOT gates in series (Figure 6A). Several design criteria were established. For the first inverter (Rx-pRx; Figure 6A), the minimum activity of pRx at low IPTG (inverter ON state) was set at 0.5 RPU, based on the input promoter levels required for full repression of downstream gates in P. protegens Pf-5 (Figure 2 and Supplementary Figures S1 and S2). Similarly, the maximum activity of pRx at high IPTG (inverter off state) was set at 0.1 RPU to switch the second inverter (Ry-pRy; Figure 6A) to its on state. Both inverters were required to exhibit low leakiness, indicated by low β value (Figure 3A), and a dynamic range of at least threefold to allow clear on/off discrimination (Supplementary Figure S4). Among the candidates, only BetIE1, QacRQ1, and LitRL1 met the experimental criteria for the on/off output promoter activities and the dynamic range.
Figure 6. Model-guided host-dependent digital YES gate.
A. Schematic representation of the YES gate architecture (top) illustrating the serial arrangement of two transcriptional inverters regulated by QacR and SrpR to implement a digital YES logic function (bottom). B. Depiction of the logical function glyph, the RBS variant of the QacR (Rx) inverter, and the bacterial host combinatorial design to achieve the predicted digital YES function (top) and the corresponding experimental characterization using input concentrations of IPTG ranging from 0 to 1000 µM (bottom). C. Alternative RBS-host combinations (right) and their corresponding experimental characterizations using input concentrations of IPTG ranging from 0 to 1000 µM (left). Same RBS (Q1), different host (P. putida) (top); different RBS (Q2) and host (P. putida) (middle); different RBS (Q2) same host (P. protegens) (bottom). In panels B and C, output expression levels are shown in Relative Promoter Units (RPUs) for P. protegens Pf-5 (green) and P. putida KT2440 (red) as dots connected with straight solid lines for visual clarity. Y-axis scale was fixed to facilitate visual comparison. Average and standard error of the mean were calculated from three independent biological replicates.
To achieve digital-like behavior, we aimed to leverage the host-specific ribosome binding site (RBS) effects, captured by the model parameter g, which significantly influenced the inverters dynamics in P. protegens Pf-5 (Figure 4). QacRQ1 exhibited the highest g value (Figure 3A), shifting the response curve downward and leftward (Figure 1D), and was thus selected as the first inverter. The availability of two QacR variants with different TIRs (Q1 and Q2 - Figure 4) enabled further investigation into whether RBS effects observed in NOT gates (Figure 4) are conserved in the YES gate configuration. Selection of the second inverter focused on compatibility with the output of QacRQ1. SrpR variants showed the lowest β values (Figure 3A), and SrpRS2 exhibited the highest on state activity in P. protegens Pf-5 (Supplementary Figure S1). This gate is fully repressed at 0.5 RPU and reaches near-maximum activation at approximately 0.1 RPU, indicating compatibility with the QacRQ1 output profile.
The target YES gate QacRQ1SrpRS2 and its RBS variant QacRQ2SrpRS2 were cloned into the pSEVA221 backbone and conjugated to P. protegens Pf-5 (Supplementary Tables S1 and S2) to experimentally validate the expected digital YES function of the QacRQ1:Pf-5 tandem (Figure 6B). These gates were also tested in P. putida KT2440 (Figure 6C - top and middle) to assess the influence of host background on digital performance. The first objective was to determine whether the QacRQ1 and SrpRS2 inverters could be combined to form a functional YES gate, as predicted. In both hosts, a clear activation of the output promoter (pRy) was observed in response to increasing IPTG concentrations, which confirms the expected YES functionality.
Moreover, QacRQ1:Pf-5 combination is required in order to achieve sharper digital performance (as opposed to gradual analog-like dynamics). The output promoter (pSrpR) was in fact strongly activated at low IPTG (50-200 µM) (Figure 6B), recovering the maximum output promoter activity observed for the SrpRS2 inverter (Supplementary Figure S1). Tests with different combinations of RBS:host (Figure 6C) failed to provide such digital performance. The QacRQ1:KT2440 and QacRQ2:KT2440 tandems showed weaker and more gradual activation with only a slight improvement in responsiveness from the Q1 RBS. Therefore, gates that were unsuitable in P. putida could be reused in P. protegens for ideal performance. On its side, QacRQ2:Pf-5 resulted in a barely functional YES gate, once again demonstrating the RBS effect in this host (Figure 6C - bottom).
In summary, the model proved instrumental in selecting suitable gate pairs to construct a larger circuit with the predefined host-dependent dynamics.
3. Conclusion
Studying non-model species is crucial for applying synthetic biology tools and engineering them effectively. In this study, we characterized a library of genetic circuits (NOT logic gates) in the soil bacterium Pseudomonas protegens Pf-5, examining their dynamics, host-circuit interactions, and developing parameterized mathematical models specific to this organism. Additionally, we developed a method to predict the performance of circuits previously tested in Pseudomonas putida KT2440 within this new chassis.
The mathematical model was able to quantify the activity of key circuit components: the reporter cassette (for readout), repressor cassette (implementing the NOT logic), promoter leakiness, and the Hill coefficient (for repressor interactions). Fluorescence measurements for each gate were fitted to these parameters, yielding unique values for each Rx-pRx pair. Analysis of these values led to novel mechanistic insights about P. protegens Pf-5 that are crucial for the success of bioengineering efforts.
Our results suggest that the LacI-pTac inducible system is highly sensitive to very low IPTG concentrations in this host. Given that low-dose systems are advantageous but challenging to engineer, as recently in E. coli [60], this trait positions P. protegens as a promising chassis to this end. Furthermore, RBS sequences with high TIR efficiently initiate translation in this host, which, along with the system’s sensitivity to low IPTG levels, may explain the narrower transition phase compared to the synthetic biology chassis P. putida KT2440. This characteristic is beneficial for applications requiring a digital input-output response [61, 62]. The consistently high Hill coefficients across all circuit models suggest a strong non-linear behaviour in this bacterium, which is key dynamical feature to consider when designing and implementing genetic circuits. All this information—and more that is described in the Results section—aids engineering and enhances predictability in this unconventional chassis.
Bioengineering applications targeting ecosystems need environmental microbes as workhorses. We advocate for the synthetic biology potential of Pseudomonas protegens Pf-5 to engineer beneficial traits in soil environments.
4. Materials and Methods
Strains, media and general culture conditions
The bacterial strains used in this study are detailed in Supplementary Tables S1 and S2. As a general overview, Escherichia coli strains with inverters from the pSEVA221 library previously used by Tas et al [21] or the YES gates designed in this study were used as donor strains in triparental matings with P. protegens Pf-5 or P. putida KT2440 as recipient strains in a similar manner to that previously reported [63]. In brief, E. coli carrying either the control plasmids pSEVA221:1201, pSEVA221:1717, pSEVA221:1818, the pSEVA221:Inverters or the pSEVA221:YES (Supplementary Table S1) were mixed with the helper strain E. coli HB101/pRK600 and the recipient strain (namely, P. protegens Pf-5 or P. putida KT2440) in a 1:1:1 ratio based on their OD600. All cells were washed with 10 mM MgSO4 prior to mixing to reduce the amount of antibiotics carried from the culture medium. The cell suspensions were then centrifuged for 5 minutes at 5000g, resuspended in 10–20 µL of 10 mM MgSO4 and a single drop was laid in the center of a Luria-Bertani agar plate. Plates were incubated at 30°C for 20-22 h and the whole cell-patch was resuspended in 1 ml of 10 mM MgSO4. Undiluted cell suspensions or up to 10−3 dilution were plated in M9-citrate agar plus kanamycin 50 µg/ml. Colonies were clearly visible after 24-48 h of incubation at 30ºC. Whole Plasmid Sequencing was performed by Plasmidsaurus using Oxford Nanopore Technology with custom analysis and annotation.
Cloning of YES gates
The plasmid pSEVA221:QacRQ1SrpRS2 was built using the Gibson Assembly Master Mix (NEB) and the plasmids pSEVA221:QacRQ1 (Tas15) and pSEVA221:SrpRS2 (Tas18) as templates (Supplementary table S1). In brief, 3 fragments were amplified by PCR with Phusion High-Fidelity DNA Polymerase (NEB) (2 from Tas15, primers 470/471 and 466/488, and 1 from Tas18, primers 469/486 - see details in Supplementary Table S7). The three fragments were purified using the Monarch PCR and DNA Cleanup Kit (NEB), quantified by nanodrop and mixed in equimolar amounts (0.06 pmol each). The 20 µl reaction was incubated at 50ºC for 1 hour and 2 µl transformed into NEB Stable Competent E. coli (NEB). Transformants were selected on LB agar plus kanamycin 50 µg/ml and confirmed by whole plasmid sequencing by Plasmidsaurus using Oxford Nanopore Technology with custom analysis and annotation prior mating. The QacRQ2SrpRS2 YES gate was fully synthesized and subcloned into the pSEVA221 PacI/SpeI restriction sites by GeneArt (Thermofisher). The resulting plasmid was subsequently transformed into E. coli Thermo Scientific DH5α competent cells and transformants were selected on LB agar plus kanamycin 50 µg/ml prior mating.
Phenotypic characterization
Induction assays for flow cytometry assessment of gate performance
Bacterial strains of P. protegens Pf-5 (Supplementary Table S2) were grown in M9 medium supplemented with citrate as previously described [64]. In summary, saturated overnight cultures were diluted 10-fold and 3 µl were inoculated into the wells of a microtiter plate to a final volume of 200 µl per well, with IPTG concentrations ranging 0 to 1000 µM and incubated at 30°C with shaking at 400 rpm for 24 h. Cultures were then kept on ice for the rest of the procedure. YFP fluorescence distribution of each sample was measured with a Miltenyi Biotec MACSQuant VYB flow cytometer at channel B1 with an excitation of 488 nm and emission of 525/50 nm. For each sample 30 thousand events were collected with singlet gating. Calibration was done by using MACSQuant Calibration Beads. An example of the gating strategy is shown in Supplementary Figure S7.
Data analysis
Flow cytometry
The medians for the YFP (B1 channel) fluorescence distribution for each P. protegens Pf-5 listed at Supplementary Table S2 were calculated using the MACSQuantify software V2.13.3. The singlet gating dataset for each IPTG concentration (ranging from 0 to 1000 µM) was analyzed.
Assessment of the relative promoter units (RPU)
The medians obtained from the flow cytometry data analysis were included in the following formula to assess the relative promoter units (RPU):
| (1) |
where Y FP stands for the median fluorescence value of the inverter that is to be standardised into RPU, Y FP1201 is the median fluorescence value of the autofluorescence plasmid and Y FP1717 indicates the median fluorescence value from the standardisation plasmid.
Control Model
A typical Hill function where IPTG induces the production of YFP has been used for the control gate (1818).
| (2) |
The parameters of the control model are:
| (3) |
where γ is the maximum theoretical Y FPc, Kdc is the dissociation constant and nc is the Hill coefficient. Kdc and nc values are used for fitting the NOT gates. Leakage (β) was neglected in the control model, since Supplementary Figure S2 shows that at IPTG = 0 the input signal is essentially absent.
Model
NOT gates are modelled by:
| (4) |
Given our data, we cannot differentiate between some parameters such as C0 and C1, that is why we group certain parameters into other new parameters, so that the fit is performed concisely and the result is reliable, avoiding overfitting. Thus:
| (5) |
β represents the gate leakage, defined as the normalized YFP output at saturating IPTG. the parameters of the model used are:
| (6) |
where a and g group undecidable parameters, β refers to the leakage percentage (from 0 to 1), n is the Hill coefficient, and Kdc and nc are the Hill parameters of the control gate. Kdc and nc are fixed parameters for each host given by the control model (Supplementary Figure S2).
Fitting methods
The model fitting was performed using non-linear least squares minimization. Specifically, we used the minimize function from the lmfit package with method =′ least − squares′ in Python. The standard error was the uncertainty measure used in each fitted parameter which was obtained directly from the fitting process. After fitting, these uncertainties are provided by the stderr attribute of the parameter objects in the lmfit results.
Supplementary Material
Acknowledgments
This work was supported by the the ECCO (ERC-2021-COG-101044360) Contract of the EU, grants MULTI-SYSBIO (PID2020-117205GA-I00), BIOELECTRIC (CNS2022-135951), and MULTI-SYNBIO (PID2023-152470NB-I00) funded by MICIU/AEI/ 10.13039/501100011033, grant CONTEXT (2019-T1/BIO-14053) of the Comunidad de Madrid, and grant CEX2020-000999-S to the CBGP by the Severo Ochoa Program for Centres of Excellence / AEI.
Code and data availability
The code to reproduce simulations, along with parameter values, can be publicly accessed at https://github.com/PabloJap/ Plasmid maps as GenBank files in Supplemenary Material.
References
- [1].Ausländer S, Ausländer D, Fussenegger M. Synthetic biology—the synthesis of biology. Angewandte Chemie International Edition. 2017;56(23):6396–6419. doi: 10.1002/anie.201609229. [DOI] [PubMed] [Google Scholar]
- [2].Amos M, Goñi-Moreno A. Cellular computing and synthetic biology. Computational matter. 2018:93–110. [Google Scholar]
- [3].Benenson Y. Biomolecular computing systems: principles, progress and potential. Nature Reviews Genetics. 2012;13(7):455–468. doi: 10.1038/nrg3197. [DOI] [PubMed] [Google Scholar]
- [4].Grozinger L, Amos M, Gorochowski TE, Carbonell P, Oyarzún DA, Stoof R, Fellermann H, Zuliani P, Tas H, Goñi-Moreno A. Pathways to cellular supremacy in biocomputing. Nature communications. 2019;10(1):5250. doi: 10.1038/s41467-019-13232-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Goñi-Moreno A. Biocomputation: Moving beyond Turing with living cellular computers. Communications of the ACM. 2024;67(6):70–77. [Google Scholar]
- [6].Tas H, Grozinger L, Goñi-Moreno A, de Lorenzo V. Automated design and implementation of a NOR gate in Pseudomonas putida. Synthetic Biology. 2021;6(1):ysab024. doi: 10.1093/synbio/ysab024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Nielsen AA, Der BS, Shin J, Vaidyanathan P, Paralanov V, Strychalski EA, Ross D, Densmore D, Voigt CA. Genetic circuit design automation. Science. 2016;352(6281):aac7341. doi: 10.1126/science.aac7341. [DOI] [PubMed] [Google Scholar]
- [8].Friedland AE, Lu TK, Wang X, Shi D, Church G, Collins JJ. Synthetic gene networks that count. Science. 2009;324(5931):1199–1202. doi: 10.1126/science.1172005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Lugagne J-B, Sosa Carrillo S, Kirch M, Köhler A, Batt G, Hersen P. Balancing a genetic toggle switch by real-time feedback control and periodic forcing. Nature communications. 2017;8(1):1–8. doi: 10.1038/s41467-017-01498-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Filo M, Chang C-H, Khammash M. Biomolecular feedback controllers: from theory to applications. Current Opinion in Biotechnology. 2023;79:102882. doi: 10.1016/j.copbio.2022.102882. [DOI] [PubMed] [Google Scholar]
- [11].Siuti P, Yazbek J, Lu TK. Synthetic circuits integrating logic and memory in living cells. Nature biotechnology. 2013;31(5):448–452. doi: 10.1038/nbt.2510. [DOI] [PubMed] [Google Scholar]
- [12].Brooks SM, Alper HS. Applications, challenges, and needs for employing synthetic biology beyond the lab. Nature Communications. 2021;12(1):1390. doi: 10.1038/s41467-021-21740-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Meng F, Ellis T. The second decade of synthetic biology: 2010–2020. Nature Communications. 2020;11(1):5174. doi: 10.1038/s41467-020-19092-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Chubukov V, Mukhopadhyay A, Petzold CJ, Keasling JD, Martín HG. Synthetic and systems biology for microbial production of commodity chemicals. NPJ systems biology and applications. 2016;2(1):1–11. doi: 10.1038/npjsba.2016.9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].de Lorenzo V, Prather KL, Chen G-Q, O’Day E, von Kameke C, Oyarzuń DA, Hosta-Rigau L, Alsafar H, Cao C, Ji W, et al. The power of synthetic biology for bioproduction, remediation and pollution control: the UN’s Sustainable Development Goals will inevitably require the application of molecular biology and biotechnology on a global scale. EMBO reports. 2018;19(4):e45658. doi: 10.15252/embr.201745658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Danchin A. Bacteria as computers making computers. FEMS microbiology reviews. 2008;33(1):3–26. doi: 10.1111/j.1574-6976.2008.00137.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Naseri G, Koffas MA. Application of combinatorial optimization strategies in synthetic biology. Nature communications. 2020;11(1):2446. doi: 10.1038/s41467-020-16175-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Brödel AK, Jaramillo A, Isalan M. Engineering orthogonal dual transcription factors for multi-input synthetic promoters. Nature communications. 2016;7(1):13858. doi: 10.1038/ncomms13858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Hammer K, Mijakovic I, Jensen PR. Synthetic promoter libraries–tuning of gene expression. Trends in biotechnology. 2006;24(2):53–55. doi: 10.1016/j.tibtech.2005.12.003. [DOI] [PubMed] [Google Scholar]
- [20].Moore SJ, Lai H-E, Kelwick RJ, Chee SM, Bell DJ, Polizzi KM, Freemont PS. EcoFlex: a multifunctional MoClo kit for E. coli synthetic biology. ACS Synthetic Biology. 2016;5(10):1059–1069. doi: 10.1021/acssynbio.6b00031. [DOI] [PubMed] [Google Scholar]
- [21].Tas H, Grozinger L, Stoof R, de Lorenzo V, Goñi-Moreno A. Contextual dependencies expand the re-usability of genetic inverters. Nature communications. 2021;12(1):355. doi: 10.1038/s41467-020-20656-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Boo A, Ellis T, Stan G-B. Host-aware synthetic biology. Current Opinion in Systems Biology. 2019;14:66–72. [Google Scholar]
- [23].Chen Y, Zhang S, Young EM, Jones TS, Densmore D, Voigt CA. Genetic circuit design automation for yeast. Nature Microbiology. 2020;5(11):1349–1360. doi: 10.1038/s41564-020-0757-2. [DOI] [PubMed] [Google Scholar]
- [24].Young KD. The selective value of bacterial shape. Microbiology and molecular biology reviews. 2006;70(3):660–703. doi: 10.1128/MMBR.00001-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Gibson B, Wilson DJ, Feil E, Eyre-Walker A. The distribution of bacterial doubling times in the wild. Proceedings of the Royal Society B. 2018;285(1880):20180789. doi: 10.1098/rspb.2018.0789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Darlington AP, Kim J, Jiménez JI, Bates DG. Engineering translational resource allocation controllers: mechanistic models, design guidelines, and potential biological implementations. ACS Synthetic Biology. 2018;7(11):2485–2496. doi: 10.1021/acssynbio.8b00029. [DOI] [PubMed] [Google Scholar]
- [27].Kim J, Goñi-Moreno A, Calles B, de Lorenzo V. Spatial organization of the gene expression hardware in Pseudomonas putida. Environmental microbiology. 2019;21(5):1645–1658. doi: 10.1111/1462-2920.14544. [DOI] [PubMed] [Google Scholar]
- [28].de Lorenzo V, Krasnogor N, Schmidt M. For the sake of the bioeconomy: define what a synthetic biology chassis is! New Biotechnology. 2021;60:44–51. doi: 10.1016/j.nbt.2020.08.004. [DOI] [PubMed] [Google Scholar]
- [29].Ruiz N, Silhavy TJ. How Escherichia coli became the flagship bacterium of molecular biology. Journal of bacteriology. 2022;204(9):e00230–22. doi: 10.1128/jb.00230-22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Jones EM, Marken JP, Silver PA. Synthetic microbiology in sustainability applications. Nature Reviews Microbiology. 2024;22:345–359. doi: 10.1038/s41579-023-01007-9. [DOI] [PubMed] [Google Scholar]
- [31].Liew FE, Nogle R, Abdalla T, Rasor BJ, Canter C, Jensen RO, Wang L, Strutz J, Chirania P, De Tissera S, Mueller AP, et al. Carbon-negative production of acetone and isopropanol by gas fermentation at industrial pilot scale. Nature Biotechnology. 2022;40:335–344. doi: 10.1038/s41587-021-01195-w. [DOI] [PubMed] [Google Scholar]
- [32].Torella JP, Gagliardi CJ, Chen JS, Bediako DK, Colón B, Way JC, Silver PA, Nocera DG. Efficient solar-to-fuels production from a hybrid microbial–water-splitting catalyst system. PNAS. 2015;112:2337–2342. doi: 10.1073/pnas.1424872112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Venkataraman M, Yñigez-Gutierrez A, Infante V, MacIntyre A, Fernandes-Junior PI, Ané J-M, Pfleger B. Synthetic biology toolbox for nitrogen-fixing soil microbes. ACS Synthetic Biology. 2023;12:3623–3634. doi: 10.1021/acssynbio.3c00414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Martinez-Feria R, Simmonds MB, Ozaydin B, Lewis S, Schwartz A, Pluchino A, McKellar M, Gottlieb SS, Kayatsky T, Vital R, Mehlman SE, et al. Genetic remodeling of soil diazotrophs enables partial replacement of synthetic nitrogen fertilizer with biological nitrogen fixation in maize. Scientific Reports. 2024 Nov;14:27754. doi: 10.1038/s41598-024-78243-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [35].Liu Y, Liu L, Li J, Du G, Chen J. Synthetic biology toolbox and chassis development in Bacillus subtilis. Trends in biotechnology. 2019;37(5):548–562. doi: 10.1016/j.tibtech.2018.10.005. [DOI] [PubMed] [Google Scholar]
- [36].Martínez-García E, de Lorenzo V. Pseudomonas putida as a synthetic biology chassis and a metabolic engineering platform. Current Opinion in Biotechnology. 2024;85:103025. doi: 10.1016/j.copbio.2023.103025. [DOI] [PubMed] [Google Scholar]
- [37].Ueki T, Nevin KP, Woodard TL, Lovley DR. Genetic switches and related tools for controlling gene expression and electrical outputs of Geobacter sulfurreducens. Journal of Industrial Microbiology and Biotechnology. 2016;43(11):1561–1575. doi: 10.1007/s10295-016-1836-5. [DOI] [PubMed] [Google Scholar]
- [38].Grozinger L, Heidrich E, Goñi-Moreno A. An electrogenetic toggle switch model. Microbial Biotechnology. 2023;16(3):546–559. doi: 10.1111/1751-7915.14153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [39].Blount BA, Weenink T, Ellis T. Construction of synthetic regulatory networks in yeast. FEBS letters. 2012;586(15):2112–2121. doi: 10.1016/j.febslet.2012.01.053. [DOI] [PubMed] [Google Scholar]
- [40].Lee J, Kim S, Jung H, Koo B-K, Han JA, Lee H-S. Exploiting bacterial genera as biocontrol agents: Mechanisms, interactions and applications in sustainable agriculture. Journal of Plant Biology. 2023;66:485–498. [Google Scholar]
- [41].Müller T, Behrendt U. Exploiting the biocontrol potential of plant-associated pseudomonads – a step towards pesticide-free agriculture? Biological Control. 2021;155:104538 [Google Scholar]
- [42].Ryu M-H, Zhang J, Toth T, Khokhani D, Geddes BA, Mus F, Garcia-Costas A, Peters JW, Poole PS, Ané J-M, Voigt CA. Control of nitrogen fixation in bacteria that associate with cereals. Nature Microbiology. 2020;5:314–330. doi: 10.1038/s41564-019-0631-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Jing X, Cui Q, Li X, Yin J, Ravichandran V, Pan D, Fu J, Tu Q, Wang H, Bian X, et al. Engineering Pseudomonas protegens pf-5 to improve its antifungal activity and nitrogen fixation. Microbial Biotechnology. 2020;13(1):118–133. doi: 10.1111/1751-7915.13335. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Boo A, Toth T, Yu Q, Pfotenhauer A, Fields BD, Lenaghan SC, Stewart CN, Jr, Voigt CA. Synthetic microbe-to-plant communication channels. Nature Communications. 2024;15:1817. doi: 10.1038/s41467-024-45897-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Espeso DR, Martínez-García E, de Lorenzo V, Goñi-Moreno A. Physical forces shape group identity of swimming Pseudomonas putida cells. Frontiers in microbiology. 2016;7:217689. doi: 10.3389/fmicb.2016.01437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Stoof R, Wood A, Goñi-Moreno A. A model for the spatiotemporal design of gene regulatory circuits. ACS synthetic biology. 2019;8(9):2007–2016. doi: 10.1021/acssynbio.9b00022. [DOI] [PubMed] [Google Scholar]
- [47].Sequeiros C, Vázquez C, Banga JR, Otero-Muras I. Automated design of synthetic gene circuits in the presence of molecular noise. ACS Synthetic Biology. 2023;12(10):2865–2876. doi: 10.1021/acssynbio.3c00033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Marchisio MA, Stelling J. Computational design of synthetic gene circuits with composable parts. Bioinformatics. 2008;24(17):1903–1910. doi: 10.1093/bioinformatics/btn330. [DOI] [PubMed] [Google Scholar]
- [49].Crowther M, Wipat A, Goñi-Moreno A. A network approach to genetic circuit designs. ACS synthetic biology. 2022;11(9):3058–3066. doi: 10.1021/acssynbio.2c00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [50].Goñi-Moreno A, Carcajona M, Kim J, Martinez-Garcia E, Amos M, de Lorenzo V. An implementation-focused bio/algorithmic workflow for synthetic biology. ACS synthetic biology. 2016;5(10):1127–1135. doi: 10.1021/acssynbio.6b00029. [DOI] [PubMed] [Google Scholar]
- [51].Jones TS, Oliveira SM, Myers CJ, Voigt CA, Densmore D. Genetic circuit design automation with Cello 2.0. Nature protocols. 2022;17(4):1097–1113. doi: 10.1038/s41596-021-00675-2. [DOI] [PubMed] [Google Scholar]
- [52].Lopatkin AJ, Collins JJ. Predictive biology: modelling, understanding and harnessing microbial complexity. Nature Reviews Microbiology. 2020 Sep;18:507–520. doi: 10.1038/s41579-020-0372-5. [DOI] [PubMed] [Google Scholar]
- [53].Stoof R, Goñi-Moreno A. Modelling co-translational dimerization for programmable nonlinearity in synthetic biology. Journal of the Royal Society Interface. 2020;17(172):20200561. doi: 10.1098/rsif.2020.0561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Espah Borujeni A, Channarasappa AS, Salis HM. Translation rate is controlled by coupled trade-offs between site accessibility, selective RNA unfolding and sliding at upstream standby sites. Nucleic Acids Research. 2013;42:2646–2659. doi: 10.1093/nar/gkt1139. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [55].Espah Borujeni A, Salis HM. Translation initiation is controlled by RNA folding kinetics via a ribosome drafting mechanism. Journal of the American Chemical Society. 2016;138:7016–7023. doi: 10.1021/jacs.6b01453. [DOI] [PubMed] [Google Scholar]
- [56].Espah Borujeni A, Cetnar D, Farasat I, Smith A, Lundgren N, Salis HM. Precise quantification of translation inhibition by mRNA structures that overlap with the ribosomal footprint in N-terminal coding sequences. Nucleic Acids Research. 2017;45:5437–5448. doi: 10.1093/nar/gkx061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].Puigbò P, Bravo IG, Garcia-Vallve S. CAIcal: A combined set of tools to assess codon usage adaptation. Biology Direct. 2008;3:38. doi: 10.1186/1745-6150-3-38. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Tin Chat Chan D, Winter L, Bjerg J, Krsmanovic S, Baldwin GS, Bernstein HC. Fine tuning genetic circuits via host context and RBS modulation. ACS Synthetic Biology. 2025;14:193–205. doi: 10.1021/acssynbio.4c00551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Molinaro AM, Simon R, Pfeiffer RM. Prediction error estimation: a comparison of resampling methods. Bioinformatics. 2005;21(15):3301–3307. doi: 10.1093/bioinformatics/bti499. [DOI] [PubMed] [Google Scholar]
- [60].Zhao M, Tao X-Y, Wang F-Q, Ren Y-H, Wei D-Z. Establishment of a low-dosage-IPTG inducible expression system construction method in Escherichia coli. Journal of Basic Microbiology. 2018;9:806–810. doi: 10.1002/jobm.201800160. [DOI] [PubMed] [Google Scholar]
- [61].Roquet N, Lu TK. Digital and analog gene circuits for biotechnology. Biotechnology Journal. 2014;9:597–608. doi: 10.1002/biot.201300258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [62].Li X, Daniel R. Synthetic nonlinear computation for genetic circuit design. Current Opinion in Biotechnology. 2022;76:102727. doi: 10.1016/j.copbio.2022.102727. [DOI] [PubMed] [Google Scholar]
- [63].Alejaldre L, Anhel A-M, Goñi-Moreno A. pBLAM1-x: standardized transposon tools for high-throughput screening. Synthetic Biology. 2023;8:1–6. doi: 10.1093/synbio/ysad012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Tas H, Goñi-Moreno A, de Lorenzo V. A standardized inverter package borne by broad host range plasmids for genetic circuit design in gram-negative bacteria. ACS Synthetic Biology. 2021;10:213–217. doi: 10.1021/acssynbio.0c00529. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The code to reproduce simulations, along with parameter values, can be publicly accessed at https://github.com/PabloJap/ Plasmid maps as GenBank files in Supplemenary Material.






