Modeling with uncertainty quantification reveals the essentials of a non-canonical algal carbon-concentrating mechanism

Anne K Steensma; Joshua A M Kaste; Junoh Heo; Douglas J Orr; Chih-Li Sung; Yair Shachar-Hill; Berkley J Walker

doi:10.1093/plphys/kiae629

. 2024 Dec 4;197(2):kiae629. doi: 10.1093/plphys/kiae629

Modeling with uncertainty quantification reveals the essentials of a non-canonical algal carbon-concentrating mechanism

Anne K Steensma ^1,^2,², Joshua A M Kaste ^3,^2,^✉, Junoh Heo ⁴, Douglas J Orr ⁵, Chih-Li Sung ⁶, Yair Shachar-Hill ⁷, Berkley J Walker ^8,^9,^4,^✉,⁵

PMCID: PMC11836721 PMID: 39656810

Abstract

The thermoacidophilic red alga Cyanidioschyzon merolae survives its challenging environment likely in part by operating a carbon-concentrating mechanism (CCM). Here, we demonstrated that C. merolae's cellular affinity for CO₂ is stronger than the affinity of its rubisco for CO₂. This finding provided additional evidence that C. merolae operates a CCM while lacking the structures and functions characteristic of CCMs in other organisms. To test how such a CCM could function, we created a mathematical compartmental model of a simple CCM, distinct from those we have seen previously described in detail. The results of our modeling supported the feasibility of this proposed minimal and non-canonical CCM in C. merolae. To facilitate the robust modeling of this process, we measured and incorporated physiological and enzymatic parameters into the model. Additionally, we trained a surrogate machine-learning model to emulate the mechanistic model and characterized the effects of model parameters on key outputs. This parameter exploration enabled us to identify model features that influenced whether the model met the experimentally derived criteria for functional carbon concentration and efficient energy usage. Such parameters included cytosolic pH, bicarbonate pumping cost and kinetics, cell radius, carboxylation velocity, number of thylakoid membranes, and CO₂ membrane permeability. Our exploration thus suggested that a non-canonical CCM could exist in C. merolae and illuminated the essential features generally necessary for CCMs to function.

The thermoacidophilic red alga Cyanidioschyzon merolae possesses a unique carbon-concentrating mechanism (CCM), and a model illuminates the essential features generally required for CCM function.

Introduction

Cyanidioschyzon merolae is a red microalga found in moist environments surrounding geothermal sulfur springs. This species is extremophilic, with optimal laboratory growth conditions including low pH (∼2) and high temperatures (∼42°C) (Miyagishima and Wei 2017; Miyagishima et al. 2017). C. merolae and other thermoacidophilic red algae draw interest for their unique biology and simple characteristics, which position them as useful model organisms and as candidates for biotechnology applications (Rahman et al. 2017; Miyagishima and Tanaka 2021; Seger et al. 2023; Villegas-Valencia et al. 2023). For example, C. merolae is of interest because it is one of few organisms which relies on photosynthesis in geothermal spring environments, where hot and acidic conditions restrict the availability of inorganic carbon and challenge biological carbon fixation (Gross 2000; Miyagishima et al. 2017). Notably, organisms of acid waters can only access approximately 10 μM inorganic carbon, as the inorganic carbon pool at acid pH is primarily the volatile species CO_2. In comparison, organisms of near-neutral and alkaline waters may have access to several millimolar of inorganic carbon, due to accumulation of the involatile bicarbonate (Oesterhelt et al. 2007).

C. merolae is thought to survive in its challenging environment in part by operating a carbon-concentrating mechanism (CCM) (Zenvirth et al. 1985; Rademacher et al. 2017; Steensma et al. 2023). CCMs boost carbon-fixation efficiency by concentrating CO₂ around rubisco, providing ample substrate for carbon-fixation and inhibiting a competing oxygen-fixation reaction of rubisco. Evidence supporting a CCM in C. merolae includes measured accumulation of radiolabeled carbon in the cell, d¹³C consistent with a CCM, transcriptional response of potential CCM genes to CO₂ fluctuations, and substantial CO₂ assimilation at low environmental CO₂ concentrations (Zenvirth et al. 1985; Rademacher et al. 2017; Steensma et al. 2023). However, many of these indications of the CCM are not definitive: in particular, it is not known how much of C. merolae's ability to assimilate CO₂ efficiently could be explained by the affinity of C. merolae rubisco for CO₂. Thus, we here provide further evidence for the CCM in C. merolae by demonstrating that the affinity of C. merolae cells for CO₂ is better than could be explained by the affinity of C. merolae rubisco for CO₂.

C. merolae's CCM may be described as a “non-canonical” CCM, since the C. merolae CCM must operate differently from the few CCM types which are well-characterized.

For example, unlike algae and cyanobacteria with well-characterized CCMs, C. merolae is not able to take up external bicarbonate, and C. merolae lacks anatomy associated with the pyrenoid CCM organelle (Zenvirth et al. 1985; Badger et al. 1998; Misumi et al. 2005; Steensma et al. 2023). The absence of these CCM features in C. merolae challenges our understanding of what components are required for a functional CCM, and presents the opportunity to define essential CCM components. While previous work has discussed CO₂ as a source of carbon for the CCM (Fridlyand et al. 1996; Price 2011), there has been little quantitative exploration of whether a CCM could function while lacking both facilitated carbon uptake and specialized compartments such as the pyrenoid or carboxysome. We thus used mathematical modeling, informed by our experimental measurements, to explore how the C. merolae CCM may function.

Research on CCMs has long employed mathematical models to understand the components of functional CCMs in model cyanobacteria and algae, with a particular area of interest in CCM modeling being the possibility of boosting crop productivity by engineering CCMs into crops which lack CCMs (Price et al. 2013; McGrath and Long 2014; Fei et al. 2022; Kaste et al. 2024). By developing modeling approaches to robustly describe CCMs in organisms where biochemical data is limited, such as extremophile algae, we can better understand how organisms survive environmental challenges. Here we add to these engineering efforts by modeling a heat-tolerant CCM with minimal components which offers unique possibilities for plant synthetic biology (Misumi et al. 2017). To draw robust conclusions about cellular characteristics which can support a CCM, we used state-of-the-art statistical methods to define the effects of model parameters on the predicted photosynthetic phenotype while limiting unwarranted a priori assumptions. We demonstrate an interdisciplinary modeling approach which efficiently sampled from large parameter spaces and identified features (e.g. compartment permeability, pH, enzyme characteristics) that determine the function and energy cost of a simple CCM. This adds a useful tool for compartmental photosynthetic modeling, and could facilitate effective use of models to inform experiments and rational engineering.

Some sets of model input parameters produced model outputs which met empirically based criteria for functional carbon concentration and efficient energy usage, and we identified input parameters which have substantial impacts on the model outputs. Overall, our model of a hypothetical biophysical CCM which requires minimal enzymes and anatomical features (Fig. 1) appears to represent a feasible CCM structure in C. merolae, which invites further research into the sources of environmental resilience in extremophile algae.

Figure 1. — Cross-section of model structure. This model describes fluxes (indicated by arrows and “V#” notation) and pools (indicated by molecular formulas) of a simplified dissolved inorganic carbon system (CO₂, ${HCO}_{3}^{-}$ ) and of oxygen (O₂). Fluxes are as defined in Supplementary Methods section “Model fluxes.” Molecule pools can be present in several well-mixed compartments: the bulk external medium surrounding the cell, an unstirred boundary layer of medium around the cell, the cytosol, or a central stromal space of the chloroplast. Circles mark enzymatically-catalyzed fluxes. Compartments are not drawn to scale. PR = photorespiratory CO₂ release, *R_L* = respiration in the light. All fluxes are reversible and are assigned an arbitrary direction, except those fluxes which represent producing or consuming material.

Results and discussion

Rubisco kinetics demonstrated that C. merolae operates a CCM

In previous work, we determine that if C. merolae has rubisco kinetics similar to other red algae, then this alga must operate a CCM to maintain its measured photosynthetic efficiency. Alternatively, its measured photosynthetic efficiency could be explained by unprecedented rubisco kinetics, meaning enzyme properties favoring carbon-fixation over oxygen-fixation to an unprecedented degree (Steensma et al. 2023). Here we confirmed that C. merolae rubisco kinetics are similar to those of other red-type (Form 1D) rubiscos (Read and Tabita 1994; Uemura et al. 1997; Whitney et al. 2001). C. merolae rubisco had a strong affinity for CO₂ (low K_C), a poor affinity for O₂ (high K_O), and a slow carboxylation rate (low kcat_C) (Fig. 2). Consistent with other studies, kcat_C and K_C were higher when measured at increased temperature, while K_O was lower. Although K_O is in the denominator of rubisco specificity (S_c/o) and S_c/o decreases with increased temperature, in vitro K_O is observed to decrease with increased assay temperature in some species (Jordan and Ogren 1984; Uemura et al. 1997; Prins et al. 2016).

Figure 2. — Experimental data incorporated into the model. **A, B)**. Response of net assimilation in *C. merolae* to A) CO₂ availability and B) light availability. Points are mean ± SE (n = 3), and parameters calculated from the data are indicated in the upper left corner of each plot as mean ± SE. Dashed lines indicate trend fits used to determine Michaelis–Menten constant of CO₂ fixation (K_C) and respiration in the light (R_L). The linear fit used to determine the CO₂ compensation point (Γ_CO2) is not pictured but is described in Methods. C) Kinetic properties of *C. merolae* rubisco. Rubisco turnover rate for CO₂ fixation (kcat_C), Michaelis–Menten constant of CO₂ fixation (K_C), and Michaelis–Menten constant of O₂ fixation (K_O) were measured at 25 and 45°C. Data are mean ± SE, n = 4.

These kinetics findings indicated C. merolae does operate a CCM, as C. merolae cells had higher affinity for CO₂ than C. merolae rubisco (8.71 ± 1.7 µM cell K_C vs. 24.9 ± 3.2 µM rubisco K_C at 45°C, P = 0.008 by two-sample t-test) (Fig. 2). This result adds to the evidence of a CCM in C. merolae (Zenvirth et al. 1985; Rademacher et al. 2017; Steensma et al. 2023).

Quantitative modeling showed that a hypothesized CCM can explain C. merolae's carbon-concentrating behavior

To explore how the C. merolae CCM may operate, we constructed a functional model of a CCM (Fig. 1). This model demonstrated that there were parameter sets consistent with the empirical literature that result in a functional CCM, despite the minimal model structure lacking structures like a pyrenoid or carboxysome (Fig. 3). Cyanobacterial CCM models have also supported reduction to a simple model with only two compartments from the cell membrane inwards (Mangan and Brenner 2014).

Figure 3. — Values of key model outputs. A) Parameter sets are organized into a 2-dimensional histogram according to their output values of Γ_CO2 and ATP per CO₂, with dashed lines indicating bounds for acceptable values of these outputs. The dataset for the figure was the 240,000 total parameter sets. However, 80 parameter sets (0.03% of the total) are not pictured in this panel, as they produced negative ATP per CO₂ values and could not be log-transformed. B) Percentages of parameter sets meeting various combinations of output criteria. Model output criteria and associated units are as defined in Methods: Definition of reasonable model output values.

Our results provided quantitative support for a CCM taking inorganic carbon from the environment solely through CO₂ diffusion into the cell without specialized compartments, which we term a “non-canonical” CCM due to its differences in structure and function from CCMs that have been characterized in detail. C. merolae has a different structure and environment than the “canonical” CCMs of Chlamydomonas reinhardtii and of model cyanobacteria, which allowed us to explore a biology and a parameter space which are different from those in previous CCM models.

Though there is speculation that extremophilic red algae may use a C₄-like CCM, it has been previously proposed that acidophile algae may accumulate carbon by a “bicarbonate-trap” or “acid-loading” mechanism similar to our modeled CCM (Gehl and Colman 1985; Fridlyand 1997; Gross 2000; Rademacher et al. 2016; Curien et al. 2021; Fei et al. 2022). Briefly, this mechanism would involve bicarbonate being concentrated for enzymatic action by bringing inorganic carbon speciation near equilibrium in near-neutral cellular compartments, since the predominant inorganic carbon species from pH ∼6 to ∼10 is the poorly-membrane-permeable bicarbonate.

Various facilitated CO₂ uptake mechanisms exist in CCM-containing organisms, such as the NAD(P)H dehydrogenase-I complexes in cyanobacteria and the periplasmic carbonic anhydrase (CA) system in algae (Fridlyand et al. 1996; Moroney et al. 2011; Price 2011). We here test a different model where inorganic carbon enters the cell solely by passive CO₂ diffusion into the cytosol, followed by the action of non-vectorial cytosolic carbonic anhydrase. In contrast to the well-studied cyanobacterial and algal systems, where growth under limiting CO₂ is supported by active bicarbonate uptake and the accumulation of cytosolic bicarbonate above equilibrium levels (Price and Badger 1989; Price et al. 2004; Duanmu et al. 2009), our model functions as a CCM without taking any bicarbonate from the environment.

Another unique feature of our model is the nature of the diffusion barrier surrounding rubisco. Cyanobacteria encapsulate rubisco in a proteinaceous shell called the carboxysome, which is thought to provide a diffusion barrier to CO₂ (Price et al. 2008). The model alga C. reinhardtii aggregates rubisco into an organelle called the pyrenoid, which in wild-type cells is surrounded by a starch sheath that may serve as a diffusion barrier. In contrast to the well-studied system of C. reinhardtii, there has been comparatively less investigation into algae which lack starch sheaths or lack pyrenoids entirely (Morita et al. 1999; Barrett et al. 2021). Thus, to broaden our knowledge of CCM anatomy, we modeled an arrangement where rubisco is diffuse within a series of concentric thylakoid membranes. This allowed us to further investigate whether membranes, which are thought to be highly permeable to CO₂ (Gutknecht et al. 1977; Missner et al. 2008), could impact carbon concentration, and how carbon concentration could function without a carboxysome or pyrenoid.

To investigate these and other features of interest, we used two strategies to deeply explore the model parameter space and ensure that our conclusions were robust. First, the model included our experimental data on gas-exchange and rubisco parameters central to photosynthetic efficiency (Fig. 2). Second, we developed a method for thoroughly assessing the model's sensitivity to the value of model parameters of interest. Specifically, we were interested in 19 of the 43 model parameters which were biologically interesting in relation to the function of a hypothetical C. merolae CCM and which were not well-characterized physical constants (Supplementary Table S1). We thus sampled input parameter sets with varying numbers for these parameters of interest. We sampled parameter sets through a Latin hypercube design (McKay et al. 1979) which enhanced analysis accuracy by mitigating sampling bias, as it produced parameter sets distributed throughout the 19-dimensional parameter space of interest. Then, each input parameter set was used to parameterize the model and to generate a set of outputs for analysis.

Some of the input parameter sets produced outputs consistent with a functional CCM with reasonable energy cost. Of particular interest were the parameter sets which met all the empirically based criteria for a realistic and functional CCM (criteria selection described in Methods Supplementary S1 Supporting Information). Of 240,000 parameter sets, 13,998 (6%) parameter sets fulfilled the two competing objectives of functional carbon concentration (corresponding to outputs of low Γ_CO2, high stromal CO₂, and low v_o/v_c) and efficient energy usage (corresponding to output of low ATP per CO₂) (Fig. 2).

The generated parameter sets allowed us to explore the tradeoffs associated with various features related to the CCM. For example, adding additional concentric thylakoids slightly improved carbon concentration by presenting barriers to CO₂ leakage out of the chloroplast, but incurred additional energy costs of carbon transport (Fig. 4, Supplementary Figs. S1 and S2). This is consistent with other modeling studies indicating that thylakoid membranes could affect inorganic carbon diffusion and with observations of pyrenoids surrounded by layers of thylakoids in hornworts (Thoms et al. 2001; Fei et al. 2022; Robison et al. 2024).

Figure 4. — Effect of select input parameters on key model outputs. **A, B)** Effect of model input parameter membranes (x-axis) on key model outputs. Distribution of parameter set outputs for each value of membranes is represented by a box plot overlaid on a violin plot. Shaded areas represent unacceptable values of outputs. A) Effect of membranes on model output Γ_CO2. B) Effect of membranes on model output ATP per CO₂ 80 parameter sets (0.03% of total) are not pictured in this panel, as they produced negative ATP per CO₂ values and could not be log-transformed. **C, D)** Effect on key model outputs where carbonic anhydrases or bicarbonate transport activity at the chloroplast boundary are removed from the model. Distribution of parameter set outputs for each scenario is represented by a box plot overlaid on a violin plot. Shaded areas represent out-of-bounds values of outputs. The same sampling of input parameter sets was run through models representing each scenario. C) Γ_CO2 in model scenarios where various model features removed, with an indication of how many parameter sets met output criteria in each scenario. D) ATP per CO₂ in model scenarios where carbonic anhydrases or bicarbonate transport activity at the chloroplast boundary are removed from the model. 6,991 parameter sets producing negative ATP per CO₂ values (0.6% of total) are not pictured in this panel. All panels: Model output criteria and associated units are as defined in Methods: Definition of reasonable model output values. 240,000 total parameter sets were graphed for each panel of the figure, except where otherwise noted. The plots pictured were created with geom_violin and geom_boxplot in the R ggplot2 library. geom_boxplot visualizes the median, two hinges marking the first and third quartiles of the data, and two whiskers extending to the largest and smallest values no farther than 1.5 interquartile ranges from the hinge.

Machine-learning-based surrogate models identified the parameters that most influence CCM efficiency

Like most mathematical models of photosynthetic systems, this model faced the challenge of drawing robust conclusions while using parameters which, although bounded by their relationship to physical processes, have substantial uncertainty (Supplementary Table S1). To model a system with limited biochemical data while not constraining input parameters to a greater degree than was supported by the literature, it was important to assess uncertainties which seemed likely to have substantial and interdependent effects on the model. For example, the input parameter describing the permeability of a lipid bilayer to CO₂ (Plip_CO2) has reported values ranging over several orders of magnitude (Supplementary Table S1). Furthermore, the effect of Plip_CO2 in the model depended on the value of other parameters, such as the number of lipid bilayers which pose a barrier to carbon moving between the stroma and cytosol (Membranes). Various sensitivity analyses are available for ordinary differential equation (ODE) models, but Plip_CO2 and similar parameters were unlikely to be satisfactorily explored by classical local sensitivity analyses, which involve tracking model outputs when individual parameters are varied by a set fraction of the parameter's original value. Therefore, to reveal which model conditions were necessary for the modeled CCM to function biologically, and to identify interesting directions for future investigation, we used statistical methods to identify impactful parameters and to identify which input spaces corresponded to target output ranges. These statistical methods involved training a surrogate machine-learning model on our CCM model inputs and outputs. Interpretations of this surrogate model identified which zones in the input parameter space contained the most combinations fulfilling output criteria (Fig. 5lower left), quantified how much each input parameter affected the prediction of outputs by the surrogate model (Fig. 5upper right), and visualized the response of model outputs to inputs (Supplementary Figs. S4 to S7).

Figure 5. — Statistical investigation of parameters affecting model output. Upper right bar plots: Mean absolute SHapley Additive exPlanations (SHAP) plots showing which inputs most affect each output criterion based on the 240,000 samples of the surrogate model. Lower left density plots: Density plots of parameter sets meeting all output criteria, based on the 240,000 samples of the surrogate model and organized by selected pairwise input parameters (input parameters pictured are those input parameters with high SHAP values for all output criteria). Scale bars by each density plot indicate relative densities, with darker areas indicating areas where more parameter sets meeting criteria occur. (Scales of color vary for each plot.) Parameters and associated units are as defined in Supplementary Table S1.

Some input parameters had little impact on model outputs with the tested input ranges. For these parameters, values from across the input range were evenly represented in the parameter sets meeting all output criteria. The parameters with relatively little impact on outputs included values related to carbonic anhydrase concentration and kinetics ([CA], CAkcat, Km_CO2, and Km_HCO3− for carbonic anhydrases), chloroplast pH, and values related to bicarbonate membrane permeability (Plip_HCO3−, Q10_PlipHCO3−, Fig. 5, Supplementary Figs. S4 to S8). While it is possible that these aspects of the CCM may become impactful if varied beyond the tested range (e.g. if engineering efforts produce carbonic anhydrase concentrations falling outside the range of literature values we used), these parameters did not emerge as particularly impactful in our exploration. Due to how fast the interconversion of inorganic carbon species by carbonic anhydrase is, the enzyme is likely capable of keeping inorganic carbon species close to their equilibrium concentrations across the range of values we explored for its kinetics. Given this, it is unsurprising that model outputs varied little with respect to carbonic-anhydrase-related parameters, even though the complete absence of these enzymes was deleterious (Fig. 4).

Other parameters were more constraining in the model, indicating their importance in producing a functional CCM. For example, six parameters appeared to impact all four of the target model outputs in the mean absolute SHapley Additive exPlanations (SHAP) plots: V_c, Vmax_pump, Km_pump, pH in the cytosol, PlipCO₂, and Membranes (Fig. 5). Sobolʹ analysis (Sobol′ 2001) of the surrogate model produced similar results (Supplementary Fig. S9). As might be expected in a model relying on a cytosolic bicarbonate trap followed by bicarbonate pumping, parameter sets that successfully and efficiently concentrated carbon tended to have cytosolic pH at or above the pH where bicarbonate predominates (cytosol pH above 6) and tended to have a lower ATP cost of pumping bicarbonate (low Pump_cost), as well as faster and higher-affinity bicarbonate pumps (high Vmax_pump, low Km_pump) (Fig. 5).

Other features enriched in parameter sets meeting output criteria were a cell radius in the middle of the input range (moderate Radius_cell) and a lower CO₂ membrane permeability (low Plip_CO2, Fig. 5, Supplementary Figs. S4 to S9). This suggested an important relationship between the volumes where metabolism occurs and the surface areas which present diffusion barriers between compartments. As the radius of the cell increases, CO₂ loss from R_L may overcome the ability of the cell to acquire carbon through passive diffusion into the cell. Conversely, as the radius of the cell decreases, less absolute bicarbonate pumping would be necessary to achieve high rubisco saturation, especially when rubisco is slow (low V_c). In low-radius scenarios, “over-pumping” bicarbonate could reduce energy efficiency.

In silico knockouts identified experimental targets for further characterization of the C. merolae CCM

The modeling also suggested interesting directions for investigating enzymatic components of the CCM. Alternative models with CCM enzymes removed (carbonic anhydrases or bicarbonate pumping not functional) were less likely to meet the criterion of a Γ_CO2 indicative of functional carbon concentration, but tended to have lower ATP per CO₂ cost than the model with all enzymes present (Fig. 4, Supplementary Figs. S1 and S2).

The modeled CCM functioned without fine details of cellular structure that support photosynthesis in other organisms, such as rubisco aggregation into an area smaller than the stroma, carbonic anhydrases with restricted distributions and directions (i.e. lumenal and vectorial carbonic anhydrases), recapture of mitochondrially respired CO₂, and perforations or interconnections in concentric thylakoids (Nevo et al. 2007; Rademacher et al. 2017; Barrett et al. 2021). Our work thus expands on previous models with detailed chloroplast geometry (Fei et al. 2022) by demonstrating that efficient carbon capture may occur in a simple case when rubisco and carbonic anhydrase are diffuse within a series of concentric thylakoid spheres. It may still be of interest to explore what chloroplast structures support photosynthesis in C. merolae and to investigate the biochemical and molecular basis for this non-canonical CCM.

Further applications of surrogate modeling and uncertainty quantification

More broadly, the statistical approach adopted in this paper represents an advance in metabolic and biochemical modeling. By training a surrogate model on the parameter space of mechanistic biological models, we can understand and account for high-dimensional uncertainty in model parameters. Metabolic modeling in general, especially complex metabolic modeling, has been highlighted as a particularly promising application of surrogate modeling, as metabolic modeling has biotechnological potential but is challenged by the complexity of metabolism and by the “trial and error” process which is often required to produce a working metabolic model (Gherman et al. 2023). Surrogate modeling has found uses in dynamic flux balance analysis and process modeling for bioprocesses (Mountraki et al. 2020; de Oliveira et al. 2021). Our work expands on these investigations by demonstrating what is to our knowledge the first application of surrogate modeling to ODE-based compartmental modeling of biological systems. Our methods may be particularly valuable for models that have poorly defined parameters or are extremely computationally expensive. For example, the implementation of surrogate modeling described here could alleviate current limitations in interpreting reaction-diffusion models and genome-scale metabolic models (Gherman et al. 2023). Even for our relatively-simple model, the run time for 240,000 simulations was several hours and required the use of a computing cluster. In contrast, surrogate modeling could be run locally on a laptop computer and was able to generate 240,000 predictions for all four outputs of interest in less than 10 s, easily creating a large dataset for analysis and allowing for precise sensitivity estimation. We compared this with Sobolʹ sensitivity analysis (Sobol′ 2001) performed with the original model with a sample size of n = 163,840, comparable to the number of parameter sets and outputs used to train the surrogate model. Despite the generation of these samples taking several hours of computation time, this approach yielded extremely imprecise and uninterpretable results, suggesting that substantially more computational investment would be necessary to achieve acceptably precise sensitivity estimates (Supplementary Fig. S10). With normalized root-mean-square error (NRMSE) below 1.5% in our validation (Supplementary Table S2), the computational gains associated with the surrogate modeling approach outweighed the near-negligible potential error introduced by an inexact surrogate.

Important considerations in any surrogate modeling application include the sample size required to train the model and limitations of surrogate models for out-of-sample predictions. Surrogates should be used cautiously for out-of-sample predictions, particularly in high-dimensional settings where training data are limited (Forrester et al. 2008). Regarding the sample size, early studies (Chapman et al. 1994; Jones et al. 1998; Loeppky et al. 2009) suggested using around 10d samples, where d is the input dimension, for building an accurate Gaussian Process (GP) surrogate model. GP surrogates are particularly effective for small datasets and provide uncertainty quantification, which is valuable for assessing the confidence of out-of-sample predictions (Gramacy 2020). If the desired accuracy is not achieved, one can improve the model by increasing the sample size through adaptive strategies such as active learning (MacKay 1992), which allows for more efficient use of additional data to further enhance accuracy. Recent studies have also provided guidance on determining the run size required for a GP surrogate to achieve a pre-specified level of out-of-sample prediction accuracy (Harari et al. 2018). In scenarios where high extrapolation performance is critical, one may consider using physics-informed surrogates, which tend to be more reliable in out-of-sample contexts. These surrogate models incorporate physical laws into their training process and offer improved performance for out-of-sample predictions, especially when physical dynamics are a key feature of a model. Examples of physics-informed surrogates include a manifold-constrained GP surrogate that adheres to an underlying ODE system (Yang et al. 2021) or Physics-Informed Neural Networks (PINNs) (Raissi et al. 2019).

Effective parameter exploration and analysis may generally be useful in confronting global challenges. Here, we used statistical sampling, surrogate modeling, and uncertainty quantification methods to investigate how a particular aquatic organism achieves the high photosynthetic efficiency that enables them collectively to be responsible for approximately half of global photosynthetic CO₂ consumption (Field et al. 1998). Similar modeling techniques may be applied effectively to any system: for example, as part of engineering efforts for bioproduction, crop resilience, and other goals, it may be useful to in silico determine which features of a system are essential or inflexible throughout ranges of interest before devoting resources to in vivo experimentation.

Conclusions

The extremophilic red microalga C. merolae operates a CCM, as evidenced by this alga having gas-exchange behavior which was not explained by its rubisco properties. Mathematical modeling suggested that this CCM could consist of a minimal mechanism. Robust parameter exploration and statistical analysis, aided by the use of a surrogate model, allowed us to quantify the sensitivity of our model to parameter uncertainties, identify important parameter interactions, and identify key determinants of CCM efficiency. Therefore, in addition to supporting the presence of a non-canonical CCM in C. merolae, our results shed light on what conditions must be met for this CCM to function and the essential elements of biophysical CCMs in general.

Materials and methods

Experimental data collection: gas-exchange measurements

Cyanidioschyzon merolae 10D was grown as cultures in Erlenmeyer flasks in 50 mL of medium containing 40 mm (NH₄)₂SO₄, 4 mm MgSO₄ · 7H₂O, 8 mm KH₂PO₄, 0.75 mm CaCl₂ · 2H₂O, 1 mL L⁻¹ Hutner's Trace Elements solution, and H₂SO₄ to pH 2.7 (recipe modified from MA2 medium recipe of (Fujiwara and Ohnuma 2017)). Cultures were maintained at 40°C under 100 µmol m⁻² s⁻¹ white light, with aeration by shaking at 100 rpm. For gas-exchange measurements, three replicate cultures of OD₇₅₀ 1.0 to 1.2 were resuspended in growth medium to OD₇₅₀ 0.6 (1.60 × 10⁷ to 3.68 × 10⁷cells/mL). Gas-exchange parameters were measured in a LI-6800-18 Aquatic Chamber (LI-COR Biosciences) at 45°C and with normalization to cell count data from a hemocytometer slide, following the procedures of Steensma et al. (2023) and with a protocol similar to the study by Davey and Lawson (2024).

Experimental data collection: rubisco kinetics measurements

We purified rubisco from C. merolae biomass with a protocol adapted from the studies Miyagishima and Wei (2017) and Orr and Carmo-Silva (2018). Approximately 60 g of biomass was lysed by freeze–thawing followed by mechanical homogenization. Crude rubisco was polyethylene–glycol-precipitated from clarified homogenate and purified by fast protein liquid chromatography (FPLC). FPLC fractions eluting under the major UV trace peak were assayed by SDS–PAGE and by spectrophotometric rubisco activity assay (procedures adapted from Kubien et al. (2010) and Carter et al. (2013) (Supplementary Fig. S3). Fractions containing active semi-pure rubisco were pooled, concentrated with a 100-kDa centrifugal concentration filter, and snap-frozen for use in rubisco assays.

Purified rubisco was used in four technical replicate experiments to determine catalytic properties as described previously in detail (Prins et al. 2016), with some alterations to protein desalting and activation: concentrated protein aliquots were first diluted with activation mix containing 100 mm Bicine-NaOH pH 8.0, 20 mm MgCl₂, 10 mm NaHCO₃, and 1% (v/v) Plant Protease Inhibitor cocktail (Sigma-Aldrich, UK). Rubisco was then activated at 45°C for 15 min before being used in ¹⁴CO₂ consumption assays at either 25°C or 45°C with CO₂ concentrations of 8, 16, 24, 36, 68, and 100 µM. To determine K_O, these CO₂ concentrations were combined with concentrations of 0, 21, 40, or 70% (v/v) O₂. kcat_C was determined using measurements with 0% O₂. An aliquot of the activated protein was used for determination of Rubisco active sites via ¹⁴C-CABP binding using the method of (Sharwood et al. 2016). For ¹⁴C-CABP binding, protein aliquots were incubated at 45°C for 15 mins with ¹⁴C-CABP to maximize binding, prior to application to Sephadex columns as previously described (Loganathan et al. 2016). Aliquots were also analyzed via SDS–PAGE alongside known concentrations of plant-type Rubisco to strengthen estimates of Rubisco content.

Model details

The hypothetical CCM described in this study (Fig. 1) was modeled as a set of well-mixed compartments and represented as a system of ordinary differential equations (ODEs). In this minimal biophysical CCM, carbon diffuses into the cell as CO₂, is trapped in the cytosol as bicarbonate by the action of carbonic anhydrase, and is pumped into the chloroplast, where a second carbonic anhydrase provides CO₂ around rubisco. No pyrenoid diffusion barrier is present, as neither a starch sheath nor a clear organized subcompartment for rubisco has been described in C. merolae. However, we accounted for potential effects of the concentric thylakoids which are present in C. merolae and many other aquatic photosynthetic organisms (Ichinose and Iwane 2017). CAs and bicarbonate transporters are essential components of known biophysical CCMs and thus essential components of a CCM model (Beardall and Raven 2020). These components (V4, V11, and V8) are discussed in more detail below.

The model geometry is based on the cellular structure of C. merolae as apparent in published micrographs of this alga (Kuroiwa 1998; Miyagishima et al. 1998; Toda et al. 1998; Itoh et al. 1999; Yagisawa et al. 2012, 2016; Ichinose and Iwane 2017; Reimer et al. 2017; Sato et al. 2017; Moriyama et al. 2018). The modeled cell and its boundary layer form a series of concentric spherical well-mixed compartments. The cell is enclosed by a lipid bilayer of radius $R a d i u s_{c e l l}$ . The cell contains a cytosol of radius $R a d i u s_{c e l l}$ and a chloroplast stroma space of radius $0.25 *$ . The cell is surrounded by a medium boundary layer of radius $2 *$ , beyond which lies an infinite external medium. Though varying fluid dynamic conditions strongly impact the size of boundary layers such as gas surface films or phycospheres, these layers are reported to be on the order of magnitude of 1 cell radius (Guterman and Ben-Yaakov 1987; Seymour et al. 2017).

Molecules cross the boundary of the stroma space according to diffusion or transport equations. For flux calculations, the boundary consists of 1–7 lipid bilayers of negligible thickness that are evenly spaced from $0.5 *$ to $0.25 *$ . This boundary structure represents the fact that the C. merolae chloroplast is surrounded by a chloroplast envelope and by approximately 4 to 6 thylakoids which appear as concentric circles or spirals in microscopy examinations (Ichinose and Iwane 2017). A range of possible transport scenarios (how many membranes molecules must cross when crossing between the cytosol and stroma, and how much energy this crossing costs) are captured by varying parameters Membranes and Pump_cost.

Diffusion through lipid membranes (V1, V6, V5, V7, V15) was described using estimates of conductivity of lipid membranes to the chemical species in question:

\begin{matrix} J_{m e m b r a n e d i f f u s i o n} = C o n d u c t i v i t y_{X} * ({[X]}_{A} - {[X]}_{B}) \end{matrix}

(1)

Where Conductivity_x is the conductivity—in units of µm³/s—of chemical species X through a lipid bilayer, and [X]_A and [X]_B are the concentrations of that species on the two sides of that lipid bilayer. Diffusion between the medium boundary layer and bulk medium (V18, V19) was described as an analogous simple diffusion flux, with conductivity determined according to diffusion coefficients through water at the boundary layer thickness. Lipid permeability coefficients for CO₂ and ${HCO}_{3}^{-}$ and the water diffusion coefficient for O₂ were sourced from the literature (Supplementary Table S1), and other necessary gas permeability and diffusion coefficients were determined from the literature values by Graham's law of diffusion:

\begin{matrix} \frac{r_{1}}{r_{2}} = \sqrt{\frac{M_{1}}{M_{2}}} \end{matrix}

(2)

where the rates of diffusion r₁ and r₂ for two different ideal gases, here CO₂ and O₂, are related according to their two molar masses M₁ and M₂.

To describe diffusion of CO₂ (V5), ${HCO}_{3}^{-}$ (V7), and O₂ (V15) through variable numbers of stacked thylakoid membranes, an overall conductivity through all of the layers was calculated as:

\begin{matrix} O v e r a l l C o n d u c t i v i t y = {(\sum_{i = 1}^{n} {(4 π r_{n}^{2} * C o n d u c t i v i t y_{X})}^{- 1})}^{- 1} \end{matrix}

(3)

where r_n is the radius of the sphere formed by the nth thylakoid membrane. This overall conductivity value is then used in (E1) to describe the movement of a chemical species from the outer stroma into the inner stroma space, as shown in Fig. 1. We assume that small gas molecules diffuse easily around membrane proteins, so that the diffusion of CO₂ and O₂ through any modeled membrane is potentially impeded by increased path length, but is not impeded by CO₂ and O₂ passing through high-resistance protein material.

Spontaneous interconversion of CO₂ and ${HCO}_{3}^{-}$ , as in V2, V3, V9, and V10 (E4-5), was described using simple first-order kinetics, according to the rate constant of the dehydration (slower) step of the interconversion:

\begin{matrix} J_{C O_{2} h y d r a t i o n} = k_{2} [C O_{2}] \end{matrix}

(4)

\begin{matrix} J_{H C O_{3}^{-} d e h y d r a t i o n} = k_{- 2} [H C O_{3}^{-}] [H^{+}] \end{matrix}

(5)

Note that CO₂ must first be hydrated to H₂CO₃, which is then deprotonated to yield the ${HCO}_{3}^{-}$ ion. However, because the interconversion of ${HCO}_{3}^{-}$ and H₂CO₃ is essentially instantaneous relative to the hydration–dehydration reaction, here we ignore the H₂CO₃ species and approximate the spontaneous interconversion as the hydration–dehydration reaction. It was observed in Mangan et al. (2016) that the substantially higher permeability of H₂CO₃ relative to ${HCO}_{3}^{-}$ , coupled with the rapid interconversion of these species, results in a greater permeability through lipid membranes of this joint H₂CO₃/ ${HCO}_{3}^{-}$ pool than would be expected from ${HCO}_{3}^{-}$ permeability alone. To account for this while accommodating the simplification of not including the H₂CO₃ species, we explored a range of possible lipid permeabilities to ${HCO}_{3}^{-}$ and CO₂ that substantially overlaps with the range of inorganic carbon permeability values from Mangan et al. (2016).

The interconversion of CO₂ and ${HCO}_{3}^{-}$ by carbonic anhydrase (V4, V11) was described as in the study by McGrath and Long (2014):

\begin{matrix} J_{C A} = \frac{[C A] * C A_{k c a t} * ([C O_{2}] - \frac{[H C O_{3}^{-}] [H^{+}]}{K_{a}})}{K_{m}^{C O_{2}} + [H C O_{3}^{-}] (\frac{K_{m}^{C O_{2}}}{K_{m}^{H C O_{3}^{-}}}) + [C O_{2}]} \end{matrix}

(6)

where the K_a value is the overall K_a for the CO₂/ ${HCO}_{3}^{-}$ system. This value is temperature-sensitive and was calculated using the R package seacarb package (Lavigne et al. 2019). Other potentially temperature-sensitive parameters receive temperature adjustments according to Q₁₀ or Q₁₅ factors as in von Caemmerer (2000). In C. merolae, CA inhibitors have not been shown to affect oxygen evolution, but it remains plausible that CAs are involved in photosynthesis, since genes homologous to CCM CAs show transcript increases in response to lowered CO₂ availability (Rademacher et al. 2017; Parys et al. 2021). Of the two putative CAs with the most dramatic transcriptional response to CO₂, one protein has a computationally-predicted chloroplast targeting sequence and has been fluorescence-localized between the mitochondrion and chloroplast, while the other protein has no predicted targeting sequence and has been fluorescence-localized in the cytosol (Rademacher et al. 2017; Steensma et al. 2023).

Carboxylation by rubisco (V12) was described with the assumption that CO₂ is limiting, as in Farquhar et al. (1980):

\begin{matrix} v_{c} = \frac{V m a x_{c a r b o x y l a t i o n} [C O_{2}]}{([C O_{2}] + K_{m}^{C O_{2}} (1 + \frac{[O_{2}]}{K_{m}^{O_{2}}}))} \end{matrix}

(7)

To estimate oxygenation (V13), we estimate v_c/v_o (carboxylation flux over oxygenation flux) from the CO₂/O₂ specificity (S_c/o) of rubisco and chloroplast CO₂ and O₂ concentrations (E8), and then use this to arrive at v_o.

\begin{matrix} \frac{v_{c}}{v_{o}} = S_{c o} (\frac{[C O_{2}]}{[O_{2}]}) \end{matrix}

(8)

The pumping of ${HCO}_{3}^{-}$ across the stack of thylakoid membranes by a bicarbonate pump (V8) was described by simple Michaelis–Menten kinetics:

\begin{matrix} J_{H C O_{3}^{-} p u m p} = (\frac{V_{m a x} [H C O_{3}^{-}]}{K_{m} + [H C O_{3}^{-}]}) (S u r f a c e A r e a) \end{matrix}

(9)

Concerning what is known about bicarbonate transport in C. merolae, it is difficult to identify bicarbonate transporters by homology (Price and Howitt 2011; Steensma et al. 2023). C. merolae would have minimal access to extracellular bicarbonate even if bicarbonate were substantially available in its acidic environment, as is evident from radiolabeling and from gas-exchange conducted at varying pH (Zenvirth et al. 1985; Steensma et al. 2023). Bicarbonate transport at the chloroplast or thylakoids is a key feature of biophysical CCMs (Price et al. 2008; Spalding 2008).

Photorespiratory CO₂ release (V14) and photosynthetic oxygen evolution (V16) were determined by the stoichiometry described in Supplementary S1 Supporting Information. Non-photorespiratory CO₂ release occurring during photosynthesis, known as respiration in the light (R_L) (Xu et al. 2021), was estimated from gas-exchange data according to a modified Kok method (V17). Assimilation was measured under sub-saturating light intensities and extrapolated to estimate CO₂ release in the absence of light (Fig. 2B). The resulting mean measured value of R_L was normalized to cell size for use in the model: we assume that the empirical measurement of R_L we obtained was, on a per-cell basis, characteristic of a C. merolae cell of a radius of 1 µm. Under the assumption that R_L should vary proportionally with cell volume, we normalized R_L as follows:

\begin{matrix} R_{L n o r m a l i z e d} = R_{L m e a s u r e d} \frac{V o l u m e}{V o l u m e_{1 u m}} \end{matrix}

(10)

ATP costs for the cell were estimated as:

\begin{matrix} A T P_{t o t a l} = 3 v_{c} + 3.5 v_{o} + (J_{H C O_{3}^{-} p u m p} * M e m b r a n e s * P u m p_{c o s t}) \end{matrix}

(11)

where Membranes is the number of thylakoid stacks and Pump_cost is the assumed cost, in ATP, of pumping a single ${HCO}_{3}^{-}$ ion across a lipid bilayer by the hypothesized pump.

A full list of all flux equations and the system of ODEs used to describe the system can be found in Supplementary S1 Supporting Information.

Definition of reasonable model output values

To ensure the model reproduced experimental results we measured, or obtained from the literature, experimental data to set acceptable bounds for the following model outputs: CO₂ compensation point (Γ_CO2), the ratio of ATP consumption flux to net CO₂ assimilation flux (ATP per CO₂), the steady-state CO₂ concentration in the chloroplast stroma (stromal CO₂), and the ratio of oxygen-fixation flux to carbon-fixation flux (v_o/v_c).

CO₂ compensation point (Γ_CO2)

We accepted Γ_CO2 values less than or equal to 2.70 µM, corresponding to no more than twice the mean measured value (Fig. 2).

Ratio of ATP consumption flux to net CO₂ assimilation flux (ATP per CO₂)

We accepted ATP per CO₂ values which were less than or equal to 25 and greater than 0. These bounds are supported by measured light response curves which indicated how much additional light absorption drives a certain amount of additional CO₂ assimilation (Fig. 2). We used these data to estimate how much additional ATP production drives an additional CO₂ assimilation, using the photon per ATP values for various light-reaction pathways (Walker et al. 2020), the cylindrical geometry of the gas-exchange sample chamber, and the measured density of cells in the sample. The resulting estimated values were as follows: 13.8 ± 2.19 ATP produced/CO₂ assimilated (mean ± SE, assuming cyclic and linear electron flow operating equally) or 17.4 ± 2.76 ATP produced/CO₂ assimilated (mean ± SE, assuming linear electron flow only operating). This suggests that ATP per CO₂ values of up to ∼25 are supported by photosynthetic electron flow. The lower bound of the acceptable range excludes a few parameter sets outputting negative ATP per CO₂, since these parameter sets represented particularly non-functional CCM scenarios with negative net assimilation values under ambient CO₂ conditions.

Steady-state CO₂ concentration in the chloroplast stroma (stromal CO₂)

We accepted chloroplast CO₂ concentration values of greater than or equal to the CO₂ concentration in the medium under 400 ppm CO₂ atmosphere, by the logic that a functional CCM should result in rubisco accessing a greater CO₂ concentration than is available from ambient medium.

Ratio of oxygen-fixation flux to carbon-fixation flux (v_o/v_c)

We accepted v_o/v_c values less than or equal to 0.3, based on data and models indicating that plants without CCMs are unlikely to achieve v_o/v_c less than approximately 0.3 (Bellasio et al. 2014).

Model optimization and estimation of simulated compensation point

Steady-state fluxes and metabolite concentrations were solved using odeint() from Python's SciPy library (Virtanen et al. 2020) with error control handled by maintaining the following inequality:

max (\frac{e r r o r s (y)}{e r r o r_{w e i g h t s} (y)}) \leq 1

where errors is a vector of local errors against computed outputs y and error_weights is a vector of weights:

e r r o r_{w e i g h t s} = t o l e r a n c e_{r e l a t i v e} * | y | + t o l e r a n c e_{a b s o l u t e}

where tolerance_relative and tolerance_absolute are the relative and absolute tolerance values set in the odeint() solver. We use the default value for these tolerances from SciPy version 1.10.0. All simulations were verified to reach steady state (metabolite concentration solutions changing 0.01% or less from the previous value). An end time of sufficient length was chosen to ensure that simulations successfully reached steady state. The maximum number of step sizes allowed for each time point was manually set to 5,000 as this was found to allow our simulations to reach steady-state without optimization difficulties. Other optimization parameters, such as the maximum and minimum step sizes, were left at their default settings as well and controlled by the optimizer. Using these settings, 100% (240,000/240,000) of all simulations successfully reached a steady-state solution in all model architectures.

In order to characterize the response of key outputs and robustness of conclusions to a wide range of possible parameterizations of the model, we used Latin Hypercube Sampling (McKay et al. 1979) to explore 240,000 parameter combinations according to the bounds specified in (Supplementary Table S1). These simulations were run on Michigan State University's High Performance Computing Cluster. CO₂ compensation point estimates were generated for every parameter set by running the model at external CO₂ concentrations ranging from 0.0001 to 1000 µM, constructing a cubic spline from the resulting curve of net CO₂ assimilation vs. external CO₂ concentration, and identifying the root of this spline to find the compensation point.

Parameter exploration and surrogate model selection

In order to thoroughly explore the 19-dimensional parameter space in a computationally feasible way, we trained a surrogate machine-learning model on the mechanistic CCM model. By emulating the intricacies of the mechanistic model, surrogate modeling faithfully captures the dynamics of complex systems while alleviating the substantial computational costs associated with obtaining additional results from a mechanistic model. Surrogate modeling additionally gave us access to powerful statistical tools for machine-learning model analysis, including SHAP (Lundberg and Lee 2017) and partial dependence (PD) plots (Friedman 2001).

To identify the optimal surrogate model for parameter exploration, we compared four popular machine-learning models: eXtreme Gradient Boosting (XGBoost) (Chen and Guestrin 2016), Local approximate Gaussian Process (laGP) (Gramacy and Apley 2015), single-layer Neural Network (NN) (James et al. 2013), and Deep Neural Network (DNN) (Chen and Guestrin 2016). We collected a 240,000-sized dataset, where the outputs were simulated from the mechanistic CCM model at space-filling input locations. 90% of the data was used for training the surrogate, and the remaining 10% was used as the test dataset to validate the model performance. The dataset was divided into training and test sets using a random sampling approach. Specifically, we used the sample() function in R with a fixed seed. The evaluation of prediction performance was based on the RMSE:

R M S E = \sqrt{\sum_{i = 1}^{n_{t e s t}} \frac{{(y_{i} - {\hat{y}}_{i})}^{2}}{n_{t e s t}}},

where $y_{i}$ is the ith test output and ${\hat{y}}_{i}$ is the i -th predicted model output.

Model outputs had varying scales and degrees of skew, so to effectively compare prediction performance on different model outputs, an NRMSE was calculated. The NRMSE was calculated as the RMSE divided by $y_{m a x} - y_{m i n}$ , where $y_{m a x}$ is the highest test output and $y_{m i n}$ is the lowest test output.

From the model evaluation (Supplementary Table S2), it appears that XGBoost outperformed other models for v_o/v_c and ATP per CO₂, and remained comparable for Γ_CO2 and stromal CO_2. As such, XGBoost was used as the surrogate model for further analyses.

The XGBoost model was trained using a max number of boosting iterations of 1,000 with the evaluation metric of the root-mean-square error. The laGP model used the nearest neighbor method for prediction. The NN model is a simple feedforward neural network with a logistic activation function $(1 / (1 + e^{- x}))$ for regression tasks. The error function used for the calculation of the error was the sum of squared errors. The threshold parameter for the partial derivatives of the error function as stopping criteria for the NN model was set to half the range of the target variable.

The DNN model consists of 2 hidden layers containing 64 and 32 units respectively, both using rectified linear unit (ReLU) activation functions max(x, 0). The DNN model was trained using the adaptive moment estimation (Adam) optimizer and mean squared error (MSE) as the loss function. The model was trained for 40 epochs, with the learning algorithm processing the entire training dataset 40 times. A batch size of 240 was used, indicating the number of samples processed before updating the model's internal parameters. Moreover, 20% of the training data was set aside for validation purposes during the training process.

Accession numbers

Sequence data relevant to this article can be found in the KEGG and Cyanidioschyzon merolae Genome Project v3 data libraries under accession numbers:

EC 4.1.1.39, gene IDs CMV013C and CMV014C (Cyaniodioschyzon merolae rubisco).

Supplementary Material

kiae629_Supplementary_Data

kiae629_supplementary_data.zip^{(3.4MB, zip)}

Acknowledgments

We thank Dr. Mark Seger and Dr. Peter Lammers for kindly providing innocula of C. merolae as well as C. merolae biomass needed for rubisco purification. We additionally thank the following individuals for valuable supplies and technical insight: Dr. Sigal Lechno-Yossef and Damien Sheppard (Fast Protein Liquid Chromatography technical consultation), Ludmila Roze (protein gel technical consultation), Dr. Josh Vermaas and Dr. Daipayan Sarkar (insightful discussion of lipid membrane permeability).

Contributor Information

Anne K Steensma, Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA; Michigan State University—Department of Energy Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA.

Joshua A M Kaste, Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI 48824, USA.

Junoh Heo, Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA.

Douglas J Orr, Lancaster Environment Center, Lancaster University, Lancaster, LA1 4YQ, UK.

Chih-Li Sung, Department of Statistics and Probability, Michigan State University, East Lansing, MI 48824, USA.

Yair Shachar-Hill, Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA.

Berkley J Walker, Department of Plant Biology, Michigan State University, East Lansing, MI 48824, USA; Michigan State University—Department of Energy Plant Research Laboratory, Michigan State University, East Lansing, MI 48824, USA.

Author contributions (CRediT taxonomy definitions)

Conceptualization: A.K.S., J.A.M.K., C.L.S., Y.S.H., B.J.W. Data Curation: A.K.S., J.A.M.K. Formal Analysis: A.K.S., J.A.M.K., J.H., D.J.O. Funding acquisition: A.K.S., J.A.M.K., C.L.S., Y.S.H., and B.J.W. Investigation: A.K.S., J.A.M.K., and D.J.O. Methodology: all authors. Project Administration: A.K.S., J.A.M.K., C.L.S., Y.S.H., B.J.W. Resources: all authors. Software: A.K.S., J.A.M.K., J.H. Supervision: A.K.S., J.A.M.K., C.L.S., Y.S.H., B.JW. Validation: A.K.S. and J.A.M.K. Visualization: A.K.S. and J.H. Writing—Original Draft Preparation: A.K.S. Writing—Review & Editing: all authors.

Supplementary data

The following materials are available in the online version of this article.

Supplementary methods

Supplementary Table S1. Parameter values or ranges used in the model.

Supplementary Table S2. The test root-mean-square errors (RMSEs) and normalized RMSEs (NRMSEs) of four machine-learning surrogate models.

Supplementary Figure S1. Effect of model input parameter Membranes (x-axis) on CO₂ leakage from the chloroplast.

Supplementary Figure S2. Effect on key model outputs when bicarbonate transport or carbonic anhydrases (CAs) are removed from the model.

Supplementary Figure S3. SDS–PAGE analysis of rubisco preparation

Supplementary Figure S4. Partial dependence (PD) plots of first-order effects for Γ_CO2.

Supplementary Figure S5. Partial dependence (PD) plots of first-order effects for stromal CO₂.

Supplementary Figure S6. Partial dependence (PD) plots of first-order effects for v_o/v_c.

Supplementary Figure S7. Partial dependence (PD) plots of first-order effects for ATP per CO₂.

Supplementary Figure S8. Ranges of parameters in all parameter sets versus in parameter sets meeting all output criteria.

Supplementary Figure S9. Total sensitivity of outputs to each parameter as determined by Sobolʹ analysis of the surrogate model.

Supplementary Figure S10. Total Sobolʹ sensitivity indices calculated by analyzing a sample set of size 163,840.

Funding

Work in the laboratory of BJW is supported by Grant Number DE-FG02-91ER20021 from the Division of Chemical Sciences, Geosciences and Biosciences, Office of Basic Energy Sciences of the United States Department of Energy. Work in the laboratory of YSH was supported by Grant Number DE-SC0018269 from the United States Department of Energy (https://www.energy.gov/). A.K.S. and J.A.M.K. were additionally supported by a predoctoral training award from Grant Number T32-GM110523 from the National Institute of General Medical Sciences of the National Institutes of Health. J.A.M.K. received additional support from the National Science Foundation Research Traineeship Program, grant number DGE-1828149. J.H. and C.L.S. are supported by Grant Number DMS-2113407 from the National Science Foundation. The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the funding agencies.

Data availability

Data and model code used in this study can be accessed via GitHub: https://github.com/anne-steensma/Cmerolae_CCM_model.

Preprint servers

This manuscript was deposited as a preprint at BioRciv under CC-BY license (https://doi.org/10.1101/2024.04.12.589284).

Dive Curated Terms

The following phenotypic, genotypic, and functional terms are of significance to the work described in this paper:

References

Badger MR, Andrews TJ, Whitney SM, Ludwig M, Yellowlees DC, Leggat W, Price GD. The diversity and coevolution of rubisco, plastids, pyrenoids and chloroplast-based CO2-concentrating mechanisms in the algae. Can J Bot. 1998:76(6):1052–1071. 10.1139/b98-074 [DOI] [Google Scholar]
Barrett J, Girr P, Mackinder LCM. Pyrenoids: CO2-fixing phase separated liquid organelles. Biochim Biophys Acta—Mol Cell Res. 2021:1868(5):118949. 10.1016/j.bbamcr.2021.118949 [DOI] [PubMed] [Google Scholar]
Beardall J, Raven JA. Structural and biochemical features of carbon acquisition in Algae. In: Larkum AWD, Grossman AR, Raven JA, editors. Photosynthesis in algae: biochemical and physiological mechanisms. Cham: Springer International Publishing; 2020. p. 141–160. [Google Scholar]
Bellasio C, Burgess SJ, Griffiths H, Hibberd JM. A high throughput gas exchange screen for determining rates of photorespiration or regulation of C₄ activity. J Exp Bot. 2014:65(13):3769–3779. 10.1093/jxb/eru238 [DOI] [PMC free article] [PubMed] [Google Scholar]
Carter J, Petersen BP, Printz SA, Sorey TL, Kroll TT. Quantitative application for SDS—PAGE in a biochemistry lab. J Chem Educ. 2013:90(9):1255–1256. 10.1021/ed300390j [DOI] [Google Scholar]
Chapman WL, Welch WJ, Bowman KP, Sacks J, Walsh JE. Arctic sea ice variability: model sensitivities and a multidecadal simulation. J Geophys Res Oceans. 1994:99(C1):919–935. 10.1029/93JC02564 [DOI] [Google Scholar]
Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In: KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery, 2016. p. 785–794. 10.1145/2939672.2939785 [DOI] [Google Scholar]
Curien G, Lyska D, Guglielmino E, Westhoff P, Janetzko J, Tardif M, Hallopeau C, Brugière S, Dal Bo D, Decelle J, et al. Mixotrophic growth of the extremophile Galdieria sulphuraria reveals the flexibility of its carbon assimilation metabolism. New Phytol. 2021:231(1):326–338. 10.1111/nph.17359 [DOI] [PMC free article] [PubMed] [Google Scholar]
Davey P, Lawson T. Measurements of carbon assimilation in aquatic systems. Methods Mol Biol. 2024:2790:95–120. 10.1007/978-1-0716-3790-6_6 [DOI] [PubMed] [Google Scholar]
de Oliveira RD, Guedes MN, Matias J, Le Roux GAC. Nonlinear predictive control of a bioreactor by surrogate model approximation of flux balance analysis. Ind Eng Chem Res. 2021:60(40):14464–14475. 10.1021/acs.iecr.1c01242 [DOI] [Google Scholar]
Duanmu D, Miller AR, Horken KM, Weeks DP, Spalding MH. Knockdown of limiting-CO2-induced gene HLA3 decreases HCO3- transport and photosynthetic Ci affinity in Chlamydomonas reinhardtii. Proc Natl Acad Sci U S A. 2009:106(14):5990–5995. 10.1073/pnas.0812885106 [DOI] [PMC free article] [PubMed] [Google Scholar]
Farquhar GD, von Caemmerer S, Berry JA. A biochemical model of photosynthetic CO₂ assimilation in leaves of C₃ species. Planta. 1980:149(1):78–90. 10.1007/BF00386231 [DOI] [PubMed] [Google Scholar]
Fei C, Wilson AT, Mangan NM, Wingreen NS, Jonikas MC. Modelling the pyrenoid-based CO₂-concentrating mechanism provides insights into its operating principles and a roadmap for its engineering into crops. Nat Plants. 2022:8(5):583–595. 10.1038/s41477-022-01153-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
Field CB, Behrenfeld MJ, Randerson JT, Falkowski P. Primary production of the biosphere: integrating terrestrial and oceanic components. Science. 1998:281(5374):237–240. 10.1126/science.281.5374.237 [DOI] [PubMed] [Google Scholar]
Forrester AIJ, Sóbester A, Keane AJ. Engineering design via surrogate modelling: a practical guide. Chichester, United Kingdom: John Wiley; 2008. [Google Scholar]
Fridlyand L, Kaplan A, Reinhold L. Quantitative evaluation of the role of a putative CO2-scavenging entity in the cyanobacterial CO2-concentrating mechanism. Biosystems. 1996:37(3):229–238. 10.1016/0303-2647(95)01561-2 [DOI] [PubMed] [Google Scholar]
Fridlyand LE. Models of CO2 concentrating mechanisms in microalgae taking into account cell and chloroplast structure. Biosystems. 1997:44(1):41–57. 10.1016/s0303-2647(97)00042-7 [DOI] [PubMed] [Google Scholar]
Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001:29(5):1189–1232. 10.1214/aos/1013203451 [DOI] [Google Scholar]
Fujiwara T, Ohnuma M. Procedures for transformation and their applications in Cyanidioschyzon merolae. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 87–103. [Google Scholar]
Gehl KA, Colman B. Effect of external pH on the internal pH of Chlorella saccharophila. Plant Physiol. 1985:77(4):917–921. 10.1104/pp.77.4.917 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gherman IM, Abdallah ZS, Pang W, Gorochowski TE, Grierson CS, Marucci L. Bridging the gap between mechanistic biological models and machine learning surrogates. PLoS Comput Biol. 2023:19(4):e1010988. 10.1371/journal.pcbi.1010988 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gramacy RB. Surrogates: Gaussian process modeling, design, and optimization for the applied sciences. Boca Raton, Florida: Taylor & Francis Group (Texts in Statistical Science); 2020. [Google Scholar]
Gramacy RB, Apley DW. Local Gaussian process approximation for large computer experiments. J Comput Graph Stat. 2015:24(2):561–578. 10.1080/10618600.2014.914442 [DOI] [Google Scholar]
Gross W. Ecophysiology of algae living in highly acidic environments. Hydrobiologia. 2000:433(1/3):31–37. 10.1023/A:1004054317446 [DOI] [Google Scholar]
Guterman H, Ben-Yaakov S. Exchange rates of O2 and CO2 between an algal culture and atmosphere. Water Res. 1987:21(1):25–34. 10.1016/0043-1354(87)90095-9 [DOI] [Google Scholar]
Gutknecht J, Bisson MA, Tosteson FC. Diffusion of carbon dioxide through lipid bilayer membranes: effects of carbonic anhydrase, bicarbonate, and unstirred layers. J Gen Physiol. 1977:69(6):779–794. 10.1085/jgp.69.6.779 [DOI] [PMC free article] [PubMed] [Google Scholar]
Harari O, Dean A, Bingham D, Higdon D. Computer experiments: prediction accuracy, sample size and model complexity revisited. Stat Sin. 2018:28:899–919. 10.5705/ss.202016.0217 [DOI] [Google Scholar]
Ichinose TM, Iwane AH. Cyotological analyses by advanced electron microscopy. In: Kuroiwa T, Miyagishima SY, Matsunaga S, Sato N, Nozaki H, Tanaka K, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle Biology. Singapore: Springer Nature; 2017. p. 129–151. [Google Scholar]
Itoh R, Takano H, Ohta N, Miyagishima S-Y, Kuroiwa H, Kuroiwa T. Two ftsH-family genes encoded in the nuclear and chloroplast genomes of the primitive red alga Cyanidioschyzon merolae. Plant Mol Biol. 1999:41(3):321–337. 10.1023/a:1006369104530 [DOI] [PubMed] [Google Scholar]
James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning with applications in R. New York: Springer Science + Business Media (Springer Texts in Statistics); 2013. [Google Scholar]
Jones DR, Schonlau M, Welch WJ. Efficient global optimization of expensive black-box functions. J Glob Optim. 1998:13(4):455–492. 10.1023/A:1008306431147 [DOI] [Google Scholar]
Jordan DB, Ogren WL. The CO₂/O₂ specificity of ribulose 1,5-bisphosphate carboxylase/oxygenase: dependence on ribulose bisphosphate concentration, pH and temperature. Planta. 1984:161(4):308–313. 10.1007/BF00398720 [DOI] [PubMed] [Google Scholar]
Kaste JAM, Walker BJ, Shachar-Hill Y. Reaction-diffusion modeling provides insights into biophysical carbon concentrating mechanisms in land plants. Plant Physiol. 2024:196(2):1374–1390. 10.1093/plphys/kiae324 [DOI] [PMC free article] [PubMed] [Google Scholar]
Kubien DS, Brown CM, Kane HJ. Quantifying the amount and activity of rubisco in leaves. In: Carpentier R, editors. Photosynthesis research protocols. Totowa, NJ: Humana Press; 2010. p. 349–362. [Google Scholar]
Kuroiwa T. The primitive red algae Cyanidium caldarium and Cyanidioschyzon merolae as model system for investigating the dividing apparatus of mitochondria and plastids. BioEssays. 1998:20(4):344–354. 10.1002/(SICI)1521-1878(199804)20:4<344::AID-BIES11>3.0.CO;2-2 [DOI] [Google Scholar]
Lavigne H, Proye A, Gattuso J-P. seacarb: calculates parameters of the seawater carbonate system. 2019. https://rdrr.io/rforge/seacarb/. [Google Scholar]
Loeppky JL, Sacks J, Welch WJ. Choosing the sample size of a computer experiment: a practical guide. Technometrics. 2009:51(4):366–376. 10.1198/TECH.2009.08040 [DOI] [Google Scholar]
Loganathan N, Tsai YC, Mueller-Cajar O. Characterization of the heterooligomeric red-type rubisco activase from red algae. Proc Natl Acad Sci U S A. 2016:113(49):14019–14024. 10.1073/pnas.1610758113 [DOI] [PMC free article] [PubMed] [Google Scholar]
Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. In: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc., 2017. p. 4768–4777. https://dl.acm.org/doi/10.1007/s10207-024-00926-9 [Google Scholar]
MacKay DJC. Information-based objective functions for active data selection. Neural Comput. 1992:4(4):590–604. 10.1162/neco.1992.4.4.590 [DOI] [Google Scholar]
Mangan NM, Brenner MP. Systems analysis of the CO₂ concentrating mechanism in cyanobacteria. eLife. 2014:3:e02043. 10.7554/eLife.02043 [DOI] [PMC free article] [PubMed] [Google Scholar]
Mangan NM, Flamholz A, Hood RD, Milo R, Savage DF. Ph determines the energetic efficiency of the cyanobacterial CO₂ concentrating mechanism. Proc Natl Acad Sci U S A. 2016:113(36):E5354–E5362. 10.1073/pnas.1525145113 [DOI] [PMC free article] [PubMed] [Google Scholar]
McGrath JM, Long SP. Can the cyanobacterial carbon-concentrating mechanism increase photosynthesis in crop species? A theoretical analysis. Plant Physiol. 2014:164(4):2247–2261. 10.1104/pp.113.232611 [DOI] [PMC free article] [PubMed] [Google Scholar]
McKay MD, Beckman RJ, Conover WJ. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics. 1979:21(2):239–245. 10.2307/1268522 [DOI] [Google Scholar]
Missner A, Kügler P, Saparov SM, Sommer K, Mathai JC, Zeidel ML, Pohl P. Carbon dioxide transport through membranes. J Biol Chem. 2008:283(37):25340–25347. 10.1074/jbc.M800096200 [DOI] [PMC free article] [PubMed] [Google Scholar]
Misumi O, Kuroiwa T, Hirooka S. Application of the tolerance to extreme environment to land plants. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi M, et al., editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Singapore; 2017. p. 325–341. [Google Scholar]
Misumi O, Matsuzaki M, Nozaki H, Miyagishima S-Y, Mori T, Nishida K, Yagisawa F, Yoshida Y, Kuroiwa H, Kuroiwa T. Cyanidioschyzon merolae genome. A tool for facilitating comparable studies on organelle biogenesis in photosynthetic eukaryotes. Plant Physiol. 2005:137(2):567–585. 10.1104/pp.104.053991 [DOI] [PMC free article] [PubMed] [Google Scholar]
Miyagishima M, Itoh R, Toda K, Takahashi H, Kuroiwa H, Kuroiwa T. Visualization of the microbody division in Cyanidioschyzon merolae with the fluorochrome brilliant sulfoflavin. Protoplasma. 1998:201(1–2):115–119. 10.1007/BF01280718 [DOI] [Google Scholar]
Miyagishima S, Wei JL, Nozaki H, Hirooka S. Cyanidiales: evolution and habitats. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 3–15. [Google Scholar]
Miyagishima SY, Tanaka K. The unicellular red alga Cyanidioschyzon merolae—the simplest model of a photosynthetic eukaryote. Plant Cell Physiol. 2021:62(6):926–941. 10.1093/pcp/pcab052 [DOI] [PMC free article] [PubMed] [Google Scholar]
Miyagishima S-Y, Wei JL. Procedures for cultivation, observation, and conventional experiments in Cyanidioschyzon merolae. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 31–41. [Google Scholar]
Morita E, Abe T, Tsuzuki M, Fujiwara S, Sato N, Hirata A, Sonoike K, Nozaki H. Role of pyrenoids in the CO2-concentrating mechanism: comparative morphology, physiology and molecular phylogenetic analysis of closely related strains of Chlamydomonas and Chloromonas (Volvocales). Planta. 1999:208(3):365–372. 10.1007/s004250050571 [DOI] [Google Scholar]
Moriyama T, Mori N, Nagata N, Sato N. Selective loss of photosystem I and formation of tubular thylakoids in heterotrophically grown red alga Cyanidioschyzon merolae. Photosynth Res. 2018:140(3):275–287. 10.1007/s11120-018-0603-z [DOI] [PubMed] [Google Scholar]
Moroney JV, Ma Y, Frey WD, Fusilier KA, Pham TT, Simms TA, DiMario RJ, Yang J, Mukherjee B. The carbonic anhydrase isoforms of Chlamydomonas reinhardtii: intracellular location, expression, and physiological roles. Photosynth Res. 2011:109(1–3):133–149. 10.1007/s11120-011-9635-3 [DOI] [PubMed] [Google Scholar]
Mountraki AD, Benjelloun-Mlayah B, Kokossis AC. A surrogate modeling approach for the development of biorefineries. Front Chem Eng. 2020:2. 10.3389/fceng.2020.568196 [DOI] [Google Scholar]
Nevo R, Charuvi D, Shimoni E, Schwarz R, Kaplan A, Ohad I, Reich Z. Thylakoid membrane perforations and connectivity enable intracellular traffic in cyanobacteria. EMBO J. 2007:26(5):1467–1473. 10.1038/sj.emboj.7601594 [DOI] [PMC free article] [PubMed] [Google Scholar]
Oesterhelt C, Schmälzlin E, Schmitt JM, Lokstein H. Regulation of photosynthesis in the unicellular acidophilic red alga Galdieria sulphuraria. Plant J. 2007:51(3):500–511. 10.1111/j.1365-313X.2007.03159.x [DOI] [PubMed] [Google Scholar]
Orr DJ, Carmo-Silva E. Extraction of RuBisCO to determine catalytic constants. Methods Mol Biol. 2018:1770:229–238. 10.1007/978-1-4939-7786-4_13 [DOI] [PubMed] [Google Scholar]
Parys E, Krupnik T, Kułak I, Kania K, Romanowska E. Photosynthesis of the Cyanidioschyzon merolae cells in blue, red, and white light. Photosynth Res. 2021:147(1):61–73. 10.1007/s11120-020-00796-x [DOI] [PMC free article] [PubMed] [Google Scholar]
Price GD. Inorganic carbon transporters of the cyanobacterial CO2 concentrating mechanism. Photosynth Res. 2011:109(1–3):47–57. 10.1007/s11120-010-9608-y [DOI] [PubMed] [Google Scholar]
Price GD, Badger MR. Expression of human carbonic anhydrase in the cyanobacterium Synechococcus PCC7942 creates a high CO₂-requiring phenotype: evidence for a central role for carboxysomes in the CO₂ concentrating mechanism. Plant Physiol. 1989:91(2):505–513. 10.1104/pp.91.2.505 [DOI] [PMC free article] [PubMed] [Google Scholar]
Price GD, Badger MR, Woodger FJ, Long BM. Advances in understanding the cyanobacterial CO₂-concentrating-mechanism (CCM): functional components, Ci transporters, diversity, genetic regulation and prospects for engineering into plants. J Exp Bot. 2008:59(7):1441–1461. 10.1093/jxb/erm112 [DOI] [PubMed] [Google Scholar]
Price GD, Howitt SM. The cyanobacterial bicarbonate transporter BicA: its physiological role and the implications of structural similarities with human SLC26 transporters. Biochem Cell Biol. 2011:89(2):178–188. 10.1139/O10-136 [DOI] [PubMed] [Google Scholar]
Price GD, Pengelly JJL, Forster B, Du J, Whitney SM, von Caemmerer S, Badger MR, Howitt SM, Evans JR. The cyanobacterial CCM as a source of genes for improving photosynthetic CO2 fixation in crop species. J Exp Bot. 2013:64(3):753–768. 10.1093/jxb/ers257 [DOI] [PubMed] [Google Scholar]
Price GD, Woodger FJ, Badger MR, Howitt SM, Tucker L. Identification of a SulP-type bicarbonate transporter in marine cyanobacteria. Proc Natl Acad Sci U S A. 2004:101(52):18228–18233. 10.1073/pnas.0405211101 [DOI] [PMC free article] [PubMed] [Google Scholar]
Prins A, Orr DJ, Andralojc PJ, Reynolds MP, Carmo-Silva E, Parry MAJ. Rubisco catalytic properties of wild and domesticated relatives provide scope for improving wheat photosynthesis. J Exp Bot. 2016:67(6):1827–1838. 10.1093/jxb/erv574 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rademacher N, Kern R, Fujiwara T, Mettler-Altmann T, Miyagishima S-Y, Hagemann M, Eisenhut M, Weber APM. Photorespiratory glycolate oxidase is essential for the survival of the red alga Cyanidioschyzon merolae under ambient CO₂ conditions. J Exp Bot. 2016:67(10):3165–3175. 10.1093/jxb/erw118 [DOI] [PMC free article] [PubMed] [Google Scholar]
Rademacher N, Wrobel TJ, Rossoni AW, Kurz S, Bräutigam A, Weber APM, Eisenhut M. Transcriptional response of the extremophile red alga Cyanidioschyzon merolae to changes in CO₂ concentrations. J Plant Physiol. 2017:217:49–56. 10.1016/j.jplph.2017.06.014 [DOI] [PubMed] [Google Scholar]
Rahman DY, Sarian FD, van Wijk A, Martinez-Garcia M, van der Maarel MJEC. Thermostable phycocyanin from the red microalga Cyanidioschyzon merolae, a new natural blue food colorant. J Appl Phycol. 2017:29(3):1233–1239. 10.1007/s10811-016-1007-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys. 2019:378:686–707. 10.1016/j.jcp.2018.10.045 [DOI] [Google Scholar]
Read BA, Tabita FR. High substrate specificity factor ribulose bisphosphate carboxylase/oxygenase from eukaryotic marine algae and properties of recombinant cyanobacterial RubiSCO containing “algal” residue modifications. Arch Biochem Biophys. 1994:312(1):210–218. 10.1006/abbi.1994.1301 [DOI] [PubMed] [Google Scholar]
Reimer KA, Stark MR, Aguilar L-C, Stark SR, Burke RD, Moore J, Fahlman RP, Yip CK, Kuroiwa H, Oeffinger M, et al. The sole LSm complex in Cyanidioschyzon merolae associates with pre-mRNA splicing and mRNA degradation factors. RNA. 2017:23(6):952–967. 10.1261/rna.058487.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
Robison TA, Oh ZG, Lafferty D, Xu X, Villarreal JCA, Gunn LH, Li F-W. Hornworts reveal a spatial model for pyrenoid-based CO2-concentrating mechanisms in land plants. bioRxiv. 2024. 10.1101/2024.06.26.600872, preprint: not peer reviewed. [DOI] [PubMed] [Google Scholar]
Sato N, Moriyama T, Mori N, Toyoshima M. Lipid metabolism and potentials of biofuel and high added-value oil production in red algae. World J Microbiol Biotechnol. 2017:33(4):74. 10.1007/s11274-017-2236-3 [DOI] [PubMed] [Google Scholar]
Seger M, Mammadova F, Villegas-Valencia M, Bastos de Freitas B, Chang C, Isachsen I, Hemstreet H, Abualsaud F, Boring M, Lammers PJ, et al. Engineered ketocarotenoid biosynthesis in the polyextremophilic red microalga Cyanidioschyzon merolae 10D. Metab Eng Commun. 2023:17:e00226. 10.1016/j.mec.2023.e00226 [DOI] [PMC free article] [PubMed] [Google Scholar]
Seymour JR, Amin SA, Raina J-B, Stocker R. Zooming in on the phycosphere: the ecological interface for phytoplankton–bacteria relationships. Nat Microbiol. 2017:2(7):17065. 10.1038/nmicrobiol.2017.65 [DOI] [PubMed] [Google Scholar]
Sharwood RE, Ghannoum O, Whitney SM. Prospects for improving CO₂ fixation in C₃-crops through understanding C₄-rubisco biogenesis and catalytic diversity. Curr Opin Plant Biol. 2016:31:135–142. 10.1016/j.pbi.2016.04.002 [DOI] [PubMed] [Google Scholar]
Sobol′ IM. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math Comput Simul. 2001:55(1–3):271–280. 10.1016/S0378-4754(00)00270-6 [DOI] [Google Scholar]
Spalding MH. Microalgal carbon-dioxide-concentrating mechanisms: Chlamydomonas inorganic carbon transporters. J Exp Bot. 2008:59(7):1463–1473. 10.1093/jxb/erm128 [DOI] [PubMed] [Google Scholar]
Steensma AK, Shachar-Hill Y, Walker BJ. The carbon-concentrating mechanism of the extremophilic red microalga Cyanidioschyzon merolae. Photosynth Res. 2023:156(2):247–264. 10.1007/s11120-023-01000-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
Thoms S, Pahlow M, Wolf-Gladrow DA. Model of the carbon concentrating mechanism in chloroplasts of eukaryotic algae. J Theor Biol. 2001:208(3):295–313. 10.1006/jtbi.2000.2219 [DOI] [PubMed] [Google Scholar]
Toda K, Takano H, Miyagishima S-Y, Kuroiwa H, Kuroiwa T. Characterization of a chloroplast isoform of serine acetyltransferase from the thermo-acidiphilic red alga Cyanidioschyzon merolae. Biochim Biophys Acta. 1998:1403(1):72–84. 10.1016/S0167-4889(98)00031-7 [DOI] [PubMed] [Google Scholar]
Uemura K, Anwaruzzaman, Miyachi S, Yokota A. Ribulose-1,5-bisphosphate carboxylase/oxygenase from thermophilic red algae with a strong specificity for CO₂ fixation. Biochem Biophys Res Commun. 1997:233(2):568–571. 10.1006/bbrc.1997.6497 [DOI] [PubMed] [Google Scholar]
Villegas-Valencia M, González-Portela RE, de Freitas BB, Al Jahdali A, Romero-Villegas GI, Malibari R, Kapoore RV, Fuentes-Grünewald C, Lauersen KJ. Cultivation of the polyextremophile Cyanidioschyzon merolae 10D during summer conditions on the coast of the red sea and its adaptation to hypersaline sea water. Front Microbiol. 2023:14:1157151. 10.3389/fmicb.2023.1157151 [DOI] [PMC free article] [PubMed] [Google Scholar]
Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, et al. Scipy 1.0: fundamental algorithms for scientific computing in python. Nat Methods. 2020:17(3):261–272. 10.1038/s41592-019-0686-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
von Caemmerer S. Biochemical models of leaf photosynthesis. Collingwood, Australia: CSIRO Publishing; 2000. [Google Scholar]
Walker BJ, Kramer DM, Fisher N, Fu X. Flexibility in the energy balancing network of photosynthesis enables safe operation under changing environmental conditions. Plants. 2020:9(3):301. 10.3390/plants9030301 [DOI] [PMC free article] [PubMed] [Google Scholar]
Whitney SM, Baldet P, Hudson GS, Andrews TJ. Form I Rubiscos from non-green algae are expressed abundantly but not assembled in tobacco chloroplasts. Plant J. 2001:26(5):535–547. 10.1046/j.1365-313x.2001.01056.x [DOI] [PubMed] [Google Scholar]
Xu Y, Fu X, Sharkey TD, Shachar-Hill Y, Walker ABJ. The metabolic origins of non-photorespiratory CO2 release during photosynthesis: a metabolic flux analysis. Plant Physiol. 2021:186(1):297–314. 10.1093/plphys/kiab076 [DOI] [PMC free article] [PubMed] [Google Scholar]
Yagisawa F, Fujiwara T, Kuroiwa H, Nishida K, Imoto Y, Kuroiwa T. Mitotic inheritance of endoplasmic reticulum in the primitive red alga Cyanidioschyzon merolae. Protoplasma. 2012:249(4):1129–1135. 10.1007/s00709-011-0359-1 [DOI] [PubMed] [Google Scholar]
Yagisawa F, Kuroiwa H, Fujiwara T, Kuroiwa T. Intracellular structure of the unicellular red alga Cyanidioschyzon merolae in response to phosphate depletion and resupplementation. Cytologia (Tokyo). 2016:81(3):341–347. 10.1508/cytologia.81.341 [DOI] [Google Scholar]
Yang S, Wong SWK, Kou SC. Inference of dynamic systems from noisy and sparse data via manifold-constrained Gaussian processes. Proc Natl Acad Sci U S A. 2021:118(15):e2020397118. 10.1073/pnas.2020397118 [DOI] [PMC free article] [PubMed] [Google Scholar]
Zenvirth D, Volokita M, Kaplan A. Photosynthesis and inorganic carbon accumulation in the acidophilic alga Cyanidioschyzon merolae. Plant Physiol. 1985:77(1):237–239. 10.1104/pp.77.1.237 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

kiae629_Supplementary_Data

kiae629_supplementary_data.zip^{(3.4MB, zip)}

Data Availability Statement

Data and model code used in this study can be accessed via GitHub: https://github.com/anne-steensma/Cmerolae_CCM_model.

[kiae629-B1] Badger MR, Andrews TJ, Whitney SM, Ludwig M, Yellowlees DC, Leggat W, Price GD. The diversity and coevolution of rubisco, plastids, pyrenoids and chloroplast-based CO2-concentrating mechanisms in the algae. Can J Bot. 1998:76(6):1052–1071. 10.1139/b98-074 [DOI] [Google Scholar]

[kiae629-B2] Barrett J, Girr P, Mackinder LCM. Pyrenoids: CO2-fixing phase separated liquid organelles. Biochim Biophys Acta—Mol Cell Res. 2021:1868(5):118949. 10.1016/j.bbamcr.2021.118949 [DOI] [PubMed] [Google Scholar]

[kiae629-B3] Beardall J, Raven JA. Structural and biochemical features of carbon acquisition in Algae. In: Larkum AWD, Grossman AR, Raven JA, editors. Photosynthesis in algae: biochemical and physiological mechanisms. Cham: Springer International Publishing; 2020. p. 141–160. [Google Scholar]

[kiae629-B4] Bellasio C, Burgess SJ, Griffiths H, Hibberd JM. A high throughput gas exchange screen for determining rates of photorespiration or regulation of C₄ activity. J Exp Bot. 2014:65(13):3769–3779. 10.1093/jxb/eru238 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B5] Carter J, Petersen BP, Printz SA, Sorey TL, Kroll TT. Quantitative application for SDS—PAGE in a biochemistry lab. J Chem Educ. 2013:90(9):1255–1256. 10.1021/ed300390j [DOI] [Google Scholar]

[kiae629-B6] Chapman WL, Welch WJ, Bowman KP, Sacks J, Walsh JE. Arctic sea ice variability: model sensitivities and a multidecadal simulation. J Geophys Res Oceans. 1994:99(C1):919–935. 10.1029/93JC02564 [DOI] [Google Scholar]

[kiae629-B7] Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In: KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. San Francisco, California, USA: Association for Computing Machinery, 2016. p. 785–794. 10.1145/2939672.2939785 [DOI] [Google Scholar]

[kiae629-B8] Curien G, Lyska D, Guglielmino E, Westhoff P, Janetzko J, Tardif M, Hallopeau C, Brugière S, Dal Bo D, Decelle J, et al. Mixotrophic growth of the extremophile Galdieria sulphuraria reveals the flexibility of its carbon assimilation metabolism. New Phytol. 2021:231(1):326–338. 10.1111/nph.17359 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B9] Davey P, Lawson T. Measurements of carbon assimilation in aquatic systems. Methods Mol Biol. 2024:2790:95–120. 10.1007/978-1-0716-3790-6_6 [DOI] [PubMed] [Google Scholar]

[kiae629-B10] de Oliveira RD, Guedes MN, Matias J, Le Roux GAC. Nonlinear predictive control of a bioreactor by surrogate model approximation of flux balance analysis. Ind Eng Chem Res. 2021:60(40):14464–14475. 10.1021/acs.iecr.1c01242 [DOI] [Google Scholar]

[kiae629-B11] Duanmu D, Miller AR, Horken KM, Weeks DP, Spalding MH. Knockdown of limiting-CO2-induced gene HLA3 decreases HCO3- transport and photosynthetic Ci affinity in Chlamydomonas reinhardtii. Proc Natl Acad Sci U S A. 2009:106(14):5990–5995. 10.1073/pnas.0812885106 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B12] Farquhar GD, von Caemmerer S, Berry JA. A biochemical model of photosynthetic CO₂ assimilation in leaves of C₃ species. Planta. 1980:149(1):78–90. 10.1007/BF00386231 [DOI] [PubMed] [Google Scholar]

[kiae629-B13] Fei C, Wilson AT, Mangan NM, Wingreen NS, Jonikas MC. Modelling the pyrenoid-based CO₂-concentrating mechanism provides insights into its operating principles and a roadmap for its engineering into crops. Nat Plants. 2022:8(5):583–595. 10.1038/s41477-022-01153-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B14] Field CB, Behrenfeld MJ, Randerson JT, Falkowski P. Primary production of the biosphere: integrating terrestrial and oceanic components. Science. 1998:281(5374):237–240. 10.1126/science.281.5374.237 [DOI] [PubMed] [Google Scholar]

[kiae629-B15] Forrester AIJ, Sóbester A, Keane AJ. Engineering design via surrogate modelling: a practical guide. Chichester, United Kingdom: John Wiley; 2008. [Google Scholar]

[kiae629-B16] Fridlyand L, Kaplan A, Reinhold L. Quantitative evaluation of the role of a putative CO2-scavenging entity in the cyanobacterial CO2-concentrating mechanism. Biosystems. 1996:37(3):229–238. 10.1016/0303-2647(95)01561-2 [DOI] [PubMed] [Google Scholar]

[kiae629-B17] Fridlyand LE. Models of CO2 concentrating mechanisms in microalgae taking into account cell and chloroplast structure. Biosystems. 1997:44(1):41–57. 10.1016/s0303-2647(97)00042-7 [DOI] [PubMed] [Google Scholar]

[kiae629-B18] Friedman JH. Greedy function approximation: a gradient boosting machine. Ann Stat. 2001:29(5):1189–1232. 10.1214/aos/1013203451 [DOI] [Google Scholar]

[kiae629-B19] Fujiwara T, Ohnuma M. Procedures for transformation and their applications in Cyanidioschyzon merolae. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 87–103. [Google Scholar]

[kiae629-B20] Gehl KA, Colman B. Effect of external pH on the internal pH of Chlorella saccharophila. Plant Physiol. 1985:77(4):917–921. 10.1104/pp.77.4.917 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B21] Gherman IM, Abdallah ZS, Pang W, Gorochowski TE, Grierson CS, Marucci L. Bridging the gap between mechanistic biological models and machine learning surrogates. PLoS Comput Biol. 2023:19(4):e1010988. 10.1371/journal.pcbi.1010988 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B22] Gramacy RB. Surrogates: Gaussian process modeling, design, and optimization for the applied sciences. Boca Raton, Florida: Taylor & Francis Group (Texts in Statistical Science); 2020. [Google Scholar]

[kiae629-B23] Gramacy RB, Apley DW. Local Gaussian process approximation for large computer experiments. J Comput Graph Stat. 2015:24(2):561–578. 10.1080/10618600.2014.914442 [DOI] [Google Scholar]

[kiae629-B24] Gross W. Ecophysiology of algae living in highly acidic environments. Hydrobiologia. 2000:433(1/3):31–37. 10.1023/A:1004054317446 [DOI] [Google Scholar]

[kiae629-B25] Guterman H, Ben-Yaakov S. Exchange rates of O2 and CO2 between an algal culture and atmosphere. Water Res. 1987:21(1):25–34. 10.1016/0043-1354(87)90095-9 [DOI] [Google Scholar]

[kiae629-B26] Gutknecht J, Bisson MA, Tosteson FC. Diffusion of carbon dioxide through lipid bilayer membranes: effects of carbonic anhydrase, bicarbonate, and unstirred layers. J Gen Physiol. 1977:69(6):779–794. 10.1085/jgp.69.6.779 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B27] Harari O, Dean A, Bingham D, Higdon D. Computer experiments: prediction accuracy, sample size and model complexity revisited. Stat Sin. 2018:28:899–919. 10.5705/ss.202016.0217 [DOI] [Google Scholar]

[kiae629-B28] Ichinose TM, Iwane AH. Cyotological analyses by advanced electron microscopy. In: Kuroiwa T, Miyagishima SY, Matsunaga S, Sato N, Nozaki H, Tanaka K, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle Biology. Singapore: Springer Nature; 2017. p. 129–151. [Google Scholar]

[kiae629-B29] Itoh R, Takano H, Ohta N, Miyagishima S-Y, Kuroiwa H, Kuroiwa T. Two ftsH-family genes encoded in the nuclear and chloroplast genomes of the primitive red alga Cyanidioschyzon merolae. Plant Mol Biol. 1999:41(3):321–337. 10.1023/a:1006369104530 [DOI] [PubMed] [Google Scholar]

[kiae629-B30] James G, Witten D, Hastie T, Tibshirani R. An introduction to statistical learning with applications in R. New York: Springer Science + Business Media (Springer Texts in Statistics); 2013. [Google Scholar]

[kiae629-B31] Jones DR, Schonlau M, Welch WJ. Efficient global optimization of expensive black-box functions. J Glob Optim. 1998:13(4):455–492. 10.1023/A:1008306431147 [DOI] [Google Scholar]

[kiae629-B32] Jordan DB, Ogren WL. The CO₂/O₂ specificity of ribulose 1,5-bisphosphate carboxylase/oxygenase: dependence on ribulose bisphosphate concentration, pH and temperature. Planta. 1984:161(4):308–313. 10.1007/BF00398720 [DOI] [PubMed] [Google Scholar]

[kiae629-B33] Kaste JAM, Walker BJ, Shachar-Hill Y. Reaction-diffusion modeling provides insights into biophysical carbon concentrating mechanisms in land plants. Plant Physiol. 2024:196(2):1374–1390. 10.1093/plphys/kiae324 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B34] Kubien DS, Brown CM, Kane HJ. Quantifying the amount and activity of rubisco in leaves. In: Carpentier R, editors. Photosynthesis research protocols. Totowa, NJ: Humana Press; 2010. p. 349–362. [Google Scholar]

[kiae629-B35] Kuroiwa T. The primitive red algae Cyanidium caldarium and Cyanidioschyzon merolae as model system for investigating the dividing apparatus of mitochondria and plastids. BioEssays. 1998:20(4):344–354. 10.1002/(SICI)1521-1878(199804)20:4<344::AID-BIES11>3.0.CO;2-2 [DOI] [Google Scholar]

[kiae629-B36] Lavigne H, Proye A, Gattuso J-P. seacarb: calculates parameters of the seawater carbonate system. 2019. https://rdrr.io/rforge/seacarb/. [Google Scholar]

[kiae629-B37] Loeppky JL, Sacks J, Welch WJ. Choosing the sample size of a computer experiment: a practical guide. Technometrics. 2009:51(4):366–376. 10.1198/TECH.2009.08040 [DOI] [Google Scholar]

[kiae629-B38] Loganathan N, Tsai YC, Mueller-Cajar O. Characterization of the heterooligomeric red-type rubisco activase from red algae. Proc Natl Acad Sci U S A. 2016:113(49):14019–14024. 10.1073/pnas.1610758113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B39] Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. In: NIPS'17: Proceedings of the 31st International Conference on Neural Information Processing Systems. Long Beach, California, USA: Curran Associates Inc., 2017. p. 4768–4777. https://dl.acm.org/doi/10.1007/s10207-024-00926-9 [Google Scholar]

[kiae629-B40] MacKay DJC. Information-based objective functions for active data selection. Neural Comput. 1992:4(4):590–604. 10.1162/neco.1992.4.4.590 [DOI] [Google Scholar]

[kiae629-B41] Mangan NM, Brenner MP. Systems analysis of the CO₂ concentrating mechanism in cyanobacteria. eLife. 2014:3:e02043. 10.7554/eLife.02043 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B42] Mangan NM, Flamholz A, Hood RD, Milo R, Savage DF. Ph determines the energetic efficiency of the cyanobacterial CO₂ concentrating mechanism. Proc Natl Acad Sci U S A. 2016:113(36):E5354–E5362. 10.1073/pnas.1525145113 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B43] McGrath JM, Long SP. Can the cyanobacterial carbon-concentrating mechanism increase photosynthesis in crop species? A theoretical analysis. Plant Physiol. 2014:164(4):2247–2261. 10.1104/pp.113.232611 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B44] McKay MD, Beckman RJ, Conover WJ. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics. 1979:21(2):239–245. 10.2307/1268522 [DOI] [Google Scholar]

[kiae629-B45] Missner A, Kügler P, Saparov SM, Sommer K, Mathai JC, Zeidel ML, Pohl P. Carbon dioxide transport through membranes. J Biol Chem. 2008:283(37):25340–25347. 10.1074/jbc.M800096200 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B46] Misumi O, Kuroiwa T, Hirooka S. Application of the tolerance to extreme environment to land plants. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi M, et al., editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Singapore; 2017. p. 325–341. [Google Scholar]

[kiae629-B47] Misumi O, Matsuzaki M, Nozaki H, Miyagishima S-Y, Mori T, Nishida K, Yagisawa F, Yoshida Y, Kuroiwa H, Kuroiwa T. Cyanidioschyzon merolae genome. A tool for facilitating comparable studies on organelle biogenesis in photosynthetic eukaryotes. Plant Physiol. 2005:137(2):567–585. 10.1104/pp.104.053991 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B48] Miyagishima M, Itoh R, Toda K, Takahashi H, Kuroiwa H, Kuroiwa T. Visualization of the microbody division in Cyanidioschyzon merolae with the fluorochrome brilliant sulfoflavin. Protoplasma. 1998:201(1–2):115–119. 10.1007/BF01280718 [DOI] [Google Scholar]

[kiae629-B49] Miyagishima S, Wei JL, Nozaki H, Hirooka S. Cyanidiales: evolution and habitats. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 3–15. [Google Scholar]

[kiae629-B50] Miyagishima SY, Tanaka K. The unicellular red alga Cyanidioschyzon merolae—the simplest model of a photosynthetic eukaryote. Plant Cell Physiol. 2021:62(6):926–941. 10.1093/pcp/pcab052 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B51] Miyagishima S-Y, Wei JL. Procedures for cultivation, observation, and conventional experiments in Cyanidioschyzon merolae. In: Kuroiwa T, Miyagishima S, Matsunaga S, Sato N, Nozaki H, Tanaka K, Misumi O, editors. Cyanidioschyzon merolae: a new model eukaryote for cell and organelle biology. Singapore: Springer Nature; 2017. p. 31–41. [Google Scholar]

[kiae629-B52] Morita E, Abe T, Tsuzuki M, Fujiwara S, Sato N, Hirata A, Sonoike K, Nozaki H. Role of pyrenoids in the CO2-concentrating mechanism: comparative morphology, physiology and molecular phylogenetic analysis of closely related strains of Chlamydomonas and Chloromonas (Volvocales). Planta. 1999:208(3):365–372. 10.1007/s004250050571 [DOI] [Google Scholar]

[kiae629-B53] Moriyama T, Mori N, Nagata N, Sato N. Selective loss of photosystem I and formation of tubular thylakoids in heterotrophically grown red alga Cyanidioschyzon merolae. Photosynth Res. 2018:140(3):275–287. 10.1007/s11120-018-0603-z [DOI] [PubMed] [Google Scholar]

[kiae629-B54] Moroney JV, Ma Y, Frey WD, Fusilier KA, Pham TT, Simms TA, DiMario RJ, Yang J, Mukherjee B. The carbonic anhydrase isoforms of Chlamydomonas reinhardtii: intracellular location, expression, and physiological roles. Photosynth Res. 2011:109(1–3):133–149. 10.1007/s11120-011-9635-3 [DOI] [PubMed] [Google Scholar]

[kiae629-B55] Mountraki AD, Benjelloun-Mlayah B, Kokossis AC. A surrogate modeling approach for the development of biorefineries. Front Chem Eng. 2020:2. 10.3389/fceng.2020.568196 [DOI] [Google Scholar]

[kiae629-B56] Nevo R, Charuvi D, Shimoni E, Schwarz R, Kaplan A, Ohad I, Reich Z. Thylakoid membrane perforations and connectivity enable intracellular traffic in cyanobacteria. EMBO J. 2007:26(5):1467–1473. 10.1038/sj.emboj.7601594 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B57] Oesterhelt C, Schmälzlin E, Schmitt JM, Lokstein H. Regulation of photosynthesis in the unicellular acidophilic red alga Galdieria sulphuraria. Plant J. 2007:51(3):500–511. 10.1111/j.1365-313X.2007.03159.x [DOI] [PubMed] [Google Scholar]

[kiae629-B58] Orr DJ, Carmo-Silva E. Extraction of RuBisCO to determine catalytic constants. Methods Mol Biol. 2018:1770:229–238. 10.1007/978-1-4939-7786-4_13 [DOI] [PubMed] [Google Scholar]

[kiae629-B59] Parys E, Krupnik T, Kułak I, Kania K, Romanowska E. Photosynthesis of the Cyanidioschyzon merolae cells in blue, red, and white light. Photosynth Res. 2021:147(1):61–73. 10.1007/s11120-020-00796-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B60] Price GD. Inorganic carbon transporters of the cyanobacterial CO2 concentrating mechanism. Photosynth Res. 2011:109(1–3):47–57. 10.1007/s11120-010-9608-y [DOI] [PubMed] [Google Scholar]

[kiae629-B61] Price GD, Badger MR. Expression of human carbonic anhydrase in the cyanobacterium Synechococcus PCC7942 creates a high CO₂-requiring phenotype: evidence for a central role for carboxysomes in the CO₂ concentrating mechanism. Plant Physiol. 1989:91(2):505–513. 10.1104/pp.91.2.505 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B62] Price GD, Badger MR, Woodger FJ, Long BM. Advances in understanding the cyanobacterial CO₂-concentrating-mechanism (CCM): functional components, Ci transporters, diversity, genetic regulation and prospects for engineering into plants. J Exp Bot. 2008:59(7):1441–1461. 10.1093/jxb/erm112 [DOI] [PubMed] [Google Scholar]

[kiae629-B63] Price GD, Howitt SM. The cyanobacterial bicarbonate transporter BicA: its physiological role and the implications of structural similarities with human SLC26 transporters. Biochem Cell Biol. 2011:89(2):178–188. 10.1139/O10-136 [DOI] [PubMed] [Google Scholar]

[kiae629-B64] Price GD, Pengelly JJL, Forster B, Du J, Whitney SM, von Caemmerer S, Badger MR, Howitt SM, Evans JR. The cyanobacterial CCM as a source of genes for improving photosynthetic CO2 fixation in crop species. J Exp Bot. 2013:64(3):753–768. 10.1093/jxb/ers257 [DOI] [PubMed] [Google Scholar]

[kiae629-B65] Price GD, Woodger FJ, Badger MR, Howitt SM, Tucker L. Identification of a SulP-type bicarbonate transporter in marine cyanobacteria. Proc Natl Acad Sci U S A. 2004:101(52):18228–18233. 10.1073/pnas.0405211101 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B66] Prins A, Orr DJ, Andralojc PJ, Reynolds MP, Carmo-Silva E, Parry MAJ. Rubisco catalytic properties of wild and domesticated relatives provide scope for improving wheat photosynthesis. J Exp Bot. 2016:67(6):1827–1838. 10.1093/jxb/erv574 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B67] Rademacher N, Kern R, Fujiwara T, Mettler-Altmann T, Miyagishima S-Y, Hagemann M, Eisenhut M, Weber APM. Photorespiratory glycolate oxidase is essential for the survival of the red alga Cyanidioschyzon merolae under ambient CO₂ conditions. J Exp Bot. 2016:67(10):3165–3175. 10.1093/jxb/erw118 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B68] Rademacher N, Wrobel TJ, Rossoni AW, Kurz S, Bräutigam A, Weber APM, Eisenhut M. Transcriptional response of the extremophile red alga Cyanidioschyzon merolae to changes in CO₂ concentrations. J Plant Physiol. 2017:217:49–56. 10.1016/j.jplph.2017.06.014 [DOI] [PubMed] [Google Scholar]

[kiae629-B69] Rahman DY, Sarian FD, van Wijk A, Martinez-Garcia M, van der Maarel MJEC. Thermostable phycocyanin from the red microalga Cyanidioschyzon merolae, a new natural blue food colorant. J Appl Phycol. 2017:29(3):1233–1239. 10.1007/s10811-016-1007-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B70] Raissi M, Perdikaris P, Karniadakis GE. Physics-informed neural networks: a deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. J Comput Phys. 2019:378:686–707. 10.1016/j.jcp.2018.10.045 [DOI] [Google Scholar]

[kiae629-B71] Read BA, Tabita FR. High substrate specificity factor ribulose bisphosphate carboxylase/oxygenase from eukaryotic marine algae and properties of recombinant cyanobacterial RubiSCO containing “algal” residue modifications. Arch Biochem Biophys. 1994:312(1):210–218. 10.1006/abbi.1994.1301 [DOI] [PubMed] [Google Scholar]

[kiae629-B72] Reimer KA, Stark MR, Aguilar L-C, Stark SR, Burke RD, Moore J, Fahlman RP, Yip CK, Kuroiwa H, Oeffinger M, et al. The sole LSm complex in Cyanidioschyzon merolae associates with pre-mRNA splicing and mRNA degradation factors. RNA. 2017:23(6):952–967. 10.1261/rna.058487.116 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B73] Robison TA, Oh ZG, Lafferty D, Xu X, Villarreal JCA, Gunn LH, Li F-W. Hornworts reveal a spatial model for pyrenoid-based CO2-concentrating mechanisms in land plants. bioRxiv. 2024. 10.1101/2024.06.26.600872, preprint: not peer reviewed. [DOI] [PubMed] [Google Scholar]

[kiae629-B74] Sato N, Moriyama T, Mori N, Toyoshima M. Lipid metabolism and potentials of biofuel and high added-value oil production in red algae. World J Microbiol Biotechnol. 2017:33(4):74. 10.1007/s11274-017-2236-3 [DOI] [PubMed] [Google Scholar]

[kiae629-B75] Seger M, Mammadova F, Villegas-Valencia M, Bastos de Freitas B, Chang C, Isachsen I, Hemstreet H, Abualsaud F, Boring M, Lammers PJ, et al. Engineered ketocarotenoid biosynthesis in the polyextremophilic red microalga Cyanidioschyzon merolae 10D. Metab Eng Commun. 2023:17:e00226. 10.1016/j.mec.2023.e00226 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B76] Seymour JR, Amin SA, Raina J-B, Stocker R. Zooming in on the phycosphere: the ecological interface for phytoplankton–bacteria relationships. Nat Microbiol. 2017:2(7):17065. 10.1038/nmicrobiol.2017.65 [DOI] [PubMed] [Google Scholar]

[kiae629-B77] Sharwood RE, Ghannoum O, Whitney SM. Prospects for improving CO₂ fixation in C₃-crops through understanding C₄-rubisco biogenesis and catalytic diversity. Curr Opin Plant Biol. 2016:31:135–142. 10.1016/j.pbi.2016.04.002 [DOI] [PubMed] [Google Scholar]

[kiae629-B78] Sobol′ IM. Global sensitivity indices for nonlinear mathematical models and their Monte Carlo estimates. Math Comput Simul. 2001:55(1–3):271–280. 10.1016/S0378-4754(00)00270-6 [DOI] [Google Scholar]

[kiae629-B79] Spalding MH. Microalgal carbon-dioxide-concentrating mechanisms: Chlamydomonas inorganic carbon transporters. J Exp Bot. 2008:59(7):1463–1473. 10.1093/jxb/erm128 [DOI] [PubMed] [Google Scholar]

[kiae629-B80] Steensma AK, Shachar-Hill Y, Walker BJ. The carbon-concentrating mechanism of the extremophilic red microalga Cyanidioschyzon merolae. Photosynth Res. 2023:156(2):247–264. 10.1007/s11120-023-01000-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B81] Thoms S, Pahlow M, Wolf-Gladrow DA. Model of the carbon concentrating mechanism in chloroplasts of eukaryotic algae. J Theor Biol. 2001:208(3):295–313. 10.1006/jtbi.2000.2219 [DOI] [PubMed] [Google Scholar]

[kiae629-B82] Toda K, Takano H, Miyagishima S-Y, Kuroiwa H, Kuroiwa T. Characterization of a chloroplast isoform of serine acetyltransferase from the thermo-acidiphilic red alga Cyanidioschyzon merolae. Biochim Biophys Acta. 1998:1403(1):72–84. 10.1016/S0167-4889(98)00031-7 [DOI] [PubMed] [Google Scholar]

[kiae629-B83] Uemura K, Anwaruzzaman, Miyachi S, Yokota A. Ribulose-1,5-bisphosphate carboxylase/oxygenase from thermophilic red algae with a strong specificity for CO₂ fixation. Biochem Biophys Res Commun. 1997:233(2):568–571. 10.1006/bbrc.1997.6497 [DOI] [PubMed] [Google Scholar]

[kiae629-B84] Villegas-Valencia M, González-Portela RE, de Freitas BB, Al Jahdali A, Romero-Villegas GI, Malibari R, Kapoore RV, Fuentes-Grünewald C, Lauersen KJ. Cultivation of the polyextremophile Cyanidioschyzon merolae 10D during summer conditions on the coast of the red sea and its adaptation to hypersaline sea water. Front Microbiol. 2023:14:1157151. 10.3389/fmicb.2023.1157151 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B85] Virtanen P, Gommers R, Oliphant TE, Haberland M, Reddy T, Cournapeau D, Burovski E, Peterson P, Weckesser W, Bright J, et al. Scipy 1.0: fundamental algorithms for scientific computing in python. Nat Methods. 2020:17(3):261–272. 10.1038/s41592-019-0686-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B86] von Caemmerer S. Biochemical models of leaf photosynthesis. Collingwood, Australia: CSIRO Publishing; 2000. [Google Scholar]

[kiae629-B87] Walker BJ, Kramer DM, Fisher N, Fu X. Flexibility in the energy balancing network of photosynthesis enables safe operation under changing environmental conditions. Plants. 2020:9(3):301. 10.3390/plants9030301 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B88] Whitney SM, Baldet P, Hudson GS, Andrews TJ. Form I Rubiscos from non-green algae are expressed abundantly but not assembled in tobacco chloroplasts. Plant J. 2001:26(5):535–547. 10.1046/j.1365-313x.2001.01056.x [DOI] [PubMed] [Google Scholar]

[kiae629-B89] Xu Y, Fu X, Sharkey TD, Shachar-Hill Y, Walker ABJ. The metabolic origins of non-photorespiratory CO2 release during photosynthesis: a metabolic flux analysis. Plant Physiol. 2021:186(1):297–314. 10.1093/plphys/kiab076 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B90] Yagisawa F, Fujiwara T, Kuroiwa H, Nishida K, Imoto Y, Kuroiwa T. Mitotic inheritance of endoplasmic reticulum in the primitive red alga Cyanidioschyzon merolae. Protoplasma. 2012:249(4):1129–1135. 10.1007/s00709-011-0359-1 [DOI] [PubMed] [Google Scholar]

[kiae629-B91] Yagisawa F, Kuroiwa H, Fujiwara T, Kuroiwa T. Intracellular structure of the unicellular red alga Cyanidioschyzon merolae in response to phosphate depletion and resupplementation. Cytologia (Tokyo). 2016:81(3):341–347. 10.1508/cytologia.81.341 [DOI] [Google Scholar]

[kiae629-B92] Yang S, Wong SWK, Kou SC. Inference of dynamic systems from noisy and sparse data via manifold-constrained Gaussian processes. Proc Natl Acad Sci U S A. 2021:118(15):e2020397118. 10.1073/pnas.2020397118 [DOI] [PMC free article] [PubMed] [Google Scholar]

[kiae629-B93] Zenvirth D, Volokita M, Kaplan A. Photosynthesis and inorganic carbon accumulation in the acidophilic alga Cyanidioschyzon merolae. Plant Physiol. 1985:77(1):237–239. 10.1104/pp.77.1.237 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Modeling with uncertainty quantification reveals the essentials of a non-canonical algal carbon-concentrating mechanism

Anne K Steensma

Joshua A M Kaste

Junoh Heo

Douglas J Orr

Chih-Li Sung

Yair Shachar-Hill

Berkley J Walker

Abstract

Introduction

Figure 1.

Results and discussion

Rubisco kinetics demonstrated that C. merolae operates a CCM

Figure 2.

Quantitative modeling showed that a hypothesized CCM can explain C. merolae's carbon-concentrating behavior

Figure 3.

Figure 4.

Machine-learning-based surrogate models identified the parameters that most influence CCM efficiency

Figure 5.

In silico knockouts identified experimental targets for further characterization of the C. merolae CCM

Further applications of surrogate modeling and uncertainty quantification

Conclusions

Materials and methods

Experimental data collection: gas-exchange measurements

Experimental data collection: rubisco kinetics measurements

Model details

Definition of reasonable model output values

CO2 compensation point (ΓCO2)

Ratio of ATP consumption flux to net CO2 assimilation flux (ATP per CO2)

Steady-state CO2 concentration in the chloroplast stroma (stromal CO2)

Ratio of oxygen-fixation flux to carbon-fixation flux (vo/vc)

Model optimization and estimation of simulated compensation point

Parameter exploration and surrogate model selection

Accession numbers

Supplementary Material

Acknowledgments

Contributor Information

Author contributions (CRediT taxonomy definitions)

Supplementary data

Supplementary methods

Funding

Data availability

Preprint servers

Dive Curated Terms

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

CO₂ compensation point (Γ_CO2)

Ratio of ATP consumption flux to net CO₂ assimilation flux (ATP per CO₂)

Steady-state CO₂ concentration in the chloroplast stroma (stromal CO₂)

Ratio of oxygen-fixation flux to carbon-fixation flux (v_o/v_c)