Abstract
This paper presents the Multi Clone Kinetic Model (MCKM), a novel generalized kinetic mechanistic model for fed-batch cultivations of diverse Chinese hamster ovary (CHO) cell lines, producing different recombinant monoclonal antibodies (mAbs). Unlike traditional kinetic models requiring multiple cultures for one parameter regression, MCKM derives a complete set of 13 kinetic parameters from a single fed-batch cell line culture of 49 data points. This enables per-cell-line metabolic characterization during cell line development, as well as direct comparisons of kinetics across clones, passages, and different recombinant mAbs. To enable MCKM to be broadly applicable across many cell lines and mAbs, and to address the high-dimensional challenge of estimating 13 kinetic parameters from a small number of datapoints, the model uniquely incorporates a mechanistic growth constraint, a glucose-dependent lactate switch, and automated parameter balancing. MCKM demonstrated successful regression of 656 fed-batch culture runs of 157 unique CHO cell lines across four passage generations, recombinant for three different mAbs, achieving high accuracy in biomass and mAb titre (average
≈ 0.96 ± 0.07 and
≈ 0.97 ± 0.05, respectively). MCKM could facilitate automated cell line selection, identification of critical process parameters and biomarkers, guide media and feeding strategies, predict metabolite profiles, and support scale-up and quality-by-design studies, delivering overall reduction of experimental workload.
One-Sentence Summary: This paper presents a novel kinetic model that derives distinct parameter sets from a single fed-batch run, enabling characterization of individual CHO clones across different mAb targets.
Keywords: cell line development, monoclonal antibodies, CHO cells, kinetic modelling, biotherapeutics
Graphical Abstract
Graphical Abstract.

Introduction
Monoclonal antibodies (mAb) are the largest growing class of biotherapeutics with global sales reaching $343 billion in 2021 (Walsh & Walsh, 2022). Chinese hamster ovary (CHO) cells have become the workhorse of industrial recombinant therapeutic protein production due to their flexibility in synthesising various proteins, human-like post-translational modifications, and adaptability to cell culture conditions (Kawabe et al., 2012; Pilbrough et al., 2009; Yang et al., 2015). Moreover, the Glutamine Synthetase (GS)-CHO cell line has gained prominence for its ability to produce complex biotherapeutics with high yields and product quality (Kim et al., 2012; Patel et al., 2018; Ye et al., 2009). However, CHO cells’ flexibility comes paired with its genomic plasticity, making the cells prone to genetic rearrangements and fluctuations in productivity over longer cultivations (Du et al., 2013; Kim et al., 2011). Consequently, after transfection, an upstream cell line screening study is required to identify cell lines that demonstrate consistently high productivity over multiple generations and cultivations, a process referred to as cell line development (CLD) (DeMaria et al., 2007).
CLD encompasses a series of experiments that aim to develop a clonal cell line with desired performance in production, and which commences with transfection. Transfection results in a heterogeneous pool of cells that differ in copy number and chromosomal positioning of the recombinant gene resulting from non-site-specific transfection and, therefore, leading to different growth and productivity behaviours (Reisinger et al., 2008). By cloning each cell into hundreds of clonal cell line cultures, the purpose of CLD is to assess interconal variation and to select the ‘lead’ clone or cell line, most appropriate for production in terms of growth, stable productivity and product quality. Cell line instability is still poorly understood; it introduces a stochastic level of heterogeneity between cell lines, which renders the process of identifying the lead cell line challenging (Kildegaard et al., 2013; Yusufi et al., 2017). The final stages of CLD consist of multiple rounds of fed-batch scale-down production runs of all cell lines, for example using the 15 mL microbioreactor (MBR) system Ambr15™ (Warr, 2020), aimed at selecting the cell line with adequate mAb productivity, product quality and cell line stability (i.e. stable productivity or product titre over >70 generations) (Dahodwala & Lee, 2019).
The optimization of the CLD screening and selection process would demand a deep understanding of the cellular kinetics governing cell line performance. Mechanistic modelling has been widely used in biochemical engineering to obtain a structured representation of knowledge about a cultivation process, developing fundamental understanding of the underlying biochemical mechanisms governing cell line behaviour (Costa et al., 2016). Among the various types of mechanistic modelling approaches, unstructured unsegregated kinetic models are most commonly used to capture the dynamic behaviour of cell lines, while assuming homogeneity in intracellular kinetics and cells (Kyriakopoulos et al., 2018). It is the hypothesis of this work that mechanistic modelling can provide a means to understand and predict the behaviour of the underlying processes and mechanisms of CHO cell culture. Furthermore, mechanistic modelling would allow for in silico simulation of fed-batch process, providing means to reduce the need for experiments. Mechanistic modelling of cell culture behaviour can be done through stoichiometric or kinetic modelling, while only kinetic modelling captures the dynamic behaviour of cell lines, which may be used in de-risking stability of cell culture scale-up, for example in Ambr15™ (Sha et al., 2018).
Developing a kinetic model for upstream CLD is challenging because it must be broadly applicable across different CHO host strains, clonal cell lines, and recombinant mAbs. Traditional kinetic models at the production scale focus on a single clone and mAb at a time, typically requiring multiple culture runs to generate a complete parameter set for one regression (Jang & Barford, 2000; López-Meza et al., 2016; Tang et al., 2020). This paper introduces a novel Multi Clone Kinetic Model (MCKM), which uniquely derives a set of 13 kinetic parameters from a single culture run for each CHO clonal cell line. Figure 1 illustrates how MCKM differs from conventional kinetic modelling, highlighting its ability to generate parameters for individual cell line runs. By characterizing each culture run in this way, the MCKM provides an additional layer of metabolic information from CLD data that can inform data-driven cell line selection.
Fig. 1.
A comparison of conventional kinetic modelling that accumulates many cell cultures to regress one set of kinetic paraemters, versus the MCKM that uses a single cell line culture run to regress the kinetic parameters of each cell line culture run.
The paper describes the MCKM model design in detail, and demonstrates its successful application to 656 fed-batch cell culture runs in the 15-mL minibioreactor Ambr15™. These data come from four historic CLD campaigns, encompassing 157 unique CHO cell lines, producing three distinct recombinant mAbs across two different CHO host strains. The paper also describes a sensitivity analysis and model identifiability study to evaluate the accuracy of the MCKM. Finally, the paper demonstrates an application of the MCKM to investigate metabolic differences among cell lines in relation to performance and stable productivity, using linear discriminant analysis (LDA).
Methods
Bioreactor Operation
The kinetic model was developed for fed-batch cultivation of recombinant CHO cells expressing high levels of mAb using the Glutamine Synthetase (GS) expression system. The data were recorded by GlaxoSmithKline, Stevenage, UK. The bioreactor system used was the Ambr15™ (TAP Biosystems, Greenville, DE) with a working volume of 15 mL. The data originate from four CLD campaigns where the different cell lines were all derived from the same CHO host. The campaigns are referred to as mAb-A, mAb-B1, mAb-B2, and mAb-C. These four campaigns involve three different mAbs, with mAb-B being campaigned twice using a different cell line platform (the exact difference is kept confidential). A total of 157 unique cell lines were studied, each progressing through four sequential Ambr runs to capture productivity data over at least 70 generations, enabling assessment of cell line stability. Cell line stability refers to consistent production across multiple passages or culture runs (Dahodwala & Lee, 2019). The four sequential runs are referred to as Ambr-1, Ambr-2, Ambr-3, and Ambr-4. Including some manually selected promising cell lines run in duplicates or triplicates, there are a total of 656 cell line culture runs (N = 656). Table 1 gives an overview of how many unique cell lines and cell cultures were run. Between each sequential Ambr run, cel lines are occasionally and manually selected and omitted based on poor cell line productivity, viability, or product quality, resulting in decreasing N over runs (exact selection criteria are kept confidential).
Table 1.
An Overview of the Number of Cell Lines in Each Dataset and the Number of Cell Culture Runs (Batches)
| Ambr-1 | Ambr-2 | Ambr-3 | Ambr-4 | |||||
|---|---|---|---|---|---|---|---|---|
| N | n | N | n | N | n | N | n | |
| mAb-A | 48 | 48 | 35 | 35 | 33 | 35 | 33 | 36 |
| mAb-B1 | 19 | 20 | 18 | 18 | 18 | 18 | 18 | 20 |
| mAb-B2 | 47 | 47 | 47 | 48 | 32 | 48 | 32 | 48 |
| mAb-C | 43 | 52 | 43 | 63 | 43 | 50 | 43 | 62 |
N is the number of unique cell lines, and n is the number of experiments, as sometimes particular cell lines are run in duplicate or triplicate. mAb-A, B1, B2, and C are the four CLD campaigns; Ambr-1, 2, 3, and 4 are the four sequential Ambr runs cultivating the same cell line over increasing generation numbers.
Dataset
The total length of each Ambr15™ fed-batch cultivation is approximately 15 days. At-line samples were analysed on approximately day 0, 3, 6, 8, 10, 13, and 15, resulting in 7 (T = 7) samples in total, recording metabolic profiles of 7 metabolites (M = 7): viable cell concentration (VCC) (Xυ), glucose (glc), glutamine (gln), glutamate (glu), mAb (P), ammonium (amm), lactate (lac), and for each Ambr15™ run. Furthermore, the fed-batch cultivations were run with bolus feeding, i.e. discrete feeding over time, of glucose and glutamate. Feeding of glucose and glutamate was executed right after sampling, with a total of five feeding times, commencing after the second sampling time. The feeding schedule of one individual run of mAb-A Ambr-1 was used as an approximation of the feeding of all other cell culture runs. Each cultivation is a dataset of seven-time measurements of the seven metabolites, resulting in a [7 × 7] matrix for each cell line culture run, as shown in Eq. 1.
![]() |
(1) |
Missing data
Some data were missing, such as day 2 lactate for mAb-A, Ambr-2, and day 1 protein titer for mAb-A, Ambr-2. For these samples, the kinetic model regression was run with fewer datapoints.
Kinetic Model Development
Given the complexity of kinetic models—particularly in defining sufficient number of equations and parameters while avoiding overfitting—many mammalian cell bioprocess models have adopted a macroscopic approach (Chotteau et al., 2022). For the MCKM, the first step was to identify the key metabolic pathways governing growth and production, expressed through stoichiometric equations describing the interrelationships between major metabolites (Nolan & Lee, 2011). These were then translated into ordinary differential equations (ODEs) to capture temporal changes in metabolite concentrations. Mass balance expressions for production and consumption rates were formulated using established kinetics, including the Monod equation (Monod, 1949), derived from the Michaelis–Menten model (Michaelis & Menten, 1913), which has been widely favoured in describing enzyme kinetics due to its simplicity and ease of use for mathematical computations (Tang et al., 2020). For other metabolites, the Luedeking–Piret equation (Luedeking & Piret, 1959) was applied, relating production or consumption rates to growth rate and yield.
Biomass
The ODE of the viable biomass concentration (Xv in 109 cells L−1) is a function of the growth rate (µ in h−1) and death rate, Eq. 2.
![]() |
(2) |
In kinetic models, growth is typically expressed as a function of limiting substrates such as glucose and glutamine, while toxic by-products (ammonium, lactate) may appear in the growth or death rate equations (Kornecki & Strube, 2019; López-Meza et al., 2016; Xing et al., 2010). In this study, glutamate was supplied in excess and thus not growth-limiting, while prior data analysis showed no significant correlation between VCC and ammonium or lactate in the growth phase (Sietaram et al., 2025). Consequently, glucose is the only growth-limiting substrate considered. The growth rate is described by Monod kinetics, as shown in Eq. 3.
![]() |
(3) |
where µmax is the maximum growth rate (in h−1) and Kglc (in mM) is the Michaelis–Menten constant of glucose or half-saturation concentration.
Previous data analysis (Sietaram et al., 2025) showed a significant negative Spearman correlation between VCC and ammonium during the death phase, suggesting ammonium-induced cell death. In line with the literature, where ammonium and lactate are both reported as cytotoxic by-products of protein metabolism (Karra et al., 2010), many CHO kinetic models include these metabolites in the death rate equation (Xing et al., 2010). However, the MCKM must remain robust across diverse behaviours, from clones that only produce lactate to those that switch from lactate production to consumption in the stationary phase (Carinhas et al., 2013; Gašperšič et al., 2018; Martínez et al., 2013). To ensure consistency, lactate was excluded, and the death rate equation was instead formulated as a Monod-type function of ammonium concentration alone, Eq. 4.
![]() |
(4) |
where kD is the death rate (in h−1) and KDamm (in mM) is the Monod constant of ammonium inhibition.
Monoclonal antibody
The amount of secreted mAb in the extracellular environment (P in mM) formed over time is described by Eq. 5.
![]() |
(5) |
The form of the production rate for mAb (qP in mM h−1) can be described by the Luedeking–Piret model (López-Meza et al., 2016) as a product YP/X (in mmol/109 cells)—the yield of mAb, and biomass growth rate, Eq. 6.
![]() |
(6) |
Glucose
Glucose is the only growth-limiting substrate and is both being consumed, as well as being fed during the fed-batch cultivation, resulting in the ODE shown as Eq. 7.
![]() |
(7) |
where F(t) is a discrete feeding function (in L h−1), representing the fed-batch bolus feeding, glcF,in (in mM) is the concentration of glucose in the feed, and VL(t) is the volume of the bioreactor (in L). The consumption rate of glucose (qglc in mM h−1) is described by the Luedeking–Piret model (López-Meza et al., 2016), Eq 8.
![]() |
(8) |
where YX/glc in (109 cells mM−1) is the yield of biomass over glucose and mglc (in mmol L−1 h−1) is consumption of glucose for maintenance.
Glutamate
GS-CHO consumes glutamate as a substrate, where the majority of glutamate is converted into glutamine through the GS-system, as well as consumption for tricarboxylic acid (TCA) cycle intermediates or amino acids for protein synthesis (Zhang et al., 2006). Glutamate is fed throughout the cultivation, thus can be described as shown in Eq. 9.
![]() |
(9) |
The consumption rate of glutamate (qglu in mM h−1) is described by the Luedeking–Piret model (López-Meza et al., 2016), Eq. 10.
![]() |
(10) |
Glutamine
Glutamine is mainly formed through the activity of GS that converts glutamate and ammonium into glutamine. Henceforth, glutamine production is a function of the glutamate consumption rate (qglu) with Ygln/glu (in mol mol−1) the yield of glutamine on glutamate. Furthermore, glutamine is consumed at a rate qgln for protein and biomass synthesis, Eq. 11.
![]() |
(11) |
The rate of consumption of glutamine (qgln) is described by the Luedeking–Piret model (López-Meza et al., 2016) as a function of yield YX/gln (in 109 cells mM−1), Eq. 12.
![]() |
(12) |
Ammonium
Ammonium (NH₃) is a by-product of cell growth and protein production. In GS-expression systems, glutamine synthase (GS) converts glutamate and ammonia into glutamine. The literature often links ammonium profiles to glutamine degradation (Xing et al., 2010), and for the GS-system specifically ammonium to be a function of glutamate (Kiparissides et al., 2015). Data analysis of the current dataset showed that ammonium correlates positively with glutamate only during the growth phase, and not in stationary or death phases (Sietaram et al., 2025). Most cell lines exhibited steadily increasing ammonium profiles, with occasional drops due to GS conversion. MCKM requires to accommodate the lack of a consistent link of ammonium to glutamate across all cell lines. The ammonium equation in MCKM is simplified to a general production term arising from aminoacid metabolism, Eq. 13.
![]() |
(13) |
where the ammonium production rate (qamm in mM h−1) with Luedeking–Piret as a function of YX/amm (in 109 cells mM−1), Eq. 14.
![]() |
(14) |
Lactate
Lactate production is a result of the anaerobic conversion of glucose, where excess pyruvate is converted into lactate. This lactate production is often a function of the glucose consumption rate (Jang & Barford, 2000). A constraint could be used to introduce the lactate switch to the model, the transition from lactate production to consumption as cells enter the late exponential phase. The model must be flexible to be able to capture all cell lines behaviours where some cells exhibit and do not exhibit the switch to lactate consumption.
To the best of our knowledge, no kinetic model in the literature of CHO incorporates an accurate mathematical description of the lactate switch. The main identified cause for the lactate switch was glucose depletion (Hartley et al., 2018; Martínez et al., 2013). This was further validated in the current data set; data analysis has shown that lactate was found to significantly correlate to glucose profiles (Sietaram et al., 2025). Therefore, MCKM was designed such that the lactate switch depends on glucose depletion by incorporating an arbitrary minimum glucose concentration at which lactate consumption begins. This threshold was determined by analysing the 656 cell cultures and averaging glucose concentrations at which all cell lines started consuming lactate, resulting in a model constant aglc (in mM). This value is deduced only once from the data and further treated as a model constant. This is implemented into the kinetic model equation for lactate, as shown in Eqs. 15 and 16.
![]() |
(15) |
![]() |
(16) |
where Ylac/glc (in mol mol−1) is the yield of lactate on glucose, qglc is the glucose consumption rate, qlac (in mM h−1) is the lactate consumption rate, and aglc (in mM) is the concentration of glucose below which lactate consumption is activated.
Model Implementation
Additional ODE for growth rate
The nature of the kinetic regression problem at hand is rather challenging. MCKM must estimate 13 kinetic parameters from a limited number of the available experimental data (49 datapoints) of a single culture run, while capturing the diverse behaviours of multiple CHO cell lines and mAbs. Thus, it is useful to introduce additional mechanistic constraints to attain the desired model accuracy. The unique feature of MCKM is the incorporation of an additional ODE describing the evolution of the growth rate (µ) over time. This acts as a mechanistic constraint, guiding regression towards parameter sets consistent with realistic growth dynamics. Prior to regression, the growth rate is estimated from raw VCC data (µest, see the section ‘Expansion of input data with growth kinetics’) and treated as input alongside metabolite profiles. The model then regresses µest against the growth rate ODE, which is derived using the quotient rule, Eq. 17.
![]() |
(17) |
where glc is the glucose concentration (mM), Kglc is the half-saturation constant of glucose (mM), µmax is the maximum growth rate (h−1) and µest is the estimated growth rate from VCC data (h−1).
Bolus fed-batch model implementation
The kinetic model was implemented in Matlab®2019b as a set of ODEs. Since Matlab®2019b requires all variables, including time, to be continuous, the discrete nature of feeding poses a challenge. To address this, the fed-batch model equations were simplified into a batch kinetic model by removing the feeding terms. Fed-batch simulations are then run iteratively between feeding events, updating concentrations at each feeding step as new initial concentrations. The error of fed-batch regression is then calculated as a sum of errors of the iterative batch regressions. The seven ODEs as a batch model can be summarized as shown in Eq. 18.
![]() |
(18) |
To run the bolus fed-batch kinetic model, the batch model is executed iteratively between the seven sampling times [ti, ti+1] resulting in six iterations. The model variables are X(t)=[Xv, P, glc, glu, gln, amm, lac, µest] and the kinetic parameters are:
![]() |
The first batch run starts with the defined initial concentrations Xinitial(t0). After each iteration, the ode45 solver in Matlab®2019b computes Xfinal(ti+1) between ti and ti+1. The discrete feeding Xfeeding (of substrates glc and glu) is added at ti+1 to the Xfinal, producing the new Xinitial(ti+1) for the next iteration. An overview of the bolus fed-batch model in pseudocode is given in Table 2.
Table 2.
A Pseudocode of the Fed-Batch Bolus Feeding Model Implemented in Matlab as Iterations of a Batch Model fbatch (Eq. 18)
| 1. Function run_fed_batch_bolus 1.1 Xinitial = [VCC0,glc0,glu0, gln0, amm0, lac0, P0, μest,0] 1.2 tsample = [t1,t2,t3,t4,t5,t6,t7] 1.3 For iteration i = (length(tsample) − 1) 1.3.1 tinitial = tsample(i) 1.3.2 tfinal = tsample(i + 1) 1.3.3 Xfinal = solve ode45 for fbatch(tinitial: tfinal,Xinitial,θ) 1.3.4 Xinitial = Xfinal + Xfeeding |
Parameter Regression
Expansion of input data with growth kinetics
The kinetic model is designed to regress a unique set of θ from one single cell culture run. In this study, a total of 656 (r = 656) Ambr15™ data cell line cultivations are regressed. To improve the accuracy of the model, an additional ODE for the estimated growth rate (µ) was incorporated into the model. No direct measurements of µ were recorded; hence the growth rate was rather estimated (µest) from raw VCC data, derived from the assumption that the growth rate corresponds to the exponential of the VCC according to Goudar et al. (2008) and Xing et al. (2010), see Eq. 19.
![]() |
(19) |
Note that this assumes Xv follows exponential growth at all times, which is an approximation, since growth may deviate from exponential behavior outside the true exponential phase.
For the first µest,t1 at t1 the value calculated for µest,t2 is used twice. Thus, the raw input data Xraw,data is expanded with µest, resulting in Xdata. Xdata combines metabolite profiles with the growth rate estimates and is used as the actual input for regression; see Xdata described in Eq. 20.
![]() |
(20) |
Prior to regression, Xdata was normalized into |Xdata| using min-max scaling:
![]() |
(21) |
Objective function
The parameter regression was performed using least squared regression. Alternative methods such as Markov Chain Monte Carlo method or Bayesian optimization were considered impractical because the regression being performed on one cell culture run, as these optimization methods generally require more cell culture runs to perform parameter regression due to their inherent computational demands and the need for extensive sampling to ensure accuracy and convergence (Hernández Rodríguez et al., 2019; Ridgeway & Madigan, 2003; Xing et al., 2010).
The prediction variables are given as shown in Eq. 22.
![]() |
(22) |
The objective function J was defined as shown in Eq. 23.
![]() |
(23) |
fbatch(tdata, Xdata(t1),θ) (Eq. 18) returns Xpred, predicted metabolite profiles, solved using ode45 in Matlab®2019b. Due to the batch implementation of the kinetic model, the cost function is a summation in between time intervals to generate the fed-batch objective function. The θ parameters are regressed using the function fmincon and the algorithm interior-point (maximum number of function evaluations allowed 3E3) in Matlab®2019b according to:
![]() |
(24) |
Global optimization methods were explored as an alternative to the interior-point algorithm, since the cost function is likely multi-modal with many local minima (Gábor et al., 2017; Miró et al., 2012). However, due to the nonlinear differential–algebraic equations and additional constraints, the optimization problem is ill-conditioned and highly sensitive to initial conditions. Small perturbations in input can cause large output variations, leading algorithms to converge to local optima or diverge (Sawicki et al., 2022; Villaverde et al., 2019). Moreover, global methods are computationally demanding, which is particularly problematic for such irregular optimization landscapes (Pal et al., 2005).
Initial and boundary conditions
This section describes how the initial guesses and bounds for the kinetic parameters, either mathematically or from the literature, were derived.
Initial guesses and bounds for yield
The initial guesses of the yields were estimated for each cell line were using the equation (Xing et al., 2010), see Eq. 25.
![]() |
(25) |
where ci(t1), cj(t1), ci(t7) and cj(t7) are the initial and the final concentrations of components i and j, respectively, VL(t1) and VL(t7) are the initial and the final volumes in the bioreactor, and ci,feed is the concentration of component i in the feed with a feeding volume Vfeed. When computing Yi/j for all cell culture runs, maximum and minimum values were used as the upper bounds (θub) and lower bounds (θlb), respectively for parameter regression.
Initial guesses for µmax and Kglc
Prior to MCKM regression, a separate regression was performed only using concentration of glc and µest data, to generate the initial guesses (IGs) for µmax and Kglc. µest was calculated from Eq. 19 and regressed against Eq. 3, assuming exponential growth of VCC. Only the first four time points were used here (µest, t1…µest, t4), corresponding to the strictly exponential growth phase (day 0–8). The sub dataset for this particular regression can be mathematically summarized as shown in Eq. 26.
![]() |
(26) |
Using the function nlinfit in Matlab®2019b, Xsubdata is non-linearly regressed upon the growth rate equation (Eq. 3). The regressed parameters are used as the initial guesses for µmax,IG and Kglc,IG.
Remaining initial and boundary conditions
To define bounds and initial guesses of the remaining kinetic parameter, values from literature were used. The upper bounds for the yield kinetic parameters Ylac/glc and Ygln/glc were deduced from stoichiometry. All initial guesses and bounds used in the regression are given in Table 3.
Table 3.
An Overview of Bounds and Initial Guesses of Kinetic Parameters
| θ | unit | θlb | θ0 | θub | Literature | Reference |
|---|---|---|---|---|---|---|
| μmax | h−1 | 0.01 | * | 0.1 | 0.029–0.090 | (López-Meza et al., 2016; Shakibaie et al., 2011; Xing et al., 2010) |
| kd | h−1 | 0.0001 | 0.016 | 0.1 | 0.016 | (Xing et al., 2010) |
| Kglc | g L−1 | 0.1 | * | 10 | 0.18–3.5 | (Kiparissides et al., 2015; López-Meza et al., 2016; Shakibaie et al., 2011; Xing et al., 2010) |
| mglc | g L−1 | 0 | 0.006 192 | 1 | 0–0.036 | (Xing et al., 2010) |
| Y X/glc | 109cells g−1 | 0.1 | 1** | 10 | 0.42–2.8 | (López-Meza et al., 2016; Shakibaie et al., 2011) |
| Y X/gln | 109cells mmol−1 | 0.1 | 0.974** | 10 | 0.4–0.97 | (Acosta et al., 2007; Xing et al., 2010) |
| Y X/glu | 109cells mmol−1 | 0.001 | 1** | 20 | - | - |
| Y lac/glc | g g−1 | 0.1 | 0.615 | 2*** | 0.5–0.615 | (Martínez et al., 2013; Xing et al., 2010) |
| Y X/amm | 109cells mmol−1 | 0.1 | 0.67** | 20 | - | - |
| Y gln/glu | mol mol−1 | 0.001 | 0.7 | 1*** | - | |
| Y X/lac | 109cells g−1 | 0.01 | 2.7** | 5 | 0.89 | (Shakibaie et al., 2011) |
| Y P/X | g/109cells | 0.0001 | 0.2** | 1 | - | - |
| KDamm | mM | 0 | 5 | 100 | 1.44–14.28 | (Kotidis et al., 2019a; Xing et al., 2010) |
computed for each cell line as described in ‘Initial guesses of µmax and Kglc’
derived from an average of the estimated yields as described in ‘Initial guesses yields’
derived from stoichiometry. For ‘-’ no values were found in the literature.
Model accuracy
For each regression or cell line, the goodness of fit was assessed through computing R2 for each variable i in Xpred, computed according to Eq. 27.
![]() |
(27) |
where Xdata,i is the data of variable i, n is the number of datapoints (7) and
is the mean of the observed values of the ith variable. Furthermore, the normalized vector of the Root Mean Squared Error (RMSE) is computed as the % normalized RMSE (NMRSE) using min-max normalization (Eq. 21) upon |Xdata| and |Xpred|:
![]() |
(28) |
Automated parameter balancing
A common approach to model regression is parameter balancing, keeping some kinetic parameters fixed to reduce the parameter space to find optimal solutions (Lubitz et al., 2010).
The kinetic model is now referred to as the ‘free-model’, comprising 13 kinetic parameters. During model development, potential co-linearity was identified between µmax, kd and Kglc which can destabilize regression. In the literature, this is often addressed by fixing one or more parameters based on prior experimental measurements (Xing et al., 2010). However, such data were not available for all individual cell lines in this study. To address this, MCKM implements an automated parameter balancing procedure: if regression upon a certain cell line using the ‘free model’ fails, failing defined as
<0.90 or
<0.90, the parameter space is reduced to 12 instead of 13, keeping either µmax or Kglc fixed. The values at which they are kept fixed are the initial guesses (µmax,IG and Kglc,IG). Figure 2 shows an overview of the workflow of MCKM structure, and visualizes the automated parameter balancing.
Fig. 2.
An overview of the Multi Clone Kinetic Model work flow. Each cell line culture run N regresses separately through the model. consisting of 49 datapoints (7 metabolites and 7 timepoints, additional constrains are implemented to result in a robust widely applicable model for many cell lines and different mAbs.
Model Identifiability
Sensitivity analysis
Whilst R2 and RMSE assess accuracy of the kinetic model to fit the experimental metabolite profiles, parameter identifiability analysis is useful to confirm whether one unique solution for the set of kinetic parameters exists. This validation process ensures the biological significance of the regressed kinetic parameters, allowing the model to be utilized for metabolic analysis. A sensitivity analysis was executed for each of the parameters in θ). The sensitivity index (sij) for each kinetic parameter j and each metabolite i was computed, averaging over the total number of timepoints (T = 7) was computed as shown in Eq. 29.
![]() |
(29) |
where Xi is the observed variable or concentration of the ith metabolite. Averaging over all metabolites, the mean squared sensitivity of each parameter is given by Eq. 30 (Gábor et al., 2017).
![]() |
(30) |
Parameter identifiability
To assess if the parameters are uniquely identifiable, it was verified whether changes in parameters can be cancelled out by other parameters, due to collinearity. This is computed through the collinearity index (CI) between parameter pairs. The sensitivity index is first normalized:
![]() |
(31) |
The normalized matrix
. The CI for each parameter pair K was computed as:
![]() |
(32) |
where
is the smallest eigenvalue of
(Brun et al., 2002) implemented using eig() in Matlab®2019b.
Biological interpretation of parameter estimates
The parameter regression was run a total of 656 times for each cell line culture run. The identified kinetic parameters were averaged for each of the four different mAbs campaigns separately, computing the mean values of the parameters across the cell lines and standard deviation which reflect the variance amongst cell lines. The identified kinetic parameter values were compared to those reported in the literature.
Case study: using the MCKM to identify key metabolic patterns in CLD
To demonstrate a biologically relevant application of the MCKM kinetic model, a case study is presented that uses the regressed kinetic parameters of the cell lines to study metabolic differences between well-performing and poorly performing, or stable and unstable cell lines. Well-performing cell lines are defined as scoring the top 50% mAb titre during Ambr-1, 2, 3, and 4, exhibiting minimal changes in productivity over time (i.e. low slope when productivity is plotted against generation) and maintaining a sufficiently high productivity level. Specific numerical thresholds were not disclosed due to confidentiality agreements. LDA, as proposed by (Fisher, 1936), was applied to the kinetic parameters acquired from both groups; well-performing versus poor-performing, or stable versus unstable cell lines. This quantifies the contribution of each kinetic parameter to both performance and stability.
Results and Discussion
Kinetic Model Regression
In this study, a generalized Multi Clone Kinetic Model (MCKM) was developed to regress kinetic parameters of individual CHO cell cultures producing different recombinant mAbs. The MCKM estimates 13 kinetic parameters per culture and was applied to 656 Ambr15™ fed-batch cultures spanning 157 unique CHO clonal cell lines, recombinant for three distinct mAbs. Table 4 contains the average accuracy of the 656 MCKM regressions in terms of R2 and RMSE of the eight process variables, averaged overall all the cell lines that are recombinant for the same mAb as this data originates from four different CLD campaigns.
Table 4.
MCKM Regression Performance Metrics of the Four Different CLD Campaigns
| mAb-A | mAb-B1 | mAb-B2 | mAb-C | |||||
|---|---|---|---|---|---|---|---|---|
| Mean | Std | Mean | Value | Mean | Std | Mean | Std | |
|
0.920 | 0.078 | 0.959 | 0.055 | 0.967 | 0.065 | 0.954 | 0.067 |
|
0.935 | 0.045 | 0.970 | 0.023 | 0.975 | 0.057 | 0.985 | 0.043 |
|
-0.240 | 0.480 | 0.247 | 0.449 | -0.041 | 0.374 | 0.246 | 0.520 |
|
0.671 | 0.501 | 0.888 | 0.124 | 0.866 | 0.208 | 0.885 | 0.169 |
|
0.355 | 0.354 | 0.508 | 0.359 | 0.271 | 0.331 | 0.336 | 0.537 |
|
0.206 | 1.984 | -0.238 | 1.961 | -0.220 | 3.455 | -0.524 | 2.516 |
|
0.375 | 0.586 | 0.729 | 0.486 | 0.869 | 0.226 | 0.672 | 0.320 |
|
5.8% | 3.9% | 6.7% | 3.1% | 6.1% | 4.3% | 7.4% | 0.035 |
| NRMSEP | 8.6% | 3.9% | 5.7% | 2.3% | 5.5% | 3.9% | 4.1% | 0.033 |
| NRMSEglc | 46.3% | 12.4% | 49.7% | 14.6% | 50.6% | 10.6% | 50.4% | 21.8% |
| NRMSEgln | 15.8% | 8.4% | 10.9% | 4.6% | 12.9% | 5.8% | 10.3% | 5.0% |
| NRMSEglu | 34.2% | 13.2% | 33.8% | 14.7% | 39.8% | 10.5% | 35.1% | 13.8% |
| NRMSElac | 33.5% | 0.265 | 40.0% | 21.1% | 42.2% | 28.0% | 44.4% | 22.0% |
| NRMSEamm | 22.5% | 10.9% | 13.5% | 10.4% | 11.4% | 6.8% | 19.3% | 7.9% |
Tabulated are R2 and NRMSE values of regression of averaged over all cell lines of the same CLD campaign (mAb-A, B1, B2, or C). Values from different CLD campaigns are reported separately, as differences in mAb or platform can substantially influence cell line kinetics.
Modelling of growth and titre
Overall, MCKM demonstrated strong regression performance for biomass and mAb titre, with an average
0.96 ± 0.07 and
0.97 ± 0.05 (Table 4) averaged over the 656 regressions, indicating that the MCKM effectively captures diverse cell line behaviours. Out of the 656 parameter regressions, 99% runs are being predicted with
> 0.90 and RP > 0.90. Compared to existing literature, the accuracy of the regression achieved in this study is comparable (Goudar et al., 2010; Kotidis et al., 2019b; López-Meza et al., 2016; Xing et al., 2010). However, it is important to note that MCKM operates very differently from the literature’s kinetic models, as it does not accumulate hundreds of cultures for one parameter regression but rather uses only one cell culture run (with 49 available data points in our case) to regress a set 13 kinetic parameters. Each regression of MCKM was performed independently, with no parameter sharing or transfer learning across experiments; the fitted parameters are specific to kinetics of each individual cell line culture run. Given this high dimensionality problem, MCKM’s performance can be considered remarkably robust.
and NRMSEP of 6–7% and 4–9%, respectively outperforms hybrid modelling strategies and transfer models which range between 10% and 20% (Bayer et al., 2021).
Three of the 656 MCKM cell line culture regressions are plotted in Fig. 3 to show a variety of cell line behaviours MCKM is able to capture. Figure 3a plots a cell line that was optimized using MCKM ‘automated parameter balancing’ described in the section ‘Initial guesses for µmax and Kglc’. The initial ‘regression free model’ resulted in
=0.87, which is below the threshold of accurate regression (
<0.90). Consequently, MCKM automatically triggers its automated parameter balancing procedure, re-running the regression in a reduced parameter space of 12 instead of 13, keeping either Kglc or µmax fixed. Figure 3a plots the ‘regression fixed Kglc’ and shows improvement in biomass, protein, glucose, ammonium, and lactate regression compared to the ‘regression free model’, to
= 0.99 and
= 0.91. The other metabolites show only slight improvement, comparing ‘regression free model’ to ‘regression fixed Kglc’, respectively, highlighting the model’s capacity to find multiple kinetic parameter solutions that still result it a similar plot of the metabolite profiles. This will be further analysed in the parameter identifiability study in the section ‘Parameter identifiability’.
Fig. 3.
Regression results of the MCKM of four example cell line culture runs. To demonstrate a variety of cell line behaviours, plotted are (a) a cell line with significant lactate consumption (mab-A, Ambr-1, cell line A1); (b) a cell line with only increasing lactate (mab-A, Ambr-1, cell line A2); (c) a poorly growing cell line (mAb-A, Ambr-1, cell line A3), and (d) a cell line that fluctuates in lactate production to consumption (mAb-C, Ambr-1).
Figure 3b shows a cell line with an earlier and steeper death phase, resulting in an overall reduced cell growth, which MCKM accurately captures with
= 0.97 and
= 0.98. Figure 3b and d also reveal that around the stationary phase (t = 250), MCKM predicts a dip in the VCC growth curve which is not observed in the experimental profiles. This arises because in MCKM death rate is defined as a function of ammonium, while ammonium is represented as a simple mathematical expression proportional to the growth rate. In contrast, the experimental data show a drop in ammonium around the stationary phase, see Figs. 3b and d, which MCKM regression cannot mathematically reproduce. Nevertheless, MCKM still predicts an appropriate peak of VCC. Peak value of VCC is often reported as a key metric in CLD, as it reflects the maximum viable biomass achieved during culture, and is frequently used in preliminary cell line assessments (Moore et al., 2024).
The glucose regressions in Fig. 3 visually demonstrate accurate fits of the profiles across the four cell lines. However, the coefficient of determination is negative and highly variable for glucose (
=-0.24 ± 0.48 and NRMSEglc ~50% for mAb-A in Table 4). Since MCKM applies average feeding amounts of glucose and glutamate across cell lines, while actual experiments involve slight variations in feeding amounts and timing, the statistical fit metric is sensitive to these discrepancies. This suggests that incorporating more precise feeding data or adopting a dynamic feeding model could further improve prediction accuracy for glucose and glutamate.
In all four cell line regressions in Fig. 3, µest predicted by MCKM aligns well with the computed µest from the experimental data during the exponential phase, but deviates in the stationary and death phases. This is because µest is computed assuming exponential changes in VCC at all time points (Eq. 19). During the stationary and death phases, cells no longer follow strict exponential kinetics, leading to an inaccurate µest in the stationary and death phase.
Modelling of the lactate switch
MCKM takes a mechanistic approach to modelling the lactate switch by introducing a minimum glucose threshold below which cells transition to lactate consumption. This successfully captures the characteristic parabolic lactate profiles of cell lines that have the lactate switch, see Fig. 3c, while also accommodates cell lines that lack the switch and solely produce lactate, see Fig. 3b. Figure 3d shows a cell line that activates the lactate switch twice, a fluctuation MCKM only partially captures. Table 4 shows a high average NRMSElac of 33.5–44.4% across the four mAb campaigns. Nevertheless, despite its simplified formulation of the lactate switch, MCKM provides sufficient information to compare kinetic lactate profiles across different cell lines. Its strength lies not in absolute predictive accuracy, but in providing biologically interpretable parameters that allow comparison of lactate metabolism across cell lines—specifically, whether and how frequently cells undergo a lactate switch—while remaining computationally efficient and useful for CLD.
Future refinements in MCKM could link the switch to additional physiological drivers, such as redox balance or pH, to better capture complex fluctuations as the lactate switch is influenced by multiple factors, such as pH, NAD⁺/NADH ratio, mitochondrial function, and substrate availability (Hartley et al., 2018) as previous models have relied on data-driven or dynamic formulations (Villaverde et al., 2016; Zalai et al., 2015)
Modelling of the GS-metabolism
MCKM regressed well upon cell lines that are overall increasing in glutamate and ammonium, as shown in Figs. 3a and b, but does not capture drops in profiles glutamate and ammonium, as observed in Figs. 3c and d. This is also reflected in the low average R2 in Table 4:
=0.355 ± 0.354 and
=0.375 ± 0.586 for mAb-A. This is a logical consequence of the simplification made in the ammonium equation, which does not mathematically link ammonium to glutamate in order to accommodate the wide variety of ammonium profiles across different cell lines. Additionally, MCKM uses an estimate of glutamate feeding, not the actual values that were used in the experiments, thus sensitivity of the R² metric to outliers contributes to these lower values (Chicco et al., 2021).
Notably, these drops in the glutamate and ammonium profiels are linked to the lactate switch as previous analysis found a significant correlation between lactate and glutamate in the death phase (Sietaram et al., 2025). Glucose limitation during this phase drives higher lactate and glutamate consumption, with glutamate potentially influencing lactate production via its role in the TCA cycle and amino acid biosynthesis (Széliová et al., 2020). This indicates an area for further refinement, such as integrating additional metabolic pathways or dynamic feedback mechanisms linking glutamate and ammonium consumption to glucose depletion. MCKM simplifies ammonium expression by not explicitly linking it to glutamate or glutamine, even though in reality this occurs via the GS system. This simplification was necessary because MCKM needed to accommodate a wide variety of cell lines and profiles, as revealed by data analysis the correlation between glutamate and ammonium is not consistent across cell lines (Sietaram et al., 2025).
The regression coefficient for glutamine differs notably between mAb-A and the other mAbs (Table 4). For mAb-A, the average
=0.671 ± 0.501, substantially lower than the values observed for the other three campaigns (
> 0.8, Table 4). The NRMSEamm of mAb is also higher (22.5%) than the other CLD campaigns (13.5 and 11.4% of mAb-B1, B2 Table 4). This suggests greater metabolic variability in glutamine utilization by the CHO cells producing mAb-A. The broad variance of the
(std ± 0.501 for mAb-A) further indicates instability or inconsistency in how glutamine impacts production. In contrast, the higher and more consistent R² values for the other mAbs may reflect a tighter regulation of glutamine metabolism and a closer coupling between glutamine availability and protein synthesis (Goudar et al., 2010; Jeong et al., 2016). NRMSEgln and NRMSEamm in Table 4 are 11–22%, which is a reasonable level of error compared to the literature, given the simplified equation of MCKM for these metabolites.
Model Identifiability
When using MCKM predictions to study kinetics across cell lines in CLD, their biological interpretability is crucial. To assess this, model identifiability is evaluated—determining whether the model parameters can be accurately and uniquely inferred from the input-output data (Sidoli et al., 2003). This involves performing a sensitivity analysis, followed by quantifying the co-linearity between parameters.
Sensitivity analysis
To assess model identifiability, a sensitivity analysis was first performed to evaluate how variables in the kinetic model respond to the 13 kinetic parameters. Figure 4a shows normalized sensitivity coefficients. As can be seen, MCKM process variables exhibit sensitivity to all 13 kinetic parameters. Biomass is naturally sensitive to all the parameters that occur in the growth rate equation (µmax, Kglc, mglc) and death rate equation (kD and KDamm). The kinetic model characterizes cell growth, with glucose serving as the sole growth-limiting substrate and ammonium as the sole growth inhibitor, resulting in a normalized sensitivity coefficient of Xv to YX/glc of 0.92 and YX/amm of 0.72, see Fig. 4a.
Fig. 4.
Parameter identifiability study: (a) normalized sensitivity coefficients, (b) collinearity index. M stands for million. The plot (a) shows normalized standard sensitivity coefficients, where positive values indicate that increasing the parameter increases the model output, and negative values indicate an inverse effect. Values near zero suggest low sensitivity. The collinearity index in (b) quantifies the degree of parameter correlation, with higher values indicating potential identifiability issues due to parameter interdependencies.
The substrates glucose and glutamate in Fig. 4a both show sensitivity to growth and death rate kinetic parameters (µmax, Kglc, mglc, kD and KDamm), which is expected, since substrate availability directly affects cell proliferation and death. Notably, the growth model is highly sensitive to KDamm (normalized sensitivity of 0.96, Fig. 4a). Lactate is also significantly sensitive to µmax (normalized sensitivity of 0.59) and kD (normalized sensitivity of 0.36) parameters, see Fig. 4a, highlighting that lactate production and consumption are closely tied to the cells’ metabolic state. While the model captures lactate as a consequence of growth and death, the causal relationship may be bidirectional: lactate could also influence or induce cell death, a potential interaction not currently included in MCKM to maintain generality across diverse cell line lactate profiles. Additionally, lactate shows sensitivity to YX/amm (normalized sensitivity of 0.43, Fig. 4a), reflecting MCKM's mathematical link between ammonium and death rate, and the biological association between lactate accumulation and cell death.
Parameter identifiability
To assess if all parameters are uniquely identifiable, the co-linearity index was computed from the normalized sensitivity index (through Eq. 28–30), of which the computed values are shown in Fig. 4b. A co-linearity index of CIk > 20 is considered co-linear where roughly 95% of the variation in the model output by changing one parameter can be compensated by the other parameter (Gábor et al., 2017). The co-linearity index between µmax and Kglc is CIk = 14.33, implying a slight co-linearity but still below the threshold. This co-linearity arises because both parameters influence the rate of growth and are often interdependent in their effects on cell growth kinetics. Conceptually, Kglc describes the cell’s affinity for glucose, which naturally impacts the maximum rate at which the cells can grow. The kinetic model has the automated parameter balacning for regression runs that fixies either µmax or Kglc based on µmax,IG or Kglc,IG, respectively, in which the co-linearity is directly overcome.
mglc and KDamm also demonstrate some co-linearity, with the growth parameters µmax or Kglc and kD, although the CIk values remain below the identifiability threshold. This mild co-dependence reflects the underlying biology, as both basal glucose consumption and ammonium-induced cell death are inherently linked to growth dynamics.
A high degree of collinearity is observed between Ylac/glc and YX/lac (CIK = 5.8×106, see Fig. 4b). The structure of the ODE for lactate (equation) naturally leads to collinearity between Ylac/glc and YX/lacas both parameters govern the same underlying biological process. How much of the cell's glucose is consumed for lactate (Ylac/glc) naturally impacts the yield of biomass upon lactate consumption; in other words, YX/lac is biologically a function of Ylac/glc. It is common for kinetic models of biochemical systems to be practically unidentifiable, as there is a complex interplay between biological processes that these parameters represent, making it difficult to decouple their effects (Gábor et al., 2017). While Ylac/glc and YX/lac cannot be uniquely determined, MCKM reliably captures lactate profiles, such as if and when the cells switch from lactate consumption to production, allowing comparison of metabolic switching and lactate utilization of patterns between cell lines.
The same principle applies to Ygln/glu and YX/gln, which are co-linear (CIk = inf, Fig. 4b). While fixing one of the co-linear parameters is necessary to achieve unique parameter estimates, caution is needed when interpreting the absolute parameter values, as they may not fully represent the true metabolism. However, relative differences in glutamine consumption and conversion between cell lines can still be assessed, providing useful information for evaluating cell line metabolic behavior.
Parameter Estimates of Different Cell Lines
The regressed kinetic parameters were averaged across all cell lines producing of the mAb CLD campaign, with mean values and standard deviations reported in Table 5. The standard deviation reflects variability amongst the cell lines of he same mAb campaign. For all four mAbs, the maximum growth rate µmax ≈ 0.02 h−1 (Table 5) is on the lower end of the literature values for recombinant CHO: µmax ≈ 0.029–0.090 h−1 (Aehle et al., 2010; Kotidis et al., 2019a; López-Meza et al., 2016; Shakibaie et al., 2011; Xing et al., 2010). The low standard deviation in µmax (±0.00831 for mAb-A amongst cell lines, Table 5) suggests this is an intrinsic property of the CHO host rather than a modelling artefact. From the computations of estimates of the growth rates, µest (Eq. 19), the highest observed value was 0.0238 h−1, which aligns well with the regressed µmax ≈ 0.0258 ± 0.00831 h−1, supporting the plausibility of the accurate computation of µmax through MCKM.
Table 5.
Mean Estimated Kinetic Parameters for All Cell Lines of the Four CLD Campaigns mAb-A, B1, B2, and C Ambr-1
| mAb-A | mAb-B1 | mAb-B2 | mAb-C | ||||||
|---|---|---|---|---|---|---|---|---|---|
| θ | Unit | Mean* | Std** | Mean* | Std | Mean* | Std | Mean* | Std |
| μmax | h−1 | 0.0258 | ± 0.00 831 | 0.0221 | ± 0.0174 | 0.0237 | ± 0.00 390 | 0.0237 | ± 0.00912 |
| kd | h−1 | 0.0180 | ± 0.0194 | 0.0166 | ± 0.0146 | 0.0176 | ± 0.0170 | 0.0185 | ± 0.0163 |
| Kglc | g L−1 | 1.05 | ± 2.60 | 0.788 | ± 3.263 | 0.501 | ± 1.35 | 0.641 | ± 2.67 |
| Y X/glc | 109 cells g−1 | 2.03 | ± 1.29 | 1.52 | ± 2.028 | 2.46 | ± 2.13 | 2.45 | ± 1.97 |
| Y X/gln | 109cells mmol−1 | 43.3 | ± 21.9 | 37.68 | ± 24.71 | 25.82 | ± 22.2 | 30.03 | ± 27.3 |
| Y X/glu | 109cells mmol−1 | 8.94 | ± 4.19 | 7.680 | ± 4.774 | 7.95 | ± 3.65 | 8.828 | ± 5.49 |
| Y X/amm | 109cells mmol−1 | 11.0 | ± 4.71 | 8.679 | ± 5.232 | 9.51 | ± 4.54 | 9.88 | ± 5.62 |
| Y lac/glc | glac/gglc | 0.286 | ± 0.158 | 0.2684 | ± 0.1610 | 0.225 | ± 0.151 | 0.168 | ± 0.135 |
| Y gln/glu | molgln/molglu | 0.387 | ± 0.165 | 0.5227 | ± 0.2623 | 0.482 | ± 0.265 | 0.715 | ± 0.260 |
| Y X/lac | (109cells)/glac | 3.16 | ± 5.37 | 3.295 | ± 4.646 | 4.37 | ± 4.32 | 4.07 | ± 5.09 |
| Y P/X | gP/(109cells) | 0.0481 | ± 0.0258 | 0.0898 | ± 0.0495 | 0.0561 | ± 0.0356 | 0.0892 | ± 0.0587 |
| mglc |
|
1.51*10−6 | 2.07*10−3 | 1.60*10−4 | ± 2.92*10−3 | 1.71*10−5 | ± 1.48*10−3 | 8.80*10−4 | ± 3.07*10−3 |
| KDamm | mM | 6.47 | ± 3.14 | 6.4640 | ± 3.3144 | 4.17 | ± 3.30 | 6.29 | ± 3.42 |
Values from different CLD campaigns are reported separately, as differences in mAb or platform can substantially influence cell line kinetics. *These values are the average over the ~20–50 cell lines of the same mAb campaign in Ambr-1. **Std reflects the variation amongst cell lines.
MCKM finds average death rates of kd = 0.0116–0.0185 h−1 (Table 5) which match literature reports of 0.016 h−1 (Xing et al., 2010). Similarly, the biomass yield on glucose YX/glc (≈2 × 109 cells/gglc) and the glucose saturation constant Kglc (≈1.05 g L−1 for mAb-A) fall within the reported ranges in the literature (Kiparissides et al., 2015; López-Meza et al., 2016). However, Kglc shows a substantial variability between cell lines (standard deviation of ± 2.60 g L−1 for mAb-A). Considering the unstable CHO karyotype as a result of common chromosomal aberrations, random gene expression fluctuations, genomic rearrangements, gene copy number loss variance in cell culture conditions and cell line adaptation, a high variance in glucose metabolism amongst cell lines is plausible (Dahodwala & Lee, 2019; Lai et al., 2013).
Glutamine-related parameters,
and
, exhibit larger variability (e.g.
= 43.3 ± 21.9 × 109 cells mM−1 for mAb-A) primarily due to co-linearity in the model and the nature of GS-CHO cultures. From the currently measured experimental data, the kinetic model has no exact way of deriving how much glutamine is consumed as there is always a net increase of glutamine from the GS-conversion and glutamine is not a limiting nutrient, which is common in GS-CHO culture (Xu et al., 2014). The large variability in
(8.94± 4.19 × 109 cells mM−1 for mAb-A) may reflect the large differences in GS-expression amongst cell lines as a result of random site of integration of the recombinant gene as well as gene copy number variation (Kuo et al., 2018; Würtele et al., 2003).
The lactate yields Ylac/glc and YX/lac were found to be co-linear in the section ‘Parameter identifiability’, yet Ylac/glc remains relatively stable (0.286 ± 0.158 glac/gglc for mAb-A, see Table 5), indicating consistent glucose partitioning to lactate across cell lines of the same mAb-campaign. This likely reflects the controlled microenvironment in Ambr15™ systems, which provide good control over dissolved oxygen levels, which can stabilize the reliance on anaerobic pathways (Kelly et al., 2018; Martínez et al., 2013; Xing et al., 2010).
Finally, the ammonium death rate constant KDamm is consistent across CLD the mAb-A, B1, and C campaigns and cell lines around ~6 ± 3 mM (Table 5). This is slightly higher than literature values (1.44–4.5 mM; Xing et al., 2010), likely due to model simplifications where cell death depends solely on ammonium concentration, omitting other potential stressors such as nutrient depletion or hypoxia (Karra et al., 2010). The results obtained with the MCKM model demonstrate how kinetic parameters provide insight into comparison of CLD campaigns, highlighting an altered ammonium metabolism for mAb-B1 (~4.2 ± 3 mM compared to 6 mM for the other three campaigns).
The mean mglc regressed by the kinetic model (1.51*10−6± 2.07*10−3
for mAb-A, Table 5) varies significantly amongst cell lines and is lower than values observed in the literature of 0.036
(Xing et al., 2010) as a result of the parameter co-linearity (as was shown in the section ‘Parameter identifiability’). Thus, this value cannot be accurately deduced by the MCKM alone and would require complementary experimental measurements of basal glucose uptake to be reliably estimated.
Case study: Using MCKM to Identify Key Metabolic Patterns in CLD
This case study demonstrates one example of how MCKM can be used to study metabolic differences amongst cell lines during CLD, namely to understand the biological mechanisms that underpin cell line stability, something that is currently poorly understood in the literature (Yusufi et al., 2017). Beyond this, MCKM could be leveraged in several other ways: for biomarker discovery to link metabolic parameters to productivity or stress response, for media and feeding strategy optimization, or in combination with machine learning approaches to predict high-performing clones or anticipate metabolic bottlenecks. In this study, we demonstrate a LDA applied to the regressed kinetic parameters, distinguishing well- versus poorly performing and stable versus unstable cell lines, with the LDA coefficients identifying the key kinetic parameters driving these differences. This is done for two of the CLD campaigns, mAb-A and mAb-B1.
The LDA results comparing well- versus poorly performing cell lines producing mAb-A and mAb-B1 are shown in Fig. 5a, c, respectively. Cells that perform ‘well’ were labelled as such, based on scoring top 50% in titre, and having high peak productivity. As expected, YP/X provides the strongest discriminatory power between ‘well-’ and ‘poor’ performing cells, reflecting its direct link to productivity and titer. Interestingly, YX/glu shows comparable discriminative power as µmax and kd for distinguishing ‘well’ performing cells, indicating that cell growth is as critical as the efficiency of glutamate conversion via the GS system for cell line performance. The LDA coefficients for YX/glc differ remarably between mAb-A and mAb-B1 (≈ −0.7 vērsus ≈ 7 in Fig. 5a, c, respectively), suggesting that glucose metabolism may be influenced by the type of mAb that is being produced. When producing different antibodies, cells may experience changes in metabolic demands, protein synthesis rates, and cellular stress responses, all of which can influence glucose metabolism (Fan et al., 2015).
Fig. 5.
LDA upon the MCKM's regressed kinetic parameters to study the biological differences amongst ‘well performing’ versus ‘poorly performing’ or ‘stable’ versus ‘unstable’ cell lines during CLD. Plotted are the LDA coefficients of the kinetic parameters that distinguish the classes (a) well performing versus poorly performing cell lines of mAb-A campaign (classification accuracy 90%), (b) stable versus unstable cells of the mAb-A campaign (classification accuracy 40%), (c) well performing versus poorly performing cell lines of the mAb-B1 campaign (classification accuracy 75%), (d) stable versus unstable cells of the mAb-B1 campaign (classification accuracy 60%).
When comparing ‘stable’ (i.e. stable productivity over Ambr-1, 2, 3, 4) versus ‘unstable cell’ lines, and the discrimanitory power the kinetic parameters have towards this (Fig. 5b, d for mAb-A and mAb-B1, respectively), µmax, kD, Kglc and YX/glc provide the greatest discriminatory power, highlighting the importance of growth and glucose metabolism in maintaining cell line stability. Efficient conversion of glucose into biomass rather than by-products appears critical for stable performance (Vergara et al., 2018).
The LDA coefficient of YX/amm for distinguishing stable versus unstable cell lines, differs between the mAbs: strongly positive for mAb-A, see Fig. 5b, but slightly negative for mAb-B1, Fig. 5d. This suggests that higher ammonium utilization may support stability in mAb-A, whereas lower ammonium utilization may be preferable for mAb-B1. Such differences reflect the known variability in how CHO cells handle metabolic by-products depending on the recombinant protein produced (Noh et al., 2017).
Conclusions
In this paper, we present the MCKM, a novel generalized kinetic model capable of dynamically simulating a wide range of CHO clonal cell lines and mAb targets. The key innovation of MCKM is its ability to characterize the metabolic kinetics of individual clones derived from the same CHO host, computing a distinct profile of 13 kinetic parameters from a single cell culture run containing only 49 datapoints. This contrasts with most kinetic models in the literature, which typically focus on population-averaged behaviour, require multiple replicate cultures per regression, or aim primarily at production-level fitting rather than capturing clonal heterogeneity. MCKM enables the characterization of individual clones during upstream CLD, providing substantial knowledge gain for cell line selection and for comparing kinetic behaviour across different clones during a CLD campaign, as well as comparing different mAb targets and CHO hosts across campaigns. The model was developed and validated on four historical CLD campaigns, comprising 656 unique fed-batch cell culture regressions (15 mL minibioractor Ambr15™) from 157 unique clones producing three distinct mAbs.
Across 656 regressions, MCKM achieved high accuracy with the average R2 of viable biomass and mAb titre
0.96 ± 0.07 and
0.97 ± 0.05 across cell line cultures, respectively, with 99% of runs exceeding R² > 0.90. This robustness is remarkable given that the model regresses 13 kinetic parameters from a single cell culture run of only 49 datapoints, whereas most approaches reported in the literature rely on accumulating multiple replicates. The NRMSE for biomass (6–7%) and mAb product (4–9%) even outperformed hybrid and transfer models (typically 10–20%).
To achieve such broad applicability across hundreds of clones and three mAb targets, MCKM required unique implementations. One is an automated parameter balancing procedure: if regressions fail due to dimensionality, they are re-run with one growth kinetic parameter fixed from an initial guess, ensuring all 656 runs converge to a biologically interpretable solution. MCKM is novel in its implementation of the lactate switch, implemented as a glucose-limitation function, which successfully captured diverse behaviours—from clones that switch once or twice, to those that continuously produce lactate. Although lactate predictions showed higher error (~50%), the strength of the model lies not in the absolute predictive accuracy, but in providing biologically interpretable parameters that enable cross-clone comparison of lactate metabolism, particularly the frequency and timing of lactate switches. MCKM could be further refined by linking to physiological drivers such as pH, redox balance, or mitochondrial function.
The GS-system was simplified by modelling glutamate and ammonium as functions of growth, allowing MCKM to cover a wide variety of clone-specific profiles. While this abstraction limits accuracy during the death phase, the model still achieved ~10% NRMSE for ammonium and glutamine in two of the four CLD campaigns. Lower R² for glucose and glutamate were largely due to estimated feeding times rather than model structure.
A parameter identifiability analysis revealed expected co-linearity among growth kinetics, reflecting underlying biology. Some yields (e.g. glutamine/glutamate, lactate/glucose) were not uniquely identifiable, suggesting that further experimental studies would be required to refine their biological interpretation. Importantly, even with non-unique values, MCKM provides meaningful discrimination when comparing lactate, glutamine, and glutamate profiles across cell lines.
A case study application of MCKM was shown demonstrating its ability to capture kinetic differences between well-performing, stable clones and poorly performing, unstable clones. MCKM could serve as a standard tool in CLD to characterize cell line metabolism, providing additional data for lead clone selection. Beyond this case study, the applications of MCKM are broad. It could predict which clones are likely to remain stable and productive over long culture periods (Dean & Reddy, 2013; Zagari et al., 2013), identify biomarkers or critical growth rates linked to productivity, and guide feeding strategy optimization. Moreover, the model could support metabolic engineering by highlighting genetic or metabolic engineering targets, as well as providing a tool for simulating how clones behave under different conditions in bioreactor conditions.
Acknowledgements
This study was co-funded by BBSRC and GlaxoSmithKline via the CASE Award programme. Furthermore, an acknowledgement goes to Ruth Rowland-Jones and Paloma Diaz-Fernandez within GSK for collecting and sharing data used in this study.
Contributor Information
Devi Sietaram, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, UK.
Pavlos Kotidis, BioPharm Process Research, GSK, Stevenage SG1 2NY, UK.
Gary Finka, BioPharm Process Research, GSK, Stevenage SG1 2NY, UK.
Alexei A Lapkin, Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, UK.
Author contributions
D.S.: study design, performing the study, drafting, and editing the manuscript. A.A.L.: study conceptualization, funding acquisition, project supervision, manuscript editing. P.K.: project supervision, manuscript editing. GF: funding acquisition, manuscript editing. R.R.J.: project supervision.
Funding
This work was supported by Biotechnology and Biological Sciences Research Council and GlaxoSmithKline via the Collaborative Awards in Science and Engineering programme.
Conflict of interest
The authors acknowledge funding of this study by GSK via the BBSRC CASE award.
Data Availability
The data underlying this article were provided by GlaxoSmithKline under a confidentiality agreement and are available only to the authorized parties in accordance with that agreement.
References
- Aehle M., Simutis R., Lübbert A. (2010). Comparison of viable cell concentration estimation methods for a mammalian cell cultivation process. Cytotechnology, 62(5), 413–422. 10.1007/s10616-010-9291-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bayer B., Duerkop M., Striedner G., Sissolak B. (2021). Model transferability and reduced experimental burden in cell culture process development facilitated by hybrid modeling and intensified design of experiments. Frontiers in Bioengineering and Biotechnology, 9, 740215. 10.3389/fbioe.2021.740215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brun R., Kühni M., Siegrist H., Gujer W., Reichert P. (2002). Practical identifiability of ASM2d parameters—Systematic selection and tuning of parameter subsets. Water Research, 36(16), 4113–4127. 10.1016/S0043-1354(02)00104-5 [DOI] [PubMed] [Google Scholar]
- Carinhas N., Duarte T. M., Barreiro L. C., Carrondo M. J. T., Alves P. M., Teixeira A. P. (2013). Metabolic signatures of GS-CHO cell clones associated with butyrate treatment and culture phase transition: Metabolic signatures of GS-CHO cell. Biotechnology and Bioengineering, 110(12), 3244–3257. 10.1002/bit.24983 [DOI] [PubMed] [Google Scholar]
- Chicco D., Warrens M. J., Jurman G. (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7, e623. 10.7717/peerj-cs.623 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chotteau V., Hagrot E., Zhang L., Mäkinen M. E. (2022). Mathematical modelling of cell culture processes. In: Cell Culture Engineering and Technology(pp. 431–466.). Springer, Cham. 10.1007/978-3-030-79871-0_14 [DOI] [Google Scholar]
- Costa R. S., Hartmann A., Vinga S. (2016). Kinetic modeling of cell metabolism for microbial production. Journal of Biotechnology, 219, 126–141. 10.1016/j.jbiotec.2015.12.023 [DOI] [PubMed] [Google Scholar]
- Dahodwala H., Lee K. H. (2019). The fickle CHO: A review of the causes, implications, and potential alleviation of the CHO cell line instability problem. Current Opinion in Biotechnology, 60, 128–137. 10.1016/j.copbio.2019.01.011 [DOI] [PubMed] [Google Scholar]
- Dean J., Reddy P. (2013). Metabolic analysis of antibody producing CHO cells in fed-batch production. Biotechnology and Bioengineering, 110(6), 1735–1747. 10.1002/bit.24826 [DOI] [PubMed] [Google Scholar]
- DeMaria C. T., Cairns V., Schwarz C., Zhang J., Guerin M., Zuena E., Estes S., Karey K. P. (2007). Accelerated clone selection for recombinant CHO cells using a FACS-based high-throughput screen. Biotechnology Progress, 23(2), 465–472. 10.1021/bp060298i [DOI] [PubMed] [Google Scholar]
- Du Z., Mujacic M., Le K., Caspary G., Nunn H., Heath C., Reddy P. (2013). Analysis of heterogeneity and instability of stable mAb-expressing CHO cells. Biotechnology and Bioprocess Engineering, 18(2), 419–429. [Google Scholar]
- Fan Y., Jimenez Del Val I., Müller C., Wagtberg Sen J., Rasmussen S. K., Kontoravdi C., Weilguny D., Andersen M. R. (2015). Amino acid and glucose metabolism in fed-batch CHO cell culture affects antibody production and glycosylation. Biotechnology and Bioengineering, 112(3), 521–535. 10.1002/bit.25450 [DOI] [PubMed] [Google Scholar]
- Fisher R. A. (1936). The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2), 179–188. [Google Scholar]
- Gábor A., Villaverde A. F., Banga J. R. (2017). Parameter identifiability analysis and visualization in large-scale kinetic models of biosystems. BMC Systems Biology, 11(1), 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gašperšič J., Kastelic M., Novak U., Likozar B. (2018). Metabolic network modelling of Chinese hamster ovary (CHO) culture bioreactors operated as microbial cell factories. Acta Chimica Slovenica, 65(4), 769–786. 10.17344/acsi.2018.4591 [DOI] [PubMed] [Google Scholar]
- Goudar C. T., Joeris K., Konstantinov K. B., Piret J. M. (2008). Logistic equations effectively model mammalian cell batch and fed-batch kinetics by logically constraining the fit. Biotechnology Progress, 21(4), 1109–1118. 10.1021/bp050018j [DOI] [PubMed] [Google Scholar]
- Goudar C., Biener R., Boisart C., Heidemann R., Piret J., de Graaf A., Konstantinov K. (2010). Metabolic flux analysis of CHO cells in perfusion culture by metabolite balancing and 2D [13C, 1H] COSY NMR spectroscopy. Metabolic Engineering, 12(2), 138–149. [DOI] [PubMed] [Google Scholar]
- Hartley F., Walker T., Chung V., Morten K. (2018). Mechanisms driving the lactate switch in Chinese hamster ovary cells. Biotechnology and Bioengineering, 115(8), 1890–1903. [DOI] [PubMed] [Google Scholar]
- Hernández Rodríguez T., Posch C., Schmutzhard J., Stettner J., Weihs C., Pörtner R., Frahm B. (2019). Predicting industrial-scale cell culture seed trains–A Bayesian framework for model fitting and parameter estimation, dealing with uncertainty in measurements and model parameters, applied to a nonlinear kinetic cell culture model, using an MCMC method. Biotechnology and Bioengineering, 116(11), 2944–2959. 10.1002/bit.27125 [DOI] [PubMed] [Google Scholar]
- Jang J. D., Barford J. P. (2000). An unstructured kinetic model of macromolecular metabolism in batch and fed-batch cultures of hybridoma cells producing monoclonal antibody. Biochemical Engineering Journal, 4(2), 153–168. [Google Scholar]
- Jeong D. H., Yoo S. J., Kim J. H., Lee J. M. (2016). Computationally efficient dynamic simulation of cellular kinetics via explicit solution of flux balance analysis: XDFBA modelling and its biochemical process applications. Chemical Engineering Research and Design, 113, 85–95. [Google Scholar]
- Karra S., Sager B., Karim M. N. (2010). Multi-scale modeling of heterogeneities in mammalian cell culture processes. Industrial & Engineering Chemistry Research, 49(17), 7990–8006. 10.1021/ie100125a [DOI] [Google Scholar]
- Kawabe Y., Makitsubo H., Kameyama Y., Huang S., Ito A., Kamihira M. (2012). Repeated integration of antibody genes into a pre-selected chromosomal locus of CHO cells using an accumulative site-specific gene integration system. Cytotechnology, 64(3), 267–279. 10.1007/s10616-011-9397-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelly W., Veigne S., Li X., Subramanian S. S., Huang Z., Schaefer E. (2018). Optimizing performance of semi-continuous cell culture in an ambr15TM microbioreactor using dynamic flux balance modeling. Biotechnology Progress, 34(2), 420–431. 10.1002/btpr.2585 [DOI] [PubMed] [Google Scholar]
- Kildegaard H. F., Baycin-Hizal D., Lewis N. E., Betenbaugh M. J. (2013). The emerging CHO systems biology era: Harnessing the ‘omics revolution for biotechnology. Current Opinion in Biotechnology, 24(6), 1102–1107. 10.1016/j.copbio.2013.02.007 [DOI] [PubMed] [Google Scholar]
- Kim J. Y., Kim Y.-G., Lee G. M. (2012). CHO cells in biotechnology for production of recombinant proteins: Current state and further potential. Applied Microbiology and Biotechnology, 93(3), 917–930. 10.1007/s00253-011-3758-5 [DOI] [PubMed] [Google Scholar]
- Kim M., O'Callaghan P. M., Droms K. A., James D. C. (2011). A mechanistic understanding of production instability in CHO cell lines expressing recombinant monoclonal antibodies. Biotechnology and Bioengineering, 108(10), 2434–2446. 10.1002/bit.23189 [DOI] [PubMed] [Google Scholar]
- Kiparissides A., Pistikopoulos E. N., Mantalaris A. (2015). On the model-based optimization of secreting mammalian cell (GS-NS0) cultures: Optimization of secreting mammalian cell cultures. Biotechnology and Bioengineering, 112(3), 536–548. 10.1002/bit.25457 [DOI] [PubMed] [Google Scholar]
- Kornecki M., Strube J. (2019). Accelerating biologics manufacturing by upstream process modelling. Processes, 7(3), 166. 10.3390/pr7030166 [DOI] [Google Scholar]
- Kotidis P., Jedrzejewski P., Sou S. N., Sellick C., Polizzi K., del Val I. J., Kontoravdi C. ((2019).). Model-based optimization of antibody galactosylation in CHO cell culture. Biotechnology and Bioengineering, 116(7), 1612–1626. 10.1002/bit.26960 [DOI] [PubMed] [Google Scholar]
- Kuo C.-C., Chiang A. W., Shamie I., Samoudi M., Gutierrez J. M., Lewis N. E. (2018). The emerging role of systems biology for engineering protein production in CHO cells. Current Opinion in Biotechnology, 51, 64–69. 10.1016/j.copbio.2017.11.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kyriakopoulos S., Ang K. S., Lakshmanan M., Huang Z., Yoon S., Gunawan R., Lee D.-Y. (2018). Kinetic modeling of mammalian cell culture bioprocessing: The quest to advance biomanufacturing. Biotechnology Journal, 13(3), 1700229. 10.1002/biot.201700229 [DOI] [PubMed] [Google Scholar]
- Lai T., Yang Y., Ng S. (2013). Advances in mammalian cell line development technologies for recombinant protein production. Pharmaceuticals, 6(5), 579–603. 10.3390/ph6050579 [DOI] [PMC free article] [PubMed] [Google Scholar]
- López-Meza J., Araíz-Hernández D., Carrillo-Cocom L. M., López-Pacheco F., Rocha-Pizaña M., del R., Alvarez M. M. (2016). Using simple models to describe the kinetics of growth, glucose consumption, and monoclonal antibody formation in naive and infliximab producer CHO cells. Cytotechnology, 68(4), 1287–1300. 10.1007/s10616-015-9889-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lubitz T., Schulz M., Klipp E., Liebermeister W. (2010). Parameter balancing in kinetic models of cell metabolism. The Journal of Physical Chemistry B, 114(49), 16298–16303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luedeking R., Piret E. L. (1959). A kinetic study of the lactic acid fermentation. Batch process at controlled pH. Journal of Biochemical and Microbiological Technology and Engineering, 1(4), 393–412. [DOI] [PubMed] [Google Scholar]
- Acosta ML, Sanchez A., Garcia F., Contreras A., Molina E, (2007). Analysis of kinetic, stoichiometry and regulation of glucose and glutamine metabolism in hybridoma batch cultures using logistic equations. Cytotechnology, 54(3), 189–200. 10.1007/s10616-007-9089-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martínez V. S., Dietmair S., Quek L., Hodson M. P., Gray P., Nielsen L. K. (2013). Flux balance analysis of CHO cells before and after a metabolic switch from lactate production to consumption. Biotechnology and Bioengineering, 110(2), 660–666. [DOI] [PubMed] [Google Scholar]
- Michaelis L., Menten M. L. (1913). Die kinetik der invertinwirkung. Biochem. z, 49(333-369), 352. [Google Scholar]
- Miró A., Pozo C., Guillén-Gosálbez G., Egea J. A., Jiménez L. (2012). Deterministic global optimization algorithm based on outer approximation for the parameter estimation of nonlinear dynamic biological systems. BMC Bioinformatics [Electronic Resource], 13(1), 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Monod J. (1949). The growth of bacterial cultures. Annual Review of Microbiology, 3(1), 371–394. 10.1146/annurev.mi.03.100149.002103 [DOI] [Google Scholar]
- Moore B., Georgakis C., Antoniou C., Khattak S. (2024). A two-phase approach optimizing productivity for a mAb-producing CHO cell culture process using dynamic response surface methodology models. Biochemical Engineering Journal, 201, 109137. 10.1016/j.bej.2023.109137 [DOI] [Google Scholar]
- Noh S. M., Park J. H., Lim M. S., Kim J. W., Lee G. M. (2017). Reduction of ammonia and lactate through the coupling of glutamine synthetase selection and downregulation of lactate dehydrogenase-A in CHO cells. Applied Microbiology and Biotechnology, 101(3), 1035–1045. [DOI] [PubMed] [Google Scholar]
- Nolan R. P., Lee K. (2011). Dynamic model of CHO cell metabolism. Metabolic Engineering, 13(1), 108–124. 10.1016/j.ymben.2010.09.003 [DOI] [PubMed] [Google Scholar]
- Pal S., Krusienski D. J., Jenkins W. K. (2005). Structured stochastic optimization strategies for problems with ill-conditioned error surfaces. In 2005 IEEE International Symposium on Circuits and Systems(pp. 2291–2294.). 10.1109/ISCAS.2005.1465081 [DOI] [Google Scholar]
- Patel N. A., Anderson C. R., Terkildsen S. E., Davis R. C., Pack L. D., Bhargava S., Clarke H. R. (2018). Antibody expression stability in CHO clonally derived cell lines and their subclones: Role of methylation in phenotypic and epigenetic heterogeneity. Biotechnology Progress, 34(3), 635–649. [DOI] [PubMed] [Google Scholar]
- Pilbrough W., Munro T. P., Gray P. (2009). Intraclonal protein expression heterogeneity in recombinant CHO cells. PLoS ONE, 4(12), e8432. 10.1371/journal.pone.0008432 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reisinger H., Steinfellner W., Stern B., Katinger H., Kunert R. (2008). The absence of effect of gene copy number and mRNA level on the amount of mAb secretion from mammalian cells. Applied Microbiology and Biotechnology, 81(4), 701–710. [DOI] [PubMed] [Google Scholar]
- Ridgeway G., Madigan D. (2003). A sequential Monte Carlo method for bayesian analysis of massive datasets. Data Mining and Knowledge Discovery, 7(3), 301–319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sawicki J., Łoś M., Smołka M., Schaefer R. (2022). Understanding measure-driven algorithms solving irreversibly ill-conditioned problems. Natural Computing, 21(2), 289–315. 10.1007/s11047-020-09836-w [DOI] [Google Scholar]
- Sha S., Huang Z., Wang Z., Yoon S. (2018). Mechanistic modeling and applications for CHO cell culture development and production. Current Opinion in Chemical Engineering, 22, 54–61. 10.1016/j.coche.2018.08.010 [DOI] [Google Scholar]
- Shakibaie M., Tabandeh F., Zomorodipour A. R., Mohammad-Beigi H., Ebrahimi S., Habib-Ghomi H. (2011). Kinetics, experimental and simulation studies of Chinese hamster ovary cell growth in a packed-bed bioreactor. World Applied Sciences Journal, 15(11), 1568–1575. [Google Scholar]
- Sidoli F. R., Mantalaris A., Asprey S. P. (2003). Parametric identifiability of a structured kinetic model for mammalian cell cultures. IFAC Proceedings Volumes, 36(15), 521–526. 10.1016/S1474-6670(17)33558-9 [DOI] [Google Scholar]
- Sietaram D., Kotidis P., Rowland-Jones R., Finka G., Lapkin A. (2025). Multivariate Analysis of CHO Cell Line Development Data to Identify Cell Line Selection Criteria. Chemrxiv. 10.26434/chemrxiv-2025-dvjpd [DOI] [Google Scholar]
- Széliová D., Ruckerbauer D. E., Galleguillos S. N., Petersen L. B., Natter K., Hanscho M., Troyer C., Causon T., Schoeny H., Christensen H. B., Lee D.-Y., Lewis N. E., Koellensperger G., Hann S., Nielsen L. K., Borth N., Zanghellini J. (2020). What CHO is made of: Variations in the biomass composition of Chinese hamster ovary cell lines. Metabolic Engineering, 61, 288–300. 10.1016/j.ymben.2020.06.002 [DOI] [PubMed] [Google Scholar]
- Tang P., Xu J., Louey A., Tan Z., Yongky A., Liang S., Li Z. J., Weng Y., Liu S. (2020). Kinetic modeling of Chinese hamster ovary cell culture: Factors and principles. Critical Reviews in Biotechnology, 40(2), 265–281. 10.1080/07388551.2019.1711015 [DOI] [PubMed] [Google Scholar]
- Vergara M., Torres M., Müller A., Avello V., Acevedo C., Berrios J., Reyes J.G., Valdez-Cruz N.A., Altamirano C. (2018) High glucose and low specific cell growth but not mild hypothermia improve specific r-protein productivity in chemostat culture of CHO cells. PLOS ONE. 13(8):e0202098. 10.1371/journal.pone.0202098 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Villaverde A. F., Bongard S., Mauch K., Balsa-Canto E., Banga J. R. (2016). Metabolic engineering with multi-objective optimization of kinetic models. Journal of Biotechnology, 222, 1–8. [DOI] [PubMed] [Google Scholar]
- Villaverde A. F., Fröhlich F., Weindl D., Hasenauer J., Banga J. R. (2019). Benchmarking optimization methods for parameter estimation in large kinetic models. Bioinformatics, 35(5), 830–838. 10.1093/bioinformatics/bty736 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walsh G., Walsh E. (2022). Biopharmaceutical benchmarks 2022. Nature Biotechnology, 40(12), 1722–1760. 10.1038/s41587-022-01582-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Warr S. R. C. (2020). Ambr® 15 microbioreactors for CHO cell cultivation. In Animal Cell Biotechnology: Methods and Protocols(pp. 43–67.). Springer US, New York. 10.1007/978-1-0716-0191-4_4 [DOI] [PubMed] [Google Scholar]
- Würtele H., Little K., Chartrand P. (2003). Illegitimate DNA integration in mammalian cells. Gene Therapy, 10(21), 1791–1799. [DOI] [PubMed] [Google Scholar]
- Xing Z., Bishop N., Leister K., Li Z. J. (2010). Modeling kinetics of a large-scale fed-batch CHO cell culture by Markov chain Monte Carlo method. Biotechnology Progress,26(1), 208–219. 10.1002/btpr.284 [DOI] [PubMed] [Google Scholar]
- Xu P., Dai X., Graf E., Martel R., Russell R. (2014). Effects of glutamine and asparagine on recombinant antibody production using CHO-GS cell lines. Biotechnology Progress, 30(6), 1457–1468. 10.1002/btpr.1957 [DOI] [PubMed] [Google Scholar]
- Yang Z., Wang S., Halim A., Schulz M. A., Frodin M., Rahman S. H., Vester-Christensen M. B., Behrens C., Kristensen C., Vakhrushev S. Y., Bennett E. P., Wandall H. H., Clausen H. (2015). Engineered CHO cells for production of diverse, homogeneous glycoproteins. Nature Biotechnology, 33(8), 842–844. 10.1038/nbt.3280 [DOI] [PubMed] [Google Scholar]
- Ye J., Kober V., Tellers M., Naji Z., Salmon P., Markusen J. F. (2009). High-level protein expression in scalable CHO transient transfection. Biotechnology and Bioengineering, 103(3), 542–551. 10.1002/bit.22265 [DOI] [PubMed] [Google Scholar]
- Yusufi F. N. K., Lakshmanan M., Ho Y. S., Loo B. L. W., Ariyaratne P., Yang Y., Ng S. K., Tan T. R. M., Yeo H. C., Lim H. L., Ng S. W., Hiu A. P., Chow C. P., Wan C., Chen S., Teo G., Song G., Chin J. X., Ruan X., Lee D.-Y. (2017). Mammalian Systems Biotechnology reveals global cellular adaptations in a recombinant CHO cell line. Cell Systems, 4(5), 530–542.e6. 10.1016/j.cels.2017.04.009 [DOI] [PubMed] [Google Scholar]
- Zagari F., Jordan M., Stettler M., Broly H., Wurm F. M. (2013). Lactate metabolism shift in CHO cell culture: The role of mitochondrial oxidative activity. New Biotechnology, 30(2), 238–245. [DOI] [PubMed] [Google Scholar]
- Zalai D., Koczka K., Párta L., Wechselberger P., Klein T., Herwig C. (2015). Combining mechanistic and data-driven approaches to gain process knowledge on the control of the metabolic shift to lactate uptake in a fed-batch CHO process. Biotechnology Progress, 31(6), 1657–1668. 10.1002/btpr.2179 [DOI] [PubMed] [Google Scholar]
- Zhang F., Sun X., Yi X., Zhang Y. (2006). Metabolic characteristics of recombinant Chinese hamster ovary cells expressing glutamine synthetase in presence and absence of glutamine. Cytotechnology, 51(1), 21–28. 10.1007/s10616-006-9010-y [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The data underlying this article were provided by GlaxoSmithKline under a confidentiality agreement and are available only to the authorized parties in accordance with that agreement.






































