Skip to main content
ACS AuthorChoice logoLink to ACS AuthorChoice
. 2025 Oct 18;14(11):4424–4438. doi: 10.1021/acssynbio.5c00352

Cell-Free-Based Thermophilic Biocatalyst for the Synthesis of Amino Acids from One-Carbon Feedstocks

Ray Westenberg †,, Shaafique Chowdhury , Ryan Cardiff §, Kimberly Wennerholm , Alexander S Beliaev ⊥,#, James M Carothers §,¶,*, Pamela Peralta-Yahya †,‡,∥,*
PMCID: PMC12645577  PMID: 41108744

Abstract

Bioproduction from one-carbon compounds, such as formate, is an attractive prospect due to reduced energy requirements and the possibility for using CO2 as a sustainable feedstock. Formate-fixing pathways engineered using Escherichia coli lysate-based cell-free expression (CFE) biocatalysts have the potential to route 100% of feedstock carbon toward chemical synthesis but are undermined by siphoning of in-pathway metabolites and cofactors by the CFE background metabolism. To address this limitation, we engineer a CFE-based thermophilic multienzyme biocatalyst for the synthesis of serine and glycine from formate, bicarbonate, and ammonia. After expression of the thermophilic formate-to-serine pathway in a one-pot reaction, the mesophilic E. coli CFE background machinery is removed by simple heat denaturation, eliminating the siphoning of cofactors, in-pathway metabolites, and products. After bioprocess optimization, including pathway gene expression duration and chemical synthesis temperature, we achieve near stoichiometric conversion of formate and bicarbonate to serine and glycine, reaching 97% of stoichiometric yield. The use of a moderately thermophilic biocatalyst allowed chemical synthesis to take place at mesophilic temperatures, enabling the balance of optimal enzyme activity with minimal metabolite/cofactor thermal degradation. In a fed-batch experiment, the biocatalyst shows sustained chemical synthesis rates for 8 h, paving the way toward a continuous bioprocess. Finally, a sensitivity analysis of cofactor usage revealed that the most expensive cofactors, THF and NADPH, can be reduced by 5-fold without significantly lowering product yields. To the best of our knowledge, this is the first instance of expressing a thermophilic pathway in an E. coli lysate-based CFE system to generate a thermophilic biocatalyst for use at mesophilic temperatures. The CFE-based thermophilic formate-to-serine biocatalyst triples the combined serine and glycine yield previously obtained by a CFE-based mesophilic formate-to-serine biocatalyst (30%), and quadruple the yield obtained by a purified enzyme system (22%). Ultimately, this work opens the door to using E. coli lysate-based CFE for thermophilic biocatalyst generation to achieve high chemical synthesis yields.

Keywords: carbon-negative synthesis, chemical bioproduction, cell-free expression, metabolic engineering, thermophilic biocatalyst


graphic file with name sb5c00352_0009.jpg


graphic file with name sb5c00352_0007.jpg

Introduction

Biological upgrading of CO2 or its equivalentssuch as the more biologically accessible formatedirectly into value-added chemicals has the potential to remove CO2 from the environment at a lower energy and cost than starting from other feedstocks. Limitations with natural carbon fixing organisms, including their slow growth and low carbon fixation efficiency, have led to the implementation of natural and synthetic carbon fixation pathways in biotechnology-friendly chassis, such as Escherichia coli , and Saccharomyces cerevisiae, , to leverage their fast growth rate and the extensive body of knowledge about their metabolic pathway optimization. Among carbon fixation pathways, the tetrahydrofolate (THF)-dependent formate fixation/reductive glycine synthesis (THF/rGS) pathway is of particular interest, given its low energy requirements (2 ATP, 3 NAD­(P)­H), thermodynamic favorability (ΔG = −4.9 kJ/mol from formate to glycine), and minimal enzyme requirements (9 enzymes). To date, the THF/rGS pathway has been implemented in Escherichia coli ,− and yeast , to support cellular growth and has enabled 10% yield of lactate from formate.

Recently, the THF/rGS pathway has been implemented in an E. coli lysate-based cell-free expression (CFE) system to generate a 10-enzyme biocatalyst for the carbon-negative synthesis of serine and glycine from formate. Briefly, CFE systems can be generated using either (1) cell lysate, i.e., unpurified cell extract containing transcription/translation machinery as well as enzymes involved in primary metabolism, or (2) purified cell machinery (PURE), i.e., the purified 36 enzymes and ribosomes involved in transcription/translation. The enthusiasm around engineering CFE systems for chemical synthesis is because they are nonliving; consequently, carbon feedstock is not needed for cell growth and maintenance; thus, they are capable of potentially routing 100% of carbon feedstock to chemical synthesis. The PURE system has very low CFE background metabolism; however, it is more time-consuming and expensive to generate ($600–$2000/L), likely rendering it cost-prohibitive for bioindustrial applications. On the other hand, lysate-based CFE systems have a lower production cost ($90/L) and can be rapidly scaled up; however, the significant CFE background metabolism (or CFE background machinery) siphons cofactors and intermediates away from the desired metabolic pathway, reducing the desired product yields. Specifically, the lysate-based CFE-based mesophilic THF/rGS biocatalyst resulted in a 30% yield of serine and glycine, with intermediates and cofactors siphoned by CFE background metabolismin particular NAD­(P)­H-consuming pathways , which was attributed as one of the major process limitations to achieving higher product yields.

Synthetic biology and bioprocess engineering strategies can be implemented to reduce the CFE background metabolism. Synthetic biology strategies include (1) knocking out competing metabolic pathways in the microbes used to generate the CFE cell lysate and (2) adding inhibitors directly to the CFE to block competing pathways. , Knocking out competing pathways in the E. coli used to make the CFE cell lysate resulted in more than a 2-fold increase in protein synthesis yields. Although there is no concrete evidence that this strategy will result in higher chemical synthesis yields in the context of a CFE-based biocatalyst, it is likely to do so given prior work around chemical synthesis in crude cell lysates. For example, knocking out competing pathways in the E. coli used to make the crude cell lysate led to an 85% reduction in the use of ATP for the synthesis of dihydroxyacetone phosphate. Direct addition of inhibitors to the CFE system to block the TCA cycle has resulted in a 20% increase in malate synthesis. Looking ahead, the use of pathway enzymes engineered to use artificial cofactors is also likely to improve product yields. Bioprocess engineering strategies shown to reduce the effect of CFE background metabolism include (1) tagging key genes for affinity purification removal before generating the CFE , and (2) diluting the CFE to minimize background metabolism. , In crude cell lysates, tagging and removing key enzymes has led to a 4-fold improvement in ethanol synthesis and 40-fold improvement in pyruvate synthesis, suggesting it could improve yields for a CFE-based biocatalyst. Nevertheless, this strategy is laborious, requiring tagging of the lysate-source genome, followed by bead-based purification. Although CFE dilution reduces background metabolism, at 10× dilution, siphoning of NADH by the background CFE metabolism was still significant.

A common strategy to rapidly and inexpensively purify enzymes is the expression of thermophilic genes in E. coli to generate enzymes with a temperature optimum between 50° and 90 °C, followed by cell lysis and heat denaturation to remove the native E. coli proteins that precipitate between 45° and 50 °C. This heat denaturation strategy has been used to produce thermophilic multienzyme biocatalysts for the synthesis of pyruvate (9 enzymes), glutathione (2 enzymes), acetoin (2 enzymes), fructose 1,6-diphosphate (4 enzymes), myo-inositol (4 enzymes and 11 enzymes), glucaric acid (3 thermostable enzymes), and cysteine (11 enzymes). In all of these cases, chemical synthesis has taken place at the optimal temperature of the biocatalyst, 70°–90 °C. A key challenge of performing chemical synthesis at thermophilic temperatures is the instability of cofactors and metabolites. ,, Of note, not all thermophilic proteins will fold correctly at mesophilic temperatures or in mesophilic organisms such as E. coli.

An analogous strategy in CFE involves expression of a thermophilic pathway in an E. coli lysate-based CFE to generate the thermophilic biocatalyst followed by heat denaturation to remove the E. coli proteins, i.e., the CFE background machinery. Such a strategy would allow the rapid prototyping of thermophilic biocatalysts in the absence of CFE background metabolism. Additionally, the lack of a cell membrane and the ability of CFE to withstand toxic compounds also open the door to complementary bioprocess optimization solutions, such as direct addition of inhibitors to improve chemical synthesis. Further, available tools for codon optimization of thermophilic genes to match the E. coli codon usage together with inexpensive DNA synthesis improve the feasibility of prototyping thermophilic genes for biocatalyst generation. An important consideration is that not all thermophilic genes will express in a mesophilic system, thus, thermophilic CFE platforms have been developed for this purpose. ,− However, the expression of thermophilic genes in a thermophilic CFE system would not facilitate the rapid removal of the CFE machinery by simply increasing the reaction temperature.

Here, we engineer a 9-enzyme CFE-based thermophilic biocatalyst for the synthesis of serine and glycine from formate, bicarbonate, and ammonia (Figure A). The biocatalyst captures 3 total CO2 equivalents with 2 CO2 equivalents (1 formate and 1 bicarbonate) captured per glycine molecule produced and an additional 1 CO2 equivalent (1 formate) per serine molecule produced (Figure B). The use of a thermophilic biocatalyst enables the removal of the mesophilic E. coli CFE background machinery after gene expression, impeding the siphoning of intermediates and cofactors away from the thermophilic biocatalyst. The 9-gene THF/rGS thermophilic pathway is generated in a one-pot reaction using E. coli lysate-based CFE. Since previous efforts have focused on reconstituting the mesophilic THF/rGS pathway, ,,,,, this is the first generation of a thermophilic THF/rGS pathway. To balance enzyme activity with minimal cofactor degradation, chemical synthesis was performed at mesophilic temperatures (30 °C) rather than thermophilic temperatures (over 50 °C). Via bioprocess optimizations, the CFE-based thermophilic biocatalyst achieves near stoichiometric yield of serine and glycine (97%) from one-carbon feedstocks using a 20 h multigene expression step and a 4 h chemical synthesis step. This yield is 3-fold higher than that using a CFE-based mesophilic formate-to-serine biocatalyst (30%), 4-fold higher than that using a purified enzyme system (22%), and almost 8-fold higher than previous microbial engineering efforts to produce lactate from formate in E. coli. This study is therefore the highest reported conversion of CO2 equivalents into amino acids compared to any previous E. coli whole cell, purified, or CFE-based biocatalyst.

1.

1

Cell-free expression (CFE)-based thermophilic biocatalyst for the carbon-negative synthesis of serine and glycine from formate, bicarbonate, and ammonia. (A) CFE-based thermophilic formate-to-serine biocatalyst. Enzyme (blue): MT_fhs, Moorella thermoacetica formate-tetrahydrofolate ligase; MT_folD, M. thermoacetica bifunctional protein performing the activities of methenyltetrahydrofolate cyclohydrolase and methylenetetrahydrofolate dehydrogenase; MT_gcvHLPT, M. thermoacetica glycine cleavage system H, L, P, and T proteins; MT_lipM, M. thermoacetica octanoyltransferase; MT_shmt, M. thermoacetica serine hydroxymethyltransferase; ptdh*, Pseudomonas stutzeri phosphite dehydrogenase mutant. Substrates (green, red): H2CO3, bicarbonate, NH3, ammonia. Metabolites (gray): CHO–THF, 10-formyltetrahydrofolate, CH2–THF, 5,10-methylenetetrahydrofolate. Cofactors (brown): THF, tetrahydrofolate, ATP, NAD­(P)­H. (B) Cartoon representation of carbons (formate, bicarbonate) incorporated into glycine and serine. (C) Equations for stoichiometric yield and formate incorporation used in this study. (D) CFE-based thermophilic biocatalyst bioprocess. Step 1: One-pot expression of pathway genes for biocatalyst generation. Step 2: Biocatalyst dilution. Step 3: Heat denaturation of the mesophilic CFE machinery followed by debris removal via centrifugation. Step 4: Thermophilic biocatalysis performs chemical synthesis at mesophilic temperatures (30 °C). (D) Synthesis of serine and glycine from formate, bicarbonate, and ammonia obtained via a CFE-based mesophilic formate-to-serine biocatalyst (Chowdhury et al. 2024) or the CFE-based thermophilic formate-to-serine biocatalyst (this work). Reaction conditions: Gene expression: 30 °C, 16 h; dilution: 10×; heat denaturation: 50 °C, 10 min; chemical synthesis: 30 °C, 4 h; supplementation: 2 mM CH2O2, 2 mM THF, 1 mM H2CO3, 1 mM NH3, 2 mM ATP, 1 mM NADH, 2 mM NADPH, 5 mM Na2HPO3. The bars represent the mean ± standard error of the mean (SEM), n = 3, ****p < 0.0001. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Tukey method. Calculations can be found in Table S2.

Results

Formate-to-Serine Biocatalyst

For engineering purposes, we split the formate-to-serine pathway into three modules (Figure A). In module 1, THF-dependent formate fixation, formate (CH2O2) attaches to THF to become the C1 donor CH2–THF. In module 2, reductive glycine synthesis, CH2–THF combines with bicarbonate (H2CO3) and ammonia (NH3) to generate glycine. In module 3, the serine synthesis, glycine combines with a second CH2–THF to generate serine. Modules 2 and 3 recycle THF. Regeneration of NAD­(P)H is achieved via a Pseudomonas stutzeri phosphite dehydrogenase mutant. The cofactor ATP is not recycled in the system.

Based on pathway stoichiometry, 1 mol CH2O2, 1 mol H2CO3 (2 CO2 equivalents), and 1 mol NH3 are needed per mol of glycine generated, while an additional mol of CH2O2 is needed per mole of serine produced (Figure B). To calculate yields of the system, all our experiments were conducted at stoichiometric substrate concentrations, that is, 2 mM CH2O2, 1 mM H2CO3, and 1 mM NH3, which could result in a maximum concentration of 1 mM combined glycine and serine. Thus, the stoichiometric yield is defined as the actual concentration of serine and glycine divided by the theoretical maximum concentration of serine and glycine produced (1 mM) (Figure C and Table S2). We were also interested in determining the percent of formate incorporated into amino acids. This is calculated as two times the serine concentration (2 CH2O2 per serine generated) plus the glycine concentration divided by the concentration of fed formate (Figure C and Table S3).

Cell-Free Expression-Based Thermophilic Biocatalyst Process Strategy

Removal of the CFE background machinery should eliminate siphoning of metabolic intermediates and cofactors away from the formate-to-serine biocatalyst, thus achieving a close to 100% product yield. Figure D shows the envisioned CFE-based thermophilic biocatalyst process. Step 1, gene expression, involves the one-pot expression of the thermophilic formate-to-serine pathway genes in an E. coli lysate-based CFE at 30 °C. During the gene expression step, optimal E. coli lysate-based CFE transcription/translation conditions in terms of temperature, cell lysate, energy molecules, cofactors, and buffer are set. In Step 2, biocatalyst dilution, the thermophilic biocatalyst is diluted 10-fold using buffer to increase substrate loading and reach higher product levels. Step 3, biocatalyst purification, entails removal of the mesophilic E. coli CFE background machinery by heat denaturation. The precipitated E. coli proteins are separated via centrifugation, leaving the thermophilic biocatalyst in the solution phase. In Step 4, chemical synthesis, to the solution phase containing the thermophilic biocatalyst, substrate and cofactors are added and chemical synthesis is run at mesophilic temperatures (30 °C). Use of mesophilic temperatures limits thermal degradation of cofactors, including ATP, NAD+, NADH, ,, and THF , that would have occurred at higher temperatures, thus bypassing a major obstacle when using thermophilic biocatalysts.

CFE-Based Thermophilic Formate-to-Serine Biocatalyst Design

Due to previous challenges with thermal degradation of cofactors at hyperthermophilic temperatures (70°–90 °C), ,, we sought to design a thermophilic formate-to-serine biocatalyst that would withstand heat denaturation of the E. coli-based CFE machinery at 45°–50 °C yet have high activity during the chemical synthesis step performed at mesophilic temperatures (30 °C). Hyperthermophilic enzymes often have greatly reduced activities at lower temperatures, experiencing over 80% loss of activity below 60 °C. Thus, we focused on identifying thermophilic enzymes with experimental data confirming optimal activity at moderately thermophilic conditions (50–70 °C). Such enzymes, we hypothesized, would allow the balance of optimal enzyme activity with minimal metabolite thermal degradation at mesophilic chemical synthesis temperatures.

Despite the sparse kinetic data on thermophilic enzymes, activity for module 1 enzymes is available for the moderately thermophilic Moorella thermoacetica (formerly named Clostridium thermoaceticum). M. thermoacetica is an anaerobic organism with an optimal growth temperature of 55–60 °C, naturally performs THF-dependent CO2 fixation, and is a well-studied thermophile, having been the gene source for the engineering of a thermophilic ethanol fermentation pathway.

The thermophilic module 1 is composed of M. thermoacetica formate-tetrahydrofolate ligase (Fhs) and the bifunctional NADP+-dependent methenyltetrahydrofolate cyclohydrolase/methylenetetrahydrofolate dehydrogenase (FolD). Compared to the mesophilic Methylorubrum extorquens AM1 THF-dependent formate fixation pathway, which has been widely used to reconstitute the THF pathway in mesophilic biotechnology amenable organisms, ,,,,,, the activity of M. thermoacetica FolD is performed by two enzymes, M. extorquens methenyltetrahydrofolate cyclohydrolase (Fch) and the NADP+-methylenetetrahydrofolate dehydrogenase (MtdA). In the M. extorquens THF pathway, MtdA is the rate limiting step due to its high reversibility, requiring NADP+ recycling to drive the reaction forward. Kinetic data for Fhs and FolD suggest that the M. thermoacetica enzymes are faster than the equivalents from M. extorquens (Table S1).

There are no kinetic data for thermophilic enzymes in the reductive glycine pathway (rGS, module 2). Therefore, we used rGS enzymes from the same organism, M. thermoacetica, as using enzymes from the same organism supports recapitulation of protein–protein interactions important for protein complex formation. Briefly, M. thermoacetica rGS genes were identified using automated gene predictions from the BioCyc genome database, as derived from protein homology. BLAST protein analyses were performed to compare the thermophilic protein sequences with their mesophilic homologues. Although the similarity scores were relatively low for most enzymes (Table S1), the statistical significance of the protein alignments was high. Taken together, the thermophilic module 2 is composed of M. thermoacetica glycine cleavage system proteins H, L, P, and T (GcvHLPT) and lipoyl­(octanoyl) transferase (LipM). Of note, mesophilic rGS systems have relied on the E. coli rGS pathway, which uses lipoate-protein ligase (LplA, EC 6.3.1.20) to perform the sequential reactions of ATP transfer to lipoic acid to generate lipoate-AMP, followed by lipoate transfer to GcvH. In M. thermoacetica, a general transferase, LipM, is used to perform both reactions. Finally, the thermophilic module 3 is composed of M. thermoacetica serine hydroxymethyltransferase (Shmt). Recycling of NAD­(P)H was performed by a previously engineered thermostable P. stutzeri phosphite dehydrogenase (ptdh*), which has been used for cofactor recycling in CFE.

Evaluation of the CFE-Based Thermophilic Formate-to-Serine Biocatalyst

The CFE-based mesophilic formate-to-serine biocatalyst achieved a 30% stoichiometric yield using a bioprocess composed of a 16 h gene expression step, 10-fold biocatalyst dilution, and a 4 h chemical synthesis step. Like in the mesophilic system, in the CFE-based thermophilic biocatalyst, the genes were expressed from a PT70 promoter using linear DNA. The formate-to-serine pathway gene ratios resulting in maximal glycine/serine synthesis in the mesophilic system (gcvHLPT/lplA = 192:2:1:4:2 and ftl/fch/mdta/shmt/ptdh* = 3:3:3:3:3) were used in the thermophilic system. Using the same bioprocess parameters plus heat denaturation to remove the mesophilic CFE background machinery prior to chemical synthesis, the CFE-based thermophilic biocatalyst achieved 88% of the stoichiometric yield of combined serine and glycine from formate and bicarbonate (0.49 mM ± 0.02 mM serine and 0.39 mM ± 0.05 mM glycine), with 68.5% formate incorporation (Figure E, Tables S2 and S3). Formate-derived CH2–THF is pulled into both module 2 to synthesize glycine (via incorporation of H2CO3) and module 3 to convert glycine into serine. The remaining ∼30% formate may be in the form of formate or CH2–THF, but it has not been combined with glycine to form serine. Taken together, the CFE-based thermophilic formate-to-serine biocatalyst and the outlined bioprocess nearly triple the combined serine and glycine yields achieved by the mesophilic system.

Impact of the Mesophilic CFE Machinery on Amino Acid Yields

The drastic improvement in amino acid synthesis when using the CFE-based thermophilic biocatalyst coupled to the heat-denaturing treatment led us to investigate the role of the CFE background machinery on amino acid yields. As shown in Figure A, we measured the yield of serine and glycine from formate in the presence and absence (due to heat denaturation treatment) of the CFE background machinery over a period of 12 h. For up to 4 h of chemical synthesis, there is no difference in the synthesis of glycine or serine in the presence or absence of the CFE machinery. The chemical synthesis for both serine and glycine peaks at 4 h, reaching 0.39 ± 0.05 mM glycine and 0.49 ± 0.02 mM serine in the absence of the CFE machinery and 0.32 ± 0.01 mM glycine and 0.44 ± 0.04 mM serine in the presence of it. Starting at 8 h, however, the glycine concentration in the presence of the CFE machinery starts to drop while it remains constant in the absence of the CFE machinery. Most dramatically, after 8 h of chemical synthesis, the glycine concentration in the presence of the CFE machinery is 33% lower than in the absence of it. There is a similar trend in the synthesis of serine. After 12 h of chemical synthesis, the serine concentration is 53% lower in the presence of the CFE machinery versus in the absence of it. Given that heat denaturation is the only difference between treatments, the CFE background machinery must consume the serine and glycine synthesized by the biocatalyst from formate, thus reducing the overall chemical yield.

2.

2

Impact of mesophilic CFE background machinery on amino acid yield and cofactor consumption. (A) Synthesis of serine and glycine from formate, bicarbonate, and ammonia by the thermophilic formate-to-serine biocatalyst in the presence (black) and absence via heat denaturation (red) of the mesophilic CFE background machinery. Left: Process schematic. Right: Glycine or serine concentration as a function of chemical synthesis time. Reaction conditions: Gene expression: 30 °C, 16 h; dilution: 10×; heat denaturation: 50 °C, 10 min or none; chemical synthesis: 30 °C, 2–12 h; supplementation: 2 mM CH2O2, 2 mM THF, 1 mM H2CO3, 1 mM NH3, 2 mM ATP, 1 mM NADH, 2 mM NADPH, 5 mM Na2HPO3. Data at the t = 0 h time point were taken from the Fmoc glycine and Fmoc serine standard curves using plain CFE (no genes expressed) with Tris buffer and chemical denaturant. Data points represent the mean ± standard error of the mean (SEM), n = 3, with *p < 0.05, **p < 0.005, ****p < 0.0001 as a comparison of mesophilic CFE machinery absent (red) versus mesophilic CFE machinery present (black) at the same chemical synthesis time. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Šídák method. (B) Consumption of NADPH incubated in plain CFE in the presence (black) and absence via heat denaturation (red) of the mesophilic CFE background machinery. Reaction conditions: Gene expression: 30 °C, 16 h; dilution: 10×; heat denaturation: 50 °C, 10 min or none; chemical incubation: 30 °C, 2–12 h; supplementation: 1 mM NADPH. Left: Process schematic. Right: NADPH or NADP+ concentration as a function of chemical incubation time. Data points represent mean ± SEM, n = 3, with ****p < 0.0001 as a comparison of mesophilic CFE background machinery absent (red) versus mesophilic CFE background machinery present (black) at the same chemical incubation time. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Šídák method.

Impact of the Mesophilic CFE Machinery on Cofactor Availability

Previous work has shown that the CFE background machinery siphons NADH. Thus, we investigated the effect of the CFE machinery on NADPH concentration as two NADPH equivalents are used per serine synthesized. Using plain CFE (i.e., CFE expressing no genes) incubated at 30 °C for 16 h to mimic the gene expression step and supplemented with 1 mM NADPH, we measured NADPH concentrations in the presence and absence (due to heat-denatured treatment) of the CFE background machinery over a 12 h period. In the presence of the CFE background machinery, close to 40% of the NADPH is consumed in the first 2 h, reaching almost 100% consumption after 8 h (Figure B). In the absence of the CFE background machinery, NADPH concentration is unchanged for the first 4 h (1.08 ± 0.03 mM). However, NADPH concentration drops 46% between 4 and 8 h followed by a 34% drop between 8 and 12 h. The halving of the NAPDH concentration after 8 h can be explained by the thermal degradation of NADPH, which has a half-life of ∼8.6 h at pH 7 at 30 °C.

Regarding the NADP+ concentration, in the presence of the CFE background machinery, there is an exponential increase in NADP+ reaching 0.83 ± 0.01 mM after 8 h and plateauing thereafter. In the absence of the CFE machinery, the NADP+ concentration increases linearly, reaching 0.67 ± 0.02 mM after 12 h. The increase in NADP+ concentration between 0 and 4 h in the absence of the CFE background machinery is surprising as there is no decrease in the NADPH concentration during the same time period. We hypothesize that the 36% increase in NADP+ over the first 4 h may be due to some of the CFE background machinery not being completely removed upon heat denaturation, since some proteins can remain partially soluble in their unfolded states. , Unfolded E. coli proteins in the supernatant may refold and regain activity upon returning to mesophilic temperatures, reconstituting some of the CFE machinery components. Indeed, we observe that although most of the proteins precipitate upon heat denaturation, some proteins are still precipitated out by a subsequent chemical denaturation step (Figure S1). Taken together, the heat denaturation step removes sufficient CFE background machinery to stop NADPH consumption for the first 4 h, which represents the time required by the thermophilic biocatalyst to achieve close to stoichiometric yields of serine and glycine from formate.

Biosynthetic Contribution of Enzymes in the THF-Dependent Formate Fixation Module

The presence of the CFE background machinery does not reduce the glycine or serine concentration obtained by the thermophilic biocatalyst up to the 4 h chemical synthesis mark. Thus, we could compare head-to-head the efficiency of the mesophilic and thermophilic THF/rGS biocatalyst using a 4 h chemical synthesis step in the presence of the CFE background machinery. In particular, we were curious about comparing the efficiency of the THF-dependent formate fixation (module 1, Figure A), as the mesophilic system relies on two enzymes (M. extorquens Fch and MdtA), whereas the thermophilic system uses a single bifunctional enzyme (M. thermoacetica FolD). Given the chemical instability of CH2–THF (Figure S2), we perform the module 1 experiments using a 30 min chemical synthesis step.

3.

3

Biosynthetic contribution of thermophilic and mesophilic enzymes in the THF-dependent formate fixation module. (A) Left: Schematic of the mesophilic THF-dependent formate fixation pathway (module 1). Enzyme abbreviations shown in blue: ME_ftl, Methylorubrum extorquens formate-tetrahydrofolate ligase; ME_fch, M. extorquens methenyl-THF cyclohydrolase; ME_mtdA, M. extorquens methylene THF dehydrogenase; ptdh*, phosphite dehydrogenase mutant. Right: Schematic of the thermophilic module 1. Enzyme abbreviations shown in red/blue: MT_fhs, Moorella thermoacetica formate-tetrahydrofolate ligase; MT_folD, M. thermoacetica bifunctional protein performing the activities of methenyltetrahydrofolate cyclohydrolase and methylenetetrahydrofolate dehydrogenase; ptdh*, phosphite dehydrogenase mutant. (B) Concentration of module 1 metabolites (THF, CH = THF, and CH2–THF) as a function of the module 1 enzyme source (mesophilic, thermophilic) and bioprocess utilized. Of note, heat denaturation of the CFE background machinery was only done in the thermophilic module 1, CFE absent samples. Reaction conditions for (B,C): Gene expression: 30 °C, 16 h; biocatalyst dilution: 10×; heat denaturation, 50 °C, 10 min or none; chemical synthesis, 30 °C, 30 min; substrate/cofactor supplementation, 1 mM CH2O2, 1 mM THF, 1 mM ATP, 2 mM NADPH, 5 mM Na2HPO3. The bars represent the mean ± standard error of the mean (SEM), n = 3, *p < 0.05, ****p < 0.0001. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Tukey method. (C) Concentration of module 1 metabolites (THF, CH = THF, and CH2–THF) as a function of the module 1 enzyme source. The CFE machinery is present in all samples (no heat denaturation). The bars represent mean ± SEM, n = 3, *p < 0.05, **p < 0.005, ****p < 0.0001. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Tukey method. Metabolite abbreviations: THF, tetrahydrofolate; CH = THF, 5,10-methenyltetrahydrofolate; CH2–THF, 5,10-methylenetetrahydrofolate.

As shown in Figure B, plain CFE supplemented with 1 mM THF does not synthesize CH = THF or CH2–THF. The mesophilic Module 1 converts all THF accumulating CH = THF (0.65 ± 0.02 mM) and has nondetectable levels of CH2–THF. As CFE background reactions are known to consume CH2–THF, a C1 donor used in purine, methionine, and thymidylate biosynthesis, it is possible that ∼0.4 mM of CH2–THF were generated and then consumed by CFE-background reactions.

The thermophilic module 1 in the presence of the CFE background machinery converts THF to CH = THF slowly. After 30 min, CH = THF reaches 0.50 ± 0.1 mM with 0.34 ± 0.1 mM THF remaining and limited accumulation of CH2–THF (0.01 ± 0.0 mM) is observed. In the absence of the CFE background machinery, the thermophilic module 1 converts THF to CH = THF even more slowly, with 0.87 ± 0.1 mM THF remaining after 30 min, yet accumulates CH2–THF to 0.17 ± 0.1 mM. Thus, at the same DNA loading, the mesophilic module 1 converts THF to CH = THF faster than the thermophilic module 1. The removal of the CFE background machinery from thermophilic module 1 enables accumulation of CH2–THF, supporting higher glycine/serine synthesis.

Next, we tested module 1 hybrids generated by a mixture of mesophilic and thermophilic enzymes in the presence of the CFE machinery. A module 1 hybrid composed of the mesophilic Ftl and the thermophilic FolD resulted in complete THF conversion, achieving 0.60 ± 0.0 mM CH = THF. A module 1 hybrid composed of the thermophilic Fhs and the mesophilic Fch/MdtA converted THF more slowly, reaching only 0.56 ± 0.0 mM CH = THF and leaving 0.27 ± 0.06 mM unreacted THF. No CH2–THF was detectable in either of the module 1 hybrids due to the presence of the CFE background. Taken together, these results show that the presence of the mesophilic CFE machinery limits the accumulation of CH2–THF.

Bioprocess Optimization of the Thermophilic CFE-Based Biocatalyst

To further increase amino acid synthesis yields, we optimized the bioprocess conditions, including (1) heat denaturation temperature, (2) the need to physically separate the denatured mesophilic CFE machinery, (3) chemical synthesis temperature, and (4) pathway gene expression duration (Figure ).

4.

4

Bioprocess optimization of the CFE-based thermophilic formate-to-serine biocatalyst. Assessment of (A) heat denaturation temperature optimization, (B) the need to physically separate the denatured mesophilic CFE machinery via centrifugation, and (C) chemical synthesis temperature. Reactions conditions for (A–C): Gene expression: 30 °C, 16 h; biocatalyst dilution: 10×; heat denaturation; 50 °C or otherwise noted for 10 min followed by centrifugation or none; chemical synthesis: 30 °C or otherwise noted, 4 h; substrate/cofactor supplementation, 2 mM CH2O2, 2 mM THF, 1 mM H2CO3, 1 mM NH3, 2 mM ATP, 1 mM NADH, 2 mM NADPH, 5 mM Na2HPO3. Data shown in (A–C): Bars represent mean ± standard error of the mean (SEM), n = 3, *p < 0.05. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Šídák method. (D) Impact of gene expression duration on serine and glycine yields. Middle: Serine and glycine concentrations as a function of gene expression duration. Data points represent mean ± SEM, n = 3. Data were analyzed using a two-way ANOVA followed by a multiple comparisons test via the Tukey method. Multiple comparison of glycine concentration at 8 h vs 16 h (*p < 0.05) was statistically significant. Multiple comparisons of serine concentration at 8 h vs 16 h (**p < 0.005) and 12 h vs 20 h (*p < 0.05) were statistically significant. Full representation of all significant comparisons is shown in Figure S3. Right: Combined serine and glycine concentration as a function of gene expression duration. Data points represent mean ± SEM, n = 3. Calculations can be found in Table S3.

With regard to the heat denaturation temperature, we initially used 50 °C as E. coli enzymes denature above 45 °C, but M. thermoacetica enzymes would not likely be impacted, as M. thermoacetica has an optimal growth temperature of 55–60 °C. As shown in Figure A, a heat denaturation step of 50 or 45 °C results in similar serine and glycine concentrations, suggesting no major impact on thermophilic biocatalyst activity. To ensure effective mesophilic CFE machinery denaturation, we used 50 °C in all subsequent experiments.

Next, we determined the impact of physically separating the denatured mesophilic CFE machinery from the thermophilic biocatalyst that remains in solution. As Figure B shows, failure to remove the denatured CFE machinery via centrifugation results in a significantly lower concentration of glycine (0.39 mM ± 0.05 mM vs 0.27 mM ± 0.00 mM) but similar concentrations of serine (0.49 mM ± 0.02 mM vs 0.43 mM ± 0.00 mM vs). To maximize product yields, we continued to remove the denatured CFE machinery in subsequent experiments.

Then, we delved into the temperature of the chemical synthesis step. Thus far, we used 30 °C for chemical synthesis to minimize cofactor degradation. High temperatures are known to decrease the stability of several key metabolites and cofactors, with THF degrading rapidly above 37 °C, THF-based metabolites above 40C, and significant degradation of NAD­(P)H at 50 °C. However, the M. thermoacetica enzymes have an optimal temperature of over 55 °C while the P. stutzeri ptdh* mutant has an optimal temperature of 59 °C (Table S1). It is possible that increased enzyme activity may outweigh any cofactor degradation during the chemical synthesis step. As Figure C shows, the thermophilic formate-to-serine biocatalyst results in similar serine and glycine yields at 30 and 50 °C. Given that 30 °C is far from the optimal temperature for thermophilic enzyme activity ,, and thermal degradation of cofactors is a concern, this suggests the optimal chemical synthesis temperature depends not only on enzyme activity but also on metabolite stability. Therefore, in this thermophilic THF/rGS biocatalyst, any benefit from increased temperature and activity may be outweighed by the thermal instability of the cofactors. Thus, we held the chemical synthesis temperature at 30 °C in subsequent experiments.

Finally, we focused on optimizing the gene expression duration. While a longer gene expression time resultsup to a certain pointin more biocatalyst being generated, it also stretches the overall bioprocess run time. Initially, we chose a 16 h gene expression step as cell-free expression of PT70-GFP reaches saturation at that time. Gene expression, however, is affected by the gene sequence and length as well as the number of genes expressed in the one-pot CFE reaction. Thus, we analyzed the impact of varying the pathway gene expression duration on the serine and glycine yields. As shown in Figure D, starting at a 12 h gene expression step, the concentration of the intermediate glycine stabilizes at ∼0.39 mM. The concentration of serine, however, steadily increases over time, peaking at 0.58 mM ± 0.03 mM after a 20 h gene expression step. Indeed, after a 20 h gene expression step, the thermophilic formate-to-serine biocatalyst achieves a 0.97 mM combined serine and glycine concentration or 97% of the stoichiometric yield of combined serine and glycine from formate and bicarbonate with a 77.5% formate incorporation (Figure D, Tables S2 and S3). As using a 20 h gene expression step was not statistically different than using a 16 h gene expression step (88% of the stoichiometric yield), we used 16 h in all subsequent experiments to reduce the overall process time.

Impact of Lowering the Cofactor Concentration on Biocatalyst and Product Cost

Key challenges in using enzyme biocatalysts over a microbial biocatalyst for the synthesis of large-scale chemicals are the costs associated with (1) enzyme purification and the (2) exogenous addition of cofactors. As shown in this work, the CFE-based thermophilic biocatalyst can be separated in bulk from the mesophilic CFE background machinery by rapid and inexpensive heat denaturation. In the CFE-based formate-to-serine biocatalyst, the cost of the cofactors THF ($970/g), NADPH ($428/g), NADH ($60/g), and ATP ($33.6/g) outweighs the cost of the CFE cell lysate ($90/L). Thus, robust cofactor regeneration is pivotal to reducing the cost of the bioprocess.

Based on previous calculations, a 10-fold diluted CFE-based formate-to-serine biocatalyst costs $1.66/mL, when using equimolar concentrations of substrates and cofactors (2 mM formate, 1 mM NH3, 1 mM H2CO3, 2 mM THF, 2 mM NADPH, 1 mM NADH, 2 mM ATP), i.e., the base case. Figure A shows the impact of reducing the initial concentration of THF, NADPH, NADH, and ATP on biocatalyst cost. While reducing the concentration of THF or NADPH by 5-fold reduces the biocatalyst cost by ∼40%, a similar reduction in NADH concentration has a minimal impact on the biocatalyst cost (2% reduction). Combining 5-fold reductions in THF and NADH concentrations lowers the biocatalyst cost drastically, to $0.40/mL. Further reductions in biocatalyst cost require reductions in the concentration of NADH first and ATP last to ultimately achieve $0.09/mL. CFE-based formate-to-serine biocatalysts beyond $0.09/mL will require a reduction in the cost of the CFE lysate. Figure B shows the cost of synthesizing serine and glycine using the CFE-based thermophilic formate-to-serine biocatalyst as a function of the reduction in cofactor concentration. In the base case, the cost of serine is $27,299/g while the cost of glycine is $56,833/g. After applying cofactor reductions of THF (25-fold), NADPH (25-fold), NADH (5-fold), and ATP (5-fold), the cost of serine is $1,477/g and the cost of glycine is $3,074/g. Reductions in CFE lysate cost or reuse of the CFE-based catalyst over multiple cycles will be needed to bring down the serine and glycine price to $1–10/g.

5.

5

Impact of lowering the concentration of regenerated cofactors on amino acid yields. (A) Biocatalyst cost reduction ($/mL) as a function of cofactor reduction. Calculations are based using a 10-fold diluted CFE-based thermophilic formate-to-serine with an initial cost of $1.66/mL. Base case cofactor concentration (1×-fold [cofactor]): 2 mM THF, 2 mM NADPH, 2 mM NADH, 2 mM ATP. Commercial reagent prices used in cost analysis can be found in Table S11. (B) Cost of serine and glycine from formate produced by the CFE-based thermophilic biocatalyst as a function of cofactor concentration. (C–E) Concentration of serine and glycine from formate as a function of reductions in THF (C), NADPH (D), and NADH (E) concentrations. Reaction conditions: Gene expression: 30 °C, 16 h; biocatalyst dilution, 10×; heat denaturation: 50 °C, 10 min; chemical synthesis: 30 °C, 4 h; substrate/cofactor supplementation, 2 mM CH2O2, 1 mM H2CO3, 1 mM NH3, 2 mM ATP, 5 mM Na2HPO3, or the specified cofactor concentration. Bars represent mean ± standard error of the mean (SEM), n = 3, *p < 0.05, **p < 0.005, ****p < 0.0001. Data were analyzed using two-way ANOVA followed by a multiple comparisons test via the Tukey method.

Efficiency of the CFE-Based Thermophilic Formate-to-Serine Biocatalyst at Lower Cofactor Concentrations

To determine how well the CFE-based thermophilic formate-to-serine biocatalyst regenerates THF, NADPH, and NADH, we performed the bioprocess at reduced cofactor concentrations. As shown in Figure C, reducing the THF concentration 5-fold (0.4 mM) reduces the biocatalyst cost by 42% ($0.97/mL), while reaching a similar concentration of serine (0.48 ± 0.01 mM) and glycine (0.35 ± 0.02 mM). Reducing the concentration of THF 10-fold (0.2 mM THF) leads to similar glycine concentration but significantly lower serine concentration (0.25 ± 0.02 mM). With respect to NADPH, even a 5-fold reduction in concentration (0.4 mM) significantly reduces the serine (0.36 ± 0.03 mM) and glycine (0.24 ± 0.01 mM) (Figure D). We hypothesize that the speed at which NADP+ is regenerated to NADPH by ptdh* and the catalytic rate of FolDthe only enzyme that utilizes NADPH as a cofactordo not match. Lastly, dropping the NADH concentration by 5-fold (0.2 mM) results in similar serine levels but significantly lower glycine concentration at 0.23 ± 0.01 mM (Figure E). These data suggest a mismatched activity between NADH recycling by ptdh* and the reductive glycine synthesis module. Finally, we attempted to simultaneously reduce the THF and NADH concentration by 5-fold, but it resulted in a significant reduction of serine and glycine (Figure S4). Taken together, cofactor recycling in the formate-to-serine biocatalyst is complex as THF regeneration is intertwined with NAD­(P)­H regeneration. NADPH regeneration is needed in the THF-formate fixation module that consumes THF, while NADH regeneration is needed in the reductive glycine synthesis module that regenerates THF.

CFE-Based Thermophilic Formate-to-Serine Pathway as a Fed-Batch Process

The cost of the CFE-based thermophilic biocatalyst process could also be reduced by transitioning from a batch processwhere the enzymes are single-useto a fed-batch process, where the enzymes remain in the reactor and substrates/cofactors are periodically added to the system. For the batch process, we observe plateauing of chemical synthesis after a 4 h chemical synthesis step (Figure A). Hypothesizing that the biocatalyst runs out of substrates and cofactors after 4 h rather than the enzymes losing their activity, we used the CFE-based thermophilic biocatalyst in a fed-batch process. Specifically, we supplemented substrates and cofactors at the beginning of the chemical synthesis step (t = 0 h) and after 4 h of chemical synthesis (t = 4 h) and measured serine and glycine concentration at t = 4 h, 8 h, 12 h, and 24 h (Figure A).

6.

6

CFE-based thermophilic formate-to-serine biocatalyst fed-batch process: (A) Reaction conditions for fed-batch reactions with protocol changes highlighted in yellow. Complete reaction conditions: Gene expression: 30 °C, 16 h; Biocatalyst dilution, 10×; heat denaturation, 50 °C, 10 min; 2nd batch chemical synthesis, 30 °C, 4–24 h; substrate/cofactor supplementation at first and second batch, 2 mM CH2O2, 2 mM THF, 1 mM H2CO3, 1 mM NH3, 2 mM ATP, 1 mM NADH, 2 mM NADPH, 5 mM Na2HPO3. (B) Combined serine and glycine concentration as a function of chemical synthesis time. Chemicals were spiked to initiate a 2nd batch at the 4 h point (purple line and arrow). Black line: reaction performance when chemicals and cofactors are not spiked (data from Figure A). Red dotted lines indicate the stoichiometric per-batch yield from the biocatalyst given the initial or spiked substrate/cofactor concentrations. Stoichiometric yield for each batch is indicated next to the graph. Full yield calculations can be found in Table S2. Metabolite production rates from 0–4 h and 4–8 h are indicated in the table on the right. Data points represent mean ± SEM, n = 3. Individual serine and glycine synthesis data are shown in Figure S5.

The biocatalyst was generated as previously described and supplemented with equimolar concentrations of substrates and cofactors. After 4 h of chemical synthesis (t = 4 h), the biocatalyst generated 0.49 ± 0.02 mM serine (0.12 mM/h ± 0.00 mM/h) and 0.39 ± 0.05 mM glycine (0.10 ± 0.00 mM/h) for an overall 88% of the stoichiometric yield (Figure B). The biocatalyst was then spiked with equimolar concentration of substrates and cofactors to run a second batch of the process. During the second 4 h batch (t = 8 h), the biocatalyst produced 0.23 mM glycine and 0.33 mM serine or 56% of the stoichiometric yield with a productivity of 0.08 ± 0.00 mM/h serine and 0.06 ± 0.01 mM/h glycine (Figure B, Tables S2 and S3). Therefore, the biocatalyst was still active during the second fed batchhowever, its biosynthetic productivity dropped by ∼40% from batch to batch. Beyond 8 h (t = 12 h and t = 24 h), the combined concentration of serine and glycine plateaus (Figure B). These results show that the CFE-based thermophilic formate-to-serine biocatalyst can be used in a fed-batch process with a ∼25% decrease in product yield from batch to batch. In the future, introduction of a third and a fourth batch will help determine the limits of the CFE-based biocatalyst.

Discussion

In this work, we used a mesophilic E. coli lysate-based CFE to generate a 9-gene thermophilic formate-to-serine biocatalyst in a one-pot reaction for the synthesis of serine and glycine from formate, bicarbonate, and ammonia. The thermophilic nature of the biocatalyst enabled its rapid and cost-effective purification away from the mesophilic CFE machinery, which siphons the amino acid products, in-pathway intermediates, and cofactors if not removed. The thermophilic biocatalyst was built using enzymes from the moderate thermophile M. thermoacetica, which allowed the chemical synthesis to be run at mesophilic temperatures, which limited thermal cofactor degradation. The thermophilic biocatalyst performance did not improve when running the chemical synthesis at moderately thermophilic temperatures, supporting the hypothesis that optimal enzyme activity and thermal cofactor degradation need to be balanced when using thermophilic enzymes. Ultimately, the CFE-based thermophilic biocatalyst achieves a 97% of the stoichiometric yield of combined serine and glycine from one-carbon feedstocks using a 20 h gene expression step and a 4 h chemical synthesis. Finally, with an eye toward process scalability, we demonstrate the potential of using a CFE-based biocatalyst in a fed-batch process over 8 h of performance. To the best of our knowledge, this is the first instance of expressing enzymes from a thermophilic organism for chemical synthesis and utilizing those enzymes at a mesophilic temperature in an E. coli-based CFE system.

The commercial feasibility of CFE-based biocatalysts will depend on lowering the cost of the cell lysate ($90/L) and the cofactors. In this work, we lowered the bioprocess cost by reducing the cofactor loading. With a cost of $1.66/mL, the current CFE-based thermophilic biocatalyst produces serine and glycine at a cost of $27,299/g and $56,833/g, respectively. Reducing the concentration of the most expensive cofactor, THF, by 5-fold did not significantly reduce the yield of glycine or serine from formate while allowing a 42% reduction in bioprocess cost ($1.66/mL to $0.97/mL). The biocatalyst does not regenerate ATPthe least expensive cofactorwhich does not significantly affect the biocatalyst cost. Nevertheless, in the future, ATP regeneration could be implemented with a thermophilic polyphosphate kinase, as has been demonstrated in multiple systems. ,, Reductions in cofactor concentration could bring the cost of the biocatalyst down to $0.09/mL, reducing the cost of serine and glycine to $1,477/g and $3,074/g, respectively. Further reductions in biocatalyst cost and ultimately product cost will come from reducing the cost of the cell lysate or the ability to reuse the biocatalyst in a multi-fed-batch or continuous process.

Toward developing a continuous process, we assess the ability of the biocatalyst in a fed-batch system and observe limitations with the CFE-based biocatalyst activity lifetime. In this work, the thermophilic biocatalyst displayed sustained activity over an 8 h period with 2 batches of reagents (substrates and cofactors) with a 24% reduction in the product yield from batch to batch. In the future, the bioprocess could be extended beyond two fed batches to a multibatch or continuous process to push the limits of the biocatalyst lifetime. Furthermore, the biocatalyst could be spiked only with substrates and not with cofactors to make full use of the cofactor regeneration systems.

Future improvements to the thermophilic biocatalyst process will require a closer examination of pathway kinetics. For example, the current biocatalyst shows a biosynthetic efficiency imbalance between module 2 and module 3, both of which use CH2–THF, as glycine is accumulated in the system. Specifically, module 2 efficiently combines CH2–THF and H2CO3 to generate glycine, but module 3 is not able to combine all of the glycine with CH2–THF to generate serine. This indicates a pathway bottleneck in either module 3 or module 1 that generates CH2–THF from formate. Future investigation of pathway intermediates will determine the best places for optimization. Increasing cofactor recycling or using higher-activity enzyme homologues could improve individual modules. Balancing the biosynthetic efficiency of glycine conversion could be achieved by altering the module 2/module 3 gene ratios or timing the degradation of module 2 enzymes to facilitate the complete conversion of formate into serine. In this work, we investigated the activities of the module 1 enzymes while future studies should examine the Shmt enzyme of module 3the most thermodynamically unfavorable stepto improve serine production. Extending the pathway to other products is also an exciting possibility, as serine is just one step from pyruvate, a gateway to multiple bioproducts. This could provide a sink for serine to pull metabolites toward a single product. Whole-pathway insight could ultimately be gained through data-driven machine learning or kinetic models in which heat denaturation provides a significant advantage. Without interference from the background metabolism, the pathway can be more accurately investigated, resulting in more powerful models and predictions in the chemical yield.

Materials and Methods

Materials

All materials, including chemicals, solvents, kits, plasmids, primers, and gene sequences, can be found in the Supporting Information Tables S4–S10.

Bioinformatics Analysis

BRENDA and Uniprot were used to identify thermophilic enzymes with known experimental catalytic and thermostability parameters. Module 1 thermophilic enzymes were chosen based on their optimal kinetic and thermostability parameters at moderately thermophilic temperatures (50–70 °C). There were no experimental parameters for thermophilic glycine cleavage pathway (Gcv) enzymes or serine hydroxymethyltransferase (Shmt). Gene sequences for gcv and shmt were identified based on protein homology and automated gene predictions from the BioCyc database.

Thermophilic Formate-to-Serine Pathway Construction

M. thermoacetica fhs, foldD, gcvP, gcvH, gcvT, gcvL, lipM, and shmt were codon optimized for E. coli and commercially synthesized. Genes were cloned under control of the PT70 promoter in plasmid pTX/TL-P70a-deGFP between NdeI/XhoI using Gibson assembly. Clones were confirmed by whole plasmid DNA sequencing. The plasmid carrying ptdh* (p70a-P. stutzeri_ptdh*) has been previously disclosed. To generate the linear DNA for use in the CFE, the nine genes (genes fhs, foldD, gcvP, gcvH, gcvT, gcvL, lipM, shmt, and ptdh*) were amplified from their respective vectors using RW9/RW10 that bound ∼100bp upstream from the promoter and downstream from the terminator to protect the sequence from exonuclease degradation.

Synthesis of Serine and Glycine from Formate, Bicarbonate, and Ammonia

Reactions were set up in a 96-well plate using a Labcyte Echo 525. The reactions contained 100 μM PLP, 100 μM α-lipoic acid (dissolved in water), 3 nM PT70-M. thermoacetica_fhs, 3 nM PT70-M. thermoacetica_folD, 192 nM PT70-M. thermoacetica_gcvH, 2 nM PT70-M. thermoacetica_gcvL, 1 nM PT70-M. thermoacetica_gcvP, 4 nM PT70-M. thermoacetica_gcvT, 2 nM PT70-M. thermoacetica_lipM, 3 nM PT70-M. thermoacetica_shmt, and 3 nM PT70-P. stutzeri_ptdh*. By hand, the transcription/translation mixture (TX/TL, 75% vol) and water were added to reach 5 μL. Gene expression step: 4–20 h at 30 °C and shaken at 1.8g. Biocatalyst dilution step: 10× dilution was obtained by moving the biocatalyst to a PCR tube and adding 0.1 M Tris HCL at pH 8 to reach 50 μL. Heat denaturation step: The biocatalyst was overlaid with argon, incubated at 50 °C for 10 min, and centrifuged for 10 min to pellet the precipitated E. coli CFE background machinery. Chemical synthesis step: The biocatalyst supernatant was moved to a new PCR tube and supplemented with substrate, cofactors, and buffer to reach 50 μL. Final concentrations: 20 mM DTT, 100 μM α-lipoic acid (dissolved in DMSO), 2 mM THF, 2 mM formate, 2 mM NADPH, 2 mM ATP, 1 mM NH3, 1 mM NaHCO3, 1 mM NADH, and 5 mM Na2HPO3. Reactions with 5×, 10×, and 25× reduced THF concentration contained 0.4 mM, 0.2 mM, and 0.08 mM THF, respectively. Reactions with 5×, 10×, and 25× reduced NADPH concentration contained 0.4 mM, 0.2 mM, and 0.08 mM NADPH, respectively. Reactions with 5×, 10×, and 25× reduced NADH concentration contained 0.2 mM, 0.1 mM, and 0.04 mM NADH, respectively. The reactions were overlaid with argon and sealed to establish semianaerobic conditions. Chemical synthesis took place for 2–12 h at 30 °C shaken at 1.8g.

NADPH Consumption by Plain CFE with and without Heat Denaturation

Reactions were set up in a 96-well plate containing a TX/TL mixture (75% vol) and water to reach 5 μL. Reactions were incubated for 16 h at 30 °C and shaken at 1.8g. Plain CFE dilution step: 10× dilution was obtained by moving the CFE reaction mixture to a PCR tube and adding 0.1 M pH 8 Tris HCL to reach 50 μL. Heat denaturation step: Reactions that were heat denatured were incubated at 50 °C for 10 min and centrifuged for 10 min to pellet the precipitated E. coli CFE machinery. Heat-denatured supernatants were then moved to a new PCR tube and all reactions were supplemented with 1 mM NADPH. Zero-hour reactions were measured immediately while all other reactions were incubated for 2–12 h at 30 °C shaken at 1.8g.

Module 1 Reactions: Synthesis of CH2–THF from Formate

Reactions were set up in a 96-well plate using a Labcyte Echo 525. The reactions contained combinations of 3 nM PT70-M. thermoacetica_fhs, 3 nM PT70-M. thermoacetica_folD, PT70-E. coli _folD, PT70-M. extorquens_ftl, PT70-M. extorquens_fch, and PT70-M. extorquens_mtdA. All reactions contained 3 nM PT70-P. stutzeri_ptdh*. By hand, TX/TL (75% vol) and water were added to reach 5 μL. Gene expression step: 16 h at 30 °C shaken at 1.8g. Biocatalyst dilution step: 10× dilution was obtained by moving the biocatalyst to a PCR tube and adding 0.1 M pH 8 Tris HCL to reach 50 μL. Heat denaturation step: For heat-denatured samples, the biocatalyst was overlaid with argon, incubated at 50 °C for 10 min, and centrifuged for 10 min to pellet the precipitated E. coli CFE machinery. Chemical synthesis step: For heat-denatured samples, the biocatalyst supernatant was moved to a new PCR tube. All reactions were supplemented with substrate, cofactors, and buffer to reach 50 μL. Final concentrations: 1 mM THF, 1 mM formate, 1 mM NADPH, 1 mM ATP, and 5 mM Na2HPO3. The reactions were overlaid with argon and sealed to establish semianaerobic conditions. Chemical synthesis took place for 30 min at 30 °C and shaken at 1.8g.

Fed-Batch Synthesis of Serine and Glycine from Formate, Bicarbonate, and Ammonia

Reactions were set up and incubated by following the same protocol as the batch reactions. After the first 4 h chemical synthesis step, tubes were unsealed and all chemicals (20 mM DTT, 100 μM α-lipoic acid (dissolved in DMSO), 2 mM THF, 2 mM formate, 2 mM NADPH, 2 mM ATP, 1 mM NH3, 1 mM NaHCO3, 1 mM NADH, and 5 mM Na2HPO3) were spiked into the reaction in an additional 5 μL of volume to reach the same initial concentrations (10% volume increase assumed to be negligible). The reactions were then overlaid with argon and resealed. Chemical synthesis was resumed for an additional 4–20 h at 30 °C and shaken at 1.8g.

Amino Acid Derivatization

For liquid chromatography/mass spectrometry (LC/MS) detection and quantification of serine and glycine, the amino acids were derivatized to their Fmoc-protected versions using 9-fluorenylmethoxycarbonyl chloride (Fmoc-Cl) and yield calculations were done assuming 100% derivatization efficiency in both the standard curves and experimental data (Figure S6). To the 10× diluted biocatalyst (50 μL) was added 50 μL of 5% acetic acid in methanol containing 2 mM Boc-serine (internal standard) for a final concentration of 1 mM Boc-serine in the reaction. The denatured reaction was centrifuged for 10 min, and 25 μL of the supernatant was moved to a new tube. The pH was adjusted to 8.3 using 50 μL of saturated NaHCO3 and then 100 μL of 3 mM Fmoc-Cl dissolved in acetone was added. The reactions were performed at room temperature for 10 min. The Fmoc-derivatized amino acids were extracted using 500 μL of ethyl acetate (3×), dried under a vacuum, and resuspended in 200 μL of methanol. Derivatized amino acids were spun at 21,000g for 15 min, and then 100 μL of the supernatant was removed for analysis.

Metabolite Quantification

The Fmoc-derivatized amino acids were quantified using an Agilent 1260 Infinity II HPLC system equipped with an Agilent Q-TOF 6530 detector using a Poroshell 120 SB-C18 3.0 × 50 × 2.7 μm column. LC conditions: Solvent Awater with 0.1% formic acid, solvent Bacetonitrile with 0.1% formic acid. Gradient: 0 min, 5% B; 3 min, 5% B; 15 min, 100% B; 18 min, 100% B; 20 min, 5% B; 22 min, 5% B. MS acquisition: Extracted ion chromatograms in positive ion mode were used to detect and quantify Fmoc serine (m/z 328.11) and Fmoc glycine (m/z 298.11). Fmoc-derivatized commercial serine and glycine were used to determine retention times and generate standard curves for chemical quantification.

All other metabolites were detected and quantified via LC/MS without derivatization using an Agilent 1260 Infinity II HPLC system equipped with an Agilent LC/MSD iQ Single Quadrapole MS and an electrospray ion source using a Poroshell 120 EC-C18 2.1 × 50 × 2.7 μm column. The proteins in the 10× diluted biocatalyst were denatured by adding 50 μL of 5% acetic acid in methanol spiked with 0.1 mM catechol (internal standard). The denatured reactions were centrifuged at 3,600g for 10 min, and the supernatant was directly analyzed. Column temperature was kept constant at 28 °C. LC conditions: Solvent Awater with 3% methanol, 10 mM tributylamine, and 15 mM acetic acid, solvent Bmethanol. Gradient: 0 min, 0% B; 1 min, 0% B; 2 min, 65% B; 4.5 min, 77.5% B; 5 min, 95% B; 6 min, 0% B; and 8.5 min, 0% B. MS acquisition: negative ion scan was used to extract ion chromatograms and quantify THF (m/z 444), CH = THF (m/z 454), CH2–THF (m/z 456), NADP+ (m/z 742), and NADPH (m/z 744). Commercial THF, CH = THF, CH2–THF, NADPH, and NADP+ were used to determine retention times and generate standard curves for chemical quantification.

Supplementary Material

sb5c00352_si_001.pdf (922.9KB, pdf)

Acknowledgments

The information, data, or work presented herein was funded in part by the Advanced Research Project Agency-Energy (ARPA-E), U.S. Department of Energy under Award Number DE-AR0001514 and the U.S. Department of Energy, Office of Science, Biological and Environmental Research Program, under Award Number DE-SC0023091. This report was prepared as an account of work sponsored by an agency of the United States Government. Neither the United States Government nor any agency thereof, nor any of their employees, makes any warranty, express or implied, or assumes any legal liability or responsibility for the accuracy, completeness, or usefulness of any information, apparatus, product, or process disclosed, or represents that its use would not infringe privately owned rights. Reference herein to any specific commercial product, process, or service by trade name, trademark, manufacturer, or otherwise does not necessarily constitute or imply its endorsement, recommendation, or favoring by the United States Government or any agency thereof. The views and opinions of the authors expressed herein do not necessarily state or reflect those of the United States Government or any agency thereof.

All the data generated or analyzed during this study is included in the published article and its Supporting Information files.

The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acssynbio.5c00352.

  • Properties of the enzymes in the thermophilic formate-to-serine biocatalyst; stoichiometric yield calculations; formate incorporation calculations; table of reagents; table of solvents; table of kits; table of plasmids; table of primers; table of promoter and terminator; sequences of genes evaluated; table of commercial reagent prices used in cost analysis; images of plain CFE and the CFE-based thermophilic module 1 biocatalyst during different steps of the bioprocess workflow; stability of CH2–THF at 30 °C over time; extended comparisons between groups shown in Figure 4D; concentration of glycine/serine from formate as a function of simultaneous reduction in THF and NADH concentrations; individual serine and glycine concentrations shown in Figure 6B; efficiency of Fmoc derivatization of serine and glycine; LC/MS traces of commercial Fmoc serine and Fmoc glycine in plain cell-free expression (CFE); standard curves of Fmoc serine and Fmoc glycine using derivatized Fmoc serine and glycine under the bioprocess workflow conditions used to quantify serine and glycine concentrations in this study; LC/MS traces of commercial tetrahydrofolate (THF), 5,10-methenyltetrahydrofolate (CH = THF), 5,10-methylenetetrahydrofolate (CH2–THF), NADPH, and NADP+ in plain cell-free expression (CFE); and standard curves using commercial tetrahydrofolate (THF), 5,10-methenyltetrahydrofolate (CH = THF), 5,10-methylenetetrahydrofolate (CH2–THF), NADPH, and NADP+ (PDF)

∇.

P.P.-Y. is the lead contact. The project was conceived by P.P.-Y, R.W., S.C., R.C., J.M.C., and A.S.B. CFE-based catalyst engineering and process design were done by R.W., S.C., K.W., and P.P.-Y. The experiments were performed by R.W., S.C., and K.W. Data analysis was performed by R.W., S.C., K.W., and P.P.-Y. Project supervision was done by P.P.-Y. The manuscript was written by P.P.-Y. and R.W. The manuscript was edited by S.C., R.C., K.W., A.S.B., and J.M.C. All authors approved the submission of the manuscript.

The authors declare the following competing financial interest(s): R.W., S.C., R.C., K.W., A.S.B., J.M.C., and P. P-Y have filed a provisional patent application based on this work.

References

  1. Yishai O., Bouzon M., Döring V., Bar-Even A.. In Vivo Assimilation of One-Carbon via a Synthetic Reductive Glycine Pathway in Escherichia coli. ACS Synth. Biol. 2018;7:2023–2028. doi: 10.1021/acssynbio.8b00131. [DOI] [PubMed] [Google Scholar]
  2. Liu Z., Wang K., Chen Y., Tan T., Nielsen J.. Third-generation biorefineries as the means to produce fuels and chemicals from CO2. Nat. Catal. 2020;3:274–288. doi: 10.1038/s41929-019-0421-5. [DOI] [Google Scholar]
  3. Nieh L.-Y.. et al. Evolutionary engineering of methylotrophic E. coli enables fast growth on methanol. Nat. Commun. 2024;15:8840. doi: 10.1038/s41467-024-53206-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Yu H., Liao J. C.. A modified serine cycle in Escherichia coli coverts methanol and CO2 to two-carbon compounds. Nat. Commun. 2018;9:3992. doi: 10.1038/s41467-018-06496-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bysani V. R., Alam A. S., Bar-Even A., Machens F.. Engineering and evolution of the complete Reductive Glycine Pathway in Saccharomyces cerevisiae for formate and CO2 assimilation. Metab. Eng. 2024;81:167–181. doi: 10.1016/j.ymben.2023.11.007. [DOI] [PubMed] [Google Scholar]
  6. Gonzalez De La Cruz J., Machens F., Messerschmidt K., Bar-Even A.. Core Catalysis of the Reductive Glycine Pathway Demonstrated in Yeast. ACS Synth. Biol. 2019;8:911–917. doi: 10.1021/acssynbio.8b00464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Pontrelli S.. et al. Escherichia coli as a host for metabolic engineering. Metab. Eng. 2018;50:16–46. doi: 10.1016/j.ymben.2018.04.008. [DOI] [PubMed] [Google Scholar]
  8. Bang J., Lee S. Y.. Assimilation of formic acid and CO2 by engineered Escherichia coli equipped with reconstructed one-carbon assimilation pathways. Proc. Natl. Acad. Sci. U.S.A. 2018;115:E9271–E9279. doi: 10.1073/pnas.1810386115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bang J., Hwang C. H., Ahn J. H., Lee J. A., Lee S. Y.. Escherichia coli is engineered to grow on CO2 and formic acid. Nat. Microbiol. 2020;5:1459–1463. doi: 10.1038/s41564-020-00793-9. [DOI] [PubMed] [Google Scholar]
  10. Tashiro Y., Hirano S., Matson M. M., Atsumi S., Kondo A.. Electrical-biological hybrid system for CO2 reduction. Metab. Eng. 2018;47:211–218. doi: 10.1016/j.ymben.2018.03.015. [DOI] [PubMed] [Google Scholar]
  11. Mitic B. M., Troyer C., Lutz L., Baumschabl M., Hann S., Mattanovich D.. The oxygen-tolerant reductive glycine pathway assimilates methanol, formate and CO2 in the yeast Komagataella phaffii. Nat. Commun. 2023;14:7754. doi: 10.1038/s41467-023-43610-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Kim S., Giraldo N., Rainaldi V., Machens F., Collas F., Kubis A., Kensy F., Bar-Even A., Lindner S. N.. Optimizing E. coli as a formatotrophic platform for bioproduction via the reductive glycine pathway. Front Bioeng Biotechnol. 2023;11:1091899. doi: 10.3389/fbioe.2023.1091899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Chowdhury S.. et al. Carbon Negative Synthesis of Amino Acids Using a Cell-Free-Based Biocatalyst. ACS Synth. Biol. 2024;13:3961–3975. doi: 10.1021/acssynbio.4c00359. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Kruyer N. S.. et al. Membrane Augmented Cell-Free Systems: A New Frontier in Biotechnology. ACS Synth. Biol. 2021;10:670–681. doi: 10.1021/acssynbio.0c00625. [DOI] [PubMed] [Google Scholar]
  15. Laohakunakorn N., Grasemann L., Lavickova B., Michielin G., Shahein A., Swank Z., Maerkl S. J.. Bottom-Up Construction of Complex Biomolecular Systems With Cell-Free Synthetic Biology. Front Bioeng Biotech. 2020;8:213. doi: 10.3389/fbioe.2020.00213. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Rasor B. J.. et al. Toward sustainable, cell-free biomanufacturing. Curr. Opin. Biotechnol. 2021;69:136–144. doi: 10.1016/j.copbio.2020.12.012. [DOI] [PubMed] [Google Scholar]
  17. Cardiff R. A. L., Chowdhury S., Sugianto W., Tickman B. I., Burbano D. A., Meyer P. A., Cook M., King B., Garenne D., Beliaev A. S.. et al. Carbon-conserving Bioproduction of Malate in an E. coli-based Cell-Free System. Metab. Eng. 2025;91:59–76. doi: 10.1101/2024.11.26.623433. [DOI] [PubMed] [Google Scholar]
  18. Michel-Reydellet N., Calhoun K., Swartz J.. Amino acid stabilization for cell-free protein synthesis by modification of the Escherichia coli genome. Metab. Eng. 2004;6:197–203. doi: 10.1016/j.ymben.2004.01.003. [DOI] [PubMed] [Google Scholar]
  19. Dudley Q. M., Anderson K. C., Jewett M. C.. Cell-Free Mixing of Escherichia coli Crude Extracts to Prototype and Rationally Engineer High-Titer Mevalonate Synthesis. ACS Synth. Biol. 2016;5:1578–1588. doi: 10.1021/acssynbio.6b00154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Bujara M., Schümperli M., Billerbeck S., Heinemann M., Panke S.. Exploiting cell-free systems: Implementation and debugging of a system of biotransformations. Biotechnol. Bioeng. 2010;106:376–389. doi: 10.1002/bit.22666. [DOI] [PubMed] [Google Scholar]
  21. Levitskaya Z.. et al. Engineering cell-free systems by chemoproteomic-assisted phenotypic screening. RSC Chemical Biology. 2024;5:372–385. doi: 10.1039/D4CB00004H. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Yi X.. et al. Establishing a versatile toolkit of flux enhanced strains and cell extracts for pathway prototyping. Metab. Eng. 2023;80:241–253. doi: 10.1016/j.ymben.2023.10.008. [DOI] [PubMed] [Google Scholar]
  23. Richardson K. N., Black W. B., Li H.. Aldehyde Production in Crude Lysate-and Whole Cell-Based Biotransformation Using a Noncanonical Redox Cofactor System. ACS Catal. 2020;10:8898–8903. doi: 10.1021/acscatal.0c03070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Dinglasan J. L. N., Doktycz M. J.. Rewiring cell-free metabolic flux in E. coli lysates using a block-push-pull approach. Synthetic Biology. 2023;8:ysad007. doi: 10.1093/synbio/ysad007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Garcia D. C., Dinglasan J. L. N., Shrestha H., Abraham P. E., Hettich R. L., Doktycz M. J.. A lysate proteome engineering strategy for enhancing cell-free metabolite production. Metab Eng. Commun. 2021;12:e00162. doi: 10.1016/j.mec.2021.e00162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Mackey B. M., Miles C. A., Seymour D. A., Parsons S. E.. Thermal denaturation and loss of viability in Escherichia coli and Bacillus stearothermophilus. Lett. Appl. Microbiol. 1993;16:56–58. doi: 10.1111/j.1472-765X.1993.tb00341.x. [DOI] [Google Scholar]
  27. Ninh P. H., Honda K., Sakai T., Okano K., Ohtake H.. Assembly and Multiple Gene Expression of Thermophilic Enzymes in Escherichia coli for In Vitro Metabolic Engineering. Biotechnol. Bioeng. 2015;112:189–196. doi: 10.1002/bit.25338. [DOI] [PubMed] [Google Scholar]
  28. Zhang X., Wu H., Huang B., Li Z., Ye Q.. One-pot synthesis of glutathione by a two-enzyme cascade using a thermophilic ATP regeneration system. J. Biotechnol. 2017;241:163–169. doi: 10.1016/j.jbiotec.2016.11.034. [DOI] [PubMed] [Google Scholar]
  29. Jia X., Liu Y., Han Y.. A thermophilic cell-free cascade enzymatic reaction for acetoin synthesis from pyruvate. Sci. Rep. 2017;7:4333. doi: 10.1038/s41598-017-04684-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Wang W., Liu M., You C., Li Z., Zhang Y. H. P.. ATP-free biosynthesis of a high-energy phosphate metabolite fructose 1,6-diphosphate by in vitro metabolic engineering. Metab. Eng. 2017;42:168–174. doi: 10.1016/j.ymben.2017.06.006. [DOI] [PubMed] [Google Scholar]
  31. You C., Shi T., Li Y., Han P., Zhou X., Zhang Y. P.. An In Vitro Synthetic Biology Platform for the Industrial Biomanufacturing of Myo-Inositol From Starch. Biotechnol. Bioeng. 2017;114:1855–1864. doi: 10.1002/bit.26314. [DOI] [PubMed] [Google Scholar]
  32. Cheng K., Zheng W., Chen H., Zhang Y. H. P. J.. Upgrade of wood sugar D-xylose to a value-added nutraceutical by in vitro metabolic engineering. Metab. Eng. 2019;52:1–8. doi: 10.1016/j.ymben.2018.10.007. [DOI] [PubMed] [Google Scholar]
  33. Petroll K., Care A., Bergquist P. L., Sunna A.. A novel framework for the cell-free enzymatic production of glucaric acid. Metab. Eng. 2020;57:162–173. doi: 10.1016/j.ymben.2019.11.003. [DOI] [PubMed] [Google Scholar]
  34. Imura M., Etoh S., Iwakiri R., Okano K., Honda K.. Improvement of production yield of L-cysteine through in vitro metabolic pathway with thermophilic enzymes. J. Biosci. Bioeng. 2021;132:585–591. doi: 10.1016/j.jbiosc.2021.09.003. [DOI] [PubMed] [Google Scholar]
  35. Endoh T.. et al. Cell-free protein synthesis at high temperatures using the lysate of a hyperthermophile. J. Biotechnol. 2006;126:186–195. doi: 10.1016/j.jbiotec.2006.04.010. [DOI] [PubMed] [Google Scholar]
  36. Krutsakorn B.. et al. In vitro production of n-butanol from glucose. Metab. Eng. 2013;20:84–91. doi: 10.1016/j.ymben.2013.09.006. [DOI] [PubMed] [Google Scholar]
  37. Kruglikov A., Wei Y., Xia X.. Proteins from Thermophilic Thermus thermophilus Often Do Not Fold Correctly in a Mesophilic Expression System Such as Escherichia coli. ACS Omega. 2022;7:37797–37806. doi: 10.1021/acsomega.2c04786. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Salas-Bruggink D. I. J., Martín J. S.-S., Leiva G., Blamey J. M.. Extremozymes: Challenges and opportunities on the road to novel enzymes production. Process Biochem. 2024;143:323–336. doi: 10.1016/j.procbio.2024.04.035. [DOI] [Google Scholar]
  39. Lo Gullo G.. et al. Optimization of an in Vitro Transcription/Translation System Based on Sulfolobus solfataricus Cell Lysate. Archaea. 2019;2019:1. doi: 10.1155/2019/9848253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Ribeiro A. L. J. L.. et al. Thermostable in vitro transcription-translation compatible with microfluidic droplets. Microb Cell Fact. 2024;23:169. doi: 10.1186/s12934-024-02440-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Xu Y. Y., Ren J., Wang W., Zeng A. P.. Improvement of glycine biosynthesis from one-carbon compounds and ammonia catalyzed by the glycine cleavage system in vitro. Eng. Life Sci. 2022;22:40–53. doi: 10.1002/elsc.202100047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Wu R.. et al. Enzymatic Electrosynthesis of Glycine from CO(2) and NH(3) Angew. Chem., Int. Ed. Engl. 2023;62:e202218387. doi: 10.1002/anie.202218387. [DOI] [PubMed] [Google Scholar]
  43. Zou Y.. et al. Crystal Structures of Phosphite Dehydrogenase Provide Insights into Nicotinamide Cofactor Regeneration. Biochemistry. 2012;51:4263–4270. doi: 10.1021/bi2016926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Garamella J., Marshall R., Rustad M., Noireaux V.. The All E-coli TX-TL Toolbox 2.0: A Platform for Cell-Free Synthetic Biology. ACS Synth. Biol. 2016;5:344–355. doi: 10.1021/acssynbio.5b00296. [DOI] [PubMed] [Google Scholar]
  45. Taniguchi H., Imura M., Okano K., Honda K.. Developing a single strain for in vitro salvage synthesis of NAD+ at high temperatures and its potential for bioconversion. Microb Cell Fact. 2019;18:75. doi: 10.1186/s12934-019-1125-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Jones M. L., Nixon P. F.. Tetrahydrofolates Are Greatly Stabilized by Binding to Bovine Milk Folate-Binding Protein. J. Nutr. 2002;132:2690–2694. doi: 10.1093/jn/132.9.2690. [DOI] [PubMed] [Google Scholar]
  47. Indrawati, Van Loey A., Hendrickx M.. Pressure and temperature stability of 5-methyltetrahydrofolic acid: A kinetic study. J. Agric. Food Chem. 2005;53:3081–3087. doi: 10.1021/jf048370k. [DOI] [PubMed] [Google Scholar]
  48. Chen L., Zhou C., Yang H., Roberts M. F.. Inositol-1-phosphate synthase from Archaeoglobus fulgidus is a class II aldolase. Biochemistry. 2000;39:12415–12423. doi: 10.1021/bi001517q. [DOI] [PubMed] [Google Scholar]
  49. Chang A.. et al. BRENDA, the ELIXIR core data resource in 2021: New developments and updates. Nucleic Acids Res. 2021;49:D498–D508. doi: 10.1093/nar/gkaa1025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Radfar R.. et al. Cation binding and thermostability of FTHFS monovalent cation binding sites and thermostability of N10-formyltetrahydrofolate synthetase from Moorella thermoacetica. Biochemistry. 2000;39:14481–14486. doi: 10.1021/bi001577w. [DOI] [PubMed] [Google Scholar]
  51. O’Brien W. E., Brewer J. M., Ljungdahl L. G.. Purification and characterization of thermostable 5,10-methylenetetrahydrofolate dehydrogenase from Clostridium thermoaceticum. J. Biol. Chem. 1973;248:403–408. doi: 10.1016/S0021-9258(19)44387-1. [DOI] [PubMed] [Google Scholar]
  52. Ljungdahl L. G., O’Brien W. E., Moore M. R., Liu M.-T.. Methylenetetrahydrofolate Dehydrogenase from Clostridium formicoaceticum and Methylenetetrahydrofolate Dehydrogenase, Methenyltetrahydrofolate Cyclohydrolase (Combined) from Clostridium thermoaceticum. Methods Enzymol. 1980;66:599–609. doi: 10.1016/0076-6879(80)66513-6. [DOI] [PubMed] [Google Scholar]
  53. Leaphart A. B., Trent Spencer H., Lovell C. R.. Site-directed mutagenesis of a potential catalytic and formyl phosphate binding site and substrate inhibition of N10-formyltetrahydrofolate synthetase. Arch. Biochem. Biophys. 2002;408:137–143. doi: 10.1016/S0003-9861(02)00552-0. [DOI] [PubMed] [Google Scholar]
  54. Jia D., Deng W., Hu P., Jiang W., Gu Y.. Thermophilic Moorella thermoacetica as a platform microorganism for C1 gas utilization: physiology, engineering, and applications. Bioresour Bioprocess. 2023;10:61. doi: 10.1186/s40643-023-00682-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Hu P., Rismani-Yazdi H., Stephanopoulos G.. Anaerobic CO2 fixation by the acetogenic bacterium Moorella thermoacetica. AIChE J. 2013;59:3176–3183. doi: 10.1002/aic.14127. [DOI] [Google Scholar]
  56. Rahayu F.. et al. Thermophilic ethanol fermentation from lignocellulose hydrolysate by genetically engineered Moorella thermoacetica. Bioresour. Technol. 2017;245:1393–1399. doi: 10.1016/j.biortech.2017.05.146. [DOI] [PubMed] [Google Scholar]
  57. Sarria S., Bartholow T. G., Verga A., Burkart M. D., Peralta-Yahya P.. Matching Protein Interfaces for Improved Medium-Chain Fatty Acid Production. ACS Synth. Biol. 2018;7:1179–1187. doi: 10.1021/acssynbio.7b00334. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Karp P. D.. et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief Bioinform. 2019;20:1085–1093. doi: 10.1093/bib/bbx085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Wu J. T., Wu L. H., Knight J. A.. Stability of NADPH: Effect of Various Factors on the Kinetics of Degradation. Clin Chem. 1986;32:314. doi: 10.1093/clinchem/32.2.314. [DOI] [PubMed] [Google Scholar]
  60. Bowman M. A.. et al. Properties of protein unfolded states suggest broad selection for expanded conformational ensembles. Proc. Natl. Acad. Sci. U.S.A. 2020;117:23356–23364. doi: 10.1073/pnas.2003773117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Clark P. L., Plaxco K. W., Sosnick T. R.. Water as a Good Solvent for Unfolded Proteins: Folding and Collapse are Fundamentally Different. J. Mol. Biol. 2020;432:2882–2889. doi: 10.1016/j.jmb.2020.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Hyatt D. C., Maley F., Montfort W. R.. Use of Strain in a Stereospecific Catalytic Mechanism: Crystal Structures of Escherichia coli Thymidylate Synthase Bound to FdUMP and Methylenetetrahydrofolate. Biochemistry. 1997;36:4585–4594. doi: 10.1021/bi962936j. [DOI] [PubMed] [Google Scholar]
  63. Harvey R. J., Dev I. K.. Regulation in the foláte pathway of Escherichia coli. Adv. Enzym Regul. 1975;13:97–124. doi: 10.1016/0065-2571(75)90010-2. [DOI] [PubMed] [Google Scholar]
  64. Verlinde P. H. C. J., Oey I., Deborggraeve W. M., Hendrickx M. E., Van Loey A. M.. Mechanism and Related Kinetics of 5-Methyltetrahydrofolic Acid Degradation during Combined High Hydrostatic Pressure–Thermal Treatments. J. Agric. Food Chem. 2009;57:6803–6814. doi: 10.1021/jf900832g. [DOI] [PubMed] [Google Scholar]
  65. Lewin A. H.. et al. Synthesis and physicochemical characterization of the one-carbon carrier 10-formyltetrahydrofolate; a reference standard for metabolomics. Metabolomics. 2017;13:117. doi: 10.1007/s11306-017-1256-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Tyagi A., Penzkofer A., Batschauer A., Wolf E.. Thermal degradation of (6R,S)-5,10-methenyltetrahydrofolate in aqueous solution at pH 8. Chem. Phys. 2009;358:132–136. doi: 10.1016/j.chemphys.2009.01.005. [DOI] [Google Scholar]
  67. Hofmann D., Wirtz A., Santiago-Schübel B., Disko U., Pohl M.. Structure elucidation of the thermal degradation products of the nucleotide cofactors NADH and NADPH by nano-ESI-FTICR-MS and HPLC-MS. Anal. Bioanal. Chem. 2010;398:2803–2811. doi: 10.1007/s00216-010-4111-z. [DOI] [PubMed] [Google Scholar]
  68. Johannes T. W., Woodyer R. D., Zhao H.. Directed evolution of a thermostable phosphite dehydrogenase for NAD­(P)H regeneration. Appl. Environ. Microbiol. 2005;71:5728–5734. doi: 10.1128/AEM.71.10.5728-5734.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Bowie J. U.. et al. Synthetic Biochemistry: The Bio-inspired Cell-Free Approach to Commodity Chemical Production. Trends Biotechnol. 2020;38:766–778. doi: 10.1016/j.tibtech.2019.12.024. [DOI] [PubMed] [Google Scholar]
  70. Iwamoto S.. et al. Use of an Escherichia coli recombinant producing thermostable polyphosphate kinase as an ATP regenerator to produce fructose 1,6-diphosphate. Appl. Environ. Microbiol. 2007;73:5676–5678. doi: 10.1128/AEM.00278-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Restiawaty E.. et al. Feasibility of thermophilic adenosine triphosphate-regeneration system using Thermus thermophilus polyphosphate kinase. Process Biochem. 2011;46:1747–1752. doi: 10.1016/j.procbio.2011.05.021. [DOI] [Google Scholar]
  72. Bateman A.. et al. UniProt: the Universal Protein Knowledgebase in 2025. Nucleic Acid Res. 2025;53:D609–D617. doi: 10.1093/nar/gkae1010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Ziegler J., Abel S.. Analysis of amino acids by HPLC/electrospray negative ion tandem mass spectrometry using 9-fluorenylmethoxycarbonyl chloride (Fmoc-Cl) derivatization. Amino Acids. 2014;46:2799–2808. doi: 10.1007/s00726-014-1837-5. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sb5c00352_si_001.pdf (922.9KB, pdf)

Data Availability Statement

All the data generated or analyzed during this study is included in the published article and its Supporting Information files.


Articles from ACS Synthetic Biology are provided here courtesy of American Chemical Society

RESOURCES