Skip to main content
Open Biology logoLink to Open Biology
. 2021 Feb 3;11(2):200173. doi: 10.1098/rsob.200173

Concerns with computational protein engineering programmes IPRO and OptMAVEn and metabolic pathway engineering programme optStoic

Thomas K Wood 1,
PMCID: PMC8061685  PMID: 33529550

Abstract

It has become customary in engineering to require a modelling component in research endeavours. In addition, as the code for these models becomes more byzantine in complexity, it is difficult for reviewers and readers to discern their value and understand the underlying code. This opinion piece summarizes the negative experience of the author with the IPRO and OptMAVEn computational protein engineering models as well as problems with the optStoic metabolic pathway model. In our hands, these models often fail to predict reliable ways to engineer proteins and metabolic pathways.

Keywords: protein engineering, modelling, computational pathway engineering

1. Review

In the following, I describe our experience with three computational programmes that were designed to improve proteins and metabolic pathways.

1.1. Optimal method for antibody variable region engineering (OptMAVEn) for single-chain antibodies

As part of a National Science Foundation grant (CBET 1133040), we investigated the de novo protein design of fully human antibody variable domains for binding a specified antigen using OptMAVEn [1]. In OptMAVEn, possible antigen-binding conformations are generated for a given antigen, then the top scored antigen conformations and antibody models are assembled by combinations of six modular antibody parts and random mutations are introduced to the antibody models for improved antigen-binding affinity.

We focused on trying to improve an existing antibody, 2D10, which is a single-chain antibody (scFv) that recognizes the dodecapeptide DVFYPYPYASGS, a peptide mimic of mannose-containing carbohydrates. As a result of the OptMAVEn predictions [2], we cloned and purified five de novo designed scFvs and verified their correct folding. Of these five, two predicted ScFvs had no binding to the dodecapeptide, and the other three ScFvs had less superior binding by a factor of 2.3–6.2 fold [2]. Hence, in our experience, OptMAVEn was unable to create ScFvs that have superior binding or even equivalent binding.

1.2. Iterative protein redesign and optimization procedure (IPRO) for improved dioxygenases

As part of a National Science Foundation grant (BES-0114126), we successfully optimized naphthalene dioxygenase (NDO) of Ralstonia sp. strain U2 for oxidizing the benzene ring and the methyl-group of substituted toluene compounds using random protein engineering [3]. We had hoped to use computational methods to help identify additional substitutions that would be beneficial for increasing the activity of the dioxygenase for nitroaromatic pollutants. With this goal, IPRO [4] was performed on the large subunit NagAc of NDO with the aim of optimizing the docking of the substrate 2,3-dinitrotoluene to favour the formation of the intermediate 4-methyl-3-nitrocatechol. IPRO is designed to use energy-based scoring to identify beneficial substitutions in native protein sequences.

Unfortunately, after its construction via site-directed mutagenesis, the IPRO-predicted NagAc variant, L225R/L251I/V258N/F350I, when analysed for 2,3-dinitrotoluene activity by HPLC and a nitrite assay, was completely inactive toward the substrate 2,3-dinitrotoluene. SDS–PAGE analysis showed that this IPRO-predicted variant NagAc L225R/L251I/V258N/F350I was produced at normal levels. Hence, in our experience, IPRO was unable to predict amino acid substitutions that increase protein activity.

1.3. IPRO for improved anaerobic oxidation of methane

As part of an Advanced Research Projects Agency–Energy grant, our group reversed methanogenesis for the first time by engineering an archaeal strain so that it could grow on methane as a pure culture [5]. This required cloning methyl-coenzyme M reductase (Mcr) from an unculturable organism (which was part of an anaerobic methanotrophic archaeal population in a Black Sea mat) into Methanosarcina acetivorans and using 10 mM FeCl3 as an electron acceptor [5]. This led to the engineered archaeal strain that could produce acetate [5], lactate [6] and electricity [7].

As part of this project, we tested the IPRO predictions for engineering Mcr, the enzyme for capturing methane as part of reversed methanogenesis, by assaying for improved enzyme kinetics (i.e. greater consumption of methane). Critically, we tested the top two model predictions for engineering McrA to improve F430 cofactor binding. After sequencing to confirm the plasmid constructs were correct, we tested McrA V419 K and found the predicted substitution not only did not improve methane capture, it abolished methane capture (i.e. it inactivated the enzyme). We also tested the predicted substitutions McrA M78R/H157D/V419 K and found these predictions also abolished methane capture. Hence, in our hands, IPRO is unable to improve Mcr.

1.4. Optimum overall stoichiometry (optStoic) de novo metabolic pathway modelling

As noted in the previous section, our laboratory discovered how to reverse methanogenesis by using the external electron acceptor ferric iron, to generate acetate in the autumn of 2014; these results were subsequently published in 2016 by our group [5]. To our surprise, our laboratory results on using ferric iron to reverse methanogenesis with cloned Mcr in an archaeal strain were published without our approval and without acknowledgement as a means to give an example of the power of the optStoic metabolic pathway modelling routine [8]. optStoic is designed to identify the optimum overall stoichiometry that maximizes carbon, energy or price efficiency based on thermodynamic constraints. Our laboratory research results were known to the group that published them since this group was part of the same ARPA-E research grant and received monthly reports. Strikingly, they used our laboratory results as one of the examples to demonstrate the robustness of their ‘model’ prior to our own publication of these results.

The common features between our proven laboratory discovery and their ‘modelling predictions' include: (i) anaerobic capture of methane for growth by a pure culture (which had not been demonstrated previously); (ii) methane conversion to the same end product, acetate (iii) metabolism based on the same electron acceptor, ferric iron; (iv) metabolism based on the same enzyme, Mcr; and (v) metabolism based on the identical archaeal host, M. acetivorans. Hence, the model ‘predictions’ (e.g. the importance of ferric iron, generation of acetate) were published using our known, albeit unacknowledged, experimental results.

2. Perspectives

The main conclusion from our use of the IPRO and OptMAVEn protein engineering models is that they are often incapable of predicting substitutions that improve protein function. Hence, evaluation of these kinds of models should be predicated on a positive control being performed in which the model predicts a priori some known beneficial substitution that is experimentally verified. It should be noted that other in silico protein engineering approaches exist, such as computer-aided directed evolution of enzymes [9], which successfully screened the effect of 128 substitutions in triosephosphate isomerase from Saccaromyces cerevisiae. Moreover, a Rosetta enzyme design approach for sampling the sequence and conformational space of partial active site randomization led to the synthesis of four chiral β-amino acids via an aspartase from Bacillus sp. YM55-1, without laboratory evolution [10].

In addition, it is worth noting that without a doubt, experimentalists often get it wrong, too, without the aid of protein modellers. For example, it has been predicted in a prestigious journal that the archaeal strain M. acetivorans can grow on methane faster than Escherichia coli can grow on glucose [11], which is highly unlikely.

Furthermore, with 19 possible substitutions for every amino acid and an average protein size of 333 aa, there are 19333 possible substitutions, so the protein space for models (and experimentalists) have to cover is enormous. However, of the two approaches for protein engineering, experimental random mutagenesis versus computational protein engineering, a Nobel Prize has been given only for the experimental approach [12]. Moreover, over two decades, we have had success in using random protein engineering to create better catalysts and regulators by using DNA shuffling with monooxygenases [1317], dioxygenases [3], epoxide hydrolases [18], toxin/antitoxin systems [19] and biofilm regulators [2023]. However, we have never had success in using computational protein engineering for improving either enzyme activity or protein binding. The crux is that these computational models should be vetted more thoroughly before their predictions are accepted, and experimentalists should use these programmes with discretion.

Data accessibility

This article has no additional data.

Competing interests

I declare I have no competing interests.

Funding

This work was supported by funds derived from the Biotechnology Endowed Professorship at the Pennsylvania State University.

References

  • 1.Li T, Pantazes RJ, Maranas CD. 2014. OptMAVEn—a new framework for the de novo design of antibody variable region models targeting specific antigen epitopes. PLoS ONE 9, e105954. ( 10.1371/journal.pone.0105954) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Poosarla VG, Li T, Goh BC, Schulten K, Wood TK, Maranas CD. 2017. Computational de novo design of antibodies binding to a peptide with high affinity. Biotechnol. Bioeng. 114, 1331-1342. ( 10.1002/bit.26244) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Keenan BG, Leungsakul T, Smets BF, Mori M-A, Henderson DE, Wood TK. 2005. Protein engineering of the archetypal nitroarene dioxygenase of Ralstonia sp. strain U2 for activity on aminonitrotoluenes and dinitrotoluenes through alpha-subunit residues leucine 225, phenylalanine 350, and glycine 407. J. Bacteriol. 187, 3302-3310. ( 10.1128/JB.187.10.3302-3310.2005) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Saraf MC, Moore GL, Goodey NM, Cao VY, Benkovic SJ, Maranas CD. 2006. IPRO: an iterative computational protein library redesign and optimization procedure. Biophys. J. 90, 4167-4180. ( 10.1529/biophysj.105.079277) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Soo VWC, et al. 2016. Reversing methanogenesis to capture methane for liquid biofuel precursors. Microb. Cell Factories 15, 11. ( 10.1186/s12934-015-0397-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.McAnulty MJ, Poosarla VG, Li J, Soo VWC, Zhu F, Wood TK. 2017. Metabolic engineering of Methanosarcina acetivorans for lactate production from methane. Biotechnol. Bioeng. 114, 852-861. ( 10.1002/bit.26208) [DOI] [PubMed] [Google Scholar]
  • 7.McAnulty MJ, Poosarla VG, Kim KY, Jasso-Chávez R, Logan BE, Wood TK. 2017. Electricity from methane by reversing methanogenesis. Nat. Commun. 8, 15419. ( 10.1038/ncomms15419) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Chowdhury A, Maranas CD. 2015. Designing overall stoichiometric conversions and intervening metabolic reactions. Sci. Rep. 5, 16009. ( 10.1038/srep16009) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Amrein BA, Steffen-Munsberg F, Szeler I, Purg M, Kulkarni Y, Kamerlin SCL. 2017. CADEE: computer-aided directed evolution of enzymes. IUCrJ 4, 50-64. ( 10.1107/S2052252516018017) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Li R, et al. 2018. Computational redesign of enzymes for regio- and enantioselective hydroamination. Nat. Chem. Biol. 14, 664-670. ( 10.1038/s41589-018-0053-0) [DOI] [PubMed] [Google Scholar]
  • 11.Yan Z, Joshi P, Gorski CA, Ferry JG. 2018. A biochemical framework for anaerobic oxidation of methane driven by Fe(III)-dependent respiration. Nat. Commun. 9, 1642. ( 10.1038/s41467-018-04097-9) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Arnold FH. 2019. Innovation by evolution: bringing new chemistry to life (Nobel Lecture). Angew. Chem. Int. Ed. 58, 14 420-14 426. ( 10.1002/anie.201907729) [DOI] [PubMed] [Google Scholar]
  • 13.Rui L, Reardon KF, Wood TK. 2005. Protein engineering of toluene ortho-monooxygenase of Burkholderia cepacia G4 for regiospecific hydroxylation of indole to form various indigoid compounds. Appl. Microbiol. Biotechnol. 66, 422-429. ( 10.1007/s00253-004-1698-z) [DOI] [PubMed] [Google Scholar]
  • 14.Canada KA, Iwashita S, Shim H, Wood TK. 2002. Directed evolution of toluene ortho-monooxygenase for enhanced 1-naphthol synthesis and chlorinated ethene degradation. J. Bacteriol. 184, 344-349. ( 10.1128/JB.184.2.344-349.2002) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Fishman A, Tao Y, Bentley W, Wood TK. 2004. Protein engineering of toluene 4-monooxygenase of Pseudomonas mendocina KR1 for synthesizing 4-nitrocatechol from nitrobenzene. Biotechnol. Bioeng. 87, 779-790. ( 10.1002/bit.20185) [DOI] [PubMed] [Google Scholar]
  • 16.Vardar G, Wood TK. 2005. Protein engineering of toluene-o-xylene monooxygenase from Pseudomonas stutzeri OX1 for oxidizing nitrobenzene to 3-nitrocatechol, 4-nitrocatehcol, and nitrohydroquinone. J. Biotechnol. 115, 145-156. ( 10.1016/j.jbiotec.2004.08.008) [DOI] [PubMed] [Google Scholar]
  • 17.Vardar G, Wood TK. 2004. Protein engineering of toluene-o-xylene monooxygenase from Pseudomonas stutzeri OX1 for synthesizing 4-methylresorcinol, methylhydroquinone, and pyrogallol. Appl. Environ. Microbiol. 70, 3253-3262. ( 10.1128/AEM.70.6.3253-3262.2004) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rui L, Cao L, Chen W, Reardon KF, Wood TK. 2005. Protein engineering of epoxide hydrolase from Agrobacterium radiobacter AD1 for enhanced activity and enantioselective production of (R)-1-phenylethane-1,2-diol. Appl. Environ. Microbiol. 71, 3995-4003. ( 10.1128/AEM.71.7.3995-4003.2005) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Soo VWC, Cheng H-Y, Kwan BW, Wood TK. 2014. de novo synthesis of a bacterial toxin/antitoxin system. Sci. Rep. 4, 4807. ( 10.1038/srep04807) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hong SH, Lee J, Wood TK. 2010. Engineering global regulator Hha of Escherichia coli to control biofilm dispersal. Microb. Biotechnol. 3, 717-728. ( 10.1111/j.1751-7915.2010.00220.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Hong SH, Wang X, Wood TK. 2010. Controlling biofilm formation, prophage excision, and cell death by rewiring global regulator H-NS of Escherichia coli. Microb. Biotechnol. 3, 344-356. ( 10.1111/j.1751-7915.2010.00164.x) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lee J, Maeda T, Hong SH, Wood TK. 2009. Reconfiguring the quorum-sensing regulator SdiA of Escherichia coli to control biofilm formation via indole and N-acylhomoserine lactones. Appl. Environ. Microbiol. 75, 1703-1716. ( 10.1128/AEM.02081-08) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ma Q, Zhang G, Wood TK. 2011. Escherichia coli BdcA controls biofilm dispersal in Pseudomonas aeruginosa and Rhizobium meliloti. BMC Res. News 4, 447. ( 10.1186/1756-0500-4-447) [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This article has no additional data.


Articles from Open Biology are provided here courtesy of The Royal Society

RESOURCES