Abstract
Machine learning (ML) is rapidly turning into a key technology for biocatalysis. By learning patterns in amino acid sequences, protein structures, and functional data, ML models can help navigate complex fitness landscapes, uncover new enzymes in databases, and even design biocatalysts de novo. Along with advances in DNA synthesis and sequencing, laboratory automation, and high-throughput screening, ML is increasing the speed and efficiency of enzyme development. In this Outlook, we highlight recent applications of ML in the fields of enzyme discovery, design, and engineering, with a focus on current challenges and emerging solutions. Furthermore, we discuss barriers that impede a broader and faster adoption of ML-based workflows in the biocatalysis community. We conclude by suggesting best practices for fostering effective collaborations in this interdisciplinary field.


Introduction
In just a few years, the landscape of protein design and engineering has experienced a remarkable transformation. Among many breakthroughs, we have witnessed the development of tailor-made protein binders capable of neutralizing snake venom, protein nanoparticle vaccines that protect against emerging viruses, and synthetic sensors for the detection of small molecules. These developments are the result of synergistic advances across several technological frontiers. In particular, the integration of machine learning (ML) into tasks like protein structure prediction, de novo design, and sequence-function mapping has significantly enhanced the capabilities of computational tools in protein science. In parallel, the decreasing cost of DNA synthesis and sequencing, as well as advances in high-throughput screening and lab automation, greatly facilitate the characterization of large numbers of proteins, which in turn yields valuable training data for ML models.
These developments also have a profound impact on biocatalysis. Enzyme development has long relied on the optimization of natural enzymes by directed evolution, which has enabled numerous industrial applications of biocatalysis. , Over time, this workflow has increasingly been augmented by computational tools. This transformation has accelerated with the introduction of ML techniques, which hold considerable promise in addressing some of the most significant challenges within biocatalysis. Key tasks in this field include the identification of promising enzymes in databases (enzyme discovery), the de novo design of entirely new enzymes (enzyme design), and the optimization of existing enzymes with respect to important properties (enzyme engineering). ML tools already often outperform previous approaches in enzyme design, and numerous studies have demonstrated the utility of ML for tasks such as enzyme discovery or engineering (see Figure for an overview of common ML-assisted workflows). At the same time, compared to the development of proteins for tasks like binding, enzymes present additional challenges for both computational and experimental approaches, for example due to the complexity of enzymatic mechanisms and the need for specialized analytical techniques. Thus, in many cases bespoke solutions are required to further enhance and accelerate enzyme development and render biocatalysis more widely applicable. Here, we provide an overview of new paradigms in enzyme development, with a particular emphasis on current challenges and emerging solutions. We consider computational tools, experimental technologies, as well as the sociocultural factors influencing the future trajectory of this field. For a more detailed discussion of specific topics, we refer the reader to recent reviews on enzyme discovery, enzyme design, , and computational enzyme engineering. ,,
1.
Exemplary ML-assisted workflows for the discovery, de novo design, and engineering of enzymes. (a) Enzyme discovery often begins with the mining of databases for enzymes that share some similarity to a template. This is aided by the facile structure prediction enabled by ML tools. Homology can be quantified using various metrics, including ML-based embeddings. Candidate sequences can also be evaluated based on predicted EC numbers and predicted properties such as thermal stability or ease of recombinant expression. (b) ML-assisted de novo enzyme design typically requires a model of a minimal active site, around which a protein backbone is designed. Subsequently, a compatible amino acid sequence is designed using an inverse folding model. Various in silico tools can then be employed to validate designs and select variants for experimental testing. (c) ML-assisted enzyme engineering is used to improve the activity (or other properties) of a given enzyme. This can involve the use of zero-shot ML methods to identify hotspots or promising mutations without the need for experimental data. Subsequently, selected variants are tested experimentally. The resulting sequence-function data are used to train a model, which can be used to predict the properties of unseen variants. Frequently, several rounds of experimental testing and modeling are performed.
Emerging Paradigms in Enzyme Development
Enzyme Discovery
Modern genome and metagenome projects have generated extensive repositories of protein sequences, numbering in the hundreds of millions. However, only a small fraction of these proteins has been experimentally characterized. Clearly, these sequence databases contain a multitude of enzymes with useful properties, but linking the sequences to their corresponding functions is challenging. Current initiatives aimed at addressing this issue are pursuing different approaches. On the one hand, automatically annotating sequences with functional attributes has been a longstanding goal in the field. On the other hand, researchers interested in a specific reaction can mine databases for enzymes that are likely to catalyze this reaction. Historically, both endeavors have relied primarily on sequence or structural similarity to known enzymes.
Enzyme mining typically requires a query sequence that has been shown to catalyze the reaction of interest. This sequence can be used to search for related sequences using BLAST, or more recent and faster methods such as MMseqs2. This step will typically yield hundreds or even thousands of sequence hits, which need to be filtered based on expert domain knowledge about active sites or other determinants of the desired property. This process can be partially automated using tools like EnzymeMiner, which combines sequence-based homology searches with filtering based on user-specified essential residues. To assist users in selecting variants for experimental testing, several additional properties are predicted, such as the likelihood of expressing the enzyme in soluble form in Escherichia coli.
Until recently, the search for enzymes based on structural motifs was restricted to a comparatively small number of proteins with experimentally determined structures. However, breakthroughs in protein structure prediction using ML, along with databases such as AlphaFold DB or ESM Metagenomic Atlas and structural homology search tools such as FoldSeek, now enable structure-based mining at unprecedented scale. This development is very promising, as structure is more conserved than sequence during evolution. The search can also focus on active site properties, facilitating the discovery of promiscuous enzymes that share similar active sites but very different sequences and folds. Despite these advances, the need for a template enzyme with rudimentary activity for the desired reaction (or a closely related one) remains a limitation of current enzyme mining approaches.
High-quality, standardized data sets remain a bottleneck for generalizable ML models of enzymatic properties.
Parallel efforts to reliably annotate enzyme sequences in databases can benefit enzyme mining projects and provide insights into various questions in basic research. Over the past years, numerous ML models have been developed that predict Enzyme Commission (EC) numbers from sequence. A particularly noteworthy example is a model dubbed CLEAN, which uses a contrastive learning framework to embed enzyme sequences in a space where distance correlates with functional similarity. This method, combined with powerful protein language model embeddings, has achieved a high accuracy in EC number prediction and was able to predict promiscuous activity. Other models also take predicted structures into consideration. For example, GraphEC uses geometric graph learning and first predicts the location of the active site, followed by EC number prediction. This led to further gains in accuracy.
Besides EC numbers, models have been developed to predict other relevant properties such as kinetic parameters. , As generalist models may not always achieve the required level of detail or accuracy, task-specific models have been developed to predict the properties of enzymes from specific families (e.g., glycosyltransferases , ). However, such approaches remain challenging due to the limited availability of functional data, the multifunctionality of enzymes, as well as inconsistent experimental procedures and reporting practices. Recent advances in automated reaction profiling promise to enhance the robustness of functional annotations, particularly in the context of revealing enzyme promiscuity.
In general, data-driven modeling paradigms rely on the accessibility, quantity, and quality of experimental data for training. The success of structure prediction by ML models is largely attributable to the availability of large, structured, consistent, and high-quality data sets on protein sequences and structures. In contrast, quantitative functional datasuch as k cat or K M values of enzyme-catalyzed reactionsare available only for a relatively small number of enzymes. Moreover, the fraction of enzyme sequences with functional experimental characterization is biased toward certain enzyme classes (e.g., hydrolases), while others are underrepresented. Another limitation is the accuracy of experimental annotations – enzyme databases often contain errors or lack critical metadata and details on data processing, which results in published values that may not be reproducible. Thus, it would be important to adopt standardized reporting practices (such as the STRENDA guidelines , ) and data formats (such as the markup language EnzymeML). Similarly, the reporting of model performance can be inconsistent, but community-driven standards are currently emerging.
Enzyme Design
Early examples of de novo enzyme design typically centered on conceptualizing an idealized active site – referred to as a theozyme – consisting of key catalytic residues positioned around a reaction’s transition state. This was followed by a search for native protein scaffolds capable of accommodating this active site using tools from the Rosetta macromolecular modeling suite, in some cases followed by short molecular dynamics (MD) simulations. This paradigm saw success in designing enzymes for reactions such as Kemp elimination, retro aldol reactions, and Diels–Alder reactions (among others), however, the activity of the resulting enzymes was low. Accordingly, extensive directed evolution efforts were required to turn the designed enzymes into proficient biocatalysts. , These early de novo designs were restricted by the limited number of naturally occurring, characterized scaffolds as well as the insufficient accuracy of design tools.
The significant leap in the accuracy of protein structure predictions enabled by deep learning tools such as AlphaFold2 and RosettaFold has profoundly influenced the field of enzyme design. On the one hand, these computational tools can be used to validate designs in silico, enabling researchers to focus experimental efforts on candidates most likely to adopt the desired fold. On the other hand, structure prediction models can be used to “hallucinate” novel structures by iteratively optimizing a random input sequence until it is predicted to fold into a stable protein. This process can be constrained in such a way that it generates a de novo scaffold around a desired motif, such as an enzyme active site. For example, constrained hallucination has been combined with Rosetta-based design to create novel luciferases exhibiting high activity and selectivity, although experimental screening of several thousand variants was still required.
More recently, RosettaFold has been refined for protein structure denoising tasks to create a generative model dubbed RFdiffusion. By starting from random noise, this model can generate plausible backbone structures that can be conditioned on various constraints, including binding sites or specific motifs. Compared to hallucination, RFdiffusion is more computationally efficient and offers ways of biased sampling to provide specific target structures. Further improvements were introduced with RFdiffusion2, which can generate enzymes directly from atom-level active site descriptions and was able to design scaffolds around a greater variety of active sites than RFdiffusion (41/41 vs 16/41) on an in silico benchmark.
Following the design of the protein’s backbone structure, an amino acid sequence that folds into the desired structure needs to be identified. This inverse folding problem has been addressed by ProteinMPNN, a graph neural network that outperforms previous tools such as Rosetta in terms of accuracy. Expanding on this framework, LigandMPNN enables the design of sequences that bind specific ligands. This is highly interesting for enzyme design as it can facilitate the design of active sites or cofactor binding regions. In this regard, it is also noteworthy that recent structure-prediction models like AlphaFold3, Chai-1, or Boltz-1 possess the capability to predict the structure of protein–ligand complexes. These tools promise to further enhance the design process and provide an alternative to classical protein–ligand docking methods.
In parallel to these structure-focused protein design strategies, a number of ML models that generate new protein sequences without explicitly considering structure have emerged in recent years. Among these, transformer-based protein language models (pLMs) trained on large data sets of natural protein sequences have demonstrated the ability to generate new proteins that are only distantly related to the ones in the training data sets. A challenge lies in steering these unsupervised models toward generating proteins with desired properties, such as specific enzymatic activities. A notable step in this direction is ZymCTRL, which has been trained on enzyme sequences as well as corresponding EC numbers. Moreover, ESM3 is a multimodal pLM that integrates information from protein sequences, structures, and functional annotations. It enables numerous tasks, including the scaffolding of active sites or function prediction from sequence alone. In addition, it has been demonstrated that pLMs can be fine-tuned to produce desired outputs by means of reinforcement learning, , for example using supervised models trained on experimental activity data. Despite their ability to generate new sequence variants for native enzymatic activities, such as highly diverse lysozymes, the use of pLMs for the design of entirely new enzymes (e.g., catalyzing non-natural reactions) remains to be demonstrated.
The development of ML-based design routines has led to noteworthy advances over previous enzyme design efforts. Most notably, recent methodologies enable the placement of catalytic residues within de novo backbones with very high accuracy, thus addressing one of the primary limitations of initial Rosetta-based design routines. However, to obtain highly active enzymes, it is critical to start the design process from a catalytically competent theozyme. In a recent study, the catalytic tetrad of an experimentally optimized retro-aldolase was placed in de novo backbones with custom substrate pockets using a diffusion-based pipeline. The resulting designs exhibited remarkable accuracy and displayed activities that exceeded those of earlier computationally designed retro-aldolases by several orders of magnitude. However, accomplishing similar outcomes for novel reactions will require more advanced pipelines for computing highly accurate theozymes.
While transition-state stabilization is a basic tenet of enzymatic catalysis, the findings from the field of enzyme design underscore previous observations that this factor alone is not sufficient to rationalize the formidable activity of enzymes. Another critical aspect is the influence of conformational dynamics and active-site preorganization. While design routines typically operate on static structures, it is more accurate to consider the ensemble of structures that an enzyme can adopt in solution. Some structures in this ensemble will be more catalytically competent than others, and mutations that improve preorganization by shifting the distribution toward the competent conformation can have a substantial impact on activity. Predicting which mutations will trigger the desired conformational change is challenging, although some strategies based on MD simulations in combination with the correlation-based method Shortest Path Map (SPM) have been developed. , Preorganization is frequently assessed by running nanosecond-time scale MD simulations. Due to the high computational cost of MD simulations, there is growing interest in approximating such simulations using ML tools. For example, AlphaFold2 has been adapted to predict multiple conformations of a protein, and the deep learning model BioEmu has been developed to emulate protein equilibrium ensembles on consumer-grade hardware.
Many enzymes undergo multiple conformational changes during the course of a catalytic cycle, which poses a substantial challenge for enzyme design. A notable step toward multistate enzyme design has recently been reported for serine hydrolases that rely on a four-step reaction mechanism. In this instance, researchers placed a catalytic triad and oxyanion hole into de novo backbones using RFdiffusion and subsequently filtered designs using PLACER (Protein–Ligand Atomistic Conformational Ensemble Resolver), a neural network that predicts conformational ensembles of proteins and small molecules. Assessing the preorganization in each step of the catalytic cycle using PLACER increased the design success rate substantially and resulted in enzymes with noteworthy catalytic efficiencies of up to 2.2 × 105 M–1 s–1.
Another important facet of enzymatic catalysis is the influence of electrostatic effects. Over the past decades, it has become increasingly clear that enzymes can produce strong electric fields that promote the reaction. − As electric fields over individual bonds can be computed quickly, there is potential to further enhance design workflows by considering such fields. This notion is supported by the successful in silico optimization of a designed Kemp eliminase based on electric fields.
The examples provided above illustrate that enzyme design has made remarkable progress in terms of design accuracy. Nevertheless, a major bottleneck persists in the challenge of predicting the most active variants among designed enzymes. Addressing this challenge will require a computational analysis of factors such as conformational dynamics and electric fields, which can be used to filter designs and thus further reduce the number of variants that need to be tested to identify active enzymes. Even better results may be obtained by implementing design and filtering in an iterative manner, such that final designs are optimized with regard to catalytically relevant properties.
Enzyme Engineering
The past decades have witnessed major progress in enzyme engineering, largely enabled by directed evolution and the development of high-throughput screening platforms. However, the vastness of protein sequence space, combined with the propensity to become trapped in local optima, renders traditional approaches to exploring fitness landscapes inefficient and prone to suboptimal outcomes. To address these challenges, numerous computational tools have been developed to propose mutations or identify hot spots for improving properties such as activity, thermostability, and solubility. These tools frequently leverage evolutionary information or biophysical calculations and increasingly take advantage of ML algorithms. For example, MutCompute is a convolutional neural network that can identify amino acids that are not optimized for their local chemical microenvironment. Mutations suggested by this tool improved the activity of a PET-degrading enzyme at ambient temperatures while keeping the experimental effort modest. This kind of zero-shot enzyme engineeringi.e., approaches that do not rely on functional experimental datashows great promise, but is likely most powerful in combination with directed evolution or supervised ML strategies.
While screenings over multiple rounds are typically still required, ML can help to make these more efficient. The predictive power of sequence-function data sets has long been recognized, , enabling the prediction of properties for untested variants and thereby guiding engineering campaigns. ML models with their ability to capture complex, nonlinear relationships are well suited to this task. Consequently, machine learning-assisted directed evolution (MLDE) has gained substantial attention in recent years. −
MLDE treats enzyme engineering as a supervised learning problem: starting from an initial sequence-function data set, a model is trained to predict function (often activity) from the amino acid sequence. Subsequently, this model can be used to design a new library of promising variants for experimental testing. This process can be iterated to improve the model and explore the sequence space in a model-guided fashion (referred to as active learning). , Using deep-mutational scanning data sets, it has been demonstrated that MLDE can identify desired variants faster and more reliably than classical directed evolution strategies. However, in practice the success of MLDE campaigns depends on numerous factors, from the choice of suitable encodings and model architectures to the data set size and quality. Countless encodings of amino acid sequences as well as model architectures have been suggested in the literature, and while a few comparative studies have been performed, , the best choice is very much case-dependent. Similarly, the required size of the training data set varies, but a few dozen to a few hundred data points have frequently been reported to be sufficient to develop useful models. − However, larger data sets are likely to result in more reliable models. It is also important to note that some data sets are more informative as training data than others. Strategies aiming to increase the information density include free-energy calculations to exclude destabilizing mutations, as well as advanced zero-shot predictors based on pLMs. Noise and measurement errors can likewise impact model performance substantially. Once a model has been developed, a sampling strategy that balances exploration and exploitation needs to be selected for subsequent rounds. Frequently, active-learning algorithms relying on Bayesian Optimization are applied to achieve this balance. ,
While current tools are often sufficient to perform successful MLDE campaigns, further advances with respect to informative encodings and zero-shot prediction are desirable, particularly for settings where little training data are available. Moreover, most models exhibit limitations in their capability to reliably extrapolate beyond the training distribution or to generalize to related reactions. Therefore, transfer learning strategies that leverage evolutionary information, predicted biophysical properties, or performance data on related, yet easily screenable reactions are interesting areas of research.
Most studies to date target a rather small search space and thus do not deliver on the promise of thorough protein engineering beyond the capabilities of advanced high-throughput screening methods. Similarly, multiobjective optimization remains underexplored, despite the need for enzymes to meet a range of requirements beyond the target activity for industrial application. Thus, ample opportunities remain to further augment enzyme engineering with the help of ML.
Experimental Technologies
Besides breakthroughs in computer science and increased computing power, the progress in ML-driven enzyme development has also been enabled by advances on the experimental side. To fully harness the potential of ML-driven workflows, further progress in experimental methods and approaches is imperative.
One crucial component is the remarkable decline in the cost of DNA synthesis and sequencing, as it provides both training data and experimental validation on a scale that was previously out of reach. Nonetheless, the costs associated with synthesis and sequencing can still constrain library sizes and impede the widespread adoption of ML-based methodologies. For example, while ML can reduce the screening effort during enzyme engineering, the additional sequencing costs may render traditional strategies, where sequence information is only obtained for selected hits, more economically viable. A potential solution is to rely on next-generation sequencing (NGS), which is highly cost-effective on a per-variant basis, but may require bespoke barcoding strategies. , With regard to library generation, strategies to assemble full-length genes from oligonucleotide pools are a way to reduce costs. , Moreover, variational synthesis is a cost-effective way to create extremely large libraries that are the output of a generative model with knowledge of chemical DNA synthesis.
It is important that requirements and expectations are discussed early on in a collaboration, which may involve planning preliminary experiments.
Testing large numbers of enzyme variants can be labor-intensive and time-consuming. Accordingly, screening methods that reduce manual labor and allow for highly parallelized experimentation are desirable for data-driven enzyme development. A possible solution is to make use of lab automation, which also holds the promise of enhanced reproducibility. However, setting up an automated screening procedure can be a time-consuming endeavor in itself. More flexible and modular lab automation solutions could greatly facilitate the adoption of automation and the generation of data for MLDE and other purposes. Moreover, integrating automation with computational pipelines or AI agents may eventually enable the development of enzymes with no or minimal human intervention. In fact, the repetitive nature of enzyme screenings may make the implementation of such “self-driving” laboratories easier than in other fields of research.
Beyond automation, several methods have been developed to screen and characterize enzyme variants at scale, yielding data sets that are highly attractive for ML applications. For example, enzymatic reactions can be miniaturized and parallelized on microfluidic chips, and NGS can be combined with cell sorting or DNA recorders to perform high-throughput sequence-function mapping. However, adapting such methods to specific enzymatic reactions is often challenging. Consequently, the development of a high-throughput sequence-function mapping strategy that is broadly applicable to enzymes remains an outstanding challenge. Biosensors linked to cell growth or DNA modification as a readout could yield very large data sets in a relatively straightforward manner, but as suitable sensors are lacking for most relevant products, better biosensor design methods are required as a first step. Droplet microfluidics coupled with mass spectrometry could be another potent and flexible method; however, maintaining the genotype-phenotype linkage is crucial. It should also be considered that data from high-throughput methods is frequently more noisy than results from classical well-plate assays, which may pose a challenge for the development of accurate ML models.
Challenges in Adopting and Advancing ML-Based Workflows
Besides the technical challenges outlined above, the implementation and development of ML-based tools and workflows can be hindered by numerous practical challenges, for example related to the accessibility of published tools or the lack of a common language between wet lab and dry lab teams. While seemingly mundane, such issues can have a profound impact on the success of projects and the speed of progress in the field.
Currently, experimentalists aiming to integrate ML into their workflows can face a number of hurdles. While some tools are available via special web servers or Colab implementations, others are only available on GitHub and may be challenging to use without computational expertise. Experience is also required with regard to data processing, parameter choices, understanding mathematical models, and the interpretation of results. Efforts from publicly funded organizations to build and maintain accessible implementations of popular tools and provide the necessary training are needed to remove such barriers.
Integrating enzyme design and engineering into a unified pipeline could merge bold leaps in sequence space with fine-grained optimization.
In many cases, progress in the field also depends on the effective collaboration between experimentalists and computer scientists. As most experimentalists and computer scientists possess limited training in ML and biochemistry, respectively, such collaborations can be prone to misunderstandings and unrealistic expectations on both sides. For example, the ideal data set for ML may differ substantially from what can be readily generated in the lab, and proper handling of the data requires an understanding of how the data has been generated. Moreover, it is not uncommon that experimental conditions vary from one screening round to the next (for example due to unreliable equipment or the desire to continuously optimize conditions), but such variations pose a challenge for the computational analysis. For these reasons, it is important that such requirements and expectations are discussed early on in a collaboration (see Figure for a “collaboration checklist”). This may also involve planning preliminary experiments to determine whether the data that can be generated is suitable for computational analysis. Despite such challenges, collaborations between wet lab and dry lab teams are often fruitful and will continue to play a crucial role in biocatalysis. Consequently, it will be important to update curricula at universities to reflect the growing importance of ML in the life sciences and provide future generations of scientists with the required interdisciplinary training.
2.
Checklist for collaborations between wet lab and dry lab teams.
Future Directions
The computational and experimental toolbox available to enzyme designers and engineers has grown at a rapid pace in recent years, and ML now plays a pivotal role in enzyme development pipelines. While some ML tools and strategies (e.g., for protein structure prediction) can be considered mature and well-established, others require further development and validation.
Democratizing access to ML-driven enzyme development will require open-source tools, centralized biofoundries, and low-cost automation.
Frequently, the development of enzyme-specific ML models is hindered by the limited availability of large, high-quality data sets on enzymatic properties across diverse enzyme families, resulting in tools with limited scope and generalizability. Initiatives from funding agencies to support the generation and sharing of such data sets could therefore be highly valuable. Ideally, such data sets should provide information on multiple enzymatic properties (e.g., catalytic parameters, expression yield, thermostability) across various substrates and reactions conditions. In addition, standardized protocols and comprehensive metadata reporting are critical for ML applications. Models trained on such data could enable more accurate zero-shot predictions, which would be useful in a wide range of scenarios. For example, even a coarse-grained activity estimate can greatly reduce the number of variants that need to be tested during enzyme discovery or following de novo design. Data sets on multiple properties could also accelerate the development of multimodal models, which promise to be powerful tools, for example with respect to multiobjective optimization.
Further advancements could come from pipelines that integrate various tools and data sources into combined workflows. In particular, tightly integrating enzyme design and engineering into one unified pipeline can merge the capability of generative models to perform large jumps in sequence space with the local optimization provided by engineering methods. Moreover, such approaches enable iterative improvements to design models based on experimental feedback.
As the number of computational tools continues to grow, it will be crucial to rigorously test and validate them to build trust within the community. In the case of protein structure prediction, the Critical Assessment of Structure Prediction (CASP) competition provided an independent evaluation, quickly fostering confidence in the structure prediction tools. In the case of protein design or engineering, systematic tests by means of comparative studies or competitions are less established. Just recently, a number of competitions have emerged and benchmarks (such as ProteinGym) have been put forward. Such efforts could be highly valuable by providing guidance as to which tools and strategies are the most powerful and reliable within this rapidly evolving field. Moreover, it will be important to address biases in training data sets and remove data leakage between training and test sets to provide a reliable performance evaluation.
Progress on the experimental side will also continue to be important to the field, for example with respect to the cost, throughput, and reproducibility of data acquisition. In addition, active learning strategies would benefit from shorter feedback loops between computation and experiment, which could be facilitated by advances in DNA synthesis (such as enzymatic oligo synthesis and DNA assembly) and lab automation platforms (such as self-driving labs or low-cost benchtop systems). In addition, cell-free gene synthesis and expression systems, possibly in combination with microfluidics, can markedly accelerate the build and test phase of design-build-test-learn cycles. ,, However, it should be considered that differences in expression and reaction conditions between screening and the final application are often problematic. Models that account for such differences and predict performance across a range of conditions could mitigate such problems and facilitate the industrial scale-up process. In particular, multimodal models that integrate sequence and structure with stability and solubility data hold promise for capturing these complexities more comprehensively. In the long term, advanced biophysical simulations (or ML-based approximations thereof) could substantially reduce the need for experimental testing, but such approaches are still in their infancy in the context of biological systems.
Numerous computational and experimental methods are available for the different phases of the DBTL cycle. When selecting which methods to use, a challenge lies in aligning the capabilities and requirements of these methods with each other and with external constraints, such as project budgets and timelines. Moreover, the great diversity of enzymes and enzymatic reactions make the development of universal platforms challenging, both from the computational and experimental perspective. Nonetheless, it is possible to establish broadly applicable pipelines that integrate the discovery or design of candidate sequences with DNA synthesis, expression, and activity assays as well as iterative refinements in a largely or fully automated manner. Depending on the available throughput, different strategies can be envisioned (Figure ). If only a small number of variants can be experimentally tested, virtual screenings (e.g., using docking methods or MD) can be performed to identify the most promising variants, and transfer learning and foundation models can be leveraged to obtain better predictions. If a higher throughput is possible, cost-efficient DNA synthesis methods become important, and the resulting data can be used to train large, possibly multimodal models.
3.
Enabling technologies and synergistic strategies for every stage of the design-build-test-learn cycle. As a strategy for designing novel enzyme variants, generative design (relying on fine-tuned diffusion, language, or flow-based models) is proposed for both low- and high-throughput scenarios. Depending on the available experimental throughput, genes encoding the predicted variants are either directly synthesized (for example, via enzymatic oligo synthesis and assembly) or generated through automated mutagenesis and cloning strategies (for example, relying on ssDNA oligo pools). In low-throughput scenarios, virtual prescreening (for instance, using AI-based protein–ligand complex predictions and MD simulations) offers substantial potential to narrow down the number of variants to be tested experimentally. Low-cost benchtop robotics platforms and cell-free expression systems are viewed as enabling technologies for democratized medium-throughput screening, while further advances in growth-based assays and microfluidics would qualify these technologies for future high-throughput screening applications. This would be particularly valuable for generating large data sets suitable for pretraining multimodal models. Low-throughput scenarios can benefit from pretrained foundation models, as fine-tuning them on specialized data sets enables transfer learning to novel downstream tasks even when data set sizes are limited.
As setting up such “lab-in-a-loop” operations requires substantial financial resources and expertise across multiple disciplines, a critical question will be how to democratize access to state-of-the-art technologies. Centralized biofoundries could play an important role in this regard, in particular if access can be provided via simple web interfaces. Low-cost automation platforms with accessible, chatbot-assisted programming interfaces offer a complementary, decentralized solution by facilitating the adoption of medium-throughput benchtop automation platforms. The latter would benefit from streamlined (cell-free) gene synthesis and expression (vide supra) to reduce the complexity of liquid handling and experimental procedures. In parallel, the development and dissemination of open-source computational tools will be essential to ensure that powerful ML models are broadly accessible, reducing dependence on proprietary platforms and enabling transparent, community-driven innovation.
As both computational and experimental technologies become more mature and powerful, it will be time to move from low-hanging fruit to more challenging but potentially impactful targets. Biocatalysis has long been hailed as an important puzzle piece in the urgently required transition toward a greener economy, and a concerted effort to develop enzymes for impactful “dream reactions” (e.g., new CO2 fixation routes or upcycling of waste products) could be instrumental in realizing this potential.
Acknowledgments
This manuscript was inspired by the talks and discussions at the “ML for Biocatalysis Workshop 2025” in Zurich. We would like to thank Adrian Bunzel, Mattia Gollub, Julian Englert, David Harding-Larsen, as well as all other participants for their contributions to this event. Figures 1 and 3 were created with BioRender (Stockinger, P. (2025) https://BioRender.com/gqwz2de and https://BioRender.com/qxasamn, respectively).
∇.
T.V., P.S., and M.M. contributed equally.
The authors declare no competing financial interest.
References
- Vázquez Torres S., Benard Valle M., Mackessy S. P., Menzies S. K., Casewell N. R., Ahmadi S.. et al. De Novo Designed Proteins Neutralize Lethal Snake Venom Toxins. Nature. 2025;639:225–231. doi: 10.1038/s41586-024-08393-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walls A. C., Fiala B., Schäfer A., Wrenn S., Pham M. N., Murphy M.. et al. Elicitation of Potent Neutralizing Antibody Responses by Designed Protein Nanoparticle Vaccines for SARS-CoV-2. Cell. 2020;183:1367–1382. doi: 10.1016/j.cell.2020.10.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- An L., Said M., Tran L., Majumder S., Goreshnik I., Lee G. R.. et al. Binding and Sensing Diverse Small Molecules Using Shape-Complementary Pseudocycles. Science. 2024;385:276–282. doi: 10.1126/science.adn3780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heckmann C. M., Paradisi F.. Looking Back: A Short History of the Discovery of Enzymes and How They Became Powerful Chemical Tools. ChemCatChem. 2020;12:6082–6102. doi: 10.1002/cctc.202001107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buller R., Lutz S., Kazlauskas R. J., Snajdrova R., Moore J. C., Bornscheuer U. T.. From Nature to Industry: Harnessing Enzymes for Biocatalysis. Science. 2023;382:eadh8615. doi: 10.1126/science.adh8615. [DOI] [PubMed] [Google Scholar]
- Markus B., Gruber C. C., Andreas K., Arkadij K., Stefan L., Gustav O.. et al. Accelerating Biocatalysis Discovery with Machine Learning: A Paradigm Shift in Enzyme Engineering, Discovery, and Design. ACS Catal. 2023;13:14454–14469. doi: 10.1021/acscatal.3c03417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chu A. E., Lu T., Huang P.-S.. Sparks of Function by de Novo Protein Design. Nat. Biotechnol. 2024;42:203–215. doi: 10.1038/s41587-024-02133-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang J., Li F.-Z., Arnold F. H.. Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering. ACS Cent. Sci. 2024;10:226–241. doi: 10.1021/acscentsci.3c01275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ariaeenejad S., Gharechahi J., Foroozandeh Shahraki M., Fallah Atanaki F., Han J.-L., Ding X.-Z., Hildebrand F., Bahram M., Kavousi K., Hosseini Salekdeh G.. Precision Enzyme Discovery through Targeted Mining of Metagenomic Data. Nat. Prod. Bioprospecting. 2024;14:7. doi: 10.1007/s13659-023-00426-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lovelock S. L., Crawshaw R., Basler S., Levy C., Baker D., Hilvert D.. et al. The Road to Fully Programmable Protein Catalysis. Nature. 2022;606:49–58. doi: 10.1038/s41586-022-04456-z. [DOI] [PubMed] [Google Scholar]
- Mazurenko S., Prokop Z., Damborsky J.. Machine Learning in Enzyme Engineering. ACS Catal. 2020;10:1210–1223. doi: 10.1021/acscatal.9b04321. [DOI] [Google Scholar]
- Yang K. K., Wu Z., Arnold F. H.. Machine-Learning-Guided Directed Evolution for Protein Engineering. Nat. Methods. 2019;16:687–694. doi: 10.1038/s41592-019-0496-6. [DOI] [PubMed] [Google Scholar]
- The UniProt Consortium. UniProt: The Universal Protein Knowledgebase in 2023. Nucleic Acids Res. 2023, 51, D523–D531. 10.1093/nar/gkac1052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altschul S. F., Gish W., Miller W., Myers E. W., Lipman D. J.. Basic Local Alignment Search Tool. J. Mol. Biol. 1990;215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Steinegger M., Söding J.. MMseqs2 Enables Sensitive Protein Sequence Searching for the Analysis of Massive Data Sets. Nat. Biotechnol. 2017;35:1026–1028. doi: 10.1038/nbt.3988. [DOI] [PubMed] [Google Scholar]
- Hon J., Borko S., Stourac J., Prokop Z., Zendulka J., Bednar D.. et al. EnzymeMiner: Automated Mining of Soluble Enzymes with Diverse Structures, Catalytic Properties and Stabilities. Nucleic Acids Res. 2020;48:W104–W109. doi: 10.1093/nar/gkaa372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varadi M., Bertoni D., Magana P., Paramval U., Pidruchna I., Radhakrishnan M.. et al. AlphaFold Protein Structure Database in 2024: Providing Structure Coverage for over 214 Million Protein Sequences. Nucleic Acids Res. 2024;52:D368–D375. doi: 10.1093/nar/gkad1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin Z., Akin H., Rao R., Hie B., Zhu Z., Lu W.. et al. Evolutionary-Scale Prediction of Atomic-Level Protein Structure with a Language Model. Science. 2023;379:1123–1130. doi: 10.1126/science.ade2574. [DOI] [PubMed] [Google Scholar]
- van Kempen M., Kim S. S., Tumescheit C., Mirdita M., Lee J., Gilchrist C. L. M.. et al. Fast and Accurate Protein Structure Search with Foldseek. Nat. Biotechnol. 2024;42:243–246. doi: 10.1038/s41587-023-01773-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Illergård K., Ardell D. H., Elofsson A.. Structure Is Three to Ten Times More Conserved than SequenceA Study of Structural Response in Protein Cores. Proteins Struct. Funct. Bioinforma. 2009;77:499–508. doi: 10.1002/prot.22458. [DOI] [PubMed] [Google Scholar]
- Steinkellner G., Gruber C. C., Pavkov-Keller T., Binter A., Steiner K., Winkler C.. et al. Identification of Promiscuous Ene-Reductase Activity by Mining Structural Databases Using Active Site Constellations. Nat. Commun. 2014;5:4150. doi: 10.1038/ncomms5150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu T., Cui H., Li J. C., Luo Y., Jiang G., Zhao H.. Enzyme Function Prediction Using Contrastive Learning. Science. 2023;379:1358–1363. doi: 10.1126/science.adf2465. [DOI] [PubMed] [Google Scholar]
- Song Y., Yuan Q., Chen S., Zeng Y., Zhao H., Yang Y.. Accurately Predicting Enzyme Functions through Geometric Graph Learning on ESMFold-Predicted Structures. Nat. Commun. 2024;15:8180. doi: 10.1038/s41467-024-52533-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu H., Deng H., He J., Keasling J. D., Luo X.. UniKP: A Unified Framework for the Prediction of Enzyme Kinetic Parameters. Nat. Commun. 2023;14:8211. doi: 10.1038/s41467-023-44113-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boorla V. S., Maranas C. D.. CatPred: A Comprehensive Framework for Deep Learning in Vitro Enzyme Kinetic Parameters. Nat. Commun. 2025;16:2072. doi: 10.1038/s41467-025-57215-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang M., Fehl C., Lees K. V., Lim E.-K., Offen W. A., Davies G. J.. et al. Functional and Informatics Analysis Enables Glycosyltransferase Activity Prediction. Nat. Chem. Biol. 2018;14:1109–1117. doi: 10.1038/s41589-018-0154-9. [DOI] [PubMed] [Google Scholar]
- Harding-Larsen D., Madsen C. D., Teze D., Kittilä T., Langhorn M. R., Gharabli H.. et al. GASP: A Pan-Specific Predictor of Family 1 Glycosyltransferase Acceptor Specificity Enabled by a Pipeline for Substrate Feature Generation and Large-Scale Experimental Screening. ACS Omega. 2024;9:27278–27288. doi: 10.1021/acsomega.4c01583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khersonsky O., Tawfik D. S.. Enzyme Promiscuity: A Mechanistic and Evolutionary Perspective. Annu. Rev. Biochem. 2010;79:471–505. doi: 10.1146/annurev-biochem-030409-143718. [DOI] [PubMed] [Google Scholar]
- Kumar A., Taubitz J., Meyer F., Imstepf N., Peng J., Tassano E., Moore C., Lochmann T., Snajdrova R., Buller R.. Streamlining Enzyme Discovery and Development through Data Analysis and Computation. Chem. Catal. 2025:101445. doi: 10.1016/j.checat.2025.101445. [DOI] [Google Scholar]
- Tipton K. F., Armstrong R. N., Bakker B. M., Bairoch A., Cornish-Bowden A., Halling P. J.. et al. Standards for Reporting Enzyme Data: The STRENDA Consortium: What It Aims to Do and Why It Should Be Helpful. Perspect. Sci. 2014;1:131–137. doi: 10.1016/j.pisc.2014.02.012. [DOI] [Google Scholar]
- Malzacher S., Meißner D., Range J., Findrik Blažević Z., Rosenthal K., Woodley J. M.. et al. The STRENDA Biocatalysis Guidelines for Cataloguing Metadata. Nat. Catal. 2024;7:1245–1249. doi: 10.1038/s41929-024-01261-x. [DOI] [Google Scholar]
- Lauterbach S., Dienhart H., Range J., Malzacher S., Sporing J.-D., Rother D., Pinto M. F., Martins P., Lagerman C. E., Bommarius A. S., Høst A. V., Woodley J. M., Ngubane S., Kudanga T., Bergmann F. T., Rohwer J. M., Iglezakis D., Weidemann A., Wittig U., Kettner C., Swainston N., Schnell S., Pleiss J.. EnzymeML: Seamless Data Flow and Modeling of Enzymatic Data. Nat. Methods. 2023;20:400–402. doi: 10.1038/s41592-022-01763-1. [DOI] [PubMed] [Google Scholar]
- Walsh I., Fishman D., Garcia-Gasulla D., Titma T., Pollastri G., Harrow J.. et al. DOME: Recommendations for Supervised Machine Learning Validation in Biology. Nat. Methods. 2021;18:1122–1127. doi: 10.1038/s41592-021-01205-4. [DOI] [PubMed] [Google Scholar]
- Röthlisberger D., Khersonsky O., Wollacott A. M., Jiang L., DeChancie J., Betker J.. et al. Kemp Elimination Catalysts by Computational Enzyme Design. Nature. 2008;453:190–195. doi: 10.1038/nature06879. [DOI] [PubMed] [Google Scholar]
- Jiang L., Althoff E. A., Clemente F. R., Doyle L., Röthlisberger D., Zanghellini A.. et al. De Novo Computational Design of Retro-Aldol Enzymes. Science. 2008;319:1387–1391. doi: 10.1126/science.1152692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel J. B., Zanghellini A., Lovick H. M., Kiss G., Lambert A. R., St. Clair J. L., Gallaher J. L., Hilvert D., Gelb M. H., Stoddard B. L., Houk K. N., Michael F. E., Baker D.. Computational Design of an Enzyme Catalyst for a Stereoselective Bimolecular Diels-Alder Reaction. Science. 2010;329:309–313. doi: 10.1126/science.1190239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blomberg R., Kries H., Pinkas D. M., Mittl P. R. E., Grütter M. G., Privett H. K.. et al. Precision Is Essential for Efficient Catalysis in an Evolved Kemp Eliminase. Nature. 2013;503:418–421. doi: 10.1038/nature12623. [DOI] [PubMed] [Google Scholar]
- Obexer R., Godina A., Garrabou X., Mittl P. R. E., Baker D., Griffiths A. D.. et al. Emergence of a Catalytic Tetrad during Evolution of a Highly Active Artificial Aldolase. Nat. Chem. 2017;9:50–56. doi: 10.1038/nchem.2596. [DOI] [PubMed] [Google Scholar]
- Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O.. et al. Highly Accurate Protein Structure Prediction with AlphaFold. Nature. 2021;596:583–589. doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baek M., DiMaio F., Anishchenko I., Dauparas J., Ovchinnikov S., Lee G. R.. et al. Accurate Prediction of Protein Structures and Interactions Using a Three-Track Neural Network. Science. 2021;373:871–876. doi: 10.1126/science.abj8754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anishchenko I., Pellock S. J., Chidyausiku T. M., Ramelot T. A., Ovchinnikov S., Hao J.. et al. De Novo Protein Design by Deep Network Hallucination. Nature. 2021;600:547–552. doi: 10.1038/s41586-021-04184-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang J., Lisanza S., Juergens D., Tischer D., Watson J. L., Castro K. M.. et al. Scaffolding Protein Functional Sites Using Deep Learning. Science. 2022;377:387–394. doi: 10.1126/science.abn2100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yeh A. H.-W., Norn C., Kipnis Y., Tischer D., Pellock S. J., Evans D.. et al. De Novo Design of Luciferases Using Deep Learning. Nature. 2023;614:774–780. doi: 10.1038/s41586-023-05696-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watson J. L., Juergens D., Bennett N. R., Trippe B. L., Yim J., Eisenach H. E.. et al. De Novo Design of Protein Structure and Function with RFdiffusion. Nature. 2023;620:1089–1100. doi: 10.1038/s41586-023-06415-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ahern, W. ; Yim, J. ; Tischer, D. ; Salike, S. ; Woodbury, S. M. ; Kim, D. ; et al. Atom Level Enzyme Active Site Scaffolding Using RFdiffusion2. bioRxiv, 2025, 10.1101/2025.04.09.648075. [DOI]
- Dauparas J., Anishchenko I., Bennett N., Bai H., Ragotte R. J., Milles L. F.. et al. Robust Deep Learning-Based Protein Sequence Design Using ProteinMPNN. Science. 2022;378:49–56. doi: 10.1126/science.add2187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dauparas J., Lee G. R., Pecoraro R., An L., Anishchenko I., Glasscock C., Baker D.. Atomic Context-Conditioned Protein Sequence Design Using LigandMPNN. Nat. Methods. 2025;22:717–723. doi: 10.1038/s41592-025-02626-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A.. et al. Accurate Structure Prediction of Biomolecular Interactions with AlphaFold 3. Nature. 2024;630:493–500. doi: 10.1038/s41586-024-07487-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chai Discovery team ; Boitreaud, J. ; Dent, J. ; McPartlon, M. ; Meier, J. ; Reis, V. ; et al. Chai-1: Decoding the Molecular Interactions of Life. bioRxiv, 2024, 10.1101/2024.10.10.615955. [DOI]
- Wohlwend, J. ; Corso, G. ; Passaro, S. ; Getz, N. ; Reveiz, M. ; Leidal, K. ; et al. Boltz-1 Democratizing Biomolecular Interaction Modeling. bioRxiv, 2025, 10.1101/2024.11.19.624167. [DOI]
- Madani A., Krause B., Greene E. R., Subramanian S., Mohr B. P., Holton J. M.. et al. Large Language Models Generate Functional Protein Sequences across Diverse Families. Nat. Biotechnol. 2023;41:1099–1106. doi: 10.1038/s41587-022-01618-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Munsamy, G. ; Illanes-Vicioso, R. ; Funcillo, S. ; Nakou, I. T. ; Lindner, S. ; Ayres, G. ; et al. Conditional Language Models Enable the Efficient Design of Proficient Enzymes. bioRxiv 2024, 10.1101/2024.05.03.592223. [DOI] [Google Scholar]
- Hayes T., Rao R., Akin H., Sofroniew N. J., Oktay D., Lin Z.. et al. Simulating 500 Million Years of Evolution with a Language Model. Science. 2025;387:850–858. doi: 10.1126/science.ads0018. [DOI] [PubMed] [Google Scholar]
- Stocco, F. ; Artigues-Lleixà, M. ; Hunklinger, A. ; Widatalla, T. ; Güell, M. ; Ferruz, N. . Guiding Generative Protein Language Models with Reinforcement Learning. arXiv, 2025, arXiv:2412.12979, 10.48550/arXiv.2412.12979. [DOI]
- Blalock, N. ; Seshadri, S. ; Babbar, A. ; Fahlberg, S. A. ; Kulkarni, A. ; Romero, P. A. . Functional Alignment of Protein Language Models via Reinforcement Learning. bioRxiv, 2025, 10.1101/2025.05.02.651993. [DOI]
- Braun, M. ; Tripp, A. ; Chakatok, M. ; Kaltenbrunner, S. ; Fischer, C. ; Stoll, D. ; et al. Computational Enzyme Design by Catalytic Motif Scaffolding. bioRxiv, 2025, 10.1101/2024.08.02.606416. [DOI]
- Chaturvedi S. S., Bím D., Christov C. Z., Alexandrova A. N.. From Random to Rational: Improving Enzyme Design through Electric Fields, Second Coordination Sphere Interactions, and Conformational Dynamics. Chem. Sci. 2023;14:10997–11011. doi: 10.1039/D3SC02982D. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frushicheva M. P., Cao J., Chu Z. T., Warshel A.. Exploring Challenges in Rational Enzyme Design by Simulating the Catalysis in Artificial Kemp Eliminase. Proc. Natl. Acad. Sci. U. S. A. 2010;107:16869–16874. doi: 10.1073/pnas.1010381107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Osuna S.. The Challenge of Predicting Distal Active Site Mutations in Computational Enzyme Design. WIREs Comput. Mol. Sci. 2021;11:e1502. doi: 10.1002/wcms.1502. [DOI] [Google Scholar]
- Duran C., Kinateder T., Hiefinger C., Sterner R., Osuna S.. Altering Active-Site Loop Dynamics Enhances Standalone Activity of the Tryptophan Synthase Alpha Subunit. ACS Catal. 2024;14:16986–16995. doi: 10.1021/acscatal.4c04587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casadevall G., Duran C., Osuna S.. AlphaFold2 and Deep Learning for Elucidating Enzyme Conformational Flexibility and Its Application for Design. JACS Au. 2023;3:1554–1562. doi: 10.1021/jacsau.3c00188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lewis S., Hempel T., Jiménez-Luna J., Gastegger M., Xie Y., Foong A. Y. K.. et al. Scalable Emulation of Protein Equilibrium Ensembles with Generative Deep Learning. Science. 2025;389:eadv9817. doi: 10.1126/science.adv9817. [DOI] [PubMed] [Google Scholar]
- Lauko A., Pellock S. J., Sumida K. H., Anishchenko I., Juergens D., Ahern W.. et al. Computational Design of Serine Hydrolases. Science. 2025;388:eadu2454. doi: 10.1126/science.adu2454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Warshel A., Sharma P. K., Kato M., Xiang Y., Liu H., Olsson M. H. M.. Electrostatic Basis for Enzyme Catalysis. Chem. Rev. 2006;106:3210–3235. doi: 10.1021/cr0503106. [DOI] [PubMed] [Google Scholar]
- Fried S. D., Boxer S. G.. Electric Fields and Enzyme Catalysis. Annu. Rev. Biochem. 2017;86:387–415. doi: 10.1146/annurev-biochem-061516-044432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eberhart M. E., Alexandrova A. N., Ajmera P., Bím D., Chaturvedi S. S., Vargas S.. et al. Methods for Theoretical Treatment of Local Fields in Proteins and Enzymes. Chem. Rev. 2025;125:3772–3813. doi: 10.1021/acs.chemrev.4c00471. [DOI] [PubMed] [Google Scholar]
- Vaissier V., Sharma S. C., Schaettle K., Zhang T., Head-Gordon T.. Computational Optimization of Electric Fields for Improving Catalysis of a Designed Kemp Eliminase. ACS Catal. 2018;8:219–227. doi: 10.1021/acscatal.7b03151. [DOI] [Google Scholar]
- Merlicek L. P., Neumann J., Lear A., Degiorgi V., de Waal M. M., Cotet T.-S.. et al. AI.Zymes: A Modular Platform for Evolutionary Enzyme Design. Angew. Chem., Int. Ed. 2025;64:e202507031. doi: 10.1002/anie.202507031. [DOI] [PubMed] [Google Scholar]
- Shroff R., Cole A. W., Diaz D. J., Morrow B. R., Donnell I., Annapareddy A.. et al. Discovery of Novel Gain-of-Function Mutations Guided by Structure-Based Deep Learning. ACS Synth. Biol. 2020;9:2927–2935. doi: 10.1021/acssynbio.0c00345. [DOI] [PubMed] [Google Scholar]
- Lu H., Diaz D. J., Czarnecki N. J., Zhu C., Kim W., Shroff R.. et al. Machine Learning-Aided Engineering of Hydrolases for PET Depolymerization. Nature. 2022;604:662–667. doi: 10.1038/s41586-022-04599-z. [DOI] [PubMed] [Google Scholar]
- Liu C., Wu J., Chen Y., Liu Y., Zheng Y., Liu L., Zhao J.. Advances in Zero-Shot Prediction-Guided Enzyme Engineering Using Machine Learning. ChemCatChem. 2025;17:e202401542. doi: 10.1002/cctc.202401542. [DOI] [Google Scholar]
- Fox R. J., Davis S. C., Mundorff E. C., Newman L. M., Gavrilovic V., Ma S. K.. et al. Improving Catalytic Function by ProSAR-Driven Enzyme Evolution. Nat. Biotechnol. 2007;25:338–344. doi: 10.1038/nbt1286. [DOI] [PubMed] [Google Scholar]
- Liao J., Warmuth M. K, Govindarajan S., Ness J. E, Wang R. P, Gustafsson C., Minshull J.. Engineering Proteinase K Using Machine Learning and Synthetic Genes. BMC Biotechnol. 2007;7:1–19. doi: 10.1186/1472-6750-7-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Romero P. A., Krause A., Arnold F. H.. Navigating the Protein Fitness Landscape with Gaussian Processes. Proc. Natl. Acad. Sci. U. S. A. 2013;110:E193–E201. doi: 10.1073/pnas.1215251110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buchler J., Malca S. H., Patsch D., Voss M., Turner N. J., Bornscheuer U. T., Allemann O., Le Chapelain C., Lumbroso A., Loiseleur O., Buller R.. Algorithm-Aided Engineering of Aliphatic Halogenase WelO5* for the Asymmetric Late-Stage Functionalization of Soraphens. Nat. Commun. 2022;13:371. doi: 10.1038/s41467-022-27999-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bedbrook C. N., Yang K. K., Robinson J. E., Mackey E. D., Gradinaru V., Arnold F. H.. Machine Learning-Guided Channelrhodopsin Engineering Enables Minimally Invasive Optogenetics. Nat. Methods. 2019;16:1176–1184. doi: 10.1038/s41592-019-0583-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saito Y., Oikawa M., Nakazawa H., Niide T., Kameda T., Tsuda K., Umetsu M.. Machine-Learning-Guided Mutagenesis for Directed Evolution of Fluorescent Proteins. ACS Synth. Biol. 2018;7:2014–2022. doi: 10.1021/acssynbio.8b00155. [DOI] [PubMed] [Google Scholar]
- Biswas S., Khimulya G., Alley E. C., Esvelt K. M., Church G. M.. Low-N Protein Engineering with Data-Efficient Deep Learning. Nat. Methods. 2021;18:389–396. doi: 10.1038/s41592-021-01100-y. [DOI] [PubMed] [Google Scholar]
- Honda Malca S., Duss N., Meierhofer J., Patsch D., Niklaus M., Reiter S.. et al. Effective Engineering of a Ketoreductase for the Biocatalytic Synthesis of an Ipatasertib Precursor. Commun. Chem. 2024;7:46. doi: 10.1038/s42004-024-01130-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vornholt T., Mutný M., Schmidt G. W., Schellhaas C., Tachibana R., Panke S.. et al. Enhanced Sequence-Activity Mapping and Evolution of Artificial Metalloenzymes by Active Learning. ACS Cent. Sci. 2024;10:1357–1370. doi: 10.1021/acscentsci.4c00258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freschlin C. R., Fahlberg S. A., Romero P. A.. Machine Learning to Navigate Fitness Landscapes for Protein Engineering. Curr. Opin. Biotechnol. 2022;75:102713. doi: 10.1016/j.copbio.2022.102713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Z., Kan S. B. J., Lewis R. D., Wittmann B. J., Arnold F. H.. Machine Learning-Assisted Directed Protein Evolution with Combinatorial Libraries. Proc. Natl. Acad. Sci. U. S. A. 2019;116:8852–8858. doi: 10.1073/pnas.1901979116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xu Y., Verma D., Sheridan R. P., Liaw A., Ma J., Marshall N. M.. et al. Deep Dive into Machine Learning Models for Protein Engineering. J. Chem. Inf. Model. 2020;60:2773–2790. doi: 10.1021/acs.jcim.0c00073. [DOI] [PubMed] [Google Scholar]
- Wittmann B. J., Yue Y., Arnold F. H.. Informed Training Set Design Enables Efficient Machine Learning-Assisted Directed Protein Evolution. Cell Syst. 2021;12:1026–1045. doi: 10.1016/j.cels.2021.07.008. [DOI] [PubMed] [Google Scholar]
- Patsch D., Schwander T., Voss M., Schaub D., Hüppi S., Eichenberger M.. et al. Enriching Productive Mutational Paths Accelerates Enzyme Evolution. Nat. Chem. Biol. 2024;20:1662. doi: 10.1038/s41589-024-01712-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Meier, J. ; Rao, R. ; Verkuil, R. ; Liu, J. ; Sercu, T. ; Rives, A. . Language Models Enable Zero-Shot Prediction of the Effects of Mutations on Protein Function. In Advances in Neural Information Processing Systems; Ranzato, M. , Beygelzimer, A. , Dauphin, Y. , Liang, P. S. , Vaughan, J. W. , Eds.; Curran Associates, Inc., 2021; Vol. 34, pp 29287–29303. [Google Scholar]
- Hie B. L., Yang K. K.. Adaptive Machine Learning for Protein Engineering. Curr. Opin. Struct. Biol. 2022;72:145–152. doi: 10.1016/j.sbi.2021.11.002. [DOI] [PubMed] [Google Scholar]
- Frazer J., Notin P., Dias M., Gomez A., Min J. K., Brock K.. et al. Disease Variant Prediction with Deep Generative Models of Evolutionary Data. Nature. 2021;599:91–95. doi: 10.1038/s41586-021-04043-8. [DOI] [PubMed] [Google Scholar]
- Gelman, S. ; Johnson, B. ; Freschlin, C. ; Sharma, A. ; D’Costa, S. ; Peters, J. ; et al. Biophysics-Based Protein Language Models for Protein Engineering. bioRxiv, 2025, 10.1101/2024.03.15.585128. [DOI] [PMC free article] [PubMed]
- Long Y., Mora A., Li F.-Z., Gürsoy E., Johnston K. E., Arnold F. H.. LevSeq: Rapid Generation of Sequence-Function Data for Directed Evolution and Machine Learning. ACS Synth. Biol. 2025;14:230–238. doi: 10.1021/acssynbio.4c00625. [DOI] [PubMed] [Google Scholar]
- Plesa C., Sidore A. M., Lubock N. B., Zhang D., Kosuri S.. Multiplexed Gene Synthesis in Emulsions for Exploring Protein Functional Landscapes. Science. 2018;359:343–347. doi: 10.1126/science.aao5167. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lund S., Potapov V., Johnson S. R., Buss J., Tanner N. A.. Highly Parallelized Construction of DNA from Low-Cost Oligonucleotide Mixtures Using Data-Optimized Assembly Design and Golden Gate. ACS Synth. Biol. 2024;13:745–751. doi: 10.1021/acssynbio.3c00694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weinstein, E. N. ; Gollub, M. G. ; Slabodkin, A. ; Gardner, C. L. ; Dobbs, K. ; Cui, X.-B. ; et al. Manufacturing-Aware Generative Model Architectures Enable Biological Sequence Design and Synthesis at Petascale. bioRxiv, 2024, 10.1101/2024.09.13.612900. [DOI]
- Angelopoulos A., Cahoon J. F., Alterovitz R.. Transforming Science Labs into Automated Factories of Discovery. Sci. Robot. 2024;9:eadm6991. doi: 10.1126/scirobotics.adm6991. [DOI] [PubMed] [Google Scholar]
- Rapp J. T., Bremer B. J., Romero P. A.. Self-Driving Laboratories to Autonomously Navigate the Protein Fitness Landscape. Nat. Chem. Eng. 2024;1:97–107. doi: 10.1038/s44286-023-00002-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Markin C. J., Mokhtari D. A., Sunden F., Appel M. J., Akiva E., Longwell S. A.. et al. Revealing Enzyme Functional Architecture via High-Throughput Microfluidic Enzyme Kinetics. Science. 2021;373:eabf8761. doi: 10.1126/science.abf8761. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Höllerer S., Desczyk C., Muro R. F., Jeschek M.. From Sequence to Function and Back - High-Throughput Sequence-Function Mapping in Synthetic Biology. Curr. Opin. Syst. Biol. 2024;37:100499. doi: 10.1016/j.coisb.2023.100499. [DOI] [Google Scholar]
- Holland-Moritz D. A., Wismer M. K., Mann B. F., Farasat I., Devine P., Guetschow E. D.. et al. Mass Activated Droplet Sorting (MADS) Enables High-Throughput Screening of Enzymatic Reactions at Nanoliter Scale. Angew. Chem., Int. Ed. 2020;59:4470–4477. doi: 10.1002/anie.201913203. [DOI] [PubMed] [Google Scholar]
- Gantz M., Mathis S. V., Nintzel F. E. H., Lio P., Hollfelder F.. On Synergy between Ultrahigh Throughput Screening and Machine Learning in Biocatalyst Engineering. Faraday Discuss. 2024;252:89–114. doi: 10.1039/D4FD00065J. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Armer, C. ; Kane, H. ; Cortade, D. L. ; Redestig, H. ; Estell, D. A. ; Yusuf, A. ; et al. Results of the Protein Engineering Tournament: An Open Science Benchmark for Protein Modeling and Design. Proteins Struct. Funct. Bioinforma. 2025, 10.1002/prot.70008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Notin, P. ; Kollasch, A. W. ; Ritter, D. ; Van Niekerk, L. ; Paul, S. ; Spinner, H. ; et al. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. bioRxiv, 2023, 10.1101/2023.12.07.570727. [DOI]
- Graber, D. ; Stockinger, P. ; Meyer, F. ; Mishra, S. ; Horn, C. ; Buller, R. . GEMS: A Generalizable GNN Framework For Protein-Ligand Binding Affinity Prediction Through Robust Data Filtering and Language Model Integration. bioRxiv, 2024, 10.1101/2024.12.09.627482. [DOI]
- Ma Y., Zhang Z., Jia B., Yuan Y.. Automated High-Throughput DNA Synthesis and Assembly. Heliyon. 2024;10:e26967. doi: 10.1016/j.heliyon.2024.e26967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Landwehr G. M., Bogart J. W., Magalhaes C., Hammarlund E. G., Karim A. S., Jewett M. C.. Accelerated Enzyme Engineering by Machine-Learning Guided Cell-Free Expression. Nat. Commun. 2025;16:865. doi: 10.1038/s41467-024-55399-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu J., Meng Y., Gao W., Yang S., Zhu W., Ji X.. et al. AI-Driven High-Throughput Droplet Screening of Cell-Free Gene Expression. Nat. Commun. 2025;16:2720. doi: 10.1038/s41467-025-58139-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Citations
- Ahern, W. ; Yim, J. ; Tischer, D. ; Salike, S. ; Woodbury, S. M. ; Kim, D. ; et al. Atom Level Enzyme Active Site Scaffolding Using RFdiffusion2. bioRxiv, 2025, 10.1101/2025.04.09.648075. [DOI]
- Chai Discovery team ; Boitreaud, J. ; Dent, J. ; McPartlon, M. ; Meier, J. ; Reis, V. ; et al. Chai-1: Decoding the Molecular Interactions of Life. bioRxiv, 2024, 10.1101/2024.10.10.615955. [DOI]
- Wohlwend, J. ; Corso, G. ; Passaro, S. ; Getz, N. ; Reveiz, M. ; Leidal, K. ; et al. Boltz-1 Democratizing Biomolecular Interaction Modeling. bioRxiv, 2025, 10.1101/2024.11.19.624167. [DOI]
- Stocco, F. ; Artigues-Lleixà, M. ; Hunklinger, A. ; Widatalla, T. ; Güell, M. ; Ferruz, N. . Guiding Generative Protein Language Models with Reinforcement Learning. arXiv, 2025, arXiv:2412.12979, 10.48550/arXiv.2412.12979. [DOI]
- Blalock, N. ; Seshadri, S. ; Babbar, A. ; Fahlberg, S. A. ; Kulkarni, A. ; Romero, P. A. . Functional Alignment of Protein Language Models via Reinforcement Learning. bioRxiv, 2025, 10.1101/2025.05.02.651993. [DOI]
- Braun, M. ; Tripp, A. ; Chakatok, M. ; Kaltenbrunner, S. ; Fischer, C. ; Stoll, D. ; et al. Computational Enzyme Design by Catalytic Motif Scaffolding. bioRxiv, 2025, 10.1101/2024.08.02.606416. [DOI]
- Gelman, S. ; Johnson, B. ; Freschlin, C. ; Sharma, A. ; D’Costa, S. ; Peters, J. ; et al. Biophysics-Based Protein Language Models for Protein Engineering. bioRxiv, 2025, 10.1101/2024.03.15.585128. [DOI] [PMC free article] [PubMed]
- Weinstein, E. N. ; Gollub, M. G. ; Slabodkin, A. ; Gardner, C. L. ; Dobbs, K. ; Cui, X.-B. ; et al. Manufacturing-Aware Generative Model Architectures Enable Biological Sequence Design and Synthesis at Petascale. bioRxiv, 2024, 10.1101/2024.09.13.612900. [DOI]
- Notin, P. ; Kollasch, A. W. ; Ritter, D. ; Van Niekerk, L. ; Paul, S. ; Spinner, H. ; et al. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. bioRxiv, 2023, 10.1101/2023.12.07.570727. [DOI]
- Graber, D. ; Stockinger, P. ; Meyer, F. ; Mishra, S. ; Horn, C. ; Buller, R. . GEMS: A Generalizable GNN Framework For Protein-Ligand Binding Affinity Prediction Through Robust Data Filtering and Language Model Integration. bioRxiv, 2024, 10.1101/2024.12.09.627482. [DOI]



