Abstract
High-throughput yeast engineering is being transformed by biofoundries that integrate automation, artificial intelligence (AI), and standardized workflows. This review examines how these facilities accelerate strain development through the Design-Build-Test-Learn (DBTL) cycle, with advances in genome editing, phenotypic screening, and predictive modelling. It highlights Australia’s involvement through the Australian Genome Foundry, Idea-BIO, and the CSIRO Biofoundiry and explores global efforts to overcome reproducibility and standardization challenges. Despite progress, key barriers remain, including protocol variability and integration of AI tools. We also highlight the opportunity for a shift toward autonomous, self-optimizing ‘self-driving labs’ that transition from DBTL to Design-Build-Deploy cycles. The future of yeast engineering depends not only on technological innovation, but also on the harmonization of international standards, data governance, and ethical safeguards. If fully realized, the convergence of robotics, AI, and synthetic biology will redefine yeast engineering, leading to step changes in strain performance for a variety of important products, thus enabling economic and sustainable biomanufacturing at scale.
Keywords: biofoundry, synthetic biology, engineering biology, machine learning
This minireview provides a contemporary overview of the latest developments in biofoundry level yeast strain engineering in Australia and in general.
Introduction
Saccharomyces cerevisiae has long been central to biotechnology due to its genetic tractability, robust industrial performance, and generally regarded as safe (GRAS) status (Wang et al. 2024). Recent advances in genome editing, systems biology, and high-throughput screening have expanded its role as a chassis to produce a wide range of valuable chemicals. A major focus now lies in optimizing strain performance by improving titre, rate, and yield (TRY) key parameters that determine industrial viability (Shi et al. 2025). The field is also rapidly broadening to include non-conventional yeasts, each offering specialized biosynthetic capabilities. Examples include Komagataella phaffii (Pichia pastoris) for heterologous protein production, Yarrowia lipolytica for lipid biosynthesis, and Kluyveromyces marxianus and Scheffersomyces stipitis for efficient bioethanol generation from diverse carbon sources (Borodina and Nielsen 2014, Kuivanen et al. 2018, Cai et al. 2019, J. Zhang et al. 2021, Wives et al. 2024).
To enable high-throughput (HTP) engineering of these diverse hosts, standardized genetic toolkits, transformation protocols, and high-throughput robotics enabled workflows are essential (Holowko et al. 2021, Koray et al. 2022). Technologies like CRISPR, modular cloning, in vivo recombination, and site-specific integrases support this expansion (Shen et al. 2024).
HTP yeast engineering, powered by biofoundries, accelerates the Design-Build-Test-Learn (DBTL) cycle through automation, AI, and advanced analytics (Nielsen and Keasling 2016, Chao et al. 2017, Dixon et al. 2023). These platforms enable rapid, reproducible strain optimization for applications in biomanufacturing, pharmaceuticals, and sustainability (Hillson et al. 2019). Robotics increases the number of strains that can be built and tested and ensure consistency across design, assembly, and cultivation, reducing the time from concept to strain (Holub and Agena 2023, Goold et al. 2025).
Despite these advances, challenges remain in standardization and reproducibility across labs (Lux et al. 2023). Integrating AI for strain selection and predictive modelling presents both opportunities and implementation hurdles (Pretorius 2017, Jervis et al. 2018). Recent advances emphasize that overcoming these hurdles will require the deep integration of AI with mechanistic metabolic models to enable accurate, data-driven strain design (Lu et al. 2024). Still, the convergence of multiomics, single-cell tools, and AI is transforming yeast engineering into a data-rich, precision field, as demonstrated by (Gao et al. 2022) and a recent preprint (Lyu et al. 2024).
The Global Biofoundry Alliance (GBA), established in 2019, fosters open standards and shared infrastructure among public foundries worldwide (Hillson et al. 2019). Australia plays a key role, with facilities like the Australian Genome Foundry, Idea-BIO, and the CSIRO Biofoundry contributing to global efforts in HTP engineering, data interoperability, and protocol harmonization (Dixon et al. 2023, Holub and Agena 2023, Goold et al. 2025, IDEA–Bio 2025). Regional collaborations include an Asia–Pacific Workshop, held in August 2023, which brought together 40 participants from 13 countries across industry, academia, and government to advance the harmonization of metrics and standards (Freemont 2024).
As biofoundries evolve, DBTL is increasingly shifting to Design–Build–Deploy (DBD), emphasizing rapid, AI–guided strain delivery (Dixon et al. 2022). Improvements in knowledge, modelling, and AI, fuelled by high–volume, high–quality omics, and high–throughput datasets, are markedly enhancing design predictability and accuracy, so that fewer iterative wet–lab tests are needed (e.g. ML models trained on multi–omics data now forecast pathway performance with greatly reduced experimental burden (Patra et al. 2023, Kugler and Stensjö 2024); deep integration of AI with genome–scale and enzyme–constrained models delivers more reliable TRY predictions (Kundu et al. 2024, Lu et al. 2024) retrieval–augmented LLM platforms automate knowledge extraction and design suggestions, speeding the transition from design to experiment (Li et al. 2025, Mao et al. 2025); and hybrid AI–automation systems cross–validate in silico predictions with in–line screening, ensuring robust strain delivery (Boob et al. 2024). Globally coordinated, AI–powered biofoundries represent the most scalable and strategic path forward for yeast strain engineering (Castaño-Cerezo et al. 2024). Establishing the infrastructure needed to support this next generation of biofoundries is critical. Australia offers several emerging models that demonstrate how investment in automation, standardization, and data interoperability can enable globally competitive synthetic biology platforms.
Advances in yeast genome synthesis and synthetic biology toolkits
Australia is an important participant in the international synthetic biology community, with major advancements in yeast engineering and investment in automated research platforms (Pretorius 2017, Dixon et al. 2023). Integration of diverse scientific disciplines, coupled with progress in DNA sequencing and synthesis, underpins this growth. Saccharomyces cerevisiae has long served as a model for metabolic engineering, recently demonstrated by Australian-led innovations in isoprenoid biosynthesis. For example, an auxin-inducible protein degradation was implemented to conditionally redirect carbon flux toward mono- and sesquiterpene production, achieving up to 3.5 g l⁻¹ nerolidol by selectively depleting key enzymes (Lu et al. 2021). Similarly, combinatorial terpene pathway engineering and biosensor-guided strain optimization for high-value food and fragrance compounds were also explored (Bongers et al. 2020, Peng and Wei 2025). CRISPR has also been applied by Australian researchers to enhance yeast traits relevant to the wine and fermentation industries (Jagtap et al. 2017).
Beyond traditional engineering, yeast lipid droplets are being used for biocatalysis and biosensing (Suri et al. 2024). Synthetic genome minimization and biosafety strategies, like auxotrophy and toxin-antitoxin systems, are being explored to ensure environmental containment (Pretorius 2017).
Australian-led efforts have spearheaded two major chromosome-scale constructions within the global Yeast 2.0 (Sc2.0) project. Sc2.0 incorporated SCRaMbLE (Synthetic Chromosome Rearrangement and Modification by LoxP-mediated Evolution), an inducible Cre-loxPsym system that generates combinatorial genomic diversity through stochastic deletions, inversions and translocations. By placing loxPsym sites downstream of nonessential loci, SCRaMbLE enabled rapid exploration of genotype–phenotype landscapes, facilitating adaptive gains in pathway performance, stress tolerance or novel trait emergence (Shen et al. 2016). This capability has been applied not only to native chromosomes but also to synthetic neochromosomes (Erpf et al. 2025).
In 2022, Kutyna and colleagues at Macquarie designed and built a 211 kb pan-genome neochromosome by concatenating diverse sequence elements from eight industrial and environmental yeast strains, embedding 63 loxPsym sites to enable SCRaMbLE-mediated variability and providing new metabolic capabilities (Kutyna et al. 2022). More recently, in late 2024 the same team released the fully synthetic 903 kb synXVI, complete with optimized chunk termini, rebalanced tRNA content and corrected loxPsym insertions, demonstrating both high-fidelity assembly and iterative CRISPR-D-BUGS debugging to restore wild-type fitness (Goold et al. 2025).
Since its inception in 2006, the Sc2.0 consortium has pursued a bottom-up DBTL strategy to reengineer all 16 native chromosomes of S. cerevisiae, plus a synthetic tRNA neochromosome (Annaluru et al. 2014). Core design principles include replacement of TAG stop codons with TAA, removal of repetitive and nonessential elements, and integration of standardized PCRTags for assembly tracking. This ‘connecting-the-dots’ effort has unified computational design (BioStudio), hierarchical SwAP-In assembly, and centralized debugging workflows under the Yeast 2.0 banner (Pretorius and Boeke 2018).
High-throughput screening (HTS) systems are advancing recombinant protein production in S. cerevisiae and K. phaffii with work on promoter variants, terminators, and co-expression of folding chaperones like HAC1 and PDI improving yields (Peng et al. 2015, 2018, Navone et al. 2021). The advances for HTS also include yeast-based protein secretion biosensors for such organisms (Peng et al. 2021, Navone et al. 2023, Cleaver et al. 2024), and synthetic circuits for environmental and functional monitoring (Scott et al. 2018, 2020). Biosensor applications go beyond detecting proteins, as exemplified by the G-protein–coupled receptor biosensor detecting serotonin in S. cerevisiae (Saleski et al. 2024).
In sum, Australia’s strategic investments in synthetic genomics, metabolic engineering, automation, and AI have enabled important contributions to global synthetic biology innovation (Pretorius 2019, Dixon et al. 2023, Freemont 2024, Goold et al. 2025). However, to accelerate progress in synthetic biology, it will be essential to scale up the deployment of HTP platforms. When combined with robust automation, standardized workflows, and international collaboration, these systems can unlock faster DBTL cycles, enhance reproducibility, and support the translation of innovations into a design-built-deploy model and real-world applications.
High-throughput phenotypic screening for complex traits
Synthetic biology’s shift to HTS and automation is driven by the DBTL cycle, which applies engineering principles to biological design (Kittleson et al. 2012, Carbonell et al. 2018, Pleiss 2024). The design phase uses computational tools like Benchling and Teselagento streamline construct planning, codon optimization, and constraint-based modelling (Nathan et al. 2012, Linshiz et al. 2014, Davies 2020, Fero et al. 2020).
The build phase leverages synthetic DNA, genome editing, and automation platforms such as AssemblyTron and AutoBiotech. Liquid-handling robots and acoustic droplet ejection systems reduce human error and reagent use whilst increasing throughput (Hadimioglu et al. 2016, Bryant et al. 2023).
In the test phase, yeast strains are evaluated using HTS platforms including microplate assays, FACS, multi-omics, and droplet microfluidics (Zhang et al. 2023, Lopez-Barbera et al. 2024, Shiraishi et al. 2024). Microfluidic systems encapsulate single cells in nanolitre droplets that behave as microbioreactors, enabling millions of miniaturized, parallel assays (Panwar et al. 2023) and link genotype to real-time reporter signals, enriching top variants via FACS (Zhang et al. 2023, Saleski et al. 2024).
The learn phase range from the use of simple tools, such as Excel and R, to advanced AI-driven models that predict genotype–phenotype relationships and refine experimental designs (Zhang et al. 2020). Robust laboratory information management systems (LIMS) and ML pipelines are increasingly critical for managing large datasets (Berezin et al. 2023, Zhang et al. 2025).
Modern biofoundries integrate these technologiesthrough automation, HTS, and data analysis evolving toward a DBD model focused on rapid translation to real-world applications (Dixon et al. 2022, Tu et al. 2022, Bryant et al. 2023). Table 1 provides an overview of the core hardware and software technologies enabling scalable HTP DBTL workflows in modern biofoundries.
Table 1.
Key hardware and software technologies accelerating HTP DBTL workflows in synthetic biology. This table highlights representative tools used across the DBTL cycle in yeast synthetic biology. It includes automation platforms, microfluidic systems, liquid handling hardware, and computational tools that enable rapid strain prototyping, screening, and data-driven optimization. These technologies are central to modern biofoundries and the transition toward digitally integrated, deployable microbial solutions.
| Workflow stage | Traditional approach | Automated biofoundry approach | Performance gain | References |
|---|---|---|---|---|
| Design | Manual CAD tools, limited rational design | AI-aided design using LLMs, PLMs, genome-scale metabolic models, BioCAD, SBOL, design Space exploration | +speed, +scope, +predictive power, +design diversity | (Gurdo et al. 2023, Ding et al. 2024, Ruan et al. 2024) |
| Build | Manual cloning, transformation | Automated modular DNA assembly (Golden Gate, MoClo, YTK) and genome editing (CRISPR/Cas, base editors) with robotics | +throughput, +modularity, +standardization, +multiplexing | (Chao et al. 2017, Koray et al. 2022, Gurdo et al. 2023) |
| Test | Plate-based assays, colony picking | HTS, microfluidics, FACS, omics, biosensors, single-cell analytics, automated data acquisition | +resolution, +data volume, +cell-level granularity, +speed | (Holland and Blazeck 2022, Martin et al. 2023) |
| Learn | Manual analysis, expert intuition | ML/AI integration (Bayesian optimization, deep learning, active learning), LIMS, cloud-based feedback loops | +predictivity, +cycle speed, +hypothesis generation | (Samek 2017, Jervis et al. 2018, Ding et al. 2024) |
| Iteration time | Weeks/months | Hours/days via closed-loop AI–robotics integration | ×10–100 faster | (Gurdo et al. 2023, Martin et al. 2023) |
| Standardization | Informal protocols, lab-dependent/specific | Use of SBOL, SEVA, BioBricks, MIQE, modular toolkits, harmonized protocols across foundries | +reproducibility, +collaboration, +interoperability | (Bustin et al. 2009, Smolke 2009, Quinn et al. 2015, Chao et al. 2017, Freemont 2024) |
| Automation level | Manual, stepwise, pipetting robots | Fully autonomous self-driving labs (SDLs), digital twins, automated liquid handling, robotics integrated with AI agents | +consistency, +scalability, +parallelism, +24/7 uptime | (Gurdo et al. 2023, Martin et al. 2023, Kim 2025) |
| Interoperability | Siloed tools, format mismatches | Plug-and-play instrument modules via OPC UA, standard APIs, cloud-based coordination platforms | +flexibility, +device compatibility, +scale-out capability | (Freemont 2024, Li et al. 2024) |
| Data infrastructure | Spreadsheets, local servers | Integrated LIMS + data lakes, metadata tracking, real-time ML pipelines, FAIR-compliant architecture | +traceability, +ML-readiness, +multi-foundry learning | (Gurdo et al. 2023, Martin et al. 2023, Freemont 2024) |
| Feedback mechanism | Manual literature review, empirical design | Active learning loops, ML-guided optimization, LLM-based experiment planning, real-time sensor feedback | +exploration efficiency, +adaptability, +autonomy | (H. Yang et al. 2022, Ruan et al. 2024, Zhang et al. 2025) |
| Sustainability | Not evaluated routinely | Life cycle assessment (LCA), green chemistry metrics, low-waste microfluidics | +environmental compliance, +process efficiency, +public trust | (Asin-Garcia et al. 2024, Freemont 2024) |
| Regulatory readiness | Case-by-case, post hoc | Data-rich process logs, standardization for safety assessment, traceable workflows, reproducible synthetic routes | +approval speed, +regulatory transparency | (Freemont 2024) |
| Biosafety & biosecurity | Manual containment, SOPs | Automated risk classification, digital containment protocols, controlled access to genetic elements | +risk mitigation, +auditability, +compliance | (Hoffmann et al. 2023, Freemont 2024) |
| Knowledge sharing | Manual methods, internal notebooks | Open-source hardware/software, multi-foundry standard exchange formats | +community acceleration, +comparability | (Hillson et al. 2019, Gupta et al. 2024) |
While HTP testing offers unparalleled data generation, its full potential is unlocked only when paired with computational tools capable of interpreting and using those data.
AI, machine learning, and generative models in the DBTL cycle
Artificial intelligence (AI) and machine learning (ML) have reshaped synthetic biology by providing predictive modelling, accelerating experimental workflows, and integrating complex datasets. In yeast engineering, AI-driven pipelines can refine the Design–Build–Test–Learn (DBTL) cycle by guiding construct design, predicting performance, and optimizing metabolic pathways (Lee et al. 2023, Ramos et al. 2024). This section outlines how AI and generative models, combined with specialized software and high-throughput platforms, enhance DBTL accuracy, scope, and efficiency.
Enhanced speed and predictive accuracy
Biofoundries increasingly incorporate AI solutions throughout DBTL to reduce trial-and-error, enable real-time feedback, and automate large-scale experimentation (Goshisht 2024, Zhang et al. 2025). Since every wet-lab assay carries significant cost and only a finite number of designs can be empirically tested, AI-driven in silico exploration of thousands of genetic configurations allows prioritization of high-confidence variants for validation, preserving scarce financial, temporal and material resources (Ramos et al. 2024). AI also assists in evaluating HTS data fidelity by flagging anomalies from robotic errors or instrument drift(Camacho et al. 2018), and supports standardization through domain-specific ‘Construction File’ (CF) formats that encode molecular biology operations and enable automated detection of design flaws and validation of workflows across labs (Ataii et al. 2023).
Generative models in yeast engineering: PLMs and LLMs
Generative AI frameworks, including large language models (LLMs) and protein language models (PLMs), accelerate yeast engineering by automating pathway discovery and in silico design of genetic constructs (Radivojević et al. 2020, Cheng et al. 2023). ESM–2 predicts enzyme fitness metrics from sequence data and suggests mutations for downstream experimental validation in S. cerevisiae (Zhang et al. 2025), whilst LLM–driven pipelines could be adapted to draft yeast strain construction protocols and interface with liquid–handling robots or cell–free protein synthesis workflows (Herisson et al. 2024). Unsupervised generative models such as Generative Adversarial Networks or Variational Autoencoders could in principle explore combinatorial libraries of promoters, terminators and coding sequences to uncover novel regulatory elements that enhance metabolite production; an application that still awaits in vivo validation in yeast (Radivojević et al. 2020). As these tools mature, they could guide the design of bespoke biosynthetic pathways and tailor regulatory networks to the unique physiology of yeast hosts.
AI-Driven data analysis and predictive modelling
AI–driven analytics underpin yeast metabolic engineering by transforming high–volume screening and multi–omics datasets into actionable design insights. Supervised methods such as support vector machines, random forests and deep neural networks, trained on integrated transcriptomic, metabolomic, and phenotypic profiles, enable accurate genotype–phenotype predictions that outperform linear models for complex traits (Goshisht 2024, Herisson et al. 2024). For example, deep learning frameworks have been used to predict metabolic flux distributions in engineered S. cerevisiae, closing the feedback loop in the ‘learn’ phase of DBTL and guiding subsequent strain optimization (Kim et al. 2020).
Hybrid mechanistic–ML pipelines embed genome–scale metabolic or kinetic models within transformer architectures to pinpoint pathway bottlenecks and recommend targeted gene interventions (Song et al. 2024). Retrosynthetic tools, RetroPath 2.0 and RetroBioCat, leverage curated enzyme databases alongside ML–guided route planning to propose multi–step biosynthetic pathways specifically optimized for yeast hosts (Finnigan et al. 2021, Yonet et al. 2024). Automated guide–RNA design with CRISPR–GPT has streamlined multiplexed genome by generating high–confidence editing libraries (Qu et al. 2024).
Underpinning these capabilities are robust data standards and tight integration with laboratory information management systems. Structured formats (e.g. SBOL) and ‘Construction File’ representations ensures seamless data exchange, automated error checking, and full provenance tracking across biofoundry workflows (Ataii et al. 2023, Cheng et al. 2023, Song et al. 2024, Vidal et al. 2025). A recent preprint underscores the importance of interpretability, traceability, and auditability in AI–driven experimentation, makes compliance an essential design consideration for any large–scale yeast engineering project (Boiko et al. 2023).
Future prospects for ‘speed-to-strain’ engineering
As LLMs and generative models converge with robotic platforms, fully autonomous DBTL pipelines could drastically shorten design cycles (Ramos et al. 2024). Reduced trial-and-error and targeted combinatorial searches enable ‘speed-to-strain’ approaches, where AI prunes infeasible designs and proposes high-yield constructs, while improving scale-up decisions by factoring in real-time quality data (Yook and Alper 2025).
These integrated AI frameworks promise yeast engineering guided by flexible, multi-modal models that unify text, structure, and experimental data, shifting synthetic biology toward large-scale, autonomous engineering practices (Boiko et al. 2023, Cheng et al. 2023). Table 2 summarizes the main AI/ML tools used in yeast engineering with other relevant tools for automation that have not been validated in yeast systems are in Table S1.
Table 2.
AI/ML tools used in yeast synthetic biology and their functions.
| Tool/model | Type | DBTL role | Function/ application | Input/output | Yeast eng. application | Reference |
|---|---|---|---|---|---|---|
| ESM-2 | Protein language model (LLM) | DL | Zero-shot fitness prediction of protein variants, design of enzyme libraries | Protein sequences Fitness likelihood scores | Predicts the impact of coding mutations on yeast‐expressed enzymes. | (Zhang et al. 2025) |
| Digital twins | Simulation/hybrid ML model | D | Predict pathway flux, simulate strain behaviour, virtual strain construction | Design specs → phenotype simulations | Simulates strain performance to rank designs before experiments. | (Gurdo et al. 2023) |
| Alphafold all atom/multimer | Structure prediction + ML | D | Predicts 3D structure of designed enzymes in new pathways, even conformed as multimers | Sequence → structure (PDB) | Predicts structures of engineered proteins before expression. | (Evans et al. 2021, Hou et al. 2025) |
| AUTOGPT + LIMS | Agent framework + LLM | BL | Self-guided agents for planning, executing, and refining biofoundry tasks, including multigene construct synthesis | Text prompt + context → experimental plan | Orchestrates end–to–end strain construction workflows. | (Zhang et al. 2025) |
| Metabolomics-ML | Random Forests/DNN | TL | Predict phenotypic outputs from metabolomic or fluxomics profiles | Omics data → trait classification/regression | Predicts yeast production traits from metabolomics data. | (Goshisht 2024) |
| ecFactory | Constraint-based + ML-guided design | D | Predicts optimal metabolic engineering interventions and platform-strain targets | ecYeastGEM + product list → ranked OE/KO/KD gene targets | Designs S. cerevisiae strains for 100 + chemicals; prioritizes edits before Build/Test | (Domenzain et al. 2025) |
| Camformer | Deep CNN for promoters | L→D | Learns promoter→ expression mapping; enables in-silico promoter design | Promoter sequences → predicted expression | Trained on millions of S. cerevisiae promoters; guides regulatory-part libraries | (Dash and Bornelöv 2025) |
| Pymaker (pre-trained promoter model) | Transfer learning/foundation DNA model | L→D | Pre-training + fine-tuning to improve promoter activity prediction/design | Unlabeled promoter + labeled yeast data → activity predictor | Improves S. cerevisiae promoter design with less data | (Yang et al. 2025) |
| Codon-choice PLM | Protein language model | L→D | Infers synonymous-codon constraints directly from protein sequence; recommends codon usage | AA sequence → codon choice/optimization signals | Codon optimization for yeast expression driven by PLM-learned constraints | (Sakharova and Lareau 2025) |
| DLTRNM (distributed large-scale transcriptional regulatory neural network) | Knowledge-integrated ML model | L | Reconstructs transcriptional regulatory networks using pre-trained ML + prior knowledge | Multi-omics + priors → TRN edges & regulators | Maps S. cerevisiae regulation to reveal engineering levers | (Fan et al. 2025) |
| Polygraph | Software framework (ML-assisted assessment) | TL | Systematic evaluation of native and designed promoters | Promoter sequences + models → divergence/feature analyses | Benchmarks computationally designed S. cerevisiae promoters prior to build | (Lal et al. 2025) |
| DL-based yeast colony counter | Deep learning (object detection) | T | High-throughput CFU/colony counting from plate images (species-aware) | Plate images → colony counts & labels | Automates test-stage imaging; distinguishes S. cerevisiae vs. K. humilis | (Aiki et al. 2025) |
| ML-assisted signal peptide design (Y. lipolytica) | ML-guided sequence design + directed evolution | L→B→T | Optimizes secretion via data-driven SP selection/engineering | SP sequences + secretion data → next-round variants | Boosts heterologous protein secretion in yeast | (Z. Wu et al. 2025) |
| Automated hypothesis & experiment planner | LLM agent + automated lab | DBTL | Generates hypotheses, plans/executes experiments, logs outcomes | Objective + assay APIs → next experiment + results graph | Demonstrated on S. cerevisiae interaction experiments; fits closed-loop biofoundry ops | (Brunnsåker et al. 2025) |
Importance of manually curated data
AI enables powerful data analysis and predictive modelling by identifying complex patterns in large, high-dimensional datasets (Camacho et al. 2018). However, its performance depends heavily on training data quality; biased or sparse datasets can yield unreliable results compared to expert-guided methods (Goshisht 2024). This underscores the need for human–in–the–loop workflows, where domain experts curate training data, validate AI outputs, and provide continuous feedback to refine models. Manually curated data, shaped by expert knowledge, are particularly valuable for rare events or poorly characterized phenomena and often offer greater interpretability when exploring biological mechanisms, as discussed in a preprint (Samek 2017).
Hybrid approaches are emerging to combine the strengths of both. BioNursery, for example, uses LLMs to generate hypotheses, refined via crowd-sourced expert input to avoid hallucinations and ensure relevance (Jamil et al. 2024), while domain-specific tools like CRISPR-GPT and GP-GPT rely on manual verification, (Lyu et al. 2024, Qu et al. 2024).
This integration of AI and expert curation offers a pragmatic middle ground, accelerating discovery without compromising reliability. As data infrastructures mature and AI systems gain self-correction capabilities, the field advances toward fully autonomous biofoundries that execute design, experimentation and learning cycles with minimal human intervention whilst preserving scientific oversight
Toward autonomous and self-optimizing biofoundries
Automation is central to biofoundries, with fully autonomous, self-optimizing ‘self-driving labs’ integrating robotic execution with AI-driven decision-making to cycle through DBTL phases with minimal human input (Kim 2025). Protein language model-guided systems have autonomously generated enzyme variants, built and tested them, and refined designs over multiple iterations (Zhang et al. 2025), whilst cell-free platforms using a language model and active learning achieved improved protein yields without manual intervention (Herisson et al. 2024).
Advances in hardware, such as automated pickers, fermenters, analysers, and software, including Bayesian optimization and AI-guided design, are accelerating strain engineering. These tools help prioritize informative experiments and avoid unproductive iteration (‘involution’) in DBTL cycles (Liao et al. 2022). Early autonomous platforms have shown success in narrow tasks like enzyme optimization; the next step is scaling to whole-cell engineering (Bozkurt et al. 2025).
Importantly, these advances extend beyond S. cerevisiae. Modern biofoundries increasingly support other yeasts like K. phafii and Y. lipolytica, aided by modular toolkits and adaptable automation. Biofoundries have integrated multiple organisms for various applications, though each requires tailored methods. For example, Y. lipolytica miniaturized workflows leads to perturbations in phenotype such as flocculation, necessitating hardware innovations (Celińska and Gorczyca 2024).
The effectiveness of autonomous, self-optimizing biofoundries depends on data quality within iterative DBTL cycles. Despite rapid progress, technical bottlenecks constrain full autonomy: high-throughput readouts produce noisy proxy measurements that impede robust model training (Chafai et al. 2023); heterogeneous data integration remains challenged by differing experimental scales, temporal patterns, and batch effects (de Crécy-Lagard et al. 2025). Hardware reliability issues, including robot failures, and inter-instrument differences, impede large-scale data generation and sharing (Kim et al. 2025, Tobias and Wahab 2025). Optimization landscapes for whole-cell engineering are high-dimensional and rugged; strategies that work for low-dimensional enzyme optimization do not yet scale efficiently to multi-trait, environment-dependent cellular phenotypes within affordable experimental budgets (Aghdam et al. 2024). Finally, insufficient provenance and metadata prevent reliable federated learning across facilities, limiting sample diversity for robust model generalization. Advancing autonomous biofoundries therefore requires concurrent improvements in measurement science, instrument standardization, robust ML methods for distribution shift, and richer, large multi-modal datasets.
Beyond technical challenges, economic costs and human capital present additional barriers. Establishing a modern biofoundry requires substantial capital expenditure for automation, instrumentation, and compute infrastructure; operating costs and high-availability staffing elevate total cost of ownership (Vickers and Freemont 2022, Asin-Garcia et al. 2024). Capital intensity and uncertain return timelines deter academic groups and SMEs from investing in in-house biofoundries, driving reliance on shared facilities (Vickers and Freemont 2022). Although automation reduces demand for traditional wet-lab expertise, biofoundries require a fundamentally different skill set: computational biologists, automation engineers, and data scientists. Current gaps in AI literacy, systems-level automation, and data engineering reflect a mismatch between available skills and those demanded by autonomous DBTL workflows (Trevisan et al. 2024). These barriers argue for policy interventions (shared-facility funding, subsidized training pipelines), standardized modular equipment to reduce entry costs, and open curricula building an ‘AI-ready’ biotech workforce.
Standardization for reproducibility
While bespoke protocols produce reliable data within a single biofoundry, robust standardization of assay methods, data schemas, and metadata ontologies are critical for sharing and aggregating datasets across facilities, a prerequisite for training generalizable AI models. Pre-defined, validated workflows and uniform file formats also reduce lead times for new experiments by eliminating bespoke method development (Casas et al. 2024, Kim et al. 2025). Without harmonization, cross-site integration stalls and automated DBTL efficiency is severely limited.
Biofoundries advance standardization through modular, automated workflows that reduce human error (Gupta et al. 2024). Toolkits like YeastFab, Yeast Optogenetic Toolkit, and Multiplex Yeast Toolkit have improved DNA assembly efficiency (Shaw et al. 2023, Harmer and McClean 2025, Hoffmann 2025), though variations in enzymes, reaction conditions, and equipment calibration still affect outcomes, highlighting the need for benchmarking across diverse chassis organisms.
Phenotypic screening encompasses assays tailored to the trait under investigation: growth kinetics, product titres, reporter-based fluorescence, cell viability, enzyme activity, and metabolite quantification (Pham et al. 2021). Core protocol elements such as media composition, pH, inoculum density, temperature, and calibration workflows should be standardized to ensure comparability across experiments and laboratories (Lloyd 2020). Conversely, bespoke methods remain necessary for novel measurements such as custom droplet-based assays or specialized LC-MS/MS workflows, where detection chemistries and instrument parameters vary with each target. Differences in statistical analysis approaches and proprietary software further undermine cross-study comparability (Gilbertson et al. 2024). Reproducibility therefore requires harmonization of core assay conditions and data-processing pipelines for routine traits, alongside clearly documented custom protocols for specialized measurements.
Community standards are advancing standardization. SBOL enables genetic design exchange, whilst repositories like BioBrick, SEVA, and SynBioHub support part reuse (Smolke 2009, Chao et al. 2017, Beal et al. 2020). Open-source platforms such as Synthace and OpenWorkstation encapsulate validated protocols in programmable formats (Anhel et al. 2023, Torres-Acosta et al. 2024), while MQTT-based systems and Python APIs support cross-platform calibration (Eggert et al. 2020, Gervasi et al. 2021).
Standardized phenotypic metrics, such as TRY, are also gaining traction (Beal et al. 2020). As biofoundries generate increasingly large datasets, alignment on QC thresholds, metadata standards, and findable, accessible, interoperable, reusable (FAIR) data principles becomes crucial for enabling machine learning and cross-site validation (Pleiss 2024).
To move from principles to practice, we recommend an evidence-backed harmonization roadmap that leverages existing community standards and GBA coordination. Immediately actionable steps include: (1) adopt common exchange formats and ontologies across DBTL phases (e.g. SBOL v3 for design, COMBINE/OMEX for model, and simulation packaging, MIxS/GenBank for sequence deposition and MIFlowCyt/MIQE/MIAPE for assay metadata where applicable) to ensure machine-readable interoperability and re-use; (2) require ‘minimum information’ checklists for DNA constructs on publication, building on proposals for complete sequence disclosure, machine-readable files, and provenance metadata to enable benchmarking and replication; (3) define a focused, publishable minimal reporting standard for DNA assembly and genome editing experiments to be endorsed by journals and funders, capturing full sequences, assembly methods, fragment conditions, QC methods and readouts, and strain deposition identifiers; (4) implement cross-location benchmarking exercises and ring-trials coordinated by the Global Biofoundry Alliance to quantify inter-lab variance for core assays and publish standard operating ranges and QC thresholds; (5) mandate persistent identifiers and centralized deposition (Addgene/PIPdb/SynBioHub/ENA) for constructs and metadata with machine-readable schemas (JSON-LD or SBOL) to support automated aggregation; (6) promote open reagent and enzyme formulation reporting so that enzymatic sources are not hidden variables in transferability; and (7) couple technical steps with capacity building, coordinated training modules, published protocol libraries in executable formats (Autoprotocol/OPIL/SBOL), and a shared hardware calibration package to reduce entry costs for new sites. Many elements align with recent community efforts: SBOL v3 provides a robust design exchange format, and coordinated benchmarking has shown that systematic harmonization materially reduces variance between laboratory locations (Hillson et al. 2019, McLaughlin et al. 2020, Bock et al. 2022, Thuronyi et al. 2023, Claussnitzer et al. 2024, Golebiewski et al. 2024, Vegh et al. 2024). Operationalizing these steps will require coordinated endorsement by the Global Biofoundry Alliance, major funders and journals, coupled with modest investments in tooling for metadata capture and automated deposition to reduce compliance barriers for experimental teams.
DNA assembly: advances and standardization challenges
DNA assembly methods like Golden Gate (Engler et al. 2008, Lee et al. 2015) and Gibson Assembly (Gibson et al. 2009) underpin HTP DBTL cycles in yeast engineering. Modular Golden Gate–based toolkits, including the scarless Golden EGG system (Biró et al. 2024), and a MoClo–compatible plasmid set for hierarchical assembly were created (de Vries et al. 2025). Integration with automation platforms, such as AssemblyTron on Opentrons OT–2 (Bryant et al. 2023) and PlasmidMaker iBioFAB (Enghiad et al. 2022), enables parallel construction of thousands of constructs with minimal manual intervention. Yeast’s native homologous recombination simplifies large fragment assembly, enabling entire synthetic chromosomes like synXVI (>900 kb) (Goold et al. 2025).
Enzymatic DNA synthesis delivers kilobase fragments with gene–scale assembly compatibility (Simmons et al. 2023). HTP liquid–handling platforms integrates one–pot HiFi/Type IIS reactions, generates thousands of constructs per day with error frequencies below one per 10 kb (Ma et al. 2024). Continuous–flow microfluidic systems miniaturize cell–free assemblies to nanolitre volumes, cutting reagent costs by two orders of magnitude (Baranwal and Maerkl 2024). Updated multi–kingdom Golden Gate toolkits define consensus overhang sets and reaction buffers that increase cross–lab success rates, and modular design pipelines now feed directly into robotic execution (Vegh et al. 2025). Commercial bench–top printers such as the BioXp 3250 bring chip–based oligo synthesis and gene assembly together, shortening design–to–build cycle to one day (Ma et al. 2024). Complementary standards like SEVA 4.0 provide curated vectors with harmonized origins and selection markers to enhance plug–and–play interoperability (Martínez-García et al. 2023).
Reproducibility remains a bottleneck: Inter–lab comparisons show order–of–magnitude differences in Golden Gate yields due to proprietary enzyme blends and undocumented buffer additives (Bell and Molloy 2022). Overhang sequence context can halve efficiency, underscoring the need for agreed design rules and minimum-information checklists capturing enzyme provenance, ligation kinetics and fragment purity (Lux et al. 2023, Strzelecki et al. 2024). Comprehensive user guides have therefore recommended minimum–information checklists that capture enzyme provenance, ligation kinetics and fragment purity to facilitate benchmarking (Bird et al. 2022). Another study highlighted that despite automated protocols, construct design choices strongly influence cross–facility performance (Cummins et al. 2023). Calls for stronger, simpler publishing standards argue that complete sequence disclosure and open reagent formulations are prerequisites for scalable, community–wide validation (Thuronyi et al. 2023). In practice, journals and community started operationalizing these ideas: A proposed mandatory sequence deposition and a short ‘Box 1’ style minimum reporting checklist for DNA constructs (report full sequences in GenBank/FASTA, note which parts are empirically validated, deposit physical samples where possible), and more specialized minimum-information standards (for example, MAVE and pooled-CRISPR reporting guidelines) show how compact, targeted checklists can accelerate reuse of DNA construct designs, sequence-resolved datasets, and associated phenotype measurements in benchmarking, meta-analysis, and ML applications (Bustin et al. 2009, Bock et al. 2022, Claussnitzer et al. 2024).
Genome editing advances and standardization challenges
Genome editing in yeast has diversified beyond CRISPR–Cas9 to include a range of novel nucleases and precision–editing modalities. CRISPR–Cas system remains the foundation for most workflows in S. cerevisiae, praised for its simplicity, high targeting specificity, and operational efficiency (X. Wu et al. 2025). Alternative nucleases such as ErCas12a offer T–rich PAM recognition and near–100% editing efficiencies (Bennis et al. 2023). Base editors have also been engineered in yeast, including a first–of–its–kind diversifying cytosine deaminase (AID) fusion that enables rapid C–to–T transitions without double–strand breaks (Cazier et al. 2023). Multiplexed editing combining optimized sgRNA arrays and high–throughput transformation protocols have reduced strain engineering timelines from weeks to days (Zhang et al. 2021). A Cas9–fusion strategy (Cas9–Brex27–FadR) recruits achieved integration efficiencies of 98% for 10 kb fragments and nearly 80% for 40 kb fragments (Xu et al. 2024). For even larger assemblies, the HAnDy protocol employs CRISPR–assisted chromosome elimination enables Mb–scale pathway engineering (Ma et al. 2025).
An emerging alternative to conventional editing is the de novo synthesis and deployment of entire synthetic chromosomes. In November 2023, the Sc2.0 consortium reported a strain containing more than 50% synthetic DNA across seven artificial chromosomes, demonstrating that cells tolerate extensive genome rewiring (Goold et al. 2025). Subsequent efforts have consolidated multiple synthetic chromosomes into single strains, revealing novel genetic interactions and streamlined strain backbones for reproducible engineering (Zhao et al. 2023).
To date, the most pressing obstacle is cross–lab reproducibility. Comparative studies show that variations in sgRNA scaffold design, Cas expression vectors, and donor–template architecture can shift editing efficiencies by an order of magnitude, highlighting the need for harmonized reporting standards (Antony et al. 2022). Recent reviews echo this concern and propose minimum–metadata checklists, covering PAM context, off–target assays, and transformation parameters, to enable meaningful benchmarking across platforms (X. Wu et al. 2025). For genome editing, community recommendations provide templates that can be adapted into a compact ‘Minimum Information for Genome Editing’ checklist covering sgRNA sequences and scaffolds, delivery vectors and expression context, donor template sequence and homology arms, off-target assay methods and sensitivity, transformation efficiencies, and deposition of editing reagents/cells to repositories (Beal et al. 2020, Bock et al. 2022, Malci et al. 2022, Skrekas 2022, Thuronyi et al. 2023, Claussnitzer et al. 2024). Formalizing these elements as a field-endorsed minimum standard, and linking compliance to routine deposition in public repositories, would substantially reduce ambiguity in cross-facility comparisons and enable automated QC in biofoundry networks.
Standards and metrics for reproducibility and interoperability
Achieving reliable, scalable bioengineering requires community standards for data, methods, and performance metrics (Hillson et al. 2019, Beal et al. 2020). Over the last few years, there has been a strong push toward technical standards that promote reproducibility and interoperability in synthetic biology. One example is the Synthetic Biology Open Language (SBOL), which has been updated (v3.0/3.1) to facilitate exchanging genetic designs and engineering intent across software platforms (Quinn et al. 2015, Vidal et al. 2025). SBOL provides a structured format to describe DNA parts, circuits, and assemblies, enabling design compatibility across laboratories and automation systems.
Standard ontologies and COMBINE archives are increasingly adopted to package datasets, models, and metadata together, ensuring consistent interpretation and enabling data-centric automation (Matzko and Konur 2024). These structured formats form a ‘digital language’ for biofoundries, aligning design environments with build and test systems across institutional boundaries.
International consortia, academic institutions, industry partners, and regulatory bodies all recognize the need for unified standards. The Global Biofoundry Alliance (GBA) supports international collaboration by promoting standardized workflows, data formats, and best practices (Hillson et al. 2019). Similarly, the International Genetically Engineered Machine (iGEM) Competition reinforces standardized design principles, contributing over 20 000 modular parts to the Registry of Standard Biological Parts and training thousands of researchers (Smolke 2009, Gupta et al. 2024).
Recent studies highlight the importance of performance metrics and data quality control in synthetic biology. For instance, a remote robotic lab replicated a classic yeast gene circuit experiment at scale, revealing which results remained robust under HTP scrutiny (Golebiewski et al. 2024). Standard performance metrics such as TRY are now commonly used to evaluate industrial strain development and bioprocess viability.
Another key technical focus is laboratory interoperability. Efforts from other fields, such as medical imaging’s DICOM standard, have inspired sharable protocol formats and toolchains. Frameworks such as MoClo, Phytobricks, and Loop provide standardized DNA assembly methods compatible with automation platforms (Vidal et al. 2025). Open-access repositories like Addgene, SEVA, and SynBioHub facilitate access to validated plasmids, strains, and parts (Vegh et al. 2025).
Ultimately, standardization of methods, formats, and metrics is driving a new era of collaborative synthetic biology. As biofoundries proliferate worldwide, they rely on common data structures, interoperable instrumentation, and community-wide best practices. These foundations enable distributed DBTL workflows, where design, build, and test operations occur in different institutions without the need for extensive reformatting or re-validation. By ensuring experiments can be repeated, improved, and transferred with confidence, standardization becomes a cornerstone of scalable, reliable bioengineering.
Policy, regulatory, and data governance for global biofoundries
As biofoundries expand globally, policy and governance frameworks are evolving to support safe, collaborative, and efficient development. A central focus is data governance: protocols for sharing designs, metadata, and materials enable reproducibility across labs (Asin-Garcia et al. 2024, Watkins et al. 2024). A yeast strain producing a novel antibiotic, for instance, can be transferred between facilities if both adhere to shared standards such as SBOL and minimal phenotypic datasets. This effort also includes open MTAs and cloud-based experimental logging systems, like those from the DARPA SD2 program (Leins et al. 2023).
Standardized phenotypic screening and quality metrics are critical for HTP workflows (Tancheva et al. 2025). Researchers are aligning assay conditions, media, and reporter systems (Beal et al. 2020, John et al. 2020), while guidelines like MIACA, and FAIR principles promote transparency (Nault et al. 2023). Platforms such as protocols.io and FAIRDOMHub facilitate method sharing and reduce variability (Gygli and Pleiss 2020, ‘Welcoming protocols.io,’ 2024). Reporting negative results is also encouraged to support iterative improvement (Brazil 2024).
Regulatory landscapes increasingly shape foundry operations. Australia’s Gene Technology Regulations 2001 exempt SDN1 edits from GMO oversight, directly affecting strain design strategies and regulatory data packages; however, this exemption is sometimes misrepresented, prompting calls for precise regulatory mapping in distributed DBTL programs (Thygesen 2024). China’s Biosafety/Biosecurity Law (2020/2021) consolidates governance of biotechnology R&D, human genetic resources, and pathogen laboratories, while analyses note persistent implementation gaps relevant to distributed foundry networks (Cao 2021). Singapore’s risk-based system and investments in SynCTI and SINERGY illustrate a supportive innovation ecosystem attentive to dual-use concerns and interoperable data sharing (Trump et al. 2020, Farzaneh and Freemont 2021, Millett et al. 2023). Comparative analyses show China pairing statutory controls with soft-law frameworks such as the Tianjin Biosecurity Guidelines, whereas Singapore integrates infrastructure and governance to support regional DBTL interoperability (Trump et al. 2020, Hynek 2025).
Technical standards are increasingly tied to regulatory compliance, especially as biofoundries move toward deployment phases (Freemont 2024). ISO Technical Committee 276 is drafting guidelines on bioprocessing and data traceability, supported by efforts from metrology institutes like NIST (Farzaneh and Freemont 2021). Biofabrication standards, paralleling GMP, aim to ensure consistent yields and reliable data across labs (Beal et al. 2020, Vidal et al. 2025). Internationally, biosafety oversight remains uneven: ISO 35 001:2019 offers the first global biorisk-management standard, but it is still not widely adopted, and many regions continue to rely on BMBL or the WHO Laboratory Biosafety Manual; frameworks that currently omit explicit AI safeguards (Undheim 2024).
With AI-driven automation, biosecurity and ethics are becoming more pressing. A 2024 publication portrays the situation as a ‘whack-a-mole’ governance challenge, advocating a dynamic blend of precautionary hard law, stewardship-oriented soft law, bottom-up community norms, and industry-led initiatives to keep risks in check (Undheim 2024).
A multi-stakeholder red-team study demonstrated that generative protein design tools could produce synthetic homologs that initially evaded standard sequence-order screening (Wittmann et al. 2025). Following responsible disclosure, industry screening systems were patched to ∼97% detection accuracy, restricting ability to order genes encoding AI-designed risky proteins. Separately, public LLMs show tightening safeguards: certain models refused disinformation prompts even under jailbreaking, whilst others improved safeguards over time, indicating operational restrictions applied by AI developers relevant to bioscience communication risk (Menz et al. 2024). Policy scholarship argues for mandatory evaluations of biological foundation models, linking general-purpose AI regulation to proposed model-level safeguards and controlled access to AI-enabled biological design (Grinbaum and Adomaitis 2024).
Beyond design and predictive modelling, specialized workflows extend into gene editing, genotype–phenotype mapping, and biosafety. GP-GPT captures genotype–phenotype links across large datasets (Lyu et al. 2024), whilst ThreatSeq screens for potentially hazardous sequences in real time to strengthen sequence-level biosurveillance (Hoffmann et al. 2023). Deploying protein language models at scale depends on consistently labelled training data and integration with robust laboratory information management systems synchronized with automated build–test pipelines (Cheng et al. 2023, Song et al. 2024). Regulators increasingly emphasize traceability and interpretability in AI-driven experimentation; early engagement is prudent because LLMs and generative models can propose constructs outside typical design boundaries (Boiko et al. 2023). Proposals include screening AI-generated designs and tracking engineered organisms to prevent misuse (Holub and Agena 2023). Data ownership and privacy have emerged as central sociolegal concerns in collaborative synthetic biology as the balance between openness and commercial becomes more relevant (McLennan and Maslen 2025). Federated data trust architectures and controlled access repositories using attribute-based access controls and blockchain-enabled provenance tracking are being implemented to secure data sharing whilst preserving intellectual property rights (Taddese et al. 2025). Targeted training initiatives including embedded science data literacy programs and interdisciplinary workshops have been incorporated into graduate and professional curricula to equip scientists with competencies in data curation, ethical sharing and regulatory compliance (Qiao et al. 2024).
Automation tools further reinforce standardization. Platforms like AssemblyTron, PyLabRobot, and DNAda translate genetic designs into reproducible liquid-handling protocols with built-in monitoring (Bryant et al. 2023, Nava et al. 2023, Wierenga et al. 2023). In a recent preprint, RoboCulture, an AI-driven, hardware-agnostic system enhances portability across robotic setups (Angers et al. 2025), while benchmarking tools support strategic automation choices (Rupp et al. 2024). These systems align with FAIR principles and improve traceability (Marilene et al. 2021, Pleiss 2024).
Global interoperability depends on shared standards and mutual trust. The UK, Singapore, and China are building compatible infrastructures where design, testing, and analytics can be distributed across locations (Asin-Garcia et al. 2024, Watkins et al. 2024). Regulatory bodies are updating submission protocols to track strains developed through automated cycles, with blockchain-style traceability and kill switch safeguards under discussion (Farzaneh and Freemont 2021). Emerging frameworks call for AI-enabled early-warning systems to detect and mitigate biohazards and for broadening the Dual Use Research of Concern (DURC) regime so that generative-AI design tools fall within its scope. Multi-level governance continuums, spanning global, national, corporate, laboratory, and citizen tiers, are being proposed to ensure that precautionary, stewardship, bottom-up, and market-driven measures act in concert (Undheim 2024).
In summary, data governance and policy are key to scaling the biofoundry revolution. Shared protocols, robust quality metrics, and strong biosecurity frameworks are enabling reproducible, distributed, and responsible innovation (Hillson et al. 2019, Leavell et al. 2020). Through global harmonization of DBTL workflows, synthetic biology is transitioning from ad hoc experimentation to a coordinated, scalable, and secure engineering discipline (Holub and Agena 2023).
Cross-domain inspirations: learning from materials and pharma
Synthetic biology increasingly draws on advances from adjacent fields like nanomaterials, chemical engineering, and drug discovery. Self-driving labs, originating in chemistry and materials science, employ closed-loop systems where robots run experiments and AI selects the next ones, accelerating discoveries in catalysts and light-harvesting materials (Martin et al. 2023, Kim 2025). Bioengineers are adapting similar approaches, using Bayesian optimization and evolutionary algorithms, first popular in drug formulation, to guide strain engineering by selecting gene combinations or culture parameters.
AlphaFlow, an AI-controlled chemical synthesis system, parallels biofoundry pipelines where multiple reactions or design steps must be orchestrated (Volk et al. 2023). Lessons from AlphaFlow’s decision logic can inform control systems for complex metabolic engineering. Similarly, HTS in pharma, used to test millions of compounds, is analogous to testing large strain libraries or combinatorial pathways in synthetic biology (Salunke et al. 2022). Drug discovery workflows increasingly rely on in silico design and cloud labs, like IBM’s RoboRXN, where users specify outcomes and automation handles the rest (O’Neill 2021). A similar model is emerging for biofoundries: users could submit a target compound and organism and receive a custom-engineered yeast strain. Standards from pharmaceutical biotechnology, such as SiLA 2 and ANSI/ISA protocols, are being adopted to improve device interoperability (Juchli 2022). Even MLOps, Machine Learning Operations from the software industry, is being applied to maintain and update predictive models used in biofoundry workflows (Alla and Adari 2020).
Regarding materials informatics, active learning techniques used to optimize carbon nanotube production are now applied to protein engineering and metabolic pathway optimization (B. Yang et al. 2022). Digital twins, computational models that simulate experiments, are already being used in synthetic biology to model bioreactors or predict metabolic outcomes, reducing trial-and-error in the Design phase (Martin et al. 2023).
Operational practices from semiconductor manufacturing and industrial automation also offer valuable lessons. Cleanroom protocols, QA/QC procedures, and continuous improvement philosophies (kaizen) are being adopted to enhance biofoundry reproducibility (Chien et al. 2020, Zhao and Yoshikuni 2021). Similarly, pre-competitive collaborations in pharma and materials science, like the Materials Genome Initiative, have inspired data-sharing efforts such as the Global Biofoundry Alliance (Holub and Agena 2023).
Regulatory insights from other domains, such as FDA oversight of automated drug production, are shaping discussions about oversight for engineered microbes produced by biofoundries (Niazi 2023). Synthetic biology is not evolving in isolation but converging with automation, AI and data science. This cross-pollination is making biofoundries more efficient and integrated by design.
Conclusion
High-throughput yeast engineering is shifting from labour-intensive trial-and-error to AI-driven, automated platforms that can design, build, test, and learn at unprecedented speed. Biofoundries are no longer just tools to increase throughput; they are cyber-physical infrastructures transforming how we prototype and deploy biological systems. As synthetic biology advances, the convergence of robotics, machine learning, and multi-omics is enabling truly autonomous, self-optimizing ‘self-driving labs.’ Yet technology alone is not enough. Scalable and responsible innovation requires standardization, robust data governance, and harmonized policy frameworks.
Governance must evolve alongside capability. The 1975 Asilomar conference on recombinant DNA set in progress proactive, precautionary frameworks, showing that transparency and regulation can enable innovation while safeguarding society. At the 2025 Asilomar conference on the future of biotechnology, a range of topics were discussed (resulting entreaties are available at https://repository.rice.edu/collections/e4e3a33e-08a4-427a-9b59-cb30d93b8bbe), including around AI-driven design and biosafety. Embedding auditability, explainability, and robust oversight into biofoundry operations will be essential to maintain public trust as the scale and autonomy of these systems increase.
Equitable access is also central to the long-term impact of high-throughput yeast engineering. Without deliberate investment in shared tools and distributed capacity, advanced workflows risk remaining concentrated in countries with existing research strengths. Initiatives that work on open-source biotechnology and local biomanufacturing illustrate how this can be addressed—through open enzyme collections, community-driven reagent production, and training programmes that lower barriers to participation. These examples demonstrate that equitable access is not peripheral but foundational, ensuring that biofoundries evolve from elite infrastructures into global engines of collaboration and innovation.
The next generation of yeast bioengineering will be defined not only by smarter tools but by a smarter system. Biofoundries are becoming the operating system for a new era of biological design; one that demands coordinated investment in automation, AI, and governance while embedding transparency, equity, and sustainability at its core.
Supplementary Material
Contributor Information
Juan P O Martinez, Advanced Engineering Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Dutton Park, Brisbane, QLD 4102, Australia; School of Biology and Environmental Science, Queensland University of Technology, Brisbane, QLD 4000, Australia.
Robert E Speight, Advanced Engineering Biology Future Science Platform, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Dutton Park, Brisbane, QLD 4102, Australia; School of Biology and Environmental Science, Queensland University of Technology, Brisbane, QLD 4000, Australia.
Conflicts of interest
The authors declare no conflicts of interest related to the content of this manuscript. Both authors have read and approved the final version of the manuscript and agree with its submission.
Funding
This work was supported by the Advanced Engineering Biology Future Science Platform at the Commonwealth Scientific and Industrial Research Organisation (CSIRO), Australia.
References
- Aghdam R, Tang X, Shan S et al. Human limits in machine learning: prediction of potato yield and disease using soil microbiome data. BMC Bioinf. 2024;25:366. 10.1186/s12859-024-05977-2. [DOI] [Google Scholar]
- Aiki K, Tsuchiya R, Kushida A et al. Rapid counting of Kazachstania humilis and Saccharomyces cerevisiae in sourdough by deep learning-based classifier. J Microbiol Methods. 2025;236:107183. 10.1016/j.mimet.2025.107183. [DOI] [PubMed] [Google Scholar]
- Alla S, Adari SK. What is mlops?In Beginning MLOps with MLFlow: Deploy Models in AWS SageMaker, Google Cloud, and Microsoft Azure. Berkeley, CA: Apress, 2020, pp. 79–124. [Google Scholar]
- Angers K, Darvish K, Yoshikawa N et al. RoboCulture: a robotics platform for automated biological experimentation. arXiv. 2025. 10.48550/arXiv.2505.14941 [DOI] [Google Scholar]
- Anhel A-M, Alejaldre L, Goñi-Moreno Á. The laboratory automation protocol (LAP) format and repository: a platform for enhancing workflow efficiency in synthetic biology. ACS Synth Biol. 2023;12:3514–20. 10.1021/acssynbio.3c00397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Annaluru N, Muller H, Mitchell LA et al. Total synthesis of a functional designer eukaryotic chromosome. Science. 2014;344:55–58. 10.1126/science.1249252. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Antony JS, Hinz JM, Wyrick JJ. Tips, tricks, and potential pitfalls of CRISPR genome editing in Saccharomyces cerevisiae. Front Bioeng Biotechnol. 2022;10:924914. 10.3389/fbioe.2022.924914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Asin-Garcia E, Fawcett JD, Batianis C et al. A snapshot of biomanufacturing and the need for enabling research infrastructure [Review]. Trends Biotechnol. 2025;43:1000–14. 10.1016/j.tibtech.2024.10.014. [DOI] [PubMed] [Google Scholar]
- Ataii N, Bakshi S, Chen Y et al. Enabling AI in synthetic biology through construction file specification. PLoS One. 2023;18:e0294469. 10.1371/journal.pone.0294469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baranwal AK, Maerkl SJ. A comprehensive review of microfluidic approaches in cell-free synthetic biology. Front Synth Biol. 2024;2:1397533. 10.3389/fsybi.2024.1397533. [DOI] [Google Scholar]
- Beal J, Goñi-Moreno A, Myers C et al. The long journey towards standards for engineering biosystems: are the Molecular Biology and the Biotech communities ready to standardise?. EMBO Rep. 2020;21:e50521. 10.15252/embr.202050521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bell NA, Molloy JE. Efficient golden gate assembly of DNA constructs for single molecule force spectroscopy and imaging. Nucleic Acids Res. 2022;50:e77. 10.1093/nar/gkac300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennis NX, Anderson JP, Kok SM et al. Expanding the genome editing toolbox of Saccharomyces cerevisiae with the endonuclease Er Cas12a. FEMS Yeast Res. 2023;23:foad043. 10.1093/femsyr/foad043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Berezin C-T, Aguilera LU, Billerbeck S et al. Ten simple rules for managing laboratory information. PLoS Comput Biol. 2023;19:e1011652. 10.1371/journal.pcbi.1011652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bird JE, Marles-Wright J, Giachino A. A user’s guide to golden gate cloning methods and standards. ACS Synth Biol. 2022;11:3551–63. 10.1021/acssynbio.2c00355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biró JB, Kecskés K, Szegletes Z et al. Golden EGG, a simplified Golden Gate cloning system to assemble multiple fragments. Sci Rep. 2024;14:25288. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bock C, Datlinger P, Chardon F et al. High-content CRISPR screening. Nat Rev Methods Primers. 2022;2:8. 10.1038/s43586-021-00093-4. [DOI] [Google Scholar]
- Boiko DA, MacKnight R, Gomes G. Emergent autonomous scientific research capabilities of large language models. arXiv. 2023. 10.48550/arXiv.2304.05332 [DOI] [Google Scholar]
- Bongers M, Perez-Gil J, Hodson MP et al. Adaptation of hydroxymethylbutenyl diphosphate reductase enables volatile isoprenoid production. eLife. 2020;9:e48685. 10.7554/eLife.48685. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Boob AG, Chen J, Zhao H. Enabling pathway design by multiplex experimentation and machine learning. Metab Eng. 2024;81:70–87. 10.1016/j.ymben.2023.11.006. [DOI] [PubMed] [Google Scholar]
- Borodina I, Nielsen J. Advances in metabolic engineering of yeast Saccharomyces cerevisiae for production of chemicals. Biotechnol J. 2014;9:609–20. 10.1002/biot.201300445. [DOI] [PubMed] [Google Scholar]
- Bozkurt EU, Ørsted EC, Volke DC et al. Accelerating enzyme discovery and engineering with high-throughput screening. Nat Prod Rep. 2026. 10.1039/D4NP00031E [DOI] [Google Scholar]
- Brazil R. Illuminating’the ugly side of science': fresh incentives for reporting negative results. Nature. 2024. 10.1038/d41586-024-01389-7. [DOI] [Google Scholar]
- Brunnsåker D, Gower AH, Naval P et al. Self-driven biological discovery through automated hypothesis generation and experimental validation. BioRxiv. 2025. 10.1101/2025.06.24.661378. [DOI] [Google Scholar]
- Bryant JA Jr, Kellinger M, Longmire C et al. AssemblyTron: flexible automation of DNA assembly with Opentrons OT-2 lab robots. Synth Biol (Oxf). 2023;8:ysac032. 10.1093/synbio/ysac032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bustin SA, Benes V, Garson JA et al. The MIQE guidelines: minimum information for publication of quantitative real-time PCR experiments. Clin Chem. 2009;55:611–22. 10.1373/clinchem.2008.112797. [DOI] [PubMed] [Google Scholar]
- Cai P, Gao J, Zhou Y. CRISPR-mediated genome editing in non-conventional yeasts for biotechnological applications. Microb Cell Fact. 2019;18:1–12. 10.1186/s12934-019-1112-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Camacho DM, Collins KM, Powers RK et al. Next-generation machine learning for biological networks. Cell. 2018;173:1581–92. 10.1016/j.cell.2018.05.015. [DOI] [PubMed] [Google Scholar]
- Cao C. China’s evolving biosafety/biosecurity legislations. J Law Biosci. 2021;8:lsab020. 10.1093/jlb/lsab020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carbonell P, Jervis AJ, Robinson CJ et al. An automated design-build-test-learn pipeline for enhanced microbial production of fine chemicals. Commun Biol. 2018;1:66. 10.1038/s42003-018-0076-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Casas A, Bultelle M, Kitney R. An engineering biology approach to automated workflow and biodesign [Article]. Synth Biol. 2024;9:Article ysae009. 10.1093/synbio/ysae009. [DOI] [Google Scholar]
- Castaño-Cerezo S, Chamas A, Kulyk H et al. Combining systems and synthetic biology for in vivo enzymology. EMBO J. 2024;43:5169–5185. 10.1038/s44318-024-00251-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cazier AP, Irvin OM, Chávez LS et al. A rapid antibody enhancement platform in Saccharomyces cerevisiae using an improved, diversifying CRISPR base editor. ACS Synth Biol. 2023;12:3287–300. 10.1021/acssynbio.3c00299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Celińska E, Gorczyca M.‘Small volume—big problem’: culturing Yarrowia lipolytica in high-throughput micro-formats. Microb Cell Fact. 2024;23:184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chafai N, Hayah I, Houaga I et al. A review of machine learning models applied to genomic prediction in animal breeding. Front Genet. 2023;14:1150596. 10.3389/fgene.2023.1150596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chao R, Mishra S, Si T et al. Engineering biological systems using automated biofoundries. Metab Eng. 2017;42:98–108. 10.1016/j.ymben.2017.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng K, Guo Q, He Y et al. Exploring the potential of GPT-4 in biomedical engineering: the dawn of a new era. Ann Biomed Eng. 2023;51:1645–53. 10.1007/s10439-023-03221-1. [DOI] [PubMed] [Google Scholar]
- Chien C-F, Chen Y-H, Lo M-F. Advanced quality control (AQC) of silicon wafer specifications for yield enhancement for smart manufacturing. IEEE Trans Semicond Manufact. 2020;33:569–77. 10.1109/TSM.2020.3010200. [DOI] [Google Scholar]
- Claussnitzer M, Parikh VN, Wagner AH et al. Minimum information and guidelines for reporting a multiplexed assay of variant effect. Genome Biol. 2024;25:100. 10.1186/s13059-024-03223-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cleaver A, Luo R, Smith OB et al. High-throughput optimisation of protein secretion in yeast via an engineered biosensor [Article]. Trends Biotechnol. 2025;43:838–67., 10.1016/j.tibtech.2024.11.010. [DOI] [PubMed] [Google Scholar]
- Cummins B, Vrana J, Moseley RC et al. Robustness and reproducibility of simple and complex synthetic logic circuit designs using a DBTL loop. Synth Biol. 2023;8:ysad005. 10.1093/synbio/ysad005. [DOI] [Google Scholar]
- Dash T, Bornelöv S. Predicting gene expression using millions of yeast promoters reveals cis-regulatory logic. Bioinform Adv. 2024;5:vbaf130. 10.1093/bioadv/vbaf130. [DOI] [Google Scholar]
- Davies K. From the bench to benchling. GEN Edge. 2020;2:303–9. 10.1089/genedge.2.1.52. [DOI] [Google Scholar]
- de Crécy-Lagard V, Dias R, Sexson N et al. Limitations of current machine learning models in predicting enzymatic functions for uncharacterized proteins. G3: Genes, Genomes, Genetics. 2025;15:jkaf169. 10.1093/g3journal/jkaf169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Vries ST, Kley L, Schindler D. Use of a golden gate plasmid set enabling scarless MoClo-compatible transcription unit assembly. In Golden Gate Cloning. Methods in Molecular Biology. New York, NY: Humana, 2025;2850. 10.1007/978-1-0716-4220-7_7 [DOI] [Google Scholar]
- Ding N, Yuan Z, Ma Z et al. AI-assisted rational design and activity prediction of biological elements for optimizing transcription-factor-based biosensors. Molecules. 2024;29:3512. 10.3390/molecules29153512. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dixon TA, Walker RSK, Pretorius IS. Visioning synthetic futures for yeast research within the context of current global techno-political trends [Review]. Yeast. 2023;40:443–56. 10.1002/yea.3897. [DOI] [PubMed] [Google Scholar]
- Dixon TA, Williams TC, Pretorius IS. Bioinformational trends in grape and wine biotechnology. Trends Biotechnol. 2022;40:124–35. 10.1016/j.tibtech.2021.05.001. [DOI] [PubMed] [Google Scholar]
- Domenzain I, Lu Y, Wang H et al. Computational biology predicts metabolic engineering targets for increased production of 103 valuable chemicals in yeast. Proc Natl Acad Sci USA. 2025;122:e2417322122. 10.1073/pnas.2417322122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Eggert S, Mieszczanek P, Meinert C et al. OpenWorkstation: a modular open-source technology for automated in vitro workflows. HardwareX. 2020;8:e00152. 10.1016/j.ohx.2020.e00152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Enghiad B, Xue P, Singh N et al. PlasmidMaker is a versatile, automated, and high throughput end-to-end platform for plasmid construction. Nat Commun. 2022;13:2697. 10.1038/s41467-022-30355-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engler C, Kandzia R, Marillonnet S. A one pot, one step, precision cloning method with high throughput capability. PLoS One. 2008;3:e3647. 10.1371/journal.pone.0003647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Erpf PE, Meier F, Walker RS et al. Building synthetic chromosomes one yeast at a time: insights from Sc2. 0. Nat Biotechnol. 2025:1–8. [DOI] [PubMed] [Google Scholar]
- Evans R, O’Neill M, Pritzel A et al. Protein complex prediction with Alpha Fold-Multimer. bioRxiv. 2021. 10.1101/2021.10.04.463034 [DOI] [Google Scholar]
- Fan X, Xiang G, Liao W et al. Decoding yeast transcriptional regulation via a data-and mechanism-driven distributed large-scale network model. Synth Syst Biotechnol. 2025;10:1140–9. 10.1016/j.synbio.2025.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farzaneh T, Freemont PS. Biofoundries are a nucleating hub for industrial translation. Synth Biol. 2021;6:ysab013. 10.1093/synbio/ysab013. [DOI] [Google Scholar]
- Fero MJ, Craft JK, Vu T et al. Combinatorial-Hierarchical DNA Library Design Using the TeselaGen DESIGN Module with j5. In: Chandran S., George K. (eds), DNA Cloning and Assembly. Methods in Molecular Biology. New York, NY: Humana, 2020;2205. [Google Scholar]
- Finnigan W, Hepworth LJ, Flitsch SL et al. RetroBioCat as a computer-aided synthesis planning tool for biocatalytic reactions and cascades. Nat Catal. 2021;4:98–104. 10.1038/s41929-020-00556-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freemont, P.S., Ni, C., Aurand, E. et al. . Engineering Biology Metrics and Technical Standards for the Global Bioeconomy. London, UK, 2024. 10.25561/110822. [DOI] [Google Scholar]
- Gao J, Xu J, Zuo Y et al. Synthetic biology toolkit for marker-less integration of multigene pathways into Pichia pastoris via CRISPR/Cas9. ACS Synth Biol. 2022;11:623–33. 10.1021/acssynbio.1c00307. [DOI] [PubMed] [Google Scholar]
- Gervasi A, Cardol P, Meyer PE. Open-hardware wireless controller and 3D-printed pumps for efficient liquid manipulation. HardwareX. 2021;9:e00199. 10.1016/j.ohx.2021.e00199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gibson DG, Young L, Chuang R-Y et al. Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat Methods. 2009;6:343–5. 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
- Gilbertson C, Mundt M, Teves J et al. Towards evidence-based software quality practices for reproducibility: practices and aligned software qualities. In: Proceedings of the 2nd ACM Conference on Reproducibility and Replicability, Rennes, France, 2024. 10.1145/3641525.3663624. [DOI] [Google Scholar]
- Golebiewski M, Bader G, Gleeson P et al. Specifications of standards in systems and synthetic biology: status, developments, and tools in 2024. Journal of Integrative Bioinformatics. 2024;21:20240015. 10.1515/jib-2024-0015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goold HD, Kroukamp H, Erpf PE et al. Construction and iterative redesign of synXVI a 903 kb synthetic Saccharomyces cerevisiae chromosome [Article]. Nat Commun. 2025;16, Article841. 10.1038/s41467-024-55318-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goshisht MK. Machine learning and deep learning in synthetic biology: key architectures, applications, and challenges. ACS Omega. 2024;9:9921–45. 10.1021/acsomega.3c05913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grinbaum A, Adomaitis L. Dual use concerns of generative AI and large language models. J Resp Innov. 2024;11:2304381. 10.1080/23299460.2024.2304381. [DOI] [Google Scholar]
- Gupta A, Lee SG, Sung BH et al. Advancing biofoundry development: strategies and challenges [Review]. Biotechnol Bioproc E. 2024;29:771–8. 10.1007/s12257-024-00121-6. [DOI] [Google Scholar]
- Gurdo N, Volke DC, McCloskey D et al. Automating the design-build-test-learn cycle towards next-generation bacterial cell factories [Article]. New Biotechnol. 2023;74:1–15. 10.1016/j.nbt.2023.01.002. [DOI] [Google Scholar]
- Gygli G, Pleiss J. Simulation foundry: automated and FAIR molecular modeling. J Chem Inf Model. 2020;60:1922–7. 10.1021/acs.jcim.0c00018. [DOI] [PubMed] [Google Scholar]
- Hadimioglu B, Stearns R, Ellson R. Moving liquids with sound: the physics of acoustic droplet ejection for robust laboratory automation in life sciences. SLAS Technology. 2016;21:4–18. 10.1177/2211068215615096. [DOI] [Google Scholar]
- Harmer ZP, McClean MN. The yeast optogenetic toolkit (yOTK) for spatiotemporal control of gene expression in budding yeast. In Optogenetics: Methods in Molecular Biology. New York, NY: Humana, 2025;2840. [Google Scholar]
- Herisson J, Hoang NA, El Sawah A et al. Operate a cell-free biofoundry using large language models. bioRxiv. 2024. 10.1101/2024.10.28.619828 [DOI] [Google Scholar]
- Hillson N, Caddick M, Cai Y et al. Building a global alliance of biofoundries. Nat Commun. 2019;10:2040. 10.1038/s41467-019-10079-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann SA, Diggans J, Densmore D et al. Safety by design: biosafety and biosecurity in the age of synthetic genomics. iScience. 2023;26:106165. 10.1016/j.isci.2023.106165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann SA. YeastFab cloning of toxic genes and protein expression optimization in yeast. In Schindler D. (ed.), Golden Gate Cloning: Methods in Molecular Biology. New York, NY: Humana, 2025;2850. 10.1007/978-1-0716-4220-7_24 [DOI] [Google Scholar]
- Holland K, Blazeck J. High throughput mutagenesis and screening for yeast engineering. J Biol Eng. 2022;16:37. 10.1186/s13036-022-00315-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holowko MB, Frow EK, Reid JC et al. Building a biofoundry. Synth Biol (Oxf). 2021;6:ysaa026. 10.1093/synbio/ysaa026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holub M, Agena E. Biofoundries and citizen science can accelerate disease surveillance and environmental monitoring [Article]. Front Bioeng Biotechnol. 2023;10:1110376. 10.3389/fbioe.2022.1110376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hou M, Xia Y, Wang P et al. High-accuracy protein complex structure modeling based on sequence-derived structure complementarity. Nat Commun. 2025;16:10182. 10.1038/s41467-025-65090-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hynek N. Synthetic biology/AI convergence (SynBioAI): security threats in frontier science and regulatory challenges. AI & SOCIETY. 2025;1–18. [Google Scholar]
- IDEA–Bio . Integrated Design Environment for Advanced Biomanufacturing. Australian Institute for Bioengineering and Nanotechnology. 2025. https://www.ideabio.org.au (20 December 2025, date last accessed). [Google Scholar]
- Jagtap UB, Jadhav JP, Bapat VA et al. Synthetic biology stretching the realms of possibility in wine yeast research [Review]. Int J Food Microbiol. 2017;252:24–34. 10.1016/j.ijfoodmicro.2017.04.006. [DOI] [PubMed] [Google Scholar]
- Jamil H, Krawetz S, Gow A. Knowledge synthesis using large language models for a computational biology workflow ecosystem. In: Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing, Avila, Spain. 2024. [Google Scholar]
- Jervis AJ, Carbonell P, Vinaixa M et al. Machine learning of designed translational control allows predictive pathway optimization in Escherichia coli. ACS Synth Biol. 2019;8:127–36. 10.1021/acssynbio.8b00398. [DOI] [PubMed] [Google Scholar]
- John MP, John MP, John MP et al. Enabling one-pot Golden Gate assemblies of unprecedented complexity using data-optimized assembly design. PLoS One. 2020;15:e0238592. 10.1371/journal.pone.0238592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Juchli D. Sila 2: the next generation lab automation standard. In Beutel S., Lenk F. (eds), Smart Biolabs of the Future. Advances in Biochemical Engineering/Biotechnology. Springer, Cham, 2022;182:147–74. 10.1007/10_2022_204 [DOI] [Google Scholar]
- Kim Gi Bae, Kim Won Jun, Kim Hyun Uk, Lee Sang Yup. Machine learning applications in systems metabolic engineering. Current Opinion in Biotechnology. 2020;64:1–9. 10.1016/j.copbio.2019.08.010 [DOI] [PubMed] [Google Scholar]
- Kim H, Hillson NJ, Cho B-K et al. Abstraction hierarchy to define biofoundry workflows and operations for interoperable synthetic biology research and applications. Nat Commun. 2025;16:6056. 10.1038/s41467-025-61263-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kim M. The Rise of Self-Driving Labs. Opt Photonics News. 2025;36:36–43. 10.1364/OPN.36.4.000036. [DOI] [Google Scholar]
- Kittleson JT, Wu GC, Anderson JC. Successes and failures in modular genetic engineering. Curr Opin Chem Biol. 2012;16:329–36. 10.1016/j.cbpa.2012.06.009. [DOI] [PubMed] [Google Scholar]
- Koray M, Koray M, Emma W et al. Standardization of synthetic biology tools and assembly methods for Saccharomyces cerevisiae and emerging yeast species. ACS Synth Biol. 2022;11:2527–47. 10.1021/acssynbio.1c00442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kugler A, Stensjö K. Machine learning predicts system-wide metabolic flux control in cyanobacteria. Metab Eng. 2024;82:171–82. 10.1016/j.ymben.2024.02.013. [DOI] [PubMed] [Google Scholar]
- Kuivanen J, Holmstrom S, Lehtinen B et al. A high-throughput workflow for CRISPR/Cas9 mediated combinatorial promoter replacements and phenotype characterization in yeast. Biotechnol J. 2018;13:e1700593. 10.1002/biot.201700593. [DOI] [Google Scholar]
- Kundu P, Beura S, Mondal S et al. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv. 2024;74:108400. 10.1016/j.biotechadv.2024.108400. [DOI] [PubMed] [Google Scholar]
- Kutyna DR, Onetto CA, Williams TC et al. Construction of a synthetic Saccharomyces cerevisiae pan-genome neo-chromosome [Article]. Nat Commun. 2022;13: 3628. 10.1038/s41467-022-31305-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lal A, Gunsalus L, Gupta A et al. Polygraph: a software framework for the systematic assessment of synthetic regulatory DNA elements. Genome Biol. 2025;26:114. 10.1186/s13059-025-03584-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leavell MD, Singh AH, Kaufmann-Malaga BB. High-throughput screening for improved microbial cell factories, perspective and promise. Curr Opin Biotechnol. 2020;62:22–28. 10.1016/j.copbio.2019.07.002. [DOI] [PubMed] [Google Scholar]
- Lee DH, Kim H, Sung BH et al. Biofoundries: bridging automation and biomanufacturing in synthetic biology [review]. Biotechnol Bioproc E. 2023;28:892–904. 10.1007/s12257-023-0226-x. [DOI] [Google Scholar]
- Lee ME, DeLoache WC, Cervantes B et al. A highly characterized yeast toolkit for modular, multipart assembly. ACS Synth Biol. 2015;4:975–86. 10.1021/sb500366v. [DOI] [PubMed] [Google Scholar]
- Leins DA, Haase SB, Eslami M et al. Collaborative methods to enhance reproducibility and accelerate discovery. Digital Discovery. 2023;2:12–27. 10.1039/D2DD00061J. [DOI] [Google Scholar]
- Li S, Ma C, Yang F et al. Research on integrated technology of synthetic biology automated platform. In: 2024 International Conference on Advanced Control Systems and Automation Technologies (ACSAT). Nanjing, China, 2024;221–228. 10.1109/ACSAT63853.2024.10823805 [DOI] [Google Scholar]
- Li W, Mao Z, Xiao Z et al. Large language model for knowledge synthesis and AI-enhanced biomanufacturing. Trends Biotechnol. 2025;43:1864–1875. 10.1016/j.tibtech.2025.02.008 [DOI] [PubMed] [Google Scholar]
- Liao X, Ma H, Tang YJ. Artificial intelligence: a solution to involution of design–build–test–learn cycle. Curr Opin Biotechnol. 2022;75:102712. 10.1016/j.copbio.2022.102712. [DOI] [PubMed] [Google Scholar]
- Linshiz G, Stawski N, Goyal G et al. PR-PR: cross-platform laboratory automation system. ACS Synth Biol. 2014;3:515–24. 10.1021/sb4001728. [DOI] [PubMed] [Google Scholar]
- Lloyd MD. High-throughput screening for the discovery of enzyme inhibitors. J Med Chem. 2020;63:10742–72. 10.1021/acs.jmedchem.0c00523. [DOI] [PubMed] [Google Scholar]
- Lopez-Barbera A, Abasolo N, Torrell H et al. Integrative transcriptomic and target metabolite analysis as a new tool for designing metabolic engineering in yeast. Biomolecules. 2024;14:1536. 10.3390/biom14121536. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu H, Xiao L, Liao W et al. Cell factory design with advanced metabolic modelling empowered by artificial intelligence. Metab Eng. 2024;85:61–72. 10.1016/j.ymben.2024.07.003. [DOI] [PubMed] [Google Scholar]
- Lu Z, Peng B, Ebert BE et al. Auxin-mediated protein depletion for metabolic engineering in terpene-producing yeast [Article]. Nat Commun. 2021;12:1051. 10.1038/s41467-021-21313-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lux MW, Strychalski EA, Vora GJ. Special issue:Reproducibility in synthetic biology. Synthetic Biology. 2023;8:ysad015. 10.1093/synbio/ysad015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lyu Y, Wu Z, Zhang L et al. Gp-gpt: large language model for gene-phenotype mapping. arXiv. 2024. arXiv2409.09825 [Google Scholar]
- Ma Y, Xiong Y, Xu J et al. Assembly and delivery of large DNA via chromosome elimination in yeast. Nat Protoc. 2025:1–28. [Google Scholar]
- Ma Y, Zhang Z, Jia B et al. Automated high-throughput DNA synthesis and assembly. Heliyon. 2024;10:e26967. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malci K, Jonguitud-Borrego N, van der Straten Waillet H et al. ACtivE: assembly and crispr-targeted in vivo editing for yeast genome engineering using minimum reagents and time. ACS Synth Biol. 2022;11:3629–43. 10.1021/acssynbio.2c00175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mao Z, Du J, Wang R et al. SynBioGPT: a retrieval-augmented large language model platform for AI-guided microbial strain development. bioRxiv. 2025. 10.1101/2025.03.23.644789 [DOI] [Google Scholar]
- Pavan M, Setting up an automated biomanufacturing laboratory. In: Menolascina F. (ed.) Synthetic Gene Circuits. Methods of Molecular Biology. New York, NY: Humana, 2021;2229. 10.1007/978-1-0716-1032-9_5. [DOI] [Google Scholar]
- Martin HG, Radivojevic T, Zucker J et al. Perspectives for self-driving labs in synthetic biology. Curr Opin Biotechnol. 2023;79:102881. 10.1016/j.copbio.2022.102881. [DOI] [PubMed] [Google Scholar]
- Martínez-García E, Fraile S, Algar E et al. SEVA 4.0: an update of the Standard European Vector Architecture database for advanced analysis and programming of bacterial phenotypes. Nucleic Acids Res. 2023;51:D1558–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matzko R, Konur S. Technologies for design-build-test-learn automation and computational modelling across the synthetic biology workflow: a review. Netw Model Anal Health Inform Bioinforma. 2024;13:1–23. 10.1007/s13721-024-00455-4. [DOI] [Google Scholar]
- McLaughlin JA, Beal J, Mısırlı G et al. The synthetic biology open language (SBOL) version 3: simplified data exchange for bioengineering. Front Bioeng Biotechnol. 2020;8:1009. 10.3389/fbioe.2020.01009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McLennan A, Maslen S.“Open science” meets commercial realities: a qualitative study of factors influencing sharing in synthetic biology research in Australia. Front Bioeng Biotechnol. 2025;13:1604509. 10.3389/fbioe.2025.1604509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Menz BD, Kuderer NM, Bacchi S et al. Current safeguards, risk mitigation, and transparency measures of large language models against the generation of health disinformation: repeated cross sectional analysis. BMJ. 2024;384:e078538. 10.1136/bmj-2023-078538. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Millett P, Naik S, Otim G et al. Somatic genome editing governance approaches and regulatory capacity in different countries. SSRN J. 2023. 10.2139/ssrn.4375726 [DOI] [Google Scholar]
- Hillson NJ, Rosengarten RD, Keasling JD J5 DNA assembly design automation software. ACS Synth Biol. 2012;1:14–21. 10.1021/sb2000116. [DOI] [PubMed] [Google Scholar]
- Nault R, Cave MC, Ludewig G et al. A case for accelerating standards to achieve the FAIR principles of environmental health research experimental data. Environ Health Perspect. 2023;131:065001. 10.1289/EHP11484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nava AA, Fear AL, Lee N et al. Automated platform for the plasmid construction process. ACS Synth Biol. 2023;12:3506–13. 10.1021/acssynbio.3c00292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Navone L, Moffitt K, Behrendorff J et al. Biosensor-guided rapid screening for improved recombinant protein secretion in Pichia pastoris [Article]. Microb Cell Fact. 2023;22:92. 10.1186/s12934-023-02089-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Navone L, Vogl T, Luangthongkam P et al. Disulfide bond engineering of AppA phytase for increased thermostability requires co-expression of protein disulfide isomerase in Pichia pastoris [Article]. Biotechnol Biofuels. 2021;14:80. 10.1186/s13068-021-01936-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niazi SK. The coming of age of AI/ML in drug discovery, development, clinical testing, and manufacturing: the FDA perspectives. Drug Des Devel Ther. 2023;17:2691–725. 10.2147/DDDT.S424991. [DOI] [Google Scholar]
- Nielsen J, Keasling JD. Engineering cellular metabolism. Cell. 2016;164:1185–97. 10.1016/j.cell.2016.02.004. [DOI] [PubMed] [Google Scholar]
- O’Neill S. AI-driven robotic laboratories show promise. Engineering. 2021;7:1351. [Google Scholar]
- Panwar J, Utharala R, Fennelly L et al. iSort enables automated complex microfluidic droplet sorting in an effort to democratize technology. Cell Reports Methods. 2023;3:100478. 10.1016/j.crmeth.2023.100478. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patra P, Disha B, Kundu P et al. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv. 2023;62:108069. 10.1016/j.biotechadv.2022.108069. [DOI] [PubMed] [Google Scholar]
- Peng B, Nielsen LK, Kampranis SC et al. Engineered protein degradation of farnesyl pyrophosphate synthase is an effective regulatory mechanism to increase monoterpene production in Saccharomyces cerevisiae [Article]. Metab Eng. 2018;47:83–93. 10.1016/j.ymben.2018.02.005. [DOI] [PubMed] [Google Scholar]
- Peng B, Wei S. Synthetic Engineering of Microbes for Production of Terpenoid Food Ingredients. J Agric Food Chem. 2025;73:10052–68. 10.1021/acs.jafc.5c01724. [DOI] [PubMed] [Google Scholar]
- Peng B, Williams TC, Henry M et al. Controlling heterologous gene expression in yeast cell factories on different carbon substrates and across the diauxic shift: a comparison of yeast promoter activities [Article]. Microb Cell Fact. 2015;14:91. 10.1186/s12934-015-0278-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peng K, Kroukamp H, Pretorius IS et al. Yeast synthetic minimal biosensors for evaluating protein production [Article]. ACS Synth Biol. 2021;10:1640–50. 10.1021/acssynbio.0c00633. [DOI] [PubMed] [Google Scholar]
- Pham T, Tyagi A, Wang YS et al. Single-cell proteomic analysis. WIREs mechanisms of disease. 2021;13:e1503. 10.1002/wsbm.1503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pleiss J. FAIR data and software: improving efficiency and quality of biocatalytic science. ACS Catal. 2024;14:2709–18. 10.1021/acscatal.3c06337. [DOI] [Google Scholar]
- Pretorius I, Boeke J. Yeast 2.0—connecting the dots in the construction of the world’s first functional synthetic eukaryotic genome. FEMS Yeast Res. 2018;18:foy032. 10.1093/femsyr/foy032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pretorius IS. Synthetic genome engineering forging new frontiers for wine yeast [Review]. Crit Rev Biotechnol. 2017;37:112–36. 10.1080/07388551.2016.1214945. [DOI] [PubMed] [Google Scholar]
- Pretorius IS. Yeast 2.0-synthetic genome engineering pioneers new possibilities for wine yeast research. In: Yeasts in the Production of Wine. Springer; New York, 2019;211–27. 10.1007/978-1-4939-9782-4_6. [DOI] [Google Scholar]
- Qiao C, Chen Y, Guo Q et al. Understanding science data literacy: a conceptual framework and assessment tool for college students majoring in STEM. IJ STEM Ed. 2024;11:25. 10.1186/s40594-024-00484-5. [DOI] [Google Scholar]
- Qu Y, Huang K, Cousins H et al. CRISPR-GPT: an LLM Agent for Automated Design of Gene-Editing Experiments. bioRxiv. 2024. 10.1101/2024.04.25.591003 [DOI] [Google Scholar]
- Quinn JY, Cox RS III, Adler A et al. SBOL visual: a graphical language for genetic designs. PLoS Biol. 2015;13:e1002310. 10.1371/journal.pbio.1002310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Radivojević T, Costello Z, Workman K et al. A machine learning Automated Recommendation Tool for synthetic biology. Nat Commun. 2020;11:4879. 10.1038/s41467-020-18008-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramos MC, Collison CJ, White AD. A review of large language models and autonomous agents in chemistry. Chem Sci. 2024;16:2514–72., 10.1039/d4sc03921a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruan Y, Lu C, Xu N et al. An automatic end-to-end chemical synthesis development platform powered by large language models. Nat Commun. 2024;15:10160. 10.1038/s41467-024-54457-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rupp N, Ries R, Wienbruch R et al. Can I benefit from laboratory automation? A decision aid for the successful introduction of laboratory automation. Anal Bioanal Chem. 2024;416:5–19. 10.1007/s00216-023-05038-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sakharova H, Lareau LF. Protein language models reveal evolutionary constraints on synonymous codon choice. bioRxiv. 2025. 10.1101/2025.05.13.653614 [DOI] [Google Scholar]
- Saleski TE, Peng H, Lengger B et al. High-throughput G protein-coupled receptor-based autocrine screening for secondary metabolite production in yeast [Article]. Biotech & Bioengineering. 2024;121:3283–96. 10.1002/bit.28797. [DOI] [Google Scholar]
- Salunke S, Wasmate D, Bawage S. Automation in pharmaceutical industry. World J Pharm Res. 2022;11:1864–77. [Google Scholar]
- Samek W, Wiegand T, Muller KR Explainable artificial intelligence: understanding, visualizing and interpreting deep learning models. arXiv. 2017. 10.48550/arXiv.1708.08296 [DOI] [Google Scholar]
- Scott LH, Mathews JC, Filipovska A et al. Building artificial genetic circuits to understand protein function. In Shukla AK (ed.), Methods in Enzymology. Academic Press Inc, 2020;633:231–50. [Google Scholar]
- Scott LH, Mathews JC, Flematti GR et al. An artificial yeast genetic circuit enables deep mutational scanning of an antimicrobial resistance protein [article]. ACS Synth Biol. 2018;7:1907–17. 10.1021/acssynbio.8b00121. [DOI] [PubMed] [Google Scholar]
- Shaw WM, Khalil AS, Ellis T. A multiplex MoClo toolkit for extensive and flexible engineering of Saccharomyces cerevisiae. ACS Synth Biol. 2023;12:3393–405. 10.1021/acssynbio.3c00423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen Q, Yan F, Li YW et al. Expansion of YALIcloneHR toolkit for Yarrowia lipolytica combined with Golden Gate and CRISPR technology. Biotechnol Lett. 2024;46:37–46. 10.1007/s10529-023-03444-1. [DOI] [PubMed] [Google Scholar]
- Shen Y, Stracquadanio G, Wang Y et al. SCRaMbLE generates designed combinatorial stochastic diversity in synthetic chromosomes. Genome Res. 2016;26:36–49. 10.1101/gr.193433.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi S, Chen Y, Nielsen J. Metabolic engineering of yeast. Annu Rev Biophys. 2025;54:101–20. 10.1146/annurev-biophys-070924-103134. [DOI] [PubMed] [Google Scholar]
- Shiraishi K, Arima Y, Nakamura M et al. A novel fluorescence-activated cell sorting (FACS)-based screening identified ATG14, the gene required for pexophagy in the methylotrophic yeast. FEMS Yeast Res. 2024;24:foae022. 10.1093/femsyr/foae022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simmons BL, McDonald ND, Robinett NG. Assessment of enzymatically synthesized DNA for gene assembly. Front Bioeng Biotechnol. 2023;11:1208784. 10.3389/fbioe.2023.1208784. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skrekas C. CRISPR-based Technologies for High Throughput Metabolic Engineering of Yeast. Gothenburg, Sweden: Chalmers University of Technology, 2022. [Google Scholar]
- Smolke CD. Building outside of the box: iGEM and the BioBricks Foundation. Nat Biotechnol. 2009;27:1099–102. 10.1038/nbt1209-1099. [DOI] [PubMed] [Google Scholar]
- Song Y, Xu H, Li C et al. HMT: hybrid mechanistic transformer for bio-fabrication prediction under complex environmental conditions. Expert Syst Appl. 2024;255:124389. 10.1016/j.eswa.2024.124389. [DOI] [Google Scholar]
- Strzelecki P, Joly N, Hébraud P et al. Enhanced Golden Gate Assembly: evaluating overhang strength for improved ligation efficiency. Nucleic Acids Res. 2024;52:e95–. 10.1093/nar/gkae809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suri A, Hu KKY, Younas T et al. Functionalizing yeast lipid droplets as versatile biomaterials [article]. Small. 2024;20:2308463. 10.1002/smll.202308463. [DOI] [Google Scholar]
- Taddese AA, Addis AC, Tam BT. Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications. Hum Genomics. 2025;19:16. 10.1186/s40246-025-00716-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tancheva G, Hongisto V, Patyra K et al. High-throughput screening data generation, scoring and FAIRification: a case study on nanomaterials. J Cheminform. 2025;17:59. 10.1186/s13321-025-01001-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thuronyi BW, DeBenedictis EA, Barrick JE. No assembly required: time for stronger, simpler publishing standards for DNA sequences. PLoS Biol. 2023;21:e3002376. 10.1371/journal.pbio.3002376. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thygesen P. Regulation of genome edited organisms in Australia. Transgenic Res. 2024;33:545–50. 10.1007/s11248-024-00411-y. [DOI] [PubMed] [Google Scholar]
- Tobias AV, Wahab A. Autonomous ‘self-driving’laboratories: a review of technology and policy implications. R Soc Open Sci. 2025;12:250646. 10.1098/rsos.250646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Torres-Acosta MA, Bozeat P, Ceballos Rodriguez-Conde F et al. Practical considerations for the high-level automation of a biosciences research laboratory. Biochem Eng J. 2024;201: 109154. 10.1016/j.bej.2023.109154. [DOI] [Google Scholar]
- Trevisan AH, Acerbi F, Dukovska-Popovska I et al. Skills for the twin transition in manufacturing: a systematic literature review. J Cleaner Prod. 2024;474:143603. 10.1016/j.jclepro.2024.143603. [DOI] [Google Scholar]
- Trump BD, Siharulidze G, Cummings CL. Synthetic biology and risk regulation: the case of Singapore. In: Trump B., Cummings C., Kuzma J., Linkov I. (eds), Synthetic Biology 2020: Frontiers in Risk Analysis and Governance. Risk, Systems and Decisions. Springer, Cham, 2020;297–312. 10.1007/978-3-030-27264-7_13 [DOI] [Google Scholar]
- Tu R, Mao Y, Liu Y et al. Advances in automated high-throughput editing and screening of engineered strains [Article]. Shengwu Gongcheng Xuebao/Chin J Biotechnol. 2022;38:4162–79. 10.13345/j.cjb.220582. [DOI] [Google Scholar]
- Undheim TA. The whack-a-mole governance challenge for AI-enabled synthetic biology: literature review and emerging frameworks. Front Bioeng Biotechnol. 2024;12:1359768. 10.3389/fbioe.2024.1359768. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vegh P, Chapman E, Gilmour C et al. Modular DNA construct design for high-throughput golden gate assembly. In Schindler D (eds), Golden Gate Cloning. Methods in Molecular Biology. New York, NY: Humana, 2025;2850:61–77. 10.1007/978-1-0716-4220-7_4 [DOI] [Google Scholar]
- Vegh P, Donovan S, Rosser S et al. Biofoundry-scale DNA assembly validation using cost-effective high-throughput long-read sequencing [article]. ACS Synth Biol. 2024;13:683–6. 10.1021/acssynbio.3c00589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vickers CE, Freemont PS. Pandemic preparedness: synthetic biology and publicly funded biofoundries can rapidly accelerate response time. Nat Commun. 2022;13:453. 10.1038/s41467-022-28103-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vidal G, Vitalis C, Guillén J. Standardized golden gate assembly metadata representation using SBOL. In Schindler D (eds), Golden Gate Cloning. Methods in Molecular Biology. Humana, New York, NY, 2025;2850:89–104. [Google Scholar]
- Volk AA, Epps RW, Yonemoto DT et al. AlphaFlow: autonomous discovery and optimization of multi-step chemistry using a self-driven fluidic lab guided by reinforcement learning. Nat Commun. 2023;14:1403. 10.1038/s41467-023-37139-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang S, Zhao F, Yang M et al. Metabolic engineering of Saccharomyces cerevisiae for the synthesis of valuable chemicals. Crit Rev Biotechnol. 2024;44:163–90. 10.1080/07388551.2022.2153008. [DOI] [PubMed] [Google Scholar]
- Watkins A, McCarthy A, Holland C et al. Public biofoundries as innovation intermediaries: the integration of translation, sustainability, and responsibility [article]. J Technol Transf. 2024;49:1259–86. 10.1007/s10961-023-10039-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Welcoming protocols.io . Nat Protoc. 2024;19:2527. 10.1038/s41596-024-01012-z. [DOI] [Google Scholar]
- Wierenga RP, Golas SM, Ho W et al. PyLabRobot: an open-source, hardware-agnostic interface for liquid-handling robots and accessories. Device. 2023;1:100111. 10.1016/j.device.2023.100111. [DOI] [Google Scholar]
- Wittmann BJ, Alexanian T, Bartling C et al. Strengthening nucleic acid biosecurity screening against generative protein design tools. Science. 2025;390:82–87. 10.1126/science.adu8578. [DOI] [PubMed] [Google Scholar]
- Wives AP, Mendes ISdM, Santos STd et al. Molecular techniques employed in CTG (Ser1) and CTG (Ala) D-xylose metabolizing yeast clades for strain design and industrial applications. arXiv. 2024. 10.48550/arXiv.2405.11010 [DOI] [Google Scholar]
- Wu X, Wan X, Yu H et al. Recent advances in CRISPR-Cas system for Saccharomyces cerevisiae engineering. Biotechnol Adv. 2025;81:108557. 10.1016/j.biotechadv.2025.108557. [DOI] [PubMed] [Google Scholar]
- Xu S, Meng J, Zhang Q et al. CILF: cRISPR/Cas9 based integration of large DNA fragments in Saccharomyces cerevisiae. Biotech Bioeng. 2024;121:3906–11. 10.1002/bit.28830. [DOI] [Google Scholar]
- Yang B, Misirli G, Wipat A et al. Modelling the fitness landscapes of a SCRaMbLEd yeast genome [Article]. Biosystems. 2022;219, Article104730. 10.1016/j.biosystems.2022.104730. [DOI] [PubMed] [Google Scholar]
- Yang G, Chen Y, Guo Q et al. Leveraging pre-trained AI models for robust promoter sequence design in synthetic biology. swwlxb. 2025;11:1. 10.52601/bpr.2025.240033. [DOI] [Google Scholar]
- Yang H, Li J, Lim KZ et al. Automatic strain sensor design via active learning and data augmentation for soft machines. Nat Mach Intell. 2022;4:84–94. 10.1038/s42256-021-00434-8. [DOI] [Google Scholar]
- Yonet N, Ari Yuka S, Turker M et al. Comparative analysis of retrosynthesis applications for predicting of pathways for plant Secondary metabolite production. Biotechnol Biotechnol Equip. 2024;38:2431041. [Google Scholar]
- Yook S, Alper HS. Recent advances in genetic engineering and chemical production in yeast species. FEMS Yeast Res. 2025;25:foaf009. 10.1093/femsyr/foaf009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang J, Chen Y, Fu L et al. Accelerating strain engineering in biofuel research via build and test automation of synthetic biology. Curr Opin Biotechnol. 2021;67:88–98. 10.1016/j.copbio.2021.01.010. [DOI] [PubMed] [Google Scholar]
- Zhang J, Petersen SD, Radivojevic T et al. Combining mechanistic and machine learning models for predictive engineering and optimization of tryptophan metabolism. Nat Commun. 2020;11:4880. 10.1038/s41467-020-17910-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Q, Chen W, Qin M et al. Integrating protein language models and automatic biofoundry for enhanced protein evolution [Article]. Nat Commun. 2025;16:1553. 10.1038/s41467-025-56751-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Z-X, Wang L-R, Xu Y-S et al. Recent advances in the application of multiplex genome editing in Saccharomyces cerevisiae. Appl Microbiol Biotechnol. 2021;105:3873–82. 10.1007/s00253-021-11287-x. [DOI] [PubMed] [Google Scholar]
- Zhang Z, Guo Q, Wang Y et al. High-throughput screening of microbial strains in large-scale microfluidic droplets. Front Bioeng Biotechnol. 2023;11:1105277. 10.3389/fbioe.2023.1105277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao H, Yoshikuni Y. Editorial overview: energy biotechnology. Current Opinion in Biotechnology. 2021;67:iii–v. [DOI] [PubMed] [Google Scholar]
- Zhao Y, Coelho C, Hughes AL et al. Debugging and consolidating multiple synthetic chromosomes reveals combinatorial genetic interactions. Cell. 2023;186:5220–5236.e16. e5216. 10.1016/j.cell.2023.09.025. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
