Skip to main content
Frontiers in Artificial Intelligence logoLink to Frontiers in Artificial Intelligence
. 2026 Apr 8;9:1739303. doi: 10.3389/frai.2026.1739303

The transformative impact of AI-enabled AlphaFold 3: evolution, current status, and future prospects in structural biology

Chiranjib Chakraborty 1,2,*, Manojit Bhattacharya 3, Sang-Soo Lee 4,*
PMCID: PMC13099841  PMID: 42027778

Abstract

The AlphaFold (AF) initiative profoundly impacted structural biology, evidenced by its 2024 Nobel Prize. AlphaFold progressed from AF1 to AF2, which achieved near-experimental accuracy in single-chain protein folding, and then to AF3, expanding predictions to protein-ligand, protein-nucleic acid, and protein–protein complexes. This evolution led to the widespread adoption of AF tools, expanded structural coverage, and greater accessibility through the AlphaFold Database (AFDB), accelerating translational research, especially in structure-based drug discovery (SBDD) and the study of complex macromolecular assemblies. AF1 uses deep neural networks (DNNs), AF2 employs the Evoformer to model evolutionarily related sequences, and AF3 applies the Pairformer for pairwise amino acid interactions. The main differences between AF versions are architectural. Remaining challenges include predicting protein dynamics and multiple conformational states. This review first outlines AlphaFold’s architectural evolution, then explores the post-AlphaFold landscape and its global impact, discusses translational research applications, and addresses limitations and future directions. Despite challenges, AlphaFold is poised to further advance structural biology, particularly in biotechnology and medicine.

Keywords: AlphaFold, CASP, protein structure prediction, revolution in application, structural biology

1. Introduction

One of the major challenges in structural biology is determining the three-dimensional conformation of proteins, which defines their function. The field advanced in the 1950s with techniques like X-ray crystallography (Shi, 2014), culminating in John Kendrew’s determination of the first protein structure (myoglobin) in 1958. In 1962, Max Perutz and John Kendrew received the Nobel Prize for their pioneering work (Jaskolski et al., 2014). Increasing structural data led to the establishment of the Protein Data Bank (PDB) at Brookhaven National Laboratory in 1971, which became the central global archive for protein structures (Figure 1). The open-access model of the PDB fostered broad research usage and critical data sharing (Kurisu, 2022; Berman and Burley, 2025). Long-term support by the NSF, NIH, and DOE, with NIH as the main contributor, ensured PDB’s impact on subsequent advances, including AlphaFold (Burley et al., 2022; Kurisu, 2022). Despite the PDB’s growth, the gap between known sequences and solved structures widened due to the limitations of experimental methods. This led to the launch of CASP in 1994, a biennial challenge for computational prediction of protein structure (Staples et al., 2019). Traditional methods, mainly homology modeling and ab initio methods, relied on metrics such as the Global Distance Test (GDT) to measure model accuracy (Kryshtafovych et al., 2019; Zemla, 2003; Bieri and Kiefhaber, 1999; Li et al., 2016). By 2016, these techniques reached a GDT of about 40/100 for the most difficult targets, showing major computational limitations (Anishchenko et al., 2014; Hameduh et al., 2020; Wuyun et al., 2024). A breakthrough arrived when Google DeepMind’s AlphaFold, introduced at CASP13 in 2018, used deep learning and coevolutionary information to accurately predict protein structures, ranking first in the Free Modeling category (AlQuraishi, 2019; Senior et al., 2020).

Figure 1.

Horizontal timeline graphic illustrating major milestones in protein structure prediction. Key events include the 1958 first 3D protein structure, establishment of Protein Data Bank in 1971, CASP in 1994, AlphaFold and RosettaFold developments from 2018 to 2024, Nobel Prizes in 1962 and 2024, and release of AlphaFill and ESMFold.

The figure illustrates the timeline of structural biology development, from the inception of the Protein Data Bank (PDB) to the emergence of AlphaFold.

In 2020, AlphaFold 2 (AF2) by DeepMind showed excellent accuracy in the CASP14 competition. Experts stated that AF2 achieved high accuracy in predicting single-chain protein structures, with a GDT score of about 92.4%. This breakthrough has revolutionized structural biology research (Perrakis and Sixma, 2021; Callaway, 2022; Editorial, 2022). As a result, protein structure prediction has since shifted toward more complex structures. Researchers consequently focus on challenges such as predicting protein interactions, molecular binding, and conformational changes, which are actively shaping the field’s future. In this review, we first outline AlphaFold’s architectural trajectory from co-evolution to universal modelling and discuss the post-AlphaFold landscape and its global impact. We then illustrate AlphaFold’s translational research applications. Finally, we discuss limitations, next-generation challenges, and the future trajectory.

2. AlphaFold: from co-evolution to universal modelling

Each version of AlphaFold embodies a distinct architectural strategy, meticulously designed to address increasingly complex challenges in biomolecular prediction. We have described each version of AlphaFold, which is as follows:

2.1. AlphaFold 1 (CASP13): deep learning (DL) for contact prediction

AlphaFold 1 (AF1) aimed to predict which parts of a protein are physically close in the CASP13 competition. The researchers used two main approaches: first, they compared many similar protein sequences to identify regions that change together, suggesting these parts likely interact in the 3D structure. Second, they used deep learning methods to study these relationships. Then, they turned this data into maps showing which parts of the protein are close together (Hou et al., 2019). AF1 built the final 3D shapes using these advanced algorithms. This success shows how artificial intelligence can help us understand protein structures (AlQuraishi, 2019; Hou et al., 2019).

2.2. AlphaFold 2 (CASP14): deep learning (DL) revolution

The AF2 was significantly improved from the previous version. It utilized a DNN to generate protein structures from amino acid (AA) sequences. At the same time, it also utilizes AF3 capabilities, multiple sequence alignments (MSA), and related proteins (Pratt et al., 2025). The central part of the analysis was performed using Evoformer (Jumper et al., 2021b). It worked with the MSA data by assigning different weights to different areas, which helped manage complex, long-range connections within the structure. This mechanism, known as attention, allows the Evoformer to focus computational resources by assigning weights to different parts of the input data. Specifically, Evoformer uses axial or criss-cross attention, which applies row-wise gated self-attention within each sequence and column-wise gated self-attention across sequences in the MSA. This approach enables the model to prioritize relevant information when analyzing data. The performance of AF2 was assessed using the TM-score, which ranges from 0 to 1. Usually, a score exceeding 0.4 signifies structural similarity (Schlievert et al., 1996; Zhang and Skolnick, 2005; Skolnick et al., 2021; Pratt et al., 2025). AF2 achieved a remarkable 92.4 GDT, and it can generate models nearly as precise as those derived from experimental methods (Skolnick et al., 2021). These accurate models were promptly integrated into traditional experiments. It facilitates the determination of structures in X-ray crystallography by providing a useful starting point for molecular replacement. In addition to X-ray crystallography-based structures in PDB, NMR structures in PDB were also included, which caused a complication. These structures are essential for the rapid growth of structural data. However, it introduces complications primarily because NMR relies on a previously known, similar structure as a template (the search model) to determine an unknown structure (Abergel, 2013; Scapin, 2013).

There has been a recent protein structural revolution, driven by cryo-EM. About 40% of the new structures deposited into the PDB from 2024 to 2025 were obtained using this technique (Rubach et al., 2025; Berman and Burley, 2025; RCSB PDB, 2026). Similarly, NMR structures and cryo-EM-based structures have also been instrumental in creating or training using AF. Therefore, the combined impact of AI-driven modeling (such as AF) and experimental techniques, including cryo-EM, X-ray crystallography, and NMR. It has led to a record-breaking number of new structures in 2025–2026.

TM-score and GDT are related but distinct metrics for protein structure similarity. The TM-score ranges from 0 to 1. It is widely used for pairwise structural comparison and is less sensitive to protein length (Xu and Zhang, 2010). GDT, most often reported as GDT_TS (%), is the standard metric in CASP experiments for predicting structure accuracy (Moult et al., 2003). While TM-score often measures overall agreement, AlphaFold2’s CASP14 performance was measured using GDT_TS. AF2 achieved a median of about 92.4, indicating near-experimental accuracy (Jumper et al., 2021b). We shift from TM-score to GDT here to align with the official CASP14 evaluation, not to introduce inconsistency.

2.3. AlphaFold 3: universal prediction of biomolecular interactions

Google DeepMind and Isomorphic Labs released AlphaFold 3 (AF3) on May 8, 2024. It represents a substantial advancement in both its design and scope. AF3 can model the structures of complexes that include not only proteins but also small molecules, ions, DNA/RNA, and modified residues (Fang et al., 2025; Hennig, 2025; Krokidis et al., 2025). This shift toward a comprehensive model for all molecular entities underscores the idea that molecular function depends on interactions rather than solely on individual protein structures. In the architecture, AF2 uses the Evoformer module. At the same time, in the architecture, AF3 uses a simplified module called Pairformer. This modification facilitates the processing of extensive MSAs while maintaining or enhancing prediction accuracy, particularly for complex structures (Abramson et al., 2024; Krokidis et al., 2025). By implementing token-based spatial reasoning across various molecular types, the system achieves greater generality and efficiency in modelling interactions. Although tokens are not a major novelty of AF3, they were already used in AF2. The novelty of AF3 is Pairforme. It replaces the Evoformer with a “Pairformer” module. It operates exclusively on token pairs and employs an “Atom Transformer” (also known as Atom Attention Encoder) to manage interactions between atoms. The advancement AF3 has led to the discovery of novel pharmaceuticals by utilizing the unsolved structure of drug targets. Actually, at present, drug discovery is increasingly using drug targets and their structures. If the drug target’s structure is unavailable, the drug discovery process is hindered, and the researcher may use molecular modeling to build a model of the drug target. In this case, AF3 has solved several previously unsolved protein structures, which can serve as drug targets. AF presents a big promise, but it has not led to new drugs to date. Several AF-inspired projects have been developed in this direction (pre-clinical, clinical) (Table 1). Similarly, AF3 demonstrates exceptional proficiency in predicting drug interactions, including the binding of proteins to other molecules and the binding of antibodies to target proteins (Desai et al., 2024; Shi, 2024). A PoseBusters assessment is an evaluation of predicted protein-ligand binding positions. It understands the priority of binding positions in terms of physical and chemical feasibility, as well as geometric accuracy (Buttenschoen et al., 2024). In the challenging PoseBusters assessment, AF3 demonstrated a 50% improvement in accuracy over the most advanced traditional physics-based methods. Without any structural input, this was performed (Krokidis et al., 2025). PoseBusters is a Python package. It performs a series of standard quality checks. These checks use the well-established cheminformatics toolkit RDKit. The test suite confirms the chemical and geometric consistency of a ligand. It includes its stereochemistry (Buttenschoen et al., 2024). In this experiment, PoseBusters was used. Without any structural input, this was performed. The protein-ligand complex was predicted solely from the protein and ligand sequences and chemical SMILES strings. It was not predicted by docking into a known receptor structure (Buttenschoen et al., 2024; Lee et al., 2025a). It was the first example of an AI using physics-based tools to predict biomolecular structures in this domain, demonstrating its ability to predict protein-ligand binding positions and achieve geometric accuracy. The progression of AlphaFold across its three major versions is summarized in Table 2.

Table 1.

AlphaFold-inspired projects related to pharmaceuticals and their status.

Sl. No Project (pharmaceuticals) Stage/status Reference
1. Insilico Medicine – CDK20 inhibitor Pre-clinical (Ren et al., 2023)
2. Viral target screening (e.g., NSP6, SARS-CoV-2) Pre-clinical (Desai et al., 2024)
3. Academic/de novo design tools (PCMol, generative methods) Early discovery (Bernett et al., 2024)

Table 2.

Comparative evolution and benchmarks of AlphaFold systems.

AlphaFold version CASP participation Key architectural innovation Primary prediction scope Benchmark achievement
AlphaFold 1
(AF1)
CASP13 (2018) DNN, Co-evolutionary couplings, contact maps Single protein chains (initial fold modeling) Ranked first in free modeling (FM)
AlphaFold 2
(AF2)
CASP14 (2020) End-to-end DL, evoformer, high-resolution refinement Single protein chains, multimers Highest score achieved (92.4 GDT)
AlphaFold 3
(AF3)
Released 2024 Pairformer, unified model of life’s molecules Proteins, DNA, RNA, ligands, ions, modified residues 50% more accurate than best traditional methods (PoseBusters)

3. Comparison between AlphaFold 1 (AF1), AlphaFold 2 (AF2), and AlphaFold 3 (AF3)

AlphaFold has evolved into three versions: AF1, AF2, and AF3. To trace their progression, AF1 was demonstrated at CASP13 in 2018, AF2 at CASP14 in 2020, and AF3 in 2024. Starting with AF1, deep learning was used to predict protein structures with moderate accuracy. In 2018, AF1 introduced a deep learning approach using convolutional neural networks to predict inter-residue distances and torsion angles. Although it achieved high accuracy in the CASP13 competition, it remained limited (AlQuraishi, 2019; Peng et al., 2025).

Building on advances in AF1, AF2 achieved greater accuracy and solved the protein folding problem for single proteins. In 2020, it revolutionized the field with its transformer-based architecture and a median GDT score above 90. This breakthrough earned recognition in structural biology (Jumper et al., 2021a). AF3 further expanded the technology to predict complex interactions among proteins, DNA, RNA, and small molecules. In 2024, AF3 was extended to predict the structures of biomolecular assemblies, including protein–DNA, protein–RNA, protein–ligand interactions, and post-translational modifications. AF3 also uses a diffusion-based model, directly generating atomic coordinates to determine exact atom positions (Abramson et al., 2024).

In summary, each AlphaFold version is built on its predecessor. AF1 used deep neural networks. AF2 employed the Evoformer, designed to model related protein sequences. AF3 used the Pairformer, a model developed to capture pairwise interactions between amino acid residues. All AlphaFold versions applied deep neural networks to generate structures from amino acid sequences; their main difference was architecture. AF1 showed deep learning could predict protein structures. AF2 improved on AF1’s accuracy with the same approach. AF3 further extended these capabilities to predict interactions between different molecules (Table 2).

4. The post-AlphaFold landscape and global impact

AlphaFold, particularly AF2, has rapidly transformed the field of structural biology. During the era of AlphaFold (AF1/AF2/AF3), several AlphaFold-associated tools have been developed and applied in various areas of biological science, particularly in structural biology, by researchers across different fields (Table 3). It has significantly influenced the way we associate sequences with structures.

Table 3.

AlphaFold and different AlphaFold-associated tools and their different applications used by researchers for structural biology from time to time.

Sl. No. Name of the tool Category/type Remarks Reference
1. ModelArchive Database The public repository for depositing 3D protein structure models Tauriello et al. (2025)
2. AlphaBridge Evaluation and visualization It designed to analyze and post-process predicted macromolecular complex models to deduce and visualize interaction interfaces Álvarez-Salmoral et al.(2024)
3. SubtiWiki Structure comparison The curated, community-oriented wiki/database focused on the Bacillus subtilis model organism Elfmann et al. (2025)
4. ColabFold Structure prediction The accessible interface (Google Colab notebook + command-line tool) for running AlphaFold2 (and related models) more easily and efficiently Mirdita et al. (2022)
5. AlphaFold clusters Structure comparison The sets of protein structures from the AlphaFold Protein Structure Database that share similar folds or overall 3D shapes Barrio-Hernandez et al. (2023)
6. PAE viewer Visualization and evaluation The interactive visualization tool used to explore the confidence and relative accuracy of AlphaFold protein structure predictions Elfmann and Stulke (2023)
7. AlphaFold server Structure prediction The public web platform that allows researchers to generate protein structure predictions using DeepMind’s AlphaFold2 system Abramson et al. (2024)
8. Foldseek search Structure comparison The fast, structure-based search tool that allows to compare 3D protein structures — similar to how BLAST compares sequence van Kempen et al. (2024)
9. AlphaFold DB Database The public database that provides AI-predicted 3D structures of proteins for nearly all known sequences in nature Varadi et al. (2024)

4.1. Structural coverage expansion

AlphaFold has changed the structural landscape of the human proteome, increasing structural coverage. Research demonstrates that the structural coverage of all human protein residues increased from 48% before AlphaFold’s implementation to 76% thereafter (Porta-Pardo et al., 2022). Structural coverage is the key finding of this research work. Here, using three-dimensional coordinate files from the PDB, the coverage of the human proteome was evaluated. In this context, structural coverage refers to the percentage of the human proteome for which we have known or predicted three-dimensional protein structures. More specifically, it measures the proportion of all human proteins (or protein residues) that have available structural information, estimated either from experimental structures determined by techniques such as X-ray crystallography or cryo-EM, or from computational models (homology modelling) or AI predictions (from AF). The study’s key finding is that structural coverage increased from 48 to 76% when AF predictions were included. While the “dark proteome” (proteins without any structural information) decreased from 26% to just 10%. At the protein level, the number of human proteins completely lacking structural coverage dropped from 5,027 to just 29 proteins after AF’s release. Essentially, structural coverage is a measure of how much of the proteome we can visualize and understand at the three-dimensional level. It is critical for understanding protein function, disease mechanisms, and drug design. This augmentation represents a significant advancement in fundamental biological knowledge. The number of high-quality models has significantly increased. Models exhibiting a sequence identity of 50% or greater with a Protein Data Bank (PDB) chain have risen from 31 to 50%. AlphaFold has nearly eliminated the “dark proteome.” The “dark proteome” refers to the collection of protein regions and entire proteins whose three-dimensional (3D) structures are largely unknown and cannot be determined experimentally or computationally, such as through homology modeling.

In structural biology, the number of unsolved human proteins has dropped dramatically, representing a revolution in the field. Obtaining, interpreting, and validating a structure once took years, but now researchers can focus more on protein function, interactions, and dynamics (Alderson et al., 2023).

4.2. Data democratization and the AlphaFold database (AFDB)

Data democratization is the process of making data accessible to everyone, so it can be utilized effectively and efficiently. This approach empowers individuals to make informed decisions (Lefebvre et al., 2021; Marcu et al., 2022; Mseer and Al Makhzoumi, 2025), highlighting an urgent need for data and knowledge democratization (Dessimoz and Thomas, 2024). AlphaFold exemplifies this by making structural data more accessible, with data availability as a core component. The AlphaFold Protein Structure Database (AFDB) began in 2021 with 300,000 structures and has grown to over 214 million predicted protein structures (Varadi et al., 2022; Varadi et al., 2024), making it 500 times larger and comprising a significant portion of the UniProt repository. AFDB is an open-access online repository that provides detailed protein structures to scientists worldwide, fostering a sustainable and innovative research landscape. For instance, Nji et al. (2025) indicate it has a positive impact on African healthcare research, supporting researchers in evaluating model accuracy.

Additionally, AFDB offers atomic-level details that greatly benefit drug development and disease research (Varadi and Velankar, 2023; Chang et al., 2024; Desai et al., 2024). Used by over 2 million individuals in 190 countries, the library saves researchers significant time and financial resources. This diverse information accelerates scientific discoveries, especially given the dramatic increase in data availability observed from the pre- to post-AlphaFold eras, as shown in Table 4.

Table 4.

Quantitative impact on human structural coverage during the pre-AlphaFold era and the post-AlphaFold era.

Metric Pre-AlphaFold (Approximate) Post-AlphaFold (AFDB 2024) Significance
Human proteome coverage (All residues) 48% 76% Major expansion of structural knowledge base.
Proteins with no structural coverage 5,027 29 Near elimination of “dark” disease targets.
AFDB structure count 300 k (2021 initial release) >214 million 500-fold expansion, enabling global access.
High-quality coverage (≥50% Seq. ID) 31% 50% Increase in clinically relevant, accurate models.
Reference Porta-Pardo et al. (2022), Varadi and Velankar (2023), and Varadi et al. (2024) Porta-Pardo et al. (2022), Varadi and Velankar (2023), and Varadi et al. (2024) -

4.3. The association between the AlphaFold, AFDB, and UniProt

While calling AlphaFold “democratizing access to structural data” captures its broad impact, it also obscures the specific methodological advances behind this shift. The AlphaFold Protein Structure Database (AFDB) provides open access to hundreds of millions of high-accuracy predicted protein structures (Jumper et al., 2021b). It is now more closely aligned with UniProt sequence releases. This change embeds structural models within the primary sequence knowledgebase used across biology and improves integration with common data resources (Varadi et al., 2024). AFDB predictions are now cross-referenced with UniProt and related resources, improving their usability for functional annotation and bioinformatics workflows. The open-source release of the AlphaFold code also allows users to run predictions on novel custom sequences beyond those already in AFDB. This has greatly expanded practical access to structure prediction (Lang et al., 2025).

4.4. The 2024 Nobel prize in chemistry was given to AlphaFold

AlphaFold’s scientific contribution was acknowledged when its developers, Demis Hassabis and John Jumper of Google’s DeepMind, were awarded half of the Nobel Prize (Chemistry) for their work in “predicting protein structures.” The award, shared with David Baker for his work in “designing proteins using computers” (Callaway, 2024; Graham, 2024; Najar Najafi et al., 2025), highlights AlphaFold’s capabilities and innovative protein design approach. AlphaFold’s accurate predictions help researchers design novel proteins efficiently and demonstrate the importance of integrating extensive public data resources, such as the PDB, with new AI developments. This recognition may trigger the evolution of biology with AI models (Abriata, 2024).

In addition, David Baker was recognized for his achievements in protein design, a computational method that is part of AlphaFold. He developed a competing approach, called RoseTTAFold, for computational protein design (Mansoor et al., 2023).

5. Alphafold’s translational research applications

AlphaFold serves as an essential platform that facilitates biological and medical discoveries across a wide range of applied fields (Figure 2).

Figure 2.

Diagram illustrating AI-enabled AlphaFold at the center of a neural network, with arrows pointing to six applications: drug discovery and design, peptides, protein dynamics, clinical study, mutation study, protein complex, and ligand and cofactor.

AlphaFold applications in different translational research in the field of biological and medical discovery.

5.1. AlphaFold helps in the advancements in structure-based drug discovery (SBDD)

The detailed shape of a protein is key to designing drugs. AlphaFold has quickly given high-quality structure predictions for many hard-to-study targets. Now, these targets can be used for structure-based drug research (Jumper et al., 2021b; Nussinov et al., 2023; Guo et al., 2024).

Recently, researchers found that AlphaFold achieved a 60% hit rate when predicting protein structures for TAAR1 agonists, outperforming conventional methods. In this context, a hit was defined as a tested compound that showed agonist activity. AlphaFold’s virtual screen identified 30 top-ranked compounds, of which 18 displayed agonist activity, with potencies ranging from 12 to 0.03 μM. In contrast, traditional homology modeling identified 32 top compounds, but only 7 were active, yielding a 22% hit rate. These results indicate AlphaFold’s clear advantage in hit identification. However, while AF2 models predict binding pockets with high accuracy, their ability to predict ligand-binding poses via computational docking is comparable to that of traditional methods and lower than that of docking to experimentally determined structures. This poses challenges for structure–activity relationship (SAR) studies and lead optimization, as accurately predicting subtle structural modifications remains difficult. Additionally, AlphaFold models are limited by their lack of structural plasticity, typically offering only one conformational state, which may not reflect ligand-bound forms. Therefore, further structure refinement is needed. Despite these limitations, AlphaFold remains a valuable tool because it generates predictions even for proteins with low sequence identity to known templates, particularly for those with less than 20% sequence similarity.

AlphaFold3 enriches the drug design landscape by predicting interactions among ligands, ions, and antibodies. It is used to assess variations in binding energy and evaluate the impact of mutations on protein interactions, which are helpful in developing novel therapeutic approaches such as targeted protein degradation (He et al., 2025; Szczepski and Jaremko, 2025; Zhong et al., 2022; Mathur et al., 2025; Xue et al., 2025).

AF3 can predict interactions among proteins, ligands, antibodies, and other biomolecules for drug design. In benchmark datasets, AF3 reached 76.4% accuracy in protein-ligand docking—a 1.8-fold improvement over prior approaches. It also showed 50% greater accuracy than physics-based methods. However, key limitations remain. Evaluations show AF3 struggles to predict ligand-binding poses for ions and peptides in targets such as GPCRs, and it displays minimal correlation between predicted and experimental binding affinities. AF3 also performs poorly on test sets post-training, suggesting memorization rather than genuine modeling. For targeted protein degradation, such as in PROTACs, AF3 can predict ternary complex structures if given explicit ligand details, with recent studies showing 33 complexes predicted to sub-Ångström accuracy. Yet it still struggles to accurately model small protein–protein interfaces typical of such interactions. These findings show AF3’s strength in developing structural hypotheses and visualizing validated binding pairs, while also emphasizing key areas needing improvement.

5.2. Resolving large, challenging macromolecular assemblies

AF2-Multimer and AF3 have advanced the study of large, complex assemblies (Ibrahim et al., 2023; Yang et al., 2023; Abramson et al., 2024), which have long challenged experimental techniques due to their size, flexibility, or diversity.

Building on these advances, the use of AlphaFold has brought a significant change to the structural study of the human nuclear pore complex (NPC). This substantial structure is crucial for the nucleus in eukaryotic cells and comprises approximately 1,000 proteins of around 30 distinct types. With AlphaFold structures, researchers have explained approximately 90% of the human NPC structure, providing insights into cellular machinery (Lin and Hoelz, 2019; Fontana et al., 2022; Mosalaganti et al., 2022; Fleming et al., 2025). Furthermore, AlphaFold has facilitated the determination of the Mce1 structure, a pivotal protein used by the tuberculosis bacterium to acquire nutrients from host cells, increasing the understanding of disease mechanisms (Chen et al., 2023; Kovalevskiy et al., 2024; Rennie and Oliver, 2025). Its rapid modeling capability accelerates research in critical health domains and is now referred to as “science at digital speed” (Elfmann and Stulke, 2025; Rennie and Oliver, 2025). Additionally, AlphaFold has enabled researchers to advance investigations beyond static structures by modeling dynamic protein conformations (Cui et al., 2025). Ultimately, AlphaFold represents a transformative leap in structural biology, driving new discoveries and redefining our approach to understanding complex biological assemblies.

5.3. Protein engineering and synthetic biology

In the domain of protein engineering, AlphaFold provides high-precision and rapid structural validation. Therefore, it significantly reduced the design-build-test cycle duration (Kovalevskiy et al., 2024; Matzko and Konur, 2024; Uzoeto et al., 2024). By generating testable hypotheses about how sequence modifications translate into altered three-dimensional conformations, researchers can rationally design novel protein functions. Applications encompass the development of more effective pharmaceuticals and the creation of novel enzymes capable of degrading plastic pollution. AF3 has specific capabilities, such as predicting physiologically relevant metal-binding sites and modelling complex β-solenoid structures. Furthermore, it enhances the capacity to engineer novel proteins with precise functional requirements (Marcu et al., 2022; Krokidis et al., 2025).

Collectively, these advances signal a foundational shift: instead of relying solely on empirical trial-and-error, researchers now leverage predictive, computationally guided strategies for protein and enzyme design. This integrated approach—with generative AI, structural insights, and experimental validation—unlocks the creation of programmable biocatalysts and broadens possibilities in medicine, energy, and environmental applications, marking a new era in synthetic biology.

5.4. AlphaFold-predicted structures and their other downstream applications

Presently, AlphaFold (AF) structures are used in many downstream assessments and applications, including protein function, druggability, stability, protein interfaces, and evolutionary relationships. Researchers use AF structures in different orientations to map protein function. This includes identifying active sites, binding pockets, and functional domains (Castillo and Ollila, 2025). In drug discovery, AF models help assess druggability and enable virtual screening for candidates (Alhumaid and Tawfik, 2024). Predicted structures are also useful for evaluating protein stability and detecting destabilizing mutations (Pak et al., 2023). AF structures provide insight into protein–protein interaction networks and their interfaces (Lee et al., 2024). They also help researchers understand signaling pathways and complex formations (Li et al., 2025a). In addition, AF predictions reveal evolutionary relationships among proteins (Jumper et al., 2021b). Comparative analyses show conserved structural motifs and functional adaptations in homologous proteins (Ruperti et al., 2023).

5.5. Several AlphaFold variants and alternatives in protein structure prediction

Several AlphaFold variants and alternative methods have emerged for protein structure prediction, each serving distinct use cases. ColabFold is a free platform for rapid structure prediction that employs AlphaFold2 and MMseqs2 to accelerate multiple sequence alignment and structure modeling, accessible via Google Colaboratory or the command line. It offers predictions about 5 times faster than standard AlphaFold2 (Kim, Gyuri et al., 2025; Mirdita et al., 2022). In contrast, the AlphaFold Server provides free, non-commercial access to AlphaFold 3, with a limit of 30 jobs per day, to promote accessibility. AlphaFold-Multimer (AFM) extends AlphaFold2 by enabling the prediction of protein complexes and multi-chain assemblies (Zhu et al., 2023; Homma et al., 2023). OmegaFold differs by predicting high-resolution protein structures from a single sequence through protein language models, without requiring evolutionary information (Jing et al., 2024; Hýskova et al., 2025; Wu et al., 2022). RoseTTAFold uses a neural network to achieve high-accuracy structure prediction comparable to AlphaFold2 and provides an alternative method for researchers (Krishna et al., 2024; Lisanza et al., 2025; Liang et al., 2022; Barbarin-Bocahu and Graille, 2022).

6. Limitations and next-generation challenges

AlphaFold has addressed the problem of protein folding. However, it continues to pose obstacles in the field of structural biology research. These challenges arise from the PDB training data, which primarily consists of standard structures. Consequently, while AlphaFold excels at analyzing known structures, it struggles with novel or dynamic structures.

6.1. The challenge of protein dynamics and multiple conformations

A significant limitation of AF2, and to a lesser extent AF3, is their tendency to predict a singular, thermodynamically stable ground state (Laurents, 2022; Chakravarty et al., 2024). This makes it difficult to assess protein structures with different conformational states. Many proteins exist in multiple conformational states (Cui et al., 2025). Proteins often change conformation during ligand binding, catalysis, or interactions with biomolecules (Plattner and Noé, 2015). These changes are critical for biological function (Liu et al., 2008). However, AF2 or AF3 predictions cannot capture these dynamic states (Perkins-Jechow et al., 2025). This fixed state is challenging to understand. Various biological processes, such as enzyme catalysis, signal transduction, and membrane transport, rely on fold-switching or substantial conformational changes. These processes involve transitions among distinct stable, metastable, and transient states (Bryan and Orban, 2010; Chakravarty et al., 2023). For example, AlphaFold accurately predicts only the closed state of Adenylate kinase. It thereby neglects the open state, which is functionally critical and has been experimentally observed (Guan et al., 2024; Lee et al., 2025b). Moreover, AlphaFold faces several challenges when predicting fine-grained functional effects, such as those caused by point mutations. It cannot model transient or excited states. Researchers can assess model quality, but high-confidence metrics (pLDDT > 90) suggest only a high probability of accurate local coordinate positioning (pLDDT, 2024). This does not guarantee concordance with the native protein conformation if that conformation is dynamic, involves multiple states, or belongs to a non-globular class. Confidence metrics serve as estimators of coordinate likelihood. They are not predictors of native functional relevance for flexible systems (Sala et al., 2023).

6.2. Noncanonical chemistry and unseen residues

AlphaFold, when it predicts protein structures based on evolutionary principles, encounters significant challenges. This is especially true for proteins that incorporate Noncanonical Amino Acids (NCAs) or exhibit complex Post-Translational Modifications (PTMs) (Yang et al., 2023; Agarwal and McShan, 2024). It often fails to distinguish NCAAs from the input data and MSAs. Here, generic “X” tokens were represented in sequence databases. In biological sequence databases, the generic “X” token typically represents an unknown or undetermined residue (Kemena and Notredame, 2009; Castro et al., 2023; Agarwal and McShan, 2024; Dotan et al., 2024; Testagrose and Boucher, 2025). This representation obscures their identity and hinders the model’s ability to extract relevant co-evolutionary signals.

Challenges with evolutionary-based models arise when achieving optimal performance for complex and novel chemical structures.

Specifically, for peptides such as sactipeptides, which are characterized by unique sulfur-to-alpha-carbon linkages, these models yield low predictive scores ranging from 0.0 to 19.2%. This suggests their inability to accurately predict structures in some cases (Fluhe and Marahiel, 2013; Chen et al., 2021; Agarwal and McShan, 2024). Consequently, it highlights the need for models that incorporate physical principles to address the complexities of chemical systems, surpassing the capabilities of evolution-based models (Wee and Wei, 2024a).

6.3. Accuracy in complex assemblies and stereochemistry

AF3 can develop complex molecular structures, but the resulting structures sometimes contain errors. For example, compared to original Protein Data Bank (PDB) complexes, AF3 predictions show an 8.6% error in binding free energy changes (ΔΔG) (Wee and Wei, 2024b). AF3 can also cause overlapping atoms, especially in proteins with multiple identical chains, leading to inaccuracies. Additionally, the server struggles to maintain the correct three-dimensional (3D) conformation of small molecules in drug research (Schreiner et al., 2011; Steinkellner et al., 2025). According to the PoseBusters test, chirality errors occur at a rate of 4.4%. Thus, even with precise chemical input, AF3’s generalized approach may not match the accuracy of specialized methods.

For complex assemblies, AF3 shows moderate performance, with success rates ranging from 40 to 60% across different types of complexity (AlphaFold, 2025). Accuracy decreases for larger heteromeric assemblies. The model achieved approximately 76% accuracy in predicting protein complexes that involve intrinsically disordered regions (Gopalan and Narayanan, 2025). However, performance drops significantly when evaluated using interface-specific metrics, such as DockQ (which measures the quality of predicted protein–protein interfaces), compared to global structural metrics (which assess the overall similarity between predicted and experimental structures). For stereochemistry, AF3 exhibits a 4.4% chirality violation rate on the PoseBusters benchmark (Gopalan and Narayanan, 2025; Childs et al., 2025; Abramson et al., 2024). This occurs despite the incorporation of penalty mechanisms for chiral errors. The model also produces significant errors in ligand bond lengths and bond angles (Ishitani and Moriwaki, 2025). Systematic issues include incorrect chirality assignments even when provided with correct reference structures. These stereochemical errors particularly affect small-molecule predictions, which are critical for drug design applications (Ishitani and Moriwaki, 2025).

6.4. Intrinsically disordered proteins

AlphaFold also does not handle several issues appropriately, including the structures of intrinsically disordered proteins (IDPs). In higher organisms, IDPs account for approximately one-third of proteins (Pancsa and Tompa, 2012). Because IDPs lack stable three-dimensional structures and exist as dynamic conformational changes, they cannot be modeled by AF (Ruff and Pappu, 2021). This incompatibility stems from AF’s focus on stable structures. Nevertheless, AF’s inability to reliably predict a single structure may itself indicate intrinsic disorder, effectively making AF an excellent predictor of structural disorder (Akdel et al., 2022). As a result, this characteristic might prove significant for AF.

6.5. Data leakage

AlphaFold performs very well at structure prediction. Because of this, many researchers use AlphaFold for applications beyond simple structure prediction. However, they often neglect a significant statistical error. This error occurs when the same structures are included in both training and analysis (Dobson et al., 2025).

Data leakage occurs when there is overlap between AlphaFold’s training and test datasets. More specifically, in AF, data leakage typically occurs when the training data for PDB overlaps with the validation or test sets. As a result, it causes inflated performance metrics due to high sequence similarity (Verburgt et al., 2025; Bernett et al., 2024). When test proteins are structurally similar to those in the training set, this overlap can further exaggerate performance. Additionally, AF3’s test set uses time-based splits without ligand similarity filtering, which makes it difficult to assess performance on truly novel ligands and binding pockets. Consequently, this raises concerns about whether AlphaFold relies on memorization of training data rather than understanding protein-ligand chemistry (Masters et al., 2025; Outeiral et al., 2022; Škrinjar et al., 2025).

6.6. An overview of the technical limitations of AlphaFold (AF1/AF2/AF3) affects biological interpretation and translational workflows, addressing challenges related to intrinsically disordered regions, post-translational modifications, and transient interactions

There are limitations to AlphaFold (AF1/AF2/AF3) that significantly affect biological interpretation and translational workflows, so users must carefully consider their appropriate use. Notably, all AlphaFold versions encounter difficulties with intrinsically disordered regions (IDRs), which comprise 30–40% of the human proteome (Gopalan and Narayanan, 2025) and play critical roles in cellular signaling and disease. For example, pLDDT confidence scores of AF2 correlate with intrinsic disorder (Alderson et al., 2023), enabling the server to identify IDRs; however, these models cannot evaluate the dynamic nature and conformational heterogeneity of these regions, and, in some cases, AF3 incorrectly predicts ordered structures for 22% of disordered residues (Gopalan and Narayanan, 2025). Furthermore, post-translational modifications (PTMs) represent another critical limitation relevant to IDRs.

Challenges extend to PTMs as well: AF2 was not designed to model PTMs and cannot predict their structural impact. Recent evidence suggests PTMs may be the key factor influencing IDR folding (Bah and Forman-Kay, 2016). AlphaFold also struggles with protein–protein interactions; its training may favor high-affinity, homogeneous binding, potentially hindering predictions of cellular assembly. Furthermore, user education on confidence metrics is essential. For example, a high pLDDT score does not always reflect structural accuracy. Membrane proteins may receive very high confidence scores, yet such predictions can deviate from experimental structures. Systematic data bias further complicates structure determination and downstream applications. PDB training data, for instance, skews toward soluble proteins—less than 3% are membrane proteins (Ahram et al., 2006)—and toward well-studied families and static conformations. AlphaFold predictions tend to regularize regions of conformation, yet conformational diversity may differ greatly from experimentally determined structures. Thus, AlphaFold predictions should supplement, not supplant, experimental validation. This is especially important for drug discovery, protein engineering, and studies of disease mechanisms, where structural dynamics, PTMs, and transient interactions are critical.

7. The future trajectory: integrating AI with modelling

AlphaFold exhibits certain limitations, reflecting the current state of structural biology. Presently, scientists employ a combination of methodologies that integrate rapid and precise deep learning techniques with realistic physical models to investigate biological processes.

7.1. Predictions of single-chain structures

It has been claimed that single-chain structure prediction is mostly solved by AF3. However, this needs qualification. While AF3 shows better local structural accuracy than AlphaFold2, its global accuracy improvements for protein monomers are limited. For example, AF3 outperformed AF2 in only 57.5% of cases on orphan protein benchmarks (Peng et al., 2025). The most challenging cases are orphan proteins without detectable multiple sequence alignments. In these instances, AF2 actually performs better than AF3 in both global and local predictions. Thus, AF3’s accuracy has not increased significantly for protein monomer structure prediction compared to AF2 (Peng et al., 2025), especially for orphan proteins lacking sequence homology. As a result, predictions for these proteins are often unsatisfactory (Singh, 2023). In summary, the persistent shortcomings of current models, particularly regarding orphan proteins, underscore that single-chain structure prediction is far from solved and remains a critical challenge that demands further advances.

7.2. Methods for conformational sampling and ensemble generation

Recent research shows that protein sequences alone carry the information for conformational changes. This can be seen using MSA (multiple sequence alignment), a method for comparing protein sequences. Researchers have developed new techniques—MSA masking (hiding parts of the alignment to test their importance), subsampling (analyzing random subsets), and clustering (grouping similar sequences)—to generate diverse sub-MSAs with minor changes (Genc and McGuffin, 2025; Kalakoti and Wallner, 2025). These help AlphaFold generate a range of possible protein shapes, not just a single static structure. Also, new models use diffusion (iterative denoising) and flow-matching (aligning distributions of structures) techniques. These methods are transforming protein structure prediction (Sala et al., 2023). As a result, models can now predict multiple molecular shapes, helping us understand more than a single state (Zheng et al., 2023; Genc and McGuffin, 2025; Kalakoti and Wallner, 2025; Li et al., 2025b).

7.3. Post-prediction refinement using molecular dynamics (MD) simulations

AI and physics-based computational methods are crucial for studying molecular energy states. AlphaFold efficiently identifies the lowest-energy state of molecular structures (Perrakis and Sixma, 2021; Zhang et al., 2024). In contrast, Molecular Dynamics (MD) simulations provide essential insights into the energy landscape and molecular conformations. Studies show that combining AlphaFold with MD simulations improves accuracy, demonstrating that this synergy outperforms AI alone (Heo and Feig, 2020; Elia Venanzi et al., 2024). This approach is particularly useful for modeling conformational changes affecting molecular function. By starting MD simulations with AlphaFold predictions, researchers can explore alternative conformations; specific transporter structures have been examined using this strategy (Heo and Feig, 2020; Elia Venanzi et al., 2024; Guo et al., 2024). These methods also help model transient states, intermediates between two functional conformations. Thus, integrating AI predictions with MD simulations greatly enhances our understanding of protein conformational changes.

7.4. Advanced benchmarking for multi-molecular systems

There is a need for more refined methods with thoroughly evaluated performance. In complex prediction, innovative models—such as combining AF3 with robust predictors like Protenix—have achieved 70–80% success rates under strict conditions (Fnat ≥ 0.8), much higher than AlphaFold2-multimer’s 53% (Varga et al., 2025; Zhou et al., 2025). Researchers are also working to improve model quality assessment beyond traditional metrics like iPTM and RMSD. Evaluating concordance among models has become an effective alternative (Varga et al., 2025). Defining these measures is crucial, especially for flexible or complex targets, since metrics like iPTM may not capture structural misalignments.

Building on recent advances and ongoing challenges, we outline current limitations and mitigation or hybrid strategies to enhance dynamic modelling and multi-molecular systems, as shown in Table 5. These insights highlight pivotal directions for overcoming limitations and advancing the field.

Table 5.

Current limitations and strategies for dynamic modelling and multi-molecular systems.

Challenge area Specific limitation/artifact AlphaFold context Mitigation/hybrid strategy
Protein dynamics Multiple conformations, transient states, fold-switching AF2 and AF3 are biased toward a static ground state MD simulations, MSA subsampling/enhanced sampling, generative models
Non-canonical chemistry NCAAs, complex PTMs (e.g., sactipeptides) Evolution-based models fail where MSA signals are masked or absent Physics-informed models, explicit chemical tokenization (e.g., RareFold)
Small molecule accuracy Stereochemistry/Chirality Violations AF3 exhibits 4.4% chirality violation rate Required physics-based refinement or customized chemical modules
Model quality assessment High confidence (pLDDT) deviation from non-globular native structure Confidence metrics do not guarantee functional relevance for highly flexible regions Updated error categorization; focus on PAE for relative domain orientation

8. Conclusion

AlphaFold has fundamentally revolutionized structural biology, transforming how scientists predict and decipher protein structures with unprecedented accuracy. In recognition of this extraordinary breakthrough, the Nobel Prize was awarded to Demis Hassabis and John Jumper (Callaway, 2024; Graham, 2024). AlphaFold has democratized access to structural data, expanding the AlphaFold Database (AFDB) to an astonishing 214 million+ entries (Varadi et al., 2024). Moreover, structural coverage of the human proteome increased from 48 to 76% (Porta-Pardo et al., 2022; Varadi and Velankar, 2023).

Building on AlphaFold’s transformative impact, advancements in this domain have shifted the field’s focus. The prediction of single-chain structures is largely resolved. The current significant challenge lies in modelling biological functions under real-life conditions.

To address these ongoing challenges, it is crucial to apply AI models by investigating various properties of proteins, such as dynamics, conformational changes, and molecular interactions. Accordingly, the focus of future research is on integrating DL, which excels at rapid identification of protein conformations, with physics-based methodologies such as Molecular Dynamics. These approaches can explain various aspects of protein behavior using distinct parameters. The strategy is expected to yield significant advances in protein structure and is of great importance to both the field of biotechnology and medicine.

Funding Statement

The author(s) declared that financial support was received for this work and/or its publication. This study was supported by the National Research Foundation of Korea (NRF) grant funded by the MSIT (RS-2025-00513909).

Footnotes

Edited by: Marco Masseroli, Polytechnic University of Milan, Italy

Reviewed by: Michael Rock Goldsmith, Congruence Therapeutics, United States

Peter Tompa, Vrije University Brussels, Belgium

Author contributions

CC: Project administration, Writing – original draft, Data curation, Conceptualization, Investigation, Supervision, Writing – review & editing. MB: Writing – review & editing, Formal analysis, Methodology, Resources, Investigation. S-SL: Validation, Writing – review & editing, Funding acquisition, Project administration.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The author(s) CC and MB declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declared that Generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  1. Abergel C. (2013). Molecular replacement: tricks and treats. Biol. Crystallograp. 69, 2167–2173. doi: 10.1107/S0907444913015291 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abramson J., Adler J., Dunger J., Evans R., Green T., Pritzel A., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493–500. doi: 10.1038/s41586-024-07487-w, [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Abriata L. A. (2024). The Nobel prize in chemistry: past, present, and future of AI in biology. Commun Biol 7:1409. doi: 10.1038/s42003-024-07113-5, [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Agarwal V., McShan A. C. (2024). The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins. Nat. Chem. Biol. 20, 950–959. doi: 10.1038/s41589-024-01638-w, [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ahram M., Litou Z. I., Fang R., Al-Tawallbeh G. (2006). Estimation of membrane proteins in the human proteome. In Silico Biol. 6, 379–386. doi: 10.3233/isb-00251, [DOI] [PubMed] [Google Scholar]
  6. Akdel M., Pires D. E. V., Pardo E. P., Jänes J., Zalevsky A. O., Mészáros B., et al. (2022). A structural biology community assessment of AlphaFold2 applications. Nat. Struct. Mol. Biol. 29, 1056–1067. doi: 10.1038/s41594-022-00849-w, [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Alderson T. R., Pritišanac I., Kolarić D., Moses A. M., Forman-Kay J. D. (2023). Systematic identification of conditionally folded intrinsically disordered regions by AlphaFold2. Proc. Natl. Acad. Sci. USA 120:e2304302120. doi: 10.1073/pnas.2304302120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Alhumaid N. K., Tawfik E. A. (2024). Reliability of AlphaFold2 models in virtual drug screening: a focus on selected class a GPCRs. Int. J. Mol. Sci.. l 25:10139: 21. doi: 10.3390/ijms251810139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. AlphaFold (2025) “How have AlphaFold 3’s predictions been validated? AlphaFold”. Available online at: www.ebi.ac.uk/training/online/courses/alphafold/alphafold-3-and-alphafold-server/introducing-alphafold-3/how-have-alphafold-3s-predictions-been-validated/
  10. AlQuraishi M. (2019). AlphaFold at CASP13. Bioinformatics 35, 4862–4865. doi: 10.1093/bioinformatics/btz422, [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Álvarez-Salmoral D., Borza R., Maiella C., Kwee B. P., Xie R., Joosten R. P., et al. (2024). AlphaBridge: tools for the analysis of predicted macromolecular complexes. bioRxiv, 2024–10. doi: 10.1101/2024.10.23.619601 [DOI] [Google Scholar]
  12. Anishchenko I., Kundrotas P. J., Tuzikov A. V., Vakser I. A. (2014). Protein models: the grand challenge of protein docking. Proteins 82, 278–287. doi: 10.1002/prot.24385 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bah A., Forman-Kay J. D. (2016). Modulation of intrinsically disordered protein function by post-translational modifications. J. Biol. Chem. 291, 6696–6705. doi: 10.1074/jbc.R115.695056, [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Barbarin-Bocahu I., Graille M. (2022). The X-ray crystallography phase problem solved thanks to AlphaFold and RoseTTAFold models: a case-study report. Acta Crystallogr D Struct Biol. 78, 517–531. doi: 10.1107/S2059798322002157 [DOI] [PubMed] [Google Scholar]
  15. Barrio-Hernandez I., Yeo J., Jänes J., Mirdita M., Gilchrist C. L. M., Wein T., et al. (2023). Clustering predicted structures at the scale of the known protein universe. Nature 622, 637–645. doi: 10.1038/s41586-023-06510-w, [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Berman H. M., Burley S. K. (2025). Protein data Bank (PDB): fifty-three years young and having a transformative impact on science and society. Q. Rev. Biophys. 58:e9. doi: 10.1017/S0033583525000034, [DOI] [PubMed] [Google Scholar]
  17. Bernett J., Blumenthal D. B., List M. (2024). Cracking the black box of deep sequence-based protein-protein interaction prediction. Brief. Bioinform. 25:bbae076. doi: 10.1093/bib/bbae076, [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Bieri O., Kiefhaber T. (1999). Elementary steps in protein folding. Biol. Chem. 380, 923–929. doi: 10.1515/bc.1999.114, [DOI] [PubMed] [Google Scholar]
  19. Bryan P. N., Orban J. (2010). Proteins that switch folds. Curr. Opin. Struct. Biol. 20, 482–488. doi: 10.1016/j.sbi.2010.06.002, [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Burley S. K., Bhikadiya C., Bi C., Bittrich S., Chen L., Crichlow G. V., et al. (2022). RCSB protein data Bank: celebrating 50 years of the PDB with new tools for understanding and visualizing biological macromolecules in 3D. Protein Sci. 31, 187–208. doi: 10.1002/pro.4213, [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Buttenschoen M., Morris G. M., Deane C. M. (2024). Posebusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences. Chem. Sci. 15, 3130–3139. doi: 10.1039/d3sc04185a, [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Callaway E. (2022). What's next for AlphaFold and the AI protein-folding revolution. Nature 604, 234–238. doi: 10.1038/d41586-022-00997-5, [DOI] [PubMed] [Google Scholar]
  23. Callaway E. (2024). Chemistry Nobel goes to developers of AlphaFold AI that predicts protein structures. Nature 634, 525–526. doi: 10.1038/d41586-024-03214-7, [DOI] [PubMed] [Google Scholar]
  24. Castillo S., Ollila O. H. S. (2025). Actseek: fast and accurate search algorithm of active sites in alphafold database. Bioinformatics 41:btaf424. doi: 10.1093/bioinformatics/btaf424, [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Castro T. G., Melle-Franco M., Sousa C. E. A., Cavaco-Paulo A., Marcos J. C. (2023). Non-canonical amino acids as building blocks for peptidomimetics: structure, function, and applications. Biomolecules 13:981. doi: 10.3390/biom13060981, [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Chakravarty D., Schafer J. W., Chen E. A., Thole J. F., Ronish L. A., Lee M., et al. (2024). AlphaFold predictions of fold-switched conformations are driven by structure memorization. Nat. Commun. 15:7296. doi: 10.1038/s41467-024-51801-z, [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Chakravarty D., Schafer J. W., Porter L. L. (2023). Distinguishing features of fold-switching proteins. Protein Sci. 32:e4596. doi: 10.1002/pro.4596, [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Chang L., Mondal A., Singh B., Martínez-Noa Y., Perez A. (2024). Revolutionizing peptide-based drug discovery: advances in the post-AlphaFold era. Wiley Interdiscip Rev Comput Mol Sci 14:e1693. doi: 10.1002/wcms.1693, [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Chen J., Fruhauf A., Fan C., Ponce J., Ueberheide B., Bhabha G., et al. (2023). Structure of an endogenous mycobacterial MCE lipid transporter. Nature 620, 445–452. doi: 10.1038/s41586-023-06366-0, [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Chen Y., Wang J., Li G., Yang Y., Ding W. (2021). Current advancements in Sactipeptide natural products. Front. Chem. 9:595991. doi: 10.3389/fchem.2021.595991, [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Childs H., Zhou P., Donald B. R. (2025). Has AlphaFold 3 solved the protein folding problem for D-peptides? bioRxiv 2025 3:643307. doi: 10.1101/2025.03.14.643307 [DOI] [Google Scholar]
  32. Cui X., Ge L., Chen X., Lv Z., Wang S., Zhou X., et al. (2025). Beyond static structures: protein dynamic conformations modeling in the post-AlphaFold era. Brief. Bioinform. 26:bbaf340. doi: 10.1093/bib/bbaf340, [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Desai D., Kantliwala S. V., Vybhavi J., Ravi R., Patel H., Patel J. (2024). Review of AlphaFold 3: transformative advances in drug design and therapeutics. Cureus 16:e63646. doi: 10.7759/cureus.63646, [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Dessimoz C., Thomas P. D. (2024). AI and the democratization of knowledge. Sci. Data 11:268. doi: 10.1038/s41597-024-03099-1, [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Dobson L., Tusnády G. E., Tompa P. (2025). Regularly updated benchmark sets for statistically correct evaluations of AlphaFold applications. Brief. Bioinform. 26:bbaf104. doi: 10.1093/bib/bbaf104, [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Dotan E., Jaschek G., Pupko T., Belinkov Y. (2024). Effect of tokenization on transformers for biological sequences. Bioinformatics 40:btae196. doi: 10.1093/bioinformatics/btae196, [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Editorial (2022). How AlphaFold can realize AI's full potential in structural biology. Nature 608:8. doi: 10.1038/d41586-022-02088-x [DOI] [PubMed] [Google Scholar]
  38. Elfmann C., Dumann V., van den Berg T., Stülke J. (2025). A new framework for SubtiWiki, the database for the model organism Bacillus subtilis. Nucleic Acids Res. 53, D864–D870. doi: 10.1093/nar/gkae957, [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Elfmann C., Stulke J. (2023). PAE viewer: a webserver for the interactive visualization of the predicted aligned error for multimer structure predictions and crosslinks. Nucleic Acids Res. 51, W404–W410. doi: 10.1093/nar/gkad350, [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Elfmann C., Stulke J. (2025). Cutting-edge tools for structural biology: bringing AlphaFold to the people. Trends Microbiol. 33, 819–822. doi: 10.1016/j.tim.2025.03.014, [DOI] [PubMed] [Google Scholar]
  41. Elia Venanzi N. A., Basciu A., Venanzi N. A. E., Vargiu A. V., Kiparissides A., Dalby P. A., et al. (2024). Machine learning integrating protein structure, sequence, and dynamics to predict the enzyme activity of bovine enterokinase variants. J. Chem. Inf. Model. 64, 2681–2694. doi: 10.1021/acs.jcim.3c00999, [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Fang Z., Ran H., Zhang Y., Chen C., Lin P., Zhang X., et al. (2025). AlphaFold 3: an unprecedent opportunity for fundamental research and drug development. Precis. Clin. Med. 8:pbaf015. doi: 10.1093/pcmedi/pbaf015, [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Fleming J., Magana P., Nair S., Tsenkov M., Bertoni D., Pidruchna I., et al. (2025). AlphaFold protein structure database and 3D-beacons: new data and capabilities. J. Mol. Biol. 437:168967. doi: 10.1016/j.jmb.2025.168967, [DOI] [PubMed] [Google Scholar]
  44. Fluhe L., Marahiel M. A. (2013). Radical S-adenosylmethionine enzyme catalyzed thioether bond formation in sactipeptide biosynthesis. Curr. Opin. Chem. Biol. 17, 605–612. doi: 10.1016/j.cbpa.2013.06.031 [DOI] [PubMed] [Google Scholar]
  45. Fontana P., Dong Y., Pi X., Tong A. B., Hecksel C. W., Wang L., et al. (2022). Structure of cytoplasmic ring of nuclear pore complex by integrative cryo-EM and AlphaFold. Science 376:eabm9326. doi: 10.1126/science.abm9326, [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Genc A. G., McGuffin L. J. (2025). “Beyond AlphaFold2: the impact of AI for the further improvement of protein structure prediction,” in Methods Mol Biol. 2867:121–139. [DOI] [PubMed] [Google Scholar]
  47. Gopalan S., Narayanan S. (2025). Hallucinations in AlphaFold3 for intrinsically disordered proteins with disorder in biological process residues. arXiv preprint arXiv:2510.15939. doi: 10.48550/arXiv.2510.15939 [DOI] [Google Scholar]
  48. Graham F. (2024). Daily briefing: AlphaFold developers share Nobel prize in chemistry. Nature. [Online ahead of print]. doi: 10.1038/d41586-024-03325-1 [DOI] [PubMed] [Google Scholar]
  49. Guan X., Tang Q. Y., Ren W., Chen M., Wang W., Wolynes P. G., et al. (2024). Predicting protein conformational motions using energetic frustration analysis and AlphaFold2. Proc. Natl. Acad. Sci. USA 121:e2410662121. doi: 10.1073/pnas.2410662121, [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Guo S. B., Meng Y., Lin L., Zhou Z. Z., Li H. L., Tian X. P., et al. (2024). Artificial intelligence alphafold model for molecular biology and drug discovery: a machine-learning-driven informatics investigation. Mol. Cancer 23:223. doi: 10.1186/s12943-024-02140-6, [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Hameduh T., Haddad Y., Adam V., Heger Z. (2020). Homology modeling in the time of collective and artificial intelligence. Comput. Struct. Biotechnol. J. 18, 3494–3506. doi: 10.1016/j.csbj.2020.11.007, [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. He X. H., Li J. R., Shen S. Y., Xu H. E. (2025). AlphaFold3 versus experimental structures: assessment of the accuracy in ligand-bound G protein-coupled receptors. Acta Pharmacol. Sin. 46, 1111–1122. doi: 10.1038/s41401-024-01429-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Hennig J. (2025). Structural biology of RNA and protein-RNA complexes after AlphaFold3. Chembiochem 26:e202401047. doi: 10.1002/cbic.202401047, [DOI] [PubMed] [Google Scholar]
  54. Heo L., Feig M. (2020). High-accuracy protein structures by combining machine-learning with physics-based refinement. Proteins 88, 637–642. doi: 10.1002/prot.25847, [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Homma F., Huang J., van der Hoorn R. A. L. (2023). AlphaFold-multimer predicts cross-kingdom interactions at the plant-pathogen interface. Nat. Commun. 14:6040. doi: 10.1038/s41467-023-41721-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Hou J., Wu T., Cao R., Cheng J. (2019). Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13. Proteins 87, 1165–1178. doi: 10.1002/prot.25697, [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Hýskova A., Maršálková E., Šimeček P. (2025). Balancing speed and precision in protein folding: a comparison of AlphaFold2, ESMFold, and OmegaFold. Front. Genet. 16:1715037. doi: 10.3389/fgene.2025.1715037, [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Ibrahim T., Khandare V., Mirkin F. G., Tumtas Y., Bubeck D., Bozkurt T. O. (2023). AlphaFold2-multimer guided high-accuracy prediction of typical and atypical ATG8-binding motifs. PLoS Biol. 21:e3001962. doi: 10.1371/journal.pbio.3001962, [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Ishitani R., Moriwaki Y. (2025). Improving stereochemical limitations in protein-ligand complex structure prediction. ACS Omega 10, 56075–56084. doi: 10.1021/acsomega.5c07675, [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Jaskolski M., Dauter Z., Wlodawer A. (2014). A brief history of macromolecular crystallography, illustrated by a family tree and its Nobel fruits. FEBS J. 281, 3985–4009. doi: 10.1111/febs.12796, [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Jing X., Wu F., Luo X., Xu J. (2024). Single-sequence protein structure prediction by integrating protein language models. Proc. Natl. Acad. Sci. USA 121:e2308788121. doi: 10.1073/pnas.2308788121, [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., et al. (2021a). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. doi: 10.1038/s41586-021-03819-2, [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Jumper J., Evans R., Pritzel A., Green T., Figurnov M., Ronneberger O., et al. (2021b). Applying and improving AlphaFold at CASP14. Proteins 89, 1711–1721. doi: 10.1002/prot.26257, [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Kalakoti Y., Wallner B. (2025). AFsample2 predicts multiple conformations and ensembles with AlphaFold2. Commun Biol 8:373. doi: 10.1038/s42003-025-07791-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Kemena C., Notredame C. (2009). Upcoming challenges for multiple sequence alignment methods in the high-throughput era. Bioinformatics 25, 2455–2465. doi: 10.1093/bioinformatics/btp452, [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Kim G., Lee S., Levy Karin E., Kim H., Moriwaki Y., Ovchinnikov S., et al. (2025). Easy and accurate protein structure prediction using ColabFold. Nat. Protoc. 20 2025, 620–642. doi: 10.1038/s41596-024-01060-5 [DOI] [PubMed] [Google Scholar]
  67. Kovalevskiy O., Mateos-Garcia J., Tunyasuvunakool K. (2024). AlphaFold two years on: validation and impact. Proc. Natl. Acad. Sci. USA 121:e2315002121. doi: 10.1073/pnas.2315002121, [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Krishna R., Wang J., Ahern W., Sturmfels P., Venkatesh P., Kalvet I., et al. (2024). Generalized biomolecular modeling and design with RoseTTAFold all-atom. Science (N. Y.). 384:eadl2528. doi: 10.1126/science.adl2528, [DOI] [PubMed] [Google Scholar]
  69. Krokidis M. G., Koumadorakis D. E., Lazaros K., Ivantsik O., Exarchos T. P., Vrahatis A. G., et al. (2025). AlphaFold3: an overview of applications and performance insights. Int. J. Mol. Sci. 26:3671. doi: 10.3390/ijms26083671, [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Kryshtafovych A., Schwede T., Topf M., Fidelis K., Moult J. (2019). Critical assessment of methods of protein structure prediction (CASP)-round XIII. Proteins 87, 1011–1020. doi: 10.1002/prot.25823 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Kurisu G. (2022). Fifty years of protein data Bank in the Journal of biochemistry. J. Biochem. 171, 3–11. doi: 10.1093/jb/mvab133, [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Lang B., Mészáros B., Sejdiu B. I., Patel J., Babu M. M. (2025). AlphaSync is an enhanced AlphaFold structure database synchronized with UniProt. Nat Struct Mol Biol. 32, 2628–2632. doi: 10.1038/s41594-025-01719-x, [DOI] [PubMed] [Google Scholar]
  73. Laurents D. V. (2022). AlphaFold 2 and NMR spectroscopy: partners to understand protein structure, dynamics and function. Front. Mol. Biosci. 9:906437. doi: 10.3389/fmolb.2022.906437, [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Lee J., Hao Nguyen C., Mamitsuka H. (2025a). Beyond rigid docking: deep learning approaches for fully flexible protein-ligand interactions. Brief. Bioinform. 26:bbaf454. doi: 10.1093/bib/bbaf454, [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Lee C. Y., Hubrich D., Varga J. K., Schäfer C., Welzel M., Schumbera E., et al. (2024). Systematic discovery of protein interaction interfaces using AlphaFold and experimental validation. Mol. Syst. Biol. 20, 75–97. doi: 10.1038/s44320-023-00005-6, [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Lee M., Schafer J. W., Prabakaran J., Chakravarty D., Clore M. F., Porter L. L. (2025b). Large-scale predictions of alternative protein conformations by AlphaFold2-based sequence association. Nat. Commun. 16:5622. doi: 10.1038/s41467-025-60759-5, [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Lefebvre H., Legner C., et al. (2021). “Data democratization: toward a deeper understanding,” in ICIS (Forty-Second International Conference on Information Systems), (Austin, Texas: Conference: International Conference on Information Systems (ICIS 2021)), 1–17. Available online at: https://aisel.aisnet.org/icis2021/gen_topics/gen_topics/7 [Google Scholar]
  78. Li W.-r., Cadet X. F., Medina-Ortiz D., Davari M. D., Sowdhamini R., Damour C., et al. (2025b). From thermodynamics to protein design: diffusion models for biomolecule generation towards autonomous protein engineering. arXiv preprint arXiv:2501.02680. doi: 10.48550/arXiv.2501.02680 [DOI] [Google Scholar]
  79. Li W., Schaeffer R. D., Otwinowski Z., Grishin N. V. (2016). Estimation of uncertainties in the global distance test (GDT_TS) for CASP models. PLoS One 11:e0154786. doi: 10.1371/journal.pone.0154786, [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Li M. Q. C., Wang S., Lin S.-R., Ting L. E. N., Wan Z.-H., Xie G., et al. (2025a). Advantages and limitations of AlphaFold in structural biology: insights from recent studies. Protein J. 45, 22–38. doi: 10.1007/s10930-025-10310-8, [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Liang T., Jiang C., Yuan J., Othman Y., Xie X. Q., Feng Z. (2022). Differential performance of RoseTTAFold in antibody modeling. Brief. Bioinform. 23:bbac152. doi: 10.1093/bib/bbac152, [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Lin D. H., Hoelz A. (2019). The structure of the nuclear pore complex (an update). Annu. Rev. Biochem. 88, 725–783. doi: 10.1146/annurev-biochem-062917-011901, [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Lisanza S. L., Gershon J. M., Tipps S. W. K., Sims J. N., Arnoldt L., Hendel S. J., et al. (2025). Multistate and functional protein design using RoseTTAFold sequence space diffusion. Nat. Biotechnol. 43, 1288–1298. doi: 10.1038/s41587-024-02395-w, [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Liu H., Dastidar S. G., Lei H., Zhang W., Lee M. C., Duan Y. (2008). Conformational changes in protein function. Methods Mol. Biol. 443, 258–275. doi: 10.1007/978-1-59745-177-2_14, [DOI] [PubMed] [Google Scholar]
  85. Mansoor S., Baek M., Juergens D., Watson J. L., Baker D. (2023). Zero-shot mutation effect prediction on protein stability and function using RoseTTAFold. Protein Sci. 32:e4780. doi: 10.1002/pro.4780, [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Marcu Ş.-B., Tăbîrcă S., Tangney M. (2022). An overview of Alphafold's breakthrough. Front Artificial Intelligence 5:875587. doi: 10.3389/frai.2022.875587, [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Masters M. R., Mahmoud A. H., Lill M. A. (2025). Investigating whether deep learning models for co-folding learn the physics of protein-ligand interactions. Nat. Commun. 16:8854. doi: 10.1038/s41467-025-63947-5, [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Mathur V., Jha M., Zai I., Mahajan M., Nazar S., Ali S., et al. (2025). Design and development of PROTACs: a new paradigm in anticancer drug discovery. Med. Drug Discov. 27:100221. doi: 10.1016/j.medidd.2025.100221 [DOI] [Google Scholar]
  89. Matzko R., Konur S. (2024). Technologies for design-build-test-learn automation and computational modelling across the synthetic biology workflow: a review. Netw. Model. Anal. Health Inform. Bioinform. 13:22. doi: 10.1007/s13721-024-00455-4 [DOI] [Google Scholar]
  90. Mirdita M., Schutze K., Moriwaki Y., Heo L., Ovchinnikov S., Steinegger M. (2022). ColabFold: making protein folding accessible to all. Nat. Methods 19, 679–682. doi: 10.1038/s41592-022-01488-1, [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Mosalaganti S., Obarska-Kosinska A., Siggel M., Taniguchi R., Turoňová B., Zimmerli C. E., et al. (2022). AI-based structure prediction empowers integrative structural analysis of human nuclear pores. Science 376:eabm9506. doi: 10.1126/science.abm9506, [DOI] [PubMed] [Google Scholar]
  92. Moult J., Fidelis K., Zemla A., Hubbard T. (2003). Critical assessment of methods of protein structure prediction (CASP)-round V. Proteins 53, 334–339. doi: 10.1002/prot.10556 [DOI] [PubMed] [Google Scholar]
  93. Mseer I., Al Makhzoumi O. (2025). “Data democratization: empowering individuals with access to insights derived from data,” In The Paradigm Shift from a Linear Economy to a Smart Circular Economy. Studies in Systems, Decision and Control, eds. M. Alaali, A. M. A. Musleh Al-Sartawi,and A. S. Aydiner. (Springer, Cham). 586. doi: 10.1007/978-3-031-87550-2_90 [DOI] [Google Scholar]
  94. Najar Najafi N., Karbassian R., Hajihassani H., Azimzadeh Irani M. (2025). Unveiling the influence of fastest nobel prize winner discovery: alphafold's algorithmic intelligence in medical sciences. J. Mol. Model. 31:163. doi: 10.1007/s00894-025-06392-x, [DOI] [PubMed] [Google Scholar]
  95. Nji E., Cramer K. C., Rüffin N. V., Fofana F. G., Heiba W., Sankhe S. (2025). Leveraging AlphaFold for innovation and sustainable health research in Africa. Nat. Commun. 16:1334. doi: 10.1038/s41467-025-56545-y, [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Nussinov R., Zhang M., Liu Y., Jang H. (2023). AlphaFold, allosteric, and orthosteric drug discovery: ways forward. Drug Discov. Today 28:103551. doi: 10.1016/j.drudis.2023.103551, [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Outeiral C., Nissley D. A., Deane C. M. (2022). Current structure predictors are not learning the physics of protein folding. Bioinf. (Oxf.) 38, 1881–1887. doi: 10.1093/bioinformatics/btab881, [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Pak M. A., Markhieva K. A., Novikova M. S., Petrov D. S., Vorobyev I. S., Maksimova E. S., et al. (2023). Using AlphaFold to predict the impact of single mutations on protein stability and function. PLoS One 18:e0282689. doi: 10.1371/journal.pone.0282689, [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Pancsa R., Tompa P. (2012). Structural disorder in eukaryotes. PLoS One 7:e34687. doi: 10.1371/journal.pone.0034687 [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Peng C., Ni W., Liu Q., Hu G., Zheng W. (2025). A comprehensive benchmarking of the AlphaFold3 for predicting biomacromolecules and their interactions. Brief. Bioinform. 26:bbaf616. doi: 10.1093/bib/bbaf616, [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Perkins-Jechow B. H., Iglesias Ahualli J. P., Nhu H. T., Omidi A., Li C., Holguin-Cruz J. A., et al. (2025). Challenging AlphaFold in predicting proteins with large-scale allosteric transitions. Commun Chem. 8:378. doi: 10.1038/s42004-025-01763-0, [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Perrakis A., Sixma T. K. (2021). AI revolutions in biology: the joys and perils of AlphaFold. EMBO Rep. 22:e54046. doi: 10.15252/embr.202154046, [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Plattner Nuria, Noé Frank. (2015). “Protein conformational plasticity and complex ligand-binding kinetics explored by atomistic simulations and Markov models.” Nat. Commun. 6:7653. 2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. pLDDT. Understanding local confidence. AlphaFold. (2024). Available online at: https://www.ebi.ac.uk/training/online/courses/alphafold/inputs-and-outputs/evaluating-alphafolds-predicted-structures-using-confidence-scores/plddt-understanding-local-confidence/ (Accessed September 02, 2025).
  105. Porta-Pardo E., Ruiz-Serra V., Valentini S., Valencia A. (2022). The structural coverage of the human proteome before and after AlphaFold. PLoS Comput. Biol. 18:e1009818. doi: 10.1371/journal.pcbi.1009818, [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Pratt O. S., Elliott L. G., Haon M., Mesdaghi S., Price R. M., Simpkin A. J., et al. (2025). AlphaFold 2, but not AlphaFold 3, predicts confident but unrealistic beta-solenoid structures for repeat proteins. Comput. Struct. Biotechnol. J. 27, 467–477. doi: 10.1016/j.csbj.2025.01.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Ren F., Ding X., Zheng M., Korzinkin M., Cai X., Zhu W., et al. (2023). AlphaFold accelerates artificial intelligence powered drug discovery: efficient discovery of a novel CDK20 small molecule inhibitor. Chem. Sci. 14, 1443–1452. doi: 10.1039/D2SC05709C, [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Rennie M. L., Oliver M. R. (2025). Emerging frontiers in protein structure prediction following the AlphaFold revolution. J. R. Soc. Interface 22:20240886. doi: 10.1098/rsif.2024.0886, [DOI] [PMC free article] [PubMed] [Google Scholar]
  109. RCSB PDB. (2026). Number of Released PDB Structures per Year. Berkeley, California, USA. Available online at: https://www.rcsb.org/stats/all-releasedstructures#:~:text=Table_title:%20Number%20of%20Released%20PDB%20Structures%20per,EM:%20563%20%7C%20Multiple%20Methods:%208%20%7C (Accessed September 02, 2025).
  110. Rubach P., Majorek K. A., Gucwa M., Murzyn K., Wlodawer A., Minor W. (2025). Advances in cryo-electron microscopy (cryoEM) for structure-based drug discovery. Expert Opin. Drug Discov. 20, 163–176. doi: 10.1080/17460441.2025.2450636, [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Ruff K. M., Pappu R. V. (2021). AlphaFold and implications for intrinsically disordered proteins. J. Mol. Biol. 433:167208. doi: 10.1016/j.jmb.2021.167208, [DOI] [PubMed] [Google Scholar]
  112. Ruperti F., Papadopoulos N., Musser J. M., Mirdita M., Steinegger M., Arendt D. (2023). Cross-phyla protein annotation by structural prediction and alignment. Genome Biol. 24:113. doi: 10.1186/s13059-023-02942-9, [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Sala D., Engelberger F., Mchaourab H. S., Meiler J. (2023). Modeling conformational states of proteins with AlphaFold. Curr. Opin. Struct. Biol. 81:102645. doi: 10.1016/j.sbi.2023.102645, [DOI] [PubMed] [Google Scholar]
  114. Scapin G. (2013). Molecular replacement then and now. Acta Crystallogr. D Biol. Crystallogr. 69, 2266–2275. doi: 10.1107/S0907444913011426 [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Schlievert P. M., Assimacopoulos A. P., Cleary P. P. (1996). Severe invasive group a streptococcal disease: clinical description and mechanisms of pathogenesis. J. Lab. Clin. Med. 127, 13–22. doi: 10.1016/S0022-2143(96)90161-4, [DOI] [PubMed] [Google Scholar]
  116. Schreiner E., Trabuco L. G., Freddolino P. L., Schulten K. (2011). Stereochemical errors and their implications for molecular dynamics simulations. BMC Bioinformatics 12:190. doi: 10.1186/1471-2105-12-190, [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Senior A. W., Evans R., Jumper J., Kirkpatrick J., Sifre L., Green T., et al. (2020). Improved protein structure prediction using potentials from deep learning. Nature. 577:706–710. doi: 10.1038/s41586-019-1923-7, [DOI] [PubMed] [Google Scholar]
  118. Shen C., Zhang X., Gu S., Zhang O., Wang Q., Du G., et al. (2025). Unlocking the application potential of AlphaFold3-like approaches in virtual screening. Chem. Sci. 17:2858–2879. doi: 10.1039/d5sc06481c [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Shi Y. (2014). A glimpse of structural biology through X-ray crystallography. Cell 159, 995–1014. doi: 10.1016/j.cell.2014.10.051, [DOI] [PubMed] [Google Scholar]
  120. Shi Y. (2024). Drug development in the AI era: AlphaFold 3 is coming! Innovation 5:100685. doi: 10.1016/j.xinn.2024.100685, [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Singh A. (2023). Structure prediction for orphan proteins. Nat. Methods 20:176. doi: 10.1038/s41592-023-01795-1, [DOI] [PubMed] [Google Scholar]
  122. Skolnick J., Gao M., Zhou H., Singh S. (2021). AlphaFold 2: why it works and its implications for understanding the relationships of protein sequence, structure, and function. J. Chem. Inf. Model. 61, 4827–4831. doi: 10.1021/acs.jcim.1c01114, [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Škrinjar P., Eberhardt J., Durairaj J., Schwede T. (2025). Have protein-ligand co-folding methods moved beyond memorisation? BioRxiv, 2025–2002. doi: 10.1101/2025.02.03.636309 [DOI] [Google Scholar]
  124. Staples M., Chan L., Si D., Johnson K., Whyte C., Cao R. (2019). “Artificial intelligence for bioinformatics: applications in protein folding prediction,” in 2019 IEEE Technology & Engineering Management Conference (TEMSCON), (Atlanta, GA, USA: IEEE; ), 1–8. doi: 10.1109/TEMSCON.2019.8813656 [DOI] [Google Scholar]
  125. Steinkellner G., Kroutil W., Gruber K., Gruber C. C. (2025). AlphaFold 3 is great - but it still needs human help to get chemistry right. Nature 637:548. doi: 10.1038/d41586-025-00111-5, [DOI] [PubMed] [Google Scholar]
  126. Szczepski K., Jaremko L. (2025). AlphaFold and what is next: bridging functional, systems and structural biology. Expert Rev. Proteomics 22, 45–58. doi: 10.1080/14789450.2025.2456046 [DOI] [PubMed] [Google Scholar]
  127. Tauriello G., Waterhouse A. M., Haas J., Behringer D., Bienert S., Garello T., et al. (2025). ModelArchive: a deposition database for computational macromolecular structural models. J. Mol. Biol. 437:168996. doi: 10.1016/j.jmb.2025.168996, [DOI] [PubMed] [Google Scholar]
  128. Testagrose C., Boucher C. (2025). Tokenization and deep learning architectures in genomics: a comprehensive review. Comput. Struct. Biotechnol. J. 27, 3547–3555. doi: 10.1016/j.csbj.2025.07.038, [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Uzoeto H. O., Cosmas S., Bakare T. T., Durojaye O. A. (2024). AlphaFold-latest: revolutionizing protein structure prediction for comprehensive biomolecular insights and therapeutic advancements. Beni-Suef Univ. J. Basic Appl. Sci. 13:46. doi: 10.1186/s43088-024-00503-y [DOI] [Google Scholar]
  130. van Kempen M., Kim S. S., Tumescheit C., Mirdita M., Lee J., Gilchrist C. L. M., et al. (2024). Fast and accurate protein structure search with Foldseek. Nat. Biotechnol. 42, 243–246. doi: 10.1038/s41587-023-01773-0, [DOI] [PMC free article] [PubMed] [Google Scholar]
  131. Varadi M., Anyango S., Deshpande M., Nair S., Natassia C., Yordanova G., et al. (2022). AlphaFold protein structure database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 50, D439–D444. doi: 10.1093/nar/gkab1061, [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Varadi M., Bertoni D., Magana P., Paramval U., Pidruchna I., Radhakrishnan M., et al. (2024). AlphaFold protein structure database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. 52, D368–D375. doi: 10.1093/nar/gkad1011, [DOI] [PMC free article] [PubMed] [Google Scholar]
  133. Varadi M., Velankar S. (2023). The impact of AlphaFold protein structure database on the fields of life sciences. Proteomics 23:e2200128. doi: 10.1002/pmic.202200128 [DOI] [PubMed] [Google Scholar]
  134. Varga J. K., Ovchinnikov S., Schueler-Furman O. (2025). actifpTM: a refined confidence metric of AlphaFold2 predictions involving flexible regions. Bioinformatics 41:btaf107. doi: 10.1093/bioinformatics/btaf107, [DOI] [PMC free article] [PubMed] [Google Scholar]
  135. Verburgt J., Zhang Z., Kihara D. (2025). AlphaFold model quality self-assessment improvement via deep graph learning. Protein Sci. 34:e70274. doi: 10.1002/pro.70274, [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Wee J., Wei G. W. (2024a). Benchmarking AlphaFold3's protein-protein complex accuracy and machine learning prediction reliability for binding free energy changes upon mutation. ArXiv. arXiv:2406.03979v1. doi: 10.48550/arXiv.2406.03979 [DOI] [Google Scholar]
  137. Wee J., Wei G. W. (2024b). Evaluation of AlphaFold 3's protein-protein complexes for predicting binding free energy changes upon mutation. J. Chem. Inf. Model. 64, 6676–6683. doi: 10.1021/acs.jcim.4c00976, [DOI] [PMC free article] [PubMed] [Google Scholar]
  138. Wu R., Ding F., Wang R., Shen R., Zhang X., Luo S., et al. (2022). High-resolution de novo structure prediction from primary sequence. BioRxiv, 2022–2007. doi: 10.1101/2022.07.21.500999 [DOI] [Google Scholar]
  139. Wuyun Q., Chen Y., Shen Y., Cao Y., Hu G., Cui W., et al. (2024). Recent Progress of protein tertiary structure prediction. Molecules 29:832. doi: 10.3390/molecules29040832, [DOI] [PMC free article] [PubMed] [Google Scholar]
  140. Xu J., Zhang Y. (2010). How significant is a protein structure similarity with TM-score = 0.5? Bioinformatics 26, 889–895. doi: 10.1093/bioinformatics/btq066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  141. Xue F., Zhang M., Li S., Gao X., Wohlschlegel J. A., Huang W., et al. (2025). SE(3)-equivariant ternary complex prediction towards target protein degradation. Nat. Commun. 16:5514. doi: 10.1038/s41467-025-61272-5, [DOI] [PMC free article] [PubMed] [Google Scholar]
  142. Yang Z., Zeng X., Zhao Y., Chen R. (2023). AlphaFold2 and its applications in the fields of biology and medicine. Signal Transduct. Target. Ther. 8:115. doi: 10.1038/s41392-023-01381-z, [DOI] [PMC free article] [PubMed] [Google Scholar]
  143. Zemla A. (2003). LGA: a method for finding 3D similarities in protein structures. Nucleic Acids Res. 31, 3370–3374. doi: 10.1093/nar/gkg571, [DOI] [PMC free article] [PubMed] [Google Scholar]
  144. Zhang H., Lan J., Wang H., Lu R., Zhang N., He X., et al. (2024). AlphaFold2 in biomedical research: facilitating the development of diagnostic strategies for disease. Front. Mol. Biosci. 11:1414916. doi: 10.3389/fmolb.2024.1414916, [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. Zhang Y., Skolnick J. (2005). TM-align: a protein structure alignment algorithm based on the TM-score. Nucleic Acids Res. 33, 2302–2309. doi: 10.1093/nar/gki524, [DOI] [PMC free article] [PubMed] [Google Scholar]
  146. Zheng L. E., Barethiya S., Nordquist E., Chen J. (2023). Machine learning generation of dynamic protein conformational ensembles. Molecules 28:4047. doi: 10.3390/molecules28104047, [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Zhong Y., Chi F., Wu H., Liu Y., Xie Z., Huang W., et al. (2022). Emerging targeted protein degradation tools for innovative drug discovery: from classical PROTACs to the novel and beyond. Eur. J. Med. Chem. 231:114142. doi: 10.1016/j.ejmech.2022.114142, [DOI] [PubMed] [Google Scholar]
  148. Zhou F., Guo S., Peng X., Zhang S., Men C., Duan X., et al. (2025). Benchmarking AlphaFold3-like methods for protein-peptide complex prediction. BioRxiv:642277. doi: 10.1101/2025.03.09.642277 [DOI] [Google Scholar]
  149. Zhu W., Shenoy A., Kundrotas P., Elofsson A. (2023). Evaluation of AlphaFold-multimer prediction on multi-chain protein complexes. Bioinformatics 39:btad424. doi: 10.1093/bioinformatics/btad424, [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Frontiers in Artificial Intelligence are provided here courtesy of Frontiers Media SA

RESOURCES