Abstract
Proteins perform diverse functions critical to cellular processes. Transitions between functional states are often regulated by post‐translational modifications (PTMs) such as phosphorylation, which dynamically influence protein structure, function, folding, and interactions. Dysregulation of PTMs can therefore contribute to diseases such as cancer and Alzheimer's. However, the structure–function relationship between proteins and their modifications remains poorly understood due to a lack of experimental structural data, the inherent diversity of PTMs, and the dynamic nature of proteins. Recent advances in deep learning, particularly AlphaFold, have transformed protein structure prediction with near‐experimental accuracy. However, it remains unclear whether these models can effectively capture PTM‐driven conformational changes, such as those induced by phosphorylation. Here, we systematically evaluated AlphaFold models (AF2, AF3‐non phospho, and AF3‐phospho) to assess their ability to predict phosphorylation‐induced structural diversity. By analyzing experimentally derived conformational ensembles, we found that all models predominantly aligned with dominant structural states, often failing to capture phosphorylation‐specific conformations. Despite its phosphorylation‐aware design, AF3‐phospho predictions provided only modest improvement over AF2 and AF3‐non phospho predictions. Our findings highlight key challenges in modeling PTM‐driven structural landscapes and underscore the need for more adaptable structure prediction frameworks capable of capturing modification‐induced conformational variability.
Keywords: AlphaFold3, conformational diversity, phosphorylation, post‐translational modifications (PTMs), protein structures
1. INTRODUCTION
Proteins are essential biomolecules that play a myriad of roles in cellular processes, serving as enzymes, structural components, signaling molecules, and more. Their function is intricately linked to their three‐dimensional structure, which is determined by the sequence of amino acids and stabilized by various interactions such as hydrogen bonds, disulfide bridges, hydrophobic effects, and ionic interactions (Anfinsen, 1973; Maynard Smith, 1970; Redfern et al., 2008). This precise structure–function relationship is fundamental for the protein's biological activity (Bordin et al., 2023). Post‐translational modifications (PTMs), including phosphorylation, ubiquitination, glycosylation, and methylation, further diversify protein function and regulation (Goldtzvik et al., 2023). PTMs are critical in modulating protein activity, stability, localization, and interaction with other molecules, thereby fine‐tuning cellular processes and responses (Goldtzvik et al., 2023; Hunter, 1995; Xin & Radivojac, 2012). Among these, phosphorylation is one of the most extensively studied due to its pivotal role in regulating cellular processes (Hunter, 1995; Ramasamy et al., 2022), such as the MAPK/ERK pathway, where sequential phosphorylation events propagate signals that control cell growth, differentiation, and apoptosis (Zhang & Liu, 2002). Dysregulation of phosphorylation is implicated in numerous diseases, including cancer, diabetes, and neurodegenerative disorders (Cohen, 2001; Shu et al., 2023). Phosphorylation can induce significant conformational changes in the protein, acting as a molecular switch that regulates protein activity (Goldtzvik et al., 2023; Xin & Radivojac, 2012).
Understanding the mechanisms of PTMs, particularly phosphorylation, is crucial for fully comprehending protein function and, by extension, for the development of targeted therapies. Though 60% of the human proteome is known to be phosphorylated (P) (Ramasamy et al., 2022), only a small fraction of these known phosphosites are associated with functional relevance (Beltrao et al., 2012). Computational and experimental approaches have tried to identify such functionally important phosphosites (Ochoa et al., 2020; Viéitez et al., 2022). However, these methods cannot identify the structural impact of phosphorylation, which is pivotal in how signal transduction, enzyme regulation, and other cellular processes (Johnson, 2009; Orioli et al., 2022; Xin & Radivojac, 2012) are mediated. Our incomplete understanding of how phosphorylation modulates protein conformation especially impacts analysis of large‐scale proteomics datasets, given the dearth of experimentally available structural information (Goldtzvik et al., 2023; Ramasamy et al., 2022).
AlphaFold2, a deep learning‐based structure prediction algorithm, has demonstrated remarkable performance in protein structure prediction from amino acid sequence (Jumper et al., 2021). This tool has been instrumental in elucidating the structures of many previously unexplored or challenging proteins, resulting in for example a dramatic increase in the structural coverage of the human proteome and interactome (Kosoglu et al., 2024; Porta‐Pardo et al., 2022; Varadi et al., 2022). AlphaFold2 excels at predicting static ground‐state structures but often misses dynamic changes from modifications like phosphorylation (Bertoline et al., 2023; Perrakis & Sixma, 2021; Saldaño et al., 2022). AlphaFold3 improves on this by modeling ligands, complexes, and modifications, offering better insight into functional protein states (Abramson et al., 2024).
However, proteins are not static entities; they exist as ensembles of interchanging conformational states (Saldaño et al., 2022). This conformational diversity, especially in P proteins, presents a challenge for AlphaFold and similar computational models, which do not capture gradations in dynamical properties (Gavalda‐Garcia et al., 2025). Trained predominantly on experimental structures from the Protein Data Bank (PDB) determined by x‐ray diffraction, with the protein in crystallized form at typically cryogenic temperatures, AlphaFold essentially excels at predicting the most stable conformations in these conditions. How well AlphaFold can sample multiple protein conformations beyond these dominant structural states remains unclear (Chakravarty & Porter, 2022; Terwilliger et al., 2024); its ability to model conformational diversity is not fully understood and was only validated on a limited number of well‐studied experimentally determined cases. Its predictions may therefore be guided more by pattern recognition for the most probable conformer than by an intrinsic understanding of the energetics underlying protein ensembles (Chakravarty et al., 2024; Chakravarty & Porter, 2022).
These open questions on AlphaFold's performance on highly dynamic proteins (Chakravarty et al., 2024; Chakravarty & Porter, 2022) are particularly relevant for phosphorylation, which is known to drive distinct structural states critical for regulatory functions. To date, no large‐scale studies have systematically assessed AlphaFold's performance on P proteins in light of their complex conformational landscapes. The distinct structural states introduced by phosphorylation are often functionally critical, yet transient or less abundant—can AlphaFold accurately model such structural changes, or does it merely default to predicting the dominant, non‐phosphorylated (NP) conformation?
Here, we provide a large‐scale, detailed assessment of the conformational diversity of P proteins by analyzing multiple experimentally observed conformational states of eukaryotic proteins (human, mouse, and rat) in the PDB (Burley et al., 2023). In addition, we evaluate the ability of AlphaFold models (AF2, AF3) to predict these phosphorylation‐induced conformational states, focusing on both global and local structural changes. Although AF2 and AF3 were trained on PDB structures that capture only limited conformational variability, in this study we leverage the same structural information in a different way: to evaluate how well AlphaFold predictions can actually reproduce conformational diversity and phosphorylation‐induced structural transitions, rather than simply replicating their original training data for those proteins. Using metrics such as conformational diversity score (CDS), phospho‐adjusted CDS, co‐evolutionary signals and clustering approaches, we analyze how phosphorylation drives conformational shifts and assess whether AlphaFold models (AF2, AF3) capture these transitions. Our analysis highlights both the strengths and limitations of AlphaFold (AF2, AF3) in modeling P proteins, revealing biases toward dominant structural ensembles and reduced sensitivity to phosphorylation‐induced conformational changes. In contrast to previous studies that examined general conformational diversity or AlphaFold's limitations in ensemble prediction, our work provides the first large‐scale, phosphorylation‐specific benchmark of AF2 and AF3. This focus on phosphorylation distinguishes our study and advances current understanding of how AlphaFold models capture, or fail to capture, modification‐driven conformational changes. Notably, we observed that AF models prioritize evolutionarily stable conformations, favoring those reinforced by co‐evolutionary constraints rather than modeling the full spectrum of structural variability. Our findings provide a foundation for improving predictive algorithms by emphasizing the need to capture local structural variations and rare conformations. Future refinements of AlphaFold architectures—such as incorporating context‐specific embeddings or hybrid frameworks—may enable more accurate modeling of post‐translationally modified protein states.
2. RESULTS
2.1. Structural diversity of phosphoproteins in the PDB
We retrieved all available protein structures from the PDB for human, mouse, and rat to investigate conformational changes upon phosphorylation. Of the 10,686 proteins with at least one structure in the PDB, only 8.5% had at least one P canonical site, identified by residue codes for phosphoserine (SEP), phosphothreonine (TPO), or phosphotyrosine (PTR). Filtering for P structures with ≥50 amino acids reduced this to 2.2%, indicating that most phospho‐containing entries are short peptides. When restricting to proteins with both P and NP structures, the number dropped further to 1%, underscoring the limited availability of full‐length P proteins (Figure 1a). Most P structures belong to the kinase superfamily, followed by small GTPases and heat shock proteins (Figure 1b). Singly P proteins are most common, with TPO being the most frequent modification, followed by PTR and SEP. For dual phosphorylation, PTR‐PTR combinations dominate, followed by SEP‐TPO pairs.
FIGURE 1.

Overview of phosphorylated (P) protein structures in the PDB. (a) Workflow for collecting, filtering, and clustering P and non‐phosphorylated (NP) protein conformers. The process involves retrieving structures from UniProt and PDB, filtering for complete residues surrounding phosphosites, and clustering based on global RMSD values. (b) Protein family distribution of experimentally determined P structures in the PDB, dominated by kinase superfamilies, with smaller contributions from other families. (c) Sunburst plot showing the number of distinct clusters per protein based on global RMSD clustering, with proteins classified by cluster count. (d) Dumbbell plot illustrating cluster size statistics for each protein. The x‐axis represents the number of conformers per cluster, with blue and orange markers indicating the smallest and largest clusters, respectively. (e) Density plot showing the distribution of the fraction of P conformers per protein, highlighting the prevalence of phosphorylation across the dataset. RMSD, root mean square deviation.
To ensure reliable structural comparison, we curated a high‐quality dataset by filtering for substantial fragments without missing residues. The final dataset includes 109 proteins with 253 phosphorylation sites, spanning all phosphorylation states‐single, dual, and multiple‐as well as corresponding NP structures (see Methods). Recruitment of conformers was further validated by pairwise sequence alignment, which confirmed very high sequence conservation. Sequence identity was 90%–100% for 108 of 109 proteins, with only one protein in the 70%–80% range. Sequence similarity showed a comparable trend, with 107 proteins in the 95%–100% range and two between 80% and 95%. Detailed information on the protein, structure, and phosphorylation information is provided in Tables S1 and S2, Supporting Information.
2.2. Conformational diversity across phosphoproteins
To assess conformational diversity across phosphoproteins, structural conformers of each protein were clustered based on root mean square deviation (RMSD) values (Figure 1a). Conformers are alternative structural states within a protein's native ensemble that can range from large domain shifts to subtle local rearrangements, often arising from conditions such as substrate binding, PTMs, or pH changes (Saldaño et al., 2022). Here, we specifically define conformational diversity in the context of protein phosphorylation. We used the number of clusters per protein as a proxy for structural variability, where a higher cluster count indicates greater conformational diversity. As shown in the circular bar plot (Figure 1c), cluster counts varied across proteins, suggesting a broad range of structural flexibility in the dataset. Most proteins (66%) exhibited two distinct conformational states, followed by 26% with three states. Larger protein families, such as kinases, tended to show more clusters than smaller families like small GTPases and heat shock proteins.
To examine the composition of these clusters, we analyzed the distribution of conformers within each. Figure 1d presents a dumbbell plot comparing the minimum and maximum cluster sizes for each protein, revealing differences in the representation of structural states within the PDB. Proteins with similar maximum and minimum conformer counts had balanced structural sampling, while those with wide disparities showed preferential representation of certain states—potentially P forms—over others. This suggests a bias in structural deposition, favoring stable or experimentally tractable conformations.
We next explored whether phosphorylation contributes to this structural diversity. Almost 50% of proteins in our dataset contained at least 25% P conformers (Figure 1e). To further investigate, we analyzed the proportion of P and NP conformers within each cluster (Figure 2a). The heatmap shows clusters enriched for either P or NP conformers, as well as mixed clusters, indicating that phosphorylation can both influence and coexist within the structural landscape. Interestingly, 65% of proteins had at least half of their clusters containing both P and NP conformers (Figure 2a, inset). This overlap suggests that phosphorylation often stabilizes conformations already accessible in the NP ensemble, rather than inducing entirely new folds. In such cases, phosphorylation acts as a modulator—favouring specific pre‐existing states rather than drastically altering structure—thereby enhancing the functional adaptability of the protein. Overall, these findings imply that the NP structural landscape is inherently flexible, encompassing conformations compatible with the P state. Phosphorylation may therefore fine‐tune protein function by shifting the equilibrium toward particular structural states in response to regulatory cues.
FIGURE 2.

Mapping conformational diversity across phosphorylated proteins. (a) Heatmap illustrating the number of clusters (x‐axis) for each protein (y‐axis), with each cell representing a cluster. The proportion of P conformers is shown in cyan, and NP conformers in red. The inset highlights the fraction of clusters containing P and NP conformers across all proteins. (b) Example from the dataset for Mitogen‐activated protein kinase 1, which has three clusters. Cluster IDs 1 and 3 contain both P and NP conformers, while Cluster ID 2 consists exclusively of NP conformers. Representative structures for each cluster are shown, with P conformers in gold and NP conformers in blue. Phosphosites are highlighted in red and displayed as sticks where visible (e.g., in Cluster 1, the phosphorylated site in 6OPI‐A is missing due to unresolved crystal structure regions).
2.3. Phosphorylation induced conformational shifts in experimental structures
To evaluate the structural impact of phosphorylation, we calculated pairwise RMSD values across three conformer pair types within each protein: P–P, NP–NP, and P–NP. RMSD values between P–P and NP–NP conformers were comparable, as shown by their proximity to the diagonal in Figure 3a, indicating similar levels of structural variability within each state. In contrast, P–NP comparisons exhibited higher RMSD values, clustering above the diagonal, suggesting that phosphorylation introduces greater structural divergence between P and NP states (Figure 3a).
FIGURE 3.

Conformational diversity and protein phosphorylation. (a) Scatter plot (top) showing the median global RMSD values between P conformers (x‐axis) and between P and NP conformers (y‐axis) for each protein. Data points closer to the diagonal indicate less variability, while those further from the diagonal indicate higher diversity. The bottom scatter plot compares the median global RMSD between P and NP conformers (x‐axis) and between all non‐phospho conformers (y‐axis) within each protein. (b) Bar plot showing the difference between conformational diversity scores (CDS) and adjusted phospho CDS. Negative values indicates that phosphorylation induces greater conformational drift in those proteins. (c) Violin plots showing the distribution of RMSD values for conformer pairs (P and NP) with the maximal RMSD in a protein. Global RMSD values are shown in purple, while segment RMSD values are in gold. The x‐axis indicates the number of phosphosites in the phospho conformer. (d) Residue‐level modification analysis highlighting the most common single phosphorylation site SEP, followed by TPO and PTR. Co‐occurring phosphorylation sites are dominated by PTR‐associated phosphosites. (e) Boxplot showing RMSD values for the same dataset as in C, but with the x‐axis representing the phosphosite combinations. RMSD, root mean square deviation.
To assess whether molecular interactors contribute to phosphorylation‐associated structural diversity, we compared RMSD distributions across P–P, P–NP, and NP–NP conformer pairs under different ligand‐binding conditions. In this analysis, ligands were defined broadly as any non‐polymeric molecules annotated in the PDB entry, excluding waters. This general definition allowed us to assess whether the presence of a bound molecule, regardless of type, modulated conformational variability, although ligand‐specific effects could not be resolved. Ligand binding increased structural variability in NP conformers but had a stabilizing effect in P conformers. Correlation analyses further confirmed that phosphorylation was the primary driver of conformational divergence, while ligand effects acted as secondary modulators in a phosphorylation‐dependent manner (see Data S1 and Figure S1 for detailed analysis).
2.4. Conformational diversity score and adjusted CDS analysis
To quantify phosphorylation's effect on structural diversity, we calculated the CDS and its phosphorylation‐adjusted version (pCDS) for each protein (see Methods). While CDS measures overall structural variability, pCDS normalizes this value by the proportion of P and NP conformers, isolating the contribution of phosphorylation. The difference between CDS and pCDS reflects phosphorylation's structural impact. Positive values indicate that NP conformers drive most variability, with phosphorylation stabilizing pre‐existing states (Figure 3b, S2A). Negative values suggest phosphorylation introduces novel conformations, expanding structural diversity (Figure 3b, S2C). Near‐zero differences imply minimal structural shifts between P and NP conformers, pointing to a subtle regulatory effect (Figure 3b, S2B).
Overall, only 23% of proteins showed notable conformational drift upon phosphorylation, indicating that most proteins maintain similar structures across P and NP states under experimental conditions. Combined with the observed distributions of conformer fractions and median RMSD values (Figure 3a), these results suggest phosphorylation typically induces minor structural adjustments—stabilizing or subtly modulating NP conformations—rather than creating entirely new structural states.
2.5. Conformational variability by phosphorylation state: Global versus local effects
We evaluated how different phosphorylation states (single, dual, or multiple sites) affect structural variability by comparing phosphosites in experimental structures to their associated conformational shifts. For each protein, we selected the phospho state (single, dual, or multi) that exhibited the highest RMSD relative to the NP conformer. For instance, if a single modification (e.g., SEP) yielded higher RMSD than a dual‐modified form (e.g., SEP + TPO), the single‐modified conformer was used.
Conformers with a single phosphosite displayed the greatest global RMSD variability, with some showing extreme conformational shifts (~20 Å) (Figure 3c). This higher variability may reflect both the limited availability of multi‐modified structures (Figure 3d) and the filtering strategy, where the phospho state showing the largest conformational shift relative to the NP state was selected. Dual‐modified structures exhibited narrower global RMSD distributions, suggesting a stabilizing effect, while triple‐modified proteins showed a broader range and slightly elevated medians, indicating a possible interplay between stabilizing and destabilizing forces (Figure 3c).
Segment‐level RMSD remained tightly distributed across all modification states, indicating that conformational changes were localized—likely near the modification sites, possibly within intrinsically disordered regions. Interestingly, while segment RMSD was lower than global RMSD in single‐modified proteins, dual‐ and multi‐modified states often showed higher segment‐level median RMSD than their global values. This suggests that although global structures become more stable with increasing modification complexity, the local environment remains flexible and undergoes fine‐tuned adjustments in response to multi‐site phosphorylation.
2.6. Role of single and multi‐site phosphorylation in conformational diversity
We analyzed global and segment RMSD values to assess residue‐specific conformational variability driven by phosphorylation (Figure 3d,e). Segment RMSD was consistently higher than global RMSD, indicating increased local flexibility near modification sites, in line with previous observations of enhanced flexibility at P residues (Ramasamy et al., 2022). Among single modifications, SEP showed the most stable conformations, with narrow RMSD distributions. TPO displayed intermediate variability, while PTR induced the largest segment‐level shifts, reflecting pronounced localized reorganization (Figure 3e). In multi‐site contexts, TPO–TPO and PTR‐containing pairs (e.g., PTR–PTR, PTR–TPO) showed broader RMSD distributions and higher segment variability, whereas SEP‐SEP pairs remained relatively stable. Overall, PTR drives the most substantial conformational changes, followed by TPO, while SEP is associated with minimal structural shifts (Figure 3e).
2.7. Conformational diversity and structural features
To understand the structural basis of phosphorylation‐induced variability, we analyzed domain organization, secondary structure transitions, and solvent accessibility in conformer pairs with maximal RMSD differences. Proteins with phosphorylation sites located within structured domains exhibited greater conformational diversity, especially in multi‐domain proteins. Phosphorylation‐induced structural shifts were most prominent in flexible coil regions and were associated with increased solvent accessibility at the modification site. A detailed analysis is provided in Data S1 and Figures S3, S4.
2.8. Residue‐specific conformational changes upon phosphorylation
Residue‐specific domain analysis further confirmed the preference for modification sites, including both P and NP counterparts Tyrosine (TYR/PTR), Serine (SER/SEP), Threonine (THR/TPO), within structured regions. Modifications were predominantly observed in domain regions (PTR: n = 47; SEP: n = 41; TPO: n = 42), with fewer instances in non‐domain regions (PTR: n = 6; SEP: n = 20; TPO: n = 2) (Figure S3C). Notably, SEP/SER displayed a higher occurrence in nondomain regions compared to PTR/TYR and TPO/THR. Mixed cases, where one conformer had a modification in a domain and the other in a nondomain region, were rare, with only one instance each for SEP/SER and TPO/THR and none for PTR/TYR (Figure S3C).
Our analysis suggests that sites corresponding to both P and NP states in conformers with maximal diversity pairs are predominantly localized in structured regions. However, SEP and SER exhibit greater flexibility, with a higher proportion occurring in nondomain regions compared to TYR, THR, and their P counterparts.
Residue‐specific secondary structural analysis revealed that SER to SEP modifications most frequently maintained coil to coil states (n = 36) but were also observed in helix to helix states and occasionally transitioned from coil in the NP state to helix in the P state (Figure S4B). THR to TPO modifications exhibited transitions from helix in the NP state to coil in the P state, consistent with findings from a recent molecular dynamics study (Bickel & Vranken, 2024), while PTR modifications were predominantly associated with strand states, either maintaining strand to strand transitions or adopting a strand conformation in the P state from a coil in the NP state. PTR and TPO modifications demonstrated larger changes in RMSD values and solvent accessibility compared to SEP, indicating a higher degree of structural reorganization in these residues (Figure S4C, D).
2.9. AlphaFold predictions converge on dominant states regardless of phosphorylation
AF's ability to predict the effect of phosphorylation on proteins was tested on 109 protein pairs that showed little to large conformational differences upon phosphorylation. Most of these protein pairs are represented by kinase families, and most of these structural pairs are likely in the AF2 and AF3 training set. To evaluate the accuracy of AF2, AF3, and AF3‐p predictions in capturing P and NP conformations, a systematic approach was undertaken (Figure 5). Pairwise RMSD values were calculated for all P–NP conformers, and the pair with the highest RMSD was selected to represent the most structurally diverse conformations for each protein. The predicted models (AF2, AF3, and AF3‐p) were then aligned to these high‐RMSD pairs to determine whether they most closely resembled the P or NP state (Figure 5). This approach provided a stringent initial benchmark, as the chosen conformers captured the maximal experimentally observed divergence between the two functional states. By focusing on these extremes, the analysis avoided dilution by redundant or similar conformers. To prevent bias from considering only one pair, we complemented this with a full ensemble analysis that included all available conformers per protein (see Section Structural fidelity of AlphaFold models across conformational ensembles).
FIGURE 5.

Workflow for evaluating structural fidelity of AlphaFold predictions across conformational ensembles. All available phosphorylated (P) and non‐phosphorylated (NP) conformers for each protein were compared against AlphaFold models (AF2, AF3, AF3‐p) using pairwise RMSD and hierarchical clustering. To identify the phosphorylation state driving conformational variability, we first selected the P–NP conformer pair with the highest RMSD value. This phosphorylation state was then used for AF3‐p prediction. For each model, the closest experimental conformer from the highest RMSD pair was identified, and its cluster ID, size, and phosphorylation state were recorded. To extend beyond the highest RMSD pair, we further aligned the models to all conformers available for the protein and recorded the closest experimental conformer and cluster information. Predictions were then classified as aligning to the same or different cluster as the closest conformer, and further evaluated for phospho‐state agreement. This workflow allowed us to assess whether AlphaFold models reproduced structural similarity alone or also captured phosphorylation‐dependent conformational differences. RMSD, root mean square deviation.
The results demonstrated that AF3‐p predictions aligned with P conformers in 73% of cases (80/109 proteins), slightly higher than AF3 (70%) and AF2 (64%) (Figure 4c). Our results suggest that AF3‐p, which is tailored for phospho‐specific predictions, exhibits a modest enrichment toward P conformations compared to the general‐purpose AF2 and AF3 models. However, the similarity in alignment proportions between the models indicates that AF2 and AF3 have comparable predictive preferences for P and NP states. Importantly, a chi‐square test of independence (χ2 = 2.28, p = 0.32) revealed no statistically significant differences in P/NP proportions across the models, indicating that all models perform similarly in aligning with P and NP conformers.
FIGURE 4.

Evaluating AlphaFold predictions for phosphorylated and non‐phosphorylated states. (a) Workflow outlining the integration of PDB conformers with AlphaFold predictions (AF2, AF3, and AF3‐p) for clustering based on global and local RMSD analyses. (b) Hierarchical clustering of pairwise RMSD values for the Proto‐oncogene tyrosine‐protein kinase receptor (RET) reveals two distinct conformational clusters, suggesting the existence of two structural states. One cluster consists exclusively of P conformers, while the dominant cluster contains a mix of P and NP conformers. Representative structures from each cluster are displayed on the right, with phosphosites highlighted in red. (c) Proportion of predicted conformational states aligning with the closest P or NP conformers for AF2, AF3, and AF3‐p predictions. AF3‐p shows a slight enrichment toward phosphorylated conformers compared to AF3 and AF2. However, all models exhibit a comparable tendency to predict P states over NP ones. (d) Cluster consistency analysis reveals that 81.65% of AlphaFold predictions align with the same cluster across models, while 18.35% diverge into different clusters. (e) Alignment of AlphaFold predictions with same dominant structural clusters indicates a strong preference for dominant conformational states across all models, regardless of phosphorylation specificity. RMSD, root mean square deviation.
2.10. Structural convergence and dominance in AlphaFold predictions
We evaluated the alignment of AlphaFold predictions (AF2, AF3, and AF3‐p) to structural clusters corresponding to the closest P and NP conformational states. Hierarchical clustering of pairwise RMSD data grouped all conformers of a protein into structural clusters, with the cluster ID, size, and phosphorylation labels (see Methods, Figure 5) annotated for the conformer closest to each predicted model. This allowed us to assess whether the predictions from AF2, AF3, and AF3‐p consistently aligned with the same cluster or spanned multiple distinct clusters. Across 109 proteins, we found that AlphaFold predictions aligned into the same cluster for 81.65% of cases, with predictions diverging into different clusters in only 18.35% of cases (Figure 4d,e). This high degree of alignment suggests a strong structural convergence across all three models, indicating that AlphaFold predictions are largely consistent with similar conformational states, regardless of phosphorylation status.
To further examine this tendency, we analyzed the alignment of predictions with dominant clusters, defined as the largest structural clusters for each protein. In cases where multiple clusters were of equal size, these clusters were categorized as “equal.” The results showed that all three models aligned with dominant clusters in 98% of cases. The Z‐test for proportions revealed no significant differences in the proportion of dominant predictions among the three models (AF2, AF3, and AF3‐p; p > 0.05 for all pairwise comparisons), indicating that all models exhibit similar preferences for predicting dominant states, regardless of their specific design or focus on phosphorylation.
This consistent alignment with dominant clusters likely reflects patterns inherent to the training data, which emphasize the most stable or frequently observed conformations aligning well with the previous studies on fold switching proteins without modifications (Chakravarty et al., 2024; Chakravarty & Porter, 2022). AlphaFold models thus appear more likely to favor dominant structural states rather than explore alternative or less‐populated conformations, irrespective of phosphorylation specificity. This inherent bias highlights the model's tendency to prioritize the most stable and frequently observed conformations, potentially limiting their ability especially AF3 to capture subtle structural variations and conformational shifts introduced by PTMs such as phosphorylation. This pattern is further illustrated through a case study on the RET kinase receptor, where hierarchical clustering and heatmap analysis of RMSD values were used to evaluate the alignment of AlphaFold predictions with experimental conformers (see Data S1 and Figures S5 and 6c).
FIGURE 6.

AlphaFold predictions and co‐evolutionary constraints. (a) Bar plot showing Δ (Jco‐evolving pairs − Jall contacts) for each protein across AF2 (green), AF3 (orange), and AF3‐p (blue). Positive values indicate stronger alignment of predicted models with co‐evolutionary constraints, while negative values indicate stronger alignment with generic contact networks. This representation highlights model‐specific differences and protein‐level variation in the extent to which evolutionary couplings shape predicted conformations. Proteins are grouped by imbalance category (balanced → moderate → skewed) and x‐axis labels are shaded from dark to light gray accordingly. (b) P04406: structural alignment shows strong similarity between P (in gold, phosphosites in red) and NP states (in blue). All three models predicted the coexisting P conformer. (c) P07949: alignment of P (in gold) and NP conformers (in blue) shows certain phospho states that exist in a distinct conformational state different from the NP state. AF2 and AF3 predicted the P conformer driving the variability while AF3 predicts the co‐existing NP conformer. (d) P28482: structural alignment of P (in gold, phosphosites in red) and NP conformers (in blue and cyan) shows that the NP state exists in multiple conformational ensembles, with the P conformer being part of these pre‐existing states. AF2 and AF3‐p predicts the co‐existing state while AF3 predicted the NP conformer driving variability. For clarity, only selected P and NP conformers are shown in all examples, with the fragments of interest highlighted.
2.11. Structural fidelity of AlphaFold models across conformational ensembles
To address the potential bias of analyzing only a single pair of conformers, we expanded the analysis to include all available conformers for each protein that passed the filtering criteria (see Methods). This approach ensured that the structural diversity of both P and NP states was comprehensively represented, allowing for a more robust evaluation of AlphaFold model predictions against the full range of experimentally observed conformations (Figure 5). Pairwise RMSD calculations were performed between all conformers and the predicted models to identify the conformer closest to each prediction, with its phosphorylation status recorded. This expanded alignment offered a more comprehensive evaluation of how effectively the models captured the full range of structural diversity observed in the PDB, rather than focusing solely on a single pair exhibiting high conformational variability.
For example, for each model type (AF2, AF3, AF3‐p), we first identified the closest experimental conformer (P or NP) from the pair showing the largest conformational difference for each protein. To expand beyond this single high‐variability pair, we included all available P and NP conformers for a protein, along with the predicted models, in hierarchical clustering (Figure 5). From the resulting clusters, we obtained the closest experimental conformer for each model type and extracted its cluster information, including ID, size, and phosphorylation state.
We then compared the closest conformers identified from the high‐RMSD pair with those identified through comprehensive full conformer clustering. If both the cluster ID and phosphorylation state matched, the prediction was considered to have captured the same conformational and functional state (Figure 5). If the cluster ID matched but the phosphorylation state differed, it suggested that the model predicted a similar structural state, independent of phosphorylation status. We also examined the cluster size of the closest conformer to assess whether the models aligned with the most dominant structural state (Figure 5).
Predictions were categorized based on whether they aligned with the “same” or “different” clusters, and whether the phosphorylation states matched or differed (Figure 5). This allowed us to evaluate the AlphaFold model's ability to reproduce both structural similarity and phosphorylation‐dependent conformational differences. The analysis quantified how often models aligned with similar structural clusters, and to what extent they sampled structurally diverse or functionally distinct states.
Across all AlphaFold models (AF2, AF3, AF3‐p), predictions predominantly aligned with the same cluster as the closest experimental conformer. AF3‐p showed the highest cluster alignment (81%), followed by AF3 (80%) and AF2 (71%). A Kruskal–Wallis test based on binary classification (“same” vs. “different” cluster) revealed no significant differences among the models (H = 4.20, p = 0.12), indicating comparable performance in structural cluster matching across AF2, AF3, and AF3‐p.
When phosphorylation state was considered, predictions that aligned with the same structural cluster generally also matched in phospho‐state. AF3 had the highest number of matches (n = 50), followed by AF3‐p (n = 49) and AF2 (n = 43). In contrast, predictions aligning with different clusters were more likely to diverge in phosphorylation state, with AF2 showing the highest number of mismatches (n = 21).
Binomial tests assessing enrichment for phospho‐state matches revealed that none of the models significantly favored matching phosphorylation states beyond random expectation (AF3‐p: p = 0.44; AF3: p = 0.70; AF2: p = 0.84). Similarly, a chi‐squared test comparing the overall distribution of matching versus non‐matching phospho‐states across models showed no significant difference (χ2 = 0.68, p = 0.71). Overall, while AF3‐p showed a slight trend toward better phospho‐state agreement, all models performed similarly in both structural cluster alignment and phosphorylation state matching. These findings highlight the need for further optimization to improve model sensitivity to phosphorylation‐dependent conformational changes.
2.12. Co‐evolutionary constraints bias AlphaFold toward dominant conformations
To investigate why AF models favor specific conformational states, we examined structural and evolutionary interaction patterns, focusing on global residue interaction networks, localized contacts around P and NP sites, and co‐evolutionary relationships inferred from co‐evolving residue pairs. We analyzed 44 proteins with available co‐evolutionary coupling data. For each protein, we quantified Δ, defined as the difference between Jaccard similarity based on co‐evolving residue pairs and all residue‐residue contacts, as a measure of how strongly AF predictions capture co‐evolutionary constraints relative to generic packing interactions (Figure 6a).
Across the dataset, Δ values declined progressively with increasing imbalance between P and NP conformers. Proteins with balanced ensembles (imbalance <0.15) exhibited the highest alignment (mean Δ = 0.306, n = 30 predictions), while moderate imbalance (0.15–0.35) showed weaker alignment (mean Δ = 0.062, n = 144). Skewed proteins (>0.35) displayed negligible signal (mean Δ = −0.029, n = 90). Pairwise comparisons confirmed that balanced proteins displayed significantly higher Δ values than either moderate (p = 8.5 × 10−8) or skewed proteins (p = 6.7 × 10−10), and moderate exceeded skewed (p = 0.012). Across all proteins, imbalance correlated negatively with Δ (Spearman ρ = −0.304, p = 4.7 × 10−7), demonstrating a graded loss of co‐evolutionary alignment as ensemble coverage became increasingly skewed (Figure 6a).
Importantly, this effect depended on which conformer dominated the ensemble. Δ values were uniformly low in NP‐dominant proteins (mean Δ ≈0.04 across AF2, AF3, and AF3‐p), reflecting convergence onto generic NP contacts. By contrast, P‐dominant proteins displayed substantially higher Δ values (mean Δ = 0.16), with AF3‐p showing the strongest alignment (Δ = 0.175). Consistent with this, Δ values were also highest in proteins where all three AF models converged on P conformers (mean Δ = 0.103) compared to NP (0.031) or mixed predictions (0.017; Kruskal–Wallis p < 0.05), reinforcing that phosphorylation‐stabilized conformers exert stronger evolutionary constraints. These findings demonstrate that AlphaFold predictions capture evolutionary couplings most strongly when phosphorylation stabilizes the dominant conformation.
As illustrative case studies, we highlight three proteins that exemplify these patterns with distinct conformational landscapes (Figure 6b–d): P04406, where P and NP states are nearly identical; P07949, which exhibits multiple P conformers, with one driving structural diversity while others coexist with NP states; and P24842, where NP states drive conformational variability while P states remain stable.
Across all cases, we observed that AlphaFold predictions favored evolutionarily stable conformations, those reinforced by co‐evolutionary constraints rather than sampling the full spectrum of conformational variability (Figure 6b–d). AF3‐p consistently aligned with dominant P states that coexist with NP conformers, while AF2 and AF3 more frequently predicted NP conformers, especially in cases where NP states were responsible for the observed conformational diversity.
In P04406, P and NP conformers are nearly indistinguishable, forming a coexisting ensemble with little structural diversity (3 P vs. 86 NP). All three models aligned with the P conformer sampling from the dominant cluster (3 P vs. 84 NP). Δ values from all contacts were close to zero, and subgraph values were uniformly negative, consistent with the weak evolutionary signal expected for skewed ensembles (Figure 6b). This suggests that the models reproduced a P conformer in a state that coexists with NP but is not strongly reinforced by co‐evolutionary couplings.
P07949 presents a more balanced case (22 P vs. 9 NP), with multiple P conformers, one driving structural diversity while others cluster closely with NP states (Figure 6c). AF2 and AF3 aligned with a P conformer sampling from the sample cluster (5 P vs. 0 NP), whereas AF3‐p predicted an NP conformer within the coexisting cluster, sampling from a different cluster other than AF2 and AF3 (17 P vs. 9 NP). Both P and NP conformers showed strongly positive Δ values (≈0.38–0.44), highlighting the robust co‐evolutionary reinforcement expected in balanced ensembles (Figure 6a). At the subgraph level, NP conformers carried a slightly stronger signal, consistent with AF3‐p's NP alignment. This demonstrates how models converge on evolutionary constraints when structural diversity is balanced across P and NP states.
In P28482, conformational variability is primarily driven by NP states, while P states remain comparatively stable (151 NP vs. 11 P). AF2 and AF3‐p sampled from the same cluster (9 P vs. 45 NP) predicting the co‐existing NP/P conformer, whereas AF3 aligned with a NP conformer within a more stable cluster (2 P vs. 106 NP). Δ values from all contacts were negative for both P and NP (≈−0.10 to −0.21), consistent with a collapse toward generic contact networks in heavily skewed ensembles (Figure 6a,d). Subgraph scores showed NP states strongly negative for AF3 but closer to neutral for AF2 and AF3‐p, while AF3‐p's P conformer displayed a large negative shift. This suggests that when ensembles are skewed, AlphaFold models reproduce dominant conformers with little evolutionary reinforcement.
Together, our findings suggest that AF models, rather than learning all viable conformations equally, are optimized to reproduce the most frequently observed or evolutionarily stable states. They align closely with co‐evolutionary constraints when P and NP conformers are balanced, yet collapse onto dominant folds when ensemble coverage is skewed, particularly in NP‐dominated proteins. This behavior is consistent with recent observations by Bryant and Noé (2024) and Kalakoti and Wallner (2025), who reported that AF2 predictions are inherently tied to a single set of coevolutionary constraints. AF3‐p's preference for dominant P conformers suggests that P residues within stable environments exert a strong influence on model predictions, whereas AF2 and AF3 remain more sensitive to NP conformers, particularly when these dominate the conformational landscape.
3. DISCUSSION
Our study provides an integrated perspective on how phosphorylation modulates protein structure and evaluates the ability of AlphaFold models (AF2, AF3, and AF3‐p) to capture these changes. By combining a systematic analysis of experimentally resolved P and NP structures with an assessment of AlphaFold predictions, we uncover critical insights into the conformational landscape, model biases, and implications for the broader research community.
Phosphorylation predominantly stabilizes or subtly adjusts pre‐existing structural states rather than inducing novel conformations (Xin & Radivojac, 2012). Approximately 65% of P and NP conformers share the same structural cluster, suggesting that the NP ensemble often includes conformations accessible to the P state. These findings highlight phosphorylation's role as a fine‐tuner of protein function, favoring specific structural states that support regulatory or signaling processes. However, distinct conformational shifts upon phosphorylation were observed in 23% of proteins, underscoring the context‐dependent nature of these modifications. These results challenge the notion of phosphorylation as a driver of dramatic structural transitions and instead position it as a modulator of existing flexibility, which aligns with the previous study (Xin & Radivojac, 2012).
Our results reveal that ligand binding modulates conformational diversity differently depending on phosphorylation status. Ligand binding reduced diversity in P conformer pairs (P–P, P–NP) but increased it in NP pairs (NP–NP), suggesting that phosphorylation constrains structural flexibility while NP states remain more ligand‐responsive. Weak correlations between P–P and NP–NP pairs indicate that phosphorylation primarily drives structural divergence, while strong correlations in P–NP versus NP–NP pairs suggest ligand binding amplifies structural overlap in mixed phosphorylation states. These findings highlight phosphorylation as the primary determinant of conformational diversity, with ligand binding playing a secondary, context‐dependent role.
Residue‐specific analyses reveal that structures with single phosphorylation sites exhibit the highest global conformational variability, whereas multi‐site modifications tend to balance localized flexibility with overall structural stability, highlighting a nuanced interplay between stabilization and disruption. Phosphotyrosine (PTR) and phosphothreonine (TPO) induce the most pronounced structural shifts, both globally and locally, reflecting their dynamic roles in functional regulation. As a single modification, TPO drives greater global structural changes, while phosphoserine (SEP) shows intermediate variability and minimal impact, aligning with its role in stabilization. Notably, dual modifications involving PTR and TPO, either as identical pairs (e.g., TPO–TPO or PTR–PTR) or combinations (e.g., PTR–TPO), lead to pronounced shifts, particularly in localized protein conformations.
The PDB data underpinning AlphaFold's training exhibits pronounced biases, with P structures disproportionately represented in kinase superfamilies and limited diversity in multi‐site modifications. This imbalance might skew AlphaFold predictions toward dominant, well‐represented conformational states, at the expense of rare or functionally significant phosphorylation‐induced structures. The overlap between P and NP conformers within the same cluster further reflects the constrained diversity of experimental data, limiting the ability of models to capture the broader functional implications of phosphorylation. These findings emphasize the need for more diverse datasets to enhance the fidelity of predictive models. Our study also highlights the essential dearth of such data, severely limiting model training and validation. This limited availability of experimentally resolved P structures not only biases AlphaFold training but also constrains our own analyses, as conclusions can only be drawn from a small subset of proteins where both P and NP conformers exist.
Pairwise sequence alignments confirmed very high similarity and identity across conformers (≥90% for nearly all proteins), indicating that sequence variation is minimal and any residual differences are random with respect to phosphorylation status, thus contributing only to background noise. In addition, some methodological approximations in our analysis warrant consideration. First, our use of a dynamic distance threshold (70% of the maximum linkage distance) to define conformational clusters avoids the pitfalls of applying a single absolute cutoff across proteins with widely varying RMSD ranges, but it can lead to relative over‐splitting in proteins with low conformational variability and under‐splitting in those with high variability. Second, our analysis did not formally distinguish between apo and holo sub‐states within the NP ensemble, even though ligand binding can shift conformational equilibria toward open or closed forms and phosphorylation itself may act as an allosteric modulator of these pre‐equilibria. While we stratified conformer pairs into ligand‐bound, mixed, and no‐ligand conditions, this treatment does not fully capture the complexity of apo–holo dynamics. Indeed, the scope of this study was to benchmark AlphaFold predictions against experimentally observed conformers, rather than to derive detailed mechanistic conclusions on ligand‐ or phosphorylation‐driven equilibria. Nonetheless, our finding that ligand binding increases variability in NP conformers but stabilizes P conformers is consistent with phosphorylation fine‐tuning pre‐existing equilibria. Importantly, our main conclusions, including comparisons between P and NP states and the alignment of AlphaFold models, are based on relative patterns of cluster distribution and the broad influence of phosphorylation and ligand binding, and are therefore robust to these methodological choices. Future studies with expanded structural coverage will be required to systematically disentangle the interplay between phosphorylation and ligand binding in shaping conformational landscapes.
Despite the inclusion of PDB‐derived structural information in their training, AF2, AF3, and AF3‐p struggle to differentiate between P and NP conformers effectively. Our results show that while AF3‐p demonstrates a slight enrichment toward P states, the differences in predictive performance across models are not statistically significant. All models exhibit a strong bias toward dominant conformational states, aligning with the largest clusters irrespective of phosphorylation status. This tendency reflects a structural memorization effect, where AF models prioritize frequently observed conformations in the training data rather than capturing phosphorylation‐induced structural diversity (Chakravarty et al., 2024; Chakravarty & Porter, 2022). Consistent with this, Lazou et al. (2024) demonstrated that AF2 predicts multiple conformations only when the PDB contains comparably sized clusters of open and closed states, whereas rarely observed conformations remain inaccessible.
Furthermore, AF3‐p's inability to substantially outperform AF2 and AF3 raises questions about its phospho‐specific design. While AF3‐p aligns more closely with P conformers in some cases, these instances remain exceptions rather than the rule. This limitation underscores the challenge of overcoming biases in training data and highlights the need for model refinement to improve sensitivity to phosphorylation‐specific structural changes.
The biases and limitations identified in AlphaFold predictions have significant implications for researchers studying PTMs. While AlphaFold models excel at predicting dominant conformations, their inability to capture rare or biologically significant phosphorylation‐induced transitions necessitates cautious interpretation. These findings reinforce the importance of integrating computational predictions with experimental validation, particularly for dynamic modifications like phosphorylation.
Addressing the limitations identified in this study requires a two‐pronged approach: improving training datasets and refining model architectures. Expanding structural datasets to include more P protein structures with diverse modification states will provide the foundation for better model training. In parallel, enhancing model architectures to prioritize local structural changes and rare conformations over dominant ensembles may improve their ability to predict phosphorylation‐induced diversity.
Additionally, integrating advanced techniques such as transfer learning, where models can adapt to phosphorylation‐specific tasks using high‐quality, curated datasets, may enhance their performance. Complementary approaches, such as combining AF predictions with molecular dynamics simulations and experimental validation, could further bridge the gap between predicted and experimentally observed conformations.
Refining AlphaFold architectures to prioritize local structural changes and rare conformations over dominant ensembles represents a promising avenue for improvement. Incorporating advanced techniques such as context‐specific embeddings or hybrid predictive frameworks may enable more accurate modeling of phosphorylation‐induced structural landscapes.
In conclusion, phosphorylation acts as a nuanced regulator of protein structure, favoring functional adaptability through subtle modifications rather than dramatic shifts. While AlphaFold models remain invaluable tools for structural biology, their biases toward well‐sampled states limit their applicability to phosphorylation‐specific studies. By addressing these challenges, the research community can drive the development of more accurate and functionally relevant predictions, unlocking deeper insights into the dynamic roles of PTMs in cellular regulation and disease.
4. METHODS
4.1. Phosphoprotein structure selection and curation
We obtained all available PDB structures, chains, and modeled segments for human, mouse and rat canonical proteins from UniProtKB (The UniProt Consortium, 2023). For each protein we retrieved all available protein structure information from PDB (Burley et al., 2023). The modified amino acid information was obtained to get the P structures and their non‐phospho counterparts (where the same residue is P in one and NP in the other) by querying the PDB database. To identify structures crystallized with phosphorylation modifications, we conducted a comprehensive API screening of the PDB repository (https://data.rcsb.org/#rest-api) (19 September 2024) and retrieved modified residue information for all structures per protein. We selected phosphoproteins crystallized with one or more canonical phosphosites: phosphoserine (SEP), phosphothreonine (TPO), and phosphotyrosine (PTR). For the non‐phospho counterparts, we ensured the phospho and non‐phospho PDB IDs are mapped to the same UniProt ID, and the residues are NP (SER, THR, and TYR). This resulted in 185 proteins covering a total of 2015 phosphosites from 1327 phospho structures and 3867 non‐phospho structures. All the PDB files were downloaded on 20 September 2024.
All PDB chains were mapped to UniProt residue coordinates using SIFTS (Choudhary et al., 2023), which provides residue‐level correspondence between PDB chains and UniProt sequences. To ensure the accuracy of conformational details and the completeness of modeled segments, we included only structures where the modeled fragments of the protein contained at least 50 amino acids. Only chains where the phosphosite or the corresponding non‐phosphosite could be mapped to UniProt coordinates were retained. Conformers were classified as P if at least one mapped site was P and passed the filtering, and as NP if the same UniProt positions were available without modification. This ensured that P and NP conformers always overlapped in UniProt sequence space.
For each modified amino acid in the structure, we assessed missing residues and prioritized structures with no missing residues. If no such structures were available, we minimized the number of missing residues around both phosphorylated (P data) and non‐phosphorylated sites (NP data). For P structures, we selected one representative structure per modification state (single, dual, or multi‐phosphorylated) based on minimal missing residues and better resolution. For NP structures, all entries meeting the filtering criteria were retained. This approach ensured the inclusion of PDB structures covering sufficiently large protein fragments with complete information around the modified/unmodified residues.
In addition, we performed pairwise sequence alignment using MAFFT (Katoh & Standley, 2013) across all conformers per protein as a validation step. These alignments showed very high sequence conservation, with nearly all proteins showing average pairwise identity and similarity above 90% (see Results for detailed breakdown). This high level of sequence conservation corroborates the robustness of our mapping‐based recruitment strategy. The final filtered data resulted in 109 different proteins covering a total of 253 phosphosites from 159 phospho structures and 1695 non‐phospho structures.
To cover diverse phosphosite and conformational preferences, proteins were selected such that the number of phosphosites in a structure ranged from one to five, encompassing a combination of SEP, TPO, and PTR. Details of all phosphoproteins, including the number of PDB structures modeled with and without phosphorylation and phosphosite information are presented in Tables [Link], [Link].
4.2. Protein conformers and conformational variability in protein structures
We assessed the conformational diversity of proteins in the context of PTMs, with a specific focus on phosphorylation. We filtered protein chains according to the criteria previously outlined (section Phosphoprotein structure selection and curation) and labeled each chain as P or NP based on its phosphorylation status. This allowed us to quantify and compare the number of P and NP conformers available for each protein, providing insights into the structural variability associated with phosphorylation.
Ligand‐binding status for each conformer was determined from PDB annotations. “Ligands” were defined as any non‐polymeric molecules co‐crystallized with the protein, including substrates, substrate analogs, inhibitors, cofactors, and metal ions, but excluding water molecules. Conformers were categorized as “ligand‐bound” if at least one such molecule was present, or “no‐ligand” otherwise.
To investigate conformational diversity between P and NP protein conformers, we categorized structural changes by assessing pairwise RMSD values. Both global and segment‐based RMSD values were analyzed to capture variations at different structural scales. We performed structural alignments using the PDB Superimposer module in BioPython (Cock et al., 2009). For global alignment, all C‐alpha atoms common to both structures were used to calculate the RMSD. For segment alignment, only the residues within a defined region (e.g., ±15 residues around phospho‐sites) were included. The alignment utilized the Kabsch algorithm to determine the optimal rotation and translation matrices that minimized the RMSD between the selected atoms. This approach ensures sensitivity to both global conformational changes and localized structural deviations.
In order to determine the distinct conformational state of a protein, we clustered all P and NP conformers for a specific protein based on pairwise structural alignment. We performed both the global alignment (on full chain) and segment alignment (spanning 15 amino acids upstream and downstream in sequence space) using the PDB Superpose method. The RMSD values obtained between each pair were used for clustering. Hierarchical clustering was performed using the average linkage method, with clusters defined based on a dynamic distance threshold set at 70% of the maximum linkage distance to ensure appropriate separation of structural conformations. This adaptive threshold accounts for the wide range of RMSD values across proteins, potentially leading to finer splitting in low‐variability cases and more conservative clustering in highly variable cases. A dendrogram was generated to visualize the clustering results, with distinct branch colors indicating major structural clusters and the cluster threshold visually marked to highlight structural variation patterns. Each cluster therefore represents a distinct conformational state of the protein. We considered the number of clusters per protein as a quantitative measure of conformational diversity. To assess the influence of phosphorylation on conformational diversity, we annotated each cluster with the number of P and NP conformers. This allowed us to explore whether phosphorylation affects conformational states by stabilizing or favoring specific clusters.
To further quantify conformational diversity, we calculated a CDS and an adjusted phospho conformational diversity score (pCDS). The CDS provided a baseline measure of structural variability, while the adjusted pCDS accounted for the relative proportions of P and NP conformers, normalizing the score to evaluate the specific contribution of phosphorylation to structural diversity.
4.3. Conformational diversity score
The CDS is used to quantify the structural diversity of protein conformations within a set of structural clusters. The CDS considers both the number of clusters a protein adopts and the relative size of each cluster, thus providing an overall measure of conformational flexibility.
The CDS for a protein is calculated as follows:
where:
, the total number of unique structural clusters identified for a given protein.
, the total number of clusters.
, the number of structures in cluster.
, the sum of all conformers across all clusters for the protein.
This formula captures the spread of conformers across clusters, with higher CDS values indicating a more even distribution across clusters (greater diversity), and lower CDS values suggesting concentration within fewer clusters (less diversity).
4.4. Adjusted phospho‐CDS
To investigate the impact of phosphorylation on conformational diversity, an adjusted phospho‐CDS (pCDS) was introduced. This score incorporates information about the distribution of P and NP conformers within each cluster, assigning higher weight to clusters where both phosphorylation (P and NP) states are present in a balanced manner. pCDS aims to highlight clusters where phosphorylation induces distinct conformational states, while still accounting for structural diversity across the dataset.
The adjusted phospho‐CDS is calculated as:
where:
the proportion of phospho structures within cluster , calculated as
, serves as a weighting term that increases when clusters have balanced P and NP structures, emphasizing clusters where phosphorylation may influence conformational shifts.
threshold is set as a minimum value (e.g., 0.3) to ensure that clusters with high P or NP dominance still contribute meaningfully to the score.
This formula captures the diversity upon phosphorylation by emphasizing clusters that exhibit mixed modification states, providing a nuanced view of structural adaptability in response to phosphorylation.
4.5. Domain regions, secondary structural states and solvent accessibility
Domain information for the proteins was obtained using the InterPro (Blum et al., 2025) API, which provides annotations for protein domains and their coordinates within UniProt sequences. The domain regions were then mapped onto modeled PDB structures; the PDBe API was used to retrieve residue ranges for specific PDB chains. The InterPro API outputs were filtered to include only domain annotations, while the PDBe API results were cross‐referenced to match PDB IDs and chains corresponding to the modeled structures. These data were integrated to identify and compare domain regions across P and NP conformers. Secondary structural information for proteins was derived using STRIDE (Heinig & Frishman, 2004), which annotates eight secondary structure classes based on PDB files. These classes were subsequently mapped into three categories: helix, strand, and coil. Relative solvent accessibility (RSA) was calculated as a percentage of their maximum solvent‐accessible area, using established values for standard amino acids (Heinig & Frishman, 2004; Miller et al., 1987). STRIDE was run on modified PDB files, ensuring P residues were correctly processed, and the results were matched to specific residues in both P and NP states.
4.6. Protein structures (PDB, AlphaFold2, AlphaFold3 predictions)
All PDB structures used in this analysis were obtained from the RCSB PDB (https://www.rcsb.org/) on 20 September 2024. AlphaFold2 structures were sourced from the AlphaFold2 Database (https://alphafold.ebi.ac.uk/) on 08 October 2024 and were labeled AF2.
AlphaFold3 structures were predicted using the AlphaFold3 server (https://alphafoldserver.com/). A single Json file was used for batch predictions as described in the server (https://github.com/google-deepmind/alphafold/blob/main/server/README.md). AF3 prediction jobs were submitted between 02 October 2024 and 08 October 2024.
4.7. AlphaFold3 predictions
4.7.1. Unphosphorylated structure predictions
AlphaFold3 predictions were generated from the corresponding canonical UniProtKB protein sequences without any modifications. These predictions were labeled as AF3.
4.7.2. Phosphorylated structure predictions
Due to the server limitations of 20 structures per day, we only predicted the phosphorylation status of the conformer that showed the maximum conformational variability in a protein. These predictions were labeled as AF3‐p.
We used the MAXIT suite (https://sw-tools.rcsb.org/apps/MAXIT/index.html) to convert the PDBx/mmCIF format to PDB format.
4.8. Structure comparison and evaluation metrics
To evaluate the performance of predicted models; AlphaFold2 (AF2), AlphaFold3 (AF3), and AlphaFold3 phosphorylation‐specific (AF3‐p) various metrics and comparison strategies were employed. Experimental PDB structures, including both P and NP forms, were used to cross‐validate the predictions. Key validation metrics included the consistency of RMSD values, structural integrity, and conformational accuracy, particularly at phosphorylation sites and surrounding residues.
To analyze local conformational changes around phosphorylation sites, residue contacts of P residues (in P conformers) and their corresponding NP residues (in wild‐type NP conformers) were measured within an 8 Å radius (Chen et al., 2024; Kryshtafovych et al., 2019; Madani et al., 2022).
4.9. Hierarchical clustering and heatmap visualization
To evaluate the structural alignment of AF2, AF3, and AF3‐p predictions with experimental P and NP conformations, hierarchical clustering was performed using distance matrices derived from three metrics:
global RMSD: RMSD calculated across the entire protein structure.
segment RMSD: RMSD values calculated for the ±15‐residue region surrounding phosphorylation sites (N‐ and C‐terminal extensions).
contact map differences: Residue contact differences within an 8 Å radius (Chen et al., 2024; Kryshtafovych et al., 2019; Madani et al., 2022) around P and NP sites.
For each protein, representative structures were chosen for each distinct phosphorylation state from PDB entries as described in section Phosphoprotein structure selection and curation. Predictions from AF2, AF3, and AF3‐p were aligned to these representative P and NP conformers to calculate pairwise distances, which were used to generate condensed distance matrices. Hierarchical clustering was then performed using the average linkage method, and the resulting dendrograms were visualized as heatmaps, where clustering patterns were used to evaluate the model's proximity to structural ensembles.
Heatmaps were generated to illustrate the relative RMSD and contact differences for selected proteins, highlighting how model predictions align with experimentally observed conformers. These visualizations provide insights into the structural tendencies of each model, particularly in capturing phosphorylation‐related conformations.
4.10. Residue interaction networks and co‐evolutionary analysis
To evaluate how AF models capture P and NP conformers, we constructed Residue Interaction Networks (RINs) and inferred co‐evolutionary constraints from EvCouplings (Hopf et al., 2019). Global RINs were built from PDB structures, where residue‐residue contacts were defined within 8.0 Å, including P residues (SEP, TPO, PTR). Localized subgraphs captured P and NP residues along with their immediate neighbors. Network properties (degree centrality, edge betweenness, clustering coefficient, and eigenvector centrality) were computed to assess residue contributions to structural stability. Co‐evolving residue pairs were inferred from EvCouplings and mapped onto RINs as weighted edges based on evolutionary coupling probabilities. The analysis was performed for global structures and localized subgraphs, capturing both large‐scale and site‐specific constraints. AF model predictions were compared to PDB conformers using Jaccard similarity, assessing overlap in residue interaction and co‐evolutionary networks:
where A and B represent residue contacts in AF and PDB networks. Mean similarity scores were computed for global RINs and subgraphs to quantify model alignment with PDB conformers.
To quantify the relative contribution of co‐evolutionary versus generic structural contacts, we defined Δ as:
where J co‐evol is the Jaccard similarity based on co‐evolving residue pairs and J all‐contact is the Jaccard similarity based on all residue‐residue contacts. Δ was calculated for 44 proteins with available EVcoupling data.
4.11. Statistical tests
Statistical analyses were performed to assess the effect of ligand binding on conformational diversity; a Tukey honest significant difference (HSD) test was performed separately for P–P, P–NP, and NP–NP pairs. RMSD distributions across the three ligand categories (ligand‐bound, mixed, no‐ligand) were compared within each pair type. Statistical significance was determined based on an adjusted significance threshold (α = 0.05) to account for multiple comparisons. To evaluate whether ligand binding altered the structural relationship between P and NP conformers, Pearson correlation coefficients were calculated between RMSD values of P–P, P–NP, and NP–NP pairs within each ligand category. Correlation strength was assessed using the Pearson correlation coefficient (r), and statistical significance was determined by the corresponding p‐value. Correlation analyses were conducted only for pairs where RMSD values were available for both pair types.
To evaluate the performance of AF2, AF3, and AF3‐p in predicting P and NP conformations, as well as their structural and label alignment tendencies, chi‐square test and Kruskal–Wallis test were performed. To determine whether the proportions of predictions aligning with P versus NP conformers differed significantly across models, a chi‐square test of independence was applied, testing for overall distribution differences.
The Kruskal–Wallis test was used to compare the proportions of predictions aligning with the same cluster as the closest conformer, providing a non‐parametric measure of structural consistency across the models. For label alignment (P vs. NP), binomial tests were conducted to assess whether the proportion of “same label” matches for each model exceeded a random baseline (p = 0.5), highlighting each model's ability to differentiate between P and NP states. Additionally, a chi‐square test was performed to compare the overall distribution of “Same Label” and “different label” cases across models to assess inter‐model differences in label alignment performance.
To evaluate differences in Δ across conformational ensembles, proteins were stratified into imbalance categories (balanced, moderate, skewed) based on the relative numbers of P and NP conformers observed in the PDB. Δ values were compared across categories using Mann–Whitney U tests, and a Spearman correlation was used to test for a monotonic trend between imbalance and Δ. Predictions were further grouped by model consensus, defined as whether all three AF models (AF2, AF3, AF3‐p) aligned with P conformers, all aligned with NP conformers, or produced mixed predictions. Differences across consensus groups were tested with Kruskal–Wallis.
All statistical analyses were conducted using Python, primarily utilizing the scipy library. A p‐value threshold of 0.05 was used to determine statistical significance, ensuring robustness in identifying meaningful differences among the models.
AUTHOR CONTRIBUTIONS
Pathmanaban Ramasamy: Conceptualization; data curation; formal analysis; visualization; writing – original draft; methodology; writing – review and editing; funding acquisition. Jasper Zuallaert: Methodology. Lennart Martens: Writing – review and editing; supervision; resources; funding acquisition. Wim F. Vranken: Writing – review and editing; supervision; writing – original draft; investigation; resources; funding acquisition.
CONFLICT OF INTEREST STATEMENT
The authors declare no competing interests.
Supporting information
Figure S1: Conformational diversity, molecular interactors, and phosphorylation.
Figure S2: Structural adaptations of phosphorylated protein conformers.
Figure S3: Domain architecture and protein phosphorylation.
Figure S4: Secondary structural state transitions and solvent accessibility changes upon phosphorylation.
Figure S5: Conformer alignment and hierarchical clustering of AlphaFold predictions with P and NP conformers.
Data S1: Detailed analysis and supplementary figures.
Table S1: Detailed information on proteins, number of phospho and non‐phospho conformers.
Table S2: Detailed information on proteins, conformers, modified residues, positions and phosphorylation status.
ACKNOWLEDGMENTS
Pathmanaban Ramasamy acknowledges BOF grant from Ghent University under grant agreement number BOF.PDO.2023.0024.01. Pathmanaban Ramasamy, Jasper Zuallaert, Lennart Martens, and Wim F. Vranken acknowledge Research Foundation Flanders (FWO) through grant G028821N.
Ramasamy P, Zuallaert J, Martens L, Vranken WF. Assessing the relation between protein phosphorylation, AlphaFold3 models, and conformational variability. Protein Science. 2026;35(1):e70376. 10.1002/pro.70376
Review Editor: Nir Ben‐Tal
DATA AVAILABILITY STATEMENT
All data and Python scripts used in this study are freely available on GitHub at https://github.com/Pathmanaban/AlphaFold‐phospho‐conformation, with a corresponding DOI: https://doi.org/10.5281/zenodo.15365572. All analyses and visualizations were performed using Python (version 3.10.12). Protein structure figures were generated using PyMol (Schrödinger, LLC, 2015) (version 3.0.5).
REFERENCES
- Abramson J, Adler J, Dunger J, Evans R, Green T, Pritzel A, et al. Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature. 2024;630:493–500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973;181:223–230. [DOI] [PubMed] [Google Scholar]
- Beltrao P, Albanèse V, Kenner LR, Swaney DL, Burlingame A, Villén J, et al. Systematic functional prioritization of protein posttranslational modifications. Cell. 2012;150:413–425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bertoline LMF, Lima AN, Krieger JE, Teixeira SK. Before and after AlphaFold2: an overview of protein structure prediction. Front Bioinform. 2023;3:1120370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel D, Vranken W. Effects of phosphorylation on protein backbone dynamics and conformational preferences. J Chem Theory Comput. 2024;20:4998–5011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blum M, Andreeva A, Florentino LC, Chuguransky SR, Grego T, Hobbs E, et al. InterPro: the protein sequence classification resource in 2025. Nucleic Acids Res. 2025;53:D444–D456. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bordin N, Lau AM, Orengo C. Large‐scale clustering of AlphaFold2 3D models shines light on the structure and function of proteins. Mol Cell. 2023;83:3950–3952. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bryant P, Noé F. Structure prediction of alternative protein conformations. Nat Commun. 2024;15:7328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burley SK, Bhikadiya C, Bi C, Bittrich S, Chao H, Chen L, et al. RCSB protein data bank (RCSB.org): delivery of experimentally‐determined PDB structures alongside one million computed structure models of proteins from artificial intelligence/machine learning. Nucleic Acids Res. 2023;51:D488–D508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakravarty D, Porter LL. AlphaFold2 fails to predict protein fold switching. Protein Sci. 2022;31:e4353. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chakravarty D, Schafer JW, Chen EA, Thole JF, Ronish LA, Lee M, et al. AlphaFold predictions of fold‐switched conformations are driven by structure memorization. Nat Commun. 2024;15:7296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen L, Li Q, Nasif KFA, Xie Y, Deng B, Niu S, et al. AI‐driven deep learning techniques in protein structure prediction. Int J Mol Sci. 2024;25:8426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choudhary P, Anyango S, Berrisford J, Tolchard J, Varadi M, Velankar S. Unified access to up‐to‐date residue‐level annotations from UniProtKB and other biological databases for PDB data. Sci Data. 2023;10:204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cock PJA, Antao T, Chang JT, Chapman BA, Cox CJ, Dalke A, et al. Biopython: freely available Python tools for computational molecular biology and bioinformatics. Bioinformatics. 2009;25:1422–1423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen P. The role of protein phosphorylation in human health and disease. Eur J Biochem. 2001;268:5001–5010. [DOI] [PubMed] [Google Scholar]
- Gavalda‐Garcia J, Dixit B, Díaz A, Ghysels A, Vranken W. Gradations in protein dynamics captured by experimental NMR are not well represented by AlphaFold2 models and other computational metrics. J Mol Biol. 2025;437:168900. [DOI] [PubMed] [Google Scholar]
- Goldtzvik Y, Sen N, Lam SD, Orengo C. Protein diversification through post‐translational modifications, alternative splicing, and gene duplication. Curr Opin Struct Biol. 2023;81:102640. [DOI] [PubMed] [Google Scholar]
- Heinig M, Frishman D. STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res. 2004;32:W500–W502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hopf TA, Green AG, Schubert B, Mersmann S, Schärfe CPI, Ingraham JB, et al. The EVcouplings Python framework for coevolutionary sequence analysis. Bioinformatics. 2019;35:1582–1584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hunter T. Protein kinases and phosphatases: the yin and yang of protein phosphorylation and signaling. Cell. 1995;80:225–236. [DOI] [PubMed] [Google Scholar]
- Johnson LN. The regulation of protein phosphorylation. Biochem Soc Trans. 2009;37:627–641. [DOI] [PubMed] [Google Scholar]
- Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, et al. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596:583–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalakoti Y, Wallner B. AFsample2 predicts multiple conformations and ensembles with AlphaFold2. Commun Biol. 2025;8:373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kosoglu K, Aydin Z, Tuncbag N, Gursoy A, Keskin O. Structural coverage of the human interactome. Brief Bioinform. 2024;25:bbad496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kryshtafovych A, Schwede T, Topf M, Fidelis K, Moult J. Critical assessment of methods of protein structure prediction (CASP)—round XIII. Proteins. 2019;87:1011–1020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lazou M, Khan O, Nguyen T, Padhorny D, Kozakov D, Joseph‐McCarthy D, et al. Predicting multiple conformations of ligand binding sites in proteins suggests that AlphaFold2 may remember too much. Proc Natl Acad Sci. 2024;121:e2412719121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Madani M, Mahdi Behzadi M, Song D, Ilies HT, Tarakanova A. Improved inter‐residue contact prediction via a hybrid generative model and dynamic loss function. Comput Struct Biotechnol J. 2022;20:6138–6148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maynard Smith J. Natural selection and the concept of a protein space. Nature. 1970;225:563–564. [DOI] [PubMed] [Google Scholar]
- Miller S, Janin J, Lesk AM, Chothia C. Interior and surface of monomeric proteins. J Mol Biol. 1987;196:641–656. [DOI] [PubMed] [Google Scholar]
- Ochoa D, Jarnuczak AF, Viéitez C, Gehre M, Soucheray M, Mateus A, et al. The functional landscape of the human phosphoproteome. Nat Biotechnol. 2020;38:365–373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Orioli S, Henning Hansen CG, Lindorff‐Larsen K. Transient exposure of a buried phosphorylation site in an autoinhibited protein. Biophys J. 2022;121:91–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perrakis A, Sixma TK. AI revolutions in biology. EMBO Rep. 2021;22:e54046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Porta‐Pardo E, Ruiz‐Serra V, Valentini S, Valencia A. The structural coverage of the human proteome before and after AlphaFold. PLoS Comput Biol. 2022;18:e1009818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramasamy P, Vandermarliere E, Vranken WF, Martens L. Panoramic perspective on human phosphosites. J Proteome Res. 2022;21:1894–1915. [DOI] [PubMed] [Google Scholar]
- Redfern O, Dessailly B, Orengo C. Exploring the structure and function paradigm. Curr Opin Struct Biol. 2008;18:394–402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saldaño T, Escobedo N, Marchetti J, Zea DJ, Mac Donagh J, Velez Rueda AJ, et al. Impact of protein conformational diversity on AlphaFold predictions. Bioinformatics. 2022;38:2742–2748. [DOI] [PubMed] [Google Scholar]
- Schrödinger, LLC . The PyMOL Molecular Graphics System, Version 1.8. 2015.
- Shu L, Du C, Zuo Y. Abnormal phosphorylation of protein tyrosine in neurodegenerative diseases. J Neuropathol Exp Neurol. 2023;82:826–835. [DOI] [PubMed] [Google Scholar]
- Terwilliger TC, Liebschner D, Croll TI, Williams CJ, McCoy AJ, Poon BK, et al. AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination. Nat Methods. 2024;21:110–116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- The UniProt Consortium . UniProt: the universal protein knowledgebase in 2023. Nucleic Acids Res. 2023;51:D523–D531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, et al. AlphaFold protein structure database: massively expanding the structural coverage of protein‐sequence space with high‐accuracy models. Nucleic Acids Res. 2022;50:D439–D444. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Viéitez C, Busby BP, Ochoa D, Mateus A, Memon D, Galardini M, et al. High‐throughput functional characterization of protein phosphorylation sites in yeast. Nat Biotechnol. 2022;40:382–390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xin F, Radivojac P. Post‐translational modifications induce significant yet not extreme changes to protein structure. Bioinformatics. 2012;28:2905–2913. [DOI] [PubMed] [Google Scholar]
- Zhang W, Liu HT. MAPK signal pathways in the regulation of cell proliferation in mammalian cells. Cell Res. 2002;12:9–18. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figure S1: Conformational diversity, molecular interactors, and phosphorylation.
Figure S2: Structural adaptations of phosphorylated protein conformers.
Figure S3: Domain architecture and protein phosphorylation.
Figure S4: Secondary structural state transitions and solvent accessibility changes upon phosphorylation.
Figure S5: Conformer alignment and hierarchical clustering of AlphaFold predictions with P and NP conformers.
Data S1: Detailed analysis and supplementary figures.
Table S1: Detailed information on proteins, number of phospho and non‐phospho conformers.
Table S2: Detailed information on proteins, conformers, modified residues, positions and phosphorylation status.
Data Availability Statement
All data and Python scripts used in this study are freely available on GitHub at https://github.com/Pathmanaban/AlphaFold‐phospho‐conformation, with a corresponding DOI: https://doi.org/10.5281/zenodo.15365572. All analyses and visualizations were performed using Python (version 3.10.12). Protein structure figures were generated using PyMol (Schrödinger, LLC, 2015) (version 3.0.5).
