Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Aug 1.
Published in final edited form as: Methods. 2022 May 21;204:55–63. doi: 10.1016/j.ymeth.2022.05.006

Multi-Level Analysis of Intrinsically Disordered Protein Docking Methods

Jacob Verburgt 1, Zicong Zhang 2, Daisuke Kihara 1,2,3,*
PMCID: PMC9701586  NIHMSID: NIHMS1849015  PMID: 35609776

Abstract

Intrinsically Disordered Proteins (IDPs) are a class of proteins in which at least some region of the protein does not possess any stable structure in solution in the physiological condition but may adopt an ordered structure upon binding to a globular receptor. These IDP-receptor complexes are thus subject to protein complex modeling in which computational techniques are applied to accurately reproduce the IDP ligand-receptor interactions. This often exists in the form of protein docking, in which the 3D structures of both the subunits are known, but the position of the ligand relative to the receptor is not. Here, we evaluate the performance of three IDP-receptor modeling tools with metrics that characterize the IDP-receptor interface at various resolutions. We show that all three methods are able to properly identify the general binding site, as identified by lower resolution metrics, but begin to struggle with higher resolution metrics that capture biophysical interactions.

Introduction

Intrinsically Disordered Proteins (IDPs) are a class of proteins in which at least some region of the protein does not possess any stable structure in solution in their physiological condition. Current predictions show that 33.0% of all proteins in the human proteome contain at least some residues in disordered conformations [1]. IDPs are prominent in liquid-liquid phase separation [2], [3], cell signaling and regulation [4], such as with the P53 Tumor Suppressor [5] , as well as other cellular functionality. This functionality is largely mediated through interactions that the disordered regions make with their receptor molecules, which may take a wide variety of forms.

Through both experimental [3] and computational [6] means, certain biophysical interactions have been shown to be prominent in the binding of IDPs to receptors. These receptors may be other IDPs, nucleic acids, or globular proteins. It is fairly common for these interactions to be non-specific or “fuzzy” to where, although the IDP may be interacting with specific regions on the receptor, a singular conformation between the IDP region and receptor does not exist [7]. Molecular Dynamics (MD) simulations of these interactions have shown that specific forms and designs are commonly responsible for these interactions [6]. So-called “patterning” of charged residues sequences of IDPs [8], [9] has also been attributed as a contributing factor in phase separation.

Despite frequent fuzziness, it is possible for IDPs to adopt a sufficiently stable conformation upon binding to a receptor, to the point that their complex structures may be experimentally determined by X-ray crystallography, Nuclear Magnetic Resonance (NMR), or Cryo-Electron Microscopy (Cryo-EM) [5], [10]. This brings IDP-receptor modeling into the domains of computational protein structure modeling and docking.

Due to their prevalence in therapeutic and cellular functionality, accurate modeling of IDP-receptor like complexes has been long sought after in the Protein docking community. Protein docking consists of finding a 3D complex structure from the known 3D structures of two subunit molecules, termed the ligand and receptor. The primary goal of this process is correct placement of the ligand subunit on the surface of the receptor subunit, while adhering to the chemical and biophysical constraints of protein structures. Protein docking is typically evaluated through the Critical Assessment of the Prediction of Interactions (CAPRI) [11] community, which has conducted over 50 rounds of assessment [12]. Due to their prevalence in cellular signaling and disease, highly accurate prediction of IDP-receptor complexes will thus largely aid in therapeutic design.

The disordered nature of IDPs alters the protein docking problem. Instead of placing a 3D structure of a ligand subunit onto a globular receptor surface, an unstructured sequence of an IDP (or region thereof) must have its structure modeled on the surface of the receptor. Previous software packages have attempted to model these IDP-receptor interactions [13], [14], and can often provide a low-precision view of potential ligand placement on the receptor where the general binding site may be identified (often in grooves or crevices in the protein surface). However, residue and atomic level contacts are largely ignored in the context of global IDP docking. The advent of AlphaFold [15], along with its variant AlphaFold Multimer [16], have propelled the field of protein structure prediction, as well as protein complex prediction, to a level of never before seen accuracy and consistency at an atomic level. Previous studies have shown that AlphaFold is capable of modeling peptide-protein complexes [17], [18]. It is unclear how well AlphaFold is able to handle fuzzy interactions between IDPs and globular receptors.

In this study, we investigate the abilities of docking and modeling of the disordered regions of IDPs, along with other sufficiently long peptides, to their respective receptors at various levels of meticulousness. We specifically test our own IDP-LZerD docking software, the coarse-grained CABS-Dock docking software, and the AlphaFold-Multimer docking software. We start with more low-resolution metrics that are primarily concerned with the ability to identify and dock the ligands to the proper binding side on the receptor. Low resolution metrics should not be immediately disregarded, as “fuzzy” interactions, in which the IDP retains some degree of disorder while still specifically binding to a region on the receptor, may still be identified by them. From here, we explore more moderate resolution metrics that focus on the orientation of the flexible ligand in the receptor, as well as proximity of residues between the ligand and the receptor. Finally, we explore the ability to model high-resolution interactions between the ligand and the receptor. These high-quality metrics are derived directly from the biophysical interactions that drive IDP binding in reality. All methods performed acceptably well with identifying binding sites on the receptor surface for many cases. However, as the resolution of the metrics increased, AlphaFold was able to outperform the other two methods, particularly in the residue and high-resolution metrics.

Materials and Methods

IDP Dataset

To create a dataset of IDP-receptor complexes to test modeling ability, the PEPtide Binding Database (pepBDB) [19] was clustered by any available sequences via a pre-computed MMseqs2 [20] from the Protein Data Bank (PDB). Structures that only contain protein chains, have an experimental resolution <=3.0 Å or are NMR structures, have a minimum receptor chain length of 100 residues, have a minimum IDP chain length of 20 residues, and only have a single ligand chain binding to the receptor were included. From this set, we cross-referenced the PDB-IDs against structures listed in the DisProt database [21]. We removed a case (PDB-ID: 1S6C) where the peptide chain was covalently bound to the receptor via a peptide bond, and another case (PDB-ID: 5KY7) where the ligand was determined to be globular, despite being included in the PEP-BDB. Two more entries were also removed (PDB-IDs: 3HPW and 5FMK) because models of these targets were unable to be refined through CHARMM [22], which is a part of the modeling procedure with IDP-LZerD. The resultant test set contains a total of 35 entries, with details in Supplementary Table 1.

Modeling Methods

We used the following three modeling methods in this study. Here we briefly describe their algorithms.

IDP-LZerD

IDP-LZerD [14], [23], which was specifically designed to handle longer IDP Sequences, works by first generating overlapping 9 residue fragments of the IDP using the Rosetta Fragment Picker [24], in which the predicted secondary structure is considered. Here, we consider secondary structure predictions from PSIPRED 4.0 [25], DeepCNF [26], DNSS [27], and SPOT1D [28]. Using the LZerD docking program [29], [30], several predicted conformations of each of these fragments are docked onto the receptor structure. These docked fragments are then stitched together into the full IDP using the GOAP [31], D-FIRE [32] and IT-SCORE [33] scoring functions, as well as the complementarity of overlapping regions between full fragments, producing up to 100 full length IDP-receptor structures. A restrained relaxation via CHARMM [22] is then used to clear any poorly modeled geometry. A custom scoring function that considers GOAP, D-FIRE, and IT-Score, as well as a molecular-mechanics scoring function commonly used with LZerD applications [34] is again used to rank the refined models. IDP-LZerD is available at https://kiharalab.org/proteindocking/idplzerd.php as a part of the LZerD protein docking suite.

CABS-Dock

CABS-Dock [13] is a coarse-grained peptide-protein docking program that has previously been used for docking of IDP fragments [14], as it allows for large flexibility of the ligand chain. CABS is a simplified representation of proteins in which each residue is modeled only as up to four beads [35]. CABS-Dock initially selects 1000 frames from 10 separate trajectories. These 10,000 frames are then filtered down to 1000 separate models, which are then subsequently clustered into 10 final models. These 10 models are then reconstructed to an all-atom representation via MODELLER [36]. We ran the stand-alone CABS-Dock program downloaded from https://bitbucket.org/lcbio/cabsdock/downloads/.

AlphaFold-Multimer

AlphaFold-Multimer [16] predicts protein complexes from amino acid sequences of component proteins. The algorithm is based on AlphaFold [15], which predicts single chain protein structures, with modifications made for handling multiple input proteins as input and for the scoring function to make accurate protein interactions. AlphaFold-Multimer first makes a Multiple Sequence Alignment (MSA) for each input protein sequence and concatenates them into a single MSA. The MSA is input to deep neural networks, which embed pairwise residue features and builds structure models of the protein complex. By default, five models will be constructed. The models are ranked by a model confidence score. Each target takes approximately 1.5 hours for full pipeline inference on a Titan RTX GPU with 24GB memory, dependent on the protein size. We used the AlphaFold-Multimer code that was downloaded from the Github repository, https://github.com/deepmind/alphafold, on February 21st 2022. AlphaFold-Multimer needs two databases, a protein sequence database from which similar sequences to the input sequences are selected and aligned and a protein structure template database which includes known protein structures which will be used as templates in the modeling. For a protein sequence database, we used Uniref90 v.2021_03 [37], BFD, Uniclust30 v.2018_08 [38], and MGnify v.2018_12 [39]. All databases were downloaded on July 22nd 2021. We used the same template database as used in the original paper [16]. For template search, we used structures from the Protein Data Bank (PDB), which are dated on 2022/02/01 or earlier. The default parameters of the program were used. Below, we refer to AlphaFold-Multimer as AlphaFold for simplicity.

Evaluation Metrics

We present a variety of evaluation metrics that span various levels of precision. We start with low-resolution metrics that only concern which residues of the interface are interacting with the IDP. These methods do not relate to any specific contacting pairs, nor the orientation of the IDP. We then investigate intermediate-resolution metrics, which penalize the incorrect orientation of the IDP, as well as receptor-ligand contacting residue pairs. Lastly, we evaluated high-level metrics, derived from the biophysical interactions that drive IDP Docking.

Interface Precision and Recall

The interface precision and recall are a group of metrics used to gauge the ability of docking methods to identify the binding interface on the receptor. We describe four related metrics that measure similarity between the interfaces of the true and modeled complexes. Receptor Interface Precision and Receptor Interface Recall measure the number of the true receptor interface residues that exist at the model complex interface as a proportion to either the total number of predicted receptor interface residues or true receptor interface residues. A receptor residue is considered an interface residue if it is within 5 Å of any ligand residues.

ReceptorPrecision=(TrueReceptorInterfaceResdiues)(PredictedReceptorInterfaceResidues)(PredictedReceptorInterfaceResidues) (1)
ReceptorRecall=(TrueReceptorInterfaceResdiues)(PredictedReceptorInterfaceResidues)(TrueReceptorInterfaceResidues) (2)

Figure 1A illustrates these two metrics.

Figure 1:

Figure 1:

Diagrams depicting the interface residue metrics. A, Visualization of interfaces used in the Receptor Precision and Receptor Recall metrics. B, Visualization of the ligand and receptor residues used in the Ligand Precision metric. C, Visualization of the ligand and receptor residues used in the Ligand Recall metric.

We also defined recall and precision on the ligand (IDP) side. Since most of ligand residues in a model interact with the receptor, we defined Ligand Precision and Recall considering if ligand residues interact with predicted or true interface residues of the receptor. Ligand Precision quantifies the proportion of the predicted ligand interface residues that are in interaction with any of the true receptor interface residues (Figure 1B). On the other hand, Ligand Recall quantifies the fraction of true ligand interface residues that are in contact with predicted receptor interface residues. Ligand residues are considered to be at interface if they are within 5 Å of any of the receptor interface residues. Particularly, ligand residues are considered as:

LigandPrecision=(PredictedLigandInterfaceResiduesinContactwithTrueReceptorInterfaceResidues)(PredictedLigandInterfaceResidues) (3)
LigandRecall=(TrueLigandInterfaceResiduesinContactwithPredictedReceptorInterfaceResidues)(TrueLigandInterfaceResidues) (4)

Intermediate-Resolution Metrics

Ligand-RMSD (LRMSD)

The Ligand RMSD (LRMSD) is a CAPRI metric that measures the Cα RMSD of the ligand chain (in this case the docked IDP) when the receptor chains are superimposed [11].

Fnat

The Fraction of Native Contacts (Fnat) is a metric used to assess docking performance in CAPRI [11]. The metric measures the number of native contacts found in the computational model, divided by the total number of contacts. Here, a contact is defined as a ligand-receptor residue pair within 5 Å of each other.

High-Resolution Metrics

Fhbond

The criteria for defining a hydrogen bond via atom coordinates were derived from constraints described previously [40]. This definition considers both the distance and angle of a potential hydrogen bond, and thus requires the coordinates of the donor and acceptor heavy atoms, as well as the hydrogen atom. Hydrogens were added to target and model PDB files with PHENIX reduce [41].

From all modeled hydrogen bonds, we created an Fhbond metric, which follows the same functional form of the Fnat metric described above but considers pairs of intermolecular hydrogen bonding residues instead of contacts.

Fhbond=HydrogenBondingPairsCorrectlyModeledTotalHydrogenBondingPairs (5)

Fhphobe

The criteria for defining a hydrophobic interactions via atom coordinates were derived from constraints described previously [40], with definitions of hydrophobes being simplified to include only elements found naturally in proteins. This simple definition only considers distances between two hydrophobic atoms.

Fhphobe=HydrophobicPairsCorrectlyModeledTotalHydrophobicPairs (6)

Fionic

The criteria for defining short range ionic interactions via atom coordinates were derived from constraints described previously [40], with definitions of cations and anions modified to fit proteins, and will thus only identify salt bridges. This definition only considers distances between the cation and anion.

Fionic=IonicPairsCorrectlyModeledTotalIonicPairs (7)

Results

Global Positioning of an IDP Ligand on the Receptor Surface

First, we gauged the ability of the methods to identify the correct global location of the IDP-receptor interface (Figure 2). The modeled ligand at the interface can be evaluated between either the true and predicted receptor surfaces (Figure 2AB), or the ligand residues (Figure 2CD), as well as with respect to the predicted interface or the true interface, resulting in 4 possible precision and recall metrics. The top ranked, best within the top 5, and best of all models of IDP-LZerD were able to successfully model over 70% of the true interface for 13 (37.14%), 22 (62.86%), and 27 (77.14%) models, respectively (Figure 2A). These numbers drop to 8 (22.86%), 16 (45.71%), and 17 (48.57%) for top ranked, best within the top 5, and best of all models for CABS-Dock, and increased to 31 (88.57%) for both the top ranked and best within the top 5 for AlphaFold. Comparing Figure 2A and 2B, the precision was generally lower than the recall in Figure 2B for models by IDP-LZerD and CABS-Dock. A typical reason for this is that these two methods tend to bind all residues in the ligand at the receptor surface, making a larger interface region than the true one.

Figure 2:

Figure 2:

Recall and precision distributions for modeled complexes. Swarm plots of recall and precision distributions of the top ranked model (blue), best of top 5 ranked models (orange), and best of all models (green) for each method. Each dot represents the result from a target complex in the IDP dataset. A, the recall of receptor residues in modeled complexes. B, the precision of receptor residues in the modeled complexes. C, the recall of the true ligand (IDP) residues on the predicted receptor surface. D, the precision of modeled ligand residues on the true receptor surface. For AlphaFold only results for the top ranked and best of the top 5 ranked models were shown because it only outputs five models in the default setting.

Evaluation from the ligand shows a similar trend (Figure 2CD). Using the same 70% cutoff for the Ligand Precision (that is to say that 70% of the contacting ligand residues are contacting at the correct interface), the precision of the top ranked, best within the top 5, and best of all models for IDP-LZerD are 13 (37.14%), 25 (71.43%), and 31 (88.57%), respectively. These numbers drop to 11 (31.43%), 23 (65.71%), and 27 (77.14%) for top ranked, best within the top 5, and best of all models for CABS-Dock, and increase slightly to 34 (97.14%) and 35 (100.00%) for top ranked and best within the top 5 for AlphaFold.

We visually inspected four targets in Figure 3. In Figure 3A, (PDB ID: 1WLP), the Receptor Recall was well captured by all three methods (0.93, 0.89 and 0.93 for IDP-LZerD, CABS-Dock, and AlphaFold, respectively). The target with the longest IDP (49 residues) in the dataset, 1JPW, had an elongated IDP structure which all three methods were able to generally follow (Figure 3B). Within the resolved regions, it resulted in Receptor Recall values of 0.54, 0.46, and 0.98 for IDP-LZerD, CABS-Dock, and AlphaFold, respectively. It was noted that for the best ranking models of target 2PHG (Figure 3C), both IDP-LZerD and CABS-Dock were able to generally trace the ligand, which does not leave the receptor surface. However, AlphaFold begins to follow the same general path as IDP-LZerD did, but veers away from the receptor surface midway. This caused the Receptor Recall values for IDP-LZerD and CABS-Dock to be 0.73 and 0.77 respectively, and the Receptor Recall for AlphaFold to be 0.50, despite exceptional modeling in the contacting region. Lastly, target 3PK1 (Figure 3D), shows that both IDP-LZerD and CABS-Dock have Receptor Recall values (0.84 and 0.96, respectively) despite the CABS-Dock model being modeled in backwards. In contrast, AlphaFold modeled this target very well, resulting in a Receptor Recall value of 1.00.

Figure 3:

Figure 3:

Modeling examples. The receptor and target IDP are colored green and translucent red respectively. IDP-LZerD, CABS, and AlphaFold models are colored blue, orange, and pink, respectively. The best in Top 5 by Receptor Recall are shown for A. The best in Top 5 by LRMSD of each method is shown in B, C, and D. A, Strong performance by Receptor Recall for target 1WLP (Cytochrome b-245 light chain and Neutrophil cytosol factor 1) by all three methods. The length of the IDP is 25 residues. B, Docking a long IDP of 49 residues for target 1JPW (Transcription factor 7-like 2 and Beta-catenin). In this target, the IDP structure was only resolved for 24 residues in the IDP (residues 13 to 25; and 40 to 50), which are shown as two fragments in red. C, Mis-modeling of target 2PHG (Alpha trans-inducing protein and Transcription initiation factor IIB) by AlphaFold, better modeled by CABS-Dock and IDP-LZerD. The length of the IDP is 26 residues. D, High performance of AlphaFold on target 3PK1 (Apoptosis regulator BAX and Induced myeloid leukemia cell differentiation protein Mcl-1). The length of the IDP is 34 residues.

Intermediate-Resolution Accuracy

Next, we evaluated the models in terms of LRMSD and Fnat. These two metrics concern the ability of the methods to model the IDP ligand in the correct overall orientation and binding site (Figure 4A). Out of all resultant models, AlphaFold was able to consistently identify the binding site of the ligand, and placed the ligands of all 5 models for any given target in the same general location on the receptor. AlphaFold models have 31 targets where all models have an LRMSD less than 10.0 Å, and 25 targets where all models have an LRMSD less than 5.0 Å. The average standard deviation of AlphaFold across all model LRMSDs is 0.59 Å.

Figure 4:

Figure 4:

Intermediate Resolution Modeling Methods. A, Swarm plots of LRMSD distribution of the top ranked model, best of top 5 ranked models, and best of all models by LRMSD. B, Swarm plots of Fnat distribution of the top ranked model, best of top 5 ranked models, and best of all models by Fnat. C, Correlation between Receptor Recall and LRMSD of the best of top 5 models by Receptor Recall for all methods.

With CABS-Dock and IDP-LZerD, binding locations of the IDP ligand were sometimes localized or clustered in grooves on the receptor surface, but were generally more diverse in the placement of the IDP on the receptor surface compared to AlphaFold. As a result, CABS-Dock and IDP-LZerD have average LRMSD standard deviations of 5.67 Å and 5.98 Å within the top 5 ranked models respectively. In Figure 4C, we see that models that have a high interface residue accuracy often have a large LRMSD, which correspond to the models that have different binding poses of IDPs in the correct binding sites.

This tendency to place the ligand in grooves and concave places in the receptor was particularly problematic for IDP-LZerD on target 1IK9, in which IDP-LZerD placed all models in ridges near the globular domain, instead of on the coiled-coil stem where it was crystalized (Figure 5A). This resulted in a minimum LRMSD of 69.64 Å for this model. CABS-Dock had a similar problem with this target, but was able to place a single model near the binding site (albeit backwards, and distorting the receptor), resulting in a minimum LRMSD of 28.13 Å. AlphaFold was able to model this structure with a minimum LRMSD of 6.03 Å. It was noticed that in several cases where AlphaFold also reported a poor LRMSD, the ligand was placed in the correct receptor site, but backwards (Figure 5B). This has been noted previously with AlphaFold [18], as well as other IDP docking studies [23]. This is by no means specific to AlphaFold, as CABS-Dock also docked the ligand backwards in target 3PK1 (Figure 3D). This caused the LRMSD of the CABS-Dock model to be significantly higher than the IDP-LZerD model, which placed the ligand in the correct direction (20.10 Å vs 6.98 Å, respectively) despite having similar Receptor Precision and Recall.

Figure 5:

Figure 5:

Examples of intermediate resolution modeling. A, IDP-LZerD models of 1IK9 (DNA ligase IV and DNA repair protein xrcc4). The receptor is colored gray, the 100 modeled ligands magenta, and the true ligand red. B, the top-ranked AlphaFold model of target 2B3G (Cellular tumor antigen p53 and Replication protein A 70 kDa DNA-binding subunit), in which the IDP was modeled backwards. Receptor coloring is in gray, and true and modeled IDPs have a blue-red spectrum from N-terminus to C-terminus. This model has a Receptor Recall of 0.57, but an LRMSD of 23.12 Å. The true model is modeled from left to right. C, the best in top 5 models for target 1JSP (Tumor protein p53 and CREB-binding protein). The receptor and target IDP are colored green and translucent red, respectively. IDP-LZerD, CABS-Dock, and AlphaFold models are colored blue, orange, and pink, respectively. These models have Receptor Recall values of 0.11, 0.11 and 0.78 respectively, and LRMSD values of 34.18, 33.80 and 19.01 respectively.

We further examined the Fnat across the best and top ranked models for each method, to determine if the methods are able to place residues within the proper vicinity on the receptor surface (Figure 4B). AlphaFold managed to maintain the highest average Fnat of 0.78 across all best within top 5 models, with 30 models recording an Fnat of above 0.5. CABS-Dock scored the lowest of the three methods with an average Fnat of 0.13, and 0 models with an Fnat above 0.5. IDP-LZerD had an average Fnat of 0.23 across all best within top 5 models, with 5 models recording an Fnat of above 0.5.

AlphaFold did score 5 of its top models below an Fnat of 0.5. As expected, these are largely the same models in which the LRMSD was relatively large, often due to the flipping of the ligand within the correct binding site. IDP-LZerD did score relatively high Fnat values for several targets (also targets where the LRMSD was high). However, it was typical for the ranking of generated models, particularly for IDP-LZerD and CABS-Dock, to have sub-optimal ranking of the models. Thus, the quality generally increases from the top ranked model, best in the top, and best of all models (Figure 4B). For IDP-LZerD, the average Fnat values increased from 0.12, 0.23, and 0.33 for the top ranked, best in the top 5, and best of all models, respectively. For CABS-Dock, the average Fnat values increased from 0.05, 0.13, and 0.14 for the top ranked, best in the top 5, and best of all models, respectively.

Several of the models available in the PepBDB are derived from NMR targets or have unresolved regions in which the IDP is not completely contacting the receptor. This is a recognized weakness in the methodology of IDP-LZerD and CABS, since these two methods tend to bind all the residues in an IDP to the receptor surface by their algorithm design. This is the case with target 1JSP, in which only the N-terminal region is making contact with the receptor, while the C-terminal region is not making any contacts (Figure 5C). Both the best CABS-Dock and IDP-LZerD models attempted to keep the entire fragment in contact with the receptor (resulting in LRMSDs of 34.18 Å and 33.79 Å for the best in top 5 respectively), whereas the AlphaFold model was able to leave certain portions of the ligand out of contact with the receptor. While AlphaFold’s LRMSD was still lower than average (19.01 Å), the models were still able to keep some proper contacts with the receptor (Fnat: 0.29).

Atomic Level Interactions

Due to the remarkable performance of AlphaFold in the traditional protein docking metrics as discussed above, we decided to re-evaluate the modeling performance using more stringent, biophysical interaction-based metrics only for AlphaFold models. These are derived from geometric constraints for hydrogen bonds, hydrophobic contacts, and short-range ionic interactions. We compared the ability of AlphaFold to model these interactions (in the form of fractions of true interactions) against the Fnat values of the same target (Figure 6A). Despite the overwhelmingly high quality Fnat values of AlphaFold in Figure 4, the fractions of individual interactions often fall short of their relative Fnat values, even for more prominent interaction types, such as hydrogen bonds. Top ranked and best AlphaFold Interactions have average Fnat values of 0.76 and 0.78 respectively, but have average Fhbond values of 0.59 and 0.62, Fhphobe values of 0.72 and 0.73, and Fionic values of 0.75 and 0.77. The discrepancy between Fnat values and the interaction metrics is most apparent in the Fhbond values. In the plot in Figure 6A, we observe many cases where Fhbond is approximately half of Fnat.

Figure 6:

Figure 6:

Interaction Fractions of AlphaFold Models. A, scatterplots of Fnat against the fractions of the three interaction types, Fhbond, Fhyrdophobic, and Fionic. The top ranked AlphaFold Model (orange), as well as the best of all 5 models (blue) are shown. B, A missed hydrogen bond in the top ranked AlphaFold model of 3JUA (Yes-associated protein and TEF-3). Cyan and deepblue are the ligand chains of the ground truth and AlphaFold model, respectively, and lime and dark green are the receptor chains of the ground truth and AlphaFold model, respectively. C, AlphaFold model of 5OHG (Ribonuclease E and Enolase). Pink and magenta are the ligand chains of the ground truth and AlphaFold model respectively, and Cyan and deep blue are the receptor chains of the ground truth and AlphaFold model respectively. D, General sidechain mismodeling in target 5FCG (Protein X and Apoptosis regulator Bcl-2). Cyan and deepblue are the ligand chains of the ground truth and AlphaFold model, respectively, and lime and dark green are the receptor chains of the ground truth and AlphaFold model, respectively.

We visualize how AlphaFold sometimes misses making potential interactions in Figure 6. In Figure 6B, we see that in target 3JUA (Fnat: 0.88, Fhbond: 0.87) a missed hydrogen bond is purely due to sidechain rotamer modeling, in which the backbone of the receptor is well aligned, but the sidechains of both the donor and acceptor are rotated away from each other. In another example (Figure 6C), 5OHG (Fnat:0.87, Fionic:0.50) a missed ionic interaction is caused by the terminal region of the ligand chain to continue forward in an alpha helix, pushing the cation away from the anion. In the last example in Figure 6D, we observed a large discrepancy between Fnat and values of the three atomic level metrics. This target (5FCG) manages to maintain an Fnat value of 0.63, while the Fhbond, Fhphobe, and Fionic values are 0.17, 0.60, and 0.33, respectively. Many of the sidechains in this model have rotamers that place sidechains in wrong orientations with respect to the receptor. There errors are small enough to maintain a relatively large Fnat but fail to meet the criteria for the more stringent interaction based metrics.

Discussion

We evaluated models from three different modeling software packages for their ability to accurately model IDP-receptor interactions known from experimental crystal structures. We investigated modeling with metrics of various resolutions in ascending order: the Interface Precision and Recall, LRMSD, Fnat, and high-resolution fractions of interactions (Fhbond, , Fhphobe, , and Fionic) for intermolecular hydrogen bonds, hydrophobic contacts, and ionic interactions. We evaluated the top ranked, best within the top 5, and best of all generated models for each method.

In general, IDP-LZerD, CABS-Dock, and AlphaFold were all able to perform acceptably with low resolution metrics such as the fractions at the interface. However, as the resolution of the analysis metrics increases, AlphaFold does manage to outperform IDP-LZerD and CABS-Dock, with the difference in performance being particularly noticeable in the Fnat metric. Despite the tremendous leap forward AlphaFold has made to protein structure and complex prediction, there is still progress to be made on the atomic-interaction level at the interface of IDP-receptor complexes, as the fraction of interactions are often below the Fnat values. Particularly, Fhbond are often substantially lower than Fnat values. Future AlphaFold derivative modeling methods may aim to take advantage of this. By explicitly considering these types of biophysical interactions, one may be able to better model these interactions, but also may aid in other modeling errors of AlphaFold, such as flipping of the IDP in the receptor site.

It should be kept in mind that many known IDP-receptor complexes are fuzzy in nature and deviate significantly on the receptor surface. IDP-LZerD and CABS both exhibit significant conformational deviations in their generated models, yet are often able to place their models in the general vicinity of the binding site, perhaps making their modeling techniques better suited for these types of interactions. Although it is clear that AlphaFold maintains a strong performance lead with the well-ordered crystal structures tested here, the fuzzy nature of IDPs may still prove traditional physics-based methods such as IDP-LZerD and CABS-Dock useful. Future approaches in IDP-receptor modeling, particularly with those that exhibit dynamic interactions, are likely to include hybrid approaches between machine learning and traditional physics and MD based methods.

Supplementary Material

Supplementary Table 1

Acknowledgments

Jacob Verburgt is supported by NIGMS-funded predoctoral fellowship (T32 GM132024). Daisuke Kihara acknowledges support from the National Institutes of Health (R01GM133840, R01GM123055, and 3R01GM133840-02S1), the National Science Foundation (DBI2003635, DBI2146026, CMMI1825941, and MCB1925643).

Footnotes

Conflicts of Interest

The authors declare no conflict of interest.

References

  • [1].Ward JJ, Sodhi JS, McGuffin LJ, Buxton BF, and Jones DT, “Prediction and Functional Analysis of Native Disorder in Proteins from the Three Kingdoms of Life,” J. Mol. Biol, vol. 337, no. 3, pp. 635–645, Mar. 2004, doi: 10.1016/j.jmb.2004.02.002. [DOI] [PubMed] [Google Scholar]
  • [2].Hazra MK and Levy Y, “Biophysics of Phase Separation of Disordered Proteins Is Governed by Balance between Short- And Long-Range Interactions,” J. Phys. Chem. B, vol. 125, no. 9, pp. 2202–2211, Mar. 2021, doi: 10.1021/acs.jpcb.0c09975. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [3].Gabryelczyk B et al. , “Hydrogen bond guidance and aromatic stacking drive liquid-liquid phase separation of intrinsically disordered histidine-rich peptides,” Nat. Commun, vol. 10, no. 1, Art. no. 1, Nov. 2019, doi: 10.1038/s41467-019-13469-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Wright PE and Dyson HJ, “Intrinsically disordered proteins in cellular signalling and regulation,” Nat. Rev. Mol. Cell Biol, vol. 16, no. 1, Art. no. 1, Jan. 2015, doi: 10.1038/nrm3920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Wells M et al. , “Structure of tumor suppressor p53 and its intrinsically disordered N-terminal transactivation domain,” Proc. Natl. Acad. Sci, vol. 105, no. 15, pp. 5762–5767, Apr. 2008, doi: 10.1073/pnas.0801353105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].“Unraveling Molecular Interactions in Liquid–Liquid Phase Separation of Disordered Proteins by Atomistic Simulations | The Journal of Physical Chemistry B.” 10.1021/acs.jpcb.0c06288 (accessed Mar. 21, 2022). [DOI] [PubMed]
  • [7].Olsen JG, Teilum K, and Kragelund BB, “Behaviour of intrinsically disordered proteins in protein–protein complexes with an emphasis on fuzziness,” Cell. Mol. Life Sci, vol. 74, no. 17, pp. 3175–3183, 2017, doi: 10.1007/s00018-017-2560-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Lin Y-H and Chan HS, “Phase Separation and Single-Chain Compactness of Charged Disordered Proteins Are Strongly Correlated,” Biophys. J, vol. 112, no. 10, pp. 2043–2046, May 2017, doi: 10.1016/j.bpj.2017.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Das S, Eisen A, Lin Y-H, and Chan HS, “A Lattice Model of Charge-Pattern-Dependent Polyampholyte Phase Separation,” J. Phys. Chem. B, vol. 122, no. 21, pp. 5418–5431, May 2018, doi: 10.1021/acs.jpcb.7b11723. [DOI] [PubMed] [Google Scholar]
  • [10].Mészáros B, Tompa P, Simon I, and Dosztányi Z, “Molecular Principles of the Interactions of Disordered Proteins,” J. Mol. Biol, vol. 372, no. 2, pp. 549–561, Sep. 2007, doi: 10.1016/j.jmb.2007.07.004. [DOI] [PubMed] [Google Scholar]
  • [11].Méndez R, Leplae R, Maria LD, and Wodak SJ, “Assessment of blind predictions of protein–protein interactions: Current status of docking methods,” Proteins Struct. Funct. Bioinforma, vol. 52, no. 1, pp. 51–67, 2003, doi: 10.1002/prot.10393. [DOI] [PubMed] [Google Scholar]
  • [12].Lensink MF et al. , “Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment,” Proteins Struct. Funct. Bioinforma, vol. 89, no. 12, pp. 1800–1823, 2021, doi: 10.1002/prot.26222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [13].Kurcinski M et al. , “CABS-dock standalone: a toolbox for flexible protein-peptide docking,” Bioinforma. Oxf. Engl, vol. 35, no. 20, pp. 4170–4172, Oct. 2019, doi: 10.1093/bioinformatics/btz185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [14].Christoffer C and Kihara D, “IDP-LZerD: Software for Modeling Disordered Protein Interactions,” Methods Mol. Biol. Clifton NJ, vol. 2165, pp. 231–244, 2020, doi: 10.1007/978-1-0716-0708-4_13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Jumper J et al. , “Highly accurate protein structure prediction with AlphaFold,” Nature, vol. 596, no. 7873, pp. 583–589, Aug. 2021, doi: 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Evans R et al. , “Protein complex prediction with AlphaFold-Multimer,” Oct. 2021. doi: 10.1101/2021.10.04.463034. [DOI]
  • [17].Tsaban T, Varga JK, Avraham O, Ben-Aharon Z, Khramushin A, and Schueler-Furman O, “Harnessing protein folding neural networks for peptide–protein docking,” Nat. Commun, vol. 13, no. 1, Art. no. 1, Jan. 2022, doi: 10.1038/s41467-021-27838-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Johansson-Åkhe I and Wallner B, “Benchmarking Peptide-Protein Docking and Interaction Prediction with AlphaFold-Multimer.” bioRxiv, p. 2021.11.16.468810, Nov. 18, 2021. doi: 10.1101/2021.11.16.468810. [DOI] [Google Scholar]
  • [19].Wen Z, He J, Tao H, and Huang S-Y, “PepBDB: a comprehensive structural database of biological peptide–protein interactions,” Bioinformatics, vol. 35, no. 1, pp. 175–177, Jan. 2019, doi: 10.1093/bioinformatics/bty579. [DOI] [PubMed] [Google Scholar]
  • [20].Steinegger M and Söding J, “MMseqs2 enables sensitive protein sequence searching for the analysis of massive data sets,” Nat. Biotechnol, vol. 35, no. 11, Art. no. 11, Nov. 2017, doi: 10.1038/nbt.3988. [DOI] [PubMed] [Google Scholar]
  • [21].Vucetic S et al. , “DisProt: a database of protein disorder,” Bioinformatics, vol. 21, no. 1, pp. 137–140, Jan. 2005, doi: 10.1093/bioinformatics/bth476. [DOI] [PubMed] [Google Scholar]
  • [22].Brooks BR et al. , “CHARMM: The biomolecular simulation program,” J. Comput. Chem, vol. 30, no. 10, pp. 1545–1614, 2009, doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Peterson LX, Roy A, Christoffer C, Terashi G, and Kihara D, “Modeling disordered protein interactions from biophysical principles,” PLOS Comput. Biol, vol. 13, no. 4, p. e1005485, Apr. 2017, doi: 10.1371/journal.pcbi.1005485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Gront D, Kulp DW, Vernon RM, Strauss CEM, and Baker D, “Generalized Fragment Picking in Rosetta: Design, Protocols and Applications,” PLOS ONE, vol. 6, no. 8, p. e23294, Aug. 2011, doi: 10.1371/journal.pone.0023294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Buchan DWA and Jones DT, “The PSIPRED Protein Analysis Workbench: 20 years on,” Nucleic Acids Res, vol. 47, no. W1, pp. W402–W407, Jul. 2019, doi: 10.1093/nar/gkz297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Wang S, Peng J, Ma J, and Xu J, “Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields,” Sci. Rep, vol. 6, no. 1, Art. no. 1, Jan. 2016, doi: 10.1038/srep18962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Spencer M, Eickholt J, and Cheng J, “A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction,” IEEE/ACM Trans. Comput. Biol. Bioinform, vol. 12, no. 1, pp. 103–112, Jan. 2015, doi: 10.1109/TCBB.2014.2343960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Hanson J, Paliwal K, Litfin T, Yang Y, and Zhou Y, “Improving prediction of protein secondary structure, backbone angles, solvent accessibility and contact numbers by using predicted contact maps and an ensemble of recurrent and residual convolutional neural networks,” Bioinforma. Oxf. Engl, vol. 35, no. 14, pp. 2403–2410, Jul. 2019, doi: 10.1093/bioinformatics/bty1006. [DOI] [PubMed] [Google Scholar]
  • [29].Venkatraman V, Yang YD, Sael L, and Kihara D, “Protein-protein docking using region-based 3D Zernike descriptors,” BMC Bioinformatics, vol. 10, no. 1, p. 407, Dec. 2009, doi: 10.1186/1471-2105-10-407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Christoffer C et al. , “LZerD webserver for pairwise and multiple protein-protein docking,” Nucleic Acids Res, May 2021, doi: 10.1093/nar/gkab336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Zhou H and Skolnick J, “GOAP: A Generalized Orientation-Dependent, All-Atom Statistical Potential for Protein Structure Prediction,” Biophys. J, vol. 101, no. 8, pp. 2043–2052, Oct. 2011, doi: 10.1016/j.bpj.2011.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Zhou H and Zhou Y, “Distance-scaled, finite ideal-gas reference state improves structure-derived potentials of mean force for structure selection and stability prediction,” Protein Sci, vol. 11, no. 11, pp. 2714–2726, 2002, doi: 10.1110/ps.0217002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Huang S-Y and Zou X, “Statistical mechanics-based method to extract atomic distance-dependent potentials from protein structures,” Proteins, vol. 79, no. 9, pp. 2648–2661, Sep. 2011, doi: 10.1002/prot.23086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Esquivel-Rodríguez J, Yang YD, and Kihara D, “Multi-LZerD: Multiple protein docking for asymmetric complexes,” Proteins Struct. Funct. Bioinforma, vol. 80, no. 7, pp. 1818–1833, 2012, doi: 10.1002/prot.24079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Kmiecik S, Gront D, Kolinski M, Wieteska L, Dawid AE, and Kolinski A, “Coarse-Grained Protein Models and Their Applications,” Chem. Rev, vol. 116, no. 14, pp. 7898–7936, Jul. 2016, doi: 10.1021/acs.chemrev.6b00163. [DOI] [PubMed] [Google Scholar]
  • [36].Webb B and Sali A, “Protein structure modeling with MODELLER,” Methods Mol. Biol. Clifton NJ, vol. 1137, pp. 1–15, 2014, doi: 10.1007/978-1-4939-0366-5. [DOI] [PubMed] [Google Scholar]
  • [37].Suzek BE, Wang Y, Huang H, McGarvey PB, Wu CH, and the UniProt Consortium, “UniRef clusters: a comprehensive and scalable alternative for improving sequence similarity searches,” Bioinformatics, vol. 31, no. 6, pp. 926–932, Mar. 2015, doi: 10.1093/bioinformatics/btu739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [38].Mirdita M, von, den Driesch L, Galiez C, Martin MJ, Söding J, and Steinegger M, “Uniclust databases of clustered and deeply annotated protein sequences and alignments,” Nucleic Acids Res, vol. 45, no. D1, pp. D170–D176, Jan. 2017, doi: 10.1093/nar/gkw1081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [39].Mitchell AL et al. , “MGnify: the microbiome analysis resource in 2020,” Nucleic Acids Res, vol. 48, no. D1, pp. D570–D578, Jan. 2020, doi: 10.1093/nar/gkz1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [40].Marcou G and Rognan D, “Optimizing fragment and scaffold docking by use of molecular interaction fingerprints,” J. Chem. Inf. Model, vol. 47, no. 1, pp. 195–207, Feb. 2007, doi: 10.1021/ci600342e. [DOI] [PubMed] [Google Scholar]
  • [41].Word JM, Lovell SC, Richardson JS, and Richardson DC, “Asparagine and glutamine: using hydrogen atom contacts in the choice of side-chain amide orientation11Edited by J. Thornton,” J. Mol. Biol, vol. 285, no. 4, pp. 1735–1747, Jan. 1999, doi: 10.1006/jmbi.1998.2401. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Table 1

RESOURCES