Abstract
The Nedd4 family contains several structurally related but functionally distinct HECT-type ubiquitin ligases. The members of the Nedd4 family are known to recognize substrates through their multiple WW domains, which recognize PY motifs (PPxY, LPxY) or phospho-threonine or phospho-serine residues. To better understand protein interactor recognition mechanisms across the Nedd4 family, we report the development and implementation of a python-based tool, PxYFinder, to identify PY motifs in the primary sequences of previously identified interactors of Nedd4 and related ligases. Using PxYFinder, we find that, on average, half of Nedd4 family interactions are likely PY-motif mediated. Further, we find that PPxY motifs are more prevalent than LPxY motifs and are more likely to occur in proline-rich regions and that PPxY regions are more disordered on average relative to LPxY-containing regions. Informed by consensus sequences for PY motifs across the Nedd4 interactome, we rationally designed a focused peptide library and employed a computational screen, revealing sequence- and biomolecular interaction-dependent determinants of WW-domain/PY-motif interactions. Cumulatively, our efforts provide a new bioinformatic tool and expand our understanding of sequence and structural factors that contribute to PY-motif mediated interactor recognition across the Nedd4 family.
Introduction
Neuronal precursor cell-expressed developmentally downregulated 4 (Nedd4) is the founding member of a family of HECT-type E3 ubiquitin ligases that share a common architecture but have distinct cellular functions. The Nedd4 family is characterized by a multi-domain architecture comprised, from N- to C-terminus respectively, of a C2 domain for membrane localization, two to four WW domains for substrate recognition, and a catalytic HECT domain (Fig 1A) [1–5]. As the final enzyme in the ubiquitin signaling cascade, the Nedd4 family of HECT-type E3 ubiquitin ligases receives ubiquitin from a ubiquitin-E2 conjugating enzyme thioester adduct. The ubiquitin-HECT E3 conjugate then passes ubiquitin to a substrate protein via isopeptide bond formation at target lysine residues. Nedd4 and related HECT-type ligases are thus responsible for conferring substrate specificity in the ubiquitin signaling pathway. Understanding the specificity of the Nedd4 family of ubiquitin ligases is of particular interest due to the role of Nedd4 in the regulation of proteostasis in various conditions including cancers [6–8] and neurodegenerative disorders [9–16] and with recent insights into the potential of Nedd4 to serve as a therapeutic target [17–25].
It has been established that the Nedd4 ligases recognize substrates primarily through their WW domains, small structural domains characterized by a three-strand, anti-parallel β-sheet with two conserved tryptophan residues ~20 amino acids apart (Fig 1B) [8, 26–31]. WW domains are found in a variety of proteins and bind primarily to proline-rich regions of target proteins. In the Nedd4 family, WW domain-mediated interactor recognition occurs via binding of the WW domain to a substrate PY motif (PPxY or LPxY, where x can be any amino acid; Fig 1C), or phospho-threonine or phospho-serine residues (pT and pS, respectively) [26–28, 30–36]. There have been extensive efforts to characterize the nature of Nedd4 family ligases and their interactors, from solution state NMR [26, 30, 33, 37–40] and x-ray crystallography [29] characterization of WW domain-PY motif complexes and studies of WW domain-PY motif affinities [41], to affinity-based pull-down assays [42] and high-throughput microarray studies of Nedd4 binders. Through these efforts, significant information about the interactions of Nedd4 family has become available with ~100 to over 700 interactors annotated for different members of the ligase family (Table 1). Comparative analysis of the annotated interactors across the Nedd4 family indicates little overlap between the interactomes of the ligases, indicating that the members of the Nedd4 family are functionally distinct despite high structural conservation (Fig 2A). Gene ontology annotation [43, 44] of the interactomes furthers this, revealing that there are similar trends amongst affected biological processes but distinct patterns in protein classes that interact with each Nedd4 type ligase (Fig 2B).
Table 1. Number of previously identified interactors for each Nedd4 family ligase in the BioGrid protein-protein interaction database.
Ligase | # of annotated interactors |
---|---|
Nedd4 | 348 |
ITCH | 233 |
SMURF1 | 425 |
SMURF2 | 150 |
WWP1 | 112 |
WWP2 | 763 |
HECW1 | 26 |
HECW2 | 308 |
Using this available interactome data, we sought to analyze the features defining Nedd4 substrate specificity and the PY-dependent substrates of the ligase family. Through this effort, we aimed to characterize the prevalence of canonical PY motifs (both PPxY and LPxY) in the known Nedd4 family interactome to determine the frequency of PY-mediated interactor recognition. Further, we sought to determine the preferred amino acid identity at the x position and the sequence context of the PPxY and LPxY motifs to provide insight into the nature of the protein regions where these domains occur. To this end, we developed a python-based tool, termed PxYFinder, for rapid sequence-based analysis of the Nedd4 family interactome. Analysis of the primary sequences of known Nedd4 family interactors using PxYFinder revealed that PY-motifs occur in ~50% of the Nedd4 family interactome, with PPxY motifs occurring more frequently than LPxY motifs. Further bioinformatic analysis reveals that PPxY motifs are both more likely to occur in disordered and in solvent accessible regions than LPxY regions. Next, using consensus sequence data from the PY motif-containing interactors of the Nedd4 interactome, we conducted a computational analysis of PY peptide affinity using a rationally designed peptide library. Specifically, we screened combinations of the most common residues at the x–1 and x position (where x–1 denotes the residue immediately preceding PPxY or LPxY) using a combination of template-based peptide docking [45] and molecular mechanics-based binding affinity prediction [46] to identify residue-dependent trends in peptide binding affinity. Finally, to gain insight into PY-independent Nedd4/substrate interactions, we conducted an analysis of the non-PY motif containing Nedd4 substrates to identify possible alternative modes of interaction with the ligase. To this end, we screened non-PY substrates against the PhosphoSite database [47] to identify phospho-proteoforms that may drive Nedd4 recognition. Cumulatively, these analyses provide insight into the predominance and nature of PY-motif dependent protein-protein interactions versus PY-independent interactions in the Nedd4 family interactome and establishes a platform for further experimental interrogation of interaction specificity and affinity in the Nedd4 family of ubiquitin ligases.
Results
Identification and analysis PY motif sequences in the Nedd4 family interactome
To begin our analysis of PY motif-mediated interactions in the Nedd4 family, we first sought to determine the prevalence of PY motifs amongst interactors of the family. To this end, we retrieved interactome data for the Nedd4 family ligases (Nedd4-1, Nedd4-2, ITCH, WWP1, WWP2, SMURF1, SMURF2, HECW1, HECW2) from BioGrid [48, 49] using Homo sapiens as an organismal filter. Across the BioGrid database, the reported interactors have been identified via various means (affinity capture mass spectrometry, affinity capture western blotting, two hybrid, co-localization, biochemical activity assays, etc.). It is important to note that these techniques may also result in identification of indirect interactors (i.e. proteins identified through association with a multi-protein complex), and this idea is discussed further below. Across the Nedd4 ligase family, there are ~100 to over 700 annotated interactors for each of the ligases (Table 1; Fig 2), so we sought a rapid method to screen the interactor sequences for the presence or absence of PY motifs. Since PY motifs can be identified from the protein primary sequence and do not rely on predicted or annotated protein secondary structure or conformation, we developed a python-based script for rapid analysis of protein primary sequence to identify PY motifs (S1A Fig). This script, referred to as PxYFinder, was first validated by analyzing a published dataset in which PY motifs were annotated amongst a pool of proteins. To this end, we chose to employ a dataset by Persaud and co-workers [34] in which Nedd4 interactors were characterized via proteome array and PY motifs in identified Nedd4 interactors were annotated. This dataset included binding partners and ubiquitinated substrates of human Nedd4-1 and Nedd4-2 as well as rat Nedd4-1. Using the published dataset, available as S1 Table (Persaud et al., Mol Syst Biol, 2009, S1 Table at DOI: 10.1038/msb.2009.85) [34], we compiled a list of all PY-motif containing binding partners or substrates identified in screens of all three Nedd4 forms (human Nedd4-1, human Nedd4-2, and rat Nedd4-1). This list was subsequently analyzed with PxYFinder, which revealed that 69 of the 82 reported PY-containing proteins were identified as PY containing proteins with PxYFinder (S1B Fig, S1 Table). Of the identified interactors screened, 13 proteins were annotated as PY motif containing proteins by Persaud and co-workers but were not identified as PY-containing in our analysis (S1B Fig). To understand this discrepancy, the 13 proteins (canonical and all isoforms) were analyzed manually, revealing that the 13 proteins did not contain the PY motifs that they were reported to contain in the Persaud dataset (S2 Table) [34]. We further manually confirmed that the reported PY motifs were present in the PY-containing proteins that PxYFinder identified (S1 Table). This revealed that all hits (PY containing) and non-hits (no PY motif) as determined with PxYFinder were correctly classified. With this validation, we feel confident that our tool can rapidly identify PY motifs across primary sequences of a large set of proteins.
Using PxYFinder, we identified the prevalence of PPxY and LPxY motifs in the previously interactors of the Nedd4 family ligases as annotated in the BioGrid database (Fig 3A). To this end, we found that that, on average 33.3% of Nedd4 family interactors contain PPxY motifs while 15.7% contain LPxY and 51.0% contain neither PPxY or LPxY motifs. PY motifs appear more prevalent in the Nedd4 family interactomes relative to the annotated Homo sapiens proteome. Our analysis revealed that for all ligases studied, PPxY motifs were more prevalent in the interactomes than LPxY motifs. Interestingly, Nedd4-1, Nedd4-2, and ITCH showed similar trends wherein their interactomes were distributed with approximately 40% containing PPxY motifs, 20% containing LPxY motifs, and 40% containing no canonical PY motif. WWP1 showed similar trends with its distribution skewed slightly toward PPxY motif prevalence. For WWP2, SMURF1, SMURF2, HECW1 and HECW2, however, 50% or greater of the interactome lack canonical PY motifs. These results show that on average, approximately 50% of the known Nedd4 family interactome contains a canonical PY motif, providing a sequence-based evidence of the likely mode of interaction between Nedd4 and these substrates.
To further understand PY-dependent interactor recognition in the Nedd4 family interactome, we sought to characterize sequences of identified PY motifs to determine 1) if there was conservation of amino acid identity at the x position in the motif and 2) if there are characteristic features of the protein sequences up and downstream of the PY motif. To this end, we used PxYFinder to extract a slice of the FASTA string of each PY motif-containing interactor that included the identified motif and the 10 amino acids before and after the PY motif. After collecting these extracted sequences across the interactome, we looked for consensus sequences using the WebLogo tool [50] (Fig 3B, S2 Fig). Here, we see moderate conservation of residue identity at the x position of the PPxY motif across the Nedd4 family, with all but SMURF2, HECW1 and HECW2 having proline, serine, and glycine as the three highest probability amino acids in the x position in the consensus sequences. For LPxY containing proteins, the highest probability residue for the x position is shown to be serine or proline for all members of the Nedd4 family except for SMURF1, HECW1 and HECW2. Interestingly, the WebLogo analysis indicates that PPxY motifs are more likely to occur in proline rich regions of the substrate protein than LPxY motifs. In fact, proline is the highest probability residue at almost all of the up and downstream positions for PPxY motifs in substrates of Nedd4–1, ITCH, WWP1, WWP2, SMURF2 and HECW2. On the other hand, there is little sequence consensus up- and downstream of LPxY across the Nedd4 family, with a distribution of charged, polar, and non-polar residues present as highest probability residues across the consensus sequences.
To better understand the propensity of PY motifs to occur in proline-rich regions, we next sought to determine if proline-rich regions are enriched for PPxY motifs relative to chance. To this end, we queried the UniProt database for reviewed Homo sapiens protein sequences with annotated compositional bias for proline residues and analyzed these proteins using PxYFinder. This analysis revealed that, in a random sample of ~1300 proteins with annotated proline-rich regions, 16.8% contain PPxY motifs and 11.3% contain LPxY motifs. This analysis reveals that PY motifs (both PPxY and LPxY) occur at lower rates in proline rich regions in general compared in the Nedd4 family interactome (average prevalence in the Nedd4 family was found to 33.3% and 15.7% for PPxY and LPxY, respectively). The prevalence of PPxY in these proline-rich regions is higher than in the annotated Homo sapiens proteome, where 11.7% of proteins contained PPxY sequences (Fig 3A).
To gain further insight into the structural context of PY motifs in the Nedd4 family interactomes, we sought to characterize the accessibility of PY motifs and the likelihood that these motifs occur in ordered or disordered protein regions. To this end, a series of bioinformatic algorithms were employed to analyze the relative solvent accessible surface area (RSA) [51], disorder [52], and polyproline helix propensity [53] of the PY motifs identified across the Nedd4 interactome. As a case study, these analyses were conducted with the PY motifs identified in the interactomes of Nedd4-1, WWP2, and SMURF1 as these members of the Nedd4 family have the largest annotated interactome datasets (Fig 2A) and show distinct patterns in the types of protein classes with which they interact (Fig 2B).
For a ligase to recognize an interacting protein through a WW domain/PY motif interaction, it is necessary for the PY motif to be accessible for binding to the WW domain. As a measure of relative accessibility of PY motifs, we sought to characterize the RSA of the PY motifs across the Nedd4-1, WWP2 and SMURF1 interactomes. To this end, bioinformatic tool NetSurfP-2.0 was employed [51]. This platform employs a deep learning-based algorithm for high throughput prediction of various protein features from primary sequences including propensity for secondary structure formation (helix, coil, or sheet) and solvent accessible surface area. For this analysis, the full primary sequence of each PY-containing interactor was used as we anticipated that analysis of the extracted PY sequence alone may provide insufficient sequence and structural context. Interactors were further delineated by PPxY containing versus LPxY containing. As a measure of accessibility, RSA is calculated as a ratio of the total accessible surface area (ASA) of each residue in a protein structure to the maximum possible solvent accessible surface area of that residue [51, 54, 55]. Calculated RSA, therefore, is value between 0 and 1 where a larger value indicates a higher degree of relative solvent accessibility of a residue or protein region and a lower value indicates a less accessible or more “buried” residue. The NetSurfP-2.0 algorithm delineates accessible from buried residues with a threshold RSA value of 0.25 [51], such that a residue with RSA < 0.25 is considered buried in the core of the protein and therefore would not be accessible for participation in protein-protein interactions.
Calculation of the predicted RSA of PY motifs across the Nedd4-1 interactome revealed that identified LPxY motifs are more buried than PPxY motifs (Fig 4), with greater than 25% of LPxY motifs having average RSA values less than 0.25 across the motif. Over half of the LPxY motifs have calculated RSA values between 0.25 and 0.5, and just under 25% of the LPxY motifs have high accessibility scores (RSA > 0.5). In contrast, a majority of PPxY motifs have calculated RSA values over 0.5, indicating that a majority of PPxY motifs have higher degrees of solvent accessibility compared to LPxY motifs. This trend is consistent across the motifs analyzed in interactors of SMURF1 and WWP2. This result provides insight into the availability of PY motifs to engage in WW domain mediated interactions and indicates that the presence alone of a PY motif in the primary sequence does not guarantee that the PY motif is accessible for substrate recognition. Therefore, it is important to consider the context of secondary structure in complement with the primary sequence of putative WW domain substrates.
To further explore sequence context, we next employed the IUPred2A [52] tool to calculate the relative order of interactor sequences containing PPxY and LPxY motifs in Nedd4-1, WWP2, and SMURF1. Comparison of the average relative order (wherein a score > 0.5 indicates disorder) of these sequences reveals that, on average across the Nedd4-1 interactome, PPxY motifs occur in more disordered regions that LPxY motifs (Fig 5A). In the case of WWP2 and SMURF1, this trend was retained (S3A Fig). This result is also consistent with the observed trends in RSA in PPxY containing proteins relative to LPxY, where PPxY motifs are on average more solvent accessible than LPxY motifs. We anticipate that this may be a result of PPxY motifs occurring in more proline-rich regions than LPxY motifs (Fig 3B; S2 Fig) as proline-rich regions are associated with intrinsic disorder due to the geometric constraints imposed by the backbone of proline residues [56, 57]. To explore this further, we then computationally flipped the identity of the first residue in the PY motif (i.e. P in PPxY substituted with L to afford LPxY) and re-analyzed the sequence disorder using IUPred2A (S3B Fig). In this control study, we observed that the difference in observed disorder is decreased upon substitution of proline for leucine or vice versa. This result indicates that, while the PPxY motifs appear to occur in more disordered regions, the presence of the first proline residue in the motif is likely a large contributor to that disorder.
We then sought to analyze the effect of proline prevalence on the predicted order of the PY motif-containing regions via PPIIPred [53], a bioinformatic tool for identification of polyproline II (PPII) secondary structure, an extended helix-like structure that can occur in the presence of polyproline stretches. PPIIPred analysis reveals that PPxY motifs are more likely to display PPII structure immediately before or at the PY motif (residues 20–24 in extracted sequence slices) as compared to LPxY motifs (Fig 5B). The sequences up and downstream of the PY motifs, however, show similarly low propensities for PPII structure on average. We anticipate that the increased prevalence of proline residues in PPxY-containing regions contributes to relative disorder but does not induce PPII structure on average.
Rational design of PY motif peptide library for computational analysis
Analysis of previously resolved PY motif/WW domain complex structures from Nedd4 show moderate conservation of PY peptide backbone conformation regardless of primary sequence (Fig 6A). To better understand the effect of PY sequence on WW domain binding, we sought to determine if the sequence variants of the PY motif affected the predicted affinity with which the substrate of interest binds to Nedd4. To this end, we began with a previously resolved structure of a Nedd4 WW domain bound to a PY-motif peptide from a known substrate, sodium channel ENaC (PDB ID: 2M3O) as our model complex [30]. Using the ENaC peptide (sequence: TAPPPAYATLG, with PY motif in bold) as a template, we designed a peptide library based on the previously described consensus sequences. We chose to vary the residues at the x and x–1 position of the PY motifs (where x–1 is the residue immediately preceding PPxY or LPxY) as these are the residues which span the binding interface between the PY peptide and WW domain (Fig 6B). Based on the consensus sequences (Fig 3B, S2 Fig), we generated 15 variants each for PPxY and LPxY peptides using the template peptide (TAx–1PPxYATLG or TAx–1LPxYATLG) with all combinations of the three and five highest probability residues at the x–1 and x positions, respectively (Fig 6C). It should be noted that, as there is not a previously characterized complex of a hNedd4 WW domain bound to a PY-peptide with the LPxY motif, we opted to use the same template peptide for screening of both PPxY and LPxY motifs to allow for direct comparison across the suite without variation outside of the peptide core. Cumulatively, this design afforded 30 peptides in total for computational screening against a Nedd4 WW domain.
Docking and molecular mechanics analysis of WW domain/PY motif interactions
Prior to computational analysis of our rationally designed peptide library, we sought to determine the WW domain scope required to capture any sequence-dependent variation in WW domain binding across the Nedd4 family. To this end, we compared the conservation of WW domain sequences across the family of ligases. Each ligase contains 2–4 WW domains (Fig 1A), with moderate sequence similarity across the family (Fig 1B and S4A and S4B Fig). Analysis of key residues that interact with peptide substrates shows moderate conservation of the binding interface (S4C Fig). These residues, which are primarily located in the concave peptide binding cleft of the three-stranded β-sheet structure, drive the direct interaction of the WW domain with the PY motif (S4C Fig). Alignment of three representative WW domain structures with varied residue identity in the binding interface (Nedd4–1 (WW3) and ITCH (WW3 and WW4), all of which have been shown to bind ENaC, the substrate from which the peptide library was derived) shows conservation of secondary structure (S4C Fig). Additionally, analysis with MolProbity [58] and KiNG [59] indicate that the interactions between the peptide substrate and WW domain are predominantly mediated by van der Waals contacts with few inter-peptide and peptide-WW domain hydrogen bonds. Therefore, we anticipate that trends in binding affinity across the peptide library from screening against our model WW domain (PDB: 2M3O) will be representative as the electrostatic nature of the binding interface is largely conserved. Instead, we anticipate that sequence-dependent changes in the peptide-protein interactions and peptide conformation will have a greater effect on binding affinity than the identity of WW domain residues.
To begin our computational analysis of the rationally designed PY motif library against our suite of WW domain structures, we first sought a docking method that was amenable to docking peptides to protein targets rather than small molecule ligands. We determined that use of a template-based docking method was the most appropriate approach for our analysis as there are a number of PY peptide/WW domain complexes that have been reported. Therefore, known PY/WW complex structures can be used to guide the docking, improving efficiency and minimizing computational expense compared to a global docking approach. To this end, we employed GalaxyPepDock [45] to dock our library against a Nedd4 WW domain (PDB ID: 2M3O) [30]. Each GalaxyPepDock docking result provided 10 predicted poses for the peptide of interest. From these 10 poses, we selected the pose with the most similarity to the native substrate in the PY/WW complex (PDB ID: 2M3O) (S5 Fig). To further refine the docking result, we then used the Glide ligand docking tool with SP-Peptide precision setting from the Schrödinger suite [60–62] to optimize the conformation of the peptide backbone and side chains in the binding pocket. Finally, to obtain thermodynamic measurement of predicted peptide binding affinities, we employed molecular mechanics-based binding affinity prediction using the Generalized Born and surface area continuum solvation method (MM/GBSA, Schrödinger) [46], which considers the effect of solvation on binding energies using an implicit solvation model. From this calculation, we generated a total measurement of affinity as ΔGbinding in addition to various contributing energy terms, enabling analysis of biomolecular interactions that serve as driving forces in the peptide/protein interaction.
The predicted ΔGbinding values provide a relative measure of affinity across the peptide suite wherein a more negative number indicates a stronger predicted peptide/protein interaction. Docking results were analyzed by comparison to the predicted binding affinity of the native peptide (Fig 6D). Our docking analysis reveals that, in general, substitutions at the x–1 or x position in the PPxY peptide scaffold weaken the predicted binding affinity (indicated by a less negative ΔGbinding) with the exception of derivatives APPSY, EPPPY, and EPPGY. We anticipate that there is a significant deal of pre-organization in the native ligand around the tri-proline core (TAPPPAYATLG), and we hypothesize that alteration of the steric or electrostatic nature at the x position with retention of the tri-peptide core (PPPxY) is unfavorable as the peptide lacks flexibility to compensate for altered interactions with the WW domain. Screening of derivatives with alanine or glutamic acid at the x–1 position was slightly unfavorable, but derivatives APPSY, EPPPY demonstrated improved affinity, likely through an increased number of intramolecular interactions due to the bent conformation adopted by the optimized docked ligand (S6 Fig).
In the LPxY peptide library, the derivatives generally had stronger predicted binding affinities than the PPxY library members. We anticipate that this is a result of greater ligand flexibility resulting from the lessened conformational strain induced by the core proline-proline dipeptide. We also anticipate that the greater hydrophobicity of leucine relative to proline may drive binding of the PY peptide to the WW domain pocket, contributing to the trend observed. Several members of this peptide class that have strong predicted binding affinities adopted a bent conformation, increasing the number of intramolecular contacts. Further, we hypothesize that the increased polarity with substitutions of serine or threonine at the x–1 or x positions increases either dipole-mediated intramolecular interactions or stabilizes the peptide/WW domain complex by presenting the polar residue to the solvent accessible side of the peptide and promoting burial of the lipophilic residues in the WW domain binding pocket.
We then analyzed individual energetic contributions to overall binding affinity across the library of peptide analogues. This includes energy components of the free ligand or receptor, the optimized complex, or sub-components of the ΔGbinding measurements (i.e. contributions of individual interaction types). In general, van der Waals and Coulombic interactions contributed most strongly to binding affinity, while solvation energy accounted for the most disfavorable (positive ΔG) component (Fig 7A and S7 Fig). We next correlated all individual energy components to total ΔGbinding (Fig 7B and 7C and S8 Fig). Analysis of energy components from complex, ligand, and receptor showed that receptor energies had the lowest correlation with overall ΔGbinding while ligand and complex energy components had higher correlations (Fig 7B). Further analysis showed that ligand efficiency, a function of binding affinity relative to total non-hydrogen atoms, correlated most strongly with ΔGbinding (Fig 7C). Finally, this analysis reveals that van der Waals are most correlated with ΔGbinding, followed by Coloumbic and lipophilic interactions.
Analysis of non-PY motif substrates in the Nedd4 interactome
While we have extensively discussed the nature of PY motif-mediated interactions with WW domains, the nature of interactions that guide the remaining half of the interactome remain unclear from our analysis. It is likely that these interactions are guided by interactions at other sites in the ubiquitin ligase, such as is the case for E2 conjugating enzymes [63–65], which interact with the HECT domain, or for proteins like α-synuclein [11–14, 16], which has been shown to interact with the C2 domain and HECT domain of the ligase. Additionally, there is evidence for WW domain interactions with phospho-threonine or phospho-serine (pT, pS) [26]. In these cases, the Nedd4 interaction would be dependent upon specific phospho-proteoforms, the presence of which are regulated by other cellular pathways and is discussed further below.
To complement our analysis of PY motif-mediated protein-protein interactions in the Nedd4 family interactome, we sought to further analyze the pool of non-PY motif interactors in the annotated dataset. We first performed a functional analysis of non-PY interactors using the PANTHER GO annotation database [57, 58] to determine how many proteins in the interactome were involved in the ubiquitination process (for example, E2 conjugating enzymes that would bind to the ligases through the E2 interaction site on the catalytic HECT domain). From this annotation, we identified that the non-PY containing interactome contained a range from 2.19% (WWP2) to 11.63% (WWP1) across the Nedd4 family of Nedd4 (Table 2). This indicates that nearly all of the non-PY interactome is not comprised of upstream members of the ubiquitin signaling cascade but rather contains substrates or regulatory partners that interact with Nedd4 in a PY-independent manner (i.e. through phosphorylated residues or through C2, HECT, or linker interactions).
Table 2. Functional analysis of non-PY containing interactors involved in ubiquitination.
Ligase | # non-PY motif interactors | % of non-PY interactome involved in ubiquitination |
---|---|---|
Nedd4 | 144 | 3.73 |
ITCH | 96 | 11.46 |
SMURF1 | 312 | 5.36 |
SMURF2 | 81 | 10.94 |
WWP1 | 34 | 11.63 |
WWP2 | 483 | 2.19 |
To characterize the remainder of the non-PY containing proteins in the Nedd4 interactome, we next screened non-PY interactors for the presence of pT or pS residues as reported in the PhosphoSite protein phosphorylation database. Of the 153 non-PY interactors identified in the Nedd4 interactome, there are 128 proteins that are annotated in PhosphoSite database [47] to contain both pT and pS post-translational modifications (PTMs) while 17 proteins have been detected with either pT or pS and eight proteins have no reported pT or pS residues. Based on these previous reports, experimentally detected phosphorylation on threonine and/or serine residues occurs in 94.8% of the non-PY interactome. Thus, this provides putative evidence that phosphorylation at serine or threonine may be the driving force for Nedd4 recognition of non-PY interactors. Though experimental validation of putative WW domain/phospho-protein interaction specificity would be required, it is beyond the scope of this investigation.
As a final investigation of potential mechanisms underlying protein recognition by the Nedd4 family, we sought to compare the interactome of Nedd4 and those of non-PY, non-pS or pT interactors to determine if there are shared interactors between the identified protein and Nedd4. To this end, annotated protein-protein interactions of a small sampling of non-PY, non-phosphorylated Nedd4 interactors were cross referenced with the Nedd4 interactome (S9 Fig). This analysis revealed a number of shared secondary interactors that contain PY motifs or are known to have pT or pS proteoforms. This evidence indicates a potential role of protein complex formation in the identification of Nedd4 interactors. We hypothesize that the non-PY, non-phosphorylated interactors may be part of a larger protein complex that is recognized by Nedd4 and, thus, these proteins were enriched in affinity capture methods and were subsequently annotated in the Nedd4 interactome. Therefore, their recognition by Nedd4 may not be through direct formation of a protein-protein interaction with Nedd4 but is instead scaffolded by other Nedd4 interactors in a larger complex. There is evidence for this idea of substrate clustering, as demonstrated by Mund and Pelham [66], who determined that Nedd4 more efficiently recognized polymerized or clustered substrates relative to the monomeric or isolated forms.
Finally, it is important to note that Nedd4 has a number of disordered linker regions that may influence the interaction specificity of the enzyme. It is known that the Nedd4 linker domains contribute to autoregulation of Nedd4 activity by forming or facilitating intramolecular interactions [67–69], but increasing knowledge of protein interactions has revealed that disordered linker regions also participate in protein-protein interaction events [70–73]. Therefore, it is possible that non-PY motif interactions may be mediated through binding in disordered regions of the Nedd4 family enzymes.
Discussion
We have employed a combination of bioinformatic and computational analyses to gain insight into the sequence and structural properties that drive interaction specificity in the Nedd4 interactome. We began our analysis with the development and implementation of PxYFinder to rapidly identify the presence or absence of canonical PY motifs in a library of FASTA sequences. Using this tool in combination with interactome data available through the BioGrid database, we determined that, on average, 33.8% of Nedd4 family interactors contain PPxY motifs while 15.5% contain LPxY and 50.6% contain neither PPxY or LPxY motifs. This demonstrates that canonical PY motifs drive only half of the WW-domain mediated interactions in the known interactome on average, and that screening for PY motifs is not sufficient on its own for identification of putative Nedd4 family substrates. In general, all members of the Nedd4 family that we analyzed have more interactors that contain PPxY motifs than LPxY motifs, and consensus sequence analysis reveals that PPxY motifs are more likely to occur in proline-rich, disordered, and solvent accessible regions than LPxY motifs. While this analysis expands our understanding of the prevalence of PY motifs in these interactome, it should be noted that the presence of a PY motif sequence in an interactor does not guarantee recognition by a Nedd4 family ligase. For instance, our bioinformatic analysis revealed that some of these sequences may be buried in the protein core and are thus inaccessible for WW domain/PY motif-mediated recognition. Therefore, future efforts may focus on experimental validation of these specific recognition modes across the Nedd4 family. Despite this, the combination of bioinformatic analyses employed reveals the prevalence as well as the sequence and structural context in which these motifs occur.
Using the information obtained from PxYFinder and consensus sequence analysis of the Nedd4 interactome, we then sought to computationally analyze sequence-dependent effects on PY peptide/WW domain binding. To this end, we employed a multi-step computational analysis of peptide binding affinity using a previously resolved structure of the Nedd4 WW domain. Specifically, we designed a library of PY peptides (both PPxY and LPxY motifs) which contain all combinations of the three and five most commonly occurring residues at the x–1 and x positions based on our bioinformatics-derived consensus sequences. We then employed a multi-step computational analysis wherein we began with template-based docking of the peptide substrate to the WW domain structure, followed by refinement of the complex and analysis of thermodynamic binding parameters via MM-GBSA. From this effort, we determined that the PPxY scaffold is less tolerant to substitutions than the LPxY scaffold. We hypothesize that this is a result of pre-organization in the poly-proline backbone of the PPxY peptides. Therefore, incorporation of residues that increase peptide flexibility or polarity tend to improve binding affinity. As predicted, our analysis reveals that binding affinity is most strongly driven by van der Waals interactions, with positive though lesser correlations to Coulombic and lipophilic interactions.
To gain further insight into the role of WW-domain binding in Nedd4 family substrate recognition, we analyzed the non-PY containing interactome of Nedd4 as a case study. This analysis reveals that nearly all of the non-PY substrates have been previously annotated to have phosphorylation at threonine and/or serine residues, providing a putative indication of WW-domain recognition independent of canonical PY motifs. While experimental validation of these hypotheses would be necessary to confirm the mechanism of Nedd4 recognition, our bioinformatic analysis provides valuable insight into possible modes of binding.
Cumulatively, the results presented herein provide insight into the prevalence and nature of PY motifs in the Nedd4 interactome. We anticipate that PxYFinder will be useful in screening large datasets for putative WW-domain interactors (both in the Nedd4 family and for other WW domain-containing proteins) and addresses a gap in current bioinformatic tools for which there is not an established method for identification of PY motifs in a large dataset. Further, our analysis of identified PY motifs expanded our understanding of the conservation of residues in and around the motif. Specifically, we demonstrated that, despite differences in interactor specificity that cause the Nedd4 family ligases to be functionally distinct, trends in sequence and structural context of the PY motifs are largely conserved across the family. This indicates that the specificity is driven not by protein structure alone but to a higher level of regulation. Finally, our efforts informed a computational analysis of sequence-dependent changes in PY peptide binding affinity. While the binding parameters obtained in this computational analysis are relative, we anticipate that our results will be useful in informing experimental design of PY peptide libraries either for interrogating the nature of the peptide/protein interaction or for designing inhibitors that target PY peptide/WW domain complexes.
Methods
Development and use of PxYFinder script
A python script, termed PxYFinder, was developed in Python 3.8 to perform the following workflow: PxYFinder imports FASTA sequences and iterates through primary sequences to identify PPY, PPxY, or LPxY. If a PY motif is identified, PxYFinder extracts a slice of the FASTA string that contains the PY motif and x (user-specified) amino acids up and down stream, copying this slice to a new.csv file. Code and documentation for PxYFinder is available as S1 File.
For analysis of Nedd4 family interactors, interactome data for each Nedd4 family member of interest (Nedd4, ITCH, WWP1, WWP2, SMURF1, SMURF2, HECW1, HECW2) was retrieved from BioGrid using Homo sapiens as an organismal filter. Gene names were converted to UniprotIDs and were used to retrieve FASTA sequences from the Uniprot database. PxYFinder was used to identify and extract PY motifs in the interactome of each ligase and calculate prevalence of PY motifs in each interactome. Graphs were generated using Prism (GraphPad). PY motif consensus sequences were determined by analysis with WebLogo (http://weblogo.threeplusone.com/) [50] with probability as the y-axis measure.
Prediction of protein order in PY-motif containing segments
Interactor sequences were subsequently analyzed using the IUPred2A and PPIIPred tools for prediction of overall disorder (IUPred2A) [52] and propensity to form polyproline secondary structures (PPIIPred) [53]. Data was visualized as mean ± S.D. and analyzed using paired and unpaired (Welch’s) t-test to determine statistical differences between specific residue positions (numbered 1–44) and across the full sequence, respectively. Data visualization and analysis was performed in Prism (GraphPad).
Relative solvent accessible surface area was calculated across the full primary sequence of identified PY-motif containing proteins using NetSurfP-2.0 [51], and calculated RSA of the four PY motif residues in each protein was extracted and averaged across the motif. Data visualization performed in Prism GraphPad.
Docking and molecular mechanics analysis of PY peptide library
A 30-member peptide library was generated based on the consensus sequences in the PY motif interactome of Nedd4. The top three and five residues at the x–1 and x positions respectively were paired in all possible combinations to generate 15 peptides each for PPxY and for LPxY libraries using a previously characterized substrate peptide bound to a Nedd4 WW domain (PDB ID: 2M3O) [30] as a template. All peptides were first docked via GalaxyPepDock [45] using 2M3O as a template structure. Docked complexes were further refined using the Schrödinger suite (Schrödinger Release 2020–3, Schrödinger, LLC, New York, NY, 2020). Specifically, the complex generated using GalaxyPepDock was prepared using the Protein Preparation Wizard and LigPrep tools [74]. Using the Glide tool [60], a docking grid for the WW domain was generated using (Glide Receptor Grid Generation), and the ligand was docked as a flexible ligand to the generated grid using SP-Peptide function with retention of amide bond conformation and restriction of docking poses to 0.50 Å tolerance for core pattern comparison relative to the native ligand conformation. Following docking, the Schrödinger Prime MM-GBSA [46, 61, 62] tool was used to analyze the PoseView (PV) docking output file with the VSGB solvation model and OPLSe3 force field to generate ΔGbinding for each generated pose. For each peptide, the pose with most negative ΔGbinding was selected for comparison of predicted binding affinities across the peptide library. Graphs of binding data were generated using Prism GraphPad. Structural analysis and visualization were performed with PyMol [75], MolProbity [58], and/or KiNG [59].
Supporting information
Acknowledgments
The authors would like to thank the Duke University Department of Chemistry Computing Services for access to the Schrödinger software suite and computational servers. They also thank the department for support for M.D.P. through the summer undergraduate research fellowship program. Finally, the authors thank the members of the McCafferty lab for their thoughtful feedback on the project and manuscript.
Data Availability
All relevant data are within the paper and its Supporting information files.
Funding Statement
This work was kindly supported by Duke University, National Institutes of Health National Institute of Neurological Disorders and Stroke Grant 1R21NS112927-01 to D.G.M. (https://www.ninds.nih.gov), Michael J. Fox Foundation Grant 16250 to D.G.M. (https://www.michaeljfox.org), and National Science Foundation Graduate Research Fellowship GRFP 2017248946 to A.K.H. (https://www.nsfgrfp.org/). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Boase NA, Kumar S (2015) NEDD4: The founding member of a family of ubiquitin-protein ligases. Gene 557:113–122. doi: 10.1016/j.gene.2014.12.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Ingham RJ, Gish G, Pawson T (2004) The Nedd4 family of E3 ubiquitin ligases: functional diversity within a common modular architecture. Oncogene 23:1972–1984. doi: 10.1038/sj.onc.1207436 [DOI] [PubMed] [Google Scholar]
- 3.Rotin D, Kumar S (2009) Physiological functions of the HECT family of ubiquitin ligases. Nat. Rev. Mol. Cell Biol. 10:398–409. doi: 10.1038/nrm2690 [DOI] [PubMed] [Google Scholar]
- 4.Mari S, Ruetalo N, Maspero E, Stoffregen MC, Pasqualato S, Polo S, et al. (2014) Structural and functional framework for the autoinhibition of nedd4-family ubiquitin ligases. Structure 22:1639–1649. doi: 10.1016/j.str.2014.09.006 [DOI] [PubMed] [Google Scholar]
- 5.Anan T, Nagata Y, Koga H, Honda Y, Yabuki N, Miyamoto C, et al. (1998) Human ubiquitin-protein ligase Nedd4: Expression, subcellular localization and selective interaction with ubiquitin-conjugating enzymes. Genes Cells 3:751–763. doi: 10.1046/j.1365-2443.1998.00227.x [DOI] [PubMed] [Google Scholar]
- 6.Zou X, Levy-Cohen G, Blank M (2015) Molecular functions of NEDD4 E3 ubiquitin ligases in cancer. Biochim. Biophys. Acta BBA—Rev. Cancer 1856:91–106. doi: 10.1016/j.bbcan.2015.06.005 [DOI] [PubMed] [Google Scholar]
- 7.Weber J, Polo S, Maspero E (2019) HECT E3 Ligases: A Tale With Multiple Facets. Front. Physiol. 10:370–370. doi: 10.3389/fphys.2019.00370 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Scheffner M, Kumar S (2014) Mammalian HECT ubiquitin-protein ligases: Biological and pathophysiological aspects. Biochim. Biophys. Acta—Mol. Cell Res. 1843:61–74. doi: 10.1016/j.bbamcr.2013.03.024 [DOI] [PubMed] [Google Scholar]
- 9.Kwak YD, Wang B, Li JJ, Wang R, Deng Q, Diao S, et al. (2012) Upregulation of the E3 ligase NEDD4-1 by oxidative stress degrades IGF-1 receptor protein in neurodegeneration. J. Neurosci. 32:10971–10981. doi: 10.1523/JNEUROSCI.1836-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kim E, Wang B, Sastry N, Masliah E, Nelson PT, Cai H, et al. (2016) NEDD4-mediated HSF1 degradation underlies α-synucleinopathy. Hum. Mol. Genet. 25:211–222. doi: 10.1093/hmg/ddv445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sugeno N, Hasegawa T, Tanaka N, Fukuda M, Wakabayashi K, Oshima R, et al. (2014) Lys-63-linked ubiquitination by E3 ubiquitin ligase Nedd4-1 facilitates endosomal sequestration of internalized α-synuclein. J. Biol. Chem. 289:18137–51. doi: 10.1074/jbc.M113.529461 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Davies SE, Hallett PJ, Moens T, Smith G, Mangano E, Kim HT, et al. (2014) Enhanced ubiquitin-dependent degradation by Nedd4 protects against α-synuclein accumulation and toxicity in animal models of Parkinson’s disease. Neurobiol. Dis. 64:79–87. doi: 10.1016/j.nbd.2013.12.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Tofaris GK, Kim HT, Hourez R, Jung J-W, Kim KP, Goldberg AL (2011) Ubiquitin ligase Nedd4 promotes alpha-synuclein degradation by the endosomal-lysosomal pathway. Proc. Natl. Acad. Sci. U. S. A. 108:17004–9. doi: 10.1073/pnas.1109356108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mund T, Masuda-Suzukake M, Goedert M, Pelham HR (2018) Ubiquitination of alpha-synuclein filaments by Nedd4 ligases Massiah M, editor. PLOS ONE 13:e0200763–e0200763. doi: 10.1371/journal.pone.0200763 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Perrett RM, Alexopoulou Z, Tofaris GK (2015) The endosomal pathway in Parkinson’s disease. Mol. Cell. Neurosci. 66:21–28. doi: 10.1016/j.mcn.2015.02.009 [DOI] [PubMed] [Google Scholar]
- 16.Rott R, Szargel R, Shani V, Hamza H, Savyon M, Elghani FA, et al. (2017) SUMOylation and ubiquitination reciprocally regulate α-synuclein degradation and pathological aggregation. 114:13176–13181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ye X, Wang L, Shang B, Wang Z, Wei W (2014) NEDD4: A Promising Target for Cancer Therapy. Curr. Cancer Drug Targets 14:549–556. doi: 10.2174/1568009614666140725092430 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Quirit JG, Lavrenov SN, Poindexter K, Xu J, Kyauk C, Durkin KA, et al. (2017) Indole-3-carbinol (I3C) analogues are potent small molecule inhibitors of NEDD4-1 ubiquitin ligase activity that disrupt proliferation of human melanoma cells. Biochem. Pharmacol. 127:13–27. doi: 10.1016/j.bcp.2016.12.007 [DOI] [PubMed] [Google Scholar]
- 19.Tardiff DF, Jui NT, Khurana V, Tambe MA, Thompson ML, Chung CY, et al. (2013) Yeast Reveal a “Druggable” Rsp5/Nedd4 Network that Ameliorates α-Synuclein Toxicity in Neurons. Science 342:979–983. doi: 10.1126/science.1245321 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hatstat AK, Ahrendt HD, Foster MW, Mayne L, Moseley MA, Englander SW, et al. (2021) Characterization of Small-Molecule-Induced Changes in Parkinson’s-Related Trafficking via the Nedd4 Ubiquitin Signaling Cascade. Cell Chem. Biol. 28:14–25.e9. doi: 10.1016/j.chembiol.2020.10.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Zinzalla G (2015) Paving the way to targeting HECT ubiquitin ligases. Future Med. Chem. 7:2107–2111. doi: 10.4155/fmc.15.141 [DOI] [PubMed] [Google Scholar]
- 22.Han Z, Lu J, Liu Y, Davis B, Lee MS, Olson MA, et al. (2014) Small-molecule probes targeting the viral PPxY-host Nedd4 interface block egress of a broad range of RNA viruses. J. Virol. 88:7294–306. doi: 10.1128/JVI.00591-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tian M, Zeng T, Liu M, Han S, Lin H, Lin Q, et al. (2019) A cell-based high-throughput screening method based on a ubiquitin-reference technique for identifying modulators of E3 ligases. J. Biol. Chem. 294:2880–2891. doi: 10.1074/jbc.RA118.003822 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Fajner V, Maspero E, Polo S (2017) Targeting HECT-type E3 ligases–insights from catalysis, regulation and inhibitors. FEBS Lett. 591:2636–2647. doi: 10.1002/1873-3468.12775 [DOI] [PubMed] [Google Scholar]
- 25.Chen D, Gehringer M, Lorenz S (2018) Developing Small-Molecule Inhibitors of HECT-Type Ubiquitin Ligases for Therapeutic Applications: Challenges and Opportunities. ChemBioChem 19:2123–2135. doi: 10.1002/cbic.201800321 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Aragón E, Goerner N, Zaromytidou A-I, Xi Q, Escobedo A, Massagué J, et al. (2011) A Smad action turnover switch operated by WW domain readers of a phosphoserine code. Genes Dev. 25:1275–1288. doi: 10.1101/gad.2060811 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Aragón E, Goerner N, Xi Q, Gomes T, Gao S, Massagué J, et al. (2012) Structural Basis for the Versatile Interactions of Smad7 with Regulator WW Domains in TGF-β Pathways. Structure 20:1726–1736. doi: 10.1016/j.str.2012.07.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Escobedo A, Gomes T, Aragón E, Martín-Malpartida P, Ruiz L, Macias MJ (2014) Structural basis of the activation and degradation mechanisms of the E3 ubiquitin ligase Nedd4L. Structure 22:1446–1457. doi: 10.1016/j.str.2014.08.016 [DOI] [PubMed] [Google Scholar]
- 29.Qi S, O’Hayre M, Gutkind JS, Hurley JH (2014) Structural and Biochemical Basis for Ubiquitin Ligase Recruitment by Arrestin-related Domain-containing Protein-3 (ARRDC3). J. Biol. Chem. 289:4743–4752. doi: 10.1074/jbc.M113.527473 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bobby R, Medini K, Neudecker P, Lee TV, Brimble MA, McDonald FJ, et al. (2013) Structure and dynamics of human Nedd4-1 WW3 in complex with the αENaC PY motif. Biochim. Biophys. Acta BBA—Proteins Proteomics 1834:1632–1641. doi: 10.1016/j.bbapap.2013.04.031 [DOI] [PubMed] [Google Scholar]
- 31.Panwalkar V, Neudecker P, Schmitz M, Lecher J, Schulte M, Medini K, et al. (2016) The Nedd4-1 WW Domain Recognizes the PY Motif Peptide through Coupled Folding and Binding Equilibria. Biochemistry 55:659–674. doi: 10.1021/acs.biochem.5b01028 [DOI] [PubMed] [Google Scholar]
- 32.Chen HI, Sudol M (1995) The WW domain of Yes-associated protein binds a proline-rich ligand that differs from the consensus established for Src homology 3-binding modules. Proc. Natl. Acad. Sci. U. S. A. 92:7819–7823. doi: 10.1073/pnas.92.17.7819 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Wahl LC, Watt JE, Yim HTT, De Bourcier D, Tolchard J, Soond SM, et al. (2019) Smad7 Binds Differently to Individual and Tandem WW3 and WW4 Domains of WWP2 Ubiquitin Ligase Isoforms. Int. J. Mol. Sci. 20:4682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Persaud A, Alberts P, Amsen EM, Xiong X, Wasmuth J, Saadon Z, et al. (2009) Comparison of substrate specificity of the ubiquitin ligases Nedd4 and Nedd4-2 using proteome arrays. Mol. Syst. Biol. 5:333–333. doi: 10.1038/msb.2009.85 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Iconomou M, Saunders DN (2016) Systematic approaches to identify E3 ligase substrates. Biochem. J. 473:4083–4101. doi: 10.1042/BCJ20160719 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Edwin F, Anderson K, Patel TB (2009) HECT Domain-containing E3 Ubiquitin Ligase Nedd4 Interacts with and Ubiquitinates Sprouty2. 285:255–264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kanelis V, Bruce MC, Skrynnikov NR, Rotin D, Forman-Kay JD (2006) Structural determinants for high-affinity binding in a Nedd4 WW3* domain-comm PY motif complex. Structure 14:543–553. doi: 10.1016/j.str.2005.11.018 [DOI] [PubMed] [Google Scholar]
- 38.Kanelis V, Rotin D, Forman-Kay JD (2001) Solution structure of a Nedd4 WW domain-ENaC peptide complex. Nat. Struct. Biol. 8:407–412. doi: 10.1038/87562 [DOI] [PubMed] [Google Scholar]
- 39.Kanelis V, Farrow NA, Kay LE, Rotin D, Forman-Kay JD (1998) NMR studies of tandem WW domains of Nedd4 in complex with a PY motif-containing region of the epithelial sodium channel. Biochem. Cell Biol. Biochim. Biol. Cell. 76:341–350. doi: 10.1139/bcb-76-2-3-341 [DOI] [PubMed] [Google Scholar]
- 40.Chong PA, Lin H, Wrana JL, Forman-Kay JD (2006) An expanded WW domain recognition motif revealed by the interaction between Smad7 and the E3 ubiquitin ligase Smurf2. J. Biol. Chem. 281:17069–17075. doi: 10.1074/jbc.M601493200 [DOI] [PubMed] [Google Scholar]
- 41.Henry PC, Kanelis V, O’Brien MC, Kim B, Gautschi I, Forman-Kay J, et al. (2003) Affinity and specificity of interactions between Nedd4 isoforms and the epithelial Na+ channel. J. Biol. Chem. 278:20019–20028. doi: 10.1074/jbc.M211153200 [DOI] [PubMed] [Google Scholar]
- 42.Milkereit R, Rotin D (2011) A Role for the Ubiquitin Ligase Nedd4 in Membrane Sorting of LAPTM4 Proteins. PLOS ONE 6:e27478. doi: 10.1371/journal.pone.0027478 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Mi H, Thomas P (2009) PANTHER pathway: an ontology-based pathway database coupled with data analysis tools. Methods Mol. Biol. Clifton NJ 563:123–140. doi: 10.1007/978-1-60761-175-2_7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Mi H, Lazareva-Ulitsky B, Loo R, Kejariwal A, Vandergriff J, Rabkin S, et al. (2005) The PANTHER database of protein families, subfamilies, functions and pathways. Nucleic Acids Res. 33:D284–D288. doi: 10.1093/nar/gki078 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Lee H, Seok C (2017) Template-Based Prediction of Protein-Peptide Interactions by Using GalaxyPepDock. Methods Mol. Biol. Clifton NJ 1561:37–47. doi: 10.1007/978-1-4939-6798-8_4 [DOI] [PubMed] [Google Scholar]
- 46.Genheden S, Ryde U (2015) The MM/PBSA and MM/GBSA methods to estimate ligand-binding affinities. Expert Opin. Drug Discov. 10:449–461. doi: 10.1517/17460441.2015.1032936 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Hornbeck PV, Zhang B, Murray B, Kornhauser JM, Latham V, Skrzypek E (2015) PhosphoSitePlus, 2014: mutations, PTMs and recalibrations. Nucleic Acids Res. 43:D512–520. doi: 10.1093/nar/gku1267 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Oughtred R, Stark C, Breitkreutz B-J, Rust J, Boucher L, Chang C, et al. (2019) The BioGRID interaction database: 2019 update. Nucleic Acids Res. 47:D529–D541. doi: 10.1093/nar/gky1079 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Stark C, Breitkreutz B-J, Reguly T, Boucher L, Breitkreutz A, Tyers M (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res. 34:D535–9. doi: 10.1093/nar/gkj109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Crooks GE, Hon G, Chandonia J-M, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res. 14:1188–1190. doi: 10.1101/gr.849004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Klausen MS, Jespersen MC, Nielsen H, Jensen KK, Jurtz VI, Sønderby CK, et al. (2019) NetSurfP-2.0: Improved prediction of protein structural features by integrated deep learning. Proteins 87:520–527. doi: 10.1002/prot.25674 [DOI] [PubMed] [Google Scholar]
- 52.Mészáros B, Erdős G, Dosztányi Z (2018) IUPred2A: context-dependent prediction of protein disorder as a function of redox state and protein binding. Nucleic Acids Res. 46:W329–W337. doi: 10.1093/nar/gky384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.O’Brien KT, Mooney C, Lopez C, Pollastri G, Shields DC. Prediction of polyproline II secondary structure propensity in proteins. R. Soc. Open Sci. 7:191239. doi: 10.1098/rsos.191239 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Marsh JA, Teichmann SA (2011) Relative Solvent Accessible Surface Area Predicts Protein Conformational Changes upon Binding. Struct. England1993 19:859–867. doi: 10.1016/j.str.2011.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Samanta U, Bahadur RP, Chakrabarti P (2002) Quantifying the accessible surface area of protein residues in their local environment. Protein Eng. Des. Sel. 15:659–667. doi: 10.1093/protein/15.8.659 [DOI] [PubMed] [Google Scholar]
- 56.Theillet F-X, Kalmar L, Tompa P, Han K-H, Selenko P, Dunker AK, et al. (2013) The alphabet of intrinsic disorder. Intrinsically Disord. Proteins 1:e24360. doi: 10.4161/idp.24360 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Morgan AA, Rubenstein E (2013) Proline: The Distribution, Frequency, Positioning, and Common Functional Roles of Proline and Polyproline Sequences in the Human Proteome. PLOS ONE 8:e53785. doi: 10.1371/journal.pone.0053785 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Williams CJ, Headd JJ, Moriarty NW, Prisant MG, Videau LL, Deis LN, et al. (2018) MolProbity: More and better reference data for improved all-atom structure validation. Protein Sci. Publ. Protein Soc. 27:293–315. doi: 10.1002/pro.3330 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Chen VB, Davis IW, Richardson DC (2009) KING (Kinemage, Next Generation): A versatile interactive molecular and scientific visualization program. Protein Sci. Publ. Protein Soc. 18:2403–2409. doi: 10.1002/pro.250 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Friesner RA, Murphy RB, Repasky MP, Frye LL, Greenwood JR, Halgren TA, et al. (2006) Extra Precision Glide: Docking and Scoring Incorporating a Model of Hydrophobic Enclosure for Protein−Ligand Complexes. J. Med. Chem. 49:6177–6196. doi: 10.1021/jm051256o [DOI] [PubMed] [Google Scholar]
- 61.Jacobson MP, Pincus DL, Rapp CS, Day TJF, Honig B, Shaw DE, et al. (2004) A hierarchical approach to all-atom protein loop prediction. Proteins 55:351–367. doi: 10.1002/prot.10613 [DOI] [PubMed] [Google Scholar]
- 62.Jacobson MP, Friesner RA, Xiang Z, Honig B (2002) On the Role of the Crystal Environment in Determining Protein Side-chain Conformations. J. Mol. Biol. 320:597–608. doi: 10.1016/s0022-2836(02)00470-9 [DOI] [PubMed] [Google Scholar]
- 63.Huang L (1999) Structure of an E6AP-UbcH7 Complex: Insights into Ubiquitination by the E2-E3 Enzyme Cascade. Science 286:1321–1326. doi: 10.1126/science.286.5443.1321 [DOI] [PubMed] [Google Scholar]
- 64.Das R, Liang Y-H, Mariano J, Li J, Huang T, King A, et al. (2013) Allosteric regulation of E2:E3 interactions promote a processive ubiquitination machine. EMBO J. 32:2504–2516. doi: 10.1038/emboj.2013.174 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Chakrabarti KS, Li J, Das R, Byrd RA (2017) Conformational Dynamics and Allostery in E2:E3 Interactions Drive Ubiquitination: gp78 and Ube2g2. 25:794–805.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Mund T, Pelham HR (2018) Substrate clustering potently regulates the activity of WW-HECT domain–containing ubiquitin ligases. J. Biol. Chem. 293:5200–5209. doi: 10.1074/jbc.RA117.000934 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Grimsey NJ, Narala R, Rada CC, Mehta S, Stephens BS, Kufareva I, et al. (2018) A Tyrosine Switch on NEDD4-2 E3 Ligase Transmits GPCR Inflammatory Signaling. Cell Rep. 24:3312–3323.e5. doi: 10.1016/j.celrep.2018.08.061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Huang X, Chen J, Cao W, Yang L, Chen Q, He J, et al. (2019) The many substrates and functions of NEDD4-1. Cell Death Dis. 10:1–12. doi: 10.1038/s41419-019-2142-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Jiang H, Thomas SN, Chen Z, Chiang CY, Cole PA (2019) Comparative analysis of the catalytic regulation of NEDD4-1 and WWP2 ubiquitin ligases. J. Biol. Chem. 294:17421–17436. doi: 10.1074/jbc.RA119.009211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Wright PE, Dyson HJ (2015) Intrinsically disordered proteins in cellular signalling and regulation. Nat. Rev. Mol. Cell Biol. 16:18–29. doi: 10.1038/nrm3920 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Hsu W-L, Oldfield C, Meng J, Huang F, Xue B, Uversky VN, et al. (2012) Intrinsic protein disorder and protein-protein interactions. Pac. Symp. Biocomput. Pac. Symp. Biocomput.:116–127. [PubMed] [Google Scholar]
- 72.Uversky VN (2018) Intrinsic Disorder, Protein-Protein Interactions, and Disease. Adv. Protein Chem. Struct. Biol. 110:85–121. doi: 10.1016/bs.apcsb.2017.06.005 [DOI] [PubMed] [Google Scholar]
- 73.Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN (2005) Flexible nets. FEBS J. 272:5129–5148. doi: 10.1111/j.1742-4658.2005.04948.x [DOI] [PubMed] [Google Scholar]
- 74.Madhavi Sastry G, Adzhigirey M, Day T, Annabhimoju R, Sherman W (2013) Protein and ligand preparation: parameters, protocols, and influence on virtual screening enrichments. J. Comput. Aided Mol. Des. 27:221–234. doi: 10.1007/s10822-013-9644-8 [DOI] [PubMed] [Google Scholar]
- 75.Schrodinger (2015) The PyMOL Molecular Graphics System, Version 1.8.
- 76.Lex A, Gehlenborg N, Strobelt H, Vuillemot R, Pfister H (2014) UpSet: Visualization of Intersecting Sets. IEEE Trans. Vis. Comput. Graph. 20:1983–1992. doi: 10.1109/TVCG.2014.2346248 [DOI] [PMC free article] [PubMed] [Google Scholar]