Abstract
The WAVE regulatory pentameric complex regulates actin remodeling. Two components of it (CYFIP2 and NCKAP1) are encoded by genes whose genetic mutations increase the risk for autism spectrum disorder (ASD) and related neurodevelopmental disorders. Here, we use a newly developed computational protocol and hotspot analysis to uncover the functional impact of these mutations at the interface of the correct isoforms of the two proteins into the complex. The mutations turn out to be located on the surfaces involving the largest number of hotspots of the complex. Most of them decrease the affinity of the proteins for the rest of the complex, but some have the opposite effect. The results are fully consistent with the available experimental data. The observed changes in the WAVE regulatory complex stability might impact on complex activation and ultimately play a role in the aberrant pathway of the complex, leading to the cell derangement associated with the disease.
Keywords: autism‐linked mutations, hotspot analysis, protein isoforms, protein–protein docking, WAVE regulatory complex
1. INTRODUCTION
Autism spectrum disorder (ASD) is a neurodevelopmental disorder characterized by deficits in social interaction and communication and restricted or repetitive patterns of interests and behaviors. ASD has a strong genetic component (Gaugler et al. 2014; Willsey et al. 2022), and hundreds of new ASD risk genes have been discovered with the advent of genomics (De Rubeis et al. 2014; Fu et al. 2022; Iossifov et al. 2014; Satterstrom et al. 2020; Zhou et al. 2022).
Among the genetic loci that have been recently associated with increased ASD liability are CYFIP2 (Cytoplasmic FMR1‐Interacting protein 2) and NCKAP1 (Non‐Catalytic region of tyrosine Kinase Associated Protein 1 (Fu et al. 2022)). In fact, based on rare de novo heterozygous single‐nucleotide variation, CYFIP2 and NCKAP1 both meet exome‐wide significance for association with ASD (Fu et al. 2022). Further, de novo heterozygous missense mutations in CYFIP2 have been related to a form of developmental and epileptic encephalopathy characterized by early‐onset intractable seizures, severe to profound developmental delay, hypotonia, and mild facial dysmorphisms (MIM #618008) (Nakashima et al. 2018; Zweier et al. 2019). De novo or inherited variants likely leading to NCKAP1 haploinsufficiency have been reported in individuals with ASD or autistic features, language and motor delays, variable degrees of intelligence disability, or learning disabilities (Guo et al. 2020). These clinical associations are compatible with the observation that both genes are under significant genomic constraints, as evidenced by their intolerance to missense or loss‐of‐function genetic variance in the general population (Karczewski et al. 2021).
The proteins encoded by CYFIP2 (also known as PIR121) and NCKAP1 (also known as NAP1) are part of a highly conserved hetero‐pentameric complex that regulates actin remodeling, the so‐called WAVE regulatory complex (WRC). The latter is formed by an elongated pseudo‐symmetric dimer formed by CYFIP1/2 and NCKAP1 (Chen et al. 2010) and then a trimer (Chen et al. 2010) consisting of the Abl interactor 2 (ABI2; or the orthologues ABI1 or ABI3), the hematopoietic stem progenitor cell 300 (HSPC300), and the Actin‐binding protein WASF1 (WAVE1; or the orthologues WAVE2 or WAVE3) (Rottner et al. 2021) (Figure 1). Combinations of different orthologs of each of these components can yield diverse WRC isoforms, likely with tissue, cellular, or developmental specificity. In particular, three CYFIP2 and two NCKAP1 isoforms are associated with ASD (CYFIP2iso1, CYFIP2iso2, and CYFIP2iso3 (a computationally mapped potential isoform (Consortium TU 2022); Table S1; NCKAP1iso1 and NCKAP1iso2, Table S2)).
WRC is critical for modulating actin remodeling: indeed, the VCA region of WAVE1 (WCA; Figure 1) binds to and activates the Arp2/3 complex to promote actin filament (Figure S1). In the basal state, the functional role of WCA is inhibited by interacting with CYFIP1/2 (Chen et al. 2010). When CYFIP1/2 binds two Rac1 proteins in specific sites (A‐site and D‐site in Figure S1a) (Chen et al. 2017; Schaks et al. 2018; Ding et al. 2022), it releases the WCA (“activation” process) (Figure S1). This process is essential for brain development and function, including the formation and maturation of synapses (De Rubeis et al. 2013; Hsiao et al. 2016; Oguro‐Ando et al. 2015; Pathania et al. 2014).
Studies have begun to examine the structural and functional impact of some disease‐associated mutations located in CYFIP2 or NCKAP1 (WAVE1, HSPC300, and ABI2 variants play no role in ASD (Fu et al. 2022); see Tables S1 and S2). The Arg87 (R87) of CYFIP2 is a mutational hotspot, and p.Arg87Cys/Ser/His/Pro/Lys pathogenic substitutions have been identified as associated with neurodevelopmental manifestations (Begemann et al. 2021; Zweier et al. 2019). R87 is located at the CYFIP2 interface with WAVE1 (Figure 1), as shown by an X‐ray structure of the entire WRC (Chen et al. 2010). The affinity of wild‐type (WT) CYFIP2 for WCA is larger than that of the R87C/L/P CYFIP2 ASD‐linked variants (Nakashima et al. 2018). Such high affinity is essential to keep WRC in an inactive state (Chen et al. 2010; Takenawa and Suetsugu 2007), and indeed, R87C (along with p.Ile664Met, p.Glu665Lys, p.Asp724His, and p.Gln725Arg, all of these ASD‐associated mutations involving residues located at the CYFIP2 interface with WAVE1 (Chen et al. 2010)) activate WRC abnormally, leading to aberrant lamellipodia (Schaks et al. 2020). These phenomena might be caused, at least in part, by a decrease in stability of the WAVE1/CYFIP2 interface (Figure 1) and/or by structural changes in the variants (Biembengut et al. 2022; Schaks et al. 2020). The p.Ala455Pro(CYFIP2) ASD‐linked variant leads to lamellipodia (Schaks et al. 2020). A455(CYFIP2) is very close to the CYFIP2/NCAKP1 interface (Chen et al. 2010). This may perturb the latter, perhaps via an allosteric mechanism (Schaks et al. 2020).
Here, we sought to systematically assess the structural impact of de novo missense mutations in CYFIP2 and NCKAP1 associated with ASD and co‐morbid neurodevelopmental manifestations in the assembly and function of the WRC, by prioritizing those hitting residues located at or closed to the protein/protein interface. These are almost all the mutations in CYFIP2 (14 out of 18, Tables 1 and S1), and very few in NCKAP1 (2 out of 15, Tables 1 and S2). For the first time, we map these disease‐linked mutations on the appropriate isoforms of CYFIP2 and NCKAP1 (Tables S1 and S2). It is essential to consider the different isoforms of the same protein because isoforms are diversely expressed in tissues. For instance, CYFIP2iso2 is mainly present in the human fetal brain (Patowary et al. 2024). First, we use a combination of homology modeling, blind protein–protein docking, all‐atom post‐docking refinement, and free energy‐based re‐ranking to predict the structural determinants of the WT WRC. The calculations are based on the X‐ray structure of the complex (Chen et al. 2010) and consider the correct isoforms for CYFIP2 and NCKAP1.1 Next, we perform a hotspot prediction on the predicted complexes to get qualitative insights on the relative stability of the eight protein/protein contact surfaces of the complex (Figure 1). Strikingly, we find that the ASD‐linked mutations in Table 1 affect the two interfaces of WRC with the largest number of hotspots by far, possibly impacting WRC stability. The impact of the ASD‐linked variants on the structural determinants of the complexes is finally investigated by our set of tools. These results could pave the way for a detailed blue map of WRC‐mediated biological pathways.
TABLE 1.
CYFIP2 | Mutation(s) | NCKAP1 | Mutation |
---|---|---|---|
Isoform 1 | p.Arg87Cys/Leu/Pro | Isoform 2 | p.Pro372Thr |
p.Ala455Pro | |||
p.Ile664Met | |||
p.Glu665Lys | |||
p.Asp724His | |||
p.Gln725Arg | |||
Isoform 2 | p.Arg87Cys | ||
Isoform 3 | p.Ile613Met | p.Arg675Thr | |
p.Glu614Lys | |||
p.Tyr639Cys | |||
p.Gln674Arg | |||
p.Arg744Cys |
2. RESULTS AND DISCUSSION
2.1. CYFIP2iso1– 3 ASD‐linked variants
We first predicted the structure of the WT complexes with the three isoforms of CYFIP2, and then, based on this structural information, we investigated the effect of mutations associated with ASD.
2.1.1. Structural predictions of wild‐type CYFIP2iso1– 3·NCKAP1iso1 ·ABI2·HSPC300·WAVE1 complex (C2iso1–3,WT ·WRC hereafter)
We predicted the structural determinants of the three isoforms using homology modeling based on the WT CYFIP1iso1·NCKAP1iso1·ABI2·HSPC300·WAVE1 (C1iso1,WT·WRC hereafter) X‐ray structure (PDB ID: 3P8C (Chen et al. 2010)). The RMSD on the Cα atoms is 0.9 Å or less compared to the latter (Table S3), suggesting that the three CYFIP2 isoforms are accommodated in the complex similarly to CYFIP1iso1 (Figure 2a). Indeed, the isoforms differ only by the following: (i) The residues 558–582 (hereafter called MR1) are present only in CYFIP2iso1. They interact with WAVE1 N‐terminus. (ii) Residues 70–95 (hereafter called MR2) are present only in CYFIP2iso1–2. (iii) The first 50 residues of CYFIP2iso3 are unfolded. The sequence differences between CYFIP1iso1 and three CYFIP2 isoforms are shown in Figure S2a and Table S4. The structure of CYFIP1iso1 is the most similar to the CYFIP2iso2, without MR1 and having the MR2 (Figure 2a).
The three CYFIP2 isoforms interact with NCKAP1, WAVE1 (Figure 2b,c), HSPC300, ABI2 (Figures S2–S4 and Tables S5–S7) as CYFIP1iso1 does in the C1iso1,WT·WRC X‐ray structure (Chen et al. 2010) (Figures 1 and 2a). MR1 and MR2 are located at the CYFIP2/WAVE1 interface (Figure 2b).
The CYFIP2iso1–3/NCKAP1iso1 and CYFIP2iso1–3/WAVE1 feature the highest number of hotspots by far, irrespectively of the method used (Table 2 and Figures 3 and S5–S7), suggesting that quite tight interactions are formed at these interfaces. Thus, the ASD‐linked mutations of Table 1 perturb the two most critical protein/protein interactions of the complex in terms of hotspots, either directly by removing (R87(CYFIP2iso1–2), Q674(CYFIP2iso3), or D724(CYFIP2iso1), see Figures 3 and S5) or indirectly by altering the protein/protein interactions at the interfaces.
TABLE 2.
Interfaces | mCSM | BeAtMuSiC | FoldX |
---|---|---|---|
CYFIP2/NCKAP1 | 58/54/60 | 47/53/54 | 93/95/111 |
CYFIP2/WAVE1 | 51/45/34 | 40/36/26 | 79/66/55 |
CYFIP2/HSPC300 | 21/23/19 | 11/9/10 | 24/15/19 |
CYFIP2/ABI2 | 8/7/10 | 8/6/6 | 16/9/13 |
NCKAP1/ABI2 | 26/19/23 | 11/8/11 | 16/26/18 |
WAVE1/HSPC300 | 17/13/12 | 14/15/11 | 40/33/33 |
WAVE1/ABI2 | 10/17/21 | 18/16/15 | 28/32/30 |
HSPC300/ABI2 | 3/4/5 | 11/11/10 | 21/21/18 |
2.1.2. Structural predictions of C2iso1–3·WRC variants
A quick way to model variants is to replace the amino acids at the protein/protein interface involved in the mutations. However, this approach does not consider the potential rearrangement of the interface caused by the mutation. This may be particularly important when dealing with homology models and not experimental structures (Fiser 2010). Here, to address this issue, we propose a docking‐based strategy by combining blind docking, refinement, and energy‐based re‐ranking (Figure 4). Specifically, we use the relative affinity index (see section 4 for a definition) to qualify the impact of mutations and isoforms on the stability of the complexes. Of course, this does not consider entropy effects and is used strictly for qualitative comparisons. If available, we report experimental findings on binding affinities.
CYFIP2iso1 /WAVE1 interface
For all the variants at the interface, the contact maps hardly change on passing from the WT to the ASD‐linked variants (Figures S10–S13), except, of course, the region of the mutations.
C2 iso1,R87C ·WRC: C87(CYFIP2iso1) forms a weaker hydrogen bond with Y2557(WAVE1) (Figure 5b) than that formed by R87(CYFIP2iso1), a hotspot, with the same residue in the WT complex. These observations are consistent with the decrease in experimental binding affinity (Nakashima et al. 2018) and in the calculated affinity index between the two proteins on passing from the WT to the mutant (Figure 5a). As a functional region of WAVE1, the disrupted binding between WCA and CYFIP2 could activate the WRC (Schaks et al. 2018). Moreover, decreased interactions between the Stem region/α‐6 helix of WAVE1 and CYFIP2 have a similar effect on WRC activation (Chen et al. 2010). Therefore, by affecting the CYFIP2iso1/WAVE1 interface, the mutation can activate the WRC aberrantly (Schaks et al. 2018; Schaks et al. 2020), matching what observed in sub‐cellular experiments (Schaks et al. 2020).
C2 iso1,R87L/R87P ·WRC: L87(CYFIP2iso1) and P87(CYFIP2iso1) form weak hydrophobic interactions with Y2557(WAVE1) (Figure 5d,e). The mutations remove a hotspot. Consistently, these mutations decrease the calculated (Figure 5a) and experiment binding affinities (Nakashima et al. 2018).
C2 iso1,I644M ·WRC: M664(CYFIP2iso1) in the mutant and I664(CYFIP2iso1) in the WT complex form hydrophobic interactions with W2567(WAVE1), a hotspot (Figure 3): The latter interactions are stronger than the first (Figure 5f,g). Consistently, the affinity index decreases upon mutation slightly (Figure 5a). This variant, like C2iso1,R87C·WRC, activates WRC aberrantly (Schaks et al. 2020). Similar arguments apply to the E665K, D724H, and Q725R variants (see below).
C2 iso1,E665K ·WRC: K665(CYFIP2iso1) does not interact with WAVE1 (Figure 5h). In contrast, E665(CYFIP2iso1) forms a salt bridge with K2570(WAVE1) (Figure 5i) in the WT complex. Consistently, the affinity index decreases upon mutation (Figure 5a), supporting the hypothesis that this mutation could activate WRC aberrantly in the sub‐cellular experiment (Schaks et al. 2020). In contrast with the other mutation sites in CYFIP2iso1, E665(CYFIP2iso1) is neither a hotspot nor close to it (Figure 3).
C2 iso1,D724H ·WRC: H724(CYFIP2iso1) forms hydrogen bonds with Q2516(WAVE1) and D2656(WAVE1) (Figure 5j) and disrupts the H‐bond between R795(CYFIP2iso1) (a hotspot) and Q2515(WAVE1) (Figures 3 and 5j,k). In the WT complex, hotspot residue D724(CYFIP2iso1) forms H‐bonds with Q2515(WAVE1) and Q2516(WAVE1) (Figures 3 and 5k). The affinity index decreases (Figure 5a) consistently with removing a hotspot and with the experimentally observed aberrant C2iso1,D724H·WRC activation (Schaks et al. 2020).
C2 iso1,Q725R ·WRC: R725(CYFIP2iso1) does not interact with WAVE1 (Figure 5l). In the WT complex, instead, Q725 forms a hydrogen bond with W2567(WAVE1) (Figure 5m), a hotspot (Figure 3). Thus, the disruption of the interactions with the latter, a hotspot, might significantly affect the binding, as observed in the decrease of the affinity index (Figure 5a). This supports the findings that this mutation aberrantly activates WRC in the subcellular experiments (Schaks et al. 2020).
CYFIP2iso1 /NCKAP1iso1 interface
C2 iso1,A455P ·WRC: The residue at position 455 is not located at the interface, but it is close to it. The mutation turns out to break H2395(NCKAP1iso1)/Y687(CYFIP2iso1) π–π stacking interactions (Figure 6b,c). The affinity index decreases (Figure 6a) in accordance with the fact that Y687 is a hotspot (Figure 3). The mutation can activate the WRC through potential allosteric regulation (Schaks et al. 2020). Our predicted models offer structural insights into this potential mechanism: the A455P mutation could impair the binding between CYFIP2iso1 and NCKAP1iso1 indirectly by weakening the interactions of the hotspot, possibly resulting in the disruption of CYFIP2iso1/WAVE1 interactions and consequent unwanted activation of the WRC (Schaks et al. 2020).
CYFIP2iso2 /WAVE1 interface
C2 iso2,R87C ·WRC: The interaction patterns for both the mutant and WT (Figure 7c,d) complexes are highly similar to those of WT and R87C C2iso1·WRC (Figure 5b,c). The mutation removes the hotspot R87 (Figure S5a), and accordingly, the affinity index also decreases (Figure 7a). Therefore, this mutation should have similar results to the C2iso1,R87C·WRC variant. Our results are consistent with previous molecular dynamics (MD) simulations carried out on CYFIP2iso2/NCKAP1iso1/WAVE1 complex (thus lacking HSPC300 and ABI2 proteins in Figure 2), which also suggest that R87C mutations destabilize CFYIP2iso2/WAVE1 interactions (Biembengut et al. 2022).
CYFIP2iso3 /WAVE1 interface
C2 iso3,I613M ·WRC: For this and the two mutants below, the interaction pattern differs from that of C2iso1,WT·WRC. Here, M613(CYFIP2iso3) forms hydrophobic interactions with F2512(WAVE1) and W2516(WAVE1) (Figure 7e), two hotspots (Figure S5b), while in the WT I613(CYFIP2iso3) interacts only with W2516(WAVE1) (Figure 7f). Consistently, the affinity index increased (Figure 7b).
C2 iso3,E614K ·WRC: K614(CYFIP2iso3) forms a salt bridge with D2523(WAVE1) and van der Waals interactions with M2520(WAVE1) (Figure 7g). In isoform 1, the correspondent residue (K665) does not interact with any WAVE1 residue (Figure 5h). In the WT complex, E614(CYFIP2iso3) is not a hotspot, and there are no other hotspots within 4 Å. It forms only a salt bridge with K2519(WAVE1) (Figure 7h). The affinity index increases slightly (Figure 7b).
C2 iso3,Q674R ·WRC: R674(CYFIP2iso3) forms a cation–π interaction with W2516(WAVE1) (Figure 7i), a hotspot, and van der Waals interactions with M2520(WAVE1).2 Q674(CYFIP2iso3), albeit a hotspot (Figure S5b), forms only a hydrogen bond with W2516(WAVE1) (Figure 7j). Consistently, the affinity index increases upon mutation, albeit only slightly (Figure 7b).
The observed structural and affinity index (Figures 5 and 7) differences between the above three variants and their corresponding mutants in CYFIP2iso1 may be attributed, at least in part, to the presence of the MR2 region. The latter interacts with the α4‐ and α5‐helices immediately adjacent to the α6‐ and C‐helices of WAVE1. The α6‐ and C‐helices are the interacting regions for these mutations aforementioned in this section (Figure 2).
The increased binding between WCA and CYFIP2 upon mutations I613M(CYFIP2iso3), E614K(CYFIP2iso3), Q674R(CYFIP2iso3) might also affect WRC activation, though no direct experimental evidence is currently available.
C2 iso3,R744C ·WRC: C744(CYFIP2iso3) forms a hydrogen bond with Q2462(WAVE1) (Figure 7k). In the WT complex, R744(CYFIP2iso3) forms a stronger hydrogen bond with the same residue (Q2462(WAVE1)) (Figure 7l) and a cation–π interaction with Y677(CYFIP2iso3), which is a hotspot (Figure S5b). In addition, the R744C mutation weakens the hydrogen bond between Y677(CYFIP2iso3) and S2611(WAVE1). Accordingly, the affinity index decreased, albeit slightly (Figure 7b).
CYFIP2iso3 /NCAKP1iso1 interface
C2 iso3,Y639C ·WRC: C639(CYFIP2iso3) does not interact with NCKAP1iso1 (Figure 8b). In the WT complex, Y639(CYFIP2iso3) forms a hydrogen bond with a hotspot of NCKAP1iso1, R2340 (Figures 8c and S5b). Consistently, the affinity index decreases (Figure 8a).
2.2. N1iso2·WRC variants
We constructed the N1iso1,WT·WRC and N1iso2,WT·WRC complexes, starting from the same X‐ray structure as above (PDB ID: 3P8C (Chen et al. 2010)), and following the same protocol (Table S8). As expected, also here the CYFIP2/NCKAP1 and CYFIP2/WAVE1 contact surfaces feature the largest number of hotspots (by far) among the eight protein/protein surfaces of WRC (Figure S14). We constructed the mutants in Table 1 following the same scheme as above.
The only difference between N1iso1,WT·WRC and N1iso2,WT·WRC is an additional loop (36KQGQVW41) in the latter, which is far away from the CYFIP2/NCKAP1 interface involved in the 2 NCKAP1iso2 mutations studied here (Table 1 and Figure 2c). As expected, the contact maps are similar to those of C2iso1–3,WT·WRC (Figures S15–S17).
N1 iso2,P372T ·WRC: T372(NCKAP1iso2) forms hydrophobic interactions with F2400(CYFIP2iso1), a hotspot at the CYFIP2iso1/NCKAP1iso2 interface (Figure 9b). In the WT complex, P372(NCKAP1iso2) forms hydrophobic interactions with the same residue (F2400(CYFIP2iso1)) (Figure 9c). The affinity index does not change (Figure 9a).
N1 iso2,R675T ·WRC: T675(NCKAP1iso2) forms a hydrogen bond with E2308(CYFIP2iso1) (Figure 9d). In the WT complex (Figure 9e), R675(NCKAP1iso2) forms a salt bridge with E2308(CYFIP2iso1), one with D2313(CYFIP2iso1), and a hydrogen bond with N2316(CYFIP2iso1). R675(NCKAP1iso2) is a hotspot (Figure S14a); hence, its replacement with threonine may affect the interface significantly. Consistently, the affinity decreases (Figure 9a).
As expected, the correspondent results for the N1iso2·WRC variants with the other two isoforms of CYFIP2 are very similar (Figures S18 and S19).
3. CONCLUSIONS
Our calculations show that the mutations in Table 1 are located in the protein/protein interfaces tightly bound to each other by the largest number of hotspots. For the first time, we mapped these mutations on the correct isoforms of CYFIP2 and NCKAP1, which is critical given the different distribution of the isoforms in human tissues (Patowary et al. 2024). The mutants affect the stability of the CYFIP2/WAVE1 or /NCKAP1 interface by removing hotspots (R87C/L/P(CYFIP2iso1–2), D724H(CYFIP2iso1), and Q674R(CYFIP2iso3)) or indirectly (A455P(CYFIP2iso1), Y639C(CYFIP2iso3), I613M(CYFIP2iso3), I664M(CYFIP2iso1), Q725R(CYFIP2iso1), R744C(CYFIP2iso3), R675T(NCKAP1iso2)), by affecting the interaction networks of other hotspots. Furthermore, our work provides insight into their effect at the molecular level. Our approach, albeit approximate, goes beyond simple homology modeling, and it is fully consistent with the experiment: in particular, we predict that most mutations in Table 1 weaken the interactions of CYFIP2iso1–3 with their cellular partners WAVE1 and NCKAP1iso1–2. Specifically, R87C(CYFIP2iso1–2), R87L(CYFIP2iso1), R87P(CYFIP2iso1), I664M(CYFIP2iso1), E665K(CYFIP2iso1), D724H(CYFIP2iso1), Q725R(CYFIP2iso1), and R744C(CYFIP2iso3), affect CYFIP2/WAVE1 binding, while A455P(CYFIP2iso1), Y639C(CYFIP2iso3), and R675T(NCKAP1iso2) mutations affect the CYFIP2/NCKAP1 interface. These results provide molecular insights for rationalizing subcellular experiments, which demonstrated that R87C(CYFIP2iso1), A455P(CYFIP2iso1), I664M(CYFIP2iso1), E665K(CYFIP2iso1), D724H(CYFIP2iso1), and Q725R(CYFIP2iso1) variants cause abnormal WRC activation (Schaks et al. 2020). We predict here that R87C(CYFIP2iso2), R87L(CYFIP2iso1), R87P(CYFIP2iso1), Y639C(CYFIP2iso3), R744C(CYFIP2iso3), and R675T(NCKAP1iso2) could also activate WRC aberrantly. Interestingly, a few mutations turn out to strengthen the interactions between the CYFIP2iso3 and WAVE1 (Table 1). We assume that this reinforcement of the interactions may also impair WRC activation by making the release of the WCA region more difficult, potentially affecting downstream actin remodeling. Interestingly, the effect of mutations on the strength of the interaction changes, at times, when passing from CYFIP2iso3 to CYFIP2iso1 (Table 1 and Figures 5 and 7): according to our data, the mutations I613M, E614K, and Q674R in CYFIP2iso3 strengthen the interaction networks of the CYFIP2/WAVE1 interface while the corresponding mutations in CYFIP2iso1 (E664M, E665K, and Q725R) weaken them. Moreover, CYFIP2iso1 exhibits a higher predicted affinity for WAVE1 than the other two isoforms (Figures S20 and S21; see Data S1 for details). Thus, the three isoforms of CYFIP2 might exhibit different efficiencies in the WRC pathway (Figure S1). Our results suggest that it is key to consider the correct isoforms of the proteins in terms of the formation of complexes and the impact of disease‐related mutations.
4. METHODS
4.1. CYFIP2iso1–3 variants
These were predicted for the CYFIP2iso1–3 and NCKAP1iso1 in the WRC (C2iso1–3·WRC hereafter). We followed a multistep strategy. First, we predicted the structural determinants of the WT complex based on the X‐ray structure at 2.3 Å resolution of the structurally similar CYFIP1iso1 protein (sequence identity as high as 88%, Table S4) in the WRC (PDB ID: 3P8C) (Chen et al. 2010). The SWISS‐MODEL (Waterhouse et al. 2018) and MODELLER V9.19 (Webb and Sali 2021) codes were used. Notice that we do not know which isoforms are involved in the ASD‐linked variants of WRC at present. Thus, we used the canonical isoforms of ABI2, HSPC300, WAVE1, and NCKAP1 in the 3P8C structure (Table S9). Next, we established a protocol for building the variants using docking procedures between WT CYFIP2iso1–3 and the rest of the complex (Figure 4).
Blind docking of CYFIP2 iso1–3 onto the rest of the complex was performed by the GRAMM web server (Singh et al. 2024). The number of scans that match the output was set to 300,000, and the top 10 predicted structures were considered for successive refinement.
Refinement of the 10 structures by reducing atomic clashes and optimizing binding poses, followed by minimization using the relax function with InterfaceRelax 2019 parameters. This step was performed with Rosetta 2021.16 (Khatib et al. 2011; Maguire et al. 2021; Tyka et al. 2011).
Energy‐based re‐ranking based on Molecular Mechanics‐Generalized Born Surface Area (MM‐GBSA) (Wang et al. 2019) via the sander and pmemd engines of AMBER22 software (Case et al. 2022). The ff02 force field (Cieplak et al. 2001) and the GBOBC1 model (Onufriev et al. 2004) were used to predict protein–protein affinity, as previous benchmark studies demonstrated they have the best performance in predicting protein–protein complexes (Chen et al. 2016).
In this protocol, we use MM‐GBSA calculations to evaluate the affinity of the protein–protein interactions in the C2iso1–3,WT·WRC and N1iso2,WT·WRC and their ASD‐linked variants. In particular, we use the enthalpy component calculated in this approach as affinity index (which is here taken in its absolute value and normalized to the WT), ΔH MM‐GBSA the value 1.0 refers to the best‐docked conformation of the WT complexes regarding ΔH MM‐GBSA. Besides being very approximate, ΔH MM‐GBSA does not include entropy. Assuming that this contribution does not vary mainly on passing from one complex to the other, we then use this quantity as a qualitative index of stability of protein/protein interface: namely, we use ΔH MM‐GBSA values to establish qualitatively the stability of the mutated complexes relative to the WT structures.
Our protocol was first validated with the WT complexes. It reproduced the structural determinants of the WT WRC, which have been built as described in the previous section (see Data S1 for details). Then, we applied the protocol to the ASD‐linked variants of the protein (Table S10). Only residues at the interface in WT WRC were considered, except for A455(CYFIP2iso1), as this causes aberrant lamellipodia, which may have an apparent effect on the stabilization at the interface (Table 1).
The CPPTRAJ (Roe and Cheatham III 2013), a module of AMBER software, was used to calculate the Cα‐atom root‐mean‐square deviation (RMSD) values and contact map. The interface residues are listed in Tables S5–S7 and S11–S13. The residues within 4 Å are considered as interacting or close to each other. The hydrogen bonding interactions are defined as if the distance between the donor atom (D) and the acceptor atom (A) is less than 3.5 Å and the D‐H‐A angle is larger than 135°. The cation–π interactions are identified by the following criterion: the distance between the cation and the center of the centroid of the π‐system is 5 Å or lower.
4.2. NCKAP1iso2 variants
The ASD‐linked mutations at the CYFIP2/NCKAP1 interface are only mapped on NCKAP1iso2. NCKAP1iso2 variants were predicted for the complex of this protein with CYFIP2iso1–3, along with the other proteins in the WRC (N1iso2·WRC hereafter). We followed the same multistep strategy as above, except that in the homology modeling, we also predicted the structural determinants of NCKAP1iso2, which is very similar to NCKAP1iso1 (99.5% identity), being the only difference K36 replaced by the 36KQGQVWK42 loop. In addition, we validated the procedure to generate the WT complex again using NCKAP1iso2.
4.3. Hotspot predictions
These were performed on the C2iso1–3,WT·WRC and N1iso2,WT·WRC complexes using the mCSM (Pires et al. 2013), BeAtMuSiC (Dehouck et al. 2013), and FoldX (Schymkowitz et al. 2005) servers. The hotspots are defined as the interface residues whose substitution with an alanine leads to a loss of protein/protein binding free energy of 2 kcal/mol or larger (Cukuroglu et al. 2014).
AUTHOR CONTRIBUTIONS
Song Xie: Methodology; writing – original draft; writing – review and editing; visualization; formal analysis. Ke Zuo: Formal analysis; writing – original draft; writing – review and editing. Silvia De Rubeis: Conceptualization; writing – review and editing. Paolo Ruggerone: Conceptualization; funding acquisition; supervision; formal analysis; writing – review and editing. Paolo Carloni: Conceptualization; writing – original draft; writing – review and editing; investigation; funding acquisition.
Supporting information
ACKNOWLEDGMENTS
S.X. acknowledges the support of the China Scholarship Council program (Project ID: 202306650006). K.Z. acknowledges financial support under the National Recovery and Resilience Plan (NRRP), Mission 4, Component 2, Investment 1.1, Call for tender No. 104 published on 2.2.2022 by the Italian Ministry of University and Research (MUR), funded by the European Union–NextGenerationEU–Project Title PpiAsd (Protein–protein interactions in Autism Spectrum Disorder)–CUP F53D23001170006–Grant Assignment Decree adopted on June 30, 2023 by the Italian Ministry of University and Research (MUR). The work at the University of Cagliari was also funded by Health Extended ALliance for Innovative Therapies, Advanced Lab‐research, and Integrated Approaches of Precision Medicine partnership (HEAL ITALIA), foundation by the Italian Ministry of University and Research, PNRR, mission 4, component 2, investment 1.3, project number PE00000019 (University of Cagliari) and under the National Recovery and Resilience Plan (NRRP), Mission 4 Component 2 Investment 1.5–Call for tender No. 3277 published on December 30, 2021 by the Italian Ministry of University and Research (MUR) funded by the European Union–NextGenerationEU. Project Code ECS0000038–Project Title eINS Ecosystem of Innovation for Next Generation Sardinia–CUP J85B17000360007–Concession Decree No. 1056 adopted on June 23, 2022 by the Italian Ministry of University and Research (MUR). Open Access funding enabled and organized by Projekt DEAL.
Xie S, Zuo K, De Rubeis S, Ruggerone P, Carloni P. Molecular basis of the CYFIP2 and NCKAP1 autism‐linked variants in the WAVE regulatory complex. Protein Science. 2025;34(1):e5238. 10.1002/pro.5238
Review Editor: Lynn Kamerlin
Footnotes
Contributor Information
Ke Zuo, Email: k.zuo@stimulate-ejd.eu.
Paolo Ruggerone, Email: paolo.ruggerone@dsf.unica.it.
Paolo Carloni, Email: p.carloni@fz-juelich.de.
REFERENCES
- Begemann A, Sticht H, Begtrup A, Vitobello A, Faivre L, Banka S, et al. New insights into the clinical and molecular spectrum of the novel CYFIP2‐related neurodevelopmental disorder and impairment of the WRC‐mediated actin dynamics. Genet Med. 2021;23:543–554. 10.1038/s41436-020-01011-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Biembengut ÍV, Shigunov P, Frota NF, Lourenzoni MR, de Souza TACB. Molecular dynamics of CYFIP2 protein and its R87C variant related to early infantile epileptic encephalopathy. Int J Mol Sci. 2022;23:8708. 10.3390/ijms23158708 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Case DA, Aktulga HM, Belfon K, Ben‐Shalom I, Berryman JT, Brozell SR, et al. Amber 2022. San Francisco, CA: University of California; 2022. http://ambermd.org/doc12/Amber22.pdf [Google Scholar]
- Chen B, Chou H‐T, Brautigam CA, Xing W, Yang S, Henry L, et al. Rac1 GTPase activates the WAVE regulatory complex through two distinct binding sites. eLife. 2017;6:e29795. 10.7554/eLife.29795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen F, Liu H, Sun H, Pan P, Li Y, Li D, et al. Assessing the performance of the MM/PBSA and MM/GBSA methods. 6. Capability to predict protein–protein binding free energies and re‐rank binding poses generated by protein–protein docking. Phys Chem Chem Phys. 2016;18:22129–22139. 10.1039/C6CP03670H [DOI] [PubMed] [Google Scholar]
- Chen Z, Borek D, Padrick SB, Gomez TS, Metlagel Z, Ismail AM, et al. Structure and control of the actin regulatory WAVE complex. Nature. 2010;468:533–538. 10.1038/nature09623 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cieplak P, Caldwell J, Kollman P. Molecular mechanical models for organic and biological systems going beyond the atom centered two body additive approximation: aqueous solution free energies of methanol and N‐methyl acetamide, nucleic acid base, and amide hydrogen bonding and chloroform/water partition coefficients of the nucleic acid bases. J Comput Chem. 2001;22:1048–1057. 10.1002/jcc.1065 [DOI] [Google Scholar]
- Consortium TU . UniProt: the universal protein knowledgebase in 2023. Nucleic Acids Res. 2022;51:D523–D531. 10.1093/nar/gkac1052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cukuroglu E, Engin HB, Gursoy A, Keskin O. Hot spots in protein–protein interfaces: towards drug discovery. Prog Biophys Mol Biol. 2014;116:165–173. 10.1016/j.pbiomolbio.2014.06.003 [DOI] [PubMed] [Google Scholar]
- De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Ercument Cicek A, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014;515:209–215. 10.1038/nature13772 [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Rubeis S, Pasciuto E, Li Ka W, Fernández E, Di Marino D, Buzzi A, et al. CYFIP1 coordinates mRNA translation and cytoskeleton remodeling to ensure proper dendritic spine formation. Neuron. 2013;79:1169–1182. 10.1016/j.neuron.2013.06.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dehouck Y, Kwasigroch JM, Rooman M, Gilis D. BeAtMuSiC: prediction of changes in protein–protein binding affinity on mutations. Nucleic Acids Res. 2013;41:W333–W339. 10.1093/nar/gkt450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ding BJ, Yang S, Schaks M, Liu YJ, Brown AJ, Rottner K, et al. Structures reveal a key mechanism of WAVE regulatory complex activation by Rac1 GTPase. Nat Commun. 2022;13:5444. 10.1038/s41467-022-33174-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiser A. Template‐based protein structure modeling. In: Fenyö D, editor. Computational biology. Totowa, NJ: Humana Press; 2010. p. 73–94. 10.1007/978-1-60761-842-3_6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fu JM, Satterstrom FK, Peng M, Brand H, Collins RL, Dong S, et al. Rare coding variation provides insight into the genetic architecture and phenotypic context of autism. Nat Genet. 2022;54:1320–1331. 10.1038/s41588-022-01104-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gaugler T, Klei L, Sanders SJ, Bodea CA, Goldberg AP, Lee AB, et al. Most genetic risk for autism resides with common variation. Nat Genet. 2014;46:881–885. 10.1038/ng.3039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo H, Zhang Q, Dai R, Yu B, Hoekzema K, Tan J, et al. NCKAP1 disruptive variants lead to a neurodevelopmental disorder with core features of autism. Am J Human Gen. 2020;107:963–976. 10.1016/j.ajhg.2020.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsiao K, Harony‐Nicolas H, Buxbaum JD, Bozdagi‐Gunal O, Benson DL. Cyfip1 regulates presynaptic activity during development. J Neurosci. 2016;36:1564–1576. 10.1523/jneurosci.0511-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Iossifov I, O'Roak BJ, Sanders SJ, Ronemus M, Krumm N, Levy D, et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature. 2014;515:216–221. 10.1038/nature13908 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. Author correction: the mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2021;590:E53. 10.1038/s41586-020-03174-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khatib F, Cooper S, Tyka MD, Xu K, Makedon I, Popović Z, et al. Algorithm discovery by protein folding game players. Proc Natl Acad Sci. 2011;108:18949–18953. 10.1073/pnas.1115898108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maguire JB, Haddox HK, Strickland D, Halabiya SF, Coventry B, Griffin JR, et al. Perturbing the energy landscape for improved packing during computational protein design. Proteins Struct Func Bioinf. 2021;89:436–449. 10.1002/prot.26030 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nakashima M, Kato M, Aoto K, Shiina M, Belal H, Mukaida S, et al. De novo hotspot variants in CYFIP2 cause early‐onset epileptic encephalopathy. Ann Neurol. 2018;83:794–806. 10.1002/ana.25208 [DOI] [PubMed] [Google Scholar]
- Oguro‐Ando A, Rosensweig C, Herman E, Nishimura Y, Werling D, Bill BR, et al. Increased CYFIP1 dosage alters cellular and dendritic morphology and dysregulates mTOR. Mol Psychiatry. 2015;20:1069–1078. 10.1038/mp.2014.124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Onufriev A, Bashford D, Case DA. Exploring protein native states and large‐scale conformational changes with a modified generalized born model. Proteins Struct Func Bioinf. 2004;55:383–394. 10.1002/prot.20033 [DOI] [PubMed] [Google Scholar]
- Pathania M, Davenport EC, Muir J, Sheehan DF, López‐Doménech G, Kittler JT. The autism and schizophrenia associated gene CYFIP1 is critical for the maintenance of dendritic complexity and the stabilization of mature spines. Transl Psychiatry. 2014;4:e374. 10.1038/tp.2014.16 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patowary A, Zhang P, Jops C, Vuong CK, Ge X, Hou K, et al. Developmental isoform diversity in the human neocortex informs neuropsychiatric risk mechanisms. Science. 2024;384:eadh7688. 10.1126/science.adh7688 [DOI] [PubMed] [Google Scholar]
- Pires DEV, Ascher DB, Blundell TL. mCSM: predicting the effects of mutations in proteins using graph‐based signatures. Bioinformatics. 2013;30:335–342. 10.1093/bioinformatics/btt691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Roe DR, Cheatham TE III. PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J Chem Theory Comput. 2013;9:3084–3095. 10.1021/ct400341p [DOI] [PubMed] [Google Scholar]
- Rottner K, Stradal TEB, Chen B. WAVE regulatory complex. Curr Biol. 2021;31:R512–R517. 10.1016/j.cub.2021.01.086 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Satterstrom FK, Kosmicki JA, Wang J, Breen MS, De Rubeis S, An J‐Y, et al. Large‐scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell. 2020;180:568–584.e23. 10.1016/j.cell.2019.12.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaks M, Reinke M, Witke W, Rottner K. Molecular dissection of neurodevelopmental disorder‐causing mutations in CYFIP2. Cells. 2020;9:1355. 10.3390/cells9061355 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaks M, Singh SP, Kage F, Thomason P, Klünemann T, Steffen A, et al. Distinct interaction sites of Rac GTPase with WAVE regulatory complex have non‐redundant functions in vivo. Curr Biol. 2018;28:3674–3684.e6. 10.1016/j.cub.2018.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005;33:W382–W388. 10.1093/nar/gki387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh A, Copeland MM, Kundrotas PJ, Vakser IA. GRAMM web server for protein docking. In: Gore M, Jagtap UB, editors. Computational drug discovery and design. New York, NY: Springer; 2024. p. 101–112. 10.1007/978-1-0716-3441-7_5 [DOI] [PubMed] [Google Scholar]
- Takenawa T, Suetsugu S. The WASP–WAVE protein network: connecting the membrane to the cytoskeleton. Nat Rev Mol Cell Biol. 2007;8:37–48. 10.1038/nrm2069 [DOI] [PubMed] [Google Scholar]
- Tyka MD, Keedy DA, André I, DiMaio F, Song Y, Richardson DC, et al. Alternate states of proteins revealed by detailed energy landscape mapping. J Mol Biol. 2011;405:607–618. 10.1016/j.jmb.2010.11.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang E, Sun H, Wang J, Wang Z, Liu H, Zhang JZH, et al. End‐point binding free energy calculation with MM/PBSA and MM/GBSA: strategies and applications in drug design. Chem Rev. 2019;119:9478–9508. 10.1021/acs.chemrev.9b00055 [DOI] [PubMed] [Google Scholar]
- Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS‐MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–W303. 10.1093/nar/gky427 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Webb B, Sali A. Protein structure modeling with modeller. In: Chen YW, Yiu C‐PB, editors. Structural genomics: general applications. New York, NY: Springer; 2021. p. 239–255. 10.1007/978-1-0716-0892-0_14 [DOI] [Google Scholar]
- Willsey HR, Willsey AJ, Wang B, State MW. Genomics, convergent neuroscience and progress in understanding autism spectrum disorder. Nat Rev Neurosci. 2022;23:323–341. 10.1038/s41583-022-00576-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhou X, Feliciano P, Shu C, Wang T, Astrovskaya I, Hall JB, et al. Integrating de novo and inherited variants in 42,607 autism cases identifies mutations in new moderate‐risk genes. Nat Genet. 2022;54:1305–1319. 10.1038/s41588-022-01148-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zweier M, Begemann A, McWalter K, Cho MT, Abela L, Banka S, et al. Spatially clustering de novo variants in CYFIP2, encoding the cytoplasmic FMRP interacting protein 2, cause intellectual disability and seizures. Eur J Hum Genet. 2019;27:747–759. 10.1038/s41431-018-0331-z [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.