Abstract
We have investigated covalent conjugation of VPPPVPPRRRX′ peptide (where X′ denotes Nε-chloroacetyl lysine) to N-terminal SH3 domain from adapter protein Grb2. Our experimental results confirmed that the peptide first binds to the SH3 domain noncovalently before establishing a covalent linkage through reaction of X′ with the target cysteine residue C32. We have also confirmed that this reaction involves a thiolate-anion form of C32 and follows the SN2 mechanism. For this system, we have developed a new MD-based protocol to model the formation of covalent conjugate. The simulation starts with the known coordinates of the noncovalent complex. When two reactive groups come into contact during the course of the simulation, the reaction is initiated. The reaction is modeled via gradual interpolation between the two sets of force field parameters that are representative of the noncovalent and covalent complexes. The simulation proceeds smoothly, with no appreciable perturbations to temperature, pressure or volume, and results in a high-quality MD model of the covalent complex. The validity of this model is confirmed using the experimental chemical shift data. The new MD-based approach offers a valuable tool to explore the mechanics of protein-peptide conjugation and build accurate models of covalent complexes.
Subject terms: Peptides, Proteins, Structural biology, Molecular modelling, NMR spectroscopy, Molecular biophysics
Introduction
Various natural and modified peptides are broadly used in modern clinical practice. For example, gramicidin (topical antibiotic), insulin (life-saving diabetes treatment), oxytocin (used to induce and support labor), goserelin and degarelix (prostate cancer drugs) all fall in this category1. The advantages of peptide therapy are well recognized. Peptide drugs are highly selective, which minimizes their off-target interactions and lowers their toxicity. They are also affordable compared to some other classes of drugs, such as antibodies. At the same time, there is a number of limiting factors. As a rule, peptide therapeutics are not orally bioavailable. They can only be aimed at extracellular targets (the problem of effective peptide delivery into cancer cells or other target cells remains unresolved). They are also rapidly cleared from the blood through kidneys and liver. In this situation, successful peptide therapeutics are mainly analogues of natural hormones or slightly modified versions thereof2.
One interesting direction in this area is development of peptides that are capable of covalently binding to their targets (and thus belong to a broad class of covalent drugs). Ideally, this can be accomplished via structure-based design. Using crystallographic or NMR coordinates of (noncovalent) peptide-protein complex, one can determine the site where a covalent bond can be engineered. Subsequently, the peptide needs to be re-designed such as to incorporate a reactive group in the selected position. This group should be capable of forming a bond with one of the protein moieties that are located nearby in the structure of the complex.
The key to successful design is a proper choice of reactive group3. If its reactivity is too high, the peptide will bond with many randomly encountered proteins, i.e. will be wasted through off-target binding. Conversely, if the reactivity is too low, the peptide will be cleared before it has a chance to react with its target, i.e. it will not serve its intended purpose as a covalent ligand.
For a properly chosen reactive group, one expects the following scenario. First, the peptide forms a noncovalent complex with its protein target. This brings the peptide’s reactive group in close proximity to its target moiety on the protein surface, thus achieving high “local concentration of the reactants”. Under these favorable conditions, the reaction runs its course, linking the peptide to the protein.
What can be accomplished by implementing a covalent peptidic ligand for the biomedically relevant target protein? Clearly, this improves the binding affinity (formally speaking, the dissociation constant Kd of the covalent ligand approaches zero). It should also alleviate the situation with rapid peptide clearance. In practice, however, one may expect only a limited effect. For instance, consider a covalent peptide targeted at a certain transmembrane receptor. Prior to covalent bonding, the peptide spends much of its lifetime as noncovalent ligand (as indicated above, the conjugation chemistry must be of necessity slow). After bonding, its remaining lifespan is limited by receptor internalization and degradation. Therefore, potential gains from the use of reactive peptides are likely less than can be (naively) expected.
Nevertheless, we believe that reactive peptides can elicit a significantly different response compared to their conventional analogues. This includes changes in binding efficiency, changes in receptor-mediated endocytosis and even structural changes in the receptor triggered by covalent bonding. Along these lines, it is entirely possible that hypothetical reactive versions of goserelin or degarelix will have more favorable properties than their prototypes. Broadly speaking, development of covalent peptides adds a certain new dimension to the field of peptide therapeutics.
Recently there has been a significant amount of activity in this direction. In many cases, peptides were designed to react with cysteine thiols. The repertoire of reactive groups used in these studies included acrylamide4–6, chloroacetamide7,8 and fluoromethyl ketone9. However, free surface cysteines are relatively rare. Even more importantly, they are only encountered in intracellular proteins. Given the current difficulties with efficient peptide delivery into specific type cells10,11, this strategy may not be readily translatable into clinical practice. The attractive alternative is to employ conjugation chemistry aimed at lysine amine group. Using this line of approach, the peptides have been equipped with sulfonyl fluoride groups12–14 and various other types of reactive groups12,15. This strategy, however, also has its shortcomings – for example, sulfonyl fluoride forms unstable adducts with histidine and cysteine.
All of the studies cited above make use of intracellular protein targets (in this sense, they provide a proof of principle rather than a practical pharmacological solution). Separately, one should mention the work of Marquez et al.16 who used a peptide with dinitrofluorobenzene reactive group targeted at vascular endothelial growth factor (VEGF). VEGF circulates in blood and is a bona fide drug target. Furthermore, Assefa et al. designed a covalent version of peptidic antagonist for gonadotropin-releasing hormone receptor (GNRHR), the same receptor as targeted by goserelin and degarelix17. Their study, however, makes use of photoreactive azidobenzoyl group that requires ultraviolet irradiation, which is highly harmful for cells. In future, we anticipate further efforts to design covalent peptides targeted at extracellular domains of various transmembrane receptors. It can be argued that such peptides have the best chance to be successfully developed into therapeutics.
In this study, we seek to develop new modeling tools to assist in design of covalent peptide ligands. As a model system, we have chosen the complex between N-terminal SH3 domain of adaptor protein Grb2 and the peptide corresponding to the Grb2-binding sequence in the Ras guanine nucleotide exchange factor Sos1. Recently, this system has been used by Yu et al. to demonstrate covalent peptide binding via chloroacetamide group8. For the purpose of our study, we have chosen 11-residue peptide VPPPVPPRRRX′, where X′ denotes Nε-chloroacetyl lysine (in what follows this peptide is abbreviated Sos1-X′). We have started with a comprehensive experimental characterization of the Grb2 N-SH3 interaction with Sos1-X′. Heteronuclear NMR spectroscopy was used to confirm the noncovalent binding of this modified peptide and determine the corresponding binding constant. NMR experiments were further used to monitor the process of covalent bonding and find half-life of the reaction. As a next step, tandem mass spectrometry with collision induced dissociation was employed to confirm selective bonding of Sos1-X′ with C32 residue in Grb2 N-SH3, resulting in formation of thioether linkage. Finally, the pH dependence of the conjugation rate has been assessed by means of gel electrophoresis, confirming that the reaction proceeds through the ionic form of the thiol group.
With this information in mind, we set out to develop an MD model of Sos1-X′/Grb2 N-SH3 conjugation. As a starting point, we used the structure of noncovalent complex between Grb2 N-SH3 and Sos1 peptide18. In this structure, the X′ residue (Nε-chloroacetyl lysine) was added to the C-terminus of the peptide. The force-field parameters for X′ have been calculated using the appropriate computational tools, including quantum-chemical calculations. The obtained system is then used to conduct an MD simulation.
During this simulation we continuously monitor the distance between the chloroacetyl group in the Sos1-X′ peptide and the thiolate-anion group of residue C32 in Grb2 N-SH3. When the two groups approach each other to within a certain distance, the algorithm “flips a coin” in order to decide whether to initiate the chemical reaction.
The reaction is modeled by interpolating from one set of force field parameters to another. Initially, the simulation uses a set of force field constants pertaining to X′ and C32 (thiolate anion) residues representing two separate entities. In the end, it uses a set of constants pertinent to X′ and C32 coupled via the thioether linkage. In between the initial and the final point, the algorithm smoothly interpolates between these two parameter sets. In essence, the algorithm imitates gradual dissipation of old chemical bonds and emergence of the new ones.
The above procedure is clearly empirical. However, two aspects should be emphasized here. (1) The transition is executed sufficiently slowly such as to avoid any significant perturbations to the simulated system. MD thermostat and barostat successfully cope with the incremental changes in force field constants, so that the transition can be considered adiabatic. (2) While our protocol disregards the actual characteristics of the energy barrier separating the two states, it offers a smooth path from the known initial state (noncovalent complex) to the unknown final state (covalent complex). As a point of comparison, several well-established methods such as Empirical Valence Bond (EVB)19,20 or Adiabatic Reactive Molecular Dynamics (ARMD)21,22 usually rely on the known coordinates of the initial and final states, focusing their attention on the transition state.
Using the above protocol, we have recorded 11 MD trajectories, each of which is comprised of: (i) 0.5 μs conventional unrestrained simulation for the noncovalent complex, (ii) additional variable-length period of conventional simulation when the reaction can be initiated, (iii) 2 ns steered adiabatic transition to the covalent complex, (iv) 0.5 μs conventional unrestrained simulation for the covalent complex. In this manner, we have obtained what we believe to be a high-quality MD model of the covalent Sos1-X′/Grb2 N-SH3 complex. To validate this model, we have analyzed the NMR chemical shifts in this covalent complex. The comparison of the MD-based predictions with the experimental results confirmed (within the uncertainty margin of the predictor) the validity of the reported model.
Results
Noncovalent binding between Sos1-X′ and Grb2 N-SH3
Grb2 is an adaptor protein that consists of a central SH2 domain, which preferentially binds to phosphotyrosine-containing motifs, and two flanking SH3 domains, which bind to proline-rich motifs. It is an important hub in growth-factor signaling, perhaps best known for its role in Ras/MAPK pathway23,24. Briefly, stimulation of cell by epidermal growth factor (EGF) causes dimerization of EGF receptor (EGFR), further leading to activation of cytoplasmic tyrosine kinase domains and autophosporylation of EGFR cytoplasmic tails. Grb2 binds through its SH2 domain to two of the phosphotyrosine-containing sequences within the EGFR tails, aided by a fellow adaptor protein Shc25. At the same time, Grb2 binds through its two SH3 domains to proline-rich sequences within the disordered tail of the guanine nucleotide exchange factor Sos1, thus recruiting Sos1 to cell membrane and facilitating its interaction with membrane-anchored small GTPase Ras26. This interaction induces GDP to GTP exchange in Ras, initiating downstream signaling events that have fundamental importance for cell growth and proliferation.
Binding of Sos1 to Grb2 N-SH3 occurs through short linear proline-rich motifs27–29. Therefore, it can be readily characterized by solving a structure of an isolated SH3 domain in complex with the corresponding peptides. Several such structures have been determined by means of solution-state NMR18,30,31. It was recognized early on that these structures provide a potential basis to design therapeutics aimed at Grb2 SH3 domains32. However, somewhat generic character of SH3 module and the lack of high-affinity binding pockets makes it an extremely difficult target. Small-molecule inhibitors were reported for Grb2 SH3 domain, but showed poor selectivity and proved cytotoxic33,34. Certain progress has been achieved in designing peptidic inhibitors of Grb2 SH3 interaction with Sos135,36. The most advanced peptidic constructs were equipped with protein transduction domain to enable cell entry and have had some success in preclinical studies on mice37.
Very recently, Yu and co-workers developed a family of covalent peptidic ligands targeted at Grb2 N-SH38. In doing so, they took advantage of the sole cysteine residue C32, which is favorably positioned on the surface of Grb2 N-SH3 near the peptide-binding site, and relied on chloroacetamide group to conjugate the peptide to the corresponding thiol. Taking a cue from their work, we used a slightly longer peptide sequence, VPPPVPPRRR, for which the PDB coordinates of the complex with Grb2 N-SH3 are available18. We further added Nε-chloroacetyl lysine residue (X′) in the C-terminal position, arriving at the peptide termed Sos1-X′.
First, we set out to characterize the interaction between Sos1-X′ and Grb2 N-SH3 experimentally. This interaction involves rapid formation of noncovalent complex (denoted SH3·Sos1-X′), followed by its gradual progression to covalent complex (denoted SH3:Sos1-X′). To separate the former process from the latter, we have engineered the C32S mutant of Grb2 N-SH3, which is incapable of covalent bonding with Sos1-X′. Previously it has been shown that this conservative cysteine-to-serine mutation, which involves the surface residue outside the ligand-binding interface, has no effect on structural integrity and ligand binding properties of Grb2 N-SH338.
To determine Kd constant of noncovalent binding between Sos1-X′ and C32S Grb2 N-SH3, we conducted 1HN,15N-HSQC titration experiment. Adding the peptide caused moderate shifts for a number of SH3 peaks corresponding to the residues on peptide-binding interface (see Fig. 1C)18. The data are consistent with fast-to-intermediate exchange between the two states of the SH3 domain (free and peptide-bound). For quantitative analysis, we have selected a group of 8 well-resolved peaks showing substantial titration effects. The two-dimensional HSQC titration data for these peaks have been fitted using the program TITAN39 on per-residue basis, as well as collectively (illustrated in Fig. 1D; for complete summary see Fig. S1). The collection of per-residue fits produced the average Kd value of 4.6 ± 2.3 μM, while the global fit yielded 4.9 μM. These two results are obviously consistent with each other. More importantly, they are similar to the dissociation constant previously determined for the unmodified Sos1 peptide, Kd = 3.5 μM. Indeed, the reactive residue X′ has been appended to the C-terminus of the Sos1 peptide, where it is projected into solvent and therefore should not interfere with the binding (see Fig. 1A,B). We conclude that Sos1-X′ forms a bona fide noncovalent complex with Grb2 N-SH3, in agreement with the predicted mechanism of covalent conjugation. Considering this result, as well as the chemical shift mapping data, it is safe to suggest that SH3·Sos1-X′ complex is identical to the SH3·Sos1. Thus, the existing structure of SH3·Sos1 provides a good starting point to build a model of the covalent complex SH3:Sos1-X′ (described in what follows).
It is also worth noting that addition of the peptide considerably improves the quality of the spectral map, leading to more uniform peak intensities. The improvement is particularly significant at higher sample concentration, 1 mM (see Fig. S2A). This suggests that apo SH3 domain suffers from weak self-association, similar to what has been described before for Crk SH3 domain40. It has also been shown that peptide binding leads to reduction in μs-ms dynamics, which is detectable at several sites in Src SH3 domain, and improves protection against solvent exchange41. Finally, it is worth mentioning that in the case of α-spectrin SH3 domain, peptide binding leads to a moderate increase in thermodynamic stability of the domain42.
Covalent binding between Sos1-X′ and Grb2 N-SH3
To investigate the formation of covalent complex SH3:Sos1-X′, we have used the sample containing 1 mM of 15N-labeled wt-SH3 and 2 mM of Sos1-X′. The progression of the conjugation reaction was monitored by recording a series of back-to-back 1HN,15N-HSQC spectra. The first spectral map (red spectrum in Fig. 2A) contains a set of strong signals corresponding to noncovalent complex SH3·Sos1-X′. It also contains a set of weak peaks from covalent complex SH3:Sos1-X′ which has formed during the course of the NMR experiment. The last map acquired 8 h after the start of the reaction (green spectrum in Fig. 2A) contains solely SH3:Sos1-X′ signals. If any amount of unreacted SH3 domain remains in the sample, it is too small to be reliably detected in the spectrum.
The inspection of Fig. 2A immediately shows that the perturbations to the spectrum due to covalent binding are minimal. This leads us to conclude that engineering of the covalent bond in the SH3·Sos1-X′ complex leaves its structure essentially intact. This is not surprising given that X′ residue is positioned opposite to C32 so that they can form a thioether linkage without causing any significant distortions to the structure. It appears that only minimal structural adjustments are needed to accommodate the newly formed covalent bond.
To localize small structural changes caused by peptide conjugation, we analyzed chemical shift differences between SH3·Sos1-X′ and SH3:Sos1-X′ spectra. For this purpose, we have combined 1HN and 15N shifts, as often done in the context of chemical shift mapping43:
1.1 |
1.2 |
Δnoncov and Δcov were further determined under the identical sample conditions, cf. Fig. S2. The summary of the results is shown in Fig. 2B. The inspection of this graph suggests that larger values of Δcov are found near the attachment point, i.e. in the n-Src loop housing residue C32. This is understandable, since peptide conjugation alters the covalent bonding pattern at the attachment site and, furthermore, is likely to have some effect on the conformation (and conformational dynamics) of the n-Src loop. Smaller shifts Δcov are also observed in many of the same sites where Δnoncov were detected. Indeed, we expect that covalent bonding entails certain subtle adjustments at the protein-peptide interface. This is reflected in the chemical shifts of the same residues that previously proved sensitive to (noncovalent) peptide binding. The effect is additionally illustrated in Fig. S3 that shows the mapping of Δcov and Δnoncov onto the structure of the complex.
The series of back-to-back HSQC spectra, two of which are shown in Fig. 2A, can also be used to characterize the kinetics of the conjugation reaction. Given the experimental sample conditions and the Kd constant reported above, it follows that SH3 domain is fully loaded with the peptide (the proportion of the apo SH3 does not exceed 0.4%). Therefore, the reaction can be described simply as a gradual transformation of SH3·Sos1-X′ into SH3:Sos1-X′, obeying the first-order exponential kinetics. To quantify this process, we have selected 3 residues which give rise to well-resolved peak pairs assignable to the noncovalent and covalent complex. Fitting these data on per-residue basis leads to the average value τ1/2 = 41.6 ± 1.9 min for half-life of the reaction, while the global fitting yields 41.5 min (illustrated in Fig. 2C, summarized in Fig. S4). If viewed from the perspective of potential diagnostic or therapeutic uses, this is a favorable outcome. Indeed, as commented in the Introduction, the conjugation chemistry should not be too fast or too slow. The timeframe of ca. 1 h appears to be the desired middle ground.
While NMR data strongly support the proposed conjugation mechanism, they offer no definitive proof that the reaction occurs as intended. For such proof, we turn to liquid chromatography/tandem mass spectrometry. The sample of SH3:Sos1-X′ for LC-MS/MS analysis was prepared in a similar manner to the NMR study, then transferred to ammonium bicarbonate buffer and subjected to trypsinolysis. The trypsin treatment is expected to cleave between residue R10 in the peptide and the X′ derivate that is conjugated to the protein (see Fig. 1B). Consequently, in the tryptic digest C32 residue should carry a modification with the mass 186.15 (acetyllysine). Detection of this particular modification should prove that the reaction has occurred as expected.
The fragmentation spectra of SH3:Sos1-X′ produced a reasonably good coverage of the SH3 sequence, see Fig. 2E. The expected acetyllysine modification is consistently found in the position C32, thus confirming the presence of the intended thioether linkage. Note that in some of the tryptic peptides C32 remains unmodified, while in others it carries a dithiothreitol modification (the consequence of 2 mM DTT in the reaction medium). Bear in mind, however, that these data should not be interpreted as a measure of reaction efficiency since LC-MS/MS method is not suitable for product quantification. In fact, NMR data indicate that the efficiency of the conjugation reaction approaches 100% (see above). Similarly, the presence of oxidized Met, His and Trp residues, deamidated Gln and Asn residues, etc. in the tryptic digest does not reflect the true state of the sample, but largely stems from the experimental artefacts44.
We have also conducted an experiment to shed light on the mechanism of the conjugation reaction. Specifically, we carried out the reaction under different pH conditions, employing SDS-PAGE to detect the adduct (see Fig. 2D). The observed increases in the reaction rate with increasing pH confirm that the reaction proceeds via thiolate-anion form of C32, in agreement with the well-established mechanism of a small-molecule reaction involving chloroacetamide and cysteine45. This observation proved very important for our subsequent modelling efforts (see below). Generally, chloroacetamide chemistry is known for its high degree of specificity toward cysteine residue46. Indeed, our mass spectra showed no evidence of off-target modifications, involving e.g. proximal lysine sites. Our NMR spectra, featuring one set of intense spectral peaks assignable to the SH3 domain, also confirm that the covalent complex is formed uniquely through the target C32 residue.
Finally, we were interested in folding properties of SH3:Sos1-X′ vs. SH3·Sos1-X′. It is a common knowledge that small domains, such as Grb2 N-SH3, have an ability to fold spontaneously. On a fundamental level, this property is a product of evolutionary pressure – simple structural motifs have evolved to fold independently (without help from chaperonins). On the other hand, this argument does not apply to the covalent complex of SH3:Sos1-X′. Indeed, its polypeptide chain, featuring the thioether isopeptide linkage, is a wholly artificial construct. Therefore, it is not a priori clear whether SH3:Sos1-X′ retains the ability to spontaneously fold similar to the apo SH3 domain.
In order to answer this question, we have characterized the ability of SH3:Sos1-X′ to refold following thermal denaturation. The sample of noncovalent SH3·Sos1-C complex was used as a reference. Our results demonstrate that both covalent and noncovalent complexes can successfully refold after 30 minutes at 70 °C with only moderate losses (see Fig. S5). Similarly, the covalent complex refolds after 30 minutes at 90 °C, which is well above its melting temperature47. While non-native covalent linkages can affect protein folding, especially in homodimers48, this is apparently not the case in this particular peptide-protein conjugate.
MD simulation of covalent binding between Sos1-X′ and Grb2 N-SH3
In this section, our objective is to start with the known structure of the noncovalent SH3·Sos1-X′ complex and by means of the specially designed MD algorithm transform it into covalent complex SH3:Sos1-X′. In doing so, we seek to obtain a maximally faithful model of SH3:Sos1-X′.
As a first step, we calculated Amber ff14SB force-field49 parameters for non-native residues in this system: Nε-chloroacetyl lysine (X′) and its adduct with cysteine. For this purpose, a standard procedure has been employed (see Materials & Methods). In brief, small fragments have been chosen to represent the residues of interest and geometry-optimized by means of DFT calculations. The obtained geometries were used to calculate electrostatic potentials by means of Hartree-Fock method; these potentials were in turn used to fit partial charges via the RESP scheme50. Other lacking force-field parameters have been taken from GAFF2 force field51. In this manner, we have parameterized Nε-chloroacetyl lysine (termed LYC), C-terminal Nε-chloroacetyl lysine (CLYC), adduct of Nε-chloroacetyl lysine with cysteine (LYZ), adduct of C-terminal Nε-chloroacetyl lysine with cysteine (CLYZ) and adduct of cysteine with Nε-chloroacetyl lysine (CYZ). The boundary between the covalently bonded CYZ and LYZ (or CLYZ) residues is across the Sγ-Cθ bond (see Fig. 1B). Given the C-terminal position of Nε-chloroacetyl lysine in the Sos1-X′ peptide, only CLYC, CLYZ and CYZ residues are required for further simulations.
To build the initial model, we begin with the structure PDB ID 1GBQ representing noncovalent complex between Sos1 and Grb2 N-SH318. 1GBQ is a minimized average NMR structure and, as such, is expected to be less accurate than crystallographic structures52. Nevertheless, comparing it with x-ray coordinates of apo Grb2 N-SH3 PDB ID 6SDF indicates a very good level of agreement (rmsd 0.57 Å for the backbone atoms, 0.72 Å overall). Beginning with 1GBQ, we appended X′ (CLYC) residue to the extended C-terminus of the Sos1 peptide using the standard facilities of Amber 1653. Finally, we assigned the (standard) type CYM to residue C32 in the SH3 domain to simulate the ionized form of cysteine. As discussed in the previous section, conjugation of Nε-chloroacetyl lysine to cysteine occurs through the thiolate-anion form of cysteine.
The resulting structure has been placed in TIP3P water box, energy-minimized and heated to the temperature 293 K. This configuration was then used to launch 11 MD trajectories (starting with different initial velocity distributions). Over the first 500 ns, each trajectory was recorded as a regular, conventional MD simulation. Beginning from this point, the system is deemed reactive: i.e. it is assumed that X′ and C32 may form a bond if the two reactive groups come sufficiently close to each other. Accordingly, the algorithm starts to monitor the distance between Cθ (X′) and Sγ (C32) atoms. This distance varies significantly during the course of the simulation, reflecting conformational mobility of (i) flexible arginine-rich peptide tail and (ii) long Nε-chloroacetyl lysine side chain. When this distance falls below rc = 3.3 Å, it is assumed that the two reactive groups are within the capture radius of each other and, therefore, the conditions exist for the chemical reaction to occur.
This does not mean, however, that the reaction is triggered automatically upon crossing the rc threshold. Instead, the algorithm makes a decision on whether to initiate the reaction by means of rejection sampling (i.e. essentially by flipping a coin, see Materials & Methods). The goal of this scheme is to lower the reaction rate. While it is impossible to reproduce the correct time scale of the reaction, ca. 1 h, this approach is meant to ensure that the simulated reaction is slow on the scale of conformational dynamics in SH3·Sos1-X′.
At the moment when the reaction is initiated, a special algorithm takes control. This algorithm makes use of two topologies and their associated sets of force-field parameters: one pertaining to the noncovalent complex SH3·Sos1-X′ (involving residues CLYC and CYM), and the other pertaining to the covalent complex SH3:Sos1-X′ (involving covalently bonded CLYZ and CYZ). The algorithm interpolates between these two parameter sets. Specifically, the force constants from the first set are gradually reduced to zero, while the force constants from the second set are gradually ramped up to their full value. As an example, we illustrate the scaling of the force constants associated with Cθ-Cl bond in the chloroacetyl group and Cθ-Sγ bond in the thioether bridge (red and green profiles in Fig. 3A, respectively). It can be said that the first bond is gradually dissolved, while the second is gradually materialized.
The transition between the initial SH3·Sos1-X′ state and the final SH3:Sos1-X′ state is conducted over the time interval Trxn = 2 ns. During this period, the linear interpolation between the two states is accomplished through (i) consecutive regeneration of topology files and (ii) using scalable artificial restraints that emulate certain force field terms (see Materials & Methods for details). By the end of the transition period, the artificial restraints are replaced with the equivalent force field terms for SH3:Sos1-X′.
After the formation of SH3:Sos1-X′ is completed, it is simulated in a conventional fashion under the control of the regular force field. In this final stage of our protocol, we record 500 ns of such conventional MD trajectory.
In what follows, we discuss the behavior of the 11 MD models that have been generated in this manner. The initial 500-ns-long portion of each trajectory corresponds to the conventional MD simulation of the SH3·Sos1-X′ complex. The dynamics of the complex is in line with expectations. The polyproline-II segment of Sos1 (VPPPVPP) remains wedged into two shallow hydrophobic grooves on the surface of the SH3 domain, maintaining key hydrogen bonds to the side chains of the conserved W36 and Y52 residues (these and other essential residues are visualized in Fig. S7)40,54. The first arginine following the PPII element, R8, also maintains a strong connection to the SH3 domain through two concurrent salt bridges – to D15 and E16. The following two arginines, R9 and R10, are highly dynamic, with side chains mostly projected into solvent. However, occasionally these arginines also form salt bridges with different anionic residues in the RT and n-Src loops of the SH3 domain; these salt bridges often turn out to be rather long-lived (100 ns or longer). Finally, the long side chain of Nε-chloroacetyl lysine (X′) shows a similarly diverse behavior – most of the time it is extended into solvent, but occasionally it inserts itself between the RT and n-Src loops and remains in this configuration for extended periods of time (100 ns or longer). Accordingly, the distance between Cθ atom in X′ chloroacetyl group and Sγ atom in C32 thiolate group shows a wide range of variation during the course of the simulations (between 3.1 and 32.8 Å).
After the time point 500 ns, the simulation of SH3·Sos1-X′ is continued in the same conventional fashion up to the moment when the reaction is initiated. This is illustrated for one of the trajectories in Fig. 3A, where the reaction is started at time point 649.3 ns. Considering all trajectories, the conjugation occurs in the time window between 513 and 1,047 ns.
We will now discuss in more detail the events that happen during the course of the “reaction”, i.e. during the time Trxn = 2 ns. It begins with a configuration where Cθ atom is maximally close to Sγ, see Fig. 3B. Initially, the system evolves similar to a noncovalent complex, with only minimal extraneous restraints. However, after 1 ns a “transition state” is formed, where the force field terms representing separate X′ and C32 residues are balanced with the other force field terms representing the thioether linker. This configuration, illustrated in Fig. 3C, involves pentavalent carbon Cθ and appears curiously realistic. Indeed, our algorithm, which involves gradual dissolution of old bonds and simultaneous emergence of new bonds, is expected to produce a reasonably looking “transition state”. Of course, the similarity to the actual transition state is rather superficial (building a realistic model of transition state would require quantum-chemical calculations, which are not a part of our procedure).
From the “transition state” the system evolves toward the bona fide thioether bond, see Fig. 3D. The chlorine atom leaves and diffuses into solvent (in a form of chloride anion). At the end of the Trxn period the force field terms corresponding to the old covalent geometry are reduced to nil and the terms corresponding to the new covalent geometry are brought to their full intensity. Beginning from this point the simulation of the newly formed SH3:Sos1-X′ complex is continued in a conventional fashion. The length of this final portion of the trajectory is 500 ns (same as the initial portion of the trajectory dedicated to SH3·Sos1-X′). The entire sequence of events is visualized in Video S1.
At this point it is appropriate to discuss the setting of the two parameters used in our computational scheme, rc = 3.3 Å and Trxn = 2 ns. Both values represent a heuristic compromise. In the case of rc, the value smaller than 3 Å would mean that prohibitively long simulations are required to achieve the bonding. Conversely, large values that are greater than ca. 5–10 Å would mean that the system is actively steered to form a bond, even when its current conformation is not conducive to bond formation. In the case of reaction time, a short Trxn period would result in an explosive growth of potential energy and termination of the trajectory. (The limiting case Trxn = 0 corresponds to a scenario when one imposes a full-strength bond on a pair of distant atoms, which causes the simulation to immediately “blow up”). On the other hand, using long Trxn would mean that the system has a long time to evolve while in a somewhat artificial state, such as the “transition state” described above. This is also undesirable.
The force field terms that are ramped up/down in our protocol involve a small part of the simulated system (specifically, just a few atoms). Furthermore, these terms are ramped up/down sufficiently slowly over the time interval 2 ns. Consequently, the potential energy of the system increases only modestly during the transition. This is illustrated in Fig. 4A – the transient increase in potential energy corresponds to only 0.5% of the total. This is of similar magnitude to random fluctuations of potential energy (represented in the graph by a pale blue band). From this perspective the transformation can be characterized as almost adiabatic (i.e. no heat is exchanged with the environment)55.
On a more quantitative level, changes in potential energy should, of course, be reflected in kinetic energy of the system. However, the situation is successfully handled by the thermostat so that the temperature remains stable throughout the transition period, see Fig. 4C (i.e. the thermostat removes the excess heat). Similarly, the barostat successfully maintains constant pressure, Fig. 4D. The only macroscopic parameter that becomes noticeably perturbed in this NPT simulation is the volume of the simulation cell, Fig. 4E. However, the observed transient increase in volume does not exceed 0.15% of the total. This is similar to volume fluctuations during the course of the regular trajectory. The observed spike is short-lived – the volume returns to its original value by the end of the Trxn period. Therefore, we conclude that our scheme avoids any appreciable perturbations to the essential macroscopic parameters: temperature, pressure and volume. This outcome validates our choice of Trxn and, more broadly, the entire scheme used to model Grb2 N-SH3 conjugation with Sos1-X′.
Evaluation and validation of the model
Each of our 11 simulations described in the previous section includes the 500-ns final segment that represents the covalent complex SH3:Sos1-X′. Immediately following the bond formation, the system may need some time to relax and dissipate the residual structural strain. Therefore, we choose to ignore the first 50 nanoseconds subsequent to Trxn and focus on the final 450-ns interval. Thus, we have 11 × 450 ns = 4.95 μs of conventional MD simulations that can be viewed as an MD model of the SH3:Sos1-X′ complex.
Similarly, the initial part of each simulation is representative of the noncovalent SH3·Sos1-X′ complex. Ignoring the first 50 ns, the net duration of the relevant MD “footage” is 4.95 μs + 2.27 μs (the latter contribution is from the period prior to bond formation, when we monitor the Cθ-Sγ distance, but aside from that record the trajectory in a conventional manner). Therefore, we have a total of 7.22 μs of conventional MD simulations that can be regarded as an MD model of the SH3·Sos1-X′ complex. This presents an opportunity to compare the two MD models, SH3·Sos1-X′ and SH3:Sos1-X′, and thereby assess the effect of conjugation on protein structure.
Figure 5A shows the superposition of the backbone traces from the average MD coordinates of SH3·Sos1-X′ and SH3:Sos1-X′ complexes (painted red and green, respectively). Clearly, there is very little difference between the two models. In fact, the differences are localized in the n-Src loop region of the SH3 domain and the C-terminus of the Sos1-X′ peptide – i.e. precisely at the conjugation site. In particular, the Cα atom of residue C32 is displaced by 1.7 Å, while Cα atom of residue X′ is displaced by 3.1 Å. We conclude that the structure adapts to covalent bonding through local conformational changes involving only a few residues proximal to the conjugation site.
Not surprisingly, the key peptide-protein interactions also remain intact, e.g. hydrogen bond between Y52 and P3 (present in 97% of all MD frames in both SH3·Sos1-X′ and SH3:Sos1-X′ models) as well as several other conserved hydrogen bonds. Similarly, the “fuzzy” electrostatic interactions between the arginine-rich tail of Sos-1X′ peptide and the negatively charged SH3 patches40 are maintained as well. However, the pattern of these transient contacts is somewhat changed. In particular, the salt bridges between R8 and D15, E16 are slightly weakened as a result of covalent linking. R9 tends to interact less with D15, E31, C32 and more with D33, Q34, whereas R10 interacts less with E16 and more with D15. Finally, X′ residue shows some hydrogen bonding to carbonyl oxygen of C32, with which it is linked via thioether bridge. The hydrogen bonds are through backbone N atom (24%), as well as Nζ atom in the linker (12%).
In addition, we have also analyzed the amplitudes of backbone motion in the MD models of SH3·Sos1-X′ and SH3:Sos1-X′. The results are shown Fig. 5B. The observed differences are minor and limited to the C-terminal tail of the peptide, plus essentially a single residue in the n-Src loop, viz. E31. Interestingly, the loop remains mobile despite covalent linking – apparently, long and flexible linkage allows it to retain a significant level of motional freedom. Thus, both structure and dynamics of the complex are only minimally affected by the covalent linking.
We have also sought to validate our SH3:Sos1-X′ model using experimental data. The optimal strategy would be to compare our model to a crystallographic structure of the covalent complex. We attempted to obtain such structure in collaboration with S. Korban in I. Bezprozvanny laboratory. This work led to a crystal structure of apo Grb2 N-SH3 (PDB ID 6SDF). However, our efforts to crystallize SH3:Sos1-X′ complex proved unsuccessful.
An alternative would be to obtain NMR structure of the covalent complex. However, as already pointed out, NMR structures are generally less accurate than their crystallographic counterparts56. In particular, it may be difficult to reliably detect the repositioning of several residues in the n-Src loop by ca. 1–2 Å based on NMR coordinates. Moreover, standard structure-solving methods would require isotopic labeling of the (chemically modified) Sos1-X′ peptide, which is prohibitively expensive. Without the labeled peptide, it is difficult to obtain a good structural model of the flexible polyarginine tail and the thioether linker.
In the absence of high-quality structural data, we used NMR chemical shifts to validate the obtained SH3:Sos1-X′ model. This approach has been introduced following the development of structure-based methods for calculation of protein chemical shifts57–61. Like others, we use secondary chemical shifts, , that are supposedly free from the effect of primary structure and sensitive only to higher-order (secondary, tertiary, etc.) structure. Note that our definition of the secondary shift is analogous to the one reported originally62,63. The calculations were conducted using the chemical shift prediction program SPARTA+64. Specifically, MD frames were extracted from the SH3:Sos1-X′ trajectory with a step of 1 ns and processed by the program; the results were subsequently averaged and corrected for the random-coil shifts δrc as catalogued in SPARTA+. The experimental shifts were corrected using the same δrc values.
The correlation between the experimental and predicted (trajectory-based) δsec is illustrated in Fig. 6. The obtained correlation coefficients, 0.88–0.92, suggest that the model is in agreement with the experimental observations. In principle, chemical shifts are exquisitely sensitive to fine details of protein structure. However, their usefulness in this context is limited because of the relatively low accuracy of the available predictor programs. For instance, SPARTA+ claims the accuracy (i.e. rmsd between the predicted and experimentally measured shifts) of 1.09, 0.94 and 1.14 ppm for 13C′, 13Cα and 13Cβ spins, respectively. In our analysis, we have obtained the rmsd of 0.84 ppm for the combined data from these carbon spins, Fig. 6A. A significant uncertainty margin associated with chemical shift predictions essentially limits the “resolution” of this method as a structure-validation tool (in other words, δsec rmsd is not nearly as informative as, for example, crystallographic Rfree65).
To advance further, we choose to compare the results in Fig. 6 with other similar analyses. As a first step, we used the NMR structure of noncovalent complex SH3·Sos1-X′ (1GBQ) as a model for covalent complex SH3:Sos1-X′. This is not unreasonable, given that peptide conjugation leads only to minimal structural changes in the complex, see Fig. 5. Moreover, one may expect that the experimentally determined structure (SH3·Sos1-X′) is in certain ways superior to the MD-based model (SH3:Sos1-X′). However, the results have not borne out these expectations. 1GBQ proved to be a markedly worse model of the covalent complex, as indicated by lower Pearson coefficients and significantly increased rmsds: 1.05 vs. 0.84 ppm for 13C, 2.71 vs. 1.92 ppm for 15N, and 0.53 vs. 0.33 ppm for 1HN spins (cf. Figs. 6 and S8). This outcome reflects favorably on our MD-based model of SH3:Sos1-X′. Specifically, it suggests that our model is not simply a mechanically altered version of 1GBQ, but rather a distinctly better representation of the actual covalent complex.
It is also instructive to compare the results in Fig. 6 with benchmarks from one of the popular model proteins, such as ubiquitin. To this end, we correlated the experimental chemical shifts of ubiquitin with SPARTA+ predictions using the crystallographic structure 1UBQ66. It is anticipated that this example should yield the best correlation obtainable with SPARTA+ software. Indeed, SPARTA+ has been trained on high-resolution x-ray structures and, accordingly, produces best results when used with such structures67. Furthermore, 1UBQ was actually a part of the training set, which should additionally improve the quality of predictions. It should also be noted that 1UBQ has been extensively validated using various solution NMR data and proved to be a good model for ubiquitin in solution68. The results from 1UBQ calculations are presented in Fig. S9A–C. The obtained degree of correlation is comparable to the one in Fig. 6. Specifically, the Pearson coefficients are the same for 13C, r = 0.92 vs. 0.92, and nearly the same for 15N, r = 0.90 vs. 0.89, while for 1HN our results using SH3:Sos1-X′ model are somewhat better, r = 0.88 vs. 0.80. In terms of rmsd, 1UBQ calculations show slightly better agreement for 13C (0.80 vs. 0.84 ppm) and 15N (1.87 vs. 1.92 ppm), while SH3:Sos1-X′ calculations prove a little more accurate for 1HN (0.33 vs. 0.39 ppm). Of note, the latter result does not depend on proton optimization in the crystal structure of ubiquitin.
In addition to the crystallographic structure 1UBQ, we have also considered the recently reported ultra-accurate solution structure 2MJB68, as well as 1.23-μs MD trajectory of ubiquitin recorded in house under Amber ff14SB force field. The former led to chemical shift predictions on par with 1UBQ, see Fig. S9D–F. The latter produced the results that appeared to be slightly worse, Fig. S9G,H. This last observation is in agreement with the findings of Li and Brüschweiler, who concluded that the application of SPARTA+ and other similar predictors to MD trajectories do not lead to any improvement compared to high-resolution crystallographic structures and, in fact, tends to slightly degrade the accuracy of the predictions67.
In conclusion, the comparison with ubiquitin suggests that our SH3:Sos1-X′ model is of very good quality. As a template for chemical shift predictions, the SH3:Sos1-X′ model shows the same level of performance as a high-resolution x-ray structure, i.e. the highest level of performance attainable with SPARTA+ (as well as other empirical predictors). It is difficult, however, to make any more specific claims in this regard because of the limitations of the chemical-shift-based validation procedure.
Concluding Remarks
It should be instructive to compare our approach with some of established MD-based modeling tools that, in principle, could be used to model peptide-protein conjugation. Of special interest are those methods that use empirical reactive force fields (ERFF), such as Empirical Valence Bond approach or Adiabatic Reactive Molecular Dynamics19–22.
In ERFF methods, it is normally assumed that both initial state of the system (i.e. in our case noncovalent complex) and its final state (covalent complex) are known. The main emphasis is on reconstruction of the potential energy surface on the path from the initial to the final state. This is achieved by means of ab initio calculations and may involve calibration against experimental data. For reactions with low barriers, ERFF methods can simulate the transition of the system from the initial to the final state21. More typically, however, the reactions of interest feature high barriers. In this case, ERFF methods rely on various biasing schemes, such as umbrella sampling69. Such approach allows one to estimate the magnitude of free-energy barrier ΔG, explore the details of transition state, and investigate the effect of protein environment on ΔG. All of this is the focus and the stated goal of the ERFF methods.
In our approach, we know the structure of the initial state (noncovalent complex), but not the final state (covalent complex). Unlike ERFF methods, we are not interested in the details of the transition state, but instead seek to build a realistic model of the final state. The reaction is initiated when the two reactive groups approach each other to within a certain minimum distance (which is a reasonable approximation for the starting point). The reaction is driven by switching the simulation from one potential energy surface to another. In principle, this approach is fraught with a risk of large perturbations to energy and temperature. However, in our implementation the switching is conducted over a relatively long period of time, as controlled by the appropriate switching function. Under these conditions, the excess energy is thermalized and successfully absorbed by the thermostat, thus avoiding any significant perturbations to the system. Previously, such switching functions have been employed in the context of different ERFF schemes22,70.
In summary, our method can be viewed as a modification of ERFF strategy, where we execute the transition from the initial to the final state with no interest in the details of the intervening barrier (cf. Fig. 4B). The adiabatic character of the transition is achieved by slow switching from one potential energy surface to the other. In this sense, our method can also be likened to a steered simulation. The distinction is that our algorithm is driven by structure-informed force field terms rather than a preselected geometric variable (e.g. certain characteristic distance)71,72.
While the transition state obtained in our method is broadly reasonable, we make no effort to reproduce the correct barrier height, etc. Consequently, our method has only limited ability to predict the efficiency of the reaction. Specifically, it can address steric factors controlling the approach of the reactive groups, but not the chemistry per se. At the same time, our method demonstrated its ability to produce a reasonable model for the final state (covalent complex), which is duly relaxed and structure-optimized under the appropriate force-field potential. Note that such model building is not a part of the original ERFF mandate.
In addition to ERFF methods, it is also worth mentioning an alternative modeling technique known as covalent docking73. In principle, this technique can also be used to build a model of covalent peptide-protein complex. However, in practice the existing programs can only handle small-molecule ligands and not peptidic ligands. Furthermore, this approach suffers from limitations that are common to docking programs, e.g. it restricts the flexibility of the system (treating the scaffold of the target protein as rigid). This makes it unsuitable for the purpose of the current study, where we seek to capture (subtle) structural changes caused by covalent bonding of the peptide ligand.
Of particular note, the described method evolves the system under the control of native, unaltered force field (with the exception of 2-ns period, representative of the chemical reaction). Accordingly, the speed of the simulations is essentially the same as in conventional simulations. The proposed MD protocol should be a useful complement to experimental studies of peptide-protein conjugates. First, MD modeling sheds light on the likelihood of the peptide reactive group making contact to its target residue on the surface of the protein. Many reactive groups can bond with more than one type of amino acid; some of these reactions are undesirable since they lead to unstable conjugates. MD modeling allows one to estimate the likelihood of generating such off-target products. This kind of in silico identification of possible conjugation sites can be of practical interest15. Second, our method permits building of high-quality models for peptide-protein conjugates. Such models can be helpful in numerous applications involving imaging, therapeutics, biotechnology, etc., especially when the conjugates do not easily lend themselves to structural characterization (such as the case with SH3:Sos1-X′). This is particularly true in situations where the initial models are incomplete, e.g. the coordinates of a flexible peptide tail or a protein loop are not accurately known (which is the case with SH3·Sos1-X′, where the peptide is partially disordered). In this situation, MD-based methods, such as described in this paper, can help to model the missing fragments and thus augment the existing structural data. Third, our method can offer some insight into stability of peptide-protein conjugates. Experimental results suggest that various covalent modifications can have a destabilizing effect on protein structure48,74–76. This has far-reaching implications for therapeutic applications of covalent peptidic ligands. It is anticipated that MD models can shed light on structural and dynamic origins of such effects.
Materials and Methods
Sample preparation
pET-28 vectors for Grb2 N-SH3 (WT and C32S) encoding residues 2–59 of human Grb2 have been purchased from GenScript. The protein additionally containing N-terminal His6-tag was expressed in Rosetta DE3 cells (Novagen) using minimal M9 media. 15N-NH4Cl and 13C-glucose supplements have been used for isotope labeling. Protein expression was induced when OD600 reached 0.8. After 16 hours of incubation at 37 °C, cells were harvested, suspended in the standard lysis buffer (containing 2 mM of DTT) and then homogenized by SPEX SamplePrep 6870 Freezer/Mill followed by three rounds of sonication on ice. The sample was purified using Ni-affinity and size-exclusion chromatography (columns GE HisTrap HP and GE Sephacryl S-200 HR). Sos1-X′ and Sos1-C peptides were synthesized by Pepmic Inc. The buffer composition was 20 mM sodium phosphate, 10 mM DTT, 10% D2O, 0.01% NaN3, pH 7.2 (unless indicated otherwise).
NMR experiments
All measurements were conducted at 25 °C using 500 MHz Bruker Avance III spectrometer equipped with TBI room temperature probe. For spectral assignment of noncovalent (covalent) complex we used the sample containing 1.5 mM of 15N,13C-labeled protein and two-fold excess of Sos1-C (Sos1-X′) peptide. The buffer additionally contained 150 mM NaCl. The covalent complex was obtained by overnight incubation of Grb2 N-SH3 with Sos1-X′ at room temperature; the completion of the conjugation reaction was confirmed by SDS-PAGE. Spectral assignment was obtained using the standard suite of triple-resonance experiments: HNCO, HNCACB, HN(CA)CO and HN(CO)CA77; the results were found to be in agreement with the data of Wittekind et al.18.
Binding affinity of Sos1-X′ to C32S Grb2 N-SH3 has been determined using the sample with protein concentration 100 μM. The peptide concentration was incremented from 0 to 160 μM in a step of 20 μM and then brought to 240 μM. The spectra were acquired using 1H,15N-BEST-HSQC sequence78. The results were analyzed using the program TITAN39. Conjugation kinetics was investigated using the sample containing 1 mM Grb2 N-SH3. Following the addition of 2 mM Sos1-X′, a series of back-to-back 1H,15N-BEST-HSQC experiments was recorded (24 mins per spectrum). Peak volumes were obtained using nlinLS79 and then fitted assuming pseudo first order (exponential) kinetics.
The refolding of noncovalent complex was investigated using the sample containing 100 μM Grb2-N SH3 and 300 μM Sos1-C. Covalent complex was obtained by overnight incubation of 100 μM of Grb2-N SH3 with 300 μM Sos1-X′. The buffer additionally contained 150 mM NaCl. The samples were denatured by 30-min incubation at 70 or 90 °C and then returned to room temperature. 1H,15N-BEST-HSQC spectra were recorded before and after this procedure.
SDS-PAGE
The freshly prepared protein material was reduced by 10 mM DTT, divided into several portions and transferred to 50 mM sodium acetate buffer (pH 4.0 and 5.0), 50 mM sodium phosphate buffer (pH 6.0 and 7.0) and 50 mM Tris-HCl buffer (pH 8.0). The obtained samples, containing 100 μM Grb2 N-SH3 and 1 mM DTT, were incubated for 20 mins with 200 μM Sos1-X′ and then loaded on non-reducing tris-glycine 14% acrylamide gel as is (i.e. without quenching the reaction or boiling). The gel was stained using Coomassie Blue.
LC-MS/MS analysis
Samples of apo Grb2 N-SH3 and Grb2 N-SH3 conjugated with Sos1-X′ were dissolved in 50 mM ammonium bicarbonate buffer and digested with trypsin (10 ng/μl) for 16 h at 37 °C. The reaction was stopped with 0.5% formic acid. The obtained tryptic peptide fractions (injection volume 2 uL) were analyzed in triplicates on a nano-HPLC Agilent 1100 system (Agilent Technologies, Inc.) coupled to a 7 T LTQ-FT Ultra mass-spectrometer (Thermo Electron Bremen GmbH) using a nanospray ion source, as described previously80,81. HPLC separation was performed on a capillary column packed in-house (75 μm i.d. × 12 cm fused silica capillary filled with Reprosil-Pur Basic C18, 3 μm/100 Å, Dr. Maisch HPLC GmbH). Gradient elution was carried out at a flow rate of 0.3 μL/min with the mobile phase A being 0.1% formic acid in water and mobile phase B – 0.1% formic acid in acetonitrile. After pre-equilibration with 3% (v/v) solvent B, a 30 min linear gradient from 3% to 50% was applied, followed by a 5 min gradient from 50% to 90% and then a 10 min isocratic elution with 90% solvent B. MS and MS/MS data were obtained in data-dependent mode using Xcalibur (Thermo Finnigan LLC) software. The precursor ion scan MS spectra (m/z range 300–1600) were acquired in the FTICR cell with resolution R = 50,000 at m/z 400 (number of accumulated ions: 5·106). Five most intense ions from each parent scan were isolated and fragmented by collision-induced dissociation (CID), electron-capture dissociation (ECD) or ECD with in-source decay. Dynamic exclusion was used with 30 s duration80,81.
MS data were searched using PEAKS Studio 8.5 software against the SwissProt human database (augmented with user-supplied sequences of the expected tryptic peptides). The initial mass tolerance was set to 50 ppm for full scans and 0.1 Da for MS/MS. Most common natural or artifactual modifications, such as oxidation of methionine, histidine and tryptophan, deamidation of asparagine and glutamine, and acetylation of N-terminus, were used as variable modifications in the database search. In addition, possible preparation-related modifications, such as chloroacetylation of lysine and DTT modifications of cysteine, were included in the search. Finally, the expected target modification, i.e. modification of cysteine with acetyllysine, was also added. Separately, a modification search with a wide range of possible modifications has been carried out, allowing for up to five variable modifications per peptide. The cutoff false discovery rate was set to 0.1%. At least one unique peptide identification per protein was required. Using the described LC-MS/MS protocol, the target modification was identified in the SH3:Sos1-X′ sample, but not in the control apo sample.
Parameterization of non-native residues for Amber ff14SB force field
We have obtained force field parameters for several small molecules and modified amino-acid residues relevant for the problem at hand. To this end, we have used the following standard algorithm. First, appropriate small-molecule models were chosen and geometry-optimized using B3LYP functional82,83 with 6–31 G(d) basis set84,85. Second, electrostatic potential of the optimized models was calculated using Hartree-Fock method with the same basis set86. Third, the corresponding electrostatic potential (ESP) atomic charges87 and restrained electrostatic potential (RESP) atomic charges88 were derived using the RESP module in Amber 16. Based on the similarity of ESP charges, we transfer (previously reported or newly calculated) RESP charges from one molecule to another. Fourth, other necessary parameters that are absent from the standard ff14SB force field have been adopted from GAFF251,89. All quantum chemical calculations in the above procedure have been performed using Gaussian90.
Small-molecule models and amino-acid residues that have been treated in this manner are listed below. The chemical structures of the amino-acid residues and the respective atomic charges are summarized in Fig. S10.
-
(i)
Non-terminal deprotonated lysine (LYN). Used for method calibration (the corresponding force-field parameters are available in ff14SB).
-
(ii)
C-terminal deprotonated lysine (CLYN). Atomic charges for side-chain atoms beginning with δ-methylene group are the same as in LYN.
-
(iii)
2-chloro-N-propylacetamide. A partial model for Nε-chloroacetyl lysine side chain.
-
(iv)
Non-terminal Nε-chloroacetyl lysine (LYC). Atomic charges for backbone atoms, as well as side-chain atoms up to and including γ-methylene group, are the same as in LYN. Atomic charges for side-chain atoms beginning with δ-methylene group are the same as RESP charges from 2-chloro-N-propylacetamide.
-
(v)
C-terminal Nε-chloroacetyl lysine (CLYC). Atomic charges for backbone atoms, as well as side-chain atoms up to and including γ-methylene group, are the same as in CLYN. Atomic charges for side-chain atoms beginning with δ-methylene group are the same as RESP charges from 2-chloro-N-propylacetamide.
-
(vi)
S-(2-propylamino-2-oxoethyl)-cysteine capped with acetyl and N-methylamide in N- and C-terminal positions, respectively. A partial model for the cysteine residue conjugated to Nε-chloroacetyl lysine.
-
(vii)
Non-terminal cysteine residue modified by conjugation with Nε-chloroacetyl lysine (CYZ). Atomic charges for N, HN, C and O atoms are the same as in CYS. Other atomic charges are the same as RESP charges from S-(2-propylamino-2-oxoethyl)-cysteine.
-
(viii)
Non-terminal Nε-chloroacetyl lysine residue modified by conjugation with cysteine (LYZ). Atomic charges for backbone atoms, as well as side-chain atoms up to and including γ-methylene group, are the same as in LYN. Other atomic charges are the same as RESP charges from S-(2-propylamino-2-oxoethyl)-cysteine. The boundary between LYZ and CYZ residues is across the Cθ-Sγ bond.
-
(ix)
C-terminal Nε-chloroacetyl lysine residue modified by conjugation with cysteine (CLYZ). Atomic charges for backbone atoms, as well as side-chain atoms up to and including γ-methylene group, are the same as in CLYN. Other atomic charges are the same as RESP charges from S-(2-propylamino-2-oxoethyl)-cysteine. The boundary between CLYZ and CYZ residues runs across the Cθ-Sγ bond.
Note that entries (vii-ix) do not involve any new calculations compared to (i-vi). Of the above amino-acid residues, CLYC, CLYZ and CYZ have been used in our MD simulations of Sos1-X′ conjugation with Grb2 N-SH3.
MD simulations
Initial coordinates to simulate the complex of Grb2 N-SH3 with Sos1-X′ were from PDB ID 1GBQ. To bring the structure in line with our experimental design, we appended Nε-chloroacetyl lysine residue (X′, CLYC) to the C-terminus of the Sos1 peptide and treated the reactive cysteine residue C32 in the Grb2 N-SH3 as a thiolate anion (CYM), cf. the discussion of Fig. 2D. The protonation state of the system in the absence of histidine residues was standard for the target pH 7.2 (verified by PROPKA91 calculations). The structure was solvated with TIP3P water by building a truncated octahedron box with minimal 10 Å separation between the protein/peptide atoms and the boundary. The solvated system was neutralized by adding a single Na+ ion, subjected to energy minimization (500 steps using harmonic restraints with force constant 200 kcal/mol·Å2, followed by 100 steps with no restraints), then heated from 0 to 293 K and equilibrated for 1 ns at 293 K. The subsequent simulations were conducted in Amber 16 under ff14SB force field (the nonstandard amino acid types added to the force field are described above, alterations to the force field during the 2-ns reaction period are described below). The simulations were carried out in the NPT ensemble using Langevin thermostat with collision frequency 2 ps−1. A cutoff of 10.5 Å was used for (nonbonded) van der Waals and short-range electrostatic interactions; long-range electrostatic interactions were treated using the particle mesh Ewald scheme. All bonds involving hydrogen atoms were constrained using SHAKE algorithm. The integration step was 2 fs. The production rate using workstations equipped with Tesla K40 m GPU cards was 110 ns/card/day.
A special master script, written in python, controls the entire simulation process (see Fig. S6). The initial 500-ns portion of each SH3·Sos1-X′ trajectory is recorded in a conventional fashion. After that, the script continues recording the trajectory in 1-ns segments. Atomic coordinates and velocities (restart files) are saved at 1-ps intervals. After the segment is completed, the script loops over the frames and extracts the distances between Cθ (X′) and Sγ (C32) atoms. If during 1-ns time interval the distance never drops below 3.3 Å, the simulation is continued and the next 1-ns fragment is recorded. Otherwise, if the frame n is found where this distance is lower than 3.3 Å, the algorithm makes a decision on whether to initiate the reaction. This decision is made using the rejection sampling strategy with p0 set to 0.1 (i.e. the decision is random with the probability of positive outcome 0.1)92. If the outcome is negative, the scanning of the frames continues. Otherwise, the restart file n is invoked; the state of the system stored in this file is taken to be the initial state of the reaction.
During the subsequent reaction period Trxn = 2 ns we use the force-field parameters interpolated between the initial state of the system, SH3·Sos1-X′, and its final state, SH3:Sos1-X′. This concept is illustrated in Fig. 3 for the force constants associated with Cθ-Cl bond (decremented from its nominal value in the chloroacetyl group to zero) and Cθ-Sγ bond (incremented from zero to its nominal value in the thioether linkage). The list of parameters subject to interpolation is given in Table S1 (the key to this table is shown in Fig. S11). The information on atomic charges is provided separately in Fig. S10.
The interpolation procedure is implemented as follows. The 2-ns period is divided into 10-ps intervals. At the beginning of each interval, the relevant force constants are incremented/decremented and the temporary parameter-topology (prmtop) file is created. The 10-ps trajectory segment is then recorded under the control of this temporary prmtop file. At the end of the Trxn period, the temporary prmtop file is strictly equivalent to the generic prmtop file for the SH3:Sos1-X′ complex under ff44SB force field (augmented with the parameters for CYZ and CLYZ residues). Therefore, the simulation can be seamlessly continued further in a fully conventional manner. In doing so, we record 500 ns of such conventional trajectory representing the covalent SH3:Sos1-X′ complex.
The above procedure involves one technicality, which deserves a separate comment. For example, consider the pair of atoms Cθ and Cl that are initially bonded in the SH3·Sos1-X′ complex and then become separated in SH3:Sos1-X′ (chlorine atom moves into solvent in a form of Cl– ion). Amber convention is such that van der Waals and electrostatic interactions between the two bonded atoms are nullified. This makes it impossible to implement the interpolation strategy for these two interactions. To circumvent this problem, we disable the Cθ-Cl bond and replace it with an equivalent harmonic NMR restraint. Then we redefine Cθ and Cl as 1–4 atoms (by introducing a fictitious Cθ-X-Y-Cl angle). The van der Waals and electrostatic interactions are allowed between 1–4 atoms and the corresponding force constants can be edited using ParmEd facilities93. This workaround has been used for several energy terms. Specifically, during the Trxn period we disabled the original potentials associated with Cθ-Cl and Cθ-Sγ bonds, as well as Cβ-Sγ-Cθ, Cl-Cθ-Cη, Cl-Cθ-Hθ1, Cl-Cθ-Hθ2, Cl-Cθ-Sγ, Sγ-Cθ-Cη, Sγ-Cθ-Hθ1 and Sγ-Cθ-Hθ2 angles, and replaced them with the appropriate artificial restraints. At the end of the Trxn period, all these restraints were seamlessly replaced with the equivalent default potentials.
In an alternative version of this protocol, we used sigmoidal functions instead of linear functions to interpolate between the two sets of the force field parameters22. The results were unchanged (not shown).
MD trajectories were analyzed using the program PYTRAJ94, as well as PYXMOLPP2 package written in-house. Chemical shifts of Grb2 N-SH3 domain in SH3:Sos1-X′ complex were calculated on per-frame basis using the program SPARTA+64. For this purpose we have used the final 450 ns of each trajectory sampled with the step of 1 ns. The calculated shifts were averaged over 450 × 11 = 4,950 frames from eleven trajectories. To calibrate chemical shift calculations, we also used a separate 1.23-μs trajectory of ubiquitin. This simulation was conducted in the NVE ensemble using TIP4P-Ew water model95.
Supplementary information
Acknowledgements
This work was supported by the Russian Science Foundation grant 15-14-20038. We acknowledge the Center for Magnetic Resonance in the Research Park of St. Petersburg State University, where we conducted the NMR measurements, and the university Computer Center, where we performed most of the computations. HPLC-MS/MS analysis was performed at the Core Facility of the IBCP RAS “New Materials and Technologies”. We are thankful to Kris Liang at Pepmic Inc. for synthesizing Sos1-X′ and to Sevastyan Rabdano for his advice regarding spectral assignments. The cost of the publication was defrayed by St. Petersburg State University grant 15.61.2221.2013.
Author contributions
N.R.S. and D.A.L. conceived the project. D.A.L. expressed and purified protein material, collected NMR data, as well as other experimental data, and analyzed them. M.I.I. acquired and analyzed LC-MS/MS data, with advice from A.S.K. O.N.R. devised and implemented the new MD protocol based on the platform developed by S.A.I., recorded MD trajectories and analyzed the results. N.R.S. wrote the manuscript with input from all authors.
Data availability
Leap library files for all modified residues described in this paper, force-field parameter modification file and python script to run reactive MD simulations have been deposited to github.com/bionmr-spbu-projects/2019-GRB2-Sos1. The latter script is dependent on MD-control library PYRUN, which is available from github.com/bionmr-spbu/pyrun. PYXMOLPP2 package to process MD trajectories can be downloaded from github.com/bionmr-spbu/pyxmolpp2. The backbone NMR assignments of Grb2 N-SH3 have been deposited in BMRB under accession numbers 50104 (noncovalent complex with Sos1-C) and 50106 (covalent complex with Sos1-X′). All other data reported in this paper are available from the authors upon request.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Dmitrii A. Luzik and Olga N. Rogacheva.
Supplementary information
is available for this paper at 10.1038/s41598-019-56078-7.
References
- 1.Fosgerau K, Hoffmann T. Peptide therapeutics: current status and future directions. Drug Discovery Today. 2015;20:122–128. doi: 10.1016/j.drudis.2014.10.003. [DOI] [PubMed] [Google Scholar]
- 2.Lau JL, Dunn MK. Therapeutic peptides: historical perspectives, current development trends, and future directions. Bioorg Med Chem. 2018;26:2700–2707. doi: 10.1016/j.bmc.2017.06.052. [DOI] [PubMed] [Google Scholar]
- 3.Hermanson, G. T. Bioconjugate Techniques. 2-nd edn (Academic Press, 2008).
- 4.Stebbins JL, et al. Structure-based design of covalent Siah inhibitors. Chem Biol. 2013;20:973–982. doi: 10.1016/j.chembiol.2013.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Huhn AJ, Guerra RM, Harvey EP, Bird GH, Walensky LD. Selective covalent targeting of anti-apoptotic BFL-1 by cysteine-reactive stapled peptide inhibitors. Cell Chem Biol. 2016;23:1123–1134. doi: 10.1016/j.chembiol.2016.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Harvey EP, et al. Crystal structures of anti-apoptotic BFL-1 and its complex with a covalent stapled peptide inhibitor. Structure. 2018;26:153–160. doi: 10.1016/j.str.2017.11.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Barile E, et al. hBfl-1/hNOXA interaction studies provide new insights on the role of Bfl-1 in cancer cell resistance and for the design of novel anticancer agents. ACS Chem Biol. 2017;12:444–455. doi: 10.1021/acschembio.6b00962. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yu YS, et al. Targeted covalent inhibition of Grb2-Sos1 interaction through proximity-induced conjugation in breast cancer cells. Mol Pharm. 2017;14:1548–1557. doi: 10.1021/acs.molpharmaceut.6b00952. [DOI] [PubMed] [Google Scholar]
- 9.Hatcher JM, et al. Peptide-based covalent inhibitors of MALT1 paracaspase. Bioorg Med Chem Lett. 2019;29:1336–1339. doi: 10.1016/j.bmcl.2019.03.046. [DOI] [PubMed] [Google Scholar]
- 10.Yang NJ, Hinner MJ. Getting across the cell membrane: an overview for small molecules, peptides, and proteins. Methods Mol Biol. 2015;1266:29–53. doi: 10.1007/978-1-4939-2272-7_3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Palm C, Jayamanne M, Kjellander M, Hallbrink M. Peptide degradation is a critical determinant for cell-penetrating peptide uptake. Biochim Biophys Acta Biomembranes. 2007;1768:1769–1776. doi: 10.1016/j.bbamem.2007.03.029. [DOI] [PubMed] [Google Scholar]
- 12.Baggio C, et al. Design of potent pan-IAP and Lys-covalent XIAP selective inhibitors using a thermodynamics driven approach. J Med Chem. 2018;61:6350–6363. doi: 10.1021/acs.jmedchem.8b00810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gambini L, et al. Covalent inhibitors of protein-protein interactions targeting lysine, tyrosine, or histidine residues. J Med Chem. 2019;62:5616–5627. doi: 10.1021/acs.jmedchem.9b00561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Charoenpattarapreeda J, et al. Targeted covalent inhibitors of MDM2 using electrophile-bearing stapled peptides. Chem Commun. 2019;55:7914–7917. doi: 10.1039/c9cc04022f. [DOI] [PubMed] [Google Scholar]
- 15.Chandra K, et al. Covalent inhibition of HIV-1 integrase by N-succinimidyl peptides. ChemMedChem. 2016;11:1987–1994. doi: 10.1002/cmdc.201600190. [DOI] [PubMed] [Google Scholar]
- 16.Marquez BV, et al. Enhancing peptide ligand binding to Vascular Endothelial Growth Factor by covalent bond formation. Bioconjugate Chem. 2012;23:1080–1089. doi: 10.1021/bc300114d. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Assefa D, et al. A new photoreactive antagonist cross-links to the N-terminal domain of the gonadotropin-releasing hormone receptor. Mol Cell Endocrinol. 1999;156:179–188. doi: 10.1016/s0303-7207(99)00123-9. [DOI] [PubMed] [Google Scholar]
- 18.Wittekind M, et al. Solution structure of the Grb2 N-terminal SH3 domain complexed with a ten-residue peptide derived from SOS: Direct refinement against NOEs, J-couplings and 1H and 13C chemical shifts. J Mol Biol. 1997;267:933–952. doi: 10.1006/jmbi.1996.0886. [DOI] [PubMed] [Google Scholar]
- 19.Warshel A, Weiss RM. An empirical valence bond approach for comparing reactions in solutions and in enzymes. J Am Chem Soc. 1980;102:6218–6226. doi: 10.1021/ja00540a008. [DOI] [Google Scholar]
- 20.Shurki A, Derat E, Barrozo A, Kamerlin SCL. How valence bond theory can help you understand your (bio) chemical reaction. Chem Soc Rev. 2015;44:1037–1052. doi: 10.1039/c4cs00241e. [DOI] [PubMed] [Google Scholar]
- 21.Nutt DR, Meuwly M. Studying reactive processes with classical dynamics: Rebinding dynamics in MbNO. Biophys J. 2006;90:1191–1201. doi: 10.1529/biophysj.105.071522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Danielsson J, Meuwly M. Atomistic simulation of adiabatic reactive processes based on multi-state potential energy surfaces. J Chem Theory Comput. 2008;4:1083–1093. doi: 10.1021/ct800066q. [DOI] [PubMed] [Google Scholar]
- 23.Lowenstein EJ, et al. The SH2 and SH3 domain-containing protein GRB2 links receptor tyrosine kinases to ras signaling. Cell. 1992;70:431–442. doi: 10.1016/0092-8674(92)90167-b. [DOI] [PubMed] [Google Scholar]
- 24.Bisson N, et al. Selected reaction monitoring mass spectrometry reveals the dynamics of signaling through the GRB2 adaptor. Nat Biotechnol. 2011;29:653–658. doi: 10.1038/nbt.1905. [DOI] [PubMed] [Google Scholar]
- 25.Batzer AG, Rotin D, Urena JM, Skolnik EY, Schlessinger J. Hierarchy of binding sites for Grb2 and Shc on the epidermal growth factor receptor. Mol Cell Biol. 1994;14:5192–5201. doi: 10.1128/mcb.14.8.5192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Boriack-Sjodin PA, Margarit SM, Bar-Sagi D, Kuriyan J. The structural basis of the activation of Ras by Sos. Nature. 1998;394:337–343. doi: 10.1038/28548. [DOI] [PubMed] [Google Scholar]
- 27.Cussac D, Frech M, Chardin P. Binding of the Grb2 SH2 domain to phosphotyrosine motifs does not change the affinity of its SH3 domains for Sos proline‐rich motifs. EMBO J. 1994;13:4011–4021. doi: 10.1002/j.1460-2075.1994.tb06717.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kay BK, Williamson MP, Sudol P. The importance of being proline: the interaction of proline-rich motifs in signaling proteins with their cognate domains. FASEB Journal. 2000;14:231–241. doi: 10.1096/fasebj.14.2.231. [DOI] [PubMed] [Google Scholar]
- 29.Bartelt RR, et al. Regions outside of conserved PxxPxR motifs drive the high affinity interaction of GRB2 with SH3 domain ligands. Biochim Biophys Acta Mol Cell Res. 2015;1853:2560–2569. doi: 10.1016/j.bbamcr.2015.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Goudreau N, et al. NMR structure of the N-terminal SH3 domain of GRB2 and its complex with a proline-rich peptide from Sos. Nat Struct Biol. 1994;1:898–907. doi: 10.1038/nsb1294-898. [DOI] [PubMed] [Google Scholar]
- 31.Kohda D, et al. Solution structure and ligand-binding site of the carboxy-terminal SH3 domain of GRB2. Structure. 1994;2:1029–1040. doi: 10.1016/s0969-2126(94)00106-5. [DOI] [PubMed] [Google Scholar]
- 32.Cussac D, et al. A Sos-derived peptidimer blocks the Ras signaling pathway by binding both Grb2 SH3 domains and displays antiproliferative activity. FASEB Journal. 1999;13:31–39. doi: 10.1096/fasebj.13.1.31. [DOI] [PubMed] [Google Scholar]
- 33.Oneyama C, Nakano H, Sharma SV. UCS15A, a novel small molecule, SH3 domain-mediated protein-protein interaction blocking drug. Oncogene. 2002;21:2037–2050. doi: 10.1038/sj/onc/1205271. [DOI] [PubMed] [Google Scholar]
- 34.Oneyama C, et al. Synthetic inhibitors of proline-rich ligand-mediated protein-protein interaction: Potent analogs of UCS15A. Chem Biol. 2003;10:443–451. doi: 10.1016/s1074-5521(03)00101-7. [DOI] [PubMed] [Google Scholar]
- 35.Nguyen JT, Turck CW, Cohen FE, Zuckermann RN, Lim WA. Exploiting the basis of proline recognition by SH3 and WW domains: design of n-substituted inhibitors. Science. 1998;282:2088–2092. doi: 10.1126/science.282.5396.2088. [DOI] [PubMed] [Google Scholar]
- 36.Vidal M, et al. Design of peptoid analogue dimers and measure of their affinity for Grb2 SH3 domains. Biochemistry. 2004;43:7336–7344. doi: 10.1021/bi030252n. [DOI] [PubMed] [Google Scholar]
- 37.Gril B, et al. Grb2-SH3 ligand inhibits the growth of HER2+ cancer cells and has antitumor effects in human cancer xenografts alone and in combination with docetaxel. Int J Cancer. 2007;121:407–415. doi: 10.1002/ijc.22674. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Vidal M, et al. Molecular and cellular analysis of Grb2 SH3 domain mutants: interaction with Sos and dynamin. J Mol Biol. 1999;290:717–730. doi: 10.1006/jmbi.1999.2899. [DOI] [PubMed] [Google Scholar]
- 39.Waudby CA, Ramos A, Cabrita LD, Christodoulou J. Two-dimensional NMR lineshape analysis. Sci Rep. 2016;6:24826. doi: 10.1038/srep24826. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Xue Y, Yuwen TR, Zhu FQ, Skrynnikov NR. Role of electrostatic interactions in binding of peptides and intrinsically disordered proteins to their folded targets. 1. NMR and MD characterization of the complex between the c-Crk N-SH3 domain and the peptide Sos. Biochemistry. 2014;53:6473–6495. doi: 10.1021/bi500904f. [DOI] [PubMed] [Google Scholar]
- 41.Wang CY, Pawley NH, Nicholson LK. The role of backbone motions in ligand binding to the c-Src SH3 domain. J Mol Biol. 2001;313:873–887. doi: 10.1006/jmbi.2001.5083. [DOI] [PubMed] [Google Scholar]
- 42.Martin-Sierra FM, et al. A binding event converted into a folding event. FEBS Lett. 2003;553:328–332. doi: 10.1016/s0014-5793(03)01038-x. [DOI] [PubMed] [Google Scholar]
- 43.Williamson MP. Using chemical shift perturbation to characterise ligand binding. Prog NMR Spectrosc. 2013;73:1–16. doi: 10.1016/j.pnmrs.2013.02.001. [DOI] [PubMed] [Google Scholar]
- 44.Klont F, et al. Assessment of sample preparation bias in mass spectrometry-based proteomics. Anal Chem. 2018;90:5405–5413. doi: 10.1021/acs.analchem.8b00600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Lindley H. Study of the kinetics of the reaction between thiol compounds and chloroacetamide. Biochem J. 1960;74:577–584. doi: 10.1042/bj0740577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hains PG, Robinson PJ. The impact of commonly used alkylating agents on artifactual peptide modification. J Proteome Res. 2017;16:3443–3447. doi: 10.1021/acs.jproteome.7b00022. [DOI] [PubMed] [Google Scholar]
- 47.Wales TE, Engen JR. Partial unfolding of diverse SH3 domains on a wide timescale. J Mol Biol. 2006;357:1592–1604. doi: 10.1016/j.jmb.2006.01.075. [DOI] [PubMed] [Google Scholar]
- 48.Rabdano, S. O. et al. Onset of disorder and protein aggregation due to oxidation-induced intermolecular disulfide bonds: case study of RRM2 domain from TDP-43. Sci Rep7, 11161 10.1038/s41598-017-10574-w (2017). [DOI] [PMC free article] [PubMed]
- 49.Maier JA, et al. ff14SB: improving the accuracy of protein side chain and backbone parameters from ff99SB. J Chem Theory Comput. 2015;11:3696–3713. doi: 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Bayly CI, Cieplak P, Cornell WD, Kollman PA. A well-behaved electrostatic potential based method using charge restraints for deriving atomic charges: the RESP model. J Phys Chem. 1993;97:10269–10280. doi: 10.1021/j100142a004. [DOI] [Google Scholar]
- 51.Wang JM, Wang W, Kollman PA, Case DA. Automatic atom type and bond type perception in molecular mechanical calculations. J Mol Graph Model. 2006;25:247–260. doi: 10.1016/j.jmgm.2005.12.005. [DOI] [PubMed] [Google Scholar]
- 52.Simon K, Xu J, Kim C, Skrynnikov NR. Estimating the accuracy of protein structures using residual dipolar couplings. J Biomol NMR. 2005;33:83–93. doi: 10.1007/s10858-005-2601-7. [DOI] [PubMed] [Google Scholar]
- 53.Case, D. A. et al. AMBER 16. (University of California, 2016).
- 54.Wu XD, et al. Structural basis for the specific interaction of lysine-containing proline-rich peptides with the N-terminal SH3 domain of c-Crk. Structure. 1995;3:215–226. doi: 10.1016/s0969-2126(01)00151-4. [DOI] [PubMed] [Google Scholar]
- 55.Marchi M, Ballone P. Adiabatic bias molecular dynamics: a method to navigate the conformational space of complex molecular systems. J Chem Phys. 1999;110:3697–3702. doi: 10.1063/1.478259. [DOI] [Google Scholar]
- 56.Spronk C, Nabuurs SB, Krieger E, Vriend G, Vuister GW. Validation of protein structures derived by NMR spectroscopy. Prog NMR Spectrosc. 2004;45:315–337. doi: 10.1016/j.pnmrs.2004.08.003. [DOI] [Google Scholar]
- 57.Williamson MP, Kikuchi J, Asakura T. Application of 1H NMR chemical shifts to measure the quality of protein structures. J Mol Biol. 1995;247:541–546. doi: 10.1016/S0022-2836(05)80135-4. [DOI] [PubMed] [Google Scholar]
- 58.Vila JA, Villegas ME, Baldoni HA, Scheraga HA. Predicting 13Cα chemical shifts for validation of protein structures. J Biomol NMR. 2007;38:221–235. doi: 10.1007/s10858-007-9162-x. [DOI] [PubMed] [Google Scholar]
- 59.Sahakyan AB, Vranken WF, Cavalli A, Vendruscolo M. Using side-chain aromatic proton chemical shifts for a quantitative analysis of protein structures. Angew Chem, Int Ed. 2011;50:9620–9623. doi: 10.1002/anie.201101641. [DOI] [PubMed] [Google Scholar]
- 60.Berjanskii M, Zhou JJ, Liang YJ, Lin GH, Wishart DS. Resolution-by-proxy: a simple measure for assessing and comparing the overall quality of NMR protein structures. J Biomol NMR. 2012;53:167–180. doi: 10.1007/s10858-012-9637-2. [DOI] [PubMed] [Google Scholar]
- 61.Koes DR, Vries JK. Evaluating amber force fields using computed NMR chemical shifts. Proteins. 2017;85:1944–1956. doi: 10.1002/prot.25350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Spera S, Bax A. Empirical correlation between protein backbone conformation and Cα and Cβ 13C nuclear magnetic resonance chemical shifts. J Am Chem Soc. 1991;113:5490–5492. doi: 10.1021/ja00014a071. [DOI] [Google Scholar]
- 63.Dalgarno DC, Levine BA, Williams RJP. Structural information from NMR secondary chemical shifts of peptide α C-H protons in proteins. Biosci Rep. 1983;3:443–452. doi: 10.1007/bf01121955. [DOI] [PubMed] [Google Scholar]
- 64.Shen Y, Bax A. SPARTA+: a modest improvement in empirical NMR chemical shift prediction by means of an artificial neural network. J Biomol NMR. 2010;48:13–22. doi: 10.1007/s10858-010-9433-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Brunger AT. Free R value: cross-validation in crystallography. Methods Enzymol. 1997;277:366–396. doi: 10.1016/s0076-6879(97)77021-6. [DOI] [PubMed] [Google Scholar]
- 66.Vijay-Kumar S, Bugg CE, Cook WJ. Structure of ubiquitin refined at 1.8 Å resolution. J Mol Biol. 1987;194:531–544. doi: 10.1016/0022-2836(87)90679-6. [DOI] [PubMed] [Google Scholar]
- 67.Li DW, Bruschweiler R. PPM: a side-chain and backbone chemical shift predictor for the assessment of protein conformational ensembles. J Biomol NMR. 2012;54:257–265. doi: 10.1007/s10858-012-9668-8. [DOI] [PubMed] [Google Scholar]
- 68.Maltsev AS, Grishaev A, Roche J, Zasloff M, Bax A. Improved cross validation of a static ubiquitin structure derived from high precision residual dipolar couplings measured in a drug-based liquid crystalline phase. J Am Chem Soc. 2014;136:3752–3755. doi: 10.1021/ja4132642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Rosta E, Klahn M, Warshel A. Towards accurate ab initio QM/MM calculations of free-energy profiles of enzymatic reactions. J Phys Chem B. 2006;110:2934–2941. doi: 10.1021/jp057109j. [DOI] [PubMed] [Google Scholar]
- 70.Smith KD, Stoliarov SI, Nyden MR, Westmoreland PR. RMDff: A smoothly transitioning, forcefield-based representation of kinetics for reactive molecular dynamics simulations. Mol Simul. 2007;33:361–368. doi: 10.1080/08927020601156392. [DOI] [Google Scholar]
- 71.Paci E, Karplus M. Forced unfolding of fibronectin type 3 modules: an analysis by biased molecular dynamics simulations. J Mol Biol. 1999;288:441–459. doi: 10.1006/jmbi.1999.2670. [DOI] [PubMed] [Google Scholar]
- 72.Izrailev S, Stepaniants S, Balsera M, Oono Y, Schulten K. Molecular dynamics study of unbinding of the avidin-biotin complex. Biophys J. 1997;72:1568–1581. doi: 10.1016/s0006-3495(97)78804-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Scarpino A, Ferenczy GG, Keseru GM. Comparative evaluation of covalent docking tools. J Chem Inf Model. 2018;58:1441–1458. doi: 10.1021/acs.jcim.8b00228. [DOI] [PubMed] [Google Scholar]
- 74.Draper SRE, et al. Polyethylene glycol based changes to β-sheet protein conformational and proteolytic stability depend on conjugation strategy and location. Bioconjugate Chem. 2017;28:2507–2513. doi: 10.1021/acs.bioconjchem.7b00281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Morimoto, D., Walinda, E., Fukada, H., Sugase, K. & Shirakawa, M. Ubiquitylation directly induces fold destabilization of proteins. Sci Rep6, 10.1038/srep39453 (2016). [DOI] [PMC free article] [PubMed]
- 76.Foulkes Daniel M., Byrne Dominic P., Yeung Wayland, Shrestha Safal, Bailey Fiona P., Ferries Samantha, Eyers Claire E., Keeshan Karen, Wells Carrow, Drewry David H., Zuercher William J., Kannan Natarajan, Eyers Patrick A. Covalent inhibitors of EGFR family protein kinases induce degradation of human Tribbles 2 (TRIB2) pseudokinase in cancer cells. Science Signaling. 2018;11(549):eaat7951. doi: 10.1126/scisignal.aat7951. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Sattler M, Schleucher J, Griesinger C. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog NMR Spectrosc. 1999;34:93–158. doi: 10.1016/s0079-6565(98)00025-9. [DOI] [Google Scholar]
- 78.Schanda P, Van Melckebeke H, Brutscher B. Speeding up three-dimensional protein NMR experiments to a few minutes. J Am Chem Soc. 2006;128:9042–9043. doi: 10.1021/ja062025p. [DOI] [PubMed] [Google Scholar]
- 79.Delaglio F, et al. NMRPipe: a multidimensional spectral processing system based on unix pipes. J Biomol NMR. 1995;6:277–293. doi: 10.1007/bf00197809. [DOI] [PubMed] [Google Scholar]
- 80.Muranov KO, et al. The mechanism of the interaction of α-crystallin and UV-damaged βL-crystallin. Int J Biol Macromol. 2019;140:736–748. doi: 10.1016/j.ijbiomac.2019.08.178. [DOI] [PubMed] [Google Scholar]
- 81.Kravchuk OI, et al. Characterization of the 20S proteasome of the lepidopteran, Spodoptera frugiperda. Biochim Biophys Acta Proteins Proteom. 2019;1867:840–853. doi: 10.1016/j.bbapap.2019.06.010. [DOI] [PubMed] [Google Scholar]
- 82.Becke AD. Density-functional thermochemistry. III. The role of exact exchange. J Chem Phys. 1993;98:5648–5652. doi: 10.1063/1.464913. [DOI] [Google Scholar]
- 83.Stephens PJ, Devlin FJ, Chabalowski CF, Frisch MJ. Ab initio calculation of vibrational absorption and circular dichroism spectra using density functional force fields. J Phys Chem. 1994;98:11623–11627. doi: 10.1021/j100096a001. [DOI] [Google Scholar]
- 84.Hehre WJ, Ditchfield R, Pople JA. Self-consistent molecular orbitals methods. XII. Further extensions of Gaussian-type basis sets for use in molecular orbital studies of orgnaic molecules. J Chem Phys. 1972;56:2257–2261. doi: 10.1063/1.1677527. [DOI] [Google Scholar]
- 85.Hariharan PC, Pople JA. Influence of polarization functions on molecular orbital hydrogenation energies. Theor Chim Acta. 1973;28:213–222. doi: 10.1007/bf00533485. [DOI] [Google Scholar]
- 86.Arnold WD, et al. Experimental, Hartree-Fock, and density functional theory investigations of the charge density, dipole moment, electrostatic potential, and electric field gradients in L-asparagine monohydrate. J Am Chem Soc. 2000;122:4708–4717. doi: 10.1021/ja000386d. [DOI] [Google Scholar]
- 87.Momany FA. Determination of partial atomic charges from ab initio molecular electrostatic potentials. Application to formamide, methanol, and formic acid. J Phys Chem. 1978;82:592–601. doi: 10.1021/j100494a019. [DOI] [Google Scholar]
- 88.Cornell WD, Cieplak P, Bayly CI, Kollman PA. Application of RESP charges to calculate conformational energies, hydrogen bond energies, and free energies of solvation. J Am Chem Soc. 1993;115:9620–9631. doi: 10.1021/ja00074a030. [DOI] [Google Scholar]
- 89.Wang JM, Wolf RM, Caldwell JW, Kollman PA, Case DA. Development and testing of a general Amber force field. J Comput Chem. 2004;25:1157–1174. doi: 10.1002/jcc.20035. [DOI] [PubMed] [Google Scholar]
- 90.Gaussian 16 Rev. B.01 (Wallingford, CT, 2016).
- 91.Bas DC, Rogers DM, Jensen JH. Very fast prediction and rationalization of pKa values for protein-ligand complexes. Proteins. 2008;73:765–783. doi: 10.1002/prot.22102. [DOI] [PubMed] [Google Scholar]
- 92.Izmailov SA, Podkorytov IS, Skrynnikov NR. Simple MD-based model for oxidative folding of peptides and proteins. Sci Rep. 2017;7:9293. doi: 10.1038/s41598-017-09229-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Shirts MR, et al. Lessons learned from comparing molecular dynamics engines on the SAMPL5 dataset. J Comput Aid Mol Des. 2017;31:147–161. doi: 10.1007/s10822-016-9977-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Roe DR, Cheatham TE. PTRAJ and CPPTRAJ: software for processing and analysis of Molecular Dynamics trajectory data. J Chem Theory Comput. 2013;9:3084–3095. doi: 10.1021/ct400341p. [DOI] [PubMed] [Google Scholar]
- 95.Horn HW, et al. Development of an improved four-site water model for biomolecular simulations: TIP4P-Ew. J Chem Phys. 2004;120:9665–9678. doi: 10.1063/1.1683075. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Leap library files for all modified residues described in this paper, force-field parameter modification file and python script to run reactive MD simulations have been deposited to github.com/bionmr-spbu-projects/2019-GRB2-Sos1. The latter script is dependent on MD-control library PYRUN, which is available from github.com/bionmr-spbu/pyrun. PYXMOLPP2 package to process MD trajectories can be downloaded from github.com/bionmr-spbu/pyxmolpp2. The backbone NMR assignments of Grb2 N-SH3 have been deposited in BMRB under accession numbers 50104 (noncovalent complex with Sos1-C) and 50106 (covalent complex with Sos1-X′). All other data reported in this paper are available from the authors upon request.