Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Jul 1.
Published in final edited form as: J Am Soc Mass Spectrom. 2020 Jun 22;31(7):1544–1553. doi: 10.1021/jasms.0c00131

Protein-ligand affinity determinations using covalent labeling-mass spectrometry

Tianying Liu 1, Tyler M Marcinko 1, Richard W Vachet 1,*
PMCID: PMC7332385  NIHMSID: NIHMS1603717  PMID: 32501685

Abstract

Determining binding affinity is an important aspect of charactering protein-ligand complexes. Here, we describe an approach based on covalent labeling (CL)-mass spectrometry (MS) that can accurately provide protein-ligand dissociation constants (Kd values), using diethylpyrocarbonate (DEPC) as the labeling reagent. Even though DEPC labeling reactions occur on a timescale that is similar to the dissociation/re-association rates of many protein-ligand complexes, we demonstrate that relatively accurate binding constants can still be obtained as long as the extent of protein labeling is kept below 30%. Using two well-established model systems and one insufficiently characterized system, we find that Kd values can be determined that are close to values obtained in previous measurements. The CL-MS based strategy that is described here should serve as an alternative for characterizing protein-ligand complexes that are challenging to measure by other methods. Moreover, this method has the potential to provide simultaneously affinity and binding site information.

Graphical Abstract

graphic file with name nihms-1603717-f0001.jpg

Covalent labeling/mass spectrometry along with ligand titrations can be used to determine protein-ligand dissociation constants while at the same time identifying protein-ligand binding sites.

Introduction

Understanding the details of protein-ligand interactions has implications for drug discovery, design, and development.1,2 As a part of characterizing protein-ligand complexes, binding affinity determinations are important. To acquire the binding affinity of a ligand for a protein, mass spectrometry (MS) based methods have been explored as an option because of the advantages of the technique, which include high specificity, small sample consumption, and simplicity (e.g. no protein or ligand immobilization is needed).3 Native electrospray ionization mass spectrometry (ESI-MS) allows noncovalent protein-ligand complexes to be transferred intact from solution to the gas phase,4-6 which makes thermochemical7,8 and ESI-MS-based titration experiments possible.9,10 While such titration experiments have been used successfully for many different protein-ligand systems, some challenges can exist, such as differences in protein and protein-ligand complex ionization efficiencies, in-source dissociation of the complex, or false positives caused by non-specific ligand binding.3

Other MS-based methods that encode protein-ligand binding information into the mass of the protein have also been developed. Hydrogen deuterium exchange (HDX)-based methods have been used to measure the dissociation constants (Kd) for protein-ligand complexes. One of the first examples was the method known as SUPREX (or stability of unpurified proteins from rates of H/D exchange),11 which utilizes denaturants to monitor the relative stabilities of a protein and its ligand complex to obtain a Kd value.10 Another HDX-MS method to determine Kd values is PLIMSTEX,13,14 which relies on labeling in the context of ligand titration. PLIMSTEX also offers information about protein-ligand stoichiometries and dynamics at the same time. When peptide-level information is obtained during these experiments, the binding site can also often be acquired, although sometimes the obtained binding site information can be ambiguous due to changes in protein dynamics that can occur distant from the binding site.

For HDX-based methods, back exchange is a potential concern, and the need to measure multiple exchange time points can make the measurements time consuming. Covalent labeling (CL) methods can avoid these challenges, as the covalent bond that is formed limits label loss, multiple time-point measurements are not needed, and residue-level binding site information can be obtained.15-19 An early example of a CL-MS approach that was used to determine protein-ligand affinities is SPROX (or stability of proteins from rates of oxidation), which uses hydrogen peroxide in the presence of increasing concentrations of denaturant to oxidize proteins with and without ligand present as a means of determining Kd.20 More recently, a method based on fast photochemical oxidation of proteins (FPOP) called LITPOMS (ligand titration, fast photochemical oxidation of proteins and mass spectrometry) was reported.21,22 LITPOMS uses hydroxyl radicals to modify the protein of interest, and then relies on decreased protein oxidative modifications at the ligand-binding site as the ligand concentration is increased. The method yields information about ligand binding affinities, binding sites, and even ligand-induced structural changes when combined with bottom-up sequencing. As compared to HDX-based methods, CL methods like LITPOMS require little sample dilution, making it easier to study weakly binding protein-ligand complexes and identify binding sites.

Given the potential advantages of CL for determining Kd values, we were interested in assessing if other CL methods and reagents could provide reliable protein-ligand binding constants. In particular, we were interested in investigating CL methods that do not require sophisticated equipment such as lasers or flow cells. It is known that CL-MS, including non-radical CL methods, are able to identify the ligand binding sites of proteins by identifying residues that undergo decreases in labeling due to ligand-induced decreases in solvent accessibility of the residues involved in binding.15,23-26 If such CL methods are applied in a ligand titration context, Kd values should be accessible.

A key concern with non-radical based labeling reagents is their relatively slow reaction kinetics, which are on the order of milliseconds to minutes16,27 as compared to radical-based reagents that react on the sub-millisecond timescale.28,29 The slower reaction timescale might mean that the ligand dissociation and re-association processes that are a normal part of the equilibrium could occur on comparable timescales, yielding incorrect Kd information because the labeling could distort the original binding equilibrium. Such a concern is presumably not present with radical-based methods such as LITPOMS, because labeling happens on a faster timescale than ligand dissociation/re-association. We predict, though, that relatively accurate binding constants can still be obtained using slower labeling reagents as long as the extent of protein labeling at the binding site is minimized enough to prevent significant label-induced changes to the equilibrium. In this work, we use several model protein-ligand complexes to demonstrate this idea. We find that reasonably accurate Kd values can be determined when the extent of protein labeling is kept relatively low.

Materials and Methods

Materials.

The maltose binding protein (MBP) was obtained from MyBioSource (San Diego, CA). Human full-length β-2-microglobulin (β2m) was purchased from Lee Biosolutions (Maryland Heights, MO). Lysozyme from chicken egg white and the following chemicals were obtained from MilliporeSigma (St.Louis, MO): DL-dithiothreitol (DTT), diethylpyrocarbonate (DEPC), dimethyl(2-hydroxy-5-nitrobenzyl)sulfonium bromide (HNSB), epigallocatechin gallate (EGCG), guanidine hydrochloride (GuHCl), imidazole, iodoacetamide, maltose monohydrate, MOPS, MOPS sodium salt, N,N’,N” triacetylchitotriose (NAG3), L-tryptophan, and urea. Acetonitrile, copper sulfate (CuSO4), formic acid, potassium acetate, sodium phosphate, sodium phosphate monobasic monohydrate, HPLC grade water, and a 1 M Tris buffer (pH 8) stock solution were all purchased from Fisher Scientific (Fair Lawn, NJ). Centricon molecular weight cutoff (MWCO) filters were obtained from Millipore (Burlington, MA). Sequencing grade modified trypsin and sequencing grade chymotrypsin were obtained from Promega (Madison, WI).

Sample Preparation.

Samples containing 30 μM lysozyme were prepared in a 20 mM MOPS (pH 7.4) buffer, and stock solutions of NAG3, which binds lysozyme, were prepared in HPLC grade water. Samples containing 24 μM MBP were prepared in 20 mM sodium phosphate buffer (pH 7.5), and stock solutions of maltose were prepared in the same buffer. Samples containing 30 μM β2m were prepared in 25 mM MOPS buffer (pH 7.4), 60 μM Cu(II) (CuSO4), 500 mM urea, and 200 mM potassium acetate, which are conditions under which the protein can aggregate to form amyloid fibrils. EGCG stock solutions were prepared in water. Different concentrations of the small molecule ligands were mixed with the corresponding protein to conduct the titration experiments. All samples were equilibrated at room temperature (22 °C) for 1 to 5 min after the protein was mixed with the ligand and before performing the CL reactions.

HNSB Labeling.

HNSB stock solutions were prepared in water immediately before use. The protein labeling reactions were initiated by adding an aliquot of the HNSB stock solution to the buffered protein-ligand sample. The reactions were allowed to proceed at room temperature for a defined time period before quenching them via the addition of tryptophan to a final concentration of 5 mM. The concentrations of HNSB and reaction times were varied to examine the effect of different labeling extents on the measured Kd values

DEPC Labeling.

DEPC stock solutions were freshly prepared in acetonitrile for each experiment. The protein labeling reactions were initiated by adding an aliquot of the DEPC stock solution to the buffered protein-ligand sample, with the final DEPC concentration depending on the protein and the experiment. The final volume percentage of acetonitrile was less than 1% in all experiments. The reactions were allowed to proceed at 37 °C for 1 min, before being quenched by the addition of imidazole at a final concentration of 10 mM. These labeling conditions were similar to our group’s previous studies with this CL reagent.27,30

Proteolytic Digestion.

CL-modified proteins that were subjected to proteolytic digestion and LC/MS/MS analysis were prepared by the following procedure. Before denaturing the protein, samples were diluted with water to a volume of 400 μL and then were concentrated using a 10,000 MWCO filter to a final volume of 40 μL. Samples containing MBP were reconstituted with 0.25 M Tris (pH 8.0) and 6 M GuHCl to a final volume of 100 μL. After incubating at 55 °C for 1 h to denature the protein, the samples were allowed to cool before being diluted again by 300 μL of water. Then, the samples were concentrated by a 10,000 MWCO filter to a final volume of 40 μL. The dilute-concentrate procedures were repeated one more time to reduce the concentration of GuHCl and increase the concentration of the protein. The 40 μL samples were then reconstituted in a 0.1 M Tris (pH 8.0) buffer and digested by trypsin for 2 h at 37 °C, if the protein was labeled by HNSB, or by chymotrypsin for 2 h at 25 °C, if the protein was labeled by DEPC.

The samples containing β2m were reconstituted to a final concentration of 0.1 M Tris, 13% (v/v) acetonitrile, and 10 mM DTT. After incubating at 55 °C for 1 h, the sample was allowed to cool, and the reduced disulfide bond was alkylated by 14 mM iodoacetamide (prepared in a Tris buffer, pH 8.0) in the dark at room temperature for 15 min. The denatured and reduced protein was then digested by chymotrypsin for 2 h at 25 °C.

Liquid Chromatography–Mass Spectrometry (LC/MS).

For intact protein analyses, measurements were performed on a Bruker (Billerica, MA) AmaZon quadrupole ion trap mass spectrometer equipped with an electrospray ionization source. Typically, the electrospray needle voltage was kept at ~4 kV, and the capillary temperature was set at 220 °C. HPLC separations were performed using a Thermo Scientific (Waltham, MA) Ultimate 3000 HPLC with an OPTI-TRAP MICRO column (1 mm x 12 mm, Optimize Technologies Inc., Oregon City, OR). The proteins were eluted using a gradient of acetonitrile containing 0.1% formic acid that increased from 5 to 99% for 10 min at the flow rate of 50 μL/min.

For peptide mixture analyses, measurements were performed by LC-MS on a Thermo Scientific Orbitrap Fusion mass spectrometer equipped with a nano-electrospray ionization source. The needle voltage was set at 2.1 kV, and the ion transfer tube temperature was set at 325 °C. The resolution of the Orbitrap was set to 60,000, the MSI AGC target was set at 4 × 105 ions with a maximum injection time of 50 ms. Collision-induced dissociation (CID) with a normalized collision energy of 35% was used for tandem mass spectrometry (MS/MS). Data-dependent selection for precursor ions with ion abundances above 5,000 was used, and a dynamic exclusion of 30 s was activated after 3 spectra were acquired for any given precursor ion within 5 s. The MS/MS AGC target and maximum injection time were set to 5 × 104 ions and 100 ms, respectively. HPLC separations were conducted by a Thermo Scientific Easy-NanoLC 1000 system with a Thermo Scientific Acclaim PepMap C18 nanocolumn (15 cm x 75 μm ID, 2 μm, 100 Å). Peptides were eluted using a gradient of acetonitrile containing 0.1% formic acid that increased from 0 to 50% for 60 min at the flow rate of 0.3 μL/min.

Determination of Covalent Labeling Modification Percentages.

Residue level CL modification percentages (% labeling) were calculated by integrating the peak area of eluting peptides (i.e., modified or unmodified) using extracted ion chromatograms, as described in our previous work.26,30 The % labeling was calculated based on equation 1.

%labeling=i=1nz=1mAi,zmodifiedi=1nz=1mAi,zmodified+i=1nz=1mAi,zunmodified×100 (1)

Ai,z represents the peak area of the peptide of interest. Peak areas from all of the measurable peptides (i) that contain the residue of interest, labeled or not, and all detectable charge states (z) are included. The determined modification percentages are relative rather than absolute values because the modified and unmodified peptides have different ionization efficiencies and LC elution times. It should be noted that the % labeling in equation 1 was only used for peptides in which all the charge states that were summed were present in all samples.

To determine intact protein CL modification percentages, the calculation is similar but simplified. Averaged ion abundances during protein elution were used instead (equation 2), and the ion abundance of multiple charge states were summed.

%labeling=zImodifiedzImodified+zIunmodified×100 (2)

In determining whether two labeling results were different, an unpaired student t-test was applied with a 99% confidence interval.

Calculation of Kd.

In this work, we focus on the most frequently encountered case where only one ligand (L) binds to one protein (P), and only one kind of protein-ligand complex (PL) is formed. The dissociation constant (Kd) in such a system can be described by equation 3.

Kd=[P][L][PL] (3)

where [P], [L], and [PL] are the equilibrium concentrations of P, L, and the complex PL, respectively. The total concentration of ligand ([L]0) and protein ([P]0) has the following concentration relationship with related fractions in the system, respectively (equations 4 and 5):

[L]0=[L]+[PL] (4)
[P]0=[P]+[PL] (5)

Because ligand binding decreases the extent of labeling of the residues at the binding interface, we assume there is a relationship between the covalent labeling result (represented by % labeling) and the concentration of ligand-free protein in the system ([P]). This relationship can be normalized to the extent of labeling when the protein is 100% bound to the ligand (i.e. maximum ligand) and when no ligand is present. The resulting relationship is represented by equation 6:

[P][P]0=Δ%labeling%labelingmaximunligand%labelingnoligand (6)

By algebraically combining equations 3-6, the following relationship that relates the relative change in labeling to Kd in terms that include the total ligand and protein concentrations (equation 7) can be derived.

relativelabelingfractionalchange=Δ%labeling%labelingmaximunligand%labelingnoligand=(([L]0+[P]0+Kd)(([L]0+[P]0+Kd)24×[P]0×[[L]0])0.5)2[P]0 (7)

A plot of the relative labeling fractional change, based on labeling percentage measurements, as a function of total ligand concentration used during the titration experiments can be used to determine Kd. Fitting of equation 7 was performed using a customized non-linear curve fit in Origin 8 (OriginLab Corporation, Northampton, MA). The results from each experimental replicate were fit individually, and then an averaged Kd value from at least three experimental replicates is reported.

Solvent accessible surface area (SASA) calculations.

Solvent accessible surface areas (SASA) for individual residues were calculated from a given protein’s PDB structure using GETAREA.31 The probe radius was set to 1.4 Å with no gradient calculation applied. The reported SASA percentage is the ratio of side-chain surface area in the protein to the random coil value for that same residue in the tripeptide Gly-X-Gly.

Results and Discussion

Effect of Labeling Extent on the Apparent Kd Value.

We predict that CL-MS based on slow labeling reactions is capable of determining the Kd values of a protein-ligand complex as long as the overall fraction of labeled protein is kept at low levels such that the perturbation to the equilibrium is limited. To illustrate this idea, consider a hypothetical protein-ligand system with a real Kd value of 10 μM (Kd1) that upon CL of the free protein results in a significantly weaker interaction with the ligand and a new Kd value of 1000 μM (Kd2). The concentration relationship between each fraction in the system can be described by equations 8-11, where [L] is the free ligand concentration at equilibrium, [P] is the unlabeled protein concentration, [P’] is the labeled protein concentration, [PL] is the concentration of the unlabeled protein-ligand complex, [P’L] is the concentration of the labeled protein-ligand complex, [L0] is total ligand concentration, and [P0] is the initial total protein concentration.

[L]+[PL]+[PL]=[L0] (8)
[P]+[P]+[PL]+[PL]=[P0] (9)
Kd1=[P][L][PL] (10)
Kd2=[P][L][PL] (11)

If one considers the percentage of the labeled protein as described in equation 12, then the apparent Kd will be defined by equation 13.

%labeling=[P]+[PL][P0] (12)
AppKd=[P+P]×[L][PL+PL] (13)

The relationship between the apparent Kd and the percent labeling can then be plotted for any labeling extent between 0 and 100% (Figure 1). Not surprisingly, when there is no labeling, the apparent Kd is equal to the real Kd (i.e. 10 μM in this hypothetical situation), whereas at 100% labeling the apparent Kd is equal to the new Kd of the more weakly binding labeled protein (i.e. 1000 μM in this hypothetical situation). More interesting is the observation that when the percent labeling is kept low (e.g. below 30%), the apparent Kd differs from the real Kd by less than a factor of 2, meaning that as long as the extent of labeling is kept low, reasonably accurate Kd values can be obtained. It should be pointed out that often the variation in measured Kd values for protein-ligand complexes differ by more than a factor of 2 when different measurement methods are compared.32 Of course, if the percent labeling is much higher (e.g. above 70%), the apparent Kd value is more than an order of magnitude higher than the real Kd value, which is typically unacceptably inaccurate.

Figure 1:

Figure 1:

(Top) Relationship between the apparent Kd and the real Kd for a hypothetical model in which the real Kd is 10 μM and the Kd upon 100% labeling is 1000 μM, upon considering equations 8-13 in the text. (Bottom) Expanded view of the theoretical plot, illustrating that when the percent labeling is about below 60%, the apparent Kd is within 5-fold of the real Kd, and is within 2-fold of the real Kd when the labeling percent is below 30%.

Lysozyme-NAG3 Binding.

To test our idea, the first model system that we selected was the NAG3-lysozyme complex. Because the NAG3 binding site includes two Trp residues, Trp62 and Trp63 (PDB 1HEW),33,34 we used HNSB, which is a Trp-specific labeling reagent. Reactions of lysozyme with HNSB in the presence and absence of NAG3 result in only Trp62 being labeled (Figure 2a) as indicated by LC/MS/MS data (Figure S1). The solvent accessible surface area (SASA) of Trp63 is very low, hindering its reaction with HNSB. Because only Trp62 is labeled, the labeling ratio of the intact protein could be used during the ligand titration experiments.

Figure 2.

Figure 2.

HNSB covalent labeling of intact lysozyme (30 μM) in the presence of (NAG)3 using a 1.5 mM concentration of HNSB and 10 s reaction time. (a) Lysozyme surface structure with the (NAG)3 binding site shown from PDB 1HEW. Trp62 is indicated in blue, and (NAG)3 is in orange. (b) HNSB covalent labeling percentage of intact lysozyme as a function of (NAG)3 concentrations. (c) HNSB labeling results fit using equation 7, resulting in a lysozyme-(NAG)3 Kd of 14 ± 2 μM. These experiments were done in triplicate. Fitting statistics for are found in Table S1 in the Supporting Information.

To determine the Kd value using CL-MS, a titration curve was generated in which the NAG3 concentration was varied between 0 and 600 μM at a fixed lysozyme concentration of 30 μM. HNSB was added at a concentration of 1.5 mM and allowed to react with the protein for 10 s at each NAG3 concentration. This titration experiment resulted in a decrease in the percent labeling with increased total ligand concentration (Figure 2b). As expected, increasing NAG3 concentrations lead to more protein molecules in which Trp62 is protected from labeling, leading to a decrease in the percent labeling. The HNSB labeling data were then plotted and fit using equation 7 (Figure 2c), and a Kd of 14 ± 2 μM is obtained. Previous measurements of the lysozyme-NAG3 Kd by methods such as fluorescence, UV, and ESI-MS provide values that range from 6 to 60 μM.32,35-39 Our solution conditions are most similar to those reported by Zenobi and co-workers32 in which they obtained a Kd value of 20 ± 4 μM. Clearly, our CL approach is able to afford a value that is consistent with prior studies.

To investigate if the CL modification extent significantly perturbs the ligand-binding equilibration (leading to variations in the apparent Kd), additional HNSB labeling conditions were explored in the context of the same titration experiment. Labeling with 1.5 mM HNSB for 30 s and 0.75 mM HNSB for 10 s resulted in more and less extensive labeling, respectively, than the conditions used to generate Figure 2b. These resulting titration curves are shown in Figure 3a, and the corresponding fits to equation 7 are shown in Figures 3b and 3c. Comparing the Kd values from the three labeling conditions, the results are not significantly different from each other at 95% confidence interval. Moreover, there is no clear trend in the Kd value with the extent of labeling, indicating that the labeling extent does not have a strong influence on the resulting Kd value at these low labeling extents. The reason for this is likely the fact that the extent of the protein labeling at higher ligand concentrations is very low (< 10% for almost all ligand concentrations), resulting in very minimal perturbation to the protein-ligand binding equilibrium. Indeed, HNSB concentrations 10 times higher and reaction times twice as long do not lead to labeling levels beyond 20% when the ligand is present at high concentrations (data not shown).

Figure 3.

Figure 3.

HNSB covalent labeling of intact lysozyme (30 μM) in the presence of (NAG)3 at different extents of labeling. (a) HNSB labeling extent for intact lysozyme at three different labeling conditions as a function of (NAG)3 concentration. (b) HNSB labeling at 1.5 mM HNSB and a 30 s reaction time fit to equation 7, resulting in a lysozyme-(NAG)3 Kd of 19 ± 2 μM. (c) HNSB labeling at 0.75 mM HNSB and a 10 s reaction time fit to equation 7, resulting in a lysozyme-(NAG)3 Kd of 17 ± 4 μM. The experiments in each case were done in triplicate. Fitting statistics for are found in Table S1 in the Supporting Information.

Maltose-Binding Protein-Maltose Binding.

The second model system that we investigated was the maltose binding protein (MBP) bound to maltose (Figure 4a). This system allowed us to investigate if accurate Kd values can be acquired by using the extent of labeling at multiple residues. HNSB (1.0 mM) and DEPC (300 μM) were separately used to label the protein to an average labeling extent of 1 to 1.2 labels per protein in the absence of the ligand. Upon comparing MBP labeling in the presence and absence of maltose, we find that Trp340 from the HNSB labeling and Lys297 from the DEPC labeling show statistically significant decreases upon ligand binding (Figure S2a). These residues are expected to decrease in labeling because based on the crystal structures for the protein and its complex with maltose, their SASA values all decrease upon ligand binding.40 The SASAs of Trp230 and Trp232 also decrease upon ligand binding (Table S2) and are labeled by HNSB, but it was difficult to get reliable labeling extents for these residue because the peptides containing these residue had significant day-to-day variations in the degree of oxidation at Met224, which is present in the same peptide as these residues. The reasons for these variations in Met224 oxidation are unclear.

Figure 4.

Figure 4.

(a) Surface structure of MBP in the ligand-free “open” state (PDB 1OMP) and the ligand-bound “closed” state (PDB 1ANF). (b) HNSB labeling extent of Trp340 in MBP as a function of maltose concentration. (c) DEPC labeling extent of Lys297 in MBP as a function of maltose concentration. (d) HNSB labeling results for Trp340 fit using equation 7, resulting in a MBP-maltose Kd of 7 ± 4 μM. (e) DEPC labeling results for Lys297 fit using equation 7, resulting in a MBP-maltose Kd of 4 ± 2 μM. The concentration of MBP was 24 μM in each experiment, and the experiments were done in triplicate. Fitting statistics for are found in Table S1 in the Supporting Information.

Separate titrations were conducted with HNSB labeling (Figure 4b) and DEPC labeling (Figure 4c) with a fixed MBP concentration of 24 μM, while the concentration of maltose was varied from 0 to 285 μM. As was observed for the lysozyme-NAG3 system, the extents of labeling at Trp340 from HNSB labeling and Lys297 from DEPC labeling decrease as the maltose concentration is increased. Although, because the MBP measurements are from peptide fragments after digestion and LC/MS, the errors are larger than observed for the lysozyme system, particularly for Lys297, which undergoes a very low extent (< 1%) of DEPC labeling.

From the HNSB labeling data of Trp340, a Kd value of 7 ± 4 μM is obtained (Figure 4b and d). This value is comparable to a Kd value of 1 to 3 μM that was measured previously by fluorescence.41-44 Interestingly, other Trp residues (e.g. Trp6, Trp10, Trp94, Trp129, and Trp158) do not change in SASA upon ligand binding (Table S2) and therefore do not undergo any significant changes in labeling extent. As examples, Trp10 and Trp158 are modified by HNSB, but because these residues are distant from the ligand-binding site, their extents of labeling do not change significantly as the maltose concentration is increased (Figure S2c). Residues like Trp10 and Trp158 serve as useful controls for residues like Trp340 that undergo changes in labeling upon ligand binding.

We also explored titrations with DEPC because it can modify up to six different types of amino acids, including Lys, His, Tyr, Ser, Thr, and Cys residues.16 Even though DEPC can label so many residues, only Lys297 undergoes a significant decrease in labeling extent in the presence of maltose. Indeed, of the 25 residues in MBP that can be labeled by DEPC, Lys297 undergoes the greatest percent change in SASA value (from 38% to 23%) upon maltose binding (Table S2). Upon plotting the labeling data for Lys297, a Kd value of 4 ± 2 μM is obtained (Figure 4e). This value is similar to the value obtained from HNSB labeling of Trp340 and is comparable to the literature value for the MBP-maltose complex.41-44 It is worth noting that the data for Lys297 provides a reasonable measure of the Kd even though it is not a residue directly interacting with maltose. This observation demonstrates that residues in regions that undergo a structural change upon ligand binding can also be used for Kd determination. Several other residues, such as Lys26, Lys239, and Lys295 (Figure S2d), undergo no change in labeling upon maltose binding, and thus serve as useful controls.

Like the HNSB labeling experiments of the NAG3-lysozyme complex, the low extent of labeling observed at MBP residues that undergo changes in SASA (i.e. Trp340 and Lys297) upon ligand binding facilitates determination of a Kd value that is close to the literature value. In the case of MBP, the extents labeling of Trp340 and Lys297 are below 25% and 0.5%, respectively. Because labeling by both HNSB and DEPC is limited to about one label on average per protein molecule, which is spread across multiple modifiable residues all over the protein, the extent of modification at any given residue, including binding residues, is low (see Figure S2a). Thus, the number of modified protein molecules that might perturb the protein-ligand equilibrium is expected to be low.

β-2-microglobulin-EGCG Binding.

We next applied CL-MS to determine the Kd for epigallocatechin gallate (EGCG) bound to β-2-microglobulin (β2m). β2m forms amyloid fibrils in patients who undergo long term dialysis due to kidney failure,45 and EGCG (Figure 5a) has recently been found by our group to prevent Cu(II)-induced β2m amyloid formation in vitro by redirecting β2m aggregation toward amorphous, re-dissolvable aggregates.46 Thus, we decided to apply a CL-MS based method to determine the Kd of this complex.

Figure 5.

Figure 5.

EGCG binding to Cu(II)-β2m results in decreases and increases in DEPC labeling at several residues. (a) Structure of EGCG. (b) Residues undergoing covalent labeling with DEPC mapped onto the Cu(II)-free structure of β2m (PDB 1JNJ). Note that there is no structure is available for Cu(II)-β2m. DEPC modified residues that undergo no significant change upon ligand binding are shown in cyan. Those residues that decrease or increase in labeling extent are shown in blue and red, respectively. The proposed EGCG binding site is indicated by the black circle.

EGCG concentrations ranging from 0 to 115 μM, a β2m concentration of 30 μM, and a Cu(II) concentration of 60 μM were used in a titration experiment along with DEPC labeling, using 120 μM of the labeling reagent. DEPC labeling at high EGCG concentrations indicate that several residues undergo both increased and decreased extents of labeling upon ligand binding, including the N-terminal amine, Thr4, Lys6, His13, His31, and Lys91 (Figure S3). Most of these residues are near the N-terminus region of the protein (Figure 5b), suggesting EGCG’s binding site is between one β sheet that includes residues from Lys6 to Ser11 and another that includes residues from Lys91 to Asp96.

Interestingly, some of the residues that undergo increased labeling, specifically the N-terminal amine and His31, are residues that comprise the Cu(II) binding site in the Cu(II)-β2m complex.47,48 The increased reactivity of these residues with DEPC as the EGCG concentration is increased suggests that EGCG is disrupting the Cu(II) binding site to some extent. Indeed, the Kd for Cu(II) bound to β2m was found to increase from 3 μM to 49 μM in the presence of EGCG,46 which decreases the concentration of the β2m-Cu(II) complex. The reduced concentration of the metal-bound protein means that when EGCG is present the N-terminus and His31 are free in more protein molecules to react with DEPC.

DEPC labeling as a function of EGCG concentration for residues that both increase and decrease in labeling in the presence of EGCG are shown in Figure 6. The Kd values that are acquired from the labeling results of the N-terminal amine, Lys6, and Lys91 are 12 ± 5 μM, 3 ± 2 μM, and 11 ± 5 μM, respectively. These values are reasonably similar, considering the error bars, and they indicate an EGCG-β2m Kd value of between 3 and 12 μM. It is interesting to note that although the increase in the labeling of the N-terminal amine is likely due to the release of Cu(II) caused by ligand binding, the resulting value is consistent with the Kd value obtained from the residues that are protected from DEPC labeling. The Kd values acquired by CL-MS are also consistent with the Kd value of 6 μM that was obtained from size exclusion chromatography data in previous work.46

Figure 6.

Figure 6.

(a) DEPC labeling extent of N-terminus, Lys6, and Lys91 in Cu(II)-β2m as a function of EGCG concentration. (b) DEPC labeling results for the N-terminus, fit using equation 7, resulting in a Cu(II)-β2m and EGCG Kd of 12 ± 5 μM. (c) DEPC labeling results for Lys6, fit using equation 7, resulting in a Cu(II)-β2m and EGCG Kd of 3 ± 2 μM. The concentration of β2m was 24 μM in each experiment, and the experiments were done in triplicate. Fitting statistics for are found in Table S1 in the SI.

Conclusions

We have demonstrated that CL-MS can be used to obtain reasonably accurate Kd values for protein-ligand complexes. In particular, we find that slow-reacting labeling reagents like DEPC and HNSB can provide this information even when labeling from these reagents might be expected to perturb the equilibrium of the protein-ligand complex. We find that accurate Kd values can be obtained when the extent of protein labeling is kept low. The experimental results from two model systems and one insufficiently characterized protein-ligand system support this conclusion and show that our CL-MS based strategy is able to determine protein-ligand binding affinity with reasonable accuracy. An intriguing prospect of using CL-MS to determine Kd values is the ability to simultaneously identify the ligand binding site, if the residue level labeling results contain sufficiently detailed information. This CL-MS based ligand titration strategy might serve as an alternative for characterizing protein-ligand complexes that are difficult to measure by other methods, such as fluorescence spectroscopy, surface plasmon resonance spectroscopy, or nuclear magnetic resonance spectroscopy.

Supplementary Material

Supporting Information file

Acknowledgements

This work was supported by National Institutes of Health (NIH) grant R01 GM075092. Some of the data described herein was acquired on a Thermofisher Orbitrap Fusion Tribrid mass spectrometer funded by NIH grant 1S10OD010645-01A1.

Footnotes

Supporting Information

Tandem mass spectrum of the peptide from lysozyme; fitting statistics for the plots shown in the main text; covalent labeling/MS results from the maltose binding protein reacting with DEPC and HNSB; solvent accessible surface area values for the covalently-modified residues of the maltose binding protein; and DEPC labeling results for the Cu(II)-β2m complex.

References

  • [1].Babine RE; Bender SL Molecular Recognition of Protein–Ligand Complexes: Applications to Drug Design. Chem. Rev 1997, 97, 1359–1472. [DOI] [PubMed] [Google Scholar]
  • [2].Chaires JB Calorimetry and thermodynamics in drug design. Annu. Rev. Biophys 2008, 37, 135–151. [DOI] [PubMed] [Google Scholar]
  • [3].Kitova EN; El-Hawiet A; Schnier PD; Klassen JS Reliable Determinations of Protein–Ligand Interactions by Direct ESI-MS Measurements. Are We There Yet? J. Am. Soc. Mass Spectrom 2012, 23, 431–441. [DOI] [PubMed] [Google Scholar]
  • [4].Ganem B; Li YT; Henion JD Observation of noncovalent enzyme-substrate and enzyme-product complexes by ion-spray mass spectrometry. J. Am. Chem. Soc 1991, 113, 7818–7819. [Google Scholar]
  • [5].Katta V; Chait BT Observation of the Heme-Globin Complex in Native Myoglobin by Electrospray-Ionization Mass Spectrometry. J. Am. Chem. Soc 1991, 113, 8534–8535. [Google Scholar]
  • [6].Loo RRO; Goodlett DR; Smith RD; Loo JA Observation of a Noncovalent Ribonuclease S-Protein/S-Peptide Complex by Electrospray Ionization Mass Spectrometry. J. Am. Chem. Soc 1993, 115, 4391–4392. [Google Scholar]
  • [7].Daneshfar R; Kitova EN; Klassen JS Determination of Protein-Ligand Association Thermochemistry Using Variable-Temperature Nanoelectrospray Mass Spectrometry. J. Am. Chem. Soc 2004, 126, 4786–4787. [DOI] [PubMed] [Google Scholar]
  • [8].Cong X; Liu Y; Liu W; Liang X; Russell DH; Laganowsky A Determining Membrane Protein-Lipid Binding Thermodynamics Using Native Mass Spectrometry. J. Am. Chem. Soc 2016, 138, 4346–4349. [DOI] [PubMed] [Google Scholar]
  • [9].Wang W; Kitova EN; Klassen JS Influence of Solution and Gas Phase Processes on Protein–Carbohydrate Binding Affinities Determined by Nanoelectrospray Fourier Transform Ion Cyclotron Resonance Mass Spectrometry. Anal. Chem 2003, 75, 4945–4955. [DOI] [PubMed] [Google Scholar]
  • [10].Daniel JM; Friess SD; Rajagopalan S; Wendt S; Zenobi R Quantitative Determination of Noncovalent Binding Interactions Using Soft Ionization Mass Spectrometry. Int. J. Mass Spectrom 2002, 216, 1–27. [Google Scholar]
  • [11].Ghaemmaghami S; Fitzgerald MC; Oas TG A Quantitative, High-Throughput Screen for Protein Stability. Proc. Natl. Acad. Sci. U. S. A 2000, 97, 8296–8301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Powell KD; Ghaemmaghami S; Wang MZ; Ma L; Oas TG; Fitzgerald MC A General Mass Spectrometry-Based Assay for the Quantitation of Protein–Ligand Binding Interactions in Solution. J. Am. Chem. Soc 2002, 124, 10256–10257. [DOI] [PubMed] [Google Scholar]
  • [13].Zhu MM; Rempel DL; Du Z; Gross ML Quantification of Protein–Ligand Interactions by Mass Spectrometry, Titration, and H/D Exchange: PLIMSTEX. J. Am. Chem. Soc 2003, 125, 5252–5253. [DOI] [PubMed] [Google Scholar]
  • [14].Zhu MM; Rempel DL; Gross ML Modeling Data from Titration, Amide H/D Exchange, and Mass Spectrometry to Obtain Protein-Ligand Binding Constants. J. Am. Soc. Mass Spectrom 2004, 15, 388–397. [DOI] [PubMed] [Google Scholar]
  • [15].Mendoza VL; Vachet RW Probing Protein Structure by Amino Acid-Specific Covalent Labeling and Mass Spectrometry. Mass Spectrom. Rev 2009, 28, 785–815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Limpikirati P; Liu T; Vachet RW Covalent Labeling-Mass Spectrometry with Non-Specific Reagents for Studying Protein Structure and Interactions. Methods 2018, 144, 79–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Xu G; Chance MR Hydroxyl Radical-Mediated Modification of Proteins as Probes for Structural Proteomics. Chem. Rev 2007, 107, 3514–3543. [DOI] [PubMed] [Google Scholar]
  • [18].Zhang B; Cheng M; Rempel D; Gross ML Implementing Fast Photochemical Oxidation of Proteins (FPOP) as a Footprinting Approach to Solve Diverse Problems in Structural Biology. Methods 2018, 144, 94–103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [19].Liu XR; Zhang MM; Gross ML Mass Spectrometry-Based Protein Footprinting for Higher-Order Structure Analysis: Fundamentals and Applications. Chem. Rev 2020, in press. 10.1021/acs.chemrev.9b00815 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [20].West GM; Tang L; Fitzgerald MC Thermodynamic Analysis of Protein Stability and Ligand Binding Using a Chemical Modification- and Mass Spectrometry-Based Strategy. Anal. Chem 2008, 80, 4175–4185. [DOI] [PubMed] [Google Scholar]
  • [21].Liu XR; Zhang MM; Rempel DL; Gross ML Protein-Ligand Interaction by Ligand Titration, Fast Photochemical Oxidation of Proteins and Mass Spectrometry: LITPOMS. J. Am. Soc. Mass Spectrom 2019, 30, 213–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Liu XR; Zhang MM; Rempel DL; Gross ML A Single Approach Reveals the Composite Conformational Changes, Order of Binding, and Affinities for Calcium Binding to Calmodulin. Anal. Chem 2019, 91, 5508–5512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Hambly D; Gross ML Laser Flash Photochemical Oxidation to Locate Heme Binding and Conformational Changes in Myoglobin. Int. J. Mass Spectrom 2007, 259, 124–129. [Google Scholar]
  • [24].Guan J-Q; Vorobiev S; Almo SC; Chance MR Mapping the G-Actin Binding Surface of Cofilin Using Synchrotron Protein Footprinting. Biochemistry 2002, 41, 5765–5775. [DOI] [PubMed] [Google Scholar]
  • [25].Li Z; Moniz H; Wang S; Ramiah A; Zhang F; Moremen KW; Linhardt RJ; Sharp JS High Structural Resolution Hydroxyl Radical Protein Footprinting Reveals an Extended Robo1-Heparin Binding Interface. J. Biol. Chem 2015, 290, 10729–10740. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Liu T; Marcinko TM; Kiefer PA; Vachet RW Using Covalent Labeling and Mass Spectrometry To Study Protein Binding Sites of Amyloid Inhibiting Molecules. Anal. Chem 2017, 89, 11583–11591. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Mendoza VL; Vachet RW Protein Surface Mapping Using Diethylpyrocarbonate with Mass Spectrometric Detection. Anal. Chem 2008, 80, 2895–2904. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Hambly DM; Gross ML Laser Flash Photolysis of Hydrogen Peroxide to Oxidize Protein Solvent-Accessible Residues on the Microsecond Timescale. J. Am. Soc. Mass Spectrom 2005, 16, 2057–2063. [DOI] [PubMed] [Google Scholar]
  • [29].Chen J; Rempel DL; Gau BC; Gross ML Fast Photochemical Oxidation of Proteins and Mass Spectrometry Follow Submillisecond Protein Folding at the Amino-Acid Level. J. Am. Chem. Soc 2012, 134, 18724–18731. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Zhou Y; Vachet RW Increased Protein Structural Resolution from Diethylpyrocarbonate-based Covalent Labeling and Mass Spectrometric Detection. J. Am. Soc. Mass Spectrom 2012, 23, 708–717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Fraczkiewicz R; Braun W Exact and Efficient Analytical Calculation of the Accessible Surface Areas and Their Gradients for Macromolecules. J. Comp. Chem 1998, 19, 319–333. [Google Scholar]
  • [32].Jecklin MC; Touboul D; Bovet C; Wortmann A; Zenobi R Which Electrospray-Based Ionization Method Best Reflects Protein-Ligand Interactions Found in Solution? A Comparison of ESI, NanoESI, and ESSI for the Determination of Dissociation Constants with Mass Spectrometry. J. Am. Soc. Mass Spectrom 2008, 19, 332–343. [DOI] [PubMed] [Google Scholar]
  • [33].Cheetham JC; Artymiuk PJ; Phillips DC Refinement of an Enzyme Complex with Inhibitor Bound at Partial Occupancy: Hen Egg-White Lysozyme and Tri-N-Acetylchitotriose at 1.75 Å Resolution. J. Mol. Biol 1992, 224, 613–628. [DOI] [PubMed] [Google Scholar]
  • [34].Ishikawa T; Reddy Burri R; Kamatari YO, ; Sakuraba S; Matubayasi N; Kitao A; Kuwata K A Theoretical Study of the Two Binding Modes between Lysozyme and Tri-NAG with an Explicit Solvent Model Based on the Fragment Molecular Orbital Method. Chem. Phys 2013, 15, 3646–3654. [DOI] [PubMed] [Google Scholar]
  • [35].Imoto T; Johnson LN; North ACT ; Phillips DC; Rupley JA Vertebrate Lysozymes, 3rd ed.; Academic: New York: 665–868 (1972) [Google Scholar]
  • [36].Schindler M; Assaf Y; Sharon N; Chipman DM Mechanism of Lysozyme Catalysis: Role of Ground-State Strain in Subsite D in Hen Egg-White and Human Lysozymes. Biochemistry. 1977, 16, 423–431. [DOI] [PubMed] [Google Scholar]
  • [37].Maenaka K; Matsushima M; Song H; Sunada F; Watanabe K; Kumagai I Dissection of Protein–Carbohydrate Interactions in Mutant Hen Egg-White Lysozyme Complexes and Their Hydrolytic Activity. J. Mol. Biol 1995, 247, 281–293. [DOI] [PubMed] [Google Scholar]
  • [38].Clark SM; Konermann L: Determination of Ligand–Protein Dissociation Constants by Electrospray Mass Spectrometry-Based Diffusion Measurements. Anal. Chem 2004, 76, 7077–7083. [DOI] [PubMed] [Google Scholar]
  • [39].Dennhart N; Letzel T Mass Spectrometric Real-Time Monitoring of Enzymatic Glycosidic Hydrolysis, Enzymatic Inhibition and Enzyme Complexes. Anal. Bioanal. Chem 2006, 386, 689–698. [DOI] [PubMed] [Google Scholar]
  • [40].Quiocho FA; Spurlino JC; Rodseth LE Extensive Features of Tight Oligosaccharide Binding Revealed in High-Resolution Structures of the Maltodextrin Transport/Chemosensory Receptor. Structure 1997, 5, 997–1015. [DOI] [PubMed] [Google Scholar]
  • [41].Walker IH; Hsieh P; Riggs PD Mutations in Maltose-Binding Protein That Alter Affinity and Solubility Properties. Appl. Microbiol. Biotechnol 2010, 88, 187–197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [42].Miller DM; Olson JS; Pflugrath JW; Quiocho FA Rates of ligand binding to periplasmic proteins involved in bacterial transport and chemotaxis. J. Biol. Chem 1983, 258, 13665–13672. [PubMed] [Google Scholar]
  • [43].Seo MH; Park J, Kim E; Hohng S; Kim HS Protein conformational dynamics dictate the binding affinity for a ligand. Nat. Commun 2014, 5, 3724. [DOI] [PubMed] [Google Scholar]
  • [44].Telmer PG; Shilton BH Insights into the conformational equilibria of maltose-binding protein by analysis of high affinity mutants. J. Biol. Chem 2003, 278, 34555–34567. [DOI] [PubMed] [Google Scholar]
  • [45].Floege J; Ketteler M B2-Microglobulin–Derived Amyloidosis: An Update. Kidney Int. 2001, 59, S164–S171. [DOI] [PubMed] [Google Scholar]
  • [46].Marcinko TM; Drews T; Liu T; Vachet RW Epigallocatechin-3-gallate Inhibits Cu(II)-induced β-2-microglobulin Amyloid Formation by Binding to the Edge of its β-sheets. Biochemistry 2020, 59, 1093–1103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Lim J; Vachet RW: Using Mass Spectrometry To Study Copper–Protein Binding under Native and Non-Native Conditions: β-2-Microglobulin. Anal. Chem 2004, 76, 3498–3504. [DOI] [PubMed] [Google Scholar]
  • [48].Srikanth R; Mendoza VL; Bridgewater JD; Zhang G; Vachet RW Copper Binding to β-2-Microglobulin and its Pre-Amyloid Oligomers. Biochemistry, 2009, 48, 9871–9881. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information file

RESOURCES