Abstract
Proteins are undoubtedly some of the most essential molecules of life. While much is known about many proteins, some aspects still remain mysterious. One particularly important aspect of understanding proteins is determining how structure helps dictate function. Continued development and implementation of biophysical techniques that provide information about protein conformation and dynamics is essential. In this review, we discuss hydrogen exchange mass spectrometry and how this method can be used to learn about protein conformation and dynamics. The basic concepts of the method are described, the workflow illustrated, and a few examples of its application are provided.
Keywords: Deuterium, Protein mass spectrometry, Protein dynamics, Protein conformation
Introduction
Proteins play a pivotal role in most biological processes, including DNA replication, cell division, cell death, immune response, and cellular signal transduction. Many of these processes are thought to be carried out by protein machines containing at least 10 or more proteins [1]. To fully understand how proteins drive and contribute to basic biological and biochemical events, techniques that probe the fundamental properties of proteins are necessary. Not only must these methods provide information about protein function, they must also help reveal how function is tied to protein conformation and dynamics. In the past 60 years or so, there have been profound advances in the techniques for protein analysis, including such Nobel Prize-winning methods as nuclear magnetic resonance (NMR), X-ray crystallography, small angle X-ray scattering, and cryo-electron microscopy. As with everything, each of these techniques has advantages and disadvantages for studying protein conformation and dynamics, and often a combination of methods is required for full protein understanding. An orthogonal biophysical technique that has also seen profound advances in the last 20 or so years is Hydrogen Exchange monitored by Mass Spectrometry (HX MS). HX MS probes solution conformation so crystallization is not required, it requires very little sample (500–1,000 picomoles for an entire experiment), it is amenable to studying proteins that are hard to purify or that can only be handled at low concentrations (as low as 0.1 μM), and it can reveal changes to conformation and dynamics on a wide timescale (for a recent review and perspective, see Ref. [2]).
Topics
Overview of hydrogen exchange: what is it?
Hydrogen exchange mass spectrometry exploits a fundamental chemical reaction unique to hydrogens found in proteins (for a historical perspective, see Ref. [3]). Certain hydrogens in proteins are in continuous exchange with the hydrogens in solution. If an aqueous, all-H2O solvent is replaced with an isotope of hydrogen that has distinctive spectroscopic properties, then one can follow this exchange process. For most modern HX experiments, deuterated or “heavy” water (D2O) is used. In particular, the hydrogen bonded to the backbone nitrogen (also referred to as the backbone amide hydrogen) is useful for probing protein conformation. Figure 1a shows the location of the backbone amide hydrogens relative to other chemical groups within the protein. Note that every amino acid except for proline has a backbone amide hydrogen, meaning that there is a sensor at every amino acid along the length of the protein chain. While other hydrogens on side chains can also undergo exchange, the rate of side chain exchange is generally much faster than that of the backbone positions and any deuterium that is incorporated in non-backbone positions reverts back to hydrogen later in the experiment (described below).
There are various factors that affect hydrogen exchange in proteins [4]. The four primary factors are pH, temperature, solvent accessibility, and hydrogen bonding. Temperature and pH can be controlled experimentally meaning that the rate and location of exchange then becomes a function of hydrogen bonding and solvent accessibility. While amide hydrogens of fully solvent-exposed peptides at pH 7 exchange very rapidly with rates of 10–1,000 s−1 (depending on solvent conditions, etc.) [5], in folded proteins amide hydrogens display a variety of exchange rates depending on their position within the protein and whether they are involved in intramolecular hydrogen bonding. Figure 1b illustrates how solvent accessibility and hydrogen bonding relate to exchange in proteins. Regions that are highly dynamic and solvent-exposed (like the loops connecting the alpha helices) will exchange rapidly while regions that are less dynamic (i.e., “rigid”) and/or involved in hydrogen bonding networks or buried within the interior of the protein (such as β-sheets or α-helices) will exchange slower. In folded proteins some amide hydrogens exchange quickly while others exchange much slower on timescales from minutes to months [4, 6]. Note also that the backbone amide hydrogens are the ones participating in the hydrogen bonds that hold α-helices and β-sheets together. If there is a change to the solvent accessibility or the hydrogen bonding network of a protein, the rate and location of deuterium incorporation can be altered.
Monitoring hydrogen exchange
The different chemical properties of hydrogen and deuterium allow several biophysical techniques to distinguish between the two isotopes. If deuterium incorporation into proteins is monitored with NMR, for example, the amide proton peaks (from hydrogen) disappear as the protein becomes deuterated because deuterium is NMR silent. As the mass of hydrogen is 1.0078 Da and the mass of deuterium is 2.0141 Da, deuterated proteins will have a larger mass than non-deuterated proteins [7]. All that remains to convert deuteration information into conformational information is to determine how fast a protein is deuterated and where the deuterium goes.
A general workflow for a hydrogen exchange experiment monitored with mass spectrometry is shown in Fig. 2. This figure shows the continuous labeling experiment, which is the most common [8]. Protein solutions are initially equilibrated at room temperature, physiological pH, and in all-H2O buffer. A dilution of 10- to 20-fold into the identical buffer except with all D2O initiates the labeling process. The excess deuterium insures the exchange kinetics favor labeling from hydrogen to deuterium [9]. The labeling is allowed to proceed for various amounts of time (e.g., 10 s, 1 min, 20 min, 8 h) and then the labeling is quenched by adjusting the pH of the sample to 2.5 and lowering the temperature to 0 °C. These quench conditions ensure retention of the deuterium label for MS analysis by decreasing the rate of amide exchange approximately five orders of magnitude [6, 10]. The quenched protein sample can then be sprayed directly into a mass spectrometer (using liquid chromatography as the interface) to determine the mass of the intact protein. Measuring the mass of the whole protein gives global information about the protein(s) being studied but does not locate where the deuterium is within the protein. Proteins can also be digested post-quench but prior to chromatography. This digestion must be conducted under quench conditions to ensure retention of the deuterium label; therefore, acid proteases such as pepsin must be used [10, 11]. Note that all peptic peptides must also be identified, typically done in separate MS/MS experiments. With a digestion experiment, deuterium can be localized within the short peptides produced by the digestion. The location cannot be refined to single amino acids without the use of more sophisticated mass spectrometry methods [12]
What can hydrogen exchange tell us?
HX MS can provide very useful information, including: how conformation changes upon binding, protein folding and unfolding pathways, glimpses into the structure of proteins that will not crystallize or are not amenable to NMR, which regions of a protein are solvent-exposed and dynamic, location and properties of binding sites and surfaces, dissociation constants and measures of protein stability under various conditions. The raw mass spectra of the mass of the protein, or each peptide from the digestion, are converted into a deuterium uptake plot where the level of deuterium is plotted versus the labeling time. Deuterium uptake plots for both intact analysis and peptide analysis contain information about how solvent-exposed and dynamic a protein is in solution. For a detailed explanation of all the information that can be derived from deuterium uptake plots see Ref. [13].
To illustrate how HX MS data can be revealing, consider Fig. 3 which shows examples of deuterium uptake plots for two proteins under different hypothetical situations. In Fig. 3a, the use of HX MS to probe structural effects as a result of small molecule/inhibitor binding is represented; Fig. 3b illustrates results for protein–protein interactions in a hexameric protein assembly. In the intact analysis in Fig. 3a(i), when the protein is incubated with an inhibitor, less deuterium is incorporated over the course of the labeling experiment. The differences in deuteration appear at later times in the exchange process, which indicates changes in solution dynamics as a result of inhibitor binding [14]. The overall motions of the protein were decreased, that is its structure was rigidified and it could not flex as well in solution, hence less deuterium was incorporated. In order to identify which specific parts of the protein were changed upon binding, the pepsin digestion experiment can be performed prior to MS analysis. HX MS data from two peptides are shown in Fig. 3a(ii and iii). The deuterium incorporation for the peptide in Fig. 3a(ii) (residues 20–35) shows a similar shape to that of the intact protein in Fig. 3a(i), meaning this is a region that has become less dynamic as a result of inhibitor binding. As the location of the residues is known from the crystal structure of the protein, blue in the structure in Fig. 3a(ii), an interpretation of the meaning of less deuteration in this region of this protein can be made. In other regions (Fig. 3a(iii)), no changes are observed. Deuterium uptake curves that differ in the level of deuterium incorporated but remain parallel are more indicative of a change of solvent accessibility whereas those that have different slopes usually report on changes in protein dynamics and breathing (reviewed in Ref. [13]).
In addition to probing structural changes in proteins as a result of ligand/inhibitor binding, HX MS can be used to study many other types of protein interactions, including protein–protein interactions. Since the majority of cellular processes are orchestrated by multiple proteins and large macromolecular protein complexes, being able to study the structural consequences of these interactions is extremely important. Figure 3b illustrates an example of how HX MS could be used to study protein–protein interactions. In this hexameric protein, exchange can be compared in the isolated monomer versus the monomer in the assembled hexamer (Fig. 3b(i)). Then, with the use of pepsin digestion, the specific regions affected by complex formation can be determined (residues 10–28 are involved in hexamer formation, residues 50–68 are not; Fig. 3b(ii and iii)).
One could imagine many protein–protein interactions in which determining the effects and location(s) of changes in HX upon binding would be useful. For example, HX MS has been used to study the structural changes induced by pH changes in the capsid protein of the brome mosaic virus [15] and it has been used to investigate conformational changes in the HIV-1 capsid protein as a result of HIV assembly and maturation [16]. The large size and complexity of viral capsids make them challenging to study by many structural means, particularly when detailed information about conformational changes is desired. HX MS, however, can allow access to such information and has therefore become important as a tool for probing large proteins and protein complexes.
HX MS is also sensitive to populations of molecules in solution. If multiple populations exist that are different structurally, different amounts of deuterium will be incorporated into each population and this can be observed in the raw mass spectra [17], as illustrated in Fig. 4. If the refolding rate (k−1) of unfolded and exposed regions of a protein is slower than the deuterium labeling rate (Fig. 4a), the unfolded species will have sufficient time to get highly deuterated and therefore have a higher mass than the folded species [4, 18, 19]. D2O is present in vast excess such that once an individual protein molecule has transitioned to the unfolded form and become deuterated, it is a permanent member of the unfolded species peak (red, bottom of Fig. 4a). The conversion of the folded to the unfolded species occurs at a rate indicative of the rate of protein unfolding in solution. This type of exchange behavior is rare for proteins, but when observed, can serve as an extremely powerful tool for studying protein–protein interactions. A good example comes from the SH3 domains of the Src family of tyrosine kinases [20]. Data from the HX of the SH3 domain of the Lyn kinase alone (Fig. 4b) and in complex with the HIV accessory protein Nef (Fig. 4c) indicate multiple populations existing in solution. The Nef protein has been shown to interact with different members of the Src family of tyrosine kinases, including the Lyn kinase, and to disrupt cellular signaling to enhance viral infection [21]. In HX MS of Lyn SH3 alone, the unfolding rate occurs with a half-life of approximately 12 min but when the Lyn SH3 domain is bound to Nef, the unfolding dynamics of Lyn SH3 are slowed to a half-life of longer than 30 min. From this type of data, one can extract relative binding affinities [22], or determine when and if certain sequences are able to associate intramolecularly [23, 24].
Outlook
Because the function of proteins is dictated by their structure and movements in solution, any biophysical technique which allows for the molecular investigation of proteins will be invaluable to aiding in our understanding of these molecular machines. Amide hydrogen exchange monitored by mass spectrometry is just one such technique for studying proteins. To become robust and high throughput, HX MS faces some challenges. Development of robotics and total automation of sample handling and data processing will make the method more amenable to high-throughput types of studies. Advances in liquid chromatography and mass spectrometry will also continue to improve the HX MS experiment. A major limitation in terms of chromatography during HX MS experiments is that separation must be done at 0 °C where chromatographic efficiency in traditional HPLC is relatively poor. New separation media and the use of UPLC [25] are addressing this issue and also allowing for analysis of bigger and bigger proteins and protein systems [26]. The large amount of data produced during hydrogen exchange experiments have historically made data processing time-consuming. Reducing the data to a form that is easily understood in terms of conformation for a given protein can be arduous. Recent software developments (e.g., HYDRA [27], HD Desktop [28], and TOF2H [29]) significantly reduce the burden of HX MS data analysis although much more work needs to be done in this area. More sensitive mass spectrometers will also improve HX MS. One important parameter of an HX MS experiment is obtaining the protein to be studied. If proteins are rare, difficult to overexpress or obtain in the concentrations needed for some biophysical methods, or just otherwise uncooperative, there may be significant problems obtaining suitable material for analysis. Luckily, HX MS has some of the lowest requirements of any of the biophysical techniques that can provide conformational data, thereby providing access to some of these proteins that are difficult to deal with, particularly the estimated 30% of eukaryotic proteins that contain unstructured regions [30]. We envision that conformational data on many proteins that were considered impossible to analyze will be provided by HX MS in the future.
Acknowledgments
We would like to thank Dr. Thomas Wales and Dr. Thomas Smithgall for the Lyn SH3:Nef binding data. This work was supported in part by the National Institutes of Health (R01-GM070590 and R01-GM086507). This is contribution 959 from the Barnett Institute.
References
- 1.Alberts B. Cell. 1998;92:291–294. doi: 10.1016/s0092-8674(00)80922-8. [DOI] [PubMed] [Google Scholar]
- 2.Engen JR. Anal Chem. 2009;81:7870–7875. doi: 10.1021/ac901154s. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Englander SW, Mayne L, Bai Y, Sosnick TR. Protein Sci. 1997;6:1101–1109. doi: 10.1002/pro.5560060517. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Englander SW, Kallenbach NR. Q Rev Biophys. 1983;16:521–655. doi: 10.1017/s0033583500005217. [DOI] [PubMed] [Google Scholar]
- 5.Bai Y, Milne JS, Mayne L, Englander SW. Proteins. 1993;17:75–86. doi: 10.1002/prot.340170110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Smith DL, Deng Y, Zhang Z. J Mass Spectrom. 1997;32:135–146. doi: 10.1002/(SICI)1096-9888(199702)32:2<135::AID-JMS486>3.0.CO;2-M. [DOI] [PubMed] [Google Scholar]
- 7.Katta V, Chait BT. Rapid Commun Mass Spectrom. 1991;5:214–217. doi: 10.1002/rcm.1290050415. [DOI] [PubMed] [Google Scholar]
- 8.Wales TE, Engen JR. Mass Spectrom Rev. 2006;25:158–170. doi: 10.1002/mas.20064. [DOI] [PubMed] [Google Scholar]
- 9.Englander JJ, Rogero JR, Englander SW. J Mol Biol. 1983;169:325–344. doi: 10.1016/s0022-2836(83)80186-7. [DOI] [PubMed] [Google Scholar]
- 10.Zhang Z, Smith DL. Protein Sci. 1993;2:522–531. doi: 10.1002/pro.5560020404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Cravello L, Lascoux D, Forest E. Rapid Commun Mass Spectrom. 2003;17:2387–2393. doi: 10.1002/rcm.1207. [DOI] [PubMed] [Google Scholar]
- 12.Rand KD, Zehl M, Jensen ON, Jorgensen TJ. Anal Chem. 2009;81:5577–5584. doi: 10.1021/ac9008447. [DOI] [PubMed] [Google Scholar]
- 13.Morgan CR, Engen JR. Curr Protoc Protein Sci. 2009;58(17.16):11–17. doi: 10.1002/0471140864.ps1706s58. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Maier CS, Deinzer ML. Methods Enzymol. 2005;402:312–360. doi: 10.1016/S0076-6879(05)02010-0. [DOI] [PubMed] [Google Scholar]
- 15.Wang L, Lane LC, Smith DL. Protein Sci. 2001;10:1234–1243. doi: 10.1110/ps.100101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lanman J, Lam TT, Emmett MR, Marshall AG, Sakalian M, Prevelige PE., Jr Nat Struct Mol Biol. 2004;11:676–677. doi: 10.1038/nsmb790. [DOI] [PubMed] [Google Scholar]
- 17.Miranker A, Robinson CV, Radford SE, Aplin RT, Dobson CM. Science. 1993;262:896–900. doi: 10.1126/science.8235611. [DOI] [PubMed] [Google Scholar]
- 18.Weis DD, Wales TE, Engen JR, Hotchko M, Ten Eyck LF. J Am Soc Mass Spectrom. 2006;17:1498–1509. doi: 10.1016/j.jasms.2006.05.014. [DOI] [PubMed] [Google Scholar]
- 19.Engen JR, Smith DL. Methods Mol Biol. 2000;146:95–112. doi: 10.1385/1-59259-045-4:95. [DOI] [PubMed] [Google Scholar]
- 20.Wales TE, Engen JR. J Mol Biol. 2006;357:1592–1604. doi: 10.1016/j.jmb.2006.01.075. [DOI] [PubMed] [Google Scholar]
- 21.Trible RP, Emert-Sedlak L, Smithgall TE. J Biol Chem. 2006;281:27029–27038. doi: 10.1074/jbc.M601128200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Trible RP, Emert-Sedlak L, Wales TE, Ayyavoo V, Engen JR, Smithgall TE. J Mol Biol. 2007;374:121–129. doi: 10.1016/j.jmb.2007.09.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Chen S, Dumitrescu TP, Smithgall TE, Engen JR. Biochemistry. 2008;47:5795–5803. doi: 10.1021/bi800446b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Hodkinson JP, Jahn TR, Radford SE, Ashcroft AE. J Am Soc Mass Spectrom. 2009;20:278–286. doi: 10.1016/j.jasms.2008.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Wales TE, Fadgen KE, Gerhardt GC, Engen JR. Anal Chem. 2008;80:6815–6820. doi: 10.1021/ac8008862. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Houde D, Arndt J, Domeier W, Berkowitz S, Engen JR. Anal Chem. 2009;81:5966. doi: 10.1021/ac9009287. [DOI] [PubMed] [Google Scholar]
- 27.Slysz GW, Baker CA, Bozsa BM, Dang A, Percy AJ, Bennett M, Schriemer DC. BMC Bioinformatics. 2009;10:162. doi: 10.1186/1471-2105-10-162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Pascal BD, Chalmers MJ, Busby SA, Griffin PR. J Am Soc Mass Spectrom. 2009;20:601–610. doi: 10.1016/j.jasms.2008.11.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nikamanon P, Pun E, Chou W, Koter MD, Gershon PD. BMC Bioinformatics. 2008;9:387. doi: 10.1186/1471-2105-9-387. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dyson HJ, Wright PE. Curr Opin Struct Biol. 2002;12:54–60. doi: 10.1016/s0959-440x(02)00289-0. [DOI] [PubMed] [Google Scholar]