Abstract
Reduction and alkylation of cysteine residues is part of virtually any proteomics workflow. Despite its frequent use, up to date no systematic investigation of the impact of different conditions on the outcome of proteomics studies has been performed. In this study, we compared common reduction reagents (dithiothreitol, tris-(2-carboxyethyl)-phosphine, and β-mercaptoethanol) and alkylation reagents (iodoacetamide, iodoacetic acid, acrylamide, and chloroacetamide). Using in-gel digests as well as SAX fractionated in-solution digests of cytosolic fractions of HeLa cells, we evaluated 13 different reduction and alkylation conditions resulting in considerably varying identification rates. We observed strong differences in offsite alkylation reactions at 7 amino acids as well as at the peptide N terminus, identifying single and double adducts of all reagents. Using dimethyl labeling, mass tolerant searches, and synthetic peptide experiments, we identified alkylation of methionine residues by iodine-containing alkylation reagents as one of the major factors for the differences. We observed differences of more than 9-fold in numbers of identified methionine-containing peptide spectral matches for in-gel digested samples between iodine- and noniodine-containing alkylation reagents. This was because of formation of carbamidomethylated and carboxymethylated methionine side chains and a resulting prominent neutral loss during ESI ionization or in MS/MS fragmentation, strongly decreasing identification rates of methionine-containing peptides. We achieved best results with acrylamide as alkylation reagent, whereas the highest numbers of peptide spectral matches were obtained when reducing with dithiothreitol and β-mercaptoethanol for the in-solution and the in-gel digested samples, respectively.
Mass spectrometry (MS) has become one of the major techniques for the qualitative and quantitative analysis of proteins. Because the introduction of electrospray ionization (ESI) and matrix-assisted laser desorption ionization (MALDI)1 (1, 2), which paved the way for modern mass spectrometry-based proteomics, the power of mass spectrometric analyses has improved continuously. The performance of chromatography systems and mass spectrometers, as well as the development of novel sample preparation protocols are the main contributing reasons. Bottom-up proteomics is the most common method for protein identification and characterization by mass spectrometry. It is based on the enzymatic digestion of proteins followed by the subsequent analysis of proteolytic peptides.
In current bottom-up proteomics strategies, all sample preparation protocols have several key elements in common: (1) cells or tissues are lyzed and proteins are extracted; (2) proteins are reduced in order to break disulfide bonds; (3) proteins are alkylated to covalently modify cysteine SH-groups, preventing them from forming unwanted novel disulfide bonds; and (4) proteins are enzymatically digested to peptides. Dependent on the complexity of the sample and the goal of the analysis, samples are frequently fractionated on the protein- or the peptide-level. One common approach is the separation of proteins according to their mass using SDS-PAGE followed by in-gel digestion (3). Alternatively, proteins are digested in-solution followed by the fractionation of peptides using isoelectric focusing or chromatography-based approaches (4). If specific modifications are to be analyzed, enrichment methods are commonly used; for example, phosphorylated peptides are enriched using antibodies or metal-ion affinity resins (5).
Cell lysis, protein purification, proteolytic digestion, and peptide fractionation have been optimized in several studies (e.g. (4, 6–8)). The reduction and alkylation step of proteins, however, has surprisingly not been systematically investigated up to date, despite its use in virtually any proteomic experiment (excluding experiments investigating disulfide bridges or cysteine modifications, for example (9)).
The most common reducing reagent used in proteomics experiments, dithiothreitol (DTT), was introduced in 1964, as a more stable and less toxic alternative to commonly used reducing reagents such as glutathione or β-mercaptoethanol (10). The second most common reducing reagent used by the community, tris-2(-carboxyethyl)-phosphine (TCEP), was introduced in 1999 (11). In this study, the authors showed that TCEP is less susceptible to oxidation than DTT and superior in conserving enzymatic activity using myosin as an example. Finally, β-mercaptoethanol (BME) is frequently used as reducing reagent among molecular biologists, for example, as part of the standard buffer for SDS-PAGE sample preparation (Laemmli buffer (12)), whereas it is rarely used in sample preparation for mass spectrometry-based proteomics.
After reduction of disulfide bridges, alkylation of the free SH-groups is usually performed by iodoacetamide (IAA, (3)). Because of a variety of side reactions that have been observed for IAA (13) several structurally related alternatives have been introduced, including iodoacetic acid (IAC), chloroacetamide (CAA), and acrylamide (AA), as well as the structurally not related reagents N-ethylmaleimide (NEM), methyl methanethiosulfonate (MMTS), and 4-vinylpyridine (14–16). Early studies performed with standard protein digests and MALDI mass spectrometry did not identify differences concerning the performance of the different alkylation reagents (14), whereas later studies identified several side products of the carbamylation reaction using IAA (13, 17–19). Based on these studies, one would expect that the abovementioned alternative alkylation reagents (IAC, AA, and CAA) should be used frequently to minimize the number of chemical artifacts. The authors have the impression, however, that the combination of DTT and IAA is nevertheless the most frequently used combination in the field of mass spectrometry-based proteomics (supplemental Fig. S1 and supplemental Table S1).
In this study, we systematically evaluated reduction and alkylation of proteins using three reducing agents (DTT, TCEP, and BME) in combination with four alkylation agents (IAA, IAC, AA, and CAA). We tested these conditions using strong anion exchange (SAX) fractionated in-solution digests and in-gel digested fractions of samples enriched for cytosolic proteins of HeLa cells.
EXPERIMENTAL PROCEDURES
Cell Culture/Preparation of Cytosolic Fraction
HeLa cells were grown on 15 cm cell culture dishes in DMEM, supplemented with 10% FCS, 100 U/ml penicillin, 100 μg/ml streptomycin, and 2 mm l-glutamine, until confluency. Cells were washed 3 times with ice cold PBS, harvested by scraping in 1 ml ice cold sucrose buffer (250 mm sucrose, 15 mm KCl, 1.5 mm MgAc, 10 mm HEPES, 1× Complete, Roche Diagnostics, Berlin, Germany), and homogenized using a dounce homogenizer on ice (30 strokes). The solution was centrifuged at 1000 × g, 4 °C for 10 min, and the supernatant was transferred to a new 1.5 ml tube, whereas the post nuclear pellet was re-suspended in 1 ml sucrose buffer for a second round of homogenization and centrifugation. The supernatants from both homogenization steps were combined and centrifuged at 10,000 × g, 4 °C for 10 min. The supernatant was transferred to a new tube and another centrifugation step at 100,000 × g, 4 °C for 1 h was performed. The supernatant (cytosolic fraction) was mixed with acetone (−20 °C) in a ratio of 1 to 4 (v/v), vortexed, and incubated at −20 °C overnight. The next day, samples were centrifuged at 20,000 × g, 4 °C for 30 min, the supernatant discarded, and the pellet air dried at 23 °C. The pellet was re-solubilized in 0.1 m TRIS-HCl/4% SDS by heating at 95 °C for 5 min, centrifuged at 20,000 × g, 23 °C for 30 min, and the clear supernatant transferred to a new tube. The protein concentration was determined using the DC protein assay (Bio-Rad, Hercules, CA). We generated enough cytosolic fraction in one batch to perform all experiments. The resulting sample was partitioned in several aliquots and stored at −20 °C until they were used for the individual experiments.
In-solution Reduction, Alkylation, and Digestion
For each experimental condition, 100 μg of protein were used. Protein reduction was performed for 30 min at 56 °C with 5 mm DTT, 5 mm TCEP, or 10 mm BME, respectively. Subsequent alkylation was performed in the dark for 30 min at 23 °C with 20 mm IAA, IAC, AA, or CAA, respectively. Samples were quenched by addition of DTT, TCEP, or BME, and digested in-solution by filter aided sample preparation (FASP), using 30 kDa molecular weight cutoff filters (Microcons, Merck Millipore, Billerica, MA) as described elsewhere (20). Briefly, samples were mixed with 200 μl of 8 m urea in TRIS-HCl, pH 8.5, loaded in the filtration devices, and centrifuged at 14,000 × g for 15 min. Four additional centrifugation steps were performed using buffer containing 8 m urea followed by 3 steps with 100 μl 0.05 m NH4HCO3 pH 7.8. Finally, 1 μg trypsin (Promega, Madison, WI) was added to each sample in 60 μl 0.05 m NH4HCO3 and incubated in a wet chamber overnight at 37 °C (enzyme/protein ratio 1:100). Peptides were eluted from the filtration devices and desalted using Oasis HLB 1 cc 10-mg cartridges (Waters, Milford, MA). The eluate was dried using a vacuum centrifuge, resuspended in 50 μl of water, and dried again. Subsequently SAX fractionation was performed as described elsewhere (8). Briefly, anion exchange microcolumns were manufactured by stacking 12 layers of 3 m Empore anion exchange disks (3M, Maplewood, MN) into a 200 μl micropipette tip. Microcolumns were equilibrated by consecutive centrifugation steps at 5000 × g for 3 min using 100 μl MeOH, 100 μl 1 m NaOH, and 100 μl Britton & Robinson buffer (BR buffer; 20 mm acetic acid, 20 mm phosphoric acid, and 20 mm boric acid) adjusted to pH 11. Peptides were dissolved in 100 μl BR buffer, pH 11, loaded on the microcolumns, and eluted stepwise with 100 μl of BR buffer adjusted to pH 8, 6, 5, 4, and 3. Fractions were collected on Stage tips (21) containing 6 layers of 3 m Empore C18 membrane. Stage tips were washed with 50 μl 0.1% trifluoroacetic acid (TFA) and peptides eluted with 40 μl 60% acetonitrile (ACN), dried in a vacuum centrifuge, and resuspended in 20 μl 5% ACN, 5% formic acid (FA).
In-gel Reduction, Alkylation, and Digestion
For each condition 50 μg of protein, mixed in a 1:4 (v/v) ratio with sample loading buffer (0.25 m TRIS-HCl pH 6.8, 8% SDS, 40% Glycerol, and 0.004% Bromphenol blue), was loaded on a 10% SDS-PAGE gel. Electrophoresis was performed, the gels washed 2 times with ddH2O, and stained using Coomassie brilliant blue (Page Blue Protein Staining Solution, Thermo Fisher Scientific, Waltham, MA) overnight. Gels were destained using ddH2O, the same gel section excised for each sample, cut into cubes of ∼1 mm3, and transferred to 1.5 ml micro tubes. Gel pieces were destained twice using 1 ml of 30% ACN, 0.07 m NH4HCO3 at 25 °C, 800 rpm for 30 min, and reduced using 100 μl of either 20 mm DTT, 20 mm TCEP, or 40 mm BME in 0.1 m NH4HCO3 at 56 °C for 45 min. After removal of the reducing reagent, 100 μl of 55 mm IAA, IAC, AA, or CAA in 0.1 m NH4HCO3 were added and samples incubated at 23 °C for 30 min in the dark followed by washing with 0.1 m NH4HCO3 at 800 rpm, 23 °C for 15 min. Subsequently, gel pieces were dehydrated by ACN and dried using a vacuum centrifuge. To each sample, 1 μg of trypsin (Promega) in 0.1 m NH4HCO3 was added and incubated overnight at 37 °C. Peptides were extracted from the gel pieces using an optimized protocol (6). Briefly, gel pieces were incubated consecutively with (1) 50 μl 0.1% TFA, 50% ACN, (2) 50 μl 0.1 m NH4HCO3, and (3) 100 μl 100% ACN at 25 °C, 800 rpm for 15 min. Supernatants were combined, dried using a vacuum centrifuge, resuspended in 100 μl 0.01% acetic acid, 3% ACN, and desalted using C18 stage tips (21). Peptides were eluted twice using 20 μl 0.5% acetic acid, 80% ACN, eluate fractions dried using a vacuum centrifuge, and resuspended in 20 μl 5% ACN, 5% FA.
LC-MS/MS Analyses
Analyses were performed using an EASY-nLC 1000 UHPLC system in combination with an Orbitrap Velos mass spectrometer (both Thermo Fisher Scientific). Five-microliter samples were loaded directly on in-house manufactured analytical columns (spray tips generated from 360 μm outer diameter and 100 μm inner diameter fused silica capillaries using a P-2000 laser puller (Sutter Instruments, Novato, CA) packed for 20 cm with 5 μm Reprosil AQ C18 particles (Dr. Maisch, Ammerbuch, Germany) at a flow rate of 1 μl/min using 100% solvent A (0.1% FA in water). Peptides were eluted using linear gradients from 99% solvent A, 1% solvent B (ACN with 0.1% FA) to 65% solvent A, 35% solvent B in 30 min for the SAX fractions and 60 min for the in-gel digests. Eluting peptides were ionized in the positive ion mode at 1.6 kV and MS survey scans from m/z 400 to m/z 1200 acquired at a resolution of 30,000 in the Orbitrap analyzer. CID-MS/MS spectra of the top 10 most abundant ions were performed in the ion trap and precursor ions excluded from further fragmentation for 30 s. For MSA scans, neutral losses of 105 (IAA and CAA), 106 (IAC), and 119 (AA) were defined.
Dimethyl Labeling
The same regions of the SDS-PAGE gel were cut as for the other analyses, proteins reduced, alkylated, and digested in-gel using trypsin and triethylammonium bicarbonate (TEAB) buffer. Extracted peptides were dried using a vacuum centrifuge, re-suspended in 0.1 m TEAB, and either labeled with light-, intermediate-, or heavy-formaldehyde as described elsewhere (22). Briefly, dried samples were re-suspended in 100 μl 0.1 m TEAB, 4 μl of 4% formaldehyde (CH2O, CD2O or 13CD2O), and 4 μl 0.6 m cyanoborohydride (NaBH3CN or NaBD3CN) added and the labeling reaction could proceed for 1h at 23 °C. After addition of 16 μl 1% ammonia and 8 μl FA samples were mixed, desalted using Oasis cartridges, and dried using a vacuum centrifuge.
Data Analysis
Thermo *.raw files were searched against Uniprot human (release 2016_01, 20121 entries) using Mascot 2.5.1 (Matrix Science, London, UK) with Proteome Discoverer 2.1.0.81 (Thermo Fisher Scientific). Depending on the searches different combinations of modifications were used (supplemental Fig. S2). The precursor mass tolerance was set to 10 ppm and the fragment ion tolerance to 0.8 Da. For the initial searches, with alkylation specified only for cysteine residues, a second search of unidentified spectra was performed with 50 ppm precursor mass tolerance and the data were combined. Trypsin was selected as enzyme and 2 missed cleavage sites were allowed. Peptides were exported with a false discovery rate of 1% at the peptide level and further processed using MS Excel. For error tolerant searches, Mascot result files were opened in a browser window in the peptide summary format and the top 20 protein hits were selected for error tolerant searching. For mass tolerant searches, *.raw files were converted to *.mgf files using Proteome Discoverer. The header of the *.mgf files was modified as follows: peptide mass tolerance: 250 Da; fragment ion tolerance: 0.8 Da; search type: SQ; fixed modifications: carbamidomethyl, carboxymethyl, or propionamide, respectively, at cysteine residues depending on the alkylation reagent. Searches were performed with and without oxidized methionine as variable modification. MS/MS ion searches using the modified *.mgf files were performed using Mascot with the following parameters: Uniprot human, trypsin with 2 missed cleavage sites, and ESI-TRAP as instrument. For error and mass tolerant searches, data were exported to MS Excel using the export function of Mascot in the browser window applying a peptide score cutoff of 30.
Synthetic Peptide Experiments
The synthetic peptide APEIMLNSK was a kind gift from Wolf Dieter Lehmann, German Cancer Research Center Heidelberg. It was synthesized using FMOC chemistry, purified using C18 chromatography, and lyophilized. From the dried peptide, a 10-mm stock solution was generated using 50% ACN, diluted to 10 pmol/μl, and the amount needed transferred to microtubes followed by drying in a vacuum centrifuge. 200 pmol of synthetic peptide were incubated with 20 μl 1 m DTT or 20 μl 0.1 m NH4HCO3, respectively, at 56 °C for 30 min. Subsequently, 100 μl alkylation reagent (1 m AA, IAA, IAC, or CAA) or 0.1 m NH4HCO3 was added, followed by incubation at 23 °C for 30 min in the dark. The solutions were desalted using C18 stage tips (21), the elution fractions dried using a vacuum centrifuge, and re-suspended in 30% ACN, 0.1% FA. For oxidation experiments, 70 μl of the peptide solution were incubated with 14 μl hydrogen peroxide (H2O2) for 30 min at 23 °C followed by reduction and alkylation.
MALDI Analyses
One microliter of matrix solution (20 mm α-Cyano-4-hydroxycinnamic acid (CHCA) in 70% ACN, 0.1% TFA) was applied on a ground-steel target, mixed with 1 μl of sample, and dried at 23 °C. Samples were analyzed using an Autoflex III MALDI-TOF/TOF (Bruker Daltonics, Bremen, Germany). 3 × 1000 shots per spectrum were acquired at a laser frequency of 200 Hz. The instrument was operated in linear mode with matrix suppression set to m/z 600. Spectra were acquired in positive ion mode in a mass to charge range of m/z 700 to m/z 2000.
RESULTS
Analysis of SAX Fractionated In-solution Digestion Data Sets
To minimize undersampling caused by random precursor selection in the data dependent acquisition mode, we chose a sample of medium complexity: the cytosolic fraction of HeLa cells (see Material and Methods). The workflow for sample preparation is shown in Fig. 1A. We reduced and alkylated 100 μg of protein with 12 different conditions covering the combination of each of the respective reagents, as well as one control sample, which was neither reduced nor alkylated (Fig. 1C). Reduction was performed using dithiothreitol (DTT, (10)), tris-(2-chlorethyl)-phosphate (TCEP, (11)), and β-mercaptoethanol (BME, (12)). For alkylation we used iodoacetamide (IAA, (23)), iodoacetic acid (IAC, (13)), acrylamide (AA, (14)), and chloroacetamide (CAA, (19)) (Fig. 1B). To minimize unwanted side reactions alkylation was always performed in the dark and the reaction quenched immediately by addition of the respective reducing reagent. The control sample was treated even-handedly but without any reduction and alkylation reagents. We then proceeded to tryptic in-solution digestion using FASP and SAX-fractionation into 6 fractions (8). Finally, we analyzed the fractions using LC-MS/MS. For each experimental condition, we performed 2 independent replicates and combined the data for Mascot database searching using Proteome Discoverer. We defined products of the different alkylation reactions at cysteine as variable modification and calculated alkylation efficiencies, which were high for all conditions, ranging between 97.01% (TCEP/CAA) and 99.84% (DTT/IAC), respectively (supplemental Table S2). Despite the low variation in numbers of acquired MS/MS spectra in the single experiments (coefficient of variation: 0.039, supplemental Table S2), we observed strong differences in the number of identified spectra between the different conditions. The worst performance resulted from DTT in combination with IAC (on average 19,054 peptide spectral matches, PSMs), whereas DTT in combination with AA gave the best performance (29,390 PSMs) with an increase of 54% compared with DTT/IAC (Fig. 2A, supplemental Table S2). We observed a similar trend on the level of identified unique peptides (DTT/IAC on average 8508 identifications and DTT/AA 11,690 identifications; corresponding to an increase of 37% Fig. 2B, supplemental Table S2) that was, however, less profound compared with the PSM levels. In general, we observed similar trends for the individual alkylation reagents irrespective of the reducing reagent used. When we compared the best alkylation conditions for each set, the different reducing reagents only resulted in changes of up to 5% on the PSM level. Samples alkylated with IAA and IAC resulted in the lowest number of identified spectra with DTT/IAC performing the worst. For DTT-reduced samples, AA outperformed CAA, whereas the opposite was observed for TCEP-reduced samples. For BME both AA and CAA resulted in virtually the same number of identified PSMs. Surprisingly, the control sample without any reduction and alkylation yielded competitive results, identifying 84% of the PSMs of the top condition DTT/AA (Fig. 2, supplemental Table S2). Based on these results the best condition for in-solution digestion of proteins is reduction of samples with DTT in combination with alkylation using AA as it resulted in the highest number of identified PSMs.
Fig. 1.
A, Experimental workflow for sample generation and analysis; B, reduction and alkylation reagents used; and C, combinations of reduction and alkylation reagents for generation of the different data sets. DTT: dithiothreitol, TCEP: tris-(2-chlorethyl)-phosphate, BME: β-mercaptoethanol, IAA: iodoacetamide, IAC: iodoacetic acid, AA: acrylamide, CAA: chloroacetamide.
Fig. 2.
Results of the analyses of 2 independent replicates of in-solution digested and SAX fractionated as well as 6 replicates of in-gel digested cytosolic fractions of HeLa cells using standard modifications for database searching. A, peptide spectral matches (PSMs) for in-solution digests; B, identified unique peptide sequences (UPSs) for in-solution digests; C, peptide spectral matches (PSMs) for in-gel digests; D, identified unique peptide sequences (UPSs) for in-gel digests. DI: DTT+IAA, DIA: DTT+IAC, DA: DTT+AA, DC: DTT+CAA, TI: TCEP+IAA, TIA: TCEP+IAC, TA: TCEP+AA, TC: TCEP+CAA, MI: BME+IAA, MIA: BME+IAC, MA: BME+AA, MC: BME+CAA.
Analysis of In-gel Digestion Data Sets
One possible reason for the discrepancies in identification rates are differences in SAX fractionation performance of the in-solution digested samples. We therefore generated another data set applying the second strategy commonly used in proteomics sample preparation: separation of proteins by SDS-PAGE, followed by in-gel digestion (23). The sample buffer was omitting a reducing reagent to exclude differential running behavior of the proteins during electrophoresis associated to varying reduction efficiencies. We ran 50 μg of the cytosolic fraction on SDS-PAGE gels (2 lanes for each condition) and cut the same section of the gel, using ∼20% of the lane, covering a region between 30 to 50 kDa (supplemental Fig. S3). After reduction and alkylation in the gel, samples were in-gel digested, analyzed by LC-MS/MS in 3 technical replicates, and the data processed in the same way as the in-solution digested samples. We performed initial searches allowing for propionamide at cysteine for all conditions to investigate for possible unwanted side reactions with free acrylamide in the gel. In all conditions, except for AA as alkylating reagent, less than 1% of the cysteine containing peptides were found to be propionamide-modified. Therefore, propionamide at cysteine was omitted in the following searches of samples not alkylated by AA. Alkylation efficiencies were excellent for samples reduced with DTT (for all alkylation reagents on average above 98.6%), whereas samples reduced with BME and TCEP were slightly lower. CAA in combination with TCEP, which resulted in average alkylation yields of only 95.2%, was the only exception (supplemental Table S3). In contrast to the in-solution samples, the reduction reagent had a stronger influence on the number of identified spectra. For IAA, AA, and CAA, BME clearly outperformed the other reduction reagents by almost doubling the number of PSMs identified on average in the case of IAA (DTT with 3193 ± 270 PSMs versus BME with 6281 ± 596 PSMs, Fig. 2C, supplemental Table S3). For the alkylating reagents, the iodine-containing reagents IAA and IAC performed worse than AA and CAA in all cases, irrespective of the reducing reagent. The biggest differences were observed in combination with DTT, where IAA treated samples resulted in 3193 ± 270 PSMs and CAA treated samples in 7365 ± 655 PSMs, respectively, resulting in an increase of 131%. Despite these large changes in PSMs, the number of acquired MS/MS spectra was roughly constant among all conditions (coefficient of variation: 0.028, supplemental Table S3). We observed similar trends for identified unique peptides (DTT/IAA 2435 ± 141 identifications and BME/CAA 4717 ± 637 identifications, corresponding to an increase of 94%, Fig. 2D). Best results were obtained for samples reduced with BME and alkylated with AA (8494 ± 767 PSMs). For the in-gel digests, the control sample, with neither reduction nor alkylation, performed worse than in the in-solution digested samples, identifying on average only 72% of PSMs resulting from the best condition BME/AA. It, however, still outperformed most of the conditions including the iodine-containing alkylation reagents (Fig. 2C, 2D, supplemental Table S3). We therefore recommend to reduce and alkylate samples for in-gel digestion by BME and AA.
Differences in Identification Efficiency
Next, we investigated possible reasons for the observed differences. Since the number of acquired MS/MS spectra did not change dramatically between conditions, we ruled out influences on protein digestion as possibility. We then investigated the efficiency of peptide identification (supplemental Fig. S4). For in-solution and in-gel digestion, the percentage of identified spectra varied considerably among the conditions, like the trends observed for PSMs. DTT/IAC resulted in the lowest efficiencies for both in-gel and in-solution digests (23 ± 2% identified spectra for in-gel digests and 37.5% identified spectra on average for in-solution digests). Highest efficiencies were obtained with BME/AA in-gel (50 ± 2% of the spectra identified) and BME/CAA in-solution (57.3% of the spectra identified on average). Among the evaluated alkylation reagents, IAA and IAC resulted in the lowest efficiencies of peptide identification across all conditions; indicating that the iodine-containing reagents lead to the highest numbers of non-specifically modified peptides. These results show that the reduction and alkylation conditions can have a strong effect on the identification efficiency of peptides.
Investigation of Cysteine-containing PSMs
As the alkylation reaction targets cysteine residues, we first investigated whether a change in the number of cysteine-containing peptides was the reason for the observed differences. For the in-solution digested samples (Fig. 3A left panel, supplemental Table S2), surprisingly, DTT in combination with IAA resulted in the highest number of identified cysteine-containing PSMs (8, 178) despite its low performance related to the total number of identified PSMs. IAA also performed excellent for the other reducing agents (6601 PSMs for TCEP and 6694 PSMs for BME), whereas only TCEP/AA was better (7431 PSMs). For DTT and TCEP, CAA resulted in the lowest yields (5064 and 3789 PSMs, respectively), whereas IAC performed worst for the BME samples (2895 PSMs). With only 32 PSMs (out of 49,242 in total), the control sample resulted in virtually no identified cysteine-containing PSMs at all, indicating that all cysteine residues are present as disulfide bonds if the samples are not reduced and alkylated. The in-gel digested samples presented an entirely different result (Fig. 3A right panel, supplemental Table S3). Compared with DTT and BME, TCEP did not perform well with any of the alkylation reagents. Only for IAC, all 3 reduction reagents resulted in similar numbers of cysteine containing PSMs. For DTT and BME, AA and CAA clearly outperformed IAA and IAC leading to an increase in cysteine-containing PSMs of up to 3-fold for DTT (1301 ± 156 PSMs with CAA compared with 358 ± 35 for IAC) and BME (1496 ± 147 PSMs with AA compared with 495 ± 43 with IAC). Across all reducing reagents, IAC resulted in the lowest number of PSMs. As for the in-solution digested samples, we were only able to find negligible amounts of cysteine-containing PSMs in the control sample (8 ± 7 out of 6078 ± 399 PSMs). Despite the large differences observed in cysteine-containing PSMs, they failed to explain the differences in the total numbers of PSMs for both, the in-solution an in-gel digests.
Fig. 3.
A, Cysteine-containing PSMs identified in the in-solution and in-gel digested samples using normal search parameters; B, PSMs identified allowing for offsite alkylation at Y, S, D, T, H, E, K, and the peptide N terminus (single blots showing the abundance at each amino acid can be found in supplemental Figs. S6, S7); C, PSMs identified allowing for alkylation reagent dimers at Y, S, D, T, H, E, K, and the peptide N terminus (single blots showing the abundance at each amino acid can be found in supplemental Figs. S8, S9). Shown are the combined results from 2 independent replicates for the in-solution digested and 6 replicates for the in-gel digested samples, respectively. DI: DTT+IAA, DIA: DTT+IAC, DA: DTT+AA, DC: DTT+CAA, TI: TCEP+IAA, TIA: TCEP+IAC, TA: TCEP+AA, TC: TCEP+CAA, MI: BME+IAA, MIA: BME+IAC, MA: BME+AA, MC: BME+CAA, N-term: peptide N terminus.
Investigation of Offsite Alkylation, Double Alkylation, and Iodination
We therefore extended our search strategy to alkylation of residues other than cysteine, as this was shown to occur on synthetic peptides with high efficiency (13, 24, 25). Using the in-gel data, we performed initial searches using 1 technical replicate of the 2 individual experiments. We allowed for alkylation at all possible residues with a Proteome Discoverer workflow consisting of several parallel Mascot searches. Search results were combined and for each spectrum the identification with the best Percolator PEP value was selected. This resulted in assignment of alkylation at almost all amino acids as well as the peptide N terminus, whereas the latter was alkylated most frequently (supplemental Fig. S5, supplemental Table S4). At methionine, tryptophan, and glutamine virtually no alkylation was assigned. As it is unlikely that the alkylation reagents modify amino acids without functional groups such as glycine, valine, or alanine, which were found to be alkylated frequently, we evaluated the data manually. Almost all modifications of these amino acids were in the first two N-terminal amino acids. These amino acids form the b2 ion during peptide fragmentation resulting in a single molecule (26) and are therefore only distinguishable when investigating b2/a2 ion fragmentation patterns (27), which is not considered in Mascot searches. We therefore assigned these alkylation products to the peptide N terminus, because it was most likely that the alkylation occurred at the N-terminal amino group, which has been shown to be efficiently alkylated (13). This resulted in a short-list of possibly modified residues (cysteine, the peptide N terminus, and 7 other amino acids), which were searched subsequently in single Mascot searches for all 6 replicates of the in-gel digested samples and both replicates of the in-solution digests. We again detected alkylation at all specified amino acids with considerable differences between the alkylation reagents (Fig. 3B, supplemental Figs. S6, S7 as well as supplemental Tables S5, S6 for in-solution and in-gel digests, respectively). Irrespective of the digestion strategy, or the reagents used for reduction and alkylation, the peptide N terminus was modified most frequently and tyrosine least frequently. For in-solution digests, we identified the highest number of offsite alkylation with BME/IAC (3655 PSMs), followed by TCEP/AA (3132 PSMs) and DTT/IAC (2567 PSMs), whereas CAA resulted in the lowest number of unspecific modifications irrespective of the reducing reagent. Relative to the total number of assigned PSMs in each experiment, the in-gel digested samples resulted in significantly higher numbers of offsite alkylated residues compared with the in-solution digests. Particularly the iodine-containing reagents resulted in high numbers of offsite alkylation, which were on average between ∼3- and ∼10-fold higher when comparing individual iodine-containing to non-iodine-containing reagents. The lowest numbers were observed for TCEP in combination with AA (239 ± 84 PSMs), whereas DTT/IAC resulted in the highest numbers (2888 ± 375 PSMs). The highest occurrence of alkylation was found at the peptide N terminus followed by lysine, glutamic acid, and histidine residues for the iodine-containing alkylation reagents. Both, AA and CAA resulted in roughly similar numbers of offsite alkylation, whereas the N terminus, serine, threonine, and glutamic acid were modified most frequently. We mainly identified singly and doubly alkylated peptides, but also up to 6 times alkylated species were found (supplemental Fig. S10, supplemental Tables S5, S6). The identified offsite alkylation was not able to fully compensate for the discrepancies observed between the conditions. Therefore, we argued that additional modifications must occur during the alkylation process which were not identified with our current search strategy. It was shown that IAA can result in dimers at lysine residues mimicking the GlyGly tag found after tryptic digestion of ubiquitinated samples (19). We therefore performed searches for dimers of all 4 alkylation reagents at the 7 amino acids and the peptide N terminus analogous to the searches performed for the monomers. We identified double adducts at all amino acids for all alkylation reagents (Fig. 3C, supplemental Figs. S8, S9 and supplemental Tables S7, S8 for in-solution and in-gel digests, respectively). We also performed searches with iodination at both histidine and tyrosine (28), as it is possible that this reaction may occur when using iodine-containing reagents, but virtually no iodination was identified (data not shown). The high number of identified offsite alkylation shows impressively the reactivity of the chemicals used for alkylation at cysteine residues with basically every possible functional group present within a protein. Therefore, reduction and alkylation conditions should be chosen carefully employing minimal reagent concentrations and incubation times to minimize side reactions as efficiently as possible.
Quantification of Individual Peptide Losses by Dimethyl Labeling
The observed PSMs for each sample give an estimate how much of the sample was modified in general. They do, however, not allow calculating actual ratios for individual peptides. Since we frequently observed the modified and unmodified version of the peptides, we wondered to which degree those peptides were unintentionally modified. To investigate this, we applied a quantitative approach using dimethyl labeling (22). In-gel digests reduced with DTT and alkylated with either IAA or AA, as well as an untreated control sample, were labeled with light, medium, and heavy dimethyl reagents, respectively, in two independent replicates. Samples were analyzed by LC-MS/MS and quantified using Proteome Discoverer. We excluded all peptides containing cysteine from the results, as they exhibited different masses and chromatographic retention times because of the different alkylation groups, as well as samples with missing values in one of the channels. We quantified ∼1700 peptide groups, the normalized ratios in log2-scale are shown in Figs. 4A, 4B (supplemental Table S9). Applying a cutoff of 2-fold, we observed several hundred significantly up- or down-regulated peptides (indicated in green or red, respectively). For the IAA treated samples we observed 315 peptides (representing 19% of the data set) with reduced intensity. The highest fold-change observed was 63-fold whereas 100 peptides were down-regulated by more than 10-fold and 200 more than 5-fold. For the AA treated samples, 70 peptides were reduced in intensity up to 90-fold whereas in total only ∼20 peptides were down-regulated more than 5-fold. Interestingly, we observed some overlap between both alkylation strategies but not all peptides affected by unspecific alkylation by one strategy were also affected by the other (supplemental Table S9). We also observed peptides with increased intensity, however to a much lower extent. For IAA, 65 peptides showed up to 33-fold higher intensities compared with the untreated control sample and for AA 40 peptides showed an increase of up to 14-fold. The ratios obtained for the dimethyl labeled samples show that certain peptides get modified very efficiently as their unmodified version is practically absent in the sample. They also show, however, that alkylation can have positive effects on the detectability of peptides.
Fig. 4.
Quantification by dimethyl labeling of samples reduced and alkylated in the gel. Samples were reduced with DTT and alkylated with IAA (light label), AA (medium label) or nothing (control, heavy label). Shown are the combined results from 2 independent technical replicates. A, Log2 values of peptide ratios obtained for the IAA treated samples relative to the control samples, B, Log2 values of peptide ratios obtained for the AA treated samples relative to the control samples. Peptide ratios with log2 values ranging from 1 to 2 are indicated in light green, above 2 in dark green, from −1 to −2 in light red, and below −2 in dark red.
Analysis of Amino Acid Composition
Because both alkylation strategies add groups of differing molecular weight and the control sample is not alkylated at all, it is not possible to quantify the degree of unspecific alkylation at certain amino acids by investigating the offsite-alkylated peptides. Therefore, to investigate if a specific amino acid was most frequently affected, we analyzed the amino acid composition of peptides which changed in abundance more than 2-fold. We determined how frequently each amino acid occurred in the up- and down-regulated peptide groups of the AA and IAA treated sample relative to the control sample and normalized the determined amino acid composition to the whole data set (Fig. 5A, supplemental Table S9). In the data sets of AA alkylated samples for both the up- and down-regulated peptides we were not able to find any significant differences. For IAA, surprisingly, the only amino acid strongly affected was methionine, being 3.8-fold over-represented in the downregulated peptides relative to the rest of the data set. Further analysis revealed that methionine was present in 252 out of the 315 peptides with a change of more than 2-fold in abundance. In the previous search for unspecific offsite alkylation, however, methionine was not found to be alkylated at all (supplemental Fig. S5). As the change in methionine-containing peptides may also be because of unexpected side reactions during dimethyl labeling, we investigated the in-solution and in-gel data sets for the abundance of each amino acid. We observed a depletion of methionine in both data sets when investigating the normalized amino acid composition across all identified peptides (Fig. 5B, 5C, supplemental Tables S2, S3), confirming the trend observed in the dimethyl data. We then plotted the numbers of methionine-containing PSMs to compare between the different conditions (Fig. 5D, supplemental Tables S2, S3). For both in-gel and in-solution digests, the iodine-containing alkylation reagents caused a strong reduction in the number of methionine-containing peptides with the lowest numbers of PSMs observed for the DTT/IAC treated sample (on average 2112 and 275 ± 25 methionine-containing PSMs for the in-solution and in-gel digested samples, respectively) and the highest for DTT/AA for the in-solution digested samples (on average 8852 PSMs) and BME/AA for the in-gel digested samples (2632 ± 209 PSMs). These data clearly indicate that methionine became efficiently modified by the iodine-containing alkylation reagents IAA and IAC, but not by the non-iodine reagents AA and CAA. Therefore, a covalent modification induced by the presence of iodine-containing alkylation reagents, should have been identified in the MASCOT searches at the side chain of methionine.
Fig. 5.
Investigation of the abundance of methionine in peptides observed in the data sets. A, abundance of amino acids in peptides which are up- or down-regulated by at least 2-fold in the IAA (blue circles) or AA (red triangles) treated sample relative to the control sample in the dimethyl data set; b/c) abundance of amino acids in all data sets of the in-solution digests (B) and the in-gel digests (C). For each data set all amino acids of the individual peptides were counted, summed, and normalized to the number of all amino acids observed in the whole data set. The resulting ratios are plotted as log2 values and indicate if a certain amino acid is over- or underrepresented in the respective data set; D, number of spectra matched to peptide sequences containing methionine in the in-solution digests (upper panel) and in-gel digests (lower panel). DI: DTT+IAA, DIA: DTT+IAC, DA: DTT+AA, DC: DTT+CAA, TI: TCEP+IAA, TIA: TCEP+IAC, TA: TCEP+AA, TC: TCEP+CAA, MI: BME+IAA, MIA: BME+IAC, MA: BME+AA, MC: BME+CAA.
Error and Mass Tolerant Searches
Because we were not able to identify any methionine-containing peptides with alkylated side chains (supplemental Fig. S5) we performed error tolerant (29) and mass tolerant (30) searches to investigate for the missing methionine-containing peptides using two independent replicates of the in-gel digested samples (supplemental Table S10). In the error tolerant searches, no modifications that have not already been found in our other database searches were identified in significant numbers (data not shown). For the mass tolerant searches, we initially also did not observe any striking differences between the treatments despite a higher incidence rate of identified peptides with a mass shift of +57 Da, which is because of the offsite alkylation mentioned above (Fig. 6A). When we repeated the mass tolerant searches allowing for oxidation of methionine, however, we observed more than 1000 PSMs for the samples alkylated with iodine-containing alkylation reagents at −64 Da (Fig. 6B), which were not present in the samples alkylated with AA and CAA. These PSMs contained methionine. The neutral loss of 64 Da from the molecular peptide ion is defined as the loss of the side chain from oxidized methionine residues (www.unimod.org and (31, 32)). When oxidation at methionine is allowed as variable modification in the Mascot searches, the neutral loss of 64 Da is considered for annotation of the fragment ions. This explains why these spectra were only identified in searches with oxidation at methionine and not already in the other searches at a mass deficiency of 48 Da, as it should make no difference in the mass tolerant searches whether a mass error of 64 or 48 Da is assigned. Without the modification and the assignment of a neutral loss of 64 Da to fragment ions, however, ∼50% of the fragment ions in the MS/MS spectra could not be matched by Mascot resulting in low scoring peptide identifications falling below our applied score cutoff of 30. These searches showed that there was indeed a modification at methionine residues which results in a loss of the side chain of the amino acid.
Fig. 6.
Search results of mass tolerant searches for the in-gel digested samples. Shown are the combined results for 2 replicates of all reducing reagents in combination with either iodine or non-iodine-containing alkylation reagents. Alkylation at cysteine residues was set as fixed modification dependent on the alkylation reagent in the individual searches, a search window of ±250 Da applied to the precursor mass and mass errors were grouped in bins of 5 Da. The occurrence of peptides in the respective bins was counted for iodine-containing alkylation reagents (IAA and IAC, plotted as positive values) and non-iodine-containing reagents (AA and CAA, plotted as negative values). Displayed are the results of searches without variable modification (A) and with oxidized methionine (oxM) as variable modification (B). Highlighted in the lower panel is the bin from −65 to −60 Da which almost exclusively consists of peptides assigned with a mass error of −64 Da. These PSMs contain methionine. Bins referring to mass errors of 0 to 5 Da are not shown as they almost exclusively contain unmodified peptides and contain several tens of thousands of spectra.
Investigation of Methionine Side Chain Loss Using a Synthetic Peptide
To further investigate the side chain loss induced by iodine-containing alkylation reagents, we incubated the synthetic peptide APEIMLNSK with DTT, followed by IAA, IAC, AA, and CAA, and analyzed the samples with MALDI-MS (Fig. 7A). For the samples treated with IAA and IAC a strong loss of the methionine side chain from the molecular ion (-48 Da) could be observed. AA and CAA did not result in any losses, though we noticed offsite alkylation of a fraction of the peptides (∼25% for the AA treated and ∼10% for the CAA treated sample). Fragmentation of the [M-48+H]+ of the IAA-derivatized peptide clearly proved that the loss indeed originated from the methionine residue (Fig. 7B). To determine if the loss of the side chain exclusively depends on the alkylation reagent we also tested DTT and the alkylation reagents alone. DTT did not influence the peptide at all (supplemental Fig. S11). Incubation with the alkylation reagents alone led to the same results as in combination with DTT (supplemental Fig. S12) as well as additional offsite alkylation at other residues, because we also observed ions at the mass of the carbamidomethylated version of the [M-48+H]+. Titration of the synthetic peptide with IAA revealed a linear relationship for alkylation of methionine dependent on the IAA concentration (supplemental Fig. S13), indicating that the efficiency of the reaction is concentration-dependent. As it may also be possible that the loss of the methionine side chain results from its oxidation followed by fragmentation in the gas phase (31, 32) we oxidized the peptide with H2O2, and reduced and alkylated it (supplemental Fig. S14). We observed an abundant signal for the oxidized peptide but only a minor loss of the oxidized side chain (-64 Da) with similar intensity for all alkylation reagents. Furthermore, we detected the alkylated version of the oxidized peptide but no loss of the side chain from the respective signal, implying that the oxidized side chain of methionine is not further modified and the offsite alkylation takes place at another residue. Therefore, the analysis with the synthetic peptide clearly showed that the iodine-containing alkylation reagents led to the efficient loss of the methionine side chain which we observed in our data sets.
Fig. 7.
A, MALDI-MS spectra of the synthetic peptide with the amino acid sequence APEIMLNSK reduced by DTT and alkylated by IAA, IAC, AA, and CAA. A prominent loss of the side chain of methionine can be observed from the molecular ion (M+H) for the peptides alkylated using iodine-containing reagents; B, MS/MS spectrum of the [M+H]+ of the unmodified peptide (upper panel) and the signal at M-48Da (lower panel). The fragment ions show that the loss of 48 Da originates from the methionine side chain.
Investigation of Methionine Side Chain Loss in LC-MS/MS Measurements
We observed the peptide lacking the methionine side chain in the MALDI measurements of the alkylated peptides, but not its alkylated form. Furthermore, we only observed peptides in the mass tolerant searches, which already had lost the side chain of methionine. Therefore, the loss of the methionine side chain, in principle, can be either because of an elimination in-solution, or because of an efficient in-source fragmentation of carbamidomethylated (IAA) or carboxymethylated (IAC) methionine. It was previously described, that peptides containing carbamidomethylated methionine can undergo in-source fragmentation at high cone voltages, while staying intact at lower values (17). In another study, it was shown that carbamidomethylated methionine can cause a neutral loss of 105 Da when fragmented in the parent ion scanning mode further indicating the presence of intact carbamidomethylated methionine in ESI ionization (18). As the searches allowing for carbamidomethyl at methionine did not yield any PSMs in our data sets (supplemental Fig. S5) we defined a modification accounting for the loss of the side chain (-SHCH3) for identification of MS/MS spectra from in-source fragmented peptides, and performed Mascot searches. For the in-solution digested samples we identified up to 703 PSMs lacking the side chain of methionine (DTT/IAC) and up to 198 ± 101 (BME/IAC) for the in-gel digested samples (supplemental Fig. S15, supplemental Tables S11 and S12). When we investigated the *.raw files we observed a prominent peak at +105 (for IAA) and +106 (for IAC) Da, which coeluted with the peptides annotated with a loss of the methionine side chain (Supplemental Figs. S16, S17). The MS/MS fragment ion spectra of these peaks showed a prominent peak corresponding to the neutral loss of the alkylated methionine side chain but otherwise no fragment ions were observed from which any sequence information could be derived (supplemental Fig. S18). We therefore defined multi stage activation (MSA, (33)) methods defining the loss of the modified side chain for the individual alkylation reagents as a neutral loss for triggering the MSA scan. We then remeasured the samples of one replicate of the in-gel digests treated with DTT and all alkylation reagents with the respective MSA method. Subsequently, we performed database searches allowing for alkylation of methionine (MS/MS or MSA of intact peptides), loss of the methionine side chain (MS/MS of in-source fragmented peptides), carbamylation of methionine (MS/MS of intact peptides) and methionine oxidation (Fig. 8, supplemental Table S13). For IAA treated samples, 19% of the identified methionine-containing PSMs were unmodified, 31% oxidized, and 50% carbamidomethylated (sum of PSMs identified for MSA and MS/MS of in-source fragmented peptides), whereas for IAC only 6% were unmodified, 43% oxidized, and 51% carboxymethylated (sum of PSMs identified for MSA and MS/MS of in-source fragmented peptides). For AA and CAA 63 and 64% of the methionine-containing peptides were unmodified, respectively, whereas 32 and 33% were oxidized and only negligible amounts were annotated to be alkylated. We conclude therefore that the alkylated side chains of methionine were predominantly formed by iodine-containing reagents (as already indicated by the MALDI measurements) and that they were stable in-solution. In the gas phase, however, they fragmented efficiently either in the source of the mass spectrometer or during targeted MS/MS fragmentation resulting in an abundant neutral loss and preventing identification of the peptide.
Fig. 8.
Identification of peptides containing methionine (M) residues in in-gel digested samples reduced with DTT and alkylated with all 4 alkylation reagents. Data were acquired using a multi stage activation (MSA) method triggered by the neutral loss of the alkylated side chain of M in the MS/MS spectrum. M-containing peptides were identified in their unmodified form, being oxidized at M, containing alkylated M (identified by MSA), and with loss of the M side chain (in-source fragmented alkylated peptides). Displayed is the percentage of PSMs containing unmodified, oxidized, and alkylated methionine relative to the total amount of M-containing PSMs in the respective analysis.
DISCUSSION
In the current study, we show that reduction and alkylation conditions can have a strong impact on the identification of peptides because of unspecific alkylation using an acetone precipitated cytosolic fraction of HeLa cells, re-solubilized in 0.1 m TRIS-HCl/4% SDS at 95 °C. Interestingly, for in-gel and in-solution digestion of proteins, different combinations of reduction and alkylation reagent were found to give the best results. For the reducing reagents, surprisingly, TCEP and BME, the substances used least frequently by the community (in 21 and 0% of articles investigated, respectively, supplemental Fig. S1, supplemental Table S1), resulted in highest numbers of peptide and protein identifications for the in-gel digested samples. A likely reason is that they induced the lowest number of unspecific modifications in combination with the respective alkylation reagents. This could be because of DTT making either the side chains of the amino acids more susceptible to modification or by generating more reactive intermediates with the alkylation reagents as compared with TCEP or BME. We were, however, not able to detect any evidence that the reducing reagent itself resulted in unspecific modifications of peptides (e.g. by finding adducts matching its mass). For the in-gel digests, TCEP did not perform well for the identification of cysteine-containing peptides. This may be because of the removal of TCEP before adding the alkylation reagents. It is common practice to perform reduction and alkylation in the same step when using TCEP as it was the case in our in-solution digests (because we did not remove TCEP before addition of the alkylation reagent). It may therefore increase performance for in-gel digested samples if a mixture of TCEP and the alkylation reagent is used. For the experimental setup used in this study, we recommend BME for reducing proteins in SDS-PAGE gels. For in-solution digests, DTT, BME, and TCEP performed equally well and the choice should be made based on potential preferences in the alkylation reagents as AA seems to work better with DTT and BME, whereas CAA performs better in combination with TCEP. This was especially the case regarding the alkylation efficiency of cysteine-containing peptides, which may introduce inconsistencies, for example, in peptide quantification. In general, the iodine-containing alkylation reagents (IAA and IAC) resulted in the worst identification rates irrespective of the digestion condition, or reducing reagent, and should therefore be avoided. We compared in this study alkylation reagents that are structurally related. There are, however, also other iodine-free reagents, such as N-ethylmaleimide (NEM), methyl methanethiosulfonate (MMTS), or 4-vinylpyridine, which are structurally not related. These compounds will probably also result in lower numbers of side reactions compared with the iodine-containing reagents.
Surprisingly, for both digestion strategies, the samples that were neither reduced nor alkylated resulted in good numbers of spectra and protein identifications despite the total lack of cysteine-containing peptides. This shows that disulfide bridges do not seem to hamper the proteolytic cleavage by trypsin significantly as we also could not see an increase in missed cleavage sites in these samples (data not shown). The reproducibility between the single replicates for the untreated samples ranged among the highest compared with the other conditions, further exemplifying the negative effect of random unspecific modifications on the quality of sample preparation. The complete lack of cysteine-containing peptides in these data sets indicated an efficient formation of disulfide bridges among proteins or peptides and underlines the necessity of efficient reduction and alkylation in sample preparation for proteomics experiments.
For the in-solution digested samples the percentage of identified, nonspecifically modified peptides was lower and alkylation efficiencies better compared with the in-gel digested samples. This is most likely because of the more efficient removal of chemicals by FASP and the fact that the reduction and alkylation chemicals must diffuse into the gel pieces, which may alter accessibility of the proteins. Reduction and alkylation before gel electrophoresis would therefore most likely improve reproducibility, as it allows for better accessibility of the proteins and immediate separation of the proteins from the reactive chemicals during electrophoresis. Over all, we observed significantly higher differences in spectra identification rates between the alkylation reagents in the in-gel digests compared with the in-solution digests. There are several factors that may be responsible for this: (1) we used higher concentrations for the alkylation reagents in the gel (20 mm for the in-solution digests and 55 mm for the in-gel digests); (2) the alkylation reactions were quenched by addition of the respective reducing reagents for the in-solution digests, but not the in-gel digests; (3) because of the FASP strategy applied for the in-solution digests reactive chemicals were removed immediately after performing the reaction. Therefore, for the in-gel digests, unspecific reactions could proceed for a longer time, which also explains the higher degree of modified peptide N termini in the data set as this can only happen after tryptic cleavage. These factors influence the reaction kinetics of the alkylation reactions which is probably a major reason for the differences observed. If other protocols are applied (e.g. by different reagent conditions, altered incubation times, or different buffer systems) the differences observed may differ significantly. Additionally, the in-solution digests were analyzed with 30 min gradients whereas the in-gel digests were analyzed with 60 min gradients. This resulted in ∼100,000 spectra for the whole in-solution data set and ∼35,000 spectra for the in-gel data set, respectively. As the in-gel data set was only derived from ∼1/5 of the sample the average instrument time per number of peptides contained in the sample was higher by a factor of almost 2-fold, making it more likely to fragment and identify low abundant species in the in-gel digests. For SAX on the other hand, because of incomplete separation of highly abundant unmodified peptides between the different fractions, the effect of the alkylation reagents might have been masked because the low abundant derivatized peptide species were not fragmented at all.
When we investigated the data sets for unwanted side reactions, searches of the in-solution digests, considering multiple variable modifications, produced in almost all cases lower numbers of PSMs compared with the searches without offsite alkylation, which we performed initially. This was most likely because of the increased cutoff for peptide acceptance in order to maintain an FDR of 1% with the higher numbers of variable modifications (and therefore a strong increase in search space). This leads to lower total numbers of PSMs, even though a high number of modified peptides is identified. Therefore, it is difficult to assess how far the identified offsite alkylations can compensate for the lack of identified PSMs in the initial data sets of the in-solution digests. For the in-gel digests, searching for offsite alkylation resulted in greatly increased numbers of peptide identifications for the iodine-containing reagents almost reaching PSM numbers of the iodine-free alkylation reagents. For the latter, however, including the offsite alkylation in the searches always reduced the number of identified peptides presenting a similar picture as for the in-solution digests. For the samples alkylated with iodine-containing reagents, the searches including offsite alkylation where never able to match the numbers of unique peptide sequences obtained in the searches without any offsite alkylation. Even though resulting in almost twice the numbers of PSMs they stayed significantly below the iodine-free alkylation reagents. This is probably because of high abundant modified peptides preventing fragmentation of lower abundant unmodified species. It is therefore in our eyes not reasonable to continue using iodine-containing alkylation reagents and trying to compensate for their unspecific side reactions by altering the database search strategy.
When we investigated identification efficiencies in our data sets, we realized that it was possible to increase identification rates tremendously for the in-gel digests (from 24% in case of DTT/IAA to 54% for BME/CAA). Given that currently the majority of laboratories still work with DTT/IAA and it was recently estimated that on average only 25% of the spectra are identified in proteomics experiments (34) it may be sufficient to simply change reduction and alkylation conditions to increase identification efficiency in proteomics experiments considerably.
Surprisingly, one of the amino acids which was affected most in our data sets was methionine. So far, in studies which dealt with the investigation of unspecific modifications by alkylation in proteomic sample preparation, alkylated methionine was not detected at all. In Unimod (www.unimod.org), which is a database containing modifications used for database searching in proteomics experiments, carbamidomethylation or carboxymethylation at methionine is not even defined. To our knowledge, there are only 2 studies which mentioned alkylation at methionine and in both the alkylation was performed with IAA. One dealt with the analysis of an intact protein (17) and determined the mass of the carbamidomethylation at methionine to be 58 Da, whereas the second study was focused on the analysis of peptides (18) and identified the modification to be 57 Da. We confirmed in our experiments the mass value of 57 Da. Interestingly, we had the impression that the iodine-containing alkylation reagents seem to preferentially alkylate methionine in the synthetic peptide we used rather than the peptide N terminus, which we found to be alkylated most frequently in other peptides. In the experiments with the synthetic peptide, we only found the methionine alkylated peptide (resulting in a neutral loss of 105 Da or 48 Da relative to the unmodified molecular ion) but no additional offsite alkylation on this peptide at all. Neither on the one alkylated at the methionine, nor on the non-alkylated in the same sample. If we, however, oxidized the methionine in the peptide with H2O2 before performing the alkylation reaction, we observed prominent offsite alkylation at other residues. It therefore seems that the alkylation at methionine prevents the other residues in the peptide to be alkylated because we applied the alkylation reagent in a vast excess for this experiment and depletion of the reagent by the methionine residues is therefore highly unlikely. Along this line, also in the general data sets, we observed in the majority singly offsite alkylated peptides. This could be because of a change in the charge of the peptide or maybe interactions of the alkylated residue with other side chains that would usually be susceptible to modification.
CONCLUSION
Our systematic comparison of reduction and alkylation procedures shows that the reagents used can have a strong impact on the outcome of a proteomic experiment. Reduction reagents do have an effect; it is, however, rather moderate compared with the alkylation reagents as only the latter result in unspecific modifications of amino acids. Basically, all functional groups can react with the used alkylation reagents with different efficiencies. Especially iodine-containing reagents are prone to unspecific side reactions and should be completely avoided. The non-iodine-containing reagents result in similar cysteine alkylation efficiencies but markedly increased performance in the identification of peptides. To maximize performance and minimize unwanted side reactions, we suggest the usage of DTT and AA for in-solution digests and BME with AA for in-gel digests.
DATA AVAILABILITY
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (35) partner repository (https://www.ebi.ac.uk/pride/archive/) with the data set identifier PXD005183.
Supplementary Material
Acknowledgments
We thank the Marie Curie Sklodowska program of the European Union for financial support, Wolf Dieter Lehmann for the synthetic peptide and valuable discussions, Sven Milker for computational support, and Edgar Kaade for help with MALDI-MS analyses.
Footnotes
Author contributions: D.W. designed research; T.M. and D.W. performed research; T.M. and D.W. analyzed data; T.M. and D.W. wrote the paper.
* This work was supported by the Marie Curie Sklodowska program of the European Union.
This article contains supplemental material.
1 The abbreviations used are:
- MALDI
- matrix-assisted laser desorption ionization
- IAA
- iodoacetamide
- IAC
- iodoacetic acid
- AA
- acrylamide
- CAA
- chloroacetamide
- DTT
- dithiothreitol
- TCEP
- tris-2(-carboxyethyl)-phosphine
- BME
- β-mercaptoethanol
- NEM
- N-ethylmaleimide
- MMTS
- methyl methanethiosulfonate
- PSM
- peptide spectrum matches
- FASP
- filter aided sample preparation
- can
- acetonitrile
- FA
- formic acid
- SAX
- strong anion exchange
- TEAB
- triethylammoniumbicarbonate.
REFERENCES
- 1. Karas M., and Hillenkamp F. (1988) Laser desorption ionization of proteins with molecular masses exceeding 10000 daltons. Anal. Chem. 60, 2299–2301 [DOI] [PubMed] [Google Scholar]
- 2. Fenn J. B., Mann M., Meng C. K., Wong S. F., and Whitehouse C. M. (1989) Electrospray ionization for mass-spectrometry of large biomolecules. Science 246, 64–71 [DOI] [PubMed] [Google Scholar]
- 3. Shevchenko A., Wilm M., Vorm O., and Mann M. (1996) Mass spectrometric sequencing of proteins from silver stained polyacrylamide gels. Anal. Chem. 68, 850–858 [DOI] [PubMed] [Google Scholar]
- 4. Manadas B., Mendes V. M., English J., and Dunn M. J. (2010) Peptide fractionation in proteomics approaches. Expert Rev. Proteomics 7, 655–663 [DOI] [PubMed] [Google Scholar]
- 5. Thingholm T. E., Jensen O. N., and Larsen M. R. (2009) Analytical strategies for phosphoproteomics. Proteomics 9, 1451–1468 [DOI] [PubMed] [Google Scholar]
- 6. Winter D., and Steen H. (2011) Optimization of cell lysis and protein digestion protocols for the analysis of HeLa S3 cells by LC-MS/MS. Proteomics 11, 4726–4730 [DOI] [PubMed] [Google Scholar]
- 7. Leon I. R., Schwammle V., Jensen O. N., and Sprenger R. R. (2013) Quantitative Assessment of In-solution Digestion Efficiency Identifies Optimal Protocols for Unbiased Protein Analysis. Mol. Cell. Proteomics 12, 2992–3005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Wisniewski J. R., Zougman A., and Mann M. (2009) Combination of FASP and StageTip-Based Fractionation Allows In-Depth Analysis of the Hippocampal Membrane Proteome. J. Proteome Res. 8, 5674–5678 [DOI] [PubMed] [Google Scholar]
- 9. Sokolowska I., Gawinowicz M. A., Ngounou Wetie A. G., and Darie C. C. (2012) Disulfide proteomics for identification of extracellular or secreted proteins. Electrophoresis 33, 2527–2536 [DOI] [PubMed] [Google Scholar]
- 10. Cleland W. W. (1964) Dithiothreitol new protective reagent for SH groups. Biochemistry 3, 480–482 [DOI] [PubMed] [Google Scholar]
- 11. Getz E. B., Xiao M., Chakrabarty T., Cooke R., and Selvin P. R. (1999) A comparison between the sulfhydryl reductants tris(2-carboxyethyl)phosphine and dithiothreitol for use in protein biochemistry. Anal. Biochem. 273, 73–80 [DOI] [PubMed] [Google Scholar]
- 12. Laemmli U. K. (1970) Cleavage of structural proteins during assembly of head of bacteriophage-T4. Nature 227, 680–685 [DOI] [PubMed] [Google Scholar]
- 13. Boja E. S., and Fales H. M. (2001) Overalkylation of a protein digest with iodoacetamide. Anal. Chem. 73, 3576–3582 [DOI] [PubMed] [Google Scholar]
- 14. Sechi S., and Chait B. T. (1998) Modification of cysteine residues by alkylation. A tool in peptide mapping and protein identification. Anal. Chem. 70, 5150–5158 [DOI] [PubMed] [Google Scholar]
- 15. Liu F., and Fitzgerald M. C. (2016) Large-Scale Analysis of Breast Cancer-Related Conformational Changes in Proteins Using Limited Proteolysis. J. Proteome Res. 15, 4666–4674 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Paulech J., Solis N., and Cordwell S. J. (2013) Characterization of reaction conditions providing rapid and specific cysteine alkylation for peptide-based mass spectrometry. BBA-Proteins Proteomics 1834, 372–379 [DOI] [PubMed] [Google Scholar]
- 17. Lapko V. N., Smith D. L., and Smith J. B. (2000) Identification of an artifact in the mass spectrometry of proteins derivatized with iodoacetamide. J. Mass Spectrom. 35, 572–575 [DOI] [PubMed] [Google Scholar]
- 18. Kruger R., Hung C. W., Edelson-Averbukh M., and Lehmann W. D. (2005) Iodoacetamide-alkylated methionine can mimic neutral loss of phosphoric acid from phosphopeptides as exemplified by nano-electrospray ionization quadrupole time-of-flight parent ion scanning. Rapid Commun. Mass Spectrom. 19, 1709–1716 [DOI] [PubMed] [Google Scholar]
- 19. Nielsen M. L., Vermeulen M., Bonaldi T., Cox J., Moroder L., and Mann M. (2008) Iodoacetamide-induced artifact mimics ubiquitination in mass spectrometry. Nat. Methods 5, 459–460 [DOI] [PubMed] [Google Scholar]
- 20. Wisniewski J. R., Zougman A., Nagaraj N., and Mann M. (2009) Universal sample preparation method for proteome analysis. Nat. Methods 6, U359–U362 [DOI] [PubMed] [Google Scholar]
- 21. Rappsilber J., Mann M., and Ishihama Y. (2007) Protocol for micro-purification, enrichment, pre-fractionation and storage of peptides for proteomics using StageTips. Nat. Protoc. 2, 1896–1906 [DOI] [PubMed] [Google Scholar]
- 22. Boersema P. J., Raijmakers R., Lemeer S., Mohammed S., and Heck A. J. (2009) Multiplex peptide stable isotope dimethyl labeling for quantitative proteomics. Nat. Protoc. 4, 484–494 [DOI] [PubMed] [Google Scholar]
- 23. Shevchenko A., Tomas H., Havlis J., Olsen J. V., and Mann M. (2006) In-gel digestion for mass spectrometric characterization of proteins and proteomes. Nat. Protoc. 1, 2856–2860 [DOI] [PubMed] [Google Scholar]
- 24. Woods A. G., Sokolowska I., and Darie C. C. (2012) Identification of consistent alkylation of cysteine-less peptides in a proteomics experiment. Biochem. Biophys. Res. Commun. 419, 305–308 [DOI] [PubMed] [Google Scholar]
- 25. Yang Z., and Attygalle A. B. (2007) LC/MS characterization of undesired products formed during iodoacetamide derivatization of sulfhydryl groups of peptides. J. Mass Spectrom. 42, 233–243 [DOI] [PubMed] [Google Scholar]
- 26. Harrison A. G., Csizmadia I. G., and Tang T. H. (2000) Structure and fragmentation of b(2) ions in peptide mass spectra. J. Am. Soc. Mass Spectrom. 11, 427–436 [DOI] [PubMed] [Google Scholar]
- 27. Winter D., and Lehmann W. D. (2009) Sequencing of the thirteen structurally isomeric quartets of N-terminal dipeptide motifs in peptides by collision-induced dissociation. Proteomics 9, 2076–2084 [DOI] [PubMed] [Google Scholar]
- 28. Salek M., and Lehmann W. D. (2005) Analysis of thyroglobulin iodination by tandem mass spectrometry using immonium ions of monoiodo- and diiodo-tyrosine. Proteomics 5, 351–353 [DOI] [PubMed] [Google Scholar]
- 29. Creasy D. M., and Cottrell J. S. (2002) Error tolerant searching of uninterpreted tandem mass spectrometry data. Proteomics 2, 1426–1434 [DOI] [PubMed] [Google Scholar]
- 30. Chick J. M., Kolippakkam D., Nusinow D. P., Zhai B., Rad R., Huttlin E. L., and Gygi S. P. (2015) A mass-tolerant database search identifies a large proportion of unassigned spectra in shotgun proteomics as modified peptides. Nat. Biotechnol. 33, 743–749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Pilo A. L., and McLuckey S. A. (2014) Oxidation of Methionine Residues in Polypeptide Ions Via Gas-Phase Ion/Ion Chemistry. J. Am. Soc. Mass Spectrom. 25, 1049–1057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Lagerwerf F. M., van de Weert M., Heerma W., and Haverkamp J. (1996) Identification of oxidized methionine in peptides. Rapid Commun. Mass Spectrom. 10, 1905–1910 [DOI] [PubMed] [Google Scholar]
- 33. Schroeder M. J., Shabanowitz J., Schwartz J. C., Hunt D. F., and Coon J. J. (2004) A neutral loss activation method for improved phosphopeptide sequence analysis by quadrupole ion trap mass spectrometry. Anal. Chem. 76, 3590–3598 [DOI] [PubMed] [Google Scholar]
- 34. Griss J., Perez-Riverol Y., Lewis S., Tabb D. L., Dianes J. A., del-Toro N., Rurik M., Walzer M., Kohlbacher O., Hermjakob H., Wang R., and Vizcaino J. A. (2016) Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics data sets. Nat. Meth. 13, 651–656 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Vizcaino J. A., Csordas A., del-Toro N., Dianes J. A., Griss J., Lavidas I., Mayer G., Perez-Riverol Y., Reisinger F., Ternent T., Xu Q. W., Wang R., and Hermjakob H. (2016) 2016 update of the PRIDE database and its related tools. Nucleic Acids Res. 44, D447–D456 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (35) partner repository (https://www.ebi.ac.uk/pride/archive/) with the data set identifier PXD005183.








