Abstract
Chlorpyrifos oxon catalyzes the crosslinking of proteins via an isopeptide bond between lysine and glutamic acid or aspartic acid in studies with purified proteins. Our goal was to determine the crosslinking activity of the organophosphorus pesticide, dichlorvos. We developed a protocol for examining crosslinks in a complex protein mixture consisting of human SH-SY5Y cells exposed to 10 μM dichlorvos. The steps in our protocol included immunopurification of crosslinked peptides by binding to anti-isopeptide antibody 81D1C2, stringent washing of the immobilized complex, release of bound peptides from Protein G agarose with 50% acetonitrile 1% formic acid, liquid chromatography tandem mass spectrometry on an Orbitrap Fusion Lumos mass spectrometer, Protein Prospector searches of mass spectrometry data, and manual evaluation of candidate crosslinked dipeptides. We report a low quantity of dichlorvos-induced KD and KE crosslinked proteins in human SH-SY5Y cells exposed to dichlorvos. Cells not treated with dichlorvos had no detectable KD and KE crosslinked proteins. Proteins in the crosslink were low abundance proteins. In conclusion, we provide a protocol for testing complex protein mixtures for the presence of crosslinked proteins. Our protocol could be useful for testing the association between neurodegenerative disease and exposure to organophosphorus pesticides.
Keywords: Isopeptide, Crosslinks, SH-SY5Y human Neuroblastoma cells, Dichlorvos, Protein Prospector, Mass spectrometry, Crosslink, Manual evaluation
1. Introduction
Formation of γ-glutamyl-ε-lysine isopeptide bonds between the ε-amine of lysine and γ-glutamyl of glutamine is mediated by the transamidase activity of transglutaminase, a family of enzymes that includes fibrin stabilizing factor XIII [1].
The earliest method for detecting isopeptide crosslinks used extensive proteolysis followed by amino acid composition analysis [1,2] or high-pressure liquid chromatography [3] to identify the γ-glutamyl-ε-lysine product. More recently, isopeptide crosslinks were detected by anti-isopeptide antibodies in immunohistochemically stained brain sections and in Western blots. None of these methods was capable of identifying the peptides that gave rise to the reactive lysine and glutamine or the specific residues that were crosslinked. Using amino acid sequencing after reaction with a radiolabeled glutamine substrate and limited proteolysis provided a means for identifying the labeled protein/peptide. However, a variety of artifacts limits this technique to proteins that have already been shown to be substrates for transglutaminase [3]. Mass spectrometry offers a more robust means for studying isopeptide crosslinked peptides because it can identify the specific amino acids, peptides, and proteins involved. Nemes and coworkers demonstrated the power of mass spectrometry by identifying 2 proteins from brain cortex that were crosslinked to ubiquitin via lysine-glutamine isopeptide bonds and an intramolecular crosslink between Gln99 and Lys58 of α-synuclein [4]. The amount of isopeptide crosslinked peptides was so low that to detect them the preparations had to be highly enriched, by immunopurification with anti-isopeptide antibodies [4].
Mass spectrometry has been used to explore lysine-glutamate and lysine-aspartate isopeptide crosslinks induced by the organophosphorus pesticide chlorpyrifos oxon. Crosslinked peptides were found for tubulin, and butyrylcholinesterase [5,6]. The organophosphorus nerve agent VX induced isopeptide crosslinks on ubiquitin [7]. A proposed mechanism for isopeptide bond formation catalyzed by organophosphate chemicals is shown in Fig. 1.
Fig. 1.
Mechanism for organophosphate-induced isopeptide bond formation. The structure on the left shows organophosphate covalently attached to the epsilon amino group of lysine. Attack of the lysine amine on the Glu/Asp carbonyl carbon is catalyzed by a vicinal acidic group. The middle structure shows an intermediate between the organophosphate-modified lysine and the side chain of glutamic or aspartic acid. In the last step, the organophosphate is released and a covalent isopeptide bond forms between lysine and glutamic or aspartic acid. A nearby acidic residue stabilizes the crosslink. This mechanism is analogous to that for spontaneous formation of isopeptide bonds proposed by Kang et al. [8]. Mass spectrometry distinguishes chemically induced crosslinks from transglutaminase catalyzed crosslinks by identifying the crosslinked residues and the peptides in the crosslink. Reproduced by permission [9].
Mass spectrometry can distinguish chemically-induced cross-links from transglutaminase-mediated cross-links because the identity of the cross-linked residues can be determined. The sequence of the peptides is part of the result. Thus the residues involved in the crosslink can be identified, making it possible to distinguish a Gln-Lys crosslink mediated by transglutaminase from a Glu-Lys crosslink induced by organophosphate. In addition, the Protein Prospector/Batch Tag database search parameter “link aa” stipulates that the allowed amino acids in the cross-link are “E, D, protein C-term → K, protein N-term” thereby excluding Gln, and the search parameter “bridge comp” stipulates that the allowed delta mass for the crosslink is H2O (−18 Da), where transglutaminase-mediated crosslinks would require “link aa” to be Q → K with a “bridge comp” of NH3 (−17 Da).
Previous studies were performed on pure proteins. A study of organophosphorus-induced isopeptide crosslinking in microtubule-associated protein complex demonstrated that complex protein preparations are amenable to analysis [10].
In the present report, we have expanded our use of mass spectrometry to identify isopeptide crosslinked peptides from a more complex source. Human SH-SY5Y neuroblastoma cells were treated with the organophosphorus pesticide dichlorvos. Cells were lysed and digested with trypsin. Isopeptide crosslinked peptides were enriched with the anti-isopeptide antibody 81D1C2 and subjected to liquid chromatography-tandem mass spectrometry. Mass spectral data were analyzed with Protein Prospector/Batch Tag Web and Xcalibur/Qual Browser. In the course of analyzing these data, we have developed a generalized protocol.
2. Experimental
2.1. Chemicals
Human neuroblastoma SH-SY5Y cells (American Type Culture Collection #CRL-2266).
Trypsin (Promega, Sequencing grade, #V5113) stored at −80 °C.
Dichlorvos CAS 62-73-7 (Chem Service Inc. #N11675) 0.01 M stock solution in acetonitrile stored at −80 °C.
DMEM/F12 GlutaMAX (Gibco #10565–108).
Fetal bovine serum (Life Tech #16000044).
Penicillin & streptomycin (Gibco #15140–122).
Trans retinoic acid (Sigma-Aldrich #554720).
Pierce IP lysis buffer (25 mM TrisCl pH 7.4, 150 mM NaCl, 1% NP-40, 1 mM EDTA, 5% glycerol) (Thermo Scientific #87787).
Halt protease inhibitor cocktail (Thermo Fisher Scientific #78430); containing AEBSF, aprotinin, bestatin, E–64, leupeptin and pepstatin A.
RIPA buffer (25 mM Tris-HCl pH 7.6, 1% NP-40, 1% sodium deoxycholate, 0.1% SDS, 140 mM NaCl) (Thermo Fisher Scientific #89900).
Mouse anti-isopeptide monoclonal 81D1C2 (LS Bio #LS-C153331); reconstituted with water to 1 mg/mL.
Protein G agarose (Protein Mods LLC #PGGH); binds 20 mg antibody per ml beads.
Bicinchoninic acid protein assay kit (Thermo Scientific #23228).
2.2. Cell culture
SH-SY5Y cells (ATCC CRL #2266) in T75 flasks were grown in DMEM/F12 GlutaMAX supplemented with 15% fetal bovine serum, penicillin, and streptomycin, in a humidified atmosphere of 5% carbon dioxide, at 37 °C. After 5 days, when cells were 70–80% confluent, cells were washed with phosphate buffered saline (PBS). Cells received 15 mL per flask of DMEM/F12 GlutaMAX (no serum) supplemented with 10 μM trans retinoic acid plus-or-minus 10 μM dichlorvos. After 2 days at 37 °C, cells were harvested.
2.3. Cell lysis and protein concentration
Cells were harvested, washed with PBS, and lysed with 100 μL IP lysis buffer (25 mM TrisCl pH 7.4, 150 mM NaCl, 1% NP-40, 1 mM EDTA, 5% glycerol) containing Halt protease inhibitor cocktail. Halt protease inhibitor cocktail contains AEBSF (4-(2-aminoethyl)-benzene sulfonyl fluoride), aprotinin, bestatin, E–64 (epoxysuccinyl-l-leucylamido (4-guandidino) butane), leupeptin and pepstatin A, that inhibit serine-proteases, cysteine proteases, aspartic acid proteases, and aminopeptidases. Cell debris was removed by centrifugation at 14,000×g for 20 min at 4 °C. The protein concentration in 130 μL of supernatant was 13.2 mg/mL as determined by the bicinchoninic acid protein assay (Thermo Scientific #23228).
Three cell lysates were prepared for the plus dichlorvos cells and two for the minus dichlorvos cells. Each lysate was digested with trypsin, immunopurified with anti-isopeptide antibody 81D1C2, and subjected to liquid chromatography tandem mass spectrometry.
2.4. Trypsin digestion
Cell lysate supernatant containing 200 μg protein (15 μL) was diluted with 185 μL of 20 mM ammonium bicarbonate pH 8. Proteins were denatured in a boiling water bath for 3 min. The denatured proteins were digested with 4 μg trypsin (8 μL) at 37 °C for 16 h without reduction and alkylation. Trypsin was inactivated by heating the digest in a boiling water bath for 3 min.
2.5. Immunopurification of tryptic peptides linked through an isopeptide bond
The heat-treated digest was incubated with 8 μg (8 μL) of anti-isopeptide monoclonal 81D1C2 at room temperature for 8 h to capture isopeptide crosslinked peptides. Antibody-peptide complexes were immobilized by adding 0.1 mL of a 1:1 suspension of Protein G agarose beads, in PBS. The sample was rotated overnight at room temperature.
The beads and liquid were transferred to a 0.45 μm Durapore spin filter (Millipore UFC30HV00). Use of the spin filter maximized recovery because beads were not lost in the wash steps. Beads were washed with 0.4 mL of RIPA buffer (25 mM Tris-HCl pH 7.6, 1% NP-40, 1% sodium deoxycholate, 0.1% SDS, 140 mM NaCl) 5 times followed by 5 washes with water. Salts and detergents were washed off with water. The flow through in each wash step was discarded.
The basket of washed beads was transferred to a new microfuge tube. Bound peptides were released from the washed beads by incubating the basket of beads with 0.1 mL of 50% acetonitrile, 1% formic acid for 0.5–1 h at room temperature. The released peptides were collected in the flow through by brief centrifugation. The extraction step was repeated twice. The combined flow through was dried by vacuum centrifugation.
2.6. Sample preparation for mass spectrometry
The dry sample was dissolved in 20 μL water. The sample was centrifuged for 30 min at 14,000×g and 4 °C. The top 10 μL were transferred to an autosampler vial.
2.7. Mass spectral data acquisition
Peptide separation was performed with a Thermo RSLC Ultimate 3000 ultra-high pressure liquid chromatography system (Thermo Scientific) at 36 °C. Solvent A was 0.1% formic acid in water, and solvent B was 0.1% formic acid in 80% acetonitrile. Peptides were loaded onto an Acclaim PepMap 100C18 trap column (75 μm × 2 cm; Thermo Scientific cat# 165535) at a flow rate of 4 μL/min and washed with 98% solvent A/2% solvent B for 10 min. Then, they were transferred to a Thermo Easy-Spray PepMap RSLC C18 column (75 μm × 50 cm with 2 μm particles, Thermo Scientific cat# ES803) and separated at a flow rate of 300 nL/min using a gradient of 9–50% solvent B in 30 min, 50–99% solvent B in 40 min, hold at 99% solvent B for 10 min, 99 to 9% solvent B in 4 min, hold at 9% solvent B for 16 min.
Eluted peptides were sprayed directly into a Thermo Orbitrap Fusion Lumos Tribrid mass spectrometer (Thermo Scientific). Data were collected using data dependent acquisition. A survey full scan MS (from m/z 350–1800) was acquired in the Orbitrap with a resolution of 120,000. The AGC target (Automatic Gain Control for setting the ion population in the Orbitrap before collecting the MS) was set at 4 × 105 and the ion filling time was set at 50 msec. The 25 most intense ions with charge state of 2–6 were isolated in a 3 s cycle and fragmented using high-energy collision induced dissociation with 35% normalized collision energy. Fragment ions were detected in the Orbitrap with a mass resolution of 30,000 at m/z 200. The AGC target for MS/MS was set at 5 × 104, and dynamic exclusion was set at 30 s with a 10 ppm mass window. Data were reported in *.raw format.
The *.raw data files were converted to *.mgf files using MSConvert (ProteoWizard Tools from SourceForge).
2.8. Batch Tag Web search for crosslinked peptide candidates
The *.mgf files were subjected to a database search using the Batch Tag Web algorithm in Protein Prospector version 6.2.1. Searches were performed on the Protein Prospector website https://prospector.ucsf.edu [prospector.ucsf.edu] (14May2022). Database search parameters included: database—SwissProt 2021.0618; Species—Homo sapiens; enzyme—trypsin, missed cleavages—3; expect calc method—none; protein N-term—unchecked; protein C-term—unchecked; uncleaved—checked; parent mass tolerance—20 ppm; fragment mass tolerance—30 ppm; precursor charge state—2, 3, 4, 5; parent ion conversion—monoisotopic; modification defect—0.0048 Da; instrument—ESI Q high res; link search type—user defined link; link aa—E, D, protein C-term→ K, protein N-term; mod comp ion—K, D, and E; mod range– −18 to 3883 Da; bridge comp—H-2O-1; mod uncleaved—checked; msms mass peaks—80; msms max modifications—2; variable modification—oxidation methionine; fixed modification—none. This database search created a list of peptides that Protein Prospector considered to be crosslinked. The list of potentially crosslinked peptides, along with parameters indicating the level of confidence in the assignment, were displayed in Protein Prospector/Search Compare.
Note that using smaller mass tolerances for parent mass and fragment mass did not improve the detection of isopeptide crosslinked peptides.
2.9. Search Compare screening of crosslinked peptide candidates
To reduce the number of crosslink peptide candidates and aid in the identification of crosslinked peptides, the Search Compare list was screened using the Protein Prospector output parameters. Parameters indicating a crosslinked peptide were taken to be: charge state 3, 4, 5; Score >25; score difference >1; % matched intensity >45%; and at least 5 amino acids in each peptide. Choice of these parameters is empirical and was based on experience.
2.10. Manual evaluation of crosslinked peptide candidates
Ultimately, crosslinked peptides were confirmed by manual evaluation. Discussion of the manual evaluation that we employed requires an understanding of the following terms:
A crosslink candidate is a pair of crosslinked peptides selected by Protein Prospector from a database search.
A crosslink specific mass is a mass in the MS/MS spectrum from a crosslink candidate that includes residues from both peptides.
A crosslink specific amino acid is an interval in the MS/MS spectrum from a crosslink candidate that is defined by two crosslink specific masses and corresponds to an amino acid that is part of the crosslink candidate sequence.
A ladder sequence is a term used by Protein Prospector to describe neutral loss of amino acids from the N-terminus of the parent ion. Ladder sequencing losses can occur from both peptides in the same MS/MS spectrum.
A peeling sequence is a term used by Protein Prospector to describe the neutral loss of an amino acid from the C-terminus of the parent ion. This is otherwise referred to as a [bn-1 + 18] fragment [11]. Any C-terminal residue can be lost, provided that a basic residue such as arginine, lysine or histidine is present in the sequence [12].
Mixed fragmentation refers to a series of crosslink specific masses that correspond to sequential removal of amino acids from both peptides in the crosslink candidate.
Peptide rearrangement consists of the transfer of amino acid(s) from one terminus of a peptide to the other. Rearrangement sometimes was helpful in mixed fragmentation scenarios. Two mechanisms have been described to explain peptide rearrangements. One is protease induced cyclization and ring re-opening during proteolysis. The rearrangement is believed to require a missed endo-proteinase cleavage site (i.e., an arginine or lysine for trypsinolysis) within 2-residues of either terminus [13]. The other mechanism for peptide rearrangement occurs in the mass spectrometer. Rearrangement occurs from linear b-ions by cyclization and subsequent ring opening. The cyclic form can open at various amide bonds effectively shifting residues from one terminus to the other [14].
2.11. Manual evaluation criteria that support the presence of crosslinked peptides
For a crosslink candidate to be accepted as a crosslinked peptide there must be amino acid sequence support for both peptides and there must be at least one crosslink specific amino acid, defined by two crosslink specific ions. Sequence support consists of the following features.
A series of non-crosslink specific masses in the MS/MS spectrum must correspond to an amino acid sequence from one or the other peptide in a crosslink candidate. Suitable sequences include an N-terminal sequence, a C-terminal sequence, or an internal fragment. Sequences must be at least 3 amino acids long (for example green AVNKV and blue KGV in Fig. 2 panel A and green REDLLIN in Fig. 3 upper panel).
- At least one crosslink specific amino acid is essential. A series of crosslink specific amino acids is frequently encountered.
-
2a)Sometimes the entire series consists of crosslink specific amino acids from one peptide. This is taken as strong evidence for a crosslinked peptide (for example blue KSEA in Fig. 3 lower panel).
-
2b)Sometimes a series of crosslink specific amino acids will be appended by a single amino acid that is not part of the crosslink candidate. Such a sequence is still accepted as support for a crosslinked peptide (for example an unassigned V residue is appended to blue KV in Fig. 7 upper panel).
-
2c)Sometimes a series of crosslink specific amino acids will be appended by more than one amino acid that is not part of the crosslink candidate. Such a sequence is not accepted (for example unassigned KV is appended to blue V in Fig. 6 upper panel).
-
2d)Sometimes the entire series consists of a mixture of crosslink specific amino acids from both peptides, or of a mixture of amino acids from both the N- and C-terminals of one peptide. We refer to this as mixed fragmentation. This is taken as support for a crosslinked peptide (see Fig. 2, panels B and C, Fig. 6 lower panel, Fig. 7 lower panel for examples). Occasionally, mixed fragmentation can require rearrangement of a peptide sequence by shifting residues from the N-terminus to the C-terminus, or vice-versa (see Fig. 2, panel B for an example).
-
2e)Sometimes a sequence of amino acids can be identified that is unrelated to the crosslinked candidate even though it may contain a few crosslinked specific amino acids. Such a sequence is rejected as support for the crosslink (for example QFLELY in Fig. 5 lower panel).
-
2a)
Neutral loss of amino acids from the parent ion. Neutral losses can be N-terminal amino acids (ladder sequence) from one peptide (for example blue VGK in Fig. 2 panel A, blue EALH in Fig. 3 lower panel, green VGK in Fig. 7 upper panel), C-terminal amino acids (peeling sequence) from one peptide, a combination of N-terminal and C-terminal amino acids from one peptide, or a mixture of N-terminal and C-terminal amino acids from both peptides. By definition, the amino acids that are neutral losses from the parent ion contain residues from both peptides and are therefore crosslink specific amino acids.
Fig. 2.
The MS/MS spectrum for the isopeptide crosslinked peptide AVNKVKDTPGLGK450VK/KGVDIE4391ISHR, where the subscripted residues indicate the site of the isopeptide crosslink. The parent ion is at m/z 896.85, +3 charged. Panel A shows the mass range from m/z 180 to 900. This includes a 5-amino acid, +1, b-ion sequence (AVNKV) from the N-terminal of the green peptide, a 3-amino acid, +1, b-ion sequence (KGV) from the N-terminal of the blue peptide, and a 3-amino acid, +3, ladder sequence (KGV) from the parent ion. Panel B shows the mass range m/z 1010 to 1260. This illustrates the first scenario for the mixed fragmentation. Structures of the proposed fragments are shown. Panel C also shows the mass range m/z 1010 to 1260. This illustrates the second scenario for the mixed fragmentation. Structures of the proposed fragments are shown. Unlabeled, red masses mostly represent loss of water, amine, or CO.
Fig. 3.
The MS/MS spectrum for the isopeptide crosslinked peptide DVK245PSNILLDER/HLAESKE927K, where the subscripted residues indicate the site of the isopeptide crosslink. The parent ion is at m/z 774.41, +3 charge. The upper panel shows the complete mass range m/z 220 to 1300. This shows a 7-amino acid, +1, y-ion sequence (NILLDER) from the green peptide and a 4-amino acid, +1, b-ion crosslink specific sequence (NILL) from the green peptide. The lower panel shows an expanded mass range from m/z 610 to 1060. This emphasizes a 4-amino acid, +3, ladder sequence (HLAE) from the blue peptide, and a 4-amino acid, +2 y-ion crosslink specific sequence (AESK) from the blue peptide. Unlabeled, red masses mostly represent loss of water, amine, or CO.
Fig. 7.
The MS/MS spectrum for the isopeptide crosslinked peptide KGVVKAE179K/ISLVKTEFHAK113EQYR. The parent ion is at m/z 672.894, +4 charged. The upper panel shows the mass range from m/z 150 to 870. This emphasizes a 5-amino, +1, b-ion sequence (KGVVK) from the N-terminus of the green peptide; a 3-amino acid, +4, ladder sequence from the N-terminus of the green peptide; and a 2-amino acid, +3, y-ion crosslink specific sequence (VK) from the blue peptide appended by an unassigned residue V. The lower panel shows the mass range from m/z 680 to 840. This emphasizes the mixed fragmentation sequence (VKVK) from the green peptide. Mixed fragmentation peaks are marked with arrows. Unlabeled, red masses mostly represent loss of water, amine, or CO.
Fig. 6.
The MS/MS spectrum for the isopeptide crosslinked peptide VKGKTIK89EVAEAYR/VALEVE784DGRK. The parent ion is at m/z 538.517, +5 charged. The upper panel shows the mass range from m/z 200 to 800. This emphasizes a 4-amino acid, +1, b-ion sequence (VKGK) from the N-terminus of the green peptide and a 3-amino acid sequence (VKV) that contains 2 residues that cannot be assigned to the crosslink peptide sequence thereby disqualifying this feature as support for the crosslinked peptide. The lower panel shows a mass range from m/z 660 to 840. This emphasizes a 3-amino acid, +3, mixed fragmentation sequence (EVK) from the blue peptide. Mixed fragmentation peaks are marked with arrows. Unlabeled, red masses mostly represent loss of water, amine, or CO.
Fig. 5.
The MS/MS spectrum for the rejected isopeptide crosslinked peptide DPDYVRK1062GDAR/LVGILDILDEE532NR. The parent ion is at m/z 693.61, +4 charged. The upper panel shows the complete mass range from m/z 160 to 1100. There are no prominent crosslink features present. The lower panel shows an expanded mass range from 650 to m/z 1050. This emphasizes a 3-amino acid, +3, y-ion crosslinked specific sequence (PDY), and a 6-amino acid, +2 sequence QFLELY. The latter is inconsistent with the crosslink peptide but is consistent with kinesin family member 21A isoform CRA. C (A0A024R0V9). Unlabeled, red masses mostly represent loss of water, amine, or CO.
3. Results
3.1. Isopeptide crosslinked peptides
Three separate preparations of dichlorvos-treated SH-SY5Y cells were analyzed for isopeptide crosslinks between lysine and glutamate or lysine and aspartate. Forty isopeptide crosslinks were confirmed by manual evaluation. The results are given in Table 1. Two separate preparations of SH-SY5Y cells were made without exposure to dichlorvos. No isopeptide crosslinks between lysine and glutamate or lysine and aspartate were detected in cells not treated with dichlorvos.
Table 1.
Confirmed crosslinked peptides.
| # | Protein | UniProt ID | Mass m/z | Elute Time |
Z | Score + Score Diffa |
% Matchb |
Peptidesc |
|---|---|---|---|---|---|---|---|---|
| First Replicate | ||||||||
| 1 | Kinesin-like protein KIF26B | Q2KJY2 | 672.8934 | 22.88 | 4 | 47.4 + 1.9 | 82.7 | AVNKVKDTPGLGK450VK |
| Hemicentin-1 | Q96RW7 | KGVDIE4391ISHR | ||||||
| 2 | Kinesin-like protein KIF26B | Q2KJY2 | 896.8553 | 22.95 | 3 | 34.2 + 2.4 | 76.8 | AVNKVKDTPGLGK450VK |
| Hemicentin-1 | Q96RW7 | KGVDIE4391ISHR | ||||||
| 3 | Dual specificity mitogen-activated protein kinase kinase 7 | O14733 | 774.4123 | 26.84 | 3 | 29.2 + 8.4 | 51.8 | DVK245PSNILLDER |
| Ankyrin repeat domain-containing protein 12 | Q6UB98 | HLAESKE927K | ||||||
| 4 | Supervillin | O95425 | 835.1508 | 24.16 | 3 | 22.9 + 1.7 | 67.7 | VKVRE308K |
| UDP-GlcNAc:betaGal beta-1,3-N-acetylglucosaminyl transferase-like protein 1 | Q67FW5 | KVVAFCDVDENK298IRK | ||||||
| 5 | Transcription factor 20 | Q9UGU0 | 626.6151 | 24.08 | 4 | 56.6 + 9.1 | 84.6 | VKVRHK1754SASNGSK |
| Myosin-4 | Q9Y623 | VKKQLDHE1540K | ||||||
| 6 | Serine/threonine-protein kinase WNK2 | Q9Y3S1 | 831.2604 | 24.25 | 4 | 43.2 + 3.4 | 82.5 | TVGAAQLK2180PTLNQLKQTQK |
| Oxysterol-binding protein-related protein 5 | Q9H0X9 | VTKKE97TLKAQK | ||||||
| 7 | Tubulin polyglutamylase TTLL7 | Q6ZT98 | 673.1405 | 23.27 | 4 | 34.2 + 4.1 | 57.0 | QRKVLDIVK742TSIR |
| Transcription elongation factor A protein-like 2 | Q9H3H9 | KGKSE83MoxQGGSK | ||||||
| 8 | Gem-associated protein 4 | P57678 | 558.1219 | 23.35 | 5 | 54.6 + 1.6 | 67.9 | NK497VLAGILRSWGRK |
| Lysosomal-trafficking regulator | Q99698 | QD199HPKAKLDR | ||||||
| Second Replicate | ||||||||
| 9 | Kinesin-like protein KIF26B | Q2KJY2 | 672.8947 | 24.04 | 4 | 52.6 + 4.4 | 81.1 | AVNKVKDTPGLGK450VK |
| Hemicentin-1 | Q96RW7 | KGVDIE4391ISHR | ||||||
| 10 | Microcephalin | Q8NEM0 | 538.5172 | 24.111 | 5 | 39.1 + 1.9 | 69.0 | GKKPTRTLVMoxTSMPSE658K |
| Junction-mediating and -regulatory protein | Q8N9B5 | KGVKLK936K | ||||||
| 11 | Centromere protein V-like protein 3 | A0A0U1RRI6 | 626.6141 | 24.54 | 4 | 52.5 + 12.5 | 83.1 | VQVGVGSHAAAK67RWLGK |
| Fibroblast growth factor receptor-like 1 | Q8N441 | VKQVE90R | ||||||
| 12 | Elongation factor 2 | P13639 | 704.9792 | 26.01 | 5 | 50.5 + 16.7 | 64.3 | K16ANIRNMSVIAHVDHGK |
| Protein eva-1 homolog A | Q9H8M9 | NVFTSAE116ELERAQR | ||||||
| 13 | Proteasome subunit alpha type-2 | P25787 | 691.3914 | 21.50 | 5 | 54.2 + 24.6 | 65.1 | QK53SILYDERSVHKVEPITK |
| GTPase-activating protein and VPS9 domain-containing protein 1 | Q14C86 | LNADVLKTAE39K | ||||||
| 14 | Transcriptional regulator ATRX | P46100 | 820.4745 | 26.86 | 4 | 44.2 + 4.2 | 65.0 | ENLLGSIK1774EFR |
| Isoleucine-tRNA ligase, cytoplasmic | P41252 | YPLKEIVVIHQD856PEALK | ||||||
| 15 | Prohibitin-2 | Q99623 | 811.0315 | 24.93 | 5 | 36.0 + 8.6 | 46.0 | QK224IVQAEGEAEAAK |
| Probable 2-oxoglutarate dehydrogenase E1 component DHKTD1, mitochondrial | Q96HY7 | TLVFCSGKHFYSLVKQRE821SLGAK | ||||||
| 16 | Oxysterol-binding protein-related protein 5 | Q9H0X9 | 831.0162 | 26.55 | 4 | 36.1 + 2.1 | 68.3 | VTKKE97TLKAQK |
| Serine/threonine-protein kinase WNK2 | Q9Y3S1 | TVGAAQLK2180PTLNQLKQTQK | ||||||
| Third Replicate | ||||||||
| 17 | Kinesin-like protein KIF26B | Q2KJY2 | 538.5171 | 100.77 | 5 | 37.7 + 2.8 | 65.4 | AVNKVKDTPGLGK450VK |
| Hemicentin-1 | Q96RW7 | KGVDIE4391ISHR | ||||||
| 18 | Mitochondrial 10-formyltetrahydrofolate dehydrogenase | Q3SY69 | 538.5169 | 29.11 | 5 | 29.0 + 7.4 | 57.2 | VKGKTIK89EVAEAYR |
| Lysine-specific demethylase 5C | P41229 | VALEVE784DGRK | ||||||
| 19 | DnaJ homolog subfamily C member 9 | Q8WXX5 | 538.5166 | 29.63 | 5 | 43.5 + 1.0 | 69.0 | KRRAQEE196AK |
| Mitochondrial 10-formyl tetrahydrofolate dehydrogenase | Q3SY69 | VKGKTIK89EVAEAYR | ||||||
| 20 | Coatomer subunit delta | P48444 | 538.5165 | 100.27 | 5 | 26.5 + 0 | 58.7 | KGVQLQTHPNVDK335K |
| Kelch domain-containing protein 4 | Q8TBB5 | QRVKKDVD260K | ||||||
| 21 | Kinesin-like protein KIF26B | Q2KJY2 | 672.8947 | 31.55 | 4 | 31.7 + 4.1 | 69.3 | AVNKVKDTPGLGK450VK |
| Hemicentin-1 | Q96RW7 | KGVDIE4391ISHR | ||||||
| 22 | Ankyrin −2 | Q01484 | 672.8947 | 100.84 | 4 | 24.3 + 1.6 | 73.1 | GKVRVEK2031EK |
| C─C motif chemokine 17 | Q92583 | VKNAVKYLQSLE92RS | ||||||
| 23 | Putative high mobility group protein B1-like 1 | B2RPK0 | 672.8946 | 29.11 | 4 | 40.0 + 10.2 | 53.3 | KGVVKAE179K |
| Protein Lines homolog 1 | Q8NG48 | ISLVKTEFHAK113EQYR | ||||||
| 24 | Osteocrin | P61366 | 672.8939 | 29.61 | 4 | 48.6 + 10.5 | 77.9 | QRKVVKHPK113R |
| Helicase-like transcription factor | Q14527 | GKLKNVQSE422TKGLR | ||||||
| 25 | Titin | Q8WZ42 | 626.6149 | 29.44 | 4 | 43.1 + 10.8 | 78.6 | VKVQD28977TPGK |
| Alpha-1,3-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyl transferase-like protein MGAT4D | A6NG13 | VLNRMK71YEITKR | ||||||
| 26 | StAR-related lipid transfer prot 9 | Q9P1P6 | 671.7452 | 25.80 | 3 | 33.5 + 3.0 | 61.8 | RKKVSFQLE773R |
| Collagen alpha-5(VI) chain | A8TX70 | KGVK1667GPR | ||||||
| 27 | Zinc finger protein 292 | O60281 | 896.8561 | 29.75 | 3 | 32.5 + 4.7 | 69.1 | KRKVE2548K |
| Arginase-2, mitochondrial | P78540 | KGVEHGPAAIREAGLMK56R | ||||||
| 28 | WD repeat-containing protein 1 | O75083 | 872.0002 | 28.52 | 4 | 29.9 + 0.2 | 62.8 | VFASLPQVE16RGVSKIIGGDPK |
| DNA repair protein RAD50 | Q92878 | STESELKKK722EK | ||||||
| 29 | Zinc finger protein 728 | P0DKX0 | 928.9985 | 28.54 | 4 | 46.4 + 6.6 | 65.1 | AIHAGEKLYK481CEECGK |
| Uncharacterized protein C9orf43 | Q8TAL5 | MFLSIHRLTLE269RPALR | ||||||
| 30 | Lysine-specific demethylase 7A | Q6ZMT4 | 736.4008 | 33.85 | 5 | 33.6 + 5.4 | 63.5 | FNKHLQPSSTVPE569WRAKDNDLR |
| Inter-alpha-trypsin inhibitor heavy chain H2 | P19823 | VQISVKK810EK | ||||||
| 31 | Complement receptor type 2 | P20023 | 825.7779 | 26.33 | 3 | 28.0 + 4.9 | 51.3 | LIGEKSLLCITK68DK |
| Oxysterol-binding protein-related protein 9 | Q96SU4 | D680IDAATEAK | ||||||
| 32 | 14-3-3 protein theta | P27348 | 758.4285 | 33.48 | 4 | 44.0 + 1.0 | 66.3 | NLLSVAYK49NVVGGRR |
| Interleukin-10 receptor subunit beta | Q08334 | NILQWE41SPAFAK | ||||||
| 33 | 14-3-3 protein theta | P27348 | 916.2504 | 32.67 | 4 | 39.1 + 5.7 | 56.8 | NLLSVAYK49NVVGGRRSAWR |
| Serologically defined colon cancer antigen 8 | Q86SQ7 | QVLQISEE349ANFEK | ||||||
| 34 | von Willebrand factor A domain-containing protein 3B | Q502W6 | 591.5668 | 27.80 | 4 | 27.2 + 9.0 | 50.2 | D794GLSNASSRR |
| Phosphatidylinositol 4-kinase alpha | P42356 | IHNELSPLK432LR | ||||||
| 35 | Coatomer subunit alpha | P53621 | 606.9457 | 33.54 | 5 | 41.3 + 6.1 | 61.1 | VTTVTEIGK1211DVIGLR |
| Teneurin-4 | Q6N022 | NLLSLDFD1844RVTR | ||||||
| 36 | Microfibrillar-associated protein 1 | P55081 | 835.1493 | 29.59 | 3 | 27.6 + 3.4 | 61.6 | VKVK37RYVSGK |
| Serine/threonine-protein kinase SMG1 | Q96Q15 | KVVD1635NASQGEGVR | ||||||
| 37 | 14-3-3 protein zeta/delta | P63104 | 916.0831 | 34.06 | 5 | 61.4 + 5.3 | 80.9 | SVTEQGAELSNEERNLLSVAY |
| Striated muscle preferentially expressed protein kinase | Q15772 | K49NVVGAR RLFQQKAASLDE538R |
||||||
| 38 | Staphylococcal nuclease domain -containing protein 1 | Q7KZF4 | 847.6662 | 36.73 | 5 | 58.8 + 18.8 | 65.7 | QFQK873VITEYLNAQESAK |
| Zinc finger protein 777 | Q9ULD5 | LAVWAAVQAVE196RKLEAQA MoxR | ||||||
| 39 | C-type lectin domain family 4 member M | Q9H2X3 | 537.6976 | 27.08 | 5 | 37.0 + 2.9 | 56.8 | LQEIYQELTE210LK |
| Betaine-homocysteine S-methyltransferase 1 | Q93088 | ELFEKQK402FK | ||||||
| 40 | Teneurin-3 | Q9P273 | 762.4280 | 31.48 | 4 | 50.0 + 2.2 | 74.4 | NLLSVD1778FDRTTKTEK |
| Rho guanine nucleotide exchange factor 26 | Q96DR7 | RALKEVSK615LVR | ||||||
Score and score difference values are assigned by Protein Prospector. The higher the value, the better the match to a theoretical peptide. In Table 1, values range from 24.3 to 61.4.
% match is the matched intensity of the assigned peaks compared to the observed peaks. Confidence in the assignment is associated with high % matched intensity.
Crosslinked residues are suffixed by a subscripted number identifying the crosslinked residue.
All data were obtained from the soluble fraction of the cell lysate, therefore proteins in the aggregated pellet were not evaluated. Though the pellet could contain aggregated proteins that might be of interest, the complexity of the pellet made working with it unappealing. In our experience, aggregated proteins are soluble and remain in the cell lysate supernatant.
3.2. Peptide abundance
Only two crosslinked peptides appeared more than once. The crosslinked peptide AVNKVKDTPGLGK450VK/KGVDIE4391ISHR from protein kinesin-like protein KIF26B and hemicentin-1 appeared 5-times: twice in the first repetition (charge states 3 and 4), once in the second repetition, and twice in the third repetition (charge states 4 and 5). See Table 1.
The crosslinked peptide VTKKE97TLKAQK/TVGAAQLK2180PTLNQ LKQTQK from oxysterol-binding protein-related protein 5 and serine/threonine-protein kinase WNK2 appeared twice, once in the first repetition and once in the second repetition.
We attribute this low degree of reproducibility to crosslinking between low abundance proteins. To test this assumption, we compared the crosslinked proteins to two databases where the cellular concentration of the proteins was determined, see Supplementary Material Table S2.
Beck and coworkers determined the protein abundance for 7311 proteins from the human osteosarcoma cell line U20S [15]. Protein abundance ranged from <5 × 102 to 6.53 × 106 copies per cell. Abundance determination was based on 144 heavy isotope labeled reference peptides. The abundance of individual proteins in our crosslinked pairs was similar, ranging from <5 × 102 to 4.42 × 106 (see Supplementary Material Table S2). However, one member of each crosslinked pair was always from a low abundance protein (abundance less than 8 × 103 or 0.1% of the maximum).
Geiger and coworkers determined the protein abundance for eleven cell lines (A549, GAMG, HEK293, HeLa, HepG2, K562, Jurkat, LnCap, MCF7, RKO, and U20S) [16]. A total of 11,732 proteins were detected. Proteins were reported as iBAQ values (intensity based absolute quantitation). iBAQ values are obtained by summing the peak intensities for all peptides matching to a specific protein and dividing by the number of theoretically observable peptides in the sample. iBAQ values are reported as log10 [17]. Protein abundance in the database ranged from 2.41 to 9.13. The abundance of individual proteins for our crosslinked pairs spanned a similar range, 2.42 to 8.78 (see Supplementary Material Table S2). However, one member of each crosslinked pair was always from a low abundance protein (abundance of less than 5 or 0.01% of the maximum).
The fact that most of the crosslinked peptides are from low abundance proteins illustrates one difficulty in obtaining isopeptide crosslinking data, low abundance proteins usually give low signals in the mass spectrometer. This also makes the frequency at which the crosslink occurs difficult to determine. Normally, we would use the number of times a given spectrum appears in the data (spectral count) as a measure of frequency. Most of the crosslinks in this data appear only once.
3.3. Examples of MS/MS spectra for isopeptide crosslinked peptides
Figs. 2-5 illustrate the important features in the MS/MS spectra of isopeptide crosslinked peptides. The important features are detailed in Section 2.11. The fragmentation patterns in Figs. 2 and 3 show strong evidence to support a crosslinked peptide. Fig. 4 shows a minimally acceptable spectrum. Fig. 5 illustrates an unacceptable spectrum. Three additional MS/MS spectra with strong support for a crosslinked peptide are shown in the Supplemental material (Figs. S1, S2, & S3).
Fig. 4.
The MS/MS spectrum for the isopeptide crosslinked peptide QK1062SILYDERSVHKVEPITK/LNADVLKTAE39K, where the subscripted residues indicate the site of the isopeptide crosslink. The parent ion is at m/z 691.38, +5 charged. The upper panel shows the complete mass range m/z 180 to 1700. This shows an 8 amino acid, +1, y-ion sequence (HKVEIPTK) from the C-terminus of the green peptide; a 2-amino acid, +4, y-ion crosslink specific sequence (LV) from the blue peptide; and a repeat of that 2-amino acid crosslink specific sequence (LV) from the blue peptide in the +3 charge state. The lower panel shows an expanded mass range from m/z 850 to 1300. This emphasizes a 7-amino acid, +2 sequence (WQQQSIL) that is not consistent with the crosslinked peptide. Positions of the WQQQ masses are marked with arrows. Unlabeled, red masses mostly represent loss of water, amine, or CO.
Fig. 2 shows the MS/MS spectrum of isopeptide crosslinked peptide AVNKVKDTPGLGK450VK/KGVDIE4391ISHR from the first replicate. Peptides are from kinesin-like protein KIF26B and hemicentin-1, respectively. Four crosslinked features are present that qualify the crosslink candidate to be a crosslinked peptide. Details of the fragmentation pattern are described in the figure legend.
Fig. 2 Panel A illustrates the N-terminal sequences from the blue and green peptides, and a ladder sequence, KGV, from the shorter, blue peptide. The ladder sequence fragments are crosslink specific ions. Fig. 2 Panels B and C illustrate mixed fragmentation. This version of mixed fragmentation can be envisioned to occur in two ways.
The first scenario is illustrated in Fig. 2 Panel B. Fragmentation is initiated by formation of y8+2 from the blue peptide AVNKVKDTPGLGK450VK/VDIE4391ISHR, m/z 1252.21. This fragment loses the N-terminal V from the blue peptide to make y7+2 AVNKVKDTPGLGK450VK/DIE4391ISHR, m/z 1202.68. That fragment undergoes a rearrangement that moves AVN from the N-terminal of the green peptide to the C-terminal. This results in the fragment KVKDTPGLGK450VKAVN/DIE4391ISHR that still has a molecular weight of m/z 1202.68. That is, there are two equivalent structure for the m/z 1202.68 mass. Sequential removal of KVK from the N-terminal of the green peptide would yield the remaining fragments VKDTPGLGK450VKAVN/DIE4391ISHR, m/z 1138.64, KDTPGLGK450VKAVN/DIE4391ISHR, m/z 1089.10, and DTPGLGK450VKAVN/DIE4391ISHR, m/z 1025.06.
The second scenario is illustrated in Fig. 2, Panel C. Fragmentation is initiated by formation of fragments y8+2 and y7+2, in the way described for the first scenario. The blue y7+2 mass at m/z 1202.68 for fragment AVNKVKDTPGLGK450VK/DIE4391ISHR is coincidently the same mass as the green y12+2 KVKDTPGLGK450VK/KGVDIE4391ISHR. This makes it appear as if green KVK is contiguous with the blue V, when it is not. Rather there are two separate fragmentation processes. Sequential removal of KVK from the N-terminal of the green peptide would yield the remaining fragments y11+2 VKDTPGLGK450VK/KGVDIE4391ISHR, y10+2 m/z 1138.64, KDTPGLGK450VK/KGVDIE4391ISHR, m/z 1089.10, and y9+2 DTPGLGK450VK/KGVDIE4391ISHR m/z 1025.06.
Either of these mixed fragmentation scenarios supports the same isopeptide crosslinked peptide.
Assignment of the crosslink to K450-E4391 is based on the following logic. The crosslinks we are investigating are limited to KE and KD. The crosslinked peptides AVNKVKDTPGLGK450VK (green peptide)/KGVDIE4391ISHR (blue peptide) together contain one E, two D, and five K residues. K4386 in the smaller, blue peptide is a b1 ion and is not a crosslink specific ion. Because there are no other K residues in the blue peptide, K in the isopeptide crosslink cannot come from the blue peptide. Four K residues in the green peptide are potential partners. K441 and K443 are released from the green peptide in panel B to make crosslinked fragments at 1138.64 and 1025.06. Since the peptides remain crosslinked when K441 and K443 are gone, K441 and K443 cannot be crosslink partners. K452 is at the C terminus, which was the cleavage site for trypsin. As a general rule, trypsin does not cleave at modified lysine residues therefore K452 cannot be a crosslink partner. This leaves K450 as the only possible lysine crosslink partner in the green peptide.
Fig. 3 shows the MS/MS spectrum for the isopeptide crosslinked peptide DVK245PSNILLDER/HLAESKE927K. Peptides are from dual specificity mitogen-activated protein kinase kinase 7 and ankyrin repeat domain-containing protein 12, respectively. Four crosslinked features are present that qualify the crosslink candidate to be a crosslinked peptide. Details of the fragmentation pattern are described in the figure legend.
Fig. 4 shows the MS/MS spectrum for the isopeptide crosslinked peptide QK1062SILYDERSVHKVEPITK/LNADVLKTAE39K. Peptides are from proteasome subunit alpha type-2 and GTPase-activating protein and VPS9 domain-containing protein 1, respectively. Three crosslinked features are present that qualify the crosslink candidate to be a crosslinked peptide. Plus, there is a 7-amino acid, +2 sequence (LISQQQW) that is not consistent with the crosslinked peptide. Peptide LISQQQW does not disqualify the crosslink because the crosslink is adequately supported by the green KTIPEVKH peptide and the blue crosslink specific ions for peptide LV. Details of the fragmentation pattern are described in the figure legend.
Fig. 5 shows the MS/MS spectrum for dipeptide DPDYVRK1062GDAR/LVGILDILDEE532NR that does not qualify as an isopeptide crosslinked peptide. Peptides are from vascular endothelial growth factor receptor 2 and unconventional myosin-VI, respectively. There is only one crosslinked feature in the spectrum, the green PDY fragment. The 6-amino acid, +2 sequence (QFLELY) is not consistent with the crosslink candidate. Unassigned amino acids appended to the blue E in peptide QFLELY disqualify E as support for the blue peptide. This removes support for the blue peptide. Without support for both peptides, the candidate crosslink in Fig. 5 is rejected. Details of the fragmentation pattern are described in the figure legend.
4. Discussion
4.1. Isopeptide crosslinks induced by other organophosphates
In previous studies we have used the organophosphate pesticide chlorpyrifos oxon to induce isopeptide crosslinks in the pure proteins butyrylcholinesterase, casein, serum albumin and tubulin [5,6]. Schmidt et al. found that the organophosphorus nerve agent VX induced crosslinks in ubiquitin [7]. The results from the current work demonstrate that the organophosphate pesticide dichlorvos can also induce isopeptide crosslinks. From these observations it is tempting to suggest that any organophosphylate may be capable of inducing isopeptide crosslinks in a variety of proteins.
4.2. Mechanism for the reaction of organophosphates with lysine
In a previous study [6] we showed that the organophosphate chlorpyrifos oxon reacted with lysine residues in bovine casein, human serum albumin, mouse serum albumin, human butyrylcholinesterase, and porcine tubulin to form diethylphospho-adducts. Only selected lysine residues were labeled, suggesting that reactive lysines were activated. It was proposed that activation involved through-space, charge-charge interactions with nearby negatively charged residues. Consistent with this proposal, half of the reactive lysine residues were within two residues of an acidic residue in the linear sequence. Support for subsequent OP-induced isopeptide bond formation was the observation that 77% of the crosslinks detected involved a lysine that had been labeled with diethylphosphate.
4.3. Organophosphate induced isopeptide crosslinks and disease
Epidemiological evidence suggests that there is an increased incidence of Alzheimer’s disease [18,19] and Parkinson’s [20] disease in agricultural workers who are exposed to organophosphorus pesticides. A hallmark for both diseases is accumulation of aggregated protein. Previously, we demonstrated that chlorpyrifos oxon, the activated form of the pesticide chlorpyrifos, can crosslink proteins [6]. Of the proteins we have studied in vitro, tubulin is the most sensitive to reaction with chlorpyrifos oxon. Reaction leads to formation of aggregates [9]. Treatment of mice with nonlethal levels of chlorpyrifos resulted in disruption of microtubule structures in the brain [21]. In light of the critical role that microtubules play, disruption of their structure would be expected to presage neurological problems. More recently, we have shown that chlorpyrifos oxon can induce isopeptide dimerization of amyloid beta (1–42) [22]. The covalent amyloid beta dimer is considered to be the toxic form of amyloid beta that leads to protein aggregation and disease. It follows that exposure to organophosphorus pesticides could contribute to the progression of diseases associated with aggregation. At this point, we have no data on how common protein crosslinking is upon exposure to organophosphate levels encountered in the environment, however, because development of Alzheimer’s and Parkinson’s disease appears to require years, slow accumulation of organophosphorus-induced aggregates could be a causative factor.
4.4. Compare isopeptide crosslinks created by transglutaminase to those induced by organophosphates
Transglutaminase is well known for creating γ-glutamyl-ε-lysine isopeptide crosslinks between an α-carboxy group from glutamine on one protein and an ε-amino group from lysine on another. In 2007 Kang et al. reported that isopeptide bonds could form spontaneously between the side chains of lysine and asparagine on Spy0128 in the polymeric shaft of pili expressed by S. pyogenes [8,23]. More recently, it has been reported that organophosphylates can promote spontaneous isopeptide formation between the side chains of lysine and glutamate or lysine and aspartate [6,7]. The consequence of isopeptide bond formation by any of these methods is the creation of protein dimers and higher oligomers. Dimers and molecular aggregates created by transglutaminase are associated with neurodegenerative diseases [24,25].
4.5. The process of identifying isopeptide crosslinks
Identifying isopeptide crosslinks in mass spectral data is technically challenging and somewhat subjective. In the Methods section we outlined a procedure for selecting isopeptide crosslinked peptides from raw mass spectral data. The process begins with a database search of the mass spectral data against a SwissProt database using the Protein Prospector/Batch Tag Web algorithm (see the Methods section Batch Tag Web search for crosslinked peptide candidates). Such searches generate large numbers of crosslink candidates that can be listed using Protein Prospector/Search Compare. This list is then reduced by manually screening the crosslink candidates against parameters generated by Protein Prospector that reflect the probability that crosslinked peptide assignment is correct. Parameters used in this work are charge state 3, 4, or 5; Score >25; score difference >1; % matched intensity >45%; and a minimum of 5 amino acids in each peptide. These parameters were arrived at through experience. They reduce the number of crosslink candidates to a manageable value. MS/MS spectra for the resultant crosslink candidates are manually evaluated for features that are indicative of an isopeptide crosslink (see the Methods section Manual evaluation of crosslinked peptides).
Though this process works, it relies on subjective decision. For example, we found that Protein Prospector can assign different crosslink candidates to the same MS/MS spectrum. Parent ion 538.517 mass (+5) appeared four times in the third replicate with elution times of 29.12, 29.63, 100.27, and 100.77 min. The MS/MS pattern was essentially the same for all four. Protein Prospector assigned these candidates to peptide sequences VKGKTIK89EVAEAYR/VALEVE784DGRK (29.12 min); KRRAQEE196AK/VKGKTIK89EVAEAYR (29.63 min); KGVQLQT HPNVDK335K/QRVKKDVD260K (100.27 min) AVNKVKDTPGLGK450VK/KGVDIE4391ISHR (100.77 min). The first, second, and third sequences each have two crosslink features in their MS/MS spectra, one for each peptide. The fourth sequence has four crosslink features in its MS/MS spectrum. Therefore, all four candidates have enough crosslink features to qualify as crosslinked peptides. Fig. 6 shows the MS/MS spectrum for the m/z 538.517 isopeptide crosslinked peptide VKGKTIK89EVAEAYR/VALEVE784DGRK. Details of the fragmentation pattern are described in the figure legend.
The issue is further complicated by the fact that there is an m/z mass that is the +4 version of m/z 538.517. The m/z 672.894 mass also appeared four times in the third replicate with elution times of 29.11, 29.61, 31.56, and 100.80 min. Again, the MS/MS spectral pattern was essentially the same for all four, and the pattern was very similar to that for the m/z 538.517 masses. One might expect that Protein Prospector would assign the same crosslink candidate sequences to the m/z spectra that it did to the m/z 538.517 spectra. This is not the case. Protein Prospector assigned these candidates to peptide sequences KGVVKAE179K/ISLVKTEFHAK113EQYR (29.11 min); QRKVVKHP K113R/GKLKNVQSE422TKGLR (29.61 min); AVNKVKDTPGLGK450VK/KGVDIE4391ISHR (31.56 min); and GKVRVEK2031EK/VKNAVKYLQSLE92RS (100.80 min). Of these, only the third candidate is present in the m/z 538.517 group. The first and fourth sequences have four crosslinked features and therefore qualify as crosslinked peptides. The second and third sequences have five crosslink features and as such also qualify as crosslinked peptides. Fig. 7 shows the MS/MS spectrum for the m/z 672.894 isopeptide crosslinked peptide KGVVKAE179K/ISLVKTEFHAK113EQYR. Details of the fragmentation pattern are described in the figure legend.
Deciding which crosslinked sequence is correct is rather subjective. We chose AVNKVKDTPGLGK450VK/KGVDIE4391ISHR because it appeared in both the m/z 538.517 and 672.894 data sets, and because it appeared in the first and second replicates as well. It is noteworthy that the same sequence was assigned to the MS/MS spectrum for the m/z 896.86 (+3) parent ion in Fig. 2.
5. Conclusion
A protocol is presented for identifying zero-length isopeptide crosslinks in a complex protein mixture. Enrichment of tryptic peptides by binding to anti-isopeptide antibody is a key first step. Stringent washing of the immunopurified complex with a detergent-containing buffer minimizes the number of false positives. Searches of mass spectrometry data with Protein Prospector software provides a list of candidate dipeptide crosslinks. Manual evaluation of candidate crosslinked peptides is laborious but critical.
Our protocol identified isopeptide crosslinked proteins in cultured cells that had been exposed to 10 μM dichlorvos. Isopeptide crosslinks induced by organophosphate pesticides are distinct from crosslinks induced by transglutaminase. The chemically induced crosslinks are between lysine and glutamic acid or lysine and aspartic acid with release of a molecule of water. The transglutaminase induced crosslinks are between lysine and glutamine with release of a molecule of ammonia.
Protein aggregates in neurodegenerative diseases are thought to be produced in part by the action of transglutaminase. Our results suggest that exposure to organophosphorus pesticides may also be implicated.
Supplementary Material
Acknowledgments
Supported by NIH grant 1R21ES030132-01A1 (to OL) and Fred & Pamela Buffet Cancer Center Support Grant P30CA036727.
Mass spectrometry data were obtained by the Mass Spectrometry and Proteomics Core Facility at the University of Nebraska Medical Center, which is supported by state funds from the Nebraska Research Institute. Protein Prospector programs are available at no cost, https://prospector.ucsf.edu. (14May2022) Protein Prospector was developed in the University of California San Francisco Mass Spectrometry Facility, directed by Dr. Alma Burlingame, funded by the NIH National Institute for General Medical Sciences. The Proteomics Toolkit http://db.systemsbiology.net:8080/proteomicsToolkit/(14May2022) and Xcalibur Qual Browser (Thermo Scientific) were used to identify ions in MS/MS spectra.
Abbreviations
- LC-MS/MS
liquid chromatography tandem mass spectrometry
- MS/MS
tandem mass spectral fragmentation
- DCV
dichlorvos
- iBAQ
intensity based absolute quantitation
- PBS
phosphate buffered saline
- SDS
sodium dodecyl sulfate
- SH-SY5Y
human neuroblastoma cells
Footnotes
Declaration of competing interest
The authors declare no conflict of interest.
CRediT authorship contribution statement
Oksana Lockridge: Performed the research, reviewed and edited the manuscript. Lawrence M Schopfer: Analyzed the data and wrote the original draft.
Research data for this article
The mass spectrometry proteomics data have been deposited to ProteomeXchange Consortium [26] via PRIDE [27] partner repository with the dataset identifier PXD033593.
Appendix A. Supplementary data
Supplementary data to this article can be found online at https://doi.org/10.1016/j.ab.2022.114844.
Data availability
Data are deposited in the PRIDE database. Identifier PXD033593. Reviewer account details: Username reviewer_pxd033593@ebi.ac.uk; password Fk7nQocd.
References
- [1].Lorand L, Downey J, Gotoh T, Jacobsen A, Tokura S, The transpeptidase system which crosslinks fibrin by gamma-glutamyle-episilon-lysine bonds, Biochem. Biophys. Res. Commun 31 (1968) 222–230, 10.1016/0006-291x(68)90734-1. [DOI] [PubMed] [Google Scholar]
- [2].Tarcsa E, Fesus L, Determination of epsilon (gamma-glutamyl)lysine crosslink in proteins using phenylisothiocyanate derivatization and high-pressure liquid chromatographic separation, Anal. Biochem 186 (1990) 135–140, 10.1016/0003-2697(90)90586-x. [DOI] [PubMed] [Google Scholar]
- [3].Nemes Z, Petrovski G, Fesus L, Tools for the detection and quantitation of protein transglutamination, Anal. Biochem 342 (2005) 1–10, 10.1016/j.ab.2004.10.015. [DOI] [PubMed] [Google Scholar]
- [4].Nemes Z, Petrovski G, Aerts M, Sergeant K, Devreese B, Fesus L, Transglutaminase-mediated intramolecular cross-linking of membrane-bound alpha-synuclein promotes amyloid formation in Lewy bodies, J. Biol. Chem 284 (2009) 27252–27264, 10.1074/jbc.M109.033969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].Biberoglu K, Tacal O, Schopfer LM, Lockridge O, Chlorpyrifos oxon-induced isopeptide bond formation in human butyrylcholinesterase, Molecules 25 (2020), 10.3390/molecules25030533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Schopfer LM, Lockridge O, Mass spectrometry identifies isopeptide cross-links promoted by diethylphosphorylated lysine in proteins treated with chlorpyrifos oxon, Chem. Res. Toxicol 32 (2019) 762–772, 10.1021/acs.chemrestox.9b00001. [DOI] [PubMed] [Google Scholar]
- [7].Schmidt C, Breyer F, Blum MM, Thiermann H, Worek F, John H, V-type nerve agents phosphonylate ubiquitin at biologically relevant lysine residues and induce intramolecular cyclization by an isopeptide bond, Anal. Bioanal. Chem 406 (2014) 5171–5185, 10.1007/s00216-014-7706-y. [DOI] [PubMed] [Google Scholar]
- [8].Kang HJ, Coulibaly F, Clow F, Proft T, Baker EN, Stabilizing isopeptide bonds revealed in gram-positive bacterial pilus structure, Science 318 (2007) 1625–1628, 10.1126/science.1145806. [DOI] [PubMed] [Google Scholar]
- [9].Schopfer LM, Lockridge O, Chlorpyrifos oxon promotes tubulin aggregation via isopeptide cross-linking between diethoxyphospho-Lys and Glu or Asp: implications for neurotoxicity, J. Biol. Chem 293 (2018) 13566–13577, 10.1074/jbc.RA118.004172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Schopfer LM, Onder S, Lockridge O, Evaluation of mass spectrometry MS/MS spectra for the presence of isopeptide crosslinked peptides, PLoS One 16 (2021), e0254450, 10.1371/journal.pone.0254450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Thorne GC, Ballard KC, Gaskell SJ, Mestable decomposition of peptide [M + H]+ ions via rearragnement involving loss of the C-terminal amino acid residue, J. Am. Soc. Mass Spectrom 1 (1990) 249–257, 10.1016/1044-0305-(90)85042-K. [DOI] [Google Scholar]
- [12].Dupre M, Cantel S, Martinez J, Enjalbal C, Occurrence of C-terminal residue exclusion in peptide fragmentation by ESI and MALDI tandem mass spectrometry, J. Am. Soc. Mass Spectrom 23 (2012) 330–346, 10.1007/s13361011-0254-1. [DOI] [PubMed] [Google Scholar]
- [13].Fodor S, Zhang Z, Rearrangement of terminal amino acid residues in peptides by protease-catalyzed intramolecular transpeptidation, Anal. Biochem 356 (2006) 282–290, 10.1016/j.ab.2006.06.023. [DOI] [PubMed] [Google Scholar]
- [14].Harrison AG, Young AB, Bleiholder C, Suhai S, Paizs B, Scrambling of sequence information in collision-induced dissociation of peptides, J. Am. Chem. Soc 128 (2006) 10364–10365, 10.1021/ja062440h. [DOI] [PubMed] [Google Scholar]
- [15].Beck M, Schmidt A, Malmstroem J, Claassen M, Ori A, Szymborska A, Herzog F, Rinner O, Ellenberg J, Aebersold R, The quantitative proteome of a human cell line, Mol. Syst. Biol 7 (2011) 549, 10.1038/msb.2011.82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Geiger T, Wehner A, Schaab C, Cox J, Mann M, Comparative proteomic analysis of eleven common cell lines reveals ubiquitous but varying expression of most proteins, Mol. Cell. Proteomics 11 (2012), 10.1074/mcp.M111.014050. M111 014050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M, Global quantification of mammalian gene expression control, Nature 473 (2011) 337–342, 10.1038/nature10098. [DOI] [PubMed] [Google Scholar]
- [18].Hayden KM, Norton MC, Darcey D, Ostbye T, Zandi PP, Breitner JC, Welsh-Bohmer KA, Cache County Study I, Occupational exposure to pesticides increases the risk of incident AD: the Cache County study, Neurology 74 (2010) 1524–1530, 10.1212/WNL.0b013e3181dd4423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Tang BL, Neuropathological mechanisms associated with pesticides in Alzheimer’s disease, Toxics 8 (2020), 10.3390/toxics8020021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Sanchez-Santed F, Colomina MT, Herrero Hernandez E, Organophosphate pesticide exposure and neurodegeneration, Cortex 74 (2016) 417–426, 10.1016/j.cortex.2015.10.003. [DOI] [PubMed] [Google Scholar]
- [21].Jiang W, Duysen EG, Hansen H, Shlyakhtenko L, Schopfer LM, Lockridge O, Mice treated with chlorpyrifos or chlorpyrifos oxon have organophosphorylated tubulin in the brain and disrupted microtubule structures, suggesting a role for tubulin in neurotoxicity associated with exposure to organophosphorus agents, Toxicol. Sci 115 (2010) 183–193, 10.1093/toxsci/kfq032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Onder S, Biberoglu K, Tacal O, Schopfer LM, Chlorpyrifos oxon crosslinking of amyloid beta 42 peptides is a new route for generation of self-aggregating amyloidogenic oligomers that promote Alzheimer’s disease, Chem. Biol. Interact 363 (2022), 110029, 10.1016/j.cbi.2022.110029. [DOI] [PubMed] [Google Scholar]
- [23].Kang HJ, Baker EN, Intramolecular isopeptide bonds: protein crosslinks built for stress? Trends Biochem. Sci 36 (2011) 229–237, 10.1016/j.tibs.2010.09.007. [DOI] [PubMed] [Google Scholar]
- [24].Brinkmalm G, Hong W, Wang Z, Liu W, O’Malley TT, Sun X, Frosch MP, Selkoe DJ, Portelius E, Zetterberg H, Blennow K, Walsh DM, Identification of neurotoxic cross-linked amyloid-beta dimers in the Alzheimer’s brain, Brain 142 (2019) 1441–1457, 10.1093/brain/awz066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].O’Malley TT, Witbold WM 3rd, Linse S, Walsh DM, The aggregation paths and products of Abeta42 dimers are distinct from those of the Abeta42 monomer, Biochemistry 55 (2016) 6150–6161, 10.1021/acs.biochem.6b00453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Deutsch EW, Bandeira N, Sharma V, Perez-Riverol Y, Carver JJ, Kundu DJ, Garcia-Seisdedos D, Jarnuczak AF, Hewapathirana S, Pullman BS, Wertz J, Sun Z, Kawano S, Okuda S, Watanabe Y, Hermjakob H, MacLean B, MacCoss MJ, Zhu Y, Ishihama Y, Vizcaino JA, The ProteomeXchange consortium in 2020: enabling ‘big data’ approaches in proteomics, Nucleic Acids Res. 48 (2020) D1145–D1152, 10.1093/nar/gkz984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Perez-Riverol Y, Bai J, Bandla C, Garcia-Seisdedos D, Hewapathirana S, Kamatchinathan S, Kundu DJ, Prakash A, Frericks-Zipper A, Eisenacher M, Walzer M, Wang S, Brazma A, Vizcaino JA, The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences, Nucleic Acids Res. 50 (2022) D543–D552, 10.1093/nar/gkab1038. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data are deposited in the PRIDE database. Identifier PXD033593. Reviewer account details: Username reviewer_pxd033593@ebi.ac.uk; password Fk7nQocd.







