Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Feb 11.
Published in final edited form as: Chembiochem. 2019 Nov 28;21(1-2):103–107. doi: 10.1002/cbic.201900611

A Click-chemistry based enrichable cross-linker for structural and protein interaction analysis by mass spectrometry

Michael Stadlmeier a,+, Leander Simon Runtsch a,+, Filipp Streshnev a, Martin Wühr b, Thomas Carell a,*
PMCID: PMC6980279  NIHMSID: NIHMS1054792  PMID: 31593346

Abstract

Mass spectrometry (MS) is the method of choice for the characterization of proteomes. Most proteins operate in protein complexes, where their close association modulates their function. However, with standard MS analysis the information of protein-protein interactions is lost and no structural information is retained. In order to gain structural and interactome data, new cross-linking reagents are needed that freeze inter- and intramolecular interactions. Here, we report the development of a new reagent, which has several features that enable highly sensitive cross-linking MS. The reagent enables enrichment of cross-linked peptides from the majority of background peptides to facilitate efficient detection of low abundant cross-linked peptides. Due to the special cleavable properties, the reagent can be utilised for MS2- and potentially for MS3-experiments. Thus our new cross-linking reagent, in combination with high end MS, should enable sensitive analysis of interactomes, which will help researchers to obtain important insights into cellular states in health and diseases.

Keywords: Click reaction, cross-linking mass spectrometry, Huisgen reaction, interactome analysis, proteomics, structural proteomics


Proteins need to interact with other proteins in order to form functional complexes. In many cases, protein function inside cells can not be understood without the information about the protein structure and the knowledge in which complex the protein is situated.[12] This is for example of paramount importance for proteins that modulate epigenetic information on DNA. Almost all of these chromatin modifying proteins require intensive interaction with metabolic enzymes, which provide the cofactors needed for histone acetylation, deacetylation or methylation and demethylation.[35] This showcases the need to study the complex environment of a given protein in order to analyse its function and activity state. Protein cross-linking in combination with analysis by mass spectrometry (XL-MS) is ideally suited as a method to gain information about protein structure as well as the composition of protein complexes.[69] For XL-MS, specialised chemical reagents, so called cross-linkers, are required, which are able to covalently connect protein residues that are in close proximity, e. g. interacting in a complex.[1011] Characterization of the cross-linked peptides by mass spectrometry, however, poses a formidable challenge for two reasons. First, cross-link-identification has to be achieved by analyzing fragment ions of not only one, but two connected peptides, thereby massively complicating MS2-spectra.

Second, the cross-linked peptide species have only a very low abundance and are part of a peptide mixture that is overwhelmingly dominated by non-cross-linked peptides. In many cases this leads to a dramatic information loss that hampers accurate cross-link identification and therefore interactome analysis. A few cross-linking reagents were developed, which introduce MS-cleavable groups and the possibility for XL-enrichment to tackle these problems.[1218] Here, we report the development of a new cross-linking reagent that is enrichable and which has cleavage properties that allow accurate cross-link identification. The designed cliXlink (1) is depicted in Fig. 1. The cell permeable reagent (Fig 1A, Supp. Fig. 1) features two succinimidylester units that can react with nucleophiles in proteins and are spaced roughly 9 Å apart, thus enabling the fixation of short distances to gain structural information of proteins and to capture proteins in close proximity to each other. Efficient MS-cleavability is ensured by the well-established sulfoxide group, which can be cleaved with low-energy collision-induced dissociation (CID) conditions prior to the peptide fragmentation. In addition, an alkyne unit allows to attach an enrichment moiety, e.g. biotin, to the cross-linked sites with the help of a Cu(l) catalysed Huisgen reaction.[19]

Figure 1. The novel cross-linker cliXlink (1).

Figure 1.

A) Structure of the cliXlink-reagent (1). B) Workflow for Cross-linking-MS-Experiments. After addition of 1, protein sites in close proximity are covalently linked. The generated cross-links can be functionalised with a copper-catalysed Huisgen reaction (CuAAC). Excess reagents are removed by acetone-precipitations. After enzymatic digestion and enrichment via magnetic streptavidin-beads, cross-linked peptides were cleaved off under mild conditions and subsequently analysed by LC-MS2. C) Fragmentation pathways a) and b) of the cross-linked peptides, forming two peptide pairs with a characteristic Δmrep of 31.9721 u.

Application of the crosslinker can follow a workflow as depicted in Fig. 1B. It involves addition of reagent 1 to a complex proteome, fixating the native state distance information of peptides in close proximity and preserving them throughout the further workflow for MS-analysis. Attachment of the affinity group for enrichment is carried out after cross-linking to avoid interference of bulky enrichment groups. By modification of the cross-linked peptides on the protein level, excess small molecule reagents can readily be removed by acetone precipitation. This is followed by enzymatic digestion of the proteins. The cross-linked peptides are in this way labelled, e.g. with biotin, which allows for their enrichment. Afterwards, they are analysed by mass spectrometry. Because the individual peptide masses in a cross-link are not immediately accessible, cross-link assignment is difficult, especially in complex samples. This requires to use MS-cleavable reagents, which facilitate MS2-identification by forming specific fragment ions.[2021] At best, the reagent should also enable MS3-experiments, which requires that the cross-linker cleaves before the peptides. This allows to separate the peptides for individual MS3-based identification. Our reagent 1 contains (B-hydrogens on both sides of the sulfoxide, so that the fragmentation occurs in two directions, as depicted in Fig. 1C. This leads to a clean separation of the peptides (α, β) and it generates two fragments for each peptide; an alkene and a sulfenic acid fragment, with the latter forming a thial upon water loss. This provides two mass pairs with a characteristic ∆m/z ≈ 32. Importantly, we noticed that the fragmentation pathway a) (Fig. 1C) predominates. Therefore, for the α-peptide the alkene- and for the (β-peptide the thial-fragments are the main cleavage products. Due to the asymmetric cleavage properties, most of the signal intensity is retained on one fragment per peptide, which should provide excellent sensitivity in MS3-experiments.

The synthesis of the reagent 1 is straight forward as shown in Scheme 1. Starting point is the oxidised disulfide dimer of homocystein 2, which is reacted with 4-pentynoic acid to disulfide 3. Reductive cleavage of the disulfide and alkylation of the generated thiols with methyl 3-bromopropionate generates sulfide compound 4. Saponification of both methyl esters to 5 and conversion of 5 into activated bissuccinimidyl-ester 6, followed by oxidation of the thioether to the sulfoxide furnishes cliXlink reagent 1 in a total yield of 16%. Of particular importance is that the reagent is very pure and that the reactive ester units are in place at both sides. Partial hydrolysis needs to be avoided. This was ensured by a final precipitation purification. Reagent 1 was dissolved in a mixture of ethyl acetate and dichloromethane and precipitated upon addition of hexanes.

Scheme 1. Synthesis of cliXlink (1).

Scheme 1.

a) 4-pentynoic acid, EDC HCI, HOBt H2O, NEt3, DMF, rt, 15 h, 71%. b) over two steps: 1)TCEP HCI, NaHCO3, DMF, H2O, rt, 3 h; 2) Methyl 3-bromopropionate, 45 °C, 44 h, 55%. c) LiOH, THF, H2O, 0 °C → rt, on, 91%. d) NHS, pyridine, trifluoroacetic anhydride, MeCN, 0 °C → rt, on, 78%. e) mCPBA, AcOEt, rt, 30 min, 57%.

We next investigated the mass spectrometric properties of the new reagent 1. To this end, we added the reagent to the commonly used model protein bovine serum albumin (BSA). We followed the workflow depicted in Fig. 1B without performing the enrichment step. In short, after addition of the reagent to the protein solution and reaction for 1 h at room temperature and physiological pH, we precipitated the protein with acetone, resuspended in buffer (see SI) and digested the protein subsequently with a mixture of trypsin and Lys-C.

The obtained peptide mixture was desalted with C18-tips and analysed by HPLC-MS2. The resulting data are depicted in Fig. 2. Fig. 2A shows two cross-linked BSA peptides (α + β) as an example. After determining the exact mass of the intact cross-link (m/z 677.1, Fig. 2B), we performed both CID and HCD fragmentation on the precursor to evaluate the fragmentation properties (Fig 2C). The more selective CID fragmentation (resonance excitation of the cross-link ion, top spectrum) at a low normalised collision energy of 25% provides as expected just a small set of intense signals. The two prominent signal pairs with a ∆m/z of 32 (Amrep) are obtained due to cross-linker fragmentation which results in a thial (α/β thial) and alkene (α/β alkene) fragment for each peptide. As mentioned before, fragmentation pathway a) is preferred, leading to an asymmetric intensity distribution. The dominant formation of the expected intact, separated peptides makes the reagent consequently amenable for more sophisticated MS3 experiments.

Figure 2. Analysis of cross-linked BSA prior to enrichment.

Figure 2.

A) Sequences of a representative cross-link within BSA: Peptide α: VHKECCHGDLLECADDR (lAA-modified on Cys) and β: ALKAWSVAR. B) The isotope envelope of the 5+− charged precursor is shown. C) Fragmentation of the cross-linked peptides under CID- (25%, top spectrum) and stepped HCD-conditions (25%/30%/32%, bottom spectrum) showing the product ions of the sulfoxide-cleavage. The different fragmentation pathways are shown in orange and purple (see Fig. 1). D) MS2-identification of BSA cross-link-sites prior to enrichment. The xVis-representation shows the positions of cross-linked amino acids as red lines (left). The corresponding Cα-Cα-distances are depicted in the BSA crystal structure (pdb: 4F5S[22]). E) Histogram of the measured euclidean Cα-Cα-distances in the crystal structure with the majority being below 40 Å and a mean measured distance of 22.1 Å. F) Distribution of the identified cross-linking sites between Lys-Lys, Lys-Thr, Lys-Ser and Lys-Tyr.

For our study we used HCD fragmentation. This method provides simultaneous cross-linker cleavage and formation of the peptide fragments required for identification (Fig 2C, bottom spectrum). To help with cross-link/peptide recognition, it is important that the HCD fragmentation still provides the alkene and thial fragments. The HCD data therefore allow to identify the peptides already by MS2.

Using this method, we analysed the data with the freely available MeroX-software, strictly filtering all cross-link identifications (score > 50, FDR 1%) and requiring the presence of at least 3 out of 4 ions from the two characteristic mass pairs.[2324] Without eploiting the possibility for CuAAC-based enrichment, we were able to identify 61 unique BSA cross-links in a single measurement, shown as an xVis representation (Fig. 2D).[25] This result is representative for measurements conducted with purified BSA in our lab. Additionally, we depicted the found cross-links in the BSA crystal stucture and noted that the majority of measured distances are reasonable. Importantly, the cross-links show a mean Ca-Ca distance between the cross-linked amino acids of 22.1 Å and a distance distribution which is in perfect agreement of what is expected for the cross-linking with a reagent that spaces the active esters by about 9 Å (Fig. 2E, Supp. Fig. 2)[26] and taking into account side chain lengths and protein molecular dynamics.[27] Cross-linking occurred mostly between two Lys residues (49%), but we also detected Lys-Thr (30%), Lys-Ser (13%) and Lys-Tyr (8%) connections in agreement with the reactivity of the succinimidyl esters (Fig. 2F).[28]

This success allowed us to validate the new cross-linker in a more complex environment. We particularly wanted to show the additional value of the CuAAC-based enrichment possibility. For the experiment we again cross-linked the BSA protein, precipitated with acetone, redissolved the cross-linked protein and subsequently performed the click reaction (Supp. Fig. 3A) by adding biotin-disulfide-azide 7 (Fig. 3A) and CuSO4/THPTA followed by reduction of Cu(ll) to Cu(l) upon addition of sodium ascorbate. After 1 h at room temperature, we performed another acetone precipitation to remove excess biotin azide. We then added trypsin and Lys-C for digestion.

Figure 3. Enrichment of cross-linked BSA from a complex sample.

Figure 3.

A) Structure of the utilised biotin-azide 7, with the biotin group (pink), the disulfide-bond (green) and the azide-moiety (blue). Upon reduction of the disulfide, the biotin moiety is removed from the cross-link. B) Depiction of the spike-in-experiment. 10 μg of cross-linked and CuAAC-modified BSA digest were added to 435 μg of HEK digest as a complex background. Magnetic streptavidin-beads were used to enrich the cross-linked peptides. C) Result of the HPLC-MS2-analysis showing effect of enrichment. The experiment was done in technical triplicates, numbers show the mean value and the standard deviation is indicated by the error bar.

These peptides we next combined with a protein digest obtained from a HEKcell extract (435 μg of protein, Fig. 3B). This creates a massive background of non-cross-linked peptides. When we now analysed the cross-links within this large background (Fig. 3C), we identified only a very small number of cross-link spectral matches (mean CSMs = 5.0), unique cross-linked sites (mean unique XLs = 3.7) as well as monolinks (mean = 5.3), where one succinimidylester hydrolysed before reaction with the protein. However, when we performed the enrichment using streptavidin coated magnetic beads, the number of cross-link identifications improved dramatically. Following incubation of the peptide mixture with the magnetic particles, extensive washing (6x) of the beads with buffer (see SI) and mild, reductive cleavage of the disulfide to liberate the “captured” cross-links and thereby removing the biotin moiety (Supp. Fig. 3B), we now on average detected 95.0 CSMs, 37.0 unique XLs together with 184.3 monolinks. The number of identified unique XLs is lower than for purified BSA (Fig. 2D), which could be explained by inefficiences in the CuAAC reaction or interference of the non-crosslinked complex background. However, this result shows that our new cross-linker 1 is able to enrich cross-linked peptides in a vast excess of unmodified peptides, facilitating the analysis of complex samples with low abundant cross-links. Originally, we were sceptical about the asymmetric structure of cliXlink (1). The data, however, show that despite this, a large number of cross-links is identified. In summary, our new cross-linker 1 combines the following advantages: first, the size of the reagent is ideally suited to gain valuable distance information for structural proteomics. In addition, the molecule is cell permeable and the alkyne unit does not cause major interference with the cross-linking reaction. Furthermore, the robust, efficient sulfoxide fragmentation simplifies cross-link identification. The possibility to functionalise peptides cross-linked with 1 via Click chemistry allows to employ diverse modifications and enrichment strategies enabling analysis of low abundant cross-links. To conclude, our reagent cliXlink (1) now paves the way for complex interactome analysis using MS2 and also MS3 experiments.

Supplementary Material

Supplement

Acknowledgements

We thank the Deutsche Forschungsgemeinschaft for financial support via SFB1309 (TP-A4) and SFB1032 (TP-A5) and the SPP1784. Further support was obtained from the Einzelverfahren CA275-11/1 and from the European Union through the Marie Curie International Training and Mobility Network “LightDynamics” (grant No. 765866). Research was furthermore supported by a European Research Council (ERC) advanced grant under the European Union’s Horizon 2020 research and innovation programme (grant agreement n° EPiR 741912) and the Volkswagen foundation (Initiative “Life”: EcoRib). M.W. was supported by NIH grant R35GM128813. M.S. and L.S.R. thank the Fonds der Chemischen Industrie for predoctoral fellowships. The mass spectrometry proteomics data are deposited to the ProteomeXchange Consortium via the PRIDE[29] partner repository with the accession number PXD015080.

Footnotes

Supporting information for this article is given via a link at the end of the document.

Publisher's Disclaimer: This manuscript has been accepted after peer review and appears as an Accepted Article online prior to editing, proofing, and formal publication of the final Version of Record (VoR). This work is currently citable by using the Digital Object Identifier (DOI) given below. The VoR will be published online in Early View as soon as possible and may be different to this Accepted Article as a result of editing. Readers should obtain the VoR from the journal website shown below when it is published to ensure accuracy of information. The authors are responsible for the content of this Accepted Article.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement

RESOURCES