Identification of Isopeptides Between Human Tissue Transglutaminase and Wheat, Rye, and Barley Gluten Peptides

Barbara Lexhaller; Christina Ludwig; Katharina Anne Scherf

doi:10.1038/s41598-020-64143-9

. 2020 May 4;10:7426. doi: 10.1038/s41598-020-64143-9

Identification of Isopeptides Between Human Tissue Transglutaminase and Wheat, Rye, and Barley Gluten Peptides

Barbara Lexhaller ¹, Christina Ludwig ², Katharina Anne Scherf ^1,^3,^✉

PMCID: PMC7198585 PMID: 32367038

Abstract

Celiac disease (CD) is a chronic immune-mediated enteropathy of the small intestine, which is triggered by the ingestion of storage proteins (gluten) from wheat, rye, and barley in genetically predisposed individuals. Human tissue transglutaminase (TG2) plays a central role in the pathogenesis of CD, because it is responsible for specific gluten peptide deamidation and covalent crosslinking, resulting in the formation of N^ε-(γ-glutamyl)-lysine isopeptide bonds. The resulting TG2-gluten peptide complexes are assumed to cause the secretion of anti-TG2 autoantibodies, but the underlying mechanisms are only partly known. To gain more insight into the structures of these complexes, the aim of our study was to identify TG2-gluten isopeptides. With the use of discovery-driven as well as targeted nanoscale liquid chromatography tandem mass spectrometry, we detected 29 TG2-gluten isopeptides in total, involving seven selected TG2 lysine residues (K205, K265, K429, K468, K590, K600, K677). Several gluten peptides carried known B-cell epitopes and/or T-cell epitopes, either intact 9-mer core regions or partial sequences, as well as sequences bearing striking similarities to already known epitopes. These novel insights into the molecular structures of TG2-gluten peptide complexes may help clarify their physiological relevance in the initiation of CD autoimmunity and the role of anti-TG2 autoantibodies.

Subject terms: Proteomics, Coeliac disease

Introduction

Celiac disease (CD) is defined as a chronic immune-mediated inflammatory disorder of the small intestine initiated by the storage proteins (gluten) of wheat, rye and barley in genetically predisposed subjects¹. The ingestion of gluten causes villous atrophy, lymphocyte infiltration and the stimulation of CD4⁺ T cells against gluten epitopes in CD patients. These epitopes are presented by the human leukocyte antigen (HLA) class II alleles HLA-DQ2.5, HLA-DQ2.2 and HLA-DQ8 of the major histocompatibility complex (MHC) expressed on B cells and antigen-presenting cells. The presentation of gluten peptides leads to the activation of CD4⁺ T cells, which are the main effector cells for immunologic processes^2,3.

Human tissue transglutaminase (TG2), a Ca²⁺-dependent protein-glutamine γ-glutamyltransferase (EC 2.3.2.13), is ubiquitously expressed and catalyses the deamidation of glutamine residues or the crosslinking reaction (transamidation) between a glutamine and a lysine residue to form a covalent N^ε-(γ-glutamyl)-lysine isopeptide bond⁴. The TG2-mediated deamidation converts certain glutamine residues to glutamic acid residues by releasing ammonia and incorporating water. This leads to an introduction of negative charges in gluten peptides following a distinct pattern, e.g., the glutamine residues in the sequences QXP, QXXF(Y/W/M/L/I/V) or QXPF(Y/W/M/L/I/V), where X designates any other amino acid except P, are preferentially targeted⁵. This introduction of negatively charged amino acids increases the binding affinity of gluten peptides to the HLA molecules and enhances their antigenicity in CD patients⁶. During transamidation, the γ-carboxamide group of a protein-bound glutamine serves as acyl donor that is transferred to an acyl acceptor, such as small, biogenic amines or an ε-amino group of protein-bound lysine to form a crosslink^7,8. The modification of gluten peptides by TG2 is known as a critical event in the pathomechanism of CD, particularly as TG2-gluten peptide complexes are formed⁸. Patients with active CD have specific anti-TG2 IgA (and IgG or IgM) antibodies⁹ and the formation of these antibodies is dependent on the ingestion of gluten. In previous studies, TG2 was identified as the predominant autoantigen of CD¹⁰. At the moment, there are different models to explain the formation of autoantibodies against TG2. It has been assumed that gluten-specific CD4⁺ T cells presented in the context of HLA-DQ2.5 or -DQ8 provide help to TG2-specific B cells¹¹. After this initiation, different ways for gluten uptake by B cells and the role of B cell receptors (BCR) are possible: (i) BCR take up TG2-gluten peptide complexes and present them to gluten-specific CD4⁺ T cells, which provide help to B cells for the formation of anti-TG2 antibodies (original hapten-carrier-model¹²). In addition, (ii) the BCR may be crosslinked to neighboring BCRs by TG2 and this process contributes to B-cell activation. Evidence for these models comes from previous studies that TG2 can crosslink TG2 molecules into multimeric complexes, which can additionally incorporate gluten peptides. These multimers stimulate TG2-specific B cells and are presented to gluten-specific T cells¹³. (iii) The BCR might be crosslinked to gluten peptides through TG2 activity and thus be directly involved in uptake and presentation either in a single TG2-BCR complex or (iv) with a neighboring BCR¹⁴. After endocytosis of the BCR-gluten peptide complexes and TG2 by the receptor, the isopeptide bond between gluten peptide and BCR may again be hydrolyzed by TG2. This step releases the deamidated gluten peptide that will be subsequently linked to HLA-DQ and presented to CD4⁺ T cells.

In-depth studies about the formation of covalent TG2-gluten peptide complexes showed that six lysine residues of TG2 were involved in crosslinking with two different gluten peptides⁸ or even with TG2 molecules to create covalent TG2-TG2-multimers¹³. In addition, a reciprocal proteomics strategy using an α-gliadin-derived model peptide recently allowed the identification of 34 isopeptides involving 20 different lysine residues of TG2¹⁵. It is also known that, when confronted with a complex gluten peptide mixture, TG2 preferentially crosslinks peptides containing known CD-active T-cell epitopes to an acyl acceptor substrate such as 5-biotinamido-pentylamine¹⁶. However, there is no information as to which gluten peptides are good substrates for crosslinking to TG2, because all studies so far have worked only with selected gluten-derived model peptides and not with physiologically relevant enzymatic gluten hydrolysates due to their extreme heterogeneity^8,15.

Therefore, the aim of our study was to apply our recently developed reciprocal mass spectrometric approach, including discovery-driven mass spectrometry¹⁵ and additional targeted proteomics, to complex gluten hydrolysates that had been incubated with TG2 and identify TG2-gluten isopeptides. We used well-characterized gluten protein types (GPTs) of wheat, rye and barley¹⁷ and extended our analysis strategy with additional confirmation of isopeptide identities by parallel reaction monitoring (PRM) LC-MS/MS as follow-up measurements.

Results

Experimental approach to identify TG2-gluten isopeptides

To reduce complexity compared to a total gluten hydrolysate, our experimental approach to identify TG2-gluten isopeptides started with the preparation of the following GPTs: α-gliadins, γ-gliadins, ω5-gliadins, ω1,2-gliadins, high- (HMW-GS) and low-molecular-weight glutenin subunits (LMW-GS) of wheat, ω-secalins, HMW-secalins, γ-75k-secalins and γ-40k-secalins of rye and C-hordeins, γ-hordeins, B-hordeins and D-hordeins of barley (Fig. 1a)^17,18. The individual GPTs were hydrolysed using a combination of pepsin and chymotrypsin/trypsin to mimic the main enzymatic processes during gastrointestinal digestion^16,19. Then, the resulting GPT hydrolysates were incubated with TG2, leading to the formation of TG2-gluten peptide complexes. These complexes were hydrolysed with trypsin followed by solid phase extraction (SPE) for clean-up of the isopeptide/peptide mixture and subsequent discovery-driven nanoscale liquid chromatography tandem mass spectrometry (nLC-MS/MS) analysis (Fig. 1b)¹⁵. The GPT blank controls without addition of TG2 were used to create customized protein databases (Table S1) for each GPT that were applied in the proteomics software MaxQuant (MQ)²⁰. In order to identify TG2-gluten isopeptides, MQ searches for gluten peptides (α-side of the isopeptide) were performed against the appropriate GPT-database with each of seven selected TG2-peptides (β-side of the isopeptide) as modifications. These seven peptides (FLKNAGR, WKNHGCQR, ISTKSVGR, LAEKEETGMAMR, DLYLENPEIKIR, QKR, AVKGFR, lysine residue involved in crosslink formation highlighted in bold) containing the lysine residues K205, K265, K429, K468, K590, K600 and K677 from the TG2 amino acid sequence were selected as possible crosslinking sites. The lysine residues K590, K600 and K677 had previously been identified by Fleckenstein et al.⁸ and the lysine residues K205, K265, K429 and K468 additionally by Lexhaller et al.¹⁵. K590, K600 and K677 were known as preferred TG2 crosslinking sites also for TG2 self-multimerization¹³, while K205, K265, K429 and K468 were involved in the formation of isopeptides with high identification scores¹⁵. The tryptic TG2 peptides were chosen to contain only one lysine residue to reduce potential variability on the TG2-side. The identities of the isopeptides were confirmed by annotating the b- and y-fragments as well as internal fragment ions (double fragmentation on both crosslinked peptide sequences) calculated with the MS-Product feature of ProteinProspector²¹. The identities of the isopeptides as well as the crosslinking site localisations were verified by re-analysing all samples using targeted parallel reaction monitoring (PRM) nLC-MS/MS. Data analysis was performed with Skyline²² and additional manual curation. PRM analysis yields higher ion intensities, because it focuses on monitoring the predefined transitions from precursor to fragment ions. This higher overall intensity provided more fragments, especially around the crosslinking sites.

Workflow to identify isopeptides between gluten protein types of wheat, rye and barley and human TG2. (a) Extraction and separation procedure to obtain gluten protein types from wheat, rye and barley flours, respectively, (b) Proteomics workflow combining a reciprocal search strategy to identify isopeptides using discovery-driven mass spectrometry, MaxQuant, Skyline and parallel reaction monitoring (PRM). SPE: solid phase extraction; TG2: recombinant human tissue transglutaminase.

Identification of isopeptides in wheat GPTs

Altogether, 13 isopeptides were identified in the wheat GPTs. Table 1 shows the identified isopeptides (sorted by TG2-modification site) in each GPT, the gluten protein corresponding to the identified gluten peptide with UniProtKB accession number, name and organism, the MQ identification score, as well as the numbers of characteristic fragments identified in discovery-driven nLC-MS/MS experiments and of those that were confirmed using PRM. The γ-gliadin-GPT hydrolysate contained five isopeptides (W2, W3, W6, W7, W9) with four different TG2-crosslinking sites. Four isopeptides with three different TG2 peptides were identified in the α-gliadin-GPT hydrolysate (W1, W8, W11, W12), two isopeptides with two different TG2 peptides in the LMW-GS-GPT hydrolysate (W4, W10) and one isopeptide each in the HMW-GS-GPT hydrolysate (W13) and the ω1,2-gliadin-GPT hydrolysate (W5). No isopeptides were identified in the hydrolysate of ω5-gliadin-GPT. The structures of the isopeptides as well as the localization probabilities for the crosslinks and the deamidation are shown in Fig. 2.

Table 1.

Isopeptides between TG2 and peptides derived from wheat gluten protein types.

Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	Fragments (discovery)^f		Manually checked (discovery)^g		Fragments (targeted)^h
Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	α ⁱ	β^j	α	β	α	β
W1	K205	WQIPEQSR	α	P04726	alpha/beta-gliadin clone PW1215	T. aestivum	49.37	10	4	18	13	9	—
W2	K205	AQIPQQL	γ	A0A290XYW2	gamma-gliadin	T. aestivum	66.99	9	4	22	6	8	4
W3	K205	VQGQGIIQPQQPAQL	γ	P08453	gamma-gliadin	T. aestivum	91.31	24	7	67	8	15	7
W4	K265	PYSQPQPF	LMW	X2KVH9	alpha-gliadin	T. aestivum	59.21	9	1	16	2	7	—
W5	K429	PQQTFPQQPLF	ω1,2	R9XUE1	LMW-GS	T. aestivum	84.57	14	1	34	1	11	—
W6	K468	PQPPQQPF^k	γ	A0A290XYS8	omega-gliadin	T. aestivum	75.31	11	2	24	14	9	—
W7	K590	VQGQGIIQPQQPAQL	γ	P08453	gamma-gliadin	T. aestivum	61.21	19	8	37	28	11	8
W8^l	K590	QEQQIGQEQQPGQW	α	B2LS24	HMW glutenin subunit type-2	T. timopheevii	55.26	12	2	28	—	14	2
W9^l	K600	PQQSEQVIPQQPQQPF	γ	A0A3B6UD61	uncharacterized protein	T. aestivum	104.05	17	—	39	—	14	—
W10^l	K600	QQQPPFWQQQPPF	LMW	I3QPH0	low molecular weight glutenin subunit t128	T. aestivum	70.94	14	—	35	—	19	—
W11	K677	RPQQPYPQPQPQY	α	A0A023WGB8	alpha-gliadin	T. aestivum	63.73	13	1	38	4	7	1
W12	K677	WQTPEQSR	α	I0IT59	alpha/beta-gliadin	T. aestivum	64.42	13	3	13	3	19	3
W13	K677	VYYPTSPQQPGQL	HMW	A0A1G4P1W4	HMW glutenin x-type subunit 1Bx6	T. aestivum	69.45	16	2	43	9	13	1

Open in a new tab

^aAbb., abbreviation, ^bLysine residue in the TG2 sequence, K205: peptide FLKNAGR, K265: peptide WKNHGCQR, K429: ISTKSVGR, K468: LAEKEETGMAMR, K590: DLYLENPEIKIR, K600: QKR, K677: AVKGFR, ^cGlutamine residues involved in crosslinking to TG2 are highlighted in bold, deamidation sites underlined, ^dGPT, gluten protein type, α, α-gliadins, γ, γ-gliadins, ω1,2, ω1,2-gliadins, HMW, high-molecular-weight glutenin subunits, LMW, low-molecular-weight glutenin subunits, ^eMQ, MaxQuant, ^fNumber of fragments identified by discovery-driven nLC-MS/MS and MaxQuant data analysis, ^gNumber of fragments identified by discovery-driven nLC-MS/MS and manual inspection of full scan spectra considering additional internal fragments calculated by ProteinProspector, ^hNumber of fragments identified by PRM analysis, ⁱα, α-side of the isopeptide (gluten peptide), ^jβ, β-side of the isopeptide (TG2 peptide), ^kUnspecific cleavage at the C-terminal end (IP), ^lCrosslinking site identified by PRM analysis.

Schematic illustration of isopeptides between TG2 and wheat gluten proteins. (W1)–(W7) and (W11)–(W13), Isopeptides with localization probabilities >75%. (W8)–(W10), Isopeptides with crosslinking sites additionally confirmed by parallel reaction monitoring. The binding glutamine residues are given in bold, the binding probabilities for the crosslinks (MaxQuant) in the grey box and the deamidation probabilities for the glutamine residues in colours. Specific fragments used to confirm the binding sites are given in blue (b-fragments) and pink (y-fragments).

As an example, a very high MQ score (91.31) was obtained for the isopeptide VQGQGIIQPQQPAQL/FLKNAGR (W3, Q and K involved in the isopeptide bond highlighted in bold, deamidation site underlined) based on the identification of 24 b- and y-fragments of the α-side (some fragments were identified without or with water- and ammonia-loss). First, the MQ search result of VQGQGIIQPQQPAQL carrying the TG2 isopeptide modification “fl” (= FLKNAGR) at Q₁₀ and a deamidation “de” at Q₄ was loaded into MQ Viewer to have all b- and y-fragments annotated. These fragments were by default decharged by MQ Viewer to show them as single charged fragments (Fig. 3a)²³. Additionally, in Fig. 3b, the annotation was done manually in the MS/MS spectrum by combining the information from the spectral annotation of MQ Viewer and 35 internal fragments calculated by ProteinProspector for confident localization of the deamidation and crosslinking sites in the isopeptides. The correct detection of isopeptide W3 was confirmed by targeted MS analysis using PRM^24,25. The PRM data revealed high quality chromatographic peaks for 15 characteristic fragment ions, including b_6α⁺ to b_8α⁺ as consecutive series²⁶ within the α-side, and seven fragments for the β-side modified at K with the deamidated VQGQGIIQPQQPAQL peptide. Q₁₀ was identified as the crosslinking site with a localization probability of 94.4% and the deamidation at Q₄ was detected with a probability of 99.9%.

MS/MS spectrum of the isopeptide between VQGQGIIQPQQPAQL (γ-gliadin) and FLKNAGR (TG2). (a) Spectrum of the isopeptide annotated with fragments of the γ-gliadin peptide with TG2-peptide as modification as annotated by MQ Viewer (spectrum is shown decharged with fragments only single charged). The fragments are marked in different colours as follows: y-fragments in red, b-fragments in blue, a- and c-fragments in turquoise, fragments with losses of NH₃ or CO marked in orange. (b) Spectrum of the isopeptide annotated manually with fragments of both sides of the isopeptides, calculated with ProteinProspector. The insert amplifies the range between *m/z* 100 to 400. The fragments are marked in different colours as follows: y-fragments of the γ-gliadin peptide in pink, b-fragments of the γ-gliadin peptide in blue, y-fragments of TG2 peptide in violet, a- and internal fragments in turquoise, fragments with losses of NH₃ or CO marked in orange.

The isopeptides W1-W7 and W11-W13 were already identified unambiguously by discovery-driven nLC-MS/MS and application of the confirmation parameters (at least seven identified b- or y-fragments, at least three fragments in a consecutive series and a crosslink localization probability ≥75%¹⁵). The additional PRM analysis confirmed these 10 identified isopeptides and their crosslinking and deamidation sites. However, the PRM data was essential to unambiguously localize the crosslinking site or some deamidation sites for the three isopeptides W8-W10. For this purpose, specific transitions around these sites were used to confirm the localization of the crosslinking or deamidation sites as shown in Fig. 2.

Identification of isopeptides in rye GPTs

Overall, six isopeptides were identified in the GPTs of rye (ω-secalins, HMW-secalins, γ-75k-secalins and γ-40k-secalins) (Table 2). Three isopeptides (R2-R4) crosslinked with three different TG2 peptides were detected in the γ-75k-secalin-GPT hydrolysate (Fig. 4). In the hydrolysate of the γ-40k-secalin-GPT, two isopeptides (R1, R6) with two different TG2 peptides were identified. One isopeptide (R5) with a gluten peptide derived from barley C-hordeins was identified in the ω-secalin-GPT hydrolysate, most likely due to high sequence homologies between rye ω-secalins and barley C-hordeins. No isopeptides were identified in the HMW-secalin-GPT hydrolysate.

Table 2.

Isopeptides between TG2 and peptides derived from rye gluten protein types.

Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	Fragments (discovery)^f		Manually checked (discovery)^g		Fragments (targeted)^h
Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	α ⁱ	β^j	α	β	α	β
R1^l	K205	IVQGQSIIQQQPAQL	γ40k	H8Y0N7	gamma prolamin	S. cereale ssp. afghanicum	68.97	17	4	34	4	13	3
R2^l	K429	AQVQGIIQPQQL	γ75k	A4GU91	75k gamma secalin	S. sylvestre	59.25	9	—	10	3	12	—
R3^l	K600	QPQQPFPQQPQQSF	γ75k	H8Y0K1	gamma prolamin	P. juncea	80.75	14	—	39	2	10	—
R4^l	K677	AQVQGIIQPQQL	γ75k	A4GU91	75k gamma secalin	S. sylvestre	90.05	16	2	36	5	19	2
R5	K677	QIPTPLQPQQPF	ω	Q41210	C-hordein	H. vulgare	57.18	14	1	41	3	10	1
R6	K677	AQIPQHL	γ40k	H8Y0N7	gamma prolamin	S. cereale ssp. afghanicum	62.98	10	3	24	8	9	3

Open in a new tab

^aAbb., abbreviation, ^bLysine residue in the TG2 sequence, K205: peptide FLKNAGR, K429: ISTKSVGR, K600: QKR, K677: AVKGFR, ^cGlutamine residues involved in crosslinking to TG2 are highlighted in bold, deamidation sites underlined, ^dGPT, gluten protein type, γ40k, γ-40k-secalins, γ75k, γ-75k-secalins, ω, ω-secalins, ^eMQ, MaxQuant, ^fNumber of fragments identified by discovery-driven nLC-MS/MS and MaxQuant data analysis, ^gNumber of fragments identified by discovery-driven nLC-MS/MS and manual inspection of full scan spectra considering additional internal fragments calculated by ProteinProspector, ^hNumber of fragments identified by PRM analysis, ⁱα, α-side of the isopeptide (gluten peptide), ^jβ, β-side of the isopeptide (TG2 peptide), ^lCrosslinking site identified by PRM analysis.

Schematic illustration of isopeptides between TG2 and rye gluten proteins. (R5)-(R6), Isopeptides with localization probabilities >75%. (R1)-(R4), Isopeptides with crosslinking sites additionally confirmed by parallel reaction monitoring. The binding glutamine residues are given in bold, the binding probabilities for the crosslinks (MaxQuant) in the grey box and the deamidation probabilities for the glutamine residues in colours. Specific fragments used to confirm the binding sites are given in blue (b-fragments) and pink (y-fragments).

The isopeptides R5 and R6 were already identified unambiguously by discovery-driven nLC-MS/MS, because they fulfilled the confirmation parameters and the crosslinking sites were identified with localization probabilities of 99.3% and 100%, respectively. The PRM data from these isopeptides were used as confirmation. To identify the crosslinking sites in the other rye isopeptides (R1-R4), the identification and confirmation of specific fragments by PRM analysis was needed. Figure 4 shows the structures of these isopeptides as well as the MQ localization probabilities and the specific fragments used to confirm the crosslinking site.

Identification of isopeptides in barley GPTs

In total, ten isopeptides were identified in the GPTs of barley (C-hordeins, γ-hordeins, D-hordeins and B-hordeins) (Table 3). Five isopeptides (B1, B3, B5, B6, B10) with four different TG2 peptides were identified in the D-hordein-GPT hydrolysate and four isopeptides (B4, B7-B9) in the γ-hordein-GPT hydrolysate. The B-hordein-GPT hydrolysate contained one isopeptide (B2) with a gluten peptide derived from wheat LMW-GS, most likely again due to high sequence homologies between B-hordeins from barley and LMW-GS from wheat (Fig. 5). No isopeptides were detected in the hydrolysate of the C-hordein-GPT itself. However, one isopeptide identified within the ω-secalin-GPT was assigned to a C-hordein.

Table 3.

Isopeptides between TG2 and peptides derived from barley gluten protein types.

Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	Fragments (discovery)^f		Manually checked (discovery)^g		Fragments (targeted)^h
Abb.^a	TG2 lysine^b	Gluten peptide^c	GPT^d	UniProtKB accession	UniProtKB name	Organism	MQ score^e	α ⁱ	β^j	α	β	α	β
B1 ^l	K205	QGQQGQQLGQGQQGYY	D	A0A2C9PIB7	high-molecular-weight glutenin subunit protein	Ae. umbellulata	59.90	13	—	25	4	16	—
B2	K265	VQQQQPPF	B	V9P6N2	LMW-i glutenin subunit 1	T. aestivum	85.85	8	2	9	1	8	2
B3	K590	PQQPGQW	D	I6TRS8	D-hordein	H. vulgare	44.97	7	13	11	21	7	13
B4	K590	IIPQQPQQPFPLQPHQPY^k	γh	P17991	C-hordein	H. vulgare	44.20	10	7	17	10	11	7
B5	K600	PQQPGQGQQPGQR	D	I6TRS8	D-hordein	H. vulgare	121.20	19	—	31	2	14	—
B6	K600	PQQPGQGQGQQGYYPGATSL^k	D	I6TRS8	D-hordein	H. vulgare	82.36	18	—	35	—	24	—
B7^l	K677	PLQPQQPFPW	γh	Q41210	C-hordein	H. vulgare	72.55	9	1	23	3	8	1
B8	K677	PQQQFPQQQFHQQQL	γh	A0A0B5JD29	omega-gliadin	T. aestivum	52.73	12	—	36	4	16	1
B9^l	K677	FPQYQIPTPL	γh	Q40053	Hor1–17 C-hordein	H. vulgare	47.94	11	2	25	7	10	2
B10^l	K677	PQQPGQGQGQQGYYPGATSL	D	I6TRS8	D-hordein	H. vulgare	106.36	23	2	41	3	24	2

Open in a new tab

^aAbb., abbreviation, ^bLysine residue in the TG2 sequence, K205: peptide FLKNAGR, K265: peptide WKNHGCQR, K590: DLYLENPEIKIR, K600: QKR, K677: AVKGFR, ^cGlutamine residues involved in crosslinking to TG2 are highlighted in bold, deamidation sites underlined, ^dGPT, gluten protein type, D, D-hordeins, B, B-hordeins, γh, γ-hordeins, ^eMQ, MaxQuant, ^fNumber of fragments identified by discovery-driven nLC-MS/MS and MaxQuant data analysis, ^gNumber of fragments identified by discovery-driven nLC-MS/MS and manual inspection of full scan spectra considering additional internal fragments calculated by ProteinProspector, ^hNumber of fragments identified by PRM analysis, ⁱα, α-side of the isopeptide (gluten peptide), ^jβ, β-side of the isopeptide (TG2 peptide), ^kBoth crosslinking sites are possible and could not be identified unambiguously due to missing fragments, ^lCrosslinking site identified by PRM analysis.

Schematic illustration of isopeptides between TG2 and barley gluten proteins. (B2), (B3) and (B5), Isopeptides with localization probabilities >75%. (B1), (B4) and (B6)–(B10), Isopeptides with crosslinking sites additionally confirmed by parallel reaction monitoring. The binding glutamine residues are given in bold, the binding probabilities for the crosslinks (MaxQuant) in the grey box and the deamidation probabilities for the glutamine residues in colours. Specific fragments used to confirm the binding sites are given in blue (b-fragments) and pink (y-fragments).

The isopeptides B2, B3 and B5 were already detected unambiguously by discovery-driven nLC-MS/MS experiments and the PRM analyses were only used for confirmation. The localization probabilities for the crosslinking sites were between 87.4% and 95.2% (Fig. 5). In comparison, PRM analyses were necessary to detect the specific fragments around the crosslinking sites in B1, B7, B9 and B10 and confirm the localization of the crosslinks (Fig. 5).

Regarding the isopeptide B4, the localization probability was 49.3% for the crosslink at Q₄ or Q₅, respectively. The PRM data also did not reveal the exact position of the crosslink, because the specific transitions for these two sites were not detectable. The isopeptide B6 was identified with two deamidation sites, one of which was detected clearly with a localization probability of 77.4% at Q₈. The positions of the second deamidation and the crosslinking site were ambiguous with localization probabilities of 51.0% at Q₁₀ or 40.2% at Q₁₁ for the deamidation and 46.8% at Q₁₀ or 39.8% at Q₁₁ for the crosslink. Even the PRM experiments did not provide any further information, so that the deamidation and crosslinking sites could not be assigned unequivocally within B6.

In the isopeptide B8, the crosslinking site was identified at various positions with various low localization probabilities by discovery-driven nLC-MS/MS: Q₈ with 27.3%, Q₉ with 36.4% and Q₁₂, Q₁₃, and Q₁₄ with 12.0%, respectively. The positions Q₂ (localization probability: 96.9%) and Q₃ (localization probability: 46.7%) of the two deamidated glutamine residues in the N-terminal part of the sequence were verified due to the specific transitions b_2α⁺ to b_4α⁺, and the position of the crosslinking site could be confirmed at Q₉ based on the detection of the characteristic b_6α⁺ to b_9α⁺ fragments after PRM analysis. Q₁₂ had a deamidation probability of 96.9%, so that only the exact positions of the fourth deamidation in the rear part (Q₁₃ or Q₁₄) could not be assigned unambiguously due to missing specific fragments.

Discussion

In this study, we applied a reciprocal proteomics strategy, including discovery-driven¹⁵ as well as targeted MS measurements, to complex gluten hydrolysates and identified isopeptides between TG2 and gluten peptides. To get well-defined gluten raw materials, GPTs were isolated by modified Osborne fractionation following preparative RP-HPLC and characterized as described before^17,18. In total, 13 isopeptides of wheat GPTs, six of rye GPTs and ten of barley GPTs were detected crosslinked to peptides containing any of the seven selected TG2-lysine residues (K205, K265, K429, K468, K590, K600, K677). The crosslinking sites were unambiguously identified by discovery-driven nLC-MS/MS with localization probabilities of >75% in 18 out of 29 isopeptides. The additional PRM analyses on the ambiguously identified crosslinks in 11 isopeptides were used to clearly assign the crosslinking site. This method enabled the identification of the exact crosslinking and deamidation sites in 8 of the remaining 11 isopeptides due to the detection of the characteristic fragments around the modified sites. Only one deamidation site (B8), one crosslinking site (B4) as well as one deamidation and one crosslinking site (B6) could not be assigned unambiguously. However, we were able to identify the subpart of the amino acid sequence, where the modified glutamines are located most likely.

No isopeptides were detected in the hydrolysates of ω5-gliadins, HMW-secalins and C-hordeins. This may have several causes, including poor digestibility of the proteins, especially for ω5-gliadins²⁷, comparatively low percentages of the respective GPT within the isolate, especially for C-hordeins¹⁷, isopeptide concentrations that were below the limit of detection or even no formation of isopeptides. Due to the multitude of potential pairings considering that the TG2 sequence contains 32 lysine residues in total, we decided to focus our data evaluation on the seven selected TG2-lysine residues within the specific peptides that had been reported as reactive sites in previous investigations^8,15. With the gluten peptide side also unknown prior to our investigation, including all 32 lysine residues would have dramatically increased the search space at the cost of decreasing confident isopeptide identification. However, our intentional limitation to these seven lysine residues also implies that we may have missed isopeptides, if they contained any other TG2-derived lysine peptide.

The gluten peptides involved in isopeptide formation were not always matched to the corresponding proteins that would be primarily expected in the respective GPT. Each isopeptide dataset was searched against the GPT-specific database that was generated during the discovery-driven experiment with the GPT blank controls. Nevertheless, these GPT-specific databases partly contained proteins from other closely related plant species due to incomplete or unannotated protein entries in the UniProtKB database²⁸. In some cases, the gluten peptides were derived from a different Triticum species like T. timopheevii (W8) or from different Secale species including Psathyrostachys juncea (Russian wild rye) (R1-R4, R6). One peptide present in the rye ω-secalin hydrolysate was matched to a protein sequence from H. vulgare (R5) and, vice versa, two peptides from the barley γ- and B-hordein hydrolysates corresponded to protein sequences from T. aestivum (B2, B8). This can be explained with the close phylogenetic relationship of wheat, rye and barley that causes extensive amino acid sequence homologies, especially in the repetitive domains^29,30. Several gluten peptides also contained missed peptic/tryptic/chymotryptic cleavages sites, as is known to occur frequently during gluten digestion^31,32. To enhance the quality of correct protein identifications, it might be useful to search in other, curated databases, which include more complete gluten entries³³.

The approach with TG2 and GPTs described here has to be seen as a two-component model system with simulated gastrointestinal digestion. The crosslinking reactions were performed using isolated fractions of wheat, rye and barley proteins and this is rather far away from the real conditions, where gluten proteins are part of a complex food matrix. The simulated digestion model is based on physiological conditions including the three gastrointestinal enzymes trypsin, pepsin and chymotrypsin, but without the action of other enzymes, e.g., brush-border enzymes. This design was chosen deliberately, because the additional action of several enzymes with different cleavage specificities would have made the MS data evaluation much more complicated and increased the peptide search space by several orders of magnitude. These limitations of the current study have to be considered carefully, because more gastrointestinal enzymes would produce more or maybe divergent peptides from a more complex matrix.

TG2 is known for its high reactivity with gluten peptides³⁴, especially those harboring T-cell epitopes¹⁶. Depending on the neighboring C-terminal amino acids, TG2 specifically deamidates glutamine residues in the QXP-, QXXF(Y/W/M/L/I/V)- or QXPF(Y/W/M/L/I/V)-motifs (where X designates any amino acid except P) resulting in increased binding affinity of the gluten peptides to the CD-associated HLA molecules³⁵. In contrast, the QXXP- or QP-motifs have been described as poor or no targets for TG2-mediated deamidation⁵. Thirteen of the 29 isopeptides carried a gluten peptide with at least one additional deamidation, of which five displayed the preferred QXP-motif (W9, R1, B2, B5, B9) and five the QXXF(Y/L/I)-motif (W3, R3, B1, B6, B8). Two gluten peptides were deamidated at the poor QXXP- (W10) and QP-motifs (R3), while the remaining deamidation sites were located in sequences with unknown effect on TG2-specificity. Non-enzymatic deamidation cannot be excluded in our experiments due to the slightly alkaline pH conditions during incubation with TG2 and tryptic digestion³⁶, but our intent was to focus on the identification of crosslinking sites, rather than deamidation sites.

Among the 29 isopeptides, 12 crosslinking sites to TG2 were located in the preferred QXP-motif (W1–W3, W7, W8, W12, W13, R5, R6, B3, B5, B7) and three in the QXX(Y/I)-motif (R1, B9, B10). Five isopeptides had the crosslink within the QP-motif (W4–W6, W10, W11) and one within the QXXP-motif (W9) that are either no or poor targets for TG2. The other crosslinks were either not localized unambiguously (B4, B6) or involved QXXX-motifs (R2-R4, B1, B2, B8) that may or may not have an effect on TG2 specificity. In case of W4 no preferred target was available, but these results point to the fact that TG2 might not necessarily follow the known specificity when it comes to crosslinking TG2 molecules to gluten peptides instead of deamidation. However, further experiments would be necessary to study the mechanisms of crosslinking versus deamidation in more detail.

A further limitation of the current study is that it does not allow a differentiation if one TG2 molecule carries several gluten peptides crosslinked to different lysine residues or if there are several TG2 molecules that carry one gluten peptide each. In view of the relative distance between the active site of TG2 and the crosslinked lysine residues, it appears most likely that TG2 crosslinked the gluten peptides to other independent neighbouring TG2 molecules. In our well-defined model system, there were no other acyl acceptor substrates present except for TG2. However, this situation is uncommon under physiological conditions, where other extracellular matrix proteins such as collagen or fibronectin and free amines are always present^37,38. To address this major limitation of the current study, further experiments would be necessary in the presence of other proteins or free amines as potential substrates for TG2.

Of the 26 different gluten peptides (the isopeptides W3/W7, R2/R4 and B6/B10 involve different TG2 lysine residues, but the same gluten peptide, respectively) identified as part of isopeptides, three contained three different complete 9-mer core regions of known T-cell epitopes³⁹: IQPQQPAQL (DQ2.5.glia-γ2⁴⁰) in W3 and W7, FRPQQPYPQ (DQ2.5-glia-α3⁴⁰) in W11 (F at the N-terminal end missing due to chymotryptic cleavage) and QQPFPQQPQ (DQ2.5-glia-γ5⁴¹) in R3. The crosslinked glutamine residue in W3, W7 and W11 was located within the core region, whereas R3 had the crosslink after the core region, but within the truncated motif of the epitope DQ2.5-glia-γ1 (PQQSFPQQQ⁴²) that contains a chymotryptic cleavage site. The DQ2.5-glia-α3 and DQ2.5.glia-γ2 epitopes had also been identified as preferred TG2 substrates by Dorum et al.¹⁶. Several of the other gluten peptides crosslinked to TG2 also show striking similarities with known T-cell epitopes. For example, B7 is identical to LQPQQPFPQ (DQ2.5-glia-γ4e⁴³) except for the C-terminal W, while also being identical to PQPQQPFPW (DQ2.5-glia-ω2⁴⁴) except for the N-terminal L. W9 and B4 contain seven and eight amino acids of QQPQQPFPQ (DQ2.5-glia-γ4c⁴¹), respectively. Multiple sequence alignment of all identified gluten peptides that were bound to TG2 revealed that the PQQP-motif was the most common feature in many gluten peptides. However, there were also variations such as PQQL and PQQS, while some peptides had a different sequence altogether (e.g., within B9 or B1) (Fig. S1). The alignment of the gluten peptides considering the deamidation sites essentially showed a similar picture (Fig. S2).

As the formation of stable gluten peptide-HLA complexes is the prerequisite for activating the gluten-reactive T-cell response³⁵ these TG2-bound gluten peptides carrying known T-cell epitopes may contribute to enhanced T-cell reactivity. In turn, gluten-reactive T cells provide help to gluten-specific B cells with both receptor repertoires sharing a preference for deamidated gluten peptides with overlapping or adjacent recognition sequences^45,46. Although eight of the gluten peptides we identified within the isopeptides were too short (only eight amino acids in five cases, or seven amino acids in three cases) to elicit binding to HLA-DQ2.5, -DQ2.2 or -DQ8 molecules, one (W1) did carry a sequence recognized by gluten-specific B-cells (IPEQ, WQIPEQ)⁴⁶. Furthermore, the peptides W9, R3, R6, B4 and B7 contained the QPQQPF-motif⁴⁶ and W11 the PXPQP-motif⁴⁵, that are reported as important sequences for B-cell receptor recognition. Regarding TG2-specific B cells, the most likely route is that TG2-gluten peptide complexes are taken up through the B-cell receptor¹², but our knowledge on the cooperation of gluten-reactive T cells and TG2-specific B cells in B-cell activation warrants further investigation¹¹. Our findings on isopeptide formation between TG2 and gluten peptides from a complex gluten hydrolysate may help shed some more light into the complex interactions between HLA-DQ2/8 molecules, gluten-reactive T cells, gluten-specific B cells and TG2-specific B cells. The workflow combining discovery-driven and PRM nLC-MS/MS could also be adapted to other related questions, because TG2 is also known to interact not only with gluten, but also with extracellular matrix proteins, such as fibronectin^37,38.

Conclusion

We identified 29 isopeptides of TG2 with peptides from gluten hydrolysates from wheat, rye and barley in vitro using a reciprocal proteomics strategy. The model system does not rely on model peptides, but uses gluten proteins extracted from the flours and hydrolysed by three different gastrointestinal enzymes to mimic physiological conditions in a simplified form. In addition to discovery-driven mass spectrometry, all isopeptides were verified by targeted proteomics (PRM) that allowed the localization of the respective crosslinking site. These results provide novel insights into preferred TG2 substrates and the molecular structures of TG2-gluten peptide complexes. Several gluten peptides carried known B-cell and T-cell epitopes, either intact 9-mer core regions or partial sequences, as well as sequences bearing striking similarities to already known epitopes. Further research combining in vitro and in vivo experiments on the extent and the activation of B cells are needed to get more insights on the immunological and physiological relevance of these complexes. With the proteomics strategy in place, it would be interesting to gradually move away from the well-defined model system to studying TG2-mediated crosslinking under physiologically relevant conditions, e.g., with additional action of brushborder enzymes and in the presence of other acyl acceptor substrates such as other extracellular matrix proteins or free amines.

Methods

Material

All chemicals and solvents were at least HPLC or LC-MS grade. Recombinant human TG2 was purchased from Zedira (Darmstadt, Germany) as a purified and lyophilized protein produced in sf9 insect cells. Trypsin (from bovine pancreas, TPCK-treated, ≥10,000 BAEE U/mg protein), pepsin (from porcine gastric mucosa, 10 FIP U/mg) and α-chymotrypsin (from bovine pancreas, TLCK-treated, ≥40 U/mg of protein) were purchased from Sigma-Aldrich (Steinheim, Germany). The Retention Time Standardize Kit PROCAL (Proteome Tools Calibration Standard) was from JPT (Berlin, Germany).

Grain Samples

Grains of wheat (cultivar (cv.) Akteur, harvest year 2011, I.G. Pflanzenzucht, Munich, Germany), rye (cv. Visello, harvest year 2013, KWS Lochow, Bergen, Germany), and barley (cv. Marthe, harvest year 2009, Nordsaat Saatzucht, Langenstein, Germany) were milled into white flour using a Quadrumat Junior Mill (Brabender, Duisburg, Germany) and sieved to a particle size of 200 μm. Then, the flours were allowed to rest for 2 weeks prior to the determination of moisture and protein contents (conversion factor N × 5.7) according to International Association for Cereal Science and Technology (ICC) Standards 110/1⁴⁷ and 167⁴⁸, respectively. The moisture contents were 14.59 ± 0.01% (wheat), 11.42 ± 0.01% (rye) and 12.09 ± 0.06% (barley) and the crude protein contents were 9.93 ± 0.14% (wheat), 5.81 ± 0.29% (rye) and 6.72 ± 0.04% (barley).

Preparation of GPTs

The GPTs α-gliadins, γ-gliadins, ω5-gliadins, ω1,2-gliadins, HMW-GS and LMW-GS of wheat, ω-secalins, HMW-secalins, γ-75k-secalins and γ-40k-secalins of rye, and C-hordeins, γ-hordeins, D-hordeins and B-hordeins of barley were isolated as reported in detail by Schalk et al.¹⁸ and Lexhaller et al.¹⁷. Briefly, the protein fractions were isolated stepwise by modified Osborne fractionation from wheat, rye and barley flours using salt solution (0.4 mol/l NaCl with 0.067 mol/l Na₂HPO₄/KH₂PO₄, pH 7.6) to obtain the albumins/globulins, ethanol/water (60/40, v/v) to obtain the prolamins and glutelin extraction solution (2-propanol/water (50/50, v/v)/0.1 mol/l Tris-HCl, pH 7.5, containing 2 mol/l (w/v) urea and 0.06 mol/l (w/v) dithiothreitol (DTT)) at 60 °C under nitrogen to obtain the glutelins. The supernatants of each prolamin and glutelin fraction were combined, concentrated, lyophilized and re-dissolved for preparative RP-HPLC. After filtration of the prolamin and glutelin solutions (0.45 μm), the GPTs were separated on a Jasco HPLC (Jasco, Gross-Umstadt, Germany) according to their retention times, collected from several runs, pooled, lyophilized and stored at −20 °C until use. Then, the GPTs were characterized by RP-HPLC, SDS-PAGE and discovery-driven mass spectrometry to verify their identities and purities as already reported in detail^17,18.

Enzymatic digestion of GPTs

Each GPT was suspended in 0.02 mol/l HCl (pH 2) and hydrolyzed with pepsin at an enzyme:substrate ratio of 1:20 (w/w) for 60 min at 37 °C. After adjusting the pH to 6.5 with sodium phosphate buffer (50 mmol/l), trypsin and chymotrypsin were added at an enzyme:substrate ratio of 1:40 (w/w), respectively and hydrolyzed for 120 min at 37 °C^16,19. The samples were heated for 10 min at 95 °C to stop proteolysis, centrifuged and filtered. For the following crosslinking reaction with TG2, the samples were dried using a vacuum centrifuge (37 °C, 4 h, 800 Pa), reconstituted in TRIS/HCl buffer (0.1 mol/l, pH 7.4, 10 mmol/l CaCl₂) and the resulting peptide concentrations were estimated with a NanoDrop Micro-UV/VIS spectrophotometer and the protein A205 application (NanoDrop One, Thermo Scientific, Madison, USA) at 205 nm, which can be used to determine peptide concentrations based on the absorption of the peptide bonds.

Crosslinking reaction of TG2 and GPT hydrolysates

The reaction of TG2 (0.16 nmol/l) with each GPT hydrolysate was performed in TRIS/HCl buffer (0.1 mol/l, pH 7.4, 10 mmol/l CaCl₂) at a molar ratio of TG2:GPT hydrolysate of 1:150 at 37 °C for 120 min.¹⁵ To inactivate TG2, all samples were heated at 95 °C for 10 min. The negative controls were prepared by adding the GPT hydrolysates after inactivation of TG2. Additional GPT blank controls contained only GPT in TRIS/HCl buffer and were treated as described above just without TG2. The samples and the negative controls were prepared in triplicates; the GPT blank controls were also prepared in triplicates, but pooled prior to tryptic hydrolysis.

Tryptic digestion and isopeptide clean-up

Enzymatic hydrolysis and peptide purification were carried out as described in detail by Lexhaller et al.¹⁵. Briefly, all samples, negative controls and GPT blank controls were hydrolyzed with trypsin at an enzyme:substrate ratio of 1:100 (w/w) at 37 °C for 24 h and the digestion was stopped with formic acid (FA, pH <2). Purification was done by solid phase extraction (SPE) using 50 mg Sep-Pak tC₁₈ cc cartridges (Waters, Eschborn, Germany). After activation with methanol (1 ml), equilibration with acetonitrile/water/FA (80:20:0.1; 1 ml), and washing with acetonitrile/water/FA (2:98:0.1; 5 × 1 ml), the cartridges were loaded with the samples and washed again. The isopeptides and peptides were eluted with acetonitrile/water/FA (40:60:0.1; 2 ×0.5 ml), dried and reconstituted in FA (0.1%, v/v). Prior to nLC-MS/MS analysis, the peptide concentrations of the reconstituted samples were estimated again with the NanoDrop Micro-UV/VIS spectrophotometer at 205 nm. All samples were spiked with the PROCAL Mix (33 fmol/µl) and diluted in 96 well plates to a concentration of 200 ng/µl with acetonitrile/water/FA (2:98:0.1).

Discovery-driven mass spectrometry

nLC-MS/MS analysis was performed on an Ultimate 3000 nanoHLPC system (Dionex, Idstein, Germany) coupled to a Q Exactive HF mass spectrometer (Thermo Fisher Scientific, Dreieich, Germany). The nanoscale LC system consisted of a trap column (75 µm × 2 cm, self-packed with Reprosil-Pur, C₁₈, ODS-3, 5 µm resin, Dr. Maisch, Ammerbuch, Germany) and an analytical column (75 µm × 40 cm, self-packed with Reprosil-Gold, C₁₈, 3 µm resin, Dr. Maisch). The injection volume was 2 µL (estimated peptide concentration: 0.16 µg/µL). The peptides were delivered to the trap column using solvent A0 (0.1% FA in water) at a flow rate of 5 µL/min and separated on the analytical column using a 60 min linear gradient from 4% to 32% solvent B at a flow rate of 300 nL/min (solvent A1, 5% DMSO, 0.1% FA in water; solvent B, 5% DMSO, 0.1% FA in acetonitrile)⁴⁹. The MS was operated in data-dependent acquisition mode, automatically switching between MS1 and MS2 spectra to acquire full scans. The mass-to-charge (m/z) range for the acquisition of MS1 spectra was 360–1,300 m/z at an Orbitrap full MS scan (resolution: 60,000, automatic gain control (AGC) target value: 3e6, maximum injection time: 50 ms). In the MS2, the Top18 peptide precursors were automatically selected for fragmentation by higher energy collision-induced dissociation (isolation width: 1.7 Th, maximum injection time: 25 ms, AGC value: 1e5). Analysis was performed using 25% normalized collision energy at a resolution of 15,000.

Preparation of GPT databases

Each GPT blank control was searched individually against a protein database containing all gliadin entries (January 2019; 5,958 entries), glutenin entries (January 2019; 4,488 entries), secalin entries (January 2019; 219 entries) and hordein entries (January 2019; 158 entries) of the UniProtKB database using MQ (software version 1.6.0.1)²⁰. The parameters were set as follows: digestion mode: specific, enzyme: trypsin, pepsin, chymotrypsin, maximum missed cleavage sites: 2, variable modifications: deamidation (NQ), oxidation (M), main search peptide tolerance: 4.5 ppm, mass tolerance for fragment ions: 0.5 Da. All other parameters were used as default settings. All identified proteins in the proteinGroups.txt file were used to create an appropriate database for each GPT.

Identification of TG2-gluten isopeptides

The Thermo Xcalibur full scan.raw files of each GPT (three samples and three negative controls) were directly used as input in MQ²⁰ and searched against the appropriate GPT database. Seven peptides containing lysine residues (K205, K265, K429, K468, K590, K600, K677) from the TG2 sequence were selected as possible crosslinking sites in the isopeptides. The elemental compositions of these tryptic TG2 peptides were calculated in silico to configure the TG2-sides of the isopeptides as modifications in MQ. A formal subtraction of NH₃ was necessary to use these peptides as modifications (TG2-modifications) in an isopeptide bond¹⁵. The parameters were set as follows for the individual search runs: digestion mode: specific, enzyme: trypsin, pepsin, chymotrypsin, maximum missed cleavage sites: 2, variable modifications: each TG2-modification in one single search run, deamidation (NQ), TG2-modifications: FLKNAGR, C₃₆H₅₇N₁₁O₉, WKNHGCQR, C₄₃H₆₂N₁₆O₁₁S, ISTKSVGR, C₃₅H₆₃N₁₁O₁₂, LAEKEETGMAMR, C₅₅H₉₃N₁₅O₂₀S₂, DLYLENPEIKIR, C₆₈H₁₀₈N₁₆O₂₁, QKR, C₁₇H₃₁N₇O₅, AVKGFR, C₃₁H₄₉N₉O₇, max. number of modifications per peptide: 5, fasta files: appropriate GPT and TG2 (UniProtKB accession no. P21980) fasta files, minimum score for modified peptides: 40. All other parameters were used as default settings.

Annotation of MS/MS fragments of the isopeptides

To confirm the identification and the respective crosslinking sites of the isopeptides, the b- and y-fragments of both sides were calculated with the MS-Product feature of the ProteinProspector webpage (v.5.22.1, University of California, San Francisco, CA, USA)²¹. The sequences of gluten peptides and the TG2-modifications were entered and the binding Q or K were replaced by “u” for the user-specified amino acid elemental composition of the other isopeptide site, respectively. ProteinProspector parameters were then set to calculate b-, y- and internal fragments and associated fragments due to water- and ammonia-loss. The charge states were calculated up to 5+ for the precursors and up to 3+ for the fragments.

Isopeptide confirmation and creation of PRM methods

Skyline-daily (version 19.0.9.149)²² was used to confirm the identities of all detected isopeptides, to compare negative controls and samples and to create isolation lists for the PRM methods. To confirm the identified isopeptides and reject false positives, the sequences of the GPT-peptides were loaded into Skyline as the targets and modified with the appropriate TG2-modifications, a deamidation (−17 Da) or both, according to the MQ output. To identify the isopeptides from both sides, the reverse isopeptide sequence, i.e., the sequence of the TG2 peptide, was also loaded into Skyline and modified with the previously identified GPT peptide via a crosslink. Then, Skyline generated the appropriate precursors of all sequences. Every isopeptide was manually checked to fulfill the following parameters: (a) the retention time had to match with the identified retention time of the MQ search (ID), (b) comparison of retention time and isotopic dot product scores (idotp: generated from comparing the expected precursor isotopic distribution to the observed distribution; scored from 0–1 with 1 being the highest) among the triplicates using the graphical tools²², (c) reproducible detection of the isopeptide in the three replicates and absence in the negative controls; false positive matches in the negative controls were rejected, (d) the idotp had to be >0.9, (e) the threshold for unambiguous localization was set to a localization probability of >75% (MQ search). MS/MS libraries were built to generate the isolation lists for the isopeptides of each GPT. Therefore, the MQ output tables “msms.txt” of the searches of every modification were imported into Skyline. All identified isopeptides of one GPT and their reversed isopeptides with the appropriate GPT-modifications were summarized in one PRM method. This method was exported as an isolation list for use in the nLC-MS/MS system. A single isolation list and a single PRM method were created for each GPT.

Targeted mass spectrometry

All PRM measurements were carried out using the exact same instrument and LC conditions as for the discovery-driven setup (see above). The MS was operated in unscheduled PRM mode with the following settings: MS1 resolution: 60,000, MS1 automatic gain control (AGC) target value: 3e6, MS1 maximum injection time: 100 ms, MS1 scan range 360–1300 m/z, quadrupole isolation window width: 1.7 Th, MS2 maximum injection time: 22 ms, MS2 AGC value: 1e6. High-energy collision-induced dissociation was performed using a normalized collision energy of 27.

PRM data analysis

The Xcalibur.raw files of the PRM data were imported into Skyline separately for each GPT. The transitions of each target were checked manually and in comparison to the negative controls. To confirm the identified isopeptides and reject false positives, the following parameters were checked: (a) the retention time in the PRM data had to match with the identified retention time of the MQ search (ID) and the full scan data, (b) the comparison of retention time and idotp of the precursors among the triplicates had to fit using the graphical tools and no detection of the signals in the negative controls had to be observed, (c) according to Chen et al.²⁶, at least seven identified b- or y-fragments had to match theoretical peptide fragments, (d) at least three fragments had to be consecutive in the peptide sequence. Every identified isopeptide was double-checked with the MQ search result in the MQ Viewer.

Multiple sequence alignment of gluten peptides

All gluten peptides identified as part of the isopeptides were compiled into a peptide fasta file, either without or with deamidation at the sites we had detected. The multiple sequence alignment was done using MAFFT online version 7.452 on January 16, 2020 (Computational Biology Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan) using the default settings and the L-INS-I algorithm⁵⁰.

Supplementary information

Supplementary information.^{(96.1KB, pdf)}

Acknowledgements

The authors would like to thank Ms. Hermine Kienberger and Ms. Nina Lomp (BayBioMS) for excellent technical assistance and help with LC-MS experiments, Prof. Dr. Peter Köhler and Dr. Herbert Wieser for helpful discussions as well as Matthew Chambers (MSRC Bioinformatics, Vanderbilt University) and Brendan MacLean (Department of Genome Sciences, University of Washington) for support with Skyline. This research project (No. 250645717) was funded by the German Research Foundation (Deutsche Forschungsgemeinschaft, DFG, Bonn). The publication of this article was funded by the Open Access Fund of the Leibniz Association.

Author contributions

Investigation, Visualization, Writing-original draft, B.L.; Methodology, B.L. and K.A.S.; Data acquisition and curation, B.L. and C.L.; Resources, Writing-review and editing, C.L. and K.A.S.; Conceptualization, Project administration, Supervision, K.A.S.

Data availability

The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium (http://proteomecentral.proteomexchange.org) with the dataset identifier PXD017693 and are publicly available on Panorama Public (https://panoramaweb.org/8QUQ5F.url).

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

is available for this paper at 10.1038/s41598-020-64143-9.

References

1.Ludvigsson JF, et al. The Oslo definitions for coeliac disease and related terms. Gut. 2013;62:43–52. doi: 10.1136/gutjnl-2011-301346. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Schuppan D, Junker Y, Barisani D. Celiac disease: From pathogenesis to novel therapies. Gastroenterology. 2009;137:1912–1933. doi: 10.1053/j.gastro.2009.09.008. [DOI] [PubMed] [Google Scholar]
3.Qiao S-W, Iversen R, Raki M, Sollid LM. The adaptive immune response in celiac disease. Semin Immunopathol. 2012;34:523–40. doi: 10.1007/s00281-012-0314-z. [DOI] [PubMed] [Google Scholar]
4.Sollid LM. Coeliac disease: Dissecting a complex inflammatory disorder. Nat Rev Immunol. 2002;2:647–655. doi: 10.1038/nri885. [DOI] [PubMed] [Google Scholar]
5.Vader LW. Specificity of tissue transglutaminase explains cereal toxicity in celiac disease. J Exp Med. 2002;195:643–649. doi: 10.1084/jem.20012028. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.van de Wal Y, et al. Cutting edge: Selective deamidation by tissue transglutaminase strongly enhances gliadin-specific T cell reactivity. J Immunol. 1998;161:1585–1588. [PubMed] [Google Scholar]
7.Folk FE. Mechanism and basis for specificity of transglutaminase-catalyzed epsilon-(gamma-glutamyl) lysine bond formation. Adv Enzymol Relat Areas Mol Biol. 1983;54:1–56. doi: 10.1002/9780470122990.ch1. [DOI] [PubMed] [Google Scholar]
8.Fleckenstein B, et al. Molecular characterization of covalent complexes between tissue transglutaminase and gliadin peptides. J Biol Chem. 2004;279:17607–17616. doi: 10.1074/jbc.M310198200. [DOI] [PubMed] [Google Scholar]
9.Volta U, Molinaro N, Fusconi M, Cassani F, Biachi FB. IgA antiendomysial antibody test: A step forward in celiac disease screening. Dig Dis Sci. 1991;36:752–756. doi: 10.1007/BF01311232. [DOI] [PubMed] [Google Scholar]
10.Dieterich W, et al. Identification of tissue transglutaminase as the autoantigen of celiac disease. Nat Med. 1997;3:797–801. doi: 10.1038/nm0797-797. [DOI] [PubMed] [Google Scholar]
11.du Pré MF, Sollid LM. T-cell and B-cell immunity in celiac disease. Best Pract Res Clin Gastroenterol. 2015;29:413–423. doi: 10.1016/j.bpg.2015.04.001. [DOI] [PubMed] [Google Scholar]
12.Sollid LM, Molberg Ø, McAdam S, Lundin KEA. Autoantibodies in coeliac disease: tissue transglutaminase - guilt by association? Gut. 1997;41:851–852. doi: 10.1136/gut.41.6.851. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Stamnaes J, Iversen R, du Pré MF, Chen X, Sollid LM. Enhanced B cell receptor recognition of the autoantigen transglutaminase 2 by efficient catalytic self-mutimerization. PLoS ONE. 2015;10:e0134922. doi: 10.1371/journal.pone.0134922. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Iversen R, du Pré MF, Di Niro R, Sollid LM. Igs as substrates for transglutaminase 2: Implications for autoantibody production in celiac disease. J Immunol. 2015;195:5159–5168. doi: 10.4049/jimmunol.1501363. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Lexhaller B, Ludwig C, Scherf KA. Comprehensive detection of isopeptides between human tissue transglutaminase and gluten peptides. Nutrients. 2019;11:2263. doi: 10.3390/nu11102263. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Dorum S, et al. The preferred substrates for transglutaminase 2 in a complex wheat gluten digest are peptide fragments harboring celiac disease T-cell epitopes. PLoS ONE. 2010;5:e14056. doi: 10.1371/journal.pone.0014056. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Lexhaller B., Colgrave M.L. & Scherf K.A. Characterization and relative quantitation of wheat, rye and barley gluten protein types by LC-MS/MS, Front Plant Sci, 10.3389/fpls.2019.01530 (2019) [DOI] [PMC free article] [PubMed]
18.Schalk K, Lexhaller B, Koehler P, Scherf KA. Isolation and characterization of gluten protein types from wheat, rye, barley and oats for use as reference materials. PLoS One. 2017;12:e0172819. doi: 10.1371/journal.pone.0172819. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Gessendorfer B, Koehler P, Wieser H. Preparation and characterization of enzymatically hydrolyzed prolamins from wheat, rye, and barley as references for the immunochemical quantitation of partially hydrolyzed gluten. Anal Bioanal Chem. 2009;395:1721–1728. doi: 10.1007/s00216-009-3080-6. [DOI] [PubMed] [Google Scholar]
20.Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol. 2008;26:1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]
21.Chu F, Baker PR, Burlingame AL, Chalkley RJ. Finding chimeras: a bioinformatics strategy for identification of cross-linked peptides. Mol Cell Proteomics. 2010;9:25–31. doi: 10.1074/mcp.M800555-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Schilling B, et al. Platform independent and label-free quantitation of proteomic data using MS1 extracted ion chromatograms in Skyline. Application to protein acetylation and phosphorylation. Mol Cell Proteomics. 2012;11:202–214. doi: 10.1074/mcp.M112.017707. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Neuhauser N, Michalski A, Cox J, Mann M. Expert system for computer-assisted annotation of MS/MS spectra. Mol Cell Proteomics. 2012;11:1500–1509. doi: 10.1074/mcp.M112.020271. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Gallien S, Bourmaud A, Kim SY, Domon B. Technical considerations for large-scale parallel reaction monitoring analysis. J Proteomics. 2014;100:147–159. doi: 10.1016/j.jprot.2013.10.029. [DOI] [PubMed] [Google Scholar]
25.Peterson AC, Russell JD, Baile DJ, Westphall MS, Coon JJ. Parallel reaction monitoring for high resolution and high mass accuracy quantitative, targeted proteomics. Mol Cell Proteomics. 2012;11:1475–1488. doi: 10.1074/mcp.O112.020131. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Chen Y, Kwon SW, Kim SC, Zhao Y. Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra. J Proteome Res. 2005;4:998–1005. doi: 10.1021/pr049754t. [DOI] [PubMed] [Google Scholar]
27.Schalk K, Koehler P, Scherf KA. Targeted liquid chromatography tandem mass spectrometry to quantitate wheat gluten using well-defined reference proteins. PLoS ONE. 2018;13:e0192804. doi: 10.1371/journal.pone.0192804. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Colgrave ML, Goswami H, Howitt CA, Tanner GJ. Proteomics as a tool to understand the complexity of beer. Food Res Int. 2013;54:1001–1012. doi: 10.1016/j.foodres.2012.09.043. [DOI] [Google Scholar]
29.Wicker T, et al. Frequent gene movement and pseudogene evolution is common to the large and complex genomes of wheat, barley, and their relatives. Plant Cell. 2011;23:1706–1718. doi: 10.1105/tpc.111.086629. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Colgrave ML, et al. Comparing multiple reaction monitoring and sequential window acquisition of all theoretical mass spectra for the relative quantification of barley gluten in selectively bred barley lines. Anal Chem. 2016;88:9127–9135. doi: 10.1021/acs.analchem.6b02108. [DOI] [PubMed] [Google Scholar]
31.Shan L, et al. Identification and analysis of multivalent proteolytically resistant peptides from gluten: implications for celiac sprue. J Proteome Res. 2005;4:1732–1741. doi: 10.1021/pr050173t. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Picotti P, Aebersold R, Domon B. The implications of proteolytic background for shotgun proteomics. Mol Cell Proteomics. 2007;6:1589–1598. doi: 10.1074/mcp.M700029-MCP200. [DOI] [PubMed] [Google Scholar]
33.Bromilow S, et al. A curated gluten protein sequence database to support development of proteomics methods for determination of gluten in gluten-free foods. J Proteomics. 2017;163:67–75. doi: 10.1016/j.jprot.2017.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Piper JL, Gray GM, Khosla C. High selectivity of human tissue transglutaminase for immunoactive gliadin peptides: implications for celiac sprue. Biochem. 2002;41:386–393. doi: 10.1021/bi011715x. [DOI] [PubMed] [Google Scholar]
35.Arentz-Hansen H, et al. The intestinal T cell response to alpha-gliadin in adult celiac disease is focused on a single deamidated glutamine targeted by tissue transglutaminase. J Exp Med. 2000;191:603–612. doi: 10.1084/jem.191.4.603. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Krokhin OV, Antonovici M, Ens W, Wilkins JA, Standing KG. Deamidation of -Asn-Gly- sequences during sample preparation for proteomics: Consequences for MALDI and HPLC-MALDI analysis. Anal Chem. 2006;78:6645–6650. doi: 10.1021/ac061017o. [DOI] [PubMed] [Google Scholar]
37.Cardoso I, et al. Transglutaminase 2 interactions with extracellular matrix proteins as probed with celiac disease autoantibodies. FEBS J. 2015;282:2063–2075. doi: 10.1111/febs.13276. [DOI] [PubMed] [Google Scholar]
38.Stamnaes J, Cardoso I, Iversen R, Sollid LM. Transglutaminase 2 strongly binds to an extracellular matrix component other than fibronectin via its second C-terminal beta-barrel domain. FEBS J. 2016;283:3994–4010. doi: 10.1111/febs.13907. [DOI] [PubMed] [Google Scholar]
39.Sollid L.M. et al. Update 2020: nomenclature and listing of celiac disease-relevant gluten epitopes recognized by CD4+ T cells, Immunogenetics10.1007/s00251-019-01141-w (2019) [DOI] [PubMed]
40.Vader LW, et al. The gluten response in children with celiac disease is directed toward multiple gliadin and glutenin peptides. Gastroenterology. 2002;122:1729–1737. doi: 10.1053/gast.2002.33606. [DOI] [PubMed] [Google Scholar]
41.Arentz-Hansen H, et al. Celiac lesion T cells recognize epitopes that cluster in regions of gliadins rich in proline residues. Gastroenterology. 2002;123:803–809. doi: 10.1053/gast.2002.35381. [DOI] [PubMed] [Google Scholar]
42.Sjöström H, et al. Identification of a gliadin T-cell epitope in coeliac disease: general importance of gliadin deamidation for intestinal T-cell recognition. Scand J Immunol. 1998;48:111–115. doi: 10.1046/j.1365-3083.1998.00397.x. [DOI] [PubMed] [Google Scholar]
43.Qiao S.W. & Sollid L.M. Two novel HLA-DQ2.5-restricted gluten Tcell epitopes in the DQ2.5-glia-γ4 epitope family, Immunogenetics10.1007/s00251-019-01138-5 (2019) [DOI] [PubMed]
44.Tye-Din JA, et al. Comprehensive, quantitative mapping of T cell epitopes in gluten in celiac disease. Sci Transl Med. 2010;2:41ra51. doi: 10.1126/scitranslmed.3001012. [DOI] [PubMed] [Google Scholar]
45.Bateman EA, et al. IgA antibodies of coeliac disease patients recognise a dominant T cell epitope of A-gliadin. Gut. 2004;53:1274–1278. doi: 10.1136/gut.2003.032755. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Osman AA, et al. B cell epitopes of gliadin. Clin Exp Immunol. 2000;121:248–254. doi: 10.1046/j.1365-2249.2000.01312.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
47.ICC Standard No. 110/1. International Association for Cereal Science and Technology. Determination of the moisture content of cereals and cereal products (practical method). 1976.
48.ICC Standard No. 167. International Association for Cereal Science and Technology. Determination of crude protein in grain and grain products for food and feed by the Dumas combustion principle. 2000.
49.Hahne H, et al. DMSO enhances electrospray response, boosting sensitivity of proteomic experiments. Nat Methods. 2013;10:989–992. doi: 10.1038/nmeth.2610. [DOI] [PubMed] [Google Scholar]
50.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary information.^{(96.1KB, pdf)}

Data Availability Statement

[CR1] 1.Ludvigsson JF, et al. The Oslo definitions for coeliac disease and related terms. Gut. 2013;62:43–52. doi: 10.1136/gutjnl-2011-301346. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR2] 2.Schuppan D, Junker Y, Barisani D. Celiac disease: From pathogenesis to novel therapies. Gastroenterology. 2009;137:1912–1933. doi: 10.1053/j.gastro.2009.09.008. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Qiao S-W, Iversen R, Raki M, Sollid LM. The adaptive immune response in celiac disease. Semin Immunopathol. 2012;34:523–40. doi: 10.1007/s00281-012-0314-z. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Sollid LM. Coeliac disease: Dissecting a complex inflammatory disorder. Nat Rev Immunol. 2002;2:647–655. doi: 10.1038/nri885. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Vader LW. Specificity of tissue transglutaminase explains cereal toxicity in celiac disease. J Exp Med. 2002;195:643–649. doi: 10.1084/jem.20012028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.van de Wal Y, et al. Cutting edge: Selective deamidation by tissue transglutaminase strongly enhances gliadin-specific T cell reactivity. J Immunol. 1998;161:1585–1588. [PubMed] [Google Scholar]

[CR7] 7.Folk FE. Mechanism and basis for specificity of transglutaminase-catalyzed epsilon-(gamma-glutamyl) lysine bond formation. Adv Enzymol Relat Areas Mol Biol. 1983;54:1–56. doi: 10.1002/9780470122990.ch1. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Fleckenstein B, et al. Molecular characterization of covalent complexes between tissue transglutaminase and gliadin peptides. J Biol Chem. 2004;279:17607–17616. doi: 10.1074/jbc.M310198200. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Volta U, Molinaro N, Fusconi M, Cassani F, Biachi FB. IgA antiendomysial antibody test: A step forward in celiac disease screening. Dig Dis Sci. 1991;36:752–756. doi: 10.1007/BF01311232. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Dieterich W, et al. Identification of tissue transglutaminase as the autoantigen of celiac disease. Nat Med. 1997;3:797–801. doi: 10.1038/nm0797-797. [DOI] [PubMed] [Google Scholar]

[CR11] 11.du Pré MF, Sollid LM. T-cell and B-cell immunity in celiac disease. Best Pract Res Clin Gastroenterol. 2015;29:413–423. doi: 10.1016/j.bpg.2015.04.001. [DOI] [PubMed] [Google Scholar]

[CR12] 12.Sollid LM, Molberg Ø, McAdam S, Lundin KEA. Autoantibodies in coeliac disease: tissue transglutaminase - guilt by association? Gut. 1997;41:851–852. doi: 10.1136/gut.41.6.851. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Stamnaes J, Iversen R, du Pré MF, Chen X, Sollid LM. Enhanced B cell receptor recognition of the autoantigen transglutaminase 2 by efficient catalytic self-mutimerization. PLoS ONE. 2015;10:e0134922. doi: 10.1371/journal.pone.0134922. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Iversen R, du Pré MF, Di Niro R, Sollid LM. Igs as substrates for transglutaminase 2: Implications for autoantibody production in celiac disease. J Immunol. 2015;195:5159–5168. doi: 10.4049/jimmunol.1501363. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Lexhaller B, Ludwig C, Scherf KA. Comprehensive detection of isopeptides between human tissue transglutaminase and gluten peptides. Nutrients. 2019;11:2263. doi: 10.3390/nu11102263. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Dorum S, et al. The preferred substrates for transglutaminase 2 in a complex wheat gluten digest are peptide fragments harboring celiac disease T-cell epitopes. PLoS ONE. 2010;5:e14056. doi: 10.1371/journal.pone.0014056. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Lexhaller B., Colgrave M.L. & Scherf K.A. Characterization and relative quantitation of wheat, rye and barley gluten protein types by LC-MS/MS, Front Plant Sci, 10.3389/fpls.2019.01530 (2019) [DOI] [PMC free article] [PubMed]

[CR18] 18.Schalk K, Lexhaller B, Koehler P, Scherf KA. Isolation and characterization of gluten protein types from wheat, rye, barley and oats for use as reference materials. PLoS One. 2017;12:e0172819. doi: 10.1371/journal.pone.0172819. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] 19.Gessendorfer B, Koehler P, Wieser H. Preparation and characterization of enzymatically hydrolyzed prolamins from wheat, rye, and barley as references for the immunochemical quantitation of partially hydrolyzed gluten. Anal Bioanal Chem. 2009;395:1721–1728. doi: 10.1007/s00216-009-3080-6. [DOI] [PubMed] [Google Scholar]

[CR20] 20.Cox J, Mann M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat Biotechnol. 2008;26:1367–1372. doi: 10.1038/nbt.1511. [DOI] [PubMed] [Google Scholar]

[CR21] 21.Chu F, Baker PR, Burlingame AL, Chalkley RJ. Finding chimeras: a bioinformatics strategy for identification of cross-linked peptides. Mol Cell Proteomics. 2010;9:25–31. doi: 10.1074/mcp.M800555-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Schilling B, et al. Platform independent and label-free quantitation of proteomic data using MS1 extracted ion chromatograms in Skyline. Application to protein acetylation and phosphorylation. Mol Cell Proteomics. 2012;11:202–214. doi: 10.1074/mcp.M112.017707. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Neuhauser N, Michalski A, Cox J, Mann M. Expert system for computer-assisted annotation of MS/MS spectra. Mol Cell Proteomics. 2012;11:1500–1509. doi: 10.1074/mcp.M112.020271. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Gallien S, Bourmaud A, Kim SY, Domon B. Technical considerations for large-scale parallel reaction monitoring analysis. J Proteomics. 2014;100:147–159. doi: 10.1016/j.jprot.2013.10.029. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Peterson AC, Russell JD, Baile DJ, Westphall MS, Coon JJ. Parallel reaction monitoring for high resolution and high mass accuracy quantitative, targeted proteomics. Mol Cell Proteomics. 2012;11:1475–1488. doi: 10.1074/mcp.O112.020131. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Chen Y, Kwon SW, Kim SC, Zhao Y. Integrated approach for manual evaluation of peptides identified by searching protein sequence databases with tandem mass spectra. J Proteome Res. 2005;4:998–1005. doi: 10.1021/pr049754t. [DOI] [PubMed] [Google Scholar]

[CR27] 27.Schalk K, Koehler P, Scherf KA. Targeted liquid chromatography tandem mass spectrometry to quantitate wheat gluten using well-defined reference proteins. PLoS ONE. 2018;13:e0192804. doi: 10.1371/journal.pone.0192804. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Colgrave ML, Goswami H, Howitt CA, Tanner GJ. Proteomics as a tool to understand the complexity of beer. Food Res Int. 2013;54:1001–1012. doi: 10.1016/j.foodres.2012.09.043. [DOI] [Google Scholar]

[CR29] 29.Wicker T, et al. Frequent gene movement and pseudogene evolution is common to the large and complex genomes of wheat, barley, and their relatives. Plant Cell. 2011;23:1706–1718. doi: 10.1105/tpc.111.086629. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.Colgrave ML, et al. Comparing multiple reaction monitoring and sequential window acquisition of all theoretical mass spectra for the relative quantification of barley gluten in selectively bred barley lines. Anal Chem. 2016;88:9127–9135. doi: 10.1021/acs.analchem.6b02108. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Shan L, et al. Identification and analysis of multivalent proteolytically resistant peptides from gluten: implications for celiac sprue. J Proteome Res. 2005;4:1732–1741. doi: 10.1021/pr050173t. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Picotti P, Aebersold R, Domon B. The implications of proteolytic background for shotgun proteomics. Mol Cell Proteomics. 2007;6:1589–1598. doi: 10.1074/mcp.M700029-MCP200. [DOI] [PubMed] [Google Scholar]

[CR33] 33.Bromilow S, et al. A curated gluten protein sequence database to support development of proteomics methods for determination of gluten in gluten-free foods. J Proteomics. 2017;163:67–75. doi: 10.1016/j.jprot.2017.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Piper JL, Gray GM, Khosla C. High selectivity of human tissue transglutaminase for immunoactive gliadin peptides: implications for celiac sprue. Biochem. 2002;41:386–393. doi: 10.1021/bi011715x. [DOI] [PubMed] [Google Scholar]

[CR35] 35.Arentz-Hansen H, et al. The intestinal T cell response to alpha-gliadin in adult celiac disease is focused on a single deamidated glutamine targeted by tissue transglutaminase. J Exp Med. 2000;191:603–612. doi: 10.1084/jem.191.4.603. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Krokhin OV, Antonovici M, Ens W, Wilkins JA, Standing KG. Deamidation of -Asn-Gly- sequences during sample preparation for proteomics: Consequences for MALDI and HPLC-MALDI analysis. Anal Chem. 2006;78:6645–6650. doi: 10.1021/ac061017o. [DOI] [PubMed] [Google Scholar]

[CR37] 37.Cardoso I, et al. Transglutaminase 2 interactions with extracellular matrix proteins as probed with celiac disease autoantibodies. FEBS J. 2015;282:2063–2075. doi: 10.1111/febs.13276. [DOI] [PubMed] [Google Scholar]

[CR38] 38.Stamnaes J, Cardoso I, Iversen R, Sollid LM. Transglutaminase 2 strongly binds to an extracellular matrix component other than fibronectin via its second C-terminal beta-barrel domain. FEBS J. 2016;283:3994–4010. doi: 10.1111/febs.13907. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Sollid L.M. et al. Update 2020: nomenclature and listing of celiac disease-relevant gluten epitopes recognized by CD4+ T cells, Immunogenetics10.1007/s00251-019-01141-w (2019) [DOI] [PubMed]

[CR40] 40.Vader LW, et al. The gluten response in children with celiac disease is directed toward multiple gliadin and glutenin peptides. Gastroenterology. 2002;122:1729–1737. doi: 10.1053/gast.2002.33606. [DOI] [PubMed] [Google Scholar]

[CR41] 41.Arentz-Hansen H, et al. Celiac lesion T cells recognize epitopes that cluster in regions of gliadins rich in proline residues. Gastroenterology. 2002;123:803–809. doi: 10.1053/gast.2002.35381. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Sjöström H, et al. Identification of a gliadin T-cell epitope in coeliac disease: general importance of gliadin deamidation for intestinal T-cell recognition. Scand J Immunol. 1998;48:111–115. doi: 10.1046/j.1365-3083.1998.00397.x. [DOI] [PubMed] [Google Scholar]

[CR43] 43.Qiao S.W. & Sollid L.M. Two novel HLA-DQ2.5-restricted gluten Tcell epitopes in the DQ2.5-glia-γ4 epitope family, Immunogenetics10.1007/s00251-019-01138-5 (2019) [DOI] [PubMed]

[CR44] 44.Tye-Din JA, et al. Comprehensive, quantitative mapping of T cell epitopes in gluten in celiac disease. Sci Transl Med. 2010;2:41ra51. doi: 10.1126/scitranslmed.3001012. [DOI] [PubMed] [Google Scholar]

[CR45] 45.Bateman EA, et al. IgA antibodies of coeliac disease patients recognise a dominant T cell epitope of A-gliadin. Gut. 2004;53:1274–1278. doi: 10.1136/gut.2003.032755. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Osman AA, et al. B cell epitopes of gliadin. Clin Exp Immunol. 2000;121:248–254. doi: 10.1046/j.1365-2249.2000.01312.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR47] 47.ICC Standard No. 110/1. International Association for Cereal Science and Technology. Determination of the moisture content of cereals and cereal products (practical method). 1976.

[CR48] 48.ICC Standard No. 167. International Association for Cereal Science and Technology. Determination of crude protein in grain and grain products for food and feed by the Dumas combustion principle. 2000.

[CR49] 49.Hahne H, et al. DMSO enhances electrospray response, boosting sensitivity of proteomic experiments. Nat Methods. 2013;10:989–992. doi: 10.1038/nmeth.2610. [DOI] [PubMed] [Google Scholar]

[CR50] 50.Katoh K, Rozewicki J, Yamada KD. MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 2019;20:1160–1166. doi: 10.1093/bib/bbx108. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Identification of Isopeptides Between Human Tissue Transglutaminase and Wheat, Rye, and Barley Gluten Peptides

Barbara Lexhaller

Christina Ludwig

Katharina Anne Scherf

Abstract

Introduction

Results

Experimental approach to identify TG2-gluten isopeptides

Figure 1.

Identification of isopeptides in wheat GPTs

Table 1.

Figure 2.

Figure 3.

Identification of isopeptides in rye GPTs

Table 2.

Figure 4.

Identification of isopeptides in barley GPTs

Table 3.

Figure 5.

Discussion

Conclusion

Methods

Material

Grain Samples

Preparation of GPTs

Enzymatic digestion of GPTs

Crosslinking reaction of TG2 and GPT hydrolysates

Tryptic digestion and isopeptide clean-up

Discovery-driven mass spectrometry

Preparation of GPT databases

Identification of TG2-gluten isopeptides

Annotation of MS/MS fragments of the isopeptides

Isopeptide confirmation and creation of PRM methods

Targeted mass spectrometry

PRM data analysis

Multiple sequence alignment of gluten peptides

Supplementary information

Acknowledgements

Author contributions

Data availability

Competing interests

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases