Abstract
Celiac disease is caused by an uncontrolled immune response to gluten, a heterogeneous mixture of wheat storage proteins, including the α-gliadins. It has been shown that α-gliadins harbor several major epitopes involved in the disease pathogenesis. A major step towards elimination of gluten toxicity for celiac disease patients would thus be the elimination of such epitopes from α-gliadins. We have analyzed over 3,000 expressed α-gliadin sequences from 11 bread wheat cultivars to determine whether they encode for peptides potentially involved in celiac disease. All identified epitope variants were synthesized as peptides and tested for binding to the disease-associated HLA-DQ2 and HLA-DQ8 molecules and for recognition by patient-derived α-gliadin specific T cell clones. Several specific naturally occurring amino acid substitutions were identified for each of the α-gliadin derived peptides involved in celiac disease that eliminate the antigenic properties of the epitope variants. Finally, we provide proof of principle at the peptide level that through the systematic introduction of such naturally occurring variations α-gliadins genes can be generated that no longer encode antigenic peptides. This forms a crucial step in the development of strategies to modify gluten genes in wheat so that it becomes safe for celiac disease patients. It also provides the information to design and introduce safe gluten genes in other cereals, which would exhibit improved quality while remaining safe for consumption by celiac disease patients.
Introduction
Celiac Disease (CD) is an intestinal T cell-mediated disease caused by the gluten fraction of wheat or the homologous proteins from barley or rye. CD has prevalence between 0.5 and 2% in human populations [1] and is characterized by a chronic intestinal inflammation upon ingestion of gluten proteins. Recently, the molecular aspects have been comprehensively addressed in several review papers [2]–[5]. In short, in CD patients CD4+ T cells are present in the lamina propria that secrete interferon-gamma upon recognition of gluten-derived peptides bound to HLA-DQ2 or HLA-DQ8 molecules present on antigen presenting cells. Strikingly, most of the gluten peptides implicated in CD require modification by the enzyme tissue transglutaminase before they can bind to the disease-predisposing HLA-DQ molecules and trigger T cell responses [2]–[5]. In addition to the adaptive CD4+ T cell response to gluten, CD is characterized by the upregulation of IL-15, an intraepithelial T cell infiltrate expressing the NKG2D receptor, and the overexpression of a ligand for NKG2D (MICA) [6], [7].
Many gluten peptides with T cell stimulatory properties have now been identified. Such peptides have been found in wheat α-, γ-, and ω-gliadins as well as in low molecular weight (LMW) and high molecular weight (HMW) glutenins [8]–[14]. Several studies have demonstrated that peptides derived from α-gliadins induce strong T cell responses in the large majority of patients, while responses to the other peptides are less frequently found [8]–[10], [13]. An α-gliadin derived 33-mer peptide (amino acid sequence LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF) was identified that encodes six partially overlapping T cell epitopes and has very potent T cell stimulatory properties [13]. It harbours the p56–75 peptide (LQLQPFPQPQLPYPQPQLPY) that has been identified as the dominant gluten epitope [9], [10]. Furthermore, α-gliadins are the only gluten molecules that harbor the p31-43/49 peptide that has been implicated in the innate immune response induced by gluten [7].
The α-gliadins are a gene family encoded by the Gli-2 loci, Gli-A2, Gli-B2 and Gli-D2, located on the short arm of three homoeologous chromosomes (6AS, 6BS and 6DS) of hexaploid bread wheat (Triticum aestivum L.). These loci may contain from 25–35 to even 150 α-gliadin genes per haploid genome [15]–[17], although most of these (72–95%) are presumably pseudogenes [16], [17]. Sequencing of genomic α-gliadin clones from hexaploid bread wheat enabled to differentiate the sequences according to their loci Gli-A2, Gli-B2 and Gli-D2 based on genome-specific SNPs [16], [17]. Relevant for CD, the occurrence and frequency of the HLA-DQ2 epitopes DQ2-Glia-α1, DQ2-Glia-α2 and DQ2-Glia-α3 (previously designated Glia-α20, ref. 12] and the HLA-DQ8 T-cell epitope DQ8-Glia-α1 also differs between the loci [17]. This was corroborated by the observation that T cell clones specific for the DQ2-Glia-α2 epitope did not recognise gluten derived from diploid species carrying the S-genome, ancestrally related to the B genome of bread wheat, while gluten derived from diploid A- and D-genome species was recognized [18]. Variation in T cell stimulatory capacity of cereal-derived gluten was observed with other T cell clones as well [19]–[21]. Indeed, differences have been observed in the T cell stimulatory capacity of pasta and bread wheat varieties [22], [23], but none were safe for CD patients.
Given the overall importance of the α-gliadins in CD we set out to determine the naturally existing sequence variation in CD epitopes as deduced from α-gliadin transcripts from developing wheat grains. The immunogenic potential of these epitope variants was subsequently tested in T-cell proliferation assays. This produced insight in which key amino acid changes are sufficient to abolish T-cell recognition. Finally, we verified that the observed differences in antigenicity of α-gliadin peptides derived from diploid species corresponded to differences in the antigenicity of the gluten from these species, ancestrally related to bread wheat. Altogether the results offer a molecular basis for differential CD toxicity of the wheat genomes. Based upon these results we present a rational strategy to develop genes that encode α-gliadins that are safe for consumption by celiac disease patients.
Results
Genetic variation in α-gliadin (Gli-2) transcripts
In total 3022 expressed α-gliadin sequences (expressed sequence tags and mRNA sequences from NCBI and Unigene) originating from 11 different T. aestivum L. varieties were analyzed. These α-gliadin transcripts were grouped into 55 contigs with at least 90% sequence homology. Forty per cent of the α-gliadin transcripts clustered with A genomic sequences and were attributed to locus Gli-A2, 35% originated from Gli-D2 and only 25% came from Gli-B2 (Fig. S1). After tracing all non-synonymous DNA polymorphisms 83 different transcript contigs were obtained for the 3′ region of the gene that each contained at least four sequence equivalents. This indicates a high sequence diversity among expressed α-gliadin sequences in these 11 T. aestivum L. varieties.
Variants of T cell stimulatory and innate stimulatory sequences
The N-terminal region of α-gliadins contains the p31-43 epitope implicated in the innate immune response, and the immunodominant DQ2-Glia-α1 and DQ2-Glia-α2 epitopes as well as the DQ2-Glia-α3 T cell epitope. The carboxyl-terminal part encodes the immunodominant DQ8-Glia-α1 T cell epitope ( Table 1 ). To obtain information on the immunogenic potential of the various α-gliadins, the translated amino acid sequences for each locus were checked for the presence of canonical T cell epitopes and variants thereof. Tables 2 , 3 , 4 , 5 present the most frequently expressed epitope variants (>5 transcripts each).
Table 1. Amino acid sequences of α-gliadin derived peptides.
Immune response | Restriction element | Core-Sequence | |
DQ2-Glia-α1 | Adaptive | HLA-DQ2 | P{F/Y}PQPQLPY |
DQ2-Glia-α2 | Adaptive | HLA-DQ2 | PQPQLPYPQ |
DQ2-Glia- α3 | Adaptive | HLA-DQ2 | FRPQQPYPQ |
DQ8-Glia-α1 | Adaptive | HLA-DQ8 | QGSFQPSQQ |
P31-43 | Innate | not applicable | PGQQQPFPPQQPY |
Five antigenic peptides derived from α-gliadins are known to be involved in celiac decease. For the peptides that can provoke an adaptive immune response in CD patients (HLA-DQ2 restricted or HLA-DQ8 restricted) the minimal 9-mer “canonical” epitope cores are shown. One peptide, p31-43, is known to be involved in an innate immune response observed in CD. For each of the epitopes is specified the name, which immune response it evokes, the restriction element and the amino acid sequence.
Table 2. HLA-DQ2-Glia-α1 epitope variants expressed in bread wheat.
Gli-A2 | Gli-B2 | Gli-D2 | N | |
PFPQPQLPY | 466 | 520 | 986 | |
PYPQPQLPY | 93 | 93 | ||
PFLQPQLPY | 106 | 106 | ||
PFSQPQLPY | 94 | 94 | ||
PFPQPQLSY | 7 | 54 | 61 | |
PFPHPQLPY | 29 | 29 | ||
PFPQAQLPY | 6 | 6 | ||
Total | 673 | 702 | 1375 |
Expressed variants of the DQ2-Glia-α1 epitope represented by ≥5 transcripts and the number of transcripts per epitope variants per chromosomal locus (Gli-A2, Gli-B2 or Gli-D2). N = total transcript count per variant. In bold: amino acid variation.
Table 3. DQ2-Glia-α2 epitope variants expressed in bread wheat.
Gli-A2 | Gli-B2 | Gli-D2 | N | |
PQPQLPYPQ | 607 | 607 | ||
PQPQLPYSQ | 431 | 431 | ||
FPPQLPYPQ | 382 | 382 | ||
LQPQLPYSQ | 106 | 106 | ||
FLPQLPYPQ | 31 | 31 | ||
SQPQLPYSQ | 85 | 85 | ||
PQPQLSYPQ | 54 | 54 | ||
PQPHLPYPQ | 30 | 30 | ||
PHPQLPYPQ | 29 | 29 | ||
PQPQLSYSQ | 7 | 7 | ||
PQAQLPYSQ | 6 | 6 | ||
PQPQPQYPQ | 6 | 6 | ||
Total | 629 | 419 | 726 | 1774 |
Expressed variants of the DQ2-Glia-α2 epitope represented by ≥5 transcripts and the number of transcripts per epitope variants per chromosomal locus (Gli-A2, Gli-B2 or Gli-D2). N = total transcript count per variant. In bold: amino acid variation.
Table 4. DQ2-Glia-α3 epitope variants expressed in bread wheat.
Gli-A2 | Gli-B2 | Gli-D2 | N | |
FRPQQPYPQ | 650 | 449 | 1099 | |
FPPQQPYPQ | 732 | 687 | 606 | 2025 |
FPSQQPYPQ | 179 | 8 | 187 | |
FRPQQSYPQ | 157 | 157 | ||
FPPQQPYPH | 148 | 148 | ||
FPAQQPYPQ | 56 | 56 | ||
FPPQQSYPQ | 18 | 18 | ||
FPGQQPYPQ | 20 | 20 | ||
FQPQQPYPQ | 13 | 13 | ||
FLPQQPYPQ | 8 | 8 | ||
FRPQQQYPQ | 6 | 6 | ||
Total | 1395 | 1108 | 1234 | 3737 |
Expressed variants of the DQ2-Glia-α3 epitope represented by ≥5 transcripts and the number of transcripts per epitope variants per chromosomal locus (Gli-A2, Gli-B2 or Gli-D2). In italics: DQ2-Glia-α3 variants located on the position of the innate responsive element, p31-43. N = total transcript count per variant. In bold: amino acid variation.
Table 5. DQ8-Glia-α1 epitope variants expressed in bread wheat.
Gli-A2 | Gli-B2 | Gli-D2 | N | |
QGSFQPSQQN | 186 | 381 | 567 | |
QGSFRPSQQN | 409 | 409 | ||
QGFFQPSQQN | 114 | 114 | ||
QGSFRPFQQN | 103 | 103 | ||
QVSFQPSQLN | 112 | 112 | ||
QGSFQSSQQN | 111 | 111 | ||
QGSFQPFQQN | 4 | 27 | 31 | |
QGFFQPFQQN | 6 | 6 | ||
Total | 512 | 413 | 528 | 1453 |
Expressed variants of the DQ8-Glia-α1 epitope, represented by ≥5 transcripts and the number of transcripts per epitope variants per chromosomal locus (Gli-A2, Gli-B2 or Gli-D2). N = total transcript count per variant. In bold: amino acid variation.
We observed that canonical DQ2-Glia-α1 ( Table 2 ) and DQ2-Glia-α3 epitopes ( Table 4 ) were only present in Gli-A2 and Gli-D2 transcripts and that the canonical DQ2-Glia-α2 motif ( Table 3 ) was unique for Gli-D2 transcripts (72% of all Gli-D2 transcripts contained at least one DQ2-Glia-α2 epitope core). DQ8-Glia-α1 canonical epitopes were present in Gli-D2 and Gli-B2 transcripts only ( Table 5 ). The canonical p31-43 motif was not restricted to transcripts from a specific locus and was found in transcripts from all α-gliadin loci.
In addition to the canonical epitope motifs, a large series of sequence variants with one or two amino acid substitutions were detected ( Tables 2 , 3 , 4 , 5 ). The large majority of the Gli-B2 gliadins contained sequence variants of the DQ2-Glia-α1, DQ2-Glia-α2, and DQ2-Glia-α3 epitopes with two amino acid substitutions. Furthermore, Gli-A2 transcripts harbored a proline to serine substitution at position 8 (p8) of the DQ2-Glia-α2 epitope and a sequence variant (QGSFRP(S/F)QQN, amino acid substitution in bold) of the DQ8-Glia-α1 epitopes. Importantly, the 33-mer peptide (LQLQPFPQPQLPYPQPQLPYPQPQLPYPQPQPF) that is highly resistant to degradation in the gastrointestinal tract and contains six overlapping DQ2-Glia-α1 and DQ2-Glia-α2 epitopes, conferring superior T cell stimulatory properties [13], was only observed in a subset of the α-gliadins from the Gli-D2 locus and never in the α-gliadins from the Gli-A2 and Gli-B2 loci. The latter expressed substantially truncated versions of the 33-mer (Fig. S2).
T cell stimulatory capacity of α-gliadin derived peptides
Several of the amino acid variants that we found in the α-gliadin transcriptome have never been described before while some have been described but were never tested for their T cell stimulatory capacity. In order to determine which variants are capable of inducing T cell responses, the DQ2-Glia-α1, DQ2-Glia-α2, DQ2-Glia-α3, and DQ8-Glia-α1 variants were synthesized as 15- or 16-mer peptides and tested for their capacity to bind to HLA-DQ2 and induce in vitro proliferation of HLA DQ2- or DQ8-restricted T cell clones.
DQ2-Glia-α1 variants
Virtually all peptides that carry the canonical 9-mer Glia-α1 epitope core P1{F/Y}2P3Q4P5E6L7P8Y9 (in which the glutamic acid at p6 is introduced by TG2-deamidation of the original glutamine) were able to stimulate DQ2-Glia-α1 T cells. Only an arginine residue at the position preceding the epitope core diminished T cell stimulation. In the core sequence, several amino acid substitutions diminished or abolished the T cell stimulatory capacity, such as a proline to serine substitution at p3 or p8 and a proline to alanine substitution at p5 ( Table 6 , no. 3). Peptides in which an amino acid was deleted at p3 or p4 were not causing any proliferation of the T cells. Strikingly, such safe peptides are all from locus Gli-B2 ( Table 6 , Fig. S3).
Table 6. T cell proliferation and HLA-DQ2 binding capacity of DQ2-Glia-α variants.
No. | Peptide | locus | IC50DQ2-Glia-α1 | Glia-α1T cell clone | IC50DQ2-Glia-α2 | Glia-α2T cell clone |
1 | QLQPFPQPELPYPQPE | Gli-D2 | 5 | + | 18 | + |
2 | QLQPFPQPELPYPQPQ | Gli-D2 | 5 | + | 34 | + |
3 | QLQPFPQAELPYSQPQ | Gli-D2 | 8 | ± | 11 | - |
4 | QLQPFPQPELSYPQPQ | Gli-D2 | 12 | - | 24 | - |
5 | QLQRPFPQPELPYPQPE | Gli-D2 | 14 | ± | 19 | + |
6 | QLQPFPQPELPYTH | Gli-D2 | 21 | + | 21 | - |
7 | QLQPFPHPELPYPQPQ | Gli-D2 | 42 | + | 41 | ± |
8 | QLQPFSQPELPYSQPQ | Gli-A2 | 7 | ± | 32 | - |
9 | QLQPFPQPELPYSQPQ | Gli-A2 | 57 | + | 41 | - |
10 | QLQPFPQPELPYSQPE | Gli-A2 | 67 | + | 24 | - |
11 | QPQPFL-PELPYPQPE | Gli-B2 | 4 | - | 4 | ± |
12 | QPQPF-QPELPYPQPE | Gli-B2 | 8 | - | 8 | + |
13 | QPQEFP-PELPYPQPE | Gli-B2 | 45 | - | 45 | - |
14 | QPQQFP-PELPYPQPE | Gli-B2 | 86 | - | 86 | - |
15 | QPQPFP-PELPYPQTQP | Gli-B2 | nd | - | nd | - |
IC50 DQ2-Glia-α3 | Glia-α3 T cell clone | |||||
16 | QPFRPEQPYPQPQPQ | Gli-A2/D2 | 35 | + | ||
17 | QPFPPEQPYPQPQPQ | Gli-A2/D2/B2 | 3080 | - | ||
IC50 DQ8-Glia-α1 | DQ8-Glia-α1 T cell clone | |||||
18 | LGEGSFQPSQENP | Gli-D2/B2 | 16 | + | ||
19 | LGEGSFRPSQENP | Gli-A2 | 33 | - | ||
20 | LGEGFFQPSQENP | Gli-D2 | nd | - |
Variants of the DQ2-Glia-α1 and DQ2-Glia-α2 epitopes as encoded by the α-gliadin transcriptome were synthesized as deamidated 14- to 17-mer peptides (column 1, underlined: DQ2-Glia-α1/α2 epitope region) and tested for stimulation of DQ2-Glia-α1 and DQ2-Glia-α2 specific T cell clones in a proliferation assay. ± = 100 times reduced T cell stimulation compared to the ‘canonical’ epitope; - = 1000 times reduced T cell stimulation compared to the ‘canonical’ epitope. For each epitope shorter versions of the peptide variant, including the putative epitope core flanked by at least two amino acids, were tested in a cell free in vitro peptide binding assay for binding to HLA-DQ2 antigen presenting cells (lysates from HLA-DR3/DQ2 positive EBV-transfored B-cells). IC50 DQ2-Glia-α1/α2 = mean value of the half maximal inhibitory concentration (IC50) returned by the binding assays for respectively DQ2-Glia-α1 and DQ2-Glia-α2 epitope variants. IC50 values were calculated based on the observed competition between the tested peptides and biotin-labelled indicator peptides and indicate the concentration of the tested peptide required for half maximal inhibition of the binding of the indicator peptide.
DQ2-Glia-α2 variants
Full responses of DQ2-Glia-α2 T cells were only observed against peptides that carry the core P/F1Q2P3E4L5P6Y7P8Q9. A deletion of the glutamine at p2, indicative for α-gliadins from locus Gli-B2, or substitution of this glutamine by histidine diminished or abolished the stimulatory capacity. Furthermore, a single substitution of the proline for an serine residue at either p6 or p8 abolished the T cell stimulating capacity. The latter substitution is found in α-gliadins from locus Gli-A2 ( Table 6 ).
DQ2-Glia-α3 variants
Also for this epitope several amino acid substitutions were found to destroy T cell stimulatory capacity, including an arginine to proline substitution at p2, which is found in α-gliadins from Gli-B2 ( Table 6 ).
DQ8-Glia-α1 variants
While several amino acid substitutions were found to influence T cell recognition of the canonical sequence Q1G2S3F4Q5P6S7Q8Q9, a single serine to phenylalanine substitution at p3 and a single glutamine to arginine substitution at p5 were found to completely destroy T cell stimulatory properties ( Table 6 ). While the former T cell stimulatory variants are found in α-gliadins from Gli-D2 and Gli-B2, the glutamine to arginine variants are from Gli-A2 ( Table 5 , Fig. S2).
Thus, the α-gliadins encoded by Gli-A2 of bread wheat are marked with a specific single amino acid substitution (P to S at p8) and lack the capacity to stimulate Glia-α2 T cell clones, whereas α-gliadins encoded by Gli-B2 carry a deletion that prevents Glia-α1 T cell clone stimulation. Alpha-gliadins with the two intact epitopes, Glia-α1 and Glia-α2, are encoded by Gli-D2 ( Tables 2 and 3 ).
T cell stimulatory capacity of diploid wheat accessions
Previously, differential reactivity of α-gliadin specific T cell clones against gluten extracts from diploid wheat accessions has been reported [18], [19]. In order to link such differential reactivity to the presence of specific epitope variants, a panel of diploid wheat accessions containing either the A, S or D genome was tested for their reactivity of an α-gliadin specific monoclonal antibody (mAb) and T cells.
Pepsin-trypsin digests of gluten extracts from kernels of 29 diploid Triticum and Aegilops accessions were prepared and tested in a competition ELISA with a mAb specific for a sequence partially overlapping with the DQ2-Glia-α1 and DQ2-Glia-α2 epitopes ( Fig. 1A ) and, after treatment with TG2, with T cell clones specific for either the DQ2-Glia-α1 and DQ2-Glia-α2 epitope ( Fig. 1B and 1C ). The results indicate that the mAb reacts strongly with all extracts except three derived from diploids expressing the S genome ( Fig. 1A ). Similarly, and in agreement with previous results [18], extracts from A, S and D origin were capable of stimulating the DQ2-Glia-α1 specific T cell clone ( Fig. 1B ) while the extracts of the diploids expressing the A genome failed to stimulate the DQ2-Glia-α2 specific T cell clone ( Fig. 1C ).
Based on our observation that the α-gliadins expressed from locus Gli-A2 of the T. aestivum A genome carry a variant DQ2-Glia-α2 epitope in which the proline at p8 has been replaced by a serine, and our experimental result that introducing this substitution in a peptide leads to loss of DQ2-Glia-α2 T cell stimulatory properties ( Table 6 ), we wanted to determine if this amino acid substitution was indeed the cause of loss of immunogenicity. For this purpose we sequenced the α-gliadin locus of three diploid wheat (T. monococcum, A genome) accessions. In agreement with the results from hexaploid wheat transcripts (Fig. S2) we observed that the α-gliadin transcripts from T. monococcum contain only a single form of DQ2-Glia-α1 and DQ2-Glia-α2. Moreover, in the DQ2-Glia-α2 epitope the proline at p8 was consistently replaced by a serine (Table S1). Together these results establish that this naturally occurring single amino acid substitution is sufficient to completely eliminate the T cell stimulatory properties of the DQ2-Glia-α2 epitope in gluten.
Differential reactivity of the DQ2-Glia-α3 specific T cells towards the extracts of the diploids correlated with an arginine to proline replacement at p2 in α-gliadins derived from the S-genome, FRPQQPYPQ→ FPPQQPYPQ ( Table 6 ).
Elimination of α-gliadin toxicity by a naturally occurring single amino acid substitution
The large majority of known antigenic peptides derived from wheat gluten, as well as homologous peptides derived from the hordeins from barley and the secalins from rye, contain a proline at p8 [24]. Based on our observation that a single proline to serine substitution at p8 induced unresponsiveness of DQ2-Glia-α2 specific T cells and the previous observation that a similar substitution at p8 of the DQ2-Glia-α1 epitope induced T cell unresponsiveness ( Table 6 ; peptide no. 4), we investigated if a similar substitution would also eliminate the antigenic properties of the DQ2-Glia-α3 epitope. Wild type versions of the DQ2-Glia-α1, -α2 and -α3 epitopes as well as versions in which the proline at p8 was substituted by a serine were synthesized and tested in T cell proliferation studies. As expected neither the substituted DQ2-Glia-α1 epitope (QLQPFPQPELSYPQPQ) nor the substituted DQ2-Glia-α2 epitope (QLQPFPQPELPYSQPQ) induced T cell proliferation of respectively DQ2-Glia-α1 and DQ2-Glia-α2 specific T cells ( Table 6 ). Likewise, the substituted DQ2-Glia-α3 epitope failed to induce T cell activation ( Fig. 2A ).
We also analysed the effects of proline to serine substitutions in the 33-mer α-gliadin derived peptide that encodes 6 partially overlapping antigenic DQ2-Glia-α1 and -α2 sequences ( Fig. 2B ) as well as in an elongated version of the 33-mer which also encodes the DQ2-Glia-α3 epitope (Table S2). In all cases, the proline to serine substitutions completely abrogated the response of DQ2-Glia-α1, DQ2-Glia-α2 and DQ2-Glia-α3 specific T cell clones ( Fig. 2B , Table S2).
Finally, we tested the effect of systematic introduction of single, double, triple and quadruple P to S substitutions in the DQ2-Glia-α-1 peptide with five T cell clones isolated from small intestinal biopsies of 3 CD patients. T cell responses of single T cell clones were only observed for two of the single P to S substitutions at p3 and p5. In contrast, with a single P to S substitution at p8 and in all but one of the other cases the substitutions completely eliminated the T cell response of all five T cell clones ( Fig. 3 )
Thus, the naturally occurring proline to serine substitution constitutes a universal approach to remove the antigenic properties of HLA-DQ2 restricted α-gliadin peptides.
Discussion
Although T cell responses to peptides derived from α-, γ- and ω-gliadins as well as from HMW- and LMW-glutenins have been described, various studies have indicated that the α-gliadins are among the most immunogenic regarding CD [8]–[10], [13], [25]. A crucial step towards the elimination of gluten toxicity would thus be the elimination of T cell stimulatory α-gliadin sequences. Our extensive genetic analysis of over 3000 α-gliadin transcripts from different bread wheat accessions showed a high heterogeneity of the α-gliadins genes and considerable differences in the number of T cell stimulatory sequences encoded by the various α-gliadin genes. We identified three major factors determining these differences: i) the length of tandem repeats of antigenic sequences; ii) natural amino acid substitutions that affect the antigenicity of T cell epitopes, and iii) amino acid deletions that eliminate the antigenicity of T cell epitopes.
The α-gliadins from locus Gli-D2 generally encode several copies of both the DQ2-Glia-α1 and DQ2-Glia-α2 epitopes in addition to the DQ2-Glia-α3 and DQ8-Glia-α1 epitopes, Gli-A2 genes usually encode only the DQ2-Glia-α1 and DQ2-Glia-α3 epitopes while the Gli-B2 genes encode no or at most one DQ2-Glia-α1 epitope, next to the DQ8-Glia-α1 epitopes. The 33-mer sequence with 6 T cell stimulatory sequences [13] was only found in a minority of the α-gliadins analyzed, all of which are expressed from Gli-D2. Many natural occuring amino acid substitutions affecting the antigenicity of the canonical α-gliadin peptides were identified. Typical examples are the proline to serine substitution at p8 in the DQ2-Glia-α2 epitope and the arginine to proline substitution at p2 in the DQ2-Glia-α3 epitope, that both completely eliminate the T cell stimulatory properties of these peptides.
The analysis of gluten extracts from the diploid wheat varieties underscores these observations and provides a molecular basis for the previous observation that A-genome diploid T. monococcum varieties lack the DQ2-Glia-α2 epitope [18]. All α-gliadins from this genome encode an altered version of the DQ2-Glia-α2 epitope with a serine at p8 that fails to induce T cell responses. We observed that a similar substitution eliminates the T cell stimulatory properties of the DQ2-Glia-α1 and DQ2-Glia-α3 epitopes. The more variable reaction pattern of T-cells and antibodies towards gluten extracts of the S-genomes reflects the higher level of genetic variation in these outcrossing species. These α-gliadins contain another variant of the DQ2-Glia-α1 and DQ2-Glia-α2 region that does not incude the A-genome specific serine at p8. Furthermore, amino acid deletions in the canonical α-gliadin peptides prohibit binding to HLA-DQ2 and hence T cell recognition. A typical example is the deletion of the glutamine at p4 in the DQ2-Glia-α1 epitope which generates a peptide that no longer binds to HLA-DQ2, presumably due to defective docking of the anchor residues into their respective pockets in the HLA-DQ2 molecule. Our results confirm previous observations [18], [19] that the α-gliadin locus Gli-D2 encodes the most toxic α-gliadins while substantially less toxicity is associated with those from Gli-A2 and Gli-B2, but also provide the molecular basis for these differences.
Unfortunately, due to the complexity of both the Gli-2 gene family and the wheat genome, it will be a difficult task to generate tetraploid pasta and hexaploid bread wheat that is entirely safe for consumption by all CD patients by conventional breeding methods. Our results now provide a rationale for an alternative approach as we demonstrate that by the introduction of naturally occurring amino acid substitutions the toxicity of all four T cell epitopes in α-gliadins can be eliminated. Using novel methods such as zinc finger nucleases [26]–[28] we can introduce the underlying SNPs as specific mutations into the α-gliadin genes of wheat to eliminate toxicity completely. Technically this will not be easy, but our results indicate precisely the three complementary sets of actions that need to be performed.
i) In the Gli-B2-derived α-gliadins analyzed, none of the DQ2 epitopes are present due to a single amino acid deletion in the region encoding the DQ2-Glia-α1 and DQ2-Glia-α2 epitopes, which generates a peptide that has decreased binding affinity for HLA-DQ2 and is no longer recognized by T cells. Therefore, a single amino acid substitution in the DQ2-Glia-α3 epitope will result in a peptide that has completely lost HLA-DQ2 binding properties and T cell stimulatory properties. To eliminate the remaining DQ8-Glia-α1 epitope in some Gli-B2 α-gliadins, a single glutamine to arginine substitution, which naturally occurs in the α-gliadins from the A-genome, would suffice. Such a minimally genetically modified B-genome α-gliadin gene would thus no longer encode any T cell stimulatory peptides. Moreover, by starting with an α-gliadin gene in which the sequence of the p31-43 peptide is naturally altered (for example the gene encoding protein no. 9 in SI 2), the chance of innate immune stimulation by a protein derived from such a gene would also be minimized.
ii) For Gli-A2 α-gliadins, the approach to eliminate toxicity would be to introduce two proline to serine substitutions at p8 in the DQ2-Glia-α1 and DQ2-Glia-α3 epitopes present. As these α-gliadins genes encode a shorter version of the immunodominant 33-mer in which the DQ2-Glia-α-2 epitope is already non-functional, and contain a version of the DQ8-Glia-α1 epitope that has no T cell stimulatory properties, these two substitutions would completely remove toxicity in proteins encoded by such modified genes.
iii) Regarding the Gli-D2 α-gliadins, we found that the proline to serine substitutions at p8 completely abrogated the in vitro T cell stimulatory properties of the 33-mer peptide and of an elongated version of the 33-mer that also encodes the DQ2-Glia-α3 epitope. This result underscores our previous observation that most T cell stimulatory gluten peptides have a proline at p8 [24]. However, to render α-gliadins from the D-genome non-toxic, up to seven substitutions need to be introduced in a single gene.
An alternative approach is to design safe α-gliadin genes that can subsequently be introduced into celiac disease safe cereals such as rice or maize, for the production of gluten proteins. As such modified proteins will be very similar to existing α-gliadins they will most likely have indistinguishable technological properties. Thus, such gluten proteins could enhance the baking properties of these cereal crops, or they could be extracted from these crops and used as an ingredient to generate novel high quality foods for celiac disease patients. For the generation of high quality cereals that can replace wheat-based products the simple introduction of detoxified α-gliadins is unlikely to be sufficient, as baking quality is mostly determined by the HMW and LMW glutenin proteins. Therefore, additional studies will have to investigate how other gluten proteins can be detoxified as there is substantial evidence that these contain T cell stimulatory peptides as well. In previous studies we provided evidence that such epitopes can be found in the γ-gliadins as well as in the LMW- and HMW-glutenins [11], [12] and others have extended these observations [14], [25]. In particular, we found that T cell responses to LMW-glutenins were found in children while these are much less frequent in adults [12], [25]. Moreover, in a recent study a highly antigenic ω-gliadin peptide was described [25] that is identical to an antigenic hordein-derived peptide reported by us earlier [20]. In essence this hordein/omega peptide is a sequence variant of the DQ2-Glia-α1 peptide and also carries a proline at the p8 position [20]. It is therefore feasible that the toxicity of this peptide can be eliminated by a proline to serine substitution at p8 as well. Preliminary results show that amino acid substitutions similar to those that destroy the T cell stimulatory properties in α-gliadins might also be effective for γ-gliadin derived epitopes (Salentijn et al, in prep), but it is likely that for the LMW- and HMW-glutenins other approaches will be required. This will be the subject of further studies.
In conclusion, we have demonstrated that by utilizing naturally occurring amino acid substitutions the toxicity of the four T cell epitopes in α-gliadins can be eliminated. Such modified proteins will most likely display indistinguishable technological properties. Thus, our results provide a rational approach to eliminate CD related toxicity from α-gliadins, which represents a first but crucial step towards the realization of safe gluten containing food products for CD patients.
Materials and Methods
Analysis of α-gliadin transcripts from diploid wheat varieties
T. monococcum accessions CGN10500, CGN12035 and CGN10555 (CGN, Wageningen, The Netherlands) were used for cloning and sequencing of α-gliadin transcripts. Plants were grown under greenhouse conditions. Developing green kernels of single plants were harvested and used for RNA isolation according to Doyle and Doyle [29] but with 1% (w/v) poly-(vinylpyrrolidone)-10 in the extraction buffer. For the production of first strand cDNA 1 µg of total RNA was treated with DNAse I (Invitrogen, amplification grade; 18068-015) followed by RT PCR (Invitrogen SuperScript III First-Strand Synthesis System for RT-PCR; 18080-051) using random hexamer primers in a final reaction volume of 20 µl. Primers specific for α-gliadin genes, located on the conserved sequences at the 5′ and 3′ end of the coding region of the α-gliadin, were used to amplify α-gliadin transcripts from the cDNA samples (αF1: 5′-atgaaRaCmtttcYcatc and α5R: 5′-gttagtaccgaNgatgcc). The PCR conditions: 5 min. at 94°C, 30 cycles (94°C for 1 min., 49°C for 1 min. and 72°C for 2 min), 72°C for 10 min, 25 µl reaction volume. The PCR products were cloned and sequenced.
Characterization of expressed α-gliadin sequences
Over 3200 T. aestivum expressed sequence tags (ESTs) and mRNAs designated as α-gliadin or α/β-gliadin were downloaded from the NCBI UniGene library (Ta.15268, Ta.23792, Ta.24084, Ta.25210, Ta.27702, Ta.28482) (http://www.ncbi.nml.nih.gov/UniGene) on 13 April 2007. The sequences were from T. aestivum libraries of various tissues, treatments and cultivars (Chinese Spring 34.2%; Glenlea 20.8%; Cheyenne 10.4%; Recital 7.8%; Mercia 7.2%; unknown cultivar 8.9%; Wyuna 3.2%; HiLine 2.9%; Butte 86, 2.4%; Hartog 1.5%; Soleil 0.6%; Nostar 0.1%; Novobirskaya 67, 0.1%). The DNA sequences were aligned using the SeqMan II (DNAstar) and first assembled at a minimum match percentage of 60%, gap lengths 3200, maximum match size 50 bp. BLAST analysis of the contigs was performed to verify the α-gliadin identity of the contigs and short (<100 bp) and bad sequences were discarded. The transcript contigs of ≥60% homologous sequences were trimmed up to the start and stop codons. Next, the sequences were reassembled at 90% homology, which resulted in 55 α-gliadin transcript contigs containing 1 to 475 sequences. The 3′ end was covered by 50 contigs (2911 transcripts) whereas the 5′ end was present in 30 contigs (2753 transcripts). The consensus of these contigs were saved in separate files and used for phylogenetic studies to deduce the genome of origin of the sequences in each contig.
Phylogenic analysis
With the aim to deduce the locus, Gli-A2, Gli-B2 or Gli-D2, from which the transcripts were expressed, the 55 α-gliadin EST consensus nucleotide sequences obtained from clustering, were aligned using Clustal W, MEGA 4 [30], together with 56 genomic DNA sequences of known origin, i.e. derived from the diploid wheat species T. monococcum (A genome), T. speltoides (S genome) and Aegilops tauschii (D genome) [18] and DNA sequences that were previously assigned to a locus [31]. The sequences that covered the 5′ region of the α-gliadin sequences were trimmed up to the start and up to nucleotides coding for the conserved amino acid motif PIS, located just in front of the first glutamine repeat, to cover the same region and subsequently used to generate a Neighbor-Joining tree (bootstrap test of 1000 replicates, pairwise deletion of gaps and missing data, Kimura 2-parameter, Substitutions to Include Transitions + Transversions; Pattern among Lineages Homogeneous, Uniform rates among sites, number of sites = 750, in MEGA 4)(Fig. S1).
Sequence variation in epitope regions
To analyze all sequence variation, the 55 α-gliadin EST contigs, now assigned to a specific chromosome, were reassembled one by one at 99–100% match (SeqMan II, Lasergene, DNAstar). This yielded 717 different allelic variants. The consensus nucleotide sequences were translated (MEGA 4) and explored for epitopes and surrounding sequence regions using a text explorer program (PatternResearch, in house developed) after which the output file was analysed in Excel.
T cell clones, T cell proliferation and HLA-DQ2 binding assays
Gluten specific T cell clones were generated from small intestinal biopsies of celiac disease patients as described before [8], [11], [12]. All patients signed an informed consent form which was approved by the hospital ethics committee. Proliferation assays were performed in triplicate in 150 µl Iscove's Modified Dulbecco's Medium (Bio Whittaker, Verviers, Belgium) with 10% pooled normal human serum in 96 well flat-bottom plates using 2×104 gluten specific T cells stimulated with 105 irradiated HLA-DQ2-matched allogeneic peripheral blood mononuclear cells (3000 rad) in the presence or absence of antigen (1–10 µg/ml) [8], [11], [12]. After 2 days 3H-thymidine (0.5 µCi/well) was added to the cultures, and 18–20 hours thereafter the cells were harvested. 3H-thymidine incorporation in the T cell DNA was determined with a liquid scintillation counter (1205 Betaplate Liquid Scintillation Counter, LKB Instruments, Gaithersburg, MD). A binding assay was performed as described previously [24].
Supporting Information
Footnotes
Competing Interests: The authors have declared that no competing interests exist.
Funding: This study was supported by the Celiac Disease Consortium, an Innovative Cluster approved by The Netherlands Genomics Initiative and partially funded by the Dutch Government (BSIK03009), by the Center for Medical Systems Biology (CMSB), a center of excellence approved by The Netherlands Genomics Initiative/Netherlands Organization for Scientific Research (NWO), and by The Netherlands Ministry of Agriculture, Nature, and Food Safety (project KB-05-001-019). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Rewers M. Epidemiology of celiac disease: what are the prevalence, incidence, and progression of celiac disease? Gastroenterology. 2005;128:S47–S51. doi: 10.1053/j.gastro.2005.02.030. [DOI] [PubMed] [Google Scholar]
- 2.Koning F. Celiac Disease: caught between a rock and a hard place. Gastroenterology. 2005;129:1294–1301. doi: 10.1053/j.gastro.2005.07.030. [DOI] [PubMed] [Google Scholar]
- 3.Sollid LM. Coeliac disease: dissecting a complex inflammatory disorder. Nat Rev Immunol. 2002;2:647–55. doi: 10.1038/nri885. [DOI] [PubMed] [Google Scholar]
- 4.Stepniak D, Koning F. Celiac Disease: sandwiched between innate and adaptive immunity. Hum Immunol. 2006;67:460–468. doi: 10.1016/j.humimm.2006.03.011. [DOI] [PubMed] [Google Scholar]
- 5.Kagnoff MF. Celiac disease: pathogenesis of a model immunogenetic disease. J Clin Investigation. 2007;117:41–49. doi: 10.1172/JCI30253. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Meresse B, Chen Z, Ciszewski C, Tretiakova M, Bhagat G, et al. Coordinated induction by IL15 of a TCR-independent NKG2D signaling pathway converts CTL into lymphokine-activated killer cells in celiac disease. Immunity. 2004;21:357–366. doi: 10.1016/j.immuni.2004.06.020. [DOI] [PubMed] [Google Scholar]
- 7.Hüe S, Mention J-J, Monteiro RC, Zhang SL, Cellier C, et al. A direct role for NKG2D/MICA interaction in villous atrophy during celiac disease. Immunity. 2004;21:367–377. doi: 10.1016/j.immuni.2004.06.018. [DOI] [PubMed] [Google Scholar]
- 8.van de Wal Y, Kooy Y, van Veelen P, Pena S, Mearin L, et al. Small intestinal cells of celiac disease patients recognize a natural pepsin fragment of gliadin. Proc Natl Acad Sci USA. 1998;95:10050–10054. doi: 10.1073/pnas.95.17.10050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Arentz-Hansen H, Körner R, Molberg Ø, Quarsten H, Vader W, et al. The intestinal T cell response to α-gliadin in adult celiac disease is focused on a single deamidated glutamine targeted by tissue transglutaminase. J Exp Med. 2000;191:603–612. doi: 10.1084/jem.191.4.603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Anderson RP, Degano P, Godkin AJ, Jewell DP, Hill AV. In vivo antigen challenge in celiac disease identifies a single transglutaminase-modified peptide as the dominant A-gliadin T-cell epitope. Nat Med. 2000;6:337–342. doi: 10.1038/73200. [DOI] [PubMed] [Google Scholar]
- 11.van de Wal Y, Kooy YMC, van Veelen P, Vader W, August SA, et al. Glutenin is involved in the gluten-driven mucosal T cell response. Eur J Immunol. 2000;29:3133–3139. doi: 10.1002/(SICI)1521-4141(199910)29:10<3133::AID-IMMU3133>3.0.CO;2-G. [DOI] [PubMed] [Google Scholar]
- 12.Vader W, Kooy Y, van Veelen P, de Ru A, Harris D, Benckhuijsen W, et al. The gluten response in children with celiac disease is directed toward multiple gliadin and glutenin peptides. Gastroenterology. 2002;122:1729–1737. doi: 10.1053/gast.2002.33606. [DOI] [PubMed] [Google Scholar]
- 13.Shan L, Molberg Ø, Parrot I, Hausch F, Filiz F, et al. Structural basis for gluten intolerance in celiac sprue. Science. 2002;297:2275–2279. doi: 10.1126/science.1074129. [DOI] [PubMed] [Google Scholar]
- 14.Qiao S-W, Bergseng E, Molberg Ø, Jung G, Fleckenstein B, et al. Refining the rules of gliadin T cell epitope binding to the disease-associated DQ2 molecule in celiac disease: importance of proline spacing and glutamine deamidation. J Immunol. 2005;175:254–261. doi: 10.4049/jimmunol.175.1.254. [DOI] [PubMed] [Google Scholar]
- 15.Okita TW, Cheesbrough V, Reeves CD. Evolution and heterogeneity of the α-/β-type and γ-type gliadin DNA sequences. J Biol Chem. 1985;260:8203–8213. [PubMed] [Google Scholar]
- 16.Anderson OD, Litts JC, Greene FC. The α-gliadin gene family. I. Characterization of ten new wheat α-gliadin genomic clones, evidence for limited sequence conservation of flanking DNA, and Southern analysis of the gene family. Theor Appl Genet. 1997;95:50–58. [Google Scholar]
- 17.van Herpen TWJM, Goryunova SV, van der Schoot J, Mitreva M, Salentijn EMJ, et al. Alpha-gliadin genes from the A, B, and D genomes of wheat contain different sets of celiac disease epitopes. BMC Genomics. 2006;7:1. doi: 10.1186/1471-2164-7-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Molberg Ø, Uhlen AK, Jensen T, Flaete NS, Fleckenstein B, et al. Mapping of gluten T-cell epitopes in the bread wheat ancestors: implications for celiac disease. Gastroenterology. 2005;128:393–401. doi: 10.1053/j.gastro.2004.11.003. [DOI] [PubMed] [Google Scholar]
- 19.Spaenij-Dekking L, Kooy-Winkelaar Y, van Veelen P, Drijfhout JW, Jonker H, et al. Natural variation in toxicity of wheat: potential for selection of nontoxic varieties for celiac disease patients. Gastroenterology. 2005;129:797–806. doi: 10.1053/j.gastro.2005.06.017. [DOI] [PubMed] [Google Scholar]
- 20.Vader LW, Stepniak DT, Bunnik EM, Kooy YMC, De Haan W, et al. Characterization of cereal toxicity for celiac disease patients based on protein homology in grains. Gastroenterology. 2003;125:1105–1113. doi: 10.1016/s0016-5085(03)01204-6. [DOI] [PubMed] [Google Scholar]
- 21.Vader LW, de Ru A, van der Wal Y, Kooy YMC, Benckhuijsen W, et al. Specificity of tissue transglutaminase explains cereal toxicity in celiac disease. J Exp Med. 2002;195:643–649. doi: 10.1084/jem.20012028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Van den Broeck HC, de Jong HC, Salentijn EMJ, Dekking L, Bosch D, et al. Presence of celiac disease epitopes in modern and old hexaploid wheat varieties. Wheat breeding may have contributed to increased prevalence of celiac disease. Theor Appl Genet. 2010;121:1527–1539. doi: 10.1007/s00122-010-1408-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Van den Broeck HC, Chen H, Lacaze X, Dusautoir J-C, Gilissen L, et al. In search of tetraploid wheat accessions reduced in celiac disease-related gluten epitopes. Mol BioSystems. 2010;6:2206–2213. doi: 10.1039/c0mb00046a. [DOI] [PubMed] [Google Scholar]
- 24.Stepniak D, Wiesner M, de Ru AH, Moustakas AK, Drijfhout JW, et al. Large-scale characterization of natural ligands explains the unique gluten-binding properties of HLA-DQ2. J Immunol. 2008;180:3268–3278. doi: 10.4049/jimmunol.180.5.3268. [DOI] [PubMed] [Google Scholar]
- 25.Tye-Din JA, Stewart JA, Dromey JA, Beissbarth T, van Heel DA, et al. Comprehensive, quantitative mapping of T cell epitopes in gluten in celiac disease. Sci Transl Med. 2010;2:41ra51. doi: 10.1126/scitranslmed.3001012. [DOI] [PubMed] [Google Scholar]
- 26.Jin Li J, Hsia A-P, Schnable PS. Recent advances in plant recombination. Curr Opin Plant Biol. 2007;10:131–135. doi: 10.1016/j.pbi.2007.01.007. [DOI] [PubMed] [Google Scholar]
- 27.Lloyd A, Plaisier CL, Carroll D, Drews GN. Targeted mutagenesis using zinc-finger nucleases in Arabidopsis. Proc Natl Acad Sci USA. 2005;102:2232–2237. doi: 10.1073/pnas.0409339102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.de Pater S, Neuteboom LW, Pinas JE, Hooykaas PJ, van der Zaal BJ. ZFN-induced mutagenesis and gene-targeting in Arabidopsis through Agrobacterium-mediated floral dip transformation. Plant Biotechnol J. 2009;7:821–835. doi: 10.1111/j.1467-7652.2009.00446.x. [DOI] [PubMed] [Google Scholar]
- 29.Doyle JJ, Doyle JL. A rapid DNA isolation procedure from small quantities of fresh leaf tissues. Phytochem Bull. 1987;19:11–15. [Google Scholar]
- 30.Tamura K, Dudley J, Nei M, Kumar S. MEGA4: Molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 2007;24:1596–1599. doi: 10.1093/molbev/msm092. [DOI] [PubMed] [Google Scholar]
- 31.Kawaura K, Mochida K, Ogihara Y. Expression profile of two storage-protein gene families in hexaploid wheat revealed by large-scale analysis of expressed sequence tags. Plant Physiol. 2005;139:1870–1880. doi: 10.1104/pp.105.070722. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.