Abstract
Previously, we have shown that statistical synergism between amino acid variants in thyroglobulin (Tg) and specific HLA-DR3 pocket sequence signatures conferred a high risk for autoimmune thyroid disease (AITD). Therefore, we hypothesized that this statistical synergism mirrors a biochemical interaction between Tg peptides and HLA-DR3, which is key to the pathoetiology of AITD. To test this hypothesis, we designed a recombinant HLA-DR3 expression system that was used to express HLA-DR molecules harboring either AITD susceptibility or resistance DR pocket sequences. Next, we biochemically generated the potential Tg peptidic repertoire available to HLA-DR3 by separately treating 20 purified human thyroglobulin samples with cathepsins B, D, or L, lysosomal proteases that are involved in antigen processing and thyroid biology. Sequences of the cathepsin-generated peptides were then determined by matrix-assisted laser desorption ionization time-of-flight-mass spectroscopy, and algorithmic means were employed to identify putative AITD-susceptible HLA-DR3 binders. From four predicted peptides, we identified two novel peptides that bound strongly and specifically to both recombinant AITD-susceptible HLA-DR3 protein and HLA-DR3 molecules expressed on stably transfected cells. Intriguingly, the HLA-DR3-binding peptides we identified had a marked preference for the AITD-susceptibility DR signatures and not to those signatures that were AITD-protective. Structural analyses demonstrated the profound influence that the pocket signatures have on the interaction of HLA-DR molecules with Tg peptides. Our study suggests that interactions between Tg and discrete HLA-DR pocket signatures contribute to the initiation of AITD.
Introduction
The major histocompatibility complex (MHC)3 II molecule is intimately associated with the initiation of autoimmunity. Studies in several MHC II-associated autoimmune conditions, most notably type 1 diabetes, have now convincingly demonstrated that susceptibility to disease is caused by certain structural features of the MHC II peptide binding cleft that influence the display of immunogenic peptides (1–5). Similarly, we have shown that the autoimmune thyroid diseases (AITD), Graves disease (GD) and Hashimoto thyroiditis (HT), are strongly associated with unique HLA-DR peptide binding cleft sequences (6, 30). Mechanistically, these pocket amino acid signatures could accommodate autoantigenic peptides derived from any of the thyroid-specific proteins, namely thyroglobulin (Tg), thyroid peroxidase, and thyrotropin receptor. Indeed, there exist data demonstrating that thyroglobulin, which constitutes 80% (8) of total thyroidal protein, is an important thyroidal target and participant in triggering AITD (reviewed in Ref. 9). Along a similar vein of thought, there is mounting evidence that suggests a role for polymorphisms in the thyrotropin receptor, the specific target of Graves disease, in the initiation of Graves hyperthyroidism (28).
For the case of thyroid autoimmunity, the connection between MHC II susceptibility signatures and how they influence autoantigenic peptide binding is gradually becoming unveiled. Work from our own group has shown that a single amino acid variation in the HLA-DR3 peptide binding cleft, resulting in an arginine at position 74 of the β chain (DRβ-Arg-74), was strongly associated with GD (6). Moreover, upon co-inheriting DRβ-Arg-74 and a disease-associated Tg SNP (R1999W), the risk of developing GD was elevated significantly beyond the additive effect of these two variants, resulting in an odds ratio of 15 (7) Thus, we hypothesized that this statistical interaction mirrors an in vivo biochemical interaction between thyroglobulin peptides and the disease-associated pocket signature of HLA-DR3.
Given the potential importance of Tg/HLA-DR3 interactions, factors that could influence the generation of Tg-derived peptides, and/or the binding of a given thyroglobulin peptide to HLA-DRβ-Arg-74, would be expected to be strongly linked to the etiology of AITD. For example, cathepsins B, D, and L, endosomal proteases involved in antigen processing (11), are also key enzymes in Tg degradation (12) and thyroid biology (13, 14), potentially making them major determinants of the Tg peptidic repertoire (15). In addition to extrinsic factors such as proteases, factors that are intrinsic to Tg, such as polymorphisms, could, by virtue of the induction of either a change in conformation or a change in the proteolytic cleavage site, render a given region more or less sensitive to digestion and thus alter the pool of bio-available peptides (reviewed in Ref, 16).
The goals of this study were to examine, at both a biochemical and structural level, the features of the HLA-DR susceptibility and resistance variants that influence Tg peptide binding. Toward achieving this end, we have designed an Escherichia coli-based system that enabled us to make functional versions of “empty” HLA-DR3. Furthermore, we have used these recombinant MHC II molecules to examine binding to selected peptides from a putative bioavailable Tg peptide pool, as well as to two Tg-derived peptides that were previously reported to bind to HLA-DR3 (17). Here, we present data that shed insights on how susceptibility and resistance signatures of the HLA-DR peptide binding pocket influence the selection of the Tg peptides that can be presented to autoreactive T-cells in the initiation of AITD.
EXPERIMENTAL PROCEDURES
Purification of Thyroglobulin from Human Thyroid Tissues
The protocol was approved by the University of Cincinnati Institutional Review Board for obtaining de-identified tissues. Twenty thyroid samples were obtained either from healthy portions of glands removed at surgery or from thyroids taken at autopsy (total death time ≤16 h) that were purchased from the NCBI Brain and Tissue Bank at the University of Maryland. Thyroid tissue samples were subsequently stored at −80 °C until use. Each thyroid tissue (typically 0.2–0.6 g) was thawed and resuspended in a 10-fold volume/tissue weight in 20 mm disodium hydrogen phosphate, 50 mm KCl, 10% glycerol, pH 7.0, in the presence of 1 mm phenylmethylsulfonyl fluoride, 1 mm EDTA, caproic acid (added 25 μl per 25 ml of buffer), and a mixture of protease inhibitors (catalogue number 04 693 116 001, Roche Applied Science; 1 tablet added per 25 ml of buffer) The sample was treated in a Dounce homogenizer, and protein was precipitated, via the addition of 0.4 g/ml ammonium sulfate. Precipitate was next resuspended in 10–20 ml of resuspension buffer. Dialysis followed, using a membrane with 15,000 molecular weight cutoff against 3 liters of 20 mm disodium hydrogen phosphate, 50 mm KCl, 1 mm EDTA, and 10% glycerol, pH 7.0. The sample was filtered, using 0.22-μm membranes, and was applied to a Sephacryl S-200 (GE Healthcare) size exclusion column. The thyroglobulin-containing portion, which came off in the void volume, was then applied to an anion exchange column (Anx-Sepharose 4 fast flow, catalogue number 17-1287-10, GE Healthcare) and was run at 2 ml/min, using a linear gradient from 50 to 600 mm KCl. Thyroglobulin eluted in a broad peak, spanning 200–600 mm KCl. Fractions were pooled and were subsequently concentrated to 1 ml, using a filter (Pierce) with a 20,000 molecular weight cutoff. The concentrate was diluted 1:20 in the final buffer (10 mm phosphate, 25 mm KCl, 10% glycerol, pH 7.0) and was then concentrated to 1–10 mg/ml protein, as determined using a protein assay dye reagent kit (Bio-Rad). Samples were snap-frozen in liquid nitrogen and were subsequently stored at −80 °C.
DNA Purification from Thyroid Tissue
DNA was extracted from a small section of each thyroid tissue, using the Puregene kit (Gentra Systems, Minneapolis, MN), according to the manufacturer's protocol.
SNP Genotyping
SNPs, identified within the Tg gene, were analyzed by a fluorescent-based restriction fragment length polymorphism method, as described previously (18).
Cathepsin Digestion of Individual Thyroglobulin Samples
Cathepsins B, D, and L were used separately to digest Tg. Digestions were performed as follows: to 40 μg of human Tg (in 33 μl of 10% glycerol, 10 mm phosphate, pH 7.0, 25 mm KCl), 1.875 μl of 1 m citrate, pH 3.5, 0.375 μl of 1 m MgCl2, 37.15 μl of distilled H2O, and 2.6 μl of cathepsin (resuspended in 500 mm citrate, pH 3.5, 50% glycerol)) were added. The final cathepsin activity was 0.085 units/μl for cathepsins B and D and 0.003 units/μl for cathepsin L. Cathepsins B (Sigma) and D (Sigma) were purified from human liver, whereas cathepsin L (Cal Biochem) was derived from a human cDNA, expressed recombinantly. Digestions were performed at 37 °C for 16–20 h. Separation of the digested Tg protein on an SDS-7.5% polyacrylamide gel versus undigested control verified that the digestion into constituent peptides was nearly complete.
Mass Spectroscopy
Peptides profiles of cathepsin-digested thyroglobulins were analyzed using matrix-assisted laser desorption ionization time-of-flight-mass spectroscopy. Desaltation on target was used to eliminate peptide loss during peptide preparation for matrix-assisted laser desorption ionization analysis. α-Cyano-4-hydroxycinnamic acid was used as matrix and prepared as a saturated solution in a 1:4:4 (v/v) mixture of formic acid/water/isopropyl alcohol. A small aliquot (1 μl) of mixture of peptide/matrix (1:1) was spotted onto the sample plate. As soon as crystals were formed, the excess liquid was removed by vacuum aspiration. The spot was then washed twice for a few seconds with 2 μl of 0.1% trifluoroacetic acid. All spectra were acquired on a Voyager-DE STR time-of-flight mass spectrometer (Applied Biosystems, Foster City, CA) in linear and reflector mode. The spectra were smoothed, calibrated, and analyzed using the program M-over-Z.
Peptide sequences were acquired on a nanoflow liquid chromatography/tandem mass spectrometry instrument. The nano-LC system included a Micro-Tech XtremeSimple nano-LC pump system (Vista, CA) and a Shimadzu SIL-20AC AutoSampler (Kyoto, Japan). Peptides were separated on a reversed-phase C-18 column (100 μm (inner diameter) × 15 cm, 3 μm, 300 Å) (Micro-Tech, CA) by linear gradient elution of 3–50% solvent B in 60 min, using solvent A containing 98% water, 2% acetonitrile, 0.1% formic acid; and solvent B containing 98% acetonitrile, 2% water, 0.1% formic acid at flow rate of 600 nl/min. LTQ-Orbitrap (Thermo-Fisher, MA) was used to collect mass data, and the instrument was operated in the data-dependent mode. Survey MS spectra (from m/z 300 to 1600) were acquired in the Orbitrap with resolution R = 30,000 at m/z = 400. The five most intense ions were isolated and fragmented in a linear ion trap. The resulting fragment ions were recorded in the Orbitrap with resolution R = 15,000 at m/z = 400 or in the linear ion trap. Peptides were identified via automated data base searching by Sorcerer (version 3.10.7) (Sage-N) against an in-house created human thyroglobulin protein sequence data base composed of all entries of human thyroglobulin protein from the NCBI Reference Sequence (RefSeq) data base and our own thyroglobulin protein sequence containing SNP information. No digestion was selected during the creation of the search data base from the Fasta files. Mass tolerances on MS peaks were 5 ppm and on tandem MS peaks were 0.5 Da. Scaffold 2.0 was used to open the search result files. Only peptides with an XCorr value from SEQUEST searching software above the following settings, Sequest XCorr >1.8, 1+; >2.0, 2+; >2.2, 3+; >2.5, 4+ and above, or an E value from X!Tandem searching software above 2 were accepted for identification.
Correlation of SNP Genotypes and Peptides Produced by Digestion
The peptide profiles, analyzed by MS, were correlated with the Tg SNP genotypes. Across the different mass spectra, a representative peak was taken as an internal standard. The heights of all other peaks in the spectrum were normalized with respect to the internal standard peak. Normalized peak heights were then examined for correlation with the Tg genotype for three Tg SNPs that were shown to be associated with AITD (E10SNP24 (S734A or rs180223), E12SNP (M1027V or rs853326), and E33 SNP (R1999W or rs11535853)) (10).
Peptide and Oligonucleotide Synthesis
Peptides (>95% purity) were ordered commercially either from AlphaDiagnostic International (San Antonio, TX) or GenScript (Piscataway, NJ). Peptides were synthesized with biotin, at their N termini, to facilitate subsequent detection. Oligos were purchased from either Sigma-Proligo (The Woodlands, TX) or Integrated DNA Technologies (Coralville, IA).
Western Blot Analyses
Frozen thyroid tissues (a total of six samples) were resuspended in lysis buffer (PBS with 1 mm EDTA and protease inhibitors) at 2.5× the volume of tissue sample. Samples were vortexed 3–4 times in 5-s bursts and were kept on ice for 30 min with vortexing every 10 min. Samples were subsequently spun down for 14,000 rpm for 10 min at 4 °C and boiled for 5 min. Protein concentration was determined using a protein assay dye reagent (Bio-Rad) against the standard curve constructed with increasing amounts of bovine serum albumin (Bio-Rad). 10 μg of each sample were loaded onto an SDS-10% polyacrylamide gel. Before loading, samples were heated at 95 °C for 2–3 min in sample buffer under reduced conditions. Size standards consisted of pre-stained protein markers from New England Biolabs (Beverly, MA). Contents of the gel were transferred to a polyvinylidene difluoride membrane. After transfer, membranes were blocked, for 30 min, in 5% milk TBST. Primary antibodies were used at a 1:1000 dilution, in 2.5% milk TBST, and incubated overnight. After washing, the membrane was exposed to a horseradish peroxidase-conjugated goat anti-mouse secondary antibody for 45 min. Signal was detected using the chemiluminescent system from Amersham Biosciences. Primary antibodies used were mouse monoclonals against cathepsin D (Cell Signaling), cathepsin B (Zymed Laboratories Inc.), cathepsin L (Zymed Laboratories Inc.), and β-actin (Sigma). The secondary antibody consisted of a horseradish peroxidase-conjugated goat anti-mouse IgG antibody (Pierce). Signal was visualized with the ECL system (GE Healthcare) and exposure to x-ray film.
HLA-DR3 cDNA
The cDNAs for the α (DRA*010202) and β chains (DRB1*030101) of HLA-DR3 were generously provided by Dr. N. Koch (University of Bonn) and by Dr. C. Katovich-Hurley (Georgetown University). Sequencing of the β chain confirmed the presence of arginine at position β74. Individual amino acids of the peptide binding cleft at the β chain were changed, as indicated under “Results,” using the QuickChange® II or QuickChange® multisite-directed mutagenesis kits (Stratagene, catalogue number 200523 and 200515), as per the manufacturer's instructions.
Anti-HLA-DR Antibodies
Hybridoma cell lines were purchased from the American Type Culture Collection (Manassas, VA). Clone HB-55 was used to make the monoclonal antibody L243, which specifically recognizes the DRα chain when it is correctly folded and complexed with its β chain complement (free α chains do not bind L243) (19). Hybridomas were grown in serum-free media (Thermo Scientific). Prior to use in experiments, IgG molecules were purified from the spent media, using protein G conjugated to agarose (Roche Applied Science). Concentration was determined by measuring the absorbance at 280 nm (1 A280 nm unit = 0.8 mg/ml immunoglobulin).
Cell Lines
Cell lines used in this study either expressed MHC II molecules endogenously or were stably transfected to express MHC II. VAVY cells (expressing HLA-DR3 B1*0301, DRA 0102, DQA0501, DQB10201, DPA10201, and DPB10101) were obtained from the European Collection of Cell Cultures. Rat2 cells, derived from rat embryonic kidney tissue, were purchased from the ATCC and were subsequently engineered to express HLA-DR3. Despite the absence of invariant chain and endogenous MHC II molecules, Rat2 fibroblast cells are fully capable of assembling functional MHC class II proteins (20).
Rat2 cells stably transfected with HLA-DR3 were made as follows. The cDNA of the α chain was subcloned into the BamHI/XhoI sites of pCDNA 3.1+ vector (Invitrogen), containing the hygromycin resistance gene, and the cDNA of the β chain was cloned into the BamHI/XhoI sites of pCDNA 3.1+ vector containing the neomycin resistance gene. Cells were co-transfected with a total of 0.5 μg of the DRα and DRβ chain cDNA constructs, using Lipofectamine® 2000 (Invitrogen), and grown in Dulbecco's modified Eagle's medium containing 10% fetal calf serum supplemented with high glucose, 10 mm l-glutamine, pyridoxine-HCl, 300 μg/ml neomycin, and 50 μg/ml hygromycin for selection of double-positive clones (DRα and DRβ). After several passages, cells were analyzed for DR expression using the fluorescence-activated cell sorter and using the HLA-DR-specific monoclonal antibody, L243. Positive cells were sorted and expanded.
Peptide Binding Analysis by Flow Cytometry
Biotinylated peptides were tested for binding to Rat2 cells expressing recombinant HLA-DR, based on methodology reported previously (21). Fluorescein isothiocyanate-conjugated avidin D (Pierce) was used to detect the N-terminal biotinylated peptide. Amplification of the signal was achieved by incubating with 5 μg/ml biotinylated avidin-D-specific antisera (Pierce). Control samples consisted of Rat2 cells not transfected with HLA-DR, as well as HLA-DR expressing Rat2 cells incubated without peptide. Phycoerythrin-conjugated L243 (BD Biosciences) was used to detect the presence of HLA-DR molecules on the cell surface. The isotype control consisted of phycoerythrin-conjugated mouse IgG2a, κ isotype control (BD Biosciences).
For all flow cytometry experiments, binding was measured on a Coulter Epics XL (Beckman Coulter, Miami, FL). Dead cells and debris were excluded from analysis by setting the appropriate threshold trigger on forward-angle light scatter. Log fluorescence was collected for fluorescein isothiocyanate using a 525 band pass filter and phycoerythrin using a 575 band pass filter. In all experiments, 10,000 gated events were collected. For data analysis, peptide binding was expressed as the number of cells that bound peptide/the number of HLA-DR3 expressing cells.
Isolation of MHC II Molecules from B-cell Lines
Fully processed and naturally occurring HLA-DR3 molecules were purified from the VAVY cell line based on the reported protocols (22, 47). Briefly, 109 cells were lysed for 60 min at 4 °C in 0.01 m Tris, 0.15 m NaCl, pH 7.4, containing 2% Triton X-100 and proteinase inhibitors. HLA-DR3 molecules were purified from clarified cell lysates by immunoaffinity chromatography using L243, which was covalently linked to protein A-agarose, using the protein A IgG orientation kit (Thermo Scientific). Protein content was evaluated by the Bio-Rad protein assay reagent, and purity was assessed by SDS-PAGE.
Construction of DRα and DRβ Fusion Protein (Fig. 1)
FIGURE 1.
Design of α and β chain constructs and their respective expression in E. coli. The α and β ectodomains of HLA-DR3 were fused to a number of different sequence motifs. TEV represents a cut site for TEV protease (ENLYFQG), whereas Fos and JunB are dimerization motifs. The His is a repeat of 6 histidine residues, and Hsv is a herpes simplex virus immunodetection tag (SQPELAPEDPED) introduced by the cloning vector pET25b.
We have used a method successfully employed for HLA-DR expression systems in SF9 cells. Briefly, to achieve heterodimer stability (23) in the absence of a transmembrane domain, the extracellular portion of the DRB1*0301 β chain was fused to the coiled-coiled region of the basic leucine zipper domain of JunB, whereas the extracellular portion of the α chain was fused to the coiled-coiled region of the basic leucine zipper omain of Fos. These dimerization motifs have been demonstrated to preferentially favor the creation of heterodimers versus homodimers (24). Empty HLA-DR3 molecules were constructed by expressing each DR chain separately in E. coli and then refolding them together. Expression and purification were based on previous reports (25, 26). “Sticky end” PCR was employed to fuse the respective α or β ectodomain with its respective dimerization motif. 100 nanograms of the XhoI-linearized pcDNA3.1 construct containing the α chain was used as a template with forward primer (5′-TGCCCTCGAGAAAAGAATCAAAGAAGAACATGTGATC-3′, XhoI site is in boldface) and reverse primer (5′-TCCTCCTCCGTCGACTCCCTGAAAGTACAGGTTCTCGTTCTCTGTAGTCTCTGGGAG-3′; SalI site is in boldface, and TEV protease cleavage site is underlined). The cDNA clone containing the FOS dimerization domain was purchased from the ATCC (clone number MGC-11074). PCR was performed, using the cDNA clone as a template, along with forward primer (5′-GGAGTCGACGGAGGAGGACTGACTGATACACTCCAAGCG-3′; SalI site is in boldface) and reverse primer (5′-GCGGTTAATTAATCATGCATAATCTGGAACATCATATGGATATCGGTGAGCTGCCAGGATGAA-3′; PacI site is in boldface; stop codon is underlined, and hemagglutinin epitope tag is italicized). The amplicons of the DRα ectodomain and the FOS dimerization domain were joined together by sticky end PCR, and the amplicon was then subcloned into the XhoI/PacI sites of pKLAC1 (New England Biolabs, Beverly, MA).
For construction of the β chain construct, 100 ng of XhoI-linearized pcDNA3.1 containing the DRβ construct were used as a template with forward primer (5′-CTGCCCTCGAGAAAAGAGGGGACACCAGACCACGTTTC-3′; XhoI site is in boldface) and reverse primer (5-TCCTCCTCCGTCGACTCCCTGAAAGTACAGGTTCATCTTGCTCTGTGCAGATTC-3′; SalI site is in boldface, and TEV protease cleavage site is underlined). The cDNA clone containing the junB dimerization domain was purchased from OriGene (catalogue number TC118763, Rockville, MD) and was used as a template with forward primer (5′-GGAGTCGACGGAGGAGGACGCATCGCGCGCCTGGAGGAC-3′; SalI site is in boldface) and 250 nm reverse primer (5′-GCGGTTAATTAATCAGTGGTGGTGGTGGTGGTGGTTCATGACTTTCTGCTT-3′; PacI site is in boldface; the stop codon is italicized, and the histidine tag is underlined). As with the case of the DRα construct, the amplicons of the DRβ ectodomain and the JunB dimerization domain were joined together by sticky end PCR. The amplicon was then subcloned into the XhoI/PacI sites of pKLAC1.
For expression in E. coli, each individual MHC II fusion chain was subcloned and expressed individually in the pET25b(+) vector (Novagen). In brief, the respective constructs that had been subcloned into the pKLAC1 plasmid were used as templates for PCR, using the following primers: forward (5′-CCTGCCCATATGATCAAAGAAGAACATGTGATC-3′; NdeI site is in boldface) and reverse (5′-GCCTTGCTCAGCTCAGTGGTGGTGGTGGTGGTGTCGGTGAGCTGCCAGGAT-3′; BlpI site is in boldface) and forward (5′-CCTGCCCATATGGGGGACACCAGACCACGTTTC-3′; NdeI site is in boldface) and reverse primer (5′-GGAGGCCTCGAGGTGGTGGTGGTGGTGGTGGTT-3′; XhoI site is in boldface), for the α and β fusions, respectively. The amplicons were then ligated into the NdeI/BlpI (α fusion) or NdeI/XhoI (β fusion) sites of pET25b(+), under the control of T7 polymerase. Constructs were used to transform the E. coli strain, BL21-CodonPlus® (DE3)-RIPL cells (Stratagene). Cells were grown in LB broth, under carbenicillin selection (100 μg/ml) and were induced for 3–5 h with 1 mm isopropyl 1-thio-β-d-galactopyranoside (Bio-Rad), upon achieving an A600 nm of 0.5–0.7. Cells were spun down and frozen at −20 °C, until needed.
Solubilization and Purification of the Bacterially Expressed HLA-DR3
As the protein was expressed mainly in inclusion bodies, a scheme based on a protocol reported previously (25) for the refolding of HLA-DR from E. coli was employed. Initially, the pellet containing either the α or β chain was thawed in PBS, with 1 mm EDTA, 1 mm phenylmethylsulfonyl fluoride (Sigma), and protease inhibitors (Roche Applied Science). After resuspension and incubation on ice for 5 min, the sample was centrifuged at 10,000 rpm for 20 min in a Sorvall® RC-5b centrifuge using a sm-24 rotor. The soluble fraction was discarded, and the insoluble pellet was washed twice with isopropyl alcohol and centrifuged, as in the previous step. The washed pellet was then solubilized, and the His-tagged protein was purified, as per manufacturer's instructions (Novagen, His-Bind® kit manual), under denaturing conditions, using 6 m guanidinium hydrochloride (Sigma).
Refolding of the Bacterially Expressed HLA-DR3
To form a complete heterodimeric MHC II molecule, 5.5 mg of the β chain was combined with 5.5 mg of the α chain in 25 ml of 6 m guanidinium HCl/elute buffer (elute buffer made as described in the His-Bind® kit). Immediately, the volume was brought up to 200 ml with dialysis buffer (20 mm Tris-HCl, pH 8.0 (Sigma), 1 mm EDTA (Sigma), 3 mm glutathione-reduced (Sigma), 0.3 mm glutathione-oxidized (Sigma), 50 mm KCl (Sigma), and 33% glycerol (catalogue number 49767, Sigma)). The recombined α and β chains were dialyzed against 3 liters of the dialysis buffer, with three changes being performed over the course of 72 h. All dialysis steps used a membrane of 15,000 molecular weight cutoff (Spectra/Por®). Refolded protein was isolated by loading the recombined protein on to an Anx-Sepharose column (GE Healthcare) and running a gradient, which started at 50 mm KCl and ended at 1 m KCl in 20 mm Tris, pH 8.0, 10% glycerol. A final dialysis was performed against 10 mm Tris, pH 8.0, 25 mm KCl, 10% glycerol. The concentration of the bacterially expressed MHC II was determined as described for thyroglobulin.
ELISA
Plates were coated overnight at 4 °C with 10 μg/ml L243 in bicarbonate buffer, pH 9.5, in a 100-μl volume. Solutions with limiting amounts (96 nm) of recombinant HLA-DR3 were challenged with 5 μm biotinylated peptide for 24–48 h, at 37 °C, in a buffer consisting of PBS, 0.05% Triton X-100, and 1% bovine serum albumin. Controls consisted of just peptide, incubated under identical conditions, with no MHC II, to account for any nonspecific binding of the peptide to the ELISA plate. Subsequent to blocking with 200 μl of 2.5% bovine serum albumin/PBS, at 37 °C for 1 h, and four rounds of washing with PBS, 100 μl of peptide/HLA-DR3 mixtures were bound in individual wells of 96-well plates. After incubation, for 1 h at 37 °C, wells were washed 4–5 times with PBS, 0.05% Triton X-100. Next, 100 μl of ImmunoPure® streptavidin-alkaline phosphatase conjugate (Pierce) was added to each well, in a 1:2500 dilution (0.25 units/ml), and incubated at 37 °C for 30 min. Wells were then washed six times with PBS. Bound streptavidin alkaline phosphatase was visualized through the addition of p-nitrophenyl phosphate (Sigma) and measuring A405 nm. To quantitate the femtomoles of peptide bound, a calibration curve was constructed with known amounts of streptavidin-horseradish peroxidase. As each mg of streptavidin-horseradish peroxidase can bind to 4 μg of biotin (according to the product information provided by the manufacturer), the amount of biotin and hence the amount of peptide binding could be quantified.
Competition Experiment
For competition experiments, ELISAs were done as above, with the following modifications. Plates were coated with 20 μg/ml L243, and 4 μg/ml streptavidin/europium (PerkinElmer Life Sciences) was used to detect HLA-DR-bound peptide. After washing the europium conjugate with PBS, 200 μl of inducer solution was added, and finally the fluorescence was read using a POLARstar Omega fluorescent plate reader (BMG LabTech). Biotinylated apopeptide (previously shown to be a peptide that bound with high affinity to HLA-DR3), bound at 5 μm concentration, was competed off with increasing levels of “cold” (nonbiotinylated) competitor peptides.
MD Simulations and Energetic Analysis
The complexes of DR3 with the three Tg peptides (Tg.1571, Tg.726, and Tg.1951) have been constructed with AMBER using the known structure of DR3 with the CLIP peptide (Protein Data Bank code 1A6A). The register of the peptides in the complex was determined by a multiple alignment of the peptides that bind to DR3. The registers are as follows: Tg.1571-Pro-Glu-Ser-Lys-Val1-Ile-Phe-Asp4-Ala-Asn6-Ala-Pro-Val9-Ala-Val, Tg.726-Pro-Thr-Pro-Cus-Gln1-Leu-Gln-Ala4-Glu-Gln6-Ala-Phe-Leu9-Arg-Thr, and Tg.1951-Arg-Lys-Lys-Val-Ile1-Leu-Glu-Asp4-Lys-Val6-Lys-Asn-Phe9. The corresponding residues in the CLIP peptide were substituted with the sequence of the respective peptide. A similar approach was used to construct the DR74Q mutant of DR3. Details of the simulations and MM-PBSA analysis are presented in the supplemental material.
RESULTS
Expression of Individual α and β Chains in E. coli
Individual HLA-DR3 chains were expressed as described under “Experimental Procedures” (Fig. 1). Fig. 2A shows the high expression levels of each chain, after induction in E. coli. Fig. 2B demonstrates that an intact α/β heterodimer is formed after refolding.
FIGURE 2.
Expression of α and β chains in E. coli and characterization of the refolded HLA-DR3. A, samples were boiled in a reducing buffer and were run on a 12% SDS gel. Lanes are as follows: lane 1, size standards; lane 2, β chain prior to induction; lane 3, β chain (arrow) after induction; lane 4, α chain prior to induction, and lane 5, α chain (arrow) after induction. Expression of the different chains was greatly enhanced by using an E. coli strain that contained extra copies of the Arg, Ile, Pro, and Leu tRNA genes (data not shown). B, recombinant HLA-DR3 molecules were loaded in nonreducing buffer, without boiling, and were run on a 12% SDS gel. Lanes, taken from the same gel, are as follows: lane 1, size standards; lane 2, recombinant molecule untreated; lane 3, recombinant HLA-DR3 after treatment with TEV protease. Even without the Fos/JunB dimerization interface, and in the presence of SDS, the individual α and β chains can form a heterodimer (α/β).
Cathepsin Expression in the Human Thyroid Gland
Western blot analyses established that cathepsins B, D, and L were well expressed in human thyroidal tissue (Fig. 3) at the protein level. Interestingly, most of the cathepsin D in the thyroid exists in the pro-form, suggesting that the activity of this protein is tightly regulated. Similar results have been reported in murine thyroid (29) as well. In contrast, cathepsins B and L were expressed mostly in their active forms.
FIGURE 3.
Protein levels of thyroidal cathepsins. Cathepsins B, D, and L expression was examined on separate gels, using Western blotting. Lanes 1 and 2 consist of HeLa cell extracts. Lanes 3–8 are thyroidal tissue (from healthy portions of surgically excised glands (lanes 3 and 4) and from cadavers (lanes 5–8)). Actin levels served to confirm that equal amounts of protein were added to each lane.
Cathepsin-mediated Digestion of Thyroglobulin
Purified thyroglobulin ran with a predominant band at 330 kDa, on an SDS gel, under reducing conditions (Fig. 4, lane 2). Incubation with cathepsin B, D, or L, under acidic conditions (pH 3.5) mimicking the lysosomal milieu, resulted in substantial degradation of Tg (Fig. 4, lane 3). Sequences (118 peptides for cathepsin B, 171 peptides for cathepsin D, and 48 peptides for cathepsin L) of individual peptides generated from cathepsin-digested Tg were determined by high pressure liquid chromatography-tandem MS analysis (supplemental Tables S1–S3) and were consistent across the 20 different thyroglobulin samples. The three cathepsins exhibited different cut site preferences, as leucine and phenylalanine were the two most common amino acids used by cathepsin B and D, whereas glycine and arginine were the cathepsin L-preferred amino acids (supplemental Fig. 1).
FIGURE 4.
Undigested and cathepsin-digested Tg run on 7.5% polyacrylamide gel. The gel was stained with colloidal Coomassie. Lane 1, broad range markers from New England Biolabs. Lane 2, 5 μg of purified undigested thyroglobulin. The sample, run under reducing conditions, shows a predominant species (arrow) that migrates at the expected rate for a 330-kDa monomer. Additional bands reflect the heterogeneous nature of thyroglobulin that arises due to varying degrees of post-translational modifications. Lane 3, thyroglobulin after treatment with cathepsin D.
Correlation between Tg SNPs and Cathepsin-generated Tg Peptide Profiles
Mechanistically, a Tg amino acid variant could affect the Tg peptidic repertoire if SNP either adds or removes a cathepsin cut site. Our analysis of the peptide profiles generated by cathepsin B, D, and L digestion of Tg obtained from individuals harboring the three disease-associated Tg amino acid variants (10) did not demonstrate a correlation between any Tg amino acid variant or haplotype and levels of cathepsin B-, D-, or L-generated peptides (data not shown). Hence, these susceptibility polymorphisms may contribute to thyroid autoimmunity through mechanisms independent of cathepsin B, D, or L action (e.g. they could modify Tg entry into endosomes or peptide binding to MHC class II).
Identifying Potential Peptides for Binding Studies
We took a reverse immunological approach using two peptide-MHC II binding prediction algorithms: ProPred and RankPep. From the entire list of cathepsin-generated peptides, four gave high probabilities of binding to Arg-74 + HLA-DR3 by both algorithms and were tested. All experimentally tested peptide sequences, along with those peptides previously reported to bind to HLA-DR3 (17, 31), are summarized in Table 1.
TABLE 1.
Thyroglobulin peptides tested and their respective sequences
| Peptide name | Sequence | Comments |
|---|---|---|
| Tg.726–742 | Biotin-CPTPCQLQAEQAFLRTV | Naturally processed peptide bound in vivo to HLA-DR and eluted from HLA-DR molecules isolated from Graves disease thyroids (17) |
| Tg.2098–2112 | Biotin-LSSVVVDPSIRHFDV | Same as for Tg.726–742 |
| Tg.2100–2111 | Biotin-SVVVDPSIRHFD | Cathepsin L-generated predicted to bind by algorithms |
| Tg.1571–1597 | Biotin-EKVPESKVIFDANAPVAVRSKVPDSEF | Cathepsin D-generated predicted to bind by algorithms |
| Tg.1951–1964 | Biotin-FRKKVILEDKVKNF | Cathepsin B- and D-generated predicted to bind by algorithms |
| Tg.1409–1424 | Biotin-QLHLDSKTFPAETIRF | Cathepsin B- and D-generated predicted to bind by algorithms |
| Tg.2673 | Biotin-FSRKVPTFATPWPDFVPRAGGEN | Predicted as a nonbinder by algorithms |
| Tg.2507 | Biotin-LIGSSQDDGLINRAKAVKQF | Predicted as a nonbinder by algorithms |
| Tg.5–22 | Biotin-DQVAALTWVQTHIRGFGGDPRR | Based on a C-terminal Tg peptide reported to elicit experimental autoimmune thyroiditis (44) |
Peptide Binding Studies and the Influence of the DR3β Pocket Signature
The recombinant MHC II protein was fully functional, as evinced by its ability to bind peptides even better than HLA-DR3, which was isolated from a DR3 homozygous cell line (VAVY from ATCC) (data not shown). Previously, our genetic studies have defined residues Gln-70, Lys-71, and Arg-74 of the DRβ1 chain as being highly associated with the development of AITD, whereas residues Arg-70, Arg-71, and Gln-74 were protective (30). Hence, three distinct signatures were examined as follows: the first recombinant molecule harbored the susceptible amino acids at all three positions Gln-70, Lys-71, and Arg-74; the second DR3 contained glutamine at position β74 (protective) together with Gln-70 and Lys-71 (susceptible, this construct enabled us to dissect the individual contribution of position 74 to peptide binding); the third molecule had a completely protective signature with Arg-70, Arg-71, and Gln-74. The amount of each peptide that could be bound by an equivalent amount of each signature MHC II molecule was determined.
The experimental peptides were compared with a previously identified peptide, apo (31), which has strong and specific affinity of binding to Arg-74 + HLA-DR3 (positive control), as well as to two thyroglobulin peptides Tg.2673 and Tg.2507, predicted to be poor binders (negative controls). In addition, we tested two Tg peptides previously reported to bind DR3, Tg.726, and Tg.2098 (17). Consistent with the output of the MHC II binding algorithms, Tg.2673 and Tg.2507 did not bind (data not shown), whereas two of the predicted binders (Tg.1571 and Tg.1951) showed strong and specific binding when tested (Table 2). Binding data are summarized in Table 2, which represents the femtomoles of each bound peptide to a constant amount of HLA-DR3. The order of binding affinity was as follows: apo > Tg.1571 > Tg.726 > Tg.1951 > Tg.2098. It is noteworthy that Tg.2100, despite its similarity to Tg.2098, did not bind, possibly because the two amino acid N-terminal truncation places the biotin group much close to the peptide-binding cleft and thus sterically prevents binding. Intriguingly, as the character of the peptide binding cleft shifts from the susceptible to the protective signature, there is a marked diminution in peptide binding. Moreover, ∼90% decreased binding was seen when replacing Arg-74 with Gln-74, whereas replacing the other two susceptible amino acids at positions 70 and 71 decreased the binding by an additional 10%. These data demonstrate that Arg-74 contributed the most significant contribution to the energetics of binding and are consistent with our data that Arg-74 is strongly associated with both Graves and Hashimoto diseases (6, 10).
TABLE 2.
Femtomoles of peptide bound (with 95% confidence intervals) as a function of the pocket signature (amino acid composition at β chain residues 70, 71, and 74)
ND signifies not detected. Binding was not detected for Tg.2673 and Tg.2507, as expected.
| Pocket amino acids | Apo | Tg.1571 | Tg.1951 | Tg.726 | Tg.2098 |
|---|---|---|---|---|---|
| Gln-70, Lys-71, and Arg-74 | 608 ± 53.5 | 293 ± 30.5 | 40.3 ± 12.3 | 93.7 ± 19.9 | 34.6 ± 5.1 |
| Gln-70, Lys-71, and Gln-74 | 62.7 ± 25.3 | 20.8 ± 10.0 | 4.2 ± 1.6 | 11.9 ± 2.2 | ND |
| Arg-70, Arg-71, and Gln-Q74 | 10.8 ± 1.7 | ND | ND | 3.7 ± 1.1 | ND |
Specificity of Binding/Inhibition Studies
To demonstrate that the observed peptide binding was specific to the peptide binding cleft of the MHC II molecule, we conducted competition studies (Fig. 5) in which a biotinylated apoprotein (known to bind to the MHC II cleft) was competed off with a cold peptide of interest. The apopeptide was chosen, as it is a well characterized peptide that was initially identified after being eluted from HLA-DR3 isolated from the surface of B-cell derived cell lines (31), and it is thus suited to be “benchmark” peptide in this study. Tg.2098, Tg.726, Tg.1571, and Tg.1951 all competed with apo for the binding to the MHC II peptide binding cleft (Fig. 5), demonstrating that the binding of these peptides was specific to the HLA-DR peptide binding cleft.
FIGURE 5.
Competition with cold peptide. Recombinant HLA-DR3 (Arg-74) was incubated with 5 μm labeled apo. Nonlabeled competitor peptides were added at 5- and 25-fold molar excess. Data are expressed as the decrease in fluorescent intensity, which was generated by the labeled peptide.
Cell Lines
As a complement to the recombinant HLA-DR3 system, an HLA-DR3-Arg-β74-expressing Rat2 cell line was constructed to examine the specific Tg peptide binding to MHC II without interference from any other endogenous MHC II molecules (which may occur if one were to use immortalized B-cell lines from DR3 homozygous individuals that also express other MHC II molecules such as DQ). The DR3 stably transfected line showed significant expression of the HLA-DR molecules (Fig. 6A). Moreover, all of the peptides that were observed to bind in the recombinant system were validated and showed strong binding to the HLA-DR3-expressing cell line while not binding to the non-MHCII-expressing parent Rat2 cell line (Fig. 6B).
FIGURE 6.
Peptide binding using HLA-DR3-expressing cell line. A, HLA-DR expression on Rat2 cell line stably transfected with HLA-DR3. Cells were stained with L243-fluorescein isothiocyanate to detect HLA-DR3 on the cell surface. The top diagram (shaded) represents the stably transfected cells. The bottom diagram (clear) represents the untransfected fibroblasts. High levels of expression are seen. B, biotinylated peptide was incubated with both HLA-DR3 expressing Rat2 fibroblasts and with wild type Rat2 cells that did not express HLA-DR3. Binding is expressed as the percentage of cells that capture a given peptide of all DR3-positive cells. Tg.2673 and Tg.2507 are Tg peptides predicted not to bind and are used as negative controls (these peptides showed no detectable binding in our ELISA system; data not shown). Tg5–22 (DQVAALTWVQTHIRGFGGDPRR), which bound nonspecifically, was based on a previously reported pathogenic Tg-derived peptide (44) and was used to demonstrate the sensitivity of our system to nonspecific binding.
Analysis of the Binding Energetics of the Tg Peptides
Focusing on the nine critical pocket residues (P1–P9) that are most important for peptide MHC II interactions, we undertook an amino acid by amino acid analysis to dissect the energetics of the thyroglobulin peptide binding. Table 3 lists the energies derived from each peptide residue side chain and predicts an optimal HLA-DR3 binding sequence derived from Tg.1571, Tg.1951, and Tg.726. Clearly, all the residues contribute to binding, albeit to a different extent. For example, in all the peptides, contributions from position 5 are the smallest (Table 3), although positions 1 and 9 make major contributions. Positions 1, 6, and 9 prefer a hydrophobic residue and position 4 a negatively charged one. Positively charged residues in the center of the sequence are clearly disfavored (e.g. Tg.1951). Further decomposition of the interaction energy into contributions from backbone and side chains reveals that interactions with the backbone contribute a substantial portion of the interaction energy. In Tg.1571, Tg.726, and Tg.1951, interactions with backbone contribute 37, 38, and 54%, respectively. These contributions are from hydrogen bonds to the peptide C
O and N–H groups, and thus they depend weakly on the specific position as indicated by their average and small variance compared with the side chains. In Tg.1571, average backbone interactions are −1.9 ± 0.7 kcal/mol, whereas for the side chain interactions are −2.6 ± 1.7 kcal/mol. Clearly, the selectivity of peptide binding resides in the interaction with the side chains. This is supported by the fact that the “optimal” sequence constructed from side chain contributions is the same as that from the total interaction energies (data not shown and see Table 3).
TABLE 3.
Binding energies as a function of the amino acid sequence
Final column suggests an optimal peptide for binding to HLA-DR3. (Int.E means interaction energy).
| Position | Tg.1571 | Int.E | Tg.726 | Int.E | Tg.1951 | Int.E | Optimal residue |
|---|---|---|---|---|---|---|---|
| kcal/mol | kcal/mol | kcal/mol | |||||
| 1 | Val | −6.6 | Gln | −3.5 | Ile | −9.5 | Ile/Val |
| 2 | Ile | −4.2 | Leu | −4.8 | Leu | −5.2 | Leu/Ile |
| 3 | Phe | −7.1 | Gln | −1.5 | Glu | −1.0 | Phe |
| 4 | Asp | −6.0 | Ala | −2.7 | Asp | −5.4 | Asp |
| 5 | Ala | −2.1 | Glu | −1.2 | Lys | −0.4 | Ala |
| 6 | Asn | −2.2 | Gln | −3.0 | Val | −4.5 | Val |
| 7 | Ala | −3.6 | Ala | −3.0 | Lys | −2.9 | Ala |
| 8 | Pro | −3.9 | Phe | −3.5 | Asn | −2.9 | Pro/Phe |
| 9 | Val | −7.2 | Leu | −6.8 | Phe | −4.3 | Val/Leu |
The interaction energies between the peptide and the HLA can be also decomposed into pairwise contributions (27). Such an analysis identifies the specific residues in the peptides and in the MHC protein that make significant contribution to the interaction energy. The analysis is shown in Fig. 7A. The specific values of pairwise interactions for different peptides depend on the sequence of the peptide, but the localization of the interactions is invariant (Fig. 7A). With the exception of positions 4 and 5 in the peptide, the binding pockets are made up of residues in the α and β subunits of the MHC protein. Interestingly, the pockets for residues 4 and 5 are entirely localized in the β subunit, and the strongest interactions are centered on Lys-β71 and Arg-β74. In Tg.1571, this is by far the most important pairwise interaction, providing a molecular mechanism for the observed changes in binding upon mutations of Arg-74 and Lys-71.
FIGURE 7.
Interaction energies and contacts of the Tg.1571 and Tg.726 peptides. A, horizontal axis lists all the residues in the HLA subunits α and β demarcated by the large blue brackets. The depth axis lists the residues of the Tg.1571 peptide with their sequence in canonical order. The vertical axis shows the negative of the pairwise interaction energy. Note that the larges interactions are between D(4) of the peptide and Lys-β71 and Arg-β74. B, molecular depiction of the interactions between Lys-β71 and Arg-β74 and the Tg.726 peptide. The α subunit is in ice-blue and the β subunit in gray. The backbone of the peptide is depicted as a red tube. Residues Lys-β71 and Arg-β74 as well as Gln-3, Ala-4, and Glu-5 of Tg.726 are shown in atom colored sticks. Left, Arg-β74 through its strong salt bridge interaction with Glu-5 induces a strained conformation of the side chain. The Lys-β71 interaction with the backbone carbonyl of Glu-5 is preserved in all the complexes studied in this work. Right, R74Qβ mutation weakens the interaction with Glu-5 and upon release of the strain the side chain turns away from Gln-β74. The interaction of Lys-β71 with the backbone carbonyl is maintained.
As shown pictorially for Tg.1571 in Fig. 7A, the most favorable interactions for Tg.1571 occur on the β chain and cluster around positions 71 and 74, which we have previously identified as strategic to the development of AITD (30). Our analysis determined that a negatively charged residue in position 4 or 5 of the peptide is responsible for the interaction with the positive residues in positions 71 and 74 of the HLA-DR cleft. Fig. 7B shows Tg.726 modeled within the binding cleft and highlights the loss of contacts when Arg-β74 is changed to Gln-β74.
Importance of Arg-β74
To explore the role of Arg-β74 in peptide binding, we conducted the same simulations with the three peptides interacting with DR3 in which Arg-β74 was mutated to Gln. The mutation R74Q reduces the total interaction energy with all the peptides with the following rank order: Tg.1571 > Tg.1951 > Tg.726. A more detailed analysis showed that the mutation affects primarily the interaction with the side chains as follows: 3.5 kcal/mol in Tg.1571, 3.0 kcal/mol in Tg.726, and 4.6 kcal/mol in Tg.1951. Consequently, the relative contribution of the interaction with the backbone increases and becomes a dominant contribution. A decomposition of the total interaction energy into residue-based contributions shows that the main source of the change lies in the negatively charged residues in the center of the peptide. Thus, the interaction with D4 in Tg.1571 is reduced by 3.0 kcal/mol, the interaction with E5 in Tg.726 is reduced by 1.0 kcal/mol, and in Tg.1951 the interaction with both E3 and D4 is reduced by 3.5 kcal/mol. These changes are consistent with the experimental findings and provide a molecular model for the observed changes in binding.
An illustration of the changes in the interactions is shown in Fig. 7B, which shows the molecular arrangement around Lys-71 and Arg-74/Gln-74 in the complex with Tg.726. It shows a strong interaction between the two positive residues and the carboxylate of D4. In addition, Lys-71 interacts with the backbone carbonyl of A5. In Gln-74, the configuration does not change substantially compared with Arg-74, but now the interaction with Gln-74 is substantially weaker as reflected in the interaction energy as well as in the binding results. These analyses clearly illustrate that the effect of substitution in the HLA-DR pocket has both direct and indirect effects on peptide binding and selectivity.
DISCUSSION
Any given MHC II molecule has the ability to bind between 650 and 2000 different peptides (the peptidic repertoire) (31–33); nevertheless, despite this promiscuity, the ability to bind individual peptides with great stability is retained (reviewed in Ref. 34). In addition to being designed to present exogenous peptides and thus protect the host, the MHC II, by virtue of its exquisite design, can also present numerous self-peptides and, if autoreactive T-cells escape tolerance, cause an autoimmune response.
Critical to the initiation of an autoimmune response is the sequence of the MHC II peptide binding cleft. Indeed, concerning thyroid autoimmunity, we have previously identified unique amino acid pocket signatures in HLA-DR, which are highly associated with both GD and HT (9, 30). Moreover, when co-inherited with disease-associated polymorphisms in the thyroglobulin gene, a statistical interaction arises, resulting in a significantly higher risk for disease than would be expected by a simple additive effect alone (7), suggesting the existence of a biochemical interaction between Tg peptides and the MHC II pocket signature (35). Consistent with this putative mechanism are data showing that thyroid follicular cells can express MHC II molecules in the context of autoimmunity, and can act as facultative APCs (36, 37).
Interestingly, the MHC II pocket sequences examined in this study dictated a dramatic effect on the binding of multiple Tg peptides, as well as apo to HLA-DR (Table 2). Three pocket signatures were tested using HLA-DR3 molecules from a recombinant E. coli system, which was validated by both specificity controls and by a stable MHC-II-expressing cell line. The pocket signatures were as follows: 1) Tyr-26β, Tyr-30β, Gln-70β, Lys-71β, and Arg-74β (all susceptible amino acids); 2) Tyr-26β, Tyr-30β, Gln-70β, Lys-71β, and Gln-74β (all susceptible amino acids except Gln-74β); and 3) (Tyr-26β, Tyr-30β, Arg-70β, Arg-71β, and Gln-74β (all protective amino acids, except Tyr-26β and Tyr-30β). These pocket signatures were selected based on our genetic studies that have shown that GD is associated with only one amino acid pocket variant DRβ-Arg-74, whereas HT requires a five-amino acid signature, Tyr-26β, Tyr-30β, Gln-70β, Lys-71β, and Arg-74β. Lys-71β showed by logistic regression analysis to be the most strongly associated amino acid (30). Because Arg-74 was common to both GD and HT susceptibility pockets, we hypothesized that it was critical for peptide binding. Therefore, to test whether Arg-74 has the greatest contribution to peptide binding, we created a hybrid pocket containing susceptible residues at all positions, except 74 which had Gln (resistant). Our results demonstrated that Arg-74 was the key position for peptide binding. As corroboration to the results of the binding studies, energetic analysis indicated that most of the favorable peptide MHC II interactions are promoted by susceptibility amino acids at positions 71 and 74 of the β chain. The energetic data showed that Arg-74 forms a strong salt bridge interaction with residue Glu-5 of the peptide and induced a strained conformation of the side chain. In contrast, when Gln occupies position 74 it weakens the interaction with Glu-5 and releases the strain of the side chain away from Gln-74. Thus, Arg-74 is critical to stabilizing the Tg peptide within the DR pocket.
The same positions, located in pocket P4, which were identified to be strategic to peptide binding in the recombinant system (β70, β71, and β74), have also been found to play an important role in other autoimmune diseases (38) and immunological processes. For example, positions β70 and β71 are critical to the DR4 alleles associated with rheumatoid arthritis (39). Additionally, βQ70R and βR74Q were associated with elevated CLIP levels in cells, as they serve to increase the retention of CLIP by DR3 (40), and βQ70R has been implicated in a TCR contact (41).
From the standpoint of the peptide, this study has demonstrated that at least four thyroglobulin-derived peptides have a biological relevance to thyroid autoimmunity. Two (Tg.726 and Tg.2098) have been characterized previously as coming from human thyroids and were shown to bind to HLA-DR3, using methodology from ProImmune Ltd. (17). The two novel Tg-derived peptides that have been identified, although not coming from isolated glands, are biologically relevant nevertheless, as they were derived from cathepsins, which are the main proteases generating peptides in endosomes and are key proteases in thyroid biology. Traditional methods used to find pathogenic self-peptides, such as those applied to the acetylcholine receptor in the studies of myasthenia gravis (42), which consist of sequentially generating random peptides, are unsatisfactory, as, in addition to being very labor intensive, they may not yield peptides that have biological relevance to the etiopathogenesis of autoimmunity. Sequentially generated peptides may not be produced under in vivo conditions. By constructing a model peptidic repertoire and scanning it with validated algorithms, we have shown that it is possible to identify, with a high probability (two out of four tested), MHC II binders, without synthesizing and screening a large number of peptides. In fact, to screen the entire Tg molecule with sequentially generated random peptides would require the screening of >2,500 peptides, which is not feasible. Moreover, by furnishing a detailed analysis that documents the model peptidic repertoire that will be available in the thyroid, as a result of cathepsin B, D, or L digestion, and understanding the cutting sites, it is possible to trace the origins of a pathogenic peptide. As proof of principle, four of the eight peptides isolated from the HLA-DR of thyroids from patients with GD (17) had a very closely corresponding peptide (sequence similarities are underlined in Table 4) in the model peptide repertoire.
TABLE 4.
Correlation of thyroglobulin peptides eluted from thyroidal HLA-DR molecules (17) and cathepsin-generated thyroglobulin catabolic profile
Amino acids in the cathepsin-generated peptides that correspond to the HLA-DR eluted peptides are underlined.
| Thyroid | Cathepsin B | Cathepsin D | Cathepsin L |
|---|---|---|---|
| 726CPTPCQLQAEQAFLRTVQ743 | |||
| 1080SRTSGLLSSWK1090 | |||
| 2098LSSVVVDPSIRHFDV2112 | 2099SSVVVDPSIR2108 | ||
| 2100SVVVDPSIRHFD2111 | |||
| 2224QVDQFLGVPYAAPPLAER2241 | 2225VDQFLGVPYAAPPLAERR2242 | 2229LGVPYAAPPLAERR2242 | |
| 2508IGSSQDDGLINR2519 | 2506LLIGSSQDDGLINRAKAVKQ2525 | 2508IGSSQDDGLINRAKAVKQF2526 | |
| 2532AKAVKQFEESQGR2544 | |||
| 2520AKAVKQFEESQGRTSSK2536 | |||
| 2756LSLQEPGSKT2765 | 2756LSLQEPGSKT2765 | 2765LSLQEPGSKT2765 |
It remains to be determined whether the novel Tg-derived HLA-DR3-binding peptides identified in this study, as well as the previously identified thyroglobulin peptides (17), are immunodominant, subdominant, cryptic, or simply self-peptides. Importantly, of all the evaluated Tg-derived HLA-DR3-binding peptides, Tg.2098 is especially interesting as it was detected using our recombinant system, as well as by two other groups; the first group used thyroid tissues from patients with Graves disease (17), and the second group conducted peptide immunizations in mice (45). Hence, these data suggest that Tg.2098 could possibly play a role in the initiation of AITD.
Finally, although the analyses in this study examined Tg digested by a single cathepsin at a time, all three proteases could work in concert, in an in vivo context. Moreover, any disease or physiological process that affects the level of a given cathepsin would help to shape the character of the thyroglobulin peptidic repertoire. For example, increased cathepsin B immunoreactivity on the apical membrane of thyroid follicular cells has been shown in Graves disease (13). Consistent with this finding, TSH has been reported to induce a 2-fold enhancement of cysteine protease activities (14). Finally, cathepsins B and L are up-regulated during conditions conducive to inflammation such as exposure to interferon-γ (43). Because there is a relationship between the catabolic profile of thyroglobulin and actual peptides that are ultimately processed and displayed on thyroidal cells, cathepsin or lysosomal protease inhibitors, directed to the thyroid, might be a novel approach in which to mitigate or prevent AITD. Indeed, a proof of principle of this approach to treating autoimmunity was provided by a knock-out mouse model lacking cathepsin L on a non-obese diabetic mouse background that was protected from developing autoimmune diabetes (46).
Supplementary Material
Acknowledgments
We thank Dr. Jack Strominger (Harvard University) for providing us with the cell lines used to establish the methodology used in this study. We also thank Dr. Norbert Koch (University of Bonn), Dr. Carolyn Katovich-Hurley (Georgetown University), and Dr. Elizabeth Mellins (Stanford University) for providing us with the cDNA for HLA-DR3. We deeply appreciate the advice of Mary Palascak (University of Cincinnati) in flow cytometry studies. Finally, we express our gratitude to Drs. Nira Ben-Jonathan and Eric Hugo (University of Cincinnati) for suggestions with this manuscript.
This work was supported, in whole or in part, by National Institutes of Health Grants DK61659, DK067555, and DK073681 from NIDDK (to Y. T.). This work was also supported by a grant from the American Thyroid Association (to E. M. J.).

The on-line version of this article (available at http://www.jbc.org) contains supplemental Tables S1–S3, a figure, Methods, and additional references.
- MHC
- major histocompatibility complex
- Tg
- thyroglobulin
- AITD
- autoimmune thyroid disease
- GD
- Graves disease
- HT
- Hashimoto thyroiditis
- PBS
- phosphate-buffered saline
- ELISA
- enzyme-linked immunosorbent assay
- MS
- mass spectrometry
- TEV
- tobacco etch virus
- SNP
- single nucleotide polymorphism.
REFERENCES
- 1.Wucherpfennig K. W. (2001) Curr. Opin. Immunol. 13, 650–656 [DOI] [PubMed] [Google Scholar]
- 2.Wucherpfennig K. W., Yu B., Bhol K., Monos D. S., Argyris E., Karr R. W., Ahmed A. R., Strominger J. L. (1995) Proc. Natl. Acad. Sci. U.S.A. 92, 11935–11939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Gebe J. A., Swanson E., Kwok W. W. (2002) Tissue Antigens 59, 78–87 [DOI] [PubMed] [Google Scholar]
- 4.Wucherpfennig K. W. (2003) Ann. N.Y. Acad. Sci. 1005, 119–127 [DOI] [PubMed] [Google Scholar]
- 5.Todd J. A., Acha-Orbea H., Bell J. I., Chao N., Fronek Z., Jacob C. O., McDermott M., Sinha A. A., Timmerman L., Steinman L. (1988) Science 240, 1003–1009 [DOI] [PubMed] [Google Scholar]
- 6.Ban Y., Davies T. F., Greenberg D. A., Concepcion E. S., Osman R., Oashi T., Tomer Y. (2004) Genes Immun. 5, 203–208 [DOI] [PubMed] [Google Scholar]
- 7.Hodge S. E., Ban Y., Strug L. J., Greenberg D. A., Davies T. F., Concepcion E. S., Villanueva R., Tomer Y. (2006) Thyroid 16, 351–355 [DOI] [PubMed] [Google Scholar]
- 8.Shulman S. (1971) Adv. Immunol. 14, 85–185 [DOI] [PubMed] [Google Scholar]
- 9.Jacobson E. M., Tomer Y. (2007) Thyroid 17, 949–961 [DOI] [PubMed] [Google Scholar]
- 10.Ban Y., Greenberg D. A., Concepcion E., Skrabanek L., Villanueva R., Tomer Y. (2003) Proc. Natl. Acad. Sci. U.S.A. 100, 15119–15124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rodriguez G. M., Diment S. (1992) J. Immunol. 149, 2894–2898 [PubMed] [Google Scholar]
- 12.Dunn A. D., Crutchfield H. E., Dunn J. T. (1991) J. Biol. Chem. 266, 20198–20204 [PubMed] [Google Scholar]
- 13.Shuja S., Cai J., Iacobuzio-Donahue C., Zacks J., Beazley R. M., Kasznica J. M., O'Hara C. J., Heimann R., Murnane M. J. (1999) Thyroid 9, 569–577 [DOI] [PubMed] [Google Scholar]
- 14.Dunn A. D. (1984) Endocrinology 114, 375–382 [DOI] [PubMed] [Google Scholar]
- 15.Chapman H. A. (2006) Curr. Opin. Immunol. 18, 78–84 [DOI] [PubMed] [Google Scholar]
- 16.Engelhard V. H., Altrich-Vanlith M., Ostankovitch M., Zarling A. L. (2006) Curr. Opin. Immunol. 18, 92–97 [DOI] [PubMed] [Google Scholar]
- 17.Muixí L., Carrascal M., Alvarez I., Daura X., Martí M., Armengol M. P., Pinilla C., Abian J., Pujol-Borrell R., Jaraquemada D. (2008) J. Immunol. 181, 795–807 [DOI] [PubMed] [Google Scholar]
- 18.Tomer Y., Greenberg D. A., Barbesino G., Concepcion E., Davies T. F. (2001) J. Clin. Endocrinol. Metab. 86, 1687–1693 [DOI] [PubMed] [Google Scholar]
- 19.Fu X. T., Karr R. W. (1994) Hum. Immunol. 39, 253–260 [DOI] [PubMed] [Google Scholar]
- 20.Hitzel C., van Endert P., Koch N. (1995) J. Immunol. 154, 1048–1056 [PubMed] [Google Scholar]
- 21.Posch P. E., Araujo H. A., Creswell K., Praud C., Johnson A. H., Hurley C. K. (1995) Hum. Immunol. 42, 61–71 [DOI] [PubMed] [Google Scholar]
- 22.Shackelford D. A., Lampson L. A., Strominger J. L. (1983) J. Immunol. 130, 289–296 [PubMed] [Google Scholar]
- 23.Gauthier L., Smith K. J., Pyrdol J., Kalandadze A., Strominger J. L., Wiley D. C., Wucherpfennig K. W. (1998) Proc. Natl. Acad. Sci. U.S.A. 95, 11828–11833 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Deng T., Karin M. (1993) Genes Dev. 7, 479–490 [DOI] [PubMed] [Google Scholar]
- 25.Frayser M., Sato A. K., Xu L., Stern L. J. (1999) Protein Expr. Purif. 15, 105–114 [DOI] [PubMed] [Google Scholar]
- 26.Stöckel J., Meinl E., Hahnel C., Malotka J., Seitz R., Drexler K., Wekerle H., Dornmair K. (1994) J. Biol. Chem. 269, 29571–29578 [PubMed] [Google Scholar]
- 27.Gohlke H., Case D. A. (2004) J. Comput. Chem. 25, 238–250 [DOI] [PubMed] [Google Scholar]
- 28.Brand O. J., Barrett J. C., Simmonds M. J., Newby P. R., McCabe C. J., Bruce C. K., Kysela B., Carr-Smith J. D., Brix T., Hunt P. J., Wiersinga W. M., Hegedüs L., Connell J., Wass J. A., Franklyn J. A., Weetman A. P., Heward J. M., Gough S. C. (2009) Hum. Mol. Genet. 18, 1704–1713 [DOI] [PubMed] [Google Scholar]
- 29.Friedrichs B., Tepel C., Reinheckel T., Deussing J., von Figura K., Herzog V., Peters C., Saftig P., Brix K. (2003) J. Clin. Invest. 111, 1733–1745 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Menconi F., Monti M. C., Greenberg D. A., Oashi T., Osman R., Davies T. F., Ban Y., Jacobson E. M., Concepcion E. S., Li C. W., Tomer Y. (2008) Proc. Natl. Acad. Sci. U.S.A. 105, 14034–14039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chicz R. M., Urban R. G., Gorga J. C., Vignali D. A., Lane W. S., Strominger J. L. (1993) J. Exp. Med. 178, 27–47 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hunt D. F., Michel H., Dickinson T. A., Shabanowitz J., Cox A. L., Sakaguchi K., Appella E., Grey H. M., Sette A. (1992) Science 256, 1817–1820 [DOI] [PubMed] [Google Scholar]
- 33.Chicz R. M., Urban R. G., Lane W. S., Gorga J. C., Stern L. J., Vignali D. A., Strominger J. L. (1992) Nature 358, 764–768 [DOI] [PubMed] [Google Scholar]
- 34.Sant A. J., Chaves F. A., Jenks S. A., Richards K. A., Menges P., Weaver J. M., Lazarski C. A. (2005) Immunol. Rev. 207, 261–278 [DOI] [PubMed] [Google Scholar]
- 35.Jacobson E. M., Huber A., Tomer Y. (2008) J. Autoimmun. 30, 58–62 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bottazzo G. F., Pujol-Borrell R., Hanafusa T., Feldmann M. (1983) Lancet 2, 1115–1119 [DOI] [PubMed] [Google Scholar]
- 37.Hanafusa T., Pujol-Borrell R., Chiovato L., Russell R. C., Doniach D., Bottazzo G. F. (1983) Lancet 2, 1111–1115 [DOI] [PubMed] [Google Scholar]
- 38.Jardetzky T. S., Brown J. H., Gorga J. C., Stern L. J., Urban R. G., Strominger J. L., Wiley D. C. (1996) Proc. Natl. Acad. Sci. U.S.A. 93, 734–738 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hammer J., Gallazzi F., Bono E., Karr R. W., Guenot J., Valsasnini P., Nagy Z. A., Sinigaglia F. (1995) J. Exp. Med. 181, 1847–1855 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Doebele R. C., Pashine A., Liu W., Zaller D. M., Belmares M., Busch R., Mellins E. D. (2003) J. Immunol. 170, 4683–4692 [DOI] [PubMed] [Google Scholar]
- 41.Lombardi G., Barber L., Sidhu S., Batchelor J. R., Lechler R. I. (1991) Int. Immunol. 3, 769–775 [DOI] [PubMed] [Google Scholar]
- 42.Raju R., Spack E. G., David C. S. (2001) J. Immunol. 167, 1118–1124 [DOI] [PubMed] [Google Scholar]
- 43.Lah T. T., Hawley M., Rock K. L., Goldberg A. L. (1995) FEBS Lett. 363, 85–89 [DOI] [PubMed] [Google Scholar]
- 44.Karras E., Yang H., Lymberi P., Christadoss P. (2005) J. Autoimmun. 24, 291–296 [DOI] [PubMed] [Google Scholar]
- 45.Flynn J. C., McCormick D. J., Brusic V., Wan Q., Panos J. C., Giraldo A. A., David C. S., Kong Y. C. (2004) Cell. Immunol. 229, 79–85 [DOI] [PubMed] [Google Scholar]
- 46.Maehr R., Mintern J. D., Herman A. E., Lennon-Duménil A. M., Mathis D., Benoist C., Ploegh H. L. (2005) J. Clin. Invest. 115, 2934–2943 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Gorga J. C., Horejsí V., Johnson D. R., Raghupathy R., Strominger J. L. (1987) J. Biol. Chem. 262, 16087–16094 [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







