Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2011 Jun;53(6):1967–1976. doi: 10.1002/hep.24299

Electrostatic Modifications of the Human Leukocyte Antigen-DR P9 Peptide-Binding Pocket and Susceptibility to Primary Sclerosing Cholangitis

Johannes R Hov 1,2,3,4,*, Vasilis Kosmoliaptsis 5,6,*, James A Traherne 7, Marita Olsson 8, Kirsten M Boberg 1, Annika Bergquist 9, Erik Schrumpf 1,4, J Andrew Bradley 6, Craig J Taylor 5, Benedicte A Lie 3, John Trowsdale 7, Tom H Karlsen 1
PMCID: PMC3128712  PMID: 21413052

Abstract

The strongest genetic risk factors for primary sclerosing cholangitis (PSC) are found in the human leukocyte antigen (HLA) complex at chromosome 6p21. Genes in the HLA class II region encode molecules that present antigen to T lymphocytes. Polymorphisms in these genes are associated with most autoimmune diseases, most likely because they contribute to the specificity of immune responses. The aim of this study was to analyze the structure and electrostatic properties of the peptide-binding groove of HLA-DR in relation to PSC. Thus, four-digit resolution HLA-DRB1 genotyping was performed in 356 PSC patients and 366 healthy controls. Sequence information was used to assign which amino acids were encoded at all polymorphic positions. In stepwise logistic regressions, variations at residues 37 and 86 were independently associated with PSC (P = 1.2 × 10−32 and P = 1.8 × 10−22 in single-residue models, respectively). Three-dimensional modeling was performed to explore the effect of these key residues on the HLA-DR molecule. This analysis indicated that residue 37 was a major determinant of the electrostatic properties of pocket P9 of the peptide-binding groove. Asparagine at residue 37, which was associated with PSC, induced a positive charge in pocket P9. Tyrosine, which protected against PSC, induced a negative charge in this pocket. Consistent with the statistical observations, variation at residue 86 also indirectly influenced the electrostatic properties of this pocket. DRB1*13:01, which was PSC-associated, had a positive P9 pocket and DRB1*13:02, protective against PSC, had a negative P9 pocket. Conclusion: The results suggest that in patients with PSC, residues 37 and 86 of the HLA-DRβ chain critically influence the electrostatic properties of pocket P9 and thereby the range of peptides presented. (Hepatology 2011;53:1967-1976)


Genetic predisposition influences the development of primary sclerosing cholangitis (PSC).1 The co-occurrence of inflammatory bowel disease and classical autoimmune diseases in patients with PSC suggests that loss of immune tolerance contributes to the pathogenesis. The strongest genetic risk factors in PSC are found in the human leukocyte antigen (HLA) complex at chromosome 6p21.2 Many of the genes in this region are immune-related, and variants tend to be inherited together on extended haplotypes, i.e., they are in strong linkage disequilibrium (LD). Deciphering the contribution of the various genes in the region is a major challenge in disease genetics.

As for most HLA-associated diseases, a multitude of HLA class I and class II gene associations have been reported in PSC, most consistently for alleles that are components of the extended ancestral haplotypes AH8.1 (i.e., HLA-B*08-DRB1*03 [serological DR3]) and AH7.1 (i.e., HLA-B*07-DRB1*15 [serological DR2]), along with various less conserved HLA class II haplotypes, namely, DRB1*13:01, DRB1*04, and DRB1*07.3-6 In genome-wide association studies,2,7 strong associations near HLA-C, HLA-B, and MICA suggest a role for these loci in modifying PSC risk. The mechanism could involve an effect of alleles carried by the AH8.1 and AH7.1 haplotypes on the activation level of natural killer cells and T cells.8-11 However, associations detected for HLA class II haplotypes appear to have a significant influence on PSC, in addition to the effect of HLA class I.7

The class II genes encode heterodimers consisting of an α and a β chain (e.g., the HLA-DR molecule is encoded by HLA-DRA and HLA-DRB1) which present peptides to CD4-positive T cells. The sequences encoded by the second exon of class II genes determine the properties of the peptide-binding groove. In several autoimmune diseases HLA class II associations have been attributed to particular amino acids in the molecule that critically determine the binding of disease-specific antigen(s). One example is the protective effect in type 1 diabetes of HLA-DQβ1 chains with aspartic acid in residue 57,12 which induces distinct characteristics of the peptide-binding groove of the HLA-DQ molecule.13 Determination of the structural and electrostatic properties of the molecules associated with disease may help in identifying the disease mechanism. In primary biliary cirrhosis and autoimmune hepatitis, specific residues have been suggested to explain associations with HLA-DRB1 alleles.14,15 In PSC, an association with leucine in residue 38 of the HLA-DRβ chain was proposed by Farrant et al.,16 whereas a later study considered residues 55 and 87 of the HLA-DQβ chain as more likely candidates.3 A consistent peptide-binding motif for the class II molecules associated with PSC has not been defined, and no attempts have been made to model how specific amino acids affect the structure and the electrostatic properties of the peptide-binding groove.

The portal inflammation in PSC livers is dominated by T cells, which seem to exhibit a restricted T-cell receptor repertoire.17 It would be of importance to identify characteristics of the HLA molecules that determine the specificity of these T-cell responses. Strong LD in the HLA class II region makes it difficult to determine at the genetic level which loci are most relevant. However, several minor observations suggest that HLA-DRB1 could be the determinant of PSC risk; (1) The HLA-DQA1 and DQB1 alleles encoded on the AH8.1 haplotype are associated with PSC only on this haplotype and not when encoded on different haplotypes.4 (2) The protective DRB1*04 haplotypes may carry different DQB1 alleles.4 (3) A recent study in African-Americans confirms the association with DR13,18 which in Northern Europe forms the DRB1*13:01-DQB1*06:03 haplotype,16 whereas in African-Americans both DRB1*13:01-DQB1*06:03 and DRB1*13:01-DQB1*05:02 are common haplotypes.19 The HLA-DRB1 association is also more consistent than the association with the closely related (paralogous) HLA-DRB3 gene; e.g., PSC-associated HLA-DRB1*13:01 haplotypes may carry either the HLA-DRB3*01:01 or DRB3*02:02 alleles.4 Given this background we aimed to explore how HLA-DRB1 variation affects the molecular characteristics of HLA-DR and susceptibility to PSC.

Materials and Methods

Subjects

Scandinavian PSC patients (n = 356, Table 1) were recruited from Oslo University Hospital, Rikshospitalet, Oslo, Norway, and Karolinska University, Hospital Huddinge, Stockholm, Sweden. Diagnosis of PSC was based on accepted criteria with typical cholangiographic appearance. Ethnically and gender-matched healthy controls (n = 366) were randomly selected from the Norwegian Bone Marrow Registry. All participants gave informed consent. The study was approved by the Regional Committee for Research Ethics in South-Eastern Norway and the Ethics Committee of Karolinska Institutet.

Table 1.

Characteristics of Included Individuals

PSC Healthy Controls
N 356* 366
Male, n (%) 254 (71) 256 (70)
Age at diagnostic cholangiography, years, median (range) 36 (12-75)
Concomitant inflammatory bowel disease, n (%) 290 (82)
Ulcerative colitis/Crohn's disease/indeterminate, % 81/12/7
Cholangiocarcinoma, n (%) 50 (14)
Endpoint (tx or death), n (%) 210 (59)
Follow-up, years, median (range) 10 (0-34)
*

Norway n = 230, Sweden n = 126.

Missing information about the intestine in three patients.

HLA-DRB1 Data

Four-digit HLA-DRB1 genotypes were available from a previous study.20 Peptide sequences of all HLA-DRB1 alleles in IMGT/HLA database release 2.23 (October 2008) were aligned, and each individual was assigned two amino acids (one encoded by each chromosome) for each polymorphic residue.

Statistical Methods

Stepwise logistic regressions were performed in the statistical package R v2.10.0 (http://www.r-project.org/) assuming an “allele dosage” model, entering the count of all amino acids at a given residue as covariates. A model with all observed combinations of amino acids (“genotypes”) at a given residue entered as covariates was applied to control the validity of the model. Some combinations of amino acids were rare and after testing several criteria, combinations with a frequency of n < 2 in cases or controls at a given residue were grouped in order to avoid empty cells. In both models the reference was randomly chosen, thus no assumptions were made on which amino acid or pair of amino acids constituted high or low risk. Comparisons of allele and carrier frequencies were performed in Microsoft Excel (Redmond, WA) and PASW v. 18 (SPSS, Chicago, IL). P < 0.05 was considered statistically significant. P-values of novel HLA-DRB1 allele associations were Bonferroni corrected according to the number of alleles present in the dataset (n = 32).

3D Protein Structure Modeling of HLA-DR Molecules

The atomic coordinates of the most common HLA-DR molecules were determined using comparative protein structure modeling by satisfaction of spatial restraints as implemented in the MODELLER computer algorithm.21 HLA-DR proteins of known structure suitable as modeling templates were identified in the Protein Data Bank (PDB; http://www.rcsb.org/pdb/) and evaluated for structural quality. Accordingly, seven structures were selected as templates (PDB entries: 1KLU, 2G9H, 1D5Z, 1D5M, 2Q6W, 1PYW, and 2IPK). The amino acid sequences of the target HLA-DR molecules were obtained from the IMGT/HLA database. Multiple sequence alignments were performed with CLUSTAL_X v.1.8322 and manually corrected when indicated. The alignment files were then used as input to the MODELLER program. In brief, MODELLER generates the 3D atomic coordinates of the target sequences by satisfying spatial restraints, obtained from the templates, and by CHARMM23 energy terms enforcing proper stereochemistry. Optimization is then carried out by employing methods of conjugate gradients and molecular dynamics with simulated annealing.24 All calculations were performed in the absence of antigenic peptides to enable direct comparison of the structural and physiochemical characteristics of the peptide-binding groove among different molecules. The stereochemical quality of the modeled structures was verified using the PROCHECK25 and WHAT_CHECK26 algorithms and by assessment of Ramachandran plots. In addition, the structures were examined for protein folding quality using empirical energy potentials as implemented in the ProSA algorithm.27 Modeled coordinate sets are available upon request.

Electrostatic Potential Calculations

The electrostatic potential around the 3D structures was computed by numerically solving the Poisson Boltzmann equation using the finite difference method implemented in the DelPhi program within Discovery Studio 2.1 (Accelrys, San Diego, CA). Essential hydrogens were added to the structures. To determine the protonation state of titratable amino acid side chains the titration curves and residue pKa were calculated for each molecule (dielectric constant of 10 for the protein interior and 80 for the solvent) and titratable residues were protonated at a pH of 7.4. The protonated protein molecule was subsequently used to compute the electrostatic potential. The low dielectric protein interior (dielectric constant of 2) was embedded in a high dielectric continuum environment (water exterior, dielectric constant of 80). A solution with charged ions was simulated with an assigned ionic strength of 0.145, typical of the conditions at a pH of 7.4. The dielectric boundary between the protein and the solvent was defined by calculating the solvent-accessible surface generated by a rolling probe sphere of 1.4 Å radius. Atomic radii and partial atomic charges were taken from the CHARMM parameter set.23 An ion exclusion layer (Stern layer) for the solvent ions was defined around the solvent-accessible surface using an ionic radius of 2 Å. The layer has an ionic strength of 0.0 and determines the maximum distance that an ion can approach the solvent-accessible surface. The system was mapped into a 3D cubical grid and the electrostatic potential at each grid point was calculated iteratively starting from the Debye-Hückel boundary conditions. The accuracy of the calculations was improved by using a method of grid focusing; in the first run the coarse grid was allowed to be filled by 50% by solute and the calculated grid point potentials were used in the second run where the fine grid was filled by solute by 90%. The grid dimensions were set at 251 grid points per axis (spacing 0.3 Å / grid point). The solvent accessible surface was colored according to its calculated electrostatic potential and visualized using the Discovery Studio interface.

Results

Statistical Modeling: Identification of Position 37 and 86 in the DRβ1 Chain as PSC-Associated Residues

The amino acid sequence encoded by exon two of HLA-DRB1 was determined from the genotypes of each individual. Thirty residues were polymorphic, i.e., two or more different amino acids were observed at these positions. In the first step, a logistic regression was performed for each polymorphic residue. The counts (0, 1, 2) of the observed amino acids were included as covariates and the overall effect of the residue was tested with a likelihood ratio test. The strongest PSC associations were detected for residue 37 (P = 1.2 × 10−32, Table 2). In a second step, two-residue models were fitted containing the amino acid covariates of both the investigated residue and residue 37 and compared with the single-residue model of residue 37. The only residue that remained strongly associated with PSC in these two-residue models was 86 (Table 2). When performing a similar two-residue test for additional effects on top of 86, several residues (i.e., also residue 37) were found to contribute significantly (Table 2). No other residues showed significant disease association when included in three-residue models with residues 37 and 86 (Table 2).

Table 2.

Association Analyses Between Amino Acid Variation in the HLA-DRβ1 Chain and PSC*

Single-Residue LR Two-Residue LR Three-Residue LR



Basic Model: - Basic Model: Residue 37 Basic Model: Residue 86 Basic Model: Residues 37+86




Residue Observed Amino Acids P-value Rank P-value Rank P-value Rank P-value
9 Glu, Lys,Trp 0.051 27 0.59 22 0.0086 20 0.29
10 Glu,Gln,Tyr 2.4 × 10−11 14 0.54 21 0.0017 17 0.48
11 Asp,Gly,Leu,Pro,Ser,Val 1.2 × 10−18 8 0.23 15 1.7 × 10−8 3 0.43
12 Lys,Thr 2.7 × 10−12 12 0.64 24 0.00046 14 0.74
13 Phe,Gly,His,Arg,Ser,Tyr 9.6 × 10−25 2 0.22 14 2.1 × 10−11 2 0.25
14 Glu,Lys 0.0014 20 0.36 17 0.74 28 0.83
16 His,Tyr 3.2 × 10−5 17 0.88 29 0.00053 15 0.78
25 Gln,Arg 0.0014 21 0.36 18 0.74 29 0.83
26 Phe,Leu,Tyr 2.6 × 10−19 6 0.024 2 3.9 × 10−8 4 0.23
28 Asp,Glu,His 2.8 × 10−5 16 0.13 8 0.0053 19 0.10
30 Cys,Gly,His Leu,Arg,Tyr 0.00012 18 0.12 7 2.0 × 10−5 11 0.20
31 Phe,Ile,Val 0.31 29 0.078 5 0.0046 18 0.12
32 His,Tyr 1.2 × 10−21 4 0.38 19 6.6 × 10−8 6 0.64
33 His,Asn 1.4 × 10−15 11 0.40 20 5.0 × 10−8 5 0.31
37 Phe,Leu,Asn,Ser,Tyr 1.2 × 10−32 1 3.1 × 10−16 1
38 Ala,Leu,Val 0.029 26 0.17 11 0.0014 16 0.14
40 Phe,Tyr 0.77 30 0.17 12 0.52 27 0.14
47 Phe,Tyr 2.4 × 10−21 5 0.11 6 0.00024 13 0.29
57 Ala,Asp,Ser,Val 2.6 × 10−5 15 0.78 26 0.16 22 0.44
58 Ala,Glu 0.00055 19 0.85 28 0.035 21 0.84
60 His,Ser,Tyr 0.0046 24 0.59 23 0.28 24 0.29
67 Phe,Ile,Leu 0.0018 22 0.80 27 0.45 25 0.22
70 Asp,Gln,Arg 0.063 28 0.16 10 0.49 26 0.32
71 Ala,Glu,Lys,Arg 1.4 × 10−15 10 0.043 3 5.4 × 10−6 8 0.32
73 Ala,Gly 5.5 × 10−12 13 0.14 9 1.7 × 10−5 10 0.34
74 Ala,Glu,Leu,Gln,Arg 1.7 × 10−18 9 0.25 16 9.4 × 10−6 9 0.59
77 Asn,Thr 5.8 × 10−19 7 0.060 4 4.8 × 10−7 7 0.34
78 Val,Tyr 0.023 25 0.77 25 0.25 23 0.20
85 Ala,Val 0.0037 23 0.21 13 7.2 × 10−5 12 0.13
86 Gly,Val 1.8 × 10−22 3 2.0 × 10−5 1
*

Stepwise logistic regressions were performed assuming an “allele dosage” effect.

P-values of likelihood ratio tests of whether residue n improves the logistic regression model when added to a model with one other residue (37 or 86).

P-values of likelihood ratio tests of whether residue n improves the model when added to a model with both residues 37 and residue 86.

LR: likelihood ratio; Rank: residue rank according to P-value, lowest P-value is highlighted in bold.

In the logistic models used above the effect of a single amino acid was assumed to be additive on the log-scale: The log-odds ratio of having PSC given two copies of the amino acid is two times the log-odds ratio when having one copy. The advantage with this model is that it keeps the number of covariates to a minimum, leading to more powerful tests as long as the model assumptions are approximately true. In order to confirm the results obtained with this model, we also performed regressions where we allowed each observed combination (“genotype”) of amino acids to have a potential effect. In these “genotype” model analyses, residue 37 remained the most significantly PSC-associated residue (P = 6.9 × 10−32, Table 3), with an independent contribution from residue 86 still observed (P = 1.2 × 10−5). Several other residues contributed on top of residues 37 or 86 in two-residue models, as well as in three-residue models with both residues 37 and 86 included (Table 3). When inspecting the distribution of amino acid combinations (“genotypes”) in the dataset, it became apparent that the extra associated residues 26, 70, 71, 73, 74, and 77 (Table 3) reflected a large number of patients homozygous for HLA-DRB1*03:01 (n = 62 patients versus n = 3 healthy controls), meaning that it was not possible to determine the part of HLA-DRB1*03:01 that confers this additional risk.

3.

Summary of Association Analyses Between Amino Acid Variation and PSC Assuming a “Genotype” Model*

Single-Residue LR Two-Residue LR, Three-Residue LR



Basic Model: - Basic Model: Residue 26 Basic Model: Residue 37 Basic Model: Residues 77 Basic Model: Residues 86 Basic Model: Residues 37 + 86






Residue P-value P-value P-value P-value P-value P-value
26 6.2×10−20 4.2×10−7 0.19 1.3×10−8 4.1×10−5
37 6.9×10−32 3.3×10−19 1.5×10−19 4.8×10−17
70 0.013 0.23 0.023 0.083 0.055 0.036
71 1.6×10−17 3.8×10−14 0.00034 3.4×10−13 7.1×10−8 0.014
73 1.1×10−12 0.15 0.00075 0.17 8.1×10−6 0.0038
74 1.6×10−18 0.14 6.0×10−5 0.096 1.3×10−6 0.00093
77 7.3×10−21 0.80 8.5×10−8 7.8×10−9 3.4×10−6
86 2.0×10−21 1.3×10−9 1.2×10−5 2.2×10−9
*

Stepwise logistic regressions were performed assuming a “genotype” effect, entering all different pairs (combinations) of amino acids at a given residue. Only residues contributing significantly to the model when added to residue 37 and 86 in the three-residue regressions are shown.

P-values of likelihood ratio tests of whether residue n improves the logistic regression model when added to a model with one other residue (26, 37, 77, or 86).

P-values of likelihood ratio tests of whether residue n improves the model when added to a model with both residues 37 and residue 86.

LR: likelihood ratio.

In conclusion, residues 37 and 86 were consistent determinants of PSC susceptibility irrespective of statistical model, whereas it was difficult to exclude additional risk associated with other parts of the β chain encoded by HLA-DRB1*03:01.

Residue 37 Influences the Electrostatic Properties of Pocket P9 of HLA-DR

The amino acid frequencies at residues 37 (Table 4) showed that the highest and lowest risks of PSC were observed for carriers of asparagine (Asn37) (odds ratio [OR] = 5.7, 95% confidence interval [CI] 4.0-8.0) and tyrosine (Tyr37) (OR = 0.25, 95% CI 0.18-0.34), respectively.

Table 4.

Frequencies of Different Amino Acids at HLA-DRβ1 Residues 37 and 86

Allele Frequency, n (%) Carrier Frequency, n (%)


Residue Amino Acid PSC Healthy Controls PSC Healthy Controls
37 Asparagine (Asn) 398 (56) 199 (27) 292 (82) 163 (45)
Leucine (Leu) 4 (1) 16 (2) 4 (1) 15 (4)
Phenylalanine (Phe) 35 (5) 67 (9) 34 (10) 62 (17)
Serine (Ser) 192 (27) 208 (28) 162 (46) 175 (48)
Tyrosine (Tyr) 83 (12) 242 (33) 80 (22) 197 (54)
86 Glycine (Gly) 185 (26) 368 (50) 164 (46) 284 (78)
Valine (Val) 527 (74) 364 (50) 335 (94) 282 (77)

The specificity of the peptide-binding groove on an HLA class II molecule is governed by the properties of pockets in the groove that accommodate the amino acid side chains of the bound peptide, typically pockets for peptide residues 1 (pocket P1), 4, 6, and 9. Residue 37 of the HLA-DRβ1 chain is integral to pocket P9.28 Figure 1 shows the structural and electrostatic characteristics of pocket P9 on representative HLA-DR molecules. Significantly, HLA-DR carrying the risk residue Asn37 in the β chain (e.g., HLA-DRB1*03:01, *09:01, *13:01, *14:02; Fig. 1B) formed P9 pockets with similar structural architecture and consistently positive surface electrostatic potential (the only exception was HLA-DRB1*13:02, further discussed below). In contrast, HLA-DR molecules expressing the protective Tyr37 residue in the β chain (e.g., HLA-DRB1*04:01, *10:01, *11:01, *03:25; Fig. 1C) formed P9 pockets with consistently negative electrostatic potential. The distinct P9 pocket electrostatic patterns were conserved both among molecules that differed at several amino acid sequence positions and between structures where residue 37 constitutes the only disparity (e.g., HLA-DRB1*03:01 and -DRB1*03:25). Interestingly, a database search for peptides eluted from HLA-DR molecules showed that the presence of Asn37 restricted the amino acid preferences at position 9 (e.g., only tyrosine, leucine, and phenylalanine are defined as P9 anchors in HLA-DRB1*0301), whereas most amino acids may be P9 anchors in HLA-DRB1*0401 which carries Tyr37 (http://www.syfpeithi.de).29

Fig. 1.

Fig. 1

Structure and molecular surface electrostatic potential of pocket P9. (A) The structure and electrostatic potential of HLA-DRB1*03:01. The area within the frame is depicted in expanded form in (B,C). All structures were superimposed on HLA-DRB1*03:01 and therefore show the same view. HLA-DR carrying the risk residue Asn37 in the β chain had P9 pockets (arrows) with positive charge (B), whereas molecules expressing Tyr37 had P9 pockets (arrows) with consistently negative charge (C). Potentials less than −5 kT/e are colored red, those greater than 5 kT/e blue, and neutral potentials (0 kT/e) are colored white. Linear interpolation was used to produce the color for surface potentials between these values.

Residue 86 Defines Opposite Effects of HLA-DRB1*13:01 and *13:02 on PSC Risk

At the dimorphic residue 86, the highest risk was observed for carriers of valine (Val86) (OR = 4.8, 95% CI 2.9-7.9), whereas glycine (Gly86) appeared protective (OR = 0.25, 95% CI 0.18-0.34). Residue 86 of the HLA-DRβ1 chain is integral to pocket P1.28 In contrast to pocket P9, modeling of the P1 pocket of several HLA-DR molecules showed that the glycine/valine dimorphism at residue 86 had a minimal physiochemical effect. The majority of HLA-DR molecules examined had P1 pockets with an overall neutral charge (Fig. 2). Even though a steric effect (i.e., an effect on the volume of the pocket) imposed by the side chain of Val86 cannot be excluded, the results of the present analysis argue against a significant role of residue 86 on the choice of peptide residue accommodated by pocket P1.

Fig. 2.

Fig. 2

Structure and molecular surface electrostatic potential of pocket P1. (A) The structure and electrostatic potential of HLA-DRB1*03:01. The area within the frame is depicted in expanded form in (B). All structures were superimposed on HLA-DRB1*03:01 and therefore show the same view. Structural modeling and calculation of the electrostatic potential at the P1 pocket (arrows) of representative HLA-DR molecules showed that the Gly/Val dimorphism at position 86 had a minimal physiochemical effect (B). The majority of HLA-DR molecules examined had P1 pockets with an overall neutral charge. HLA-DRB1*03:01 and -DRB1*14:01 express Val86 whereas -DRB1*01:01 and -DRB1*04:01 express Gly86. Potentials less than −5 kT/e are colored red, those greater than 5 kT/e blue, and neutral potentials (0 kT/e) are colored white. Linear interpolation was used to produce the color for surface potentials between these values.

Further analysis, however, led to an interesting observation. As mentioned above, HLA-DR molecules expressing the risk residue Asn37 in their β chain possess electropositive P9 pockets, with the exception of HLA-DRB1*13:02 where an electronegative P9 pocket was observed (Fig. 3). Notably, when looking at the allele frequencies, HLA-DRB1*13:02, as opposed to other Asn37 encoding alleles (like the established PSC risk allele HLA-DRB1*13:01), was more frequent in healthy controls than in PSC patients (Pcorrected = 0.040, Table 5), suggesting that HLA-DRB1*13:02 may protect against PSC. This statistical observation is therefore in agreement with the protective effect associated with HLA-DR molecules expressing electronegative P9 pockets, as shown above for Tyr37 encoding alleles. Intriguingly, HLA-DRB1*13:02 and DRB1*13:01 have otherwise overall similar structural architecture and electrostatic properties (Fig. 3) with the main disparity observed at pocket P9. Because the only amino acid sequence difference between these alleles is at position 86 it may be suggested that the Gly86Val substitution may influence the choice of presented peptides through long-range electrostatic modification of pocket P9.

Fig. 3.

Fig. 3

Electrostatic modification of pocket P9 by the Gly86-to-Val86 substitution. The structure and electrostatic potential of the peptide-binding groove is shown for (structures were superimposed) HLA-DRB1*13:01 (left figures) and -DRB1*13:02 (right figures); these molecules have a single amino acid sequence difference at position 86 (Val86 and Gly86, respectively). HLA-DRB1*13:02 has an electronegative pocket P9 despite the presence of Asn at position 37, suggesting a long-range effect of the Val86-to-Gly86 substitution. The molecular surface is colored according to the calculated electrostatic potential, as for Figs. 1 and 2.

Table 5.

Frequency of HLA-DRB1Alleles Encoding Asn37

PSC (2n=712) Healthy Controls (2n=732)


Allele n (%) n (%) OR (95%CI)* P-value Residue 86
03:01 254 (36) 106 (14) 3.3 (2.5-4.2) 1.3 × 10−20 Val
09:01 12 (2) 6 (1) 2.0 (0.8-5.0) 0.14 Gly
13:01 117 (17) 47 (6) 2.8 (2.0-4.1) 2.0 × 10−9 Val
13:02 15 (2) 39 (5) 0.4 (0.2-0.7) 0.0013 Gly
14:02 0 (0) 1 (0) 0.3 (0.0-3.8) 1.0 Gly
*

Calculated with Woolf's formula with Haldane's correction.

Not corrected. Calculated with chi-square tests or Fisher's exact test where appropriate.

Pcorrected = 0.040.

OR = odds ratio. CI = confidence interval.

Discussion

By exploring variation in the amino acid sequence of the HLA-DRβ1 chain in PSC, we show that residues 37 and 86 distinguish disease susceptibility alleles and protective alleles. Investigations into the HLA-DR molecular structure revealed that the electrostatic properties of pocket P9 are determined by residue 37 and, indirectly by residue 86, suggesting that the P9 pocket is crucial for PSC risk.

In the HLA-DR molecule, residue 37 of the β chain appeared to be a key determinant of the electrostatic properties of pocket P9, which may be related to disease risk. The situation is reminiscent of type 1 diabetes, where amino acids at residue 57 of the HLA-DQ β chain associated with disease risk contribute to a larger volume of pocket P9 and a positive charge, allowing, e.g., glutamate residues from insulin peptides at position 9.30 In HLA-DR, Asn37 would restrict the range of amino acids at anchor position 9 of the peptide, and thereby which peptides may be presented. This is supported by data from peptide elusion experiments, where HLA-DR molecules with Asn37 and Tyr37 exhibit different ranges of amino acids at P9.29 Direct experimental observations focusing on pocket P9 variation and T-cell responses are scarce, but it has been shown that modification of only residue 37 (on DR4 molecules) is sufficient to alter recognition by the T-cell receptor, e.g., by neutralizing the T-cell-activating potential of the peptide-DR-complex.31,32 It should therefore be considered highly likely that characteristics of pocket P9 of the HLA-DR molecule facilitate particular immune responses.

Pocket P1 of HLA-DR was found to have an overall neutral electrostatic potential in the present study irrespective of whether glycine or valine was present at position 86. This fits with the observation that this pocket has a preference for hydrophobic amino acid side chains, and that the range of amino acids in position 1 of presented peptides is largely overlapping.33,34 However, pocket P1 with Gly86 in the β chain (e.g., as encoded by HLA-DRB1*13:02) has a tendency to accept larger (aromatic) side chains than when Val86 is present (e.g., encoded by DRB1*13:01); this has been attributed to the lack of a side chain on glycine allowing for a larger pocket volume.33,34 A more remarkable difference between HLA-DRB1*13:01 and DRB1*13:02 encoded HLA-DR molecules was that the amino acid substitution at residue 86 affected the electrostatic properties of pocket P9, in another part of the molecule. HLA-DRB1*13:02 was the only allele which contributed to a HLA-DR molecule with a negative pocket P9 with asparagine at position 37 of the β chain. Intriguingly, this allele exhibited a significantly reduced frequency in PSC patients. Taken together, our findings suggest that the association of residues 37 and 86 with PSC primarily reflects the properties of pocket P9.

Although HLA-DRB1*13:01 is a well-established PSC risk allele,4 this study is the first identifying HLA-DRB1*13:02 as a protective allele. This observation was significant even when correcting for multiple comparisons. Interestingly, similar contrasting effects have been observed in autoimmune hepatitis in Latin America, where risk is associated with HLA-DRB1*13:01 and protection with DRB1*13:02.15 HLA-DRB1*13:01 has also been associated with a protracted course of hepatitis A virus infection, which has been postulated to be a trigger of autoimmune hepatitis.35 To what extent these parallel observations are relevant for the specificity of the immune response in PSC can currently only be speculated.

Given the complexity of the HLA associations in diseases such as PSC, it is not unlikely that other alleles besides the most strongly associated ones modify the disease risk. Two previous studies of HLA-DR in PSC evaluated selected residues encoded by haplotypes associated with disease,3,16 and suggested that the presence of leucine at position 38 (Leu38) of the β chain may confer risk. Leu38 is rarely present in DRβ1 (most often encoded by DRB1*12 alleles). An explanation for the conflicting results is that the previous studies included alleles at both the HLA-DRB1 locus as well as those at other, paralogous, HLA-DRB loci. Several HLA haplotypes carry a second HLA-DRB gene besides HLA-DRB1, e.g., HLA-DRB1*03:01 and *13:01 haplotypes typically also carry an allele encoded by HLA-DRB3; DRB1*04 and *07:01 carry an allele encoded by HLA-DRB4, and the DRB1*15:01 haplotype carries an allele encoded by HLA-DRB5. These β chains couple with DRα and also have a role in antigen presentation.36 They are generally observed at several-fold lower expression levels than DRβ1.37,38 However, in diseases where the second DRB gene has been shown to be of actual relevance, the association seems to be specific to the gene in question and not due to shared sequence motifs with DRB1.3941 These facts, along with the more consistent PSC associations with HLA-DRB1 rather than HLA-DRB3,4 make it likely that the present focus on DRB1 is valid, even though an effect of other DRB loci cannot be formally ruled out at this stage.

Given the LD in the HLA complex, we cannot exclude the possibility that causal variants at other loci may be associated with the distribution of amino acids observed at given positions in HLA-DRβ1. The strong LD is particularly important in relation to the neighboring HLA-DQ genes and HLA-DRB paralogs, but it is also difficult to formally exclude an association with the nearby BTNL2 gene, which has been associated with inflammatory bowel disease,42 or even genetic variants further away. When applying a “genotype” model, in addition to residue 37 and 86 we could not exclude a residual association that could be attributed to being homozygous for HLA-DRB1*03:01. This may be speculated to relate to effects of a recessive variant outside HLA-DRB1,4 potentially related to the AH8.1 haplotype which is associated with multiple autoimmune diseases and probably contains several genetic variants in strong LD contributing to disease.43

In conclusion, this study shows that variation in PSC associated residues encoded by HLA-DRB1 impose distinct structural and physiochemical characteristics on the HLA-DR peptide-binding groove, suggesting that PSC risk molecules likely present a restricted peptide repertoire. The findings are highly relevant for and important to evaluate in future experimental studies of antigen presentation in PSC. The amino acid sequence and structural observations did not apply uniformly to all PSC patients, suggesting multiple pathogenetic mechanisms, as might be expected for a disease with the clinical heterogeneity observed in PSC.

Acknowledgments

We thank Bente Woldseth and Hege Dahlen Sollid for expert technical assistance and the Norwegian Bone Marrow Donor Registry (NORDONOR) at Oslo University Hospital Rikshospitalet for contributing healthy controls.

Abbreviations

AH

ancestral haplotype

HLA

human leukocyte antigen

LD

linkage disequilibrium

PSC

primary sclerosing cholangitis

References

  • 1.Bergquist A, Montgomery SM, Bahmanyar S, Olsson R, Danielsson A, Lindgren S, et al. Increased risk of primary sclerosing cholangitis and ulcerative colitis in first-degree relatives of patients with primary sclerosing cholangitis. Clin Gastroenterol Hepatol. 2008;6:939–943. doi: 10.1016/j.cgh.2008.03.016. [DOI] [PubMed] [Google Scholar]
  • 2.Karlsen TH, Franke A, Melum E, Kaser A, Hov JR, Balschun T, et al. Genome-wide association analysis in primary sclerosing cholangitis. Gastroenterology. 2010;138:1102–1111. doi: 10.1053/j.gastro.2009.11.046. [DOI] [PubMed] [Google Scholar]
  • 3.Donaldson PT, Norris S. Evaluation of the role of MHC class II alleles, haplotypes and selected amino acid sequences in primary sclerosing cholangitis. Autoimmunity. 2002;35:555–564. doi: 10.1080/0891693021000054093. [DOI] [PubMed] [Google Scholar]
  • 4.Spurkland A, Saarinen S, Boberg KM, Mitchell S, Broome U, Caballeria L, et al. HLA class II haplotypes in primary sclerosing cholangitis patients from five European populations. Tissue Antigens. 1999;53:459–469. doi: 10.1034/j.1399-0039.1999.530502.x. [DOI] [PubMed] [Google Scholar]
  • 5.Schrumpf E, Fausa O, Forre O, Dobloug JH, Ritland S, Thorsby E. HLA antigens and immunoregulatory T cells in ulcerative colitis associated with hepatobiliary disease. Scand J Gastroenterol. 1982;17:187–191. doi: 10.3109/00365528209182038. [DOI] [PubMed] [Google Scholar]
  • 6.Karlsen TH, Schrumpf E, Boberg KM. Genetic epidemiology of primary sclerosing cholangitis. World J Gastroenterol. 2007;13:5421–5431. doi: 10.3748/wjg.v13.i41.5421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Melum E, Franke A, Schramm C, Weismuller TJ, Gotthardt DN, Offner FA, et al. Genome-wide association analysis in primary sclerosing cholangitis identifies two non-HLA susceptibility loci. Nat Genet. 2011;43:17–19. doi: 10.1038/ng.728. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Karlsen TH, Boberg KM, Olsson M, Sun JY, Senitzer D, Bergquist A, et al. Particular genetic variants of ligands for natural killer cell receptors may contribute to the HLA associated risk of primary sclerosing cholangitis. J Hepatol. 2007;46:899–906. doi: 10.1016/j.jhep.2007.01.032. [DOI] [PubMed] [Google Scholar]
  • 9.Hov JR, Lleo A, Selmi C, Woldseth B, Fabris L, Strazzabosco M, et al. Genetic associations in Italian primary sclerosing cholangitis: heterogeneity across Europe defines a critical role for HLA-C. J Hepatol. 2010;52:712–717. doi: 10.1016/j.jhep.2009.11.029. [DOI] [PubMed] [Google Scholar]
  • 10.Norris S, Kondeatis E, Collins R, Satsangi J, Clare M, Chapman R, et al. Mapping MHC-encoded susceptibility and resistance in primary sclerosing cholangitis: the role of MICA polymorphism. Gastroenterology. 2001;120:1475–1482. doi: 10.1053/gast.2001.24041. [DOI] [PubMed] [Google Scholar]
  • 11.Wiencke K, Spurkland A, Schrumpf E, Boberg KM. Primary sclerosing cholangitis is associated to an extended B8-DR3 haplotype including particular MICA and MICB alleles. Hepatology. 2001;34:625–630. doi: 10.1053/jhep.2001.27543. [DOI] [PubMed] [Google Scholar]
  • 12.Todd JA, Bell JI, McDevitt HO. HLA-DQ beta gene contributes to susceptibility and resistance to insulin-dependent diabetes mellitus. Nature. 1987;329:599–604. doi: 10.1038/329599a0. [DOI] [PubMed] [Google Scholar]
  • 13.Lee KH, Wucherpfennig KW, Wiley DC. Structure of a human insulin peptide-HLA-DQ8 complex and susceptibility to type 1 diabetes. Nat Immunol. 2001;2:501–507. doi: 10.1038/88694. [DOI] [PubMed] [Google Scholar]
  • 14.Donaldson PT, Baragiotta A, Heneghan MA, Floreani A, Venturi C, Underhill JA, et al. HLA class II alleles, genotypes, haplotypes, and amino acids in primary biliary cirrhosis: a large-scale study. Hepatology. 2006;44:667–674. doi: 10.1002/hep.21316. [DOI] [PubMed] [Google Scholar]
  • 15.Duarte-Rey C, Pardo AL, Rodriguez-Velosa Y, Mantilla RD, Anaya JM, Rojas-Villarraga A. HLA class II association with autoimmune hepatitis in Latin America: a meta-analysis. Autoimmun Rev. 2009;8:325–331. doi: 10.1016/j.autrev.2008.11.005. [DOI] [PubMed] [Google Scholar]
  • 16.Farrant JM, Doherty DG, Donaldson PT, Vaughan RW, Hayllar KM, Welsh KI, et al. Amino acid substitutions at position 38 of the DR beta polypeptide confer susceptibility to and protection from primary sclerosing cholangitis. Hepatology. 1992;16:390–395. doi: 10.1002/hep.1840160217. [DOI] [PubMed] [Google Scholar]
  • 17.Broome U, Grunewald J, Scheynius A, Olerup O, Hultcrantz R. Preferential V beta3 usage by hepatic T lymphocytes in patients with primary sclerosing cholangitis. J Hepatol. 1997;26:527–534. doi: 10.1016/s0168-8278(97)80417-5. [DOI] [PubMed] [Google Scholar]
  • 18.Bowlus CL, Li CS, Karlsen TH, Lie BA, Selmi C. Primary sclerosing cholangitis in genetically diverse populations listed for liver transplantation: unique clinical and human leukocyte antigen associations. Liver Transpl. 2010;16:1324–1330. doi: 10.1002/lt.22161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zachary AA, Bias WB, Johnson A, Rose SM, Leffell MS. Antigen, allele, and haplotype frequencies report of the ASHI minority antigens workshops: part 1, African-Americans. Hum Immunol. 2001;62:1127–1136. doi: 10.1016/s0198-8859(01)00305-6. [DOI] [PubMed] [Google Scholar]
  • 20.Karlsen TH, Boberg KM, Vatn M, Bergquist A, Hampe J, Schrumpf E, et al. Different HLA class II associations in ulcerative colitis patients with and without primary sclerosing cholangitis. Genes Immun. 2007;8:275–278. doi: 10.1038/sj.gene.6364377. [DOI] [PubMed] [Google Scholar]
  • 21.Eswar N, Webb B, Marti-Renom MA, Madhusudhan MS, Eramian D, Shen MY, et al. Comparative protein structure modeling using Modeller. Curr Protoc Bioinformatics. 2006 doi: 10.1002/0471250953.bi0506s15. Chapter 5:Unit 5.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997;25:4876–4882. doi: 10.1093/nar/25.24.4876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Brooks BR, Brooks CL, 3rd, Mackerell AD, Jr, Nilsson L, Petrella RJ, Roux B, et al. CHARMM: the biomolecular simulation program. J Comput Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Clore GM, Brunger AT, Karplus M, Gronenborn AM. Application of molecular dynamics with interproton distance restraints to three-dimensional protein structure determination. A model study of crambin. J Mol Biol. 1986;191:523–551. doi: 10.1016/0022-2836(86)90146-4. [DOI] [PubMed] [Google Scholar]
  • 25.Laskowski RA, MacArthur MW, Moss DS, Thornton JM. PROCHECK: a program to check the stereochemical quality of protein structures. J Appl Crystallogr. 1993;26:283–291. [Google Scholar]
  • 26.Hooft RW, Vriend G, Sander C, Abola EE. Errors in protein structures. Nature. 1996;381:272. doi: 10.1038/381272a0. [DOI] [PubMed] [Google Scholar]
  • 27.Wiederstein M, Sippl MJ. ProSA-web: interactive web service for the recognition of errors in three-dimensional structures of proteins. Nucleic Acids Res. 2007;35:W407–W410. doi: 10.1093/nar/gkm290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Brown JH, Jardetzky TS, Gorga JC, Stern LJ, Urban RG, Strominger JL, et al. Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. Nature. 1993;364:33–39. doi: 10.1038/364033a0. [DOI] [PubMed] [Google Scholar]
  • 29.Rammensee H, Bachmann J, Emmerich NP, Bachor OA, Stevanovic S. SYFPEITHI: database for MHC ligands and peptide motifs. Immunogenetics. 1999;50:213–219. doi: 10.1007/s002510050595. [DOI] [PubMed] [Google Scholar]
  • 30.Jones EY, Fugger L, Strominger JL, Siebold C. MHC class II proteins and disease: a structural perspective. Nat Rev Immunol. 2006;6:271–282. doi: 10.1038/nri1805. [DOI] [PubMed] [Google Scholar]
  • 31.Chen YZ, Matsushita S, Nishimura Y. A single residue polymorphism at DR beta 37 affects recognition of peptides by T cells. Hum Immunol. 1997;54:30–39. doi: 10.1016/s0198-8859(97)00013-x. [DOI] [PubMed] [Google Scholar]
  • 32.Kaneko T, Obata F. Allogeneic recognition of HLA-DRB1*0406 by T cells with HLA-DRB1*0403: role of amino acid residue 37 on the beta sheet in T cell recognition. Immunobiology. 1996;195:261–270. doi: 10.1016/s0171-2985(96)80044-1. [DOI] [PubMed] [Google Scholar]
  • 33.Davenport MP, Quinn CL, Chicz RM, Green BN, Willis AC, Lane WS, et al. Naturally processed peptides from two disease-resistance-associated HLA-DR13 alleles show related sequence motifs and the effects of the dimorphism at position 86 of the HLA-DR beta chain. Proc Natl Acad Sci U S A. 1995;92:6567–6571. doi: 10.1073/pnas.92.14.6567. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Verreck FA, van de Poel A, Drijfhout JW, Amons R, Coligan JE, Konig F. Natural peptides isolated from Gly86/Val86-containing variants of HLA-DR1, -DR11, -DR13, and -DR52. Immunogenetics. 1996;43:392–397. doi: 10.1007/BF02199809. [DOI] [PubMed] [Google Scholar]
  • 35.Fainboim L, Canero Velasco MC, Marcos CY, Ciocca M, Roy A, Theiler G, et al. Protracted, but not acute, hepatitis A virus infection is strongly associated with HLA-DRB*1301, a marker for pediatric autoimmune hepatitis. Hepatology. 2001;33:1512–1517. doi: 10.1053/jhep.2001.24562. [DOI] [PubMed] [Google Scholar]
  • 36.Faner R, James E, Huston L, Pujol-Borrel R, Kwok WW, Juan M. Reassessing the role of HLA-DRB3 T-cell responses: evidence for significant expression and complementary antigen presentation. Eur J Immunol. 2010;40:91–102. doi: 10.1002/eji.200939225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Berdoz J, Gorski J, Termijtelen AM, Dayer JM, Irle C, Schendel D, et al. Constitutive and induced expression of the individual HLA-DR beta and alpha chain loci in different cell types. J Immunol. 1987;139:1336–1341. [PubMed] [Google Scholar]
  • 38.Cotner T, Charbonneau H, Mellins E, Pious D. mRNA abundance, rather than differences in subunit assembly, determine differential expression of HLA-DR beta 1 and -DR beta 3 molecules. J Biol Chem. 1989;264:11107–11111. [PubMed] [Google Scholar]
  • 39.Caillier SJ, Briggs F, Cree BA, Baranzini SE, Fernandez-Vina M, Ramsay PP, et al. Uncoupling the roles of HLA-DRB1 and HLA-DRB5 genes in multiple sclerosis. J Immunol. 2008;181:5473–5480. doi: 10.4049/jimmunol.181.8.5473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Ahlen MT, Husebekk A, Killie MK, Skogen B, Stuge TB. T-cell responses associated with neonatal alloimmune thrombocytopenia: isolation of HPA-1a-specific, HLA-DRB3*0101-restricted CD4+ T cells. Blood. 2009;113:3838–3844. doi: 10.1182/blood-2008-09-178475. [DOI] [PubMed] [Google Scholar]
  • 41.Engelmann R, Eggert M, Neeck G, Mueller-Hilke B. The impact of HLA-DRB alleles on the subclass titres of antibodies against citrullinated peptides. Rheumatology (Oxford) 2010;49:1862–1866. doi: 10.1093/rheumatology/keq179. [DOI] [PubMed] [Google Scholar]
  • 42.Silverberg MS, Cho JH, Rioux JD, McGovern DP, Wu J, Annese V, et al. Ulcerative colitis-risk loci on chromosomes 1p36 and 12q15 found by genome-wide association study. Nat Genet. 2009;41:216–220. doi: 10.1038/ng.275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Candore G, Lio D, Colonna Romano G, Caruso C. Pathogenesis of autoimmune diseases associated with 8.1 ancestral haplotype: effect of multiple gene interactions. Autoimmun Rev. 2002;1:29–35. doi: 10.1016/s1568-9972(01)00004-0. [DOI] [PubMed] [Google Scholar]

Articles from Hepatology (Baltimore, Md.) are provided here courtesy of Wiley

RESOURCES