Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Nov 11.
Published in final edited form as: Tissue Antigens. 2008 Oct 24;73(1):17–32. doi: 10.1111/j.1399-0039.2008.01151.x

HLA-A, -B, -C, -DRB1 Allele and Haplotype Frequencies Distinguish Eastern European Americans from the General European American Population

Steven J Mack 1, Bin Tu 2, Ana Lazaro 2, Ruyan Yang 2, Alex K Lancaster 3,4, Kai Cao 2, Jennifer Ng 2, Carolyn Katovich Hurley 2
PMCID: PMC3495166  NIHMSID: NIHMS411144  PMID: 19000140

Abstract

Sequence based typing was used to identify HLA-A,B,C,DRB1 alleles from 558 consecutively recruited U.S. volunteers with Eastern European ancestry for an unrelated hematopoietic stem cell registry. Four of the 31 HLA-A alleles, 29 -C alleles, 59 -B alleles, and 42 -DRB1 alleles identified (A*0325, B*440204, Cw*0332, and *0732N) are novel. The HLA-A*02010101g allele was observed at a frequency of 0.28. Two-, three- and four-locus haplotypes were estimated using the expectation maximization algorithm. The highest-frequency extended haplotypes (A*010101g-Cw*070101g-B*0801g-DRB1*0301 and A*03010101g-Cw*0702-B*0702-DRB1*1501) were observed at frequencies of 0.04 and 0.03, respectively. Linkage disequilibrium values (D’ij) of the constituent 2-locus haplotypes were highly significant for both extended haplotypes (p-values were less than 8 × 10−10), but were consistently higher for the more frequent haplotype. Balancing selection was inferred to be acting on all four loci, with the strongest evidence of balancing selection observed for the HLA-C locus. Comparisons of the A-C-B haplotype and DRB1 frequencies in this population to those for African, European and western Asian populations revealed high degrees of identity with Czech, Polish, and Slovenian populations and significant differences from the general European American population.

Keywords: HLA, population study

Introduction

The United Nations Statistics Division (UNSD) defines Eastern Europe as including Belarus, Bulgaria, the Czech Republic, Hungary, Poland, the Republic of Moldova, Romania, the Russian Federation, Slovakia, and Ukraine. However, common understandings of the region also include Albania, Bosnia and Herzegovina, Croatia, the Republic of Macedonia, Montenegro, Serbia, and Slovenia (these last six nations having once constituted Yugoslavia), although the UNSD defines these nations as belonging to Southern Europe. In the 2000 US census, 61% of Americans (170 million individuals) identified themselves as being of European ancestry, and of these, 17.4 million Americans identified themselves as being of Czech, Czechoslovakian, Hungarian, Polish, Romanian, Russian, Slovakian, or Ukranian ancestry, and 1.13 million Americans identified themselves as being of Albanian, Croatian, Serbian, Slovenian, or Yugoslavian ancestry. In total, Americans of Eastern European ancestry constitute 6.6% of the US population, and approximately 11% of the European American population (1).

The HLA loci are the most polymorphic loci in the human genome, with several hundreds of allelic products encoded at multiple loci (e.g., more than 1029 alleles encode over 870 different HLA B molecules). The HLA genes encode cell surface proteins that present foreign and self-derived peptides to T-lymphocyte antigen receptors for recognition. Sequence polymorphisms in the antigen binding domains of the HLA molecules determine the repertoire of peptides that can be presented and in turn influence an individual’s immune response (2). HLA alleles and multi-locus haplotypes are differentially distributed in populations around the globe, and studies of allele and haplotype frequency diversity have been used to make inferences about population relationships and history, and the selective forces operating to maintain high levels of HLA diversity.

Here, we present HLA-A, B, C, and DRB1 alleles identified via DNA sequencing in a population of Eastern European Americans, and use these data to estimate multi-locus haplotypes and the degree of linkage disequilibrium between loci, and to examine selective forces influencing this population and the degree of differentiation between this population and other U.S. populations originating from other regions of Europe, as well as European populations.

Materials and Methods

Sample population

The study population included 558 individuals from the United States indicating Eastern European ancestry who were consecutively recruited as volunteer donors for a bone marrow donor registry from January 2003 through February 2005. Because of the recruitment setting, individuals are unlikely to be related and are likely to originate from different areas of the United States. All are self identified as being of Eastern European ancestry. However, because these individuals are part of the larger US population, genetic contributions from non-European populations cannot be excluded simply on the basis of self-identification.

Identification of known HLA alleles

Genomic DNA was prepared using the QIAamp 96 DNA blood kit (Qiagen Valencia, CA). Each individual was initially typed at intermediate resolution for HLA-A,-B,-C,-DRB1 by sequence specific probe based hybridization using the One Lambda LABType® SSO Kit (One Lambda, Canoga Park, California) following manufacturer’s protocols. To identify the HLA-A,-B,-C alleles carried by each individual, PCR primers were used to amplify each locus as previously described (3). Applied Biosystems Big Dye terminator chemistry and sequencing primers were used to obtain the sequences of both strands of exons 2 and 3. DRB1 alleles were amplified and sequenced using the AlleleSEQR class II kit (Abbott Molecular Inc, Des Plaines, IL). Allele group specific amplification primers from the kit were used to isolate DRB1 alleles for sequencing. Additional in-house PCR and sequencing primers were added when needed to obtain resolution. Reactions products were identified with Applied Biosystems 3730xl DNA analyzers (PE Applied Biosystems, Foster City, CA) and sequence interpretation used Assign software (Conexio Genomics, Applecross, Western Australia) and was based on ImMunoGeneTics (IMGT)/HLA database release 2.7 (4).

Alleles identical in exons 2 and 3 (class I) or exon 2 (DRB1) were not resolved; expressed alleles in this category share the amino acid sequence of their antigen binding grooves. Unresolved alleles of this type that differ in four digit names ie, encode allelic products that vary in amino acid sequence outside of the antigen binding site, are indicated by the use of a “g” following the name of the lowest numbered allele in the group. For example, A*02010101g includes alleles A*02010101, A*0209, A*0243N, A*0266 as well as synonymous alleles A*02010102L and A*020108. A listing of these unresolved alleles can be found at http://www.ebi.ac.uk/imgt/hla/ambig.html (4) under database release 2.7. For those class I samples yielding alternative allele combinations, either allele specific sequencing primers or allele specific PCR amplification were used to link polymorphisms and to identify the specific allele combination (4). [In-house primer sequences used for all loci are available at www.dodmarrow.org.]

Characterization of new HLA alleles

Potentially new alleles were isolated and characterized as previously described (3,5). DNA sequencing of PCR products included primers annealing to both DNA strands for at least two independent PCR reactions with an ABI 3730 Automated DNA Sequencer (Applied Biosystems, Foster City, CA). Allele designations were assigned by the WHO Nomenclature Committee for Factors of the HLA System (6).

Statistical analysis

PyPop (Python for Population genetics, version 0.6.0 http://www.pypop.org) was used to carry out Hardy-Weinberg testing, Ewens-Watterson homozygosity analyses, and haplotype and linkage disequilibrium (LD) estimates (7,8). Allele frequencies were obtained by direct counting. Allele frequencies at each HLA locus were evaluated for deviations from Hardy-Weinberg equilibrium proportions using the exact test of Guo and Thompson (9), and by chi-square testing when expected values were ≥5. Chi-square tests were investigated for overall common genotypes (those expected to be seen in at least 5 instances “lumped” genotypes (the set of all genotypes individually expected to be seen in fewer than 5 instances each), all heterozygotes, all homozygotes, as well as for individual common and heterozygote genotypes. These Hardy-Weinberg tests measure the degree to which observed genotype frequencies differ from those expected based on the allele frequencies for that population, assuming that the population is suitably large and experiences random mating (10).

The Ewens-Watterson test of homozygosity was applied to each locus (11) (12), using Slatkin’s Monte-Carlo implementation of the exact test (13,14). In this test, the observed homozygosity (F, the sum of the squares of the allele frequencies) is compared with the mean value of F expected for a population of the same size with the same number of alleles, undergoing neutral evolution. The normalized deviate of F (Fnd, the difference between the observed and expected values of F divided by the square root of the variance of the expected F was also calculated for each locus (15).

Two-, three-, and four-locus haplotype frequencies were estimated using the iterative expectation-maximization (EM) algorithm (16,17). Linkage disequilibrium (LD) between alleles at each pair of loci, and two overall (locus-pair-level) measures of linkage disequilibrium, normalized to values between zero and one, were calculated. The normalized allele-pair-level LD measure, D'ij, is the disequilibrium coefficient (D) divided by the upper and lower bounds of D for the particular alleles at each locus (as described in (18,19,20), and ranges from +1 to -1. A D'ij value of zero indicates linkage equilibrium, while a value of +1 indicates the complete association of a given pair of alleles in a single haplotype, and for the data reported here, a value of -1 indicates the complete absence of a haplotype comprised by those alleles. [Note: The complete absence of a particular haplotype can only be inferred from a D’ij value of -1 when none of the reported alleles has a frequency greater than 0.5.] The first of the locus-pair-level measures, D' (18), uses the products of the allele frequencies at each locus to weight the LD contribution of specific allele pairs; while the second, Wn (21), calculates a normalization of the chi-square statistic for deviations between observed and expected haplotype frequencies. The significance of the overall LD between any two loci was tested using the permutation distribution of the likelihood ratio test (17).

Comparison of populations was limited by the availability of allele-level data for HLA A:C:B haplotypes. Arlequin v3.11 (22) was used to compare the HLA-A:C:B haplotypes and DRB1 genotypes in this population to those for Sub-Saharan African populations from Kenya (10, 23), Mali (23), Rwanda (10), Senegal (10), South Africa (10), Uganda (23), Zambia (23), and Zimbabwe (10); North African populations from Morocco (24,25,26,27) and Algiers (24); European populations from Bulgaria (10, 28), Croatia (29,30), the Czech Republic (10,31), Finland (10), Georgia (10,32), Germany (33), Italy (34), Macedonia (35,36), Northern Ireland (10), Norway (37,38), Poland (39,40), Portugal (41), Russia (42,43), Slovenia(10,44), Spain (4548), and Sweden (49); Asian populations from India (50), Israel (51), and Turkey (52); two African American populations (53,3), and four European American populations (5356), by calculating pairwise Fst values (and associated p-values) for this entire set of populations. Because the frequencies of the available A:C:B haplotypes for one of the Polish populations (39) summed to 0.78, a "blank" A:C:B haplotype with a frequency of 0.22 was considered for this population, and pairwise Fst values for this population were interpreted differently than for the rest (see below). In accounting for small differences in population sample sizes, the Fst calculation (57) may result in small negative Fst values; these were treated as being equivalent to zero and are reported as such. Pairwise standardized Fst values (F'st values) were generated using Hedrick's method of dividing each value by the maximum Fst value (58). Because all populations had not been genotyped at the same loci or for the same level of resolution, three comparisons were performed and the analysis focused on the amino acid sequences encoding the polymorphic antigen binding groove. A given pair of population datasets was determined to differ significantly if the appropriate p-value associated was less than 0.05.

Results

Allele and genotype frequencies

Allele frequencies for the HLA-A, C, B and DRB1 loci are shown in Table 1. Five-hundred fifty unique HLA-A, C, B, DRB1 phenotypes were observed among the 558 individuals examined. No overall deviations from expected Hardy-Weinberg equilibrium proportions (HWEP) were observed at the HLA-A, -C, or -B loci (p-values 0.4881, 0.2028, and 0.5792, respectively). However a minor but significant deviation from expected HWEP was observed at the DRB1 locus (p-value = 0.0187). This deviation primarily resulted from an undercount of DRB1*0701:*1601 genotypes (0 observed, 5.23 expected, p-value = 0.0222). Given the lack of similar deviations at the class I loci, it seems unlikely that the unexpected genotypic ratios at the DRB1 locus stem from sampling error.

Table 1.

HLA allelea frequencies in 558 random Americans with Eastern European ancestryb

HLA-A
Allele
Frequency n HLA-C allele Frequency n HLA-B Allele Frequency n DRB1
Alleles
Frequency n
010101g 0.14247 159 0102 0.03674 41 0702 0.10394 116 0101 0.09319 104
02010101g 0.27509 307 0202/0210 0.05824 65d 0705g 0.00538 6 0102 0.01703 19
0205 0.01613 18 0302 0.00358 4 0714 0.00090 1 0103 0.00896 10
0206 0.00358 4 0303 0.05466 61 0801g 0.09050 101 0301 0.10215 114
0217 0.00090 1 030401/030403 0.06093 68 1301 0.00090 1 0302 0.00090 1
03010101g 0.12903 144 030402 0.00090 1 1302 0.03047 34 0401 0.06810 76
0302 0.00627 7 0332 (V) 0.00090 1 1401 0.00358 4 0402 0.01344 15
0325 (V)c 0.00090 1 04010101g 0.12993 145 1402 0.02957 33 0403 0.00896 10
1101 0.06362 71 0403 0.00269 3 15010101g 0.05556 62 0404 0.04032 45
2301g 0.01792 20 0501g 0.05556 62 1503 0.00358 4 0405 0.00627 7
24020101g 0.10215 114 0602 0.09767 109 1506 0.00090 1 0406 0.00090 1
240301g 0.00090 1 070101g 0.161239 180 1510 0.00179 2 0407 0.00986 11
2501 0.02688 30 0702 0.11559 129 1517 0.00717 8 0408 0.00179 2
2601 0.04122 46 070401g 0.01971 22 1518 0.00627 7 0410 0.00090 1
29010101g 0.00448 5 0713 0.00090 1 1534 0.00090 1 0426 0.00090 1
2902 0.02151 24 0732N (V) 0.00090 1 180101g 0.05466 61 0701 0.12455 139
3001 0.01075 12 0801 0.00179 2 1803 0.00179 2 0801 0.02688 30
3002 0.00538 6 0802 0.03136 35 2702 0.00986 11 0802 0.00090 1
3004 0.00269 3 0803 0.00090 1 270502g 0.03853 43 0803 0.00269 3
3101 0.02419 27 1202 0.00806 9 2707 0.00090 1 0804 0.00090 1
3201 0.02867 32 1203 0.06989 78 350101g 0.06541 73 0901 0.01165 13
3208 0.00090 1 1402 0.01613 18 3502 0.02599 29 1001 0.01075 12
3301 0.01344 15 1502 0.01882 21 3503 0.02330 26 1101 0.05376 60
3303 0.00358 4 1505 0.00538 6 3508 0.00806 9 1102 0.00269 3
3402 0.00090 1 1601 0.02778 31 3512 0.00090 1 1103 0.00448 5
3601 0.00090 1 1602 0.00448 5 3701 0.01523 17 1104 0.05824 65
6601 0.00627 7 1604 0.00538 6 3801 0.02599 29 120101g 0.01344 15
680101 0.01254 14 1701g 0.00986 11 3901 0.00986 11 1202 0.00179 2
680102g 0.02867 32 3906 0.00538 6 1301 0.06631 74
6802 0.00717 8 Total 1.00000 1116 400101g 0.04391 49 1302 0.04480 50
7401g 0.00090 1 400201g 0.01971 22 1303 0.01882 21
4006 0.00179 2 1305 0.00448 5
Total 1.00000 1116 4010 0.00090 1 1401 0.01434 16
4101 0.00538 6 1403 0.00090 1
4102 0.00448 5 1404 0.00179 2
4201 0.00090 1 1406 0.00179 2
44020101g 0.06362 71 1501 0.11022 123
4403 0.04122 46 1502 0.00896 10
4405 0.00627 7 1503 0.00179 2
440204 (V) 0.00090 1 1601 0.03763 42
4501g 0.00538 6 1602 0.00179 2
4601 0.00090 1
4701 0.00448 5 Total 1.00000 1116
4801g 0.00269 3
4901 0.02957 33
5001 0.01254 14
5002 0.00090 1
510101g 0.05287 59
5108 0.00179 2
5201 0.00806 9
5301 0.00269 3
5501 0.01613 18
5601 0.00717 8
5701 0.02778 31
5703 0.00090 1
5801 0.00806 9
5802 0.00090 1
7801 0.00090 1
Total 1.00000 1116
a

Alleles which were identical in exons 2 and 3 were not distinguished. These alleles are indicated by the addition of the a “g” to the name of the lowest numbered allele in that group.

b

Alleles were assumed to be homozygous if the typing was consistent with a single allele in both intermediate and high resolution testing.

c

(V) indicates a novel allele defined in this study.

d

Includes one cell with a Cw*0210 allele.

Allelic diversity was high at each locus; thirty-one HLA-A alleles, 29 HLA-C alleles, 59 HLA-B alleles, and 42 DRB1 alleles were observed in this population (with an allele defined as either a unique class I exon 2-3 sequence or a unique DRB1 exon 2 sequence). Novel exon 2-3 sequences were detected at each class I locus (see below). Of the detected four digit alleles that had been previously identified, all but two have been identified by Cano et al (59) as being “common or well-documented” (CWD) alleles. The two non-CWD alleles (A*3208 and Cw*0713) were each observed in single individuals.

Four HLA-A alleles (A*02010101g, *010101g, *03010101g, and *24020101g) were observed at frequencies greater than 0.1, and represented 65% of the allelic diversity observed at this locus. Most notably, the A*02010101g allele was observed at a very high frequency of 0.275. Three HLA-C alleles (Cw*070101g, *04010101g, and *0702) were observed with frequencies above 0.1, and represented 41% of the HLA-C diversity in this population. HLA-B*0702 is the only HLA-B allele observed with a frequency greater than 0.1 in this population (allele frequency = 0.104). Three DRB1 alleles (DRB1*0701, *1501, and *0301) with frequencies greater than 0.1 represented 34% of the allelic diversity observed at this locus.

Ewens-Watterson Homozygosity Test

The normalized deviate of the Ewens-Watterson homozygosity statstic (Fnd) was calculated based on the observed allele frequencies at each locus, and was used to infer the action of balancing or directional selection at each locus. The Ewens-Watterson model calculates a mean homozygosity value for a population of a given size, in which a given number of alleles has been detected, and that is experiencing neutral evolution. The observed homozygosity value for populations evolving under neutral conditions will be similar to the expected homozygosity value, and the resulting Fnd value will be close to 0. Fnd values significantly greater than 0 (resulting from frequency distributions that are significantly “skewed” in favor of one or a few high-frequency alleles) are consistent with either the action of directional selection on the locus in question or an extreme demographic effect (e.g., a population bottleneck), while Fnd values significantly lower than 0 are consistent with the action of balancing selection, resulting from allele frequency distributions that are significantly more “even”.

The results of the Ewens-Watterson Homozygosity test are shown in Table 2. Negative Fnd values were observed for all four loci, and a significantly low value (Fnd = −1.1779, p-value = 0.0308) was observed for the HLA-C locus. In addition, the application of a sign test to these Fnd values indicates an overall significant trend away from the null hypothesis of neutral evolution (p-value = 0.0455), suggesting that the allele frequency distributions at all four loci have been shaped by balancing selection.

Table 2.

Ewens-Watterson homozygosity test of neutrality

Locus Observed F Expected F Normalized deviate of F (Fnd) p
HLA-A 0.1335 0.1449 −0.2169 0.5090
HLA-B 0.0484 0.0704 −1.0892 0.0644
HLA-C 0.0879 0.1551 −1.1779 0.0308*
HLA-DRB1 0.0698 0.1045 −1.0000 0.0843
*

Significant at the 5% level.

New alleles

Three individuals carried novel alleles at the HLA-A, HLA-B, or HLA-C loci (Table 3), a fourth carried a newly reported non-expressed HLA-C allele. Novel allele, A*0325, has already been reported by us (60). It differs from A*03010101 by a single novel nucleotide change at polymorphic codon 36 altering TTC (phe) to TGC (cys). B*440204 differs from B*44020101 at polymorphic codon 138 altering ACC to ACT, a synonymous substitution which has not been previously observed. Cw*0332 differs from Cw*030401 at codon 90, changing GCC (ala) to GAC (asp), a common alternative in HLA-C alleles. Cw*0732N was previously reported in a Spanish individual (Dr. Jose L. Vicario, unpublished, IMGT/HLA database). The allele has an insertion (CGCAGATACCT) between codons 163.2 and 165.1 in exon 3 and a loss of 4 nucleotides previously found in that region. Based on shared alleles of the Spanish cell (CTM8689384) with cell NT00597, the non-expressed allele is most likely associated with A*03010101g, B*0702, DRB1*1501.

Table 3.

Cells and their novel HLA alleles

Cell HLA-A HLA-B HLA-C HLA-DR GenBank Accession No.a
NT00682 A*03010101, *0325b,c B*0702, *4701 Cw*0602, 0702 DRB1*0701, *1301 DQ987874
NT00681 A*02010101g, *24020101g B*440204, *130201 Cw*050101g, *0602 DRB1*0101, *0701 DQ987875
NT00678 A*02010101g, 03010101g B*15010101g, *510101g Cw*0332, 150201 DRB1*0401, *1302 DQ984199
NT00597 A*03010101g, *1101 B*070201, *400201g Cw*0732N, 020202 DRB1*1301, *1501 DQ372911-DQ372913
a

Accession number of novel allele

b

Allele previously reported (33).

c

Novel allele is in bold type. The name have been officially assigned by the WHO Nomenclature Committee.

Haplotypes

Two-locus (A:B, C:B, and B:DRB1), three-locus (A:B:DRB1) and four-locus haplotypes were estimated for this population. Two-locus haplotypes are presented in Table 4; three- and four-locus haplotypes in Table 5. Because the outcome of the EM algorithm is unreliable for rare haplotypes (n = 1 or 2), only those haplotypes with at least three copies are presented in the tables. In general, the five most frequent haplotypes were also common in an earlier study of European Americans (61).

Table 4.

HLA two-locus haplotype frequencies and linkage disequilibrium values

Locus Pair Haplotype Frequency Observed (n) D'ij p-value
A-B 010101g:0801g 0.06179 69.0 0.63008 1.16582E-59
03010101g:0702 0.05082 56.7 0.4132 2.60221E-34
02010101g:15010101g 0.03365 37.5 0.45598 2.00543E-09
02010101g:44020101g 0.03301 36.8 0.34645 1.17214E-06
02010101g:400101g 0.02857 31.9 0.51828 1.71095E-09
02010101g:180101g 0.02048 22.9 0.13735 n.s.
02010101g:0702 0.02033 22.7 -0.28894 0.042847471
03010101g:350101g 0.02005 22.4 0.20372 2.89971E-06
02010101g:1302 0.01667 18.6 0.37525 0.000309397
02010101g:510101g 0.01662 18.5 0.05423 n.s.
02010101g:270502g 0.01616 18.0 0.19896 0.030776795
2902:4403 0.01612 18.0 0.73862 1.16216E-69
1101:350101g 0.01550 17.3 0.19075 3.42285E-10
2501:180101g 0.01462 16.3 0.51749 6.55176E-33
02010101g:5701 0.01455 16.2 0.34321 0.001654855
02010101g:0801g 0.01439 16.1 -0.42218 0.00613189
3301:1402 0.01163 13.0 0.86095 2.00292E-82
24020101g:0702 0.01138 12.7 0.00831 n.s.
010101g:5701 0.01065 11.9 0.28102 9.90095E-05
02010101g:350101g 0.01027 11.5 -0.42929 0.01942872
24020101g:3502 0.01010 11.3 0.31917 2.42787E-07
010101g:3502 0.00989 11.0 0.27757 0.000202656
2601:3801 0.00927 10.3 0.32898 4.81405E-18
2301g:4403 0.00896 10.0 0.4785 2.12777E-25
680102g:44020101g 0.00856 9.6 0.25154 2.39008E-08
24020101g:350101g 0.00823 9.2 0.02633 n.s.
24020101g:180101g 0.00807 9.0 0.05064 n.s.
02010101g:4901 0.00751 8.4 -0.07727 n.s.
24020101g:15010101g 0.00719 8.0 0.03037 n.s.
010101g:3701 0.00715 8.0 0.38117 0.000102148
02010101g:3503 0.00712 7.9 0.04206 n.s.
010101g:4901 0.00708 7.9 0.11296 n.s.
03010101g:510101g 0.00695 7.8 0.00278 n.s.
3101:400101g 0.00675 7.5 0.24586 1.58941E-09
03010101g:3503 0.00663 7.4 0.1787 0.016599135
03010101g:1402 0.00625 7.0 0.09452 n.s.
24020101g:510101g 0.00623 7.0 0.01745 n.s.
1101:510101g 0.00620 6.9 0.05732 n.s.
2601:510101g 0.00567 6.3 0.08942 0.0087538
24020101g:4403 0.00538 6.0 0.0315 n.s.
6802:1402 0.00538 6.0 0.74238 1.47839E-33
2301g:4901 0.00538 6.0 0.27867 5.83564E-13
03010101g:3901 0.00538 6.0 0.47811 3.46862E-05
0205:5001 0.00538 6.0 0.4192 6.39536E-35
010101g:4403 0.00535 6.0 -0.08977 n.s.
1101:44020101g 0.00531 5.9 0.02253 n.s.
24020101g:3801 0.00488 5.4 0.09519 n.s.
1101:0702 0.00487 5.4 -0.26364 n.s.
3201:0702 0.00481 5.4 0.07119 n.s.
02010101g:2702 0.00471 5.3 0.28001 n.s.
1101:3503 0.00458 5.1 0.14182 0.00499784
010101g:1302 0.00456 5.1 0.00835 n.s.
1101:15010101g 0.00453 5.1 0.01914 n.s.
02010101g:5001 0.00428 4.8 0.09137 n.s.
24020101g:400201g 0.00419 4.7 0.12305 n.s.
3201:400201g 0.00419 4.7 0.18926 1.80609E-07
02010101g:5201 0.00406 4.5 0.31511 n.s.
02010101g:3801 0.00376 4.2 -0.47459 n.s.
680102g:510101g 0.00373 4.2 0.08167 0.047248712
24020101g:270502g 0.00372 4.2 -0.0545 n.s.
2501:44020101g 0.00369 4.1 0.07973 n.s.
03010101g:15010101g 0.00365 4.1 -0.49044 n.s.
0205:4901 0.00358 4.0 0.19852 1.14771E-06
3001:1302 0.00358 4.0 0.31238 8.37536E-10
010101g:1517 0.00358 4.0 0.41693 0.00368987
24020101g:3508 0.00358 4.0 0.38124 0.000662789
2601:4901 0.00358 4.0 0.08343 0.018950042
03010101g:3701 0.00358 4.0 0.122 n.s.
010101g:0702 0.00356 4.0 -0.75992 0.000424585
03010101g:3502 0.00348 3.9 0.0056 n.s.
680102g:270502g 0.00345 3.8 0.08504 0.014760515
3101:510101g 0.00345 3.8 0.0947 0.034990683
24020101g:4405 0.00343 3.8 0.49569 9.60466E-05
1101:5501 0.00342 3.8 0.15822 0.009420807
2601:44020101g 0.00334 3.7 0.01949 n.s.
02010101g:5501 0.00333 3.7 -0.25014 n.s.
1101:400101g 0.00330 3.7 0.01241 n.s.
24020101g:3503 0.00317 3.5 0.03779 n.s.
1101:270502g 0.00308 3.4 0.0175 n.s.
680102g:350101g 0.00308 3.4 0.04484 n.s.
010101g:44020101g 0.00307 3.4 -0.65677 0.020694838
24020101g:44020101g 0.00306 3.4 -0.52313 n.s.
6801:350101g 0.00298 3.3 0.18454 0.008625701
2601:270502g 0.00297 3.3 0.03742 n.s.
03010101g:3801 0.00294 3.3 -0.12207 n.s.
02010101g:400201g 0.00293 3.3 -0.45991 n.s.
03010101g:0801g 0.00287 3.2 -0.7543 0.002217044
3201:15010101g 0.00280 3.1 0.04475 n.s.
010101g:270502g 0.00274 3.1 -0.50177 n.s.
0302:4403 0.00269 3.0 0.40401 2.32238E-07
2601:5601 0.00269 3.0 0.34813 1.87831E-06
3002:180101g 0.00269 3.0 0.47109 1.49544E-06
02010101g:0705g 0.00269 3.0 0.31026 n.s.
6601:4102 0.00269 3.0 0.59748 9.84576E-64
010101g:5801 0.00267 3.0 0.21953 n.s.
010101g:180101g 0.00267 3.0 -0.65757 0.031313371
Total 0.79356 885.6
C-B
0702:0702 0.10215 114 0.98051 4.3092E-209
070101g:0801g 0.08602 96 0.94104 5.7888E-114
04010101g:350101g 0.05732 64 0.85786 1.04758E-85
0501g:44020101g 0.04301 48 0.75908 6.1171E-125
0304:400101g 0.04211 47 0.95632 4.1105E-159
0602:1302 0.03047 34 1 1.96738E-72
0303:15010101g 0.03043 34 0.53069 3.72594E-69
0802:1402 0.02957 33 1 2.1101E-230
070101g:4901 0.02867 32 0.96379 7.67956E-38
1203:3801 0.02599 29 1 3.66438E-88
1203:180101g 0.02599 29 0.43599 2.2296E-37
0602:5701 0.02509 28 0.89263 5.58313E-53
070101g:180101g 0.0233 26 0.31662 5.91871E-09
1601:4403 0.02324 25.9 0.82971 4.7159E-113
0102:270502g 0.0215 24 0.56869 1.10709E-76
04010101g:3502 0.02055 22.9 0.75977 7.50689E-27
0202:400201g 0.01699 19 0.8535 1.94262E-59
0202:270502g 0.01613 18 0.38265 7.82735E-25
04010101g:4403 0.01592 17.8 0.29452 1.29805E-07
0303:5501 0.01523 17 0.94109 6.68502E-63
1402:510101g 0.01521 17 0.94013 5.8022E-65
04010101g:3503 0.01521 17 0.60085 1.03816E-15
0602:3701 0.01434 16 0.93481 3.65869E-32
070401g:44020101g 0.01434 16 0.70893 1.53908E-38
0304:15010101g 0.01432 16 0.20965 2.58447E-11
1502:510101g 0.01341 15 0.69646 2.39111E-42
0602:5001 0.01075 12 0.84168 5.81097E-22
0202:510101g 0.00899 10 0.11876 0.000163772
0202:2702 0.00806 9 0.80694 2.91884E-27
1202:5201 0.00806 9 1 1.1E-244
070101g:1517 0.00717 8 1 8.327E-11
04010101g:15010101g 0.00632 7.1 -0.12461 n.s.
04010101g:3508 0.00627 7 0.74459 6.47678E-09
0202:4405 0.00627 7 1 1.37115E-26
0102:5601 0.00627 7 0.87023 1.13364E-36
1203:3503 0.00626 7 0.21361 5.80702E-05
1203:3901 0.00538 6 0.5113 5.07126E-10
1505:0705g 0.00538 6 1 1.1E-244
0501g:180101g 0.00538 6 0.04532 n.s.
070401g:1518 0.00448 5 0.70854 3.89536E-40
0602:3502 0.00448 5 0.08283 n.s.
0602:4701 0.00448 5 1 9.63885E-12
1701g:4102 0.00448 5 1 9.8096E-112
070101g:5801 0.00448 5 0.47065 0.00118046
0102:510101g 0.00448 5 0.07294 0.043991193
0702:3901 0.00448 5 0.38326 0.000410173
0602:4501g 0.00448 5 0.81529 1.15469E-09
0303:350101g 0.00361 4 0.00063 n.s.
0702:3906 0.00358 4 0.6231 2.30398E-05
1701g:4101 0.00358 4 0.66335 6.08874E-60
1604:44020101g 0.00358 4 0.64436 9.49095E-10
0302:5801 0.00358 4 1 2.1555E-109
070101g:5701 0.00269 3 -0.39597 n.s.
0102:15010101g 0.00269 3 0.01865 n.s.
0501g:510101g 0.00269 3 -0.08475 n.s.
Total 0.91891 1025.7
B-DRB1
0801g:0301 0.0669 74.7 0.70951 7.2272E-109
0702:1501 0.05501 61.4 0.4709 2.47214E-52
4403:0701 0.03101 34.6 0.71718 1.30617E-39
350101g:0101 0.02833 31.6 0.37478 5.02831E-25
3502:1104 0.0222 24.8 0.84511 9.34717E-77
1302:0701 0.02208 24.6 0.68553 5.15892E-27
15010101g:0401 0.01759 19.6 0.26668 1.31806E-15
44020101g:0401 0.01604 17.9 0.2013 1.23338E-10
5701:0701 0.01515 16.9 0.48088 6.06376E-13
180101g:1501 0.01413 15.8 0.16675 0.000141231
270502g:0101 0.01379 15.4 0.29182 1.14487E-09
1402:0102 0.01344 15 0.78306 1.38818E-86
180101g:1104 0.01329 14.8 0.19628 2.29935E-10
400101g:0404 0.01253 14 0.27919 4.60633E-19
510101g:1301 0.01215 13.6 0.1752 2.11494E-07
15010101g:1301 0.01042 11.6 0.12992 7.81136E-05
510101g:0101 0.0096 10.7 0.09754 0.016327566
400101g:1302 0.00887 9.9 0.16452 5.38013E-08
44020101g:0101 0.00813 9.1 0.04015 n.s.
5001:0701 0.00806 9 0.59205 3.41744E-09
44020101g:1601 0.0079 8.8 0.15695 6.12482E-05
0702:1301 0.00722 8.1 0.00552 n.s.
44020101g:1104 0.00722 8.1 0.06528 0.036039816
3801:0402 0.00717 8 0.52088 1.69081E-35
5201:1502 0.00717 8 0.88788 4.6651E-174
400201g:1101 0.00681 7.6 0.30846 8.76433E-10
0702:0401 0.00673 7.5 -0.04866 n.s.
0702:0701 0.00668 7.5 -0.48427 0.037683092
44020101g:0701 0.00648 7.2 -0.17063 n.s.
350101g:1501 0.0063 7 -0.12668 n.s.
0702:0301 0.00628 7 -0.40878 n.s.
2702:1601 0.00627 7 0.62214 1.00044E-25
4901:1302 0.00627 7 0.17517 2.39947E-06
270502g:0404 0.00575 6.4 0.11345 0.000214345
0801g:1501 0.00563 6.3 -0.43566 n.s.
1517:1302 0.00538 6 0.73827 3.79482E-22
400101g:0801 0.00536 6 0.1625 2.55113E-05
0801g:0401 0.00528 5.9 -0.14319 n.s.
510101g:1101 0.00519 5.8 0.04691 n.s.
0702:0101 0.00508 5.7 -0.4758 n.s.
4901:1501 0.00495 5.5 0.06438 n.s.
510101g:0404 0.00487 5.4 0.07161 0.037946254
400101g:0101 0.00475 5.3 0.01641 n.s.
15010101g:0101 0.00472 5.3 -0.08872 n.s.
5501:1301 0.00469 5.2 0.2407 0.00011185
1402:0701 0.00448 5 0.0308 n.s.
350101g:0103 0.00448 5 0.465 2.35967E-08
4901:1101 0.00445 5 0.10238 0.012260941
44020101g:120101g 0.00444 5 0.31092 6.02564E-06
1402:1303 0.00438 4.9 0.20954 2.80067E-08
180101g:1101 0.00438 4.9 0.02832 n.s.
270502g:0801 0.0041 4.6 0.11837 0.001027175
3503:1101 0.00408 4.6 0.12841 0.005444055
15010101g:1501 0.004 4.5 -0.34666 n.s.
3801:1301 0.00395 4.4 0.09178 n.s.
400201g:1301 0.00394 4.4 0.14299 0.011026634
270502g:0301 0.00388 4.3 -0.01371 n.s.
4403:1101 0.00385 4.3 0.04179 n.s.
44020101g:1301 0.00375 4.2 -0.09787 n.s.
1302:1501 0.0037 4.1 0.01278 n.s.
180101g:0701 0.00364 4.1 -0.46466 n.s.
0702:0801 0.00362 4 0.03432 n.s.
15010101g:0701 0.00362 4 -0.47724 n.s.
15010101g:1101 0.00358 4 0.01176 n.s.
5701:1301 0.00358 4 0.06718 n.s.
5601:0101 0.00358 4 0.44862 7.11435E-05
4102:1303 0.00358 4 0.79616 5.51865E-38
3701:1101 0.00358 4 0.19159 0.000839126
44020101g:1302 0.00358 4 0.0183 n.s.
3906:0801 0.00357 4 0.65528 3.54525E-22
0801g:0701 0.00357 4 -0.68328 0.006609304
3901:1601 0.00357 4 0.33706 1.3379E-08
350101g:1301 0.00356 4 -0.17882 n.s.
1402:1302 0.00354 4 0.07846 0.034630944
3701:1001 0.00352 3.9 0.3173 6.36729E-19
180101g:1601 0.00352 3.9 0.04117 n.s.
3503:1401 0.00349 3.9 0.22521 4.22484E-09
180101g:0401 0.00347 3.9 -0.06794 n.s.
180101g:0301 0.00337 3.8 -0.39588 n.s.
0702:0404 0.00328 3.7 -0.21706 n.s.
3801:0101 0.00317 3.5 0.03157 n.s.
400101g:0401 0.00281 3.1 -0.06139 n.s.
5701:1501 0.00277 3.1 -0.0961 n.s.
350101g:0407 0.00275 3.1 0.22884 0.003938248
510101g:1104 0.00269 3 -0.12699 n.s.
0702:1401 0.00269 3 0.09325 n.s.
510101g:0403 0.00269 3 0.26093 0.000451137
350101g:0403 0.00269 3 0.25101 0.002579612
5701:0801 0.00269 3 0.07429 0.014681015
4405:1601 0.00269 3 0.40623 4.97935E-08
0705g:1001 0.00269 3 0.49457 2.26745E-31
5801:1302 0.00269 3 0.30206 2.65521E-05
5001:0301 0.00269 3 0.12489 n.s.
4501g:0401 0.00269 3 0.46346 2.54411E-05
350101g:0402 0.00268 3 0.14346 0.034477034
3801:1101 0.00265 3 0.0511 n.s.
180101g:1302 0.00265 3 0.00481 n.s.
Total 0.72793 812.8

Table 5.

HLA three- and four-locus haplotype frequencies

Loci Haplotype Frequency Observed (n)
A-B-DRB1 010101g:0801g:0301 0.04544 50.7
03010101g:0702:1501 0.02985 33.3
02010101g:1302:0701 0.01510 16.9
02010101g:0702:1501 0.01440 16.1
02010101g:15010101g:0401 0.01425 15.9
2902:4403:0701 0.01344 15.0
03010101g:350101g:0101 0.01325 14.8
02010101g:0801g:0301 0.01211 13.5
02010101g:5701:0701 0.00986 11.0
24020101g:3502:1104 0.00976 10.9
02010101g:44020101g:0401 0.00896 10.0
03010101g:0702:1301 0.00828 9.2
010101g:3502:1104 0.00824 9.2
02010101g:180101g:1104 0.00813 9.1
3301:1402:0102 0.00806 9.0
2501:180101g:1501 0.00803 9.0
1101:350101g:0101 0.00759 8.5
2301g:4403:0701 0.00717 8.0
3101:400101g:0404 0.00704 7.9
24020101g:0702:1501 0.00701 7.8
02010101g:270502g:0101 0.00627 7.0
2601:3801:0402 0.00627 7.0
02010101g:44020101g:1601 0.00627 7.0
02010101g:44020101g:0101 0.00627 7.0
02010101g:2702:1601 0.00538 6.0
0205:5001:0701 0.00538 6.0
02010101g:400101g:1302 0.00530 5.9
02010101g:400101g:0801 0.00501 5.6
010101g:5701:0701 0.00473 5.3
010101g:0801g:1501 0.00451 5.0
02010101g:400101g:0404 0.00448 5.0
010101g:4403:0701 0.00448 5.0
1101:44020101g:0401 0.00448 5.0
02010101g:5201:1502 0.00448 5.0
02010101g:510101g:1301 0.00448 5.0
03010101g:510101g:0101 0.00445 5.0
02010101g:44020101g:0701 0.00442 4.9
02010101g:0702:0801 0.00437 4.9
1101:0702:1501 0.00411 4.6
02010101g:510101g:0404 0.00407 4.5
03010101g:0801g:0301 0.00390 4.4
02010101g:350101g:0101 0.00378 4.2
010101g:1302:0701 0.00362 4.0
02010101g:4901:1101 0.00358 4.0
02010101g:4901:1302 0.00358 4.0
03010101g:0702:0101 0.00358 4.0
02010101g:44020101g:120101g 0.00358 4.0
03010101g:0702:0401 0.00358 4.0
02010101g:15010101g:1101 0.00358 4.0
6802:1402:1303 0.00358 4.0
24020101g:4403:1101 0.00358 4.0
2601:510101g:1301 0.00358 4.0
3101:510101g:1301 0.00358 4.0
02010101g:5701:1301 0.00358 4.0
03010101g:350101g:1501 0.00356 4.0
1101:15010101g:1301 0.00344 3.8
03010101g:0702:0701 0.00327 3.7
1101:350101g:0103 0.00312 3.5
24020101g:350101g:0101 0.00312 3.5
02010101g:15010101g:1501 0.00295 3.3
03010101g:0702:0301 0.00289 3.2
02010101g:1302:1501 0.00282 3.1
010101g:0801g:0701 0.00278 3.1
010101g:180101g:1104 0.00276 3.1
02010101g:44020101g:1104 0.00275 3.1
02010101g:400101g:0101 0.00269 3.0
02010101g:0801g:0401 0.00269 3.0
010101g:270502g:0301 0.00269 3.0
680102g:44020101g:1302 0.00269 3.0
2601:270502g:0101 0.00269 3.0
680102g:350101g:0407 0.00269 3.0
24020101g:3801:1301 0.00269 3.0
03010101g:1402:0102 0.00269 3.0
02010101g:15010101g:0901 0.00269 3.0
02010101g:350101g:1501 0.00269 3.0
02010101g:180101g:1101 0.00269 3.0
24020101g:350101g:0404 0.00269 3.0
03010101g:3503:0801 0.00269 3.0
6601:4102:1303 0.00269 3.0
24020101g:180101g:0301 0.00269 3.0
02010101g:510101g:0901 0.00269 3.0
680102g:270502g:1301 0.00265 3.0
24020101g:1302:0701 0.00265 3.0
Total 0.50999 569.3
A-C-B-DRB1 010101g:070101g:0801g:0301 0.04352 48.6
03010101g:0702:0702:1501 0.02949 32.9
02010101g:0602:1302:0701 0.01511 16.9
02010101g:070101g:0801g:0301 0.01410 15.7
02010101g:0702:0702:1501 0.01373 15.3
2902:1601:4403:0701 0.01286 14.4
03010101g:04010101g:350101g:0101 0.01251 14.0
02010101g:070101g:180101g:1104 0.00986 11.0
02010101g:0501g:44020101g:0401 0.00941 10.5
24020101g:04010101g:3502:1104 0.00896 10.0
02010101g:0304:15010101g:0401 0.00894 10.0
1101:04010101g:350101g:0101 0.00886 9.9
03010101g:0702:0702:1301 0.00829 9.3
3301:0802:1402:0102 0.00806 9.0
02010101g:0602:5701:0701 0.00805 9.0
2501:1203:180101g:1501 0.00802 8.9
3101:0304:400101g:0404 0.00717 8.0
2301g:04010101g:4403:0701 0.00716 8.0
24020101g:0702:0702:1501 0.00699 7.8
2601:1203:3801:0402 0.00627 7.0
010101g:04010101g:3502:1104 0.00550 6.1
02010101g:0304:400101g:0801 0.00538 6.0
02010101g:0202:510101g:1301 0.00538 6.0
0205:0602:5001:0701 0.00538 6.0
02010101g:0304:400101g:1302 0.00528 5.9
02010101g:0501g:44020101g:0701 0.00506 5.6
010101g:070101g:0801g:1501 0.00459 5.1
02010101g:0202:2702:1601 0.00448 5.0
02010101g:0304:400101g:0404 0.00448 5.0
02010101g:0702:0702:0801 0.00448 5.0
03010101g:0702:0702:0101 0.00448 5.0
02010101g:1202:5201:1502 0.00448 5.0
02010101g:0501g:44020101g:0101 0.00448 5.0
02010101g:070101g:4901:1101 0.00448 5.0
02010101g:04010101g:350101g:1501 0.00448 5.0
1101:0702:0702:1501 0.00448 5.0
02010101g:0304:400101g:0101 0.00403 4.5
1101:0303:15010101g:1301 0.00358 4.0
010101g:0602:1302:0701 0.00358 4.0
02010101g:0202:270502g:0101 0.00358 4.0
010101g:0602:3502:1104 0.00358 4.0
6802:0802:1402:1303 0.00358 4.0
010101g:0602:5701:0701 0.00354 4.0
02010101g:0303:15010101g:1301 0.00343 3.8
03010101g:0702:0702:0701 0.00315 3.5
1101:0501g:44020101g:0401 0.00314 3.5
02010101g:0602:1302:1501 0.00282 3.1
010101g:070101g:0801g:0701 0.00278 3.1
02010101g:070101g:0801g:0401 0.00269 3.0
010101g:0102:270502g:0301 0.00269 3.0
680102g:070401g:44020101g:1302 0.00269 3.0
010101g:1601:4403:0701 0.00269 3.0
03010101g:04010101g:3503:0701 0.00269 3.0
24020101g:0602:1302:0701 0.00269 3.0
03010101g:0802:1402:0102 0.00269 3.0
24020101g:0202:4405:0101 0.00269 3.0
03010101g:0303:15010101g:0401 0.00269 3.0
02010101g:0303:15010101g:1101 0.00269 3.0
3002:0501g:180101g:0301 0.00269 3.0
03010101g:0702:0702:0401 0.00269 3.0
24020101g:070101g:180101g:1101 0.00269 3.0
2601:070401g:44020101g:1601 0.00269 3.0
24020101g:0501g:180101g:0301 0.00269 3.0
680102g:0102:270502g:1301 0.00269 3.0
6601:1701g:4102:1303 0.00269 3.0
Total 0.43091 480.8

Ninety six haplotypes were identified at least three times and accounted for almost 80% (h.f. =0.7936) of the A:B haplotypes (Table 4). The five most frequent A:B haplotypes were A*010101g:B*0801g (haplotype frequency [h.f.] = 0.0618), A*03010101g:B*0702 (h.f. = 0.0508), A*02010101g:B*15010101g (h.f. = 0.0337), A*02010101g:B*44020101g (h.f. = 0.0330), and A*02010101g:B*400101g (h.f. = 0.0286). Four of these haplotypes are also the four most frequent haplotypes in European Americans; the last haplotype listed above ranks sixth. Of the 55 C:B haplotypes accounting for 92% (h.f. =0.9189) of the total, the five most frequent haplotypes were identical to a previous study (61): C*0702:B*0702 (h.f. =0.1022), Cw*070101g:B*0801g (h.f. =0.0860), Cw*04010101g:B*350101g (h.f. =0.0573), Cw*0501g:B*44020101 (h.f. =0.0430), and Cw*0304: B*400101g (h.f. =0.04211). Ninety seven B:DRB1 haplotypes were observed in three or more individuals totaling to a haplotype frequency of 73% (h.f. =0.7279). The five most frequent B:DRB1 haplotypes were : B*0801g:DRB1*0301 (h.f. =0.0669), B*0702:DRB1*1501 (h.f. =0.05501), B*4403:DRB1*0701 (h.f. =0.0310), B*350101g:DRB1*0101 (h.f. =0.02833), and B*3502:DRB1*1104 (h.f. =0.0222). The last haplotype was ranked at 19 in the previous study of European Americans but the other four were also the four most frequent in that study.

Among the three locus haplotypes (Table 5), the five most common were: A*010101g:B*0801g:DRB1*0301 (h.f. =0.0454), A*03010101g:B*0702:DRB1*1501 (h.f. =0.0299), A*02010101g:B*1302:DRB1*0701 (h.f. =0.0151), A*02010101g:B*0702:DRB1*1501 (h.f. =0.0144), and A*02010101g:B*15010101g:DRB1*0401 (h.f. =0.0143). This included three of the top ranked haplotypes in the earlier study; the haplotype with DRB1*0701 ranked 19th and the haplotype with DRB1*0401 ranked sixth. Eighty three three locus haplotypes included 51% (h.f. =0.5100) of the total. Of the 65 haplotypes covering 43% of the total (h.f. =0.4309), the four locus haplotypes included three of the top ranked haplotypes in the earlier study: A*010101g:Cw*070101g:B*0801g:DRB1*0301 (h.f. =0.0435), A*03010101g:Cw*0702:B*0702:DRB1*1501 (h.f. =0.0295), and A*02010101g:Cw*0702:B*0702:DRB1*1501 (h.f. =0.0137). Haplotypes A*02010101g:Cw*0602:B*1302:DRB1*0701 (h.f. =0.0151) and A*02010101g:Cw*070101g:B*0801:DRB1*0301 (h.f. =0.0141) ranked 17th and 10th respectively in the earlier study. In summary, for the very common haplotypes, this Eastern European American population is similar to a European American population (61).

Global linkage disequilibrium (LD) estimates of the associations between HLA loci are shown in Table 6. All show significant LD. As already observed (62), C:B has the strongest association (D’=0.91160; Wn=0.72619). These genes lie adjacent to one another in the major histocompatibility complex (MHC) separated by a map distance of approximately 0.1 Mb (63). Separated by approximately 2.7 Mb, the A:DRB1 LD estimates have the lowest values (D’=0.41025; Wn=0.31802). This ranking of LD is the same as was observed observed for populations in the 13th International Histocompatibility Workshop Anthropology/Human Genetic Diversity Project, with LD generally decreasing in proportion to increasing map distance between loci (62). However, higher LD values were observed for A:B (D’=0.57324; Wn=0.40388) than A:C (D’=0.47558; Wn=0.033870). This was also observed in the 13th Workshop Anthropology project (62), and may be result from the low levels of polymorphism seen at HLA-C, relative to the HLA-A and -B loci; there are approximately half as many HLA-C alleles as HLA-A alleles, and one third as many HLA-C alleles as HLA-B alleles.

Table 6.

Pairwise global linkage disequilibrium estimatesa

Locus Pair D' Wn p-value
A:C 0.47558 0.33870 <0.0001
A:B 0.57324 0.40388 <0.0001
A:DRB1 0.41025 0.31802 <0.0001
C:B 0.91160 0.72619 <0.0001
C:DRB1 0.52904 0.36512 <0.0001
B:DRB1 0.63355 0.46386 <0.0001
a

For D’, values above 0 show a positive association between loci; values above 0.5 indicate a very strong association. Wn is an alternative measurement of linkage disequilibrium which is interpreted in a similar fashion to D’.

Admixture

To estimate admixture from non-Eastern European and non-European populations, we compared pairwise F'st values for this Eastern European American population sample to those calculated for A:C:B haplotypes from Sub-Saharan African, European, and western Asian populations, and to F'st values calculated for DRB1 allele frequencies in Sub-Saharan African, European, North African, and western Asian populations. Preliminary analyses (data not shown) indicated that the Eastern European American population was uniformly closer to European and European American populations (and significantly different from all non-European populations). Therefore, only results of comparisons between the Eastern European American population and other European and European American populations are presented in Table 7.

Table 7.

Pairwise F’st values for HLA loci comparing Eastern European Americans to other European and European American populations

Population
(reference)
A:C:B
Haplotypes
p-valuea DRB1 Alleles p-valuea
Poland (39) 0.56140b 0.00000* 0.00722 0.18359
Poland (40) 0.03986 0.00586*
Czech Republic (10) 0.011018 0.17117 0.00000 0.61230
Czech Republic (31) 0.00000 0.82324
Slovenia (10) 0.00000 0.66895
Slovenia (44) 0.00000 0.59473
Russia (43) 0.00159 0.37402
Russia (42) 0.08583 0.00391*
Bulgaria (28) 0.18488 0.00098*
Croatia (30) 0.16865 0.01855*
Croatia (29) 0.14136 0.03906*
Georgia (10) 0.125765 0.00000*
Georgia (32) 0.10337 0.00293*
Macedonia (35) 0.15017 0.00000*
Macedonia (36) 0.14089 0.00000*
Finland (10) 0.089037 0.00000* 0.13423 0.00000*
Norway (37) 0.04690 0.00391*
Norway (38) 0.02035 0.166113
Sweden (49) 0.04840 0.00586*
Germany (33) 0.00000 0.94922
N. Ireland (10) 0.086255 0.00000* 0.07945 0.00000*
Spain (45)c 0.38327 0.00000*
Spain (46) 0.00000 0.44824
Spain (45)d 0.31151 0.00000*
Spain (47) 0.04005 0.01855*
Spain (48) 0.03452 0.00000*
Portugal (41) 0.00694 0.26172
Italy (34) 0.35147 0.00000*
USA (54) 0.01595 0.16699
USA (55) 0.02054 0.00195*
USA (56)e 0.00000 0.71875
USA (53) 0.012799 0.01802*
*

p<0.05, Fst values are significantly different

a

p values for unstandardized Fst values

b

Because the A:C:B haplotypes for this population included a "blank" haplotype with a frequency of 0.22, the F'st value for this population is interpreted differently than others. See the Results section for details.

c

This is the Cabuernigo population described by Sanchez-Velasco et al. (45).

d

This is the Pasiego population described by Sanchez-Velasco et al. (45).

e

This is the White population described by Rossman et al. (56).

At the DRB1 locus, the Eastern European American population did not differ significantly from eastern European populations from Poland (1 of 2 studies), the Czech Republic (2 of 2 studies), Slovenia (2 of 2 studies), and Russia (1 of 2 studies). However, the Eastern European American population was also not significantly different from European populations from Spain (1 of 5 studies), Norway (1 of 2 studies), Germany, and Portugal, or from general European American populations (2 of 3 studies).

Considering A:C:B haplotypes, the Eastern European American population was most similar to the Czech population and a general European American population (F'st = 0.011 and 0.0013, respectively). The Eastern European American population did not differ significantly from the Czech population (p-value = 0.17117 +/− 0.0286) but differed significantly from the European American population (p-value = 0.018+/−0.012). A Polish population (39) with a "blank" A:C:B haplotype frequency of 0.22 was included in this haplotype analysis. The F'st values involving this Polish population and other European or European American populations were uniformly higher than those between all other European and European American populations (means = 0.6316 and 0.1015, respectively) and this Polish population differed significantly from all other populations included in the analysis. However, the lowest pairwise F'st value involving this Polish population (0.5614) is observed for the comparison with the Eastern European American population.

Overall, the Eastern European American population was most similar to the Czech and European American populations when comparing A:C:B haplotypes, and was most similar (F'st equivalent to zero) to the Czech and Slovenian populations, as well as to German, European American and Spanish populations when considering DRB1 allele frequencies, suggesting that this group represents a distinct sub-sample of the European American population, with a considerable (but not exclusively) eastern European contribution.

Discussion

DNA sequencing was used to identify HLA alleles from a population of individuals from the United States who self identified as having Eastern European ancestry. The 2000 US census suggest that half of the Americans of Eastern European ancestry are at least partially of Polish descent, and a Polish contribution to the Eastern European American population is reflected our analyses, wherein comparison of the Eastern European American population to a variety of world populations demonstrated a strong similarity to Czech, Polish, Russian and Slovenian populations. Pairwise F'st values also showed similarities to European American populations, which might be expected since the latter would include individuals with eastern European ancestry. Similarities between the Eastern European American population and other European populations are indicated when considering DRB1 alleles, but not when considering A:C:B haplotypes; this reflects the subdivision of alleles that are common across Europe into multi-locus haplotypes that display regional restriction. Overall, our analyses suggest that self-identified ethnicity, in spite of the variation in how race and ethnicity is perceived among individuals (64), is still a fairly accurate estimation of ancestry.

First settled by Europeans in the late 1500s, the United States is a nation of immigrants from predominantly Europe but with an increasing contribution from Latin America, Africa and Asia. In the U.S. census of 2000, almost 60% of individuals specified a single ancestry and 30% specified multiple ancestry (1). Understanding this complex and evolving population structure to better define subgroups of individuals who share a common genetic heritage will lead to a better understanding of disease risk and the outcomes of medical intervention (64,65). Studies employing genome-wide genetic markers including microsatellites, Alu markers, and single nucleotide polymorphisms have mapped human genetic diversity in world-wide populations for single markers and haplotypes (66,67). Within U.S. populations, the focus has been on broad population groups: European Americans, African Americans, Hispanic Americans, and Asian Americans. Based on our findings distinguishing Americans with Eastern European heritage from a more general European American population and to a certain extent from populations in other regions of Europe, these genome-wide studies should be extended to better define subpopulations of European Americans.

In the 2000 U.S. census, 7% of individuals reported their ancestry as American. It is likely that this “new” population arising from the admixture of many groups will continue to increase over time. Loss of ethnic identity will pose a significant challenge for registries supporting hematopoietic stem cell transplantation with an unrelated donor. Today more than one third of donors are identified through an international search and reside in a country different from that of the donor (68). Knowledge of the ancestry of an individual is critical in predicting the HLA alleles carried by a potential donor with only low resolution HLA typing results. This information is used by histocompatibility experts and by algorithms to predict the likelihood of HLA allele matching (69). Registries should begin to develop strategies that will allow them to predict population group membership to guide searches. Such strategies may require testing of other polymorphic markers such as microsatellites for comparison with population databases (64,65).

Acknowledgment

This research is supported by funding from the Office of Naval Research N00014-04-1-0795 (C.K.H., J.N.) and NIH grant GM35326 (G.T., A.L.) . The views expressed in this article are those of the authors and do not reflect the official policy or position of the Department of the Navy, the Department of Defense, or the U.S. government. We would like to thank Olga Malinowski for technical assistance.

References

  • 1.Brittingham A, de la Cruz GC. Ancestry: 2000. 2000 Brief. US Census Bureau. 2004 [Google Scholar]
  • 2.Parham P, Ohta T. Population biology of antigen presentation by MHC class I molecules. Science. 1996;272:67–74. doi: 10.1126/science.272.5258.67. [DOI] [PubMed] [Google Scholar]
  • 3.Tu B, Mack SJ, Lazaro A, Lancaster A, Thomson G, Cao K, Chen M, Ling G, Hartzman R, Ng J, Hurley CK. HLA-A, -B, -C, -DRB1 allele and haplotype frequencies in an African American population. Tissue Antigens. 2007;69:73–85. doi: 10.1111/j.1399-0039.2006.00728.x. [DOI] [PubMed] [Google Scholar]
  • 4.Robinson J, Waller MJ, Parham P, de Groot N, Bontrop R, Kennedy LJ, Stoehr P, Marsh SG. IMGT/HLA and IMGT/MHC: sequence databases for the study of the major histocompatibility complex. Nucleic Acids Res. 2003;31:311–314. doi: 10.1093/nar/gkg070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lazaro A, Cao K, Masaberg C, Steiner N, Xiao Y, Tu B, Turner V, Nickerson P, Stoll S, Schall C, Valdez R, Ng J, Hartzman R, Hurley CK. Twenty-three novel HLA-B alleles identified during intermediate-resolution testing. Tissue Antigens. 2006;68:245–248. doi: 10.1111/j.1399-0039.2006.00661.x. [DOI] [PubMed] [Google Scholar]
  • 6.Marsh SG, Albert ED, Bodmer WF, Bontrop RE, Dupont B, Erlich HA, Geraghty DE, Hansen JA, Hurley CK, Mach B, Mayr WR, Parham P, Petersdorf EW, Sasazuki T, Schreuder GM, Strominger JL, Svejgaard A, Terasaki PI, Trowsdale J. Nomenclature for factors of the HLA system, 2004. Tissue Antigens. 2005;65:301–369. doi: 10.1111/j.1399-0039.2005.00379.x. [DOI] [PubMed] [Google Scholar]
  • 7.Lancaster AK, Nelson MP, Single RM, Meyer D, Thomson G. PyPop: a software framework for population genomics: analyzing large-scale multi-locus genotype data. In: Altman RB, Dunker K, Hunter L, Jung T, Klein T, editors. Pacific Symposium on Biocomputing 8. Singapore: World Scientific; 2003. pp. 514–525. [PMC free article] [PubMed] [Google Scholar]
  • 8.Lancaster AK, Single RM, Solberg OD, Nelson MP, Thomson G. PyPop update - a software pipeline for large-scale multilocus population genomics. Tissue Antigens. 2007;69:192–197. doi: 10.1111/j.1399-0039.2006.00769.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Guo SW, Thompson EA. Performing the exact test of Hardy-Weinberg proportion for multiple alleles. Biometrics. 1992;48:361–372. [PubMed] [Google Scholar]
  • 10.Meyer D, Single R, Mack SJ, Lancaster A, Nelson MP, Erlich HA, Fernandez-Vina M, Thomson G. 13th IHWS anthropology/human genetic diversity joint report. Chapter 4. Single locus polymorphism of classical HLA genes. In: Hansen JA, editor. Immunobiology of the human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 653–704. [Google Scholar]
  • 11.Ewens W. The sampling theory of selectively neutral alleles. Theor Pop Biol. 1972;3:87–112. doi: 10.1016/0040-5809(72)90035-4. [DOI] [PubMed] [Google Scholar]
  • 12.Watterson G. The homozygosity test of neutrality. Genetics. 1978;88:405–417. doi: 10.1093/genetics/88.2.405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Slatkin M. An exact test for neutrality based on the Ewens sampling distribution. Genetical Research. 1994;64:71–74. doi: 10.1017/s0016672300032560. [DOI] [PubMed] [Google Scholar]
  • 14.Slatkin M. A correction to the exact test based on the Ewens sampling distribution. Genetical Research. 1996;68:259–260. doi: 10.1017/s0016672300034236. [DOI] [PubMed] [Google Scholar]
  • 15.Salamon H, Klitz W, Easteal S, Gao X, Erlich HA, Fernandez-Vina M, Trachtenberg EA, McWeeney SK, Nelson MP, Thomson G. Evolution of HLA class II molecules: Allelic and amino acid site variability across populations. Genetics. 1999;152:393–400. doi: 10.1093/genetics/152.1.393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dempster A, Laird N, Rubin D. Maximum likelihood estimation from incomplete data using the EM algorithm. J Royal Stat Soc. 1977;39:1–38. [Google Scholar]
  • 17.Excoffier L, Slatkin M. Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. Mol Biol Evol. 1995;12:921–927. doi: 10.1093/oxfordjournals.molbev.a040269. [DOI] [PubMed] [Google Scholar]
  • 18.Hedrick PW. Gametic disequilibrium measures: proceed with caution. Genetics. 1987;117:331–334. doi: 10.1093/genetics/117.2.331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lewontin RC. The interaction of selection and linkage. II. Optimum models. Genetics. 1964;50:757–782. doi: 10.1093/genetics/50.4.757. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lewontin RC. On measures of gametic disequilibrium. Genetics. 1988;120:849–852. doi: 10.1093/genetics/120.3.849. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Cramer H. Mathematical methods of statistics. Princeton, NJ: University Press; 1946. [Google Scholar]
  • 22.Excoffier L, Laval G, Schneider S. Arlequin (version 3.0): an integrated software package for population genetics data analysis. Bioinformatics Online. 2005;1:47–50. [PMC free article] [PubMed] [Google Scholar]
  • 23.Cao K, Moormann AM, Lyke KE, Masaberg C, Sumba OP, Doumbo OK, Koech D, Lancaster A, Nelson M, Meyer D, Single R, Hartzman RJ, Plowe CV, Kazura J, Mann DL, Sztein MB, Thomson G, Fernandez-Vina MA. Differentiation between African populations is evidenced by the diversity of alleles and haplotypes of HLA class I loci. Tissue Antigens. 2004;63:293–325. doi: 10.1111/j.0001-2815.2004.00192.x. [DOI] [PubMed] [Google Scholar]
  • 24.Izaabel H, Djoulah SGHJ, Caillat-Zucman S, Beaurain G, Akhayat O, Bach J-F, Khalil I, Benhamamouch S, Degos L, Deschamps I, Hors J, Sanchez-Mazas A. Populations: Moroccan from Souss, Morocco and Algerian from Oran, Algeria. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 591–592. [Google Scholar]
  • 25.Canossi A, Piancatelli D, Oumhani K, Aureli A, Di Rocco M, Del Beato T, Liberatore G, Casciani CU, El Aouad R, Adorno D. Population: Chaouya from Morocco. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 592–594. [Google Scholar]
  • 26.Piancatelli D, Canossi A, Oumhani K, Aureli A, Di Rocco M, Del Beato T, Liberatore G, Casciani CU, El Aouad R, Adorno D. Population: Metalsa from Morocco. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 594–595. [Google Scholar]
  • 27.Arnaiz-Villena A, Moscoso J, Martinez-Laso J, Zamora J, Lowy E, del Moral P, Gomez-Casado E. Moroccan from El Jadida, Morocco. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 595–596. [Google Scholar]
  • 28.Ivanova M, Rozemuller E, Tyufekchiev N, Michailova A, Tilanus M, Naumova E. HLA polymorphism in Bulgarians defined by high-resolution typing methods in comparison with other populations. Tissue Antigens. 2002;60:496–504. doi: 10.1034/j.1399-0039.2002.600605.x. [DOI] [PubMed] [Google Scholar]
  • 29.Martinović I, Bakran M, Chaventré A, Janićijević B, Jovanović V, Smolej-Narancić N, Kastelan A, Grubić Z, Zunec R, Roberts DF, Rudan P. Application of HLA class II polymorphism analysis to the study of the population structure of the Island of Krk, Croatia. Hum Biol. 1997;69:819–829. [PubMed] [Google Scholar]
  • 30.Martinović I, Bakran M, Chaventré A, Janićijević B, Jovanović V, Smolej-Narancić N, Kastelan A, Grubić Z, Zunec R, Roberts DF, Rudan P. Application of HLA class II polymorphism analysis to the study of the population structure of the Island of Krk, Croatia. Hum Biol. 1997;69:819–829. [PubMed] [Google Scholar]
  • 31.Cerná M, Fernandez-Viña M, Ivásková E, Stastny P. Comparison of HLA class II alleles in Gypsy and Czech populations by DNA typing with oligonucleotide probes. Tissue Antigens. 1992;39:111–116. doi: 10.1111/j.1399-0039.1992.tb01919.x. [DOI] [PubMed] [Google Scholar]
  • 32.Sánchez-Velasco P, Leyva-Cobián F. The HLA class I and class II allele frequencies studied at the DNA level in the Svanetian population (Upper Caucasus) and their relationships to Western European populations. Tissue Antigens. 2001;58:223–233. doi: 10.1034/j.1399-0039.2001.580402.x. [DOI] [PubMed] [Google Scholar]
  • 33.Reil A, Bein G, Machulla HK, Sternberg B, Seyfarth M. High-resolution DNA typing in immunoglobulin A deficiency confirms a positive association with DRB1*0301, DQB1*02 haplotypes. Tissue Antigens. 1997;50:501–506. doi: 10.1111/j.1399-0039.1997.tb02906.x. [DOI] [PubMed] [Google Scholar]
  • 34.Lampis R, Morell L, Congia M, Doloretta Macis M, Mulargia A, Loddo M, De Virgiliis S, Marrosu MG, Todd JA, Cucca F. The inter-regional distribution of HLA class II haplotypes indicates the suitability of the Sardinian population for case-control association studies in complex diseases Human. Molecular Genetics. 2000;9:2959–2965. doi: 10.1093/hmg/9.20.2959. [DOI] [PubMed] [Google Scholar]
  • 35.Arnaiz-Villena A, Dimitroski K, Pacho A, Moscoso J, Gómez-Casado E, Silvera-Redondo C, Varela P, Blagoevska M, Zdravkovska V, Martínez-Laso J. HLA genes in Macedonians and the sub-Saharan origin of the Greeks. Tissue Antigens. 2001;57:118–127. doi: 10.1034/j.1399-0039.2001.057002118.x. [DOI] [PubMed] [Google Scholar]
  • 36.Petlichkovski A, Efinska-Mladenovska O, Trajkov D, Arsov T, Strezova A, Spiroski M. High-resolution typing of HLA-DRB1 locus in the Macedonian population. Tissue Antigens. 2004;64:486–491. doi: 10.1111/j.1399-0039.2004.00273.x. [DOI] [PubMed] [Google Scholar]
  • 37.Rønningen KS, Spurkland A, Markussen G, Iwe T, Vartdal F, Thorsby E. Distribution of HLA class II alleles among Norwegian Caucasians. Hum Immunol. 1990;29:275–281. doi: 10.1016/0198-8859(90)90041-m. [DOI] [PubMed] [Google Scholar]
  • 38.Lin L, Jin L, Kimura A, Carrington M, Mignot E. DQ microsatellite association studies in three ethnic groups. Tissue Antigens. 1997;50:507–520. doi: 10.1111/j.1399-0039.1997.tb02907.x. [DOI] [PubMed] [Google Scholar]
  • 39.Nowak J, Mika-Witkowska R, Polak M, Zajko M, Rogatko-Koro? M, Graczyk-Pol E, Lange A. Allele and extended haplotype polymorphism of HLA-A, -C, -B, -DRB1 and -DQB1 loci in Polish population and genetic affinities to other populations. Tissue Antigens. 2008;71:193–205. doi: 10.1111/j.1399-0039.2007.00991.x. [DOI] [PubMed] [Google Scholar]
  • 40.Bodmer J, Cambon-Thomsen A, Hors J, Piazza A, Sanchez-Mazas A. Report of the Anthropology Component. In: Charron D, editor. HLA: Proceedings of the Twelfth International Histocompatibility Workshop and Conference; 1997. pp. 269–274. [Google Scholar]
  • 41.Spínola H, Middleton D, Brehm A. HLA genes in Portugal inferred from sequence-based typing: in the crossroad between Europe and Africa. Tissue Antigens. 2005;66:26–36. doi: 10.1111/j.1399-0039.2005.00430.x. [DOI] [PubMed] [Google Scholar]
  • 42.Arnaiz-Villena A, Martinez-Laso J, Moscoso J, Livshits G, Zamora J, Gomez-Casado E, Silvera-Redondo C, Melvin K, Crawford MH. HLA genes in the Chuvashian population from European Russia: admixture of Central European and Mediterranean populations. Hum Biol. 2003;75:375–392. doi: 10.1353/hub.2003.0040. [DOI] [PubMed] [Google Scholar]
  • 43.Kapustin S, Lyshchov A, Alexandrova J, Imyanitov E, Blinov M. HLA class II molecular polymorphisms in healthy Slavic individuals from North-Western Russia. Tissue Antigens. 1999;54:517–520. doi: 10.1034/j.1399-0039.1999.540509.x. [DOI] [PubMed] [Google Scholar]
  • 44.Petrone A, Battelino T, Krzisnik C, Bugawan TL, Erlich HA, Di Mario A, Pozzilli P, Buzzetti R. Similar incidence of type 1 diabetes in two ethnically different populations (Italy and Slovenia) is sustained by similar HLA susceptible/protective haplotype frequencies. Tissue Antigens. 2002;60:244–253. doi: 10.1034/j.1399-0039.2002.600306.x. [DOI] [PubMed] [Google Scholar]
  • 45.Sánchez-Velasco P, Gomez-Casado E, Martinez-Laso J, Moscoso J, Zamora J, Lowy E, Silvera C, Cemborain A, Leyva-Cobián F, Arnaiz-Villena A. HLA alleles in isolated populations from North Spain: Origin of the Basques and the ancient Iberians. Tissue Antigens. 2003;61 doi: 10.1034/j.1399-0039.2003.00041.x. 3804-392. [DOI] [PubMed] [Google Scholar]
  • 46.Sánchez-Velasco P, Escribano de Diego J, Paz-Miguel JE, Ocejo-Vinyals G, Leyva-Cobián F. HLA-DR, DQ nucleotide sequence polymorphisms in the Pasiegos (Pas valleys, Northern Spain) and comparison of the allelic and haplotypic frequencies with those of other European populations. Tissue Antigens. 1999;53:65–73. doi: 10.1034/j.1399-0039.1999.530107.x. [DOI] [PubMed] [Google Scholar]
  • 47.Muroa M, Marína L, Toríoa A, Moya-Quilesa MR, Minguelaa A, Rosique-Romana J, Sanchisa MJ, Garcia-Calatayuda MC, García-Alonsoa AM, álvarez-López MR. HLA polymorphism in the murcia population (Spain): in the cradle of the archaeologic Iberians. Human Immunology. 2001;62:910–921. doi: 10.1016/s0198-8859(01)00290-7. [DOI] [PubMed] [Google Scholar]
  • 48.Vidal S, Morante MP, Moga E, Mosquera AM, Querol S, Garcia J, Rodriguez-Sanchez JI. Molecular analysis of HLA-DRB1 polymorphism in north-east Spain. Eur J Immunogenet. 2002;29:75–77. doi: 10.1046/j.0960-7420.2001.00293.x. [DOI] [PubMed] [Google Scholar]
  • 49.Allen M, Sandberg-Wollheim M, Sjogren K, Erlich HA, Petterson U, Gyllensten U. Association of susceptibility to multiple sclerosis in Sweden with HLA class II DRB1 and DQB1 alleles. Hum Immunol. 1994;39:41–48. doi: 10.1016/0198-8859(94)90099-x. [DOI] [PubMed] [Google Scholar]
  • 50.Mack SJ, Crawford MH, Saha N, Jani AJ, Geyer LN, Reddy M, Erlich H. Populations: Golla from Andhra Pradesh, and north Indians from New Delhi, India. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 605–607. [Google Scholar]
  • 51.Gazit E, Kalt R, Roewenthal R. Population: Ashkenazi and non-Ashkenazi Jews from Israel. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 607–608. [Google Scholar]
  • 52.Saruhan-Direskeneli G, Uyar FA. Population: Turkish from Marmara, Turkey. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 609–610. [Google Scholar]
  • 53.Cao K, Hollenbach J, Shi X, Shi W, Chopek M, Fernandez-Vina MA. Analysis of the frequencies of HLA-A, B, and C alleles and haplotypes in the five major ethnic groups of the United States reveals high levels of diversity in these loci and contrasting distribution patterns in these populations. Hum Immunol. 2001;62:1009–1030. doi: 10.1016/s0198-8859(01)00298-1. [DOI] [PubMed] [Google Scholar]
  • 54.Bugawan TL, Klitz W, Blair A, Erlich HA. High-resolution HLA class I typing in the CEPH families: analysis of linkage disequilibrium among HLA loci. Tissue Antigens. 2000;56:392–404. doi: 10.1034/j.1399-0039.2000.560502.x. [DOI] [PubMed] [Google Scholar]
  • 55.Klitz W, Maiers M, Spellman S, Baxter-Lowe LA, Schmeckpeper B, Williams TM, et al. New HLA haplotype frequency reference standards: high-resolution and large sample typing of HLA DR-DQ haplotypes in a sample of European Americans. Tissue Antigens. 2003;62:296–307. doi: 10.1034/j.1399-0039.2003.00103.x. [DOI] [PubMed] [Google Scholar]
  • 56.Rossman MD, Thompson B, Frederick M, Maliarik M, Iannuzzi MC, Rybicki BA, Pandey JP, Newman LS, Magira E, Beznik-Cizman B, Monos D. ACCESS Group. HLA-DRB1*1101: a significant risk factor for sarcoidosis in blacks and whites. Am J Hum Genet. 2003;73:720–735. doi: 10.1086/378097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Weir BS, Cockerham CC. 1984 Estimating F-statistics for the analysis of population structure. Evolution. 38:1358–1370. doi: 10.1111/j.1558-5646.1984.tb05657.x. [DOI] [PubMed] [Google Scholar]
  • 58.Hedrick PW. A Standardized Genetic Differentiation Measure. Evolution. 2005;59:1633–1638. [PubMed] [Google Scholar]
  • 59.Cano P, Klitz W, Mack SJ, Maiers M, Marsh SG, Noreen H, Reed EF, Senitzer D, Setterholm M, Smith A, Fernandez-Vina M. Common and well-documented HLA alleles: report of the Ad-Hoc committee of the american society for histocompatiblity and immunogenetics. Hum Immunol. 2007;68:392–417. doi: 10.1016/j.humimm.2007.01.014. [DOI] [PubMed] [Google Scholar]
  • 60.Lazaro AM, Xiao Y, Cao K, Masaberg C, Nichol L, Ng J, Hurley CK, Posch PE. Thirty-two novel HLA-A alleles identified during intermediate resolution testing. Tissue Antigens. 2008;71:165–168. doi: 10.1111/j.1399-0039.2007.00977.x. [DOI] [PubMed] [Google Scholar]
  • 61.Maiers M, Gragert L, Klitz W. High-resolution HLA alleles and haplotypes in the United States population. Hum Immunol. 2007;68:779–788. doi: 10.1016/j.humimm.2007.04.005. [DOI] [PubMed] [Google Scholar]
  • 62.Single R, Meyer D, Mack SJ, Lancaster A, Nelson MP, Erlich H, Fernandez-Viña MA, Thomson G. 13th International Histocompatibility Workshop anthropology / Human genetic diversity joint report. Haplotype frequencies and linkage disequilibrium among classical HLA genes. In: Hansen JA, editor. Immunobiology of the Human MHC. Seattle: International Histocompatibility Working Group Press; 2006. pp. 705–746. [Google Scholar]
  • 63.Horton R, Gibson R, Coggill P, Miretti M, Allcock RJ, Almeida J, Forbes S, Gilbert JG, Halls K, Harrow JL, Hart E, Howe K, Jackson DK, Palmer S, Roberts AN, Sims S, Stewart CA, Traherne JA, Trevanion S, Wilming L, Rogers J, de Jong PJ, Elliott JF, Sawcer S, Todd JA, Trowsdale J, Beck S. Variation analysis and gene annotation of eight MHC haplotypes: The MHC Haplotype Project. Immunogenetics. 2008;60:1–18. doi: 10.1007/s00251-007-0262-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Bamshad M, Wooding S, Salisbury BA, Stephens JC. Deconstructing the relationship between genetics and race. Nature Reviews Genetics. 2004;5:598–609. doi: 10.1038/nrg1401. [DOI] [PubMed] [Google Scholar]
  • 65.Rosenberg NA, Li LM, Ward R, Pritchard JK. Informativeness of genetic markers for inference of ancestry. Am J Hum Genet. 2003;73:1402–1422. doi: 10.1086/380416. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Conrad DF, Jakobsson M, Coop G, Wen X, Wall JD, Rosenberg NA, Pritchard JK. A worldwide survey of haplotype variation and linkage disequilibrium in the human genome. Nat Genet. 2006;38:1251–1260. doi: 10.1038/ng1911. [DOI] [PubMed] [Google Scholar]
  • 67.Guthery SL, Salisbury BA, Pungliya MS, Stephens JC, Bamshad M. The structure of common genetic variation in United States populations. Am J Hum Genet. 2007;81:1221–1231. doi: 10.1086/522239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Hurley CK, Fernandez VM, Setterholm M. Maximizing optimal hematopoietic stem cell donor selection from registries of unrelated adult volunteers. Tissue Antigens. 2003;61:415–424. doi: 10.1034/j.1399-0039.2003.00096.x. [DOI] [PubMed] [Google Scholar]
  • 69.Hurley CK, Wagner JE, Setterholm MI, Confer DL. Advances in HLA: Practical implications for selecting adult donors and cord blood units. Biol Blood Marrow Transplant. 2006;12:28–33. doi: 10.1016/j.bbmt.2005.10.005. [DOI] [PubMed] [Google Scholar]

RESOURCES