Skip to main content
Communications Biology logoLink to Communications Biology
. 2024 Jan 5;7:41. doi: 10.1038/s42003-023-05701-5

Human leukocyte antigen-DQA1*04:01 and rs2040406 variants are associated with elevated risk of childhood Burkitt lymphoma

Zhiwei Liu 1,#, Yang Luo 2,3,4,5,6,7,#, Samuel Kirimunda 8, Murielle Verboom 9, Olusegun O Onabajo 1, Mateus H Gouveia 10, Martin D Ogwang 11,12, Patrick Kerchan 12,13, Steven J Reynolds 14, Constance N Tenge 12,15, Pamela A Were 12,16, Robert T Kuremu 12,15, Walter N Wekesa 12,15, Nestory Masalu 17, Esther Kawira 12,18, Tobias Kinyera 11,12, Isaac Otim 11,12, Ismail D Legason 12,13, Hadijah Nabalende 11,12, Herry Dhudha 12,18, Leona W Ayers 19, Kishor Bhatia 1, James J Goedert 1, Nathan Cole 20, Wen Luo 20, Jia Liu 20, Michelle Manning 20, Belynda Hicks 20, Ludmila Prokunina-Olsson 1, George Chagaluka 21, W Thomas Johnston 22, Nora Mutalima 22,23, Eric Borgstein 21, George N Liomba 21, Steve Kamiza 21, Nyengo Mkandawire 21, Collins Mitambo 24, Elizabeth M Molyneux 21, Robert Newton 22, Ann W Hsing 25, James E Mensah 26, Anthony A Adjei 26, Amy Hutchinson 20, Mary Carrington 27,28, Meredith Yeager 20, Rainer Blasczyk 9, Stephen J Chanock 1, Soumya Raychaudhuri 3,4,5,6,7, Sam M Mbulaiteye 1,
PMCID: PMC10770398  PMID: 38182727

Abstract

Burkitt lymphoma (BL) is responsible for many childhood cancers in sub-Saharan Africa, where it is linked to recurrent or chronic infection by Epstein-Barr virus or Plasmodium falciparum. However, whether human leukocyte antigen (HLA) polymorphisms, which regulate immune response, are associated with BL has not been well investigated, which limits our understanding of BL etiology. Here we investigate this association among 4,645 children aged 0-15 years, 800 with BL, enrolled in Uganda, Tanzania, Kenya, and Malawi. HLA alleles are imputed with accuracy >90% for HLA class I and 85-89% for class II alleles. BL risk is elevated with HLA-DQA1*04:01 (adjusted odds ratio [OR] = 1.61, 95% confidence interval [CI] = 1.32-1.97, P = 3.71 × 10−6), with rs2040406(G) in HLA-DQA1 region (OR = 1.43, 95% CI = 1.26-1.63, P = 4.62 × 10−8), and with amino acid Gln at position 53 versus other variants in HLA-DQA1 (OR = 1.36, P = 2.06 × 10−6). The associations with HLA-DQA1*04:01 (OR = 1.29, P = 0.03) and rs2040406(G) (OR = 1.68, P = 0.019) persist in mutually adjusted models. The higher risk rs2040406(G) variant for BL is associated with decreased HLA-DQB1 expression in eQTLs in EBV transformed lymphocytes. Our results support the role of HLA variation in the etiology of BL and suggest that a promising area of research might be understanding the link between HLA variation and EBV control.

Subject terms: B-cell lymphoma, Genome-wide association studies, Immunogenetics


A study of 4,645 children in four countries in East Africa, 800 with Burkitt lymphoma (BL), identifies an association between HLA-DQA1*04:01 and variant rs2040406(G) and elevated risk of BL in Africa.

Introduction

Burkitt lymphoma (BL) is an aggressive B cell non-Hodgkin lymphoma1 first reported in African children in 1958 by Denis Burkitt2. BL incidence varies 3- to 5-fold within and across continents, but the highest incidence is recorded in children in sub-Saharan Africa (SSA, 2-4 per 100,000 person-years3,4), where it is approximately 10-fold higher than the incidence in the United States or Europe1. Epstein-Barr virus (EBV)5 and Plasmodium (P.) falciparum6 are established risk factors for BL in SSA. The association of BL with these infections strongly suggests that variation within the human leukocyte antigen (HLA) region, which plays an important role in the regulation of immune response7, could be related to the etiology of BL8. This hypothesis is consistent with studies that showed association of HLA-B*53 with protection against severe malaria in Gambian children9. Mechanistic studies suggested that this association was mediated by enhanced cytotoxicity against liver-stage malaria parasite forms among carriers of HLA-B*5310. However, a large genome-wide association study (GWAS) of 17,000 individuals from 11 populations, including people from Gambia, did not replicate this association11. HLA variation has also been associated with EBV control, including HLA-A*02:0112 and HLA-DQB1*0213 which are associated with elevated anti-EBV antibodies. HLA-A*02:01 is part of the HLA Class I system and may mediate direct killing of EBV-infected cells by modulating expression of CD8+ Th1-type immune response14. Conversely, HLA-DQB1*02 is part of the HLA Class II system and may mediate EBV control by facilitating anti-EBV antibody secretion through the modulation of peptide presentation to CD4+ T cells and their Th-2 signaling to B cells, thereby promoting an antibody response15,16. Therefore, the reported HLA associations are compatible with HLA modulating immune response to EBV or malaria as potential mechanisms for influencing risk of BL.

Only five studies have been conducted to directly test the association between HLA variation and risk of BL, yielding null results8,1720. Those studies were small (<100 cases), lacked suitable controls, and did not properly adjust for ancestry of participants. Moreover, most typed HLA alleles using serological methods, which can be less accurate21 and are limited to 1-field resolution. Only one study, which was conducted by our group previously, has used sequence-based typing (SBT)22 to obtain accurate high-resolution data (≥2 fields) in 600 participants (including 200 with BL) in the Epidemiology of Burkitt Lymphoma in East African Children and Minors (EMBLEM) study in Uganda23. Potentially significant associations were found between BL and HLA-A*02, HLA-B*41, and HLA-B*5823, underscoring the need to conduct HLA research using high-resolution typing and in a larger sample. We investigated associations of classical HLA alleles, SNPs, and amino acids within the HLA region with BL among 4645 children, 800 with BL, enrolled in Uganda, Tanzania, Kenya, and Malawi in who HLA variation was inferred from genome-wide genotype data obtained from an ongoing BL GWAS24. Here we report sucessful HLA imputation, using an updated multi-ancestry reference panel25, obtaining accurate HLA imputation (>90% for class I and 85–89% for class II alleles) when compared with SBT-results in a subset of participants (n = 600) with paired imputed and SBT data25. We observe significant associations between BL risk and HLA-DQA1*04:01 and rs2040406(G) in SSA. The higher risk rs2040406(G) variant for BL is associated with decreased HLA-DQB1 expression in eQTLs in EBV transformed lymphocytes, potentially suggesting that the HLA role is mediated by EBV control.

Results

Table 1 shows the participants’ characteristics (see Methods and Fig. 1). Most participants were from the EMBLEM study in Uganda, Tanzania, and Kenya (71.9% of the cases and 94.5% of controls). As expected and by study design1, compared to controls, BL cases were predominantly male (62.9% vs. 52.1%) and were aged 3–11 years old (77.9% of the cases vs. 74.5% of the controls). P. falciparum infection was detected in 282 (35.3%) of the BL cases vs. 1857 (48.3%) of the controls24.

Table 1.

Demographic characteristics of 4645 participants in EMBLEM and Malawi.

Characteristics Cases, n (%) (N = 800) Controls, n (%) (N = 3845)
Country where enrolled
EMBLEM study
 Uganda 249 (31.1%) 2004 (52.1%)
 Tanzania 97 (12.1%) 753 (19.6%)
 Kenya 229 (28.6%) 878 (22.8%)
 Sub-total 575 (71.9%) 3,635 (94.5%)
 Malawi 225 (28.1%) 210 (5.5%)
Sexa
 Female 297 (37.1%) 1840 (47.9%)
 Male 503 (62.9%) 2005 (52.1%)
Age, years
 0–2 67 (8.4%) 342 (8.9%)
 3–5 219 (27.4%) 914 (23.8%)
 6–8 239 (29.9%) 1130 (29.4%)
 9–11 165 (20.6%) 820 (21.3%)
 12–15 110 (13.8%) 639 (16.6%)
Plasmodium falciparum positivityb
 Negative 499 (62.4%) 1943 (50.5%)
 Positive 282 (35.3%) 1857 (48.3%)
 Missing 19 (2.4%) 45 (1.2%)
Seasonc
 Dry 115 (14.4%) 1766 (45.9%)
 Wet 383 (47.9%) 1869 (48.6%)
 Missing/unknown 302 (37.8%) 210 (5.5%)
Aread
 Urban 216 (27.0%) 1346 (35.0%)
 Rural 283 (35.4%) 2289 (59.5%)
 Missing/unknown 301 (37.6%) 210 (5.5%)

aIf the sex reported on the clinical forms did not match the genetically determined sex, the participant’s sex was determined based on their genetic sex.

bPlasmodium falciparum positivity was based on parasitemia measured on thick blood film microscopy, antigenemia on rapid diagnostic tests, or on polymerase chain reaction tests (see methods).

cData for season available only in the EMBLEM study. Calendar months of April–June and September–December months were wet season months; calendar months of January–March and July–August were dry season months, based on reports produced by the National Bureau of Statistics of each country.

dRural/urban status was determined only for the EMBLEM study. Rural areas were defined as “low-population density” villages if the population count of children aged 0–15 years in the village was less than the average population count for census enumeration areas in the regions studied for each country; otherwise as “urban/high-population density”.

Fig. 1. Map of study area and flow chard of study procedures.

Fig. 1

a Map showing geographical areas shaded green where Burkitt lymphoma cases and controls were enrolled. Participating hospitals are marked with a red cross, while the capital cities where major tertiary care centers are located are marked with white star on a green background. The map was drawn using ESRI ArcGIS Pro software. No portions of this figure were imported as image components from a database. b The Study workflow components.

HLA imputation accuracy at 2-field resolution for alleles with allelic fraction (AF) > 1% was >90% for Class I HLA-A, -B, and Class II HLA-DQA1, 89.5% for HLA-DQB1, 88.3% for HLA-DPB1, and 85.0% for HLA-DRB1 (Supplementary Fig. 1A). We observed a high correlation (r2 ≥ 0.8) between HLA allelic dosage (genotype imputation quality info) and actual typed HLA alleles with AF ≥ 1% (Supplementary Fig. 1B). Analysis of ancestry using only GWAS SNPs in the HLA region replicated the ancestry patterns observed using genome-wide data26 (Supplementary Fig. 2).

The number of imputed HLA variants after quality control is shown in Supplementary Table 1.

Of 12 HLA alleles with prior associations with severe malaria, EBV, or BL (Supplementary Table 2), nine with AF ≥ 1% were evaluated. HLA-A*23:01, which previously was associated with elevated anti-EBV VCA-IgG in a GWAS in Uganda27, was associated with decreased BL in our study (OR = 0.77; P = 0.027; Supplementary Table 3). The other alleles, including HLA-B*53, were not associated with BL. In Uganda only, decreased BL risk was observed with HLA-A*02 (OR = 0.61; P = 0.001) and -B*41 (OR = 0.45; P = 0.037; Supplementary Table 4), which in line with the findings we previously reported in a smaller sample size in Uganda.

After Bonferroni correction (P < 2.7 × 10-4), HLA-DQA1*04:01 was associated with elevated risk of BL (OR = 1.61, 95% CI = 1.32–1.97; P = 3.71 × 10−6, Table 2, Fig. 2A). Significant associations were observed with DQA1*04 (OR = 1.60, 95% CI = 1.31–1.96; P = 5.02 × 10−6), DQB1*04 (OR = 1.60, 95% CI = 1.30–1.98; P = 1.29 × 10−5), and DRB1*03:02 (OR = 1.47, 95% CI = 1.20–1.81; P = 2.30 × 10−4) (Supplementary Table 5). These associations were not significant in analysis conditioned on HLA-DQA1*04:01 (Fig. 2B). The observed associations were similar in country-specific analyses (Supplementary Table 6) and in the different sensitivity models (Supplementary Fig. 3 for model 1 and Supplementary Table 7 for models 2-5).

Table 2.

Top HLA alleles and Haplotypes, locus, and amino acid residue hits from analysis of 4,645 Participants (800 with BL) in EMBLEM and Malawi.

Category Variant Infoa Eff. Allele/Ref. Alleleb Combined Frequencyc Meta-analysisd Meta-analysise
Controls Cases OR (95% CI) Pmeta Phet OR (95% CI) Pcondition Phet
HLA allele DQA1*04:01 0.99 0.08 0.12 1.61 (1.32, 1.97) 3.71 × 10−6 0.92 1.29 (1.03, 1.63) 0.030 0.94
Class II haplotype DRB1*03:02-DQA1*04:01-DQB1*04:02 0.06 0.10 1.58 (1.25, 1.99) 1.08 × 10−4 0.83
SNP rs2040406 0.98 G/A 0.28 0.36 1.43 (1.26, 1.63) 4.62 × 10−8 0.84 1.68 (1.09, 2.59) 0.019 0.54
Imputed residue Gln 53 in HLA-DQA1 chainf 0.98 0.32 0.38 1.36 (1.20, 1.55) 2.06 × 10−6 0.60 0.79 (0.52, 1.21) 0.28 0.36

AA amino acid residue, HLA human leukocyte antigen, OR odds ratio, CI confidence interval. All statistical tests are 2-sided.

aImputation information (R2) was obtained from SNP2HLA.

bEffect allele/reference allele.

cFor locus rs2040406, the numbers indicate frequencies for the effect allele.

dEstimates were obtained from generalized linear mixed models (GLMMs), adjusting for sex, age (continuous), Plasmodium falciparum positivity, population substructure using the top 3 principal components (PCs) and genetic relatedness using a genetic relationship matrix. Association statistics were conducted by a fixed-effect meta-analysis across four countries. P values were corrected for multiple comparisons using Bonferroni adjustment (P < 2.7 × 10−4 for 187 classical HLA alleles and P < 1.1 × 10−6 for other 46,350 variants in the HLA region). Cochran’s heterogeneity p value was obtained.

eThe three HLA variants (DQA1*04:01, rs2040406, and Gln-53) were included in one model to determine if effects for one variant were independent of the other two.

fPosition 53 is polymorphic with three common residues: Q (Glutamine), K (Lysine), and R (Arginine). L (Leucine) is rare in our population. The analysis compared glutamine versus other variants at position 53.

Fig. 2. Region association plots of variants in the human leukocyte antigen region and Burkitt lymphoma based on 800 BL cases and 3,845 controls.

Fig. 2

a Shows results from unconditional analyses; b Shows analyses conditional on DQA1*04:01; c Shows results conditional on rs2040406. The color of the dots indicates different polymorphisms: grey dots indicate imputed SNPs; magenta dots indicate genotyped SNPs; turquoise dots indicate amino acids (AA); light blue dots indicate indels; and orange dots indicate classical HLA alleles at 1- or 2-field resolution.

We identified 24 haplotypes formed by HLA-DRB1, -DQA1 and -DQB1 genes with AF ≥ 0.01. The HLA haplotype substructure was different in our participants in East Africa compared to those in Ghana, consistent with our findings of significant population substructure between these regions26 (Supplementary Fig. 4). For example, HLA haplotype DRB1*08:04-DQB1*03:01-DQA1*04:01 was more frequent in Ghana (5%; 45/923) compared to East Africa (1%; 32/4645). This haplotype was not associated with BL risk (OR = 1.44, 95% CI = 0.82–2.51; p = 0.200), whereas DRB1*03:02- DQB1*04:02-DQA1*04:01, which was also more frequent in Ghana (10.0%), was associated with elevated BL risk (OR = 1.58, 95% CI = 1.25–1.99; P = 1.08 × 10−4) (Table 2).

After Bonferroni correction (P < 1.1 × 10-6), GWAS variant rs2040406(G) in the HLA-DQA1 region was associated with elevated BL risk (OR = 1.43, 95% CI = 1.26–1.63; P = 4.62 × 10−8, Table 2, Fig. 2A, and Supplementary Fig. 5). This association persisted in single variant conditional models adjusting for HLA-DQA1*04:01 (Table 2). Associations with elevated BL risk were observed with rs1064994(C) (OR = 1.38, P = 8.75 × 10−7), rs1065049(A) (OR = 1.39, P = 6.06 × 10−7), rs9272982(A) (OR = 1.38, P = 8.23 × 10−7), and rs1130399(A) (OR = 1.48, P = 1.21 × 10−7), but they were not significant after conditioning on rs2040406, which may reflect LD patterns that were strong (r2 > 0.8) for three of the four SNPs and weak or moderate for rs1130399 (r2 = 0.3) (Fig. 2C and Supplementary Table 8). Finally, we examined the combined effect of the presence of the two risk alleles. Although homozygosity for both HLA-DQA1*04:01 and SNP rs2040406 (with LD R2 = 0.225) was low (1.3% in the cases and 0.68% in the controls), children who were homozygous for both had a higher risk for BL (OR = 2.66, 95% CI = 1.19–5.95; P = 0.017) versus those not carrying either allele.

Among the imputed single amino acid residues in HLA-DQA1 chain, significant associations were observed with variants at position 53, with two allelic variants (Lys/Gln) in the current dataset. Compared to Lys, which is the most common residue at position 53, those having Gln residue at this position had an elevated risk for BL (OR = 1.36; 95% CI = 1.20–1.55; P = 2.06 × 10−6; Table 2). The 3D structure of the HLA-DQA1 chain suggests that Gln 53 is located within the peptide binding groove of HLA-DQ molecule, and may have functional impact on specificity of peptide binding and/or T cell receptor (TCR) contacts (Supplementary Fig. 6).

Among the controls, neither HLA-DQA1*04:01 nor rs2040406(G) alleles were associated with P. falciparum parasite density (Supplementary Fig. 7). In exploratory analyses, we observed no significant associations between HLA alleles and P. falciparum infection detection among the controls (Supplementary Fig. 8).

Regarding a potential EBV connection with HLA in our data, we note suggestive associations of elevated BL risk (below Bonferroni threshold) were observed with three GWAS SNPs previously associated with higher anti-EBV EBNA1 IgG antibodies in southwest Uganda27 (rs1064991(G), P = 2.11 × 10−6; rs3129867(G), P = 8.24 × 10−3; rs6927022(G), P = 1.99 × 10−2; Supplementary Table 9). We found no association between BL with three other GWAS SNPs (all p values > 0.05) that were previously associated with higher anti-EBV EBNA1 IgG antibodies in Europeans (rs2516049)28 and Mexican Americans (rs477515 and rs285427529) or with GWAS SNP rs28394498(T) associated with anti-EBV VCA IgG antibodies in southwest Uganda27 (P = 0.256). We note that the GWAS SNP with the strongest association with BL - rs2040406(G)- has been reported to be an eQTL for HLA-DQB1 (P = 1.0 × 10−10) and C4A (P = 1.1 × 10−6) in EBV-transformed lymphocytes in the Genotype-Tissue Expression (GTEx) v8 database (Supplementary Fig. 9 and Supplementary Table 10).

Contrary to our hypotheses about global associations of BL with HLA variation, our study did not find significant associations of BL with any of HLA alleles categorized as rare our dataset or with HLA zygosity in individuals. The possible exceptions were inverse associations between BL and carriage of common HLA-A alleles (OR = 0.86, P = 0.016, Supplementary Table 11) and homozygosity at HLA-C (OR = 0.62, P = 0.004, Supplementary Table 12). However, independent studies are needed to confirm these findings.

Discussion

Our study using imputed high-resolution HLA data from four countries in Africa supports the hypothesis that BL risk is related to HLA variation. This hypothesis was proposed five decades ago2, but has been difficult to confirm or refute because of lack of accurate HLA typing data and well-designed studies. We also report associations between BL risk and carriage of HLA-DQA1*04:01 allele and GWAS SNP rs2040406(G) in the HLA-DQA1 region. These HLA associations may be related to effects of HLA on EBV control because the higher risk rs2040406(G) variant for BL is in high LD (r2 > 0.8) with several GWAS SNPs (e.g., rs1064991) known to have pleiotropic effects against EBV humoral immune response26,27. Additionally, both rs2040406(G) and HLA-DQA1*04:01 have pleiotropic effects against multiple sclerosis30,31, which is EBV-linked32.

Because malaria is the strongest geographical risk factor for BL and the risk of BL increases by 39% per 100 cumulative infections in a child33, we hypothesized that HLA associations with BL may be related to control of malaria. However, neither HLA-DQA1*04:01 nor rs2040406(G) alleles were not associated with P. falciparum infection detection or parasite density. Our finding of a null association between BL and HLA-B*53 confirms a previous null result from a smaller study34, and is consistent with the null association between severe malaria and HLA-B*53 reported in a malaria GWAS of 17,000 individuals from 11 countries11. These results cast doubt on the frequently cited result of HLA-B*53 as a marker of malaria resistance9,35.

Several of our findings may suggest that there are other mechanisms that could link HLA variation to risk of BL. For example, HLA-DQA1*04:01 has been linked to autoimmune-related conditions, such as Henoch-Schönlein Purpura36. Is it possible that the association with HLA-DQA1*04:01 is pointing to a role of autoimmunity and risk of BL. This inference was suggested by reports of an inverse association between a history of allergy or asthma and BL risk among young adults in the International Lymphoma Consortium study37,38. It is also supported by findings that BL tumors display skewed usage of immunoglobulin heavy-chain variable gene segment 4 (IGH-V4), which is implicated in autoreactivity39.

The results also suggest immuno-genetic mechanisms in the etiology of BL that can be investigated by experimental methods, particularly with regard to the control of EBV infection as a critical factor affecting risk of BL. These include functional studies of expression of cytokines associated with Th1 (IFN-γ), Th2 (IL-10), or Treg (IL-17) phenotypes40 or the assessment of CD4+ and CD8+ T cell effector memory subsets targeting selected EBV proteins according to HLA type (HLA-DQA1*04:01 versus not). While previous studies have primarily centered on EBNA141, future research could be expanded to employ broader panels of EBV proteins, including the 33 EBV proteins, such as the viral capsid antigens (VCAp18, -p23, -p40, -p160), which showed to be differentially reactive in BL cases compared to controls. The studies could also use a smaller set of peptides, such as the four-marker immune panel, including BHRF1 (Bcl-2 homolog), BMRF1 (EAp47), BBLF1 (tegument protein), and BZLF1 (ZTA), whose reactivity most accurately classified BL status in patients in Ghana42.

Our study was possible because imputation of HLA alleles from GWAS data in sub-Saharan Africa, which we show to be a feasible and a practical way to obtain high-resolution HLA data to investigate the relationship between HLA variation and BL risk. Our study adds value to the ongoing GWAS and expands the experience with HLA imputation in African subjects25, who are currently underrepresented in genetic studies in Africa23.

We note several strengths of our study, including having the largest sample size to date, enrollment from four countries in Africa, availability of SBT data for evaluating the imputation accuracy, and detailed information on covariates to adjust for confounding. However, we acknowledge weaknesses, including lack of anti-EBV antibodies results, which limits our ability to discuss the link between HLA and EBV control. Although our study sample size is the largest to date, it is still relatively small for genome-wide studies and only included people from East Africa. Thus, our results may not be generalizable to other SSA populations. We did not use a GWAS significance threshold (e.g., 5 × 10−8), which is conservative for some of our exploratory hypotheses. Larger studies with more broad sampling across different countries in SSA are needed to confirm or refute our findings. Such studies may improve the capacity to investigate HLA in SSA, increase clarity of associations, and identify generalizable results.

To conclude, our findings confirm the high accuracy of HLA imputation from GWAS data in populations with admixed Nilotic ancestry. We report significant association of BL risk with HLA-DQA1*04:01 and rs2040406(G) in East African children. We hypothesize that the observed associations could be mediated by HLA effects on control of EBV infection and/or autoimmunity, and suggest a promising area of research might be understanding the link between HLA variation and EBV control in SSA populations.

Methods

Study population

The methods of the EMBLEM24 and the Infections and Childhood Cancer studies have previously been published43. The EMBLEM study is a population-based study that enrolled cases and controls aged 0-15 years in two neighboring regions in Uganda and in four neighboring regions in Kenya and Tanzania (Fig. 1). BL diagnosis was based on local histological/cytological diagnosis (74% of included cases) or clinical, imaging, and laboratory diagnosis. There were no apparent demographic or clinical differences between these groups. The controls were apparently healthy children enrolled from random villages (100 in Uganda, 100 in Kenya, and 95 in Tanzania)24. HIV-positive participants in EMBLEM were not excluded because HIV infection was rare (24 BL cases and 15 controls)24. In Malawi, participants were children 0–15 years enrolled with cancer at Queen Elizabeth Hospital in Blantyre. BL diagnosis was based on local cytology or histology and compatible clinical investigations; children with non-lymphoid solid cancers were used as controls. Participants with HIV or Kaposi sarcoma were excluded to protect their privacy. Demographic and risk factor information was obtained via a structured interviewer-administered questionnaire24. Venous blood was collected in EDTA tubes at enrollment. In EMBLEM, P. falciparum infection in blood was determined using microscopy of thick-film blood smears, or antigen capture rapid diagnostic tests (RDTs)24. In Malawi, P. falciparum infection was based on P. falciparum-specific polymerase chain reaction (PCR)43. Because we have previously shown distinct ancestry patterns in East versus West African populations26, we conducted comparative studies of HLA patterns in 968 Ghanaian adult men who were previously studied for ancestry to compare with our East African populations.

Ethical approvals

We confirm that all relevant ethical regulations were followed. Specifically, approval for the EMBLEM study was granted by ethics committees at the Uganda Virus Research Institute (GC/127), Uganda National Council for Science and Technology (H816), Tanzania National Institute for Medical Research (NIMR/HQ/R.8c/Vol. IX/1023), Moi University/Moi Teaching and Referral Hospital (000536), and National Cancer Institute (10-C-N133). Ethical approval for the original Infections and Childhood Cancer Study was granted by ethics committees at the Malawi College of Medicine (P.03/04/277R) and Oxford University. Because the original Malawi Infections and Childhood Cancer study did not request participants to consent to genetic testing, special ethical approval to conduct genetic testing was obtained from the Malawi National Health Sciences Research Committee (Approval #2405). Written informed consent was obtained from participants’ guardians in EMBLEM and Malawi studies, and written informed assent was obtained from children aged ≥7 years old in the EMBLEM study. The ethical approval for genetic studies favored research that would enable possible or suitable interventions in the communities where participants were enrolled or increasing knowledge considered relevant to the local communities, such as research on HLA variation and malaria resistance to investigate the association of BL with malaria.

DNA extraction and genotyping procedures

DNA extraction and genotyping were performed at the Cancer Genomics Research Laboratory, NCI, USA. Genotypes of approximately 4.6 million variants were determined using the Infinium Omni5Exome-4 v1.3 BeadChip (Illumina, San Diego, CA, USA) following standard Illumina data analysis workflow26. Genotype data were phased and then imputed using the African Genome Resources panel on the Sanger imputation server (https://imputation.sanger.ac.uk/). This panel was preferred because it currently has the largest number of genomes from African participants, including from the Great Lakes region in Africa. Only variants imputed with high confidence (genotype imputation quality info ≥0.9) with a minor allele frequency (MAF) ≥ 0.01 were retained in the analysis dataset. All variants analyzed passed Hardy–Weinberg equilibrium (HWE) test in the controls using a threshold of P < 1.0 × 10−6. Ancestry was evaluated using principal components (PC) analysis of 787,731 genotyped uncorrelated (r2 < 0.3) SNPs outside the HLA region. The top three country-specific PCs were used to control for ancestry. Additionally, because of a high degree of relatedness in Ugandan controls26, we constructed a genetic relationship matrix (GRM) for all individuals in the dataset, based on the probability that two individuals i and j share 0, 1, or 2 alleles identical by descent (IBD)44, and used it to control for relatedness.

We performed HLA imputation using Minimac445 on the Michigan Imputation Server (MIS) with a multi-ancestry reference panel which contains data from 21,546 unrelated individuals25. We applied the default quality control procedures of the MIS pipeline. We imputed eight classical HLA genes HLA‐A, B, C and HLA‐DRB1, DQA1, DQB1, DPA1, and DPB1, and amino acids and intergenic variants using genotypes extracted from chr6:25Mb-35Mb (hg19/GRCh37; n = 49,159 SNPs) for the 4645 participants. In total, we inferred 1439 classical I and II alleles at 1- and 2-field resolution, 164 INDELs, and 4,511 amino acid polymorphisms (Supplementary Table 1). After quality control (Supplementary Fig. 10), we retained 187 classical HLA alleles, 126 INDELs, 2652 amino acid polymorphisms, and 43,572 GWAS SNPs (11,702 genotyped and 31,870 imputed) with imputation R2 > 0.8 and MAF > 1% for association analysis. The HLA allele frequencies of East Africa, when compared with Ghana population34, are presented in Supplementary Fig. 11.

Statistics and reproducibility

We assessed the accuracy of our HLA imputation by examining the concordance and correlation between the imputed classical HLA alleles and SBT-HLA genotypes of 600 participants in the same dataset with paired data22. We assessed BL and HLA allele, SNP, and single amino acid residue associations by fitting generalized linear mixed models (GLMM) with the logit link in country-specific datasets with covariates (sex, age, P. falciparum infection status, ancestry as fixed effects, and GRM as a random effect) as the main model. P. falciparum is the strongest known co-factor for the geographic patterns of BL24,33,46,47. Thus, it was considered a-priori, as a risk factor and a confounder and included in our main models. When interpreted as a confounder, any associations with HLA alleles that remain significant indicate that additional contribution of P. falciparum was not responsible for the observed associations with HLA alleles.

The main results are presented as summary odds ratios (ORs) and significance assessed by 95% confidence intervals (95% CIs), computed using a standard meta-analytical approach of country-specific ORs in PLINK v1.9. HLA variants were coded as 0, 1, or 2 corresponding to the number of variant alleles carried by an individual. Formal analysis of statistical heterogeneity of associations with BL across the countries was assessed using Cochran’s Q test. P values were calculated using Wald tests and corrected for multiple comparisons using Bonferroni adjustment (P < 2.7 × 10−4 for 187 classical HLA alleles and P < 1.1 × 10−6 for another 46,350 variants in the HLA region). While a more conservative P value of GWAS significance (e.g., 5 × 10−8) may be considered more rigorous, it was not preferred for the current analysis because we wanted to test hypotheses based on prior epidemiological and biological data about HLA associations with malaria10 or EBV27, or with BL23. We performed conditional single-variant association tests, where in addition to the confounders included in the models, we performed conditional adjustment for HLA alleles or SNPs identified in the main models to be significantly associated with BL. Associations of BL with 12 variants with a priori

associations with BL, severe malaria or EBV were assessed without correcting for multiple comparisons (Supplementary Table 2). Alleles that are polymorphic at the 2-field resolution are reported as such, while those with limited polymorphism are reported at 1-field resolution.

We examined the robustness of our results by conducting several sensitivity analyses. We excluded P. falciparum infection from the main model in sensitivity model 1. In the subsequent sensitivity models 2-5, we controlled for urban vs. rural residence of participants (model 2), wet vs. dry season of enrollment (model 3), carriage of HBB-rs334(A) and ABO-rs8176703(T) that are also protective against BL48 (model 4), and excluded 14 individuals with outlier principal components (model 5).

Finally, we defined HLA allele groups as common, when the allele frequency [AF] >5%, otherwise as not common when the allele frequency was 1%-5%. Alleles with <1% of the participants were not included in this analysis. This variable was used to investigate global HLA associations of BL with common versus not common HLA alleles, which provides a way to assess potential detrimental associations with rare HLA alleles, based on the assumption that natural selection influences allelic distribution and favorable alleles will be differentially represented in cases and controls49. Second, we investigated associations between BL and HLA zygosity at six HLA loci (HLA‐A, B, C and HLA‐DRB1, DQB1, and DPB1). This approach was selected to reduce potential selection bias, as it could arise from excluding individuals carrying rare HLA alleles, particularly given their higher likelihood of being heterozygotes. HLA zygosity in an individual was defined as the number of alleles at the 2-field resolution that were homozygous across the six HLA loci. We hypothesized that low HLA zygosity (i.e., with more loci being heterozygous, is correlated with a broader, and therefore, more effective immune response repertoires, while high zygosity would be correlated with a decreased repertoire of immune responses50. If so, then we reasoned that low zygosity might be associated with decreased BL risk, perhaps, mediated by more effective immune responses against relevant infections, e.g., EBV or malaria.

Because our understanding of associations between classical HLA alleles and asymptomatic P. falciparum infection, particularly in older children, is less clear, we assessed associations between HLA variants with P. falciparum infection among controls. We used similar models as those used for analysis of BL, except with infection status as the outcome in the controls. To gain insights about whether HLA alleles that were significantly associated with BL have effects on P. falciparum infection, we examined the relationship between those alleles with log-transformed P. falciparum density in the controls. These analyses were performed separately for EMBLEM and Malawi participants because in Malawi parasite density was measured using PCR, which is more sensitive, while in EMBLEM parasite density was measured using blood smears, which is less sensitive.

Statistical analyses were performed in R (version 4.1.0) utilizing computational resources of the NIH HPC Biowulf cluster.

Haplotype inference, visualization, and comparisons across countries

We constructed HLA haplotypes separately for each country from phased genotypes using Haplo.stats package v.1.7.7. HLA linkage disequilibrium (LD) and/or haplotype configurations across multiallelic genetic markers were visualized using Disentangler plots (http://kumasakanatsuhiko.jp/projects/disentangler). We tested associations between BL with each haplotype, controlling for covariates used in the main model. We also descriptively compare HLA patterns in the four countries in East Africa versus in Ghana (N = 968), which we previously showed to be ancestrally different from the populations in East Africa26.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Supplementary information

Peer Review File (3.6MB, pdf)
42003_2023_5701_MOESM3_ESM.pdf (80.5KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1 (4.2MB, zip)
Reporting Summary (3.7MB, pdf)

Acknowledgements

We thank the study population and communities for their participation. We thank Ms. Janet Lawler-Heavner at Westat Inc, (Rockville, MD, USA) and Mr. Erisa Sunday at the African Field Epidemiology Network (Kampala, Uganda) for managing the study. We are grateful to the leadership of the collaborating countries and institutions for hosting local field offices and laboratories and supporting the fieldwork. We thank Ms. Laurie Buck, Dr. Carol Giffen, and Mr. Greg Rydzak at Information Management Services Inc. (Calverton, MD, USA) for coordinating data, and preparing data analysis files. The study was funded by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute (NCI) (Contracts HHSN261201100063C and HHSN261201100007I), and the Intramural Research Program, National Institute of Allergy and Infectious Diseases (SJR), National Human Genome Research Institute (MHG), National Institutes of Health (NIH), Department of Health and Human Services. The authors acknowledge the research contributions of the Cancer Genomics Research Laboratory for their expertise, execution, and support of this research in the areas of project planning, wet laboratory processing of specimens, and bioinformatics analysis of generated data from the NCI, NIH, under NCI Contract No. 75N910D00024. Y.L. is supported by a Kennedy Trust KTRR Senior Research Fellowship (KENN202109). The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government. The content of this manuscript is the sole responsibility of the authors. The funders had no role in study design, data collection, analysis or interpretation, or writing and submitting the report for publication.

Author contributions

Z.L. and S.M.M. wrote the manuscript. S.M.M., M.D.O., S.J.R., P.K. conceived the idea, designed the EMBLEM study and supervised fieldwork. S.M.M., O.O.O., S.R. and Y.L. conceived the idea of conducting HLA imputation. S.M.M., S.K., R.B. conceived the idea of SBT HLA typing. M.D.O., P.K., G.C., W.T.J., E.B., S.J.R., C.N.T., P.A.W., R.T.K., W.N.W., N.M., E.K., T.K., I.O., I.D.L., H.N., H.D., L.W.A., K.B., J.J.G., G.N.L., S.K., N.M., E.M.M., R.N., A.W.H., J.E.M., A.A.A. and C.M. conducted, monitored or advised on field work. B.H., N.C., J.L., M.M., W.L., M.Y., A.H., R.B., M.V., S.K. conducted genetic testing and processed genetic data. M.H.G., Z.L., Y.L., S.R. analyzed genetic data and developed primary and sensitivity models; L.P.O. and M.C. advised on HLA genetic analyses; Y.L. and S.R. performed HLA imputation; S.J.C. provided expertise on genetic analysis, and interpretation of data. All authors read and approved the final manuscript.

Peer review

Peer review information

Communications Biology thanks Jin-Xin Bei and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Primary Handling Editors: Chiea Chuen Khor and George Inglis. A peer review file is available.

Data availability

The genome-wide and phenotypic data from the Ghana Prostate Healthy Study and the EMBLEM and Malawi studies are publicly available. The previously published data from the Ghana Prostate Healthy Study are available under restricted access in the Genomic Data Commons; access to these studies is available through dbGaP: Ghana Prostate Study (phs000838.v1.p1). The genetic data of participants in the EMBLEM and Malawi studies are under controlled access (requires IRB approval and are limited to not-for-profit research). Readers can access the data by applying via dbGaP under accession link phs001705.v1.p1. The source data behind the graphs in Fig. 2 are available in Supplementary Data 1.

Code availability

All code was written in R, or Bash. Scripts, required software packages, and instructions are available on the Github repository (https://github.com/smbulaiteye/HLA-EMBLEM.git)51.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Zhiwei Liu, Yang Luo.

These authors jointly supervised this work: Soumya Raychaudhuri, Sam M. Mbulaiteye.

Supplementary information

The online version contains supplementary material available at 10.1038/s42003-023-05701-5.

References

  • 1.López C, et al. Burkitt lymphoma. Nat. Rev. Dis. Primers. 2022;8:78. doi: 10.1038/s41572-022-00404-3. [DOI] [PubMed] [Google Scholar]
  • 2.Burkitt D. A sarcoma involving the jaws in African children. Br. J. Surg. 1958;46:218–223. doi: 10.1002/bjs.18004619704. [DOI] [PubMed] [Google Scholar]
  • 3.Aka P, et al. Incidence and trends in Burkitt lymphoma in northern Tanzania from 2000 to 2009. Pediatr Blood Cancer. 2012;59:1234–1238. doi: 10.1002/pbc.24194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Ogwang MD, Bhatia K, Biggar RJ, Mbulaiteye SM. Incidence and geographic distribution of endemic Burkitt lymphoma in northern Uganda revisited. Int. J. Cancer. 2008;123:2658–2663. doi: 10.1002/ijc.23800. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Epstein MA, Achong BG, Barr YM. Virus particles in cultured lymphoblasts from Burkitt’s lymphoma. Lancet. 1964;1:702–703. doi: 10.1016/S0140-6736(64)91524-7. [DOI] [PubMed] [Google Scholar]
  • 6.Burkitt DP. Etiology of Burkitt’s lymphoma-an alternative hypothesis to a vectored virus. J. Natl Cancer Inst. 1969;42:19–28. [PubMed] [Google Scholar]
  • 7.Dendrou CA, Petersen J, Rossjohn J, Fugger L. HLA variation and disease. Nat. Rev. Immunol. 2018;18:325–339. doi: 10.1038/nri.2017.143. [DOI] [PubMed] [Google Scholar]
  • 8.Bodmer JG, Bodmer WF, Ziegler J, Magrath IT. HL-A and Burkitt’s tumour-a study in Uganda. Tissue Antigens. 1975;5:59–62. doi: 10.1111/j.1399-0039.1975.tb00528.x. [DOI] [PubMed] [Google Scholar]
  • 9.Hill AV, et al. Common west African HLA antigens are associated with protection from severe malaria. Nature. 1991;352:595–600. doi: 10.1038/352595a0. [DOI] [PubMed] [Google Scholar]
  • 10.Hill AV, et al. Molecular analysis of the association of HLA-B53 and resistance to severe malaria. Nature. 1992;360:434–439. doi: 10.1038/360434a0. [DOI] [PubMed] [Google Scholar]
  • 11.Malaria Genomic Epidemiology N. Insights into malaria susceptibility using genome-wide data on 17,000 individuals from Africa, Asia and Oceania. Nat. Commun. 2019;10:5732. doi: 10.1038/s41467-019-13480-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Murray PG, Constandinou CM, Crocker J, Young LS, Ambinder RF. Analysis of major histocompatibility complex class I, TAP expression, and LMP2 epitope sequence in Epstein-Barr virus-positive Hodgkin’s disease. Blood. 1998;92:2477–2483. doi: 10.1182/blood.V92.7.2477. [DOI] [PubMed] [Google Scholar]
  • 13.Li Q, et al. HLA-DQ beta1 alleles associated with Epstein-Barr virus (EBV) infectivity and EBV gp42 binding to cells. JCI Insight. 2017;2:e85687. doi: 10.1172/jci.insight.85687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kidd P. Th1/Th2 balance: the hypothesis, its limitations, and implications for health and disease. Altern. Med. Rev. 2003;8:223–246. [PubMed] [Google Scholar]
  • 15.Maurmann S, et al. Molecular parameters for precise diagnosis of asymptomatic Epstein-Barr virus reactivation in healthy carriers. J. Clin. Microbiol. 2003;41:5419–5428. doi: 10.1128/JCM.41.12.5419-5428.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Besson C, et al. Positive correlation between Epstein-Barr virus viral load and anti-viral capsid immunoglobulin G titers determined for Hodgkin’s lymphoma patients and their relatives. J. Clin. Microbiol. 2006;44:47–50. doi: 10.1128/JCM.44.1.47-50.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bodmer JG, et al. Combined analysis of three studies of patients with Burkitt’s lymphoma. Tissue Antigens. 1975;5:63–68. doi: 10.1111/j.1399-0039.1975.tb00529.x. [DOI] [PubMed] [Google Scholar]
  • 18.Jones EH, Biggar RJ, Nkrumah FK, Lawler SD. Study of the HLA system in Burkitt’s lymphoma. Hum. Immunol. 1980;1:207–210. doi: 10.1016/0198-8859(80)90015-4. [DOI] [PubMed] [Google Scholar]
  • 19.Hall PJ, et al. HLA antigens in East African Black patients with Burkitt’s lymphoma or nasopharyngeal carcinoma and in controls: a pilot study. Hum. Immunol. 1982;5:91–105. doi: 10.1016/0198-8859(82)90055-6. [DOI] [PubMed] [Google Scholar]
  • 20.Jones EH, Biggar RJ, Nkrumah FK, Lawler SD. HLA-DR7 association with African Burkitt’s lymphoma. Hum. Immunol. 1985;13:211–217. doi: 10.1016/0198-8859(85)90013-8. [DOI] [PubMed] [Google Scholar]
  • 21.Woszczek G, Borowiec M, Mis M, Gorska M, Kowalski ML. Comparison of serological and molecular (PCR-SSP) techniques of HLA-DR typing in clinical laboratory routine. Ann. Transplant. 1997;2:39–42. [PubMed] [Google Scholar]
  • 22.Voorter CE, Palusci F, Tilanus MG. Sequence-based typing of HLA: an improved group-specific full-length gene sequencing approach. Methods Mol. Biol. 2014;1109:101–114. doi: 10.1007/978-1-4614-9437-9_7. [DOI] [PubMed] [Google Scholar]
  • 23.Kirimunda S, et al. Variation in the Human Leukocyte Antigen system and risk for endemic Burkitt lymphoma in northern Uganda. Br. J. Haematol. 2020;189:489–499. doi: 10.1111/bjh.16398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Peprah S, et al. Risk factors for Burkitt lymphoma in East African children and minors: a case-control study in malaria-endemic regions in Uganda, Tanzania and Kenya. Int. J. Cancer. 2020;146:953–969. doi: 10.1002/ijc.32390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Luo Y, et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 2021;53:1504–1516. doi: 10.1038/s41588-021-00935-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Gouveia MH, et al. Genetic signatures of gene flow and malaria-driven natural selection in sub-Saharan populations of the “endemic Burkitt Lymphoma belt”. PLoS Genet. 2019;15:e1008027. doi: 10.1371/journal.pgen.1008027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Sallah N, et al. Distinct genetic architectures and environmental factors associate with host response to the gamma2-herpesvirus infections. Nat. Commun. 2020;11:3849. doi: 10.1038/s41467-020-17696-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Zhou Y, et al. Genetic loci for Epstein-Barr virus nuclear antigen-1 are associated with risk of multiple sclerosis. Mult. Scler. 2016;22:1655–1664. doi: 10.1177/1352458515626598. [DOI] [PubMed] [Google Scholar]
  • 29.Rubicz R, et al. A genome-wide integrative genomic study localizes genetic factors influencing antibodies against Epstein-Barr virus nuclear antigen 1 (EBNA-1) PLoS Genet. 2013;9:e1003147. doi: 10.1371/journal.pgen.1003147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Akel O, Zhao LP, Geraghty DE, Lind A. High-resolution HLA class II sequencing of Swedish multiple sclerosis patients. Int. J. Immunogenet. 2022;49:333–339. doi: 10.1111/iji.12594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Sanna S, et al. Variants within the immunoregulatory CBLB gene are associated with multiple sclerosis. Nat. Genet. 2010;42:495–497. doi: 10.1038/ng.584. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Bjornevik K, et al. Longitudinal analysis reveals high prevalence of Epstein-Barr virus associated with multiple sclerosis. Science. 2022;375:296–301. doi: 10.1126/science.abj8222. [DOI] [PubMed] [Google Scholar]
  • 33.Broen K, et al. Burkitt lymphoma risk shows geographic and temporal associations with Plasmodium falciparum infections in Uganda, Tanzania, and Kenya. Proc. Natl Acad. Sci. USA. 2023;120:e2211055120. doi: 10.1073/pnas.2211055120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Derkach A, et al. Associations between IgG reactivity to Plasmodium falciparum erythrocyte membrane protein 1 (PfEMP1) antigens and Burkitt lymphoma in Ghana and Uganda case-control studies. EBioMedicine. 2019;39:358–368. doi: 10.1016/j.ebiom.2018.12.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Digitale, J. C. et al. HLA alleles B(*)53:01 and C(*)06:02 are associated with higher risk of P. falciparum Parasitemia in a cohort in Uganda. Front. Immunol.12, 650028 (2021). [DOI] [PMC free article] [PubMed]
  • 36.Lopez-Mejias R, et al. Association of HLA-B*41:02 with Henoch-Schonlein Purpura (IgA Vasculitis) in Spanish individuals irrespective of the HLA-DRB1 status. Arthritis. Res. Ther. 2015;17:102. doi: 10.1186/s13075-015-0622-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Mbulaiteye SM, et al. Medical history, lifestyle, family history, and occupational risk factors for sporadic Burkitt lymphoma/leukemia: the Interlymph Non-Hodgkin Lymphoma Subtypes Project. J. Natl Cancer Inst. Monogr. 2014;2014:106–114. doi: 10.1093/jncimonographs/lgu003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Torgbor C, et al. A multifactorial role for P. falciparum malaria in endemic Burkitt’s lymphoma pathogenesis. PLoS Pathogens. 2014;10:e1004170. doi: 10.1371/journal.ppat.1004170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Baptista MJ, et al. Analysis of the IGHV region in Burkitt’s lymphomas supports a germinal center origin and a role for superantigens in lymphomagenesis. Leuk. Res. 2014;38:509–515. doi: 10.1016/j.leukres.2014.01.001. [DOI] [PubMed] [Google Scholar]
  • 40.Szabo SJ, Sullivan BM, Peng SL, Glimcher LH. Molecular mechanisms regulating Th1 immune responses. Annu. Rev. Immunol. 2003;21:713–758. doi: 10.1146/annurev.immunol.21.120601.140942. [DOI] [PubMed] [Google Scholar]
  • 41.Coghill AE, Hildesheim A. Epstein-Barr virus antibodies and the risk of associated malignancies: review of the literature. Am. J. Epidemiol. 2014;180:687–695. doi: 10.1093/aje/kwu176. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Coghill AE, et al. The association between the comprehensive Epstein-Barr virus serologic profile and endemic Burkitt lymphoma. Cancer Epidemiol. Biomarkers Prev. 2020;29:57–62. doi: 10.1158/1055-9965.EPI-19-0551. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Arisue, N. et al. Assessment of mixed plasmodium falciparum sera5 infection in endemic Burkitt lymphoma: a case-control study in Malawi. Cancers1310.3390/cancers13071692 (2021). [DOI] [PMC free article] [PubMed]
  • 44.Leal TP, et al. NAToRA, a relatedness-pruning method to minimize the loss of dataset size in genetic and omics analyses. Comput. Struct. Biotechnol. J. 2022;20:1821–1828. doi: 10.1016/j.csbj.2022.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Das S, et al. Next-generation genotype imputation service and methods. Nat. Genet. 2016;48:1284–1287. doi: 10.1038/ng.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Peprah S, et al. Mean platelet counts are relatively decreased with malaria but relatively increased with endemic Burkitt Lymphoma in Uganda, Tanzania, and Kenya. Br. J. Haematol. 2020;190:772–782. doi: 10.1111/bjh.16700. [DOI] [PubMed] [Google Scholar]
  • 47.Redmond LS, et al. Endemic Burkitt lymphoma: a complication of asymptomatic malaria in sub-Saharan Africa based on published literature and primary data from Uganda, Tanzania, and Kenya. Malar. J. 2020;19:239. doi: 10.1186/s12936-020-03312-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Hong, H. G. et al. HBB rs334, ABO Rs8176703 and plasmodium falciparum positivity at enrollment are independently associated with lower risk for endemic Burkitt lymphoma in Uganda, Tanzania, Kenya, and Malawi. Blood140, 9314–9316 (2022).
  • 49.Trachtenberg E, et al. Advantage of rare HLA supertype in HIV disease progression. Nat. Med. 2003;9:928–935. doi: 10.1038/nm893. [DOI] [PubMed] [Google Scholar]
  • 50.Thursz MR, Thomas HC, Greenwood BM, Hill AV. Heterozygote advantage for HLA class-II type in hepatitis B virus infection. Nat. Genet. 1997;17:11–12. doi: 10.1038/ng0997-11. [DOI] [PubMed] [Google Scholar]
  • 51.Human Leukocyte Antigen-DQA1*04:01 and rs2040406 Variants Are Associated with Elevated Risk of Childhood Burkitt Lymphoma - Supplementary Information. https://zenodo.org/records/10255672. [DOI] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (3.6MB, pdf)
42003_2023_5701_MOESM3_ESM.pdf (80.5KB, pdf)

Description of Additional Supplementary Files

Supplementary Data 1 (4.2MB, zip)
Reporting Summary (3.7MB, pdf)

Data Availability Statement

The genome-wide and phenotypic data from the Ghana Prostate Healthy Study and the EMBLEM and Malawi studies are publicly available. The previously published data from the Ghana Prostate Healthy Study are available under restricted access in the Genomic Data Commons; access to these studies is available through dbGaP: Ghana Prostate Study (phs000838.v1.p1). The genetic data of participants in the EMBLEM and Malawi studies are under controlled access (requires IRB approval and are limited to not-for-profit research). Readers can access the data by applying via dbGaP under accession link phs001705.v1.p1. The source data behind the graphs in Fig. 2 are available in Supplementary Data 1.

All code was written in R, or Bash. Scripts, required software packages, and instructions are available on the Github repository (https://github.com/smbulaiteye/HLA-EMBLEM.git)51.


Articles from Communications Biology are provided here courtesy of Nature Publishing Group

RESOURCES