Abstract
Epstein-Barr virus (EBV) is linked to several cancers, including endemic Burkitt lymphoma (eBL), but causal variants are unknown. We recently reported novel sequence variants in the LMP-1 gene and promoter in EBV genomes sequenced from 13 of 14 BL biopsies. Alignments of the novel sequence variants for 114 published EBV genomes, including 27 from BL cases, revealed four LMP-1 variant patterns, designated A to D. Pattern A variant was found in 48% of BL EBV genomes. Here, we used PCR-Sanger sequencing to evaluate 50 additional BL biopsies from Ghana, Brazil, and Argentina, and peripheral blood samples from 113 eBL cases and 115 controls in Uganda. Pattern A was found in 60.9% of 64 BL biopsies evaluated. Compared to PCR-negative subjects in Uganda, detection of Pattern A in peripheral blood was associated with eBL case status (odds ratio [OR] 31.7, 95% confidence interval: 6.8–149), controlling for relevant confounders. Variant Pattern A and Pattern D were associated with eBL case status, but with lower ORs (9.7 and 13.6, respectively). Our results support the hypothesis that EBV LMP-1 Pattern A may be associated with eBL, but it is not the sole associated variant. Further research is needed to replicate and elucidate our findings.
Keywords: Epstein-Barr virus, Burkitt lymphoma, Africa, Latin America, next-generation sequencing, LMP-1
1. Introduction
Epstein-Barr virus (EBV) is a gammaherpesvirus that is causally associated with multiple cancers [1,2,3,4,5,6], including endemic Burkitt lymphoma (eBL), an aggressive pediatric cancer that occurs with high incidence in African countries with high malaria endemicity [7]. Although EBV infection establishes asymptomatic lifelong infection in >90% of the world’s population [8], EBV-related cancers develop in a small proportion of EBV infected people; this proportion resulted in an estimated burden of about 120,000 cancers in 2012 or about 5.5% of all infection-associated cancers [9]. The small proportion of EBV-infected people who progress to cancer could be due to the effects of high-risk variants with greater propensity for carcinogenicity [10]. The strong regional distributions of eBL in sub-Saharan Africa, and nasopharyngeal carcinoma (NPC) in the Far East China and parts of North African, stand in marked contrast to the ubiquitous distribution of EBV [11]. These regional distributions may be interpreted as epidemiological clues about the regional distribution of high-risk EBV tumor-specific variants that play a role in the pathogenesis of eBL and NPC [10,12].
The nature of high-risk EBV variants is currently unknown. EBV sequence diversity in EBNA-3A, EBNA-3B, and EBNA-3C genes [13] enables EBV to be grouped into two major types (1 and 2, also known as A or B), but it is unrelated to carcinogenicity [10]. EBV type 2 appears to be over-represented in samples collected in Africa [14], but recent studies conducted in the United Kingdom have reported type 2 EBV in healthy people native to that region [15]. Studies of EBV diversity in the EBV nuclear antigen 1 (EBNA-1), which is expressed in EBV-positive eBL, or transcription factor Zta (BZLF-1), and latent membrane protein 1 (LMP-1), which have transforming properties [16], have not successfully demonstrated an association of diversity in those genes with risk of EBV related cancers [10]. The development of whole-genome high throughput sequencing (HTS) methods and their application to EBV provides new opportunities for the discovery of EBV genetic variants that may be associated with the risk of EBV-related cancers [17].
We recently reported novel sequence variants in the LMP-1 gene and promoter in EBV genomes sequenced by HTS from 13 of 14 BL biopsies from Ghana, Brazil, and Argentina [18]. Alignment of the novel sequences for 114 published EBV genomes, including 27 from BL cases, revealed four LMP-1 patterns, which we designated as Patterns A to D (as shown in Table 1) [18]. Pattern A was characterized by 11 SNVs located in the LMP-1 promoter and the flanking LMP-2B noncoding region, and 12 SNVs located in exons 1 and 3 of the LMP-1 gene that were associated with nine amino acid changes (Table 1). Pattern B was characterized by 28 SNVs located in the promoter, and one SNV located in exon 1 and associated with an amino acid change. Pattern C was characterized by two SNVs located in the promoter, and one SNV located in exon 1 at position 13 (that is also found in Pattern A) that is associated with an amino acid change. Pattern D was the unaltered sequence in the EBV reference genome NC_007605 (wild-type EBV). Interestingly, Pattern A was more enriched in the EBV genomes sequenced from BL tumors than EBV genomes sequenced from non-BL samples (51.8% versus 7%) in the 114 EBV genomes, leading us to hypothesize that Pattern A is associated with BL [18]. Confirmation of this hypothesis could present promising new strategies for EBV vaccines or targets for high-risk EBV variants [19].
Table 1.
Characteristic | LMP-2B Exon1 | LMP-1 Promoter | LMP-1 Exon1 | LMP-1 Exon3 | |||||||||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
nt position | 1. −426 | 2. −412 | 3. −410 | 4. −367 | −372 | −356 | 5. −354 | −329 | −328 | −315 | −286 | −284 | −240 | −238 | −234 | −233 | 6. −227 | −207 | −199 | 7. −184 | 8. −172 | −163 | −136 | −70 | 9. −50 | −44 | −43 | −37 | −34 | 10. −39 | −17 | 11. −12 | −6 | −4 | −3 | +11 | +12 | 12. +18 | +26 | +32 | +41 | 13. E2D | 14.15. H3L | R17L | 16. V43I | 17. S57A | 18.19. I63V | 20. I124V | 21. I152L | 22. H213N | 23. E214Q |
nt change | G>A | T>G | C>A | G>A | G>A | C>A | A>G | C>T | G>A | C>T | A>G | G>T | G>A | A>G | G>T | G>A | G>A | C>T | C>T | A>T | T>C | C>T | A>T | A>C | T>A | G>T | A>T | C>G | C>G | A>C | A>T | G>A | C>T | C>A | C>G | A>G | C>T | T>G | A>C | G>A | G>C | GAA>GAC | CAC>CTG | CGA>CTA | GTT>ATT | TCC>GCC | ATA>GTG | ATC>GTC | ATC>CTC | CAT>AAT | GAA>CAA |
Pattern A
(23 variants) |
A | G | A | A | G | C | G | C | G | C | A | G | G | A | G | C | A | C | C | T | C | C | A | A | A | C | A | C | C | C | A | A | C | C | C | A | C | G | A | G | C | C | TG | G | A | G | GG | G | C | A | C |
Pattern B
(29 variants) |
G | T | C | G | A | A | A | T | A | T | G | T | A | G | T | A | G | T | T | A | T | T | T | C | T | C | T | G | G | A | T | G | T | A | G | G | T | T | C | A | G | A | AC | T | G | T | AA | A | A | C | G |
Pattern C
(3 variants) |
G | T | C | G | G | C | A | C | G | C | A | G | G | A | G | G | G | C | C | A | T | C | A | A | T | T | A | C | C | A | A | G | C | C | C | A | C | T | A | G | C | C | AC | G | G | T | AA | A | A | C | G |
Pattern D
(wild-type) |
G | T | C | G | G | C | A | C | G | C | A | G | G | A | G | G | G | C | C | A | T | C | A | A | T | G | A | C | C | A | A | G | C | C | C | A | C | T | A | G | G | A | AC | G | G | T | AA | A | A | C | G |
Abbreviations: nt nucleotide; Pattern A: 23 single nucleotide variants, including 11 variants in the promoter and 9 amino acid changes in the coding sequence of LMP-1. The single nucleotide variations used to define Pattern A are numbered for ease of reference to guide the definition of Pattern A in other datasets. Pattern B: 29 single nucleotide variants, including 28 variants in the promoter and 1 amino acid change in the coding sequence of LMP-1. Pattern C: 3 single nucleotide variants, including 2 variants in promoter and sharing amino acid change E2D with Pattern A. Pattern D: The LMP-1 wild type sequence of the EBV reference genome NC_007605. The colors indicate the specific nucleotides that vary (dark green for Pattern A, lime green for Pattern B, and blue for Pattern C); only nt changes in Pattern A are numbered.
This study was conducted: (a) to use EBV genome-wide data [15,18] to classify LMP-1 genetic diversity according the Lei LMP-1 Patterns and according to six other published LMP-1 variant classifications (as shown in Figure 1 and Table 2), and determine whether these classifications are independent; (b) to increase the sample size of BL biopsies studied to investigate the enrichment of Pattern A LMP-1 variant in BL biopsies by evaluating 50 additional samples from Ghana, Brazil, and Argentina using PCR-Sanger sequencing; and (c) to assess the distribution of the Pattern A LMP-1 variant in peripheral blood samples of 113 eBL cases and 115 controls enrolled from the same geographical area in the Ugandan component of the Epidemiology of Burkitt’s Lymphoma in East-African Children and Minors (EMBLEM) study [20,21].
Table 2.
EBV Genome | Lei et al. [18] Patterns | Source | Origin | EBV Type | Hu et al. [23] Loss of Xho I Site | Miller et al. [24] 30 bp Deletion | 33 bp Repeats (# of RP Unit) | † Edwards et al. [25] C-Terminal Variants | # Sandvej et al. [27] Group |
---|---|---|---|---|---|---|---|---|---|
KP968263**(H058015C) | A | BL biopsy | Ghana | Type 1 | No | Yes | 4 | China 1 | ND |
KP968262**(H018436D) | A | BL biopsy | Ghana | Type 1 | No | Yes | 4 | China 1 | ND |
KP968264**(H002213) | A | BL biopsy | Ghana | Type 1 | No | Yes | 4 | China 1 | ND |
KP968261**(HU11393) | A | BL biopsy | Ghana | Type 1 | No | Yes | 4 | ND | ND |
KR063342**(H03753A) | A | BL biopsy | Ghana | Type 1 | No | No | 4 | China 1 | ND |
KR063345**(FNR) | A | BL biopsy | Brazil | Type 1 | No | Yes | 4 | China 1 | ND |
KR063344**(RPF) | A | BL biopsy | Brazil | Type 1 | No | Yes | 4 | China 1 | ND |
KP968258**(MP) | A | BL biopsy | Brazil | Type 1 | No | Yes | 4 | China 1 | ND |
KP968257**(CCH) | A | BL biopsy | Brazil | Type 1 | No | Yes | 4 | China 1 | ND |
KR063343**(CV-ARG) | A | BL biopsy | Argentina | Type 1 | No | Yes | 4 | China 1 | ND |
KT001102 (VA) | A | BL biopsy | Argentina | N/A | Gap | Gap | 4 | China 1 | ND |
KT001103 (SG) | A | BL biopsy | Argentina | Type 1 | No | Gap | 4 | China 1 | ND |
KP968259**(SCL) | A | BL biopsy | Brazil | Type 1 | No | No | 4 | B95–8 | ND |
LN827554(LCL-AFB1) | A | LCL | Unknown | Type 2 | No | Yes | 5 | China 1 | ND |
LN824206(pLCL-TRL1-post) | A | sLCL. PTLD (post) | USA | Type 1 | No | Yes | 5 | China 1 | ND |
LN824207(pLCL-TRL1-pre) | A | sLCL. PTLD (pre) | USA | Type 1 | No | Yes | 5 | China 1 | ND |
LN827591(sLCL-2.15) | A | sLCL | Kenya | Type 2 | No | Yes | 5 | China 1 | ND |
LN827594(sLCL-IS1.07) | A | sLCL. PTLD | Australia | Type 1 | No | Yes | 5 | China 1 | ND |
LN827559(pLCL-TRL595) | A | sLCL. PTLD | USA | Type 1 | No | Yes | 5 | China 1 | ND |
LN827563(sLCL-1.18) | A | sLCL | Kenya | Type 1 | No | Yes | 5 | China 1 | ND |
KF717093**(Raji) | A | BL | Nigeria | Type 1 | No | No | 4 | B95-8 | ND |
KP968260**(VGO) | B | BL biopsy | Brazil | Type 1 | Yes | No | 5 | ND | ND |
KC207813**(Akata) | B | BL | Japan | Type 1 | Yes | Yes | 4 | China 1 | ND |
LN824208**(Akata) | B | BL | Japan | Type 1 | Yes | Yes | 4 | China 1 | ND |
KC617875(C666-1) | B | NPC | Asia | Type 1 | Yes | Yes | 4 | China 1 | ND |
KJ411974(C666-1) | B | NPC | Asia | Type 1 | Yes | Yes | 1 | China 1 | ND |
KC617875(C666-1) | B | NPC | Asia | Type 1 | Yes | Yes | 4 | China 1 | ND |
AY961628(GD1) | B | NPC | China | Type 1 | Yes | Yes | 4 | China 1 | ND |
HQ020558(GD2) | B | NPC | China | Type 1 | Yes | Yes | 3 | China 1 | ND |
KF373730(M81) | B | NPC | Asia | Type 1 | Yes | Yes | 4 | China 1 | ND |
LN824142(Saliva) | B | Healthy saliva | UK | Type 1 | Yes | Yes | 2 | China 1 | ND |
LN827562(sLCL-1.19) | B | sLCL | Kenya | Type 1 | Yes | No | 4 | ND | ND |
LN827561(YCCEL1) | B | GC cell line | South Korea | Type 1 | Yes | Yes | 5 | China 1 | ND |
LN824209(HKN14) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 5 | China 1 | ND |
LN827547(HKN15) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 4 | China 1 | ND |
LN824224(HKN19) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 4 | China 1 | ND |
LN827549(D3201.2) | B | NPC | China | Type 1 | Yes | Yes | 6 | China 1 | ND |
JQ009376(HKNPC1) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 5 | China 1 | ND |
KF992564(HKNPC2) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 1 | China 1 | ND |
KF992565(HKNPC3) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 4 | China 1 | ND |
KF992566(HKNPC4) | B | NPC | Hong Kong | Type 1 | Gap | Gap | Gap | Gap | ND |
KF992567(HKNPC5) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 1 | China 1 | ND |
KF992568(HKNPC6) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 1 | China 1 | ND |
KF992569(HKNPC7) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 1 | China 1 | ND |
KF992570(HKNPC8) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 2 | China 1 | ND |
KF992571(HKNPC9) | B | NPC | Hong Kong | Type 1 | Yes | Yes | 1 | China 1 | ND |
LN827523(L591) | B | HL cell line | Germany | Type 1 | Yes | No | 4 | NC | ND |
LN827799(sLCL-IM1.16) | B | sLCL. IM. | Australia | Type 1 | Yes | No | 4 | NC | ND |
LN827578(sLCL-IS1.13) | B | sLCL. PTLD | Australia | Type 1 | Yes | No | 4 | NC | ND |
LN827586(sLCL-IS1.15) | B | sLCL. PTLD | Australia | Type 1 | Yes | No | 4 | NC | ND |
LN827800**(Jijoye) | C | BL | Nigeria | Type 2 | No | No | 4 | B95-8 | ND |
LN827548**(P3HR1_c16) | C | BL | Nigeria | Type 2 | No | No | 4 | B95-8 | ND |
LN827557**(BL36) | C | BL | N. Africa | Type 1 | No | No | 4 | B95-8 | ND |
LN827545**(Daudi) | C | BL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827551**(Makau) | C | BL | Kenya | Type 1 | No | No | 5 | Med | ND |
LN824205(sLCL-1.12) | C | sLCL | Kenya | Type 1 | No | No | 5 | Med | ND |
LN824203**(Mak1) | C | BL | Kenya | Type 1 | No | No | 5 | Med | ND |
LN827544**(Wewak1) | D | BL | PNG | Type 2 | No | No | 4 | ND | ND |
LN827556**(Cheptages) | D | BL | Kenya | Type 2 | No | No | 4 | B95-8 | ND |
LN827526**(BL37) | D | BL | Africa | Type 1 | No | Yes | 6 | Med | ND |
KC207814**(Mutu) | D | BL | Kenya | Type 1 | No | No | 4 | Med | ND |
NC_009334**(AG876) | D | BL | Ghana | Type 2 | No | Yes | 4 | China 1 | ND |
NC_007605 (WT-EBV) | D | B98-8 | USA | type 1 | No | No | 4 | B95-8 | ND |
AJ507799(WT-EBV) | D | B98-8 | USA | Type 1 | No | No | 4 | B95-8 | ND |
V01555(WT-EBV) | D | B95-8 | USA | Type 1 | No | No | 4 | B95-8 | ND |
KC440851(K4123-Mi) | D | healthy donor | USA | Type 1 | No | No | 4 | Med | ND |
KC440852(K4413-Mi) | D | healthy donor | USA | Type 1 | No | No | 4 | B95-8 | A |
NA19114 | D | healthy donor | Yoruba | Type 1 | No | No | 4 | B95-8 | ND |
NA19315 | D | healthy donor | Kenya | Type 1 | No | No | 4 | B95-8 | ND |
NA19384 | D | healthy donor | Kenya | Type 1 | Gap | No | 4 | China 1 | ND |
LN827739(LCL_B958) | D | LCL, B95-8 | USA | Type 1 | No | No | 4 | B95-8 | ND |
LN827597(sLCL-IS1.04) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | B95-8 | ND |
LN827596(sLCL-IM1.02) | D | sLCL. IM. | Australia | Type 1 | No | Yes | 6 | China 1 | ND |
LN827595(sLCL-IS1.03) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | Med | ND |
LN827593(sLCL-IS1.12) | D | sLCL. PTLD | Australia | Type 1 | No | No | 4 | B95-8 | A |
LN827592(sLCL-IS1.10) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | Med | ND |
LN827590(sLCL-IM1.05) | D | sLCL. IM. | Australia | Type 1 | No | Yes | 6 | China 1 | ND |
LN827589(sLCL-IS2.01) | D | sLCL. PTLD | Australia | Type 2 | No | Yes | 4 | China 1 | ND |
LN827588(sLCL-IS1.19) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | B95-8 | A |
LN827587(sLCL-2.21) | D | sLCL | Kenya | Type 2 | No | Yes | 4 | China 1 | ND |
LN827585(sLCL-1.04) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827584(sLCL-IS1.06) | D | sLCL. PTLD | Australia | Type 1 | No | No | 4 | Med | ND |
LN827583(sLCL-IM1.17) | D | sLCL. IM. | Australia | Type 1 | No | No | 4 | B95-8 | ND |
LN827582(sLCL-BL1.03) | D | sLCL | Kenya | Type 1 | No | No | 4 | B95-8 | ND |
LN827581(sLCL-1.05) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827580(sLCL-2.16) | D | sLCL | Kenya | Type 2 | No | No | 5 | B95-8 | ND |
LN827579(sLCL-1.13) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827577(sLCL-1.17) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | A |
LN827576(sLCL-IS1.20) | D | sLCL. PTLD | Australia | Type 1 | No | Yes | 7 | China 1 | ND |
LN827575(sLCL-IS1.14) | D | sLCL. PTLD | Australia | Type 1 | No | No | 4 | B95-8 | A |
LN827574(sLCL-1.09) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | A |
LN827573(sLCL-1.10) | D | sLCL | Kenya | Type 1 | No | No | 3 | Med | ND |
LN827572(sLCL-IS1.18) | D | sLCL. PTLD | Australia | Type 1 | No | No | 4 | B95-8 | ND |
LN827571(sLCL-BL1.20) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827570(sLCL-IS1.01) | D | sLCL. PTLD | Australia | Type 1 | No | Yes | 4 | China 1 | ND |
LN827569(sLCL-IS1.11) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | Med | ND |
LN827568(sLCL-1.24) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827567(sLCL-IM1.09) | D | sLCL. IM. | Australia | Type 1 | No | No | 4 | B95-8 | ND |
LN827566(sLCL-1.06) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827565(sLCL-1.07) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827564(HL04) | D | HL | UK | Type 1 | No | Yes | 3 | China 1 | ND |
LN827560(sLCL-2.14) | D | sLCL | Kenya | Type 2 | No | Yes | 4 | China 1 | ND |
LN827558(sLCL-1.02) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827555(X50-7) | D | LCL | USA | Type 1 | No | No | 4 | B95-8 | ND |
LN827553(sLCL-IS1.08) | D | sLCL. PTLD | Australia | Type 1 | No | No | 5 | Med | ND |
LN827552(sLCL-1.08) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827550(sLCL-1.11) | D | sLCL | Kenya | Type 1 | No | No | 4 | Med | ND |
LN827546(HL02) | D | HL | UK | Type 1 | No | Yes | 3 | China 1 | ND |
LN827527(M-ABA) | D | LCL, NPC virus | N. Africa | Type 1 | No | No | 4 | B95-8 | A |
LN827524(HL11) | D | HL | UK | Type 1 | No | No | 5 | Med | ND |
LN827522(HL09) | D | HL | UK | Type 1 | No | Yes | 6 | China 1 | ND |
LN824226(HL01) | D | HL | UK | Type 1 | No | Yes | 3 | China 1 | ND |
LN824225(HL08) | D | HL | UK | Type 1 | No | Yes | 5 | China 1 | ND |
LN824204(HL05) | D | HL | UK | Type 1 | No | Yes | 5 | China 1 | ND |
Notes: List sorted by LMP-1 Patterns according to Lei et al. [18] and then by phenotype (BL, LCL, NPC, and healthy donor). No: means the variant is absent, Yes: means the variant is present, Gap: means the sequence was ambiguous or gapped, therefore, the variant cannot be called. † Some of the C-terminal variants used for classification in the study of Edwards et al. (1999) were found in the 114 EBV genomes [25]. The samples were assigned following the classifications (B95-8, China 1, North Carolina (NC), Mediterranean (Med)) used in the study of Edwards et al. (1999) [25] # Sandvej [27] classification made the LMP-1 variants into 4 groups, Group A: 6 SNV on the coding sequence (CDS), containing neither loss of Xho I site nor 30 bp deletion, Group B: 33 bp repeats different from which of the B95-8 plus 19 SNVs on CDS, Group C: 30 bp deletion plus 44 SNVs on LMP-1 CDS, Group D: loss of Xho I sit plus 66 SNVs in the LMP-1 CDS and 33 SNVs in the promoter. Because the number of the repeat units is not accurate when the sequence is coming from shotgun sequencing or HTS, the estimated number is not good for classification; thus, many samples cannot be classified. These samples are designated not determined (ND) to indicate that the classification is not applicable.
2. Results
2.1. Comparison of EBV LMP-1 Genetic Diversity in 114 Published Genomes Classified by Seven Techniques
Figure 1 shows the regions of genetic variation in the LMP-1 gene and promoter described by Lei et al. [18] and 6 other investigators [22,23,24,25,26,27]. Briefly, genetic variation was reported in the coding region of the gene but not the promoter by five of seven reports Hu et al. [23], Miller et al. [24], Edwards et al. [25], Walling et al. [26], and Kanai et al. [22]). Genetic variation was reported in both the LMP-1 gene and promoter by two studies (Sandvej et al. [27], and Lei et al. [18]). Furthermore, genetic variations were identified in the LMP-1 exon 1 in three reports (Sandvej et al. [27], Lei et al. [18], and Hu et al. [23]) and in the LMP-1 exon 3 for all reports except that of Hu et al. [23]. As shown in Figure 1, the variations reported by Lei et al. [18] are physically independent from variations reported by the six other reports. Genetic variations involving the 30 bp deletion and 33 bp repeats in LMP-1 exon 3 were used to classify LMP-1 genetic diversity by three reports (Miller et al. [24], Wallings et al. [26], and Sandvej et al. [27]).
Table 2 summarizes the LMP-1 genetic classifications based on Lei et al. [18], Hu et al. [23], Miller et al. [24], Walling et al. [26], and Sandvej et al. [27] for the 114 EBV genomes previously studied in Lei et al. [18]. The results are sorted by Lei patterns A to D [18] and disease phenotype BL, NPC, Hodgkin lymphoma (HL), post-transplant lymphoproliferative disease (PTLD), gastric cancer (GC), and healthy samples (lymphoblastoid cell line [LCL], Spontaneous transformed lymphoblastoid cell line [sLCL], and Saliva). We note that 100% of the HTS data could be used to define LMP-1 genetic diversity according to Lei et al. [18], and that it was possible for 99.1% for classification by Miller et al. [24], for 97.4% for Hu et al. [23], and 96.5% in Edwards et al. [25]. However, HTS data could be used to classify only 6.1% of the samples according to Sandvej et al. [27] because Sandvej uses the number of 30 bp and 33 bp deletions to classify LMP-1 genetic variants, but the number of deletions cannot be accurately determined from HTS data.
Figure 2 shows a pairwise classification of LMP-1 genetic diversity according to Edwards et al. [25] and Lei et al. [18] for 110 EBV genomes satisfactorily analyzed for both. The Edward’s system results in seven LMP-1 strains based on the nonsynonymous nucleotide changes in the C-terminus of LMP-1, and four EBV strains in the 110 analyzed EBV genomes: China 1, Mediterranean, North Carolina, and B95-8. The China 2, China 3, and Alaskan strains were not found in this set. As shown in Figure 2, the four Edward’s strains are not correlated with the Lei patterns, as indicated by the observation that different strains defined by one method can be identified in another strain using the other method, and vice versa. For example, China 1 Edward strains were found in Lei Patterns A, B, and D strains; the B95-8 Edward strain was found in Lei Patterns A, C, and D (Figure 2). A possible exception was the North Carolina (NC) Edwards strain, which only co-occurred with Lei Pattern B, which has been seen predominantly in the samples from the Far East. However, whether the North Carolina strain is correlated with Far East ancestry, as seems to be the case for Pattern B, is unclear, because some of the samples which were jointly classified as North Carolina and Pattern B were collected in Australia and Germany, and their ancestry is unknown to us. Because we have previously shown that the Lei Patterns A to D identify unique genetic variants that cluster together in phylogenetic trees based on the entire EBV genome sequence, on the sequence from EBNA1, or LMP-1 genes (Figure 1 in [18]). We conclude that LMP-1 genetic diversity according to Edward et al. [25] is independent of that described by Lei et al. [18], and may be less discriminatory for disease or geographical patterns.
Figure 3A–D shows the distribution of Lei Patterns A–D, Hu variants based on Xho I loss of restriction site, Miller variants based on 30 bp deletion in C-terminus, and Edward’s (defined in Table 2) in 28 BL EBV genomes and 30 lymphoblastoid cell lines and saliva EBV genomes from healthy subjects that were classified using HTS data. Enrichment of Pattern A was observed in BL EBV genomes, but not in EBV genomes from healthy samples (50.0% versus 10.0%, Figure 3A). Pattern D predominated in EBV genomes from healthy samples, and was observed relatively infrequently in BL EBV genomes (80.0% versus 17.6%, Figure 3A). The distribution of loss of the Xho I restriction site was rare both in EBV in BL samples and healthy subjects (10.7% versus 6.7%, Figure 3B). The 30 bp deletion in the C-terminus in exon 3 [24] was enriched in BL EBV genomes, and was relatively infrequent in EBV genomes from healthy subjects (46.4% versus 20.0%, Figure 3C). The frequency of China 1 strains was enriched in BL EBV genomes compared to EBV from healthy samples (50.0% versus 23.3%, Figure 3D), while the opposite pattern was observed for the Mediterranean strain; no differences were observed for B95-8 Edward’s strains.
Given our findings above showing a preponderance of China 1 and 30 bp deletions in BL cases, the BL cases were jointly classified by Lei LMP-1 patterns and Edward’s China 1 strain or Lei patterns by Miller’s 30 bp deletion (Figure 4). We noted that the higher frequency of the 30 bp deletion (Figure 3C) or China 1 (Figure 3D) in BL EBV genomes was mostly explained by correlation with Lei LMP-1 Pattern A (Figure 4A, B). The North Carolina strain was not seen in the analyzed BL EBV genomes. This is not surprising, given our finding that North Carolina Edward’s strains are correlated with Pattern B (Figure 2), which is rarely seen in BL samples. The one BL sample that was classified as Pattern B in the Lei system was indeterminate on Edward’s classification (Table 2).
To confirm the enrichment of LMP-1 Pattern A in BL samples in a separate dataset, we aligned LMP-1 sequences of EBV genomes from 23 BL samples and 59 EBV genomes from healthy individuals published by Correia et al. [15], that did not overlap with samples analyzed in Lei et al. [18]. LMP-1 Pattern A variants were observed in four (17.4%) BL EBV genomes but in none of the 59 EBV genomes from healthy subjects (Table S5). Although this set is also somewhat limited by the lack of BL-free controls from the same cities and towns as the BL cases, the results are in the same direction as those comprising the original analysis [18]. The paucity of samples from healthy subjects, collected from the same cities and towns from which the BL samples were obtained, is a major limitation for any analysis based on the rapidly accumulating EBV HTS data. For example, while 20 BL samples in the data by Correia et al. [15] are from Africa, only four samples from healthy people are from Africa, and not from the same cities of towns at that (Table S6). Future collections are needed to address this disparity in sample inclusion, and more efforts must be made to enroll healthy people from the same cities and towns where the BL samples were collected, to increase the value of the generated data for investigations of associations between EBV genetic diversity and BL.
2.2. Discovery of Novel Pattern A-Like Variations in Samples Studied by PCR-Sanger Sequencing
Sequence analysis of LMP-1 sequences from the 50 additional BL samples revealed single nucleotide variations, suggesting a shift from Pattern D to Pattern A, but not involving all the 23 Pattern A-defining variations (see Table 1). We designated these A-like variations “variant Pattern A”, and grouped them into four sub-patterns (Table 3). One variant A sub-pattern was characterized by 11 or 12 SNVs also observed in Pattern A, including one or two SNVs at positions 11 or 12 (Table 1 for position numbers) in the LMP-1 promoter, and 10 SNVs located at positions 13–23 in exon 1 and 3 of the N-terminus of LMP-1 gene (representative sample BLS000207, Table 3). This variant is designated the “Mid-length A-like Pattern (ML-A)”, because the relative location of SNVs are in the mid-region of Pattern A variations. The second and third variant A sub-patterns were characterized by 4 or 3 nt variations. The 3 nt variant Pattern A was characterized by 3 SNVs at positions 13–15 in exon 1 in the N-terminus of LMP-1 gene (see Table 1), while the 4 nt variant A has an SNV in the LMP-1 promoter at position 10 (representative samples BLS000292 and BLS000020, Table 3). A novel variation S24A was discovered in the samples with 3 or 4 nt variant A sub-patterns; this variation was not previously observed in Lei et al. [18] and is not part of the Lei classification system. The fourth variant A sub-pattern was characterized by 16 SNVs: seven were at positions 6–12 in the LMP-1 promoter, two were at positions 14 and 15 in exon 1, one was at position 23 in exon 3, and five were at positions 1–5 in LMP-2B exon 1 (representative sample BLS001943, Table 3). This sub-pattern was designated as “Partial A variant sub-pattern”, because it has the most SNVs observed in Pattern A. Our finding of novel variant Pattern A sub-patterns suggests that the EBV genetic diversity in the LMP-1 gene is more extensive than is currently known.
Table 3.
Gene Region | LMP-2B Exon1 | LMP-1 Promoter | LMP-1 Exon1 | LMP-1 Exon3 | |||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Primer pair | Lei-F3/Lei-R3 | Lei-F1/Lei-R1 | Lei-F2/Lei-R2 | ||||||||||||||||||||||
Amplicon size | 396 bp | 435 bp | 428 bp | ||||||||||||||||||||||
FT and a.a. affected | AML1 | LBF2 | LBF4 | CREB | E2D | H3L | S24A (new) | V43I | S57A | I63V | I124V | I152L | H213N | E214Q | |||||||||||
Variation | 1. G-426A |
2. T-412G |
3. C-410A |
4. G-376A |
5. A-354G |
6. G-227A |
7. A-184T |
8. T-172C |
9. T-50A |
10. A-39C |
11. G-12A |
12. T+18G |
13. GAA>GAC |
14. CAC>CTG |
15. CAC>CTG |
TCC>GCC | 16. GTT>ATT |
17. TCC>GCC |
18. ATA>GTG |
19. ATA>GTG |
20. ATC>GTC |
21. ATC>CTC |
22. CAT>AAT |
23. GAA>CAA |
|
Pattern | Example | ||||||||||||||||||||||||
A | BLS000073 | A | G | A | A | G | A | T | C | A | C | A | G | C | T | G | T | A | G | G | G | G | C | A | C |
Mid-Length A | BLS000207 | G | T | C | G | A | G | A | T | T | A | A | G | C | T | G | T | A | G | G | G | G | C | A | C |
4nt sub-pattern A | BLS000292 | G | T | C | G | A | G | A | T | T | C | G | T | C | T | G | G | G | T | A | A | A | A | C | G |
3nt sub-pattern A | BLS000020 | G | T | C | G | A | G | A | T | T | A | G | T | C | T | G | G | G | T | A | A | A | A | C | G |
Partial A sub-pattern | BLS001943 | A | G | A | A | G | A | T | C | A | C | A | G | C | T | G | T | G | T | A | A | A | A | C | C |
Abbreviations: nt nucleotide; Notes: The colors indicate the specific nucleotides that vary (light green show nucleotide changes in Mid-length A; blue shows 4 nt sub-pattern A; grey show 3 nt sub-pattern A; light orange shows partial A sub-pattern A).
2.3. Frequency of Pattern A in 64 BL Tumors from Ghana, Argentina, and Brazil
Table 4 shows the results of 71 samples analyzed by PCR and Sanger sequencing from Argentina, Brazil, and Ghana, including 14 that were previously analyzed by HTS and validated by PCR and Sanger sequencing (the genotype of each sample is shown in Tables S2 and S3) [18]. Seven BL biopsies were EBV negative and excluded from frequency analysis. The distribution of Lei LMP-1 Patterns of the BL biopsies was A in 39 (60.9%) and D in 14 (21.9%) BL biopsies. Pattern B and C were observed in 1 (1.6%) biopsy each. Variant Pattern A of the Mid-Length type was observed in 7 (10.9%) BL biopsies. Analysis of variation by country shows Pattern A varying from 55% in Ghana to 80% in Brazil.
Table 4.
Geographical Region | PCR Positive * | PCR Negative * | Pattern A | Mid Length-A | Pattern D | Pattern B | Pattern C | Pattern A/D |
---|---|---|---|---|---|---|---|---|
Argentina | 15 | 3 | 8 (66.7%) | 2 (16.7%) | 2 (16.7%) | 0 (0%) | 0 (0%) | 0 (0%) |
Brazil | 11 | 1 | 8 (80.0%) | 0 (0%) | 1 (10.0%) | 1 (10.0%) | 0 (0%) | 0 (0%) |
Ghana | 45 | 3 | 23 (54.8%) | 5 (11.9%) | 11 (26.2%) | 0 (0%) | 1 (2.4%) | 2 (4.8%) |
Total (N) | 71 | 7 | 39 (60.9%) | 7 (10.9%) | 14 (21.9%) | 1 (1.6%) | 1 (1.6%) | 2 (3.1%) |
* PCR performed using Lei Primers as described in Lei et al. [18]. Only PCR positive samples were subject to Sanger sequencing.
Two (3.1%) BL biopsies, both from Ghana, had genotypes consistent with mixed infection with A and D patterns (HU11020 and HU13281, Table S1). Massive parallel sequencing of the entire LMP-1 gene and promoter of these two biopsies revealed general heterozygosity of 25% in one sample and 55% in the other sample, but the heterogeneity at specific positions varied at individual positions—ranging from 6.5% to 99% (Table S1). Although it is theoretically possible that the apparently mixed infection may be due to ongoing mutational activity in the tumor leading to diversification EBV variants from D to A patterns [28], it is unlikely that such extensive genetic heterozygosity observed across the entire LMP-1 gene and promoter would be due to random mutations. A more likely explanation is that two independent viruses infected a single B cell that progressed to BL. We note that it is also possible that two independent cells with different infections progressed to BL and merged into one clinical tumor by colonizing the same tumor microenvironment. If so, examination of somatic mutations would reveal heterozygosity.
Our finding that LMP-1 Pattern A variants are enriched in EBV genomes sequenced from BL samples, and that variant Pattern A sub-variants are prevalent in BL, may be a clue that EBV in BL might be under selection pressure, and that the variations may be functionally relevant for tumor survival.
2.4. Frequency of Pattern A in Peripheral Blood of 113 Children with- and 115 Children without-BL in Uganda
Table 5 shows the distribution of EBV LMP-1 patterns in peripheral blood of 113 children with eBL and 115 BL-free children enrolled from the same geographical areas as the cases in the EMBLEM study in Uganda. The samples were tested in two batches, starting with a proof-of-concept set of 13 children with BL and 15 BL-free children previously confirmed to be EBV positive using EBNA1 PCR [29]. These samples were tested by PCR using Lei LMP-1 primers [18], and the products sequenced by the Sanger method. Pattern A was observed in 15.4% (n = 2) of children with BL, but in 0% (based on n = 15) of the BL-free children.
Table 5.
Sample Batch | Group | Number of PBMCs | PCR Positive * | PCR Negative | Pattern A | Mid-Length A sub-Pattern |
Partial A sub-Pattern | 4nt sub-Pattern A | 3nt sub-Pattern A | Pattern D | N.D. # |
---|---|---|---|---|---|---|---|---|---|---|---|
Batch 1 | BL cases | 13 | 13 (100%) | 0 (0%) | 2 (15.4%) | 0 (0%) | 0 (0%) | 1 (7.7%) | 1 (7.7%) | 9 (69.2%) | 0 (0%) |
Controls | 15 | 15 (100%) | 0 (0%) | 0 (0%) | 4 (26.7%) | 0 (0%) | 4 (26.7%) | 0 (0%) | 7 (46.7%) | 0 (0%) | |
Batch 2 | BL cases | 100 | 86 (86%) | 14 (14%) | 10 (10.0%) | 7 (7.0%) | 3 (3.5%) | 8 (9.3%) | 3 (3.5%) | 55 (55.0%) | 0 (0%) |
Controls | 100 | 29 (86%) | 71 (71%) | 3 (3.0%) | 4(4.0%) | 0 (0%) | 0 (0%) | 0 (0%) | 19 (19.0%) | 3 (3.0%) | |
Batch 1 and 2 | BL cases | 113 | 99 (87.6%) | 14 (12.4%) | 12 (10.6%) | 7 (6.2%) | 3 (2.6%) | 9 (8.0%) | 4 (3.5%) | 64 (56.4%) | 0 (0%) |
Controls | 115 | 41 (35.6%) | 71 (64.4%) | 3 (2.6%) | 8 (7.0%) | 0 (0%) | 4 (3.5%) | 0 (0%) | 26 (22.6%) | 3 (2.6%) |
Abbreviations: PBMCs peripheral blood mononuclear cells. Percent are calculated considering all the samples, which are shown in the column headed “Number of PBMCs”. For association analyses (Figure 5), Mid-Length A, Partial A, 4nt- and 3nt variants are combined into one group called “Variant Pattern A”. The association of LMP-1 variants are compared to PCR negative as the reference group, similar to the practice in the HPV field (see Munoz N et al. [33]). * The PCR positivity of both batches was tested using Lei-1 primers (see methods). # N.D.: Undetermined samples were PCR-positive, but the product was insufficient for Sanger sequencing.
Following the successful proof-of-concept experiment, the sample size was expanded to include peripheral blood from an additional set of 100 children with eBL and 100 BL-free children from the same geographical area. The combined results from the two sets are shown in Table 5. In this analysis, children who were PCR negative using Lei primers (see Methods Section 4.3) were used as the reference group, because negativity was interpreted as indicative of controlled EBV infection or absence of infection with the specific genotype. Pattern A variants were detected more frequently in peripheral blood of eBL cases than controls (10.6% versus 2.6%; Table 5). Pattern A variants were associated with a 20.3-fold (95% confidence interval [CI] 5.1–81.3) increase in the likelihood of eBL compared to PCR-negative subjects (Figure 5). We observed variant Pattern A sub-pattern variants in the peripheral blood of 20.3% eBL cases, compared to 10.4% of the controls in Uganda (Table 5). Pattern A sub-pattern variants were associated 9.7-fold (95% CI 3.9–24.0) odds for eBL risk compared to PCR-negative subjects. Pattern D was detected more frequently in eBL cases than in controls (56.6% versus 22.6%), and was associated with 12.5-times (95% CI 6.0–26.0) higher odds for eBL risk. Pattern B and Pattern C were not observed in these Ugandan samples.
This interpretation is reasonable because EBV infection in African children occurs during infancy [30], and EBV load is highest during the period of primary infection, and decreases significantly in chronically infected children [29,31]. Continued positivity and expression of virus at high levels is therefore an indication of poorly controlled viral infection and greater likelihood of EBV-mediated carcinogenesis [32]. This reasoning is similar to that commonly used for other virus-mediated cancers, notably cervical cancer, for which persistent infection with high-risk variants is considered the risk factor, and PCR negativity for high-risk types is used as the reference category [33].
Because detection of EBV may be related to shedding due to reactivation following infection with malaria [34], the associations with LMP-1 patterns were adjusted for malaria status (positive or negative), a history of malaria treatment as an inpatient or outpatient, rainfall season, and residence in a rural or urban village. The association between Pattern A with BL risk became stronger (OR 31.7), albeit with substantial uncertainty, whereas the ratio likelihood of associations with variant Pattern A sub-patterns or Pattern D were minimally altered, which suggests that those variants are not confounded by malaria-related variables. We note that the CIs for ORs for Pattern A, variant Pattern A, and Pattern D overlap, but this is not unexpected, and has been observed for associations between different HPV types and cervical cancer risk [33].
3. Discussion
3.1. LMP-1 Classification Using the Lei et al. Technique Applicable to HTS Data and Has Parsimony of Variants
We showed that LMP-1 genetic variants according to Lei et al. [18], Miller et al. [24], Hu et al. [23], and Edwards et al. [25] can be determined from HTS EBV data for the majority of cases. However, LMP-1 genetic diversity according to patterns reported by Sandvej et al. [27] can be determined from HTS data in less than 10% of cases. We observed that these different systems for classifying LMP-1 genetic diversity are independent of each other. This independence between classifications was illustrated by detailed comparisons of LMP-1 genetic diversity by Lei et al. [18] and by Edwards et al. [25], which showed that each Lei Pattern included 2–3 different Edwards strains, and vice versa. Thus, providing information using one system provides no information about the other system. The only exception was the Edward’s North Carolina strain [25], which was jointly classified only with Lei Pattern B [18]. Because our analysis suggests that Pattern B is predominant in people from the Far East, it is possible that the North Carolina strain also tracks with EBV from the Far East, although the finding that three Pattern B-North Carolina EBV samples were collected from individuals in Australia and an individual from Germany (all with unknown ancestry) suggests that this strain circulates elsewhere. The North Carolina strain was recently reported in about 12.2% of healthy donors from Qatar [35], but it was not found in a study of LMP-1 variants in the saliva and PBMCs in children in Argentina, who included people of Asian descent [36], suggesting that the North Carolina strain may not be specific for the EBV found in Far East populations. Taken together, our results provide a justification for conducting a secondary analysis of HTS data to evaluate LMP-1 variant-disease associations using these classifications.
3.2. Pattern A Variations Frequently Detected in BL Tumors or Peripheral Blood of eBL Cases
Our study was conducted to replicate our findings of enrichment of Pattern A in BL biopsies for a larger set of samples. We quadrupled the number of BL biopsies (from 14 to 64) that have been evaluated for Pattern A variations, and confirmed the high frequency (~50%) of Pattern A originally observed in BL biopsies from Ghana, Argentina, and Brazil [18]. In contrast, Pattern A is infrequently observed in healthy subjects, which supports the hypothesis that Pattern A may be a genetic marker of an EBV variant causal for eBL. Our lack of comparative data from BL-free patients from the same cities and towns as the BL patients from whom the biopsies were obtained is a limitation of our results, because the results in the BL cases may reflect geographical patterns of EBV diversity. In particular, EBV genetic patterns in Ghana and Brazil could be similar, because historical population movements between the two countries during the trans-Atlantic slave trade [37] may have led to similar EBV genetic variants circulating in the populations in the two countries. However, those reasons would not explain the similar frequency of the EBV Pattern A in BL cases in Argentina, which did not receive a large influx of Africans as Brazil. Our finding that Pattern A was infrequent in published EBV genomes in healthy people, as previously analyzed in our study [18], or those (n = 110 EBV genomes) published by Correia et al. [15], which we analyzed in the current study, suggests that Pattern A is indeed rare in healthy subjects, and is apparently enriched in BL EBV genomes. Further research is needed to investigate the frequency of LMP-1 Pattern A variants in healthy subjects in countries with a high eBL incidence.
3.3. Novel Pattern A-Like Variations Suggest a Much Broader Undiscovered LMP-1 Genetic Diversity
Our finding of novel variant Pattern A sub-variants, discovered during our replication study, suggests that the genetic diversity of the LMP-1 gene and promoter is not yet fully characterized. These results of sub-variants are not due to unconsciously modifying the definition of variants through post-hoc analysis, as we collected data from more BL samples or the peripheral blood samples in Uganda because we used the original definition of Pattern A [18]. These results from BL biopsies reflect the in vivo status of tumors’ individually sequenced DNA molecules, and do not reflect an ongoing mutational process that promotes variation within Pattern A during cell line propagation. Understanding the mechanisms that promote genomic diversity of EBV in tumors may help to elucidate the mechanism of nucleotide substitutions and their role in carcinogenesis. Our results encourage the study of EBV genetic diversity using HTS technology to accelerate the discovery of LMP-1 genetic diversity that may be associated with EBV-associated tumors. Further research using HTS data will provide a more comprehensive picture of EBV genetic diversity, and facilitate the discovery of variant-phenotype associations.
3.4. Pattern A and D Heterozygosity in Tumors Suggests Multiple EBV Infections in Single Pre-Cancer B Cells
Our finding that EBV in two BL biopsies from two subjects in Ghana had both Pattern A and D EBV strains is intriguing. Because tumors are presumed to arise from a single transformed cell, these data may be interpreted as suggesting that some tumors arise from a B cell infected with two different EBV strains, which has not been reported before [35,38,39]. This finding raises an interesting question about which viral sequence would be relevant for facilitating malignant transformation on one hand, and whether having both infections is a marker of specific individual vulnerability to EBV infection, on the other. We considered that the possibility that heterozygosity was due to contamination by a non-tumor virus was unlikely. The high proportion of heterozygosity of all positions in the LMP-1 gene was consistent with two viral strains being present in the tumor cells rather than isolated cells with a different virus contaminating the tumor. We considered the possibility that the EBV heterozygosity may be due to two independently infected B cells undergoing initiation and progression, and then fusing into one tumor. This hypothesis might be tested by investigating somatic changes in the tumor, which would be expected to reveal heterozygosity.
We note that accumulation of multiple substitutions in the tumor EBV leading to the generation of variant species of LMP-1 in vivo could contribute to EBV heterozygosity in tumors. Accumulation of multiple substitutions in the tumor EBV was reported in nasal T cell lymphomas from Peru [40], based on single strand conformational polymorphism (SSCP) analysis of EBNA-1 fragments from the tumor EBV, which was known to be monoclonal from analysis of EBV terminal repeat sequences. The multiple EBNA-1 variants were confirmed by direct sequencing of independently subcloned EBNA-1 PCR products in nasal T cell lymphomas [40]. If the accumulation of multiple substitutions contributes to EBV heterozygosity in BL, it will provide evidence for hypermutation as a process that generates quasi-species in EBV, with implications for study of tumor-specific variants and EBV vaccine research.
3.5. Implications of Detecting a High Frequency of Pattern A in BL Tumors or Peripheral Blood of eBL Cases
Our study focuses on LMP-1 genetic diversity as a field which is potentially useful for classifying EBV variants associated with elevated risk for BL. Our results from a larger series of BL biopsies confirmed the impression that LMP-1 Pattern A is enriched in EBV sequenced from BL biopsies. Our results from Uganda are the first study to demonstrate an association between LMP-1 Pattern A with eBL in cases compared to BL-free controls enrolled from the geographical area. The results showed a significantly elevated risk of eBL (OR 31.7) associated with detection of LMP-1 Pattern A in peripheral blood of the cases compared with the controls. However, an elevated risk of eBL with the detection of variant Pattern A sub-patterns and Pattern D was observed, albeit less strong than for Pattern A, suggesting that LMP-1 Pattern A is not the sole cause, although it is the strongest.
The observation of variant Pattern A sub-patterns suggests that the genetic diversity of LMP-1 gene and promoter is yet to be fully characterized. A more comprehensive genetic description of this gene using full sequence data will clarify of the distribution of patterns, and facilitate the identification of specific SNVs that may be related to causal variants. Moreover, the detailed studies may provide insights about ongoing mutational processes and the mechanism for such genetic variation in tumor versus normal samples.
Given the exciting associations noted above, our results are grounds for optimism; however, they are based upon the relatively small number of samples that have been studied to date. Thus, there is a need to increase the number of samples from healthy subjects from geographic areas where BL cases have been obtained to facilitate variant-phenotype studies. Secondly, there is a need to develop more sensitive assays to study variants, because the current PCR and Sanger sequencing assay targets large amplicons (435 bp, 428 bp, and 396 bp), and it is negative in up to 70% of peripheral blood samples in the controls. Failure to detect all infections could lead to detection bias in future studies. Because the relationship between LMP-1 patterns with EBV load is unknown, it is difficult to predict the direction of such bias, but it could attenuate findings to the null or bias findings away from the null. Thus, the development of more sensitive assays, coupled with an improvement in the number and geographical coverage of samples, would make it possible to conduct research to understand the relationship between LMP-1 patterns and viral load, to guide interpretation in future studies. Ideally, newer assays should seek to improve the assay detection limit of LMP-1 Patterns A in peripheral blood of healthy subjects, in order to enable precise evaluation of epidemiological associations between detection of LMP-1 Pattern A in body fluids and eBL risk.
We note that the current paradigm that LMP-1 is not expressed in EBV-positive BL tumors [41] has discouraged investigators from looking at genetic diversity in the LMP-1 gene in order to detect EBV high-risk variants. Even so, our results should encourage the study of LMP-1 biology in BL. LMP-1 is a CD 40 homolog [42,43] and may modulate a number of antigen-mediated B cell activation pathways [44]. The LMP-1 C-terminal activator region (CTAR) 1 and 2 modulate the expression of multiple anti-apoptotic proteins, and could influence lymphoma progression by suppressing cell death [45,46,47,48]. Also, LMP-1 may influence lymphomagenesis by modulating the production of cytokines, such as IL-6, IL-8, IL-10 [49], which regulate the expression of several B cell surface markers. However, our results should not be interpreted as proposing that LMP-1 is biologically involved in BL developed, but rather as a marker of the EBV variants associated with elevated risk for BL.
Our findings also highlight the need for cellular and animal studies to provide biological insights about the LMP-1 patterns reported here, and transform them from descriptive patterns to findings with biological significance for viral fitness, replication, transmission, and persistence. However, responding to the priorities of EBV research requires concerted cooperation between multi-disciplinary teams of scientists, in order to design studies that can be both epidemiologically rigorous and biologically informative.
4. Materials and Methods
4.1. Subjects and IRB
The fresh-frozen biopsies from patients with BL obtained mostly from the abdomen of affected children were collected from 71 subjects in South America (15 from Argentina, 11 from Brazil) and 45 from Ghana, and labeled with unique numbers in studies conducted by investigators at the National Cancer Institute between 1965 and 1994 [18]. Peripheral blood samples were collected from 113 children with, and 115 without, BL, all of whom were enrolled in the Epidemiology of Burkitt’s Lymphoma in East-African Children and Minors (EMBLEM) in Uganda between 2010 and 2016 (protocol 10-C-N133) [20]. Participation in the current study was approved by the Office of Human Research Protection at the National Institutes of Health. No identifying information was used in the current study.
4.2. DNA Extraction
The DNA extraction of the samples from South America and Ghana was described previously [50,51,52]. The DNA extraction of peripheral blood mononuclear cells (PBMCs) was processed at the NCI Cancer Genomics Research (CGR) Laboratory, Rockville, MD, USA, using the Qiagen QIAsymphony automated instrument [20].
4.3. PCR Amplification of the Targeted Hypervariable Region of LMP-1
The corresponding positions of the variants formed Pattern A, the PCR primer sets used for target amplification, and variant typing are shown in Figure S1 and Table S4. Three pairs of primers (Supplementary Figure S1) were used to cover the LMP-1 hypervariable region, containing 23 nucleotides (nt) variants. One hundred nanograms of DNA from BL tumor biopsy, and 300 ng DNA from peripheral blood mononuclear cells (PBMCs) were used as templates in a polymerase chain reaction (PCR). Each PCR mixture contained 2 µL 10× PCR buffer, 1.6 µL dNTPs (2.5 μM), 1 µL MgCl2 (10 mM), 0.5 µL primers (10 μM each), Platinum Taq DNA polymerase, and DNA template. Nuclease-free water was added to adjust the final volume. Thermocycle was carried out in Mastercycler Pro S (Eppendorf) (Eppendorf North America, Hauppauge, NY, USA) using initial denature at 94 °C for 5 min, thermocycle at 94 °C for 30 s, 55 °C or 60 °C for 30 s, and 72 °C for 30 s, total 45 cycles, following the final extension at 72 °C for 7 min. PCR products were separated by 2% agarose gel electrophoresis in 1×TAE buffer. The gel was stained with GelGreen (Biotium, Fremont, CA, USA) and the amplicons were visualized under blue light at wave length 460–520 µm (Amersham Imager 600, GE Healthcare, Marlborough, MA, USA).
We note that BL samples that were EBV negative were not subjected to Sanger sequencing. These included seven biopsies (3 from Argentina, one from Brazil and three from Ghana) and 85 peripheral blood samples from Uganda children (14 BL cases and 71 controls).
4.4. Sanger Sequencing and Data Analysis
The PCR generated amplicons matching the desired length were retrieved and eluted from the agarose gel using GeneClean III kit (MP Biomedicals, Santa Ana, CA, USA) and stored in RNase-free water. The purified amplicons from 64 BL biopsy samples and 143 peripheral blood samples were used as templates in the Sanger sequencing reactions, and the Sanger sequencing service was provided by Macrogen Inc. (Macrogen, Rockville, MD, USA). Three of the peripheral blood samples were unable to obtain a readable chromatogram. The exported chromatograms and sequence files were visualized and assembled against the type 1 EBV reference genome (NC_007605) using CLC Genomics Workbench Version 9.0.1. The sequence alignments were performed using BioEdit v7.0 with the ClustalW algorithm, and the sequences were classified manually. The bar charts were generated by the software package GraphPad Prism v7.0.
5. Conclusions
In conclusion, our analysis shows that HTS EBV data can be used to classify samples into classes of LMP-1 genetic diversity, as described by Lei et al. [18], Miller et al. [24], Hu et al. [23], and Edwards et al. [25], but not according Sandvej et al. [27]. Of these, LMP-1 Pattern A, based on Lei et al. [18], was frequently (55–80%) detected in BL biopsies from Ghana, Argentina, and Brazil, but less frequently in EBV genomes from healthy people (~10%). In a case-control analysis of peripheral blood samples from Uganda, compared to PCR-negative subjects, detection of LMP-1 Pattern A in peripheral blood was associated with a significantly elevated risk (OR = 31.7) of eBL. eBL risk was elevated, but lower than for variant Pattern A sub-variants (OR 9.7) or Pattern D (OR 13.6). These results support the hypothesis that Pattern A could be a marker for elevated risk for EBV variants associated with BL risk, although it is not the sole variant. Further research is needed to replicate and elucidate our findings in other settings.
Acknowledgments
We thank Janet Lawler-Heavner at Westat Inc, (Rockville, MD, USA) and Erisa Sunday at the African Field Epidemiology Network (Kampala, Uganda) for managing the EMBLEM study. We are grateful to the EMBLEM staff for conducting fieldwork and participants for their cooperation. We thank Laurie Buck, Carol Giffen, and Greg Rydzak at Information Management Services Inc. (Calverton, MD, USA) for coordinating data, and preparing EMBLEM data analysis files.
Abbreviations
The following abbreviations are used in this manuscript:
EBV | Epstein-Barr virus |
LMP-1 | Latent membrane protein-1 |
EBNA-1 | Epstein–Barr nuclear antigens-1 |
BZLF-1 | Trans-activator protein BZLF1 |
HTS | High throughput sequencing |
SNV | Single nucleotide polymorphism |
PCR | Polymerase chain reaction |
BL | Burkitt lymphoma |
eBL | endemic Burkitt lymphoma |
NPC | Nasopharyngeal carcinoma |
PTLD | Post-transplant lymphoproliferative disease |
HL | Hodgkin Lymphoma |
GC | EBV-positive gastric carcinoma |
IM | Infectious mononucleosis |
EMBLEM | Epidemiology of Burkitt’s Lymphoma in East-African Children and Minors |
sLCL | Spontaneous transformed lymphoblastoid cell line |
CTAR-1, 2, 3 | Carboxyl-terminal activator regions 1, -2, -3 |
IL-6, -8, -10 | Interleukin-6, -8, -10 |
SSCP | Single strand conformational polymorphism |
Supplementary Materials
The following are available online at http://www.mdpi.com/2072-6694/10/6/177/s1, Figure S1: The illustration of the corresponding positions of the 23 single nucleotide variants that form the pattern A on LMP-1 gene and the primers used for the variant typing via target-specific PCR and Sanger sequencing. Table S1: Heterogeneous variations in the LMP-1 gene suggestive of co-infection with Pattern A and Pattern D EBV in 2 BL tumors from Ghana. Table S2: EBV LMP-1 Patterns in 22 EBV-positive BL tumors from South America typed using PCR amplicon sequencing via Sanger method. Table S3: EBV LMP-1 Patterns in 42 EBV positive BL tumors from Ghana typed using PCR amplicon sequencing via Sanger method. Table S4: The primers were used for PCR amplifying the hypervariable region of the LMP-1 and Sanger Sequencing. Table S5: The summary of the Lei et al. pattern classification applied on the samples from BL cases and healthy donors that had been used for LMP-1 phylogenic analysis in the study of Correia et al. (2017). Table S6: The geographic distribution and the Pattern A-D classification of the BL cases and healthy donors that were used in the study of Correia et al. (2017).
Author Contributions
S.-C.L. and S.M.M. conceived and designed the experiments; H. Liu and B.L. performed the experiments; H.-M.L., H. Lei and P.-J.C. analyzed the data; S.T. contributed materials and analysis tools; K.B., M.G., S.E., J.N., F.N., R.J.B., M.D.O., and S.J.R. performed fieldwork; All authors reviewed and approved the final manuscript.
Funding
The fieldwork was funded by the Intramural Research Program of the Division of Cancer Epidemiology and Genetics, National Cancer Institute (NCI) (Contracts HHSN261201100063C and HHSN261201100007I) and, in part, by the Intramural Research Program, National Institute of Allergy and Infectious Diseases (SJR), National Institutes of Health, Department of Health and Human Services, and the laboratory work by an interagency agreement between the National Cancer Institute and FDA (PUR4763663).
Conflicts of Interest
The authors declare no conflict of interest. The funding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, and in the decision to publish the results.
References
- 1.Zur Hausen H., Schulte-Holthausen H., Klein G., Henle W., Henle G., Clifford P., Santesson L. EBV DNA in biopsies of Burkitt tumours and anaplastic carcinomas of the nasopharynx. Nature. 1970;228:1056–1058. doi: 10.1038/2281056a0. [DOI] [PubMed] [Google Scholar]
- 2.De Thé G., Ablashi D.V., Liabeuf A., Mourali N. Nasopharyngeal carcinoma (NPC). VI. Presence of an EBV nuclear antigen in fresh tumour biopsies. Preliminary results. Biomedicine. 1973;19:349–352. [PubMed] [Google Scholar]
- 3.Proceedings of the IARC Working Group on the Evaluation of Carcinogenic Risks to Humans. Epstein-Barr Virus and Kaposi’s Sarcoma Herpesvirus/Human Herpesvirus 8. Lyon, France, 17–24 June 1997. IARC Monogr. Eval. Carcinog. Risks Hum. 1997;70:1–492. [Google Scholar]
- 4.Harabuchi Y., Imai S., Wakashima J., Hirao M., Kataura A., Osato T., Kon S. Nasal T-cell lymphoma causally associated with Epstein-Barr virus: Clinicopathologic, phenotypic, and genotypic studies. Cancer. 1996;77:2137–2149. doi: 10.1002/(SICI)1097-0142(19960515)77:10<2137::AID-CNCR27>3.0.CO;2-V. [DOI] [PubMed] [Google Scholar]
- 5.Pittaluga S., Loke S.L., So K.C., Cheung K.N., Ma L. Clonal Epstein-Barr virus in lymphoepithelioma-like carcinoma of the stomach: Demonstration of viral genome by in situ hybridization and southern blot analysis. Mod. Pathol. 1992;5:661–664. [PubMed] [Google Scholar]
- 6.Shibata D., Hawes D., Stemmermann G.N., Weiss L.M. Epstein-Barr virus-associated gastric adenocarcinoma among Japanese Americans in Hawaii. Cancer Epidemiol. Biomarkers Prev. 1993;2:213–217. [PubMed] [Google Scholar]
- 7.Parkin D.M., Sitas F., Chirenje M., Stein L., Abratt R., Wabinga H. Part I: Cancer in Indigenous Africans—burden, distribution, and trends. Lancet Oncol. 2008;9:683–692. doi: 10.1016/S1470-2045(08)70175-X. [DOI] [PubMed] [Google Scholar]
- 8.Thorley-Lawson D.A. Epstein-Barr virus: Exploiting the immune system. Nat. Rev. Immunol. 2001;1:75–82. doi: 10.1038/35095584. [DOI] [PubMed] [Google Scholar]
- 9.Plummer M., de Martel C., Vignat J., Ferlay J., Bray F., Franceschi S. Global burden of cancers attributable to infections in 2012: A synthetic analysis. Lancet Glob. Health. 2016;4:e609–e616. doi: 10.1016/S2214-109X(16)30143-7. [DOI] [PubMed] [Google Scholar]
- 10.Chang C.M., Yu K.J., Mbulaiteye S.M., Hildesheim A., Bhatia K. The extent of genetic diversity of Epstein-Barr virus and its geographic and disease patterns: A need for reappraisal. Virus Res. 2009;143:209–221. doi: 10.1016/j.virusres.2009.07.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ziegler J.L. Burkitt’s lymphoma. N. Engl. J. Med. 1981;305:735–745. doi: 10.1056/NEJM198109243051305. [DOI] [PubMed] [Google Scholar]
- 12.Bhatia K., Raj A., Guitierrez M.I., Judde J.G., Spangler G., Venkatesh H., Magrath I.T. Variation in the sequence of Epstein Barr virus nuclear antigen 1 in normal peripheral blood lymphocytes and in Burkitt’s lymphomas. Oncogene. 1996;13:177–181. [PubMed] [Google Scholar]
- 13.Sample J., Young L., Martin B., Chatman T., Kieff E., Rickinson A., Kieff E. Epstein-Barr virus types 1 and 2 differ in their EBNA-3A, EBNA-3B, and EBNA-3C genes. J. Virol. 1990;64:4084–4092. doi: 10.1128/jvi.64.9.4084-4092.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tzellos S., Farrell P.J. Epstein-Barr virus sequence variation-biology and disease. Pathogens. 2012;1:156–174. doi: 10.3390/pathogens1020156. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Correia S., Palser A., Elgueta Karstegl C., Middeldorp J.M., Ramayanti O., Cohen J.I., Hildesheim A., Fellner M.D., Wiels J., White R.E., et al. Natural variation of Epstein-Barr virus genes, proteins, and primary microrna. J. Virol. 2017;91:e00375-17. doi: 10.1128/JVI.00375-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bornkamm G.W. Epstein-Barr virus and the pathogenesis of Burkitt’s lymphoma: More questions than answers. Int. J. Cancer. 2009;124:1745–1755. doi: 10.1002/ijc.24223. [DOI] [PubMed] [Google Scholar]
- 17.Kwok H., Chiang A.K. From conventional to next generation sequencing of Epstein-Barr virus genomes. Viruses. 2016;8:60. doi: 10.3390/v8030060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lei H., Li T., Li B., Tsai S., Biggar R.J., Nkrumah F., Neequaye J., Gutierrez M., Epelman S., Mbulaiteye S.M., et al. Epstein-Barr virus from Burkitt lymphoma biopsies from Africa and South America share novel LMP-1 promoter and gene variations. Sci. Rep. 2015;5:16706. doi: 10.1038/srep16706. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Cohen J.I., Fauci A.S., Varmus H., Nabel G.J. Epstein-Barr virus: An important vaccine target for cancer prevention. Sci. Transl. Med. 2011;3:107fs7. doi: 10.1126/scitranslmed.3002878. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Legason I.D., Pfeiffer R.M., Udquim K.I., Bergen A.W., Gouveia M.H., Kirimunda S., Otim I., Karlins E., Kerchan P., Nabalende H., et al. Evaluating the causal link between malaria infection and endemic Burkitt lymphoma in Northern Uganda: A Mendelian Randomization study. EBio Medicine. 2017;25:58–65. doi: 10.1016/j.ebiom.2017.09.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Maziarz M., Kinyera T., Otim I., Kagwa P., Nabalende H., Legason I.D., Ogwang M.D., Kirimunda S., Emmanuel B., Reynolds S.J., et al. Age and geographic patterns of plasmodium falciparum malaria infection in a representative sample of children living in Burkitt lymphoma-endemic areas of Northern Uganda. Malar. J. 2017;16:124. doi: 10.1186/s12936-017-1778-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kanai K., Satoh Y., Yamanaka H., Kawaguchi A., Horie K., Sugata K., Hoshikawa Y., Sata T., Sairenji T. The vIL-10 gene of the Epstein-Barr virus (EBV) is conserved in a stable manner except for a few point mutations in various EBV isolates. Virus Genes. 2007;35:563–569. doi: 10.1007/s11262-007-0153-5. [DOI] [PubMed] [Google Scholar]
- 23.Hu L.F., Zabarovsky E.R., Chen F., Cao S.L., Ernberg I., Klein G., Winberg G. Isolation and sequencing of the Epstein-Barr virus BNLF-1 gene (LMP1) from a Chinese nasopharyngeal carcinoma. Pt 10J. Gen. Virol. 1991;72:2399–2409. doi: 10.1099/0022-1317-72-10-2399. [DOI] [PubMed] [Google Scholar]
- 24.Miller W.E., Edwards R.H., Walling D.M., Raab-Traub N. Sequence variation in the Epstein-Barr virus latent membrane protein 1. Pt 10J. Gen. Virol. 1994;75:2729–2740. doi: 10.1099/0022-1317-75-10-2729. [DOI] [PubMed] [Google Scholar]
- 25.Edwards R.H., Seillier-Moiseiwitsch F., Raab-Traub N. Signature amino acid changes in latent membrane protein 1 distinguish Epstein-Barr virus strains. Virology. 1999;261:79–95. doi: 10.1006/viro.1999.9855. [DOI] [PubMed] [Google Scholar]
- 26.Walling D.M., Shebib N., Weaver S.C., Nichols C.M., Flaitz C.M., Webster-Cyriaque J. The molecular epidemiology and evolution of Epstein-Barr virus: Sequence variation and genetic recombination in the latent membrane protein-1 gene. J. Infect. Dis. 1999;179:763–774. doi: 10.1086/314672. [DOI] [PubMed] [Google Scholar]
- 27.Sandvej K., Andresen B.S., Zhou X.G., Gregersen N., Hamilton-Dutoit S. Analysis of the Epstein-Barr virus (EBV) latent membrane protein 1 (LMP-1) gene and promoter in hodgkin’s disease isolates: Selection against EBV variants with mutations in the LMP-1 promoter ATF-1/CREB-1 binding site. Mol. Pathol. 2000;53:280–288. doi: 10.1136/mp.53.5.280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gutierrez M.I., Bhatia K., Cherney B., Capello D., Gaidano G., Magrath I. Intraclonal molecular heterogeneity suggests a hierarchy of pathogenetic events in Burkitt’s lymphoma. Ann. Oncol. 1997;8:987–994. doi: 10.1023/A:1008265304712. [DOI] [PubMed] [Google Scholar]
- 29.Mbulaiteye S.M., Walters M., Engels E.A., Bakaki P.M., Ndugwa C.M., Owor A.M., Goedert J.J., Whitby D., Biggar R.J. High levels of Epstein-Barr virus DNA in saliva and peripheral blood from Ugandan mother-child pairs. J. Infect. Dis. 2006;193:422–426. doi: 10.1086/499277. [DOI] [PubMed] [Google Scholar]
- 30.Biggar R.J., Henle W., Fleisher G., Bocker J., Lennette E.T., Henle G. Primary Epstein-Barr virus infections in African infants. I. Decline of maternal antibodies and time of infection. Int. J. Cancer. 1978;22:239–243. doi: 10.1002/ijc.2910220304. [DOI] [PubMed] [Google Scholar]
- 31.Matrajt L., Gantt S., Mayer B.T., Krantz E.M., Orem J., Wald A., Corey L., Schiffer J.T., Casper C. Virus and host-specific differences in oral human herpesvirus shedding kinetics among Ugandan women and children. Sci. Rep. 2017;7:13105. doi: 10.1038/s41598-017-12994-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Evans A.S., Mueller N.E. Viruses and cancer. Causal associations. Ann. Epidemiol. 1990;1:71–92. doi: 10.1016/1047-2797(90)90020-S. [DOI] [PubMed] [Google Scholar]
- 33.Munoz N., Bosch F.X., de Sanjose S., Herrero R., Castellsague X., Shah K.V., Snijders P.J., Meijer C.J., International Agency for Research on Cancer Multicenter Cervical Cancer Study Group Epidemiologic classification of human papillomavirus types associated with cervical cancer. N. Engl. J. Med. 2003;348:518–527. doi: 10.1056/NEJMoa021641. [DOI] [PubMed] [Google Scholar]
- 34.Lam K.M., Syed N., Whittle H., Crawford D.H. Circulating Epstein-Barr virus-carrying B cells in acute malaria. Lancet. 1991;337:876–878. doi: 10.1016/0140-6736(91)90203-2. [DOI] [PubMed] [Google Scholar]
- 35.Smatti M.K., Yassine H.M., AbuOdeh R., AlMarawani A., Taleb S.A., Althani A.A., Nasrallah G.K. Prevalence and molecular profiling of Epstein Barr virus (EBV) among healthy blood donors from different nationalities in Qatar. PLoS ONE. 2017;12:e0189033. doi: 10.1371/journal.pone.0189033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Gantuz M., Lorenzetti M.A., Chabay P.A., Preciado M.V. A novel recombinant variant of latent membrane protein 1 from Epstein Barr virus in Argentina denotes phylogeographical association. PLoS ONE. 2017;12:e0174221. doi: 10.1371/journal.pone.0174221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Kehdy F.S., Gouveia M.H., Machado M., Magalhaes W.C., Horimoto A.R., Horta B.L., Moreira R.G., Leal T.P., Scliar M.O., Soares-Souza G.B., et al. Origin and dynamics of admixture in Brazilians and its effect on the pattern of deleterious mutations. Proc. Natl. Acad. Sci. USA. 2015;112:8696–8701. doi: 10.1073/pnas.1504447112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Yao Q.Y., Tierney R.J., Croom-Carter D., Dukers D., Cooper G.M., Ellis C.J., Rowe M., Rickinson A.B. Frequency of multiple Epstein-Barr virus infections in T-cell-immunocompromised individuals. J. Virol. 1996;70:4884–4894. doi: 10.1128/jvi.70.8.4884-4894.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Walling D.M., Brown A.L., Etienne W., Keitel W.A., Ling P.D. Multiple Epstein-Barr virus infections in healthy individuals. J. Virol. 2003;77:6546–6550. doi: 10.1128/JVI.77.11.6546-6550.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Gutierrez M.I., Spangler G., Kingma D., Raffeld M., Guerrero I., Misad O., Jaffe E.S., Magrath I.T., Bhatia K. Epstein-Barr virus in nasal lymphomas contains multiple ongoing mutations in the EBNA-1 gene. Blood. 1998;92:600–606. [PubMed] [Google Scholar]
- 41.Thorley-Lawson D.A., Gross A. Persistence of the Epstein-Barr virus and the origins of associated lymphomas. N. Engl. J. Med. 2004;350:1328–1337. doi: 10.1056/NEJMra032015. [DOI] [PubMed] [Google Scholar]
- 42.Rastelli J., Homig-Holzel C., Seagal J., Muller W., Hermann A.C., Rajewsky K., Zimber-Strobl U. LMP1 signaling can replace CD40 signaling in B cells in vivo and has unique features of inducing class-switch recombination to IgG1. Blood. 2008;111:1448–1455. doi: 10.1182/blood-2007-10-117655. [DOI] [PubMed] [Google Scholar]
- 43.Uchida J., Yasui T., Takaoka-Shichijo Y., Muraoka M., Kulwichit W., Raab-Traub N., Kikutani H. Mimicry of CD40 signals by Epstein-Barr virus LMP1 in B lymphocyte responses. Science. 1999;286:300–303. doi: 10.1126/science.286.5438.300. [DOI] [PubMed] [Google Scholar]
- 44.Kaye K.M., Izumi K.M., Kieff E. Epstein-Barr virus latent membrane protein 1 is essential for B-lymphocyte growth transformation. Proc. Natl. Acad. Sci. USA. 1993;90:9150–9154. doi: 10.1073/pnas.90.19.9150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Izumi K.M., Kaye K.M., Kieff E.D. The Epstein-Barr virus lmp1 amino acid sequence that engages tumor necrosis factor receptor associated factors is critical for primary B lymphocyte growth transformation. Proc. Natl. Acad. Sci. USA. 1997;94:1447–1452. doi: 10.1073/pnas.94.4.1447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Izumi K.M., Kieff E.D. The Epstein-Barr virus oncogene product latent membrane protein 1 engages the tumor necrosis factor receptor-associated death domain protein to mediate B lymphocyte growth transformation and activate NF-kappaB. Proc. Natl. Acad. Sci. USA. 1997;94:12592–12597. doi: 10.1073/pnas.94.23.12592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tsai S.C., Lin S.J., Lin C.J., Chou Y.C., Lin J.H., Yeh T.H., Chen M.R., Huang L.M., Lu M.Y., Huang Y.C., et al. Autocrine CCL3 and CCL4 induced by the oncoprotein LMP1 promote Epstein-Barr virus-triggered B cell proliferation. J. Virol. 2013;87:9041–9052. doi: 10.1128/JVI.00541-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Pratt Z.L., Zhang J., Sugden B. The latent membrane protein 1 (LMP1) oncogene of Epstein-Barr virus can simultaneously induce and inhibit apoptosis in B cells. J. Virol. 2012;86:4380–4393. doi: 10.1128/JVI.06966-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Middeldorp J.M., Pegtel D.M. Multiple roles of LMP1 in Epstein-Barr virus induced immune escape. Semin. Cancer Biol. 2008;18:388–396. doi: 10.1016/j.semcancer.2008.10.004. [DOI] [PubMed] [Google Scholar]
- 50.Gutierrez M.I., Bhatia K., Barriga F., Diez B., Muriel F.S., de Andreas M.L., Epelman S., Risueno C., Magrath I.T. Molecular epidemiology of Burkitt’s lymphoma from South America: Differences in breakpoint location and Epstein-Barr virus association from tumors in other world regions. Blood. 1992;79:3261–3266. [PubMed] [Google Scholar]
- 51.Nkrumah F.K. Changes in the presentation of Burkitt’s lymphoma in Ghana over a 15-year period (1969–1982) IARC Sci. Publ. 1984:665–674. [PubMed] [Google Scholar]
- 52.Nkrumah F., Henle W., Henle G., Herberman R., Perkins V., Depue R. Burkitt’s lymphoma: Its clinical course in relation to immunologic reactivities to Epstein-Barr virus and tumor-related antigens. J. Natl. Cancer Inst. 1976;57:1051–1056. doi: 10.1093/jnci/57.5.1051. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.