Abstract
The Major Histocompatibility Complex class I (MHC-I) system plays a vital role in immune responses by presenting antigens to T cells. Allele specific technologies, including recombinant MHC-I technologies, have been extensively used in T cell analyses for COVID-19 patients and are currently used in the development of immunotherapies for cancer. However, the immense diversity of MHC-I alleles presents challenges. The genetic diversity serves as the foundation of personalized medicine, yet it also poses a potential risk of exacerbating healthcare disparities based on MHC-I alleles. To assess potential biases, we analysed (pre)clinical publications focusing on COVID-19 studies and T cell receptor (TCR)-based clinical trials. Our findings reveal an underrepresentation of MHC-I alleles associated with Asian, Australian, and African descent. Ensuring diverse representation is vital for advancing personalized medicine and global healthcare equity, transcending genetic diversity. Addressing this disparity is essential to unlock the full potential of T cells for enhancing diagnosis and treatment across all individuals.
Subject terms: Cancer, Immunology, Medical research
Introduction
The MHC-I system is a family of proteins expressed on the surface of cells and is involved in the recognition and presentation of peptides to the immune system. It consists of a polymorphic heavy chain, a constant light chain called Beta-2 Microglobulin (β2M) and an 8–13 amino-acid peptide ligand1,2. The peptide binding groove of MHC-I heavy chain accommodates these peptides, and the properties of the pockets within the groove are important for peptide presentation. MHC-I comprises three major Human Leukocyte Antigen (HLA) families: HLA-A, HLA-B, and HLA-C, each consisting of numerous alleles. Among these, HLA-A and HLA-B exhibit the greatest diversity, while HLA-C shows less variation3. HLA-C is associated with multiple additional receptors, such as Killer Immunoglobulin-like Receptors (KIRs) that can be expressed on T cells, adding complexity to the system and complicating diagnostics of HLA-C restricted T cells4. Consequently, these complexities contribute to the tendency to overlook HLA-C in research studies. The diversity in the MHC-I heavy chain directly influences peptide presentation by altering the properties of the pockets in the peptide binding groove. This genetic polymorphism results in changes in the size, shape, and electrostatic properties of the pockets, which in turn affect the binding affinity and specificity of the MHC-I molecule for different peptides, thereby directly influencing the peptide repertoire. Although peptides may have overlap between similar MHC-I heavy chains, each allele has an unique repertoire of peptides5. Hence, understanding that while the overall structure of MHC-I remains largely consistent, the specific composition of alleles and variations in anchor residues give rise to unique peptide-binding specificities within each population, resulting in exceptionally high sequence diversity. This leads to an extensive array of alleles, totalling over 37,000 variants, some of which are exclusive to particular ancestral populations6–9. This diversity in MHC-I alleles enhances the population's ability to mount effective immune responses against a given pathogen by increasing an individual’s chance of eliciting a suitable immune defence. Thus, MHC-I diversity helps to protect against pandemics. For example, the HLA-B*15:01 allele is more prevalent in Southeast Asian populations compared to European populations, highlighting the geographic variability in HLA alleles8. However, this diversity also represents a challenge in the biomedical domain due to its potential to reinforce existing disparities, potentially leading to unequal healthcare based on an individual’s MHC-I alleles.
The relevance of MHC-I alleles in diagnostics and immunotherapy has surged in recent years, offering potential applications in disease diagnosis and treatment10–16. The groundwork for this technology was laid in 1996 with the introduction of recombinant MHC-I technology, enabling the visualization of antigen-specific cells17. This methodology involves the synthesis of MHC-I molecules via synthetic DNA sequences in a laboratory setting. These recombinant soluble MHC-I monomers, complexed with specific peptides, are then multimerized and fluorescently labelled, commonly referred to as tetramers or multimers. These multimerized peptide MHC-I complexes play a crucial role in immune response monitoring by facilitating the specific binding of antigen-specific T cells, allowing the visualization, and tracking of T cell responses over time. They have applications in diagnostics, aiding in the identification of antigen-specific T cells and differentiating between vaccination and natural infection based on pathogen protein coverage. Recent studies have extensively investigated T cells in COVID-19 patients18–20. Furthermore, recombinant MHC-I technology has significantly impacted cancer immunotherapy. This technology leverages the fundamental link between TCR and peptide-HLA complexes, enabling the precise targeting of cancer cells. Customized recombinant peptide MHC-I complexes help identify specific cancer antigens, paving the way for personalized treatment strategies. Additionally, they play a pivotal role in TCR-based therapies by facilitating the precise targeting of cancer cells by engineered T cells. Clinical trials utilizing HLA-restricted TCRs are currently underway, either by introducing TCRs into patient T cells or employing recombinant TCR fusion proteins fused to anti-CD3, resulting in bispecific T cell engagers10–14. These therapies are designed to target specific immune responses mediated by predetermined HLA alleles. Moreover, recombinant MHC-I technology serves as a peptide-specific platform for inducing T cell proliferation in an antigen-specific manner21,22. In summary, the versatility and effectiveness of recombinant MHC-I technology position it as a cornerstone in the ongoing battle against cancer through immunotherapy.
Given the extensive polymorphism among HLA genes and their connections to population genetics, we set out to investigate whether there are biases in the HLA alleles studied in medical research. Our analysis focused on the utilization of MHC-I alleles in both clinical and preclinical publications, particularly those related to COVID-19 and clinical trials focused on TCR-based immunotherapies. Our findings reveal a notable underrepresentation of alleles found in people from Asian, Australian, and African descent, suggesting a widespread allele bias in medical research and clinical therapeutic development.
Results
This study conducted a comprehensive search for articles published between August 2020 and April 2023, focusing on T-cell research related to Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) and Human Leukocyte Antigen Class-I (HLA-I), representing the human MHC-I, aiming to assess the breadth of HLA alleles studied in medical research. SARS-CoV-2 was chosen as a model infection due to the extensive utilization of MHC-I technology in research on the immune response to the virus and its global impact, affecting all continents and countries. Out of 615 articles identified, 74 were included, focused on allele-specific analyses, in specific MHC-I technology. In the 74 studies we considered, individual epitopes, the specific portion of the peptide sequence that is recognized and bound by the MHC-I molecule, against COVID-19 were determined by using mono-allelic MHC-I multimer qualitative binding (Table S1, Fig. S1). A total of 22 unique MHC-I alleles were used with epitopes of SARS-CoV-2. MHC-I allele frequencies have a widespread variation across human populations, however the geography on the distribution of MHC-I alleles resembles geographical location of populations23. For this reason, we analysed the frequencies of the MHC-I allele usage in literature in relation to the frequency of occurrence in a specific continental or intra-continental group. The following continental and intra-continental groups were included in our analysis: South Asia, North-East Asia, South-East Asia, Sub-Saharan Africa, Oceania, North Africa, Western Asia, South and Central America, North America, and Europe.
The HLA-A*02:01 allele was included in the majority of the studies, i.e. 55 of the 74 studies included for analysis (74.3%), followed by HLA-A*24:02 (N = 31, 41.9%), HLA-A*01:01 (N = 24, 32.4%), HLA-B*07:02 (N = 24, 32.4%), HLA-A*03:01 (N = 21, 28.4%, Fig. 1a). For studied alleles, there was a difference in frequency across geographical populations (Fig. 1b). Indeed, there was a strong positive correlation between the frequency of alleles used across studies and the allele frequency in Europe (r = 0.70, P = 2.7·10−4), North America (r = 0.59, P = 3.9·10–3) and South and Central America (r = 0.62, P = 2.0·10−3) (Fig. 1c). We observed a weak correlation or absent correlation for North Africa (r = 0.38, P = 0.08), Western Asia (r = 0.31, P = 0.16), North-East Asia (r = 0.42, P = 0.05), Australia (r = 0.39, P = 0.07), Sub-Saharan Africa (r = 0.02, P = 0.92), South-East Asia (r = 0.31, P = 0.16), South Asia (r = 0.20, P = 0.38) and Oceania (r = 0.34, P = 0.12). Overall, the HLA-A alleles were the most studied alleles (Fig. 2a) and showed the best coverage for Europe (69.8%), while the lowest coverage for Sub-Sharan Africa (36.1%). For the HLA-B alleles, a similar pattern was observed with a high frequency in Europe but low in Sub-Sharan Africa. The B-alleles particularly showed a low coverage in North America (Fig. 2b), but this was mainly driven by the low frequency in Mexico (5.1%) versus USA-based studies (41.2%, Fig. 2b). HLA-C alleles were not extensively researched, accounting for only 4% of the studies, despite their relative high population coverage across continents (Fig. 2c).
Our analysis was repeated based on an independent dataset obtained from a systematic review of T-cell epitopes defined from the proteome of SARS-CoV-2, describing 1349 MHC-I epitopes20. This validation in a second COVID-19 dataset obtained from this systematic review of T-cell epitopes is essential to ensure the robustness and generalizability of the initial analysis and to minimize the risk of bias. The acquired alleles were not restricted to MHC-I technology, and this systematic review provides a description and explanation of the diverse range of technologies utilized. This analysis gave a similar distribution of investigated alleles was observed, with most studies including HLA-A*02:01 (N = 34, 79.1%), followed by HLA-A*24:02 (N = 16, 37.2%) and HLA-A*01:01 (N = 14, 32.6%). Again, the highest correlations were observed for Europe followed by the Americas (Fig. S3).
Next, we analysed the clinical trials that make use of TCR-based immunotherapy. TCRs can only recognize and bind to a specific peptide presented by a particular MHC-I allele. Therefore, TCR-based immunotherapy treatments are designed to target specific peptides presented by one predetermined MHC allele24,25. For this reason, TCR-based immunotherapies are HLA-restricted therapies. By examining these clinical trials, we can assess for which specific HLA alleles therapies are designed. This analysis can help to guide future research and clinical development efforts towards more personalized and effective treatments for patients with specific genetic backgrounds. Using the clinicaltrials.gov website we found 126 studies in which TCR transfer was clinically used (N = 118) or recombinant TCR fusion proteins were used (N = 8). The latter are all HLA-A2 restricted. The allele coverage of these TCRs showed clear over-representation of HLA-A2 in these clinical trials (Fig. 3). Seven studies mention that personalized TCRs will be developed, however the coverage of alleles was not mentioned and thus it was difficult to determine how large the allele diversity will be in these clinical studies. The focus on HLA-A*2, and in specific A*02:01, means that a large population is excluded in these clinical trials. For example, within the American population, people with an African American or Asian genetic descent will have an almost 50% lower chance to enrol in these TCR-based immunotherapy trials26,27.
Discussion
These results demonstrate that the preclinical and clinical analyses of antigen-specific T cell diagnostics and the clinical development of HLA-I restricted therapies, such as TCR-based immunotherapies, show an underrepresentation of people with an Asian, African, Australian, and Oceanian descent. We provide data for both the COVID-19 outbreak and TCR-based therapies. However, we believe that the underrepresentation of specific HLA alleles is not confined to only these two fields; rather, they serve as examples representing the broader scope of the field. Within the clinical setting there is a strong bias towards the use of HLA-A*2. This raises concerns about the inclusivity and generalizability of findings within the preclinical and clinical analyses of antigen-specific T cell research and diagnostics that rely on MHC-I technologies. Conversely, populations with European and American (North, South, and Central) descent exhibit robust representation in these T cell-focused investigations and clinical trials.
This lack of MHC-I allele diversity within T cell research and clinical setting is multifaceted, rooted in historical, methodological, and systematic factors. Historically, since the HLA-A02:01 allele is present in 50% of the Caucasian population, the initial analyses of T cell responses have been focused on the HLA-A02:01 allele17,28. Early reagents, such as HLA-I restricted T cell clones and later recombinant HLA-A*02:01 were focused on the European population resulting in a skewed research field. Additionally, peptide affinity predictions for specific alleles rely on data obtained through wet lab experiments, such as mass spectrometry based ligandome29. The accuracy of these predictions improves with the availability of more data. Less characterized alleles have a poorer performance for the in-silico peptide predictions30. As a result, researchers gravitate toward well-characterized alleles, reinforcing a feedback loop that perpetuates the imbalance. Increased availability of diverse recombinant MHC-I alleles in combination with high-quality peptide databases needed for high-accuracy in silico predictions would enhance the diversity in biomedical research needed for T cell analyses in a diverse population. Additionally novel technologies allowing HLA-unbiased TCR are also developed and very important in ensuring diverse HLA representation31,32.
The ramification of this underrepresentation involves critical aspects. Firstly, the COVID-19 vaccine and clinical trial landscapes exhibited overrepresentation of white non-Hispanic participants, mirroring trends in cancer immunotherapy research33,34. Given the influence of MHC-I allele diversity on disease outcome of pathogens such as SARS-CoV-2, vaccine responses, and effectiveness, the bias hampers generalizability and obstructs insights into diverse population responses to interventions35–38. In the context of COVID-19, variations in HLA genes influence an individual's susceptibility to the virus, severity of the disease, and response to treatments or vaccines35,39–41. Therefore, investigating the impact of HLA on patients with COVID-19 is essential for both clinical management and public health strategies.
Secondly, while pivotal scientific breakthroughs hold importance, the next step entails integrating human genetic diversity into research paradigms. Within the field of genome editing, genetic data from people with a diverse ancestry is essential to determine the CRISPR off-targets, and thereby assess safety and efficacy42. Originally the human genome project consisted of 70% of only one person with a blended ancestry. The remaining 30% came from 19 individuals of European ancestry43, resulting in a very limited genomic diversity. Although currently the vast majority of genomics studies have been conducted in individuals of European descent, the human genome field is now taking the lead by rapidly increasing the number of reference genomes from individuals with diverse ancestry44,45. This inclusion of genomic diversity is not only essential for genome editing but ensures that the benefits of genomic medicine are accessible to all. Just as the inclusion of diverse genetic ancestry is pivotal in genome editing, genomic medicine's broad applicability hinges on embracing genetic diversity. Incorporating genetic variability needs to become a cornerstone in the progress of immune diagnostics and immunotherapy, mirroring the trajectory of genomic medicine.
The significance of MHC-I alleles in COVID-19 outcomes underscores the necessity of comprehensive representation in COVID-19 T-cell studies, compelling the inclusion of individuals with underrepresented genetic makeup. Similarly, the overrepresentation of HLA-A*2 in TCR-based immunotherapies should intensify the urgency for equitable representation. The over-representation of the HLA-A*2 allele impedes the versatility of TCR application and reveals a skewed MHC-I allele representation in therapeutic contexts. Bridging this gap requires increased awareness and strategic funding. An example of such endeavour is the Cancer Grand Challenge of 2023 (https://cancergrandchallenges.org/challenges), which addresses disparities in cancer research across diverse populations.
Beyond the scientific field, the implications of this underrepresentation extend to inequities in therapy development and healthcare availability, demonstrating the necessity to engage diverse communities in biomedical science. The ethical goal to include MHC-I genetic diversity aligns with the scientific goal, as the biological associations of specific MHC-I alleles underscore the complexity that demands comprehensive understanding. The absence of equal representation poses formidable barriers to advancing T cell-based therapies in the era of personalized medicine.
Material and methods
Article inclusion
Articles published in peer-reviewed journals before April 2023 were included. Articles were identified using the following search term:
(“SARS-CoV-2” OR “COVID-19”) AND “T-cell” AND (“tetramer” OR “multimer”) AND “HLA-A”.
We used specifically Google Scholar given that the use of tetramers is often not described in the abstract. Indeed, the search term above yielded only 6 results in PubMed versus 615 articles on Scholar. Out of the 615 articles, 74 were suitable for inclusion, given that 7 were non-English, 323 did not report on COVID-19 but only mentioned it in the text, 177 articles were different types of articles including reviews and opinions and 34 were on a COVID-19-related topic (Fig. S1, Table S1). Regarding the latter, we focused on allele-specific analyses, and we did not include experiments, in which there was no active discrimination between the 6 different MHC-I alleles expressed in one person. Clinical trials were obtained from the website https://clinicaltrials.gov/. Trials were identified using the following search term: “TCR-T cell” AND “TCR therapy” AND “TCR-CD3 therapy” AND “TCR”. We selected only TCR transfer trials and trials that used a recombinant TCR fusion protein, such as Tebentafusp. Studies were included also when they only reported the antigen, for example A*02.
Allele frequencies in different populations
Allele frequencies in different geographic locations were obtained from the Allele Frequency Net Database (http://www.allelefrequencies.net/, access date July 20238). Country of each study was assigned to regions as defined by the Allele Frequency Net Database (https://www.allelefrequencies.net/datasets.asp).
Replication set
The analysis was repeated in an independent dataset obtained from a systemic review of T-cell epitopes defined from the proteome of SARS-CoV-2, describing 1349 MHC-I epitopes20. We only included epitopes that were predicted for one specific allele and excluded epitopes for potentially two or more different MHC-I alleles. These epitopes were functionality tested using the following assays: ELISA, HTMA, multimer staining, cytotoxicity, AIM, ICS, ELISPOT and, or proliferation. Alleles investigated in each study included in the systematic review were extracted and frequencies of alleles across studies determined.
Statistical analysis
Total coverage of a geographical region was calculated as described previously46. For each study on The Allele Frequency Net Database, we calculated the sum of all identified MHC-I class allele frequencies. When the sum of the allele frequency exceeded one, observed allele frequencies were scaled based on the sum value. When below one, it was assumed that there was an additional unmeasured allele. Next, alleles that were included in articles were summed to get a measure of the coverage of a population by the current studies on COVID-19. Median allele frequencies were calculated for each region, by taking the median allele frequency of an allele across studies. Median allele frequencies for each country are given in Table S2. The median allele frequency per region was plotted against the fraction of studies that studied a specific allele. Correlations between study frequency and allele frequency were determined based on Pearson correlation and a P-value below 0.05 was considered significant. Figures were produced using R4.3.0 in combination with ggplot2 (v3.4.3) and patchwork (v1.1.3). Geographical maps were produced with ggplot2 using the map data function; https://ggplot2.tidyverse.org/reference/map_data.html
Supplementary Information
Acknowledgements
We would like to thank A. van Duijn and N.F.C.C. de Miranda, S.B. Coffelt and L.T. Morton for helpful discussion and critical input.
Author contributions
R.C.S and F.A.S. collected and analysed the data. R.C.S performed the statistical analyses. All authors were involved in the critical input and review of the manuscript. All authors read and approved the manuscript.
Data availability
All data generated or analysed during this study are included in this published article and its supplementary information files.
Code availability
All R code used in the current study is available from GitHub: https://github.com/roderickslieker/HLA_Disparity
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-024-58777-2.
References
- 1.Abelin JG, et al. Mass spectrometry profiling of HLA-associated peptidomes in mono-allelic cells enables more accurate epitope prediction. Immunity. 2017;46:315–326. doi: 10.1016/j.immuni.2017.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Bouvier M, Wiley DC. Importance of peptide amino and carboxyl termini to the stability of MHC class I molecules. Science. 1994;265:398–402. doi: 10.1126/science.8023162. [DOI] [PubMed] [Google Scholar]
- 3.Robinson J, et al. The IPD and IMGT/HLA database: Allele variant databases. Nucl. Acids Res. 2015;43:D423–431. doi: 10.1093/nar/gku1161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Parham P. MHC class I molecules and KIRs in human history, health and survival. Nat. Rev. Immunol. 2005;5:201–214. doi: 10.1038/nri1570. [DOI] [PubMed] [Google Scholar]
- 5.Pearson H, et al. MHC class I-associated peptides derive from selective regions of the human genome. J. Clin. Investig. 2016;126:4690–4701. doi: 10.1172/JCI88590. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Gourraud PA, et al. HLA diversity in the 1000 genomes dataset. Plos One. 2014;9:e97282. doi: 10.1371/journal.pone.0097282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Robinson J, et al. IPD-IMGT/HLA database. Nucl. Acids Res. 2020;48:D948–D955. doi: 10.1093/nar/gkz950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gonzalez-Galarza FF, et al. Allele frequency net database (AFND) 2020 update: Gold-standard data classification, open access genotype data and new query tools. Nucl. Acids Res. 2020;48:D783–D788. doi: 10.1093/nar/gkz1029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Horton R, et al. Gene map of the extended human MHC. Nat. Rev. Genet. 2004;5:889–899. doi: 10.1038/nrg1489. [DOI] [PubMed] [Google Scholar]
- 10.Nathan P, et al. Overall survival benefit with Tebentafusp in metastatic uveal melanoma. N. Engl. J. Med. 2021;385:1196–1206. doi: 10.1056/NEJMoa2103485. [DOI] [PubMed] [Google Scholar]
- 11.Tran E, et al. T-cell transfer therapy targeting mutant KRAS in cancer. N. Engl. J. Med. 2016;375:2255–2262. doi: 10.1056/NEJMoa1609279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jahn L, et al. TCR-based therapy for multiple myeloma and other B-cell malignancies targeting intracellular transcription factor BOB1. Blood. 2017;129:1284–1295. doi: 10.1182/blood-2016-09-737536. [DOI] [PubMed] [Google Scholar]
- 13.Rapoport AP, et al. NY-ESO-1-specific TCR-engineered T cells mediate sustained antigen-specific antitumor effects in myeloma. Nat. Med. 2015;21:914–921. doi: 10.1038/nm.3910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hong DS, et al. Autologous T cell therapy for MAGE-A4(+) solid cancers in HLA-A*02(+) patients: A phase 1 trial. Nat. Med. 2023;29:104–114. doi: 10.1038/s41591-022-02128-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hadrup SR, et al. Parallel detection of antigen-specific T-cell responses by multidimensional encoding of MHC multimers. Nat. Methods. 2009;6:520–526. doi: 10.1038/nmeth.1345. [DOI] [PubMed] [Google Scholar]
- 16.Gangaev A, et al. Identification and characterization of a SARS-CoV-2 specific CD8(+) T cell response with immunodominant features. Nat. Commun. 2021;12:2593. doi: 10.1038/s41467-021-22811-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Altman JD, et al. Phenotypic analysis of antigen-specific T lymphocytes. Science. 1996;274:94–96. doi: 10.1126/science.274.5284.94. [DOI] [PubMed] [Google Scholar]
- 18.Quadeer AA, Ahmed SF, McKay MR. Landscape of epitopes targeted by T cells in 852 individuals recovered from COVID-19: Meta-analysis, immunoprevalence, and web platform. Cell Rep. Med. 2021;2:100312. doi: 10.1016/j.xcrm.2021.100312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Saini SK, et al. SARS-CoV-2 genome-wide T cell epitope mapping reveals immunodominance and substantial CD8(+) T cell activation in COVID-19 patients. Sci. Immunol. 2021 doi: 10.1126/sciimmunol.abf7550. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jin X, Liu X, Shen C. A systemic review of T-cell epitopes defined from the proteome of SARS-CoV-2. Virus Res. 2023;324:199024. doi: 10.1016/j.virusres.2022.199024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tvingsholm SA, et al. TCR-engaging scaffolds selectively expand antigen-specific T-cells with a favorable phenotype for adoptive cell therapy. J. Immunother. Cancer. 2023;11:e006847. doi: 10.1136/jitc-2023-006847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Weiss L, et al. Direct in vivo activation of T cells with nanosized immunofilaments inhibits tumor growth and metastasis. ACS Nano. 2023;17:12101–12117. doi: 10.1021/acsnano.2c11884. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Arrieta-Bolanos E, Hernandez-Zaragoza DI, Barquera R. An HLA map of the world: A comparison of HLA frequencies in 200 worldwide populations reveals diverse patterns for class I and class II. Front. Genet. 2023;14:866407. doi: 10.3389/fgene.2023.866407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Morgan RA, et al. Cancer regression in patients after transfer of genetically engineered lymphocytes. Science. 2006;314:126–129. doi: 10.1126/science.1129003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Oliveira G, Wu CJ. Dynamics and specificities of T cells in cancer immunotherapy. Nat. Rev. Cancer. 2023;23:295–316. doi: 10.1038/s41568-023-00560-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ellis JM, et al. Frequencies of HLA-A2 alleles in five U.S. population groups. Predominance of A*02011 and identification of HLA-A*0231. Hum. Immunol. 2000;61:334–340. doi: 10.1016/S0198-8859(99)00155-X. [DOI] [PubMed] [Google Scholar]
- 27.Cao K, et al. Analysis of the frequencies of HLA-A, B, and C alleles and haplotypes in the five major ethnic groups of the United States reveals high levels of diversity in these loci and contrasting distribution patterns in these populations. Hum. Immunol. 2001;62:1009–1030. doi: 10.1016/S0198-8859(01)00298-1. [DOI] [PubMed] [Google Scholar]
- 28.Spits H, Breuning M, Ivanyi P, Russo C, de Vries JE. In vitro-isolated human cytotoxic T-lymphocyte clones detect variations in serologically defined HLA antigens. Immunogenetics. 1982;16:503–512. doi: 10.1007/BF00372020. [DOI] [PubMed] [Google Scholar]
- 29.Sarkizova S, et al. A large peptidome dataset improves HLA class I epitope prediction across most of the human population. Nat. Biotechnol. 2020;38:199–209. doi: 10.1038/s41587-019-0322-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Luo Y, et al. A high-resolution HLA reference panel capturing global population diversity enables multi-ancestry fine-mapping in HIV host response. Nat. Genet. 2021;53:1504–1516. doi: 10.1038/s41588-021-00935-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Cattaneo CM, et al. Identification of patient-specific CD4(+) and CD8(+) T cell neoantigens through HLA-unbiased genetic screens. Nat. Biotechnol. 2023;41:783–787. doi: 10.1038/s41587-022-01547-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.O'Brien H, et al. Breaking the performance ceiling for neoantigen immunogenicity prediction. Nat. Cancer. 2023;4:1618–1621. doi: 10.1038/s43018-023-00675-z. [DOI] [PubMed] [Google Scholar]
- 33.Hamel LM, et al. Barriers to clinical trial enrollment in racial and ethnic minority patients with cancer. Cancer Control. 2016;23:327–337. doi: 10.1177/107327481602300404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Khalil L, et al. Racial and ethnic diversity in SARS-CoV-2 vaccine clinical trials conducted in the United States. Vaccines (Basel) 2022;10:290. doi: 10.3390/vaccines10020290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Augusto DG, et al. A common allele of HLA is associated with asymptomatic SARS-CoV-2 infection. Nature. 2023;620:128–136. doi: 10.1038/s41586-023-06331-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hovhannisyan A, et al. HLA-C*04:01 affects HLA class I heterozygosity and predicted affinity to SARS-CoV-2 peptides, and in combination with age and sex of armenian patients contributes to COVID-19 severity. Front. Immunol. 2022;13:769900. doi: 10.3389/fimmu.2022.769900. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Langton DJ, et al. The influence of HLA genotype on the severity of COVID-19 infection. HLA. 2021;98:14–22. doi: 10.1111/tan.14284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Genetic Analysis of Psoriasis C et al. A genome-wide association study identifies new psoriasis susceptibility loci and an interaction between HLA-C and ERAP1. Nat. Genet. 2010;42:985–990. doi: 10.1038/ng.694. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Chen YM, et al. Epidemiological and genetic correlates of severe acute respiratory syndrome coronavirus infection in the hospital with the highest nosocomial infection rate in Taiwan in 2003. J. Clin. Microbiol. 2006;44:359–365. doi: 10.1128/JCM.44.2.359-365.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Ng MH, et al. Association of human-leukocyte-antigen class I (B*0703) and class II (DRB1*0301) genotypes with susceptibility and resistance to the development of severe acute respiratory syndrome. J. Infect. Dis. 2004;190:515–518. doi: 10.1086/421523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Correale P, et al. HLA-B*44 and C*01 prevalence correlates with Covid19 spreading across Italy. Int. J. Mol. Sci. 2020;21:5205. doi: 10.3390/ijms21155205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Cancellieri S, et al. Human genetic diversity alters off-target outcomes of therapeutic gene editing. Nat. Genet. 2023;55:34–43. doi: 10.1038/s41588-022-01257-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lander ES, et al. Initial sequencing and analysis of the human genome. Nature. 2001;409:860–921. doi: 10.1038/35057062. [DOI] [PubMed] [Google Scholar]
- 44.Liao WW, et al. A draft human pangenome reference. Nature. 2023;617:312–324. doi: 10.1038/s41586-023-05896-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Fatumo S, et al. A roadmap to increase diversity in genomic studies. Nat. Med. 2022;28:243. doi: 10.1038/s41591-021-01672-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Bui H-H, et al. Predicting population coverage of T-cell epitope-based diagnostics and vaccines. BMC bioinformatics. 2006;7:153. doi: 10.1186/1471-2105-7-153. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data generated or analysed during this study are included in this published article and its supplementary information files.
All R code used in the current study is available from GitHub: https://github.com/roderickslieker/HLA_Disparity