Skip to main content
Springer logoLink to Springer
. 2020 May 14;139(6):865–875. doi: 10.1007/s00439-020-02177-9

Minding the gap in HIV host genetics: opportunities and challenges

Shanelle N Gingras 1,2,, David Tang 1,2, Jeffrey Tuff 1,2, Paul J McLaren 1,2
PMCID: PMC7272494  PMID: 32409920

Abstract

Genome-wide association studies (GWAS) have been successful in identifying and confirming novel genetic variants that are associated with diverse HIV phenotypes. However, these studies have predominantly focused on European cohorts. HLA molecules have been consistently associated with HIV outcomes, some of which have been found to be population specific, underscoring the need for diversity in GWAS. Recently, there has been a concerted effort to address this gap that leads to health care (disease prevention, diagnosis, treatment) disparities with marginal improvement. As precision medicine becomes more utilized, non-European individuals will be more and more disadvantaged, as the genetic variants identified in genomic research based on European populations may not accurately reflect that of non-European individuals. Leveraging pre-existing, large, multiethnic cohorts, such as the UK Biobank, 23andMe, and the National Institute of Health’s All of Us Research Program, can contribute in raising genomic research in non-European populations and ultimately lead to better health outcomes.

Genomic studies in HIV: progress to date

Since the completion of the human genome project, genome-wide association studies (GWAS) have identified thousands of reproducible genetic associations with hundreds of complex human traits (Collins et al. 2004; MacArthur et al. 2018). Although chronic diseases, such as type 2 diabetes, have amassed tremendously large sample sizes resulting in > 100 associated loci (Xue et al. 2018), infectious diseases have only begun to be explored to the same level despite being a major cause of morbidity and mortality globally (Wang et al. 2016). The first GWAS in infectious diseases, performed in people living with HIV (PLWH), identified two independent genetic variants located in human leukocyte antigen (HLA)-B and -C that associated with set point viral load (spVL). This phenotype, defined as mean copies of the HIV RNA genome detected in untreated individuals during the chronic phase of infection (Fellay et al. 2007), correlates both with rate of disease progression to acquired immunodeficiency disease (AIDS) (Mellors et al. 1995) and with transmission potential (Quinn et al. 2000). The observed genetic variants explained 9.6% and 6.5% of the variation seen, respectively, and held promise that natural models of HIV control could be informative for design of novel HIV intervention strategies (Fellay 2009; Fellay et al. 2007). Since that first study, technological advances in genome-wide platforms have improved the ability to detect and confirm genetic variants identified in GWAS, however, the majority of these studies have been conducted in populations of primarily European descent (Table 1) (MacArthur et al. 2018). The major phenotypes explored in HIV GWAS are: susceptibility to infection, spVL, and disease progression, measured as rate of CD4 T-cell decline (Le Clerc et al. 2009; Herbeck et al. 2010; van Manen et al. 2011). However, studies of these critical phenotypes are much more commonly carried out in European cohorts (Table 1). Furthermore, the majority of studies that do focus on a multiethnic cohort have a greater proportion of European individuals included. With a predominant focus on European individuals, genetic variants driving HIV phenotypes may go undetected.

Table 1.

Genome-wide association studies on human immunodeficiency virus-1 infection

First author Publication date (yyyy-mm-dd) Journal Reported trait Sample number (ancestry) Cohort used References
European Ancestry Cohort
 Fellay 2007-07-19 Science HIV viral setpoint 486 European EuroCHAVI Fellay et al. (2007)
 Limou 2000-01-01 J Infect Dis AIDS progression 1627 European, 86 NR GRIV Limou et al. (2009)
 Le Clerc 2009-09-15 J Infect Dis AIDS 2134 European GRIV Le Clerc et al. (2009)
 Fellay 2009-12-24 PLoS Genet HIV-1 control 2362 European EuroCHAVI Fellay (2009)
 Ferreira 2009-12-31 AM J Hum Genet

CD56 positive NK-lymphocyte cell count

CD19-positive B-lymphocyte count

CD8-positive T-lymphocyte count

CD4-positive T-lymphocyte count

2538 European Siblings from 1089 Australian families Ferreira et al. (2010)
 Shrestha 2010-01-01 AIDS Carotid atherosclerosis in HIV infection 177 European FRAM Shrestha et al. (2010)
 Herbeck 2010-02-15 J Infect Dis HIV-1 progression 156 European MACS Herbeck et al. (2010)
 Bol 2011-02-25 PLoS One HIV-1 replication 191 European Healthy blood donors Bol et al. (2011)
 Troyer 2011-05-01 J Infect Dis AIDS progression 755 European GRIV Troyer et al. (2011)
 van Manen 2011-07-21 PLoS One HIV-1 progression 404 European ACS van Manen et al. (2011)
 Irvin 2011-12-01 Pharmacogenet Genomics Fat distribution (HIV) 192 European FRAM Irvin et al. (2011)
 Lane 2013-02-10 Hum Mol Genet HIV-1 susceptibility 1195 European CHAVI014, MHCS Lane et al. (2013)
 McLaren 2013-07-25 PLoS Pathog HIV-1 susceptibility 13,581 European ACTG; ALIVE; ACS; ANRS CO18; ANRS PRIMO Cohort; CCR; CHAVI; Danish HIV Cohort Study; GISHEAL; GRIV Cohort; HGDS; Hospital Clinic-IDIBAPS Acute/Recent HIV-1 Infection; Icona Foundation Study; International HIV Controllers Study, The IrsiCaixa Foundation Acute/Recent HIV-1 Infection Cohort; Modena Cohort; MACS; MHCS; Pumwani Sex Workers Cohort; SFCCC; The GPC Cohort/ Sanger Institute in Oxford, UK, and in Uganda; SHCS; University Medical Center Utrecht Dutch Controls Cohort; NHS; WTCCC3; Western Australian HIV Cohort Study McLaren et al. (2013)
 McLaren 2015-11-09 Proc Natl Acad Sci U S A Setpoint viral load in HIV-1 infection 6315 European EuroCHAVI, ALIVE, DCG, HGDS, MHCS, SFCC, GRIV, PRIMO, MACS, ACTG, IHCS, ACS, UHS McLaren et al. (2015)
 Ulveling 2016-07-29 Hepatology Liver fibrosis severity in HIV/hepatitis C co-infection 289 European French National Agency for Research on AIDS and Viral Hepatitis CO13 HEPAVIH Ulveling et al. (2016)
African Ancestry Cohorts
 Pelak 2010-04-15 J Infect Dis HIV-1 viral setpoint 515 African American or Afro-Caribbean NHS; MACS Pelak et al. (2010)
 Petrovski 2011-02-20 AIDS HIV-1 susceptibility 1379 Sub-Saharan African Recruited from the Center for HIV/AIDS Vaccine Immunology Clinical Core in Malawi Petrovski et al. (2011)
 Lingappa 2011-12-12 PLoS One HIV-1 susceptibility 798 Sub-Saharan African, African unspecified The Partners in Prevention HSV/HIV Transmission Study; The Couples Observational Study Lingappa et al. (2011)
 Carr 2017-01-05 J Antimicrob Chemother Nevirapine-induced hypersensitivity in HIV (drug-induced liver injury) 203 Sub-Saharan African

Recruited from the Queen Elizabeth Central

Hospital, Malawi

Carr et al. (2017)
Nevirapine-induced hypersensitivity in HIV 333 Sub-Saharan African
Nevirapine-induced hypersensitivity in HIV (rash) 398 Sub-Saharan African
Nevirapine-induced hypersensitivity in HIV (hypersensitivity syndrome) 205 Sub-Saharan African
Nevirapine-induced hypersensitivity in HIV (Stevens-Johnson syndrome or toxic epidermal necrosis) 233 Sub-Saharan African
Asian Ancestry Cohorts
 Uttayamakul 2015-06-15 AIDS Res Hum Retroviruses Response to stavudine in HIV (lipoatrophy) 141 Southeast Asian Bamrasnaradura Infectious Diseases Institute, Nonthaburi, Thailand Uttayamakul et al. (2015)
 Wei 2015-06-17 Sci Rep HIV-1 viral setpoint 538 East Asian Chinese Medical University: HAN, YUN, XIN Wei et al. (2015)
Multiethnic cohorts
 Pereyra 2010-11-04 Science HIV-1 control

1712 European;

1233 African American or Afro-Caribbean;

677 Hispanic or Latin American

International HIV Controllers Study, ACTG Pereyra et al. (2010)
 Moore 2015-01-09 Open Forum Infect Dis Neutrophil count in HIV infection; HDL cholesterol in HI -infection; total bilirubin levels in HIV-1 infection 2547 European, Hispanic or Latin American, African unspecified ACTG Moore et al. (2015)
 Lehmann 2015-02-01 Pharmacogenet Genomics Response to abacavir-containing treatment in HIV-1 infection (virologic failure)

346 European; 265 African unspecified;

175 Hispanic or Latin American

ACTG Lehmann et al. (2015)
 Lehmann 2015-02-01 Pharmacogenet Genomics Response to efavirenz-containing treatment in HIV-1 infection (virologic failure)

737 European; 545 African unspecified;

314 Hispanic or Latin American

ACTG Lehmann et al. (2015)
 Johnson 2015-03-18 PLoS One HIV-1 susceptibility 2004 African American or Afro-Caribbean; 1132 European UHS Johnson et al. (2015)
 Wanga 2015-09-01 Pharmacogenet Genomics Bilirubin measurement, response to tenofovir, HIV infection, creatinine clearance 243 European, 149 African American, 100 Hispanic American, 9 UA ACTG Wanga et al. (2015)
 Jia 2017-04-26 Am J Med Genet B Neuropsychiatr Genet Neurocognitive impairment in HIV-1 infection (continuous)

490 African American or Afro-Caribbean;

443 European;

99 Hispanic or Latin American;

17 NR

CHARTER Jia et al. (2017)
 Jia 2017-04-26 Am J Med Genet B Neuropsychiatr Genet Neurocognitive impairment in HIV-1 infection (dichotomous)

490 African American or Afro-Caribbean;

443 European; 99 Hispanic or Latin American; 17 NR

CHARTER Jia et al. (2017)
 Ekenberg 2019-06-13 J Infect Dis Pre-treatment viral load in HIV-1 infection

572 African unspecified; 1398 European; 418 Hispanic or Latin American;

13 Asian unspecified;

39 NR

START Ekenberg et al. (2019)

GWAS were compiled through the NHGRI-EBI GWAS Catalog using the search terms “HIV-1” and “AIDS”. Cohorts used have been abbreviated. All abbreviations are listed below in alphabetical order. ‘NR’ is not reported ancestry, ‘UA’ is unknown ancestry (MacArthur et al. 2018)

ACS Amsterdam Cohort Studies on HIV Infection and AIDS, ACTG AIDS Clinical Trials Group, ALIVE AIDS Linked to the IntroVenous Experience, CCR Center for Cancer Research, National Cancer Institute, CHARTER CNS HIV Antiretroviral Therapy Effects Research, CHAVI Center for HIV/AIDS Vaccine Immunology, EuroCHAVI Consortium Swiss HIV Cohort Study; Icona Foundation Study; IrsiCaixa Foundation; San Raffaele del Monte Tabor Foundation; Danish Cohort; Royal Perth Hospital: Guy's, King's, and St. Thomas' Hospitals; Modena Cohort; Hospital Clinic-IDIBAPS cohort; Multicenter AIDS Cohort Study, FRAM Fat Redistribution and Metabolic Change in HIV Infection, GISHEAL The Genetic and Immunological Studies of European and African HIV-1 Long Term Non-Progressors, HGDS Hemophilia Growth and Development, MACS Multicenter AIDS Cohort Study, MHCS Multicenter Hemophilia Cohort Study, NHS The US Military HIV Natural History Study, SFCCC San Francisco City Clinic Cohort, SHCS Swiss HIV Cohort Study, START Strategic Timing of AntiRetroviral Treatment, UHS Urban Health Study, WTCCC3 The Wellcome Trust Case Control Consortium study of the genetics of host control of HIV-1 infection in the Gambia

The largest GWAS done in HIV has been conducted by the International Collaboration for the Genomics of HIV, which has assessed the host genetic contribution to HIV acquisition and spVL (McLaren et al. 2013, 2015). For HIV acquisition, although no genome-wide significant variants were identified by comparing > 6000 HIV-infected individuals to > 7000 general population controls, a subsequent analysis did identify significant heritability due to common genetic variants, suggesting larger more diverse samples may uncover novel associations (McLaren et al. 2013; Power et al. 2017). Aggregating data across studies can increase the power to detect associations by addressing the issue of sample size, however, there are many factors to consider for aggregation and subsequent meta-analysis. Data access and exchange can be a difficult balance between protecting the study participants and providing enough information to be useful to the wider scientific community without data breaches, reidentification, or misuse (Zook et al. 2017). Ultimately sharing data positively outweighs the challenges, however, with the stigmatizing nature of an HIV diagnosis, it is imperative that researchers take the proper precautions to protect individuals’ privacy. To address these concerns, studies now offer very broad consent forms to enable the future use and exchange of information and there are protocols that must be followed for data exchange and storage to limit the possibility of data breaches (Zook et al. 2017). Meta-analysis historically has required a homogenous population of study, which is problematic for individuals of African ethnicity as diversity is not only higher than the European population, but they are also less represented in genomic research, leading to smaller potential for data aggregation of a homogenous population (Popejoy and Fullerton 2016; Russo 2007). Statistical methods have been developed that not only account for the differences observed between populations (allelic heterogeneity, linkage disequilibrium (LD) patterns, effect sizes), but also takes into account different genotyping platforms (Wang et al. 2012). It is also important to consider overlap of individuals between studies and cohorts when aggregating data together. As we can see in Table 1, GWAS for HIV reuse multiple cohorts and subsets of cohorts for various studies. Aggregation of these data would require accounting for each individual overlap to ensure no bias is introduced. Aggregating the large amount of data available in HIV genomics would be no easy feat, but looking to current databases that have freely available genome and exome data (Genome Aggregation Database (gnomAD), The 1000 Genomes Project Consortium) can serve as an example for data aggregation (Auton et al. 2015; Karczewski et al. 2019). Exome and genome sequencing projects should be undertaken to address the role of rare variation that is not well captured by GWAS, and with a growing number of sequencing projects, it would serve the scientific community well to aggregate these data for future large-scale use.

For spVL, in addition to reaffirming the protective role of HLA-B*57:01, B*27:05, B*14:02 among other alleles, analysis mapping individual amino acid residues in the HLA-A and B binding grooves showed that strong effects on spVL were mediated by variable residues (McLaren et al. 2015). As with this largest study, the majority of GWAS of HIV spVL and disease progression have underscored the importance of HLA class I genes in controlling HIV replication and limiting disease progression. HLA class I molecules are responsible for peptide binding specificity and variation within this region has important implications in disease pathogenesis (Pelak et al. 2010; Peprah et al. 2015; Pereyra et al. 2010; Ramsuran et al. 2018). Multiple variants within class I HLA molecules have been found to reduce spVL and be protective for disease progression in HIV, including the population-specific HLA-B*57:01 for Europeans and HLA-B*57:03 for Africans (Fellay 2009; Fellay et al. 2007; McLaren et al. 2013; Pelak et al. 2010; Pereyra et al. 2010). The mechanism of protection attributed to HLA-B alleles has been proposed to be through a more diverse recognition of a broader epitope repertoire and through recognition of more conserved HIV peptides (Arora et al. 2019; Gaiha et al. 2019). However, it has also been proposed that class I HLA alleles may exert non-peptide dependent protective effects. Recently, Ramsuran et al. investigated the impact of HLA-A on control of HIV and its interaction with other HLA molecules (HLA-E and -B) in an ethnically diverse cohort (2018). Similar to HLA-C, HLA-A is differentially expressed based on allotype specificity (Apps et al. 2013; Ramsuran et al. 2018). However, in contrast to HLA-C, the surface expression of HLA-A is 13- to 18-fold higher and is twofold more polymorphic (Ramsuran et al. 2018). The authors observed that HLA-A expression levels positively associated with a higher VL across populations in spite of varying allelic frequencies, an association that was confirmed in a cohort of individuals with known seroconversion dates (Ramsuran et al. 2018). Furthermore, increased HLA-A expression level was strongly associated with elevated mean VL, increased odds of being an HIV non-controller versus controller, and decreased CD4 + T-cell counts (Ramsuran et al. 2018).

The mechanism underlying the impact of HLA-A on HIV progression was further investigated in the context of its relationship to HLA-E. HLA-E is a ligand for the CD94/NKG2A receptor found on both natural killer (NK) cells and T cells. When stably bound to a signal peptide produced by the leader sequences of HLA-A, -B, and -C, HLA-E strongly inhibits NK cells activity (Lee et al. 1998; Ramsuran et al. 2018). Interestingly, highly expressed HLA-A alleles significantly correlated with higher expression of HLA-E on the cell surface, independent of the known protective HLA-B alleles, and reduced the control of HIV via NKG2A-mediated NK inhibition by interfering with the clearance of HIV-infected cells (Ramsuran et al. 2018). HIV favors the increased expression of HLA-E to decrease the risk of NK cell recognition, yet this immune evasion strategy heightens the risk of restricted CD8 + T-cell recognition (Hansen et al. 2016). Recently, researchers have been able to exploit this mechanism by eliciting unconventional CD8 + T-cell responses to MHC-E peptide presentation in rhesus macaques (RM) using a rhesus cytomegalovirus (RhCMV) SIVgag vector-based vaccine (Hansen et al. 2013, 2016; Walters et al. 2018). This vaccine broadly enables CD8 + T cells to recognize MHC-E-restricted presentation of highly variable peptides and can be targeted for a pathogen’s specific vulnerability through deletions in the RhCMV genome, in the case of HIV, the upregulation of MHC-E on the cell surface (Früh and Picker 2017; Hansen et al. 2013, 2016; Walters et al. 2018). With more than 50% of the vaccinated RM demonstrating protection to a highly pathogenic SIV strain and infected RM being indiscernible from vaccinated unchallenged RM leading up to 18 months post-infection, targeting of HLA-E upregulation during HIV infection is an attractive strategy moving forward (Früh and Picker 2017; Hansen et al. 2013, 2016; Walters et al. 2018). These data demonstrate the myriad of ways in which HLA may contribute to control and clearance of infection.

Multiple independent genetic associations have also been reported in the CCR5 gene region. The strongest of which, CCR5∆32 confers protection against infection in the homozygous state and slows disease progression in heterozygotes (Dean et al. 1996). Recently, Kulkarni et al. demonstrated that an additional polymorphism in the same region, rs1015164 which also impacts spVL (McLaren et al. 2015), associated with expression of an antisense long non-coding RNA that overlaps CCR5, dubbed CCR5-AS (2019). Expression level of CCR5-AS positively associated with expression of CCR5 and, when depleted, led to a reduction in HIV infection in vitro highlighting an alternative mechanism for CCRR5-related HIV control (Kulkarni et al. 2019).

Host control of HIV has been shown to differ across ethnicities likely due in part to population-specific variants associated with susceptibility to infection and disease progression (Peprah et al. 2015; Pereyra et al. 2010). Frequency and effect size of risk alleles can vary substantially across populations, making it critical to identify associated genetic variation in non-European populations, particularly in Africa where HIV is found in the highest proportion worldwide (Peprah et al. 2015; UNAIDS 2019). Closing this gap is not only important to improve health disparities, but can also improve genetic discovery by uncovering risk alleles not found in European populations. Recently, efforts have been made to improve ethnic diversity in HIV genomic studies. Ekenberg et al. assessed genetic regulators of VL on a subset of the Strategic Timing of Antiretroviral Treatment (START) cohort, that enrolled ART-naïve, ethnically diverse (35 countries from 5 continents), HIV-positive individuals (Ekenberg et al. 2019). Five SNPs met genome-wide significance, all of which were found in the MHC class I region (Ekenberg et al. 2019). The most significant SNP, located upstream of HLA-B, was identified to tag HLA-B*5701 (Ekenberg et al. 2019). Importantly, the observed variant exhibited high positive predictive values (PPV) in White, Asian, and Hispanic populations, however, in the Black participants PPV was low (Ekenberg et al. 2019). Two more population-specific variants were also confirmed in this study, HLA-B*5701 and HLA-B*5703, with higher frequencies found in European and African populations, respectively (Ekenberg et al. 2019). Two SNPS that have previously been reported to control HIV infection, one located in the HLA complex P5 gene and another in close proximity to the MICA gene, were found to be in moderate LD with the top associated SNP (Ekenberg et al. 2019). The SNP found close to the MICA gene was also found to be statistically significant across all four ethnicities (Ekenberg et al. 2019). These data demonstrate that although there are variants that consistently associate with phenotypes across populations, there are variants that are specific to a population and unique SNPs can tag HLA alleles distinct to a population due to underlying variations in LD structure, reflecting out of Africa migration patterns and the shortening of African LD blocks through increased generational decay (Charles et al. 2014; Ekenberg et al. 2019).

Opportunities for novel discovery using large, multiethnic data sets

Currently, there are large multiethnic genetic studies that are ongoing to address the disparity in genomic medicine across ancestry groups. One such study, The UK Biobank, consists of 500,000 UK participants enrolled at ages 40–69 from diverse ethnic backgrounds (UK Biobank 2017). All 500,000 participants have been genotyped and electronic health records (general practice, hospital episodes, cancer, death) have been linked to the participants allowing for the joint analysis of genetic, clinical and demographic data on relevant clinical samples (blood, urine, saliva) taken at the time of enrollment for all participants, and at follow-up appointments for a subset of 20,000 individuals (UK Biobank 2017).

Mentzer et al. used the UK Biobank resource to study the host–pathogen–disease relationship using a Multiplex Serology platform optimized for 20 different pathogens, with three disease endpoints of known (human papillomavirus (HPV)-16 and cervical cancer), potential (Epstein–Barr virus (EBV) and cytomegalovirus (CMV) and multiple sclerosis), and unlikely (coeliac disease) infectious disease association (Mentzer et al. 2019). They observed significant seroprevalence differences among various demographics, including sex, age, lifetime sexual partners, and self-reported ethnicity (hepatitis B virus, CMV) for varying infections, and were able to confirm multiple previously reported associations with infectious diseases (Mentzer et al. 2019). Interestingly, the number of lifetime sexual partners was not associated with infectious diseases with well-established sexual routes of transmission (HIV, HBV, HCV, HTLV-1), which appears to be due to the rare occurrences of these particular diseases in this study population (Mentzer et al. 2019). However, when considering HIV and HBV in men who have sex with men, a statistically significant association was identified (Mentzer et al. 2019). The use of large, pre-established, ethnically diverse cohorts is an intriguing option for GWAS moving forward. Samples such as the UK Biobank not only have the power to detect novel variants, but can also help to address the disproportionately low number of non-European cohorts currently seen in genomics, which continues to be exacerbated through the resampling of European cohorts (Popejoy and Fullerton 2016).

Similarly, direct to consumer genomic companies, such as 23andMe also provide a potential resource for enhanced genetic discovery in multiple populations. 23andMe have currently sold more than 10 million genotyping kits, of which 80% of customers have consented to participate in ongoing research (23andMe 2019). However, their client base currently mirrors the overall ethnic diversity seen in genomic research, with the majority of their data set being Caucasian and Asian individuals (23andMe 2019; Popejoy and Fullerton 2016). To address this discrepancy, 23andMe has created multiple initiatives to increase their genetic diversity through the Global Genetics Project, Roots Into the Future, African-American Sequencing Project, and African Genetics Project (23andMe 2019). These initiatives have helped to reach a milestone of 45,000 African Americans genotyped on their platform as of May 2016 (23andMe 2019). Leveraging these data would help to increase the diversity seen in genomic research and ultimately could help to improve precision medicine options in non-Europeans. Currently, 23andMe has studied the impact of genetic variants for common infections (chickenpox, cold sores, mononucleosis, yeast infections, etc.) in a cohort of 200,000 European individuals (Tian et al. 2017). They found 13 genome-wide significant SNPs located within the HLA region, many of which were located in the peptide binding cleft, along with more than 40 SNPs located elsewhere (Tian et al. 2017). Inclusion of multiple ethnic groups in the same type of study will serve to further reduce the ancestry gap in genomic medicine.

Precision medicine in HIV

With the expansion of available cohorts in terms of sample size, ethnic diversity, demographics, and clinical samples, precision medicine is becoming an increasingly more attainable approach. Precision medicine uses individual differences in genetics, socioeconomic factors, environment, and lifestyle to tailor health care options (disease prevention, diagnosis, treatment) to the individual, instead of the population at large which may not accurately reflect the individual. Genetic precision medicine was implemented in HIV care early on in resource-rich countries, when it was observed that individuals carrying the HLA-B*57:01 allele were at high risk for a hypersensitivity reaction to abacavir, a commonly used antiretroviral drug (Hetherington et al. 2002; Mallal et al. 2002). Pre-screening for this allele was shown to reduce this reaction in predominantly white populations (Mallal et al. 2008).

Adverse events have been observed for other commonly used antiretrovirals, including efavirenz (EFV) which was among the first drug to be co-formulated into single pill regimens for mass rollout globally. Several genetic variants in the drug-metabolizing enzyme CYP2B6 have been associated with high EFV plasma concentration and increased risked of adverse neuropsychiatric effects (Haas et al. 2004; Nyakutira et al. 2008; Rotger et al. 2005). Homozygosity, for one such variant, rs3745274, increases the risk of adverse reaction to EFV by fivefold and, given the population differences in frequency, this risk genotype is much more common in Africans (13.7%) than Europeans (5.6%) (Auton et al. 2015). Consistent with this, after the mass rollout of the new three-in-one single pill regimen containing EFV in Zimbabwe, an alarming number of patients stopping the drug due to central nervous system side effects was observed, underscoring the need to understand the distribution of such genotypes across treatment contexts (Nordling 2017). This will only become more necessary as global rollout of ART seeks to put > 80% of the 37.8 million people living with HIV on ART (Unaids 2020). Genomic pre-screening prior to ART initiation may help to reduce adverse events and could also include profiles for risk of developing long-term metabolic and cardiovascular complications known to be increased in people on ART (Rotger et al. 2010, 2013). The benefit of genomic testing prior to ART initiation will have to be weighed against concerns surrounding patient privacy and data security (Mclaren et al. 2016), however, if the diversity gap in genomics goes unaddressed, precision medicine will primarily benefit European populations. This is further underscored by a recent analysis of the performance of polygenic risk scores in predicting disease in non-European populations which showed an average 50% reduction in predictive accuracy of these scores in populations of African ancestry (Duncan et al. 2019).

One of the most ambitious initiatives in advancing precision medicine is the National Institute of Health’s All of Us Research Program. By 2024, at least 1 million diverse participants living in the United States will have contributed biospecimens, physical measurements, health and lifestyle surveys, socioeconomic data, health records, family medical history, and or wearable device data (The All of Us Research Program Investigators 2019). One of the core values of the All of Us Research Program is participant diversity and inclusiveness (The All of Us Research Program Investigators 2019). As of July 2019, more than 175,000 participants contributed biospecimens; of those participants, 51% are non-white and 80% meet the All of Us Research Program’s definition of being underrepresented in biomedical research (The All of Us Research Program Investigators 2019). Overall, this will be a historic effort in identifying new risk factors (biology, environment and social determinants) to improve diagnosis, treatment and care for diverse populations (The All of Us Research Program Investigators 2019). In addition, valuable population reference panels, such as the African Genome Project will improve ability to interpret results of genome-wide studies (Gurdasani et al. 2015). This approach should be expanded to additional valuable resources such as GTEx to better understand how genetic variability impacts gene expression in multiple cell types in diverse ethnic groups (Lonsdale et al. 2013).

Conclusion

GWAS have been failing to accurately reflect the genetic diversity seen worldwide due to the extensive research in predominantly European populations. In recent years, there has been a concerted effort to address the gap in genomics between European and non-European cohorts, however, the discrepancy is still pronounced. Underrepresented populations have a greater risk for poorer health outcomes due to subpar prevention, diagnosis, and treatment options for their population. Through inclusivity, genomics has the opportunity to address these health disparities and improve health globally.

Compliance with ethical standards

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Shanelle N. Gingras, Email: gingrass@myumanitoba.ca, Email: shanelle.gingras@canada.ca

Paul J. McLaren, Email: Paul.Mclaren@canada.ca

References

  1. 23andMe (2019) 23andMe for Scientists | Accelerating Genomics Research. https://research.23andme.com/. Accessed 10 Dec 2019
  2. Apps R, et al. Influence of HLA-C expression level on HIV control. Science. 2013;340(6128):87–91. doi: 10.1126/science.1232685. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Arora J, et al. HIV Peptidome-wide association study reveals patient-specific epitope repertoires associated with HIV control. Proc Natl Acad Sci USA. 2019;116(3):944–949. doi: 10.1073/pnas.1812548116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Auton A, et al. A global reference for human genetic variation. Nature. 2015;526(7571):68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bol SM, et al. Genome-wide association study identifies single nucleotide polymorphism in DYRK1A associated with replication of HIV-1 in monocyte-derived macrophages. PLoS ONE. 2011;6(2):e17190. doi: 10.1371/journal.pone.0017190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Carr DF, et al. Genome-wide association study of nevirapine hypersensitivity in a sub-saharan african HIV-infected population. J Antimicrob Chemother. 2017;72:1152–1162. doi: 10.1093/jac/dkw545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Charles BA, Daniel S, Charles N. Accounting for linkage disequilibrium in association analysis of diverse populations. Gen Epidemiol. 2014;38(3):265–273. doi: 10.1002/gepi.21788. [DOI] [PubMed] [Google Scholar]
  8. Collins FS, Lander ES, Rogers J, Waterson RH. Finishing the euchromatic sequence of the human genome. Nature. 2004;431(7011):931–945. doi: 10.1038/nature03001. [DOI] [PubMed] [Google Scholar]
  9. Dean M, et al. Genetic restriction of HIV-1 infection and progression to AIDS by a deletion allele of the CKR5 Structural Gene. Hemophilia Growth and Development Study, Multicenter AIDS Cohort Study, Multicenter Hemophilia Cohort Study, San Francisco City Cohort, ALIVE Study. Science. 1996;273(5283):1856–1862. doi: 10.1126/science.273.5283.1856. [DOI] [PubMed] [Google Scholar]
  10. Duncan L, et al. Analysis of polygenic risk score usage and performance in diverse human populations. Nat Commun. 2019;10(1):1–9. doi: 10.1038/s41467-019-11112-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Ekenberg C, et al. Association between single-nucleotide polymorphisms in HLA alleles and human immunodeficiency virus type 1 viral load in demographically diverse, antiretroviral therapy-naive participants from the strategic timing of antiretroviral treatment trial. J Infect Dis. 2019;220(8):1325–1334. doi: 10.1093/infdis/jiz294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Fellay J. Common genetic variation and the control of HIV-1 in humans. PLoS Genet. 2009;5(12):e1000791. doi: 10.1371/journal.pgen.1000791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Fellay J, et al. A whole-genome association study of major determinants for host control of HIV-1. Science. 2007;317(5840):944–947. doi: 10.1126/science.1143767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Ferreira MAR, et al. Quantitative trait loci for CD4:CD8 lymphocyte ratio are associated with risk of type 1 diabetes and HIV-1 immune control. Am J Hum Genet. 2010;86(1):88–92. doi: 10.1016/j.ajhg.2009.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Früh K, Louis P. CD8+ T cell programming by cytomegalovirus vectors: applications in prophylactic and therapeutic vaccination. Curr Opin Immunol. 2017;47:52–56. doi: 10.1016/j.coi.2017.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gaiha GD, et al. Structural topology defines protective CD8+ T cell epitopes in the HIV proteome. Science. 2019;364(6439):480–484. doi: 10.1126/science.aav5095. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gurdasani D, et al. The African genome variation project shapes medical genetics in Africa. Nature. 2015;517(7534):327–332. doi: 10.1038/nature13997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Haas DW, et al. Pharmacogenetics of efavirenz and central nervous system side effects: an adult AIDS Clinical Trials Group Study. AIDS. 2004;18(18):2391–2400. [PubMed] [Google Scholar]
  19. Hansen SG, et al. Cytomegalovirus vectors violate CD8+ T cell epitope recognition paradigms. Science. 2013;340(6135):1237874. doi: 10.1126/science.1237874. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Hansen SG, et al. Broadly targeted CD8+ T cell responses restricted by major histocompatibility complex-E HHS public access. Science. 2016;351(6274):714–720. doi: 10.1126/science.aac9475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Herbeck JT, et al. Multistage genomewide association study identifies a locus at 1q41 associated with rate of HIV-1 disease progression to clinical AIDS. J Infect Dis. 2010;201(4):618–626. doi: 10.1086/649842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hetherington S, et al. Genetic variations in HLA-B region and hypersensitivity reactions to abacavir. Lancet. 2002;359(9312):1121–1122. doi: 10.1016/S0140-6736(02)08158-8. [DOI] [PubMed] [Google Scholar]
  23. Irvin MR, et al. Genes linked to energy metabolism and immunoregulatory mechanisms are associated with subcutaneous adipose tissue distribution in HIV-infected men. Pharmacogen Genom. 2011;21(12):798–807. doi: 10.1097/FPC.0b013e32834b68f9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Jia P, et al. Genome-Wide Association Study of HIV-associated neurocognitive disorder (HAND): a CHARTER Group Study. Am J Med Genet. 2017;174(4):413–426. doi: 10.1002/ajmg.b.32530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Johnson EO, et al. Novel genetic locus implicated for HIV-1 acquisition with putative regulatory links to HIV replication and infectivity: a Genome-Wide Association Study. PLoS ONE. 2015;10(3):e0118149. doi: 10.1371/journal.pone.0118149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Karczewski KJ, et al. Variation across 141,456 human exomes and genomes reveals the spectrum of loss-of-function intolerance across human protein-coding genes. bioRxiv. 2019 doi: 10.1101/531210v3. [DOI] [Google Scholar]
  27. Kulkarni S, et al. CCR5AS LncRNA variation differentially regulates CCR5, influencing HIV disease outcome. Nat Immunol. 2019;20(7):824–834. doi: 10.1038/s41590-019-0406-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Lane J, et al. A genome-wide association study of resistance to HIV infection in highly exposed uninfected individuals with hemophilia A. Hum Mol Genet. 2013;22(9):1903–1910. doi: 10.1093/hmg/ddt033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Le Clerc S, et al. Genomewide Association Study of a rapid progression cohort identifies new susceptibility alleles for AIDS (ANRS Genomewide Association Study 03) J Infect Dis. 2009;200(8):1194–1201. doi: 10.1086/605892. [DOI] [PubMed] [Google Scholar]
  30. Lee Ni, et al. HLA-E is a major ligand for the natural killer inhibitory receptor CD94/NKG2A. Proc Natl Acad Sci USA. 1998;95(9):5199–5204. doi: 10.1073/pnas.95.9.5199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Lehmann DS, et al. Genome-wide association study of virologic response with Efavirenz-containing or Abacavir-containing regimens in AIDS clinical trials group protocols. Pharmacogenet Genomics. 2015;25(2):51–59. doi: 10.1097/FPC.0000000000000106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Limou S, et al. Genomewide Association Study of an AIDS-nonprogression cohort emphasizes the role played by HLA genes (ANRS Genomewide Association Study 02) J Infect Dis. 2009;199(3):419–426. doi: 10.1086/596067. [DOI] [PubMed] [Google Scholar]
  33. Lingappa JR, et al. Genomewide Association Study for determinants of HIV-1 acquisition and viral set point in HIV-1 serodiscordant couples with quantified virus exposure ed Roberto F Speck. PLoS ONE. 2011;6(12):e28632. doi: 10.1371/journal.pone.0028632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Lonsdale J, et al. The genotype-tissue expression (GTEx) project. Nat Genet. 2013;45(6):580–585. doi: 10.1038/ng.2653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. MacArthur JAL, et al. NHGRI-EBI GWAS catalog. Nucleic Acids Res. 2018;47:1005–1012. [Google Scholar]
  36. Mallal S, et al. Association between presence of HLA-B*5701, HLA-DR7, and HLA-DQ3 and hypersensitivity to HIV-1 reverse-transcriptase inhibitor abacavir. Lancet. 2002;359(9308):727–732. doi: 10.1016/s0140-6736(02)07873-x. [DOI] [PubMed] [Google Scholar]
  37. Mallal S, et al. HLA-B*5701 screening for hypersensitivity to abacavir. N Engl J Med. 2008;358(6):568–579. doi: 10.1056/NEJMoa0706135. [DOI] [PubMed] [Google Scholar]
  38. McLaren PJ. Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load. Proc Natl Acad Sci USA. 2015;112(47):14658–14663. doi: 10.1073/pnas.1514867112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. McLaren PJ, et al. Association study of common genetic variants and HIV-1 acquisition in 6,300 infected cases and 7,200 controls. PLoS Pathog. 2013;9(7):e1003515. doi: 10.1371/journal.ppat.1003515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Mclaren PJ, et al. Privacy-preserving genomic testing in the clinic: a model using HIV treatment. Genet Med. 2016;18(8):814–822. doi: 10.1038/gim.2015.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Mellors JW, et al. Quantitation of HIV-1 RNA in plasma predicts outcome after seroconversion. Ann Intern Med. 1995;122(8):573–579. doi: 10.7326/0003-4819-122-8-199504150-00003. [DOI] [PubMed] [Google Scholar]
  42. Mentzer AJ, et al. Identification of host-pathogen-disease relationships using a scalable multiplex serology platform in UK biobank. medrxiv. 2019 doi: 10.1101/19004960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Moore CB, et al. Phenome-wide association study relating pretreatment laboratory parameters with human genetic variants in AIDS Clinical Trials Group Protocols. Open Forum Infect Dis. 2015;2(1):113. doi: 10.1093/ofid/ofu113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Nordling L. How the genomics revolution could finally help Africa. Nature. 2017;544(7648):20–22. doi: 10.1038/544020a. [DOI] [PubMed] [Google Scholar]
  45. Nyakutira C, et al. High prevalence of the CYP2B6 516G→T(*6) variant and effect on the population pharmacokinetics of Efavirenz in HIV/AIDS outpatients in Zimbabwe. Eur J Clin Pharmacol. 2008;64(4):357–365. doi: 10.1007/s00228-007-0412-3. [DOI] [PubMed] [Google Scholar]
  46. Pelak K, et al. Host determinants of HIV-1 control in African Americans. J Infect Dis. 2010;201(8):1141–1149. doi: 10.1086/651382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Peprah E, et al. Genome-wide association studies in Africans and African Americans: expanding the framework of the genomics of human traits and disease HHS public access. Public Health Genom. 2015;18(1):40–51. doi: 10.1159/000367962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Pereyra F, et al. The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science. 2010;330(6010):1551–1557. doi: 10.1126/science.1195271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Petrovski S, et al. Common human genetic variants and HIV-1 susceptibility: a genome-wide survey in a homogeneous African population. AIDS. 2011;25(4):513–518. doi: 10.1097/QAD.0b013e328343817b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Popejoy AB, Fullerton MS. Genomics is failing on diversity. Nature. 2016;538(7624):161–164. doi: 10.1038/538161a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Power RA, et al. A genome-wide polygenic approach to HIV uncovers link to inflammatory bowel disease and identifies potential novel genetic variants. bioRxiv. 2017 doi: 10.1101/145383v1. [DOI] [Google Scholar]
  52. Quinn TC, et al. Viral load and heterosexual transmission of human immunodeficiency virus type 1. Rakai Project Study Group. N Engl J Med. 2000;342(13):921–929. doi: 10.1056/NEJM200003303421303. [DOI] [PubMed] [Google Scholar]
  53. Ramsuran V, et al. Elevated HLA-A expression impairs HIV control through inhibition of NKG2A-expressing cells. Science. 2018;359(6371):86–90. doi: 10.1126/science.aam8825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rotger M, et al. Influence OfCYP2B6 polymorphism on plasma and intracellular concentrations and toxicity of Efavirenz and Nevirapine in HIV-infected patients. Pharmacogenet Genom. 2005;15(1):1–5. doi: 10.1097/01213011-200501000-00001. [DOI] [PubMed] [Google Scholar]
  55. Rotger M, et al. Impact of single nucleotide polymorphisms and of clinical risk factors on new-onset diabetes mellitus in HIV-infected individuals. Clin Infect Dis. 2010;51(9):1090–1098. doi: 10.1086/656630. [DOI] [PubMed] [Google Scholar]
  56. Rotger M, et al. Contribution of genetic background, traditional risk factors, and HIV-related factors to coronary artery disease events in HIV-positive persons. Clin Infect Dis. 2013;57(1):112–121. doi: 10.1093/cid/cit196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Russo MW. How to review a meta-analysis. Gastroenterol Hepatol. 2007;3(8):637–642. [PMC free article] [PubMed] [Google Scholar]
  58. Shrestha S, et al. A genome-wide association study of carotid atherosclerosis in HIV-infected men. AIDS. 2010;24(4):583–592. doi: 10.1097/QAD.0b013e3283353c9e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. The All of Us Research Program Investigators The ‘All of Us’ research program. N Engl J Med. 2019;381(7):668–676. doi: 10.1056/NEJMsr1809937. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Tian C, et al. Genome-wide association and HLA region fine-mapping studies identify susceptibility loci for multiple common infections. Nature Communications. 2017;8(1):599. doi: 10.1038/s41467-017-00257-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Troyer JL, et al. Genome-wide association study implicates PARD3B-based AIDS restriction. J Infect Dis. 2011;203(10):1491–1502. doi: 10.1093/infdis/jir046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. UK Biobank. 2017. UK Biobank Showcase User Guide UK Biobank showcase user guide: getting started. https://www.ukbiobank.ac.uk. Accessed 15 Dec 2019).
  63. Ulveling D, et al. A new 3p25 locus is associated with liver fibrosis progression in human immunodeficiency virus/Hepatitis C virus-coinfected patients. Hepatology. 2016;64(5):1462–1472. doi: 10.1002/hep.28695. [DOI] [PubMed] [Google Scholar]
  64. UNAIDS (2019) Global HIV & AIDS Statistics—2019 Fact Sheet | UNAIDS. https://www.unaids.org/en/resources/fact-sheet. Accessed 9 Sept 2019
  65. Unaids (2020) 90–90–90: an ambitious treatment target to help end the aids epidemic
  66. Uttayamakul S, et al. Genome-Wide Association Study of HIV-related lipoatrophy in Thai patients: Association of a DLGAP1 polymorphism with fat loss. AIDS Res Hum Retroviruses. 2015;31(8):792–796. doi: 10.1089/aid.2014.0266. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. van Manen D, et al. Genome-wide association scan in HIV-1-infected individuals identifying variants influencing disease course. PLoS ONE. 2011;6(7):e22208. doi: 10.1371/journal.pone.0022208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Walters LC, et al. Pathogen-derived HLA-E bound epitopes reveal broad primary anchor pocket tolerability and conformationally malleable peptide binding. Nat Commun. 2018;9(1):1–13. doi: 10.1038/s41467-018-05459-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Wang X, et al. A statistical method for region-based meta-analysis of genome-wide association studies in genetically diverse populations. Eur J Hum Genet. 2012;20(4):469–475. doi: 10.1038/ejhg.2011.219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Wang H, et al. Global, regional, and national life expectancy, all-cause mortality, and cause-specific mortality for 249 causes of death, 1980–2015: a systematic analysis for the global burden of disease study 2015. The Lancet. 2016;388(10053):1459–1544. doi: 10.1016/S0140-6736(16)31012-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Wanga V, et al. Genomewide Association Study of Tenofovir pharmacokinetics and creatinine clearance in AIDS clinical trials group protocol A5202. Pharmacogenet Genom. 2015;25(9):450–461. doi: 10.1097/FPC.0000000000000156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Wei Z, et al. Genome-wide association studies of HIV-1 host control in ethnically diverse Chinese populations. Sci Rep. 2015;5:10879. doi: 10.1038/srep10879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Xue A, et al. Genome-wide association analyses identify 143 risk variants and putative regulatory mechanisms for type 2 diabetes. Nat Commun. 2018;9(1):1–14. doi: 10.1038/s41467-018-04951-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Zook M, et al. Ten simple rules for responsible big data research ed Fran Lewitter. PLoS Comput Biol. 2017;13(3):e1005399. doi: 10.1371/journal.pcbi.1005399. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Human Genetics are provided here courtesy of Springer

RESOURCES