Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2020 Dec 4;22(3):137–153. doi: 10.1038/s41576-020-00297-6

Host genetics and infectious disease: new tools, insights and translational opportunities

Andrew J Kwok 1, Alex Mentzer 1,2, Julian C Knight 1,
PMCID: PMC7716795  PMID: 33277640

Abstract

Understanding how human genetics influence infectious disease susceptibility offers the opportunity for new insights into pathogenesis, potential drug targets, risk stratification, response to therapy and vaccination. As new infectious diseases continue to emerge, together with growing levels of antimicrobial resistance and an increasing awareness of substantial differences between populations in genetic associations, the need for such work is expanding. In this Review, we illustrate how our understanding of the host–pathogen relationship is advancing through holistic approaches, describing current strategies to investigate the role of host genetic variation in established and emerging infections, including COVID-19, the need for wider application to diverse global populations mirroring the burden of disease, the impact of pathogen and vector genetic diversity and a broad array of immune and inflammation phenotypes that can be mapped as traits in health and disease. Insights from study of inborn errors of immunity and multi-omics profiling together with developments in analytical methods are further advancing our knowledge of this important area.

Subject terms: Immunogenetics, Pathogens, Disease genetics, Evolutionary genetics, Genome-wide association studies


Infectious diseases are an ever-present global threat. In this Review, Kwok, Mentzer and Knight discuss our latest understanding of how human genetics influence susceptibility to disease. Furthermore, they discuss emerging progress in the interplay between host and pathogen genetics, molecular responses to infection and vaccination, and opportunities to bring these aspects together for rapid responses to emerging diseases such as COVID-19.

Introduction

Disease syndromes caused by infectious agents have occurred throughout the history of modern humans1. As a result of our continued interactions with pathogens, our genomes have been shaped through processes of co-evolution, with pathogen-imposed selection pressures leading to selection signatures in ancient and modern human genomes2. As one of several illustrative examples, genetic diversity involving human red blood cell structure and function is being impacted by an evolutionary arms race with malaria that is reciprocally seen in the parasite genome3. In the era of modern medicine with antimicrobial drugs and powerful organ support systems, we are curbing the selection pressure exerted through the traditional high mortality associated with many of these infectious agents in resource-rich settings, but the worldwide health-care burden of infectious diseases remains substantial4. This is exemplified by the emergence of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)5,6 and the rapid evolution into a global COVID-19 pandemic requiring coordinated global responses. Additionally, the ongoing and dynamic challenges of managing malaria, tuberculosis (TB) and human immunodeficiency virus (HIV) infection alongside the constant threat of global influenza and other outbreaks such as Ebola virus disease, all particularly afflicting underserved populations, remind us of the importance of advancing infectious disease medicine.

A seminal study of adoptees in the 1980s reported increased risk of death from infectious disease in children whose biological parents succumbed to an infectious disease7, highlighting the significance of human genetic background in susceptibility to infections. As genetic technology has developed, mirroring other disease syndromes, a raft of genetic loci influencing susceptibility to infectious diseases have been discovered through genome-wide association studies (GWAS), including for HIV infection, TB, hepatitis and malaria8 (Table 1). Such studies aim to identify common variants on the basis of a polygenic model of complex multifactorial traits. However, it has become clear that, although human genetics play a role in disease susceptibility, in contrast to other syndromes, GWAS of case–control design may not be the most powerful method to tease out complex host–pathogen relationships. Alongside developments in genotyping technologies enabling GWAS, the in-depth understanding of single-gene disorders facilitated by next-generation sequencing technologies has enabled us to better understand the functional mechanisms of host defence to infection through experiments of nature9 resulting in loss of function (LOF) or, in increasingly recognized instances, gain of function (GOF) of discrete genes (Fig. 1). Here, rare mutations of variable penetrance result in single-gene inborn errors of immunity and lead to life-threatening infectious disease (Table 2). More recently, the development and availability of multi-omics experimental techniques promise to deliver similar mechanistic understanding of the functional importance of genome-level variation at genomic loci implicated in disease pathogenesis.

Table 1.

Overview of genome-wide association studies involving major infectious diseases

Reported trait/phenotype Population Sample size Replication cohort? SNP Purported gene/implicated gene P Odds ratio Refs
HIV infection
Set point viral load European 2,362 Yes rs9264942 HLA-C 5.9 × 10−32 NA 164,165
European 2,362 Yes rs2395029 HLA-B*5701, HCP5 4.5 × 10−35 NA 164,165
Set point viral load (controllers versus progressors) European 1,712 Yes rs9264942 HLA-C 2.8 × 10−35 (univariate regression model) 2.9 166
European 1,712 Yes rs4418214 MICA 1.4 × 10−34 (univariate regression model) 4.4 166
African 1,233 No rs2523608 HLA-B 8.9 × 10−20 (univariate regression model) 2.6 166
African and Hispanics 1,233,677 No rs2523590 HLA-B 1.7 × 10−13, 8.3 × 10−8 2.4 166
Susceptibility African Americans and European Americans 3,136 Yes rs4878712 FRMPD1 4 × 10−7 NA 167
Viral load Chinese 538 No rs2442719 HLA-B 7.85 × 10−7 NA 168
HIV-1 acquisition European 13,581 No NA CCR5Δ32 5 × 10−9 0.2 169
Viral load European 6,315 No rs59440261 HCP5, MICA 2.0 × 10−83 NA 170
European 6,315 No rs1015164 CCRL2 1.5 × 10−19 NA 170
Black, Hispanic, Asian, white 2,440 No rs57989216 AL671883.3 2.6 × 10−13 NA 171
Tuberculosis
Resistance to tuberculosis in HIV-positive individuals (negative tuberculin skin test reactivity) African 469 No rs877356 IL9 1.22 × 10−8 0.2671 172
Susceptibility European 11,137 Yes rs4733781 ASAP1 4 × 10−12 NA 34
Early progression to active tuberculosis Peruvian (mixed ancestry) 4,002 No rs73226617 ATP1B3 3.93 × 10−8 1.18 173
Susceptibility East Asian 2,053 Yes rs12437118 ESRRB 1.72 × 10−11 1.277 33
East Asian 2,053 Yes rs6114027 TGM6  2.37 × 10−11 1.339 33
East Asian (Chinese) 2,509 Yes rs4240897 MFN2 1.41 × 10−11 0.79 174
Resistance African (Ghanaian) 3,176 Yes rs2057178 WT1 2.63 × 10−9 0.77 175
Early-onset tuberculosis (<25 years of age) Middle Eastern (Moroccan) 558 Yes rs916943 AGMO 2 × 10−6 2.73 176
Malaria
Severe Plasmodium falciparum malaria African 7,908 Yes rs10900585 ATP2B4 6.1 × 10−9 0.65 177
African 7,908 Yes rs2334880 MARVELD3 3.9 × 10−8 1.24 177
African 7,908 Yes rs8176719 ABO 4.0 × 10−21 1.73 177
African 7,908 Yes rs372091 HBB 1.1 × 10−14 0.44 177
African 25,904 Yes rs184895969 Glycophorin genes 9.5 × 10−11 0.67 44
Leprosy
Susceptibility East Asian (Chinese) 24,330 Yes rs2221593 BATF3 3.09 × 10−8 1.15 178
East Asian (Chinese) 24,330 Yes rs77061563 CIITA 6.23 × 10−15 0.84 178
East Asian (Chinese) 24,330 Yes rs663743 CCDC88B 8.84 × 10−14 1.24 178
East Asian (Chinese) 23,766 Yes rs6807915 SYN2 1.94 × 10−8 0.89 36
East Asian (Chinese) 23,766 Yes rs4720118 BBS9 3.85 × 10−10 1.16 36
East Asian (Chinese) 23,766 Yes rs55894533 CTSB 5.07 × 10−11 1.15 36
East Asian (Chinese) 23,766 Yes rs10100465 MED30 2.85 × 10−11 0.85 36
Hepatitis B
Chronic HBV infection East Asian 1,888 Yes rs7453920 HLA-DQB2 4.93 × 10−37 0.53 179
East Asian 1,888 Yes rs3130542 HLA-C 9.49 × 10−14 1.33 179
East Asian 1,888 Yes rs4821116 UBE2L3 1.71 × 10−12 0.82 179
Persistent HBV infection East Asian 2,308 Yes rs7000921 INTS10 3.2 × 10−12 0.78 180
Hepatitis C
Chronic HCV infection East Asian (Japanese) 36,112 Yes rs9275572 HLA-DQB1 3.59 × 10−16 0.79 181
Mixed 2,401 Yes rs12979860 IL-28B 2.17 × 10−30 0.45 50
Mixed 2,401 Yes rs4273729 HLA-DQB1 1.71 × 10−16 0.59 50
Spontaneous clearance of HCV Mixed 4,423 Yes rs74597329 IFNL4IFNL3 5.99 × 10−50 2.14 51
Mixed 4,423 Yes rs2647006 HLA-DQB1, HLA-DQA2 1.15 × 10–21 1.71 51
Mixed 4,423 Yes rs1754257 GPR158 1.8 × 10−7 1.06 51
Dengue fever
Dengue shock syndrome Vietnamese 8,697 Yes rs3132468 MICB 4.4 × 10−11 1.34 182
Vietnamese 8,697 Yes rs3765524 PLCE1 3.1 × 10−10 0.8 182
Non-typhoidal Salmonella infection
Non-typhoidal Salmonella bacteraemia African 3,591 Yes rs13390936 STAT4 8.62 × 10–10 7.61 125
Typhoid
Enteric fever Asian (Vietnamese and Nepalese) 4,243 Yes rs7765379 HLA-DRB1HLA-DQB1 2.29 × 10−13 0.22 183
Rheumatic heart disease (after Streptococcus pyogenes infection)
Susceptibility Oceanian (Polynesians, South Asians and mixed) 2,852 No rs11846409 IGHV4-61*02 3.6 × 10−9 1.43 67

Data sourced from the US National Human Genome Research Institute–European Bioinformatics Institute GWAS Catalog. The table includes most recent studies (2015 onwards), as well as previous definitive studies in the field of infectious disease. HBV, hepatitis B virus; HCV, hepatits C, virus; NA, not available; SNP, single-nucleotide polymorphism.

Fig. 1. Signalling pathways crucial to the immune response and consequences of inborn errors of immunity for infectious disease.

Fig. 1

Examples of specific proteins are shown (highlighted in colour), which when present as mutants give rise to monogenic inborn errors of immunity, with the main infectious disease phenotypes noted. a | Pattern recognition receptors (PRRs) responsible for detecting pathogen-associated molecular patterns (PAMPs) and damage-associated molecular patterns (DAMPs) with examples of PRR pathways illustrated for retinoic acid-inducible gene I protein (RIG-I)-like receptor (RLR) and Toll-like receptor (TLR). b | Receptor-interacting protein kinase (RIPK) signalling. RIPK1 and RIPK3 regulate inflammation and cell death (necroptosis). c | Interferon pathways. Type I interferons (for example, interferon-α (IFNα) and IFNβ) and type II interferons (for example, IFNγ) regulate the immune response to viral and bacterial infections. d | Antigen presentation pathways. Major histocompatibility (MHC) class I molecules present antigens (derived from intracellular proteins such as viruses and some bacteria) to cytotoxic CD8+ T cells via the endogenous pathway (left). MHC class II molecules present antigens from bacteria, parasites and other extracellular pathogens endocytosed into antigen-presenting cells to CD4+ T cells via the exogenous pathway (right). CARD, caspase activation and recruitment domain; CLIP, class II-associated invariant chain peptide; CMV, cytomegalovirus; CTD, carboxy-terminal domain; ERK, extracellular signal-regulated kinase; GAS, gamma-activated sequence; HSV, herpes simplex virus; IFNAR, interferon-α/β receptor; IKK, IκB kinase; IRAK, interleukin-1 receptor-associated kinase; IRF, interferon response factor; ISGs, interferon-stimulated genes; ISRE, interferon-stimulated response element; JNK, JUN amino-terminal kinase; MAPK, mitogen-activated protein kinase; MAPKKK, mitogen-activated protein kinase kinase kinase; NF-κB, nuclear factor-κB; TAB, transforming growth factor-β-activated kinase 1 (MAP3K7)-binding protein; TAK, transforming growth factor-β-activated kinase; TAP, transporter associated with antigen processing; TCR, T cell receptor; TIRAP, Toll–interleukin-1 receptor domain-containing adaptor protein; TRAFs, tumour necrosis factor receptor-associated factors; TRAM, Toll–interleukin-1 receptor domain-containing adapter-inducing interferon-β (TRIF)-related adaptor molecule; TRIF, Toll–interleukin-1 receptor domain-containing adapter-inducing interferon-β.

Table 2.

Inborn errors of immunity including primary immunodeficiency disorders and Mendelian/monogenic infection susceptibilities

Component of immune system affected Example syndromes Example genes involved Inheritance Infectious disease phenotype Refs
Examples of primary immunodeficiency disorders
Combined humoral and cellular immunity deficiencies Severe combined immunodeficiency RAG1, RAG2, ADA, JAK3, PTPRC AR, X-linked Oral candidiasis, Pneumocystis jirovecii pneumonia, diarrhoea before 6 months 184
Ataxia telangiectasia ATM AR Recurrent sinopulmonary infections 185
Cellular immunity deficiencies ZAP70 deficiency ZAP70 AR Common and opportunistic infections (including live-virus vaccine strains), Pneumocystis jirovecii infections 186
Humoral immunity deficiencies Common variable immunodeficiency TACI, ICOS, BAFFR, CD19, CD20 AD, AR Recurrent sinopulmonary infections, bronchiectasis 187
X-linked agammaglobulinaemia BTK X-linked

Recurrent sinopulmonary and skin infections during infancy

Persistent central nervous system infections resulting from live attenuated oral polio vaccine, echoviruses or coxsackieviruses

Increased risk of infectious arthritis, bronchiectasis

188
Hyper-IgM syndrome AID, UNG, CD40, CD40L AR, X-linked Similar to X-linked agammaglobulinaemia (recurrent pyogenic bacterial sinopulmonary infections) but greater frequency of Pneumocystis jirovecii pneumonia, cryptosporidiosis 189
Phagocytic cell defects Chronic granulomatous disease CYBA, NCF1, NCF2, NCF4 X-linked, AR

Granulomatous lesions, recurrent abscesses

Infections by catalase-producing organisms (e.g. Staphylococcus aureus, Escherichia coli, Serratia, Klebsiella and Pseudomonas spp.)

Aspergillus infections (leading cause of death)

190
Leukocyte adhesion deficiency ITGB2, SLC35C1, FERMT3 AR Soft-tissue infections, periodontitis, no formation of pus 190
Complement deficiencies C3 deficiency C3 AR Recurrent pyogenic infections with encapsulated bacteria beginning from birth 191
C5–C9 deficiency C5, C6, C7, C8, C9 AR Recurrent and disseminated Neisseria infections 191
Examples of monogenic inborn errors of immunity with high/complete penetrancea
Viral restriction (keratinocyte) Epidermodysplasia verruciformis TMC6, TMC8, C1B1 AR Human papillomaviruses infecting keratinocytes causing warts and non-melanoma skin cancer 9
IFNγ immunity (innate and adaptive lymphocytes) Mendelian susceptibility to mycobacterial disease IFNGR1, ISG15, TYK2, IRF8, SPPL2A AR, AD Severe bacillus Calmette–Guérin (BCG; a live attenuated strain of Mycobacterium bovis) or environmental non-tuberculous mycobacterial infection 91
Lymphocytes X-lymphoproliferative disease SH2D1A X-linked Haemophagocytosis during the course of Epstein–Barr virus infection, or hypogammaglobulinaemia or B cell lymphoma 192
Lymphocytes Chronic mucocutaneous candidiasis CARD9, IL17RA, STAT1, STAT3, IL17RC, IL17F, ACT1, JNK1, RORC, ZNF341 AD, AR Cutaneous and mucosal lesions due to Candida albicans 193
Phagocytes Invasive dermatophytic disease CARD9 AR Dermatophytes causing cutaneous and rarely invasive disease 194
Examples of monogenic inborn errors of immunity with low penetranceb
TLR3-mediated interferon-dependent immunity Herpes simplex encephalitis TLR3 AD Encephalitis due to herpes simplex virus 74
TLR pathway Invasive pneumococcal disease NEMO, IRAK4, MYD88 X-linked Invasive disease due to Streptococcus pneumoniae 7578
Interferon-dependent immunity Severe influenza pneumonitis IRF7 X-linked Severe lung infection due to influenza virus 87

This table lists inborn errors of immunity with genes and associated infectious disease phenotypes indicated. The classification for inborn errors of immunity underlying severe infectious disease follows that proposed by Casanova and Abel9. Primary immunodeficiencies (PIDs) comprise of a group of usually rare disorders involving partial or full dysfunction of one or more components of the immune system. The wide spectrum of PIDs leads to differing susceptibilities to infectious diseases depending on the part of the immune system affected. PIDs are associated with overt immunological abnormalities and have high/complete penetrance; they are often associated with other clinical phenotypes, such as autoimmunity, autoinflammation, cancer and allergy. Examples of other monogenic inborn errors of immunity are shown, which typically disrupt host defence against one or a few infectious agents, differentiated by penetrance. AD, autosomal dominant; AR, autosomal recessive; IFNγ, interferon-γ; TLR, Toll-like receptor. aHigh-penetrance/complete-penetrance inborn errors of immunity (‘Mendelian infections’) are usually familial. bLow-penetrance inborn errors of immunity are typically sporadic and rare.

This Review aims to summarize the insights gained from the wide range of -omics approaches being used to understand genetic susceptibility to infectious disease. We begin by discussing recent findings and limitations of GWAS applied to infectious diseases, in particular the need for application in diverse global populations. We focus on lessons learnt from different infections and on a region of the genome consistently linked to infectious disease susceptibility: the major histocompatibility (MHC) locus encoding the vital immune human leukocyte antigen (HLA) genes (Box 1). We then proceed to discuss novel approaches, many of which involve large-scale sequencing, and highlight the importance of monogenic architecture leading to infectious disease susceptibility, although only selected examples will be covered as this topic was reviewed comprehensively recently9. This is followed by exploration of the potential of future integrated analyses of human–pathogen variation and application of multi-omics platforms. We postulate how these findings may inform our understanding of the genetic determinants of infectious disease susceptibility, how this knowledge may in turn improve our understanding of individual immune and vaccine responses, and how this might be directed more universally for the benefit of human health. Finally, we discuss strategic approaches to dissecting host genetic factors in COVID-19 as a case study in how such work is being implemented as part of global efforts to address this emerging public health threat.

Box 1 The MHC and HLA genes.

The major histocompatibility complex (MHC) is a locus on chromosome band 6p21, which in humans includes the classical human leukocyte antigen (HLA) genes that encode a group of proteins responsible for presenting antigens to T cells, the MHC molecules. Split into three regions (classes I, II and III), the locus includes genes for well-characterized proteins involved in antigen processing and presentation as well as a diverse range of other immune functions. The classical and highly polymorphic HLA-A, HLA-B and HLA-C genes and non-classical and less polymorphic HLA-E, HLA-F and HLA-G genes are located in the class I region. The class II region includes HLA-DPA1, HLA-DPB1, HLA-DQA1, HLA-DQA2, HLA-DQB1, HLA-DQB2, HLA-DRA, HLA-DRB1, HLA-DRB2, HLA-DRB3, HLA-DRB4 and HLA-DRB5 genes. Genes involved in other aspects of immunity are found in the class III region, such as those for complement components. Although the primary function of both class I and class II molecules is to bind and present peptides to T cells, class I molecules also interact with killer immunoglobulin-like receptors (KIRs) and leukocyte immunoglobulin-like receptor (LILRs) on natural killer cells and monocytes, respectively195.

This intimate relationship with immune function coincides with the fact that the MHC is the genomic locus with the most associations with disease14. However, the high polymorphism, homology and gene density and the strong linkage disequilibrium in the locus make the contributions and causal mechanisms of individual loci difficult to determine196,197. The concept of ‘polymorphic frozen blocks’ arose from the substantial linkage disequilibrium and may be explained by genetic mechanisms — for example, insufficient time for recombination owing to isolated populations in recent history/lower meiotic pairing of highly variable genomic regions — or it may result from preservation of coordinated sets of alleles given the likely epistasis of the genes.

The extensive diversity of class I and class II alleles is likely to reflect selection pressures from infectious diseases, with both heterozygote advantage and frequency-dependent selection having been proposed to lead to population HLA diversity198,199. However, one interesting aspect of HLA function that provides an alternative explanation is the interaction between maternal KIRs and fetal HLA-C molecules, which influence reproductive fitness, which may drive HLA diversity195,200.

Common-variant associations: HLA and beyond

For major infectious diseases, including HIV-1 infection, TB, malaria, leprosy and viral hepatitis, genetic evidence has been accrued primarily through GWAS, aiming to identify disease-associated common variants in populations on the basis of a polygenic model of complex multifactorial traits. Recent work involving these traits has highlighted the progress and challenges that continue to be associated with mapping and functionally interrogating genetic associations through GWAS.

Genetic factors are hypothesized to contribute to the observed heterogeneity in response to infectious disease, and twin studies are an important source of information to estimate heritability while recognizing the risks of overestimation10. It is also important to note that while ongoing work investigating rare and structural genomic variants may help further address the ‘missing heritability’ conundrum — the disparity between overall trait heritability and observed effects through GWAS — this has been reconciled mostly by the recognition that genetic predisposition in most common traits is spread and shared across large numbers of variants (in the thousands), each with a typically modest effect11. Understanding of the genetic architecture of disease predisposition continues to develop, with an omnigenic model for complex traits proposing genetic contribution from both core genes with strong genetic evidence and very many networked peripheral genes of very low effect size involving regulatory circuits12,13.

Of particular interest in infectious disease GWAS has been the role of genetic variation at the locus encoding HLA genes in determining the individual response to infection, given the role of encoded MHC molecules in binding and presenting antigens and thus determining which antigens are ‘seen’ by the immune system (Box 1). However, the extreme polymorphism of this region, high gene density, complex linkage disequilibrium and regions of homology make fine mapping genetic associations involving HLA very challenging14.

Across the genome, a substantial majority of genetic variants identified by GWAS are located in non-coding regions of the genome, requiring complementary functional genomic annotations to link such variants to modulation of individual genes or pathways. This is key to understanding the functional basis of such associations, in particular the specific modulated genes or pathways that may represent potential drug targets. Knowledge of network connectivity substantially enhances the ability to then identify nodal genes that may be the best therapeutic targets15. Prior genetic evidence for targets substantially increases the success rate in late-stage clinical trials16. As well as supporting the development of new drugs, it provides important opportunities to support repurposing existing drugs where a genetic association with a target is shared across traits, indicating new indications for such approved drugs17.

Genetics can also enable analytical approaches such as Mendelian randomization that leverage genetic variation as instrumental variables to establish causal relationships between non-genetic (modifiable) risk factors and disease18. A further example of translational relevance is the opportunity for genetic evidence to potentially stratify risk; for example, to identify high-risk individuals for whom early or pre-emptive intervention may be appropriate. Aggregated genetic risk through polygenic risk scores and consideration of large-effect rare variants are gaining traction as useful clinical tools19, with reports, for example, of application to dengue fever20.

HIV-1 infection

The two genomic regions most consistently associated with HIV infection outcomes are the MHC21 and the CCR5 locus (CCR5 is a coreceptor for the virus)2224. Arguably, HIV-1 infection is the disease with the greatest insights into HLA–disease association pathogenesis, including seminal early advances supporting the hypothesis of ‘heterozygous advantage’, with individuals with greater heterozygosity of class I alleles (HLA-A, HLA-B and HLA-C) having delayed progression to AIDS25. A more recent discovery has demonstrated that differential expression of HLA-A contributes to disease progression through HLA-E-mediated effects, implicating the interaction between HLA-E and the NKG2A natural killer cell inhibitory receptor as a therapeutic target26. Both HLA-B and HLA-C have been consistently implicated in HIV control, but strong coinheritance or linkage disequilibrium in the MHC region has made disentangling the contributions of each difficult.

The repeated identification of HLA associations in HIV-1 GWAS has contributed, in part, to a catalogue of work focusing on the killer immunoglobulin-like receptor (KIR) genes that encode a family of highly polymorphic activating and inhibitory receptors expressed mainly by natural killer cells that interact with HLA. There has been a surprising lack of KIR gene associations in GWAS to date, which may be due to a multitude of reasons, including substantial diversification of KIR gene haplotypes, incomplete knowledge of the nature and diversity of such haplotypes across populations, and poor coverage of current genotyping arrays for this region, making it difficult to pinpoint specific genetic variants across populations. Thus, this region could represent an archetypal complex region that we have yet to accurately tag in even the most comprehensive of modern GWAS. Moreover, the effects of KIR gene variants may be detected only in the presence of their specific HLA ligand. As evidence for this effect, KIR3DL1 allotypes expressed highly in combination with specific HLA-B alleles (Bw4) were found to slow disease progression when compared with a low-expression allotype or lack of the Bw4 allele27. By contrast, homozygosity for the CCR5Δ32 mutation has been the most robustly confirmed genetic background for protection against HIV28, and antisense long non-coding RNA CCR5AS, which affects CCR5 mRNA stability and cell surface expression, has recently been shown to influence HIV infection susceptibility29. Furthermore, knowledge of the CCR5 pathway has led to the successful development of a class of antiviral agents targeting the entry of HIV-1 into target host cells.

Tuberculosis

Despite some of the earliest evidence for TB susceptibility being driven by genetic heterogeneity30,31, a consistent genetic signal of association in any one locus has yet to be consistently demonstrated. The largest study to date was undertaken in a combined sequencing and genotyping effort including 8,162 cases of combined presentations of TB in the Icelandic population32. This study identified variants in the MHC class II region as being associated with TB susceptibility, perhaps emphasizing the need to undertake GWAS in highly homogeneous populations when tackling ancient pathogens such as Mycobacterium tuberculosis. By contrast, other GWAS applied to TB of varied organ involvement in diverse populations have demonstrated association of pulmonary TB with variants in ESRRB and TGM6 in a Chinese population33 and ASAP1 in a Russian population34, although the biological basis by which these genes mediate an effect in TB is not always clear. For example, TB mouse models with CRISPR–Cas9 deletion of TGM6 lead to increased cytokine transcripts in the lungs, greater tissue damage and higher bacterial burden, but whether the cytokine levels are directly due to altered TGM6 levels is yet to be established.

The lack of consistent signals in the TB GWAS highlight the enormous challenges in undertaking multinational studies, where the observations may be attributed to differences in populations because of population structure either at the host level or the pathogen level or may be owing to technical differences, such as the genotyping array used. A meta-analysis of these studies that might help explain the observed incompatibilities and may inform future research in the area is eagerly awaited.

Leprosy

By contrast, many genes have been identified through GWAS for susceptibility to leprosy due to the related species Mycobacterium leprae35 that unsurprisingly implicate the immune system, including, most notably, the HLA-DR region36. Whereas a GWAS in an Indian cohort found HLA-DR, HLA-DQ and TLR1 (encoding Toll-like receptor 1) to be associated with susceptibility to leprosy37, a further GWAS of Chinese patients failed to replicate the TLR1 association despite adequate power, implicating population-specific effects38. This is again in line with the hypothesis of the genetic variation responsible for variability in the infectious disease trait (human genetic architecture) being subjected to selection pressure from pathogens, as different populations would have been exposed to various infective agents. The large size of the case–control cohorts has enabled the identification and downstream functional characterization of some of the non-HLA loci; for example, for NOD2, where the single-nucleotide polymorphism (SNP) rs1981760 is associated with differential NOD2 gene expression by being an expression quantitative trait locus (eQTL) in neutrophils39. The SNP acts by affecting binding of the transcription factor STAT3 and influencing neutrophil interferon-β (IFNβ) responses39, an effect that has been also been reported in TB40.

Malaria

This parasitic infection is perhaps the most famous example of host genetics influencing infectious disease susceptibility, with haemoglobinopathies — such as sickle haemoglobin (HbS), thalassaemia and glucose 6-phosphate dehydrogenase (G6PD) deficiency — long being associated with protection against malaria. As one example of an underlying biological mechanism, haemoglobin oxidation products in HbS affect the ability of the Plasmodium falciparum parasite to hijack red blood cell cytoskeletal remodelling41. Balancing selection, illustrated by the heterozygote advantage conferred by HbS, provides an important mechanism for maintaining advantageous genetic diversity in human populations42.

Malaria is an example where modern GWAS have confirmed several loci previously identified largely by candidate gene or protein studies, albeit still not most of them; most notably, despite many early observations of associations with HLA, GWAS projects have not found equivalent evidence. The Malaria Genomic Epidemiology Network (MalariaGEN)43 has successfully identified or confirmed genetic loci associated with severe Plasmodium falciparum malaria across Africa, Asia and Australasia, including HBB (β-globin gene), ABO (gene for glycosyltransferase enzyme that determines ABO blood groups), ATP2B4 (gene for a calcium transporter on the erythrocyte membrane), G6PD (activity levels of the enzyme affect oxidative stress in red blood cells) and CD40LG (a key gene in B cell immune responses). A notable finding was that among both different phenotypes and different populations, the heterogeneity of effect of risk alleles was substantial, possibly explaining the inability to replicate some of the previous genetic findings, together with pathogen diversity. The consortium further identified a locus neighbouring glycophorin genes (encoding erythrocyte cell surface receptors that allow Plasmodium falciparum invasion)44 to be implicated in malaria susceptibility. This finding highlights the possibility that loci conferring resistance to malaria involve polymorphisms with features of ancient balancing selection based on haplotype sharing between humans and chimpanzees45, in keeping with the hypothesis of a protracted evolutionary arms race between host and pathogen leading to the host and pathogen genetic diversity that we see today. The link between glycophorin genes and malaria resistance was further elucidated in a study using next-generation sequencing data and reference haplotypes that identified a complex structural rearrangement involving GYPA and GYPB gene copy number that reduced the risk of severe malaria by 40% and that explained the previous association signal46.

Viral hepatitis

Whereas three-quarters of patients with acute hepatitis C virus (HCV) infections eventually reach chronicity, the remaining quarter clear the virus spontaneously. As a key example of a strong GWAS signal not related to HLA with major translational impact, polymorphisms in the gene encoding IFNλ3 (IFNL3; formerly known as IL28B) have been associated with resolution of HCV infection47 and with differences in response to HCV drug treatment48 (viral clearance following treatment with pegylated interferon and ribavirin) in individuals of both European and African ancestry, enabling development of a precision medicine approach to therapy49 (Fig. 2). HLA class II associations with spontaneous HCV clearance have been reported by GWAS50. In a separate GWAS recently of African, European and Hispanic populations, HLA class II signals were replicated, while loci around the IFNλ genes and GPR158 were also implicated in spontaneous viral clearance51.

Fig. 2. Precision medicine approaches in infectious disease informed by human genetics.

Fig. 2

Examples of how understanding of human genetic information can be or has begun to be leveraged for improved patient care. Strategies include identifying specific molecular targets based on genetic understanding, stratifying patients to decide on use of certain drugs and using genetic knowledge to predict severe adverse reactions to medications. GWAS, genome-wide association studies; HCV, hepatitis C virus; HIV, human immunodeficiency virus; JAK, Janus kinase.

Novel approaches and technologies

Human population diversity

Lessons learnt in infectious disease susceptibility GWAS to date highlight several principles for increasing the chances of success. A key emerging area is the extent of population-specific associations genome-wide (Table 1). This in turn emphasizes the relative paucity of GWAS undertaken in diverse worldwide populations, many of which have the highest burden of infectious disease. Greater efforts will be needed to focus GWAS and other genetic studies on such populations, especially where ethnic background may be associated with differential risk52. It is thus imperative that genotyping arrays are designed for worldwide populations and that the availability of large imputation reference panels from diverse populations enables effective statistical inference of additional non-genotyped SNPs (and hence substantially increasing the informativeness of GWAS at no additional cost)5355. New technologies such as those based on low-coverage whole-genome sequencing may provide an alternative cost-effective approach for genotyping common and rarer genetic variants56.

As is apparent from the studies reported so far herein, infections can afflict populations limited by geography and local ecology, and it is therefore crucial that genetic associations are tested for in non-European cohorts. However, such analyses come with technical challenges owing to the need to adjust the data for potential confounding by differences in population structure or relatedness that may cause an excess of type I errors. These aspects have been appreciated for some time, and many methods have been developed to facilitate such adjustments. One method is principal component analysis (a mathematical method to summarize the main sources of variance in data) to model ancestry differences and account for differing allele frequencies in different populations57. A further development is in statistical modelling using linear mixed model algorithms that can account for population structure and cryptic relatedness simultaneously58.

The Population Architecture Using Genomics and Epidemiology (PAGE) Consortium has demonstrated both the power and the importance in accounting for multi-ethnic and diverse populations in various non-infectious disease and physical trait settings; a GWAS conducted by the consortium demonstrated that in different populations, the effect size of implicated loci differed, exemplifying the need to account for population structure59. Any initiatives planning to develop personalized medicine in infectious diseases based on GWAS will therefore have to depend on the availability of genetic studies that are commensurately precise in a population context and that include an appreciation of the genomic complexity of regions such as the MHC, KIR and immunoglobulin heavy chain (IGH) regions.

Leveraging self-report

The maturation of methods for undertaking GWAS has been paralleled by improvements in methods for recruiting and testing associations. A key study from the personal genetic provider 23andMe used self-report to assess the influence of host genetic background on susceptibility to a range of common infectious diseases of low mortality (for example, common cold, rubella and cold sores)60. Including more than 200,000 individuals of European descent, this study reported many genetic loci related to the immune system and embryonic development, including genes both within and outside the MHC. Despite the large number of associations identified, the use of surveys requiring retrospective recall to identify cases will require follow-up replication and validation. A similar method was used for whooping cough, where self-report of the characteristic cough in childhood was associated with variation across the MHC61.

Phenome-wide association studies

Another development in recent years has been to establish the phenome-wide association study (PheWAS) approach. This has been facilitated by the use of electronic health records for clinical phenotyping. Instead of taking a single disease phenotype and scanning genome-wide as in GWAS, this approach takes a number of genetic loci (usually based on a priori biological knowledge) and looks for associations with any of a large number of phenotypes. Although revealing for many immune-related diseases, it has had limited use thus far in the field of infectious diseases. A PheWAS on HLA predictably highlighted autoimmune diseases, but found only a relatively small number of associations with infectious diseases62. Given the known key role of HLA genes in infectious diseases, it is likely that this was due to a lack of statistical power or challenges in defining cases from controls using hospital data.

Nevertheless, one example of a PheWAS in infectious disease genetics identified an association between the rs3211783 SNP in the factor X gene, encoding an enzyme of the coagulation cascade, with infection-related phenotypes63. Recent evidence for the role of factor X in antibacterial immunity has emerged64, and more studies will be needed to confirm this role, but the study provides an example of what may come in the field of infectious disease genetics through future PheWAS.

Genome sequencing and large-scale bioresources

Standard GWAS and imputation methods are unable to test the effects of rare, large-effect variants, as these variants are not covered by genotyping arrays and are not common enough in haplotype references. To overcome this challenge, direct whole-genome sequencing (WGS) or whole-exome sequencing (WES) approaches have been used. In one example, WES was first performed to identify rare alleles that are relevant in West Nile neuroinvasive disease, enabling subsequent imputation of the rare alleles into a primary cohort. The findings were then extended to larger cohorts for genome-wide association analysis. Loci revealed as being important included HERC5, an interferon-stimulated gene, and an intergenic region between CD83 and JARID2, which included a site for STAT5A transcription factor binding. With very few other such examples in the literature for any trait, it is becoming increasingly clear that the successful application of these methods will require hundreds of thousands to millions of individuals65.

To this end, several population-level bioresources and personal genetic services such as the UK Biobank, the China Kadoorie Biobank and the 100,000 Genomes Project — which are large repositories of samples and data — are enabling the use of novel approaches and enhancing the quality of data analysis using existing methods. In light of early discoveries afforded by the availability of genetic data from these studies66, it will be exciting to see how they may provide further insights into infectious disease as they mature.

Long-read sequencing technologies and highly polymorphic loci

A GWAS of rheumatic heart disease in Pacific Island populations recently identified a signal in the IGH locus67. This disease, which occurs following Streptococcus pyogenes infection, used to have a more widespread worldwide prevalence and is now rather limited to underserved populations across the Pacific. This intriguing finding in the IGH region has long been suspected for many diseases but has been infrequently observed. Studying the IGH region has been challenging owing to its high level of complexity (including substantial copy number variation68) and the sparse coverage of this locus on genotyping arrays. However, with the advent of new long-read sequencing and other technologies, it is likely that we will better capture the extent of IGH diversity and possible disease associations, similarly to what has been achieved for regions such as the MHC and KIR regions69.

New insights from inborn errors of immunity

Another key route to understanding infectious disease susceptibility is through the study of inborn errors of immunity that disrupt host defence against infection. In contrast to more common variants with small to moderate effect sizes identified by GWAS, rare mutations (of variable penetrance) typically result in a large phenotypic effect with substantial infections for the individual that may involve multiple pathogens. Such analysis has been highly informative in revealing mechanisms of infectious disease pathogenesis (Fig. 1; Table 2). Intersection with GWAS is seen: for example, single-gene defects in complement components result in increased susceptibility to meningococcal disease70,71, while GWAS identified SNPs within complement factor H (CFH) and CFH-related protein 3, consolidating evidence for the role of complement in host defence against meningococcal infection72. Application of WGS and WES has now identified more than 400 genetic causes73 of inborn errors of immunity, many of which cause primary immunodeficiencies (PIDs), including genes of known and novel mediators of innate and adaptive immunity (Table 2) mutated in various manners leading to the full range of mechanisms for Mendelian inheritance patterns. One notable advancement in this area has been the recognition of disease-causing genes that do not necessarily lead to overt immunological abnormalities, with differences in penetrance and specificities in clinical infection phenotype among other features9 (Table 2).

The link between certain immune pathways with specific clinical infection phenotypes has been well documented. For example, the TLR1/2/6–Toll–interleukin-1 receptor domain-containing adaptor protein (TIRAP)–nuclear factor-κB (NF-κB) pathway influences invasive pneumococcal disease (Fig. 1a), whereas the TLR3–interferon pathway affects susceptibility to herpes simplex virus encephalitis8,7478 (Fig. 1b,c). However, other studies are highlighting non-canonical immune pathways.

As an example of a PID, recent studies include demonstration of a critical role for RIPK1 in susceptibility to infections. RIPK1 is a key regulator of cell death and survival and, in the case of cell death, mediation of either apoptosis or necroptosis. In a study of a small group of patients with recurrent infections, early-onset inflammatory bowel disease and progressive polyarthritis, WES identified homozygous LOF mutations79, while biallelic mutations in RIPK1 were found in a further series of patients with very early onset inflammatory bowel disease — the patients all had recurrent bacterial and viral infections and in some instances sepsis80.

Haploinsufficient mechanisms have also been identified, for example, involving BACH2 (ref.81), which is key in controlling B and T cell maturation and functional specification. In a case of a PID with recurrent upper respiratory tract infections and early-onset colitis, a heterozygous non-synonymous mutation in BACH2 resulted in reduced ability of the protein to dimerize or localize to the nucleus. On the other hand, GOF mutations leading to infectious disease susceptibility highlight opportunities for precision medicine. In patients with heterozygous GOF mutations in STAT1 — who have fungal (persistent oral candidiasis) and viral (CMV) infection and in one case TB (Fig. 1c) — the resultant excessive CD4+ T cell STAT1 phosphorylation after IFNβ stimulation highlighted the opportunity for therapeutic suppression by the Janus kinase (JAK) inhibitor ruxolitinib82. A separate study of patients affected by STAT1 GOF mutations and chronic mucocutaneous candidiasis showed successful treatment with ruxolitinib83 (Fig. 2).

Chronic granulomatous disease arises owing to the inability of phagocytes to mount a reactive oxygen species burst in response to pathogens and is characterized by severe recurrent bacterial and fungal infections84. This PID is often caused by mutations in the genes encoding the components of the NADPH oxidase complex. A homozygous LOF mutation in CYBC1 was identified as a further cause85, revealing a novel function for CYBC1 as a chaperone for one of the components of the NADPH oxidase complex.

Studies of patients with extremes of response to viral infections have provided mechanistic insights. In a case of recurrent viral infections requiring intensive care admission86, WES revealed a homozygous mutation in IFIH1 (encoding a retinoic acid-inducible gene I protein (RIG-I)-like helicase receptor responsible for sensing viral double-stranded RNA in the cytosol), whereas, in cases of life-threatening influenza, WES identified compound heterozygous mutations in IRF7, which reduces both type I and type III interferon production87. Influenza virus pneumonia may also arise owing to rare inborn errors of immunity involving IRF9 and TLR3 deficiencies88,89 whereas homozygous CCR5-null and FUT2-null alleles protect against CCR5-tropic HIV and norovirus, respectively2224,90.

Mendelian susceptibility to mycobacterial disease can arise owing to mutations in genes encoding proteins involved in the production of, or response to, the cytokine IFNγ. Such mutations have important treatment implications; for example, the deficiency of IFNγ signalling can be ameliorated by therapeutic supplementation with IFNγ or haematopoietic stem cell transplantation, dependent on the degree of functional pathway responsiveness91. WES has revealed a novel cause of Mendelian susceptibility to mycobacterial disease in a case of homozygous splice-site mutations in SPPL2A (required to cleave CD74, the invariant chain of HLA class II); the mutation is proposed to disrupt functional polarization of T cells owing to altered priming by dendritic cells after mycobacterial antigen presentation92 (Fig. 1d).

One hurdle in monogenic disorders is incomplete penetrance. Polymorphism of TIRAP is known to influence susceptibility to invasive pneumococcal disease, TB, malaria and other bacterial infections9396. In a study of a family with TIRAP deficiency, seven of eight individuals with the genetic defect did not have the severe staphylococcal infection that the index case did97. A possible explanation was the lack of antibodies to lipoteichoic acid found in the index patient compared with other family members that may allow monocytes in the unaffected family members to respond to lipoteichoic acid stimulation of TLR2 despite deficiency in TIRAP.

Integrating host and pathogen genetic variation

Although the impact of host genetic variation on susceptibility to infection is often viewed separately from that of pathogen genetics, it is highly likely that a major limitation in identifying novel genetic signals in GWAS applied to common infections will be attributable to heterogeneity at the level of the infectious agent. Approaches attempting to simultaneously interrogate host and pathogen transcriptomic responses (dual RNA sequencing) have been developed and applied to infectious diseases98100; this joint characterization has also occurred at the genetic level to account for variability in both the host genome and the pathogen genome. In a Dutch cohort, a GWAS of both the host and the pathogen in pneumococcal meningitis allowed delineation of the contributions of host genetic background to the susceptibility and severity of disease versus the effects of pneumococcal genetic variation on invasive potential but not disease severity101. The study revealed an intronic SNP in a host ubiquitin-converting enzyme gene, UBE2U, to be significantly associated, with possible interaction with other host genes, including PGM1 (encoding phosphoglucomutase 1) and ROR1 (encoding a tyrosine-protein kinase transmembrane receptor). Similarly, a joint genetic study of HCV infection with human genome-wide genotyping and genome sequencing of HCV genotype 3 infections led to more biological insights, including understanding of reciprocal effects (for example, of human genetic variation in driving viral genome polymorphism)102. Rather than association tests using disease status, this study examined the associations between the host genetic background and variable sites of the viral proteome. Host HLA alleles were found to be associated with HCV amino acids leaving multiple ‘footprints’ in the HCV genome; and, in addition, a specific interaction between the HCV NS5A protein and human IFNL4 (encoding IFNλ4) genotype was found to affect viral load.

Simultaneous evaluation of host and pathogen genetics was also used in HIV datasets to reveal the influence of HLA genes. By considering how a virus mutates throughout the course of its infection, the relevance of the host genetic background can be considered. A recent study analysed a large number of existing HIV datasets, using host HLA class I genotypes and pathogen reverse transcriptase and protease sequences and modelling the processes of within-host evolution of the virus103. Known allele–epitope combinations (for example, escape at codons 173/195 of reverse transcriptase when paired with the HLA-B*51 allele) were replicated, but many epitopes highlighted to be HLA associated were not previously reported. Deeper analysis of viral sequence subtypes showed that differential selection pressures between the same HLA allele (HLA-B*58) and the different viral subtypes occurred at a reverse transcriptase codon that differed between the two subtypes. HLA-B*58 selection pressure may have consolidated the divergence in viral sequence subtypes.

The selective pressure of other pathogens on human populations has been delineated by genome-wide analyses of natural selection. For example, for cholera, targets of positive selection were identified in a population from the Ganges River Delta, supporting strong selective pressure by the pathogen on innate immune pathways, including NF-κB and inflammasome signalling104.

Studies of dual-pronged interrogation of host and infectious agent genetics in malaria have yet to be conducted at a large scale, but there have been projects to characterize heterogeneity in both Plasmodium falciparum, Plasmodium knowlesi and Plasmodium malariae and the malarial vector Anopheles gambiae at genetic and gene expression levels105107. These studies will be informative for drug resistance and vector control; for example, one study highlighted the need to collect vector genomic data to establish which strategies lead to insecticide resistance107.

Several major challenges remain for studies considering both host and pathogen genetic variation. Most notably, the multiple testing burden means that substantially larger sample sizes will be needed in future studies, although an important analytical strategy is dimensionality reduction, which takes advantage of genome-wide correlation between variants101. Additionally, understanding the impact of complex spatio-temporal and ecological scales requires careful acquisition of comprehensive metadata that can be used with genomic data108.

Genetics of immune function phenotypes

In the evaluation of any disease, accounting for phenotype heterogeneity is of paramount importance. The use of various -omics technologies is a strategy to address this (Fig. 3). For example, transcriptomic analysis in sepsis has been able to stratify patients into different disease endotypes regardless of the source of infection109112. Crucially, these endotypes have been shown to have interaction effects with steroid treatment, with evidence that patients of a relatively immunocompetent endotype have increased mortality on being given steroids (Fig. 2). This finding highlights the need for accurate phenotyping in future clinical trials where the effect of treatment may be diluted by heterogeneity in patient cohorts, and such improved phenotyping will also enable more successful application of GWAS.

Fig. 3. Omics and intermediate phenotypes as part of the toolkit for investigating the basis of infectious disease susceptibility.

Fig. 3

a | Traditional case–control genome-wide association study (GWAS) approaches compare allele frequencies of genetic variants in cases versus controls. b | Mendelian disease mapping with pedigree analysis (including case–parent trio analyses) and use of whole-exome or whole-genome sequencing. c | Multi-omics approaches, which enable intermediate phenotypes to be quantified by various -omics technologies. d | Leveraging genetic information to interrogate or leverage intermediate phenotypes. Differences in intermediate phenotypes such as gene expression can be mapped to genetic variation by quantitative trait locus (QTL) mapping. Mendelian randomization methods can use intermediate phenotypes that are risk factors for disease, with genetic variants that affect the intermediate phenotype allocated randomly to allow confounders to also be randomly distributed.

Thus, aside from clinical disease phenotypes, innovative approaches to examining infectious disease genetics have investigated intermediate phenotypes that may be relevant to the disease (Fig. 3). For example differential blood cell types are under substantial genetic influence113. Furthermore, levels of immunoglobulin are also genetically influenced not only by genetic variants in IGH genes and related immunoglobulin genes and the MHC but also by RUNX3 variants altering isoform shifting and FCGR2B variants impacting IgG binding to its receptor114. These approaches are now also being translated in the context of antigen specificity, including the role of genetic variation in agent-specific antibody responses, with a role for MHC identified in various responses, including anti-Epstein–Barr virus and anti-rubella virus IgG levels115,116. It will be interesting to determine what findings these markers of previous exposure may contribute in the context of acute infection risk in the future.

In a different attempt to examine the role of genetic variants in controlling the human immune system, Roederer et al. performed detailed immunophenotyping in 669 female twins117. The top 151 heritable immune traits from extensive flow cytometry were analysed. For example, FCGR2A variants were associated with multiple immune phenotypes such as expression levels of various T cell markers such as CD27 or CD161, and FCGR2A has been associated with HIV infection progression.

The use of microbial stimuli to produce intermediate phenotypes has also led to interesting insights. One study used 528 lymphoblastoid cell lines and subjected them to challenges with eight different kinds of pathogens, selected carefully for their high health-care burden and the wide range of host responses elicited against them118. Genome-wide association tests revealed 17 significant SNPs; for example, a SNP in ZBTB20 (which functions as a transcriptional repressor) was associated with both increased salmonella-induced pyroptosis and chlamydia replication. A PheWAS approach was further used with clinical phenotypes from the eMERGE dataset119, which highlighted the same SNP, this time relevant in viral hepatitis.

Ultimately, the link from gene to biological function requires characterization of candidate loci for mechanistic understanding of genetic contributions. The vast majority of loci found in GWAS reside in non-coding regions, implicating gene regulation in disease pathogenesis. Much effort in recent years has therefore been focused on understanding the genetic basis of various forms of molecular traits (molecular quantitative trait locus (QTL) mapping) (Fig. 3). Such associations are often highly context specific, dependent, for example, on cell type120 and activation state121123, with effects specific to exposure to particular pathogens124, and dependent on the disease state; for example, in patients with sepsis due to community acquired pneumonia109. This characterization of molecular traits can inform GWAS interpretation; for example, in a GWAS of non-typhoidal Salmonella bacteraemia, a SNP acting as an eQTL for STAT4 was found, with its effect reported to be a reduction in IFNγ production by natural killer cells125. Genetic modulators of gene expression may also operate at the level of alternative splicing126, including after stimulation by influenza A virus127, with evidence that genetic regulation of isoform use was mostly distinct from that of gene expression. Immune cell-specific protein QTLs have also been demonstrated128, including effects on expression levels of HLA-DRB1 in specific cell types that may be relevant in diseases such as sepsis, where reduced HLA-DR expression is known to correlate with poor outcome129. A further study mapped cytokine production to genetic variants (cQTL mapping) in response to microbial stimuli, identifying QTLs involving pathways containing pattern recognition receptors, cytokine and complement inhibitors, and the kallikrein system130.

The maturation of technologies has seen multi-omics approaches applied to increasing numbers of diseases to identify and link genomic alterations to biological mechanisms. The infectious disease field has seen recent examples emerge: the combination of the plasma proteome, metabolome and lipidome and peripheral blood mononuclear cell transcriptome in analysing Ebola virus disease pathogenesis is one such example131. Macrophages and neutrophils were found to be particularly relevant cell types. Although this study itself did not tie the findings to the host genetic background, it is nevertheless an important example of the use of multi-omics functional characterization, and it is likely that we will soon be seeing the linking of multi-omics data back to genetic background.

Genetics of responses to vaccinations

As vast amounts of data continue to accrue on the relationship between the human host, its immune responses and infectious agents, the question as to how these data are best used is timely. Arguably, vaccination has offered one of the most effective public health efforts of modern medicine and is the most cost-effective opportunity to substantially reduce the incidence of many important infections, such as malaria, HIV infection and TB. Thus, given the large amount of evidence of human genetics influencing both non-specific and specific immune responses and susceptibility to disease, it is curious that there is still a relative lack of investigations looking at the effect of host genetics on vaccine response132, especially as the study of responses against conserved epitopes offers a means to control for pathogen diversity. Here, we review the state of the field in terms of the genetics of response to vaccination and its potential utility.

The vaccine targeting hepatitis B virus (HBV) remains the most studied, with many genetic associations identified in the MHC region, and there is controversy over whether it is HLA-DR, HLA-DP or class III MHC (involving the C4A gene) that plays independent or linked roles133136. It is clear, however, that while HLA-DRB1*03 and HLA-DRB1*07 are linked to lower antibody responses, the opposite is true of HLA-DRB1*01, HLA-DRB1*13 and HLA-DRB1*15 (refs134,137,138), and the associations between HLA-DP and vaccine response against HBV are robust, as they are for other clinical phenotypes associated with the infection, particularly in Asian populations.

Rubella vaccination has seen similar HLA associations, although evidence is again conflicting. The HLA-DPB1 locus has been associated with rubella vaccination antibody responses139,140, but SNPs in the region have been associated in opposite directions, highlighting the need for further replication and functional studies to identify plausible biological mechanisms. Increased antibody responses to measles vaccination has been associated with the HLA-DQA1*0201 allele141.

Although a twin study has shown that the contribution of HLA genes is relatively small compared with non-HLA genes in antibody-inducing vaccines142, there has been relatively little robust and replicated evidence of what the non-HLA genes may be. A recent two-stage GWAS looking at data from more than 3,600 children revealed both HLA and non-HLA associations on studying a combination of capsular group C meningococcal, Haemophilus influenzae type b and tetanus toxoid vaccines143. This study found SNPs in the locus coding for signal-regulatory proteins SIRPA, SIRPB and SIRPG were associated with greater antibody titres in group C meningococcal vaccines. Notably, the association was present only for serum bactericidal antibodies (functional antibodies as assessed by rabbit/human complement assays, and not with total meningitis C-specific IgG), emphasizing the need for caution about the agent-specific readout used. The study also identified four HLA class II alleles associated with tetanus toxoid vaccine IgG concentrations, with the lead SNP being an eQTL for HLA-DRB1 and HLA-DRB5 (ref.144).

Inborn errors of immunity have been reported to occasionally result in life-threatening disease following administration of live attenuated vaccines, for example, live poliovirus vaccine (vaccine-associated paralytic polio in patients with agammaglobulinaemia145), whereas defects in interferon immunity may result in severe illness following administration of the yellow fever or MMR vaccine (measles strain) (IFNAR1 (ref.146) and IFNAR2 (ref.147), and STAT1 (ref.148) and STAT2 (ref.149) deficiencies, respectively) (Fig. 1c).

Overall, current work highlights that new cohorts and mechanistic studies to further explore the effect of host genetics on vaccine response are needed to address this important area.

Genetic strategies for a new disease: COVID-19

Emerging infections such as SARS-CoV-2 pose a major threat to human health, and the early observation of striking heterogeneity in clinical presentations and outcomes following infection has highlighted the ongoing need to apply genetic and genomic approaches to understand drivers of individual susceptibility, severity and outcomes to this disease. Only a small minority of those infected with SARS-CoV-2 develop severe disease, with recognized risk factors including older age, male sex and co-morbidities, including obesity150. Genetic factors are hypothesized to contribute to the observed heterogeneity in response, and early reports such as from TwinsUK indicate relatively high heritability estimates for self-reported COVID-19 symptoms such as anosmia151. Genetic approaches provide the opportunity to gain novel insights into disease pathogenesis through the identification of specific disease associations involving particular genes or pathways, to define novel drug targets or to develop personalized medicine approaches with early intervention or therapy tailored to the individual. The opportunity to identify repurposing opportunities for approved medications based on shared genetic associations across traits is a particularly important consideration in a pandemic situation such as COVID-19, where, while not based on genetic evidence, the RECOVERY trial has demonstrated the therapeutic utility of existing drugs such as dexamethasone for treatment of severe disease152.

The international response to answering the question of whether human genetics alters susceptibility to or outcome from SARS-CoV-2 infection is an exemplar of modern genomic collaboration. The establishment of large appropriately phenotyped cohorts for GWAS in a pandemic situation has been enabled by prospective collection through existing or hibernating studies, rapid deployment of new studies and leveraging existing population biobank studies with large numbers of already genotyped individuals. Monthly data releases from the UK Biobank of COVID-19 results together with outcome and phenotypic information early in the pandemic highlight the opportunity provided by such population biobanks, while application of meta-analysis methods across cohorts will maximize power for genetic discovery. International collaborative efforts are key to facilitate genetic studies in infectious diseases such as COVID-19. For example, the COVID-19 Host Genetics Initiative153 was rapidly established to support sharing of relevant tools and data with an emphasis on GWAS. The COVID Human Genetic Effort154, by contrast, is focused on identifying monogenic cases with rare, highly penetrant mutations through analysis of young patients (less than 50 years of age) who were previously well and developed life-threatening disease, as well as those naturally resistant despite repeated exposure. Integrative analysis, considering both host and viral genetic variation, will likely be highly informative, although early reports suggest that viral genetic variation did not significantly affect outcomes in COVID-19 (ref.155).

An early exemplar of GWAS during the COVID-19 pandemic is a study by the Severe Covid-19 GWAS Group, who conducted a GWAS of 1,980 patients with severe disease (hospitalized with respiratory failure requiring oxygenation or mechanical ventilation) in Italian and Spanish epicentres of the pandemic versus population controls. The European meta-analysis revealed genome-wide significant associations at 3p21.31 and 9q34.2 (ref.156). The limitations of self-reported data notwithstanding, a large GWAS of more than 1.05 million individuals from the 23andMe research platform replicated the associations at these two loci for disease severity157, while the GenOMICC investigators recruited patients with COVID-19 from intensive care units only and robustly reproduced the 3p21.31 signal in addition to finding associations at other loci, including ones with genes of well-known antiviral function such as 12q24.13 (OAS1, OAS2 and OAS3) and 21q22.1 (IFNAR2)158.

Despite the important achievements, these results exemplify current challenges in GWAS biology. First, the 3p21.31 association, which was also supported by preliminary results from the COVID-19 Host Genetics Initiative, spans multiple genes, including SLC6A20, LZTFL1, CCR9, FYCO1, CXCR6 and XCR1, but will require careful functional genomic and other annotations to establish the causal genes underlying the genomic signal, as non-coding variants may modulate gene expression and other regulatory mechanisms at a distance159. Intriguingly, there is evidence that the risk haplotype, spanning ~50 kb at 3p21.31, was inherited from Neanderthals and shows substantial variation in frequency between populations, being carried by ~50% of people in South Asia, while being almost absent in East Asia, suggesting it may have been affected by selection in the past160. Second, the association at 9q34.2 coincides with the ABO blood group locus, and increased risk with blood group A was shown, but the mechanism for this association remains unclear and will require further study especially as this locus is frequently a site of type I error resulting from population stratification. Despite the apparent lack of significant signal of association at the locus encoding HLA genes in these initial GWAS of COVID-19, there is considerable interest in the importance of such variation in the response to infection, with evidence, for example, of differing capacity between HLA alleles in presentation of highly conserved SARS-CoV-2 peptides to immune cells161.

There is also early evidence from the COVID Human Genetic Effort to support the role of rare variants in the risk of severe disease on the basis of a study of 659 patients (0.1–99 years of age) hospitalized with critical disease due to COVID-19 and requiring mechanical ventilation or organ support in intensive care units162. Following genome and exome sequencing, the study authors tested the hypothesis that inborn errors of TLR3- and IRF7-dependent type I interferon immunity contribute to critical disease risk, analysing 13 genomic regions comprising loci previously shown to be mutated in critical influenza pneumonia and connected loci mutated in patients with other viral illnesses. Significant enrichment of rare variants predicted to result in LOF was found at these loci relative to people with asymptomatic or benign infection, with experimental validation including demonstration of inborn errors at eight loci in up to 23 patients (3.5%) of different ages (17–77 years) and population ancestries. The importance of type I interferons in protective immunity against SARS-CoV-2 was underlined by an accompanying article showing evidence for a high level of neutralizing autoantibodies to type I interferons in other patients with critical COVID-19, together highlighting the opportunities for therapeutic intervention based on screening at-risk individuals and developing targeted interventions163. Indeed, in the GenOMICC study described above, beyond uncovering key potential genetic contributions to COVID-19 severity, the group further used Mendelian randomization to demonstrate a link between low IFNAR2 expression and high TYK2 expression and life-threatening disease, the former further evidencing the importance of type I interferons in COVID-19 pathogenesis158. Altogether, the emerging insights from this fast-moving field of research exemplify the complex nature of the genetic architecture of susceptibility to infectious disease that may have relevance not only for the agent inflicting the largest pandemic of our generation but also potentially for a range of other infections afflicting our public health worldwide.

Conclusions and perspectives

A huge wealth of data is becoming available that confirms the influence of our host genetic variation on how we become susceptible to and respond to diseases caused by infectious agents. Although traditional case–control GWAS have yielded many insights, a large amount of information is also coming from other methods, such as intermediate phenotype mapping and multi-omics approaches. Issues of heterogeneity in host and pathogen genetic diversity and difficulty in arriving at consensus for disease case definition both hamper power in GWAS approaches. However, as recent studies emphasize, these problems are not insurmountable with appropriate power and will be aided by greater precision in defining disease phenotypes and application in a range of populations.

The challenge now comes with bringing all of these findings and approaches together to translate them into a clinical benefit. Moving forwards, it will be critical to consider context specificity. The cellular studies and vaccine responses emphasize the importance of considering pathogen-specific immune responses in the context of both time and appropriate sample type. The extent and nature of pathogen diversity is a key context, with large gaps remaining in our understanding. Our ability to leverage genetic information will also improve as we further appreciate the overlaps between monogenic syndromes with a high infection risk and the more common infectious diseases with similar pathogens. All of these opportunities are likely to come as data from cohorts such as the UK Biobank and the China Kadoorie Biobank mature.

The clinical benefit of the large body of genetic work has already been demonstrated through associations translated to clinical implementation, such as abacavir hypersensitivity eliminated by screening for HLA-B*5701 (Fig. 2), and HCV treatment influenced by IL28B. We should be inspired by these lessons learnt as we move forwards into an era where large-scale genomics may help predict our risks of disease or facilitate future vaccine development and deployment through an understanding of genetic diversity at both the individual level and the population level to enable more tailored application in a precision medicine approach that maximizes effectiveness for a given person or population group.

Acknowledgements

Funding for this work was received from the UK National Institute for Health Research Oxford Biomedical Research Centre (to J.C.K. and A.M.), a Wellcome Trust Investigator Award (204969/Z/16/Z to J.C.K.), the Chinese Academy of Medical Sciences Innovation Fund for Medical Science (grant number 2018-I2M-2-002 to J.C.K.), the Croucher Foundation (to A.J.K.) and a Wellcome Trust Grant (090532/Z/09/Z) to core facilities at the Wellcome Centre for Human Genetics.

Glossary

Penetrance

The proportion of individuals with a particular genotype that also has an associated phenotype.

Linkage disequilibrium

The non-random association of alleles at different loci in a population, often owing to physical proximity of the loci on a chromosome.

Heterozygous advantage

Where heterozygotes have fitness advantages over homozygotes.

Haplotypes

Sets of genetic variants that tend to be inherited together.

Allotypes

A group of alleles that encode a unique protein sequence.

Population structure

Also referred to as population stratification. The presence of differences in allele frequencies between subpopulations in a population.

Expression quantitative trait locus

(eQTL). A genomic region that is associated with a proportion of variation in mRNA expression levels.

Balancing selection

A selection process that maintains genetic diversity.

Imputation

The process of replacing missing information on the basis of known information. In genetics, this refers to the statistical inference of unobserved genotypes based on known genotypes.

Type I errors

The rejection of true null hypotheses in hypothesis testing (that is, false positives).

Principal component analysis

A mathematical technique that summarizes data into components that explain the variance in the data. It is used in genome-wide association studies to infer cryptic population structure and/or outlier individuals.

Linear mixed model algorithms

A statistical model that contains both fixed and random effects. This is in contrast to linear models, which contain only fixed effects.

Complement

A system of plasma proteins in the innate immune system that react with each other to opsonize pathogens and induce inflammatory responses.

Haploinsufficient

A model of dominant gene action where a single functional copy of the gene is inadequate for normal cellular function.

Pyroptosis

An inflammatory form of programmed cell death occurring most commonly following infection with intracellular pathogens.

Author contributions

The authors contributed to all aspects of the article.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Related links

COVID-19 Host Genetics Initiative: https://www.covid19hg.org/

COVID Human Genetic Effort: https://www.covidhge.com/

NHGRI–EBI GWAS Catalog: https://www.ebi.ac.uk/gwas/

UK Biobank of COVID-19: https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/covid-19-hub

References

  • 1.Mühlemann B, et al. Ancient hepatitis B viruses from the Bronze Age to the medieval period. Nature. 2018;557:418–423. doi: 10.1038/s41586-018-0097-z. [DOI] [PubMed] [Google Scholar]
  • 2.Quintana-Murci L. Human immunology through the lens of evolutionary genetics. Cell. 2019;177:184–199. doi: 10.1016/j.cell.2019.02.033. [DOI] [PubMed] [Google Scholar]
  • 3.Karlsson EK, Kwiatkowski DP, Sabeti PC. Natural selection and infectious disease in human populations. Nat. Rev. Genet. 2014;15:379–393. doi: 10.1038/nrg3734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stanaway JD, et al. Global, regional, and national comparative risk assessment of 84 behavioural, environmental and occupational, and metabolic risks or clusters of risks for 195 countries and territories, 1990-2017: A systematic analysis for the Global Burden of Disease Study 2017. Lancet. 2018;392:1923–1994. doi: 10.1016/S0140-6736(18)32225-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wu F, et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zhu N, et al. A novel coronavirus from patients with pneumonia in China, 2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sørensen TIA, Nielsen GG, Andersen PK, Teasdale TW. Genetic and Environmental influences on premature death in adult adoptees. N. Engl. J. Med. 1988;318:727–732. doi: 10.1056/NEJM198803243181202. [DOI] [PubMed] [Google Scholar]
  • 8.Chapman SJ, Hill AVS. Human genetic susceptibility to infectious disease. Nat. Rev. Genet. 2012;13:175–188. doi: 10.1038/nrg3114. [DOI] [PubMed] [Google Scholar]
  • 9.Casanova J-L, Abel L. Lethal infectious diseases as inborn errors of immunity: toward a synthesis of the germ and genetic theories. Annu. Rev. Pathol. Mech. Dis. 2020 doi: 10.1146/annurev-pathol-031920-101429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Young AI. Solving the missing heritability problem. PLoS Genet. 2019;15:e1008222. doi: 10.1371/journal.pgen.1008222. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Claussnitzer M, et al. A brief history of human disease genetics. Nature. 2020;577:179–189. doi: 10.1038/s41586-019-1879-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017;169:1177–1186. doi: 10.1016/j.cell.2017.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Liu X, Li YI, Pritchard JK. Trans effects on gene expression can drive omnigenic inheritance. Cell. 2019;177:1022–1034.e6. doi: 10.1016/j.cell.2019.04.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dendrou CA, Petersen J, Rossjohn J, Fugger L. HLA variation and disease. Nat. Rev. Immunol. 2018;18:325–339. doi: 10.1038/nri.2017.143. [DOI] [PubMed] [Google Scholar]
  • 15.Fang H, et al. A genetics-led approach defines the drug target landscape of 30 immune-related traits. Nat. Genet. 2019;51:1082–1091. doi: 10.1038/s41588-019-0456-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Nelson MR, et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 2015;47:856–860. doi: 10.1038/ng.3314. [DOI] [PubMed] [Google Scholar]
  • 17.Nabirotchkin S, et al. Next-generation drug repurposing using human genetics and network biology. Curr. Opin. Pharmacol. 2020 doi: 10.1016/j.coph.2019.12.004. [DOI] [PubMed] [Google Scholar]
  • 18.Emdin CA, Khera AV, Kathiresan S. Mendelian randomization. JAMA. 2017;318:1925–1926. doi: 10.1001/jama.2017.17219. [DOI] [PubMed] [Google Scholar]
  • 19.Torkamani A, Wineinger NE, Topol EJ. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 2018;19:581–590. doi: 10.1038/s41576-018-0018-x. [DOI] [PubMed] [Google Scholar]
  • 20.Pare G, et al. Genetic risk for dengue hemorrhagic fever and dengue fever in multiple ancestries. EBioMedicine. 2020;51:102584. doi: 10.1016/j.ebiom.2019.11.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kaslow RA, et al. Influence of combinations of human major histocompatibility complex genes on the course of HIV-1 infection. Nat. Med. 1996;2:405–411. doi: 10.1038/nm0496-405. [DOI] [PubMed] [Google Scholar]
  • 22.Deng HK, et al. Identification of a major co-receptor for primary isolates of HIV-1. Nature. 1996;381:661–666. doi: 10.1038/381661a0. [DOI] [PubMed] [Google Scholar]
  • 23.Alkhatib G, et al. CC CKR5: a RANTES, MIP-1α, MIP-1β receptor as a fusion cofactor for macrophage-tropic HIV-1. Science. 1996;272:1955–1958. doi: 10.1126/science.272.5270.1955. [DOI] [PubMed] [Google Scholar]
  • 24.Dragic T, et al. HIV-1 entry into CD4+ cells is mediated by the chemokine receptor CC-CKR-5. Nature. 1996;381:667–673. doi: 10.1038/381667a0. [DOI] [PubMed] [Google Scholar]
  • 25.Carrington M, et al. HLA and HIV-1: heterozygote advantage and B*35-Cw*04 disadvantage. Science. 1999;283:1748–1752. doi: 10.1126/science.283.5408.1748. [DOI] [PubMed] [Google Scholar]
  • 26.Ramsuran V, et al. Elevated HLA-A expression impairs HIV control through inhibition of NKG2A-expressing cells. Science. 2018;359:86–90. doi: 10.1126/science.aam8825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Jiang Y, et al. KIR3DS1/L1 and HLA-Bw4-80I are associated with HIV disease progression among HIV typical progressors and long-term nonprogressors. BMC Infect. Dis. 2013;13:405. doi: 10.1186/1471-2334-13-405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.McLaren PJ, Carrington M. The impact of host genetic variation on infection with HIV-1. Nat. Immunol. 2015;16:577–583. doi: 10.1038/ni.3147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kulkarni S, et al. CCR5AS lncRNA variation differentially regulates CCR5, influencing HIV disease outcome. Nat. Immunol. 2019;20:824–834. doi: 10.1038/s41590-019-0406-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Comstock GW. Tuberculosis in twins: a re-analysis of the Prophit survey. Am. Rev. Respir. Dis. 1978;117:621–624. doi: 10.1164/arrd.1978.117.4.621. [DOI] [PubMed] [Google Scholar]
  • 31.van der Eijk EA, van de Vosse E, Vandenbroucke JP, van Dissel JT. Heredity versus environment in tuberculosis in twins. Am. J. Respir. Crit. Care Med. 2007;176:1281–1288. doi: 10.1164/rccm.200703-435OC. [DOI] [PubMed] [Google Scholar]
  • 32.Sveinbjornsson G, et al. HLA class II sequence variants influence tuberculosis risk in populations of European ancestry. Nat. Genet. 2016;48:318–322. doi: 10.1038/ng.3498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zheng R, et al. Genome-wide association study identifies two risk loci for tuberculosis in Han Chinese. Nat. Commun. 2018;9:4072. doi: 10.1038/s41467-018-06539-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Curtis J, et al. Susceptibility to tuberculosis is associated with variants in the ASAP1 gene encoding a regulator of dendritic cell migration. Nat. Genet. 2015;47:523–527. doi: 10.1038/ng.3248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Fava VM, Dallmann-Sauer M, Schurr E. Genetics of leprosy: today and beyond. Hum. Genet. 2020;139:835–846. doi: 10.1007/s00439-019-02087-5. [DOI] [PubMed] [Google Scholar]
  • 36.Wang Z, et al. A large-scale genome-wide association and meta-analysis identified four novel susceptibility loci for leprosy. Nat. Commun. 2016;7:13760. doi: 10.1038/ncomms13760. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wong SH, et al. Leprosy and the adaptation of human toll-like receptor 1. PLoS Pathog. 2010;6:e1000979. doi: 10.1371/journal.ppat.1000979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Zhang F, et al. Identification of two new loci at IL23R and RAB32 that influence susceptibility to leprosy. Nat. Genet. 2011;43:1247–1251. doi: 10.1038/ng.973. [DOI] [PubMed] [Google Scholar]
  • 39.Naranbhai V, et al. Genomic modulators of gene expression in human neutrophils. Nat. Commun. 2015;6:7545. doi: 10.1038/ncomms8545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mayer-Barber KD, et al. Host-directed therapy of tuberculosis based on interleukin-1 and type I interferon crosstalk. Nature. 2014;511:99–103. doi: 10.1038/nature13489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Cyrklaff M, et al. Hemoglobins S and C interfere with actin remodeling in Plasmodium falciparum-infected erythrocytes. Science. 2011;334:1283–1286. doi: 10.1126/science.1213775. [DOI] [PubMed] [Google Scholar]
  • 42.Key FM, Teixeira JC, de Filippo C, Andrés AM. Advantageous diversity maintained by balancing selection in humans. Curr. Opin. Genet. Dev. 2014;29:45–51. doi: 10.1016/j.gde.2014.08.001. [DOI] [PubMed] [Google Scholar]
  • 43.Malaria Genomic Epidemiology Network Reappraisal of known malaria resistance loci in a large multicenter study. Nat. Genet. 2014;46:1197–1204. doi: 10.1038/ng.3107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Malaria Genomic Epidemiology Network A novel locus of resistance to severe malaria in a region of ancient balancing selection. Nature. 2015;526:253–257. doi: 10.1038/nature15390. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Leffler EM, et al. Multiple instances of ancient balancing selection shared between humans and chimpanzees. Science. 2013;340:1578–1582. doi: 10.1126/science.1234070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Leffler EM, et al. Resistance to malaria through structural variation of red blood cell invasion receptors. Science. 2017;356:1140–1152. doi: 10.1126/science.aam6393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Thomas DL, et al. Genetic variation in IL28B and spontaneous clearance of hepatitis C virus. Nature. 2009;461:798–801. doi: 10.1038/nature08463. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Ge D, et al. Genetic variation in IL28B predicts hepatitis C treatment-induced viral clearance. Nature. 2009;461:399–401. doi: 10.1038/nature08309. [DOI] [PubMed] [Google Scholar]
  • 49.Jones, C. R., Flower, B. F., Barber, E., Simmons, B. & Cooke, G. S. Treatment optimisation for hepatitis C in the era of combination direct-acting antiviral therapy: a systematic review and meta-analysis. Wellcome Open Res. 10.12688/wellcomeopenres.15411.1 (2019). [DOI] [PMC free article] [PubMed]
  • 50.Duggal P, et al. Genome-wide association study of spontaneous resolution of hepatitis C virus infection: data from multiple cohorts. Ann. Intern. Med. 2013;158:235–245. doi: 10.7326/0003-4819-158-4-201302190-00003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Vergara C, et al. Multi-ancestry genome-wide association study of spontaneous clearance of hepatitis C virus. Gastroenterology. 2019;156:1496–1507.e7. doi: 10.1053/j.gastro.2018.12.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Harrison, E. M. et al. Ethnicity and outcomes from COVID-19: the ISARIC CCP-UK prospective observational cohort study of hospitalised patients. SSRN Electron. J. 10.2139/ssrn.3618215 (2020).
  • 53.Kowalski MH, et al. Use of > 100,000 NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium whole genome sequences improves imputation quality and detection of rare variant associations in admixed African and Hispanic/Latino populations. PLoS Genet. 2019;15:e1008500. doi: 10.1371/journal.pgen.1008500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Loh PR, et al. Reference-based phasing using the haplotype reference consortium panel. Nat. Genet. 2016;48:1443–1448. doi: 10.1038/ng.3679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Gurdasani D, et al. The African Genome Variation Project shapes medical genetics in Africa. Nature. 2015;517:327–332. doi: 10.1038/nature13997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Gilly A, et al. Very low-depth whole-genome sequencing in complex trait association studies. Bioinformatics. 2019;35:2555–2561. doi: 10.1093/bioinformatics/bty1032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Price AL, et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 2006;38:904–909. doi: 10.1038/ng1847. [DOI] [PubMed] [Google Scholar]
  • 58.Yang J, Zaitlen NA, Goddard ME, Visscher PM, Price AL. Advantages and pitfalls in the application of mixed-model association methods. Nat. Genet. 2014;46:100–106. doi: 10.1038/ng.2876. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Wojcik GL, et al. Genetic analyses of diverse populations improves discovery for complex traits. Nature. 2019;570:514–518. doi: 10.1038/s41586-019-1310-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Tian C, et al. Genome-wide association and HLA region fine-mapping studies identify susceptibility loci for multiple common infections. Nat. Commun. 2017;8:599. doi: 10.1038/s41467-017-00257-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.McMahon G, Ring SM, Davey-Smith G, Timpson NJ. Genome-wide association study identifies SNPs in the MHC class II loci that are associated with self-reported history of whooping cough. Hum. Mol. Genet. 2015;24:5930–5939. doi: 10.1093/hmg/ddv293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Karnes JH, et al. Phenome-wide scanning identifies multiple diseases and disease severity phenotypes associated with HLA variants. Sci. Transl Med. 2017;9:eaai8708. doi: 10.1126/scitranslmed.aai8708. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Choby JE, et al. A phenome-wide association study uncovers a pathological role of coagulation factor X during Acinetobacter baumannii infection. Infect. Immun. 2019;87:e00031–19. doi: 10.1128/IAI.00031-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Chen J, et al. Coagulation factors VII, IX and X are effective antibacterial proteins against drug-resistant Gram-negative bacteria. Cell Res. 2019;29:711–724. doi: 10.1038/s41422-019-0202-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Locke AE, et al. Exome sequencing of Finnish isolates enhances rare-variant association power. Nature. 2019;572:323–328. doi: 10.1038/s41586-019-1457-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Bycroft C, et al. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Parks T, et al. Association between a common immunoglobulin heavy chain allele and rheumatic heart disease risk in Oceania. Nat. Commun. 2017;8:14946. doi: 10.1038/ncomms14946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Bashford-Rogers RJM, et al. Analysis of the B cell receptor repertoire in six immune-mediated diseases. Nature. 2019;574:122–126. doi: 10.1038/s41586-019-1595-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Jain M, et al. Nanopore sequencing and assembly of a human genome with ultra-long reads. Nat. Biotechnol. 2018;36:338–345. doi: 10.1038/nbt.4060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Lewis LA, Ram S. Meningococcal disease and the complement system. Virulence. 2014;5:98–126. doi: 10.4161/viru.26515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Schröder-Braunstein J, Kirschfink M. Complement deficiencies and dysregulation: pathophysiological consequences, modern analysis, and clinical management. Mol. Immunol. 2019;114:299–311. doi: 10.1016/j.molimm.2019.08.002. [DOI] [PubMed] [Google Scholar]
  • 72.Davila S, et al. Genome-wide association study identifies variants in the CFH region associated with host susceptibility to meningococcal disease. Nat. Genet. 2010;42:772–776. doi: 10.1038/ng.640. [DOI] [PubMed] [Google Scholar]
  • 73.Bousfiha A, et al. Human inborn errors of immunity: 2019 update of the IUIS phenotypical classification. J. Clin. Immunol. 2020;40:66–81. doi: 10.1007/s10875-020-00758-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Zhang SY, et al. TLR3 deficiency in patients with herpes simplex encephalitis. Science. 2007;317:1522–1527. doi: 10.1126/science.1139522. [DOI] [PubMed] [Google Scholar]
  • 75.Jain A, et al. Specific missense mutations in NEMO result in hyper-IgM syndrome with hypohydrotic ectodermal dysplasia. Nat. Immunol. 2001;2:223–228. doi: 10.1038/85277. [DOI] [PubMed] [Google Scholar]
  • 76.Picard C, et al. Pyogenic bacterial infections in humans with IRAK-4 deficiency. Science. 2003;299:2076–2079. doi: 10.1126/science.1081902. [DOI] [PubMed] [Google Scholar]
  • 77.Von Bernuth H, et al. Pyogenic bacterial infections in humans with MyD88 deficiency. Science. 2008;321:691–696. doi: 10.1126/science.1158298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Döffinger R, et al. X-linked anhidrotic ectodermal dysplasia with immunodeficiency is caused by impaired NF-κB signaling. Nat. Genet. 2001;27:277–285. doi: 10.1038/85837. [DOI] [PubMed] [Google Scholar]
  • 79.Cuchet-Lourenço D, et al. Biallelic RIPK1 mutations in humans cause severe immunodeficiency, arthritis, and intestinal inflammation. Science. 2018;361:810–813. doi: 10.1126/science.aar2641. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Li Y, et al. Human RIPK1 deficiency causes combined immunodeficiency and inflammatory bowel diseases. Proc. Natl Acad. Sci. USA. 2019;116:970–975. doi: 10.1073/pnas.1813582116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Afzali B, et al. BACH2 immunodeficiency illustrates an association between super-enhancers and haploinsufficiency. Nat. Immunol. 2017;18:813–823. doi: 10.1038/ni.3753. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Baris S, et al. Severe early-onset combined immunodeficiency due to heterozygous gain-of-function mutations in STAT1. J. Clin. Immunol. 2016;36:641–648. doi: 10.1007/s10875-016-0312-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Higgins E, et al. Use of ruxolitinib to successfully treat chronic mucocutaneous candidiasis caused by gain-of-function signal transducer and activator of transcription 1 (STAT1) mutation. J. Allergy Clin. Immunol. 2015;135:551–553.e3. doi: 10.1016/j.jaci.2014.12.1867. [DOI] [PubMed] [Google Scholar]
  • 84.Arnold DE, Heimall JR. A review of chronic granulomatous disease. Adv. Ther. 2017;34:2543–2557. doi: 10.1007/s12325-017-0636-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Arnadottir GA, et al. A homozygous loss-of-function mutation leading to CYBC1 deficiency causes chronic granulomatous disease. Nat. Commun. 2018;9:4447. doi: 10.1038/s41467-018-06964-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Lamborn IT, et al. Recurrent rhinovirus infections in a child with inherited MDA5 deficiency. J. Exp. Med. 2017;214:1949–1972. doi: 10.1084/jem.20161759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Ciancanelli MJ, et al. Infectious disease. Life-threatening influenza and impaired interferon amplification in human IRF7 deficiency. Science. 2015;348:448–453. doi: 10.1126/science.aaa1578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Hernandez N, et al. Life-threatening influenza pneumonitis in a child with inherited IRF9 deficiency. J. Exp. Med. 2018;215:2567–2585. doi: 10.1084/jem.20180628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Lim HK, et al. Severe influenza pneumonitis in children with inherited TLR3 deficiency. J. Exp. Med. 2019;216:2038–2056. doi: 10.1084/jem.20181621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Lindesmith L, et al. Human susceptibility and resistance to Norwalk virus infection. Nat. Med. 2003;9:548–553. doi: 10.1038/nm860. [DOI] [PubMed] [Google Scholar]
  • 91.Rosain J, et al. Mendelian susceptibility to mycobacterial disease: 2014–2018 update. Immunol. Cell Biol. 2019;97:360–367. doi: 10.1111/imcb.12210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Kong X-F, et al. Disruption of an antimycobacterial circuit between dendritic and helper T cells in human SPPL2a deficiency. Nat. Immunol. 2018;19:973–985. doi: 10.1038/s41590-018-0178-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Kumpf O, et al. Influence of genetic variations in TLR4 and TIRAP/Mal on the course of sepsis and pneumonia and cytokine release: an observational study in three cohorts. Crit. Care. 2010;14:R103. doi: 10.1186/cc9047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Khor CC, et al. A Mal functional variant is associated with protection against invasive pneumococcal disease, bacteremia, malaria and tuberculosis. Nat. Genet. 2007;39:523–528. doi: 10.1038/ng1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Hawn TR, et al. A polymorphism in toll-interleukin 1 receptor domain containing adaptor protein is associated with susceptibility to meningeal tuberculosis. J. Infect. Dis. 2006;194:1127–1134. doi: 10.1086/507907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Ladhani SN, et al. Association between single-nucleotide polymorphisms in Mal/TIRAP and Interleukin-10 genes and susceptibility to invasive Haemophilus influenzae serotype b infection in immunized children. Clin. Infect. Dis. 2010;51:761–767. doi: 10.1086/656236. [DOI] [PubMed] [Google Scholar]
  • 97.Israel L, et al. Human adaptive immunity rescues an inborn error of innate immunity. Cell. 2017;168:789–800.e10. doi: 10.1016/j.cell.2017.01.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Westermann AJ, et al. Dual RNA-seq unveils noncoding RNA functions in host-pathogen interactions. Nature. 2016;529:496–501. doi: 10.1038/nature16547. [DOI] [PubMed] [Google Scholar]
  • 99.Nuss AM, et al. Tissue dual RNA-seq allows fast discovery of infection-specific functions and riboregulators shaping host-pathogen transcriptomes. Proc. Natl Acad. Sci. USA. 2017;114:E791–E800. doi: 10.1073/pnas.1613405114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Montoya DJ, et al. Dual RNA-Seq of human leprosy lesions identifies bacterial determinants linked to host immune response. Cell Rep. 2019;26:3574–3585.e3. doi: 10.1016/j.celrep.2019.02.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Lees JA, et al. Joint sequencing of human and pathogen genomes reveals the genetics of pneumococcal meningitis. Nat. Commun. 2019;10:2176. doi: 10.1038/s41467-019-09976-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Ansari MA, et al. Genome-to-genome analysis highlights the effect of the human innate and adaptive immune systems on the hepatitis C virus. Nat. Genet. 2017;49:666–673. doi: 10.1038/ng.3835. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Palmer DS, et al. Mapping the drivers of within-host pathogen evolution using massive data sets. Nat. Commun. 2019;10:3017. doi: 10.1038/s41467-019-10724-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Karlsson EK, et al. Natural selection in a Bangladeshi population from the cholera-endemic Ganges River Delta. Sci. Transl Med. 2013;5:192ra86. doi: 10.1126/scitranslmed.3006338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Amambua-Ngwa A, et al. Major subpopulations of Plasmodium falciparum in sub-Saharan Africa. Science. 2019;365:813–816. doi: 10.1126/science.aav5427. [DOI] [PubMed] [Google Scholar]
  • 106.Howick VM, et al. The Malaria Cell Atlas: single parasite transcriptomes across the complete plasmodium life cycle. Science. 2019;365:eaaw2619. doi: 10.1126/science.aaw2619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Miles A, et al. Genetic diversity of the African malaria vector anopheles gambiae. Nature. 2017;552:96–100. doi: 10.1038/nature24995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Näpflin K, et al. Genomics of host-pathogen interactions: challenges and opportunities across ecological and spatiotemporal scales. PeerJ. 2019;7:e8013. doi: 10.7717/peerj.8013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Davenport EE, et al. Genomic landscape of the individual host response and outcomes in sepsis: a prospective cohort study. Lancet Respir. Med. 2016;4:259–271. doi: 10.1016/S2213-2600(16)00046-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Burnham KL, et al. Shared and distinct aspects of the sepsis transcriptomic response to fecal peritonitis and pneumonia. Am. J. Respir. Crit. Care Med. 2017;196:328–339. doi: 10.1164/rccm.201608-1685OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Antcliffe DB, et al. Transcriptomic signatures in sepsis and a differential response to steroids from the VaNISH randomized trial. Am. J. Respir. Crit. Care Med. 2019;199:980–986. doi: 10.1164/rccm.201807-1419OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Scicluna BP, et al. Classification of patients with sepsis according to blood genomic endotype: a prospective cohort study. Lancet Respir. Med. 2017;5:816–826. doi: 10.1016/S2213-2600(17)30294-1. [DOI] [PubMed] [Google Scholar]
  • 113.Astle WJ, et al. The allelic landscape of human blood cell trait variation and links to common complex disease. Cell. 2016;167:1415–1429.e19. doi: 10.1016/j.cell.2016.10.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Jonsson S, et al. Identification of sequence variants influencing immunoglobulin levels. Nat. Genet. 2017;49:1182–1191. doi: 10.1038/ng.3897. [DOI] [PubMed] [Google Scholar]
  • 115.Scepanovic P, et al. Human genetic variants and age are the strongest predictors of humoral immune responses to common pathogens and vaccines. Genome Med. 2018;10:59. doi: 10.1186/s13073-018-0568-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Hammer C, et al. Amino acid variation in HLA class II proteins is a major determinant of humoral response to common viruses. Am. J. Hum. Genet. 2015;97:738–743. doi: 10.1016/j.ajhg.2015.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Roederer M, et al. The genetic architecture of the human immune system: a bioresource for autoimmunity and disease pathogenesis. Cell. 2015;161:387–403. doi: 10.1016/j.cell.2015.02.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Wang L, et al. An atlas of genetic variation linking pathogen-induced cellular traits to human disease. Cell Host Microbe. 2018;24:308–323.e6. doi: 10.1016/j.chom.2018.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Denny JC, et al. Systematic comparison of phenome-wide association study of electronic medical record data and genome-wide association study data. Nat. Biotechnol. 2013;31:1102–1111. doi: 10.1038/nbt.2749. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Fairfax BP, et al. Genetics of gene expression in primary immune cells identifies cell type–specific master regulators and roles of HLA alleles. Nat. Genet. 2012;44:502–510. doi: 10.1038/ng.2205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Quach H, et al. Genetic adaptation and neandertal admixture shaped the immune system of human populations. Cell. 2016;167:643–656.e17. doi: 10.1016/j.cell.2016.09.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Fairfax BP, et al. Innate immune activity conditions the effect of regulatory variants upon monocyte gene expression. Science. 2014;343:1246949. doi: 10.1126/science.1246949. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Lee MN, et al. Common genetic variants modulate pathogen-sensing responses in human dendritic cells. Science. 2014;343:1246980. doi: 10.1126/science.1246980. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Nédélec Y, et al. Genetic ancestry and natural selection drive population differences in immune responses to pathogens. Cell. 2016;167:657–669.e21. doi: 10.1016/j.cell.2016.09.025. [DOI] [PubMed] [Google Scholar]
  • 125.Gilchrist JJ, et al. Risk of nontyphoidal Salmonella bacteraemia in African children is modified by STAT4. Nat. Commun. 2018;9:1014. doi: 10.1038/s41467-017-02398-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Kornblihtt AR, et al. Alternative splicing: a pivotal step between eukaryotic transcription and translation. Nat. Rev. Mol. Cell Biol. 2013;14:153–165. doi: 10.1038/nrm3525. [DOI] [PubMed] [Google Scholar]
  • 127.Rotival M, Quach H, Quintana-Murci L. Defining the genetic and evolutionary architecture of alternative splicing in response to infection. Nat. Commun. 2019;10:1671. doi: 10.1038/s41467-019-09689-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Patin E, et al. Natural variation in the parameters of innate immune cells is preferentially driven by genetic factors resource. Nat. Immunol. 2018;19:302–314. doi: 10.1038/s41590-018-0049-7. [DOI] [PubMed] [Google Scholar]
  • 129.Venet F, Lukaszewicz AC, Payen D, Hotchkiss R, Monneret G. Monitoring the immune response in sepsis: a rational approach to administration of immunoadjuvant therapies. Curr. Opin. Immunol. 2013;25:477–483. doi: 10.1016/j.coi.2013.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 130.Li Y, et al. A functional genomics approach to understand variation in cytokine production in humans. Cell. 2016;167:1099–1110.e14. doi: 10.1016/j.cell.2016.10.017. [DOI] [PubMed] [Google Scholar]
  • 131.Eisfeld AJ, et al. Multi-platform ’omics analysis of human Ebola virus disease pathogenesis. Cell Host Microbe. 2017;22:817–829.e8. doi: 10.1016/j.chom.2017.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Mentzer AJ, O’Connor D, Pollard AJ, Hill AVS. Searching for the human genetic factors standing in the way of universally effective vaccines. Philos. Trans. R. Soc. B Biol. Sci. 2015;370:20140341. doi: 10.1098/rstb.2014.0341. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Png E, et al. A genome-wide association study of hepatitis B vaccine response in an Indonesian population reveals multiple independent risk variants in the HLA region. Hum. Mol. Genet. 2011;20:3893–3898. doi: 10.1093/hmg/ddr302. [DOI] [PubMed] [Google Scholar]
  • 134.Desombere I, Willems A, Leroux-Roels G. Response to hepatitis B vaccine: multiple HLA genes are involved. Tissue Antigens. 1998;51:593–604. doi: 10.1111/j.1399-0039.1998.tb03001.x. [DOI] [PubMed] [Google Scholar]
  • 135.Kruger A, et al. Hepatitis B surface antigen presentation and HLA-DRB1*- lessons from twins and peptide binding studies. Clin. Exp. Immunol. 2005;140:325–332. doi: 10.1111/j.1365-2249.2005.02765.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Salazar M, et al. Normal HBsAg presentation and T-cell defect in the immune response of nonresponders. Immunogenetics. 1995;41:366–374. doi: 10.1007/BF00163994. [DOI] [PubMed] [Google Scholar]
  • 137.McDermott AB, Zuckerman JN, Sabin CA, Marsh SGE, Madrigal JA. Contribution of human leukocyte antigens to the antibody response to hepatitis B vaccination. Tissue Antigens. 1997;50:8–14. doi: 10.1111/j.1399-0039.1997.tb02827.x. [DOI] [PubMed] [Google Scholar]
  • 138.Höhler T, et al. The influence of major histocompatibility complex class II genes and T-cell Vbeta repertoire on response to immunization with HBsAg. Hum. Immunol. 1998;59:212–218. doi: 10.1016/S0198-8859(98)00014-7. [DOI] [PubMed] [Google Scholar]
  • 139.Lambert ND, et al. Polymorphisms in HLA-DPB1 Are associated with differences in rubella virus-specific humoral immunity after vaccination. J. Infect. Dis. 2015;211:898–905. doi: 10.1093/infdis/jiu553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Ovsyannikova IG, Pankratz VS, Larrabee BR, Jacobson RM, Poland GA. HLA genotypes and rubella vaccine immune response: additional evidence. Vaccine. 2014;32:4206–4213. doi: 10.1016/j.vaccine.2014.04.091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141.Jacobson RM, Ovsyannikova IG, Vierkant RA, Shane Pankratz V, Poland GA. Human leukocyte antigen associations with humoral and cellular immunity following a second dose of measles-containing vaccine: persistence, dampening, and extinction of associations found after a first dose. Vaccine. 2011;29:7982–7991. doi: 10.1016/j.vaccine.2011.08.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Newport MJ, et al. Genetic regulation of immune responses to vaccines in early life. Genes. Immun. 2004;5:122–129. doi: 10.1038/sj.gene.6364051. [DOI] [PubMed] [Google Scholar]
  • 143.O’Connor D, et al. Common genetic variations associated with the persistence of immunity following childhood immunization. Cell Rep. 2019;27:3241–3253.e4. doi: 10.1016/j.celrep.2019.05.053. [DOI] [PubMed] [Google Scholar]
  • 144.Zeller T, et al. Genetics and beyond – the transcriptome of human monocytes and disease susceptibility. PLoS ONE. 2010;5:e10693. doi: 10.1371/journal.pone.0010693. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Shaghaghi M, et al. Combined immunodeficiency presenting with vaccine-associated paralytic poliomyelitis: a case report and narrative review of literature. Immunol. Invest. 2014;43:292–298. doi: 10.3109/08820139.2013.859156. [DOI] [PubMed] [Google Scholar]
  • 146.Hernandez N, et al. Inherited IFNAR1 deficiency in otherwise healthy patients with adverse reaction to measles and yellow fever live vaccines. J. Exp. Med. 2019;216:2057–2070. doi: 10.1084/jem.20182295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Duncan CJA, et al. Human IFNAR2 deficiency: lessons for antiviral immunity. Sci. Transl Med. 2015;7:307ra154. doi: 10.1126/scitranslmed.aac4227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 148.Burns C, et al. A novel presentation of homozygous loss-of-function STAT-1 mutation in an infant with hyperinflammation — a case report and review of the literature. J. Allergy Clin. Immunol. Pract. 2016;4:777–779. doi: 10.1016/j.jaip.2016.02.015. [DOI] [PubMed] [Google Scholar]
  • 149.Moens L, et al. A novel kindred with inherited STAT2 deficiency and severe viral illness. J. Allergy Clin. Immunol. 2017;139:1995–1997.e9. doi: 10.1016/j.jaci.2016.10.033. [DOI] [PubMed] [Google Scholar]
  • 150.Docherty AB, et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: prospective observational cohort study. BMJ. 2020;369:m1985. doi: 10.1136/bmj.m1985. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 151.Williams, F. M. et al. Self-reported symptoms of covid-19 including symptoms most predictive of SARS-CoV-2 infection, are heritable. Preprint at medRxiv10.1101/2020.04.22.20072124 (2020). [DOI] [PubMed]
  • 152.RECOVERY Collaborative Group Dexamethasone in hospitalized patients with covid-19 — preliminary report. N. Engl. J. Med. 2020 doi: 10.1056/NEJMoa2021436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 153.COVID-19 Host Genetics Initiative. The COVID-19 Host Genetics Initiative, a global initiative to elucidate the role of host genetic factors in susceptibility and severity of the SARS-CoV-2 virus pandemic. Eur. J. Hum. Genet. 2020;28:715–718. doi: 10.1038/s41431-020-0636-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 154.Casanova JL, et al. A global effort to define the human genetics of protective immunity to SARS-CoV-2 infection. Cell. 2020;181:1194–1199. doi: 10.1016/j.cell.2020.05.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 155.Zhang X, et al. Viral and host factors related to the clinical outcome of COVID-19. Nature. 2020 doi: 10.1038/s41586-020-2355-0. [DOI] [PubMed] [Google Scholar]
  • 156.Severe Covid-19 GWAS Group. Genomewide association study of severe covid-19 with respiratory failure. N. Engl. J. Med.10.1056/NEJMoa2020283 (2020). This study illustrates the early successful application of GWAS in the COVID-19 pandemic, identifying the 3p21.31 locus and a potential role for the ABO blood-group system. [DOI] [PMC free article] [PubMed]
  • 157.Shelton, J. F. et al. Trans-ethnic analysis reveals genetic and non-genetic associations with COVID-19 susceptibility and severity. Preprint at medRxiv10.1101/2020.09.04.20188318 (2020).
  • 158.Pairo-Castineira, E. et al. Genetic mechanisms of critical illness in Covid-19. Preprint at medRxiv10.1101/2020.09.24.20200048 (2020).
  • 159.Smemo S, et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature. 2014;507:371–375. doi: 10.1038/nature13138. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 160.Zeberg H, Pääbo S. The major genetic risk factor for severe COVID-19 is inherited from Neanderthals. Nature. 2020 doi: 10.1038/s41586-020-2818-3. [DOI] [PubMed] [Google Scholar]
  • 161.Nguyen A, et al. Human leukocyte antigen susceptibility map for SARS-CoV-2. J. Virol. 2020 doi: 10.1128/JVI.00510-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 162.Zhang Q, et al. Inborn errors of type I IFN immunity in patients with life-threatening COVID-19. Science. 2020;370:eabd4570. doi: 10.1126/science.abd4570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 163.Bastard P, et al. Auto-antibodies against type I IFNs in patients with life-threatening COVID-19. Science. 2020;65:eabd4585. doi: 10.1126/science.abd4585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 164.Fellay J, et al. A whole-genome association study of major determinants for host control of HIV-1. Science. 2007;317:944–947. doi: 10.1126/science.1143767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 165.Fellay J, et al. Common genetic variation and the control of HIV-1 in humans. PLoS Genet. 2009;5:e1000791. doi: 10.1371/journal.pgen.1000791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 166.Pereyra F, et al. The major genetic determinants of HIV-1 control affect HLA class I peptide presentation. Science. 2010;330:1551–1557. doi: 10.1126/science.1195271. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 167.Johnson EO, et al. Novel genetic locus implicated for HIV-1 acquisition with putative regulatory links to HIV replication and infectivity: a genome-wide association study. PLoS ONE. 2015;10:e0118149. doi: 10.1371/journal.pone.0118149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 168.Wei Z, et al. Genome-wide association studies of HIV-1 host control in ethnically diverse Chinese populations. Sci. Rep. 2015;5:10879. doi: 10.1038/srep10879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 169.McLaren PJ, et al. Association study of common genetic variants and HIV-1 acquisition in 6,300 infected cases and 7,200 controls. PLoS Pathog. 2013;9:e1003515. doi: 10.1371/journal.ppat.1003515. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 170.McLaren PJ, et al. Polymorphisms of large effect explain the majority of the host genetic contribution to variation of HIV-1 virus load. Proc. Natl Acad. Sci. USA. 2015;112:14658–14663. doi: 10.1073/pnas.1514867112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 171.Ekenberg C, et al. Association between single-nucleotide polymorphisms in HLA alleles and human immunodeficiency virus type 1 viral load in demographically diverse, antiretroviral therapy-naive participants from the strategic timing of antiretroviral treatment trial. J. Infect. Dis. 2019;220:1325–1334. doi: 10.1093/infdis/jiz294. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 172.Sobota RS, et al. A chromosome 5q31.1 locus associates with tuberculin skin test reactivity in HIV-positive individuals from tuberculosis hyper-endemic regions in east Africa. PLoS Genet. 2017;13:e1006710. doi: 10.1371/journal.pgen.1006710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 173.Luo Y, et al. Early progression to active tuberculosis is a highly heritable trait driven by 3q23 in Peruvians. Nat. Commun. 2019;10:3765. doi: 10.1038/s41467-019-11664-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 174.Qi H, et al. Discovery of susceptibility loci associated with tuberculosis in Han Chinese. Hum. Mol. Genet. 2017;26:4752–4763. doi: 10.1093/hmg/ddx365. [DOI] [PubMed] [Google Scholar]
  • 175.Thye T, et al. Common variants at 11p13 are associated with susceptibility to tuberculosis. Nat. Genet. 2012;44:257–259. doi: 10.1038/ng.1080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 176.Grant AV, et al. A genome-wide association study of pulmonary tuberculosis in Morocco. Hum. Genet. 2016;135:299–307. doi: 10.1007/s00439-016-1633-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 177.Timmann C, et al. Genome-wide association study indicates two novel resistance loci for severe malaria. Nature. 2012;489:443–446. doi: 10.1038/nature11334. [DOI] [PubMed] [Google Scholar]
  • 178.Liu H, et al. Discovery of six new susceptibility loci and analysis of pleiotropic effects in leprosy. Nat. Genet. 2015;47:267–271. doi: 10.1038/ng.3212. [DOI] [PubMed] [Google Scholar]
  • 179.Hu Z, et al. New loci associated with chronic hepatitis B virus infection in Han Chinese. Nat. Genet. 2013;45:1499–1503. doi: 10.1038/ng.2809. [DOI] [PubMed] [Google Scholar]
  • 180.Li Y, et al. Genome-wide association study identifies 8p21.3 associated with persistent hepatitis B virus infection among Chinese. Nat. Commun. 2016;7:11664. doi: 10.1038/ncomms11664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 181.Miki D, et al. HLA-DQB1*03 confers susceptibility to chronic hepatitis C in Japanese: a genome-wide association study. PLoS ONE. 2013;8:e84226. doi: 10.1371/journal.pone.0084226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 182.Khor CC, et al. Genome-wide association study identifies susceptibility loci for dengue shock syndrome at MICB and PLCE1. Nat. Genet. 2011;43:1139–1141. doi: 10.1038/ng.960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 183.Dunstan SJ, et al. Variation at HLA-DRB1 is associated with resistance to enteric fever. Nat. Genet. 2014;46:1333–1336. doi: 10.1038/ng.3143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 184.Rivers L, Gaspar HB. Severe combined immunodeficiency: recent developments and guidance on clinical management. Arch. Dis. Child. 2015;100:667–672. doi: 10.1136/archdischild-2014-306425. [DOI] [PubMed] [Google Scholar]
  • 185.Amirifar P, Ranjouri MR, Yazdani R, Abolhassani H, Aghamohammadi A. Ataxia-telangiectasia: a review of clinical features and molecular pathology. Pediatr. Allergy Immunol. 2019;30:277–288. doi: 10.1111/pai.13020. [DOI] [PubMed] [Google Scholar]
  • 186.Chan AC, et al. ZAP-70 deficiency in an autosomal recessive form of severe combined immunodeficiency. Science. 1994;264:1599–1601. doi: 10.1126/science.8202713. [DOI] [PubMed] [Google Scholar]
  • 187.Bonilla FA, et al. International consensus document (ICON): common variable immunodeficiency disorders. J. Allergy Clin. Immunol. Pract. 2016;4:38–59. doi: 10.1016/j.jaip.2015.07.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 188.Suri D, Rawat A, Singh S. X-linked agammaglobulinemia. Indian J. Pediatr. 2016;83:331–337. doi: 10.1007/s12098-015-2024-8. [DOI] [PubMed] [Google Scholar]
  • 189.Yazdani R, et al. The hyper IgM syndromes: epidemiology, pathogenesis, clinical manifestations, diagnosis and management. Clin. Immunol. 2019;198:19–30. doi: 10.1016/j.clim.2018.11.007. [DOI] [PubMed] [Google Scholar]
  • 190.Dinauer MC. Disorders of neutrophil function: an overview. Methods Mol. Biol. 2014;1124:501–515. doi: 10.1007/978-1-62703-845-4_30. [DOI] [PubMed] [Google Scholar]
  • 191.Ram S, Lewis LA, Rice PA. Infections of people with complement deficiencies and patients who have undergone splenectomy. Clin. Microbiol. Rev. 2010;23:740–780. doi: 10.1128/CMR.00048-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 192.Tangye T, XLP SG. Clinical features and molecular etiology due to mutations in SH2D1A encoding SAP. J. Clin. Immunol. 2014;34:772–779. doi: 10.1007/s10875-014-0083-7. [DOI] [PubMed] [Google Scholar]
  • 193.Nahum A. Chronic mucocutaneous candidiasis: a spectrum of genetic disorders. LymphoSign J. 2017 doi: 10.14785/lymphosign-2017-0011. [DOI] [Google Scholar]
  • 194.Lanternier F, et al. Deep dermatophytosis and inherited CARD9 deficiency. N. Engl. J. Med. 2013;369:1704–1714. doi: 10.1056/NEJMoa1208487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 195.Parham P, Moffett A. Variable NK cell receptors and their MHC class i ligands in immunity, reproduction and human evolution. Nat. Rev. Immunol. 2013;13:133–144. doi: 10.1038/nri3370. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 196.Dawkins R, et al. Genomics of the major histocompatibility complex: haplotypes, duplication, retroviruses and disease. Immunol. Rev. 1999;167:275–304. doi: 10.1111/j.1600-065X.1999.tb01399.x. [DOI] [PubMed] [Google Scholar]
  • 197.Trowsdale J, Knight JC. Major histocompatibility complex genomics and human disease. Annu. Rev. Genomics. Hum. Genet. 2013;14:301–323. doi: 10.1146/annurev-genom-091212-153455. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 198.Parham P, Ohta T. Population biology of antigen presentation by MHC class I molecules. Science. 1996;272:67–74. doi: 10.1126/science.272.5258.67. [DOI] [PubMed] [Google Scholar]
  • 199.Hedrick PW. Pathogen resistance and genetic variation at MHC loci. Evolution. 2002;56:1902–1908. doi: 10.1111/j.0014-3820.2002.tb00116.x. [DOI] [PubMed] [Google Scholar]
  • 200.Radwan J, Babik W, Kaufman J, Lenz TL, Winternitz J. Advances in the evolutionary understanding of MHC polymorphism. Trends Genet. 2020;36:298–311. doi: 10.1016/j.tig.2020.01.008. [DOI] [PubMed] [Google Scholar]

Articles from Nature Reviews. Genetics are provided here courtesy of Nature Publishing Group

RESOURCES