Skip to main content
F1000Research logoLink to F1000Research
. 2017 Nov 28;6:2056. [Version 1] doi: 10.12688/f1000research.12365.1

Whole-exome sequencing for detecting inborn errors of immunity: overview and perspectives

Barbara Bosch 1,2, Yuval Itan 3,4, Isabelle Meyts 1,5,a
PMCID: PMC5710381  PMID: 29225788

Abstract

The study of inborn errors of immunity is based on a comprehensive clinical description of the patient’s phenotype and the elucidation of the underlying molecular mechanisms and their genetic etiology. Deciphering the pathogenesis is key to genetic counseling and the development of targeted therapy. This review shows the power of whole-exome sequencing in detecting inborn errors of immunity along five central steps taken in whole-exome sequencing analysis. In parallel, we highlight the challenges for the clinical and scientific use of the method and how these hurdles are currently being addressed. We end by ruminating on major areas in the field open to future research.

Keywords: primary immune deficiencies, inborn errors of immunity, whole-exome sequencing

Introduction

Inborn errors of immunity (IEIs) or primary immune deficiencies (PIDs) are inherited defects leading to errors in one or more components of the immune system. The presentation of IEIs is variable, and phenotypes are as diverse as increased susceptibility to infection, auto-inflammation, autoimmunity, allergy, and malignancy. Ever since the description of the first IEI, Bruton agammaglobulinemia (1952), their study has been built upon (1) a thorough clinical description, (2) the elucidation of the crippled cellular pathway and molecular mechanisms, and (3) the genetic etiology. Bruton noticed complete absence of gammaglobulin in a child with recurrent pneumococcal sepsis and postulated a defect in the antibody response 1. Race and Sanger mapped the agammaglobulinemia locus to the X-chromosome 2, and ultimately the role of Bruton tyrosine kinase (BTK) in B-cell development was recognized 3. This three-step approach, translated in the contemporary detection of IEI by clinical phenotyping, testing a genetic hypothesis, and functionally validating a candidate variant, remains fundamental to date in both the clinical and research setting 4.

The tools available for identifying IEIs have evolved over time. The introduction of Sanger sequencing for detecting disease-causing mutations revolutionized medicine. It allows the generation of a high-quality sequence of up to 900 nucleotides and has a low error rate (0.001%) 5. With this technique, mutations in 223 genes, including BTK, have been shown to underlie IEI. However, Sanger sequencing is intrinsically low-throughput, time-consuming, and hypothesis-driven. Therefore, the prior identification of a region or a set of genes of interest (for instance, via linkage analysis, positional cloning, genetic homology analysis, or identification of the affected pathway) is essential for the identification of a novel disease-causing gene using Sanger sequencing. The introduction of next-generation sequencing (NGS) in 2010 allowed the genetic study of IEI by sequencing the whole-exome (WES) or the whole-genome (WGS) or the transcriptome (RNA-seq) or a combination of these. NGS techniques use massive parallel sequencing, allowing the generation of gigabases of genome-wide information in a single run and thus a fast and unbiased approach to identifying the etiology of a disease. This resulted in the identification of 47 novel IEIs in the last three years (2014–2016). Yet the length of reads is shorter and sequencing error rates are higher (up to 2%) 5. As these techniques became widely accessible and more affordable and started being used in scientific but also clinical settings, it became apparent that identifying the disease-causing variant in the pile of information generated by WES and WGS, respectively, requires a methodology much like the three-step approach elaborated above.

This review presents the recent advances in detecting the genetic origin of IEIs using NGS. We will discuss (1) the technical data acquisition, (2) the generation of a genetic hypothesis, (3) the variant-level interpretation, (4) the gene-level interpretation, and (5) the functional validation. We focus on WES, which allows the analysis of the exome: the DNA sequence encompassing all exons of protein coding genes, microRNA, small nucleolar RNA, and large intergenic noncoding RNA in the genome (around 2% of the patient’s genome) 6. WES is currently the most widely used NGS technique for the detection of IEIs. Extensive overviews of this approach have recently been published 4, 79; therefore, we will elaborate only on the core components here along a structured flowchart. We will highlight the challenges for each of the five components described.

Advances in testing a genetic hypothesis using whole-exome sequencing

Whole-exome sequencing – generation and bioinformatic pipeline

The generation of a WES dataset starts with the polymerase chain reaction (PCR) amplification of the patient’s genomic DNA. The library of primers used for this PCR reaction is designed to span the entire human exome (capture step). The nucleotide sequences of the short PCR amplified pieces of DNA (amplicons) are made available for further analysis in a FASTQ output file. A FASTQ is a file that contains base call and quality information for all sequence reads that pass filtering. This is the common raw electronic data provided after sequencing 10. Subsequently, data are mapped to a genomic position (alignment) and stored in a Binary Alignment/Map (BAM) or CRAM file. BAM is a binary file that contains data about the reads’ alignment to the reference genome. BAM files are used for downstream annotations and reads visualizations 11. CRAM files are an alternative format to BAM, used in the 1000 Genomes Project 12.

Only high-quality reads are kept and compared with a human reference sequence. Nucleotides that differ between both sequences are “called” into a variant call format (VCF) file. The VCF also contains information on coverage or sequencing depth: the number of times a specific nucleotide is read. If around 50% or nearly 100% of the reads show a nucleotide differing from the reference genome, the position is called a heterozygous or homozygous variant, respectively. A sufficiently high coverage (typically, more than 20) is essential to discriminate a sequencing error from a true variant. Analysis of the sequencing depth can also be applied to detect somatic mosaicism where less than 50% of reads contain an alternative nucleotide; however, in the latter case, a greater read depth of at least 50 is recommended 13. WES typically generates over 100,000 high-quality variants per patient. At most, two of these underlie monogenetic disease in a patient. Subsequently, filtering of variants on the basis of a carefully considered genetic hypothesis is needed to maximize removal of false positives (removing maximum noise to work on a small manageable number of variants) while minimizing false negatives (not removing the true mutation by a too stringent filtration) and finally for properly prioritizing the remaining variants ( Figure 1).

Figure 1. Approach to the use of whole-exome sequencing for the detection of inborn errors of immunity.

Figure 1.

A schematic overview of the different steps taken during whole-exome sequencing analysis (black boxes) with the challenges identified in recent research that need to be accounted for in future research (blue dashed boxes). AD, autosomal dominant; AR, autosomal recessive; CADD, Combined Annotation Dependent Depletion; DN, dominant negative; GDI, gene damaging index; GOF, gain of function; IEI, inborn errors of immunity; LOF, loss of function; MAF, minor allele frequency; MQ, Mapping Quality; MSC, mutation significance cutoff; RD, read depth; XL, X-linked.

Genetic hypothesis

A rigorous clinical description of the IEI is paramount in the generation of a genetic hypothesis as it will influence all steps in the WES analysis: it aids grouping patients with homogenous phenotypes and hypothesizing on the potential mode of inheritance, expected allele frequency (AF), and cellular pathways that may underlie the disease.

Assuming high penetrance, analysis of a multiplex kindred (multiple patients in the same kindred) allows for the reduction of candidate variants to those shared by the diseased but not by the healthy relatives. High penetrance implies that a large proportion of individuals carrying a particular genotype also present the associated clinical trait. The same genetic homogeneity can be expected in at least a subset of unrelated patients with the same disease. As an example, 8 out of 18 families with isolated congenital asplenia (ICA) had deleterious mutations in RPSA whereas only one loss-of-function (LOF) mutation (p.Trp176*) has been detected in 121,156 control exomes sequenced by the Exome Aggregation Consortium (ExAC) 14. This enrichment of RPSA mutations in the ICA cohort made RPSA the candidate gene for ICA by genetic and phenotypic homogeneity (#1 candidate in a gene burden test) even though the mechanism of disease remains enigmatic 15.

Pedigree analysis also contributes to generating a hypothesis on the mode of inheritance. Consanguineous families are more likely to suffer from an autosomal-recessive (AR) inherited disease 16; diseases affecting only males suggest X-linked (XL) inheritance. This model can then be applied to the WES dataset: for AR disease, homozygous or compound heterozygous variants are selected; for autosomal dominant (AD) disease, heterozygous variants are selected; and for XL disease, variants called to the X-chromosome are prioritized.

Variant-level interpretation

In IEI research, typically variants predicted to cause LOF of the studied gene are prioritized, as a defective immune response is hypothesized. LOFs are indel, frameshift, start-loss, stop-gained, missense, and essential splicing mutations. It must be kept in mind that mutations can also lead to disease if they result in a dominant negative effect (for example, interferon-gamma receptor 1 ( IFN-γR1) 17) or in a gain of function (GOF) (for example, signal transducer and activator of transcription 3 ( STAT3) 18).

Next, it is important to consider the frequency of the disease studied. Individual IEIs are rare diseases; thus, it would be highly unlikely that a disease-causing variant has an AF of more than 1% in the general population—that is, in public databases such as the Single Nucleotide Polymorphism Database (dbSNP), 1000 Genomes, ExAC, and the Genome Aggregation Database (gnomAD). The last of these also includes control WGS data 14.

An additional parameter to select a candidate variant is its predicted deleteriousness. Various predictions tools have been developed to calculate the effect of a nucleotide change on a gene: the sorting intolerant from tolerant (SIFT) score 19, the polymorphism phenotyping v2 (PolyPhen2) score 20, and the Combined Annotation Dependent Depletion (CADD) score 21, which adds information on evolutionary conservation, gene regulation, and transcription to the SIFT and PolyPhen2 calculations. Typically, disease-causing variants that are predicted deleterious by SIFT and PolyPhen or that have a CADD score above 15 (or both) are prioritized in the WES analysis. Yet false-negative rates for all three methods’ fixed genome-wide cutoffs are high 22. Also, deleteriousness does not imply pathogenicity: it is indeed possible that a highly deleterious variant does not explain the patient’s phenotype, for example, as there are other pathways compensating for the defect 23.

Gene-level interpretation

Some genes, such as titin ( TTN) and mucin 16 ( MUC16), harbor many rare (low AF) LOF mutations that are predicted to be deleterious in the general population. They are typically not prioritized as a candidate gene for a life-threatening condition as are many IEIs 24, although variants in these genes per se could be promising candidates. A measure for this principle is the gene damage index (GDI) 25. GDI is calculated for each human gene by summing up the CADD scores of each of its 1000 Genomes Project phase 3 alternative alleles—minor allele frequency (MAF) of less than 0.5, missense/nonsense/frameshift/in-frame indels/splice variants—normalized by the corresponding expected CADD scores for alleles with the same MAF and multiplied by the AF. The lower the GDI of a particular gene, the lower the accumulated damage prediction score of the alleles in this gene are reported in the general population. The mutation significance cutoff (MSC) uses the CADD/PolyPhen2/SIFT scores of all known disease-causing mutations for each protein-coding human gene from the Human Gene Mutation Database (HGMD) and ClinVar database 26, 27 and generates the lowest predicted clinically/biologically relevant CADD/PolyPhen2/SIFT cutoff value for a specific gene to enable safe removal of benign variants from NGS data. A variant with a CADD score under the MSC is likely benign (even if the CADD score is greater than the popular cutoffs of 15 or 20) and can be safely removed as the true positive rate is 98% 22. For example, a gene with a CADD-based 99% confidence interval MSC cutoff of 25 and a variant in this gene having a CADD score of 23 can be safely filtered out. Conversely, when a gene has a CADD-based 99% confidence interval MSC cutoff of 5 and a variant in this gene has a CADD score of 7, the variant should not be filtered out on the basis of CADD score. Lastly, the information already available on the pathogenicity of a gene must be taken into account. If a gene is known to underlie an IEI and the patient’s phenotype corresponds to the phenotype previously described, the variant can be disease-causing even if the damage prediction is low 28. Also, as has been shown for the Toll-like receptor 3 (TLR3) pathway, mutations in genes that are biological interaction partners can lead to the same disease, in this case herpes simplex encephalitis (HSE). The Human Gene Connectome has been developed to calculate the biological distance and route from a core gene (typically a known IEI gene) to other genes: 95% of all new IEIs (discovered in 2014) are within the top 1% of at least one known IEI gene 29.

Functional validation

After confirmation of the candidate disease-causing variant by Sanger sequencing, the final but crucial step is its functional validation. This should include at least three steps:

First, the impact of the patient’s alleles on transcription and protein expression of the gene of interest needs to be tested. If an allele leads to loss of expression of the protein studied, this is a strong argument in favor of the deleteriousness/pathogenicity of the variant. Expression can be predicted using publicly available databases, such as bioGPS (mRNA) 30 and the Human Protein Atlas (mRNA and protein expression) 31.

Second, the cellular phenotype of the patient should be investigated in a relevant experimental system, preferably the patient’s cells and healthy control cells. The function of the pathway of interest in patient-derived primary cells or cell lines can be compared with that of controls given that the protein is endogenously expressed in the chosen experimental system. Ideally, a positive (known responder) and negative (known LOF) control is included in the experimental set-up. If a variant is LOF, the cellular defect can be quantified and rescued by the reintroduction of the wild-type protein.

Third, the genotype and cellular phenotype of the patient should be linked to his or her clinical presentation. This is often the most challenging step as even in an appropriate ex vivo system, physiological mechanisms of the organism as a whole cannot be fully mimicked.

Remaining challenges in detecting inborn errors of immunity by whole-exome sequencing

Whole-exome sequencing – generation and bioinformatic pipeline

WES provides information on the exonic part of the genetic code only (compare with definition above). Yet, in recent years, it has become apparent that intronic regions can harbor IEI-causing mutations too. Using WGS, Starokadomskyy et al. showed that a deep-intronic mutation that damages POLA1, encoding the catalytic subunit of DNA polymerase-alpha, underlies XL reticulate pigmentary disorder (XLPDR) 32. Additionally, we learned that WES does not cover all parts of the exome equally well. This is due both to the PCR capture step inducing (for example, polymerase errors) and to hybridization difficulties (for instance, of sequences with high guanine-cytosine content). Also, IEIs caused by copy number variations of a gene, such as deletion or insertion spanning a sequence longer than an amplicon, will be missed, as only the second allele will be amplified. Therefore, the biggest pitfall of WES is that the gene or mutation of interest lies outside the exome covered by the WES kit.

Second, certain genes have duplicated in evolution and resulted in pseudogenes that have a sequence that parallels the “mother” active gene yet have reduced or lost function 33. During the annotation step, variants in a gene can incorrectly be mapped to the pseudogene, preventing the detection of mutations in the active gene. Inhibitor of nuclear factor kappa B kinase subunit gamma ( IKBKG) is an example of this difficulty known to the IEI field: the gene and its pseudogene copy ( IKBKGP) have a complex, partially overlapping genetic sequence that is not correctly mapped by WES. Therefore, functional tests and alternative sequencing approaches to capture mutations in IKBKG are necessary ( Figure 1).

Genetic hypothesis

For various IEIs, individuals with a proven pathogenic genotype have been found to be asymptomatic. This phenomenon of incomplete clinical penetrance is an important pitfall and the genetic hypothesis; especially in the AD/AR/XL hypothesis, this possibility should never be neglected. It occurs when an individual has not yet encountered the key pathogen such as Epstein-Barr virus in X-linked lymphoproliferative disease. However, it can also occur in people who have been exposed. HSE, for example, is incompletely penetrant in individuals with TIR-domain-containing adapter-inducing IFN-β (TRIF) deficiency: the pathogenic TICAM1 mutations identified in patients with HSE have been found in their “healthy” relatives, some of whom had a serologically proven history of herpes simplex virus 1 (HSV-1) infection 34. Likewise, the presence of diseased female carriers in an XL disease can evoke an AD model. This has been shown for chronic granulomatous disease 35 and Wiskott-Aldrich syndrome 36 and can be explained by skewed X-inactivation of the wild-type allele resulting in haploinsufficiency: a single functional copy of the IEI gene leads to disease.

Lastly, non-Mendelian and non-monogenic forms of inheritance must be considered also when studying WES data of patients with IEI. Seemingly unaffected parents can harbor a somatic mosaicism as has been described in severe congenital neutropenia 37. On the other hand, the possibility of revertant mosaicism explaining the milder phenotype of some patients with severe combined immune deficiency (SCID), Wiskott-Aldrich syndrome, and XL ectordermal dysplasia and immunodeficiency must be recognized 38. WES has already proven useful in detecting somatic mosaicism and has great potential for the future detection of revertant mosaicism 39. Non-monogenic etiologies of IEIs have hardly been studied by WES to date. Yet digenic inheritance has already been identified in severe congenital neutropenia 40, and Timberlake et al. have elegantly shown the potential of WES in elucidating two-locus inheritance 41. Non-monogenic inheritance should be included in future genetic hypotheses, especially in patients with complex PID phenotypes. As the detection of non-monogenic etiologies of IEIs is more complex, grouping patients with similar disease to look for genetic homogeneity will become even more important.

Variant-level interpretation

The importance of exonic variants that are not predicted to be LOF, especially synonymous variants, is increasingly being recognized. Synonymous mutations, typically with a lower CADD score than LOF variants, can affect splicing or alter the timing of cotranslational folding 42. A recent example is the identification of a synonymous pathogenic mutation that resulted in an alternative splicing of interleukin-7R (IL-7R) in a patient with SCID 43. When a variant’s AF is considered, it is important to realize that the AF varies among different ethnicities 44 and that some ethnic groups such as Native Americans are underrepresented in public exome/genome databases. Belkadi et al. therefore developed a model based on principal component analysis to identify the ethnicity of a person from his or her exome data 45. The ethnicity-specific AF should be used in the choice for a candidate variant. Also, in several IEIs, such as Nijmegen breakage syndrome, IFNγ-R1 deficiency, and FOXN1-deficient SCID, a founder mutation has been identified 4648: the same mutation on the same allelic background is found in all patients. Typically, the frequency of carriers in the founder population is much higher than that in publically available databases.

For the interpretation of the deleteriousness prediction scores, one should realize that some genes have altered reading frames or are alternatively spliced, resulting in different isoforms. WES data are “called” to the canonical transcript and the CADD score is calculated on the basis of the predicted effect of the variant on the canonical isoform. However, it has been shown that patients with mutations in HCLS1-associated protein X-1 ( HAX1), underlying severe congenital neutropenia, have a different phenotype depending on the isoform that is mutated 49. Thus, when a genetic hypothesis is made on the basis of the phenotype, it is important to consider that the impact of a variant might be different from the effect predicted computationally on the basis of the canonical transcript.

Gene-level interpretation

First, a constraint of the currently available deleteriousness prediction models, and also of the MSC and GDI, is that predictions are based on data on the full gene while the tolerance to functional genetic variations can vary per exon or region of a gene. Gussow et al. therefore introduced the sub-region Residual Variation Intolerance Score (subRVIS) 50. Further research into predicting sub-region and domain-specific deleteriousness is urgently needed.

Second, the MSC is dependent on published data. With the report of pathogenic mutations in a specific gene, the MSC may change and the gene is predicted to be less or more tolerant to mutations.

Third, genes are expressed differently throughout tissues. Therefore, it is possible that a protein encoded by the same gene has alternative, organ-specific interaction partners. This information was recently made available in the Genome-scale Integrated Analysis of gene Networks in Tissues (GIANT), which allows functional networks to be built around a core gene capturing its tissue-specific functional interactions 51. Lastly, for known IEI genes, the originally described phenotype can expand or change through WES research. After the publication of IFN-stimulated gene 15 (ISG15) deficiency as a novel genetic etiology of Mendelian susceptibility to mycobacterial disease, the phenotype was expanded to include cerebral calcifications and auto-inflammation by WES identification of additional ISG15-mutated individuals 52, 53. Thus, Mutations in known IEI genes with an (at first sight) unrelated phenotype should not per se be excluded.

Functional validation

When human disease is studied in vitro or in vivo, be it in (patient-derived) cell lines or animal models, the limitations of the system used should be considered. Fibroblasts have proven useful to study viral susceptibility. The alleles of the first reported patients with TLR3 deficiency have been studied in depth in SV40 and primary fibroblasts 54. Yet, recently, it has been shown that the impaired intrinsic immunity to HSV-1 in TLR3 deficiency can be more precisely studied in TLR3-deficient central nervous system (CNS) cells derived from human-induced pluripotent stem cells (iPSCs) 55. Hence, the field has shifted to the use of iPSC-derived CNS cells for the study of novel genetic etiologies of HSE. Also, other organ-specific IEIs with susceptibility to infection—for example, fulminant myocarditis (myocytes 56) and severe influenza (lung epithelial cells 57)—are increasingly researched in patient-specific iPSCs.

Lastly, we progressively grasp the complexity of the human immune response itself and the non-genetically encoded factors that contribute to the clinical phenotype of the patient. As an example, Israel et al. demonstrated that the only patient who developed staphylococcal disease out of eight patients with Toll/interleukin-1 receptor domain-containing adapter protein (TIRAP) deficiency was the one who lacked (staphylococcal) lipoteichoic acid–specific antibody (anti-LTA Ab) and for whom the adaptive immune response thus could not rescue the innate immune defect 58.

Future perspectives and conclusion

We are only beginning to appreciate the full potential of WES. In addition to the detection of the monogenic etiology of a specific IEI and modifier genes or variants in mutations that show incomplete penetrance, WES could be applied for detecting mutations in genes belonging to the same pathway in patients with the same disease. An example is the TLR3 pathway. Currently, all genes of the pathway have been identified individually. It is not unthinkable that, with the use of the human gene connectome and by combining a large set of exome data from patients with HSE, novel genes in the TLR3 pathway or patients with bigenic TLR3 pathway defects can be identified. This approach can also be envisioned for other “pathway” diseases such as Mendelian susceptibility to mycobacterial disease (IL-12/23R - IFNγ – JAK/STAT), chronic mucocutaneous candidiasis (CMC) (Th17 circuit), and the detection of novel circuits that are currently not associated with a phenotype.

A second route that is largely unexplored is the genetics of resistance against infection 59. Although it will be harder to prove exposure to a specific pathogen, WES can also be used to identify variants that render the host immune to a specific pathogen as has been shown for the CCR5-delta32 mutation in HIV resistance 60 and the sickle cell allele in malaria 61.

Third, the genetic understanding and detection of IEI open the way to specific therapy. A first success has been booked by the introduction of the JAK 1/2 inhibitor ruxolitinib in the treatment of CMC and autoimmunity in patients with STAT1 GOF mutations 62. Another mechanism of disease-specific interventions might be gene therapy such as that for patients with SCID due to adenosine deaminase–deficiency. In the far future, one can even foresee the introduction of CRISPR-Cas9 (clustered regularly interspaced short palindromic repeats–associated protein-9 nuclease) modification of blood cell DNA into clinic. IEIs would be the ideal candidates for CRISPR-Cas9 application as they are rare, monogenic defects that often can be importantly improved by a partial reconstitution, as is the case naturally in revertant mosaicism.

In conclusion, this review shows the unique potential of WES to detect known and novel IEI-causing mutations. We also demonstrate the challenges for the clinical and scientific use of the method, how these hurdles are currently being addressed, and major areas open to future research.

Abbreviations

AD, autosomal dominant; AF, allele frequency; AR, autosomal recessive; BAM, Binary Alignment/Map; BTK, Bruton tyrosine kinase; CADD, Combined Annotation Dependent Depletion; CMC, chronic mucocutaneous candidiasis; CNS, central nervous system; CRISPR-Cas9, clustered regularly interspaced short palindromic repeats–associated protein-9 nuclease; ExAC, Exome Aggregation Consortium; GDI, gene damaging index; GOF, gain of function; HSE, herpes simplex encephalitis; HSV-1, herpes simpex virus-1; ICA, isolated congenital asplenia; IEI, inborn errors of immunity; IFN-γR1, interferon-gamma receptor 1; IKBKG, inhibitor of nuclear factor kappa B kinase subunit gamma; iPSC, induced pluripotent stem cell; ISG15, interferon-stimulated gene 15; JAK 1/2, Janus kinase 1/2; LOF, loss of function; MAF, minor allele frequency; MSC, mutation significance cutoff; NGS, next-generation sequencing; PCR, polymerase chain reaction; PID, primary immune deficiency; PolyPhen2 score, polymorphism phenotyping v2 score; SCID, severe combined immune deficiency; SIFT score, sorting intolerant from tolerant score; TLR3, Toll-like receptor 3; VCF, variant call format; WES, whole-exome sequencing; WGS, whole-genome sequencing; XL, X-linked.

Editorial Note on the Review Process

F1000 Faculty Reviews are commissioned from members of the prestigious F1000 Faculty and are edited as a service to readers. In order to make these reviews as comprehensive and accessible as possible, the referees provide input before publication and only the final, revised version is published. The referees who approved the final version are listed with their names and affiliations but without their reports on earlier versions (any comments will already have been addressed in the published version).

The referees who approved this article are:

  • Janet Chou, Boston Children’s Hospital, Boston, MA, USA

  • Claudio Pignata, Department of Translational Medical Sciences, "Federico II" University, Naples, Italy

Funding Statement

This work was supported by the Research foundation Flanders (FWO) (G0C8517N and 11V5316N), the Jeffrey Modell foundation, Boehringer Ingelheim Fonds, National Institute of Allergy and Infectious Diseases (R01AI088364 and R37AI095983) and March of Dimes (1-FY12-440).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 1; referees: 2 approved]

References

  • 1. Bruton OC: Agammaglobulinemia. Pediatrics. 1952;9(6):722–8. [PubMed] [Google Scholar]
  • 2. Race RR, Sanger R: Blood Groups in Man.Vol. 6th ed., Oxford: Blackwell.1975. Reference Source [Google Scholar]
  • 3. Rawlings DJ, Witte ON: Bruton's tyrosine kinase is a key regulator in B-cell development. Immunol Rev. 1994;138(1):105–19. 10.1111/j.1600-065X.1994.tb00849.x [DOI] [PubMed] [Google Scholar]
  • 4. Meyts I, Bosch B, Bolze A, et al. : Exome and genome sequencing for inborn errors of immunity. J Allergy Clin Immunol. 2016;138(4):957–69. 10.1016/j.jaci.2016.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Liu L, Li Y, Li S, et al. : Comparison of next-generation sequencing systems. J Biomed Biotechnol. 2012;2012:251364. 10.1155/2012/251364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Warr A, Robert C, Hume D, et al. : Exome Sequencing: Current and Future Perspectives. G3 (Bethesda). 2015;5(8):1543–50. 10.1534/g3.115.018564 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 7. Conley ME, Casanova JL: Discovery of single-gene inborn errors of immunity by next generation sequencing. Curr Opin Immunol. 2014;30:17–23. 10.1016/j.coi.2014.05.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Gallo V, Dotta L, Giardino G, et al. : Diagnostics of Primary Immunodeficiencies through Next-Generation Sequencing. Front Immunol. 2016;7:466. 10.3389/fimmu.2016.00466 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 9. Ghosh A, Schlecht H, Heptinstall LE, et al. : Diagnosing childhood-onset inborn errors of metabolism by next-generation sequencing. Arch Dis Child. 2017;102(11):1019–1029. 10.1136/archdischild-2017-312738 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 10. Cock PJ, Fields CJ, Goto N, et al. : The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants. Nucleic Acids Res. 2010;38(6):1767–71. 10.1093/nar/gkp1137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Li H, Handsaker B, Wysoker A, et al. : The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. 1000 Genomes Project Consortium, Auton A, Brooks LD, et al. : A global reference for human genetic variation. Nature. 2015;526(7571):68–74. 10.1038/nature15393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Krøigård AB, Thomassen M, Lænkholm A, et al. : Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data. PLoS One. 2016;11(3):e0151664. 10.1371/journal.pone.0151664 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 14. Lek M, Karczewski KJ, Minikel EV, et al. : Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91. 10.1038/nature19057 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 15. Bolze A, Mahlaoui N, Byun M, et al. : Ribosomal protein SA haploinsufficiency in humans with isolated congenital asplenia. Science. 2013;340(6135):976–8. 10.1126/science.1234864 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 16. Bittles A: Consanguinity and its relevance to clinical genetics. Clin Genet. 2001;60(2):89–98. 10.1034/j.1399-0004.2001.600201.x [DOI] [PubMed] [Google Scholar]
  • 17. Jouanguy E, Lamhamedi-Cherradi S, Lammas D, et al. : A human IFNGR1 small deletion hotspot associated with dominant susceptibility to mycobacterial infection. Nat Genet. 1999;21(4):370–8. 10.1038/7701 [DOI] [PubMed] [Google Scholar]
  • 18. Flanagan SE, Haapaniemi E, Russell MA, et al. : Activating germline mutations in STAT3 cause early-onset multi-organ autoimmune disease. Nat Genet. 2014;46(8):812–4. 10.1038/ng.3040 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 19. Kumar P, Henikoff S, Ng PC: Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009;4(7):1073–81. 10.1038/nprot.2009.86 [DOI] [PubMed] [Google Scholar]
  • 20. Adzhubei IA, Schmidt S, Peshkin L, et al. : A method and server for predicting damaging missense mutations. Nat Methods. 2010;7(4):248–9. 10.1038/nmeth0410-248 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Kircher M, Witten DM, Jain P, et al. : A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 2014;46(3):310–5. 10.1038/ng.2892 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 22. Itan Y, Shang L, Boisson B, et al. : The mutation significance cutoff: gene-level thresholds for variant predictions. Nat Methods. 2016;13(2):109–10. 10.1038/nmeth.3739 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Sulem P, Helgason H, Oddson A, et al. : Identification of a large set of rare complete human knockouts. Nat Genet. 2015;47(5):448–52. 10.1038/ng.3243 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 24. Shyr C, Tarailo-Graovac M, Gottlieb M, et al. : FLAGS, frequently mutated genes in public exomes. BMC Med Genomics. 2014;7:64. 10.1186/s12920-014-0064-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Itan Y, Shang L, Boisson B, et al. : The human gene damage index as a gene-level approach to prioritizing exome variants. Proc Natl Acad Sci U S A. 2015;112(44):13615–20. 10.1073/pnas.1518646112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Stenson PD, Mort M, Ball EV, et al. : The Human Gene Mutation Database: towards a comprehensive repository of inherited mutation data for medical research, genetic diagnosis and next-generation sequencing studies. Hum Genet. 2017;136(6):665–77. 10.1007/s00439-017-1779-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Landrum MJ, Lee JM, Benson M, et al. : ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44(D1):D862–8. 10.1093/nar/gkv1222 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 28. Moens L, Schaballie H, Bosch B, et al. : AD Hyper-IgE Syndrome Due to a Novel Loss-of-Function Mutation in STAT3: a Diagnostic Pursuit Won by Clinical Acuity. J Clin Immunol. 2017;37(1):12–7. 10.1007/s10875-016-0351-9 [DOI] [PubMed] [Google Scholar]
  • 29. Itan Y, Zhang SY, Vogt G, et al. : The human gene connectome as a map of short cuts for morbid allele discovery. Proc Natl Acad Sci U S A. 2013;110(14):5558–63. 10.1073/pnas.1218167110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Wu C, Orozco C, Boyer J, et al. : BioGPS: an extensible and customizable portal for querying and organizing gene annotation resources. Genome Biol. 2009;10(11):R130. 10.1186/gb-2009-10-11-r130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Uhlén M, Fagerberg L, Hallström BM, et al. : Proteomics. Tissue-based map of the human proteome. Science. 2015;347(6220):1260419. 10.1126/science.1260419 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 32. Starokadomskyy P, Gemelli T, Rios JJ, et al. : DNA polymerase-α regulates the activation of type I interferons through cytosolic RNA:DNA synthesis. Nat Immunol. 2016;17(5):495–504. 10.1038/ni.3409 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 33. Tutar Y: Pseudogenes. Comp Funct Genomics. 2012;2012:424526. 10.1155/2012/424526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Sancho-Shimizu V, Pérez de Diego R, Lorenzo L, et al. : Herpes simplex encephalitis in children with autosomal recessive and dominant TRIF deficiency. J Clin Invest. 2011;121(12):4889–902. 10.1172/JCI59259 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Romera Modamio G, Martín Mateos MA, González Enseñat MA, et al. : Chronic granulomatous disease: a case study of a symptomatic carrier. J Investig Allergol Clin Immunol. 1997;7(1):57–61. [PubMed] [Google Scholar]
  • 36. Lutskiy MI, Sasahara Y, Kenney DM, et al. : Wiskott-Aldrich syndrome in a female. Blood. 2002;100(8):2763–8. 10.1182/blood-2002-02-0388 [DOI] [PubMed] [Google Scholar]
  • 37. Benson KF, Horwitz M: Possibility of somatic mosaicism of ELA2 mutation overlooked in an asymptomatic father transmitting severe congenital neutropenia to two offspring. Br J Haematol. 2002;118(3):923; author reply 923–4. 10.1046/j.0007-1048.2002.03648.x [DOI] [PubMed] [Google Scholar]
  • 38. Wada T, Candotti F: Somatic mosaicism in primary immune deficiencies. Curr Opin Allergy Clin Immunol. 2008;8(6):510–4. 10.1097/ACI.0b013e328314b651 [DOI] [PubMed] [Google Scholar]
  • 39. Omoyinmi E, Melo Gomes S, Standing A, et al. : Brief Report: whole-exome sequencing revealing somatic NLRP3 mosaicism in a patient with chronic infantile neurologic, cutaneous, articular syndrome. Arthritis Rheumatol. 2014;66(1):197–202. 10.1002/art.38217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Germeshausen M, Zeidler C, Stuhrmann M, et al. : Digenic mutations in severe congenital neutropenia. Haematologica. 2010;95(7):1207–10. 10.3324/haematol.2009.017665 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Timberlake AT, Choi J, Zaidi S, et al. : Two locus inheritance of non-syndromic midline craniosynostosis via rare SMAD6 and common BMP2 alleles. eLife. 2016;5: pii: e20125. 10.7554/eLife.20125 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 42. Kimchi-Sarfaty C, Oh JM, Kim IW, et al. : A "silent" polymorphism in the MDR1 gene changes substrate specificity. Science. 2007;315(5811):525–8. 10.1126/science.1135308 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 43. Gallego-Bustos F, Gotea V, Ramos-Amador JT, et al. : A Case of IL-7R Deficiency Caused by a Novel Synonymous Mutation and Implications for Mutation Screening in SCID Diagnosis. Front Immunol. 2016;7:443. 10.3389/fimmu.2016.00443 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 44. Shearer AE, Eppsteiner RW, Booth KT, et al. : Utilizing ethnic-specific differences in minor allele frequency to recategorize reported pathogenic deafness variants. Am J Hum Genet. 2014;95(4):445–53. 10.1016/j.ajhg.2014.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Belkadi A, Pedergnana V, Cobat A, et al. : Whole-exome sequencing to analyze population structure, parental inbreeding, and familial linkage. Proc Natl Acad Sci U S A. 2016;113(24):6713–8. 10.1073/pnas.1606460113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Varon R, Seemanova E, Chrzanowska K, et al. : Clinical ascertainment of Nijmegen breakage syndrome (NBS) and prevalence of the major mutation, 657del5, in three Slav populations. Eur J Hum Genet. 2000;8(11):900–2. 10.1038/sj.ejhg.5200554 [DOI] [PubMed] [Google Scholar]
  • 47. Sologuren I, Boisson-Dupuis S, Pestano J, et al. : Partial recessive IFN-γR1 deficiency: genetic, immunological and clinical features of 14 patients from 11 kindreds. Hum Mol Genet. 2011;20(8):1509–23. 10.1093/hmg/ddr029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Adriani M, Martinez-Mir A, Fusco F, et al. : Ancestral founder mutation of the nude ( FOXN1) gene in congenital severe combined immunodeficiency associated with alopecia in southern Italy population. Ann Hum Genet. 2004;68(Pt 3):265–8. 10.1046/j.1529-8817.2004.00091.x [DOI] [PubMed] [Google Scholar]
  • 49. Germeshausen M, Grudzien M, Zeidler C, et al. : Novel HAX1 mutations in patients with severe congenital neutropenia reveal isoform-dependent genotype-phenotype associations. Blood. 2008;111(10):4954–7. 10.1182/blood-2007-11-120667 [DOI] [PubMed] [Google Scholar]
  • 50. Gussow AB, Petrovski S, Wang Q, et al. : The intolerance to functional genetic variation of protein domains predicts the localization of pathogenic mutations within genes. Genome Biol. 2016;17:9. 10.1186/s13059-016-0869-4 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 51. Greene CS, Krishnan A, Wong AK, et al. : Understanding multicellular function and disease with human tissue-specific networks. Nat Genet. 2015;47(6):569–76. 10.1038/ng.3259 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 52. Bogunovic D, Byun M, Durfee LA, et al. : Mycobacterial disease and impaired IFN-γ immunity in humans with inherited ISG15 deficiency. Science. 2012;337(6102):1684–8. 10.1126/science.1224026 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 53. Zhang X, Bogunovic D, Payelle-Brogard B, et al. : Human intracellular ISG15 prevents interferon-α/β over-amplification and auto-inflammation. Nature. 2015;517(7532):89–93. 10.1038/nature13801 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 54. Zhang SY, Jouanguy E, Ugolini S, et al. : TLR3 deficiency in patients with herpes simplex encephalitis. Science. 2007;317(5844):1522–7. 10.1126/science.1139522 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 55. Lafaille FG, Pessach IM, Zhang SY, et al. : Impaired intrinsic immunity to HSV-1 in human iPSC-derived TLR3-deficient CNS cells. Nature. 2012;491(7426):769–73. 10.1038/nature11583 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 56. Belkaya S, Kontorovich AR, Byun M, et al. : Autosomal Recessive Cardiomyopathy Presenting as Acute Myocarditis. J Am Coll Cardiol. 2017;69(13):1653–65. 10.1016/j.jacc.2017.01.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57. Ciancanelli MJ, Huang SX, Luthra P, et al. : Infectious disease. Life-threatening influenza and impaired interferon amplification in human IRF7 deficiency. Science. 2015;348(6233):448–53. 10.1126/science.aaa1578 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58. Israel L, Wang Y, Bulek K, et al. : Human Adaptive Immunity Rescues an Inborn Error of Innate Immunity. Cell. 2017;168(5):789–800.e10. 10.1016/j.cell.2017.01.039 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 59. Withrock IC, Anderson SJ, Jefferson MA, et al. : Genetic diseases conferring resistance to infectious diseases. Genes & Diseases. 2015;2(3):247–54. 10.1016/j.gendis.2015.02.008 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 60. Novembre J, Galvani AP, Slatkin M: The geographic spread of the CCR5 Delta32 HIV-resistance allele. PLoS Biol. 2005;3(11):e339. 10.1371/journal.pbio.0030339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61. Ferreira A, Marguti I, Bechmann I, et al. : Sickle hemoglobin confers tolerance to Plasmodium infection. Cell. 2011;145(3):398–409. 10.1016/j.cell.2011.03.049 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 62. Higgins E, Al Shehri T, McAleer MA, et al. : Use of ruxolitinib to successfully treat chronic mucocutaneous candidiasis caused by gain-of-function signal transducer and activator of transcription 1 (STAT1) mutation. J Allergy Clin Immunol. 2015;135(2):551–3. 10.1016/j.jaci.2014.12.1867 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation

Articles from F1000Research are provided here courtesy of F1000 Research Ltd

RESOURCES