The Role of Exome Sequencing in Newborn Screening for Inborn Errors of Metabolism

Aashish N Adhikari; Renata C Gallagher; Yaqiong Wang; Robert J Currier; George Amatuni; Laia Bassaganyas; Flavia Chen; Kunal Kundu; Mark Kvale; Sean D Mooney; Robert L Nussbaum; Savanna S Randi; Jeremy Sanford; Joseph T Shieh; Rajgopal Srinivasan; Uma Sunderam; Hao Tang; Dedeepya Vaka; Yangyun Zou; Barbara A Koenig; Pui-Yan Kwok; Neil Risch; Jennifer M Puck; Steven E Brenner

doi:10.1038/s41591-020-0966-5

. Author manuscript; available in PMC: 2022 Jan 29.

Published in final edited form as: Nat Med. 2020 Aug 10;26(9):1392–1397. doi: 10.1038/s41591-020-0966-5

The Role of Exome Sequencing in Newborn Screening for Inborn Errors of Metabolism

Aashish N Adhikari ^1,^2,^*, Renata C Gallagher ^2,³, Yaqiong Wang ¹, Robert J Currier ³, George Amatuni ³, Laia Bassaganyas ², Flavia Chen ^2,⁴, Kunal Kundu ^1,⁵, Mark Kvale ², Sean D Mooney ⁶, Robert L Nussbaum ^2,⁷, Savanna S Randi ⁸, Jeremy Sanford ⁸, Joseph T Shieh ^2,³, Rajgopal Srinivasan ⁵, Uma Sunderam ⁵, Hao Tang ⁹, Dedeepya Vaka ², Yangyun Zou ¹, Barbara A Koenig ^2,⁴, Pui-Yan Kwok ^2,^10,¹¹, Neil Risch ^2,¹², Jennifer M Puck ^2,^3,^10,^13,^16,^*, Steven E Brenner ^1,^2,^14,^15,^16,^*

¹Department of Plant and Microbial Biology, University of California, Berkeley, California

²Institute for Human Genetics, University of California, San Francisco, California

³Department of Pediatrics, University of California, San Francisco, California

⁴Program in Bioethics, University of California, San Francisco, California

⁵Innovation Labs, Tata Consultancy Services, Hyderabad, India

⁶Department of Biomedical Informatics and Medical Education, University of Washington, Seattle, Washington

⁷Invitae, San Francisco, California

⁸Department of Molecular, Cellular and Developmental Biology, Center for the Molecular Biology of RNA, UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California

⁹Genetic Disease Screening Program, California Department of Public Health, Richmond, California

¹⁰Cardiovascular Research Institute, University of California, San Francisco, California

¹¹Department of Dermatology, University of California, San Francisco, California

¹²Department of Epidemiology and Biostatistics, University of California San Francisco, California

¹³UCSF Benioff Children’s Hospital, Division of Allergy, Immunology and Blood and Marrow Transplantation, San Francisco, California

¹⁴Center for Computational Biology, University of California, Berkeley, California

¹⁵Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, California

¹⁶These authors contributed equally and jointly supervised the work

Author contributions

R.J.C., R.L.N, B.A.K., P-Y.K., N.R., J.M.P, S.E.B. conceived and designed the study. A.N.A., R.J.C., G.A., L.B., F.C., M.K., S.S.R., J.S., U.S., H.T., D.V., P-Y.K., J.M.P. acquired data. A.N.A., R.C.G., Y.W., R.J.C., G.A., K.K., M.K., S.R., R.S., U.S., Y.Z., N.R., J.M.P., S.E.B. analyzed data. A.N.A., R.C.G., Y.W., R.J.C., M.K., S.S.R., J.T.S, R.S., H.T., N.R., J.M.P., S.E.B interpreted data. A.N.A., Y.W., U.S. created software. A.N.A. wrote the first draft of the manuscript. R.C.G., Y.W., R.J.C., F.C., M.K., S.D.M., R.L.N., J.T.S., R.S., H.T., B.A.K., P-Y.K., N.R., J.M.P, S.E.B. provided critical revisions. All authors approved the final version of the manuscript.

jennifer.puck@ucsf.edu, brenner@compbio.berkeley.edu, anadhikari@berkeley.edu

PMCID: PMC8800147 NIHMSID: NIHMS1766172 PMID: 32778825

Abstract

Public health newborn screening (NBS) programs provide population-scale ascertainment of rare, treatable conditions that require urgent intervention. Tandem mass spectrometry (MS/MS) is currently used to screen newborns for a panel of rare inborn errors of metabolism (IEMs).^1–4 The NBSeq project evaluated whole exome sequencing (WES) as an innovative methodology for NBS. We obtained archived residual dried blood spots (DBS) and data for nearly all IEM cases from the 4.5 million infants born in California between mid-2005 and 2013, and from some infants who screened positive by MS/MS, but were unaffected upon follow-up testing. WES had an overall sensitivity of 88% and specificity of 98.4%, compared to 99.0% and 99.8%, respectively for MS/MS, although effectiveness varied among individual IEMs. Thus, WES alone was insufficiently sensitive or specific to be a primary screen for most NBS IEMs. However, as a secondary test for infants with abnormal MS/MS screens, WES could reduce false positive results, facilitate timely case resolution, and in some instances even suggest a more appropriate or specific diagnosis than that initially obtained. This study represents the largest-to-date sequencing effort of an entire population of IEM-affected cases, allowing unbiased assessment of current capabilities of WES as a tool for population screening.

Effective population-level NBS must rapidly identify the few individuals at risk of disease with extraordinary sensitivity, high specificity and limited manual review. In California, NBS for 48 different IEMs performed with MS/MS^1–4 achieved 99.0% sensitivity and >99.8% specificity.⁵ For some disorders, MS/MS nonetheless has a low positive predictive value, and results may be non-specific (Fig. 1).

Fig. 1. — Among 1,254 cases with positive GA-I MS/MS screen (California, July, 2005 through December, 2013), only 130 were ultimately diagnosed with any IEM. Of these 130, only 43 actually had a diagnosis of GA-I, while the rest had other IEMs, including medium chain acyl-CoA dehydrogenase deficiency (MCADD), long-chain 3-hydroxy acyl-CoA dehydrogenase deficiency (LCHADD), methylmalonic acidemia (MMA), carnitine palmitoyl transferase deficiency type II (CPT2D), medium/short-chain 3-hydroxyacyl-CoA dehydrogenase deficiency (M/SCHADD), multiple acyl-CoA dehydrogenase deficiency (MADD), short chain acyl-CoA dehydrogenase deficiency (SCADD), carnitine-acylcarnitine translocase deficiency (CACTD), other fatty acid oxidation disorder (FAOD). The GA-I MS/MS screen is based on elevations of glutaryl carnitine (C5DC), along with informative ratios. During the early part of the study, a derivatized method was used, in which hydroxydecanoyl carnitine (C10OH) had the same mass to charge ratio as C5DC. After the methodology was switched to use underivatized metabolites, it became hydroxyhexanoyl carnitine (C6OH) that was coincident with C5DC.

Genomic sequencing, now commonly used for diagnosis of rare disorders,^6–10 has been recommended for nearly all seriously ill children in intensive care units,^8–10 proposed for all newborns to personalize their medical care,¹¹ and marketed for screening newborns.¹² Yet, population-scale studies to establish performance characteristics of sequencing for NBS have not been reported. NBS IEMs provide an ideal model for evaluating the role of sequencing in population screening because most are Mendelian disorders affecting well-understood biochemical pathways, and many have been studied extensively. Moreover, sensitivity and specificity of sequence-based detection of IEMs can be directly compared to those of current MS/MS screening. Studying WES to identify IEMs already included in NBS can also suggest its potential utility for further treatable disorders not amenable to detection by MS/MS.¹³

We quantified the performance of WES, were it to have been the primary NBS for IEMs in California. Between July, 2005, and December, 2013, the Genetic Disease Screening Program (GDSP) of the California Department of Public Health (CDPH) screened DBS from nearly 4.5 million neonates for 48 IEMs using a multiplex MS/MS platform. We obtained a comprehensive set of 1,728 residual, de-identified, archived DBS representing all cases with IEMs, as well as a select set that screened positive, but later were determined to be unaffected. We performed WES for 1,416 DBS¹⁴ and determined that DBS exomes passing quality controls (Methods) were comparable to exomes from fresh blood (Extended Data Fig. 1–4). We analyzed 1,190 high-quality exomes from 805 IEM-affected cases and 385 MS/MS false positives (Table 1); exomes were divided into validation and test sets, 178 and 1,012 cases, respectively (Table 1).

Table 1:

Distribution of analyzed cases by study set, disorder and race/ethnicity.

	Total cases (N=1,190)	Affected (N=805)	Unaffected false positives by MS/MS (N=385)
Study set
Validation set	178	131	47
Test set	1,012	674	338
Categories of disorder
Fatty acid oxidation disorders		334
Organic acid disorders		237
Amino acid disorders		234
Race/ethnicity ^†
Hispanic	454	330	124
Non-Hispanic White	371	226	145
East Asian	79	57	22
African American	63	42	21
Other	223	150	73

Open in a new tab

^†

Race/ethnicity was based on nursery-reported categories from the newborn dried blood spot forms.

We analyzed variants within an “exome slice”¹⁵ of 78 genes associated with the 48 IEMs ascertained by NBS in California (Methods, Supplementary Table 1). We systematically explored pipeline parameters on the validation set to derive a customized, robust, sensitive pipeline for reporting potential disease-causing variants in IEM genes in a screening context (Fig. 2a; Methods). The automated pipeline had three arms. The curation arm reported all variants curated as pathogenic in existing disease databases (HGMD¹⁶ or ClinVar¹⁷) with population minor allele frequency (MAF) <0.1%, and additional potentially pathogenic less rare variants that we curated (see Methods and Supplementary Table 2). The second arm reported all rare (MAF <0.5%) missense and nonsense variants, and other rare variants predicted to be protein-altering or damaging. The third arm reported inferred copy number variants (CNVs). Usually, phase could not be determined, so cases with two reportable variants were designated “exome positive” for their associated IEMs. Exome results were compared to the clinical data collected by GDSP.¹⁸

Fig. 2. — a) Diagram of pipeline for analysis. For each exome, the pipeline considered only variants with genotype quality (GQ) >15 that impacted principal transcripts (per annotation of principal and alternative splice isoforms, APPRIS³⁵) for 78 genes associated with currently screened inborn errors of metabolism (IEMs). Variants were identified through any of three arms. Left, predicted impact, included variants with population MAF <0.5% in both 1000 Genomes²⁰ and ExAC²¹ and i) predicted protein alteration (stop gain or loss, frameshift insertion or deletion, alteration of canonical splice motif, nonsynonymous missense, in-frame insertion or deletion, start gain or loss); ii) Combined Annotation Dependent Depletion (CADD) score³⁶ >23; and iii) predicted splicing variants (determined by database of splicing consensus single nucleotide variant, dbscSNV, labeled RF)³⁷ with meta prediction score >0.5. Center, curation, included variants with MAF <0.1% annotated as disease mutation (DM) or questionable disease mutation (DM?) in HGMD¹⁶ or as pathogenic/likely pathogenic by ClinVar¹⁷ with at least 1 expert review. Among HGMD DM/DM? or ClinVar pathogenic/likely pathogenic variants with MAF ≥0.1% (n=60), 19 were considered reportable and 41 were excluded (Supplementary Table 2). Right, predicted CNV (XHMM³⁸) applied to IEM genes except three with common intragenic deletions (*ETFA, HCFC1, PRODH*). For variants in X chromosome genes(*), the MAF threshold was adjusted to 0.02%. For genes with ≥1 heterozygous variant, local phasing was performed when reads overlapped the multiple variant positions. Finally, variants annotated as benign in ClinVar with at least 2 review stars were excluded. From the list of resulting variants, the corresponding genes with ≥1 homozygous or ≥2 heterozygous variants were reported. Exceptions(**) were X-linked ornithine transcarbamylase, for which ≥1 flagged, heterozygous *OTC* gene variant was reported, since heterozygous females can display a clinical phenotype;³⁰ and methionine adenosyltransferase-1A, for which heterozygous variant *MAT1A* NP_000420.1:p.Arg264His causes autosomal dominant disease.³⁹ For each case, a single gene was chosen as the likely disease causing gene by a score incorporating IEM prevalence and variant severity (Methods).

b) Contributions from components of the pipeline in identifying variants in the test set of exomes from 674 IEM-affected infants. Of the 571 cases correctly identified by the pipeline, 360 had only rare HGMD/ClinVar curated variants, while the rest required additional curation or predictions. Of the 103 cases missed by exomes, the pipeline reported a single autosomal heterozygous variant in 53 in a gene consistent with the disorder, and no variants in 50.

c) Sensitivity of exome pipeline by disorder for the 674 IEM-affected cases from the test set. For each IEM, the numbers of individuals correctly identified by exomes are shown by green bars and the number missed by exomes shown by brown bars, with sensitivity for each IEM disorder shown in parentheses. Left, core IEMs screened by California; right, secondary/add-on IEMs. IEMs sharing a common causative gene were not distinguished by the exome predictions alone. These included TFP and LCHAD (blue shading), PKU and hyperphenylalaninemia (pink shading), and the various MMA subtypes (yellow shading).

As a primary screen, the pipeline correctly identified 571 of 674 IEM-affected cases in our test set as having a potentially pathogenic IEM genotype (Fig. 2b, Supplementary Table 3). Overall sensitivity, calculated as the ratio of exome-positives whose gene matched the reviewed diagnosis to IEM-affected cases, weighted by the prevalence of each IEM in California,¹⁹ was 88%. For the clinically confident subset of cases in the test set (Methods), the pipeline achieved 93.7% overall sensitivity. Our near-complete ascertainment of IEM-positive cases established this as a population-scale benchmark for WES sensitivity. Of the 571 cases, 360 (63%) had only pathogenic rare variants from databases,^20,21 and 17 (3%) had at least one less-rare, curated variant. The remaining 194 (34%) carried at least one previously unannotated, rare, potentially pathogenic variant: 128 nonsynonymous missense; 51 nonsense, indel, or canonical splice site; 12 splice-altering; and 3 CNVs. While 50 of the 103 affected cases not identified by the pipeline had a single reportable heterozygous variant, half had no reportable variants found in any gene associated with the clinical diagnosis.

Eleven cases were exome-positive for genes unrelated to their IEMs (Methods), thus, producing an overall specificity for WES of 98.4%. This would extrapolate to ~8,000 false positives among the half million annual births in California, far more than the actual 1,362 MS/MS false-positive cases in 2015. An alternative pipeline that reported only rare curated variants as pathogenic (Extended Data Fig. 5h) had high specificity, 99.4%, but unacceptably low sensitivity, 55%.

Sensitivity of our screening pipeline varied by disorder, generally performing better for less rare disorders. Although sensitivity was 100% for nearly a third of the NBS IEMs, statistical confidence, particularly for very rare IEMs, would require more data. (Fig. 2c; Supplementary Table 4).

Across all IEMs currently screened for by MS/MS, WES had insufficient sensitivity and specificity for sole, primary use as a replacement for MS/MS. Sensitivity could not be improved by trade-offs against specificity, as the missed cases lacked any pair of rare missense or predicted damaging variants in the relevant genes.

Our study also illuminated the genetic landscape of IEMs in California. From the 571 exome true positive cases in the test set, the pipeline reported a total of 1,157 variants, comprising 507 distinct variants: 343 missense (68%), 47 frameshifting indels (9%), 41 nonsense (8%), 32 splice site (6%), 19 predicted-damaging intronic nucleotide substitutions near a splice site (4%), and 17 non-frameshifting indels (3%). The remaining 8 (2%) were large deletions and curated variants (Extended Data Fig. 6a). Of these 507 variants, 195 (38%) were absent from ExAC,²¹ and 162 (32%) were absent from both HGMD¹⁶ and ClinVar¹⁷ (Supplementary Table 5); 384 of the 507 (76%) occurred in only one case.

We further investigated selected exome false negatives by performing WGS for 8 cases lacking 2 reportable WES variants (Methods). Large CNVs cannot be identified reliably from WES; indeed, WGS revealed one individual with isovaleric acidemia (IVA) having a deletion of the first 3 exons of the IVD gene, and another with a complex pattern of reduced IVD gene coverage (Extended Data Fig. 7). However, our targeted manual inspection of the read alignment and coverage of relevant genes in the remaining 6 failed to suggest a widespread role for CNVs in other IEMs (Methods), though complex structural rearrangements may remain undetected by WGS.

Additional WES false negatives could be due to limitations in variant interpretation. One individual with medium-chain acyl-CoA dehydrogenase deficiency (MCADD) had only one reportable variant in the ACADM gene (NP_000007.1:p.Tyr67His). A second variant 14 bases 5’ from the splice site (NM_000016.4:c.388–14A>G) had insufficient evidence for pathogenicity (Extended Data Fig. 8a). However, an experimental splicing reporter assay revealed that this variant switched splicing to a new 3’ site, extending exon 6 by 13 nucleotides, creating a premature termination codon (Extended Data Fig. 8b–e, Methods).

We next considered WES as a reflex follow-up test for MS/MS positive cases before further biochemical/clinical studies, to reduce referral of unaffected cases. For 6 IEMs with challenges in MS/MS screening, we analyzed all IEM affected and unaffected cases in validation and test sets with positive MS/MS, but with no reportable WES variants (Table 2, Extended Data Fig. 9). Among all MS/MS positives, the negative predictive value (NPV), or proportion of unaffected cases among those with no reportable WES variant, ranged from 33%, for multiple acyl-CoA dehydrogenase deficiency/glutaric acidemia type II (MADD/GA-II) to 100% for LCHADD (Table 2). For very long-chain acyl-CoA dehydrogenase deficiency (VLCADD) and maple syrup urine disease (MSUD), one affected case of each would have been wrongly excluded due to no reportable WES variant, while 48 and 16 false positives, respectively, would have avoided further follow-up, provided WES could be performed rapidly.

Table 2:

Performance of WES as a follow-up test after positive MS/MS for 6 selected IEMs, assuming a case would not be referred for additional evaluation without at least one reportable variant identified for that IEM

Abnormal MS/MS screen result reported for:^*	Number of MS/MS false positives	Number of exome false negatives (missed cases)	Number of exome true negatives	Specificity % (95% CI) (reduction in false positives)	NPV % (95% CI)
VLCADD	108	1	48	44.4 (34.9–54.3)	98.0 (89.1–100)
LCHADD	72	0	68	94.4 (86.4–98.5)	100.0 (94.7–100.9)
PKU	27	2	6	22.2 (8.6–42.3)	75.0 (34.9–96.8)
IVA	16	4	15	93.8 (69.8–99.8)	78.9 (54.3–93.9)
MSUD	16	1	16	100.0 (79.4–100)	94.1 (71.3–99.9)
GA-II	1	2	1	100.0 (2.5–100)	33.3 (0.8–90.6)
All of above	240	10	154	64.2 (57.7–70.2)	93.9 (80.0–97.0)

Open in a new tab

Abbreviations: GA-II, glutaric acidemia, type II; IVA, isovaleric acidemia; LCHADD: long chain 3-hydroxyacyl-CoA dehydrogenase deficiency; MSUD: maple syrup urine disease; PKU, phenylketonuria; VLCADD: very long chain acyl-CoA dehydrogenase deficiency. The two-sided Clopper Pearson CIs were calculated using the "exactci" function from R package PropCIs (https://github.com/shearer/PropCIs).

In 12 cases, the initial clinical diagnosis in the GDSP database was not consistent with the gene reported by WES, but the final disorder assignment from our subsequent clinical review was concordant with WES. Often, the disorders in question had overlapping MS/MS analyte profiles (Supplementary Table 6). For example, an individual clinically reported to have IVA, had a rare, homozygous, missense variant in the ACADSB gene encoding short/branched-chain acyl-CoA dehydrogenase (NM_001609.3:c.1165A>G; NP_001600.1:p.Met389Val[precursor protein], MAF 0.002%), suggesting 2-methylbutyryl-CoA dehydrogenase (2MBCD) deficiency. This variant causes skipping of ACADSB exon 10,²² leading to elevated 2-methylbutyrylcarnitine, which has the same MS/MS signature as isovalerylcarnitine, an analyte elevated in IVA.¹³

Another case diagnosed with MADD/GA-II had a homozygous missense variant in ETHE1, encoding a mitochondrial sulfur dioxygenase (NM_014297.4:c.488G>A; NP_055112.2:p.Arg163Gln, MAF 0.003%). This variant caused ethylmalonic encephalopathy (EE),²³ which may be difficult to distinguish biochemically from MADD/GA-II.²⁴

A third individual, MS/MS positive for both VLCADD and MCADD, had been diagnosed as unaffected upon follow-up, with notes suggesting VLCADD carrier status. WES revealed 2 missense variants in the electron transfer flavoprotein dehydrogenase gene ETFDH (NM_004453.3:c.250G>A; NP_004444.2:p.Ala84Thr, MAF 0.02%, and NM_004453.3:c.524G>A; p.Arg175His, not observed in ExAC). These variants, previously observed as compound heterozygous in MADD/GA-II patients with onset of symptoms as teenagers or adults,²⁵ reduce protein activity; sibling studies indicated that NP_004444.2:p.Ala84Thr is penetrant (Zhi-Ying Wu, personal communication). Notably, all pertinent MS/MS analyte values for VLCADD carriers were consistent with MADD/GA-II (Supplementary Fig. 1).

The WES false positives in our study could have been due to lack of predictive accuracy for uncharacterized, rare, but benign variants, as well as co-localization of two reported variants on the same, rather than opposite, chromosomes. Long-read sequencing technologies could mitigate phasing limitations in the future.

Of greater concern for screening, however, were the 12% exome false negatives. Upon manual inspection of all genes relevant to the IEM diagnosis in each case in the validation set, observed variants were either common or not protein-altering, with low likelihood of pathogenicity. Inadequate exome coverage may have limited our ability to detect deletions, but might be improved by using optimized gene panels.^26,27 Areas of poor coverage included exon 1 of MCCC2 (Extended Data Fig. 4), a locus of known pathogenic variants. Identifying pathogenic variants that are not clearly protein-altering remains challenging.

Cases could also be missed owing to incomplete knowledge of genetic disease. We analyzed 78 proven IEM-associated genes, but other relevant genes may be yet undiscovered; 12 of 19 cases of methylmalonic acidemia (MMA) had no WES variants detected (Fig. 2c), similar to a prior report.²⁸ Moreover, genetic mechanism may be dominant, epistatic, epigenetic, or oligogenic with synergistic heterozygosity.²⁹ Finally, inherent biological factors limit disease prediction from sequence alone; for example, heterozygous OTC variants in females manifest variable penetrance depending on X chromosome inactivation patterns in hepatocytes.³⁰

Nonetheless, following identification of infants with abnormal analytes on MS/MS by NBS, the metabolic centers to which these infants are referred may find WES invaluable for suggesting a definitive diagnosis by identifying pathogenic variants in IEM-associated genes. Our examples of discordance of recorded diagnoses and WES findings indicate that deep sequencing could facilitate rapid and precise clinical resolution for newborns with positive MS/MS on NBS, as previously suggested.³¹

WES turnaround time and cost, critical concerns for NBS programs, were not addressed in our research study, in which archived batches of samples were processed together. Future WGS could be faster; diagnostic WGS for critically ill infants have ranged from 2–3 weeks⁹ to <24 hours.^32,33 The modest caseload of positive NBS screens for IEMs (0.3% of births) suggests that deep sequencing could become sufficiently economical and rapid to provide effective information as a secondary test after a positive MS/MS result. Clinical considerations for individual IEMs would dictate whether urgent referral after a positive MS/MS screen is required, or whether referral could await sequencing results.

The high specificity of WES using curated variants prompts its consideration for well-characterized, treatable Mendelian disorders not amenable to MS/MS, but for which screening could be justified if a suitable DNA-based test were available. However, suitability of WES or WGS must be evaluated for each disorder. Though sensitivity of WES alone may be too low to meet standard criteria for NBS,³⁴ sequencing could potentially identify many treatable conditions that presently go unrecognized until too late for optimal intervention due to lack of an alternative current NBS test. As a form of screening, sequencing would require weighing of benefits versus costs and societal implications, and might not be limited to the neonatal period. Our study also underscores the value of the California Biobank Program, which has provided unrivaled, diverse population representation for research to inform approaches to advance public health.

Methods

NBSeq samples and study set

From the roughly 4.4 million children born in California July 7, 2005, to December 31, 2013, The NBSeq project obtained two de-identified 3.2 mm punches from dried blood spots (DBS) obtained for newborn screening (NBS) from each of 1,728 newborns. The samples included those from all 1,325 infants screened by tandem mass spectroscopy (MS/MS) who were subsequently confirmed to have an inborn error of metabolism (IEM), as well samples from 9 cases not identified by MS/MS screening, but diagnosed clinically. Also included were 394 MS/MS false positive cases selected from 10,011 individuals, for whom MS/MS screening was positive, but who were ultimately diagnosed as unaffected. All MS/MS false positives from well-baby nurseries for the long-chain fatty acid oxidation disorders (VLCAD, LCHAD, MADD), and three other IEMs (PKU, MSUD, IVA) were requested to investigate the hypothesis that genetic variants could underlie the abnormal analytes leading to false positive MS/MS results. False positives in neonatal intensive care units (NICUs), where concomitant illness (e.g., prematurity, liver abnormalities) and clinical interventions (e.g. total parenteral nutrition) produce frequent alterations of MS/MS analytes, were excluded from our request for DBS.

DNA was prepared from the DBS, and whole exome sequencing (WES) was performed as described.¹⁴ We sequenced 1,416 exomes, which included all the DBS specimens from the CDPH Biobank, with the exception of those from a random subset of 312 non-Hispanic white and Hispanic cases for SCADD, 3MCC, and elevated phenylalanine, which were excluded due to budget limitations. This led to exclusion of specimens from 56 SCADD, 54 3MCC and 201 initial hyperphenylalaninemia by MS/MS (including cases of 61 classic phenylketonuria, 45 variant hyperphenylalaninemia, 38 benign hyperphenylalaninemia and 58 false positive cases). From 1,190 exomes out of the 1,416 that passed quality control metrics and were phenotypically relevant, we developed and evaluated a pipeline to identify individuals for potential review based on variants reported in an IEM gene, as described later.

To obtain an unbiased estimate of the predictive accuracy of exomes, we first divided the 1,190 samples into a validation set (178 samples) in which we were unblinded to final diagnosis so as to explore the robustness of parameter choices in our exome analysis pipeline (data not shown), and a test set (1,012 samples), which was subjected to the final optimized pipeline only once and whose results are reported here. Of the 178 samples in the validation set, 129 were affected with an IEM diagnosed following positive NBS by MS/MS; two were affected, but had not been identified by MS/MS NBS but were diagnosed clinically; and 47 were unaffected MS/MS false positives (Table 1, main paper). Of the 1,012 samples in the test set 674 were from IEM-affected individuals, 667 diagnosed following MS/MS NBS and seven identified clinically with disease; there were also 338 MS/MS false positives.

Variant calling and quality control

Variant calling and annotation

DNA extraction, library construction and exome sequencing were performed as previously described.¹⁴ The 1,416 samples were sequenced in three batches (Batch 1: 188 samples, Batch 2: 411 samples, Batch 3: 817 samples) and the QC metrics were grouped accordingly. Batches 1 and 2 had paired-end read lengths of 101 bp whereas batch 3 had paired-end read length of 151 bp. Raw sequences were mapped to the reference genome (v37), using BWA mem algorithm (v0.7.10).⁴⁰ Resulting SAM files were converted to binary format, sorted and lane merged using Picard tools (v1.81). Duplicates were marked in the alignment files with Picard tools v1.81 (http://broadinstitute.github.io/picard/). Next, realignment around known indels and base quality score recalibration were performed using GATK toolkit (v3.3).⁴¹ Variants were called using the GATK Haplotype Caller function and the variant scores were recalibrated with GATK VQSR function.⁴² Combined calling was used on all samples. Variants were annotated using Varant (http://compbio.berkeley.edu/proj/varant/Home.html), a custom tool, as described,⁴³ with the following public datasets: Gencode (v19),⁴⁴ APPRIS (v24),³⁵ 1000 Genomes (phase 3),²⁰ ESP Project (ESP6500SI-V2-SSA137),⁴⁵ Exome Aggregation Consortium (ExAC v0.3.1), ²¹ Combined Annotation Dependent Depletion (CADD) (v1.3),³⁶ MetaSVM and MetaLR from dbNSFP v3.1a,⁴⁶ and dbscSNVv1.³⁷ Variants previously associated with disease from HGMD (v2014.1),¹⁶ and those with star ratings for known deleterious variants from ClinVar¹⁷ (ftp://ftp.ncbi.nlm.nih.gov/pub/clinvar/tab_delimited/variant_summary.txt.gz, accessed September 2017) were added to the call set. Copy number variant (CNV) calls on all 1,190 NBSeq samples were made using XHMM.³⁸

Quality control metrics

We developed a battery of quality assessment metrics to ensure that exomes from dried blood spot samples were suitable for variant interpretation.⁴⁷ The sequences were assessed for quality of read mapping, assessment of DNA damage and quality of the final called variants.

Mapping related metrics

The percent of unmapped reads was <1.7% in all the samples (median = 0.4%, 1st quartile = 0.3%, 3rd quartile = 0.5%; Extended Data Fig. 1a). The non-duplicate, properly oriented paired reads with mapping quality (MQ) ≥20 were classified as high quality. The percentage of high quality read pairs was quantified (median = 61%, Extended Data Fig. 1b). Batch 3 contained a larger percentage of duplicates than the other batches (Extended Data Fig. 1c), but numbers of high quality read pairs were comparable in all batches (Extended Data Fig. 1d, 1e). All batches had sufficiently large insert sizes for analysis, although Batch 2 had slightly lower insert sizes for the same read length (Extended Data Fig. 1f). The proportion of read pairs with each end mapped to a different chromosome was small (median = 0.004).

Median coverage depth across the capture region was 59x (1st quartile = 52x, 3rd quartile = 68x). Comparison between the three batches showed that for median capture coverage, Batch 1 and Batch 2 were similar, while Batch 3 had the highest median depth of coverage, likely because of the longer reads (Batch 1: median = 53; Batch 2: median = 56; Batch 3: median = 63) (Extended Data Fig. 1g). Batch 2 had the lowest fraction of capture covered for all depths, and Batch 3 the highest (Extended Data Fig. 1h). This could indicate lower uniformity of coverage in Batch 2 compared to the other two batches. The coverage over the 78 genes transcripts associated with IEMs was examined for the same set as metrics as above, and similar trends were observed. (Extended Data Fig. 1i–j).

DNA damage related metrics

The number of mismatches with respect to the reference genome was computed for all of the high quality reads as an estimate of DNA damage caused by nucleotide mis-incorporations. The fraction of reads with 0 mismatches with the reference was slightly lower (median = 0.74) than typically observed in other datasets which tend to be >0.8. A larger number of samples in Batch 3 had fewer perfectly matched reads compared to the others (Batch 1: median = 0.88, Batch 2: median = 0.79, Batch 3: median = 0.70) (Extended Data Fig. 2a). The higher mismatch rate in Batch 3 was due to their longer read length, with an increased rate towards the ends; when only the first 100 bases of each read were analyzed in Batch 3 mismatch data were comparable to that for the other batches (Extended Data Fig. 2b).

The distribution of nucleotide mis-incorporations by base change was investigated by calculating the allele distribution at each position, and from these distributions the frequency of a base change from the reference to every other base. The fraction for a change, say A>C, was computed as the ratio of the number of times an A>C change was seen to the total number of As seen. Positions with variants reported after calling the alignments were excluded, as these are presumed to be real variations. The C>A and G>T distributions were higher in Batch 2 for several of the samples compared to Batches 1 and 3, with C>T and G>A also slightly higher (Extended Data Fig. 2c). However, when the nucleotide change fractions were analyzed using high quality SNV calls (‘PASS’ by the GATK VQSR algorithm and with GQ ≥30) this bias was absent, with all nucleotide change fractions in similar ranges for the 3 batches (Extended Data Fig. 2d).

Variant related metrics

The fraction of sites with GQ ≥30, both reference and variant, was calculated from the total number of sites both for the entire exome capture as well as transcripts from the 78 IEM-associated genes. The fraction of sites with marked ‘PASS’ and with GQ ≥30 was within adequate range for the 3 batches (Extended Data Fig. 3a–b). Next, the high quality variants were classified based on frequency in 1000 Genomes (defined as common, ≥0.1%; rare <0.1%). SNVs and indels were analyzed separately. Common and rare SNVs and indels occurred at similar frequencies in the 3 batches (Extended Data Fig. 3). Transition/Transversion ratios were in the same range for the 3 batches for both common and rare SNVs and comparable to the range of 1000 Genomes samples (Extended Data Fig. 3).

QC criteria for sample exclusion

The samples that passed QC were required to satisfy the following quality criteria:

Median coverage across Nimblegen capture ≥20x
Fraction of high quality sites (GQ ≥30) ≥0.9 across the capture
Estimated contamination by VerifyBamID⁴⁸ (freemix) <3%

200 samples out of the 1,416 failed at least one of the above criteria. Some areas of systematic decreased coverage were noted, as with the relatively poor coverage of exon 1 of the MCCC2 gene, compared to ACADM, well covered throughout (Extended Data Fig. 4). Of 1,216 exomes with that passed QC, 26 with an IEM not within the core panel of 48 disorders were further excluded. The remaining 1,190 samples were analyzed in our study.

Exome analysis pipeline

Variant interpretation was guided by the principle that in current NBS, specificity is subordinate to maximal sensitivity because missed cases can have catastrophic consequences, while false positives can be resolved as unaffected following referral. Thus, like some NBS programs reporting DNA sequence information, our pipeline deliberately reported variants of uncertain significance (VUS, American College of Medical Genetics and Genomics guidelines).⁴⁹

Pipeline development

Our goal was to evaluate exome sequencing as a NBS tool. In contrast to sequencing for diagnosis of individual patients already exhibiting clinical abnormal phenotypes, sequencing in a screening setting presents different challenges and requirements, some of which include:

NBS has no a priori phenotypic information and is applied on a population-level to individuals, who are nearly all asymptomatic at the time of the screen.
While it is accepted that diagnostic exomes will resolve only 25–60% of undiagnosed cases (depending on the types of disorders), much higher sensitivity is required for NBS. For example, MS/MS NBS for IEMs is typically >99% sensitive.
Specificity also has to be high in NBS to avoid excessive cost, referrals to specialists and family anxiety.
In diagnostic settings, DNA sequence analysis may take into account family history, sequence of family members in addition to the patient, and a customized, in-depth curation of variants; but NBS must be performed on single individuals in a high-volume, largely automated mode with limited manual review.

Given these differences, it was not obvious whether a diagnostic exome pipeline was appropriate for screening or how it should be modified. We therefore developed exome analysis pipelines taking NBS requirements in consideration (Extended Data Fig. 5). It was important to understand how altering some commonly used parameters in pipelines would influence the sensitivities and specificities of disease predictions from exome data. However, we lacked sufficient data to train a sophisticated model to learn these parameters automatically. Therefore, with the goal of identifying robust parameter choices, we devised a framework to iteratively perturb and tune the pipeline parameters.

Systematic exploration of parameter perturbations formed the basis of the final exome analysis pipeline. Briefly, using the 178 sample NBSeq validation set and starting from two initial reference pipelines that used parameter combinations favoring either high sensitivity or high specificity, we systematically explored the consequences of perturbing the following parameters on both sensitivity and specificity: size of IEM gene list, variant callers, choices of transcript models, choice of population databases, minor allele frequency (MAF) thresholds in population databases, databases of predicted and curated variants, and choice of inheritance models. We studied the impact of each parameter on overall performance by altering a single or a few parameters at a time. As an example, prediction results were surprisingly sensitive to the choice of population database to determine MAF. We studied the impact of MAF thresholds of 0.1%, 0.2%, 0.5%, 1%, 2%, or 5% in three different databases: 1000 Genomes, ExAC, and the NHLBI Exome Sequencing Project (ESP). MAF thresholds using 1000 Genomes or ExAC yielded better sensitivities than in the ESP database alone, possibly due to technical artifacts from differing sequencing technologies and study designs used in the different databases. Informed by these observations, our final pipeline implemented population MAF thresholds that considered both the ExAC and 1000 Genomes databases. The impact of individual parameter choices on the 178 sample NBSeq validation set, evaluated in an iterative fashion, guided the design and parameter choices for the final pipeline (Fig. 2a, main text) that was applied to the 1,012 sample (674 IEM-affect, 338 MS/MS false positives) NBSeq test set.

Architecture of the final exome analysis pipeline

Once finalized based on the NBSeq validation set, the automated exome analysis pipeline (Fig. 2a, main paper; code available at https://github.com/nbseq1200/NBSeq1200paper) was run once on each sample in the NBSeq test set while blinded to any phenotypic data, reflecting a typical application setting for a primary newborn screen. The analysis was restricted to the variants in the exon slice of the 78 known IEM-associated genes (Supplementary Table 1). The pipeline considered only variants with genotype quality (GQ)>15, and the impact of the variants was annotated with respect to the APPRIS principal transcripts. The pipeline reported variants through one of three primary arms:

Predicted impact arm: All variants with MAF <0.5% (in both ExAC and 1000 Genomes) that satisfied any of the following criteria:
- annotated as protein altering by Varant (StopGain, StopLoss, FrameShiftInsert, FrameShiftDelete, SpliceDonor, SpliceAcceptor, NonSynonymous Missense, InFrameDelete, InFrameInsert, StartGain, StartLoss)
- CADD³⁶ score >23
- computational splicing-effect meta-prediction tool (dbscSNV)³⁷ score >0.5
Curation arm: Variants with population autosomal MAF <0.1% (in both ExAC and 1000 Genomes databases) annotated as “DM” or “DM?” in HGMD or as “pathogenic/likely pathogenic” by ClinVar with at least 1 review star. Our NBSeq team manually curated 60 variants with MAF ≥0.1% from these sources, finding 19 to be reportable as potentially pathogenic and excluding 41 (Supplementary Table 2). The NBSeq variant curation is described in detail below.
Copy number variant arm: XHMM,³⁸ a tool to call CNVs from exomes, was run with default parameters on the full sample BAM files. XHMM calls from three genes (ETFA, HCFC1, PRODH) were excluded since XHMM called CNVs with a frequency higher than 1% in these genes in 600 exomes extracted from the 1000 Genomes Project. Since XHMM does not output zygosity, we inspected the BAM files in the predicted deletion regions to make zygosity calls, the only manual step in the pipeline.

The variants reported through any of the above arms were considered further by the pipeline. For variants in X chromosome genes, the MAF threshold was adjusted (see below). To avoid spurious multiple, close-by variant calls when encountering a single deletion or insertion event, the pipeline reported only a single variant if multiple heterozygous variants appeared within 15 bases of each other. For heterozygous variants within 500 bases of each other, a local phasing procedure queried the reads overlapping both variant positions. If both variants appeared in the same read, only one was reported. Finally, any variants that were annotated as benign in ClinVar with at least two review stars were excluded.

From the remaining variants the pipeline identified the corresponding genes with ≥1 homozygous or ≥2 heterozygous variants, as the majority of these disorders are autosomal recessive. For the X-linked OTC gene, the pipeline reported ≥1 variant in either hemizygous males or heterozygous females, since heterozygous females can display a clinical phenotype.^30,50 Another exception was MAT1A, for which the pipeline reported a particular heterozygous variant NP_000420.1:p.Arg264His known to cause autosomal dominant disease.³⁹ Finally, for each sample, the pipeline reported at most a single prominent genotype based on the highest score (see below) combining disease prevalence and variant severity.

Minor allele frequency threshold adjustment for X-linked genes

In the exome analysis pipeline the MAF for genes encoded on the X chromosome was adjusted as follows. Given the MAF threshold chosen in the pipeline for the autosomal chromosomes, f_A, the X chromosome MAF is adjusted to f_X, as given by the following relationship:

\frac{1}{2} {f_{X}}^{2} + \frac{1}{2} f_{X} = {f_{A}}^{2}

When f_X≪1, we can approximate

f_{X} ≅ {2 f}_{A}^{2}

NBSeq variant curation

The exome analysis pipeline excluded non-rare variants unless existing evidence suggested possible pathogenicity. In 31 IEM-associated genes (ACAD8, ACADM, ACADS ACADSB, ACADVL, ASL, ASS1, BCKDHA, CBS, CD320, CPT1A, CPT2, DBT, FAH, GCH1, HADHA, HPD, MCCC2, MCEE, MLYCD, MMAB, MTRR, MUT, OTC, PAH, PCCA, PCCB, PRODH, SLC22A5, SLC25A13, TAZ), 60 variants with MAF >0.1% in 1000 Genomes and ExAC were characterized as DM or DM? in HGMD and/or pathogenic or likely pathogenic in ClinVar. All evidence supporting the classifications was reviewed by the NBSeq team, which determined that missense variants should be considered potentially pathogenic, consistent with the requirements of a screening test, including that specificity was generally subordinate to sensitivity. Variants were therefore considered reportable if there was any evidence that they could be disease causing, even if at low penetrance. After NBSeq curation, 19 of the 60 variants reviewed were deemed “reportable” and 41 “not-reportable” (Supplementary Table 2). The list of reportable variants were incorporated into the primary exome analysis pipeline, as the NBSeq inclusion list.

As an example, the NM_000098.3:p.Phe352Cys variant in CPT2 is polymorphic, with MAF 4.7% in 1000 Genomes and 2.2% in ExAC, but was indicated “DM?” in HGMD, while ClinVar status in the most recent version was likely benign, but with no stars. Literature review supported the assertions that this was a polymorphism, but not a totally benign one; published data suggested it was a risk factor for acute encephalopathy in the setting of serious viral illness in infancy.⁵¹ In contrast, NM_000137.2:p.Arg341Trp in FAH was on the border of being polymorphic, with MAF 0.8% in 1000 Genomes and 1.7% in ExAC. This variant was classified as “DM” in HGMD on the basis of literature⁵² that, when carefully reviewed, did not support the assertion, while functional expression in yeast and cultured cells⁵³ showed the protein carrying the missense variant had activity equal to that of wild-type protein. This variant, therefore, was designated “not reportable.”

Selection of the most prominent genotype in a sample

A scoring scheme was designed to select at most one gene as the predicted cause of disease for each sample. For each reported variant allele a in a particular gene g for the sample, a variant score S(a,g) was obtained as following:

S (a, g) = {\begin{array}{l} \frac{1}{3} * (M (a) + I (a) + D (a)) + P (g), & a i s n o t a C N V \\ 1 + P (g), & a i s a C N V \end{array}

where P(g) = Unit normalized prevalence of the disorder corresponding to gene g (Supplementary Table 1)

M (a) = {\begin{array}{l} 1, & M A F (a) < 0.05 % \\ 05, & 0.05 % \leq M A F (a) < 0.1 % \\ 0 & o t h e r w i s e \end{array}

D (a) = {\begin{array}{l} 1, & C u r a t i o n (a) \in {H G M D (D M), C l i n v a r 4 (⟣), C l i n v a r 5 (⟣)} \\ 05, & C u r a t i o n (a) \in {H G M D (D M ?)} \\ 0, & o t h e r w i s e \end{array}

I (a) = {\begin{array}{l} 1, & C o n s e q u e n c e (a) \in S e v e r e S e t \\ 05, & C o n s e q u e n c e (a) \in M i l d S e t \\ 0, & o t h e r w i s e \end{array}

SevereSet = {StopGain, StopLoss, FrameShiftInsert, FrameShiftDelete, SpliceDonor, SpliceAcceptor}

MildSet = {Missense, InFrameDelete, InFrameInsert, StartGain, StartLoss}.

MAF(a) was the maximum MAF of the allele a in 1000 Genomes and ExAC databases. Curation(a) determined the presence/absence of the allele a in HGMD and ClinVar databases. Consequence(a) determined the impact of the variant allele. The P(g) values for the 78 genes were derived from the birth prevalence of the corresponding disorders in California.¹⁹

For each gene that had either ≥1 homozygous or ≥2 heterozygous autosomal variants reported by the pipeline in a sample, the sum of the two highest scoring alleles in that gene was computed. Homozygous variants contributed two alleles, and only one variant was required for X-linked genes. The gene with the highest sum for the sample was reported as the predicted disease gene for that sample.

Assessment of predictions from the exome analysis pipeline

The exome analysis pipeline reported 507 variants in genes associated with corresponding disorders in IEM-affected cases in the NBSeq test set (Supplementary Table 5). The majority of these variants were nonsynonymous (Extended Data Fig.6a). To ensure an objective and unbiased interpretation of the results, the exome analysis pipeline developers (A.N.A., Y.W., S.E.B.) were blinded to the diagnoses of the individual cases in the NBSeq test set. The exome analysis pipeline was run on the NBSeq test set exomes and the resulting predictions from the exome analysis pipeline were submitted to the NBSeq clinical team (R.J.C., R.C.G., H.T., M.K.), who then assessed those predictions independent of the pipeline development team.

The data used for assessment are shown in Supplementary Table 3, listing for each case the reviewed diagnosis key (deemed the gold standard) and the results reported by the primary exome analysis pipeline, as well as alternative pipelines. Numerical aspects of assessment were computed using software available at https://github.com/nbseq1200/NBSeq1200paper.

Incorporation of clinical evidence to generate a “reviewed” diagnosis key

As part of the follow-up of positive NBS results, the GDSP database has recorded summaries of diagnostic testing and annual patient summaries for infants with IEMs. To assure that the exome predictions would be measured against the best current clinical disease assessments, for those cases for which the exome pipeline was discordant with the GDSP diagnosis, the clinical team generated a “reviewed diagnosis” key for all NBSeq study cases based on the strength of evidence for the clinical diagnoses recorded in the GDSP database and recent literature. This reviewed diagnosis key was used to compare case diagnoses with exome predictions.

There were three main types of discrepancies between exome predictions and clinical reports: The first group had a mismatch between the predicted gene by the pipeline and the gene implicated in the clinical diagnosis of an affected case (n=22). The second group had gene predictions in MS/MS false-positive cases that had been initially reported as unaffected (n=45). The third group had a failure to predict a gene in a initially diagnosed case (n=188). Members of NBSeq with access to the GDSP database (R.J.C, H.T.) reviewed recorded clinical notes with a metabolic geneticist (R.C.G.) to assess explanations for the discrepancies.

Twelve of 22 cases in the first group (different exome identification vs. clinical record) were resolved with establishment of a “reviewed diagnosis.” In six, there may have been an error in selecting the disorder from the drop-down list for diagnosis in the online case report. In the remaining six, the exome identified the gene for an alternative in the differential diagnosis of the screening result, and the follow-up data were most consistent with that alternative. For evaluation of exome predictions, these 12 cases were reclassified as correctly identified by exome.

In the second group (exome-positive, IEM-negative), 26 of the 45 cases had been resolved as false positive for PKU, but had two rare variants in the PAH gene. After levels of amino acids from follow-up testing were reviewed, 10 of the 26 were reclassified as hyperphenylalaninemia. Almost all of the remaining cases in the exome-positive group were either clinically identified as VLCAD(het), or MS/MS false-positive for VLCAD, with one remaining unresolved due to death of the infant. In most of these cases, the exome revealed two variants in the ACADVL gene; in three, the exome identified variants in genes for MADD.

In a third group (exome-negative, IEM-positive) 78 cases were reclassified as unaffected. This included 40 cases clinically identified as SCAD. The NBSeq curation team had reviewed and excluded from the exome pipeline the common variant in the ACADS gene (rs57443665, Supplementary Table 2), but cases with this variant often had been given an initial clinical diagnosis of SCAD.^54–56 These were re-classified as not affected. There were 23 cases of carnitine uptake defect (CUD) and 15 cases of VLCAD in this third group. Long-term follow-up data suggested that the majority of them were unaffected, and the classification was updated to reflect this. The remainder of this group was comprised of small numbers of each of the other metabolic disorders included in the study. For the majority of them, the follow-up data provided ample evidence of the correctness of the diagnosis.

As a final step in the clinical review, the cases for which the documentation was sufficient to be confident of the clinical diagnosis were designated as a “clinically confident” subset. All cases of SCAD, whether predicted by the exome or not, were excluded from the clinically confident subset due to the published data that the most commonly encountered variants may lead to biochemical abnormality on MS/MS screening, but are not associated with clinical disease.⁵⁶

Assessment of exome sequencing as a primary screen

Overall sensitivity and specificity

For evaluation of exomes as a screen for all NBS IEMs combined, we matched all test set predictions from the NBSeq exome analysis pipeline to their associated clinical diagnoses in the reviewed diagnosis key, and calculated two main metrics: the overall sensitivity and overall specificity.

IEM-affected cases for which the gene predicted by the exome analysis pipeline matched the IEM in the reviewed diagnosis key were considered correct. IEM-affected cases where no gene was predicted by the pipeline were considered incorrect, as were cases in which the pipeline predicted a gene inconsistent with the clinical diagnosis in the reviewed diagnosis key.

For each IEM, sensitivity was calculated as the number of cases in the test set correctly identified by the exome analysis pipeline divided by the total number of IEM-affected individuals in the test set. Since some individuals with particular IEMs had been excluded from sequencing, the IEM-specific sensitivities were weighted by their prevalence in California¹⁹ to obtain the overall sensitivity of the final exome analysis pipeline across all IEMs (values in Supplementary Table 1, those <0.04 per 10,000 were treated as 0.04). Among the 674 affected individuals in the test set, 571 were correctly identified by the exome pipeline (Extended Data Fig. 6b). The overall sensitivity of the final exome analysis pipeline when weighted by the disorder prevalence was 88%.

To calculate overall specificity, we considered situations where the pipeline predicted genes that did not match the IEM diagnosis according to the reviewed diagnosis key. For each IEM d, we calculated the disorder-level specificity as one minus the proportion of individuals not affected with d where the exome analysis pipeline reported a gene associated with d. (Of note none of the false positive cases were reported for a disorder on the same CDPH screen as the actual disease.) Using this strategy, overall specificity was 98.4%.

CDPH determines the sensitivity of MS/MS screening by evaluating whether a case that screens positive is reported as diagnosed with any IEM in their screening panel (without regard to whether the diagnosed IEM is in the primary differential diagnosis of the screen). Measuring sensitivity using this approach for comparison with MS/MS performance, the primary exome analysis pipeline has a sensitivity of 90%.

Performance on data subsets

Since the NBSeq study set included a large group of heterogeneous IEMs, we performed further ancillary assessments on various relevant slices of the data. The sensitivity and specificity of exome predictions across the different IEMs were heterogeneous (Fig. 2c, main paper, and Supplementary Table 4). The distribution of zygosity of the reportable variants varied across the different IEMs as well (Extended Data Fig. 10).

Some disorders were too rare to have sufficient samples to make a statistically robust estimation of sensitivity, reflected in their large 95% Clopper Pearson CI ranges (Supplementary Table 4). Two-sided Clopper Pearson CIs were calculated using the “exactci” function from R package PropCIs (https://github.com/shearer/PropCIs). When grouping by conditions in the national Recommended Uniform Screening Panel, RUSP, conditions (core conditions) vs. secondary conditions, the overall sensitivities were comparable (89% for core vs 88% for secondary conditions). We further classified the IEMs into 4 tranches based on their population prevalence in California.¹⁹ The weighted sensitivity of the most common IEMs (>2.5 per 10,000 cases) was 91% while that of the rarest (<0.04 per 10,000 case) was 78%, indicating that the more common IEMs could be identified better from sequence than rarer ones. Finally, in the ‘clinically confident’ subset that included only cases with sufficient clinical follow-up data for a highly confident clinical diagnosis, the overall sensitivity of the exome analysis predictions improved to 93.7%, but was still insufficient for use as a general primary screen across all IEMs.

Alternative specificity calculation

We also estimated the overall specificity of exome predictions using two additional, orthogonal approaches. The first approach used all 1,216 sequenced cases that passed QC filters and compared exome predictions to MS/MS results to estimate specificity. Briefly, across all 1216 cases, from their raw MS/MS data, we used the CLIR analysis tools¹ to infer likelihood of each IEM. For each MS/MS screen, the CLIR tool produced a 4-point score for likelihood of a particular IEM: 1, Negative; 2, Possible condition; 3, Probable condition; and 4, Very probable condition. We therefore estimated exome false positive calls by counting the number of cases where the primary exome analysis pipeline predicted a gene, but the associated CLIR likelihood score was “1” (Supplementary Table 7). The overall specificity of exome pipeline using this strategy was 97.45%.

A second approach involved running the exome analysis pipeline on 2,504 individuals from the 1000 genomes project Phase 3.²⁰ These were not directly comparable to our NBSeq data, primarily due to differences in sequencing technology (genomes vs. exomes) and underlying cohort ancestry distribution. Nonetheless, the 1000 Genomes dataset represent a large cohort of individuals for whom the likelihood of newborn IEMs should be low. The whole genome genotypes were obtained from: ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/ (accessed Jan 2018) and annotated using Varant.⁴³ The specificity of the primary pipeline on the 1000 Genomes cases was 95%, slightly lower than that compared to the value calculated based on NBSeq cases. Estimates of overall specificity of exome predictions from these two approaches supported the conclusion that our primary exome pipeline had insufficient specificity to be used alone as a primary screen.

Alternative pipelines and findings

To investigate the contributions of different components of our final exome analysis pipeline (Fig. 2a, main paper), we evaluated the overall sensitivity and overall specificity of various alternate pipelines that either truncated or modified the pipeline in different ways (Extended Data Fig. 5b–i). Truncating the CNV arm altogether removed only three true positive cases, and the overall sensitivity was 88% (Extended Data Fig. 5c). Truncation of the curation arm reduced the sensitivity to 87% (Extended Data Fig. 5d). More drastically, the truncation of the prediction arm, which considered all rare protein altering variants, resulted in an overall sensitivity of 59% (Extended Data Fig. 5e). An alternative pipeline that used the predicted impact arm alone had a sensitivity of 86% (Extended Data Fig. 5f). A stringent pipeline that only reported only rare ClinGen or HGMD high-confidence curated variants in the IEM genes had a far lower sensitivity of 55%, but had an extremely high specificity of 99.4%, the latter comparable to MS/MS (Extended Data Fig. 5h). Finally, even though the primary exome analysis pipeline allowed only a single gene call per sample, we evaluated an alternate pipeline that removed this restriction, allowing for multiple gene calls per sample if more than one gene contained two reportable variants. This pipeline had an overall sensitivity of 88.6% and reduced specificity of 94.7% (Extended Data Fig. 5i).

Assessment of exome sequencing as a follow-up test

In addition, using the final pipeline results, we asked whether the exome slice could be used as a follow up test to MS/MS particularly to reduce false positives. That is, we assessed the ability of exome data to identify IEM negative cases among MS/MS positive cases (i.e. MS/MS false positives) without eliminating IEM positives. Hence, we determined the Specificity (proportion of exome slice negatives among IEM negatives), which indicated how many true negatives would be removed from further consideration by eliminating cases with no gene variants on exome analysis after a positive MS/MS test. We also determined the Negative Predictive Value (NPV), the proportion of exome negatives among the MS/MS positives that were IEM negative (unaffected). The NPV must be very close to 1 to assure that true IEM cases not be excluded, while the Specificity should be high but not necessarily close to 1, as it simply indicates the effectiveness of the exome as a secondary screen. We defined “exome negative” as cases where no reportable variants were reported in genes for the disorder(s) corresponding to the MS/MS positive result.

For six MS/MS screened disorders for which we had originally requested all the MS/MS true positives and MS/MS false positives from the study period (VLCAD, PKU, LCHAD/TFP, IVA, MSUD, and GA-II), we first quantified the number of alleles characterized by our pipeline as pathogenic for both IEM-affected and MS/MS false positive individuals (Extended Data Fig. 9). Using the above exome negative definition, we then calculated the specificity and NPV for each of the six MS/MS screens (Table 2, main paper).

Follow-up exploratory studies of selected exome false negative cases

In half of the cases for which exomes failed to predict the corresponding IEMs (exome false negative), the screening pipeline reported no variants, nor were any found upon manual review. An example was an individual affected with GA-I, for whom all variants in the associated GCDH gene were either common or in a deep intronic region. Exploration of the genetic data in a diagnostic or research context with experimental functional studies could be used to systematically explore why exome analysis failed to predict IEMs in 12% of the IEM-affected cases. In a selected few such exome false negative cases, we performed follow-up studies to investigate potential causes for exome false negatives.

Whole genome pilot study of eight exome false negatives

Whole genome sequencing (WGS) was performed of eight cases from the NBSeq validation set where the pipeline had failed to predict the associated genes. The eight WGS samples were sequenced and jointly variant-called using the same protocol as described above for the NBSeq exomes. Since WGS typically provides more uniform coverage compared to exomes, we reanalyzed the exome slice of these eight cases from their WGS data using the same primary analysis pipeline described above. The predictions from the exome analysis pipeline remained unchanged for these eight cases.

Using the IGV browser⁵⁷, we further manually inspected the read alignment, mapping quality and overall coverage in the associated genes in sample BAM files in every exome false negative case in the validation set. No anomalies in coverages were identified except in two of the eight cases, where we noted poor coverage of an IEM gene, IVD. This led us to suspect large deletions, confirmed upon investigation of the genomic region around IVD in the WGS alignment files in these two cases (Extended Data Fig. 7). The first case had almost no coverage in the region spanning the first three exons of IVD. The second case had almost no coverage of exon 12 of IVD along with low coverage across the whole gene. Both cases strongly indicated presence of large deletions in the gene. To support this further, we searched for split reads (reported by the BWA mem algorithm) from the sample BAM files that also overlapped the IVD gene region. The first case had 11 such split reads spanning the deleted region confirming the deletion event. We did not find such split spanning reads in the second case.

Functional assay of an intronic variant in an exome false negative

IEM-affected cases where the pipeline reported only a heterozygous variant could suggest a second variant in the relevant gene that the pipeline may have failed to interpret correctly. In an illustrative case of an MCADD-affected individual, the final analysis pipeline reported a heterozygous pathogenic variant in the ACADM gene (Y67H). A second intronic variant 14 bases from the splice site (NM_000016.4:c.388–14A>G) was observed but not reported by the pipeline (Extended Data Fig. 8a). (Note that this position −14 is just outside the range considered by the pipeline’s splice impact predictor) Given the proximity of the variant to the splice site, it was hypothesized that it could impact splicing. One possibility considered was that it could be a branchpoint A mutation; however the branchpoint prediction tool SVM-BPfinder,⁵⁸ which did predict six other A’s in the 50 bp region 5’ to the exon, did not identify this site. Given the confidence of the clinical diagnosis and the proximity of the second variant to the splice site, we performed an experiment to determine the variant’s impact on splicing of the ACADM gene.

To determine if NM_000016.4:c388–14A>G influenced splicing of the ACADM pre-mRNA we generated a heterologous splicing reporter using the human HBB gene as a model as described.⁵⁹ Sequence corresponding to ACADM exon 6 and the flanking introns were cloned into HBB intron 1. A mutant construct containing the A>G mutation at position −14 relative to the 3’ss of intron 5 was also generated (Extended Data Fig. 8b). The following primer sequences were used:

ACADM Exon 6 Forward: 5’ ccaatagaaactgggcatatgattagtgcatttcactatagtaga 3’

ACADM Exon 6 Reverse: 5’ tgtctccacatgcccagatctattatgatactttcttggca 3’

ACADM Mutant Reverse: 5’ cgaagaaaactgacattaaa 3’

ACADM Mutant Forward: 5’ tttaatgtcagttttcttcg 3’

HBB Exon 1 Forward: 5’ gcaacctcaaacagacacca 3’

HBB Exon 2 Reverse: 5’ agcttgtcacagtgcagctc 3’

Plasmid DNA containing the wild type or mutant reporter was transfected into HEK293 cells, and spliced reporter transcripts were analyzed by RT-PCR (Extended Data Fig. 8c). As expected, the wild type reporter construct exhibited constitutive inclusion of ACADM exon 6 in the HBB transcript. By contrast, the −14 A>G mutation induced a novel isoform with reduced electrophoretic mobility, suggesting activation of a cryptic 3’ splice site, confirmed upon sequencing the amplicons derived from the wild type or mutant reporter transcripts (Extended Data Fig. 8d). Assuming that the −14 A>G mutation also activated a cryptic 3’ splice site in the endogenous ACADM pre-mRNA, the 13 nucleotide extension of exon 6 in the ACADM mRNA would induce a premature termination codon and result in nonsense mediated decay of the aberrant message (Extended Data Fig. 8e). Taken together, our data indicate that the −14 A>G mutation is sufficient to induce aberrant splicing of ACADM intron 5 and result in a defective mRNA isoform.

Data availability

The de-identified residual DBS from the California Biobank for this project (SIS request number 496) were obtained with a waiver of consent from the Committee for the Protection of Human Subjects of the State of California, under project number 14–07-1650, and in compliance with CDPH Biospecimen/Data Use and Confidentiality Agreement. California blood specimens and any data derived from the newborn screening program are confidential and subject to strict administrative, physical and technical protections. California law precludes any researcher from sharing blood specimens or uploading individual data derived from these blood specimens into any genomic data repository. Researchers desiring access to these data would need to make a separate application to the California Department of Public Health. Data in figures 2b and 2c and Extended Data Figs 6,9,10 can be found in Supplemental Table 3.

Code availability

Variant calling and annotation for the exome sequences were performed using previously published methods as described above. The code used for the screening analysis of exome data and subsequent assessments are deposited in GitHub (https://github.com/nbseq1200/NBSeq1200paper)

Extended Data

Extended Data Fig. 3 — a, Confident sites across capture (from the GVCF file) b, Confident sites across 78 genes (from the GVCF file) c, Common high quality SNVs d, Rare high quality SNVs e, Common high quality indels f, Rare high quality indels g, Transition/Transversion ratios for high quality common SNVs h, Transition/Transition ratios for high quality rare SNVs. High quality variants are those marked as PASS by GATK VQSR and have GQ ≥ 30. Common variants have a frequency greater than 0.001 in 1000 Genomes Project phase 3 database and rare variants have a frequency less than 0.001 in the database. Individual sample values are plotted and adjacent box plots display the median (red) and interquartile ranges for the dataset, whiskers extend to the last data point within 1.5 times the interquartile range. Violin plots superimposed on the box plots show the data density and mean value (blue). The sample sizes for the boxplots were batch1 (n = 180), batch2 (n = 292), batch3 (n = 744).

Extended Data Fig. 4 — *MCCC2,* top, has poor coverage in the first exon across all samples. In contrast, *ACADM,* bottom, has good coverage across the gene. The blue vertical lines indicate positions with known pathogenic variants in HGMD and ClinVar. Plot of log₁₀ of the median, 20^th percentile and minimum coverage for each coding exon across all samples for a given sample set. Dark grey: Median coverage, medium grey: 20^th percentile coverage, light grey: minimum coverage for each position. Coverage quality of each exon is indicated by colored blocks beneath the exon. Coverage quality of each exon is indicated by colored blocks beneath the coverage plot. Red: Greater than 15% of exon has less than 10x median coverage; green: 95% of the exon has minimum 20x coverage. UTRs that are part of the coding exons have a smaller indicator thickness. Regions of the exon that overlap with the capture array are indicated in blue just below the coverage plot. Exon scale in bases is shown in each plot.

Extended Data Fig. 5 — We created several alternate pipelines, altering or truncating different parts of the final exome analysis pipeline to probe contributions to overall sensitivity and specificity from various components of the pipeline. For each pipeline, the overall sensitivity and specificity on the NBSeq test set are shown. a, Final exome analysis pipeline **b-i)** Alternatives: b) Altering final pipeline by considering every CNV call homozygous **c-e**) Truncating the CNV arm, curation arm and predicted impact arm, respectively. **f-g**, Retaining the predicted impact arm or curation arm only, respectively h) Retaining only the rare pathogenic HGMD & ClinVar databases i) Allowing multiple gene calls for each sample if more than one gene predicted.

Extended Data Fig. 6 — a, Number of different variant types reported by the pipeline in IEM-affected individuals in genes associated with their IEMs the NBSeq test set (n = 674 individuals). b, Distribution of the types of variants responsible for the predictions of disease status in the 571 affected individuals correctly identified by the exome analysis pipeline.

Extended Data Fig. 7 — In two cases where we performed WGS upon follow up of an exome false negative, we identified large deletions in the associated *IVD* gene. The WGS read alignments in the genomic region spanning the *IVD* is shown on the right for the two cases. The first case had almost no coverage in the region spanning the first three exons of *IVD.* The second case had almost no coverage of exon 12 of *IVD* along with low coverage across the whole gene. The first case had 11 split reads spanning the deleted region confirming the deletion event of the first three exons.

Extended Data Fig. 8 — a, In an individual affected with MCADD, the exome analysis pipeline reported only a single rare nonsynonymous variant. A second rare intronic variant 14 bases from the splice site (NM_000016.4:c.388–14A>G) was a suspected pathogenic modification of the branchpoint A nucleotide. b, Diagram of the heterologous HBB splicing reporter construct containing the wild type ACADM sequence or the c.388–14A>G variant. c, RT-PCR analysis of reporter transcripts from wild type or mutant (lanes 1 and 2, respectively) reporter plasmids expressed in HEK293T cells (amplicons resolved by 12% PAGE and stained with SYBR Gold). The two spliced products are shown to the right of the gel image. The experiments were performed three times independently with similar results. d, Chromatograms corresponding to the sequence spliced junctions between HBB exon 1 and the wild type or mutant ACADM exon 6 constructs (left and right panel, respectively). e, Open reading frame of aberrant ACADM mRNA containing a 13 nt extension of exon 6 (red), resulting in a premature termination codon (PTC, *). Top, DNA sense strand; middle, predicted polypeptide; bottom, DNA reverse complement.

Extended Data Fig. 9 — For six MS/MS screens (VLCADD, PKU, LCHADD/TFP, IVA, MSUD, and GA-II), IEM-affected and MS/MS false positive cases in the NBSeq test set are stratified by the number of alleles reported by the exome analysis pipeline in the genes associated with those screens.

Extended Data Fig. 10 — For each IEM, bars show the zygosity distribution of the variants in relevant genes reported by the exome pipeline for the 674 IEM-affected cases from the test set. The numbers of cases correctly identified by the pipeline are broken down into those that had homozygous variants in relevant gene(s) (dark blue) and those that had two heterozygous variants in relevant genes(s) (orange). The number of cases that failed to be identified by the pipeline are broken down into those that had one heterozygous variant in relevant gene(s) (light blue) and those that had no reported variants in the relevant gene(s) (dark red). Left, core IEMs screened by California; right, secondary/add-on IEMs. IEMs sharing a common causative gene were not distinguished by the exome predictions alone. These included TFP and LCHADD (blue shading), PKU and hyperphenylalaninemia (pink shading), and the various MMA subtypes (yellow shading).

Supplementary Material

SUPPLEMENTAL FIGURES

NIHMS1766172-supplement-SUPPLEMENTAL_FIGURES.pdf^{(58.8KB, pdf)}

SUPPLEMENTAL TABLES

NIHMS1766172-supplement-SUPPLEMENTAL_TABLES.xlsx^{(163.2KB, xlsx)}

Acknowledgements

The authors are grateful for expert technical and computational assistance from many diligent contributors, including Wendy Chan, John-Marc Chandonia, Ajithavalli Chellappan, Navya Dabbiru, Brad Dispensa, Andrew Neumann, Alan Nguyen, Aditya Rao, Sadhna Rana and Zhi-Ying Wu. The work was funded by the National Institute of Health grant U19HD077627 as part of the NSIGHT Project, a joint program between the National Human Genome Research Institute and the Eunice Kennedy Shriver National Institute of Child Health and Human Development, NIH. This work was also supported by a research agreement with Tata Consultancy Services. The biospecimens and/or data used in this study were obtained from the California Biobank Program, (SIS request number 496). The California Department of Public Health is not responsible for the results or conclusions drawn by the authors of this publication.

Competing interests

Aashish Adhikari is currently an employee of Illumina, Inc. Kunal Kundu was an employee of Tata Consultancy Services (TCS); Uma Sunderam and Rajgopal Srinivasan are employees of TCS. Yangyun Zou is currently an employee of Yikon Genomics Co., Ltd. Robert Nussbaum is an employee of Invitae. Jennifer Puck is the spouse of Robert Nussbaum, an employee of Invitae. Steven E. Brenner receives support at the University of California Berkeley from a research agreement from TCS.

References

1.Hall PL, et al. Postanalytical tools improve performance of newborn screening by tandem mass spectrometry. Genet Med 16, 889–895 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Mak CM, Lee HC, Chan AY & Lam CW Inborn errors of metabolism and expanded newborn screening: review and update. Crit Rev Clin Lab Sci 50, 142–162 (2013). [DOI] [PubMed] [Google Scholar]
3.McHugh D, et al. Clinical validation of cutoff target ranges in newborn screening of metabolic disorders by tandem mass spectrometry: a worldwide collaborative project. Genet Med 13, 230–254 (2011). [DOI] [PubMed] [Google Scholar]
4.Wilcken B, Wiley V, Hammond J & Carpenter K Screening newborns for inborn errors of metabolism by tandem mass spectrometry. N Engl J Med 348, 2304–2312 (2003). [DOI] [PubMed] [Google Scholar]
5.Tang H, et al. Damaged goods?: an empirical cohort study of blood specimens collected 12 to 23 hours after birth in newborn screening in California. Genet Med 18, 259–264 (2016). [DOI] [PubMed] [Google Scholar]
6.Adams DR & Eng CM Next-Generation Sequencing to Diagnose Suspected Genetic Disorders. N Engl J Med 379, 1353–1362 (2018). [DOI] [PubMed] [Google Scholar]
7.Biesecker LG & Green RC Diagnostic clinical genome and exome sequencing. N Engl J Med 371, 1170 (2014). [DOI] [PubMed] [Google Scholar]
8.Farnaes L, et al. Rapid whole-genome sequencing decreases infant morbidity and cost of hospitalization. NPJ Genom Med 3, 10 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
9.French CE, et al. Whole genome sequencing reveals that genetic conditions are frequent in intensively ill children. Intensive Care Medicine (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Friedman JM, et al. Genome-wide sequencing in acutely ill infants: genomic medicine’s critical application? Genet Med (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Berg JS, et al. Newborn Sequencing in Genomic Medicine and Public Health. Pediatrics 139(2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Regaldo A. Baby Genome Sequencing for Sale in China. in Technology Review (15 June 2017). [Google Scholar]
13.Hoffmann GF Inherited Metabolic Diseases in the Context of Rare/Orphan Diseases. in Inherited Metabolic Diseases: A Clinical Approach (eds. Hoffmann GF, Zschocke J & Nyhan WL) 31–32 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2017). [Google Scholar]
14.Bassaganyas L, et al. Whole exome and whole genome sequencing with dried blood spot DNA without whole genome amplification. Hum Mutat 39, 167–171 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Biesecker LG Secondary findings in exome slices, virtual panels, and anticipatory sequencing. Genet Med 21, 41–43 (2019). [DOI] [PubMed] [Google Scholar]
16.Stenson PD, et al. Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat 21, 577–581 (2003). [DOI] [PubMed] [Google Scholar]
17.Landrum MJ, et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res 46, D1062–D1067 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Feuchtbaum L, Yang J & Currier R Follow-up status during the first 5 years of life for metabolic disorders on the federal Recommended Uniform Screening Panel. Genet Med 20, 831–839 (2018). [DOI] [PubMed] [Google Scholar]
19.Feuchtbaum L, Carter J, Dowray S, Currier RJ & Lorey F Birth prevalence of disorders detectable through newborn screening by race/ethnicity. Genet Med 14, 937–945 (2012). [DOI] [PubMed] [Google Scholar]
20.Consortium GP, et al. A global reference for human genetic variation. Nature 526, 68–74 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Lek M, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Matern D, et al. Prospective diagnosis of 2-methylbutyryl-CoA dehydrogenase deficiency in the Hmong population by newborn screening using tandem mass spectrometry. Pediatrics 112, 74–78 (2003). [DOI] [PubMed] [Google Scholar]
23.Tiranti V, et al. Ethylmalonic encephalopathy is caused by mutations in ETHE1, a gene encoding a mitochondrial matrix protein. Am J Hum Genet 74, 239–252 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Henriques BJ, et al. Ethylmalonic encephalopathy ETHE1 R163W/R163Q mutations alter protein stability and redox properties of the iron centre. PLoS One 9, e107157 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Wang ZQ, Chen XJ, Murong SX, Wang N & Wu ZY Molecular analysis of 51 unrelated pedigrees with late-onset multiple acyl-CoA dehydrogenation deficiency (MADD) in southern China confirmed the most common ETFDH mutation and high carrier frequency of c.250G>A. J Mol Med (Berl) 89, 569–576 (2011). [DOI] [PubMed] [Google Scholar]
26.Goldfeder RL, et al. Medical implications of technical accuracy in genome sequencing. Genome Med 8, 24 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Sulonen AM, et al. Comparison of solution-based exome capture methods for next generation sequencing. Genome Biol 12, R94 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Peng G, et al. Combining newborn metabolic and DNA analysis for second-tier testing of methylmalonic acidemia. Genet Med 21, 896–903 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Vockley J, Rinaldo P, Bennett MJ, Matern D & Vladutiu GD Synergistic heterozygosity: disease resulting from multiple partial defects in one or more metabolic pathways. Mol Genet Metab 71, 10–18 (2000). [DOI] [PubMed] [Google Scholar]
30.Batshaw ML, Msall M, Beaudet AL & Trojak J Risk of serious illness in heterozygotes for ornithine transcarbamylase deficiency. J Pediatr 108, 236–241 (1986). [DOI] [PubMed] [Google Scholar]
31.Bodian DL, et al. Utility of whole-genome sequencing for detection of newborn screening disorders in a population cohort of 1,696 neonates. Genet Med 18, 221–230 (2016). [DOI] [PubMed] [Google Scholar]
32.Clark MM, et al. Diagnosis of genetic diseases in seriously ill children by rapid whole-genome sequencing and automated phenotyping and interpretation. Sci Transl Med 11(2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Kingsmore SF, et al. A Randomized, Controlled Trial of the Analytic and Diagnostic Performance of Singleton and Trio, Rapid Genome and Exome Sequencing in Ill Infants. Am J Hum Genet 105, 719–733 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Calonge N, et al. Committee report: Method for evaluating conditions nominated for population-based screening of newborns and children. Genet Med 12, 153–159 (2010). [DOI] [PubMed] [Google Scholar]

References (Methods only)

35.Rodriguez JM, et al. APPRIS: annotation of principal and alternative splice isoforms. Nucleic Acids Res 41, D110–117 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Kircher M, et al. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet 46, 310–315 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Jian X, Boerwinkle E & Liu X In silico prediction of splice-altering single nucleotide variants in the human genome. Nucleic Acids Res 42, 13534–13544 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Fromer M, et al. Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet 91, 597–607 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Chamberlin ME, Ubagai T, Mudd SH, Levy HL & Chou JY Dominant inheritance of isolated hypermethioninemia is associated with a mutation in the human methionine adenosyltransferase 1A gene. Am J Hum Genet 60, 540–546 (1997). [PMC free article] [PubMed] [Google Scholar]
40.Li H & Durbin R Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Van der Auwera GA, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics 43, 11 10 11–33 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
42.DePristo MA, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43, 491–498 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Punwani D, et al. Multisystem Anomalies in Severe Combined Immunodeficiency with Mutant BCL11B. N Engl J Med 375, 2165–2176 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Harrow J, et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res 22, 1760–1774 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Tabor HK, et al. Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results. Am J Hum Genet 95, 183–193 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Jian X & Liu X In Silico Prediction of Deleteriousness for Nonsynonymous and Splice-Altering Single Nucleotide Variants in the Human Genome. Methods Mol Biol 1498, 191–197 (2017). [DOI] [PubMed] [Google Scholar]
47.Sunderam U, et al. DNA from dried blood spots yields high quality sequences for exome analysis. bioRxiv, 2020.2005.2019.105304 (2020). [Google Scholar]
48.Jun G, et al. Detecting and estimating contamination of human DNA samples in sequencing and array-based genotype data. Am J Hum Genet 91, 839–848 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Richards S, et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet Med 17, 405–424 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Yorifuji T, et al. X-inactivation pattern in the liver of a manifesting female with ornithine transcarbamylase (OTC) deficiency. Clin Genet 54, 349–353 (1998). [DOI] [PubMed] [Google Scholar]
51.Hu J, et al. Association of CPT II gene with risk of acute encephalitis in Chinese children. Pediatr Infect Dis J 33, 1077–1082 (2014). [DOI] [PubMed] [Google Scholar]
52.Bell CJ, et al. Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med 3, 65ra64 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Bergeron A, D’Astous M, Timm DE & Tanguay RM Structural and functional analysis of missense mutations in fumarylacetoacetate hydrolase, the gene deficient in hereditary tyrosinemia type 1. J Biol Chem 276, 15225–15231 (2001). [DOI] [PubMed] [Google Scholar]
54.Gallant NM, et al. Biochemical, molecular, and clinical characteristics of children with short chain acyl-CoA dehydrogenase deficiency detected by newborn screening in California. Mol Genet Metab 106, 55–61 (2012). [DOI] [PubMed] [Google Scholar]
55.Jethva R, Bennett MJ & Vockley J Short-chain acyl-coenzyme A dehydrogenase deficiency. Mol Genet Metab 95, 195–200 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Wolfe L, Jethva R, Oglesbee D & Vockley J Wolfe L, Jethva R, Oglesbee D, et al. Short-Chain Acyl-CoA Dehydrogenase Deficiency. In: Adam MP, Ardinger HH, Pagon RA, et al., editors. GeneReviews® [Internet]. Seattle (WA): University of Washington, Seattle; 1993–2019. Available from: https://www.ncbi.nlm.nih.gov/books/NBK63582/, cited June 15, 2019. in GeneReviews((R)) (eds. Adam, M.P., et al.) (Seattle (WA)). [Google Scholar]
57.Robinson JT, et al. Integrative genomics viewer. Nat Biotechnol 29, 24–26 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Corvelo A, Hallegger M, Smith CW & Eyras E Genome-wide association between branch point properties and alternative splicing. PLoS Comput Biol 6, e1001016 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Sterne-Weiler T, Howard J, Mort M, Cooper DN & Sanford JR Loss of exon identity is a common mechanism of human inherited disease. Genome Res 21, 1563–1571 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

SUPPLEMENTAL FIGURES

NIHMS1766172-supplement-SUPPLEMENTAL_FIGURES.pdf^{(58.8KB, pdf)}

SUPPLEMENTAL TABLES

NIHMS1766172-supplement-SUPPLEMENTAL_TABLES.xlsx^{(163.2KB, xlsx)}

Data Availability Statement

[R1] 1.Hall PL, et al. Postanalytical tools improve performance of newborn screening by tandem mass spectrometry. Genet Med 16, 889–895 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R2] 2.Mak CM, Lee HC, Chan AY & Lam CW Inborn errors of metabolism and expanded newborn screening: review and update. Crit Rev Clin Lab Sci 50, 142–162 (2013). [DOI] [PubMed] [Google Scholar]

[R3] 3.McHugh D, et al. Clinical validation of cutoff target ranges in newborn screening of metabolic disorders by tandem mass spectrometry: a worldwide collaborative project. Genet Med 13, 230–254 (2011). [DOI] [PubMed] [Google Scholar]

[R4] 4.Wilcken B, Wiley V, Hammond J & Carpenter K Screening newborns for inborn errors of metabolism by tandem mass spectrometry. N Engl J Med 348, 2304–2312 (2003). [DOI] [PubMed] [Google Scholar]

[R5] 5.Tang H, et al. Damaged goods?: an empirical cohort study of blood specimens collected 12 to 23 hours after birth in newborn screening in California. Genet Med 18, 259–264 (2016). [DOI] [PubMed] [Google Scholar]

[R6] 6.Adams DR & Eng CM Next-Generation Sequencing to Diagnose Suspected Genetic Disorders. N Engl J Med 379, 1353–1362 (2018). [DOI] [PubMed] [Google Scholar]

[R7] 7.Biesecker LG & Green RC Diagnostic clinical genome and exome sequencing. N Engl J Med 371, 1170 (2014). [DOI] [PubMed] [Google Scholar]

[R8] 8.Farnaes L, et al. Rapid whole-genome sequencing decreases infant morbidity and cost of hospitalization. NPJ Genom Med 3, 10 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.French CE, et al. Whole genome sequencing reveals that genetic conditions are frequent in intensively ill children. Intensive Care Medicine (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Friedman JM, et al. Genome-wide sequencing in acutely ill infants: genomic medicine’s critical application? Genet Med (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Berg JS, et al. Newborn Sequencing in Genomic Medicine and Public Health. Pediatrics 139(2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Regaldo A. Baby Genome Sequencing for Sale in China. in Technology Review (15 June 2017). [Google Scholar]

[R13] 13.Hoffmann GF Inherited Metabolic Diseases in the Context of Rare/Orphan Diseases. in Inherited Metabolic Diseases: A Clinical Approach (eds. Hoffmann GF, Zschocke J & Nyhan WL) 31–32 (Springer Berlin Heidelberg, Berlin, Heidelberg, 2017). [Google Scholar]

[R14] 14.Bassaganyas L, et al. Whole exome and whole genome sequencing with dried blood spot DNA without whole genome amplification. Hum Mutat 39, 167–171 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R15] 15.Biesecker LG Secondary findings in exome slices, virtual panels, and anticipatory sequencing. Genet Med 21, 41–43 (2019). [DOI] [PubMed] [Google Scholar]

[R16] 16.Stenson PD, et al. Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat 21, 577–581 (2003). [DOI] [PubMed] [Google Scholar]

[R17] 17.Landrum MJ, et al. ClinVar: improving access to variant interpretations and supporting evidence. Nucleic Acids Res 46, D1062–D1067 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Feuchtbaum L, Yang J & Currier R Follow-up status during the first 5 years of life for metabolic disorders on the federal Recommended Uniform Screening Panel. Genet Med 20, 831–839 (2018). [DOI] [PubMed] [Google Scholar]

[R19] 19.Feuchtbaum L, Carter J, Dowray S, Currier RJ & Lorey F Birth prevalence of disorders detectable through newborn screening by race/ethnicity. Genet Med 14, 937–945 (2012). [DOI] [PubMed] [Google Scholar]

[R20] 20.Consortium GP, et al. A global reference for human genetic variation. Nature 526, 68–74 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Lek M, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Matern D, et al. Prospective diagnosis of 2-methylbutyryl-CoA dehydrogenase deficiency in the Hmong population by newborn screening using tandem mass spectrometry. Pediatrics 112, 74–78 (2003). [DOI] [PubMed] [Google Scholar]

[R23] 23.Tiranti V, et al. Ethylmalonic encephalopathy is caused by mutations in ETHE1, a gene encoding a mitochondrial matrix protein. Am J Hum Genet 74, 239–252 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Henriques BJ, et al. Ethylmalonic encephalopathy ETHE1 R163W/R163Q mutations alter protein stability and redox properties of the iron centre. PLoS One 9, e107157 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R25] 25.Wang ZQ, Chen XJ, Murong SX, Wang N & Wu ZY Molecular analysis of 51 unrelated pedigrees with late-onset multiple acyl-CoA dehydrogenation deficiency (MADD) in southern China confirmed the most common ETFDH mutation and high carrier frequency of c.250G>A. J Mol Med (Berl) 89, 569–576 (2011). [DOI] [PubMed] [Google Scholar]

[R26] 26.Goldfeder RL, et al. Medical implications of technical accuracy in genome sequencing. Genome Med 8, 24 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Sulonen AM, et al. Comparison of solution-based exome capture methods for next generation sequencing. Genome Biol 12, R94 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Peng G, et al. Combining newborn metabolic and DNA analysis for second-tier testing of methylmalonic acidemia. Genet Med 21, 896–903 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Vockley J, Rinaldo P, Bennett MJ, Matern D & Vladutiu GD Synergistic heterozygosity: disease resulting from multiple partial defects in one or more metabolic pathways. Mol Genet Metab 71, 10–18 (2000). [DOI] [PubMed] [Google Scholar]

[R30] 30.Batshaw ML, Msall M, Beaudet AL & Trojak J Risk of serious illness in heterozygotes for ornithine transcarbamylase deficiency. J Pediatr 108, 236–241 (1986). [DOI] [PubMed] [Google Scholar]

[R31] 31.Bodian DL, et al. Utility of whole-genome sequencing for detection of newborn screening disorders in a population cohort of 1,696 neonates. Genet Med 18, 221–230 (2016). [DOI] [PubMed] [Google Scholar]

[R32] 32.Clark MM, et al. Diagnosis of genetic diseases in seriously ill children by rapid whole-genome sequencing and automated phenotyping and interpretation. Sci Transl Med 11(2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] 33.Kingsmore SF, et al. A Randomized, Controlled Trial of the Analytic and Diagnostic Performance of Singleton and Trio, Rapid Genome and Exome Sequencing in Ill Infants. Am J Hum Genet 105, 719–733 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Calonge N, et al. Committee report: Method for evaluating conditions nominated for population-based screening of newborns and children. Genet Med 12, 153–159 (2010). [DOI] [PubMed] [Google Scholar]

PERMALINK

The Role of Exome Sequencing in Newborn Screening for Inborn Errors of Metabolism

Aashish N Adhikari

Renata C Gallagher

Yaqiong Wang

Robert J Currier

George Amatuni

Laia Bassaganyas

Flavia Chen

Kunal Kundu

Mark Kvale

Sean D Mooney

Robert L Nussbaum

Savanna S Randi

Jeremy Sanford

Joseph T Shieh

Rajgopal Srinivasan

Uma Sunderam

Hao Tang

Dedeepya Vaka

Yangyun Zou

Barbara A Koenig

Pui-Yan Kwok

Neil Risch

Jennifer M Puck

Steven E Brenner

Abstract

Fig. 1. Low positive predictive value and complex differential diagnoses of MS/MS newborn screening for glutaric acidemia (GA-I).

Table 1:

Fig. 2. Whole exome pipeline design and analysis.

Table 2:

Methods

NBSeq samples and study set

Variant calling and quality control

Variant calling and annotation

Quality control metrics

Mapping related metrics

DNA damage related metrics

Variant related metrics

QC criteria for sample exclusion

Exome analysis pipeline

Pipeline development

Architecture of the final exome analysis pipeline

Minor allele frequency threshold adjustment for X-linked genes

NBSeq variant curation

Selection of the most prominent genotype in a sample

Assessment of predictions from the exome analysis pipeline

Incorporation of clinical evidence to generate a “reviewed” diagnosis key

Assessment of exome sequencing as a primary screen

Overall sensitivity and specificity

Performance on data subsets

Alternative specificity calculation

Alternative pipelines and findings

Assessment of exome sequencing as a follow-up test

Follow-up exploratory studies of selected exome false negative cases

Whole genome pilot study of eight exome false negatives

Functional assay of an intronic variant in an exome false negative

Data availability

Code availability

Extended Data

Extended Data Fig. 1. Metrics for WES reads and coverage.

Extended Data Fig. 2. DNA damage related metrics for the three sequencing batches.

Extended Data Fig. 3. Variant related quality metrics for 1,216 samples plotted batch wise.

Extended Data Fig. 4. Example showing variability of gene coverage in two IEM genes in the study across 1,216 samples.

Extended Data Fig. 5. Alternative pipelines derived from the final exome analysis pipeline to explore sensitivity-specificity tradeoffs.

Extended Data Fig. 6. Distribution of variants reported by the exome analysis pipeline in the NBSeq test set.

Extended Data Fig. 7. Whole genome sequencing confirms potential IVDdeletions in two individuals diagnosed with isovaleric acidemia initially missed in exome.

Extended Data Fig. 8. Experimental splicing assay of a potentially pathogenic intronic variant in an exome false negative case.

Extended Data Fig. 9. Stratification of IEM-affected and MS/MS false positives by alleles reported by the exome analysis pipeline for NPV estimation of NPV of exome as a follow-up test after a positive MS/MS screen.

Extended Data Fig. 10. Zygosity distribution of variants reported by the pipeline in relevant gene(s).

Supplementary Material

Acknowledgements

References

References (Methods only)

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES