Homozygous and hemizygous CNV detection from exome sequencing data in a Mendelian disease cohort

  1. James R. Lupski 1 , 3 , 7 , 8 , *
  1. 1Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
  2. 2Institute of Computer Science, Warsaw University of Technology, Warsaw, 00-665 Warsaw, Poland
  3. 3Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
  4. 4Norwegian National Unit for Newborn Screening, Division for Pediatric and Adolescent Medicine, Oslo University Hospital, N-0424 Oslo, Norway
  5. 5Graduate Program in Diagnostic Genetics, School of Health Professions, University of Texas MD Anderson Cancer Center, Houston, TX 77030, USA
  6. 6Human Genetics Center, University of Texas Health Science Center at Houston, Houston, TX 77030, USA
  7. 7Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
  8. 8Texas Children's Hospital, Houston, TX 77030, USA
  1. *To whom correspondence should be addressed. Tel: +1 713 798 6530; Fax: +1 713 798 5073; Email: jlupski{at}bcm.edu
  2. These authors contributed equally to this work as the first authors.
  • Received July 16, 2015.
  • Revision received November 22, 2016.
  • Accepted November 29, 2016.

Abstract

We developed an algorithm, HMZDelFinder, that uses whole exome sequencing (WES) data to identify rare and intragenic homozygous and hemizygous (HMZ) deletions that may represent complete loss-of-function of the indicated gene. HMZDelFinder was applied to 4866 samples in the Baylor–Hopkins Center for Mendelian Genomics (BHCMG) cohort and detected 773 HMZ deletion calls (567 homozygous or 206 hemizygous) with an estimated sensitivity of 86.5% (82% for single-exonic and 88% for multi-exonic calls) and precision of 78% (53% single-exonic and 96% for multi-exonic calls). Out of 773 HMZDelFinder-detected deletion calls, 82 were subjected to array comparative genomic hybridization (aCGH) and/or breakpoint PCR and 64 were confirmed. These include 18 single-exon deletions out of which 8 were exclusively detected by HMZDelFinder and not by any of seven other CNV detection tools examined. Further investigation of the 64 validated deletion calls revealed at least 15 pathogenic HMZ deletions. Of those, 7 accounted for 17–50% of pathogenic CNVs in different disease cohorts where 7.1–11% of the molecular diagnosis solved rate was attributed to CNVs. In summary, we present an algorithm to detect rare, intragenic, single-exon deletion CNVs using WES data; this tool can be useful for disease gene discovery efforts and clinical WES analyses.

This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions{at}oup.com