Abstract
Approximately 10–15% of intellectual disability (ID) cases are caused by genetic aberrations affecting chromosome X, a condition termed X-linked ID (XLID). Examination by whole-exome sequencing of two male siblings with microcephaly and suspected XLID with an unknown genetic basis revealed that they were both hemizygous for a predicted pathogenic variant (p.Ala897Val) causing a non-synonymous substitution of an evolutionary conserved amino acid within the host cell factor C1 (HCFC1) gene. Subsequent analysis determined that this was a rare variant not identified in 100 control individuals or in online databases of control individuals. Recent studies have reported mutations affecting HCFC1 in patients with ID and dysmorphic features that are associated with defective cobalamin metabolism. Biochemical investigations did not find evidence of an association between the variant identified in the present study and cobalamin metabolic disorder. This study offers further support for mutations of HCFC1 being implicated in XLID and microcephaly, but that these are not necessarily associated with cobalamin disorder.
Keywords: exome, HCFC1, intellectual disability, microcephaly
Introduction
Intellectual disability (ID), defined as the significant impairment of cognitive and adaptive behavior, affects 1.5–2% of the human population (1), and therefore receives considerable attention from scientists working in the fields of neuroscience and genetics. Studies have shown that 10–15% of inherited severe ID is the result of genetic aberrations on chromosome X, a condition termed X-Linked ID (XLID) (2). XLID is subdivided into two major categories, syndromic and non-syndromic, based on the presence or absence of specific accompanying clinical, morphological, neurological or metabolic abnormalities. The borders between non-syndromic and syndromic are not always clearly defined, particularly in small families where few individuals are affected. Small families or individual cases are therefore usually categorized as non-syndromic, contributing to the heterogeneity of this category.
Through the intensive efforts of various research groups and the utilization of powerful new genomic technologies, >100 XLID genes have been reported so far, although it is estimated that a substantial fraction of XLID genes remain to be identified (3,4). The recently developed whole-exome sequencing (WES) technology allows the rapid detection of variants located within protein-coding regions of the genome that could not be detected by technologies such as karyotyping and array-comparative genomic hybridization (array-CGH). In the present study, WES was used to screen two brothers with suspected XLID. The analysis identified a novel variant in these patients within HCFC1, a gene recently implicated in XLID.
Materials and methods
Clinical evaluation of patients
Patient 1 was the first of two male siblings born to healthy non-consanguineous Greek-Cypriot parents and was initially referred to the Clinical Genetics service (The Cyprus Institute of Neurology and Genetics, Nicosia, Cyprus) at the age of 5 years and 2 months for evaluation regarding a history of developmental and speech delay. The patient was born at 39 weeks gestation following a normal vaginal delivery with a birth weight on the 9th percentile, a birth length on the 25–50th percentile, and a birth occipitofrontal circumference (ofc) on the 0.4th percentile. There were no antenatal concerns, no history of a maternal intercurrent infection or known exposure to teratogens. Developmentally the patient sat without support at the age of 6 months and walked independently at the age of 13 months. The patient presented with moderate developmental motor and speech delay, as well as with severe learning difficulties. Other characteristic findings at the age of clinical evaluation were microcephaly (ofc <3rd percentile) and hyperactivity. The weight and height of the patient were between the 25–50th percentile. The patient had a beaked nose with relatively large ears and synophrys. The hands and feet were normal, including the palmar and plantar creases.
Patient 2 (the male sibling of patient 1) was referred to the Department of Clinical Genetics at the age of 14 months for evaluation regarding developmental delay. The patient was born prematurely at 35 weeks gestation following an emergency C-section due to fetal distress and prolonged rupture of membranes with a birth weight on the 25th percentile. At the age of 14 months, the patient was reported to be microcephalic and could only walk with support. On examination at the age of 5 years, the patient had severe developmental and speech delay. The patient was microcephalic (ofc <3th percentile) with myopathic facies, arched eyebrows, an open-mouthed appearance with a Cupid's bow configuration of the upper lip and prognathism. The eyes of the patient were prominent. The patient had a small and narrow alae nasi, triangular nails, bilateral 2–3 toe syndactyly and bilateral pes planus.
The endocrinology and ophthalmology evaluations yielded normal results. Extensive metabolic investigations did not reveal any abnormalities. In both siblings, karyotype, fragile X syndrome, and high-resolution array-CGH analyses showed normal results.
WES and variant analysis
DNA was extracted from the examined individuals using the Qiagen DNA extraction kit (Qiagen, Hilden, Germany) according to the manufacturer's protocol. WES was performed on DNA samples from the affected male siblings as a service by Oxford Gene Technology (Kidlington, UK). Exome enrichment was performed using the SureSelect exome enrichment kit v1.2 (Agilent Technologies, Thermo Fisher Scientific, Inc., Waltham, MA, USA) and the concentration of each library was determined using the Agilent qPCR NGS Library Quantification kit. Prior to sequencing, the samples were pooled to a concentration of 10 nM per sample. Sequencing of the enriched exome was performed on the Illumina HiSeq 2000 using TruSeq v3 chemistry (Illumina, Inc., San Diego, CA, USA). For the first patient there were 25, 241 and 930 paired reads, 67.48% of bases were on target, and 92.69% of target bases were covered at >20×. For the second patient there were 24, 926 and 649 paired reads, 67.39% of bases were on target, and 92.60% of bases were covered at >20×.
For the bionformatics analysis of the WES data the reads were mapped to genome assembly hg19 using the Burrows-Wheeler Aligner package, version 0.6.2 (5). Local realignment of the mapped reads around potential insertion/deletion (indel) sites was carried out with the Genome Analysis Tool kit (GATK) version 1.6 (6). Duplicate reads were marked using Picard version 1.83 (http://picard.sourceforge.net/). Base quality (Phred scale) scores were recalibrated using the covariance recalibration of GATK. SNP and indel variants were named using the GATK Unified Genotyper for each sample. SNP novelty was determined against dbSNP release 135 in February 2013 (7). Further screening of variants was performed using the exome variant server (EVS) database in May 2014 (http://evs.gs.washington.edu/EVS/). Pathogenicity prediction for identified variants was performed using the Polyphen (http://genetics.bwh.harvard.edu/pph/data/), Sift (http://sift.jcvi.org/), and Condel (http://bg.upf.edu/fannsdb/) tools.
Sanger sequencing
Purification of polymerase chain reaction products was performed using Exo-SapIT (Affymetrix, Santa Clara, CA, USA). Sequencing reactions were performed using Applied Biosystems BigDye terminator v1.1 and products were analyzed on the ABI 3130 sequencer. The Primer sequences are available on request.
Results
Variations
A total of 46,134 variations were observed for patient 1 and 45,711 variations for patient 2. Of these, there were 45,784 overlapped genes for patient 1 and 45,369 for patient 2. For variants not present in dbSNP 135 there were 3,462 for patient 1 and 3,483 for patient 2. Of these, 675 and 682, respectively, were predicted to have serious consequences (missense, frameshift, indel, essential splice-site, nonsense and nonstop) respectively. Based on the suspected XLID inheritance of the disease, we prioritized variants detected in the WES data that were: i) Located on the X chromosome, ii) found in both affected patients, iii) rare variants not present in the dsSNP database, and iv) predicted to have serious phenotypic consequences by in silico tools (Sift, Polyphen or Condel).
Only two variants fulfilled the criteria outlined above: A non-synonymous variant in the HCFC1 gene (ChrX:153,221,808 GRCh37; p.Ala897Val) and a non-synonymous variant in the zinc finger protein, X-linked (ZFX) gene (ChrX:24,197,706). Further screening of these two variants in the EVS database found that the HCFC1 variant was not present in any of the screened individuals, while the ZFX variant was present in 8 out of 10,555 chromosomes. Additionally, mutations in ZFX had not been previously associated with ID. Consequently, the ZFX variant was less likely to be the cause of the ID in these patients.
Sanger sequencing
The identified HCFC1 variant was present in the two male siblings as confirmed by Sanger sequencing using custom primers and the Big Dye terminator v1.1, while their mother was a heterozygous carrier (Fig. 1). Additionally, Sanger sequencing of the HCFC1 variant showed that this was not present in 100 neurotypical individuals from the Cypriot population. This rare variant affects an evolutionary conserved amino acid (Fig. 2), supporting its functional importance.
Discussion
HCFC1 encodes a large protein of >2,000 amino acids made up of kelch repeats, HCF-proteolysis repeats, protein interacting domains, basic domain, acidic region, and fibronectin domains (Fig. 3). In recent years studies have reported an association of mutations affecting HCFC1 with ID, metabolic disorders and dysmorphias. Mutations affecting conserved amino acids within the HCFC1 kelch domains have been associated with defective cobalamin metabolism and neurological symptoms by affecting transcription of MMAHC (8). Gérard et al (9) recently reported a p.Tyr103His mutation in two male siblings with ID and cobalamin disorder. These male siblings also exhibited dysmorphic features and microcephaly. Experimental evidence from the examination of patient-derived cell lines and in vitro assays suggest that mutations within kelch domains result in almost complete loss of HCFC1 function and result in cobalamin disorder, while HCFC1 mutations in other regions result in much milder effects on protein function and cobalamin metabolism (8,10). Loss-of-function of the zebrafish homolog of HCFC1 has also been associated with craniofacial abnormalities through dysregulation of MMAHC (11). It has been suggested that mutations in non-kelch domains of HCFC1, which result in partial loss-of-function, cause neurological disorders in the absence of cobalamin deficiency due to HCFC1 having other gene targets that are important for neurological development (8,10).
Examination of patient 1 revealed normal levels of cobalamin, homocysteine and folate in the blood [436 pg/ml cobalamin (normal range, 208–964 pg/ml), 9.1 mmol/l homocysteine (normal range, 5–15 mmol/l) and 5.4 ng/ml folate (normal range, 3–15 ng/ml)]. Consequently, no supporting evidence that the pathogenic phenotype observed in this case is associated with defective cobalamin metabolism was identified.
The variant identified in the present study (p.Ala897Val) is located within the GA-binding protein (GABP) interaction domain. Interaction of GABP with HCFC1 with transcription factors, including GABP, is believed to be essential for the ability of HCFC1 to regulate the transcription of a large number of genes (12). Within the same region, two variants have been identified in an autistic patient (p.Gly876Ser) (13) and in a patient with ID (p.Ala864Thr) (3). The findings of the present study are consistent with mutations within this region of HCFC1 being associated with ID or ASD and with dysmorphic features, without clinical manifestation of cobalamin deficiency.
References
- 1.Leonard H, Wen X. The epidemiology of mental retardation: Challenges and opportunities in the new millennium. Ment Retard Dev Disabil Res Rev. 2002;8:117–134. doi: 10.1002/mrdd.10031. [DOI] [PubMed] [Google Scholar]
- 2.De Brouwer AP, Yntema HG, Kleefstra T, Lugtenberg D, Oudakker AR, de Vries BB, van Bokhoven H, Van Esch H, Frints SG, Froyen G, et al. Mutation frequencies of X-linked mental retardation genes in families from the EuroMRX consortium. Hum Mutat. 2007;28:207–208. doi: 10.1002/humu.9482. [DOI] [PubMed] [Google Scholar]
- 3.Tarpey PS, Smith R, Pleasance E, Whibley A, Edkins S, Hardy C, O'Meara S, Latimer C, Dicks E, Menzies A, et al. A systematic, large-scale resequencing screen of X-chromosome coding exons in mental retardation. Nat Genet. 2009;41:535–543. doi: 10.1038/ng.367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bassani S, Zapata J, Gerosa L, Moretto E, Murru L, Passafaro M. The neurobiology of X-linked intellectual disability. Neuroscientist. 2013;19:541–552. doi: 10.1177/1073858413493972. [DOI] [PubMed] [Google Scholar]
- 5.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler Transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wheeler DL, Barrett T, Benson DA, et al. Database resources of the National Center for Biotechnology Information. Nucleic Acids Res. 2007;35(Database issue):D5–D12. doi: 10.1093/nar/gkl1031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Yu HC, Sloan JL, Scharer G, Brebner A, Quintana AM, Achilly NP, Manoli I, Coughlin CR, II, Geiger EA, Schneck U, et al. An X-linked cobalamin disorder caused by mutations in transcriptional coregulator HCFC1. Am J Hum Genet. 2013;93:506–514. doi: 10.1016/j.ajhg.2013.07.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gérard M, Morin G, Bourillon A, Colson C, Mathieu S, Rabier D, de Villemeur Billette T, de Baulny Ogier H, Benoist JF. Multiple congenital anomalies in two boys with mutation in HCFC1 and cobalamin disorder. Eur J Med Genet. 2015;58:148–153. doi: 10.1016/j.ejmg.2014.12.015. [DOI] [PubMed] [Google Scholar]
- 10.Jolly LA, Nguyen LS, Domingo D, Sun Y, Barry S, Hancarova M, Plevova P, Vlckova M, Havlovicova M, Kalscheuer VM, et al. HCFC1 loss-of-function mutations disrupt neuronal and neural progenitor cells of the developing brain. Hum Mol Genet. 2015;24:3335–3347. doi: 10.1093/hmg/ddv083. [DOI] [PubMed] [Google Scholar]
- 11.Quintana AM, Geiger EA, Achilly N, Rosenblatt DS, Maclean KN, Stabler SP, Artinger KB, Appel B, Shaikh TH. Hcfc1b, a zebrafish ortholog of HCFC1, regulates craniofacial development by modulating mmachc expression. Dev Biol. 2014;396:94–106. doi: 10.1016/j.ydbio.2014.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Michaud J, Praz V, Faresse James N, Jnbaptiste CK, Tyagi S, Schütz F, Herr W. HCFC1 is a common component of active human CpG-island promoters and coincides with ZNF143, THAP11, YY1 and GABP transcription factor occupancy. Genome Res. 2013;23:907–916. doi: 10.1101/gr.150078.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Piton A, Gauthier J, Hamdan FF, Lafrenière RG, Yang Y, Henrion E, Laurent S, Noreau A, Thibodeau P, Karemera L, et al. Systematic resequencing of X-chromosome synaptic genes in autism spectrum disorder and schizophrenia. Mol Psychiatry. 2011;16:867–880. doi: 10.1038/mp.2010.54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Huang L, Jolly LA, Willis-Owen S, Gardner A, Kumar R, Douglas E, Shoubridge C, Wieczorek D, Tzschach A, Cohen M, Hackett A, Field M, Froyen G, Hu H, Haas SA, Ropers HH, Kalscheuer VM, Corbett MA, Gecz J. A noncoding, regulatory mutation implicates HCFC1 in nonsyndromic intellectual disability. Am J Hum Genet. 2012;91:694–702. doi: 10.1016/j.ajhg.2012.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]