Skip to main content
PLOS Genetics logoLink to PLOS Genetics
. 2021 Aug 6;17(8):e1009698. doi: 10.1371/journal.pgen.1009698

Size matters: Large copy number losses in Hirschsprung disease patients reveal genes involved in enteric nervous system development

Laura E Kuil 1, Katherine C MacKenzie 1, Clara S Tang 2,3, Jonathan D Windster 1,4, Thuy Linh Le 5, Anwarul Karim 2, Bianca M de Graaf 1, Robert van der Helm 1, Yolande van Bever 1, Cornelius E J Sloots 4, Conny Meeussen 4, Dick Tibboel 4, Annelies de Klein 1, René M H Wijnen 4, Jeanne Amiel 5, Stanislas Lyonnet 5, Maria-Mercè Garcia-Barcelo 2, Paul K H Tam 2,3, Maria M Alves 1, Alice S Brooks 1, Robert M W Hofstra 1,6, Erwin Brosens 1,*
Editor: Gregory S Barsh7
PMCID: PMC8372947  PMID: 34358225

Abstract

Hirschsprung disease (HSCR) is a complex genetic disease characterized by absence of ganglia in the intestine. HSCR etiology can be explained by a unique combination of genetic alterations: rare coding variants, predisposing haplotypes and Copy Number Variation (CNV). Approximately 18% of patients have additional anatomical malformations or neurological symptoms (HSCR-AAM). Pinpointing the responsible culprits within a CNV is challenging as often many genes are affected. Therefore, we selected candidate genes based on gene enrichment strategies using mouse enteric nervous system transcriptomes and constraint metrics. Next, we used a zebrafish model to investigate whether loss of these genes affects enteric neuron development in vivo.

This study included three groups of patients, two groups without coding variants in disease associated genes: HSCR-AAM and HSCR patients without associated anomalies (HSCR-isolated). The third group consisted of all HSCR patients in which a confirmed pathogenic rare coding variant was identified. We compared these patient groups to unaffected controls. Predisposing haplotypes were determined, confirming that every HSCR subgroup had increased contributions of predisposing haplotypes, but their contribution was highest in isolated HSCR patients without RET coding variants. CNV profiling proved that specifically HSCR-AAM patients had larger Copy Number (CN) losses. Gene enrichment strategies using mouse enteric nervous system transcriptomes and constraint metrics were used to determine plausible candidate genes located within CN losses. Validation in zebrafish using CRISPR/Cas9 targeting confirmed the contribution of UFD1L, TBX2, SLC8A1, and MAPK8 to ENS development. In addition, we revealed epistasis between reduced Ret and Gnl1 expression and between reduced Ret and Tubb5 expression in vivo. Rare large CN losses—often de novo—contribute to HSCR in HSCR-AAM patients. We proved the involvement of six genes in enteric nervous system development and Hirschsprung disease.

Author summary

Hirschsprung disease is a congenital disorder characterized by the absence of intestinal neurons in the distal part of the intestine. It is a complex genetic disorder in which multiple variations in our genome combined, result in disease. One of these variations are Copy Number Variations (CNVs): large segments of our genome that are duplicated or deleted. Patients often have Hirschsprung disease without other symptoms. However, a proportion of patients has additional associated anatomical malformations and neurological symptoms. We found that CNVs, present in patients with associated anomalies, are more often larger compared to unaffected controls or Hirschsprung patients without other symptoms. Furthermore, Copy Number (CN) losses are enriched for constrained coding regions (CCR; genes usually not impacted by genomic alterations in unaffected controls) of which the expression is higher in the developing intestinal neurons compared to the intestine. We modelled loss of these candidate genes in zebrafish by disrupting the zebrafish orthologues by genome editing. For several genes this resulted in changes in intestinal neuron development, reminiscent of HSCR observed in patients. The results presented here highlight the importance of Copy Number profiling, zebrafish validation and evaluating all CCR expressed in developing intestinal neurons during diagnostic evaluation.

Introduction

The enteric neurons and glia in the intestine form the enteric nervous system (ENS), which is derived from the neural crest. Enteric neural crest cells (ENCCs) invade the developing gut tube in the proximal foregut around week four of human development and migrate caudally to colonize the distal hindgut in week 7 [1]. Failure in either migration, proliferation, differentiation or survival of these ENCCs [2] is known to result in Hirschsprung disease (HSCR). This congenital enteric neuropathy is characterized by an absence of the ENS, or also called enteric ganglia, in the distal colon (called aganglionosis). The length of the affected region is most often restricted to the recto-sigmoid region, however the length of aganglionosis can extend to the whole intestine. HSCR can segregate through families [36] and, depending on the length of the affected region and gender of the proband, siblings can have a recurrence risk of up to 33% [7]. Most patients have HSCR as an isolated anatomical malformation [8] and in these patients this disease is a textbook example of a complex genetic disorder where the phenotype stems from a combination of genetic alterations: rare coding variants, predisposing common haplotypes and losses or gains of (large) parts of the genome (Copy Number Variation (CNV)[9]. The common haplotypes influence HSCR penetrance and include haplotypes near RET. They are known to increase disease risk substantially, especially if homozygous, in specific combinations (rs2506030, rs7069590, rs2435357) [10,11], together with other risk loci near the semaphorin gene cluster (rs80227144) [11,12], or the neuregulin 1 gene (NRG1; rs7005606) [1315].

Approximately 18% of HSCR patients have additional anatomical malformations or neurological symptoms (HSCR-AAM) [7]. The combination of disease entities found in HSCR-AAM patients could be the result of one or more point mutation(s) or inherited variants. However, CNVs have been found to contribute to the etiology of many traits, common diseases and congenital anomalies [16]. In addition, chromosomal deletions have proven instrumental in the identification of disease-causing genes. For example, deletions of 10q11 [17,18] led to the identification of the REarranged during Transfection gene (RET), the major responsible gene for familial and sporadic isolated HSCR [19,20]. Also for the identification of causal genes for several syndromes with HSCR as a key feature. For example, deletions of 13q [9, 2125] resulted in the identification of one of the genes responsible for Waardenburg-Shah syndrome type 4 (EDNRB) [26]. Deletions of 2q [2730] and 4p [31] contributed to the discovery of genes responsible for Mowatt-Wilson syndrome (ZEB2, formerly ZFHX1B) [32] and the gene responsible for Congenital Central Hypoventilation Syndrome (PHOX2B) [33]. Therefore, we believe that rare CNVs could significantly contribute to patients with HSCR, in particular to HSCR-AAM patients.

Large CNVs harbour many genes making it difficult to determine genes responsible for the observed trait or disease. One of the strategies to identify the genes involved in disease development is to disrupt these genes in animal models. As an in vivo model to study genetic alterations that affect the development of the enteric nervous system we use zebrafish (Danio rerio), which are often used in development biology as they are easily accessible due to their rapid and ex-utero development. By injecting CRISPR/Cas9 complexes in the one cell stage of fertilized oocytes one can relatively easily generate various insertions and deletions that will disrupt gene function. This method has such high efficiency that it enables us to study the effects of gene disruption in the injected F0 generation zebrafish larvae [3436]. In addition, the development of the zebrafish enteric nervous system is highly similar to that in mouse and human [37,38]. Therefore, the zebrafish makes an excellent model to use for functional genetic testing of HSCR candidate genes in vivo.

In this manuscript, we test our hypothesis that rare CNVs could significantly contribute to patients with HSCR, in particular to HSCR-AAM patients. Hence, we compared the CNVs of HSCR patients with and patients without associated anomalies, which lack rare pathogenic coding changes, and HSCR patients with a known pathogenic coding mutation or variant. Our data pointed to the presence of larger copy number (CN) losses in HSCR-AAM patients. Subsequently, we aimed to identify the gene(s) in these CNV(s) contributing to ENS development by disruption of their orthologues in zebrafish in order to determine whether their loss affects ENS development.

Results

We selected 58 out of 197 HSCR patients for which DNA and informed consent were available, and in whom at least the RET gene was screened. HSCR patients were subdivided in three groups for the CNV detection study: patients with HSCR and additional anatomical malformations or neurological defects (HSCR-AAM), but without a RET pathogenic variant, or other causal genetic defect (group 1, n = 23, Table 1 and Fig 1A), patients with HSCR and a known variant in RET or another causal gene (group 2, n = 15, Table 2 and Fig 1A), and patients with only HSCR, without a deleterious RET coding variant or other causal genetic defect (group 3, n = 20, Fig 1A). Additionally, we included unaffected control individuals (group 4, n = 326, Fig 1A).

Table 1. Hirschsprung patients without a RET mutation and additional phenotypical features.

Patient HSCR type Other phenotypical characteristics
P_000482 Short Hydrocephalus, macrocephaly, autism
P_000540 Short Facial dysmorphisms
P_000494 Short Cardiac defects (VSD, ASD PDA, tricuspid atresia), dysplastic ears, renal malrotation
P_000512 Short Epilepsy, intellectual disability
P_000553 Short Cardiac defects (VSD, dextrocardia, PDA, double outlet right ventricle), intestinal malrotation
P_000559 Total colonic Dysmorphic features, tracheomalacia, cardiac defects (dilated left ventricle, absence of AV conduction)
P_000561 Short Facial dysmorphisms, small fontanelle, gastro-esophageal reflux, laryngeal web
P_000555 Short Hypoplastic thumb, hearing loss, developmental delay, facial dysmorphisms
P_002459 Short Hypospadias, mild autism
P_000567 Short Facial dysmorphisms, hearing loss, microcephaly, immunological hypersensitivity, nevus flammeus
P_000536 Abnormal Telecanthus upslant; short segment HSCR although a longer segment is abnormal ganglionated.
P_000562 Short Cafe au lait spots, cardiac defect (VSD)
P_000572 TIA Retrognathia, skin abnormality, facial dysmorphisms, cardiac defect (pulmonary valve stenosis)
P_000568 Short Dysmorphic features, hydrocele testis, hemangioma
P_000478 Short Hypertelorism, facial dysmorphisms
P_000520 Short Mild facial dysmorphisms, sandal-gap of toe
P_002455 Short Hypermobility of fingers; mild developmental delay, downslant of eyes
P_001763 Short White hair lock, mild developmental day
P_000537 Short Gross motor delay, spastic hemiplegia, bronchopulmonary dysplasia, cardiac defect (PDA)
P_000528 Total colonic Intellectual disability
P_000573 Short Epicanthal folds, small ears, broad eyebrows with mild synophrys
P_002450 Long Developmental delay
P_002343 Short Hypertelorism, long deeply grooved philtrum

Depicted are the patients with HSCR and additional anatomical malformations or neurological defects (HSCR-AAM), but without a RET pathogenic variant, or other causal genetic defect (group 1, n = 23). Listed are the patient identifiers, HSCR classification, and phenotypical description regarding the associated anomalies.

Fig 1. Schematic overview of our overall study design and methods used.

Fig 1

(A) For this study we included 326 controls (group 4) and 58 HSCR patients. We determined the RET and / or known disease gene coding mutations of HSCR patients (n = 58). The HSCR patients with coding mutations were included in group 2 (n = 15). We determined the presence of associated anomalies in the rest of the HSCR patients, including them in either the group containing associated anomalies (HSCR-AAM; group 1, n = 23) or in the group without associated anomalies (group 3, n = 30). (B) For all subgroups of HSCR patients and the controls Copy Number profiles were determined. To select candidate genes, we ranked CNVs according to their frequency in the unaffected controls from the Deciphering Developmental Disorders project (for illustrative purposes a screenshot of their UCSC genome browser track; https://genome-euro.ucsc.edu/ is depicted). We determined if genes included in the rare CNVs were “ENS genes”: genes with increased expression in isolated ENS (with and without the addition of GDNF) cells compared to the whole intestine (for illustrative purposes a screenshot of a heatmap of differential gene expression of known disease genes derived from brb-array tools (https://brb.nci.nih.gov/BRB-ArrayTools/) is depicted. Additionally, we determined if a gene was known as a Constrained Coding Region (CCR)(For illustrative purposes a screenshot of gnomAD browser website where such scores can be found; https://gnomad.broadinstitute.org/). Next, we determined whether disruption of the main candidate genes resulted in a reduction of enteric neurons in zebrafish. (C) In parallel, we evaluated the contribution of predisposing haplotypes across groups. Combined, we suggest that the RSnc (predisposing haplotypes), RSrcv (deleterious rare variant burden) and RScnv (deleterious Copy Number Variation) result in the genetic risk for Hirschsprung disease. Abbreviations: CNV = Copy Number Variation, ENS = Enteric Nervous System, RSnc = Risk Score non-coding variants, RSrcv = Risk Score rare coding variants, RScnv = Risk Score CNV, CCR = Constrained Coding Region.

Table 2. HSCR patients with a deleterious variant.

Patient HSCR type Other phenotypical characteristics Genetic defect
P_000302 Short No NM_002181.3 (IHH):c.151C>A, NM_000168.5 (GLI3):c.2119C>T, NM_001190468.1 (GDNF):c.676_681delGGATGT
P_000526 Short No NM_020630 (RET):c.1196C>T
P_000479 Long No NM_020630 (RET):c.656-21C>T
P_000502 Short No NM_020630 (RET):c.1880_1892del
P_002442 Long No NM_020630 (RET):c.2599C>T
P_000566 Short No NM_020630 (RET):c.3173A>G
P_000544 Long No NM_020630 (RET):c.2690G>A
P_000534 Short Hypospadias, anorectal malformation type perineal fistula NM_020630 (RET):c.2906G>A
P_000480 Short Short stature NM_013956 (NRG1):c.811A>T
P_004502 Short Hypertelorism, triangular face, pointy chin, straight eye brows, deepset eyes, small dysmorphic ears, agenesis of corpus callosum, hypospadia, dysmorphic nose NM_014795 (ZEB2): c.1570del
P_000557 Total colonic Postaxial polydactyly NM_020630 (RET):c.C229C>T
P_000518 Short Facial dysmorphisms, microcephaly, bilateral generalized polymicogyria, developmental delay, short stature, hypotonia, eye anomaly NM_015634 (KIFBP):c.268C>T
P_000486 Total colonic No abnormal phenotypedescribed. Normal psychomotor development NM_001122659 (EDNRB):c.534_535insGGTGCCT
P_000570 Short Congenital central hypoventilation syndrome NM_003924 (PHOX2B):c.738_761dup
P_000576 Short Microcephaly, epicantus folds, upslant eyes, broad nose, synophrys naevus sacralis hyperpigmentosis back and shoulders, abnormal palmar creases NM_020630 (RET):c.1321A>C,: NM_020630 (RET):c.C1941C>T

Depicted are the patients with HSCR and a known variant in RET or another causal gene (group 2, n = 15) Listed are the patient identifiers, HSCR classification, a short phenotypical description regarding associated anomalies if applicable, and the genetic defect.

Rare large CNVs are enriched in HSCR-AAM patients

Patient and control CNVs were determined and classified as a “rare CNV” when absent from large control cohorts (n = 19,584). We also included known pathogenic or modifier CNVs [39,40]. The size, type and gene content characteristics of rare CNVs were compared to those of 326 unaffected controls (Group 4, n = 326). With this approach, 56 rare CNVs were detected in 34 HSCR patients (S1 Table). In group 1, 10 CN losses, 8 CN gains and 1 maternally inherited hemizygous loss on chromosome X in a male patient, were detected. In group 2, 5 CN losses and 8 CN gains were identified, and in group 3, 7 CN losses, 12 CN gains, 4 homozygous losses and 1 hemizygous loss on chromosome X in a male patient, were found. None of the CNVs present in any of our patients affected known HSCR genes (S1 and S3 Tables).

We determined segregation of rare CNVs in nine patients, five of these were de novo: three CN gains (22q11.21—q11.22, 22q11.21 and 7q36.1) and two losses (17q23.1—q23.2 and 6p22.1—p21.33. Two CNVs were inherited maternally (10q11.22—q11.23 loss and Xq28 loss). Two of the homozygous losses in patient P_000490 (group 3), disrupt the Cystic fibrosis transmembrane conductance regulator (CFTR) locus. However, this patient does not show signs of cystic fibrosis. The inheritance pattern of other rare CNVs could not be determined due to unavailability of parental DNA.

The number of rare CNVs per individual did not differ between subgroups or controls (p = 0.385, Fig 2A). However, CNVs found in HSCR-AAM patients (Group 1) were larger compared to controls (Group 4, p = 7.297E-6, Fig 2B and S2 Table). This difference is attributed to outliers, large size of losses in specific patients of group 1 (p = 3.15E-08)(Fig 2C and S2 Table and S1 Fig), strongly suggesting a role of these specific CNVs in these HSCR-AAM patients.

Fig 2. Copy Number Variation analysis of HSCR patients and controls.

Fig 2

(A) Graph showing the number of rare CNVs found in the four groups. No statistical differences were found. (B) Graph showing the size in base pairs (bp) of the rare CNVs found in the four groups. Rare CNV size in group 1 (HSCR-AAM) was significantly larger than those found in group 3 (HSCR without a causal variant identified) and group 4 (controls). (C) Graph showing the size (bp) of the rare CN losses found in the four groups. Rare CN loss size in group 1 (HSCR-AAM) was significantly larger than those found in group 4 (controls). (D) Graph showing the number of “ENS genes” included in the CNVs found in the four groups. The number of “ENS genes” included in CNVs of group 1 (HSCR-AAM) was significantly higher than those found in group 4 (controls). (E) Graph showing the number of “ENS genes” included in the CN losses found in the four groups. The number of “ENS genes” included in CN losses of group 1 (HSCR-AAM) was significantly higher than those found in group 4 (controls). (F) Graph showing the number of CCR genes included in the CNVs found in the four groups. The number of CCR genes included in CNVs of group 1 (HSCR-AAM) was significantly higher than those found in group 4 (controls). (G) Graph showing the number of CCR genes included in the CN losses found in the four groups. The number of CCR genes included in CN losses of group 1 (HSCR-AAM) was significantly higher than those found in group 4 (controls). (H) Graph showing the number of “ENS genes” that are also a CCR, included in the CN losses found in the four groups. These genes were considered as the HSCR candidate genes. The number of CCR “ENS genes” included in CNVs of group 1 (HSCR-AAM) was significantly higher than those found in group 4 (controls). Each dot represents one patient. Error bars represent standard deviation. Statistical analysis used: one-way ANOVA followed by students t-test. ** p < 0.01 *** p < 0.001; Exact p-values and statistics can be found in S2 Table.

Candidate genes within a CN loss are constrained coding regions (CCR) and expressed in the ENS

HSCR disease genes (S3 Table) are often constrained coding regions (CCR; genes usually not impacted by genomic alterations in unaffected controls) and expressed in the affected tissue: the ENS. We hypothesize that genes within a CNV region contributing to HSCR should share these characteristics (Fig 1B and S3 Table). Therefore, we used a previously published microarray dataset and considered genes as “ENS genes” if the gene expression is higher in isolated “ENS cells” (with and without the addition of GDNF) compared to the expression in other intestinal cells or total intestines in mice [41,42]. Rare CNV impacted a gene or transcript (e.g. microRNA, long noncoding RNA) 1502 times (1213 unique ID’s) in groups 1–4 (S12 Table). We were not able to evaluate the expression of 519 genes or transcripts. Of these, 499 did not have a known mouse orthologue. Often, these genes or transcripts lacking a mouse orthologue were microRNA, long noncoding RNA or were genes of which the symbol is not available. Twenty-two genes or transcripts did not have probes in the microarray datasets used. In total the rare CNVs included 281 “ENS genes” (n = 91 in group 1–3, n = 190 in group 4) [41,42]. All rare CNVs containing “ENS genes” are depicted in S4 and S12 Tables. Compared to controls, the rare CNVs in group 1 were significantly enriched for “ENS genes” (p = 4,93E-08, Fig 2D and S2 Table), in particular in the CN losses (p = 3,2604E-10, Fig 2E and S2 Table). In addition, CCR were enriched in rare CNVs of group 1 compared to controls (p = 4,63E-05, Fig 2F and S2 Table), again in particular in rare CN losses (p = 8,13E-06, Fig 2G and S2 Table). Moreover, losses in HSCR-AAM patients (Group 1) contained more “ENS genes” that are a CCR [43,44], compared to losses found in controls (Group 4, p = 2,20E-04, Fig 2H). Hence, “ENS genes” that are also a CCR within CN losses were prioritized as candidate genes for HSCR and included in zebrafish evaluation studies: AKT3, GNL1, GABBR1, SLC8A1, MAPK8, UFD1L, TBX2, USP32 and TUBB (Table 3).

Table 3. “ENS genes” that are a CCR in rare CN losses and that have a zebrafish orthologue.

ExAC/ GnomAD intolerance scores
Patient RSNC Gene Missense Z pLI Deletion single Deletion CNV CNV region
P_000479 4,16 SLC8A1 2.23 1.00 -0.02 0.31 chr2:40,624,267–40,646,501
P_000512 8,31 TUBB # 5.71 0.98 -2.85 -1.72 chr6:28,005,012–31,683,185
GNL1 2.52 1.00 1.03 0.70
GABBR1 4.98 1.00 1.36 1.23
P_000537 6,92 MAPK8 2.92 1.00 0.84 -2.25 chr10: 49,033,586–52,431,193
P_000561 6,08 UFD1L 2.77 1.00 1.06 -2.53 chr22:18,861,209–21,630,630
P_000567 7,98 TBX2 1.75 0.99 -0.01 0.53 chr17:58,076,721–60,362,868
USP32 3.55 1.00 -2.85 -0.93
P_002431 9,37 AKT3 4.03 1.00 -2.61 -1.20 chr1:243,963,527–244,016,804

Genes marked with an # also have a loss of function variant in an independent HSCR cohort (see Table 4). Depicted are the risk score non coding (RSnc)(see S7 Table) and the CNV and variant intolerance scores (missense Z/pLI) derived from http://gnomad.broadinstitute.org/ and http://exac.broadinstitute.org/about.

HSCR disease coding genetics is heterogeneous

Stronger evidence for the involvement of these CNVs or genes included in CNVs could be provided by their involvement in additional Hirschsprung disease patients. First, we evaluated if there was CNV overlap within this cohort of patients. Within our small cohort only two CNV regions overlapped: (1) a gain in patients P_000544, P_000566 (group 2) and P_001636 (group 3), covering the blood group type gene RHCE (1p36.11), and (2) a loss in patients P_000557 (group 2) and P_000515 (group 3) of which only the ETFDH gene is affected in both patients (4q32.1). However, RHCE and ETFDH are no “ENS genes” [41,42].

Next, we evaluated if there was CNV overlap between our cohort of patients and those of previously published HSCR patients. The 22q11 deletion described in one of the patients included in DECIPHER (ID 249397) had overlap with the 22q11 deletion found in patient P_000561 (chr22:21032298–21630630). Although several genes are located in this region, only LZTR1 is an “ENS gene” [41,42]. However, this is not a CCR. One of the losses described in another DECIPHER patient (ID 249405) overlaps with the 1q44 loss present in patient P_002431, with AKT3 as the only gene affected by both CNVs. This is an CCR and an “ENS gene” [41,42] and was thus included as a candidate gene in this study. Comparing our data to that of a previously described HSCR cohort [45], only five genes in a CN loss in our cohort were also affected in that dataset. However, none of these five genes were CCR.

In an independent whole exome sequencing cohort of patients with HSCR, we determined the presence of putative deleterious alterations in all genes affected by a rare CNV in our cohort. Nonsense or splice-site variants were detected in eight genes (Table 4). Only one variant, detected in DNA derived from blood and isolated ENS cells, affected an “ENS gene” that is also CCR and present in a CN loss: a frameshift variant (TUBB; NM_001293212.1:c.1330_1331delCAinsA, p.Gln444Serfs*35) in a patient with isolated HSCR (Table 3). However, there was no significant enrichment in HSCR patients for nonsense, splice-site or missense variants in genes impacted by rare CN gains nor CN losses (S5 Table) when comparing 443 short segment HSCR patients and 493 controls [46], likely underlining the genetic heterogeneity between HSCR patients.

Table 4. WES: Nonsense and splice site variants in rare CNV genes in HSCR patients.

Sample Chr Start Stop Ref Alt Exon Gene Type location Effect HGVS cDNA-level CADD gnomADv2.1 Exomes gnomADV2.1 Genomes
SE14-0527 17 59946466 59946466 T 23 INTS2 insertion exonic frameshift NM_001330417.1:c.3172dupA 0 0
HK19-0006 17 60072727 60072727 C T 10 MED13 snv intronic splicing NM_005121:c.1968-1G>A 22.4 0 0
SE16-3114 17 59469360 59469360 C T 26 BCAS3 snv exonic stopgain NM_001320470.1:c.2773C>T 18.4 0.0002 0.0007
SE14-0656 6 31604286 31604286 G T 27 PRRC2A snv splicing splicing NM_004638.3:c.5836-1G>T 23.6 0.0001 3.24E-05
SE16-3123 6 30692109 30692110 CA A 4 TUBB substitution exonic frameshift NM_001293212.1:c.1330_1331delCAinsA . 0 0
SE17-3220 7 151962296 151962296 T C 8 KMT2C snv splicing splicing NM_170606.2:c.1013-2A>G 22 8.21E-06 0
HK19-0002 9 28476025 28476025 T G LINGO2 snv intronic splicing NM_001258282:c.-395-2A>C 23.2 0 0
HK19-0003 10 52103343 52103344 TA T 7 SGMS1 deletion exonic frameshift NM_147156:c.T529delT-:p.F177del 33 0 0
HK19-0004 10 52104106 52104108 CTG C SGMS1 deletion intronic splicing NM_147156:c.-313-2CAG>—G 24.5 0 0
HK19-0005 10 52349911 52349911 A G SGMS1 snv intronic splicing NM_147156:c.-683+2T>C 23 0 0
SE16-3109 22 21065731 21065731 G A 51 PI4KA snv exonic stopgain NM_058004.3:c.5821C>T 51 0.0002 0.0002
HK19-0001 3 14485130 14485130 A G SLC6A6 snv intronic splicing NM_001134367:c.297-6A>G 15.92 0 0

Rare putative deleterious nonsense variants and variants predicted to affect splicing in a whole exome sequencing cohort of HSCR (n = 76, 149 controls) [95] and whole genome sequencing cohort of 443 short segment HSCR patients and 493 unaffected controls [46]. Variants in genes intolerant to variation that were also impacted by the de novo 17q23.1—q23.2 loss (INTS2, MED13), the de novo 6p22.1—p21.33 loss (PRRC2A, TUBB), the 9p21 loss (LINGO2), the maternal inherited 10q11.22—q11.23 loss (SGMS1), de novo 7q36.1 gain (KMT2C) and the 3q24 gain (SLC6A6). TUBB is highlighted in gray as this gene is included in the candidate list.

Thus, for two of our HSCR candidate genes we identified an additional HSCR patient: one containing a CN loss including AKT3, and a patient carrying a frameshift variant in TUBB.

Zebrafish validation confirms the impact of candidate gene disruption on ENS development

To validate the effect of losing one copy of these candidate genes on ENS development, we disrupted their orthologues in zebrafish (akt3a, akt3b, gnl1, gabbr1a, gabbr1b, slc8a1a, slc8a1b, mapk8a, mapk8b, ufd1l, tbx2a, tbx2b, usp32 and tubb5) by CRISPR/Cas9 targeting (Fig 3A). Instances in which there are two variants (a and b) of the gene in zebrafish, both variants were targeted simultaneously using two guide RNAs (gRNAs). To determine whether the larvae present an ENS phenotype five categories were made (Fig 3B) [38]. Larvae were categorized as category I when full colonization was observed with a regular distribution of the ENS along the length and width of the intestine by evaluating the pattern of GFP+ cells in the tg(phox2bb:GFP) transgenic reporter line [47]. This reporter line labels crest-derived neurons and progenitors in the intestine. Larvae containing an overall obvious reduction in the number of ENS cells were classified as category II. In case only the most distal end of the intestine lacks their ENS this was classified as category III (Fig 3B, indicated with the arrowhead). In case the migration of the ENS halted somewhere in the mid-gut of the zebrafish, as depicted by the green arrowheads in Fig 3B, fish were classified as category IV. Most severe cases, where total absence of an ENS in the mid-gut and distal-gut was observed were categorized as category V (Fig 3B). For statistical analysis the percentages of fish presenting with a category I, or in other words an unaffected ENS, was used (white bars in the graph of Fig 3C). Premature termination of ENS migration or reduced presence of the ENS was observed in a subgroup of larvae injected with gRNAs targeting ufd1l (p = 0,0166) and slc8a1a/b (p = 0,0073). Several of the fish in which we disrupted tbx2a/b (p = 0,0593), or mapk8a/b (p = 0,0541, Fig 3A, 3B and 3C) presented with phenotypes in category II, III and IV or category II and III respectively, but did not reach significance. Quantification of the number of ENS cells normalized to the intestinal length showed that also in overall numbers disruption of ufd1l (p = 0,0235) and slc8a1a/b (p = 0,0013) had a significant effect (Fig 3D). In addition, the effect of mapk8a/b disruption was also significant (p = 0,0254, Fig 3D). To confirm our findings, we repeated the experiments disrupting tbx2a/b (p = 0,0373), ufd1l (p = 0,0208; p<0,0001) or mapk8a/b (p = 0,0022) confirming their effect on ENS development (Fig 4A–C). Disruption of slc8a1a/b induced major phenotypical abnormalities including severe pericardial edema (S3 Fig), which has been reported previously for the tremblor (tre) mutant zebrafish (NCX1h, former gene name for slc8a1a) [48]. Larvae were ufd1l was disrupted failed to develop a swim bladder (S4 Fig), which was also reported previously for this mutant [49].

Fig 3. Disruption of candidate genes in zebrafish caused defects in ENS development.

Fig 3

(A) Visual representation of the CRISPR/Cas9 complex injections using the Alt-R CRISPR-Cas9 System from Integrated DNA Technologies (IDT) [34]. (B) Legend showing the various phenotypes observed in zebrafish. Larvae are characterized as category I (e.g. ENS not affected) when there is a full colonization of the intestine by phox2bb-GFP+ cells which are dispersed at a density comparable to that observed in untreated animals. Larvae are characterized as category II when there is an overall reduction observed in the presence of phox2bb-GFP+ cells along the total length of the intestine. Larvae are characterized as category III when an absence of phox2bb-GFP+ cells is observed in the most distal part of the gut, depicted by the white arrowhead and magenta line. Category IV classification is used when the phox2bb-GFP+ have migrated into the intestine but only reach in the mid-gut as in the example depicted by the white arrowhead. The green arrowheads as well as the magenta line indicate the range in which fish are categorized as category IV. The most severe alteration in the ENS is categorized as V, where the mid-gut and distal-gut are deprived of neurons, as depicted by the magenta line and the ENS has migrated no further than the white arrowhead. (C) Disruption of four genes induced ENS phenotypes in the zebrafish larvae, of which two were statistically significant (accumulated data from multiple experiments). (D) In line with the observed ENS phenotypes, quantification of the number of neurons per 100μm showed significantly reduced numbers of neurons upon disruption of three genes (accumulated data from multiple experiments). (E) Schematic representation of sequential injection with CRISPR/Cas9 complexes and ATG-blocking ret morpholino. Graph showing accumulated data of the percentage of fish with ENS phenotypes upon injection of a morpholino targeting Ret translation at a concentration that induced ENS phenotypes in approximately 50% of the fish. Ret morpholino injections in combination with disruption of mapk8a/b and tubb5 shows epistasis. Disruption of gnl1 showed a trend towards higher penetrance of ENS phenotypes. Statistical tests used: students t-test (D), one-way ANOVA followed by Dunnett’s multiple comparisons test (D), and "N-1" Chi-squared test (C and E); * p < 0.05 ** p < 0.01 *** p < 0.001; Exact p-values, number of fish per group and statistics can be found in S11 Table.

Fig 4. Validation of tbx2, ufd1l and mapk8 gene disruption shows effects on the ENS in zebrafish.

Fig 4

(A) Figures and graphs showing that disruption of tbx2a/b in zebrafish does not result in overall significantly reduced numbers of phox2bb-GFP+ cells, however it does induce ENS phenotypes in category II and III in a significant number of zebrafish larvae. (B) Figures and graphs showing that disruption of ufd1l in zebrafish results in significantly reduced numbers of phox2bb-GFP+ cells and increase in the percentage of fish presenting with an ENS phenotype. (C) Figures and graphs showing that disruption of mapk8a/b in zebrafish does not result in overall significantly reduced numbers of phox2bb-GFP+ cells, however it does induce ENS phenotypes in a significant number of zebrafish larvae. Statistical tests used: students t-test and "N-1" Chi-squared test; * p < 0.05 ** p < 0.01 *** p < 0.001; Exact p-values and statistics can be found in S11 Table.

Copy number loss alone is likely not sufficient to result in HSCR

Considering that aganglionosis has a low prevalence in patients with for example 22q11 [39, 40, 50] and 17q23 [51,52] deletions, CNVs alone are probably not sufficient to cause HSCR. HSCR penetrance is influenced by predisposing risk haplotypes in RET, semaphorin and NRG1 [1015]. Five non-coding risk haplotypes (Risk Score non-coding, RSnc; S6 Table) were tested in our three patient groups and compared to an in-house population dataset (group 5) (n = 727, RSnc = 2.54). As expected, HSCR groups 1, 2 and 3 all had a higher RSnc compared to group 5 (p = 4.59825E-14, p = 0.000960603, p = 1.21875E-23, respectively) (S2 and S6 Tables), indicating a contribution of the risk haplotypes in all HSCR subgroups. Associated common risk haplotypes have epistatic interactions, not only with each other, but also with known HSCR genes -such as RET and NRG1 [12,14,53]- and modify HSCR penetrance in monogenetic syndromes and Down syndrome [5458]. The effect of epistatic interactions is smaller in high penetrant HSCR monogenetic disorders and larger in disorders in which HSCR penetrance is lower [54,55]. In line with this, contribution of these risk haplotypes was the lowest in group 2 (RSnc = 3.70), followed by group 1 (RSnc = 4.71), and the highest in group 3 (RSnc = 5.66) (S2 and S6 Tables). Thus, risk haplotypes have indeed a higher impact in HSCR patients without known coding variants, compared to those that contain known coding variants (S2 Fig and S6 and S7 Tables).

To model the reduced RET expression observed in patients carrying risk haplotypes, we injected zebrafish with an antisense ATG blocking morpholino (MO) specific for the zebrafish ret gene resulting in reduced Ret protein translation. We chose to inject 1ng of morpholino, as our previously published concentration curve showed that approximately half of the fish would present with an ENS phenotype (category II to V) [38]. Since disruption of slc8a1a/b in zebrafish resulted in severe heart epicardial edema and other malformations in ±75% of the surviving larvae and the absence of a swim bladder in all injected larvae (S3 Fig), we excluded this gene for the epistasis screen. Disruption of the candidate genes mapk8a/b (p = 0,0107) or tubb5 (p = 0,003) in combination with the ret morpholino resulted in a significant increase of the percentage larvae that showed an ENS phenotype (Fig 3E). Considering that disruption of tubb5 alone (Fig 3C and D) did not induce an ENS phenotype, these results suggest that loss of tubb5 shows epistasis with ret. Disruption of gnl1 also seemed to have an influence on the penetrance of ENS phenotypes in zebrafish (n.s.), therefore we repeated the injections using a lower dose of ret ATG blocking MO. Using a lower dose of MO results in a less severe phenotype, providing us with a more sensitive assay to detect epistasis between Ret and gnl1. Using this assay disruption of gnl1 indeed showed a significant increase in penetrance of an ENS phenotype (p = 0,0405)(Fig 5A). Considering that disruption of gnl1 alone did not induce an ENS phenotype, these results suggest that only in a sensitized background loss of gnl1 can increase penetrance of ENS phenotypes.

Fig 5. Complex HSCR genetics: Epistasis between ret and gnl1 and genetic predispositions of HSCR patient groups.

Fig 5

(A) A second injection round targeting gnl1, using a lower dose of ret morpholino, confirmed gnl1 epistasis with Ret. Statistical tests used: "N-1" Chi-squared test; * p < 0.05 (B) Visual representation of the distribution of genetic predispositions over HSCR patient groups. In total 197 patients born between 1973 and 2018 were evaluated by a clinical geneticist in the department of Clinical Genetics, Erasmus Medical Center, Rotterdam. Of these, 114 did not have associated anomalies nor a known syndrome. 29 patients had a known HSCR related genetic syndrome, including Down syndrome (n = 18). 153 out of 197 patients were genetically evaluated for RET gene involvement and 21 had a pathogenic RET variant. (C) Pie charts showing the incidence of rare CNVs containing “ENS genes”, CCR “ENS genes” and coding variants in HSCR patients. (D) Graphical representation of a hypothetical model explaining the relative contributions of the risk scores in our 3 patient groups. Error bars represent standard deviation.

Discussion

We describe that HSCR-AAM patients contained more often large CNVs. Concordant with a previous study, specifically rare large CN losses were enriched in HSCR-AAM patients [45]. We find that these rare CN losses contained more “ENS genes” that were a CCR. These characteristics make them suitable as candidate genes for HSCR. Therefore, we selected “ENS genes” that are CCR and were included in CN losses of HSCR patients. This resulted in nine candidate genes, which we disrupted in zebrafish larvae to screen for ENS phenotypes. CRISPR/CAS9 targeted knockdown of the candidate genes SLC8A1, TUBB, GNL1, MAPK8, UFD1L and TBX2 found in five patients connects these genes to ENS development. In addition, for two genes we identified a second HSCR patient in which the gene was affected.

Pinpointing the likely culprit in a CNV: Lessons from animal models

Mitogen-Activated Protein Kinase 8 (MAPK8) was impacted by the 10q11.22—q11.23 loss inherited from an unaffected mother. MAPK8 is part of the Mitogen activated protein (MAP) kinase pathway and required for ENCCs to migrate properly in response to GDNF in mouse intestinal explants [59]. The RSnc in this patient is 4.57, indicating that—in addition to the predisposing common haplotypes—more (genetic) factors are needed to result in HSCR etiology of this patient. Zebrafish morphants for mapk8b (jnk1) have a ventralized phenotype, but their ENS was not investigated in this previous report [60]. Based on our zebrafish data presented here, an increased presence of larvae with an ENS phenotype was observed upon disruption of mapk8a/b, alone, and in epistasis with Ret. This suggests that loss of MAPK8 influences ENS development, and that this loss alone is sufficient to induce alterations in ENS development in zebrafish. However, more larvae are affected upon ret knockdown supporting the role of a combined effect of the common haplotypes and the deletion of MAPK8 seen in this patient.

Patient P_000561 has the typical 22q11 deletion seen in 22q11 deletion syndrome. This deletion has a high penetrance with a variable phenotype [39,40,50]. HSCR has been previously described in patients with this deletion [61], as well as disturbances in migration of neural crest cells [62]. One of the main candidate genes for HSCR on 22q11 is Ubiquitin Recognition Factor In ER Associated Degradation 1 (UFD1L). It encodes for a downstream target of HAND2 [63]. Hand2-/- mice have decreased numbers of enteric neurons, neuronal differentiation defects and a disorganized ENS [64,65]. Mice with a targeted deletion encompassing Ufd1l as well as other genes of the 22q11 deletion syndrome present in humans, have shown to model the 22q11 syndrome [66,67]. However, the ENS was not evaluated in these latter models. Our zebrafish work shows that loss of ufd1l reduces the number of enteric neurons in a subset of zebrafish larvae. As zebrafish are “outbred” animals, it is likely that their genetic background in combination with loss of ufd1l, defines whether an ENS defect develops. This is in line with findings in patients, as HSCR is a rare phenomenon in 22q11 deletion syndrome [68]. Hence, it is likely that other factors contribute to HSCR development. Indeed, this patient (P_000561) has a RSnc of 4.90 suggesting a contribution of the predisposing haplotypes. It is likely that the relatively high impact of the common risk haplotypes, in combination with the de novo CNV impacting UFD1L, results in a shift from normal to defective ENS development.

Patient P_000567 has a de novo deletion of 17q23.1–23.2, a RSnc of 5.63 and has facial dysmorphism, hearing loss, microcephaly, immunological hypersensitivity, and nervus flammeus. This phenotype overlaps with other patients with 17q22-q23 deletions [51,52]. One of the other patients is constipated, but HSCR disease has not been previously described [69,70]. Four “ENS genes” are included in the CNV of patient P_000567: BCAS3, HEATR6, TBX2, USP32, of which the latter two are a CCR. Transient disruption of usp32 in zebrafish did not induce changes in their ENS and yielded no indication that USP32 is involved in HSCR. T-Box 2 (TBX2) codes for a transcription factor already known to be involved in the regulation of neural crest derived melanocytes [71], a process that is also hampered in specific HSCR patients carrying pathogenic variants in EDN3, SOX10 and EDNRB [72,73]. Tbx2 heterozygous mice develop normally, but homozygous mice have severe cardiovascular defects [74]. Zebrafish tbx2a/b morphants also present with a cardiac phenotype as they show an expanded atrium and a smaller ventricle [75]. HSCR or gastrointestinal anomalies were not described, but likely also not evaluated, in these mice or fish. Disruption of tbx2a/b in our study resulted in increased incidence of ENS phenotypes, but the larvae did not present with overt cardiac phenotypes (S4 Fig), although we should note that we did not study this in detail. Taken together, it is likely that reduced TBX2 expression has a role in HSCR development, yet only develops in patients with the 17q22-q23 deletion, if significant contributions of predisposing noncoding haplotypes are present, since this patient has a relatively high RSnc.

Patient P_000479 lost one copy of SLC8A1 and has no associated anomalies. Additionally, a known pathogenic RET mutation: RET: NM_020630: c.656-21C>T is present. As expected, the RSnc in this patient is low (RSnc = 2.98), which implies a strong influence of other genetic factor(s), in this case a pathogenic RET mutation. Whereas heterozygous Slc8a1 knockout mice develop and mature normally, without evident phenotypical abnormalities [76]. Slc8a1 homozygous knockout mice die in utero and have underdeveloped hearts [76]. In line with this, we show that zebrafish larvae where slc8a1a/b was disrupted, severe heart edema was present, as well as other morphological abnormalities (S3 Fig), as also reported in the tremblor (tre) mutant zebrafish [48]. In addition, a subgroup of zebrafish larvae showed aberrant ENS development, suggesting that this gene also affects ENS development. Since particularly, the slc8a1a isoform was efficiently targeted (84% efficiency, R2 = 9), is likely that slc8a1a is the functional gene in zebrafish, in line with the tre mutant [48]. However, HSCR patients with a CN loss lose only one copy of SLC8A1, and the remaining gene expression could explain their milder phenotype. Together, this suggests that variation in the levels of SLC8A1 can have a major impact on various developmental processes and could very well play a role in the development of both HSCR and associated anomalies.

The 1q44 loss present in patient P_002431 overlaps with one of the losses described in DECIPHER patient (ID 249405). The only overlapping gene is AKT serine/threonine-protein kinase 3 (AKT3). Unfortunately, we were not able to confirm the effect of AKT3 deletion on disturbed ENS development.

Patient P_000512 with short segment HSCR, epilepsy and intellectual disability, has a large de novo deletion (6p22.1—p21.33) which affects the ENS genes GABBR1, GNL1 and TUBB. The gamma-aminobutyric acid B receptor 1 (GABBR1) is the receptor for the inhibitory neurotransmitter GABA, and is expressed in the submucosal and myenteric plexus of the ileum and colon [77]. Receptor agonist studies indicate there is a GABA subtype and intestinal region-specific impact on gastrointestinal motility [7882]. However, our zebrafish findings did not suggest a role for gabbr1 in ENS development. Little is known about the Guanine nucleotide binding protein like 1 (GNL1) function. There are no mouse models described in the mouse genome informatics database [83](http://www.informatics.jax.org/). However, in vitro experiments indicate that GNL1 induces hyperphosphorylation of RB20, and promotes cell cycle progression and cell proliferation. GNL1 mRNA and protein expression in HIPED [84](https://www.genecards.org/) and GTEX [85](https://gtexportal.org/home/) respectively, highly correlates to that of kinesin family member 1-binding protein (KIF1BP). Recessive KIF1BP (formerly KIAA1279) pathogenic variants are involved in Goldberg-Shprintzen syndrome, often characterized by HSCR as well as intellectual disability [86,87]. Moreover, both genes (GNL1 and KIF1BP) are “ENS genes” [41,42] and are expressed in human embryonic gut [88]. Disruption of gnl1 in zebrafish showed epistasis with ret. The third candidate gene included in this patient its CNV is TUBB. Patients with mutations in TUBB have circumferential skin creases Kunze type (CSC-KT) which includes circumferential skin creases, microcephaly, cleft palate, intellectual disability and other associated anomalies but also structural brain anomalies are described [89]. Symptoms in our patient are less severe as seen in patients with dominant negative missense mutations, most likely due to the fact that TUBB heterodimers can still form, albeit at reduced levels. Breuss and colleagues described heterozygous Tubb5 mice as having decreased brain volumes due to increased apoptosis [90]. Another study in mice showed that perturbations to TUBB5 have a deleterious effect on multiple aspects of neuronal differentiation (e.g. dendritic spine maturation, axon outgrowth) [91]. We did not find an effect on ENS development in zebrafish upon loss of tubb5 alone, however, in combination with reduced Ret expression tubb5 disruption significantly increased the incidence of an ENS phenotype. This is in line with the high RSnc of 5,95 observed in this patient. In addition, we found a mutation in TUBB in a patient from an independent HSCR cohort. Therefore, the absence of enteric neurons in the distal colon and other neurological symptoms seen in Patient P_000512, can be likely related to the deletion of TUBB and GNL1 in combination with its high RSnc.

Limitations, discrepancies and recommendations

Although we find differences in the size of the rare CNVs, we did not find differences in the overall presence of rare CNVs. However, a previous study described that HSCR patients with associated anomalies have more CNVs compared to isolated HSCR patients and unaffected controls [45]. An explanation for this discrepancy might be that we used a CNV frequency cut-off of 0.026%, to HSCR prevalence (0.02–0.03%). Therefore, these CNVs are too rare to find a significant association with our study cohort size.

We were able to pinpoint and associate several genes within a CN loss to the HSCR phenotype seen in these patients. However, this is most likely an underestimation of the true contribution of CNV to HSCR, since SNP-array has relative limited resolution to detect small CNVs and is not able to detect balanced structural variations as inversions and inversion-duplications. In addition, approximately 50% of the rare CNVs we found were CN gains. As only CN losses–not CN gains–proved to be significantly enriched in this HSCR-AAM cohort, we did not include CN gains in our screen. In future studies, we would recommend the use of whole genome sequencing (WGS) approaches, as these are able to confidently detect much more structural variant types and thus provide more patients with a definite genetic diagnosis. The benefits of WGS approaches would include the detection of inversion-deletions (and disrupted genes within a CN gain), more precise breakpoint mapping, the detection of balanced rearrangements (and disruption of TAD borders and/or regulatory elements by CN losses or gains), or a second mutation in a gene impacted by a CNV on the other allele (unmasking a recessive mutation or variant). Screening at improved resolution and ability to detect a wider range of genetic aberrations would likely increase the number of associated genes in the remaining patients without an explained genetic diagnosis.

HSCR is a complex and heterogeneous genetic disorder

We determined segregation of the rare CNVs in nine patients, five of these were de novo. Although we did not determine the segregation of all rare CNVs, this already suggests the high frequency of de novo CNV in this cohort. In line with this, the DECIPHER (https://decipher.sanger.ac.uk/) contains 18 HSCR patients with associated anomalies with one or more CNVs. Of these, more than half of the CNVs are de novo (10 out of 18). Together, this shows that CNVs found in HSCR-AAM patients are often de novo. The results presented here highlight the importance of Copy Number profiling, leading us to propose a HSCR risk model that consists of risk scores for: rare coding variants (RSrcv), non-coding variants (RSnc), rare CNVs (RScnv) and currently unknown risk factors (e.g. epigenetic modifications and perhaps environmental factors). Considering HSCR-AAM (group 1), 36% of patients contained a CNV harbouring “ENS genes” in CCRs (Fig 5B). Our risk model (Fig 5D) indicates the major contributing factors in each group: CNVs in group 1, coding variants in group 2 and noncoding variants in group 3 (Fig 5B and D). The predisposing RET haplotypes reduce RET expression [92] and in combination with the other risk haplotypes increase HSCR risk substantially (RSnc). Translating this population risk to a threshold for HSCR development in individual patients is challenging as it is not precisely known how many of (and if) these known predisposing haplotypes are sufficient to result in HSCR. If we set a threshold at a relatively high level (RSnc > 6), this would explain HSCR in 30% of patients in group 3 (Fig 5B and S7 Table). Most HSCR-AAM patients have a lower RSnc, which makes sense, as this group contains more high impact CNVs and whilst they lack RET coding variants, they potentially have variants in other genes (Fig 5B and S7 Table). Translating our findings back to our full cohort including all patients seen by a clinical geneticist and screened for RET mutations, HSCR coding variants explain 21% of HSCR cases (Fig 5C). If we consider CNVs containing “ENS genes” in a CCR, we could explain an additional 10% (Fig 5C). Importantly, this is an underestimation, since for 55% of patients without a RET coding variant no copy number profiling is performed (Fig 5C) and we focussed exclusively on rare CNVs. Since more common CNVs can also modify HSCR penetrance [45,93], our results emphasize the need for a more widespread genomic analyses in all subgroups. In addition, RSnc could be one of the determinants for a more elaborate genetic evaluation. A high RSnc in a HSCR patient without associated anomalies is less likely to have a deleterious RET coding variant, patients with a low RSnc are more likely to benefit from exome sequencing. Similarly, patients with large de novo losses often have a more complex phenotype and more intensive clinical investigations might be indicated.

Conclusions

To summarize, HSCR genetics is complex with contributions of predisposing haplotypes in all HSCR subgroups. Rare large CNVs—often de novo (S1 Fig)—contribute substantially to the disease in HSCR-AAM patients. These CNVs are enriched for CN losses and for “ENS genes” that are also a CCR. Disruption of these genes in zebrafish confirmed that reduced expression of some of these genes increased the occurrence of ENS alterations, alone, or in epistasis with Ret. These genes have functional overlap with known HSCR disease genes: e.g. UFD1L is involved in signalling receptor binding and MAPK8 in axon guidance. Our “ENS gene” based approach led to the identification of new HSCR candidate genes (Figs 1B and 3 and Table 3): UFD1L, TBX2, SLC8A1, GNL1, TUBB and MAPK8.

Materials and methods

Ethics statement

This project was approved by the Medical ethics committee of the Erasmus Medical Centre (Hirschsprung disease: no 2012–582, addendum No. 1 and no.193.948/2000/159, addendum No. 1 and 2, MEC-20122387). Written (parental) formal consent was obtained.

All authors had access to the study data and reviewed and approved the final manuscript.

Determination of copy number variation

CNV profiles were determined with either the HumanCytoSNP-12 v2.1 or the Infinium Global Screening Array-24 v1.0 (Illumina Inc., San Diego, CA, USA), using methods, thresholds and analysis settings previously described [94]. CNV profiles were inspected visually in Biodiscovery Nexus CN8.0 (Biodiscovery Inc., Hawthorne, CA, USA). CNVs with an overlap of at least 75% with similar state CN changes, were either classified as rare, when absent from large control cohorts (n = 19,584), or as a known modifier [39,40]. CNV number, size, type and gene content of rare CNVs were determined in HSCR patients (n = 58) and unaffected controls (n = 326) and compared between the control groups and the described HSCR subgroups (Fig 1A). All rare CNVs were uploaded to the ClinVar database (https://www.ncbi.nlm.nih.gov/clinvar/) and are depicted in S1 Table.

Exclusion of the involvement of known disease genes

The presence of RET coding mutations and those in intron-exon boundaries, in all patients in this study are determined. Furthermore, if a specific monogenetic syndrome (S3 Table) was suspected, based on the phenotypic spectrum observed, the suspected gene(s) were evaluated using a targeted NGS panel or whole exome sequencing. In four HSCR patients with associated anomalies (Group 1) and nine HSCR patients without associated anomalies (Group 3), the involvement of other known disease genes was excluded [8,46,95,96] using whole exome sequencing (WES) with previously described pipelines [97,98] and variant prioritization methods [99].

Evaluation of candidate gene expression

We prioritized candidate genes based on gene characteristics: genes that are intolerant to variation and/or dosage sensitive (CCR; see Variant prioritization in the next paragraph), and higher expressed in the developing mouse ENS compared to whole intestine, between embryonic day E11.5 and E15.5, referred to as “ENS gene” [41,42]. Genes affected by a rare CNV were extracted from the CNV regions and converted to mouse orthologues to make a gene panel for downstream analysis (S12 Table). Gene expression data used to determine the “ENS genes” was downloaded from the gene expression omnibus (GSE34208 and GSE100130). We calculated probe set summaries from the raw Affymetrix data using BRB-ArrayTools version 4.5.0—Beta_2 (http://linus.nci.nih.gov/BRB-ArrayTools.html) and used the “Just GCRMA” algorithm to adjusts for background intensities. We normalized each array using quantile normalization that includes variance stabilization and log2 transformation. Replicate spots within an array were averaged. Genes showing minimal variation across the set of arrays were excluded from the analysis (if less than 20% of expression data had at least a 1.5 fold change in either direction from a gene its median value, or at least 50% of arrays had missing data for that gene). The minimum fold change for the class comparisons was set at 1.5 and differential expression was determined using a two-sample T-test with random variance model. The permutation p-values for significant genes were computed based on 1,000 random permutations and the nominal significance level of each univariate test was set at 0.05. Subsequently, we defined “ENS genes” as genes higher expressed in isolated “ENS cells” (with and without the addition of GDNF) compared to the expression in the other intestinal cells or total intestines. These genes were considered as “ENS genes” and included in downstream analysis. As only micro-array-based expression data of the mouse ENS was available, we cannot reliably distinguish between not expressed and not differentially expressed genes between the ENS and the intestine. Unfortunately, there was no data available from human intestines during ENCC migration. However, RNA sequencing data from human colon was available for embryonic week 12, 14 and 16 (GSE111307) [88], we annotated the gene expression during these time points for reference to S12 Table. We used the settings and methods described previously, as well as the previously generated and publicly available datafile “GSE111307_ENS_development_lenient_mapping.xlsx” at https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE111307.

Variant prioritization in CCR

When determining if a gene affected by a rare putative deleterious CNV was a Constrained Coding Region (CCR) [43,44], we allowed some tolerance to account for reduced penetrance (filter settings and table see S12 Table). Variants from NGS data previously generated were prioritized according to settings described in S5 Table. These data included (1) a WES cohort of sporadic HSCR patients (n = 76, 149 controls) and (2) a Whole Genome Sequencing (WGS) cohort of 443 short segment HSCR patients and 493 unaffected controls [46]. Using RVTESTS [100], a variant burden test was done comparing the variant burden in 443 short segment HSCR patients and 493 controls (S5 Table). All rare putative deleterious loss of function variants unique to the HSCR cohort in CCR were described in Table 3.

Genotyping of HSCR associated SNPs

Sanger sequencing was used to genotype all patients for SNPs known to be associated to HSCR [1012,93]. Primer sequences can be found in S8 Table. We used (proxy) SNPs present on the GSAMD-v1 chip to determine the Rotterdam population background for these risk haplotypes (n = 388 females, 339 males), as well as the three HSCR subgroups from the SNP-array platform (S6 Table and S2 Fig). The relative weighted risk score of published common and relatively rare risk alleles near RET (rs2506030, rs7069590, rs2435357), NRG1 (rs7005606) and SEMA3C/D (rs11766001, rs80227144), was calculated using the formula below [1012,53].

LnORriskallele1*allelecount+(LnORriskallele2*allelecount)+etc.

We also determined the risk scores of the CNV control samples (n = 326). However, we could only do this for the RET and NRG1, as no suitable proxy SNPs were present on the HCS 850k platform for the SEMA3C/D haplotypes (S2 Fig and S6 Table). Therefore, we used (proxy) SNPs present on the GSAMD-v1 chip to determine the Rotterdam population background for all risk haplotypes (group 5, n = 727, S2 Fig).

Experimental animals

Zebrafish were kept on a 14h/10h light–dark cycle at 28°C, during development and adulthood. Tg(phox2bb:GFP) animals were used for all experiments [47]. Larvae were kept in HEPES-buffered E3 medium and 0.003% 1-phenyl 2-thiourea (PTU) was added 24 hours post fertilization (hpf), to prevent pigmentation. All animal experiments were approved by the Animal Experimental Committee of the Erasmus MC Rotterdam.

CRISPR/Cas9 gene disruption

Gene targeting using CRISPR/Cas9 was performed as described previously [35]. The Alt-R CRISPR-Cas9 System from Integrated DNA Technologies (IDT) was used (Fig 3A) [34]. crRNAs were designed using the IDT tool and selected on high target efficiency and low off target effect (S9 Table). crRNA(s) and tracrRNA were diluted to final concentration of 100uM in the provided duplex buffer. crRNA/tracrRNA complexes were generated by adding them in a 1:1 proportion, heating to 95 degrees Celsius for 5 minutes and let it cool down to room temperature. 1μl of crRNA/tracrRNA complex solution was used in an injection mixture with a total volume of 6 μl (in 300mM KCl) containing 4ng cas9 protein (Generated from addgene plasmid #62731) and 0.4 μl of 0.5% Phenol Red (Sigma-Aldrich). 1nl was injected into the one-cell stage. gRNA sequences are listed in S9 Table, primers used for Sanger sequencing are listed in supplementary S10 Table. Efficiency of indel generation was determined as described previously (S11 Table) [35].

Morpholino injections

Fertilized zebrafish eggs were injected with the CRISPR/Cas9 complex. Subsequently, 0,5 or 1 ng of translation blocking morpholino against ret (5′- ACACGATTCCCCGCGTACTTCCCAT -3′), and 1:10 0,5% Phenol Red (Sigma-Aldrich), was injected between the 1- and 4-cell stage (see Fig 3E) [101]. To minimize variability, the same needle was used to inject the controls, that were not injected with CRISPR/Cas9 complex and droplet size (injection volume) was checked regularly during injections.

Imaging

Images of 5 dpf tg(phox2bb:GFP) larvae were taken using a Leica M165 fluorescent dissection microscope with the Leica LASX software. Larvae were anesthetized with 0.016% MS-222 in HEPES buffered E3 medium and positioned on their lateral side on a 1.5–2% agarose coated petridish, to enable visualization of the enteric neurons.

Quantification of phox2bb:GFP+ enteric neurons

The number of phox2bb:GFP+ enteric neurons was determined using FIJI [102](ImageJ2, http://developer.imagej.net/about) by manual polygon selection of the intestine, duplication of the selected region, selection of the GFP channel and the find maxima function. The noise tolerance was adjusted manually to detect all neurons present but exclude noise. If needed, the cell numbers were manually corrected for cells that were missed by the tool. The length of the gut region included in the neuron count was measured using the straight-line tool. The number of neurons was normalized to the gut length and expressed in number of neurons per 100 μm.

Statistical analysis

The number and size of rare CNVs, the number of rare losses and gains, the number of genes intolerant to variation (single nucleotide variants (SNVs) and CNVs), the number of “ENS genes” per rare CNV, and the relative weighted risk score, were determined and compared for the different groups with a single ANOVA test. If group differences existed (p < 0.05), we determined which subgroups were significantly different, using a two-tailed T-test. For statistical analysis of the zebrafish experiments the online tool MedCalc (MedCalc software ltd., Ostend, Belgium), was used together with the "N-1" Chi-squared test for the categorization. For the number of neurons we used a student’s t-test (two groups per experiment), or one-way ANOVA followed by Dunnett’s multiple comparisons test if the comparison included more than two groups per experiment, number of larvae used and p-values can be found in S11 Table. * p < 0.05 ** p < 0.01 *** p < 0.001.

Supporting information

S1 Fig. The Z-score distributions of total CNV size, CN loss and CN gain of controls and HSCR patients.

Depicted are the Z-score distributions of all rare CNVs (black lines), rare CN losses (red lines) and CN gains (blue lines). Controls in continuous lines, all HSCR patients in dotted lines. Limited negative z-scores due to the size limit cut-off of 20kb. The z-score distributions of CNV sizes were comparable between groups strongly suggesting that the outliers are responsible for the statistical significant groupwise-differences.

(DOCX)

S2 Fig. Risk score comparison between Sanger sequenced and SNP-array genotyped samples.

Depicted are the calculated noncoding risk scores (RSnc) of patients using the genotypes derived from the 850K array platform. However, we could only do this for the RET and NRG1, as no suitable proxy SNPs were present on the HCS 850k platform for the SEMA3C/D haplotypes (S2 Fig). Therefore, we used (proxy) SNPs present on the GSAMD-v1 chip to determine the Rotterdam population background for all risk haplotypes (group 5, n = 727, Panel B in S2 Fig).

(DOCX)

S3 Fig. Brightfield images of zebrafish larvae injected with slc8a1 gRNAs (slc8a1a/slc8a1b).

Brightfield images of zebrafish larvae injected with gRNAs targeting slc8a1a/b showing severe phenotypes including heart edema, absence of swim bladder and small eyes. Scale bar = 500μm.

(DOCX)

S4 Fig. Brightfield images of zebrafish larvae injected with gRNAs.

Brightfield images of zebrafish larvae injected with gRNAs targeting gnl1, tubb5, ufd1l, usp32, akt3a/b, tbx2a/b, gabbr1a/b and mapk8a/b showing mild phenotypes mainly consisting of the absence of a swim bladder and a down bend tail in case of ufd1l disruption. Scale bar = 500μm.

(DOCX)

S1 Table. Rare CNVs detected in this cohort.

(DOCX)

S2 Table. Statistical comparisons.

The number and size of rare CNVs, the number of rare losses and gains, the number of genes considered a CCR, the number of ENS genes per rare CNV, and the relative weighted risk score, were determined and compared for the different groups with a single ANOVA test. If group differences existed (p<0.05), we determined which subgroups were significantly different, using a two-tailed T-test. Abbreviations: CCR; Constrained Coding Region; SNV; single nucleotide variant, CNV; Copy Number Variation, ENS; Enteric Nervous System, RSnc; Risk Score non coding, ND; not determined. Higher values in green, lower values in red. Two-tailed p-values.

(DOCX)

S3 Table. HSCR disease gene characteristics.

Most known HSCR genes are intolerant to genetic variation and are rarely impacted by CNVs in unaffected individuals [39, 40]. These genes have been described to be impacted by CNVs in HSCR patients [7, 27], although this does not seem to be a frequent phenomenon, as in our cohort we did not detect any CNV impacting a known HSCR gene. Abbreviations: mis_z: Missense variation Z-score, syn_z: Synonymous variation z-sore, PLI: probability of being loss-of-function intolerant, PRec: probability of being tolerant to heterozygous loss of function variation but intolerant to homozygous loss of function variation, PNull: probability of being tolerant to loss of function variation (heterozygous or homozygous), del: deletion, dup: duplication, EW: embryonic week, DDD: deciphering developmental disorders project. Data derived from (https://gnomad.broadinstitute.org/) and the DDD control track at: https://genome-euro.ucsc.edu/.

(DOCX)

S4 Table. Rare CNV with “ENS genes”.

(DOCX)

S5 Table. Variant prioritization and WGS variant burden test results in genes impacted by rare CNV.

Abbreviations: NVAR: number of variants, NCASEHET: number of heterozygous variants in cases, NCTRLHET: number of heterozygous variants in controls.

(DOCX)

S6 Table. Risk haplotype markers and odds ratio’s used to calculate noncoding risk scores (RSnc).

Highlighted SNPS ingrey/ boldare the risk alleles and Odds ratio’s used in the polygenic risk score calculation. #We did not account for the increased risk of having the two main risk haplotype combinations [1]. @ this risk haplotype was not present in this patient cohort. Sanger sequencing was used to genotype all patients for SNPs known to be associated to HSCR [9, 10, 12, 13]. Patient and controls genotypes were determined using SNPs from the GSAMD-v1 platform. D’ and R2 derived from (https://ldlink.nci.nih.gov/) in European population. Primer sequences can be found in S8 Table.

(DOCX)

S7 Table. Description of individual patient genetic risk profiles.

Depicted are the individual patient’s characteristics, number of predisposing haplotypes (as represented by the risk SNPs) and the noncoding risk score (RSnc). Additionally, the RET risk haplotypes as described inS6 Tableare depicted. In blue the number of “ENS genes” in a CN gain, in red the number of “ENS genes” in a CN loss. Depicted in blue: (1) functional evidence from zebrafish studies and (2) the number of additional patients containing putative deleterious variants in genes within the rare CNVs. $No phasing was performed to discern the most likely haplotype.

(DOCX)

S8 Table. Primer sequences common predisposing SNPs.

(DOCX)

S9 Table. Details of the gRNA sequences used.

(DOCX)

S10 Table. Primers used for Sanger sequencing.

(DOCX)

S11 Table. Statistics, group sizes and gRNA efficiencies of zebrafish experiments.

(DOCX)

S12 Table. Gene content of rare CNV.

This list contains all genes in rare CNVs found in the four groups. We determined if a gene was a conserved coding region using the following criteria: sequence variants: PLI≥0.85, z-score for missense variants and/or synonymous variants ≥3. Deletions or duplications: del score ≥ 1, dup score ≥ 1 and cnv score ≥ 1. Additionally, genes impacted more than once by a deletion, or a duplication were not considered a CCR. If these criteria are met, this gene is marked YES in the CCR column. We determined whether the gene is a “ENS gene” based on if the gene expression is higher in isolated “ENS cells” (with and without the addition of GDNF) compared to the expression in other intestinal cells or total intestines in mice [41,42]. These genes are marked YES in the “Mouse ENS E11-E15.5 gene” column. Filtering on CCR, “ENS gene” and CN loss results in our candidate gene list. Note that in addition to the nine genes we disrupted in zebrafish DPCR1 and ZSCAN31 are included. These genes were excluded as they do not contain a zebrafish orthologue as depicted in the second sheet “zebrafish orthologues”. Sheet 3 “count tables” contains the count values used for Fig 2.

(XLSX)

Acknowledgments

We thank Mike Broeders for supplying us with the Cas9 protein.

Data Availability

All relevant data are within the manuscript and its Supporting information files. CNV regions are submitted to ClinVar, ID's are included in the manuscript (Table 1). Our ethics committee does not allow sharing of individual patient or control genotype information in the public domain. This study makes use of data generated by the DECIPHER community. A full list of centers who contributed to the generation of the data is available from http://decipher.sanger.ac.uk and via email from decipher@sanger.ac.uk.

Funding Statement

PT is funded by the Theme-based Research scheme [grant number T12C-714/14-R] and CT by the Health Medical Research Fund [HMRF 06171636]. RH was funded by a grant from the Sophia Foundation [grant nr. S14-33]. The salary of KM was paid by S14-33. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Nagy N, Goldstein AM. Enteric nervous system development: A crest cell’s journey from neural tube to colon. Seminars in cell & developmental biology. 2017;66:94–106. Epub 01/10. doi: 10.1016/j.semcdb.2017.01.006 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Lake JI, Heuckeroth RO. Enteric nervous system development: migration, differentiation, and disease. American journal of physiology Gastrointestinal and liver physiology. 2013;305(1):G1–G24. Epub 05/02. doi: 10.1152/ajpgi.00452.2012 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Moore SW, Zaahl MG. A review of genetic mutation in familial Hirschsprung’s disease in South Africa: towards genetic counseling. Journal of pediatric surgery. 2008;43(2):325–9. Epub 2008/02/19. doi: 10.1016/j.jpedsurg.2007.10.021 . [DOI] [PubMed] [Google Scholar]
  • 4.Moore SW, Zaahl M. Clinical and genetic correlations of familial Hirschsprung’s disease. Journal of pediatric surgery. 2015;50(2):285–8. Epub 2015/02/02. doi: 10.1016/j.jpedsurg.2014.11.016 . [DOI] [PubMed] [Google Scholar]
  • 5.Engum SA, Petrites M, Rescorla FJ, Grosfeld JL, Morrison AM, Engles D. Familial Hirschsprung’s disease: 20 cases in 12 kindreds. Journal of pediatric surgery. 1993;28(10):1286–90. Epub 1993/10/01. doi: 10.1016/s0022-3468(05)80314-8 . [DOI] [PubMed] [Google Scholar]
  • 6.Russell MB, Russell CA, Fenger K, Niebuhr E. Familial occurrence of Hirschsprung’s disease. Clinical genetics. 1994;45(5):231–5. Epub 1994/05/01. doi: 10.1111/j.1399-0004.1994.tb04147.x . [DOI] [PubMed] [Google Scholar]
  • 7.Amiel J, Sproat-Emison E, Garcia-Barcelo M, Lantieri F, Burzynski G, Borrego S, et al. Hirschsprung disease, associated syndromes and genetics: a review. J Med Genet. 2008;45(1):1–14. doi: 10.1136/jmg.2007.053959 . [DOI] [PubMed] [Google Scholar]
  • 8.Alves MM, Sribudiani Y, Brouwer RW, Amiel J, Antinolo G, Borrego S, et al. Contribution of rare and common variants determine complex diseases-Hirschsprung disease as a model. Dev Biol. 2013;382(1):320–9. Epub 2013/05/28. doi: 10.1016/j.ydbio.2013.05.019 . [DOI] [PubMed] [Google Scholar]
  • 9.Tilghman JM, Ling AY, Turner TN, Sosa MX, Krumm N, Chatterjee S, et al. Molecular Genetic Anatomy and Risk Profile of Hirschsprung’s Disease. N Engl J Med. 2019;380(15):1421–32. Epub 2019/04/11. doi: 10.1056/NEJMoa1706594 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Chatterjee S, Kapoor A, Akiyama JA, Auer DR, Lee D, Gabriel S, et al. Enhancer Variants Synergistically Drive Dysfunction of a Gene Regulatory Network In Hirschsprung Disease. Cell. 2016;167(2):355–68.e10. Epub 2016/10/04. doi: 10.1016/j.cell.2016.09.005 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kapoor A, Jiang Q, Chatterjee S, Chakraborty P, Sosa MX, Berrios C, et al. Population variation in total genetic risk of Hirschsprung disease from common RET, SEMA3 and NRG1 susceptibility polymorphisms. Human molecular genetics. 2015;24(10):2997–3003. Epub 2015/02/11. doi: 10.1093/hmg/ddv051 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Jiang Q, Arnold S, Heanue T, Kilambi KP, Doan B, Kapoor A, et al. Functional loss of semaphorin 3C and/or semaphorin 3D and their epistatic interaction with ret are critical to Hirschsprung disease liability. Am J Hum Genet. 2015;96(4):581–96. Epub 2015/04/04. doi: 10.1016/j.ajhg.2015.02.014 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Garcia-Barcelo MM, Tang CS, Ngan ES, Lui VC, Chen Y, So MT, et al. Genome-wide association study identifies NRG1 as a susceptibility locus for Hirschsprung’s disease. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(8):2694–9. Epub 2009/02/07. doi: 10.1073/pnas.0809630105 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gui H, Tang WK, So MT, Proitsi P, Sham PC, Tam PK, et al. RET and NRG1 interplay in Hirschsprung disease. Human genetics. 2013;132(5):591–600. Epub 2013/02/13. doi: 10.1007/s00439-013-1272-9 . [DOI] [PubMed] [Google Scholar]
  • 15.Tang CS, Tang WK, So MT, Miao XP, Leung BM, Yip BH, et al. Fine mapping of the NRG1 Hirschsprung’s disease locus. PloS one. 2011;6(1):e16181. Epub 2011/02/02. doi: 10.1371/journal.pone.0016181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Stankiewicz P, Lupski JR. Structural variation in the human genome and its role in disease. Annu Rev Med. 2010;61:437–55. doi: 10.1146/annurev-med-100708-204735 . [DOI] [PubMed] [Google Scholar]
  • 17.Fewtrell MS, Tam PK, Thomson AH, Fitchett M, Currie J, Huson SM, et al. Hirschsprung’s disease associated with a deletion of chromosome 10 (q11.2q21.2): a further link with the neurocristopathies? J Med Genet. 1994;31(4):325–7. doi: 10.1136/jmg.31.4.325 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Martucciello G, Bicocchi MP, Dodero P, Lerone M, Silengo Cirillo M, Puliti A, et al. Total colonic aganglionosis associated with interstitial deletion of the long arm of chromosome 10. Pediatric Surgery International. 1992;7(4):308–10. doi: 10.1007/bf00183991 [DOI] [Google Scholar]
  • 19.Edery P, Lyonnet S, Mulligan LM, Pelet A, Dow E, Abel L, et al. Mutations of the RET proto-oncogene in Hirschsprung’s disease. Nature. 1994;367(6461):378–80. doi: 10.1038/367378a0 . [DOI] [PubMed] [Google Scholar]
  • 20.Romeo G, Ronchetto P, Luo Y, Barone V, Seri M, Ceccherini I, et al. Point mutations affecting the tyrosine kinase domain of the RET proto-oncogene in Hirschsprung’s disease. Nature. 1994;367(6461):377–8. doi: 10.1038/367377a0 . [DOI] [PubMed] [Google Scholar]
  • 21.Bottani A, Xie YG, Binkert F, Schinzel A. A case of Hirschsprung disease with a chromosome 13 microdeletion, del(13)(q32.3q33.2): potential mapping of one disease locus. Human genetics. 1991;87(6):748–50. doi: 10.1007/BF00201741 . [DOI] [PubMed] [Google Scholar]
  • 22.Kiss P, Osztovics M. Association of 13q deletion and Hirschsprung’s disease. J Med Genet. 1989;26(12):793–4. doi: 10.1136/jmg.26.12.793 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lamont MA, Fitchett M, Dennis NR. Interstitial deletion of distal 13q associated with Hirschsprung’s disease. J Med Genet. 1989;26(2):100–4. doi: 10.1136/jmg.26.2.100 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Shanske A, Ferreira JC, Leonard JC, Fuller P, Marion RW. Hirschsprung disease in an infant with a contiguous gene syndrome of chromosome 13. Am J Med Genet. 2001;102(3):231–6. doi: 10.1002/ajmg.1451 . [DOI] [PubMed] [Google Scholar]
  • 25.Sparkes RS, Sparkes MC, Kalina RE, Pagon RA, Salk DJ, Disteche CM. Separation of retinoblastoma and esterase D loci in a patient with sporadic retinoblastoma and del(13)(q14.1q22.3). Human genetics. 1984;68(3):258–9. doi: 10.1007/BF00418397 . [DOI] [PubMed] [Google Scholar]
  • 26.Puffenberger EG, Hosoda K, Washington SS, Nakao K, deWit D, Yanagisawa M, et al. A missense mutation of the endothelin-B receptor gene in multigenic Hirschsprung’s disease. Cell. 1994;79(7):1257–66. Epub 1994/12/30. doi: 10.1016/0092-8674(94)90016-7 . [DOI] [PubMed] [Google Scholar]
  • 27.Amiel J, Espinosa-Parrilla Y, Steffann J, Gosset P, Pelet A, Prieur M, et al. Large-scale deletions and SMADIP1 truncating mutations in syndromic Hirschsprung disease with involvement of midline structures. Am J Hum Genet. 2001;69(6):1370–7. doi: 10.1086/324342 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Lurie IW, Supovitz KR, Rosenblum-Vos LS, Wulfsberg EA. Phenotypic variability of del(2) (q22-q23): report of a case with a review of the literature. Genet Couns. 1994;5(1):11–4. . [PubMed] [Google Scholar]
  • 29.McMilin KD, Reiss JA, Brown MG, Black MH, Buckmaster DA, Durum CT, et al. Clinical outcomes of four patients with microdeletion in the long arm of chromosome 2. Am J Med Genet. 1998;78(1):36–43. . [PubMed] [Google Scholar]
  • 30.Mowat DR, Croaker GD, Cass DT, Kerr BA, Chaitow J, Ades LC, et al. Hirschsprung disease, microcephaly, mental retardation, and characteristic facial features: delineation of a new syndrome and identification of a locus at chromosome 2q22-q23. J Med Genet. 1998;35(8):617–23. doi: 10.1136/jmg.35.8.617 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Benailly HK, Lapierre JM, Laudier B, Amiel J, Attie T, De Blois MC, et al. PMX2B, a new candidate gene for Hirschsprung’s disease. Clinical genetics. 2003;64(3):204–9. doi: 10.1034/j.1399-0004.2003.00105.x . [DOI] [PubMed] [Google Scholar]
  • 32.Wakamatsu N, Yamada Y, Yamada K, Ono T, Nomura N, Taniguchi H, et al. Mutations in SIP1, encoding Smad interacting protein-1, cause a form of Hirschsprung disease. Nature genetics. 2001;27(4):369–70. Epub 2001/03/30. doi: 10.1038/86860 . [DOI] [PubMed] [Google Scholar]
  • 33.Amiel J, Laudier B, Attie-Bitach T, Trang H, de Pontual L, Gener B, et al. Polyalanine expansion and frameshift mutations of the paired-like homeobox gene PHOX2B in congenital central hypoventilation syndrome. Nature genetics. 2003;33(4):459–61. Epub 2003/03/18. doi: 10.1038/ng1130 . [DOI] [PubMed] [Google Scholar]
  • 34.Hoshijima K, Jurynec MJ, Klatt Shaw D, Jacobi AM, Behlke MA, Grunwald DJ. Highly Efficient CRISPR-Cas9-Based Methods for Generating Deletion Mutations and F0 Embryos that Lack Gene Function in Zebrafish. Dev Cell. 2019;51(5):645–57 e4. Epub 2019/11/12. doi: 10.1016/j.devcel.2019.10.004 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kuil LE, Oosterhof N, Geurts SN, van der Linde HC, Meijering E, van Ham TJ. Reverse genetic screen reveals that Il34 facilitates yolk sac macrophage distribution and seeding of the brain. Dis Model Mech. 2019;12(3). Epub 2019/02/16. doi: 10.1242/dmm.037762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kroll F, Powell GT, Ghosh M, Gestri G, Antinucci P, Hearn TJ, et al. A simple and effective F0 knockout method for rapid screening of behaviour and other complex phenotypes. eLife. 2021;10:e59683. doi: 10.7554/eLife.59683 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ganz J. Gut feelings: Studying enteric nervous system development, function, and disease in the zebrafish model system. Dev Dyn. 2018;247(2):268–78. doi: 10.1002/dvdy.24597 . [DOI] [PubMed] [Google Scholar]
  • 38.Kuil LE, Chauhan RK, Cheng WW, Hofstra RMW, Alves MM. Zebrafish: A Model Organism for Studying Enteric Nervous System Development and Disease. Frontiers in Cell and Developmental Biology. 2021;8(1874). doi: 10.3389/fcell.2020.629073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Coe BP, Witherspoon K, Rosenfeld JA, van Bon BW, Vulto-van Silfhout AT, Bosco P, et al. Refining analyses of copy number variation identifies specific genes associated with developmental delay. Nature genetics. 2014;46(10):1063–71. doi: 10.1038/ng.3092 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Cooper GM, Coe BP, Girirajan S, Rosenfeld JA, Vu TH, Baker C, et al. A copy number variation morbidity map of developmental delay. Nature genetics. 2011;43(9):838–46. doi: 10.1038/ng.909 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Schriemer D, Sribudiani Y, IJ A, Natarajan D, MacKenzie KC, Metzger M, et al. Regulators of gene expression in Enteric Neural Crest Cells are putative Hirschsprung disease genes. Dev Biol. 2016;416(1):255–65. doi: 10.1016/j.ydbio.2016.06.004 . [DOI] [PubMed] [Google Scholar]
  • 42.Memic F, Knoflach V, Morarach K, Sadler R, Laranjeira C, Hjerling-Leffler J, et al. Transcription and Signaling Regulators in Developing Neuronal Subtypes of Mouse and Human Enteric Nervous System. Gastroenterology. 2018;154(3):624–36. doi: 10.1053/j.gastro.2017.10.005 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91. doi: 10.1038/nature19057 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ruderfer DM, Hamamsy T, Lek M, Karczewski KJ, Kavanagh D, Samocha KE, et al. Patterns of genic intolerance of rare copy number variation in 59,898 human exomes. Nature genetics. 2016;48(10):1107–11. Epub 2016/08/18. doi: 10.1038/ng.3638 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Tang CS, Cheng G, So MT, Yip BH, Miao XP, Wong EH, et al. Genome-wide copy number analysis uncovers a new HSCR gene: NRG3. PLoS genetics. 2012;8(5):e1002687. Epub 2012/05/17. doi: 10.1371/journal.pgen.1002687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tang CS, Li P, Lai FP, Fu AX, Lau ST, So MT, et al. Identification of Genes Associated With Hirschsprung Disease, Based on Whole-Genome Sequence Analysis, and Potential Effects on Enteric Nervous System Development. Gastroenterology. 2018;155(6):1908–22.e5. Epub 2018/09/16. doi: 10.1053/j.gastro.2018.09.012 . [DOI] [PubMed] [Google Scholar]
  • 47.Nechiporuk A, Linbo T, Poss KD, Raible DW. Specification of epibranchial placodes in zebrafish. Development (Cambridge, England). 2007;134(3):611–23. doi: 10.1242/dev.02749 . [DOI] [PubMed] [Google Scholar]
  • 48.Ebert AM, Hume GL, Warren KS, Cook NP, Burns CG, Mohideen MA, et al. Calcium extrusion is critical for cardiac morphogenesis and rhythm in embryonic zebrafish hearts. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(49):17705–10. Epub 2005/11/30. doi: 10.1073/pnas.0502683102 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Huiting LN, Samaha Y, Zhang GL, Roderick JE, Li B, Anderson NM, et al. UFD1 contributes to MYC-mediated leukemia aggressiveness through suppression of the proapoptotic unfolded protein response. Leukemia. 2018;32(11):2339–51. doi: 10.1038/s41375-018-0141-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Rosenfeld JA, Coe BP, Eichler EE, Cuckle H, Shaffer LG. Estimates of penetrance for recurrent pathogenic copy-number variations. Genetics in medicine: official journal of the American College of Medical Genetics. 2013;15(6):478–81. Epub 2012/12/22. doi: 10.1038/gim.2012.164 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Ballif BC, Theisen A, Rosenfeld JA, Traylor RN, Gastier-Foster J, Thrush DL, et al. Identification of a recurrent microdeletion at 17q23.1q23.2 flanked by segmental duplications associated with heart defects and limb abnormalities. Am J Hum Genet. 2010;86(3):454–61. doi: 10.1016/j.ajhg.2010.01.038 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Laurell T, Lundin J, Anderlid BM, Gorski JL, Grigelioniene G, Knight SJ, et al. Molecular and clinical delineation of the 17q22 microdeletion phenotype. Eur J Hum Genet. 2013;21(10):1085–92. doi: 10.1038/ejhg.2012.306 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Tang CS, Gui H, Kapoor A, Kim JH, Luzon-Toro B, Pelet A, et al. Trans-ethnic meta-analysis of genome-wide association studies for Hirschsprung disease. Human molecular genetics. 2016;25(23):5265–75. Epub 2016/10/06. doi: 10.1093/hmg/ddw333 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.de Pontual L, Pelet A, Trochet D, Jaubert F, Espinosa-Parrilla Y, Munnich A, et al. Mutations of the RET gene in isolated and syndromic Hirschsprung’s disease in human disclose major and modifier alleles at a single locus. J Med Genet. 2006;43(5):419–23. Epub 2006/01/31. doi: 10.1136/jmg.2005.040113 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.de Pontual L, Pelet A, Clement-Ziza M, Trochet D, Antonarakis SE, Attie-Bitach T, et al. Epistatic interactions with a common hypomorphic RET allele in syndromic Hirschsprung disease. Human mutation. 2007;28(8):790–6. Epub 2007/04/03. doi: 10.1002/humu.20517 . [DOI] [PubMed] [Google Scholar]
  • 56.de Pontual L, Zaghloul NA, Thomas S, Davis EE, McGaughey DM, Dollfus H, et al. Epistasis between RET and BBS mutations modulates enteric innervation and causes syndromic Hirschsprung disease. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(33):13921–6. Epub 2009/08/12. doi: 10.1073/pnas.0901219106 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Arnold S, Pelet A, Amiel J, Borrego S, Hofstra R, Tam P, et al. Interaction between a chromosome 10 RET enhancer and chromosome 21 in the Down syndrome-Hirschsprung disease association. Human mutation. 2009;30(5):771–5. Epub 2009/03/24. doi: 10.1002/humu.20944 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Jannot AS, Pelet A, Henrion-Caude A, Chaoui A, Masse-Morel M, Arnold S, et al. Chromosome 21 scan in Down syndrome reveals DSCAM as a predisposing locus in Hirschsprung disease. PloS one. 2013;8(5):e62519. Epub 2013/05/15. doi: 10.1371/journal.pone.0062519. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Natarajan D, Marcos-Gutierrez C, Pachnis V, de Graaff E. Requirement of signalling by receptor tyrosine kinase RET for the directed migration of enteric nervous system progenitor cells during mammalian embryogenesis. Development (Cambridge, England). 2002;129(22):5151–60. Epub 2002/10/26. . [DOI] [PubMed] [Google Scholar]
  • 60.Rui Y, Xu Z, Xiong B, Cao Y, Lin S, Zhang M, et al. A beta-catenin-independent dorsalization pathway activated by Axin/JNK signaling and antagonized by aida. Dev Cell. 2007;13(2):268–82. Epub 2007/08/08. doi: 10.1016/j.devcel.2007.07.006 . [DOI] [PubMed] [Google Scholar]
  • 61.Kerstjens-Frederikse WS, Hofstra RM, van Essen AJ, Meijers JH, Buys CH. A Hirschsprung disease locus at 22q11? J Med Genet. 1999;36(3):221–4. . [PMC free article] [PubMed] [Google Scholar]
  • 62.Scambler PJ. The 22q11 deletion syndromes. Human molecular genetics. 2000;9(16):2421–6. doi: 10.1093/hmg/9.16.2421 . [DOI] [PubMed] [Google Scholar]
  • 63.Yamagishi H, Garg V, Matsuoka R, Thomas T, Srivastava D. A molecular pathway revealing a genetic basis for human cardiac and craniofacial defects. Science. 1999;283(5405):1158–61. doi: 10.1126/science.283.5405.1158 . [DOI] [PubMed] [Google Scholar]
  • 64.Hendershot TJ, Liu H, Sarkar AA, Giovannucci DR, Clouthier DE, Abe M, et al. Expression of Hand2 is sufficient for neurogenesis and cell type-specific gene expression in the enteric nervous system. Dev Dyn. 2007;236(1):93–105. doi: 10.1002/dvdy.20989 . [DOI] [PubMed] [Google Scholar]
  • 65.D’Autreaux F, Margolis KG, Roberts J, Stevanovic K, Mawe G, Li Z, et al. Expression level of Hand2 affects specification of enteric neurons and gastrointestinal function in mice. Gastroenterology. 2011;141(2):576–87, 87 e1–6. doi: 10.1053/j.gastro.2011.04.059 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Lindsay EA, Botta A, Jurecic V, Carattini-Rivera S, Cheah Y-C, Rosenblatt HM, et al. Congenital heart disease in mice deficient for the DiGeorge syndrome region. Nature. 1999;401(6751):379. doi: 10.1038/43900 [DOI] [PubMed] [Google Scholar]
  • 67.Vitelli F, Lindsay E, Baldini A, editors. Genetic dissection of the DiGeorge syndrome phenotype. Cold Spring Harbor symposia on quantitative biology; 2002: Cold Spring Harbor Laboratory Press. [DOI] [PubMed] [Google Scholar]
  • 68.McDonald-McGinn DM, Sullivan KE, Marino B, Philip N, Swillen A, Vorstman JA, et al. 22q11.2 deletion syndrome. Nature reviews Disease primers. 2015;1:15071. Epub 2015/01/01. doi: 10.1038/nrdp.2015.71. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Schonewolf-Greulich B, Ronan A, Ravn K, Baekgaard P, Lodahl M, Nielsen K, et al. Two new cases with microdeletion of 17q23.2 suggest presence of a candidate gene for sensorineural hearing loss within this region. Am J Med Genet A. 2011;155A(12):2964–9. doi: 10.1002/ajmg.a.34302 . [DOI] [PubMed] [Google Scholar]
  • 70.Khattab M, Xu F, Li P, Bhandari V. A de novo 3.54 Mb deletion of 17q22-q23.1 associated with hydrocephalus: a case report and review of literature. Am J Med Genet A. 2011;155A(12):3082–6. doi: 10.1002/ajmg.a.34307 . [DOI] [PubMed] [Google Scholar]
  • 71.Pan L, Ma X, Wen B, Su Z, Zheng X, Liu Y, et al. Microphthalmia-associated transcription factor/T-box factor-2 axis acts through Cyclin D1 to regulate melanocyte proliferation. Cell proliferation. 2015;48(6):631–42. Epub 2015/10/22. doi: 10.1111/cpr.12227 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Stanchina L, Baral V, Robert F, Pingault V, Lemort N, Pachnis V, et al. Interactions between Sox10, Edn3 and Ednrb during enteric nervous system and melanocyte development. Dev Biol. 2006;295(1):232–49. Epub 2006/05/03. doi: 10.1016/j.ydbio.2006.03.031 . [DOI] [PubMed] [Google Scholar]
  • 73.Watanabe Y, Stanchina L, Lecerf L, Gacem N, Conidi A, Baral V, et al. Differentiation of Mouse Enteric Nervous System Progenitor Cells Is Controlled by Endothelin 3 and Requires Regulation of Ednrb by SOX10 and ZEB2. Gastroenterology. 2017;152(5):1139–50.e4. Epub 2017/01/09. doi: 10.1053/j.gastro.2016.12.034 . [DOI] [PubMed] [Google Scholar]
  • 74.Harrelson Z, Kelly RG, Goldin SN, Gibson-Brown JJ, Bollag RJ, Silver LM, et al. Tbx2 is essential for patterning the atrioventricular canal and for morphogenesis of the outflow tract during heart development. Development (Cambridge, England). 2004;131(20):5041–52. Epub 2004/10/02. doi: 10.1242/dev.01378 . [DOI] [PubMed] [Google Scholar]
  • 75.Sedletcaia A, Evans T. Heart chamber size in zebrafish is regulated redundantly by duplicated tbx2 genes. Developmental Dynamics. 2011;240(6):1548–57. doi: 10.1002/dvdy.22622 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Cho CH, Kim SS, Jeong MJ, Lee CO, Shin HS. The Na+ -Ca2+ exchanger is essential for embryonic heart development in mice. Mol Cells. 2000;10(6):712–22. doi: 10.1007/s10059-000-0712-2 . [DOI] [PubMed] [Google Scholar]
  • 77.Casanova E, Guetg N, Vigot R, Seddik R, Julio-Pieper M, Hyland NP, et al. A mouse model for visualization of GABA(B) receptors. Genesis. 2009;47(9):595–602. doi: 10.1002/dvg.20535 . [DOI] [PubMed] [Google Scholar]
  • 78.Ong J, Kerr DI. GABAA- and GABAB-receptor-mediated modification of intestinal motility. Eur J Pharmacol. 1982;86(1):9–17. doi: 10.1016/0014-2999(82)90390-9 . [DOI] [PubMed] [Google Scholar]
  • 79.Krantis A, Harding RK. GABA-related actions in isolated in vitro preparations of the rat small intestine. Eur J Pharmacol. 1987;141(2):291–8. doi: 10.1016/0014-2999(87)90274-3 . [DOI] [PubMed] [Google Scholar]
  • 80.Giotti A, Luzzi S, Spagnesi S, Zilletti L. GABAA and GABAB receptor-mediated effects in guinea-pig ileum. Br J Pharmacol. 1983;78(3):469–78. doi: 10.1111/j.1476-5381.1983.tb08807.x . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Pencheva N, Itzev D, Milanov P. Comparison of gamma-aminobutyric acid effects in different parts of the cat ileum. Eur J Pharmacol. 1999;368(1):49–56. doi: 10.1016/s0014-2999(99)00017-5 . [DOI] [PubMed] [Google Scholar]
  • 82.Auteri M, Zizzo MG, Mastropaolo M, Serio R. Opposite role played by GABAA and GABAB receptors in the modulation of peristaltic activity in mouse distal colon. Eur J Pharmacol. 2014;731:93–9. doi: 10.1016/j.ejphar.2014.03.003 . [DOI] [PubMed] [Google Scholar]
  • 83.Blake JA, Eppig JT, Kadin JA, Richardson JE, Smith CL, Bult CJ, et al. Mouse Genome Database (MGD)-2017: community knowledge resource for the laboratory mouse. Nucleic Acids Res. 2017;45(D1):D723–D9. doi: 10.1093/nar/gkw1040 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Fishilevich S, Zimmerman S, Kohn A, Iny Stein T, Olender T, Kolker E, et al. Genic insights from integrated human proteomics in GeneCards. Database: the journal of biological databases and curation. 2016;2016:baw030. doi: 10.1093/database/baw030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Consortium GT, Laboratory DA, Coordinating Center -Analysis Working G, Statistical Methods groups-Analysis Working G, Enhancing Gg, Fund NIHC, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550(7675):204–13. doi: 10.1038/nature24277 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Brooks AS, Bertoli-Avella AM, Burzynski GM, Breedveld GJ, Osinga J, Boven LG, et al. Homozygous nonsense mutations in KIAA1279 are associated with malformations of the central and enteric nervous systems. Am J Hum Genet. 2005;77(1):120–6. doi: 10.1086/431244 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.MacKenzie KC, de Graaf BM, Syrimis A, Zhao Y, Brosens E, Mancini GMS, et al. Goldberg-Shprintzen syndrome is determined by the absence, or reduced expression levels, of KIFBP. Hum Mutat. 2020;41(11):1906–17. Epub 2020/09/18. doi: 10.1002/humu.24097 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.McCann CJ, Alves MM, Brosens E, Natarajan D, Perin S, Chapman C, et al. Neuronal Development and Onset of Electrical Activity in the Human Enteric Nervous System. Gastroenterology. 2019. Epub 2019/01/06. doi: 10.1053/j.gastro.2018.12.020. [DOI] [PubMed] [Google Scholar]
  • 89.Isrie M, Breuss M, Tian G, Hansen AH, Cristofoli F, Morandell J, et al. Mutations in Either TUBB or MAPRE2 Cause Circumferential Skin Creases Kunze Type. Am J Hum Genet. 2015;97(6):790–800. Epub 2015/12/08. doi: 10.1016/j.ajhg.2015.10.014 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Breuss M, Fritz T, Gstrein T, Chan K, Ushakova L, Yu N, et al. Mutations in the murine homologue of TUBB5 cause microcephaly by perturbing cell cycle progression and inducing p53-associated apoptosis. Development (Cambridge, England). 2016;143(7):1126–33. Epub 2016/02/24. doi: 10.1242/dev.131516 . [DOI] [PubMed] [Google Scholar]
  • 91.Ngo L, Haas M, Qu Z, Li SS, Zenker J, Teng KS, et al. TUBB5 and its disease-associated mutations influence the terminal differentiation and dendritic spine densities of cerebral cortical neurons. Hum Mol Genet. 2014;23(19):5147–58. Epub 2014/05/17. doi: 10.1093/hmg/ddu238 . [DOI] [PubMed] [Google Scholar]
  • 92.Miao X, Leon TY, Ngan ES, So MT, Yuan ZW, Lui VC, et al. Reduced RET expression in gut tissue of individuals carrying risk alleles of Hirschsprung’s disease. Human molecular genetics. 2010;19(8):1461–7. Epub 2010/01/22. doi: 10.1093/hmg/ddq020 . [DOI] [PubMed] [Google Scholar]
  • 93.Jiang Q, Ho YY, Hao L, Nichols Berrios C, Chakravarti A. Copy number variants in candidate genes are genetic modifiers of Hirschsprung disease. PloS one. 2011;6(6):e21219. Epub 2011/06/30. doi: 10.1371/journal.pone.0021219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Brosens E, Marsch F, de Jong EM, Zaveri HP, Hilger AC, Choinitzki VG, et al. Copy number variations in 375 patients with oesophageal atresia and/or tracheoesophageal fistula. Eur J Hum Genet. 2016;24(12):1715–23. doi: 10.1038/ejhg.2016.86 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Gui H, Schriemer D, Cheng WW, Chauhan RK, Antinolo G, Berrios C, et al. Whole exome sequencing coupled with unbiased functional analysis reveals new Hirschsprung disease genes. Genome Biol. 2017;18(1):48. Epub 2017/03/10. doi: 10.1186/s13059-017-1174-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Lai FP, Lau ST, Wong JK, Gui H, Wang RX, Zhou T, et al. Correction of Hirschsprung-Associated Mutations in Human Induced Pluripotent Stem Cells Via Clustered Regularly Interspaced Short Palindromic Repeats/Cas9, Restores Neural Crest Cell Function. Gastroenterology. 2017;153(1):139–53 e8. doi: 10.1053/j.gastro.2017.03.014 . [DOI] [PubMed] [Google Scholar]
  • 97.Brouwer RWW, van den Hout M, Kockx CEM, Brosens E, Eussen B, de Klein A, et al. Nimbus: A design-driven analyses suite for amplicon based NGS data. Bioinformatics (Oxford, England). 2018. Epub 2018/03/15. doi: 10.1093/bioinformatics/bty145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Brouwer RW, van den Hout MC, Grosveld FG, van Ijcken WF. NARWHAL, a primary analysis pipeline for NGS data. Bioinformatics (Oxford, England). 2012;28(2):284–5. Epub 2011/11/11. doi: 10.1093/bioinformatics/btr613 . [DOI] [PubMed] [Google Scholar]
  • 99.Halim D, Brosens E, Muller F, Wangler MF, Beaudet AL, Lupski JR, et al. Loss-of-Function Variants in MYLK Cause Recessive Megacystis Microcolon Intestinal Hypoperistalsis Syndrome. Am J Hum Genet. 2017;101(1):123–9. Epub 2017/06/13. doi: 10.1016/j.ajhg.2017.05.011 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Zhan X, Hu Y, Li B, Abecasis GR, Liu DJ. RVTESTS: an efficient and comprehensive tool for rare variant association analysis using sequence data. Bioinformatics (Oxford, England). 2016;32(9):1423–6. doi: 10.1093/bioinformatics/btw079 . [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Heanue TA, Pachnis V. Ret isoform function and marker gene expression in the enteric nervous system is conserved across diverse vertebrate species. Mechanisms of development. 2008;125(8):687–99. Epub 2008/06/21. doi: 10.1016/j.mod.2008.04.006 . [DOI] [PubMed] [Google Scholar]
  • 102.Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, et al. Fiji: an open-source platform for biological-image analysis. Nature methods. 2012;9(7):676–82. Epub 2012/06/30. doi: 10.1038/nmeth.2019 . [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

Gregory S Barsh

5 Dec 2020

Dear Dr Brosens,

Thank you very much for submitting your Research Article entitled 'Size matters: large copy number losses reveal novel Hirschsprung disease genes' to PLOS Genetics.

The manuscript was fully evaluated at the editorial level and by two independent peer reviewers. As you will see, the reviewers appreciated the attention to an important problem, but raised substantial and serious concerns about presentation, organization, and annotation. Although there are no fundamental flaws with the approach or design, the editors agree with the reviewers that comprehensive reorganization and rewriting would be necessary for a revised manuscript to move forward. 

 Based on the reviews, we will not be able to accept this version of the manuscript, but we would be willing to review a much-revised version. We caution that the extent of effort necessary to satisfy the reviewers' concerns is considerable, and that for a revised manuscript to succeed, the reviewers' concerns will need to be satisfied fully.

Should you decide to revise the manuscript for further consideration here, your revisions should address the specific points made by each reviewer. We will also require a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript.

If you decide to revise the manuscript for further consideration at PLOS Genetics, please aim to resubmit within the next 60 days, unless it will take extra time to address the concerns of the reviewers, in which case we would appreciate an expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments are included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions see our guidelines.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool.  PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, use the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

We are sorry that we cannot be more positive about your manuscript at this stage. Please do not hesitate to contact us if you have any concerns or questions.

Yours sincerely,

Gregory S. Barsh

Editor-in-Chief

PLOS Genetics

Gregory Copenhaver

Editor-in-Chief

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Because of the complexity of Hirschsprung Disease (HSCR) genetics, identifying the genetic alterations responsible for the condition is challenging. To explore whether rare CNVs contribute to aganglionosis in HSCR-AAM patents (HSCR with additional anatomical malformations) and to uncover genes underlying HSCR-AAM, the authors have combined analysis of patient CNVs, evaluation of CN loss regions for genes expressed in the embryonic mammalian enteric nervous system (ENS) to identify candidate genes influencing disease, and then testing the function of these candidate genes in ENS development in zebrafish. The identification of candidate HSCR disease associated genes is of interest to the fields of human genetics and developmental biology, however the current manuscript suffers from some significant organizational/readability issues as well as issues with the description, analysis and interpretation of the gene functional studies.

Major comments

1. The information needed to understand the experimental design and results is not where it needs to be and instead is buried within the supplemental material or in the methods, including some of the main findings. The main figures and supplementary figures lack appropriate legends. And the supplementary material is not well organized and much of it is not explicitly referred to in the paper. Some examples:

a) The rare CNV analysis is not properly introduced. Key to this analysis is the classification of subgroups of HSCR-AAM. Figure 1A/B/3, depict HSCR patients classification into 3 groups, however the results and figure legends never describe these groups. This information is found only in the “Patient Inclusion” section of the methods on page 8, and needing to search for this important information affects readability.

b) The evidence for the first finding, “The number of rare CNVs per individual did not differ between subgroups or controls (S4).”, appears to not be represented in the figure, but only in the S4 legend text. This is one of the key findings cited in the abstract, and should be included in the main text and/or be clearly represented by a figure data panel (i.e. S4n), instead of needing to hunt for the information. Likewise, data for another main finding cited on p4 “In Group 1, rare CNVs are enriched for ENS genes (p=4.565E-6, S7).” appears not to be within S7, but in S4, but again only in the figure legend. And it isn’t clear what data this point refers to. Again, these main findings should be clearly presented.

c) Further CNV analysis is found over the 1 table and 10 plots of S4. Each piece of data should have designators (i.e. S4n) and the finding from each plot indicated in the results section. Many of these results are currently only discussed within this supplementary figure legend. If these results are not presented in the paper, then they should not be included. The same is true for all data figures and supplemental data figures. Careful consideration must be made of what to include and to properly describe any included data.

d) Acronyms such as RS (and RSnc, RScnv and RSrcv) are not defined in the legends. Nor do they appear in the text on first appearance (i.e. not for Table 1, S4). RSnc eventually described in reference to Fig2C and S11 (see also point 1, above). The other acronyms are never described or referred to other than appearing in Fig1b. In Tables 1 and 2, the acronyms in the column headers are not defined. A few are also mentioned in S9, but still not defined.

e) The term “ENS genes” is used in the results on page 4 in reference to S7 (and in S4) but without explanation. After searching, a partial definition of this term is found within the legend of S4 and also obliquely described in the methods page 9. This is a key piece of information that should be clearly described in the relevant part of the results section.

2. Several issues relate to the ENS phenotypes described following CRISPR/Cas9 targeting of zebrafish genes:

a) Phenotypes are described variously as “hypoganglionosis”, “ultra-short segment HSCR”, “short segment HSCR” and “total colonic HSCR”. This is not a valid description of these zebrafish phenotypes. There is no justification for the use of these terms and they are highly misleading. The whole zebrafish intestine is not equivalent to the mammalian colon. Instead, phenotypes affecting the distribution of ENS neurons along the gut length should be presented in neutral terms/values that actually describe the phenotypes. While the term “hypoganglionosis” has been used previously to describe zebrafish ENS phenotypes with a reduction in ENS neuron number, this term is not accurate. The zebrafish ENS has no ganglia. A proper descriptive term should be used.

b) It is not stated how a designation of “hypoganglionosis” is given. Are neurons counted? What percentage reduction constitutes a phenotype?

c) Are phenotypes observed outside of the ENS? Are any growth abnormalities observed in any gene targeting experiment? This is an important point since general developmental delays would affect the number and distribution of neurons observed at 5dpf.

d) P-values are given for observed phenotypes in S16. It is unclear whether all types of phenotypes have been lumped together for these statistics.

e) Results and p-values are described for "tbx2" in Figure 2D and S16, but it is unclear if this refers to CRISPR/Cas9 targeting of tbx2a or tbx2b or a combination. The same question arises for gabbr1a/1b, slc8a1a/1b and mapk8a/8b.

f) Ns are low for all CRISPR/Cas9 experiments, ranging from 15 to 40 embryos per gene/gene pair targeted. In the case of slc8a1, a p-value of 0.0073 is given for an experiment involving 15 embryos. Power calculations should be shown to demonstrate the appropriate number of animals needed for statistically relevant results, and that the numbers used in these experiments are sufficient.

g) The description of the results in Figures 2E and 2F is very sparse and requires further elaboration and discussion.

h) The phenotypes observed in zebrafish should never be described as “HSCR” (i.e p7 of the Conclusions). See point 1a, above.

i) Candidate HSCR disease genes should be described as “candidates” rather than “HSCR disease genes” (see p. 8, for example)

j) A p-value is given for ret+tbx2, but no results are shown in Fig2e and no data is shown for slc8a1

3. Other issues:

a) Figure 1 is a graphical display of the experimental design. The images chosen are mostly far too small to see. Key information is missing (i.e. what are the subgroups, see point 1a, above), and some images are possibly not appropriate (‘ENS’ panel suggests sequencing gels were run? Wasn't this panel referring to microarray data?). This entire figure requires significant improvements to be useful.

b) Why isn’t ChAT chosen for analysis (Table 1)? From the logic presented, it’s not clear why this gene is not selected, for it features in S7 and is an ENS expressed gene.

c) The statistics designations (i.e. meaning of ** vs ***) are never explained in the methods. The actual statistics should be found in the figure or in the figure legend, in addition to the main text. This is especially important because some results are not described in the text (i.e. most of S4 plots where * designations are not given, though p-values are indicated in the legend. But it’s unclear which p-value is referring to which panel - see point 1c, above).

d) The term “overexpressed” is used many times, for example in “a constraint gene overexpressed in the mouse ENS” (p.9) , but these genes are NOT overexpressed in the ENS. This term suggests some alteration or manipulation. Instead, these genes are expressed in the ENS. They are also expressed in the ENS more abundantly than other reference tissues (i.e. other gut tissues). This term needs to be changed throughout the text/figures/legends/data tables, etc..

e) On p5 the statement: “However, there was no significant enrichment for nonsense, splice site or missense variants in genes impacted by rare gains nor losses (S9)" seems at odds with the data in S9, which shows p-values of <0.05 for 5 genes.

f) S15 appears to refer to figures incorrectly.

g) Attention to writing is needed throughout, and especially in the author summary.

Reviewer #2: This work presented in this manuscript involves screening of a population of HSCR patients with and without additional presenting phenotypic features for the presence of rare CNVs as compared to control individuals. CNVs were then filtered for predicted deleterious impact based on the ENS expression for genes located within the CNV and a gene’s intolerance to variation within coding regions. Candidate genes were subsequently functionally evaluated for a role in ENS development utilizing a zebrafish CRIPR/cas9 knockdown, ret morpholino sensitized model.

This article would immensely benefit form a comprehensive revision and organization of the material presented. Paper is not well organized and critical information is buried in the supplemental files without proper descriptions and annotation links from within the main paper. The paper would benefit from editing for verb tense and typographical errors. The manuscript as currently presented detracts from the data presentation.

From the start of the results section, the authors should clearly define the patient grouping characteristics upfront. This identification needs to be clearly stated in the main text and in the appropriate figure legends, not merely placed at the end, in the methods, patient inclusion section.

With respect to the assessment of gene function in zebrafish, full images representing gene knockdown embryos should be included in supplement as this data would be of interest to a broader audience. I would note that under the supplement section for patient P_000479, the authors do state “In line with this, we show that zebrafish larvae where slc8a1a and slc8a1b were disrupted, severe heart edema was present, as well as swollen heads”, however images of this data was not presented.

Authors should clearly note which of the two alleles for each gene are being displayed in the Figure 2f gRNAs plots. For each of these genes the authors may find helpful the ZFIN resource ( https://zfin.org/ ). This resource will allow for identification of published results of previously characterized Zebrafish gene expression and allele specific phenotypic results for comparison. These phenotypes should be included when discussing the gene expression or phenotypes alongside those of mouse models as the authors are utilizing zebrafish as their research model for phenotypic evaluation.

In supplemental section 8, authors utilize this section to not only present patient phenotypic information, but also expand on data presented and draw conclusions based on this data. Much of the information included here would benefit from a rewrite consolidation and reworking into the discussion of the main paper.

Authors should address in their discussion the large number of individuals remaining without an explained genetic diagnosis following this analysis including the extent to which complex rearrangements will remain undetected by SNP array-based analysis (eg. complex rearrangements without copy number alterations such as inversions and the need for better resolution of breakpoint junctions with respect to genes in the cases of duplications). The number of patients without definitive diagnosis is not unique to HSCR patients and limitations of the approach taken should at least be acknowledged. While authors have focused on rare CVNs with associated with region loss, roughly 50% of the rare CNV’s detected in table S3 represent duplications. Are these duplications included in the data for figure 3b and if so, do the authors have evidence of gene disruption at breakpoints or that upregulation of critical genes contained within the intervals contribute to HSCR?

Minor points.

Authors use the term “constrain” and “constraint” inappropriately throughout the text. These instances should either use the terms “genes under constraint” or “constrained genes”.

All figures should be numbered to reflect the order that they first appear in the text. For example, supplements S1 and S2 are not mentioned in the main text.

Figure 2. All asterisks in the figures should be appropriated defined in the respective figure legends.

Figure (2a). Label y- axis clearly, is it bp or Mb?

Figure (2d and e). The total number of animals used should be included in this figure/figure legend. At a minimum, authors should indicate within the figure 2 legend that the total number of animals evaluated is presented in S16. In addition, authors should clarify if the P-value presented in figure 2e is reflective of the asterisk to the right or does it indicate a P-value for the bars below.

Figure (2f). In the top panel, the fish are labeled “normal”, however this is then presented with a number 15/23. I assume the authors wish to convey that these are the “control” individuals with a normal distribution pattern. This should be better described in the figure legend.

The term CN should be defined in the text at first usage.

S4) A subset of graphs presented here appear to be the same as those already included in Figures 2a, 2b and 2c.

S6) Please define the headers for this table.

S16) Consistent capitalization/non-capitalization should be utilized (“p”-value and “chi” in S16 , and supplement index).

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Decision Letter 1

Gregory S Barsh

16 Mar 2021

Dear Dr Brosens,

Thank you very much for submitting your Research Article entitled 'Size matters: large copy number losses in Hirschsprung disease patients reveal genes involved in enteric nervous system development.' to PLOS Genetics.

The revised manuscript was seen by the two reviewers of the original submission. As you will see, both reviewers appreciate the changes that were made; however, there are some serious remaining concerns that will need to be addressed for the manuscript to move forward. Most of these concerns have to do with presentation and analysis, and are straightforward though not trivial to address.

We therefore ask you to modify the manuscript according to the review recommendations. Your revisions should address the specific points made by each reviewer. We hope to receive your revised manuscript within the next 60 days. If you anticipate any delay in its return, we would ask you to let us know the expected resubmission date by email to plosgenetics@plos.org.

If present, accompanying reviewer attachments should be included with this email; please notify the journal office if any appear to be missing. They will also be available for download from the link below. You can use this link to log into the system when you are ready to submit a revised version, having first consulted our Submission Checklist.

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Please be aware that our data availability policy requires that all numerical data underlying graphs or summary statistics are included with the submission, and you will need to provide this upon resubmission if not already present. In addition, we do not permit the inclusion of phrases such as "data not shown" or "unpublished results" in manuscripts. All points should be backed up by data provided with the submission.

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

PLOS has incorporated Similarity Check, powered by iThenticate, into its journal-wide submission system in order to screen submitted content for originality before publication. Each PLOS journal undertakes screening on a proportion of submitted articles. You will be contacted if needed following the screening process.

To resubmit, you will need to go to the link below and 'Revise Submission' in the 'Submissions Needing Revision' folder.

[LINK]

Please let us know if you have any questions while making these revisions.

Yours sincerely,

Gregory S. Barsh

Editor-in-Chief

PLOS Genetics

Gregory Copenhaver

Editor-in-Chief

PLOS Genetics

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The revisions to the text and figure organization have improved this version of the manuscript. However, several points remain outstanding. These impact significantly on logic and readability and/or on data findings:

1) The terms “hypoganglionosis”, “ultra-short segment HSCR”, “short segment HSCR” and “total colonic HSCR” do not belong in the Results or the Figures. The fact that these authors have published their proposed classification strategy in a Review article (Kuil 2020) is immaterial to the point made here. This classification is not accurate or justified (the whole zebrafish intestine is not equivalent to the mammalian colon), is misleading and does not belong in a research paper Results section. Neutral, descriptive terms should be used in the Results to describe the findings. If the authors wish to raise in the Discussion the possibility that zebrafish phenotype severity may relate to HSCR phenotype classifications, and allude to their review article, this would be acceptable. The use of these terms is not only provocative and inaccurate, but also entirely unnecessary for this study, since the effect of gene disruptions are only described as having “significant effect” or “effect” or “significant increase in percentage of larvae that showed a phenotype”. The use of inappropriate terms offers no additional information and only clouds a simple analysis. Further to this general point, the terms “HSCR penetrance in zebrafish” and “HSCR penetrance” should not be used in the Results. The Discussion is the appropriate place to raise these speculative comparisons.

2) The term “overexpressed” is not accurate to describe genes more abundantly expressed in the ENS relative to a reference tissue, and should not be used. These genes are “expressed” in the ENS. The changes made to the manuscript do not address the issue raised.

3) In Figure 1, the schematic representation of the study set-up, the image used in Fig 1B to represent microarrays appears to be of a sequencing gel, though the legend states that this is intended to represent a heat map. This image is unrecognizable as a heat map. It is likely that the authors did not amend the figure to include the BRB-ArrayTools image, as was indicated in their rebuttal letter. Importantly, were heat maps used to identify differentially expressed genes? If so, this analysis should be shown. The methods suggest that straightforward differential expression analysis was used, presumably with some (unspecified) fold-change cut-offs, in which case heat maps should be shown to depict those genes that met this fold change cut-off, with an accompanying data table. Or perhaps scatterplots with fold change cut-offs shown and an accompanying data table. In the schematic, another more appropriate image should be found to represent microarray analysis, and best would be to select an image that reflects the analysis that was actually done. And any relevant analysis should be shown as a figure and/or data table. And the methods should be properly described in the Methods section. See also points 5 and 6 below.

4) The term “ENS gene” is still used before being defined (p. 8, lines: 170, 172, 177, 181), meaning that the logic and readability is still an issue in this version. Is this term even intended to be the same as “ENS gene” as defined on p9? Or is there some other criteria for the term used on p8? If different, this term should also be explained. If the terms are the same, then the definition must come earlier in the text. Moreover, the term as currently defined on p9 is not clear. Presumably an “ENS gene” is one which has higher expression in the developing mouse or human ENS compared with the whole intestine. As written, the definition refers to a sentence also describing CNVs, which is unnecessarily confusing: “Whereas a total of 230 genes affected by a rare CNV, had higher mRNA expression in the developing mouse ENS compared to the whole intestine (n=91 in group 1-3, n=139 in group 4) [43, 44]. We considered these genes to be important for ENS development and will refer to those genes as “ENS genes” throughout this manuscript.” This definition should be simply and clearly stated.

5) The section “Candidate genes with a loss are under constraint and expressed during ENS development” is very confusingly written. The first indication that expression of genes is being examined comes from: “Of these, 514 did not have a known mouse orthologue or probes in the microarray datasets used”. But nowhere is the use of microarray datasets explained in the Results. Certainly the schematic representation of the study set-up in Figure 1 does not adequately present or describe this part of the study (see also point 3, above) and nor does the cursory figure legend text: “The genes included in the CNVs were ranked based on their expression in the developing mouse ENS relative to that of the total intestine (for illustrative purposes a screenshot of a heatmap of differential gene expression derived from brb-array tools (https://brb.nci.nih.gov/BRB-ArrayTools/ )”. Information to understand this study belongs in Results section (see point 3, above). There is no reason not to describe this work simply and clearly.

6) There is no data table corresponding to the “Evaluation of candidate gene expression” to show the complete list of genes this analysis identified as expressed in the mouse ENS or the human embryonic colon. See point 3, above. This is required to show that the analysis was performed correctly and, importantly, to show qualitative expression information for the ENS expressed genes followed up in this study.

7) There is no experimental basis for speculating a role for TUBB in HSCR, since no effect was seen in Crispr/CAS9 experiments targeting tubb1. Therefore, this cannot be stated as is written (p16): “Although we did not find an (e)affect on ENS development in zebrafish, the absence of enteric neurons in the distal colon and other neurological symptoms seen in Patient P_000512, can be likely related to the deletion of TUBB and subsequently to an increased neuronal apoptosis or perturbed neuronal migration or differentiation in combination with the effects of GNL1.” Importantly, in fact, the zebrafish orthologue of TUBB is tubb5, although tubb1 (the TUBB1 orthologue) was tested in these experiments. So it appears that the right experiment has not been done in any case. If so, this would suggest that no conclusion can be made on the role of TUBB in HSCR and the Results and Discussion should be modified accordingly. Finally, the Discussion (in general) is very long, and especially so in this section discussing the lack of evidence for roles of GABBR1 and TUBB.

8) A table should be provided to show the orthologous Mouse and Zebrafish genes for the 1216 Human genes or transcripts present in CNVs of group 1-3 and group 4, to include the supporting source reference for these orthologue assignments (i.e. HGNC, MGI, ZFIN, Ensembl). This is important for point 7, above. Moreover, together with the table indicated in point 6, this information is necessary to provide support for statements such as “Of these, 514 did not have a known mouse orthologue or probes in the microarray datasets used” (p.9, line 201). Given the fact that 99% of human genes have mouse orthologues, it is unlikely that ~half of the 1216 human genes or transcripts lack mouse orthologues, and more likely that the many of the orthologous mouse genes were not present on the arrays. The table should provide information to allow these two possibilities to be clearly distinguished. This should also be discussed in the “Limitations and discrepancies” section.

9) The authors have addressed a comment from Reviewer 2 regarding “constraint”, but in doing so have now made further syntax/grammatical errors: i.e. “genes under constrained” throughout (i.e. Author Summary, Results, Discussion). In general the Author Summary continues to require attention to writing. Punctuation and other grammatical/syntax errors are an issue throughout the entire text and affects readability. Note also “Danio rerio”. The Discussion would benefit from shortening.

Reviewer #2: The authors resubmitted manuscript has undergone a significant rewrite of the manuscript including the approach they undertook, figures, and interpretation of results. This substantial rewrite now provides a clear presentation of the CNV’s identified, associated clinical presentations and analysis of genes located within these regions with respect to enteric nervous system expression and functional validation in zebrafish.

I would suggest the following minor comments be addressed:

The definition if the term “ENS genes” should be made at the first usage of the term online 170, rather than where it is later defined on line 207.

S1. Correct spelling of undetermined

S8. Column 1 header is missing the “t” in patient

Text in table description should include apostrophe to be patient’s

S9. Capitalize “Primer”

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: No: As outlined in comments to the Authors:

1) There is no data table corresponding to the “Evaluation of candidate gene expression” to show the complete list of genes this analysis identified as expressed in the mouse ENS or the human embryonic colon.

2) A table should be provided to show the orthologous Mouse and Zebrafish genes for the 1216 Human genes or transcripts present in CNVs of group 1-3 and group 4, to include the supporting source reference for these orthologue assignments (i.e. HGNC, MGI, ZFIN, Ensembl). The table should also indicate if mouse or zebrafish orthologues of a human gene do not exist. Finally the table should indicate if mouse orthologues of a human gene are present or absent from the microarray datasets analyzed in this study.

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Decision Letter 2

Gregory S Barsh

6 Jul 2021

Dear Dr Brosens,

We are pleased to inform you that your manuscript entitled "Size matters: large copy number losses in Hirschsprung disease patients reveal genes involved in enteric nervous system development." has been editorially accepted for publication in PLOS Genetics. Congratulations!

The revised manuscript was seen by reviewer #1 of the earlier submission; as you will see, they are positive and enthusiastic about the work.

Before your submission can be formally accepted and sent to production you will need to complete our formatting changes, which you will receive in a follow up email. Please be aware that it may take several days for you to receive this email; during this time no action is required by you. Please note: the accept date on your published article will reflect the date of this provisional acceptance, but your manuscript will not be scheduled for publication until the required changes have been made.

Once your paper is formally accepted, an uncorrected proof of your manuscript will be published online ahead of the final version, unless you’ve already opted out via the online submission form. If, for any reason, you do not want an earlier version of your manuscript published online or are unsure if you have already indicated as such, please let the journal staff know immediately at plosgenetics@plos.org.

In the meantime, please log into Editorial Manager at https://www.editorialmanager.com/pgenetics/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production and billing process. Note that PLOS requires an ORCID iD for all corresponding authors. Therefore, please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field.  This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager.

If you have a press-related query, or would like to know about making your underlying data available (as you will be aware, this is required for publication), please see the end of this email. If your institution or institutions have a press office, please notify them about your upcoming article at this point, to enable them to help maximise its impact. Inform journal staff as soon as possible if you are preparing a press release for your article and need a publication date.

Thank you again for supporting open-access publishing; we are looking forward to publishing your work in PLOS Genetics!

Yours sincerely,

Gregory S. Barsh

Editor-in-Chief

PLOS Genetics

Gregory Copenhaver

Editor-in-Chief

PLOS Genetics

www.plosgenetics.org

Twitter: @PLOSGenetics

----------------------------------------------------

Comments from the reviewers (if applicable):

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The changes made, including new experiments and substantial text changes, have significantly improved the logic and readability of this manuscript. This body of work nicely demonstrates how new technologies and cross-species strategies can be applied to human genetics studies.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Genetics data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

----------------------------------------------------

Data Deposition

If you have submitted a Research Article or Front Matter that has associated data that are not suitable for deposition in a subject-specific public repository (such as GenBank or ArrayExpress), one way to make that data available is to deposit it in the Dryad Digital Repository. As you may recall, we ask all authors to agree to make data available; this is one way to achieve that. A full list of recommended repositories can be found on our website.

The following link will take you to the Dryad record for your article, so you won't have to re‐enter its bibliographic information, and can upload your files directly: 

http://datadryad.org/submit?journalID=pgenetics&manu=PGENETICS-D-20-01399R2

More information about depositing data in Dryad is available at http://www.datadryad.org/depositing. If you experience any difficulties in submitting your data, please contact help@datadryad.org for support.

Additionally, please be aware that our data availability policy requires that all numerical data underlying display items are included with the submission, and you will need to provide this before we can formally accept your manuscript, if not already present.

----------------------------------------------------

Press Queries

If you or your institution will be preparing press materials for this manuscript, or if you need to know your paper's publication date for media purposes, please inform the journal staff as soon as possible so that your submission can be scheduled accordingly. Your manuscript will remain under a strict press embargo until the publication date and time. This means an early version of your manuscript will not be published ahead of your final version. PLOS Genetics may also choose to issue a press release for your article. If there's anything the journal should know or you'd like more information, please get in touch via plosgenetics@plos.org.

Acceptance letter

Gregory S Barsh

4 Aug 2021

PGENETICS-D-20-01399R2

Size matters: large copy number losses in Hirschsprung disease patients reveal genes involved in enteric nervous system development.

Dear Dr Brosens,

We are pleased to inform you that your manuscript entitled "Size matters: large copy number losses in Hirschsprung disease patients reveal genes involved in enteric nervous system development." has been formally accepted for publication in PLOS Genetics! Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out or your manuscript is a front-matter piece, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Genetics and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Andrea Szabo

PLOS Genetics

On behalf of:

The PLOS Genetics Team

Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom

plosgenetics@plos.org | +44 (0) 1223-442823

plosgenetics.org | Twitter: @PLOSGenetics

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. The Z-score distributions of total CNV size, CN loss and CN gain of controls and HSCR patients.

    Depicted are the Z-score distributions of all rare CNVs (black lines), rare CN losses (red lines) and CN gains (blue lines). Controls in continuous lines, all HSCR patients in dotted lines. Limited negative z-scores due to the size limit cut-off of 20kb. The z-score distributions of CNV sizes were comparable between groups strongly suggesting that the outliers are responsible for the statistical significant groupwise-differences.

    (DOCX)

    S2 Fig. Risk score comparison between Sanger sequenced and SNP-array genotyped samples.

    Depicted are the calculated noncoding risk scores (RSnc) of patients using the genotypes derived from the 850K array platform. However, we could only do this for the RET and NRG1, as no suitable proxy SNPs were present on the HCS 850k platform for the SEMA3C/D haplotypes (S2 Fig). Therefore, we used (proxy) SNPs present on the GSAMD-v1 chip to determine the Rotterdam population background for all risk haplotypes (group 5, n = 727, Panel B in S2 Fig).

    (DOCX)

    S3 Fig. Brightfield images of zebrafish larvae injected with slc8a1 gRNAs (slc8a1a/slc8a1b).

    Brightfield images of zebrafish larvae injected with gRNAs targeting slc8a1a/b showing severe phenotypes including heart edema, absence of swim bladder and small eyes. Scale bar = 500μm.

    (DOCX)

    S4 Fig. Brightfield images of zebrafish larvae injected with gRNAs.

    Brightfield images of zebrafish larvae injected with gRNAs targeting gnl1, tubb5, ufd1l, usp32, akt3a/b, tbx2a/b, gabbr1a/b and mapk8a/b showing mild phenotypes mainly consisting of the absence of a swim bladder and a down bend tail in case of ufd1l disruption. Scale bar = 500μm.

    (DOCX)

    S1 Table. Rare CNVs detected in this cohort.

    (DOCX)

    S2 Table. Statistical comparisons.

    The number and size of rare CNVs, the number of rare losses and gains, the number of genes considered a CCR, the number of ENS genes per rare CNV, and the relative weighted risk score, were determined and compared for the different groups with a single ANOVA test. If group differences existed (p<0.05), we determined which subgroups were significantly different, using a two-tailed T-test. Abbreviations: CCR; Constrained Coding Region; SNV; single nucleotide variant, CNV; Copy Number Variation, ENS; Enteric Nervous System, RSnc; Risk Score non coding, ND; not determined. Higher values in green, lower values in red. Two-tailed p-values.

    (DOCX)

    S3 Table. HSCR disease gene characteristics.

    Most known HSCR genes are intolerant to genetic variation and are rarely impacted by CNVs in unaffected individuals [39, 40]. These genes have been described to be impacted by CNVs in HSCR patients [7, 27], although this does not seem to be a frequent phenomenon, as in our cohort we did not detect any CNV impacting a known HSCR gene. Abbreviations: mis_z: Missense variation Z-score, syn_z: Synonymous variation z-sore, PLI: probability of being loss-of-function intolerant, PRec: probability of being tolerant to heterozygous loss of function variation but intolerant to homozygous loss of function variation, PNull: probability of being tolerant to loss of function variation (heterozygous or homozygous), del: deletion, dup: duplication, EW: embryonic week, DDD: deciphering developmental disorders project. Data derived from (https://gnomad.broadinstitute.org/) and the DDD control track at: https://genome-euro.ucsc.edu/.

    (DOCX)

    S4 Table. Rare CNV with “ENS genes”.

    (DOCX)

    S5 Table. Variant prioritization and WGS variant burden test results in genes impacted by rare CNV.

    Abbreviations: NVAR: number of variants, NCASEHET: number of heterozygous variants in cases, NCTRLHET: number of heterozygous variants in controls.

    (DOCX)

    S6 Table. Risk haplotype markers and odds ratio’s used to calculate noncoding risk scores (RSnc).

    Highlighted SNPS ingrey/ boldare the risk alleles and Odds ratio’s used in the polygenic risk score calculation. #We did not account for the increased risk of having the two main risk haplotype combinations [1]. @ this risk haplotype was not present in this patient cohort. Sanger sequencing was used to genotype all patients for SNPs known to be associated to HSCR [9, 10, 12, 13]. Patient and controls genotypes were determined using SNPs from the GSAMD-v1 platform. D’ and R2 derived from (https://ldlink.nci.nih.gov/) in European population. Primer sequences can be found in S8 Table.

    (DOCX)

    S7 Table. Description of individual patient genetic risk profiles.

    Depicted are the individual patient’s characteristics, number of predisposing haplotypes (as represented by the risk SNPs) and the noncoding risk score (RSnc). Additionally, the RET risk haplotypes as described inS6 Tableare depicted. In blue the number of “ENS genes” in a CN gain, in red the number of “ENS genes” in a CN loss. Depicted in blue: (1) functional evidence from zebrafish studies and (2) the number of additional patients containing putative deleterious variants in genes within the rare CNVs. $No phasing was performed to discern the most likely haplotype.

    (DOCX)

    S8 Table. Primer sequences common predisposing SNPs.

    (DOCX)

    S9 Table. Details of the gRNA sequences used.

    (DOCX)

    S10 Table. Primers used for Sanger sequencing.

    (DOCX)

    S11 Table. Statistics, group sizes and gRNA efficiencies of zebrafish experiments.

    (DOCX)

    S12 Table. Gene content of rare CNV.

    This list contains all genes in rare CNVs found in the four groups. We determined if a gene was a conserved coding region using the following criteria: sequence variants: PLI≥0.85, z-score for missense variants and/or synonymous variants ≥3. Deletions or duplications: del score ≥ 1, dup score ≥ 1 and cnv score ≥ 1. Additionally, genes impacted more than once by a deletion, or a duplication were not considered a CCR. If these criteria are met, this gene is marked YES in the CCR column. We determined whether the gene is a “ENS gene” based on if the gene expression is higher in isolated “ENS cells” (with and without the addition of GDNF) compared to the expression in other intestinal cells or total intestines in mice [41,42]. These genes are marked YES in the “Mouse ENS E11-E15.5 gene” column. Filtering on CCR, “ENS gene” and CN loss results in our candidate gene list. Note that in addition to the nine genes we disrupted in zebrafish DPCR1 and ZSCAN31 are included. These genes were excluded as they do not contain a zebrafish orthologue as depicted in the second sheet “zebrafish orthologues”. Sheet 3 “count tables” contains the count values used for Fig 2.

    (XLSX)

    Attachment

    Submitted filename: Reviewers_comments_CNV.docx

    Attachment

    Submitted filename: LK_EBR_REBUTTAL_22052021.docx

    Data Availability Statement

    All relevant data are within the manuscript and its Supporting information files. CNV regions are submitted to ClinVar, ID's are included in the manuscript (Table 1). Our ethics committee does not allow sharing of individual patient or control genotype information in the public domain. This study makes use of data generated by the DECIPHER community. A full list of centers who contributed to the generation of the data is available from http://decipher.sanger.ac.uk and via email from decipher@sanger.ac.uk.


    Articles from PLoS Genetics are provided here courtesy of PLOS

    RESOURCES