Abstract
Genetic variants in chromatin regulators are frequently found in neurodevelopmental disorders, but their effect in disease etiology is rarely determined. Here, we uncover and functionally define pathogenic variants in the chromatin modifier EZH1 as the cause of dominant and recessive neurodevelopmental disorders in 19 individuals. EZH1 encodes one of the two alternative histone H3 lysine 27 methyltransferases of the PRC2 complex. Unlike the other PRC2 subunits, which are involved in cancers and developmental syndromes, the implication of EZH1 in human development and disease is largely unknown. Using cellular and biochemical studies, we demonstrate that recessive variants impair EZH1 expression causing loss of function effects, while dominant variants are missense mutations that affect evolutionarily conserved aminoacids, likely impacting EZH1 structure or function. Accordingly, we found increased methyltransferase activity leading to gain of function of two EZH1 missense variants. Furthermore, we show that EZH1 is necessary and sufficient for differentiation of neural progenitor cells in the developing chick embryo neural tube. Finally, using human pluripotent stem cell-derived neural cultures and forebrain organoids, we demonstrate that EZH1 variants perturb cortical neuron differentiation. Overall, our work reveals a critical role of EZH1 in neurogenesis regulation and provides molecular diagnosis for previously undefined neurodevelopmental disorders.
Subject terms: Autism spectrum disorders, Autism spectrum disorders
An apparent redundant role with EZH2 has rendered EZH1 as a secondary player in PRC2-mediated homeostasis regulation. Here, the authors report that gain- and loss-of-function variants in EZH1 cause neurodevelopmental disorders, highlighting its functional relevance.
Introduction
Neurodevelopmental disorders (NDDs) are a group of conditions that arise from impaired development of the central nervous system and include developmental delay and intellectual disability among the most severe manifestations. To date, hundreds of genes, predominantly affected by de novo mutations, have been implicated in the etiology of NDDs1–3. Approximately half of these mutations are nonsense, frameshift or splice site variants that cause loss of function, while the remainder are missense mutations with often unpredictable molecular consequences that are rarely interrogated at a functional level3,4. Thus, despite recent advances in next generation sequencing and molecular diagnosis, many variants remain pathogenically undefined, contributing to the lack of treatment options and diagnostic challenges for patients and their families5.
A group of NDD genes particularly susceptible to pharmacological intervention are chromatin regulators, especially those encoding subunits of Polycomb Repressive Complex 2 (PRC2)6–10. The core PRC2 complex is formed by EED, SUZ12, and one of the two catalytic subunits, EZH1 or EZH2, that sequentially mono-, di- and tri-methylate the lysine at position 27 of the histone H3 (H3K27me1, me2 and me3 respectively). Through the establishment and propagation of H3K27me3, PRC2 maintains transcriptional repression of genes that regulate cellular identities during development and tissue homeostasis in the adult11. The involvement of PRC2 subunits and H3K27me3 demethylases, KDM6A and KDM6B, in the etiology and prognosis of cancers, has motivated an upsurge of pharmacological inhibitors with therapeutic potentials12,13. In addition, dominant de novo pathogenic variants in EZH2, EED and SUZ12 are a common cause of Weaver (OMIM #277590), Cohen-Gibson (OMIM #617561) and Imagawa-Matsumoto Syndromes (OMIM #618786) which are characterized by overgrowth and intellectual disability14–16, and de novo variants in KDM6B and KDM6A are implicated in a syndromic NDD and Kabuki Syndrome (OMIM #147920 and #300867) respectively17–19. Notably, the current availability of H3K27 methyltransferase and demethylase inhibitors renders these H3K27me3-associated developmental disorders susceptible for therapeutical targeting.
Despite evidence involving PRC2 in human development and diseases, the implication of EZH1 is less recognized. EZH1 has traditionally been considered the minor catalytic subunit of PRC2, in part because its histone lysine methyltransferase activity (HMT) is weaker than that of EZH220. During development, EZH2 catalyzes most of the H3K27me3 that represses differentiation genes. Thus, upon deletion of Ezh2, Ezh1 can only partially maintain H3K27me3 and transcriptional repression21–23. Consequently, Ezh2 deletion is fatal for embryonic progression in mice24. In contrast, Ezh1 deletion barely affects H3K27me3 maintenance at early developmental stages and is compatible with life, at least in mouse and zebrafish25,26. Nevertheless, a growing body of evidence suggests that EZH1 is important during differentiation and for postmitotic cells. For instance, EZH1 is required for myogenic gene activation during skeletal muscle differentiation, and for atrophy induced response of myotubes and neonatal heart regeneration in mice27–30. In addition, EZH1 represses multipotency of hematopoietic stem cells in favor of more lineage restricted embryonic progenitors31 and a switch from EZH2 to EZH1 expression co-occurs with the onset of blood cell differentiation32. Indeed, the reduction of EZH2 and maintenance or increase of EZH1 expression during differentiation is common to most cell types, but its functional relevance remains poorly understood.
Similarly, little is known about the contribution of EZH1 to brain development and function. Unlike EZH2, which deletion in neural progenitors accelerates neurogenesis and promotes neuronal differentiation33, EZH1 deletion does not cause overt cellular and molecular brain defects, at least in mice34. Nonetheless, evidence indicate that EZH1 is required for synaptic development of cultured mouse hippocampal neurons35 and to protect neurons from degeneration upon EZH2 deletion34.
Here, we show that EZH1 is necessary for human brain development and function. Using next generation sequencing methods, we identified biallelic truncating and heterozygous missense EZH1 variants in 19 individuals with varying degrees of developmental, language and motor delays, mild to severe intellectual disability and dysmorphic facial features. Notably, the lack of overgrowth as a hallmark distinguishes these patients from those with PRC2-EZH2 associated developmental syndromes. In biochemical studies and cellular models, we demonstrate that patients’ EZH1 variants cause recessive loss of function (LOF) or dominant gain of function (GOF) effects. Furthermore, through functional studies in the chick embryo neural tube development model and in genetically edited human pluripotent stem cells (hPSC) derived cortical neuron and forebrain organoids, we establish that EZH1 LOF and GOF disrupt cortical neuron differentiation. Together, our work shows that a precise regulation of EZH1 activity is required to coordinate neurogenesis and uncovers EZH1 LOF and GOF variants as the genetic basis of previously undefined NDDs with overlapping clinical features.
Results
Identification of EZH1 variants in individuals with undefined neurodevelopmental delay
As part of our ongoing efforts to provide genetic diagnosis for patients with undefined neurodevelopmental disorders, we performed whole exome sequencing on a 15-month-old child showing cognitive, speech and motor development delay. The co-occurrence of the neurodevelopmental symptoms with an atypical facial dysmorphism suggested a genetic cause for the disease. Results pointed to a heterozygous missense variant in EZH1 (NM_001991.5: c.2203 C > G; p.L735F) as the strongest candidate. Through collaborations facilitated by Genematcher36, Deciphering Developmental Disorders project37, and 100,000 Genomes Project38 we expanded our cohort to 19 individuals from 14 unrelated families, all sharing a clinical phenotype of neurodevelopmental delay and carrying distinct variants in EZH1 (Fig. 1, Supplementary Fig 1 and Supplementary Data 1), which supported the implication of EZH1 in the disease pathogenesis.
Further supporting the pathogenicity of these variants, EZH1 is a gene with low missense variant rate in the general population (Z = 4.2)39 and constraints for loss of function variants (o/e = 0.26)39 or gene duplication (pTriplo = 0.98)40. In addition, our patients’ variants are absent in over 140,000 sequences of a sample population reported in GnomAD39 and our internal databases, with exception of three variants (Supplementary Table 1). Among the variants identified, five are biallelic truncating variants present in 10 individuals (Fig. 1a). Two of the individuals are siblings of a consanguineous family carrying a homozygous nonsense variant (c.772C>T; p.R258X) inherited from their parents (Fig. 2a). Another homozygous nonsense variant (c.1453C>T; p.E485X) is present in four siblings and a relative of a large consanguineous family from Saudi Arabia. Interestingly, this family shares haplotype in chr17:40519338–41245021 with an additional affected child from an unrelated consanguineous Saudi Arabian family that carries the same homozygous variant, suggesting a common founder event in the region. Sanger sequencing in available family members confirmed segregation of EZH1 variants according to a strict recessive mode of inheritance with full penetrance in these families (Fig. 2b, c). Furthermore, qPCR and Western blot (WB) analyses in three hPSC clones carrying p.E485X in homozygosity, showed that the variant leads to loss of EZH1 expression (Fig. 2d and Supplementary Fig. 2a). The ninth child carries the variant c.1137C>T; p.Q413X in homozygosity identified by solo exome sequencing (Supplementary Table 1). Finally, the only non-consanguineous case is a child that carries two variants in compound heterozygosity: a large deletion covering exon 8–12 of EZH1 inherited from the mother (c.[664 + 1_665-1]_[1401 + 1_1402-1]del; p.?) and a de novo splice variant affecting the exon 10 splice acceptor site (c.932-1G>A; p.?) (Supplementary Fig. 2b, c). RNA extraction from patient’s cells and analysis by RT-PCR confirmed exclusion of exon 10 (Fig. 2e), which predicts a frameshift that introduces a premature stop codon in EZH1, likely undergoing nonsense mediated decay. Accordingly, RT-qPCR of patient cells showed ~80% reduction of total EZH1 transcripts (Supplementary Fig. 2d) and Western blot (WB) of cell lysates confirmed loss of full length EZH1 protein, which did not affect EZH2 expression levels (Fig. 2f). These data indicates that biallelic EZH1 variants cause loss of function.
The remaining 9 patients carry heterozygous missense variants that affect evolutionarily conserved amino acids with low missense tolerance (Fig. 1a, Supplementary Fig. 3a). p.L735F and p.R728G are each present in two patients from unrelated families and the other five variants are unique to the affected individuals of five unrelated families. Peripheral blood DNA was obtained from parents of three families and absence of the pathogenic variants in the parents of two (for p.A678G and p.R728G) confirmed de novo germline origin. In the third family, the variant (p.K612M) was detected at low frequency in the father suggesting de novo somatic mosaicism with germline transmission to the affected child (Supplementary Fig. 2e).
To estimate the effect of the missense variants in EZH1 function we analyzed them through molecular modeling. Five variants fall within or near the catalytic SET domain, which conserves 94% similarity with EZH2-SET domain11. Although the 3D structure of EZH1 has been recently resolved41, three of these variants fall within a gap in that structure. Thus, we used the 3D structure of EZH2-SET (PDB:5hyn42) as a template to model EZH1-SET domain and predict the effects of the new variants. With this strategy, we were unable to predict a pathogenic mechanism for p.Q731E given that the localization of the residue and the free energy calculations suggest a minor impact in the structure. However, results for p.K612M, p.A678G, p.R728G and p.L735F, show structural changes that may affect the kinetics of the methyltransferase reaction through different mechanisms (Supplementary Fig. 3b). For example, the aromatic ring of the phenylalanine that substitutes L735 residue can generate a stacking interaction with the adenine group of the methyl donor molecule (S-adenosyl methionine (SAM)) that could affect the turnover rate of SAM, and consequently, the efficiency of the H3K27me0 to me3 transition. Likewise, R728 is localized in the vicinity of the SAM binding site and close to the residues forming the binding pocket. We expect that its replacement by a glycine, a residue conferring more conformational flexibility, may induce a local structure destabilization and impact on SAM binding. On the other hand, the substitution of A678 residue for a glycine enlarges the H3K27 interaction pocket of EZH1 providing more flexibility to the structure and likely changing the stability of interactions with H3K27me0-me3. Visual analysis of the p.K612M variants shows that the substitution to a methionine could disrupt the intra-monomeric interactions of the native lysine, thus locally destabilizing the structure of EZH1. Alternatively, the location of 612 residue in the surface of EZH1 suggests that the methionine could also interfere with the interaction of EZH1 with other proteins. Since K612 and A678 residues are present in the recently resolved EZH1 structure41, we also used this structure to predict the effects of their substitutions. Results confirmed our previous predictions of p.K612M and p.A678G in EZH1 structure (Supplementary Fig. 3b).
The remaining two variants fall near the MCSS-SANT2L loop of EZH1. This region contributes to the nucleosome binding of PRC2:EZH1 and differs in sequence similarity, structure and function from the equivalent region in EZH241. The recently resolved 3D structures of PRC2:EZH1 bound and unbound to the nucleosome [PDB:7KSR and 7KSO41) allowed us to model the p.E438D variant, but not p.R406H due to a gap in the 3D structure. Visual analysis of EZH1 with E438 or D438 shows a change in the orientation of the side chain of the aspartic acid compared to the glutamic acid, particularly in the nucleosome bound PRC2:EZH1 structure (Supplementary Fig. 3b). Furthermore, the free energy calculations suggest that the aspartic acid may destabilize the PRC2:EZH1 complex or its interaction with the nucleosome. Together, these data suggest that missense variants likely impact EZH1 catalytic activity through different mechanisms that affect the interactions with H3K27me0-me3, SAM or the nucleosome.
Patients show neurodevelopmental delay with variable clinical presentations regardless of variant type and zygosity
The 19 patients in our cohort share a neurodevelopmental disorder manifested early in life as global motor, speech and cognitive delay leading to intellectual disability, usually non-progressive and co-occurring with dysmorphic facial features (Fig. 1b, c, Supplementary Fig. 1a and Supplementary Data 1). Eleven patients (P3-6, P8-9, P13, P15, P17-19) had magnetic resonance imaging (MRI) and all show mild or unremarkable findings (Supplementary Fig. 1b and Supplementary Data 1). Other clinical findings are variable between patients, regardless of the zygosity or type of mutation. For example, P19 is diagnosed with autism and P9 has a history of severe autism that was first suspected at 1–2 months of age due to a poor eye contact and tracking. Autism spectrum disorder (ASD) is being investigated in P7 as well, but there are no concerns for ASD in other patients. Aggressive behavior is reported in P5-6, who carry the same heterozygous variant (p.R728G), in P10-11 siblings harboring the homozygous p.R258X variant and in P19 with p.Q413X. In addition, P8 started showing aggressive and obsessive-compulsive behaviors at 7 years of age. P5 and P9 are the only patients with macrocephaly (>97%ile), while P1, P3, P10-11 and P13 have microcephaly. P1 and P3, who carry heterozygous missense variants, and P12 with biallelic LOF variants have short stature, but patients P5, P9 and P19 are taller than the average for their age. Together, these clinical presentations suggest that biallelic LOF and heterozygous missense EZH1 variants cause similar NDDs with otherwise variable clinical features that are clearly distinguishable from overgrowth with intellectual disability syndromes associated with PRC2-EZH2 associated variants.
Missense EZH1 variants promote hypermethylation of H3K27
Clinical similarities between patients led us to predict that the effect of the heterozygous EZH1 missense variants could be similar to that of LOF variants, possibly through a dominant negative effect on EZH1 function. Thus, considering that most missense variants cluster within or near the catalytic SET domain of EZH1, we chose three of these variants (p.A678G, p.Q731E and p.L735F) to stably express in a human neural stem cell line (ReNCells) and to monitor their effects on EZH1 expression and H3K27me3 levels. After sorting transduced ReNCells for EGFP co-expression we analyzed protein and histone lysates by WB. Results showed similar EZH1 protein levels for cells transduced with wild type (WT) EZH1 or one of the three variants (Fig. 3a and Supplementary Fig. 4a). However, unexpectedly, p.A678G and p.Q731E displayed increased overall H3K27me3 levels compared to EZH1 WT expressing ReNCells (Fig. 3b). To validate this result, we performed chromatin immunoprecipitation and sequencing (ChIPseq) using H3K27me3 antibodies, which showed a genome wide increase in the proportion of H3K27me3 levels within peaks of p.A678G expressing cells compared to EZH1 WT (Fig. 3c and Supplementary Fig. 4b).
We next analyzed if increased H3K27me3 levels were also detectable by p.A678G and p.Q731E in heterozygous conditions (like in patients). Therefore, we obtained P4 patient fibroblasts (with heterozygous EZH1 p.A678G variant) and generated hPSCs carrying p.A678G or p.Q731E in heterozygosity (EZH1+/A678G and EZH1+/Q731E) (Supplementary Fig. 4c). WB analysis of P4 fibroblasts showed a statistically non-significant trend of increased H3K27me3 levels in comparison to control fibroblasts (Supplementary Fig. 4d). However, in hPSCs, H3K27me3 levels were comparable between mutants and their isogenic controls (Supplementary Fig. 4e). Given that EZH2 predominates over EZH1 in hPSCs22, which may mask the effect of EZH1 on H3K27me3 levels, we next decided to analyze H3K27me3 under conditions where the effect of EZH2 is lower. For instance, it is well established that EZH2 levels decline with cellular differentiation27–32,34,35. For consistency with disease relevance, we chose neurons as the differentiation model where to test H3K27me3 levels. We first analyzed EZH1 and EZH2 levels during standard hPSC to neuron differentiation and confirmed that EZH2 expression declines by the second week in differentiation, while EZH1 levels remain constant (Supplementary Fig. 4f, g). We next differentiated EZH1+/A678G and EZH1+/Q731E to neurons for four weeks, and after validating that EZH1 and EZH2 levels were comparable to controls (Fig. 3d and Supplementary Fig. 4g), we measured H3K27me3 levels. WB analyses revealed a modest non-significant increase of H3K27me3 in EZH1+/A678G and a statistically significant ~50% increase of H3K27me3 levels in EZH1+/Q731E compared to control neurons (Fig. 3e).
To further diminish the contribution of EZH2 and dissect the effect of EZH1 variants in H3K27me3 molecularly, we performed in vitro histone methyltransferase (HMT) assays with reconstituted PRC2-EZH1 complexes and assembled nucleosomes harboring predefined methylation states at H3K27. We first expressed the PRC2 subunits EED, SUZ12, RBAP48, AEBP2, together with EZH1 WT or one of the three variants (p.A678G, p.Q731E or p.L735F) in a baculovirus expression system and intact complexes were successfully purified through tandem affinity purification for all (Fig. 3f, g, Coomassie staining). Versions of PRC2 carrying EZH1 WT or mutants were next incubated with tritiated methyl donor (SAM[3H]) and unmodified nucleosomes (H3K27me0) (Fig. 3f) or nucleosomes harboring di-methylated H3K27 (H3K27me2) (Fig. 3g) as substrate. Samples were run in SDS-PAGE and stained with Coomassie to control for protein levels in each reaction and subsequently transferred to a PDVF membrane to assess H3K27 methylation levels by autoradiography. Results confirmed that mutant PRC2-EZH1 complexes have enhanced HMT activity relative to the PRC2-EZH1 WT complex (Fig. 3f, g). Interestingly, each variant showed slightly different effects and catalytic mechanisms on HMT activity. While EZH1 p.L735F showed only a modest variable increase methylating H3K27me0, EZH1 p.Q731E was more efficient than EZH1 wt methylating both H3K27me0 and H3K27me2 substrates (Fig. 3f, g). In contrast, EZH1 p.A678G was hyperactive on H3K27me2 but hypoactive on H3K27me0 (Fig. 3f, g). This finding implies that EZH1 p.A678G leads to increased H3K27me3 only on nucleosomes harboring H3K27me2, that may be deposited by an EZH1 WT copy or by EZH2 in vivo. In line with these results, many cancer-associated EZH2 mutations exhibit similar kinetic changes that result in their GOF activity. In addition we found that EZH1 p.A678G is equivalent to the cancer-associated EZH2-p.A677G variant, which modifies the substrate preference of EZH2 leading to a change in H3K27me0 to me3 kinetics that results in increased H3K27me3 levels43,44. Thus, collectively, these data demonstrate that, at least, some missense EZH1 mutations create EZH1 GOF variants with increased methyltransferase activity. Combined with EZH1 LOF variants found in the recessive NDD cohort, our observations suggest that precise EZH1 activity regulation is critical for neural development and homeostasis.
EZH1 is necessary and sufficient for neuronal differentiation in the chick embryo neural tube
Having established the molecular consequences of EZH1 variants, we sought to investigate how loss or gain of EZH1 impacts neural development. For this purpose, we first took advantage of the chick embryo neural tube, an in vivo model that has been used traditionally to identify fundamental processes of vertebrate nervous system development and human disease45–47. At its formation (~33 h post fertilization, (hpf)), the chick embryo neural tube is exclusively comprised by neural stem cells that divide symmetrically to expand the progenitor pool within the ventricular zone (VZ). At ~60–70 hpf (~HH18 stage48), a switch from proliferative to neurogenic divisions of neural progenitor cells generates the first differentiating neurons that delaminate from the VZ and migrate laterally to form the mantle zone (MZ)49. This temporospatial segregation provides a convenient way to identify neural progenitor cells and differentiating neurons based on their position in the VZ and MZ of the developing neural tube respectively. Accordingly, by analyzing the expression pattern of EZH2 in transverse sections of the neural tube, we previously determined that EZH2 is predominantly expressed in neural progenitor cells of the VZ where it exerts important functions ensuring neural progenitor pool expansion, polarity and fate specification50,51. Here, we analyzed the expression pattern of EZH1 across the early neural tube development by immunofluorescence staining of transverse sections of HH12, HH23 and HH30 embryos. Results showed that EZH1 is undetectable in HH12 embryo neural tubes, which are exclusively comprised by neural progenitor cells. Similarly, EZH1 was barely detected in the VZ of HH23 and HH30 embryos. However, at these neurogenic stages, EZH1 expression was marked in the newly formed MZ of HH23 neural tubes and expanding MZ of HH30 neural tubes (Fig. 4a).
To determine if the predominance of EZH1 in the MZ is functionally relevant for differentiating neurons, we interfered with EZH1 expression and monitored its effect in neurogenesis. For this purpose, we electroporated plasmids co-encoding EGFP and either a control shRNA (shScrb) or EZH1 targeting shRNAs (shEZH1) into neural progenitor cells of HH14 neural tubes (before the onset of neurogenesis) and analyzed the effect in neuronal differentiation within a period of 48 h (Fig. 4b). As expected, the electroporation of two independent shEZH1s, successfully reduced EZH1 expression (Supplementary Fig. 5a), likely by interfering with the upregulation of EZH1 in the electroporated (EGFP+) neural progenitors cells undergoing differentiation. Moreover, we noticed that the MZ area was smaller in shEZH1 electroporated sides compared to shScrb electroporated or non-electroporated sides of the neural tubes. To further validate this observation, we labeled the MZ with anti-HuC/D differentiating neuron marker and confirmed a 50% reduction of the MZ (Fig. 4c) in neural tubes electroporated with shEZH1. This result suggests that interfering with EZH1 expression during neurogenesis inhibits differentiation of neural progenitors to neurons. Accordingly, the proportion of shEZH1 electroporated (EGFP+) cells localized in the MZ was half of the electroporated cells within the SOX9+ neural progenitor cell pool of the VZ, in strike contrast to the shScrb electroporated EGFP+ cells that were evenly distributed between the VZ and the MZ (Fig. 4c). These data, combined with lack of phenotypes in apoptotic markers and mitotic cell numbers (labeled with active Caspase 3 and pH3 respectively) (Supplementary Fig. 5b, c) indicate that EZH1 upregulation is necessary for neural progenitor cells to differentiate and migrate to the MZ in the chick embryo neural tube.
Next, we overexpressed EZH1 in neural progenitor cells of HH14 neural tubes and assessed the effect on neuronal differentiation compared to the electroporation of a control plasmid (both co-expressing EGFP). 48 h after the electroporation, the control electroporated cells (EGFP+) were evenly distributed between the SOX2+ neural progenitor cell pool in the VZ and HuC/D+ neurons in the MZ, as expected by the switch from proliferative to neurogenic divisions of neural progenitor cells that occurs at ~HH1849, shortly after the electroporation (~16 h after electroporation). In contrast, the overexpression of EZH1 resulted in a significant increase of the electroporated (EGFP+) cells localized within the MZ relative to the VZ (Fig. 4d). Notably, the EGFP+ cells localized in the MZ of EZH1 electroporated neural tubes were also HuC/D+, suggesting that EZH1 overexpression induces neural progenitor cell differentiation and migration to the MZ. This phenotype is often caused by a premature switch of neural progenitor cells from proliferative to neurogenic divisions, which may cause a reduction of the neural progenitor cell pool and consequently, less differentiated neurons over time45,46,49. To test this possibility and also discard a contribution of neural progenitor cell death to the observed phenotype we next quantified pH3+ mitotic cell number, HuC/D+ mantle zone area and cleaved caspase3+ apoptotic cell number (Fig. 4d and Supplementary 5d, e). While none of these quantifications were significantly different between EZH1 and control electroporated neural tubes, more detailed analysis, including extended time windows which are limited by the transient nature of the electroporation, would be required to determine the consequences and the exact mechanisms by which EZH1 affects neuronal differentiation.
EZH1 is expressed in developing and adult cerebral cortex
Genes associated with neurodevelopmental disorders are expressed in the human cerebral cortex, particularly during the critical developmental periods of neurogenesis and synaptogenesis10,52. To ascertain if EZH1 expression fits with this expression pattern, we examined developing and adult human brain transcriptomic databases. RNA sequencing data from the Genotype-Tissue Expression (GTEx) portal showed that EZH1 is similarly expressed across different regions of the adult human brain, including the cerebral cortex (Supplementary Fig. 6a). We next explored the BrainSpan gene expression database53, which includes RNA sequencing data from the human cerebral cortex at several pre- and postnatal stages. Results revealed a homogeneous and constant expression of EZH1 across all the pre and postnatal stages. In contrast, EZH2 showed a sharp drop during the developmental window that overlaps with neurogenesis, remaining low thereafter (Fig. 5a). These data suggest a possible central role for EZH1 as the predominant H3K27 methyltransferase beginning at cortical neurogenesis stages, and therefore, a concomitant vulnerability of the developing cerebral cortex to EZH1 loss or gain of function.
EZH1 LOF and EZH1 GOF variants alter cortical neuron differentiation
To test whether EZH1 LOF and GOF variants affect cortical neurogenesis, we took advantage of hPSC derived neurodevelopmental models, which provide unique and versatile resources to study mechanisms of neurodevelopmental disorders54–56. To achieve this goal, we generated isogenic hPSCs carrying an EZH1 LOF mutation (EZH1−/−) and one of the patients’ GOF variants (EZH1+/A678G) using CRIPSR/Cas9 genome engineering technology and subsequently validated editing, pluripotency and genome integrity (Supplementary Fig. 6b–e). WB results confirmed loss of EZH1 in EZH1−/− and intact EZH1 levels in EZH1+/A678G (Supplementary Fig. 6f).
To investigate the effects of EZH1−/− and EZH1+/A678G in cortical neuron differentiation, we first generated cortical neural progenitor cells (NPC) from hPSCs using standard protocols57. As expected, all NPC cultures expressed PAX6 and NESTIN (Supplementary Fig. 6e). We also noted that EZH1−/− NPCs continued expanding even after neuronal differentiation induction, while EZH1+/+ and EZH1+/A678G NPCs gradually stopped proliferating and acquired morphological features of differentiating neurons. To validate this observation, we first assessed proliferation rates by EdU incorporation and labeling at 0, 2 and 5 days after induction to neuronal differentiation. Consistent with our observation, the rate of EdU+ cells gradually decreased upon differentiation of EZH1+/+ and EZH1+/A678G NPC cultures. However, EZH1−/− cultures showed similar proliferation rates across the 5 days in differentiation media (Fig. 5b), suggesting failure of EZH1−/− NPCs to exit cell cycle and differentiate. To rule out the possibility that this result was due to inter-culture cell type heterogeneity, we co-labeled cells with the NPC specific marker SOX2, the proliferation marker Ki67 and the neuronal cell marker HuC/D at day 0 and 5 of differentiation and analyzed the percentages of proliferating NPCs (Ki67+ SOX2+) and differentiating neuronal populations (HuC/D+) by flow cytometry. The analysis confirmed that, regardless of the genotype, most cells are Ki67+ SOX2+ proliferating NPCs and HuC/D− at day 0 (Fig. 5c, d, Supplementary Fig. 7a). At day 5 of differentiation, there was a marked decrease of the proliferating NPC population (Ki67+ SOX2+) in EZH1+/+ and EZH1+/A678G cultures, while in EZH1−/− the NPC population was still large (Fig. 5c). Consistently, the increase of differentiating neuron population (HuC/D+) from day 0 to 5 was significantly lower in EZH1−/− cells as compared to EZH1+/+ (Fig. 5d). Furthermore, when we assessed morphological features of differentiating neurons at day 2, EZH1−/− cells exhibited short neurites resembling a less mature differentiation stage than EZH1+/+ and EZH1+/A678G differentiating neurons (Fig. 5e). Interestingly, this analysis also revealed the opposite effect in EZH1+/A678G cells, which showed longer neurites than EZH1+/+ cells (Fig. 5e) possibly representing a more mature stage in the differentiation process. Altogether this data indicates that EZH1 is necessary for the induction of neuronal differentiation from human NPCs.
Given the known function of PRC2 proteins coordinating the maintenance of gene repression programs during different stages of neural development33,58,59, we anticipated that the effects on neuronal differentiation we observed with EZH1 LOF and GOF could be caused by dysregulation of genetic programs involved in diverse neuronal differentiation pathways. With the goal to identify these genes, we performed a transcriptomic analysis of 2-month-old neurons differentiated from EZH1+/+, EZH1−/− and EZH1+/A678G hPSCs. At this differentiation stage, EZH2 expression is lower than in earlier differentiation stages and, consequently, EZH1 effects to the transcriptome could be exposed further. After performing principal component analysis (PCA) to confirm consistency between replicates and test similarities between samples (Supplementary Fig. 7b), we identified 852 down- and 546 up-regulated genes in EZH1−/− and 1360 down- and 644 up-regulated genes in EZH1+/A678G neurons compared to controls (Fig. 5f, Supplementary Data 2–3). Gene ontology (GO) enrichment analysis on downregulated genes showed only terms unrelated with neuronal differentiation and highlighted few significantly enriched terms among the upregulated genes. Interestingly, genes upregulated in EZH1−/− were significantly enriched for GO terms associated with mitosis while positive regulation of differentiation was among the top significantly enriched term in EZH1+/A678G neurons (Supplementary Fig. 7c).
Following this observation, we focused our enrichment analysis on genes specifically expressed along cortical neural development using Gene Set Enrichment Analysis (GSEA) and curated gene sets that included cortical neural stem cell markers, early-born neuron markers and late-born neuron markers (see methods section). Consistent with our previous observations, neural stem cell marker gene set was significantly enriched with EZH1−/− upregulated genes, while differentiated neuron gene sets were downregulated (Fig. 5g). In contrast, neural stem cell gene set was enriched with genes downregulated in EZH1+/A678G vs EZH1+/+, while differentiated neuron markers, and specifically the late-born neuron gene set, was enriched with upregulated genes (Fig. 5g). Interestingly, SATB2 and CUX1, which are two genes expressed in late born cortical neurons, were among the genes upregulated in EZH1+/A678G vs EZH1+/+ (Supplementary Fig. 7d and Supplementary Data 3). This data supports that EZH1 variation alters neuronal differentiation, with EZH1 LOF enriching progenitor stages and EZH1 GOF favoring features of differentiated neurons.
EZH1 LOF and GOF variants alter neurogenesis in forebrain organoids
We next sought to determine if defects in neuronal differentiation we observed with EZH1 LOF and GOF variants were correlated with changes in cortical neurogenesis. Although monolayer cortical neuron cultures generate large amounts of grossly homogeneous cells ideal for robust mechanistic studies, each analyzed timepoint captures the result of rather variable longitudinal differentiation process, which hinders our ability to determine if the phenotypes observed in each condition are caused by defects in neurogenesis or by other factors. In contrast, forebrain organoids are formed by self-organized neuroepithelial structures that hold neural progenitor cells in the ventricular zone (VZ) at the apical membrane and early and late born neurons progressively located near the basal membrane60. This spatiotemporal segregation of neurons and their progenitor cells makes it possible to assess neurogenesis rate at selected timepoints and compare them between conditions, by normalizing the number of differentiated neurons by the number of neural progenitors in each neuroepithelial structure. Thus, to test if EZH1 LOF and GOF mutations cause defects in cortical neurogenesis, we generated EZH1+/+, EZH1−/− and EZH1+/A678G forebrain organoid following a previously described protocol60. After a month in culture, all organoids showed neuroepithelial structures comprised by SOX2+ NPC-containing VZs. While the thickness of the VZs was similar between EZH1+/A678G and EZH1+/+ organoids, EZH1−/− exhibited thicker VZs (Fig. 6a). These data is consistent with defects in neuronal differentiation we observed in EZH1−/− NPCs monolayer cultures and suggests longer proliferation rates or deficits in neuronal differentiation.
We next analyzed 60-day old forebrain organoids. At this age, wild type forebrain organoids were comprised by SOX2+ NPCs in the VZ, and CTIP2+ early-born neurons around them. The first few SATB2+ late-born neurons were also detected in EZH1+/+ organoids at this stage. Interestingly, the analysis of 60-day old EZH1−/− and EZH1+/A678G organoids revealed marked differences compared to wild type organoids. Specifically, EZH1−/− organoids showed less CTIP2+ and TBR1+ early-born neurons over the number of underlaying SOX2+ NPCs (Fig. 6b and Supplementary Fig. 8a). In contrast, the relative amount of CITP2+ neurons in EZH1+/A678G organoids was comparable to EZH1+/+ organoids, but the amount of SATB2+ late-born neurons was significantly higher than in EZH1+/+ organoids (Fig. 6b). Although several neurobiological processes can lead to these phenotypes, lack of significant differences in cleaved Caspase3+ cells between genotypes (Supplementary Fig. 8b) and lack of ectopically localized neurons or NPCs in neither EZH1−/− or EZH1+/A678G organoids (Fig. 6 and Supplementary Fig. 8a) discard defects in cell death or migration as the source of the neurogenesis defects and suggest that EZH1 GOF and LOF variants affect the tempo of cortical projection neuron differentiation.
In summary, these results indicate that EZH1 variants alter neurogenesis to specific cortical neuron populations which may ultimately result in defective development of neuronal networks causing the overlapping neurodevelopmental disorders associated with EZH1 LOF and GOF variants.
Discussion
In this study, we uncover recessive and dominant EZH1 variants as the cause of overlapping neurodevelopmental disorders and demonstrate an evolutionarily conserved function of EZH1 regulating neuronal differentiation. The recessive variants include three homozygous nonsense mutations predicted to impair EZH1 expression in 9 children from 4 independent consanguineous families. We also identified a sporadic case with biallelic splice and deletion variants. The experimental confirmation of loss of EZH1 expression in cells carrying either one of the homozygous nonsense mutations or the biallelic splice and deletion variants, indicates that recessives variants in EZH1 cause LOF effects. These findings are consistent with a homozygous frameshift EZH1 variant (c.237delT; p.L80Sfs*6) recently identified in two siblings through an independent genetic study61. Like patients in our cohort, the two siblings in this study were diagnosed with intellectual disability and showed dysmorphic faces. Combined, these data support that EZH1 LOF variants are a likely cause of recessive NDDs.
The remaining 9 patients in our cohort all carry heterozygous missense EZH1 variants that likely occurred de novo. To our knowledge, missense EZH1 variants have never been associated with a human disease before, except for somatic EZH1 p.Q571R and p.Y642F mutations that are recurrent in minimally invasive thyroid and parathyroid tumors62–64. Although it is unclear how these EZH1 mutations contribute to tumor formation or progression, the molecular characterization of EZH1 p.Q571R variant suggests a GOF effect63 reminiscent of the missense mutations in our NDD cohort. GOF mutations in EZH2 are also common in human cancers that motivated the development of EZH inhibitors as therapeutic agents44. Notably, one such mutation (i.e. EZH2 p.A677G) is equivalent to EZH1 p.A678G variant found in our patient P443,65,66. Although none of our patients show malignancies, the molecular coincidence corroborates the GOF nature of the EZH1 p.A678G mutations in our NDD cohort. Nonetheless, it is possible that each missense variant affects EZH1 function in different ways. In line with this idea, our in vitro HMT assays revealed distinct effects on H3K27 methylation in the three EZH1 missense variants altering conserved SET domain residues that we tested (Fig. 3). Furthermore, two missense variants (p.R406H and p.E438D) fall far from the catalytic SET domain and near the EZH1 MCSS-SANT2L loop. Remarkably, the MCSS-SANT2L loop has recently been associated with an EZH1 specific ability to bind and condense chromatin in a methyltransferase activity independent manner41. This finding suggests a potential effect of variants p.R406H and p.E438D on EZH1 mediated chromatin condensation that warrants further investigation. Therefore, although our functional experiments show GOF effect for two missense EZH1 variants, each missense variant will need to be carefully analyzed for effects on EZH1 activity.
Likewise, further work is required to better understand how EZH1 LOF and GOF variants converge on overlapping neurodevelopmental phenotypes. Our work shows apparent opposing effects of EZH1 LOF and GOF in neural development. While EZH1 LOF leads to less differentiated neurons in the chick embryo neural tube and in hPSC derived forebrain cortical organoids, EZH1 GOF promotes neuronal differentiation. Interestingly, these opposing defects are consistent with an asynchronous neurogenesis that is emerging as a shared neurodevelopmental defect causing genetically heterogeneous NDDs55. Like in KMT5B, CHD8 and ARID1B haploinsufficient brain organoids that were recently reported55, EZH1 LOF impairs cortical projection neuron neurogenesis. Although the effect of EZH1 GOF in neurogenesis appears the opposite, premature neurogenesis of specific neurons could lead to asynchronous development of different neuronal classes ultimately converging with EZH1 LOF on abnormal development of neuronal circuits and consequent overlapping neurodevelopmental phenotypes. Regardless, our work demonstrates that cortical neurogenesis is vulnerable to EZH1 dosage and suggests that EZH1 LOF and GOF may alter the epigenetic transcriptional regulation of neurodevelopmental programs in a temporal and lineage specific manner.
Although other core PRC2 subunits have been genetically implicated in developmental syndromes67, there are marked differences between these syndromes and the EZH1 associated NDDs identified in our study. Specifically, the hallmarks of patients with pathogenic variants in EZH2, EED and SUZ12 are pre- and postnatal overgrowth with dysmorphic facial features and intellectual disability67, while most patients in our EZH1 cohort have short or normotypic statures. Additionally, despite the overlap in the manifestation of intellectual disability, the neurobiological mechanisms causing neurodevelopmental defects are likely different. Pathogenic mechanisms that cause intellectual disability in PRC2 associated overgrowth syndromes remain unclear, however it is well established that PRC2-EZH2 maintains proliferative neural progenitor pools and regulates their fate transitions during development33,50,59. In contrast, our work shows that EZH1 is necessary for neuronal differentiation and neurogenesis of cortical projection neurons. Although additional work is needed to determine the exact neurobiological pathways involved in EZH1 mediated neurogenesis regulation, our data is consistent with a previously reported function of EZH1 in synaptic spine maturation and with the switch from EZH2 to EZH1 predominance that occurs as neural progenitors differentiate35 (Fig. 5a). Together, these data support that EZH1 and EZH2 have non-redundant functions in neural development. However, future studies will need to elucidate whether the transition from proliferation of neural progenitors to neurogenesis is triggered by the decline of EZH2 expression or the predominance of EZH1. Furthermore, recent evidence indicates that PRC2 complexes can dimerize and colocalize in chromatin41,68, thus, introducing a third possibility that includes the ratio between EZH1 and EZH2, or the relative levels of PRC2-EZH1 and PRC2-EZH2 homo and heterodimers, as determinants of cell fate transitions during neurogenesis.
NDDs caused by chromatin dysregulation are attractive candidates for treatment with epigenetic drugs. Recent years have witnessed an exponential increase in the development of epigenetic drugs, mostly purposed for the treatment of human cancers. Although these drugs are not recognized as treatment options for NDDs, the similarity of EZH1 GOF variants with those found in EZH2 associated cancers suggest a potential route for epigenetic recovery of EZH1 GOF using EZH1/2 inhibitors. Furthermore, H3K27me3 demethylase inhibitors may have the potential to counterbalance EZH1 or PRC2 LOF variants in NDDs. Treating NDDs presents several challenges, including the prenatal onset of the disease and the requirement for brain penetrant compounds. However, there are successful examples of postnatal reversal of neurodevelopmental phenotypes in animal models of Rett, Kabuki and Angelman syndrome69–71, which hold promise for therapeutic targeting of PRC2 NDDs. Overall, our work uncovers a critical role of EZH1 in human neural development, provides a molecular diagnosis for patients with previously undefined NDDs and offers evidence to classify EZH1 variants as recessive LOF or dominant missense with a likely GOF effect.
Methods
Ethical regulations and study participants
This study was approved by the Institutional Review Boards of the Children’s Hospital of Philadelphia, Boston Children’s Hospital, University College London, Guy’s Hospital, Kennedy Krieger Institute, King Faisal Specialist Hospital & Research Center and University of Alabama in Birmingham. Ethical approval and informed consent were obtained for participation, phenotyping, sample collection and generation/derivation of patient and control cell lines including fibroblasts, lymphoblastoid cell lines, peripheral blood mononuclear cells and their reprograming. The authors also confirm that human research participants provided written informed consent for publication of the images in Fig. 1 and Supplementary Fig. 1. Since this study is focused on gene identification, all patients with confirmed diagnosis are included in the study, regardless of their gender and origin. Gender of each patient is indicated in Supplementary Data 1. Patients included in the study were identified based on genetic diagnostic through national and international collaborations and belong to diverse origins, including Asia, Europe and USA.
Identification of pathogenic EZH1 variants
Pathogenic variants in EZH1 were identified by exome or genome sequencing performed on blood genomic DNA extracted from the probands identified through diagnostic clinical practice or connections through GeneMatcher30 and Deciphering Developmental Disorders Research Study31 repositories. When available, blood DNA form parents and siblings was also sequenced. Data analysis was performed to assess quality of sequence reads and variants filtered with several pipelines72–75 according to mode of inheritance, conservation, predicted pathogenicity and frequency in population. For P8 specifically exome sequencing was performed using the Agilent SureSelect Exome bait design V5 (Agilent, Santa Clara, CA, USA) and Illumina HiSeq2500 sequencer (Illumina Inc, San Diego, CA, USA) at the Center for Applied Genomics at the Children’s Hospital of Philadelphia. The data was quality controlled and analyzed with BWA v0.7.1276, Picard v1.97 (http://broadinstitute.github.io/picard), and GATK v2.6.577. This resulted in a mean target coverage depth of 68.5x. ANNOVAR78 and SnpEff79 were used to functionally annotate the variants and collect minor allele frequency (MAF) data from 1,000 Genomes Project80, ESP6500SI (http://evs.gs.washington.edu/EVS/), ExAC81, gnomAD39, and Kaviar82. Relatively common variants were excluded based on MAF threshold of 0.1% in either population dataset, and functional annotation such as synonymous, non-exonic, and non-splicing-altering were also included.
Variants were reported according to standardized nomenclature recommended by the Human Genome Variation Society83 on EZH1 transcript GenBank: NM_001991.5. Genomic sequencing data in gnomAD39 and our internal databases were used to determine the frequency of identified variants in the population and residue specific missense tolerance score retrieved from Metadome84. Conservation was determined by alignment of EZH1 and EZH2 protein sequences from different species using ClustalW285.
Structural characterization of missense variants
For each variant we built a three-dimensional model using the package MODELLER86. The structure of EZH1 (PDB: 7KSO and 7KSR) was used as a template for the E438D variant. For variants R406H, R728G, Q731E, L735F, because 7KSO did not cover their locations, we followed a simple, two-step protocol. First, we created a model of the native EZH1, using the experimental structure of EZH2 (PDB: 5HYN) as a template. Then, we utilized the resulting model to build the structure of the desired variants. Variant A678G was also analyzed using this model, because it includes an H3K27me3 mimetic peptide which allows us to assess the effect of A678G on the catalytic pocket. In the case of L735F, to illustrate the potential stacking interaction between the aromatic side chain of the phenylalanine amino acid and the SAH molecule, we added local information from the structure of the SMYD3-SAM complex (PDB: 3MEK).
RNA extraction, cDNA generation and PCR
Total RNA was extracted from patient and control LCLs using TRIzol Reagent (Invitrogen, # 15596026) and purified using RNeasy Plus Mini Kit (Qiagen). For splice site analysis, directed reverse transcription was performed using 0.5 μg of total RNA and 0.2 µM of reverse EZH1 or GAPDH primers (Supplementary Table 2) with SuperScript™ III First-Strand Synthesis System (Invitrogen, # 18080051). Standard PCR protocol was performed using 0.5 µM forward and reverse primers (Supplementary Table 2). Amplicons were ran in a 2% agarose gel and imaged using BioRad Gel Doc EZ Imager. For quantification of transcripts by RT-qPCRs, 0.5–2 μg of total RNA was retrotranscribed with oligodTs and SuperScript™ III First-Strand Synthesis System following standard protocol. Next, qPCR was performed in a 10 µL reaction volume containing 4.6 µL of cDNA sample, 0.5 µM of forward and revers primers (Supplementary Table 2) and 1X Power SYBR Green PCR Master Mix in a Biorad CFX384 Touch Real-Time PCR. Primers were designed using Benchling’s Molecular Suite Primer3 program or NCBI Blast.
In vitro histone methyltransferase assay
Standard HMT assays were performed in a total volume of 15 mL containing HMT buffer (50 mM Tris-HCl, pH 8.5, 5 mM MgCl2, and 4 mM DTT) with 10 mM of 3H-labeled S-Adenosylmethionine (1:20 of 3H-labeled:unlabeled), 15 mM of H3K27me3, indicated amount of recombinant human PRC2-EZH1 complexes and 120 nM of either unmodified or H3K27me2 mononucleosomes. The reaction mixture was incubated for 60 min at 30 °C and stopped by the addition of 4 mL SDS buffer (0.2 M Tris-HCl, pH 6.8, 20% glycerol, 10% SDS, 10 mM β-mercaptoethanol, and 0.05% Bromophenol blue). After HMT reactions, samples were incubated for 5 min at 95 °C and separated on SDS-PAGE gels. The gels were then subjected to Coomassie blue staining for protein visualization or wet transfer of proteins to 0.45 mm PVDF membranes (Millipore). The radioactive signals were detected by exposure on autoradiography films (Denville Scientific).
ReNcell VM culture
ReNcell VM human neural precursor cells (Sigma Aldrich, SCC008) were plated onto Matrigel (Corning, 354277) coated T75 cell culture flasks and cultured in ReNCells media containing 1:1 ratio of N2 and B27 media supplemented with 20 ng/mL Fibroblast Growth Factor (bFGF) (Thermo Fisher, #PHG0261) and 20 ng/mL Epidermal Growth Factor (bEGF) (Thermo Fisher, # PHG0261). N2 media contains DMEM/F12 (Gibco, #11330032), 1X N2 neural supplement (Gibco, #17502048), 5 μg/mL Insulin (MilliPore Sigma, #I9278), 1X GlutaMAX™ (Gibco, # 35050061), 100 μM MEM Non-Essential Amino Acids Solution (Gibco, #11140050), 100 μM β-Mercaptoethanol (Gibco, #21985023), 1 mM Sodium Piruvate (Gibco, #11360070) and 1% Penicillin-Streptomycin (Gibco, #15140122). B27 media contains Gibco Neurobasa medium (Gibco, #21103049) supplemented with 1X B-27 neural supplement (Gibco, #17504044), 1X GlutaMAX™ and and 1% Penicillin-Streptomycin. Cell culture media was changed every 2 days and cells passaged using Accutase (Gibco, #A1110501) when reached 90% confluency.
Lentiviral production and transduction of ReNCells
Full-length cDNAs of human wildtype or variant EZH1 were cloned between the EF1A promoter and T2A-EGFP sequence in the pLV plasmid (VectorBuilder). For lentiviral production, HEK293-T (ATCC CRL-3216) cells were plated at 1 × 106 cell/well density in a well of a 6-well plate with 2 ml of media (1X DMEM (Gibco, #11995073) supplemented with 10% FBS (Tissue Culture Biologicals, #101), and 1% Penn/Strep (Gibco, #15140122)). The following day, cells were transfected using polyethyleneimine (PEI) with 1.06 μg MDLg/RRE (gag/pol expression plasmid), 0.57 μg MD2.G (VSV-G envelope expressing plasmid), 0.4 μg pRSV-Rev (rev expression plasmid), and 1.6 μg of transfer vector (pLVs) per well. 48 h later 2 mL of media with lentiviruses were collected and filtered through 0.45 μm pore filters. 100–200 μL of filtered lentiviral media was added to ReNcells seeded the day before at 800,000 cells/well density in a well of a 6 well plate. ReNcells were incubated with lentiviruses for 48 h after which time media was replaced with fresh ReNcell media and cells cultured for 5 days before sorting with BD FACS Aria Fusion flow cytometer.
Chick embryo neural tube electroporation
Eggs from white Leghorn chickens were eincubated at 38.5 °C and 70% humidity. Embryos were staged according to the method of Hamburguer and Hamilton (1951). In ovo electroporations were performed at stage HH14 (~54 h of incubation with 22 somites) with purified plasmid DNA at 3 μg/ml in H2O and 50 ng/ml Fast Green. For EZH1 overexpression, EZH1 cDNA was cloned between the EF1A promoter and T2A-EGFP in the pLV plasmid (VectorBuilder). EZH1 shRNAs (shEZH1(1):GAAGTGTTCCAAGAAACGG, shEZH1(2): CAGTGTACAGTTGAGAGCA) were cloned into the pSHIN plasmid87. Electroporations were performed delivering five 50 msec pulses of 20–25 V. Embryos were collected 48 hrs post electroporation. Sex is not identified at these stages. According to the EU (where these experiments were performed) animal care guidelines no IACUC approval was necessary to perform the chick embryo experiments, since the embryos used in this study were all in early stages of embryonic development (between embryonic day 2 and 7).
Generation of patient derived cell lines
P4 patient and unrelated control fibroblasts were established from skin biopsies cultured in MEM media (Gibco, 11095098) supplemented with 20% FBS (Tissue Culture Biologicals #101) and 1% Penicillin-Streptomycin (P/S) (Gibco, #15140122). P12 patient and unrelated control lymphoblastoid cell lines (LCLs) were established from peripheral blood lymphocytes and cultured and expanded in RPMI 1640 Medium (Gibco, #11875093) supplemented with 15% FBS (Tissue Culture Biologicals #101), 1X GlutaMAX (Gibco, # 35050061) and 1% P/S (Gibco, #15140122) in T75 flasks incubated at 37C in an upright position. P12 patient peripheral blood mononuclear cells (PBMCs) were isolated from fresh blood and reprogrammed with Sendai viruses expressing human Oct3/4, Sox2, Klf4 and c-Myc (Thermo Fisher Scientific, A16517).
Pluripotent stem cell culturing
H9 human pluripotent stem cells (hPSCs) (WiCell WA09), KOLF2.1J (Jackson Laboratories JIPSC1000) and P12 iPSCs were cultured on Matrigel (Corning, #354277) coated plates in mTeSR1 media (STEMCELL Technologies, #85850). 10 μM Rho-associated protein kinase (ROCK) inhibitor (Y-27632, Tocris, #1254) was only added into the culture media overnight when thawing the ESC/IPSCs. The medium was changed every day and cells were passaged when they reached 70% confluence, approximately every 5–7 days, using Versene (Gibco, #15040066) at a 1:15 or ReleSR (STEMCELL Technologies, #100-0484) at a 1:30 ratio.
Generation of genetically modified hPSCs by CRISPR-CAS9
The 20 bp sgRNA sequences were cloned into the PX458 vector (Addgene, #48138). Single stranded DNA oligonucleotides (ssODNs) were synthesized by Integrated DNA Technologies (IDT). ROCK inhibitor was added to the culture media 24 h prior to electroporation. On the day of electroporation, Human Stem Cell Nucleofector Kit 1 (LONZA, #VPH-5012) was prepared based on the manufacturer’s protocol. hPSCs were singularized with Accutase (Gibco, A1110501) and counted with Countess II Automated Cell Counter. To generate EZH1+/A678G clones, 3.5 μg PX458 plasmid (Addgene, #48138, from Feng Zhang) containing target sgRNA sequence and a mix of 1 μg ssODN containing the missense variant and two synonymous variants that disrupt the target sequence plus 1 μg of an ssODN containing only the two synonymous variants were transfect in 1.2 × 106 singularized hPSCs. To generate KO clones, 3.5 μg PX458 plasmid was transfect in 1.2 × 106 singularized hPSCs. To generate EZH1E485X/E485X clones, 3.5 μg of PX458 plasmid containing the target sgRNA sequence and 2 μg ssODN containing the nonsense mutation and a synonymous variant that disrupt the PAM sequence were transfected in in 1.2 × 106 singularized hPSCs. The Amaxa Human Stem Cell Nucleofector Kit I and B-016 nucleofector program was selected on Amaxa Nucleofector for electroporation. After electroporation, the cells were transferred to a Matrigel coated well with 10 μM ROCK inhibitor in the media. 24 h after electroporation, cells were fed with fresh mTeSR1 and 1% Penicillin-Streptomycin (P/S) (Gibco, #15140122). 48 h after electroporation, the cells were singularized with Accutase, resuspended in mTeSR1 with 1%P/S, and filtered through 35 μm nylon mesh cell strainer cap tube (Falcon, #352235) and 1,000–2,000 GFP positive cells were sorted through the BD FACSAria™ Fusion Flow Cytometer, collected in mTeSR1 with CloneR supplement (STEMCELL Technologies, #05888), and seeded on a 10 cm dish coated with Matrigel. Single colonies were manually picked on day 8 to 10 and half of the colony was transferred to Matrigel coated 96-well plates and the other half processed for genomic DNA extraction followed by PCR and Sanger sequencing. Successfully edited clones were identified and two of them selected for experiments. Heterozygous clones were further validated by two rounds of serial single cell isolation, clone picking for sanger sequencing. EZH1+/Q731E hPSCs were generated by Synthego. The primers, sgRNAs, and ssODNs used for genome editing are listed in Supplementary Table 3.
Cortical neuron differentiation
Cortical neurons were generated as previously described with few modifications57. Briefly, hPSC colonies were disassociated with Accutase, and seeded at ~50,000 cells/cm2 density on Matrigel coated wells in mTeSR1 + 5 µM ROCK inhibitor. Next day media was replaced by neural induction media (50% DMEM/F12 and 50% Neurobasal, supplemented with N2 (Gibco, #17502048, 1:100), B27 (Gibco, A3582801, 1:50), GlutaMAX (Gibco, #35050061, 1:100), and 50 µM BME (Gibco, #21985023) 100 nM LDN193189 (STEMCELL Technologies, #72147) and 10 µM SB431542 (Stemgent, #04-0010-10)). 5 days later cells were singularized with Accutase and plated at a ~250,000 cells/cm2 density in Matrigel coated wells, in neural induction media for additional 6 days. Next day cells were singularized again and plated at a ~250,000 cells/cm2 on matrigel coated wells in neural progenitor expansion media (50% DMEM/F12 and 50% Neurobasal, supplemented with 1X N2, 1X B27, 1X GlutaMAX, 50 µM BME and 20 ng/ml bFGF). Media was replaced everyday. On Day 25, NPCs were singularized and plated on 100 µg/ml Poly-l-ornithine (PLO) (Sigma-Aldrich, P3655) and Matrigel coated wells in neuronal differentiation media (50% DMEM/F12 and 50% Neurobasal supplemented with 1X B27, 1X GlutaMAX, 50 µM BME, 10 ng/ml BDNF (Biothchne, #248-BD/CF), 10 ng/ml GDNF (Biothchne, #212-GD/CF), and 1 µg/ml laminin (Gibco, #23017015). Thereafter, ½ media was replaced with fresh media every other day.
Cortical organoids differentiation
Forebrain organoids were generated as previously described60. On Day 0, iPSC colonies were detached by ReLeSR and aggregated to form EBs by Aggrewell (STEMCELL Technologies, #34811). The following day, EBs were resuspended and transferred to 6-well plate (Corning Costar) rotating at 110 rpm, containing DMEM/F12, 20% KnockOut Serum Replacement, 1X Non-essential Amino Acids, 1X GlutaMax, 1 μM LDN193189, 5 μM SB-431542 and 2 μg/mL heparin (STEMCELL Technologies). On Day 6, half of the medium was replaced with induction medium consisting of DMEM/F12, 1X N2 Supplement, 1X Non-essential Amino Acids, 1X GlutaMax, 1 μM CHIR99021 and 1 μM SB-431542. On Day 7, organoids were embedded in Matrigel and cultured stationarily in Low-attachment plate (Corning Costar) in the induction medium. On Day 14, organoids were mechanically dissociated from Matrigel by manual pipetting in a 10 mL pipette tip. Organoids were transferred to 6-well plate rotating at 110 rpm, containing differentiation medium consisting of DMEM:F12/Neurobasal, 1X N2 and 1X B27 Supplements, 1 × 2-Mercaptoenthanol, 1X Non-essential Amino Acids, and 2.5 mg/ml human Insulin. From Day 35 to Day 60, 1% Matrigel was supplemented in differentiation medium.
Fluorescent activating cell sorting (FACS)
Cells were disassociated with Accutase™ (Gibco, #A1110501) and pelleted at 200 g for 5 min. Pellet was resuspended with 1 mL of 1X PBS with 10% FBS and filtered through a 35 µm nylon mesh cell strainer cap into a 5 mL Falcon® Round-Bottom Tubes (Corning, #352235). Cells were loaded into the BD FACS Aria Fusion flow cytometer coupled with a BD FACSDiva Software (version 9.0.1). After excluding doublets or non-viable cells, GFP+ events were collected into a tube containing 0.5 ml of cell specific media.
Flow cytometry analysis
Neural progenitor cells were plated at 1 × 105 cells/cm2 density and let recover for two days. The third day NPCs were either harvested (for Day 0 analysis) or cultured in terminal differentiation media for five days (Day 5). For the immunostaining and flow analysis, cells were retrieved in Accutase, diluted with 1X DPBS, and centrifuged at 3000g, followed by resuspension and fixation in 1.6% paraformaldehyde for 30 min at 37 °C with agitation. Cells were subsequently washed once with 1X DPBS, resuspended into FACS Buffer (1X DPBS containing 0.5% BSA (Jackson ImmunoResearch) and 0.05% sodium azide) and stored at 4 °C until ready for analysis. Cells were then permeabilized with saponin buffer (diluted to 1X in water; Biolegend, # 421022) and incubated for 1 h in the following antibodies diluted in saponin buffer: anti-Ki67 (1:400; Cell Signaling, #9449), anti-HuC/D (1:200; Invitrogen, A-21271), and anti-SOX2 (1:300; Cell Signaling, #3579). Cells were subsequently washed with saponin buffer and incubated for 1 h with secondary antibodies goat anti-mouse IgG1-conjugated Alexa 647, goat anti-mouse IgG2b-conjugated Alexa 488, and goat anti-rabbit conjugated brilliant violet 421 (each at 1:500; Jackson ImmunoResearch 115-605-205, 115-545-207, 111-675-144, respectively). Cells were subsequently washed and resuspended into FACS buffer. Cells were analyzed using a CytoFLEX flow cytometer (Beckman Coulter) and the FlowJo software (version 10.8.2). The starting cell populations was selected using the area of the following coordinates (FSC-A, SSC-A): (146, 192k), (146k, 370k), (291k,642k), (551k, 932k), (906k, 1.1 M), (970k, 614k), (391k,112k), (192k,89k). These criteria selected 86 + −3% [SEM] of events of Day 0 samples, and 72 + −2% [SEM] of events of Day 5 samples, such that the area selected excluded debris and dead/dying cells. Positive and negative cell staining was defined as a direct comparison of the Day 0 vs Day 5 timepoints, such that the earlier timepoint (Day 0) determined the gating, knowing that the cells were proliferative (Ki67 + , SOX2 + ) and not yet terminally differentiated (HuCD-). The same gates, without modification, were applied to the Day 5 timepoint. Population groups and clustering were taken into consideration to be inclusive of the indicated group to provide appropriate gating as to not split a population.
RNA-seq sample preparation and analysis
Total RNA was extracted using TRIzol reagent. RNA quality and concentration were determined with Bioanalyzer. Library preparation and sequencing was performed by Novogene on Illumina platform (NovaSeq 6000). Raw reads of FASTQ format were processed with fastp88. In this step, clean reads were obtained by removing reads containing adapter and >10% poly-N sequences, in addition to reads with low quality. At the same time, Q20, Q30 and GC content of the clean data were calculated. All the downstream analyses were based on the clean data with high quality. Reads were mapped to the reference genome GRCh37 (hg19) using STAR (v2.7.1a)89. The number of reads mapping to each gene were counted using featureCounts (subread-1.6.190). Principal component analysis (PCA) of filtered and normalized counts were generated using ggplot291. Differential gene expression was performed with DESeq2 (v1.28.1)92 with count matrix. Volcano plots were generated with ggplot291. GSEA were performed and plotted with ClusterProfiler (v.4.0.5)93. The input lists included 5705 DEGs from KO vs WT or 2517 DEGs from HET vs WT (padj<0.05, regardless of foldchange). Ranked cell-type specific gene lists composed of the top 500 significantly enriched genes in aRG, CFuPN, and CPN scRNA clusters were extracted from Uzquiano and Kedaigle et al94 to generate the NSC, early-born neuron and late-born neuron gene sets (respectively) for GSEA analysis. Core enrichment lists were generated with GSEA. Heatmaps were generated with baseR. Gene ontology analyses were performed in Enrichr95.
Chromatin Immunoprecipitation, sequencing, and analysis
Cells were fixed with 1% formaldehyde for 10 min at room temperature. Fixation was quenched with glycine (0.125 M). Briefly, cross-linked chromatin was fragmented by sonication carried out using a Bioruptor to generate an average fragment size of 200–500 bp. Chromatin was purified by centrifugation for 30 min, at 20,000 g and 4 °C. Purified chromatin was further diluted in immunoprecipitation (IP) buffer (0.01% SDS, 1.1% Triton X-100; 1.2 mM EDTA pH 8.0; 16.7 mM Tris–HCl pH 8.1; 167 mM NaCl) and incubated overnight with 1 μg of anti-H3K27me3 antibody (Cell Signaling (D5A7)). Protein A-bound beads were added and immunocomplexes were washed once with buffers TSE I (0.1% SDS; 1% Triton-X100; 2 mM EDTA pH 8.0; 20 mM Tris–HCl pH 8.1; 150 mM NaCl), TSE II (0.1% SDS; 1% Triton-X100; 2 mM EDTA pH 8.0; 20 mM Tris–HCl pH 8.1; 500 mM NaCl) and TSE III (0.25 M LiCl; 1% NP-40; 1% sodium deoxycholate; 1 mM EDTA pH 8.0; 10 mM Tris–HCl pH 8.1) and twice with TE buffer (10 mM Tris–HCl pH 8.1 and 1 mM EDTA). De-crosslinking was carried out for 4 h at 65 °C in elution buffer (1% SDS, 0.1 M NaHCO3). DNA fragments were purified using the QIAquick PCR Purification procedure to remove fragments >10 kb. DNA libraries were constructed with the TAKARA ThruPLEX® DNA-Seq Kit and the Illumina-compatible TAKARA DNA Single Index Kit. DNA libraries were quantified using a high sensitivity Chip on Bioanalyzer (Agilent) and sequenced at 40 million 150 bp pair-end reads per replicate in the NovaSeq 6000 (Illumina).
ChIP-seq reads were aligned to the GRCh37 hg19 reference genome using bowtie2 (v2.1.0) with parameters: -q --local --no-mixed --no-unal –dovetail96. Uniquely aligned reads (mapping quality score > = 20) and concordant alignments were kept for downstream analysis using command: samtools (v0.1.19) -q 20 -f 0 × 2. H3K27me3 peaks were called using MACS2 (v2.2.7.1) with parameters: --broad --keep-dup all -p 1e-5 --broad-cutoff 1e-597. ChIP-seq signal was normalized to sequencing depth using deepTools (v3.5.0): bamCoverage --normalizeUsing CPM98. ChIP-seq signal around peaks was computed and visualized using deepTools computeMatrix and plotHeatmap functions, respectively. For the statistical test, H3K27me3 signal (RPKM) in peaks was calculated for both WT and A678G and paired t-test was performed to define the statistical significance of the H3K27me3 signal difference between WT and A678G ReNcells using ggplot.
Western blotting
For histone-specific Western blotting, histones were enriched by lysing cells in an acid lysis buffer (10 nM HEPES, 1.5 mM MgCl2, 10 mM KCl). For all other Western blots protein lysates were generated using RIPA Buffer (Cell Signaling, #9806). Protein lysates and histone extractions were separated by SDS-PAGE, transferred to a PVDF membrane, and detected by Western blotting (WB). Membranes were blocked with 5% BSA in 1X TBS with 0.01% Tween20 (TBST) for 1 h at room temperature and incubated with primary antibodies in 5% BSA in TBST overnight at 4 C (1:1,000 anti-EZH2 (Cell Signaling, D2C9), 1:1,000 anti-EZH1 (Proteintech, #20852-1AP); 1:1,000 anti-EZH1 (Reinberg lab21), 1:1,000 anti-H3K27me3 (EMD Millipore, 07-449), 1:5,000; anti-β-actin (GenScript, A00702), 1:10,000 anti-H4 (Abcam, ab10158)). The next day the membrane was washed three times with TBST, incubated for 1 h with the corresponding secondary anti-Rabbit HRP (Invitrogen, #31458) and anti-Mouse HRP (Invitrogen, #SA1-100), and washed three times with TBST. The membranes were developed using Pierce™ ECL Western Blotting Substrate kit (Thermo Scientific, #32106) and exposed to autoradiography films following development in AFP Mini-Med 90 X-Ray Fil Processor. ImageJ software was used for densitometry quantification of WB bands.
Immunofluorescence staining
For chick embryo neural tube analyses, embryos were fixed for 2 h at 4 °C in 4% paraformaldehyde (PFA) and immunostaining was performed in 50 µm free floating sections cut with a vibratome. Sections were permeabilized with 1X PBS-0.1% Triton X-100, blocked with 5% BSA solution and immunostained with the following primary antibodies: 1:500 mouse anti-HuC/D (Millipore, A-21271) or 1:500 mouse anti-NeuN (Chemicon, MAB377) for MZ staining, 1:500 rabbit anti-SOX2 (Abcam, AB97959) or 1:500 rabbit anti-SOX9 (kindly provided by James Briscoe) to label the VZ, rabbit 1:500 anti-a-Caspase 3 (BD Pharmingen, C92-605) and 1:500 rabbit anti-Phospho-Histone3 (Millipore, #06-570) over night at 4 C. After washing three 5 min washes with 1X PBS-0.1% Triton X-100 sections were incubated with 1:1000 Anti-Rabbit Alexa Fluor 555 (Invitrogen, A21434), and/or anti-mouse Cy5 (Jackson ImmunoResearch, #115175146) secondary antibodies for 1 h at room temperature following by three 5 min washes. Sections were then stained with 1 µg/ul DAPI and mounted in Mowiol (Sigma-Aldrich, # 81381). Images were acquired with the Zeiss Lsm780 confocal system. The effects of EZH1 shRNA electroporation were assessed by measuring the area occupied, respectively, by the VZ (formed by SOX2+ or SOX9+ progenitors) or the MZ (formed by HuC/D+ or NeuN neurons) in chick neural tube transversal sections. The analysis was adapted from60. Briefly, VZ or MZ channels were split and measured using the ImageJ software. After producing z-stack maximal projection images, a threshold was applied to define the areas in each side of the neural tube. VZ or MZ areas were measured in both the control and the electroporated side of the neural tubes by a particle analysis using a pixel^2 size ranging from 1000 to infinity. The data are presented as ratios of the area ± SD obtained by standardizing the values of the electroporated (EP) side with the corresponding values of the non-EP side of the neural tube. Similarly, the ratio of cells in the VZ and MZ was produced by manually counting the number of EGFP+ cells in the VZ and MZ in each neural tube.
hPSCs and derived neural cells were plated onto Matrigel coated glass coverslips (Electron Microscopy Sciences 72290-04). Cells were fixed with 4% PFA for 10 min at room temperature, permeabilized and blocked in blocking buffer (1X PBS-0.1% Triton X-100-5% goat serum (Sigma-Aldrich, G9023)) for 1 h at room temperature, and immunostained overnight at 4 °C with the following antibodies: 1:500 mouse anti-SSEA (Abcam, ab16287), 1:500 rabbit anti-OCT4 (Abcam ab19857), 1:300 rabbit anti-PAX6 (Biolegend, #901302), 1:2000 rabbit anti-Nestin (Millipore Sigma, MAB5326), 1:1000 rabbit anti-TUJ1 (Abcam, ab18207). Next day, cells were incubated for 1 h at room temperature with 1:500 anti-rabbit or anti-mouse Alexa Fluor 488, 555 or 633—conjugated secondary antibodies (Invitrogen, A11008, A21428, A21070, A11001, A21422, A21050) and 0.1 μg/ml Hoechst 33342 and mounted with Mowiol mounting medium (Sigma, #81381). For EdU (5-ethynyl-2’-deoxyuridine, Invitrogen, E10187) labeling, cells were treated with 10 μM EdU for 30 min. After fixation for 10 min at RT with in 4% PFA, EdU labeling was performed using Click-it plus EdU cell proliferation kit following the manufacturer’s instructions (Invitrogen, C10638). Antibody co-staining was performed after EdU staining as indicated above. Images were acquired with Leica SP8 confocal microscope and analyzed in FIJI ImageJ. TUJ1 trace images were generated by FIJI ImageJ Process, Binary, skeletonize.
Whole organoids were fixed in 4% formaldehyde for 30 min at room temperature. Organoids were washed with PBS and then immersed in 30% sucrose solution overnight. Organoids were embedded in tissue freezing medium and sectioned with a cryostat (Leica) at 30 μm thickness. For immunostaining, cryosectioned slides were washed with 1X PBS. Tissues were permeabilized with 1X PBS-0.5% Triton-X in for 1 h and blocked with blocking medium consisting of 10% donkey serum in 1X PBS with 0.05% Triton-X (PBST) for 30 min. Primary antibodies 1:300 rabbit anti-SOX2 (R&D Systerms, AF2018), 1:300 rat anti-CTIP2 (Abcam, ab18465), 1:1000 mouse anti-SATB2 (Abcam, ab51502)) rabbit 1:1000 anti-TBR1 (Abcam, ab31940) and 1:400 rabbit anti-aCasp3 (Cell Signaling, #9661) diluted in blocking solution were applied to the sections overnight at 4 °C. After washing with PBST for 3 times, secondary antibodies (AlexaFluor 488, 546, or 647 -conjugated donkey antibodies (Invitrogen A21206, A31570, A78947)) diluted in blocking solution were applied to the sections for 1.5 h at room temperature. DAPI (Invitrogen) was added for 5 min (Invitrogen) at the end of incubation. Finally, sections were washed with PBST for 3 times before mounting. Images were captured by a confocal microscope (Zeiss LSM 800). Sample images were prepared in ImageJ (NIH). Total CTIP2+ or SATB2+ cells were counted in each image and normalized to SOX2+ cell numbers in the adjacent VZ. Analyses of VZ thickness was performed as previously described60. Briefly, to account for the variability, at least three organoids were sectioned for analysis. For each organoid, at least three sections distributed evenly from top to bottom of each organoid were sectioned and imaged. During confocal imaging, only rosettes at the edge of the organoids were captured to avoid the variability caused by the limited nutrient access in the center. Extremely small or big rosettes were avoided. Rosettes without clear lumen were also avoided as they may locate in very top/bottom region of the organoid. The distance from the apical to the basal lamina in a randomly selected position of each rosette in the section was measured for at least 3 sections per organoid.
Statistics and reproducibility
The data were analyzed using GraphPad Prism software (Prism Version 9.1.0) or Microsoft Excell (Version 16.72). All experiments were performed at least twice and results reported as the mean ± SEMs or mean ± SDs with statistical tests described in figure legends when three or more biological replicates were analyzed. The investigators were not blinded to sample identity during experiments, but all measurements were taken objectively and no data were excluded. No statistical method was used to predetermine sample size.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
The authors thank the Center for Cellular and Molecular Therapeutics Stem cell core and Center for Applied Genomics sequencing cores at Children’s Hospital of Philadelphia (CHOP) for their support with stem cell methods and sequencing technologies. We also thank Jasmine Akoto and Mara Davis, who helped with revisions during their PhD rotations in the Akizulab. The authors extend their appreciation to the invaluable contribution of Hessa Alsaif, other clinical coordinators, patients and their families to this study. This work was supported by the CHOP/UPENN IDDRC-New Program Development award, CHOP-Junior Faculty Pilot Program award, Margaret Q Landerbergen Foundation Award and NIH/NINDS R01NS119699, NIH/NICHD R21HD107592 (to N.A.), NIH/NINDS R35NS116843 (to H.S.), NIH/NINDS R35NS097370 and NIH/NIMH RF1MH123979 (to G.-l.M.)., BFU2015-69248-P and PGC2018-096082-B-I00 from the Spanish Ministry of Economy (to M.A.M.B.), Alabama Genomic Health Initiative F170303004 through University of Alabama at Birmingham (to A.C.H., M.T.), PID2019-111217RB-I00 Spanish Ministerio de Ciencia e Innovación (to X.D.L.C.), ID2019-110157RA-I00 (to M.S.), FPU Spanish Ministry of Education and Science predoctoral fellowship (to R.F.) and 2021 FISDU 00400 (to P.E-B.), NIHR Manchester Biomedical Research Centre NIHR203308 and Solve-RD which is supported by the European Union’s Horizon 2020 research and innovation program grant agreement No 779257 (to S.B.). This research was made possible through access to the data and findings generated by the 100KGP and the DDD. The 100KGP is managed by Genomics England Limited (a wholly owned company of the Department of Health and Social Care). The 100KGP is funded by the National Institute for Health Research and NHS England. The Wellcome Trust, Cancer Research UK and the Medical Research Council have also funded research infrastructure. The 100KGP uses data provided by patients and collected by the National Health Service as part of their care and support. The DDD study presents independent research commissioned by the Health Innovation Challenge Fund [grant number HICF-1009-003], a parallel funding partnership between Wellcome and the Department of Health, and the Wellcome Sanger Institute [grant number WT098051]. The views expressed in this publication are those of the author(s) and not necessarily those of Wellcome or the Department of Health. The study has UK Research Ethics Committee approval (10/H0305/83, granted by the Cambridge South REC, and GEN/284/12 granted by the Republic of Ireland REC). The research team acknowledges the support of the National Institute for Health Research, through the Comprehensive Clinical Research Network.
Source data
Author contributions
The first two authors, C.G.D. and Y.Z., contributed equally to this work. Study conceptualization and design: C.G.D., Y.Z., and N.A. Patient recruitment, clinical assessment and genetic diagnosis: R.M., D.L., S.E.S., P.M., M.H.H., H.Hakonarson, L.R., A.J., P.V., C.P., S.Mohammed, S.Maddirevula, H.A., E.A.F., S.E., V.T., F.R., S.Maqbool., V.S., S.H.I., G.d.R., H.Houlden., M.N.A., N.A.A., P.B., G.Z., A.C.E.H., M.L.T., A.C., C.L.S.-H. A.M.H., H.Z.E., M.H., C.R., D.B., S.B., G.E.R.C., F.S.A., P.B.A., and E.J.B. EZH1 in silico molecular modeling: X.dlC. and N.P. hPSC editing, culture maintenance, and differentiation: C.G.D., G.O., S.L., Y.Z., D.L.F., and N.A. Western blotting, RNA extraction, RT-PCR, RT-qPCR experimental design, execution, and data analysis: C.G.D. hPSC-derived neuronal culture immunofluorescence, image analysis and quantification: Y.Z. Histone methyltransferase assay: C.-H.L. and D.R. ChIP-seq experimental design and execution: C.G.D. ChIP-seq computational analysis: S.Z. and E.A.H. RNA-seq and computational analysis: C.G.D., Y.Z., and Z.S. Flow cytometry experiment conceptualization, execution and analysis: S.L., E.A.W., D.B., and N.A. Chick embryo experimental design, execution and data analysis: P.E.B., M.S., C.E., R.F., M.A.M.B., and N.A. Brain organoid culture, experiments and data analysis: Q.Y., H.S., G.-L.M., and N.A. Project coordination, supervision and administration: N.A. Manuscript preparation: C.G.D., Y.Z., and N.A. Manuscript edit and review: all authors.
Peer review
Peer review information
Nature Communications thanks Fabio Papes, Achira Roy and Anthony Wynshaw-Boris for their contribution to the peer review of this work. A peer review file is available.
Data availability
All data supporting the findings of this study are available within the article and its supplementary information files. Source data are provided with this paper. The exome/genome sequencing will be made available upon request provided that privacy and consent criteria are preserved. The ChIPseq and RNAseq data generated in this study have been deposited in GEO under accession code GSE210465 and GSE227014. Other raw data will be made available within two weeks upon request to corresponding author. The human brain gene expression data used in this study are available in BrainSpan (https://www.brainspan.org/rnaseq/searches?exact_match=false&search_term=EZH1&search_type=gene and https://www.brainspan.org/rnaseq/searches?exact_match=false&search_term=EZH2&search_type=gene&page_num=0) and GTEX (https://gtexportal.org/home/gene/EZH1 and https://gtexportal.org/home/gene/EZH2). Variants p.R406H and p.A678G are accessible in ClinVar under the accession codes VCV000828189.1 and VCV000977755.2 respectively. The other variants characterized in this study have been submitted to ClinVar. Source data are provided with this paper.
Competing interests
Houda Zghal Elloumi is an employee of GeneDX, LLC. Peter Bauer and Giovanni Zifarelli are employees of Centogene. The remaining authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Qian Yang, Reza Maroofian, Paula Espana-Bonilla.
A list of authors and their affiliations appears at the end of the paper
Contributor Information
Naiara Akizu, Email: aquizun@chop.edu.
Genomic England Research Consortium:
J. C. Ambrose, P. Arumugam, R. Bevers, M. Bleda, F. Boardman-Pretty, C. R. Boustred, H. Brittain, M. A. Brown, M. J. Caulfield, G. C. Chan, A. Giess, J. N. Griffin, A. Hamblin, S. Henderson, T. J. P. Hubbard, R. Jackson, L. J. Jones, D. Kasperaviciute, M. Kayikci, A. Kousathanas, L. Lahnstein, A. Lakey, S. E. A. Leigh, I. U. S. Leong, F. J. Lopez, F. Maleady-Crowe, M. McEntagart, F. Minneci, J. Mitchell, L. Moutsianas, M. Mueller, N. Murugaesu, A. C. Need, P. O’Donovan, C. A. Odhams, C. Patch, D. Perez-Gil, M. B. Pereira, J. Pullinger, T. Rahim, A. Rendon, T. Rogers, K. Savage, K. Sawant, R. H. Scott, A. Siddiq, A. Sieghart, S. C. Smith, A. Sosinsky, A. Stuckey, M. Tanguy, A. L. Taylor Tavares, E. R. A. Thomas, S. R. Thompson, A. Tucci, M. J. Welland, E. Williams, K. Witkowska, S. M. Wood, and M. Zarowiecki
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-39645-5.
References
- 1.Wright CF, et al. Genetic diagnosis of developmental disorders in the DDD study: a scalable analysis of genome-wide research data. Lancet. 2015;385:1305–1314. doi: 10.1016/S0140-6736(14)61705-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wilfert AB, Sulovari A, Turner TN, Coe BP, Eichler EE. Recurrent de novo mutations in neurodevelopmental disorders: properties and clinical implications. Genome Med. 2017;9:101. doi: 10.1186/s13073-017-0498-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang T, et al. Large-scale targeted sequencing identifies risk genes for neurodevelopmental disorders. Nat. Commun. 2020;11:4932. doi: 10.1038/s41467-020-18723-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Deciphering Developmental Disorders S. Prevalence and architecture of de novo mutations in developmental disorders. Nature. 2017;542:433–438. doi: 10.1038/nature21062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kaufmann P, Pariser AR, Austin C. From scientific discovery to treatments for rare diseases—the view from the National Center for Advancing Translational Sciences—Office of Rare Diseases Research. Orphanet. J. Rare Dis. 2018;13:196. doi: 10.1186/s13023-018-0936-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ciptasari U, van Bokhoven H. The phenomenal epigenome in neurodevelopmental disorders. Hum. Mol. Genet. 2020;29:R42–R50. doi: 10.1093/hmg/ddaa175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Deevy, O. & Bracken, A. P. PRC2 functions in development and congenital disorders. Development146, (2019). [DOI] [PMC free article] [PubMed]
- 8.Tatton-Brown K, et al. Mutations in epigenetic regulation genes are a major cause of overgrowth with intellectual disability. Am. J. Hum. Genet. 2017;100:725–736. doi: 10.1016/j.ajhg.2017.03.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.De Rubeis S, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014;515:209–215. doi: 10.1038/nature13772. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Satterstrom FK, et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell. 2020;180:568–584.e523. doi: 10.1016/j.cell.2019.12.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yu JR, Lee CH, Oksuz O, Stafford JM, Reinberg D. PRC2 is high maintenance. Genes Dev. 2019;33:903–935. doi: 10.1101/gad.325050.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Xiao ZG, et al. The roles of histone demethylase UTX and JMJD3 (KDM6B) in cancers: current progress and future perspectives. Curr. Med. Chem. 2016;23:3687–3696. doi: 10.2174/0929867323666160725093522. [DOI] [PubMed] [Google Scholar]
- 13.Eich ML, Athar M, Ferguson JE, 3rd, Varambally S. EZH2-targeted therapies in cancer: hype or a reality. Cancer Res. 2020;80:5449–5458. doi: 10.1158/0008-5472.CAN-20-2147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Gibson WT, et al. Mutations in EZH2 cause Weaver syndrome. Am. J. Hum. Genet. 2012;90:110–118. doi: 10.1016/j.ajhg.2011.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cohen AS, Gibson WT. EED-associated overgrowth in a second male patient. J. Hum. Genet. 2016;61:831–834. doi: 10.1038/jhg.2016.51. [DOI] [PubMed] [Google Scholar]
- 16.Imagawa E, et al. Novel SUZ12 mutations in Weaver-like syndrome. Clin. Genet. 2018;94:461–466. doi: 10.1111/cge.13415. [DOI] [PubMed] [Google Scholar]
- 17.Arnaud M, et al. Kabuki syndrome: update and review. Arch. Pediatr. 2015;22:653–660. doi: 10.1016/j.arcped.2015.03.020. [DOI] [PubMed] [Google Scholar]
- 18.Miyake N, et al. MLL2 and KDM6A mutations in patients with Kabuki syndrome. Am. J. Med. Genet. A. 2013;161A:2234–2243. doi: 10.1002/ajmg.a.36072. [DOI] [PubMed] [Google Scholar]
- 19.Stolerman ES, et al. Genetic variants in the KDM6B gene are associated with neurodevelopmental delays and dysmorphic features. Am. J. Med. Genet. A. 2019;179:1276–1286. doi: 10.1002/ajmg.a.61173. [DOI] [PubMed] [Google Scholar]
- 20.Lee CH, et al. Allosteric activation dictates PRC2 activity independent of its recruitment to chromatin. Mol. Cell. 2018;70:422–434.e426. doi: 10.1016/j.molcel.2018.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Margueron R, et al. Ezh1 and Ezh2 maintain repressive chromatin through different mechanisms. Mol. Cell. 2008;32:503–518. doi: 10.1016/j.molcel.2008.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Shen X, et al. EZH1 mediates methylation on histone H3 lysine 27 and complements EZH2 in maintaining stem cell identity and executing pluripotency. Mol. Cell. 2008;32:491–502. doi: 10.1016/j.molcel.2008.10.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Lee CH, et al. Distinct stimulatory mechanisms regulate the catalytic activity of polycomb repressive complex 2. Mol. Cell. 2018;70:435–448.e435. doi: 10.1016/j.molcel.2018.03.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.O’Carroll D, et al. The polycomb-group gene Ezh2 is required for early mouse development. Mol. Cell Biol. 2001;21:4330–4336. doi: 10.1128/MCB.21.13.4330-4336.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Ezhkova E, et al. EZH1 and EZH2 cogovern histone H3K27 trimethylation and are essential for hair follicle homeostasis and wound repair. Genes Dev. 2011;25:485–498. doi: 10.1101/gad.2019811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Volkel P, et al. Ezh1 arises from Ezh2 gene duplication but its function is not required for zebrafish development. Sci. Rep. 2019;9:4319. doi: 10.1038/s41598-019-40738-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Stojic L, et al. Chromatin regulated interchange between polycomb repressive complex 2 (PRC2)-Ezh2 and PRC2-Ezh1 complexes controls myogenin activation in skeletal muscle cells. Epigenetics Chromatin. 2011;4:16. doi: 10.1186/1756-8935-4-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Mousavi K, Zare H, Wang AH, Sartorelli V. Polycomb protein Ezh1 promotes RNA polymerase II elongation. Mol. Cell. 2012;45:255–262. doi: 10.1016/j.molcel.2011.11.019. [DOI] [PubMed] [Google Scholar]
- 29.Ai S, et al. Divergent requirements for EZH1 in heart development versus regeneration. Circ. Res. 2017;121:106–112. doi: 10.1161/CIRCRESAHA.117.311212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bodega B, et al. A cytosolic Ezh1 isoform modulates a PRC2-Ezh1 epigenetic adaptive response in postmitotic cells. Nat. Struct. Mol. Biol. 2017;24:444–452. doi: 10.1038/nsmb.3392. [DOI] [PubMed] [Google Scholar]
- 31.Vo LT, et al. Regulation of embryonic haematopoietic multipotency by EZH1. Nature. 2018;553:506–510. doi: 10.1038/nature25435. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Xu J, et al. Developmental control of polycomb subunit composition by GATA factors mediates a switch to non-canonical functions. Mol. Cell. 2015;57:304–316. doi: 10.1016/j.molcel.2014.12.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Pereira JD, et al. Ezh2, the histone methyltransferase of PRC2, regulates the balance between self-renewal and differentiation in the cerebral cortex. Proc. Natl Acad. Sci. USA. 2010;107:15957–15962. doi: 10.1073/pnas.1002530107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.von Schimmelmann M, et al. Polycomb repressive complex 2 (PRC2) silences genes responsible for neurodegeneration. Nat. Neurosci. 2016;19:1321–1330. doi: 10.1038/nn.4360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Henriquez B, et al. Ezh1 and Ezh2 differentially regulate PSD-95 gene transcription in developing hippocampal neurons. Mol. Cell Neurosci. 2013;57:130–143. doi: 10.1016/j.mcn.2013.07.012. [DOI] [PubMed] [Google Scholar]
- 36.Sobreira N, Schiettecatte F, Valle D, Hamosh A. GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum. Mutat. 2015;36:928–930. doi: 10.1002/humu.22844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Deciphering Developmental Disorders S. Large-scale discovery of novel genetic causes of developmental disorders. Nature. 2015;519:223–228. doi: 10.1038/nature14135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Investigators GPP, et al. 100,000 genomes pilot on rare-disease diagnosis in health care—preliminary report. N. Engl. J. Med. 2021;385:1868–1880. doi: 10.1056/NEJMoa2035790. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Karczewski KJ, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–443. doi: 10.1038/s41586-020-2308-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Collins RL, et al. A cross-disorder dosage sensitivity map of the human genome. Cell. 2022;185:3041–3055.e3025. doi: 10.1016/j.cell.2022.06.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Grau D, et al. Structures of monomeric and dimeric PRC2:EZH1 reveal flexible modules involved in chromatin compaction. Nat. Commun. 2021;12:714. doi: 10.1038/s41467-020-20775-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Justin N, et al. Structural basis of oncogenic histone H3K27M inhibition of human polycomb repressive complex 2. Nat. Commun. 2016;7:11316. doi: 10.1038/ncomms11316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.McCabe MT, et al. Mutation of A677 in histone methyltransferase EZH2 in human B-cell lymphoma promotes hypertrimethylation of histone H3 on lysine 27 (H3K27) Proc. Natl Acad. Sci. USA. 2012;109:2989–2994. doi: 10.1073/pnas.1116418109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.McCabe MT, et al. EZH2 inhibition as a therapeutic strategy for lymphoma with EZH2-activating mutations. Nature. 2012;492:108–112. doi: 10.1038/nature11606. [DOI] [PubMed] [Google Scholar]
- 45.Saade M, et al. Multimerization of Zika Virus-NS5 causes ciliopathy and forces premature neurogenesis. Cell Stem Cell. 2020;27:920–936.e928. doi: 10.1016/j.stem.2020.10.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Le Dreau G, Saade M, Gutierrez-Vallejo I, Marti E. The strength of SMAD1/5 activity determines the mode of stem cell division in the developing spinal cord. J. Cell Biol. 2014;204:591–605. doi: 10.1083/jcb.201307031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Kicheva A, et al. Coordination of progenitor specification and growth in mouse and chick spinal cord. Science. 2014;345:1254927. doi: 10.1126/science.1254927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Hamburger V, Hamilton HL. A series of normal stages in the development of the chick embryo. J. Morphol. 1951;88:49–92. doi: 10.1002/jmor.1050880104. [DOI] [PubMed] [Google Scholar]
- 49.Saade M, et al. Sonic hedgehog signaling switches the mode of division in the developing nervous system. Cell Rep. 2013;4:492–503. doi: 10.1016/j.celrep.2013.06.038. [DOI] [PubMed] [Google Scholar]
- 50.Akizu N, Estaras C, Guerrero L, Marti E, Martinez-Balbas MA. H3K27me3 regulates BMP activity in developing spinal cord. Development. 2010;137:2915–2925. doi: 10.1242/dev.049395. [DOI] [PubMed] [Google Scholar]
- 51.Akizu N, et al. EZH2 regulates neuroepithelium structure and neuroblast proliferation by repressing p21. Open Biol. 2016;6:150227. doi: 10.1098/rsob.150227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Stessman HA, et al. Targeted sequencing identifies 91 neurodevelopmental-disorder risk genes with autism and developmental-disability biases. Nat. Genet. 2017;49:515–526. doi: 10.1038/ng.3792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Li M, et al. Integrative functional genomic analysis of human brain development and neuropsychiatric risks. Science. 2018;362:eaat7615. doi: 10.1126/science.aat7615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Chailangkarn T, et al. A human neurodevelopmental model for Williams syndrome. Nature. 2016;536:338–343. doi: 10.1038/nature19067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Paulsen B, et al. Autism genes converge on asynchronous development of shared neuron classes. Nature. 2022;602:268–273. doi: 10.1038/s41586-021-04358-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Birey F, et al. Dissecting the molecular basis of human interneuron migration in forebrain assembloids from Timothy syndrome. Cell Stem Cell. 2022;29:248–264.e247. doi: 10.1016/j.stem.2021.11.011. [DOI] [PubMed] [Google Scholar]
- 57.Chambers SM, et al. Highly efficient neural conversion of human ES and iPS cells by dual inhibition of SMAD signaling. Nat. Biotechnol. 2009;27:275–280. doi: 10.1038/nbt.1529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Mohn F, et al. Lineage-specific polycomb targets and de novo DNA methylation define restriction and potential of neuronal progenitors. Mol. Cell. 2008;30:755–766. doi: 10.1016/j.molcel.2008.05.007. [DOI] [PubMed] [Google Scholar]
- 59.Hirabayashi Y, et al. Polycomb limits the neurogenic competence of neural precursor cells to promote astrogenic fate transition. Neuron. 2009;63:600–613. doi: 10.1016/j.neuron.2009.08.021. [DOI] [PubMed] [Google Scholar]
- 60.Qian X, et al. Sliced human cortical organoids for modeling distinct cortical layer formation. Cell Stem Cell. 2020;26:766–781.e769. doi: 10.1016/j.stem.2020.02.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Hu H, et al. Genetics of intellectual disability in consanguineous families. Mol. Psychiatry. 2019;24:1027–1039. doi: 10.1038/s41380-017-0012-2. [DOI] [PubMed] [Google Scholar]
- 62.Jung CK, et al. Clinical utility of EZH1 mutations in the diagnosis of follicular-patterned thyroid tumors. Hum. Pathol. 2018;81:9–17. doi: 10.1016/j.humpath.2018.04.018. [DOI] [PubMed] [Google Scholar]
- 63.Calebiro D, et al. Recurrent EZH1 mutations are a second hit in autonomous thyroid adenomas. J. Clin. Invest. 2016;126:3383–3388. doi: 10.1172/JCI84894. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Hu Y, et al. Integrated whole-exome and transcriptome sequencing of sporadic parathyroid adenoma. Front. Endocrinol. (Lausanne) 2021;12:631680. doi: 10.3389/fendo.2021.631680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Morin RD, et al. Somatic mutations altering EZH2 (Tyr641) in follicular and diffuse large B-cell lymphomas of germinal-center origin. Nat. Genet. 2010;42:181–185. doi: 10.1038/ng.518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hodis E, et al. A landscape of driver mutations in melanoma. Cell. 2012;150:251–263. doi: 10.1016/j.cell.2012.06.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Cyrus S, Burkardt D, Weaver DD, Gibson WT. PRC2-complex related dysfunction in overgrowth syndromes: A review of EZH2, EED, and SUZ12 and their syndromic phenotypes. Am. J. Med. Genet. C: Semin. Med. Genet. 2019;181:519–531. doi: 10.1002/ajmg.c.31754. [DOI] [PubMed] [Google Scholar]
- 68.Son J, Shen SS, Margueron R, Reinberg D. Nucleosome-binding activities within JARID2 and EZH1 regulate the function of PRC2 on chromatin. Genes Dev. 2013;27:2663–2677. doi: 10.1101/gad.225888.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Guy J, Gan J, Selfridge J, Cobb S, Bird A. Reversal of neurological defects in a mouse model of Rett syndrome. Science. 2007;315:1143–1147. doi: 10.1126/science.1138389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Benjamin JS, et al. A ketogenic diet rescues hippocampal memory defects in a mouse model of Kabuki syndrome. Proc. Natl Acad. Sci. USA. 2017;114:125–130. doi: 10.1073/pnas.1611431114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Meng L, et al. Towards a therapy for Angelman syndrome by targeting a long non-coding RNA. Nature. 2015;518:409–412. doi: 10.1038/nature13975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Mencacci NE, et al. De Novo mutations in PDE10A cause childhood-onset chorea with bilateral striatal lesions. Am. J. Hum. Genet. 2016;98:763–771. doi: 10.1016/j.ajhg.2016.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Anazi S, et al. Clinical genomics expands the morbid genome of intellectual disability and offers a high diagnostic yield. Mol. Psychiatry. 2017;22:615–624. doi: 10.1038/mp.2016.113. [DOI] [PubMed] [Google Scholar]
- 74.Li D, et al. Pathogenic variants in SMARCA5, a chromatin remodeler, cause a range of syndromic neurodevelopmental features. Sci. Adv. 2021;7:eabf2066. doi: 10.1126/sciadv.abf2066. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Guillen Sacoto MJ, et al. De Novo variants in the ATPase module of MORC2 cause a neurodevelopmental disorder with growth retardation and variable craniofacial dysmorphism. Am. J. Hum. Genet. 2020;107:352–363. doi: 10.1016/j.ajhg.2020.06.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.McKenna A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164. doi: 10.1093/nar/gkq603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Cingolani P, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. (Austin) 2012;6:80–92. doi: 10.4161/fly.19695. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Genomes Project C, et al. A global reference for human genetic variation. Nature. 2015;526:68–74. doi: 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Karczewski KJ, et al. The ExAC browser: displaying reference data information from over 60 000 exomes. Nucleic Acids Res. 2017;45:D840–D845. doi: 10.1093/nar/gkw971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Glusman G, Caballero J, Mauldin DE, Hood L, Roach JC. Kaviar: an accessible system for testing SNV novelty. Bioinformatics. 2011;27:3216–3217. doi: 10.1093/bioinformatics/btr540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.den Dunnen JT, et al. HGVS recommendations for the description of sequence variants: 2016 update. Hum. Mutat. 2016;37:564–569. doi: 10.1002/humu.22981. [DOI] [PubMed] [Google Scholar]
- 84.Wiel L, et al. MetaDome: pathogenicity analysis of genetic variants through aggregation of homologous human protein domains. Hum. Mutat. 2019;40:1030–1038. doi: 10.1002/humu.23798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Larkin MA, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23:2947–2948. doi: 10.1093/bioinformatics/btm404. [DOI] [PubMed] [Google Scholar]
- 86.Eswar, N. et al. Comparative protein structure modeling using Modeller. Curr Protoc BioinformaticsChapter 5, Unit-5 6 (2006). [DOI] [PMC free article] [PubMed]
- 87.Kojima S, Vignjevic D, Borisy GG. Improved silencing vector co-expressing GFP and small hairpin RNA. Biotechniques. 2004;36:74–79. doi: 10.2144/04361ST02. [DOI] [PubMed] [Google Scholar]
- 88.Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34:i884–i890. doi: 10.1093/bioinformatics/bty560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Dobin A, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Liao Y, Smyth GK, Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30:923–930. doi: 10.1093/bioinformatics/btt656. [DOI] [PubMed] [Google Scholar]
- 91.Wickham, H., Chang, W. & Wickham, M. H. Package ‘ggplot2’ create elegant data visualisations using the grammar of graphics. version 2, 1–189 (2016).
- 92.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Uzquiano A, et al. Proper acquisition of cell class identity in organoids allows definition of fate specification programs of the human cerebral cortex. Cell. 2022;185:3770–3788.e3727. doi: 10.1016/j.cell.2022.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Xie Z, et al. Gene set knowledge discovery with Enrichr. Curr. Protoc. 2021;1:e90. doi: 10.1002/cpz1.90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Ramirez F, et al. deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic Acids Res. 2016;44:W160–W165. doi: 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data supporting the findings of this study are available within the article and its supplementary information files. Source data are provided with this paper. The exome/genome sequencing will be made available upon request provided that privacy and consent criteria are preserved. The ChIPseq and RNAseq data generated in this study have been deposited in GEO under accession code GSE210465 and GSE227014. Other raw data will be made available within two weeks upon request to corresponding author. The human brain gene expression data used in this study are available in BrainSpan (https://www.brainspan.org/rnaseq/searches?exact_match=false&search_term=EZH1&search_type=gene and https://www.brainspan.org/rnaseq/searches?exact_match=false&search_term=EZH2&search_type=gene&page_num=0) and GTEX (https://gtexportal.org/home/gene/EZH1 and https://gtexportal.org/home/gene/EZH2). Variants p.R406H and p.A678G are accessible in ClinVar under the accession codes VCV000828189.1 and VCV000977755.2 respectively. The other variants characterized in this study have been submitted to ClinVar. Source data are provided with this paper.