Summary
GATA zinc finger domain containing 2A (GATAD2A) is a subunit of the nucleosome remodeling and deacetylase (NuRD) complex. NuRD is known to regulate gene expression during neural development and other processes. The NuRD complex modulates chromatin status through histone deacetylation and ATP-dependent chromatin remodeling activities. Several neurodevelopmental disorders (NDDs) have been previously linked to variants in other components of NuRD’s chromatin remodeling subcomplex (NuRDopathies). We identified five individuals with features of an NDD that possessed de novo autosomal dominant variants in GATAD2A. Core features in affected individuals include global developmental delay, structural brain defects, and craniofacial dysmorphology. These GATAD2A variants are predicted to affect protein dosage and/or interactions with other NuRD chromatin remodeling subunits. We provide evidence that a GATAD2A missense variant disrupts interactions of GATAD2A with CHD3, CHD4, and CHD5. Our findings expand the list of NuRDopathies and provide evidence that GATAD2A variants are the genetic basis of a previously uncharacterized developmental disorder.
Keywords: GATAD2A, NuRD complex, neurodevelopmental disorder, NuRDopathies
Genetic variants in several members of the nucleosome remodeling and deacetylase (NuRD) complex are associated with distinct neurodevelopmental disorders. Our findings implicate the NuRD member, GATAD2A, in a neurodevelopmental disorder and demonstrate the need to identify additional genetic variants to define the clinical spectrum of NuRD-related disorders.
Introduction
Chromatin modifiers and remodelers have been recently implicated in a variety of neurodevelopmental disorders (NDDs).1,2,3,4,5,6,7,8 The nucleosome remodeling and deacetylase (NuRD) complex has been linked to four NDDs with overlapping phenotypes as a result of dominant variants in several paralogous subunits of the complex (NuRDopathies).5,6,7,8,9 NuRD regulates a variety of cellular processes including cell-cycle progression, genome integrity, and cellular differentiation.10,11,12 The NuRD complex consists of several different subunits, each with a set of paralogous proteins.9 The holoenzyme complex can be divided into two subcomplexes: the chromatin remodeling subcomplex (CRS) and a histone deacetylation (HDAC) subcomplex, or HDAC core (Figure 1A). The HDAC core consists of three subunits in multiple copies comprised of different paralogs that include four retinoblastoma-binding protein (RBBP4/7) subunits, two metastasis-associated protein (MTA1/2/3) subunits, and two histone deacetylase (HDAC1/2) subunits.10,12 By contrast, the CRS is composed of three monomeric paralogous subunits in series: a methyl-binding domain protein (MBD2/3), a GATAD2 protein (GATAD2A/B, previously known as p66α/β), and a chromodomain helicase DNA-binding protein (CHD3/4/5). A CDK2AP1 protein serves as the final member of the CRS and interacts with GATAD2 and CHD paralogs (not shown in diagram).10,12
Figure 1.
Individuals with monoallelic variants in GATAD2A
(A) Schematic of different NuRD subtypes and associated developmental disorders. NuRD subtypes can be defined by the presence of one paralog (CHD3-NuRD) or multiple paralogs (MBD3-GATAD2A-CHD4-NuRD). The disorders would be due to a deficiency of any particular NuRD paralog and all of its associated subtypes (e.g., CHD3-NuRD deficiency = CHD3RS).
(B) Pedigrees of five families with GATAD2A variants segregating with developmental disorders and the genotypes of sequenced individuals. Squares represent males; circles represent females; triangles represent pregnancies not carried to term; clear shapes indicate unaffected status; solid shapes indicate individuals with GARND.
(C) Facial photos across development showing subtle overlapping craniofacial dysmorphology in P1 and P2, including prominent or broad forehead, deep-set eyes, and broad nasal root. Macrocephaly is also seen in P2.
(D) Sagittal and coronal MRI of P1 at 4 days old. Yellow arrows highlighting thin corpus callosum (top), slightly enlarged ventricles (bottom left), and parieto-occipital subdural hematoma (bottom right).
Notably, the various paralogs enable a wide variety of NuRD subtypes, each with the potential to provide unique functions. For example, CHD3-, CHD4-, and CHD5-possessing NuRD subtypes (CHD3-NuRD, CHD4-NuRD, etc.) are differentially expressed during corticogenesis. CHD4-NuRD subtypes activate expression of a specific set of genes in neural progenitor cells, which are subsequently repressed in cortical neurons by CHD3-NuRD.13,14 Interestingly, CHD3, CHD4, and CHD5 are all associated with dominant NDDs with overlapping phenotypes (CHD3-related syndrome [CHD3RS/Snijders-Blok-Campeau syndrome; MIM: 618205], CHD4-related syndrome [CHD4RS/Sifrim-Hitz-Weiss syndrome; MIM: 617159], and CHD5-related syndrome [CHD5RS/Parenti-Mignot neurodevelopmental syndrome, MIM: 619873]) (Figure 1A).6,7,8,15 Of note, dominant variants in GATAD2B, which tethers the CHD paralogs to the rest of the complex in GATAD2B-NuRD (2B-NuRD) subtypes, have been associated with GATAD2B-associated neurodevelopmental disorder (GAND) (MIM: 615074), a NuRDopathy whose phenotypes encompass nearly all features of CHD3RS, CHD4RS, and CHD5RS combined (Figure 1A).16 To date, GATAD2A has not been associated with a NuRD-related NDD; however, GATAD2A deficiency has been linked to increased expression of fetal hemoglobin with a nonsense GATAD2A variant (c.19C>T, p.R7∗),17,18 and regulatory variants in GATAD2A are significantly associated with schizophrenia and bipolar disorder.19,20
The degree to which GATAD2A-NuRD (2A-NuRD) and 2B-NuRD subtypes functionally overlap or diverge is unclear, although some research has suggested non-redundant functions in certain cell types.10,17,21,22 GATAD2A possesses proline-rich PPPLφ motifs (absent in GATAD2B) that allow for interaction with MYND domains in proteins like ZMYND8.21 GATAD2A also seems to have a non-redundant role in early stem cell differentiation and its ablation enhances pluripotent reprogramming.22 Interestingly, GATAD2A is highly expressed during early neural development,23 which is consistent with early embryonic lethality and variable developmental defects in Gatad2a knockout mice.24
We report the identification of five novel de novo heterozygous variants in GATAD2A in five unrelated individuals with NDD phenotypes. Despite variable expressivity of phenotypes, shared clinical features in affected individuals include global developmental delay (GDD), structural brain defects, and craniofacial anomalies. Observed clinical phenotypes overlap with other NuRDopathies, suggesting that NuRD paralog deficiencies may converge on similar mechanisms during development. We also demonstrate that one missense variant (c.1259 G>A, p.C420Y) disrupts known interactions between GATAD2A and CHD paralogs. We hypothesize that these GATAD2A variants likely act through a loss-of-function (LoF) haploinsufficiency mechanism. Together, we provide evidence for a GATAD2A-related neurodevelopmental disorder that we have termed GARND (Figure 1A).
Materials and methods
Research subjects
All subjects and parents or guardians provided informed consent and were enrolled in institutional review board (IRB)-approved research studies. Consenting was performed in accordance with the ethical standards of the respective IRB committees on human research subjects and in keeping with international standards. Probands (P) 2–5 were identified through multiple nodes in the MatchMaker Exchange, including GeneMatcher and MyGene2.25,26 Participants were recruited at the following institutions: the University of Michigan Pediatrics Genetics Clinic at C.S. Mott Children’s Hospital (P1), Department of Medical Genetics at the Alberta Children’s Hospital, University of Calgary (P2), St. Louis Children’s Hospital in St. Louis, Missouri (P3), Saint Peter’s University Hospital (P4), and the Ludwig Maximilian University of Munich Dr. von Hauner Children’s Hospital (P5).
Exome sequencing and analysis
Genomic DNA for each participant were extracted using the DNA Genotek Prep-IT L2P (no. PT-L2P) or QIAGEN DNeasy Blood & Tissue Kit (no. 69504), quantified with the Life Technologies Quant-iT PicoGreen dsDNA Assay Kit (no. P7589), and 3 μg were submitted for exome sequencing on an Illumina HiSeq instrument at the following institutions: the University of Washington Center for Mendelian Genomics, the Department of Medical Genetics at the University of Calgary, GeneDx, and the Institute of Human Genetics at the Technical University of Munich School of Medicine. Reads were aligned to the hg38 reference genome (GRCh38.p13) using Burrows-Wheeler Aligner. Read and alignment quality was assessed for each sample using PLINK (v.1.90b2m), and kinship was confirmed with KING v.1.4.0.
Variant calling and annotation
Variant calling of single-nucleotide variants (SNVs) and copy number variants was performed using GATK and CONIFER, respectively. The data were filtered and annotated using GEMINI v.0.19.1 Variant Effect Predictor (VEP). Variants were also filtered against public databases including the 1000 Genomes Project phase 311, Genome Aggregate Database (gnomAD), and NHLBI Exome Sequencing Project (ESP) 6500. Those with a minor allele frequency >0.005 were excluded. In addition, variants flagged as low impact, low quality, or putative false positives (Phred quality score <20) were excluded from the analysis. Variants in genes known to be associated with NDD were selected and prioritized based on predicted pathogenicity.
The de novo status of GATAD2A variants in P1, P3, and P5 was reported based on trio exome sequencing results, and by Sanger sequencing confirmation in participants P2 and P3 and respective parents (Figure 1B). Pathogenicity of variants was assessed according to American College of Medical Genetics (ACMG) guidelines and using the Franklin Genoox online classification tool. All variants were submitted to Database: ClinVar (accession no. VCV001705818.2, VCV001705819.2, VCV001705820.2, VCV001705821.2, and VCV001705822.2).
Protein conservation, structure, and in silico analyses
NCBI HomoloGene tool was used to obtain aligned amino acid sequences of GATAD2A across species at affected residues and flanking regions. Protein alignment was performed on GATAD2A and GATAD2B sequences using Geneious Prime v.2022.1.1 global alignment with free end gaps and a Blosum62 cost matrix. PDB files for GATAD2A (AF-Q86YP4) were downloaded and extracted from the AlphaFold Protein Structure Database’s reference Homo sapiens proteome file no. UP000005640. The structure was edited in PyMOL v.2.5.2. In silico prediction of the functional impact of GATAD2A variants was performed using Polymorphism Phenotyping (PolyPhen-2) v.2.2.3r406 using the HumDiv model, Sorting Intolerant From Tolerant (SIFT), varSEAK, and MutationTaster2021.27,28,29 Combined annotation-dependent depletion (CADD) Phred scores were obtained for each variant using CADD v.1.6 against GRCh3830 MetaDome analysis was performed on NM_017660.3 transcript using the online tool.31
Plasmids
Full-length cDNA sequence for human GATAD2A (GenBank: NM_017660.5) as well as partial cDNA sequences for the C-terminal domain (CTD) of CHD3 (encoding residues 1246–1944 [GenBank: NM_001005273.3]), CHD4 (residues 1230–1912 [GenBank: NM_001273.5]), and CHD5 (residues 1218–1954 [GenBank: NM_015557.3]) were each cloned into the pcDNA3.1 expression vector to generate the following plasmids: HA-tagged GATAD2A (HA-GATAD2AWT), FLAG-tagged CHD3-CTD (FLAG-CHD3_CTD), FLAG-tagged CHD4-CTD (FLAG-CHD4_CTD), and FLAG-tagged CHD5-CTD (FLAG-CHD5_CTD). Site-directed mutagenesis was performed on HA-GATAD2AWT to introduce the p.C420Y missense variant (HA-GATAD2AC420Y).
Western blotting and co-immunoprecipitation
Immunoprecipitation assays were performed as previously described.16 In brief, HA-GATAD2AWT and HA-GATAD2AC420Y proteins were independently co-expressed with each of the three FLAG-CHD-CTD fusion proteins in rabbit reticulocyte lysates for in vitro translation (IVT). Expressed FLAG-CHD-CTD proteins together with HA-GATAD2A proteins were immobilized on anti-FLAG resin, washed, and eluted with 3X-FLAG peptide. In parallel, IVT lysates expressing only an HA-GATAD2A or FLAG-CHD-CTD protein, as well as lysates with no expressed protein, were run as negative controls. Immunoprecipitation inputs and eluates were loaded and run with sodium dodecyl sulfate-polyacrylamide gel electrophoresis followed by transfer to a polyvinylidene fluoride membrane. Immunoblots were probed with anti-HA-HRP (1:20,000, Cell Signaling Technology, no.2999S) followed by probing with anti-FLAG-HRP (1:80,000, Sigma Aldrich, no. A8592).
Results
Individuals with heterozygous GATAD2A variants exhibit features of developmental disorders
Through research exome-based sequencing, we identified a de novo variant of uncertain significance in the GATAD2A gene (NM_017660.5) in a child (P1; c.673C>T, p.R225∗) who presented with multiple congenital anomalies at age 1 month (Figures 1B–1D). He was diagnosed with imperforate anus, moderate membranous ventricular septal defect, patent ductus arteriosus, right optic nerve coloboma, microphthalmia, and bilateral hydronephrosis. Craniofacial dysmorphisms include choanal atresia, prominent or broad forehead, deep-set eyes, and broad nasal root (Figure 1C). He had GDD with a brain MRI at 4 days old that showed pyriform aperture stenosis, small optic nerves, bilateral cerebellar hematomas, parietooccipital hematoma, enlarged ventricles, and a thin corpus callosum (Figure 1D).
Normal testing results from chromosomal microarray and a CHARGE syndrome sequencing panel necessitated exome sequencing. A de novo nonsense GATAD2A variant (c.673C>T, p.R225∗) was identified by trio exome sequencing that is absent in control populations (gnomAD v.2.1.1), with in silico analysis supporting a deleterious effect (Table 1). After identifying individual P1, we subsequently identified four additional unrelated individuals with novel de novo variants in GATAD2A through international variant-sharing efforts and have summarized their phenotypes below as well as in Table 1 (Figure 1B).
Table 1.
Clinical summary of individuals with monoallelic variants in GATAD2A in this study
| Individual | P1 | P2 | P3 | P4 | P5 |
|---|---|---|---|---|---|
| GATAD2A variant information | |||||
| cDNA (GenBank: NM_017660.5) | c.673C>T | c.1259G>A | c.1877delT | c.1205G>T | c.626C>T |
| Protein consequence | p.R225∗ | p.C420Y | p.I627Tfs | p.G402V | p.T209I |
| Genotype | heterozygous | heterozygous | heterozygous | heterozygous | heterozygous |
| Inheritance | de novo | de novo | de novo | N/A | de novo |
| Sequencing method | research-based trio exome | research-based trio exome, Sanger confirmed | research-based trio exome, Sanger confirmed | clinical exome | research-based trio exome |
| gnomAD v.2.1.1 frequency | 0 | 0 | 0 | 0 | 0 |
| ACMG pathogenicity classification | VUS | VUS | likely pathogenic | VUS | VUS |
| CADD Phred score (GRCh38-v1.6) | 40 | 29.9 | 33 | 36 | 16.67 |
| SIFT (GRCh38-v1.6) | deleterious (0.00) | deleterious (0.00) | tolerated (0.28) | ||
| PolyPhen-2 (GRCh38-v1.6) | probably damaging (1.000) | possibly damaging (0.930) | benign (0.001) | ||
| Other genetic findings | hemizygous variant in PLXNA3 (NM_017514: c.1093C>G, p.L365V) | hemizygous variant in SLC9A7 (NM_001257291.1: c.1363del, p.L455Wfs∗29) | |||
| Clinical features | |||||
| Developmental/growth defects | + | + | N/A | + | + |
| Structural brain anomalies | + | + | N/A | N/A | − |
| Vision/hearing defects | + | + | N/A | N/A | – |
| Craniofacial dysmorphisms | + | + | + | – | – |
| Cardiovascular anomalies | + | + | N/A | N/A | N/A |
| Gastrointestinal/renal defects | + | – | + | N/A | N/A |
| Musculoskeletal anomalies | + | – | + | + | – |
VUS, variant of uncertain significance; CADD, combined annotation dependent depletion; +, present; −, absent; N/A, not available.
Individual P2 was a 7-year-old male who presented with mild short stature, chronic otitis media and associated hearing loss, hypotonia, and borderline macrocephaly. He had feeding difficulties, mild GDD, and speech delay although he continued to make developmental progress. Echocardiogram identified mild atrial enlargement. Head ultrasound revealed mildly asymmetric ventricles. Craniofacial features include prominent forehead, deep-set eyes, midface hypoplasia, and a broad nasal root (Figure 1C). Exome sequencing identified a de novo heterozygous missense GATAD2A variant (c.1259G>A, p.C420Y) in P2, with inheritance confirmed by subsequent trio Sanger sequencing.
Individual P3 was an 8-year-old female who presented with mild hemihyperplasia, horseshoe kidney, and bilateral Wilms tumor. She had normal development, and neuroimaging was not performed. Craniofacial dysmorphisms included hypertelorism, prognathism, and broad nasal tip. Trio exome sequencing identified a de novo heterozygous frameshift GATAD2A variant (c.1877delT, p.I627Tfs) in P3, which was confirmed by Sanger sequencing.
Individual P4 was a 4-year-old female who presented with right-sided hemihyperplasia. She had GDD, speech delay, and autistic features. Neuroimaging was not performed. No craniofacial dysmorphisms were noted. Exome sequencing identified a heterozygous GATAD2A missense variant (c.1205G>T, p.G402V) in P4 of unknown inheritance.
Individual P5 was a 4-year-old male who presented with GDD. No structural brain anomalies were observed by brain MRI and no craniofacial dysmorphisms were noted. Trio exome sequencing identified a de novo heterozygous GATAD2A missense variant (c.626C>T, p.T209I) in P5.
In summary, the shared clinical features with variable expressivity include GDD, hemihyperplasia, craniofacial dysmorphism, and structural brain defects (Table 1). Nearly all (4/5) individuals in our cohort presented with developmental and growth defects. Three of the five individuals exhibited craniofacial dysmorphology. Musculoskeletal anomalies were also observed in three of the five individuals. Although unlikely, variants of uncertain significance (VUSs) were identified in other genes in P1 and P2 that may also be contributing to the observed clinical phenotypes (Table 1).32,33
GATAD2A heterozygous variants predicted to disrupt NuRD interactions correlate with neurodevelopmental features
Both GATAD2 proteins possess two highly conserved domains: conserved region 1 (CR1) and conserved region 2 (CR2).34 CR1 is more N-terminal and encodes a coiled-coil domain that interacts with a similar domain in MBD-proteins for coiled-coil binding; by contrast, CR2 is downstream and possesses GATA-type zinc finger domains shown to interact with CHD paralogs (Figure 2A).5,16 CR1 is thought to tether the MBD-HDAC core unit to GATAD2 proteins, whereas CR2 tethers GATAD2 proteins to the CHD paralogs. CDK2AP1 has also been shown to interact with CR2.35 Pairwise alignment indicates that GATAD2A has protein sequence homology (40.841%) with its paralog GATAD2B, predominantly around the CR1 and CR2 domains. Of note, in GATAD2B, LoF variants were identified across most of the coding sequence, while missense variants only localized to CR1 and CR2 domains.5,16,36
Figure 2.
Features of GATAD2A variants
(A) Linear GATAD2A protein map with missense (blue triangle), nonsense (red square), and frameshift (orange circle) variants identified in present study. CR1, conserved region domain 1; CR2, conserved region domain 2; CC, coiled-coil domain; GATA ZF, GATA-type zinc finger domain.
(B) Predicted tolerance landscape of GATAD2A by MetaDome analysis.
(C) GATAD2A protein alignment showing cross-species conservation at affected residues, and alignment with human GATAD2B (bottom) showing conserved regions at affected residues within the GATA ZF domain and the previously unreported conserved C-terminal motif. The homologous human GATAD2B variants are shown below. The PPPLφ motif near R225∗ is also shown. ZB, zinc-binding motif; GCTM, GATAD2 C-terminal motif.
Among the identified GATAD2A variants in our cohort, three were missense (c.1205G>T, G402V; c.1259 G>A, p.C420Y; c.626C>T, p.T209I), one was nonsense (c.673C>T, p.R225∗), and one was frameshift (c.1877delT, p.I627Tfs) at the extreme C terminus (Figure 2A). Following ACMG standards and guidelines, one of the five GATAD2A variants was classified as likely pathogenic (p.I627Tfs), whereas the other four were VUSs (p.T209I, p.R225∗, p.G402V, p.C420Y) (Table 1).37 All discovered variants were absent in public databases (gnomAD v.2.1.1), where GATAD2A demonstrated both high probability of LoF intolerance (pLI) (observed/expected SNVs [o/e] = 0.06; pLI = 1) and high intolerance to missense variation (o/e = 0.83; Z = 1.27). In light of the difficulty of interpreting variants of uncertain significance, we used the MetaDome web server, which pools data from gnomAD and the Human Gene Mutation Database to provide intolerance profiles for missense variants at amino acid-level resolution.31 From this analysis, there is uneven intolerance across GATAD2A, with notable and predictable hotspots of intolerance concentrated around the CR1 and CR2 regions (Figure 2B). Importantly, missense variants identified in our cohort lie at predicted intolerant residues in MetaDome. All GATAD2A variants affect conserved residues localized throughout GATAD2A (Figure 2C). Except p.T209I, all variants have CADD scores above 20 (Table 1). Missense variants p.G402V and p.C420Y are predicted to be possibly or probably damaging by PolyPhen-2, and deleterious by SIFT (Table 1). These missense variants affect homologous residues that are conserved in GATAD2B (Figure 2C). Notably, the p.G402V missense variant affects a homologous residue to the GATAD2B p.G406 residue, where a pathogenic variant was previously identified in GAND,16 as well as in two previously unreported individuals with GAND (variants p.G406S and p.G406C, data not shown) (Figure 2C). These findings in several individuals with GAND indicate that the GATAD2B p.G406 residue has functional importance and the homologous residue p.G402 in GATAD2A is likely to have a similar effect. Previous studies have shown that GATAD2B p.G406R does not disrupt GATAD2B interactions with CHD proteins, suggesting that its pathogenicity may be associated with disruption of other protein interaction(s). Alternatively, both the GATAD2A (c.1205G>T, p.G402V) and GATAD2B (c.1216G>C, p.G406R) changes lie at exon-intron boundaries, with the GATAD2A variant predicted to have a LoF effect on the 3′ splice site of intron 8, which may result in use of a cryptic splice three nucleotides upstream (varSEAK, class 4). GATAD2B variants also provide additional evidence for the pathogenicity of the GATAD2A p.C420Y variant. Missense variants in GATAD2B affecting homologous zinc-binding cysteines were present in multiple individuals with GAND and are expected to have the same effect on GATAD2A.16 Finally, our GATAD2 paralog protein alignment revealed that the p.I627Tfs variant lies within a C-terminal motif that is highly conserved across species as well as with GATAD2B, with the residues flanking the motif being widely divergent between the two paralogs (Figure 2C). Conversely, the p.R225∗ nonsense and the p.T209I missense variants lie near PPPLφ motifs that are absent in GATAD2B (Figure 2C). Together, these computational and population genetics analyses provide additional evidence for the negative functional consequence of these GATAD2A heterozygous variants.
Of note, variants that might alter protein dosage (p.R225∗ and p.G402V) and/or structure of the CR2 domain (p.G402V and p.C420Y), and result in a functional haploinsufficiency, are present among individuals with neurodevelopmental defects. Among these, P1 and P2 (p.R225∗ and p.C420Y, respectively), present with structural brain defects including enlarged or asymmetric ventricles, thin corpus callosum, and macrocephaly. This evidence suggests that disruption and/or decreased dosage of the GATAD2A-CHD paralog interactions may be important for neural development.
GATAD2A interaction with NuRD components is disrupted by missense variant p.C420Y
Previous work has shown that the CTD of NuRD complex members CHD3, CHD4, or CHD5 is sufficient for GATAD2 protein interactions (Figure 3A).5,16 The GATAD2 proteins possess GATA-type zinc finger domains within CR2 that are required for this interaction.16 Given that the p.C420 of GATAD2A residue is one of four cysteines that coordinate the zinc ion, we hypothesized that the p.C420Y missense variant may disrupt CR2 zinc finger folding and its subsequent interaction with CHD paralogs. This was previously seen for GATAD2B CR2 zinc-binding cysteine residue variants (p.C420R and p.C420S), which resulted in a GAND diagnosis (Figure 3B).5,16 To investigate the impact of the p.C420Y missense variant on interactions with CHD3, CHD4, and CHD5, we performed IVT in rabbit reticulocyte lysates to co-express the GATAD2AC420Y protein with each CHD-CTD FLAG-tagged fusion protein. Immunoprecipitation was performed using the FLAG-tagged CHD-CTD fusion proteins as bait followed by pull-down with an anti-FLAG antibody. Compared with GATAD2AWT protein, there was a marked reduction in GATAD2AC420Y binding to all three CHD paralogs (Figure 3C). These findings provide evidence that the p.C420Y missense variant perturbs interactions within the CRS of the NuRD complex.
Figure 3.
Modeling GATAD2A missense variant effect on NuRD complex interactions
(A) Domain architecture of human CHD4. CTD indicates the region required for GATAD2 binding.38
(B) Structural prediction of the GATAD2A zinc finger region (AlphaFold2). The four highlighted cysteines in yellow and red (C420) are essential for zinc coordination.
(C) Western blots of co-immunoprecipitation of FLAG-CHD3-CTD, FLAG-CHD4-CTD, or FLAG-CHD5-CTD co-expressed with either HA-GATAD2AWT or HA-GATAD2AC420Y in IVT lysates, along with negative controls. Inputs are on top and immunoprecipitated eluates are on the bottom.
Discussion
We report de novo heterozygous dominant variants in GATAD2A as a genetic basis of a developmental disorder that we abbreviate as GATAD2A-related neurodevelopmental disorder (GARND). Five distinct GATAD2A variants were identified in five unrelated individuals whose prior genetic testing did not reveal other pathogenic or structural variants in genes that could explain the full array of clinical presentations. Many of the individuals in our cohort presented with overlapping developmental defects. In silico and functional analyses, along with evidence from homologous variants in GATAD2B, support a deleterious effect of the identified GATAD2A nonsense, frameshift, and missense variants. Our immunoprecipitation studies also show a disruption of the interaction between the GATAD2AC420Y and the CHD paralogs within the CRS. Further investigation will be required to determine if this results in a LoF haploinsufficiency mechanism of disease or a dominant-negative disorder due to sequestration of HDAC-MBD-GATAD2AC420Y partial complex from the CHD paralogs.
Despite some shared clinical features, our clinical findings indicate a range of phenotypic findings in GARND including craniofacial dysmorphisms, musculoskeletal anomalies, cerebral malformations, cardiovascular anomalies, and ophthalmological abnormalities. Little information has been known about GATAD2A variants in disease. One previous report of a nonsense variant in GATAD2A (c.19C>T, p.R7∗) linked it to elevated levels of fetal hemoglobin, but made no mention of neurodevelopmental status.18 The report also identified several predicted benign GATAD2A missense variants with only one present within CR2 (p.N382S). None of these were associated with changes in fetal hemoglobin levels and no neurodevelopmental data was reported. Whether individuals in our cohort have elevated fetal hemoglobin is unknown.
The array of clinical phenotypes in our GARND cohort show moderate overlap with other dominant NuRDopathies (CHD3RS, CHD4RS, CHD5RS, and GAND).5,6,7,8,16 In all five disorders, GDD, hypotonia, and dysmorphic craniofacial features (broad forehead, hypertelorism, wide nasal bridge) have been noted. For all disorders except CHD5RS, macrocephaly and ventriculomegaly have been observed, although it is more common in GAND than in CHD3RS and CHD4RS.5 The phenotypes of GARND, GAND, CHD3RS, and CHD5RS all include speech deficits, whereas CHD4RS, GARND, and GAND phenotypes include congenital cardiac defects. GARND, GAND, and CHD3RS also share neonatal feeding difficulties. Unlike GAND’s relatively consistent phenotype across affected individuals, the GARND phenotypes reported here were more variable across individuals. Whether this was the result of the cohort’s unique variant makeup (and lack of redundancy) or GARND itself needs to be determined. As more cases of GARND are defined, it will be important to evaluate the frequency of divergent phenotypic NuRDopathy features such as macrocephaly, kidney disease, and hearing impairment, and the pattern of shared features between GARND and other NuRDopathies.
Our findings provide evidence in support of a hypothesis wherein pathogenic heterozygous variations in GATAD2A act through a LoF haploinsufficiency mechanism in affected individuals. The variability in clinical phenotypes in our cohort, coupled with the variable predicted intolerance across GATAD2A, may reflect the importance of 2A-NuRD during development and/or a polygenic effect based on genetic background. Furthermore, pleiotropic functions of GATAD2A in tissues may reflect cell-type variation in GATAD2 and NuRD paralog redundancy. Of note, we observed neurodevelopmental features in individuals with GATAD2A variants that could cause haploinsufficiency of CR2 function, which are predicted to disrupt interactions with the CTD region of CHD paralogs. These findings suggest that neural development is particularly susceptible to impairment of the chromatin remodeling activity of 2A-NuRD and 2B-NuRD complexes. Our co-immunoprecipitation findings confirm diminished GATAD2A-CHD interaction in the presence of the CR2-localized p.C420Y missense variant (similar to a number of patient variants in GAND).5,16 We showed that the GATAD2A p.C420Y affects a cysteine within a zinc-binding motif, which is also disrupted by previously reported GATAD2B p.C420 variants in individuals with GAND (Figure 2C).5,16 We also found that the affected glycine of GATAD2A p.G402V is homologous to that of the previously identified GATAD2B p.G406R.5,16 Together, these findings provide evidence for the pathogenicity of GATAD2A p.G402V and p.C420Y. We hypothesize different mechanisms for dysfunction for the p.T209I and p.I627Tfs variants, which do not localize to or disrupt CR1 or CR2 interaction domains. For instance, the p.T209I variant lies within a region containing three PPPLφ motifs, which are important for GATAD2A interactions with ZMYND8 and the subsequent recruitment of 2A-NuRD to sites of DNA damage for assisting in repair.21 It remains unclear if there are other specific functions of the three PPPLφ motifs in neural development. The C-terminal p.I627Tfs variant, which lies within a previously unreported GATAD2 motif, may disrupt an important interaction with other as yet unknown proteins involved in 2A-NuRD (and likely 2B-NuRD) function. Alternatively, although less likely given its extreme C-terminal location, the frameshift variant may trigger mRNA or protein degradation, resulting in haploinsufficiency. Of course, this variant could also represent a benign change in an individual with a different developmental disorder. Additional cases will help to confirm GATAD2A-related pathogenicity in development and refine the GARND clinical spectrum.
Our human genetics findings suggest distinct but overlapping pathogenic mechanisms of variants in GATAD2 paralogs. It remains unclear if GATAD2A and GATAD2B provide full, partial, or no redundancy in NuRD function during development, and whether their functions are cell-type specific with minimal overlap of expression. In prior studies, GATAD2B overexpression failed to rescue a GATAD2A-related phenotype in GATAD2A-depleted induced pluripotent stem cells, potentially indicating non-redundancy with GATAD2B.22 GATAD2B non-redundancy is also demonstrated by the unique GATAD2A interaction with ZMYND8 through PPPLφ domains, which are absent in GATAD2B.21 To date, there are no specific assays of GATAD2B function and therefore there are no data to determine if GATAD2A could compensate for its deficiency. Additional work is necessary to assess the degree to which, if any, GATAD2B and GATAD2A compensatory activity mediates variable expressivity in or between GARND and/or GAND.
In summary, we report five unrelated individuals with heterozygous variants in GATAD2A and a neurodevelopmental disorder characterized by GDD, structural brain defects, and craniofacial dysmorphism. Discovery of additional affected individuals will provide further insight into the breadth of pathogenic genetic variation and constellation of clinical features associated with GARND.
Acknowledgments
The authors thank all individuals with GATAD2A variants and their families for their participation in this work. A.M.I. and A.V.T. wish to thank Julia Tagoe for clinical support, and acknowledge work done under the Care4Rare Canada Consortium funded by Genome Canada and the Ontario Genomics Institute, the Canadian Institutes of Health Research, Ontario Research Foundation, Genome Alberta, Genome British Columbia, Génome Québec, Children's Hospital of Eastern Ontario Foundation, and the Alberta Children's Hospital Foundation. We would like to thank Dr. Steven Leber for interpretation of brain MRI findings. We would like to thank the contributors to GeneMatcher, MyGene2, and The University of Washington Center for Mendelian Genomics and GeneDx for use of data. Sequencing provided by the University of Washington Center for Mendelian Genomics (UW-CMG) was funded by NHGRI and NHLBI grants UM1 HG006493 and U24 HG008956. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. This research was supported by NIH R01 DC018404 and the Ravitz Foundation Professorship granted to D.M.M., as well as the Fashion Industries Guild Endowed Fellowship for the Undiagnosed Diseases Program, the Cedars-Sinai Diana and Steve Marienhoff Fashion Industries Guild Endowed Fellowship in Pediatric Neuromuscular Diseases, and the Cedars-Sinai institutional funding program awarded to T.M.P.
Declaration of interests
A.M.I., M.J.B., and J.X.C. serve in a voluntary capacity as members of the Human Genetics and Genomics Advances (HGG-A) Editorial Board.
Web resources
Genome Aggregation Database, https://gnomad.broadinstitute.org/
NHLBI Exome Sequencing Project, https://evs.gs.washington.edu/EVS/
Online Mendelian Inheritance in Man, http://www.omim.org
Franklin Genoox, https://franklin.genoox.com/clinical-db/home
ClinVar, https://www.ncbi.nlm.nih.gov/clinvar/
NCBI HomoloGene tool, https://www.ncbi.nlm.nih.gov/homologene
UniProt, https://www.uniprot.org/
AlphaFold Protein Structure Database, https://alphafold.ebi.ac.uk/
Combined Annotation Dependent Depletion Tool, https://cadd.gs.washington.edu/
Polymorphism Phenotyping v.2, http://genetics.bwh.harvard.edu/pph2/
Sorting Intolerant From Tolerant, https://sift.bii.a-star.edu.sg/
varSEAK, varseak.bio
MetaDome, stuart.radboudumc.nl/metadome
Data and code availability
All data relevant to the study are included in the report. Identified variants have been submitted to ClinVar (accession nos. VCV001705818.2, VCV001705819.2, VCV001705820.2, VCV001705821.2, and VCV001705822.2).
References
- 1.Mossink B., Negwer M., Schubert D., Nadif Kasri N. The emerging role of chromatin remodelers in neurodevelopmental disorders: a developmental perspective. Cell. Mol. Life Sci. 2021;78:2517–2563. doi: 10.1007/s00018-020-03714-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Lin G.N., Song W., Wang W., Wang P., Yu H., Cai W., Jiang X., Huang W., Qian W., Chen Y., et al. De novo mutations identified by whole-genome sequencing implicate chromatin modifications in obsessive-compulsive disorder. Sci. Adv. 2022;8:eabi6180. doi: 10.1126/sciadv.abi6180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Satterstrom F.K., Kosmicki J.A., Wang J., Breen M.S., De Rubeis S., An J.Y., Peng M., Collins R., Grove J., Klei L., et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell. 2020;180:568–584.e23. doi: 10.1016/j.cell.2019.12.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Parenti I., Kaiser F.J. Cornelia de Lange syndrome as paradigm of chromatinopathies. Front. Neurosci. 2021;15:774950. doi: 10.3389/fnins.2021.774950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Pierson T.M., Otero M.G., Grand K., Choi A., Graham J.M., Young J.I., Mackay J.P. The NuRD complex and macrocephaly associated neurodevelopmental disorders. Am. J. Med. Genet. C Semin. Med. Genet. 2019;181:548–556. doi: 10.1002/ajmg.c.31752. [DOI] [PubMed] [Google Scholar]
- 6.Snijders Blok L., Rousseau J., Twist J., Ehresmann S., Takaku M., Venselaar H., Rodan L.H., Nowak C.B., Douglas J., Swoboda K.J., et al. CHD3 helicase domain mutations cause a neurodevelopmental syndrome with macrocephaly and impaired speech and language. Nat. Commun. 2018;9:4619. doi: 10.1038/s41467-018-06014-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Weiss K., Lazar H.P., Kurolap A., Martinez A.F., Paperna T., Cohen L., Smeland M.F., Whalen S., Heide S., Keren B., et al. The CHD4-related syndrome: a comprehensive investigation of the clinical spectrum, genotype-phenotype correlations, and molecular basis. Genet. Med. 2020;22:389–397. doi: 10.1038/s41436-019-0612-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Parenti I., Lehalle D., Nava C., Torti E., Leitão E., Person R., Mizuguchi T., Matsumoto N., Kato M., Nakamura K., et al. Missense and truncating variants in CHD5 in a dominant neurodevelopmental disorder with intellectual disability, behavioral disturbances, and epilepsy. Hum. Genet. 2021;140:1109–1120. doi: 10.1007/s00439-021-02283-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hoffmann A., Spengler D. Chromatin remodeling complex NuRD in neurodevelopment and neurodevelopmental disorders. Front. Genet. 2019;10:682. doi: 10.3389/fgene.2019.00682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Torchy M.P., Hamiche A., Klaholz B.P. Structure and function insights into the NuRD chromatin remodeling complex. Cell. Mol. Life Sci. 2015;72:2491–2507. doi: 10.1007/s00018-015-1880-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lai A.Y., Wade P.A. Cancer biology and NuRD: a multifaceted chromatin remodelling complex. Nat. Rev. Cancer. 2011;11:588–596. doi: 10.1038/nrc3091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Basta J., Rauchman M. The nucleosome remodeling and deacetylase complex in development and disease. Transl. Res. 2015;165:36–47. doi: 10.1016/j.trsl.2014.05.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Nitarska J., Smith J.G., Sherlock W.T., Hillege M.M.G., Nott A., Barshop W.D., Vashisht A.A., Wohlschlegel J.A., Mitter R., Riccio A. A functional switch of NuRD chromatin remodeling complex subunits regulates mouse cortical development. Cell Rep. 2016;17:1683–1698. doi: 10.1016/j.celrep.2016.10.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hoffmeister H., Fuchs A., Erdel F., Pinz S., Gröbner-Ferreira R., Bruckmann A., Deutzmann R., Schwartz U., Maldonado R., Huber C., et al. CHD3 and CHD4 form distinct NuRD complexes with different yet overlapping functionality. Nucleic Acids Res. 2017;45:10534–10554. doi: 10.1093/nar/gkx711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Weiss K., Terhal P.A., Cohen L., Bruccoleri M., Irving M., Martinez A.F., Rosenfeld J.A., Machol K., Yang Y., Liu P., et al. De novo mutations in CHD4, an ATP-dependent chromatin remodeler gene, cause an intellectual disability syndrome with distinctive dysmorphisms. Am. J. Hum. Genet. 2016;99:934–941. doi: 10.1016/j.ajhg.2016.08.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shieh C., Jones N., Vanle B., Au M., Huang A.Y., Silva A.P.G., Lee H., Douine E.D., Otero M.G., Choi A., et al. GATAD2B-associated neurodevelopmental disorder (GAND): clinical and molecular insights into a NuRD-related disorder. Genet. Med. 2020;22:878–888. doi: 10.1038/s41436-019-0747-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Sher F., Hossain M., Seruggia D., Schoonenberg V.A.C., Yao Q., Cifani P., Dassama L.M.K., Cole M.A., Ren C., Vinjamur D.S., et al. Rational targeting of a NuRD subcomplex guided by comprehensive in situ mutagenesis. Nat. Genet. 2019;51:1149–1159. doi: 10.1038/s41588-019-0453-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Liang Y., Zhang X., Liu Y., Wang L., Ye Y., Tan X., Pu J., Zhang Q., Bao X., Wei X., et al. GATA zinc finger domain-containing protein 2A (GATAD2A) deficiency reactivates fetal haemoglobin in patients with β-thalassaemia through impaired formation of methyl-binding domain protein 2 (MBD2)-containing nucleosome remodelling and deacetylation (NuRD) complex. Br. J. Haematol. 2021;193:1220–1227. doi: 10.1111/bjh.17511. [DOI] [PubMed] [Google Scholar]
- 19.Ma C., Gu C., Huo Y., Li X., Luo X.J. The integrated landscape of causal genes and pathways in schizophrenia. Transl. Psychiatry. 2018;8:67. doi: 10.1038/s41398-018-0114-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hauberg M.E., Zhang W., Giambartolomei C., Franzén O., Morris D.L., Vyse T.J., Ruusalepp A., CommonMind Consortium. Sklar P., Björkegren J.L.M., et al. Large-scale identification of common trait and disease variants affecting gene expression. Am. J. Hum. Genet. 2017;101:157. doi: 10.1016/j.ajhg.2017.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Spruijt C.G., Luijsterburg M.S., Menafra R., Lindeboom R.G.H., Jansen P.W., Edupuganti R.R., Baltissen M.P., Wiegant W.W., Voelker-Albert M.C., Matarese F., et al. ZMYND8 Co-localizes with NuRD on target genes and regulates poly(ADP-ribose)-dependent recruitment of GATAD2A/NuRD to sites of DNA damage. Cell Rep. 2016;17:783–798. doi: 10.1016/j.celrep.2016.09.037. [DOI] [PubMed] [Google Scholar]
- 22.Mor N., Rais Y., Sheban D., Peles S., Aguilera-Castrejon A., Zviran A., Elinger D., Viukov S., Geula S., Krupalnik V., et al. Neutralizing Gatad2a-chd4-Mbd3/NuRD complex facilitates deterministic induction of naive pluripotency. Cell Stem Cell. 2018;23:412–425.e10. doi: 10.1016/j.stem.2018.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Li J., Ma Z., Shi M., Malty R.H., Aoki H., Minic Z., Phanse S., Jin K., Wall D.P., Zhang Z., et al. Identification of human neuronal protein complexes reveals biochemical activities and convergent mechanisms of action in autism spectrum disorders. Cell Syst. 2015;1:361–374. doi: 10.1016/j.cels.2015.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Marino S., Nusse R. Mutants in the mouse NuRD/Mi2 component P66α are embryonic lethal. PLoS One. 2007;2:e519. doi: 10.1371/journal.pone.0000519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sobreira N., Schiettecatte F., Valle D., Hamosh A. GeneMatcher: a matching tool for connecting investigators with an interest in the same gene. Hum. Mutat. 2015;36:928–930. doi: 10.1002/humu.22844. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Philippakis A.A., Azzariti D.R., Beltran S., Brookes A.J., Brownstein C.A., Brudno M., Brunner H.G., Buske O.J., Carey K., Doll C., et al. The Matchmaker Exchange: a platform for rare disease gene discovery. Hum. Mutat. 2015;36:915–921. doi: 10.1002/humu.22858. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Adzhubei I., Jordan D.M., Sunyaev S.R. Predicting functional effect of human missense mutations using PolyPhen-2. Curr. Protoc. Hum. Genet. 2013;Chapter 7:Unit7.20. doi: 10.1002/0471142905.hg0720s76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Steinhaus R., Proft S., Schuelke M., Cooper D.N., Schwarz J.M., Seelow D. MutationTaster2021. Nucleic Acids Res. 2021;49:W446–W451. doi: 10.1093/nar/gkab266. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Sim N.L., Kumar P., Hu J., Henikoff S., Schneider G., Ng P.C. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40:W452–W457. doi: 10.1093/nar/gks539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rentzsch P., Witten D., Cooper G.M., Shendure J., Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47:D886–D894. doi: 10.1093/nar/gky1016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wiel L., Baakman C., Gilissen D., Veltman J.A., Vriend G., Gilissen C. MetaDome: pathogenicity analysis of genetic variants through aggregation of homologous human protein domains. Hum. Mutat. 2019;40:1030–1038. doi: 10.1002/humu.23798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Steele J.L., Morrow M.M., Sarnat H.B., Alkhunaizi E., Brandt T., Chitayat D.A., DeFilippo C.P., Douglas G.V., Dubbs H.A., Elloumi H.Z., et al. Semaphorin-plexin signaling: from axonal guidance to a new X-linked intellectual disability syndrome. Pediatr. Neurol. 2022;126:65–73. doi: 10.1016/j.pediatrneurol.2021.10.008. [DOI] [PubMed] [Google Scholar]
- 33.Khayat W., Hackett A., Shaw M., Ilie A., Dudding-Byth T., Kalscheuer V.M., Christie L., Corbett M.A., Juusola J., Friend K.L., et al. A recurrent missense variant in SLC9A7 causes nonsyndromic X-linked intellectual disability with alteration of Golgi acidification and aberrant glycosylation. Hum. Mol. Genet. 2019;28:598–614. doi: 10.1093/hmg/ddy371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Brackertz M., Boeke J., Zhang R., Renkawitz R. Two highly related p66 proteins comprise a new family of potent transcriptional repressors interacting with MBD2 and MBD3. J. Biol. Chem. 2002;277:40958–40966. doi: 10.1074/jbc.M207467200. [DOI] [PubMed] [Google Scholar]
- 35.Spruijt C.G., Gräwe C., Kleinendorst S.C., Baltissen M.P.A., Vermeulen M. Cross-linking mass spectrometry reveals the structural topology of peripheral NuRD subunits relative to the core complex. FEBS J. 2021;288:3231–3245. doi: 10.1111/febs.15650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Brackertz M., Gong Z., Leers J., Renkawitz R. p66α and p66β of the Mi-2/NuRD complex mediate MBD2 and histone interaction. Nucleic Acids Res. 2006;34:397–406. doi: 10.1093/nar/gkj437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Richards S., Aziz N., Bale S., Bick D., Das S., Gastier-Foster J., Grody W.W., Hegde M., Lyon E., Spector E., et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of medical genetics and Genomics and the association for molecular pathology. Genet. Med. 2015;17:405–424. doi: 10.1038/gim.2015.30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Torrado M., Low J.K.K., Silva A.P.G., Schmidberger J.W., Sana M., Sharifi Tabar M., Isilak M.E., Winning C.S., Kwong C., Bedward M.J., et al. Refinement of the subunit interaction network within the nucleosome remodelling and deacetylase (NuRD) complex. FEBS J. 2017;284:4216–4232. doi: 10.1111/febs.14301. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All data relevant to the study are included in the report. Identified variants have been submitted to ClinVar (accession nos. VCV001705818.2, VCV001705819.2, VCV001705820.2, VCV001705821.2, and VCV001705822.2).



