Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

bioRxiv logoLink to bioRxiv
[Preprint]. 2024 Aug 24:2024.08.23.609190. [Version 1] doi: 10.1101/2024.08.23.609190

Dynamic convergence of autism disorder risk genes across neurodevelopment

Meilin Fernandez Garcia 1,*, Kayla Retallick-Townsley 1,4,*, April Pruitt 2,**, Elizabeth Davidson 3,**, Yi Dai 3,**, Sarah E Fitzpatrick 2,**, Annabel Sen 1,***, Sophie Cohen 1,***, Olivia Livoti 1,***, Suha Khan 3,***, Grace Dossou 3, Jen Cheung 1, PJ Michael Deans 1, Zuoheng Wang 3, Laura Huckins 1,2,4, Ellen Hoffman 2,3, Kristen Brennand 1,2,4
PMCID: PMC11370590  PMID: 39229156

Abstract

Over a hundred risk genes underlie risk for autism spectrum disorder (ASD) but the extent to which they converge on shared downstream targets to increase ASD risk is unknown. To test the hypothesis that cellular context impacts the nature of convergence, here we apply a pooled CRISPR approach to target 29 ASD loss-of-function genes in human induced pluripotent stem cell (hiPSC)-derived neural progenitor cells, glutamatergic neurons, and GABAergic neurons. Two distinct approaches (gene-level and network-level analyses) demonstrate that convergence is greatest in mature glutamatergic neurons. Convergent effects are dynamic, varying in strength, composition, and biological role between cell types, increasing with functional similarity of the ASD genes examined, and driven by cell-type-specific gene co-expression patterns. Stratification of ASD genes yield targeted drug predictions capable of reversing gene-specific convergent signatures in human cells and ASD-related behaviors in zebrafish. Altogether, convergent networks downstream of ASD risk genes represent novel points of individualized therapeutic intervention.

Keywords: Human induced pluripotent stem cells, CRISPR screen, neural progenitor cells, glutamatergic neurons, GABAergic neurons, autism spectrum disorder, psychiatric genomics, convergence, precision medicine

INTRODUCTION

The diverse common1 and rare2 genetic variants linked to autism spectrum disorder (ASD) show coordinated3 and convergent46 effects. In fact, overlapping downstream impacts7 are widely hypothesized to explain how similar clinical phenotypes result from heterogeneous gene mutations4. The 102 highly penetrant risk genes now associated with ASD2,813 are typically expressed at high levels in the human cortex and early in development14,15, enriched in excitatory and inhibitory neuronal lineages2,16, co-expressed1719 in the post-mortem brain, and result in highly interconnected protein-protein interactomes2022. Knockdown of subsets of ASD genes in human neural progenitor cells (NPCs)2325, cerebral organoids2628, and developing mouse29, tadpole30 and zebrafish31 brains reveal overlapping impacts on neurogenesis. Given that excitatory-inhibitory (E:I) imbalance is widely believed to underlie ASD3234, the extent to which ASD genes subsequently converge in mature neurons to cause functional deficits is largely unknown.

ASD genes are enriched for roles in gene expression regulation (e.g., chromatin regulators and transcription factors) and neuronal communication (e.g., synaptic function)2,813. The first chromatin modifier unequivocally associated with ASD was the chromodomain helicase DNA binding protein 8 (CHD8)35, with temporally specific roles in regulating chromatin compaction, histone modification, gene expression, and splicing3638, concomitant with effects on neurodevelopmental excitatory and inhibitory trajectories26,3941. Given the large number of other chromatin remodeling genes now linked to ASD5, and extending upon our recent finding that psychiatric risk genes with shared biological function and brain co-expression result in stronger convergent networks42,43, here we focus on a subset of ASD genes with overlapping function (i.e., chromatin regulators) and test how differences across developmental time points and neural cell types alter the nature of convergence resolved.

Precision medicine seeks to tailor treatments to individual patients44; for example, cancer45 and monogenic disease46 patients with defined molecular subtypes receive targeted treatments. Patient stratification in ASD has proven challenging. Gene classification (e.g., regulatory versus neuronal communication genes) does not predict clinical outcomes such as differential47 or co-occuring2 diagnosis of neurodevelopmental delay or disparities in IQ2. Although subgroups of ASD genes demonstrate opposing effects on telencephalon size in vivo30 or abnormal WNT/βcatenin responses in vitro23, this has not resulted in advances in patient stratification or treatment. The extent to which convergent downstream targets of ASD genes can be manipulated to prevent or ameliorate disorder signatures and phenotypes remains unclear.

Many ASD genes have distinct phenotypes across cell types, regulating proliferation and patterning in neural progenitor cells (SYNGAP148, NRXN149,50, CHD826,39, ARID1B51,52), excitatory transmission by glutamatergic neurons (SYNGAP153, NRXN154, CHD837,55, SHANK356), and inhibitory transmission by GABAergic neurons (NRXN157, CHD855, ARID1B58, SHANK359). Here we test the hypothesis that the cellular context in which ASD genes are evaluated will impact the nature of convergence. We report a pooled CRISPR-knockout (KO) strategy targeting 29 ASD genes (ANK3, ARID1B, ASH1L, ASXL3, BCL11A, CHD2, CHD8, CREBBP, DPYSL2, FOXP2, KMT5B (SUV420H1), KDM5B, KDM6B, KMT2C, MBD5, MED13L, NRXN1, PHF12, PHF21A, POGZ, PPP2R5D, SCN2A, SETD5, SHANK3, SIN3A, SKI, SLC6A1, SMARCC2, WAC) for loss-of-function (LoF) mutations across human neurodevelopment, resolving shared and distinct effects in induced NPCs, glutamatergic neurons, and GABAergic neurons. Two distinct methods (gene-level and network-level) demonstrated the greatest convergence in mature glutamatergic neurons. Overall, the strength and composition of the convergent networks reflected the co-expression patterns of the specific ASD genes considered, varied across cell types, and informed unique aspects of disease biology. Stratification of ASD genes yielded targeted drug predictions capable of reversing gene-specific convergent signatures in human cells and ASD-related behaviors in zebrafish. Altogether, cell-type-specific convergence of ASD genes informed disease mechanisms and prioritized novel therapies that predicted pharmacological responses in vivo. We tested the extent to which convergence informs genetic stratification of cases, asking which subsets of diverse ASD risk genes converge on a smaller number of target genes.

RESULTS

We49,6062 and others54,56,6368 demonstrated that iGLUTs (induced via transient overexpression of NGN2) are >95% glutamatergic neurons, robustly express excitatory genes, and show spontaneous excitatory synaptic activity by three-to-four weeks in vitro. Likewise, we57,69,70 and others63,71 reported that iGABA neurons (generated via transient overexpression of ASCL1 and DLX2) are >95% GABAergic neurons, robustly express inhibitory genes, and show spontaneous inhibitory synaptic activity by five-to-six weeks. Neural progenitor cells are rapidly generated by 48-hour induction with NGN272,73. iGLUTs, iGABA and iNPCs express most ASD genes, including all genes prioritized herein60.

A systematic comparison of ASD gene effects across neuronal cell types

From the 102 highly penetrant loss-of-function (LoF) gene mutations associated with ASD (58 gene expression regulation, 24 neuronal communication genes, cytoskeletal, and 10 misc.)2, we used gene ontology and primary literature to identify 21 epigenetic modifiers specifically involved in chromatin organization, rearrangement, and modification (ASH1L, ARID1B, ASXL3, BCL11A, CHD2, CHD8, CREBBP, PPP2R5D, KDM5B, KDM6B, KMT2C, KMT5B (SUV420H1), MBD5, MED13L, PHF12, PHF21A, SETD5, SIN3A, SKI, SMARCC2, WAC), as well as two transcription factors with putative roles as chromatin regulators (FOXP2, POGZ). Gene regulatory transcription factors, general transcription factors, and DNA replication genes were excluded, while three extensively studied synaptic genes (NRXN1, SCN2A, SHANK3) and three under-explored neuronal communication genes (ANK3, DPYSL2, SLC6A1) strongly associated with ASD were added (SI Fig. 1). Of these 29 ASD genes, many also have LoF gene mutations associated with neurodevelopmental delay, schizophrenia, bipolar disorder, and epilepsy (Fig. 1A, bold as most significant with ASD2) and/or general associations with GWAS for many neuropsychiatric disorders (MAGMA74) (Fig. 1B), indicating a pleotropic effect and consistent with the shared genetic liability across neuropsychiatric disorders75.

Figure 1. ASD gene knockout (KO) effects have stronger within cell-type correlations than within gene correlations.

Figure 1.

(A) List of targeted risk genes associated with autism spectrum disorder (ASD) and neurodevelopmental disorders (NDD). The boldest of the name indicates the strength of ASD correlation and genes are annotated if they are also rare variant targets of SCZ, BIP, and epilepsy (EPI) (B) MAGMA enrichments of targeted ASD genes across psychiatric GWAS. (C) Illustration of hiPSC derived cell-type specific scCRISPR-KO screen. Representative immunofluorescence for cell type markers for NPCs (DAPI/Nestin), mature iGLUTs (DAPI/MAP2/vGLUT), and mature iGABAs (DAPI/MAP2/GABA). (D) Transcriptomic impact of ASD/NDD gene KO across cell-type specific screens represented as the number of nominally significant (p<0.01) differentially expressed genes (DEGs). The degree of KO as indicated by gene expression (logFC) is not correlated with the proportion of significant DEGs (SI Fig. 8F,iii). (E) PCA of transcriptomic similarity between scramble controls across cell-types shows clustering based on cellular maturity, with mature iGLUTs and iGABAs and immature iGLUTs and NPCs clustering together respectively. (F) Pearson’s correlation matrix of log2FC across all KOs and cell-types demonstrated greater transcriptomic similarity within cell-types across distinct KOs compared to between cell-types and the same KO. (G) Cross cell-type correlation network diagram across KO perturbations demonstrates greater clustering between NPCs, immature iGLUTs, and surprisingly mature iGABAs, with KO signatures in mature GLUTs the most distinct and most strongly correlated. (H) GWAS disorder enrichments of KO DEGs across cell-types reveal unique KO-by-cell type common variant associations. (#nominal p-value<0.05, *FDR<0.05, **FDR<0.01, *FDR<0.001).

Although regulatory ASD genes showed a bias for prenatal expression, whereas neuronal communicational genes had highest expression in infancy, there were no significant differences in cell type specific expression patterns between either2. Towards resolving whether regulatory genes confer a continuous or distinct period of ASD susceptibility across neurodevelopment, we endeavored to knockout (KO) regulatory ASD genes in donor-matched neural progenitor cells, immature glutamatergic neurons, and mature glutamatergic neurons, contrasting effects across the glutamatergic lineage to effects in mature GABAergic neurons. A pooled CRISPR-KO approach (ECCITE-seq76) combined direct detection of sgRNAs and single-cell RNA sequencing to compare loss-of-function effects across 29 ASD genes. The CRISPR-KO library was generated from pre-validated gRNAs (three to four gRNAs per gene). Sequencing of the gRNA library confirmed the presence of gRNAs targeting 24 genes (ANK3, ARID1B, ASH1L, ASXL3, BCL11A, CHD2, CHD8, DPYSL2, FOXP2, KMT5B (SUV420H1), KDM5B, KDM6B, KMT2C, MBD5, MED13L, NRXN1, PHF12, PHF21A, SCN2A, SETD5, SIN3A, SKI, SMARCC2, WAC), but three were present at lower frequency and less likely to be detected (DPYSL2, FOXP2, SCN2A) (SI Fig. 2). Control hiPSCs were induced to iNPCs (here SNaPs)72, iGLUTs65, and iGABAs71, transduced with lentiviral-Cas9v2 (Addgene #98291) and subsequently with the pooled lentiviral gRNA library three days before harvest at day 7 (iNPC and immature iGLUT), day 21 (iGLUT), and day 36 (iGABA) (Fig. 1C). The gene expression patterns of non-perturbed iNPCs and iNeurons (>30% of all pooled cells) were significantly correlated with fetal brain cells and cortical adult neurons (SI Fig. 9).

Successful ASD gene perturbations were identified for 23 ASD genes (Fig. 1A,B,D): 16 in iNPCs, 15 in immature iGLUT neurons, and 21 in mature iGLUT and iGABA neurons. Nine ASD genes were perturbed in all four cell types (SI Fig. 8F,iii, Fig. 1D). We identified the sgRNA and resolved the transcriptome of 36,016 cells (12,107 iNPC, 3,310 d7 iGLUT, 11,802 d21 iGLUT, and 8,797 d36 iGABA). An average of 474 cells per sgRNA were successfully perturbated by a single gRNA (712 iNPC, 220 d7 iGLUT, 536 d21 iGLUT, 399 d36 iGABA), for a total of 33,389 perturbed cells and 2,627 controls (SI Fig. 3). Differentially expressed genes (DEGs, pFDR<0.05) across multiple individual KOs shared significant gene ontology enrichments in extracellular matrix structural constituents conferring tensile strength in NPCs (downregulated, FDR<0.05), NADH dehydrogenase activity in iGABAs (upregulated, FDR<0.01), and translation initiation factor activity, ATP hydrolysis activity, and microtubule plus-end binding, and others in mature iGLUTs (upregulated, FDR<0.05) – notably, mature iGLUT and iGABA neurons had the greatest number of shared enrichments across individual KOs, suggesting that diverse ASD genes indeed impacted similar neural processes and pathways, but that these impacts were cell-type and developmentally specific (SI Fig. 10, SI Data 1).

Across individual ASD genes, KO typically resulted in the greatest overall transcriptomic impact in mature iGLUTs, which typically also showed the largest number of DEGs (p<0.05), an effect that was not driven by differences in the extent of perturbation of the ASD gene between cell types (Fig. 1D, SI Fig. 8Fiii). Moreover, the transcriptomic effects of individual ASD gene KOs cluster by cell type, with the strongest correlations between distinct ASD KOs in mature iGLUTs and the weakest between ASD KOs in iNPCs (Fig. 1E). ASD KO transcriptomic signatures in mature iGLUTs were more distinct, least correlated with other cell types, and most highly correlated with each other (Fig. 1G). The transcriptomic effects of ASD genes within each cell type revealed unique disorder enrichments. Whereas transcriptomic effects of ASD KO in iNPCs and immature iGLUTs resulted in limited overlap with psychiatric GWAS, many ASD KOs in mature iGLUTs (13 of 21) resulted in significant GWAS associations, overwhelmingly to schizophrenia and bipolar disorder (Fig. 1H). Such clear associations between rare and highly penetrant gene mutations with more common and small effect GWAS targets are widely speculated to exist75,77,78, but supported by only limited evidence to date.

ASD gene knockouts result in cell-type-specific convergent genes and networks.

“Convergent genes” are the common DEGs with significant and shared direction of effect across all ASD gene perturbations (FDR adjusted pmeta<0.05, Cochran’s heterogeneity Q-test pHet > 0.05), as we previously described for the perturbation of schizophrenia GWAS target genes42,43 (Fig. 2A). Individual ASD KO effects rarely converged between iNPCs, iGLUTs, and iGABAs. In fact, frequently the only significant gene (FDR p<0.05) with concordant dysregulation between cell types was the targeted ASD gene itself (SI Fig. 11). The cell type specific nature of KO effects has rarely been tested systematically but given that the ASD epigenetic modifiers tested were expressed in all cell types, this lack of overlap might be considered unexpected. Moreover, the unique cross-cell-type convergent gene list generated by each ASD KO showed distinct enrichments for psychiatric disorder GWAS risk genes (SI Fig. 11).

Figure 2. Cell-type-specific gene-level convergence resulting from ASD LoF genes is greatest in mature glutamatergic neurons.

Figure 2.

(A) Schematic explaining cross-KO cell-type specific convergence at the individual gene level using differential gene expression meta-analysis. (B) Convergence across 9 ASD gene KO perturbations is largely unique to a given neuronal cell type. Rank-rank hypergeometric (RRHO) test exploring correlation of convergent genes shared across 9 KO perturbations (RRHO score = −log10*direction of effect) between cell-types. The top right quadrant represents up-regulated genes (meta-analysis z-score >0) for the y-axis and x-axis cell-type. The bottom left quadrant represents down-regulated convergent genes (meta-analysis z-score <0) for the y-axis and x-axis cell-type. Significance is represented by color with red regions representing significantly convergent gene expression. (C) Venn diagram representing the absolute overlap (regardless of direction of dysregulation) of cell-type specific convergent genes shared across 9 KOs. (D) (i) Cell-type specific convergence across these 9 KOs is uniquely enriched for neuropsychiatric GWAS-risk associated genes by, ASD common variants only enriched in immature iGLUT convergence (direction of triangle represents beta direction and significance annotation FDRs from two-sided enrichment testing with MAGMA). (ii) ORA of cross-KO cell-type specific convergence and rare variant target genes related to SCZ, ASD, and ID. (#unadjusted p-value=<0.05, *FDR<=0.05, **FDR<0.01, ***FDR<0.001) (E) (i) Across all combinations of 2–5 genes across 9 KOs, the average magnitude of convergence was highest in iGLUTs. (ii) The magnitude of convergence between the same KOs tested in different cells was highly correlated – with the strongest relationship between NPCs, immature, and mature iGLUTs. (iii) Despite strong significant correlation between convergence degree, the top-most convergent KO combinations were unique to each cell-type. (iv) Gene-level convergence based on significant DEGs often corresponds with the cell-type specific strength of correlation between KO effects on the transcriptomic – with the strongest correlations in mature iGLUTs. (F) Expanding the convergence analysis across random subsets of 2–5 genes across 16,14, 21, and 21 KOs for NPCs, immature iGLUTs, mature iGLUTs, and mature iGABAs respectively, the average magnitude of convergence was still highest in iGLUTs.

There were much greater shared effects observed between distinct ASD genes within the same cell type, than for the same ASD gene between cell types, with substantial convergent gene expression observed between different ASD gene KOs within the same cell type. Convergent genes downstream of the nine ASD genes with KO effects resolved across all four cell types were most similar between iNPCs and immature iGLUTs, particularly amongst up-regulated convergent genes, and least similar between iNPCs and mature iGLUTs or iGABAs (weighted correlations, Fig. 2B). Specifically, for these nine ASD KOs, we report 670, 1820, 11473, and 1983 significantly convergent genes in iNPCs, immature iGLUTs, mature iGLUTs, and mature iGABAs, respectively (FDR meta p-value<=0.05), with only four convergent genes overlapping across all cell types (Fig. 2C), and unique pathway enrichments by cell type (SI Fig. 1213, SI Data 1). In terms of absolute number of convergent genes, the extent of convergence varied by cell type, with the largest number of convergent genes evident in the mature iGLUTs and iGABAs (Fig. 2C). Whereas convergent targets of immature iGLUTs were significantly (FDR <0.05) enriched for the gene targets of ASD GWAS loci (MAGMA74), the convergent targets of mature iGLUTs (FDR <0.01) and iGABAs (FDR <0.05) were most enriched for schizophrenia and bipolar GWAS (Fig. 2Di). The convergent targets of iGLUTs were most enriched for rare variant risk genes associated with ASD and schizophrenia (FDR <0.05, FDR<0.08) (Fig. 2D,ii). Across all possible combinations of these nine ASD genes (combinations of 2–5 genes across all nine KOs equaling 152 unique combinations), the average strength of convergence (measured at the ratio of convergent genes to the average number of DEGs in a set, Fig. 2A,4) was highest in iGLUTs (Fig. 2E,i), but highly correlated between neural cell types, particularly between iNPCs, immature, and mature iGLUTs (Fig. 2E,ii). Top convergent sets (Fig. 2E,iii) frequently included ARID1B, and were influenced, in part and as expected, by the transcriptomic similarity between ASD KOs unique to each cell-type (Fig. 2E,ivv). Expanding the analyses across 2–5 gene combinations using all 23 perturbed ASD genes (2,724 to 7,906 random combinations) likewise confirmed that the average magnitude of convergence was highest in iGLUTs (Fig. 2F).

Figure 4. Functional similarity and brain co-expression between ASD/NDD genes predict gene-level and network-level convergence, with unique influences by cell-type.

Figure 4.

(A) Training random forest models for gene and network-level convergence - the proportion of cell-types was confirmed as balanced across the training sets and the testing sets and is representative the full data (SI Fig. 15). (B) Predictors included in the model include (i) semantic similarity scores based on gene ontology cellular component (C.C), biological process (B.P.) and molecular function (M.F) geneset membership, (ii) the scaled brain expression correlation of ASD genes in the adult dorsolateral prefrontal cortex (DLPFC), (iii) the cell-type, and (iv) the number of KOs represented in a convergent set, (C) (i) Distribution of gene-level convergence scores show cell-type specific distributions as described in Fig. 2 (n=20,240) – (ii) convergence is significantly correlated with functional similarity and brain co-expressions scores (FDR<0.001). (iii) Functional similarity, brain co-expression, cell-type, and the number of KOs assayed strongly predicted gene-level convergence (nTrees=500, 98% variance explained; mean of squared residuals<0.01) with cell-type as the most important predictor base on percent increase in mean error (%IncMSE) and the increase in node purity (IncNodePurity). (iv) Evaluation of the model in the testing set (n=6072) showed that predicted gene-level convergence by the model strongly correlated with the actual convergence (Pearson’s rho=0.99; Bonferroni p<0.001; root mean squared error (RMSE) =0.12); distribution of the number of nodes and error per tree in the random forest models reported in SI Fig. 15. (D) (i) Distribution of network-level convergence scores show cell-type specific distributions as described in Fig. 3 (n=3,204) – (ii) convergence is significantly correlated with CC and MF functional similarity (Holm’s adjP<0.001) nominally correlated with brain co-expressions scores (Holm’s adjP <0.08) (iii) Functional similarity, brain co-expression, cell-type, and the number of KOs assayed moderately predicted gene-level convergence (nTrees=500, 54% variance explained; mean of squared residuals=0.752) with the number of KOs/set and the C.C scores as the most important predictors. (iv) Evaluation of the model in the testing set (n=962) showed that predicted gene-level convergence by the model strongly correlated with the actual convergence (Pearson’s rho=0.714; Bonferroni p<0.001; RMSE=0.857). (E) Validation of predictor models in an independent scCRISPRa screen of 10 multifunctional SCZ common variant genes (eGenes) perturbed in iGLUTs demonstrated that network-level convergence model had greater predictive power. (i,ii) gene-level and network-level convergence distribution across 1,013 and 826 sets of 2–10 SCZ eGenes – convergence was significantly correlated with B.P (Holm’s adjP<0.05). M.F (Holm’s adjP<0.08), and BEC scores (Holm’s adjP <0.01) at the gene level and the number of KOs/set, B.EC and BP scores (Holm’s adjP<0.001) at the network level. While the gene-level convergence model has more predictive power within our data, it performed more poorly in external validation (RMSE=1.312, rho=0.07, Bonferroni P<0.02,) compared to the network-level model (RMSE=0.87, rho=0.364, Bonferroni P<0.001). (B.P score = semantic similarity of gene ontology: Biological Process membership between KO genes; C.C. score = semantic similarity of gene ontology: Cellular Component membership between KO genes; M.F score = semantic similarity of gene ontology: Molecular functions membership between KO genes; B.E.C = dorsolateral prefrontal cortex expression correlations between KO genes; nKOs = number of KO genes tested for convergence) (#nominal p-value<0.05, *adjP <0.05, **adjP <0.01, ***adjP <0.001).

“Convergent networks” are co-expressed genes that share similar expression patterns across ASD gene perturbations42,43 (Fig. 3A). The network connectivity score (“network convergence”) informs the strength and composition of convergent networks across cell types (i.e., networks with more interconnectedness and containing genes with greater functional similarity have increased convergence). Convergent networks generated from either just the 9 ASD genes perturbed in all cell types (502 unique ASD network sets per cell type) (Fig. 3B,i) or across a subset of random combinations of all 23 ASD genes (1,404 to 2,306 sets per cell type) (Fig. 3B,ii) revealed that the strength of the ASD convergent networks was greatest in iGLUTs. By considering just the nine ASD KOs, we observed that while network convergence was generally stronger in iGLUTs, this was not true for all possible networks; although many individual networks increased in strength over the course of differentiation from iNPCs to mature iGLUTs; there was a subset of networks that weakened from iNPCs to immature iGLUTs. Altogether, the number of convergent networks (Fig. 3C,i) and unique (Fig. 3C,ii) network nodes resolved was greatest in iGLUTs, and largely distinct across cell types. Unlike gene-level convergence, network-level convergence showed greater cell-type specificity and magnitude, indicated by the lack of overlapping node genes between cell types (Fig. 3C,ii) as well as the weak correlations of convergence strength between immature and mature cell-types (Fig. 3D; Tables 2,3). The node genes of the convergent networks generated in immature and mature iGLUTs were significantly over-represented for rare variants linked to schizophrenia and ASD (Fig. 3E) and many nodes were annotated common GWAS genes associated with schizophrenia (Fig. 3F).

Figure 3. Network-level convergence resulting from ASD LoF genes is cell-type-specific, greatest in glutamatergic neurons, largely reflects unique hub genes in each cell type, and is enriched for rare and common psychiatric risk in glutamatergic neurons.

Figure 3.

(A) Schematic explaining cross-KO cell-type specific convergence at the network level using Bayesian bi-clustering and undirected network reconstruction. (B) Testing the strength of network convergence across all possible combinations of 9 ASD/NDD KO perturbations, (i) the mean strength of network convergence is significantly different by cell-type, with the highest convergence present in immature iGLUTs. However, the same KO combinations test in one cell type may not resolve convergence in another cell type – indicating cell-type specificity. Each point represents the calculated convergence strength of one resolved network. Dots that represent the same combinations of KO perturbations, but tested in each cell type, are connected by a line. (ii) Testing the strength of network convergence across a subset of random combinations of 2–10 genes from the total 23 ASD/NDD KO perturbations, The mean strength of network convergence is significantly different by cell-type, with the highest convergence present in immature and mature iGLUTs, confirming the pattern seen with gene-level convergence (B) Venn diagrams of (i) the total number of networks resolved of the 502 combinations tested across 9 KOs further demonstrate that some KO combinations are only convergent in one cell-type – for example 6 are unique to NPCs. (ii) While overlap of the total number of unique node genes within those networks by cell-type demonstrate that even if the same combination of KOs resolve convergence, the node genes within their networks will be cell-type specific.(E) Enrichment of cell-type specific convergent node genes for rare variant targets revealed significant associations of immature iGLUTs with SCZ non-synonymous (NS) targets and nominal enrichments of mature iGLUTs and ASD LoF targets. (F) Although not significantly enriched, many node genes overlapped with GWAS common variant targets; heatmap representing the percent overlap of unique node genes for each cell-type that are targets of. Number in the center of each tile represent the absolute number of overlapping genes.

Table 2.

Disorder and behavioral associations of top nodes by cell-type.

Celltype Top Protein-coding Node Rare Disorders GWAS and behavioral associations
NPCs KIAA2012 ADHD/conduct disorder (rs1521882), educational attainment (rs12623702, rs4675248, rs2160317, rs2177083, rs34189321,rs58100125), migraine/T2D (rs6748072), amygdala volume (rs72936662)
Im. iGLUTs MYH15 Deafness (AR); de novo SCZ CNV Social interaction measurement (rs13082569), cognitive function (rs3860537), unipolar depression (rs1531188), MDD (rs113689582), insomnia (rs62266174, rs6768511, rs6786515, rs6795280), BIP (rs1531188) , ANX (rs4855559), educational attainment (rs115910830, rs2290601, rs3860537, rs60785803)
Ma. iGLUTs GALNTL5 ASD, Spastic paraplegia 48 (AR)
Ma. iGABAs EREG Epilepsy, Immune Deficiency Disease ADHD (rs1350666), wellbeing measurement (rs112444088), learning & memory (pathway)

Autosomal recessive (AR), Anxiety disorder (ANX) Bipolar Disorder (BIP), Type-2 diabetes (T2D)

Table 3.

Convergent nodes that overlap with CNV and rare variant target genes for each cell-type.

Celltype Rare variant gene targets
NPCs AATK, DLC1, PAK6
Int. iGLUTs ACMSD, HCST, MYH15, PAH, RSP01, SFTPC, SH3RF2, SLC28A2, SNAI2, AC0T6, CSPG4, PYCARD, SLC5A7, SULT1B1, TBXA2R, TEKT5, TEX15, ARG1, ASB14, CACNA1D, OPLAH, GRM4, KCNT1, FKBP6, NPAP1, OCA2, ATP10A
Mat. iGLUTs S100G, TRIM50, FOLR1, COX7B2, KIF23
Mat. iGABAs CHRND, RETN, PAX6, RIMBP3, SPDYE5, TSKS

We asked if the functional similarity and brain co-expression between ASD genes predicted convergence. To test this, we trained a prediction model (random forest linear regression)79 using 70% of our data, evaluated it using 30% of our data, and validated in an external dataset43 (Fig. 4A). Functional similarity based on gene ontology membership (Fig. 4B,i), brain co-expression in the dorsolateral prefrontal cortex (DLPFC) (Fig. 4B,ii), cell-type (Fig. 4B,iii), and the number of KOs assayed strongly predicted gene-level convergence (98% variance explained; mean of squared residuals<0.02) (Fig. 4C) and moderately predicted network-level convergence (54% variance explained; mean of squared residuals=0.75) (Fig. 4D). Cell type was the most important predictor of gene-level convergence (Fig. 4C,iii), while cell-type and functional similarity of the individual ASD genes perturbed were of generally equal importance in predicting network-level convergence (Fig. 4C,iii). Our trained model accurately predicted gene-level (Pearson’s rho=0.99, Holm’s adjP<0.001, RMSE=0.12) (Fig. 4C,iv) and network-level convergence in our testing set (Pearson’s rho=0.714, Bonferroni <0.001, RMSE=0.86) (Fig. 4D,iv), and performed moderately well in predicting network-level convergence (RMSE=0.87, rho=0.364, Bonferroni P<0.001) (Fig. 4E,ii) but not gene-level convergence (RMSE=1.3, rho=0.072, Holm’s adjP<0.02 (Fig. 4E,ii) in the external dataset. This inability to predict gene-level convergence may reflect fundamental differences in the training and validation data – specifically that the prediction model was based on CRISPR-KO of large-effect rare variant ASD target genes while the external dataset resulted from CRISPR-activation of small-effect common variant schizophrenia target genes. Altogether, although cell-type, functional similarity and co-expression were sufficient to predict the extent of convergence at the network and gene-level, the magnitude and direction of the perturbation and/or whether the perturbation genes were rare or common variant targets may also be important contributors.

Functional mapping of gene annotations (FUMA80) for the nine ASD genes perturbed in each cell-type allowed for finer resolution of shared function. With the majority involved in chromatin biology and genetic regulation, few were targets of specific TFs and miRNAs, yet convergence based on these functionally informed subsets of ASD KOs was cell-type specific (Fig. 5A). For example, ASD targets of miRNA129–5P resolved networks across all cell-types, but with the strongest convergence in mature iGABAs (SI Fig. 14) while ASD genes targeted by FOXO1 (PHF21A, SIN3A, CHD2), convergent networks were only resolved in immature and mature iGLUTs (Fig 5A,iii), distinct between these two cell types (Fig. 5B,C), and comprised of non-overlapping node genes that contained many targets of common and rare variants associated with psychiatric diagnoses. The FOX01-target convergent network in immature iGLUTs (Fig. 5B,i) was significantly enriched for immune networks (FDR<0.001), whereas in mature iGLUTs it was significantly enriched for serotonin (5-HT) receptor mediated signaling (FDR<0.05) (Fig. 5C,ii). Moreover, PPI networks between node genes were enriched for stress response (p <0.01) in immature iGLUTs and cell communication (FDR=0.066) and positive regulation of cellular biosynthetic processes in mature iGLUTs (FDR <=0.05) (Fig. 5C,iii). Although convergent networks generated between cell types were largely unique and non-overlapping, they shared common biological and disorder enrichments, hinting at novel processes that were not predicted by ASD risk genes alone.

Figure 5. Distinct cell-type-specific networks converge on FOXO1 targets in immature and mature glutamatergic neurons.

Figure 5.

(A) Functional gene annotation (FUMA) identified sets of KOs with more specific shared function, beyond chromatin organization, and shared regulation by transcription factor and miRNAs. Network convergence across KOs based on functional annotation showed cell-type specificity and resolved networks with variable convergence and gene membership (i-ii). Convergent networks between targets of FOXO1 (PHF21A, SIN3A, and CHD2) were resolved in iGLUTs, but not NPCs and iGABAs. FOXO1 target convergent networks were unique in immature (B,i) and mature (C,i) iGLUTs. Central nodes in immature and mature iGLUTs included non-overlapping gene targets of psychiatric common and rare variants. The convergent network in immature iGLUTs was significantly enriched for immune networks related to IgA production (FDR<0.001) (B,ii) – network expansion using FunMap identified protein-protein connection between node genes that were nominally enriched for the regulation of stress response in immature iGLUTs (B,iii) (nom. P<0.01, adj. P=0.32). The convergent network in mature iGLUTs was significantly enriched 5-HT receptor mediated signaling (FDR<0.05) (C,ii) – network expansion identified a PPI network nominally enriched for cell communication (adj. P=0.066) and significantly enriched for positive regulation of cellular biosynthetic processes in mature iGLUTs (C,iii) (adj. P <=0.05).

Stratification of ASD genes by zebrafish behavior, brain structure, and circuit function.

A comprehensive in vivo high-throughput, automated behavioral analysis in zebrafish31 recently revealed clear stratification of ASD genes based on basic arousal and sensory processing behaviors in the developing vertebrate brain. Leveraging zebrafish behavioral data for fifteen ASD genes for which we have iGLUT and iGABA analyses, we asked to what extent molecular convergence underlies this behavioral stratification. Notably, the cell-type deconvolution of the zebrafish brain using human single-cell reference data highlighted neurons as the highest represented cell type; zebrafish brain gene expression was robustly correlated with in vitro human-derived mature neurons (Fig. 6A). Correlation of zebrafish KOs based on behavioral effects yielded four clear groups of genes: set 1 (MBD5, NRXN1, KDM5B), set2 (CHD2, SMARCC2, PHF12, SKI), set 3 (KMT5B, KMT2C, KDM6B), and set 4 (ARID1B, ASH1L, CHD8, PHF21A, WAC) (Fig. 6B; Table 4). Gene-level convergence between genes in these sets was distinct, largely non-overlapping between cell-types, and stronger in mature iGLUTs than mature iGABAs (Fig. 6CD). Likewise, neuropsychiatric GWAS (Fig. 6C) and rare LoF genes (Fig. 6D), showed unique risk-associations by set, and stronger risk associations in the iGLUTs.

Figure 6. ASD/NDD genes clustered based on behavioral phenotypes in zebrafish resolve unique gene-level convergent signatures in mature human glutamatergic and GABAergic neurons that are uniquely enriched for psychiatric risk and predicted drug reversers.

Figure 6.

(A) Gene expression in human mature iGLUTs and iGABAs correlate with expression in the zebrafish brain. Cellular deconvolution of wild-type (wt) zebrafish brain expression based on an adult single-cell brain reference identified neurons as the largest proportion of cells in the fish brain. Gene expression in wt zebrafish brain significantly positively correlate with gene expression of human-induced mature iGLUTs (rho=0.39, Holm’s adj.P<0.001) and iGABAs (rho=0.39, Holm’s adj.P <0.001) (SI Fig. 9). (B) ASD/NDD risk genes uniquely cluster based on wake/startle behavioral responses in zebrafish KOs. Based on correlations between 24 phenotypic measurements related to sleep, wake, and startle response in zebrafish KOs, ASD/NDD risk genes clustered into four behavioral sets. Set 1: MBD5, NRXN1, KDM5B; Set 2: CHD2, SMARCC2, PHF12, SKI; Set 3: KMT5B, KMT2C, KDM6B; Set 4: ARID1B, ASH1L, CHD8, PHF21A, WAC. (C) Gene-level convergence within these sets is largely non-overlapping between mature iGLUTs and iGABAs and (D) show unique enrichments for common psychiatric risk gene targets. (E) Within each cell-type, behavioral sets have both shared and unique convergent genes (i-ii). (F) In both iGABAs and iGLUTs, all four behavioral sets were enriched for FMRP targets. Gene targets of neurodevelopmental rare variants were only significantly enriched for convergent signature in mature iGLUTs – with behavioral set 4 uniquely significantly enriched for secondary targets of ASD loss-of-function variant and set 3 uniquely enriched for primary targets of SCZ non-synonymous variants (iii). (F). Prediction of drugs “reversers” from the LiNCs database targeting these convergent signals in mature neurons identify shared and unique drugs enriched across the sets (i.e., antipsychotic perphenazine in iGLUTs) and some specific to unique convergence (i.e., valsartan only in in Set 3 iGABAs, and sirolimus in Set 3 iGLUTs). (G) Top predicted drugs show opposing effects on zebrafish behavior compared to individual ASD KOs – these signatures do not always correspond with their predicted direction of effect on transcriptomic convergence. (#nominal p-value<0.05, *FDR<0.05, **FDR<0.01, *FDR<0.001)

Table 4.

Disorder and behavioral associations of top nodes by cell-type and behavioral set.

Cell type Top Node Gene name Meta P Z-score Rare Disorders GWAS
Mature H3LUT 1 CNBP Sterol-mediated transcriptional repression 2.9e-16 8.18 Myotonic Dystrophy, Neuromyotonia & axonal neuropathy (AR), Creutzfeldt-Jakob Disease, Deafness (AR), FragMe-X tremor/ataxia, ALS
ANKRD36 1.95e-18 8.8
2 FOXJ3 1.7e-17 8.5
SOX12 5.3e-44 −13.9 ASD, Epilepsy, NDO
3 FOXJ3 Gene expression regulation/chromatin 1.9e-19 9.02 EA (rs35011283)
S0X12 Implicated in cell fate decisions during development 1.6e-17 −8.52 Microphtalmia Syndromic 12 (neurological) Memory performance (rs78532272)
4 FOXJ3 7.9e-22 9.6
ANKRD36 Ankyrin Repeat Domain 36 6.6-e26 −10.5 Neuropathy, Giant Axonal Nonsyndromic Deafness (AR) SCZ (rs9631085), neuroimaging (rs11692435, rs167684, +6)
Mature iGABA 1 UBE2D4 Ubiquitin-Conjugating Enzyme E2 D4 1.2e-09 6.1
PRKAG2 Protein Kinase AMP-Actrvated Non-Catalytic Subunit 6.7e-7 −4.97 Wolff-Parkinson-White Syndrome, Neuromuscular Disease, Specific Language Impairment, Microencephaly, Epilepsy EA (rs1860735, rs2538046, rs4726070), ASD/SCZ (rs115136442), impulse control (rs2302532), BIP (rs7795096), memory (rs2536058)
2 MYL3 BBB & immune cell trasnmigration 4e-12 6.9 Wolff-Parkinson-White Syndrome, Microencephaly, Epiliepsy
TNFSF11 3.6e-10 −6.3 ID, NDD, developmental & epileptic encephalopathy, IBS,
3 KIF1A Axonal Transporter Of Synaptic Vesicies 8.8e-12 6.8 ID, ASD, neuropathy, Epilepsy, Spastic Ataxia, Cortical Dysplasta, Alacrima, Achala sia, and Impaired ID Syndrome Insomnia (rs10196604, rs4455151)
UNC5C Netrin Receptor 3.98e-8 −5.5 Alzheimer Disease. Familial 1 Insomnia (rs371745379), wellbeing (rs116984513), OCO (rs17384439), cognition (rs3846455), unipolar depression (rs6822806)
4 UBE2D4 8.5e-13 7.2
UBC 1.7e-10 −6.4 Parkinson’s disease, epilepsy, muscular dystrophy Sleep duration (rs10773112), AD (rs78184510),

Restricting to (cMAP)81 drugs with human clinical and experimental zebrafish data31, the drugs predicted to reverse convergent signatures in iGLUTs and iGABAs included targets of glucocorticoid, androgen, and serotonin receptors, as well as drugs regulating inflammatory signaling (statins, immunomodulatory chemo-therapy drug, and non-steroidal anti-inflammatory drugs (NSAIDs)) (Fig. 6F). While some drugs were predicted to have broad genotype-independent effects reversing ASD signatures (e.g., antipsychotic perphenazine in iGLUTs), others were predicted to yield more precise clinical responses (e.g., sirolimus only in Set 3 iGLUTs. Notably, mirroring the lack of significant disorder enrichments, few of these drugs significantly reversed convergent signatures in iGABAs. By considering existing pharmacological effects of the top drugs on zebrafish behavior31, some of the predicted drug reversers were shown to oppose effects on ASD related phenotypes in zebrafish. Yet, the direction of effect predicted based on transcriptomic convergence in human neurons did not always align with anti-correlating behavioral effects in zebrafish. For example, perphenazine was significantly indicated as a reverser of transcriptomic convergence in iGLUTs sets 2,3 and 4, but only reversed the behavioral phenotypes associated with set 2 genes (CHD2, SMARCC2, PHF12, SKI). The mechanisms resulting in transcriptomic convergence in vitro only partially explain behavioral convergence in vivo.

DISCUSSION

Towards empirically resolving the extent that ASD risk gene effects converge upon a smaller number of common pathways7, we applied a pooled CRISPR-KO strategy targeting 29 highly penetrant ASD genes across neural cell types. Convergence was dynamic, driven by co-expression patterns, and yielded networks of varied strength and composition between cell types and across neurodevelopment, overall greatest in mature glutamatergic neurons. Both molecular and phenotypic stratification of ASD genes resulted in unique cell-type-specific patterns of convergence that regulate different biological processes. Distinct cell-type-specific convergent patterns predict drugs capable of reversing both gene expression and phenotypic effects.

Our findings add to a wealth of recent studies examining the convergent impact of LoF ASD genes during neurogenesis2331. Transcriptomic and epigenomic analyses of post-mortem brain from ASD cases likewise indicate convergent molecular signatures82 and subtypes of ASD83. But how to connect convergent causes during neurogenesis with differences in social communication, restricted interests, and repetitive behavior? One possibility is that temporal dysregulation of neurodevelopmental gene networks84 results in asynchronous neurodevelopment of cortical glutamatergic and GABAergic neurons26 leading to excitation-inhibitory imbalance32. Our data suggests additional mechanisms, demonstrating that, across ASD genes, studying loss of expression specifically in postmitotic neurons also resolves convergent networks, but of distinct size and composition from those identified during neurogenesis. Put simply, the same ASD genes resulted in different convergent effects across cell types. This observation may inform the etiology of other complex genetic disorders with diverse risk genes and pathophysiological evidence indicating effects in multiple cell types.

Although rare LoF ASD gene mutations tend to confer large effects in the individuals who carry them, the small effects of common variants account for much of the genetic risk for ASD at the population level1,85. The differences in expressivity and incomplete penetrance of high effect-size rare variants is frequently attributed to diversity across polygenic backgrounds86; in vitro, ASD gene effects are indeed influenced by the individual genomic context26. In psychiatry, common genetic variants are more associated with cross-disorder behavioral dimensions87 and rare variants with co-occurring intellectual disability88. Common risk variants interact with rare mutations to determine individual-level liability in ASD3,89,90, schizophrenia91,92, epilepsy93, Huntington’s disease94 and more95. Our results, highlighting that convergence downstream of ASD gene effects are enriched for cross-disorder GWAS variants and rare LoF genes, inform pleiotropy of genetic risk for psychiatric disorders. Moving forward, we argue that it is critical that empirical functional genomic studies systematically consider the impact of common and rare variants together, including screening the impact of LoF genes in hiPSC lines derived from donors with high and low polygenic risk scores96. Intriguingly, even susceptibility to environmental risk factors for ASD (e.g., valproic acid97) seems to be mediated by genetic background98. Deeper phenotypic characterization of ASD effects across donors will be critical in determining how complex genetic (or environmental) interactions shape cellular phenotypes, circuit function, and human behavior in the clinic.

In the post-mortem brain, ASD gene signatures are not just associated with downregulation of co-expression modules involving synaptic signalling99, but also upregulation of microglial and astrocyte gene modules83,99105. The extent to which increased neuroimmune activity in ASD is a response to cellular or environmental sources of inflammation, or indicative of a non-cell autonomous role for glia cells in risk is unclear; evidence supports both possibilities. Consistent with a model of maternal immune activation during neurodevelopment106, glucocorticoids and inflammatory cytokines perturb the expression of psychiatric risk genes107,108, altering the regulatory activity of psychiatric risk loci109, and interfering with neuronal maturation in brain organoids110. Yet, in vivo analysis of ASD genes in zebrafish revealed global increases in microglia31 and in vitro screening in human microglia uncovered roles in endocytosis and uptake of synaptic material111. Indeed, given the reciprocal relationships between neuronal activity and glial function, epigenetic state, and gene expression112115, it seems probable that both cell-autonomous and non-cell-autonomous effects underlie and/or exacerbate ASD gene effects.

In summary, we demonstrate that convergent effects of ASD risk genes are dynamic across cell-type-specific contexts. These signatures can be leveraged for translational insights into precision medicine, with top predicted pharmacological reversers of convergent networks including drugs with known anti-psychotic, anti-anxiety, and anti-depressive effects, as well as novel points of individualized therapeutic intervention. If the convergence of diverse risk genes on common molecular pathways indeed explains how genetically heterogeneous variants result in similar clinical features, then achieving precision medicine will ultimately entail genetic stratification of cases, whereby clinical decisions are informed based on shared genetic risk factors between individuals.

MATERIALS AND METHODS

Generation of neural cells:

Informed consent was obtained at the National Institute of Mental Health, under the review of the Internal Review Board of the NIMH. hiPSC work was reviewed by the Internal Review Board of the Icahn School of Medicine at Mount Sinai as well as by the Embryonic Stem Cell Research Oversight Committee at the Icahn School of Medicine at Mount Sinai and Yale University. Fibroblasts were genotyped by IlluminaOmni 2.5 bead chip genotyping116,117, PsychChip118, and exome sequencing118; hiPSCs118 were validated by G-banded karyotyping (Wicell Cytogenetics) and genome stability monitored by Infinium Global Screening Array v3.0 (lllumina). SNP genotype was inferred from all RNAseq data using the Sequenom SURESelect Clinical Research Exome (CRE) and Sure Select V5 SNP lists to confirm that neuron identity matched donor. Control hiPSCs were cultured in StemFlex media (Gibco, #A3349401) supplemented with Antibiotic-Antimycotic (Gibco, #15240062) on Geltrex-coated plates (Gibco, #A1413302). Cells were passaged at 80–90% confluence with 5mM EDTA (Life Technologies #15575–020) for 3 min at room temperature (RT). EDTA was aspirated and cells dissociated in fresh StemFlex media. Media was replaced every 48–72 hours for 4–7 days until the next passage.

Transient transcription factor overexpression from stable clonal hiPSCs was used to induce control hiPSCs to iNPCs (here SNaPs)72, iGLUTs65, and iGABAs71. We transduced hiPSCs from two control donors (553–3, male; 3182–3, female) with lentiviral pUBIQ-rtTA (Addgene #20342) and tetO-NGN2-eGFP-NeoR (Addgene #99378) for iNPCs and iGLUTs, or pUBIQ-rtTA (Addgene #20342), tetO-ASCL1-PuroR (Addgene #97329), and tetO-DLX2-HygroR (Addgene #97330) for iGABAs. Following transduction by spinfection at 1000g for 1 hour at 37°C, hiPSCs were subjected to 48-hour antibiotic selection (1mg/mL neomycin G418 (Thermo #10131027), 0.5μg/mL puromycin (Thermo #A1113803), and/or 250μg/mL hygromycin (Thermo, #10687010) and then clonalized by expansion from single colonies. Ultimately, clonal and inducible iNPC/iGLUT 3182–3-clone5 and iGABA 553–3-clone34 hiPSCs were validated lentiviral genome integration by PCR, doxycycline induced transcription factor expression by qPCR, and robust and consistent neuronal induction confirmed by RNA-seq and immunocytochemistry for relevant cell type markers. Analyses throughout reflect data from iGLUT 3182–3-clone5 (iNPC, d7 iGLUT and d21 iGLUT) and iGABA 553–3-clone34 (d36 iGABA) (SI Fig 2AB).

iNPCs:

At DIV0, 3182–3-clone5 hiPSCs were dissociated and plated at 1.5 × 106 cells per well onto Geltrex-coated 6-well plates (1:250 dilution coating) in SNaP Induction Media (DIV0): DMEM/F12 with Glutamax (ThermoFisher, 11320082), Glucose (0.3% v/v), N2 Supplement (1:100, ThermoFisher, 17502048), Doxycycline (2 μg/mL; Sigma-Aldrich, D9891), LDN-193189 (200 nM; Stemgent, 04–0074), SB431542 (10 μM; Tocris, 1614), and XAV939 (2 μM; Stemgent, 04–00046) supplemented with 25 ng/mL Chroma I ROCK2 Inhibitor. After 24 hours, DIV2, cells were fed with Selection Media: DMEM/F12 with Glutamax, Glucose (0.3% v/v), N2 Supplement (1:100), Doxycycline (2 μg/mL), Geneticin (0.5 mg/mL; Thermofisher, 10131035), LDN-193189 (100 nM), SB431542 (5 μM), and XAV939 (1 μM). After 48 hours post induction (DIV2), SNaPs were dissociated with Accutase for 10 minutes at 37°C, quenched in DMEM, pelleted at 800g for 5 minutes, and replated at 1.5×106 cells per well onto Geltrex-coated 6-well plates in SNaP Selection Media supplemented with Geneticin (0.5 mg/mL). After 16–18 hr (DIV3), medium was switched to SNaP maintenance Medium: DMEM/F12 with Glutamax, Penn/Strep (1:100), MEM-NEAA (1:100; Life Technologies, 10370088), B27 minus Vitamin A (1:50; Life Technologies, 12587010), N2 Supplement (1:100; Life Technologies, 17502048), recombinant human EGF (10 ng/mL; R&D Systems, 236-EG-200), recombinant human basic FGF (10 ng/mL; Life Technologies, 13256029), Geneticin (0.5 mg/mL), and Chroman I (25 ng/mL). Cells were fed every 48 hours with SNaP maintenance medium lacking Chroman I and Geneticin. Cells were dissociated and seeded weekly at a density of 1.25–1.5×106 cells per well onto Geltrex-coated 6-well plates until NPC morphology was observed and persistent. Cells were expanded and cryofrozen.

DIV7 iGLUTs:

3182–3-clone5 iNPCs were thawed and seeded at 1× 106 cells per well onto Geltrex-coated 12-well plates. NGN2 expression was induced with Doxycycline (2 μg/mL) for 24 hrs (DIV0) with antibiotic selection for 48 hrs (DIV1–3) in SNaP maintenance medium. At DIV 4 SNaPs were dissociated with Accutase, switched into Neuronal Medium: Brainphys (Stemcell, 05790), Glutamax (1:100), Sodium Pyruvate (1 mM), Anti-Anti (1:100), N2 (1:100), B27 without vitamin A (1:50), BDNF (20 ng/mL; R&D, 248-BD-025), GDNF (20 ng/mL; R&D, 212), dibutyryl cAMP (500 μg/mL; Sigma, D0627), L-ascorbic acid (200 μM; Sigma, A4403), Natural Mouse Laminin (1.2 μg/m; Thermofisher, 23017015) and seeded in Geltrex-coated (1:120 dilution coating) 12-well plates. Medium was changed every 24 hrs until DIV7 harvest.

D21 iGLUTs:

hiPSCs were harvested in Accutase (Innovative Cell Technologies, AT-104) for 5 minutes 37°C, dissociated into a single-cell suspension, quenched in DMEM, pelleted via centrifugation for five minutes at 1000 rcf and resuspended in StemFlex containing 25 ng/mL Chroma I ROCK2 Inhibitor and 2.0 μg/mL doxycycline (DIV0), seeded 1 × 106 cells per well onto Geltrex-coated 6-well plates (1:250 dilution coating), and incubated overnight at 37°C. The next day, DIV1, hiPSCs were subjected to 48-hour antibiotic selection by medium replacement with Induction Media: DMEM/F12 (Thermofisher, 10565018), Glutamax (1:100; Thermofisher, 10565018), N-2 (1:100; Thermofisher, 17502048), B27 without vitamin A (1:50; Thermofisher, 12587010), Antibiotic-Antimycotic (1:100) with 1.0μg/mL doxycycline and 0.5mg/ml Geneticin. At DIV3, cells were treated with 4.0μM cytosineβ-D-arabinofuranoside hydrochloride (Ara-C) and 1.0μg/mL doxycycline to arrest proliferation and eliminate non-neuronal cells in the culture. At DIV4 immature neurons were dissociated with Accutase and 5 units/mL DNAse I at 37°C for 7–10 min, quenched in DMEM, centrifuged for five minutes at 1,500 rpm and resuspended in 25 ng/mL Chroma I ROCK2 Inhibitor, 1.0 μg/mL doxycycline and 4.0μM Ara-C and switched to Neuron Medium: Brainphys (Stemcell, 05790), Glutamax (1:100), Sodium Pyruvate (1 mM), Anti-Anti (1:100), N2 (1:100), B27 without vitamin A (1:50), BDNF (20 ng/mL; R&D, 248-BD-025), GDNF (20 ng/mL; R&D, 212), dibutyryl cAMP (500 μg/mL; Sigma, D0627), L-ascorbic acid (200 μM; Sigma, A4403), Natural Mouse Laminin (1.2 μg/mL; Thermofisher, 23017015) and seeded 7 × 105 cells per well onto Geltrex-coated (1:60 dilution coating) 12-well plates and incubated overnight at 37°C. The next day, DIV 6, Chroman I was removed from culture and Ara-C lowered to 2.0 μM with a full Neuronal medium change. At DIV 7 a full Neuronal Medium change was performed to remove doxycycline and Ara-C from culture, to allow for antibiotic resistant genes silencing. From DIV7 onwards, half neuronal medium changes were performed every 72 – 96 hrs until mature DIV 21 for harvest.

DIV36 iGABAs:

hiPSCs were harvested in Accutase (Innovative Cell Technologies, AT-104) for 5 minutes 37°C, dissociated into a single-cell suspension, quenched in DMEM, pelleted via centrifugation for five minutes at 1000 rcf and resuspended in StemFlex containing 25 ng/mL Chroma I ROCK2 Inhibitor and 2.0 μg/mL doxycycline (DIV0), seeded 1.5–2× 106 cells per well onto Geltrex-coated 6-well plates (1:250 dilution coating), and incubated overnight at 37°C. The next day, DIV1, hiPSCs were subjected to 48-hour antibiotic selection by medium replacement with Induction Media: DMEM/F12 (Thermofisher, 10565018), Glutamax (1:100; Thermofisher, 10565018), N-2 (1:100; Thermofisher, 17502048), B27 without vitamin A (1:50; Thermofisher, 12587010), Antibiotic-Antimycotic (1:100) with 1.0μg/mL doxycycline, 1.0 μg/mL puromycin (Sigma, P7255) and 250 μg/mL hygromycin (Sigma, 10687010). At DIV3, cells were treated with 4.0μM cytosineβ-D-arabinofuranoside hydrochloride (Ara-C) and 1.0μg/mL doxycycline to arrest proliferation and eliminate non-neuronal cells in the culture. At DIV5 immature neurons were dissociated with Accutase and 5 units/mL DNAse I at 37°C for 7–10 min, quenched in DMEM, centrifuged for five minutes at 1,500 rpm and resuspended in 25 ng/mL Chroma I ROCK2 Inhibitor, 1.0 μg/mL doxycycline and 4.0μM Ara-C and switched to Neuron Medium: Brainphys (Stemcell, 05790), Glutamax (1:100), Sodium Pyruvate (1 mM), Anti-Anti (1:100), N2 (1:100), B27 without vitamin A (1:50), BDNF (20 ng/mL; R&D, 248-BD-025), GDNF (20 ng/mL; R&D, 212), dibutyryl cAMP (500 μg/mL; Sigma, D0627), L-ascorbic acid (200 μM; Sigma, A4403), Natural Mouse Laminin (1.2 μg/mL; Thermofisher, 23017015) and seeded 7 × 105 cells per well onto Geltrex-coated (1:60 dilution coating) 12-well plates and incubated overnight at 37°C. The next day, DIV 6, Chroman I was removed from culture and Ara-C lowered to 2.0 μM with a full Neuronal medium change. At DIV 7 a full Neuronal Medium change was performed to remove doxycycline and Ara-C from culture, to allow for antibiotic resistant genes silencing. From DIV7 onwards, half neuronal medium changes were performed every 72–96 hrs until mature DIV 36 for harvest.

CRISPR knockout gRNA library design (Thermofisher) and validation

From the 102 highly penetrant loss-of-function (LoF) gene mutations associated with ASD (58 gene expression regulation, 24 neuronal communication genes, 9 cytoskeletal genes, and 11 multifunction genes)2, gene ontology and primary literature research identified 26 epigenetic modifiers specifically involved in chromatin organization, rearrangement, and modification. ASD gene expression (RNA-seq RPKM in iGLUTs) was plotted against significance of ASD association (TADA FDR Values), to ensure selection of top ASD genes with the highest expression and highest ASD association. ASD gene expression was confirmed across development in the brain (BrainSpan119), and in bulk and scRNA-seq (SI Fig. 1BC). 21 epigenetic modifiers (ASH1L, ASXL3, ARID1B, CHD2, CHD8, CREBBP, KDM5B, KDM6B, KMT2C, KMT5B, MBD5, MED13L, PHF12, PHF21A, POGZ, PPP2R5D, SETD5, SIN3A, SKI, SMARCC2, WAC,) as well as two transcription factors with putative roles as chromatin regulators (FOXP2, BCL11A) were selected (SI Fig. 1A). Gene regulatory transcription factors, general transcription factors, and DNA replication genes were excluded. Three extensively studied synaptic genes (NRXN1, SCN2A, SHANK3) with roles in ASD were included as positive controls and three under-explored genes for ASD role in neuronal communication genes (ANK3, DPYSL2, SLC6A1) were also included in the library.

Individual DNA from glycerol stocks of Invitrogen LentiArray Human CRISPR Library gRNAs-PuroR (ThermoFisher, A31949) (3–4 individual gRNAs per gene, see SI Table 1) were prepared using GeneJET Plasmid Miniprep Kit (K0503) and pooled at an equimolar ratio and a 5-fold ratio of scramble control gRNA plasmid. Library quality was confirmed by restriction enzyme digest (10x Cutsart NEB), agarose gel purification using QIAquick Gel Extraction Kit (#28706) to check library purity, followed by Mi-seq for gRNA count distribution (SI Fig. 2D). Based on the abundance of gRNAs from Mis-seq, 4 ASD gene targets were highly unlikely to be resolved in the final experiments – POGZ, PP2R5D, SHANK3, SLC6A1 – and 3 with low abundance and less likely to be resolved (SCNA2, FOXP2, DYPSL2) (SI Fig. 2E).

Lentiviral Cas9v2-HygroR (Addgene, 98291) and pooled LentiArray-gRNA-PuroR CRISPR-KO library were packaged as high-titer lentiviruses (Boston Children’s Hospital Viral Core) and experimentally titrated in each cell type. Highest viable MOI was used for Cas9v2 and MOI < 0.5 for lentivirus gRNAs pool library.

CRISPR and gRNA delivery:

Lentiviral Cas9v2-HygroR (Addgene #98291) transduction of iNPCs, day 4 (iGLUTs), or day 5 (iGABAs) occurred via spinfection (one hour at 1,000 g) and followed by 72 hr hygromycin (250 μg /mL) (except for iGABAs, which express inducible hygromycin resistance at this stage). Pooled Invitrogen LentiArray Human gRNA-PuroR CRISPR-KO Library gRNAs (ThermoFisher #A31949) (MOI 0.3–0.5) were transduced via spinfection three days prior to harvest (e.g., d4 for D7 iGLUTs, d18 for D21 iGLUTs, d33 for d36 iGABA), with fresh medium containing puromycin (1 μg/mL) added 16–24 hours post transduction of gRNAs. For mature iGLUTs and iGABAs, as doxycycline was removed from medium at DIV7, and by DIV18 neurons had lost transcription factor linked antibiotic resistance, at 24 hours post-transduction (DIV19 or DIV34) puromycin (1 μg/mL) and hygromycin (250 μg /mL) were added to media for 48-hr antibiotic selection prior to harvest (SI Fig. 3A,i).

Dissociation to single cells:

Cells were dissociated 72 hrs post gRNA library delivery for single cell sequencing, as iNPCs, DIV7 and DIV21 iGLUTs, or DIV36 iGABAs as follows:

iNPCs and DIV7 iGLUTs were dissociated in accutase for 5min @37°C, washed with DMEM/10%FBS, centrifuged at 1,000xg for 5 min, gently resuspended, and counted.

DIV21 iGLUTs and DIV36 iGABAs were dissociated with papain. Papain was pre-warmed (39°C) for 30 minutes in HBSS (ThermoFisher, 14025076), HEPES (10 mM, pH 7.5) EDTA (0.5 mM), Papain (0.84 mg/mL; Worthington-Biochem, LS003127). The cells were washed with PBS-EDTA (0.5 mM) and 300 uL of papain solution and 5 units of DNAse I was added per well of 12-well plate and incubated at 37°C for 10–15 minutes, 125 rpm. Dissociation was quenched with DMEM-10%FBS. Detached neurons were broken by gentle manual pipetting, pelleted at 600 g for 5 minutes, resuspended in DMEM-10%FBS, filtered through a cell strainer and counted and submitted for 10X sequencing.

Cells were loaded into 10X in 4 lanes per cell type, targeting 20,000 cells per lane for a total of ~80,000 targeted cells per cell type. scRNA-seq was performed at Yale Genomics Core with the 10X single cell 5’ v2 HT with CRISPR barcode kit (SI Fig. 3A,ii).

Analysis of single-cell CRISPRko screens in NPCs, DIV 7, DIV 21 iGLUTs and DIV 36 iGABAs.

mRNA sequencing reads were mapped to the GRCh38 reference genome using the Cellranger Software. To generate count matrices for GDO (gRNA) libraries, the kallisto indexing and tag extraction (kite) workflow were used. Count matrices were used as input into the R/Seurat package120 to perform downstream analyses, including QC, normalization, cell clustering, GDO demultiplexing, and covariate regression76,121.

Normalization and downstream analysis of RNA data were performed using the Seurat R package (v.2.3.0), which enables the integrated processing of multimodal single-cell datasets. CRISPR-screen experiments in each cell-type were processed independently. Within each cell-type, ~100–80,000 cells were sequenced across 4 lanes. gRNA and RNA UMI feature counts were filtered removing the top and bottom decile of cells based on distribution of counts in each cell-type. The percentage of all the counts belonging to the mitochondrial, ribosomal, and hemoglobin genes calculated using Seurat::PercentageFeatureSet were filtered with cell-type specific thresholds, given the relatively high proportion of mitochondrial genes expressed in neurons. Mitochondrial, ribosomal, and hemoglobin genes as well as MALAT1 were removed (^RP[SL][[:digit:]]|^RPLP[[:digit:]]|^RPSA|ĤB[AEGQ][[:digit:]]|ĤB[ABDMQ]|^MT-|^MALAT1$). Lowly expressed genes, those that had at fewer than 2 read counts in 90% of samples were also removed. Hashtag and guide-tag raw counts were normalized using centered log ratio transformation, where counts were divided by the geometric mean of the corresponding tag across cells and log-transformed. gRNA demultiplexing was performed using the Seurat::MULTIseqDemux function for each lane individually and then counts were merged across lanes (SI Fig. 3B). In NPCs, 94,363 cells were retained after filtering and removal of negatively assigned cells with 62,7% classified as doublets and 37.3% classified as singlets. In DIV7 and DIV21 iGLUTs, 57,685 and 31,473 cell were retained with 34% and 9.8% doublets and 66% and 90.2% singlets respectively. In DIV35 iGABAs, 64,462 cells were retained with 48.3% doublets and 51.7% singlets. For all downstream analysis only cells with “singlet” gRNA classification were used (26,549–38,097 cells per experiment) (SI Fig. 3CE).

Cell-type specific population heterogeneity correction.

Gene-expression based clustering was largely driven by cellular heterogeneity, cell quality, and sequencing lane effects. We found that gRNA identity was not correlated with these covariates and thus sought to remove transcriptomic variability to analyze the data as a more homogenous cell population (SI Fig. 47). To adjust for cellular heterogeneity driving clustering within a cell population we developed maturity and cellular subtype scores across both perturbed and non-perturbed cells. First, variation related to cell-cycle phase of individual cells was accounted for by assigning cell cycle scores using Seurat::CellCycleScoring which uses a list of cell cycle markers122 to segregate by markers of G2/M phase and markers of S phase. Second, to address variance due to cellular heterogeneity within a single experiment, we adapted the method applied by Seurat::CellCycleScoring to calculate a “Maturity. Score” and “Subtype.Score” for each cell based on cellular subtype (more variable in mature GABAergic neurons) and developmental time-point specific markers (mora variable in NPCs and immature iGLUTs) (SI Table 2). Cells with outlier maturity scores and subtype scores were removed from downstream analyses. RNA UMI count data were then normalized, log-transformed and the percent mitochondrial, hemoglobulin, and ribosomal genes (markers of cell quality), lane, cell cycle scores (Phase), and maturity scores regressed out using Seurat::SCTransform. The scaled residuals of this model represent a ‘corrected’ expression matrix, that was used for all downstream analyses.

Although demultiplexing assigned the correct guide identity to each cell, we next sought to remove “false positives” whereby gRNAs were assigned but gene expression was unperturbed by evaluating the transcriptomes of gRNA clusters relative to scramble gRNAs. While successful CRISPR-KOs may not result in significant down-regulation of the targeted gene 76 gene, their overall transcriptomic profile should be distinct from scramble populations. To ensure that cells assigned to a guide-tag identity class demonstrated successful perturbation of the targeted ASD gene, we performed ‘weighted-nearest neighbor’ (WNN) analysis123 to assign clusters based on both guide-tag identity class and gene expression. To test that the transcriptome of these gRNA clusters were distinct form Scramble-gRNA control clusters we performed differential gene expression analysis (Wilcoxon Rank Sum) comparing each cluster to all other clusters. We then filtered non-targeting clusters and KO gRNA clusters by setting a quantile base average expression threshold of target genes based on the distribution of target gene average expression across all other clusters. Clusters were the collapsed by gRNA identity and gRNAs with less than 75 cells removed from analysis (SI Fig. 8A). These cells were then used for downstream differential gene-expression analyses124. For each cell-type individually, single-cell gene expression matrices were PseudoBulked using scuttle::aggregateAcrossCells function across lanes (4 pseudo-bulk samples per perturbation), lowly-expressed genes were removed (leaving 18–22,000 genes) followed by edgeR/limma differential gene expression analysis. Concordance between Wilcox-rank sum differential gene expression analysis using single-cell data and limma:voom using PseudoBulked data was assessed for each gene (SI Fig. 8DE).

Meta-analysis of gene expression across perturbations43.

Across ASD KOs, DEGs were meta-analyzed (METAL125), and “convergent” genes were defined as those with significant and shared direction of effect across all ASD gene perturbations and with non-significant heterogeneity (FDR adjusted pmeta<0.05, Cochran’s heterogeneity Q-test pHet > 0.05). To test convergence between ASD-KOs, meta-analyses were performed across all possible combinations of 2–5 KO perturbations with and without sub-setting for those shared across cell types (>40,000 combinations across cell-types) (SI Fig 15C, SI Data 1).

Bayesian Bi-clustering to identify Convergent Networks43.

Across ASD KOs, convergent networks were generated by Bayesian bi-clustering126 and undirected gene coexpression network reconstruction from the ASD KOs. Not constrained by statistical cut-offs, and able to capture the effect of more lowly expressed genes, convergent networks may be a more sensitive measure of convergence. Networks were built based on bi-clustering (BicMix)127 using log2CPM expression data from all the replicates across each of the ASD gene sets and Scramble gRNA jointly. We performed 40 runs of BicMix on these data and the output from iteration 400 of the variational Expectation-Maximization algorithm was used. Target Specific Network reconstruction128 was performed to identify convergent networks across all possible combinations of the 9 ASD gene KO perturbations shared across cell-types (n=502 combinations/cell-type) and randomly sampled combinations of 2–21 KO perturbations without sub-setting for those shared across cell types (n=1400–2300 combinations) (SI Fig 15C).

Influence of Functional Similarity on Convergence Degree.

To test the influence of functional similarity and brain co-expression between KOs on convergence and compare the degree of convergence between the same KOs in different cell-types we established two methods for defining and measuring convergence. First, gene-level convergence using meta-analysis as described above, with the strength of convergence for each set defined as ratio of convergent genes to the average number of DEGs.

genelevelconvergence=nConvergentGenesmean(1NnDEGs)

Second, network-level convergence based on undirected network reconstruction from Bayesian bi-clustering as described above. Bi-clustering identifies co-expressed genes shared across the downstream transcriptomic impacts of any given set of KO perturbations, thus, the resolved networks are the transcriptomic similarities between distinct perturbations (convergence). We calculated the “degree of convergence” for each network based on previously described metric43. Briefly, convergence scores are based on (1) network connectivity as defined by the sum of the clustering coefficient (Cp) and the difference in average length path (Lp) from the maximum average length path resolved across all possible sets [(max)Lp-Lp] and (2) similarity of network genes based on biological pathway membership scored by taking the sum of the mean semantic similarity scores 129 between all genes in the network and (3) minimum percent duplication rate across 40 runs. Duplication thresholds are network-dependent and a metric of confidence in the connections.

networklevelconvergence=Cp+[(Lp)Lp]+mean1NMFsemsim+BPsemsim+CCsemsim+nDuplicationruns/nTotalRuns

Functionally similarity scores across the ASD KO genes represented in each set was calculated using (1) Gene Ontology Semantic Similarity Scores: the average semantic similarity score based on Gene Ontology pathway membership within Biological Pathway (BP), Cellular Component (CC), and Molecular Function (MF) (SI Fig. 1D) between ASD genes in a set129 and (2) brain expression correlation (BEC) score: based on the strength of the correlation in ASD gene expression in the CMC (n=991 after QC) post-mortem dorsa-lateral pre-frontal cortex (DLPFC) gene expression data,.

We performed Pearson’s correlation analysis (Holm’s adjusted P) on similarity scores and the degree of network convergence to determine the influence of the similarity of the initial KO genes on downstream convergence. We compared the average strength of convergence across cell-types using a parametric Welch’s F-test and pairwise Games-Howell test.

Enrichment analysis of convergence for risk loci using MAGMA.

We intersected cross cell-type perturbation specific and cross perturbation cell-type-specific gene-level convergence with genetic risk of eleven psychiatric and neurological disorders/traits [attention-deficit/hyperactivity disorder (ADHD)130, anorexia nervosa (AN)131, autism spectrum disorder (ASD)1, alcohol dependence (AUD)132, bipolar disorder (BIP)133, cannabis use disorder (CUD)134, major depressive disorder (MDD)135, obsessive-compulsive disorder (OCD)136, post-traumatic stress disorder (PTSD)137, and schizophrenia (SCZ)138, Cross Disorder (CxD)139, Alzheimer disease (AD)140, Parkinson disease (PD)141, amyotrophic lateral sclerosis (ALS)142, Tourette’s143, migraine144, chronic pain145, and neurotic personality traits146 GWAS summary statistics] using multi-marker analysis of genomic annotation (MAGMA)74. SNPs were mapped to genes based on the corresponding build files for each GWAS summary dataset using the default method, snp-wise = mean (a test of the mean SNP association). A competitive gene set analysis was then used to test enrichment in genetic risk for a disorder across gene sets with an FDR<0.05.

Over-representation analysis, functional enrichment annotation, and biological theme comparison.

To identify pathway enrichments unique to individual KOs, we performed biological theme comparison and GSEA using ClusterProfiler129. Using FUMAGWAS: GENE2FUNC, the 102 ASD genes were functionally annotated and overrepresentation gene-set analysis for each convergent gene set was performed80. Using WebGestalt (WEB-based Gene SeT AnaLysis Toolkit)147, over-representation analysis (ORA) was performed on all convergent gene sets against publicly available genset lists GeneOntology, KEGG, DisGenNet and a curated gene list of rare-variant targets associated with ASD,SCZ, and ID60.

Random forest prediction model of convergence strength.

To determine how well functional similarity between KOs can predict gene-level and network-level convergence we trained a random forest model79 (randomForest package in R) for each type of convergence, evaluated the model in an independent internal dataset, and validated the model in an external CRISPRa activation screen43. Data from randomly tested gene combinations (2–5 KO sets at the gene level and 2–10 KO sets at the network level) tested across cell-types were randomly down-sampled into a training set (70%) and testing set (30%) – all with comparable proportions of data by cell-type (SI Fig 15). The random forest model was trained with bootstrap aggregation using C.C, M.F, B.P semantic similarity scores, brain expression correlation, number of genes, and cell-type as predictors. The Random Forest linear regression model was evaluated in the testing data by comparing actual values to predicted values, estimating the root mean squared error and performing Pearson’s correlations. Predictor models were validated using an external dataset of 10 CRISPR-activation perturbations of SCZ common variant target genes with multifunctional annotations broadly grouped as signaling/cell communication (CALN1, NAGA, FES, CLCN3, PLCL1) and epigenetic/regulatory (SF3B1, TMEM219, UBE2Q2L, ZNF804A, ZNF823)43, and assessed based the root mean squared error and Pearson’s correlation between actual and predicted convergence strength.

Behavioral phenotyping of ASD knockouts in zebrafish

We performed automated, high-throughput, quantitative behavioral profiling of larval zebrafish to measure arousal and sensorimotor processing as a readout of circuit-level deficits resulting from gene perturbation31. We quantified 24 parameters across sleep-wake activity and visual-startle responses in 15 stable homozygous or F0 CRISPant KO lines. Stable zebrafish line (ARID1B, ASH1L, CHD8, KDM5B, KMT2C, KMT5B, NRXN1) s were previously generated31,148; new CRISPant mutants (CHD2, KDM6B, MBD5, PHF12, PHF21A, SKI, SMARCC2, WAC) were generated according to149. We identified unique behavioral fingerprints for each ASD gene mutant, revealing convergent and divergent phenotypes across mutants. To classify convergent behavioral subgroups that may share circuit-level functions, we performed correlation analyses across mutants. We identified four distinct subgroups of ASD genes with highly correlated behavioral features (Fig. 6B).

Larvae were placed into individual wells of a 96 well plate containing 650 mL of standard embryo water per well within a Zebrabox. Locomotion was quantified with automated video-tracking system. The visual-startle assay was conducted at 5 days post fertilization (dpf). To assess larval responses to lights-off stimuli (VSR-OFF), larvae were acclimated to white light for 1 hour, and baseline activity was tracked for 30 minutes followed by five 1-second dark flashes with intermittent white light for 29 seconds. To evaluate larval responsivity to lights-on stimuli (VSR-ON), the assay was revered, where larvae were acclimated to darkness for 1 hour, and baseline activity was tracked for 30 minutes followed by five 1-second white light flashes with intermittent darkness for 29 seconds. For VSR-OFF and VSR-ON, six behavioral parameters were quantified using custom MATLAB code31: (i) average intensity of all startle responses, (ii) average post-stimulus activity (iii) average activity after first stimulus, (iv) stimulus versus post-stimulus activity, (v) intensity of responses to the first stimulus, (vi) intensity of responses to the final stimulus. The sleep-wake paradigm was executed between 5–7 dpf, following VSR-OFF and VSR-ON assays. During a 14h:10h white light:darkness cycle, larvae activity and sleep patterns were tracked within the Zebrabox and analyzed with custom MATLAB code31. Six behavioral parameters were quantified for daytime and nighttime: (i) total activity, (ii) total sleep, (iii) waking activity (iv) rest bouts (v) sleep length (vi) sleep latency. Across VSR-OFF, VSR-ON, and sleep-wake assays, we analyzed 24 parameters.

Linear mixed models (LMM) were used to compare phenotypes of each behavioral parameter between homozygous mutant versus wild-type or CRISPant versus scramble-injected fish for each gene of interest. Variations of behavioral phenotypes across experiments were accounted for by including the date of the experiment as a random effect in LMM. Hierarchical clustering analysis was performed to cluster mutants and behavioral parameters based on signed −log10-transformed p-values from LMM, where sign indicates direction of the difference in behavioral phenotype when comparing stable mutant to wild-type or CRISPant to scrambled-injected. Pearson correlation analysis was used to assess correlations between mutants based on the difference in the 24 parameters. Difference was evaluated using signed −log10-transformed p-values.

Drug screen for behavioral phenotyping in zebrafish

>500 drugs and small molecules were screened for behavioral effects in zebrafish; behavioral phenotyping of 24 parameters across sleep-wake activity and visual-startle responses were performed and analyzed as above. Drug and ASD KO-related behavioral phenotypes were compared using Pearson’s correlation analysis to identify to correlating and anti-correlating drugs (i.e. drugs that reversed behavioral signatures of ASD KOs) (SI Data 2).

Drug prioritization based on perturbation signature reversal in LiNCs Neuronal Cell Lines.

To identify drugs that could reverse cell-type specific convergence across different KOs, we used the Query tool from The Broad Institute’s Connectivity Map (Cmap) Server76. Briefly, the tool computes weighted enrichment scores (WTCS) between the query set and each signature in the Cmap LiNCs gene expression data (dose, time, drug, cell-line), normalizes the WRCS by dividing by the signed mean within each perturbation (NCS), and computes FDR as fraction of “null signatures” (DMSO) where the absolute NCS exceeds reference signature127. We prioritized drugs that reversed signatures specifically in neuronal cells (either neurons (NEU) or neural progenitor cells (NPCs) with NCS <= −1.00, FDR<=0.05) and filtered for drugs that had clinical data in humans and paired behavioral phenotyping in zebrafish (SI Data 2).

Supplementary Material

Supplement 1
media-1.xlsx (31.8KB, xlsx)
Supplement 2
media-2.xlsx (1.2MB, xlsx)
Supplement 3
media-3.xlsx (2.4MB, xlsx)
Supplement 4
media-4.pdf (15.1MB, pdf)

Table 1.

Disorder and behavioral associations of top convergent up and down-regulated genes by cell-type.

Coll-type Top Meta Gene Mota P Z-score Rare Disorders GWAS and behavioral associations
NPCs AGAP4 1.2e-14 7.71 Mitochondrial Complex 1 Deficiency
CYTL1 7.8e-9 −5.77 Muscular Trophy (AD) T1D, UC, CD, T2D, psoriasis, celiac, autoimmune disease, thyroid disease, ankylosing spondylitis (rs7672495)
Int. iGLUTs MBD2 1.2e-15 8.01 Cerebellar Ataxia, Deafness, Narcolepsy (AD) Breast Cancer Allergic disease, psoriasis, neuroticism, hoarding disorder, executive funcion measurement, memory function
SHOX 2.9e-l3 −7.30 Turner Syndrome, Dysplasia
Mat. iGLUTs MAP3K14 6.2e-36 12.52 Immunodeficlency (AR; X-linked), Ectodermal Dysplasia. Noonan Syndrome PD, neuroticism, neuroimaging, unipolar depression, mood disorder, anxiety, cognitive function, MS, asthma, allergic disease.
JMY 2.4e 44 −13.97 Galloway Mowat Syndrome (AR; X-linked) T2D
Mat. iGABAs GPR83 1.36–17 −8.54 testosterone measurement, free androgen index, age at menarche, anxiety like-behaviors
UBE2D4 3.9e-2l 9.44 Brachydactyly, Type E2 (BDE2) (AD)

Autosomal dominate (AD), Parkinson’s disease (PD), multiple sclerosis (MS), ulcerative colitis (UC), type-1 diabetes (T1D), type=2 diabetes (T2D), Crohn’s disease (CD)

FUNDING SOURCES

This work was supported by F31MH130122 (K.R.T), HHMI Gilliams Fellowship (A.P.), Autism Science Foundation (A.P.), MH014276 (M.F.G.) R01MH123155 (K.J.B.), RM1MH132648 (K.J.B. and E.J.H.), R01MH121074 (K.J.B.), R01MH116002 (E.J.H.) R21MH133245 (E.J.H.), and R01ES033630 (L.H., K.J.B.), R01MH124839 (LMH), R01MH118278 (LMH), Simons Foundation (#1012863KB, #573508EH and #345993EH), Spector Fund, (E.J.H. and Swebilius Foundation (E.J.H.); Kavli Foundation (E.J.H.); and Interdepartmental Neuroscience Program at Yale (A.P).

Special thanks to Michael Talkowski and Douglas Ruderfer for countless discussions on convergence and to Summer Thyme for sharing zebrafish mutants.

Footnotes

CONFLICT OF INTEREST STATEMENT

The authors declare no conflict of interest.

STATEMENT OF ETHICS

Yale University Institutional Review Board waived ethical approval for this work. Ethical approval was not required because the hiPSC lines, lacking association with any identifying information and widely accessible from a public repository, are thus not considered to be human subject research. Post-mortem brain data are similarly lacking identifiable information and are not considered human subject research.

DATA AND CODE AVAILABILITY

All source donor hiPSCs have been deposited at the Rutgers University Cell and DNA Repository (study 160; http://www.nimhstemcells.org/).

sc-RNA sequencing data reported in this paper will be uploaded to Gene expression omnibus (GEO) prior to publication. Processed data and accompanying code can be accessed through will be accessible through Synapse prior to publication.

REFERENCES

  • 1.Grove J. et al. Identification of common genetic risk variants for autism spectrum disorder. Nat Genet 51, 431–444, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Satterstrom F. K. et al. Large-Scale Exome Sequencing Study Implicates Both Developmental and Functional Changes in the Neurobiology of Autism. Cell 180, 568–584 e523, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Weiner D. J. et al. Statistical and functional convergence of common and rare genetic influences on autism at chromosome 16p. Nat Genet 54, 1630–1639, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Willsey H. R. et al. Genomics, convergent neuroscience and progress in understanding autism spectrum disorder. Nat Rev Neurosci 23, 323–341, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bicks L. K. et al. Functional neurogenomics in autism spectrum disorders: A decade of progress. Curr Opin Neurobiol 86, 102858, (2024). [DOI] [PubMed] [Google Scholar]
  • 6.Quesnel-Vallieres M. et al. Autism spectrum disorder: insights into convergent mechanisms from transcriptomics. Nat Rev Genet 20, 51–63, (2019). [DOI] [PubMed] [Google Scholar]
  • 7.Geschwind D. H. Autism: many genes, common pathways? Cell 135, 391–395, (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.De Rubeis S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.O’Roak B. J. et al. Recurrent de novo mutations implicate novel genes underlying simplex autism risk. Nat Commun 5, 5595, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Talkowski M. E. et al. Sequencing chromosomal abnormalities reveals neurodevelopmental loci that confer risk across diagnostic boundaries. Cell 149, 525–537, (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Neale B. M. et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485, 242–245, (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.O’Roak B. J. et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science 338, 1619–1622, (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Sanders S. J. et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 485, 237–241, (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Willsey A. J. et al. Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism. Cell 155, 997–1007, (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Xu X. et al. Cell type-specific expression analysis to identify putative cellular mechanisms for neurogenetic disorders. J Neurosci 34, 1420–1431, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Polioudakis D. et al. A Single-Cell Transcriptomic Atlas of Human Neocortical Development during Mid-gestation. Neuron 103, 785–801 e788, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Parikshak N. N. et al. Integrative functional genomic analyses implicate specific molecular pathways and circuits in autism. Cell 155, 1008–1021, (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Voineagu I. et al. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature 474, 380–384, (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Liao C. et al. Convergent coexpression of autism-associated genes suggests some novel risk genes may not be detectable in large-scale genetic studies. Cell Genom 3, 100277, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pintacuda G. et al. Protein interaction studies in human induced neurons indicate convergent biology underlying autism spectrum disorders. Cell Genom 3, 100250, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang B. et al. A foundational atlas of autism protein interactions reveals molecular convergence. bioRxiv, (2023). [Google Scholar]
  • 22.Murtaza N. et al. Neuron-specific protein network mapping of autism risk genes identifies shared biological mechanisms and disease-relevant pathologies. Cell reports 41, 111678, (2022). [DOI] [PubMed] [Google Scholar]
  • 23.Cederquist G. Y. et al. A Multiplex Human Pluripotent Stem Cell Platform Defines Molecular and Functional Subclasses of Autism-Related Genes. Cell stem cell 27, 35–49 e36, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lalli M. A. et al. High-throughput single-cell functional elucidation of neurodevelopmental disease-associated genes reveals convergent mechanisms altering neuronal differentiation. Genome Res 30, 1317–1331, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Sun N. et al. Autism genes converge on microtubule biology and RNA-binding proteins during excitatory neurogenesis. bioRxiv, 2023.2012.2022.573108, (2024). [Google Scholar]
  • 26.Paulsen B. et al. Autism genes converge on asynchronous development of shared neuron classes. Nature 602, 268–273, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Meng X. et al. Assembloid CRISPR screens reveal impact of disease genes in human neurodevelopment. Nature 622, 359–366, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Li C. et al. Single-cell brain organoid screening identifies developmental defects in autism. Nature 621, 373–380, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Jin X. et al. In vivo Perturb-Seq reveals neuronal and glial abnormalities associated with autism risk genes. Science 370, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Willsey H. R. et al. Parallel in vivo analysis of large-effect autism genes implicates cortical neurogenesis and estrogen in risk and resilience. Neuron 109, 1409, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Weinschutz Mendes H. et al. High-throughput functional analysis of autism genes in zebrafish identifies convergence in dopaminergic and neuroimmune pathways. Cell reports 42, 112243, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Rubenstein J. L. et al. Model of autism: increased ratio of excitation/inhibition in key neural systems. Genes, brain, and behavior 2, 255–267, (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Antoine M. W. et al. Increased Excitation-Inhibition Ratio Stabilizes Synapse and Circuit Excitability in Four Autism Mouse Models. Neuron 101, 648–661 e644, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Nelson S. B. et al. Excitatory/Inhibitory Balance and Circuit Homeostasis in Autism Spectrum Disorders. Neuron 87, 684–698, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Bernier R. et al. Disruptive CHD8 mutations define a subtype of autism early in development. Cell 158, 263–276, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kerschbamer E. et al. CHD8 suppression impacts on histone H3 lysine 36 trimethylation and alters RNA alternative splicing. Nucleic Acids Res 50, 12809–12828, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Shi X. et al. Heterozygous deletion of the autism-associated gene CHD8 impairs synaptic function through widespread changes in gene expression and chromatin compaction. Am J Hum Genet 110, 1750–1768, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kawamura A. et al. Deletion of the autism-related gene Chd8 alters activity-dependent transcriptional responses in mouse postmitotic neurons. Commun Biol 6, 593, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Villa C. E. et al. CHD8 haploinsufficiency links autism to transient alterations in excitatory and inhibitory trajectories. Cell reports 39, 110615, (2022). [DOI] [PubMed] [Google Scholar]
  • 40.Dong C. et al. Conserved and Distinct Functions of the Autism-Related Chromatin Remodeler CHD8 in Embryonic and Adult Forebrain Neurogenesis. J Neurosci 42, 8373–8392, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Astorkia M. et al. Molecular and network disruptions in neurodevelopment uncovered by single cell transcriptomics analysis of CHD8 heterozygous cerebral organoids. bioRxiv, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Deans P. M. et al. Non-additive effects of schizophrenia risk genes reflect convergent downstream function. medRxiv, 2023.2003.2020.23287497, (2023). [Google Scholar]
  • 43.Townsley K. G. et al. Convergent impact of schizophrenia risk genes. bioRxiv, 2022.2003.2029.486286, (2023). [Google Scholar]
  • 44.Bell J. Stratified medicines: towards better treatment for disease. Lancet 383 Suppl 1, S3–5, (2014). [DOI] [PubMed] [Google Scholar]
  • 45.Tsimberidou A. M. et al. Molecular tumour boards - current and future considerations for precision oncology. Nature reviews. Clinical oncology 20, 843–863, (2023). [DOI] [PubMed] [Google Scholar]
  • 46.Zhang H. et al. Monogenic diabetes: a gateway to precision medicine in diabetes. J Clin Invest 131, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Fu J. M. et al. Rare coding variation provides insight into the genetic architecture and phenotypic context of autism. Nat Genet 54, 1320–1331, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Birtele M. et al. Non-synaptic function of the autism spectrum disorder-associated gene SYNGAP1 in cortical neurogenesis. Nat Neurosci 26, 2090–2103, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Flaherty E. et al. Neuronal impact of patient-specific aberrant NRXN1alpha splicing. Nat Genet 51, 1679–1690, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sebastian R. et al. Schizophrenia-associated NRXN1 deletions induce developmental-timing- and cell-type-specific vulnerabilities in human brain organoids. Nature Communications 14, 3770, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Li C. et al. Single-cell brain organoid screening identifies developmental defects in autism. bioRxiv, 2022.2009.2015.508118, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Martins-Costa C. et al. ARID1B controls transcriptional programs of axon projection in an organoid model of the human corpus callosum. Cell stem cell 31, 866–885 e814, (2024). [DOI] [PubMed] [Google Scholar]
  • 53.Vermaercke B. et al. SYNGAP1 deficiency disrupts synaptic neoteny in xenotransplanted human cortical neurons in vivo. Neuron, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Pak C. et al. Cross-platform validation of neurotransmitter release impairments in schizophrenia patient-derived NRXN1-mutant neurons. Proc Natl Acad Sci U S A 118, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Ellingford R. A. et al. Cell-type-specific synaptic imbalance and disrupted homeostatic plasticity in cortical circuits of ASD-associated Chd8 haploinsufficient mice. Mol Psychiatry 26, 3614–3624, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Yi F. et al. Autism-associated SHANK3 haploinsufficiency causes Ih channelopathy in human neurons. Science 352, aaf2669, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Fernando M. B. et al. Precise Therapeutic Targeting of Distinct NRXN1(+/−) Mutations. bioRxiv, (2023). [Google Scholar]
  • 58.Jung E. M. et al. Arid1b haploinsufficiency disrupts cortical interneuron development and mouse behavior. Nat Neurosci 20, 1694–1707, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Chen Q. et al. Dysfunction of cortical GABAergic neurons leads to sensory hyper-reactivity in a Shank3 mouse model of ASD. Nat Neurosci 23, 520–532, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Schrode N. et al. Synergistic effects of common schizophrenia risk variants. Nat Genet 51, 1475–1485, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Wang M. et al. Transformative Network Modeling of Multi-omics Data Reveals Detailed Circuits, Key Regulators, and Potential Therapeutics for Alzheimer’s Disease. Neuron 109, 257–272 e214, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Ho S. M. et al. Rapid Ngn2-induction of excitatory neurons from hiPSC-derived neural progenitor cells. Methods 101, 113–124, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Marro S. G. et al. Neuroligin-4 Regulates Excitatory Synaptic Transmission in Human Neurons. Neuron 103, 617–626 e616, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Zhang Z. et al. The fragile X mutation impairs homeostatic plasticity in human neurons by blocking synaptic retinoic acid signaling. Science translational medicine 10, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Zhang Y. et al. Rapid single-step induction of functional neurons from human pluripotent stem cells. Neuron 78, 785–798, (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Meijer M. et al. A Single-Cell Model for Synaptic Transmission and Plasticity in Human iPSC-Derived Neurons. Cell reports 27, 2199–2211 e2196, (2019). [DOI] [PubMed] [Google Scholar]
  • 67.Zhang S. et al. Allele-specific open chromatin in human iPSC neurons elucidates functional disease variants. Science 369, 561–565, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Sun Y. et al. A deleterious Nav1.1 mutation selectively impairs telencephalic inhibitory neurons derived from Dravet Syndrome patients. Elife 5, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Barretto N. et al. ASCL1- and DLX2-induced GABAergic neurons from hiPSC-derived NPCs. J Neurosci Methods 334, 108548, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Powell S. K. et al. Induction of dopaminergic neurons for neuronal subtype-specific modeling of psychiatric disease risk. Mol Psychiatry, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Yang N. et al. Generation of pure GABAergic neurons by transcription factor programming. Nat Methods, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Wells M. F. et al. Natural variation in gene expression and viral susceptibility revealed by neural progenitor cell villages. Cell stem cell 30, 312–332 e313, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Guss E. J. et al. Protocol for neurogenin-2-mediated induction of human stem cell-derived neural progenitor cells. Star Protoc 5, 102878, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.de Leeuw C. A. et al. MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput Biol 11, e1004219, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Sullivan P. F. et al. Defining the Genetic, Genomic, Cellular, and Diagnostic Architectures of Psychiatric Disorders. Cell 177, 162–183, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Mimitou E. P. et al. Multiplexed detection of proteins, transcriptomes, clonotypes and CRISPR perturbations in single cells. Nat Methods 16, 409–412, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Wray N. R. et al. Common Disease Is More Complex Than Implied by the Core Gene Omnigenic Model. Cell 173, 1573–1580, (2018). [DOI] [PubMed] [Google Scholar]
  • 78.Boyle E. A. et al. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell 169, 1177–1186, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Breiman L. Random Forests. Machine Learning 45, 5–32, (2001). [Google Scholar]
  • 80.Watanabe K. et al. Functional mapping and annotation of genetic associations with FUMA. Nat Commun 8, 1826, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Subramanian A. et al. A Next Generation Connectivity Map: L1000 Platform and the First 1,000,000 Profiles. Cell 171, 1437–1452 e1417, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Wong C. C. Y. et al. Genome-wide DNA methylation profiling identifies convergent molecular signatures associated with idiopathic and syndromic autism in post-mortem human brain tissue. Hum Mol Genet 28, 2201–2211, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Ramaswami G. et al. Integrative genomics identifies a convergent molecular subtype that links epigenomic with transcriptomic differences in autism. Nat Commun 11, 4873, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Schafer S. T. et al. Pathological priming causes developmental gene network heterochronicity in autistic subject-derived neurons. Nat Neurosci 22, 243–255, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Gaugler T. et al. Most genetic risk for autism resides with common variation. Nat Genet 46, 881–885, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Schaaf C. P. et al. A framework for an evidence-based gene list relevant to autism spectrum disorder. Nat Rev Genet 21, 367–376, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Shi Y. et al. Multi-polygenic scores in psychiatry: From disorder specific to transdiagnostic perspectives. Am J Med Genet B Neuropsychiatr Genet 195, e32951, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Andreassen O. A. et al. New insights from the last decade of research in psychiatric genetics: discoveries, challenges and clinical implications. World Psychiatry 22, 4–24, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Weiner D. J. et al. Polygenic transmission disequilibrium confirms that common and rare variation act additively to create risk for autism spectrum disorders. Nat Genet 49, 978–985, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Klei L. et al. How rare and common risk variation jointly affect liability for autism spectrum disorder. Mol Autism 12, 66, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Bergen S. E. et al. Joint Contributions of Rare Copy Number Variants and Common SNPs to Risk for Schizophrenia. Am J Psychiatry 176, 29–35, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Akingbuwa W. A. et al. Ultra-rare and common genetic variant analysis converge to implicate negative selection and neuronal processes in the aetiology of schizophrenia. Mol Psychiatry 27, 3699–3707, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Oliver K. L. et al. Common risk variants for epilepsy are enriched in families previously targeted for rare monogenic variant discovery. EBioMedicine 81, 104079, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Genetic Modifiers of Huntington’s Disease, C. Identification of Genetic Factors that Modify Clinical Onset of Huntington’s Disease. Cell 162, 516–526, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Kingdom R. et al. Genetic modifiers of rare variants in monogenic developmental disorder loci. Nat Genet 56, 861–868, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Dobrindt K. et al. Publicly Available hiPSC Lines with Extreme Polygenic Risk Scores for Modeling Schizophrenia. Complex Psychiatry 6, 68–82, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Bjork M. H. et al. Association of Prenatal Exposure to Antiseizure Medication With Risk of Autism and Intellectual Disability. JAMA Neurol 79, 672–681, (2022). [DOI] [PubMed] [Google Scholar]
  • 98.Anton-Bolanos N. et al. Brain Chimeroids reveal individual susceptibility to neurotoxic triggers. Nature, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Gandal M. J. et al. Broad transcriptomic dysregulation occurs across the cerebral cortex in ASD. Nature 611, 532–539, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Wamsley B. et al. Molecular cascades and cell type-specific signatures in ASD revealed by single-cell genomics. Science 384, eadh2602, (2024). [DOI] [PubMed] [Google Scholar]
  • 101.Yap C. X. et al. Brain cell-type shifts in Alzheimer’s disease, autism, and schizophrenia interrogated using methylomics and genetics. Sci Adv 10, eadn7655, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Zhang P. et al. Neuron-specific transcriptomic signatures indicate neuroinflammation and altered neuronal activity in ASD temporal cortex. Proc Natl Acad Sci U S A 120, e2206758120, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Bhattacharya A. et al. Isoform-level transcriptome-wide association uncovers genetic risk mechanisms for neuropsychiatric disorders in the human brain. Nature genetics 55, 2117–2128, (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Gandal M. J. et al. Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science 362, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Gandal M. J. et al. Shared molecular neuropathology across major psychiatric disorders parallels polygenic overlap. Science 359, 693–697, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Han V. X. et al. Maternal immune activation and neuroinflammation in human neurodevelopmental disorders. Nat Rev Neurol 17, 564–579, (2021). [DOI] [PubMed] [Google Scholar]
  • 107.Seah C. et al. Modeling gene x environment interactions in PTSD using human neurons reveals diagnosis-specific glucocorticoid-induced gene expression. Nat Neurosci 25, 1434–1445, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Seah C. et al. Common genetic variation impacts stress response in the brain. bioRxiv, 2023.2012.2027.573459, (2023). [Google Scholar]
  • 109.Retallick-Townsley K. G. et al. Dynamic stress- and inflammatory-based regulation of psychiatric risk loci in human neurons. bioRxiv, 2024.2007.2009.602755, (2024). [Google Scholar]
  • 110.Cruceanu C. et al. Cell-Type-Specific Impact of Glucocorticoid Receptor Activation on the Developing Brain: A Cerebral Organoid Study. Am J Psychiatry, appiajp202121010095, (2021). [DOI] [PubMed] [Google Scholar]
  • 111.Teter O. M. et al. CRISPRi-based screen of Autism Spectrum Disorder risk genes in microglia uncovers roles of <em>ADNP</em> in microglia endocytosis and uptake of synaptic material. bioRxiv, 2024.2006.2001.596962, (2024). [Google Scholar]
  • 112.Ma Y. et al. Activity-Dependent Transcriptional Program in NGN2+ Neurons Enriched for Genetic Risk for Brain-Related Disorders. Biol Psychiatry 95, 187–198, (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Roussos P. et al. Activity-Dependent Changes in Gene Expression in Schizophrenia Human-Induced Pluripotent Stem Cell Neurons. JAMA Psychiatry 73, 1180–1188, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Sanchez-Priego C. et al. Mapping cis-regulatory elements in human neurons links psychiatric disease heritability and activity-regulated transcriptional programs. Cell reports 39, 110877, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Boulting G. L. et al. Activity-dependent regulome of human GABAergic neurons reveals new patterns of gene regulation and neurological disease heritability. Nat Neurosci 24, 437–448, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Ahn K. et al. High rate of disease-related copy number variations in childhood onset schizophrenia. Mol Psychiatry 19, 568–572, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Ahn K. et al. Common polygenic variation and risk for childhood-onset schizophrenia. Mol Psychiatry, (2014). [DOI] [PubMed] [Google Scholar]
  • 118.Hoffman G. E. et al. Transcriptional signatures of schizophrenia in hiPSC-derived NPCs and neurons are concordant with post-mortem adult brains. Nat Commun 8, 2225, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Miller J. A. et al. Transcriptional landscape of the prenatal human brain. Nature 508, 199–206, (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Butler A. et al. Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 36, 411–420, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Papalexi E. et al. Characterizing the molecular regulation of inhibitory immune checkpoints with multimodal single-cell screens. Nat Genet 53, 322–331, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Tirosh I. et al. Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science 352, 189–196, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Hao Y. et al. Integrated analysis of multimodal single-cell data. Cell 184, 3573–3587 e3529, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Tian R. et al. CRISPR Interference-Based Platform for Multimodal Genetic Screens in Human iPSC-Derived Neurons. Neuron, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Willer C. J. et al. METAL: fast and efficient meta-analysis of genomewide association scans. Bioinformatics 26, 2190–2191, (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Saha A. et al. Co-expression networks reveal the tissue-specific regulation of transcription and splicing. Genome Res 27, 1843–1858, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Gao C. et al. Context Specific and Differential Gene Co-expression Networks via Bayesian Biclustering. PLoS Comput Biol 12, e1004791, (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Gao C B. C., Engelhardt BE A latent factor model with a mixture of sparse and dense factors to model gene expression data with confounding effects. arXiv, (2013). [Google Scholar]
  • 129.Yu G. Gene Ontology Semantic Similarity Analysis Using GOSemSim. Methods in molecular biology 2117, 207–215, (2020). [DOI] [PubMed] [Google Scholar]
  • 130.Demontis D. et al. Discovery of the first genome-wide significant risk loci for attention deficit/hyperactivity disorder. Nat Genet 51, 63–75, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Duncan L. et al. Significant Locus and Metabolic Genetic Correlations Revealed in Genome-Wide Association Study of Anorexia Nervosa. Am J Psychiatry 174, 850–858, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Walters R. K. et al. Transancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. Nat Neurosci 21, 1656–1669, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Mullins N. et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet 53, 817–829, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Johnson E. C. et al. A large-scale genome-wide association study meta-analysis of cannabis use disorder. Lancet Psychiatry 7, 1032–1045, (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Howard D. M. et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat Neurosci 22, 343–352, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.International Obsessive Compulsive Disorder Foundation Genetics, C. et al. Revealing the complex genetic architecture of obsessive-compulsive disorder using meta-analysis. Mol Psychiatry 23, 1181–1188, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.Nievergelt C. M. et al. International meta-analysis of PTSD genome-wide association studies identifies sex- and ancestry-specific genetic risk loci. Nat Commun 10, 4558, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Trubetskoy V. et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604, 502–508, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Cross-Disorder Group of the Psychiatric Genomics Consortium. Genomic Relationships, Novel Loci, and Pleiotropic Mechanisms across Eight Psychiatric Disorders. Cell 179, 1469–1482 e1411, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Marioni R. E. et al. GWAS on family history of Alzheimer’s disease. Translational psychiatry 8, 99, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141.Nalls M. A. et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. The Lancet. Neurology 18, 1091–1102, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Nicolas A. et al. Genome-wide Analyses Identify KIF5A as a Novel ALS Gene. Neuron 97, 1268–1283 e1266, (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Yu D. et al. Interrogating the Genetic Determinants of Tourette’s Syndrome and Other Tic Disorders Through Genome-Wide Association Studies. Am J Psychiatry 176, 217–227, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 144.Hautakangas H. et al. Genome-wide analysis of 102,084 migraine cases identifies 123 risk loci and subtype-specific risk alleles. Nat Genet 54, 152–160, (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Johnston K. J. A. et al. Genome-wide association study of multisite chronic pain in UK Biobank. PLoS genetics 15, e1008164, (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146.Lo M. T. et al. Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders. Nat Genet 49, 152–156, (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Wang J. et al. WebGestaltR: Gene Set Analysis Toolkit WebGestaltR. R package version 0.4.3., <https://CRAN.R-project.org/package=WebGestaltR> (2020). [Google Scholar]
  • 148.Capps M. E. S. et al. Diencephalic and Neuropeptidergic Dysfunction in Zebrafish with Autism Risk Mutations. bioRxiv, 2024.2001.2018.576309, (2024). [Google Scholar]
  • 149.Kroll F. et al. A simple and effective F0 knockout method for rapid screening of behaviour and other complex phenotypes. Elife 10, (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1
media-1.xlsx (31.8KB, xlsx)
Supplement 2
media-2.xlsx (1.2MB, xlsx)
Supplement 3
media-3.xlsx (2.4MB, xlsx)
Supplement 4
media-4.pdf (15.1MB, pdf)

Data Availability Statement

All source donor hiPSCs have been deposited at the Rutgers University Cell and DNA Repository (study 160; http://www.nimhstemcells.org/).

sc-RNA sequencing data reported in this paper will be uploaded to Gene expression omnibus (GEO) prior to publication. Processed data and accompanying code can be accessed through will be accessible through Synapse prior to publication.


Articles from bioRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES