Abstract
Cellular heterogeneity in the human brain obscures the identification of robust cellular regulatory networks, which is necessary to understand the function of non-coding elements and the impact of non-coding genetic variation. Here we integrate genome-wide chromosome conformation data from purified neurons and glia with transcriptomic and enhancer profiles, to characterize the gene regulatory landscape of two major cell classes in the human brain. We then leverage cell-type-specific regulatory landscapes to gain insight into the cellular etiology of several brain disorders. We find that Alzheimer’s disease (AD)-associated epigenetic dysregulation is linked to neurons and oligodendrocytes, whereas genetic risk factors for AD highlighted microglia, suggesting that different cell types may contribute to disease risk, via different mechanisms. Moreover, integration of glutamatergic and GABAergic regulatory maps with genetic risk factors for schizophrenia (SCZ) and bipolar disorder (BD) identifies shared (parvalbumin-expressing interneurons) and distinct cellular etiologies (upper layer neurons for BD, and deeper layer projection neurons for SCZ). Collectively, these findings shed new light on cell-type-specific gene regulatory networks in brain disorders.
Subject terms: Epigenetics, Epigenomics, Chromatin, Epigenetics in the nervous system, Genetics of the nervous system
The cellular heterogeneity in brain obscures the identification of robust cellular regulatory networks. Here the authors integrate genome-wide chromosome conformation data from sorted neurons and glia, with transcriptomic and enhancer profiles, to characterize cell-type-specific gene regulatory landscapes in the human brain, and provide insights into cell-type-specific gene regulatory networks in brain disorders.
Introduction
The majority of human genetic variants imparting risk for brain diseases are located within non-coding elements1. Allelic variation in these elements is thought to have an influence on complex human traits via impacting gene regulation2, necessitating the understanding of gene regulatory architecture in the human brain. We and others have identified gene regulatory relationships in the developing and adult human brain by integrating multi-dimensional datasets that include transcriptomic, epigenomic, and higher-order chromatin interaction landscapes3–7. However, cellular heterogeneity poses significant challenges for unraveling the complexity in the gene regulatory architecture of the human brain, as it is comprised of heterogeneous cell populations, mostly different types of neurons and glia, each of which display distinct gene expression3,8–10 and chromatin accessibility profiles11–14. To address this issue, several groups have employed Hi–C and its derivatives (e.g. promoter-capture Hi–C) to build higher-order chromatin interaction maps in induced pluripotent stem cells (iPSC)-derived neurons and astrocytes15. However, these published studies rely on in vitro cultured cells that mark early brain development. Recently, promoter interaction profiles were inferred from three types of brain cells (neurons, oligodendrocytes, and microglia) obtained from the postnatal human cerebral cortex16.
Here, we analyze genome-wide chromosome conformation at cellular resolution to capture how chromatin structure affects cellular expression profiles. We use fluorescence-activated nuclear sorting (FANS, “Methods”)17 to sort neurons (NeuN+ cells) and glia (NeuN− cells), two major cell classes in the brain, and generated genome-wide chromosome conformation using Hi–C (Supplementary Figs. 1–2). We call multiple architectural units: A/B compartments, which are mutually exclusive large-scale chromatin domains, where compartment A is associated with actively transcribed regions and compartment B is enriched with repressive chromatin regions; Topologically Associating Domains (TADs), which are enriched with self-contacts and thus delineate the genomic regions within which most cis-regulatory interactions occur; Frequently Interacting REgions (FIREs), which are local interaction hotspots that are enriched with active regulatory elements; and gene loops within NeuN+ and NeuN− cells which link promotors to presumed regulatory elements (Supplementary Fig. 1). Furthermore, we integrate acetylated histone H3 lysine 27 (H3K27ac) peaks from glutamatergic (Glu) and medial ganglionic eminence (MGE)-derived GABAergic (GABA) neurons18 with NeuN+ chromatin interactions to obtain finer-scale gene regulatory relationship within these two major neuronal subtypes. We then leveraged cell-type-specific gene regulatory relationships to help decipher the genetic mechanisms contributing to Alzheimer’s disease (AD), schizophrenia (SCZ), and bipolar disorder (BD). Our results illustrate deciphering the epigenetic landscape in a cell-type-specific fashion offers substantial advantages for inferring the functional impact of genetic risk factors associated with brain disorders.
Results
Differential FIREs and super-FIREs are associated with cell-type-specific gene regulation
Hi–C libraries were generated from NeuN+ and NeuN− cells sorted from four dorsolateral prefrontal cortex (DLPFC) samples (Supplementary Table 1 and Supplementary Fig. 2). We first measured the reproducibility across Hi–C libraries using a stratum-adjusted correlation coefficient (SCC) that systematically quantifies similarities between two Hi–C contact maps19. Notably, we found that samples were clustered by cell types, not individuals, and showed high correlation between the same cell types from different individuals (NeuN+ = 0.95–0.97; NeuN− = 0.86-0.91; Supplementary Fig. 3a), demonstrating the reproducibility across Hi–C libraries.
We also compared the chromatin contacts of NeuN+ and NeuN− cells with other existing Hi–C datasets from brain-relevant tissues (Supplementary Fig. 3b, “Methods”). We found that NeuN− cells showed higher structural similarity with the adult brain than the fetal brain, indicative of gliogenesis in the postnatal brain20. Intriguingly, NeuN− cells did not show high structural similarity with iPSC-derived astrocytes, consistent with the previous report that the majority of NeuN− cells are oligodendrocytes21. NeuN+ cells showed high similarity with adult brains, fetal brains, and iPSC-derived neurons. The fact that NeuN+ cells show high similarity with fetal brain may reflect extensive neurogenesis at midgestation, the developmental stage at which the fetal brain was obtained22.
We next interrogated the cell-type-specific nature of 3D chromatin structures such as compartments23 and FIREs6,24. We detected extensive compartment switching between NeuN+ and NeuN− cells: 4,333 regions (in 100kb resolution) switched from compartment A to B between NeuN− to NeuN+ cells, while 2098 regions switched from compartment B to A in NeuN+ to NeuN− cells. Importantly, genes located in compartments that switch from A to B between NeuN− and NeuN+ were highly expressed in oligodendrocytes and astrocytes, while those that switch from B to A in NeuN− to NeuN+ were highly expressed in neurons, suggesting that the difference in chromosome conformation between NeuN+ and NeuN− cells is associated with cell-type-specific gene regulation (Supplementary Fig. 3c).
FIREs represent regions that act as interaction hubs, containing active enhancers that show high tissue specificity6. They are also enriched for genome-wide association study (GWAS) variants and exhibit variability across individuals25,26. We therefore compared FIREs in NeuN+ and NeuN− cells to identify how local chromatin architecture differs among these major brain cell types. We detected 3966 and 3967 FIREs in NeuN+ and NeuN− cells, respectively (Fig. 1a and Supplementary Data 1)24. Among them, 1248 FIREs were shared between NeuN+ and NeuN− cells based on the coordinate-level overlap and similarities in FIRE scores (hereafter referred to as common FIREs, “Methods”, Supplementary Data 1). Genes located in common FIREs were involved in housekeeping functions (e.g., microtubule polymerization, cell junction assembly) as well as pathways that involve both neurons and oligodendrocytes (e.g., main axon, Supplementary Data 1).
To further investigate how FIREs are associated with cell-type-specific gene expression, we used a stringent statistical cutoff to define differential FIREs on the basis of FIRE scores (“Methods”), detecting 287 differential FIREs between NeuN+ (145) and NeuN− (142) cells (hereby referred to as NeuN+ and NeuN− FIREs, respectively, Fig. 1b, Supplementary Data 2). Since previous reports have suggested that FIREs are closely linked to cell-type-specific enhancers6, we intersected differential FIREs with differential H3K27ac peaks between NeuN+ and NeuN− cells27. We found that the majority of NeuN+ and NeuN− FIREs overlapped with NeuN+ and NeuN− differential H3K27ac peaks, respectively, displaying remarkable cell-type-specificity (Fig. 1c, d). We next examined whether differential FIREs overlap with cell-type-specific marker genes. Indeed, NeuN+ FIREs overlapped with neuronal genes that were enriched for synaptic function (Fig. 1g), while NeuN− FIREs overlapped with genes involved in myelination, glial differentiation, and oligodendrocyte differentiation (Fig. 1h). Salient examples include GRIN2B, which overlaps with a NeuN+ FIRE, and OLIG1 and OLIG2, which overlap with a NeuN− FIRE (Fig. 1e, f, Supplementary Fig. 4). We further checked cellular expression profiles of genes assigned to differential FIREs using single-cell (sc)RNA-seq data10. As expected, NeuN+ and NeuN− FIRE-associated genes were mainly enriched in neurons and glia, respectively (Fig. 1i, j). In particular, NeuN− FIRE-associated genes were most highly expressed in oligodendrocytes, confirming that NeuN− fraction is enriched for oligodendrocytes21.
Super-FIREs represent a small proportion of FIRE clusters with the most significant local interaction frequency6. Super-FIREs are thought to have a strong gene regulatory potential and often overlap with super enhancers6. We therefore identified super-FIREs in NeuN+ and NeuN− cells (“Methods”) and found that they also displayed substantial cellular specificities. Only 9 super-FIREs were detected in both NeuN+ and NeuN− cells, leaving 253 and 157 cell-type-specific super-FIREs in NeuN+ and NeuN− cells, respectively (Supplementary Fig. 5a, Supplementary Data 2). Over 95% of super-FIREs overlapped with differential H3K27ac peaks and all super-FIREs overlapped with promoters, indicating that they have a particularly strong cell-type-specific regulatory impact (Supplementary Fig. 5b, c). In line with these findings, super-FIREs were tightly coupled with cell-type-specific gene expression. NeuN+ super-FIREs overlapped with genes functioning in synapses and ion gated channels that were highly expressed in neurons, while NeuN− super-FIREs overlapped with genes involved in cell adhesion, myelination, and ensheathment of neurons, with high expression in glia (Supplementary Fig. 5d-f, 6). Taken together, these analyses show that differential FIREs and super-FIREs are strongly associated with cell-type-specific gene regulation in the nervous system.
Characteristics of chromatin interactions in NeuN+ and NeuN− cells
We next identified promoter-anchored chromatin loops in NeuN+ and NeuN− cells to examine how chromatin interactions are associated with intricate regulation of cell-type-specific expression profiles. We detected 187,674 and 167,551 promoter-based interactions3,4 from NeuN+ and NeuN− cells, respectively (Fig. 2a). Over 75% of promoter-based interactions were detected within TADs (Supplementary Fig. 7a). Roughly 37% of interactions occurred between enhancers and promoters and ~23% of interactions occurred between two different promoters (Supplementary Fig. 7b). We infer that the promoter–promoter interactions are likely attributed to transcription factors (TFs) of coregulated genes. A substantial fraction of chromatin interactions were distal, as ~50% of chromatin interactions brought two genomic regions linearly separated by 320 kb or more into close proximity (Supplementary Fig. 7c) and captured complex enhancer–promoter interactions. For example, we found that the majority of promoters interact with more than one enhancer (Supplementary Fig. 7d), consistent with the previous findings that multiple enhancers can interact with one promoter3,7. Notably, the number of enhancers that interact with promoters was strikingly correlated with gene regulation, as gene expression linearly increased with the number of physically interacting enhancers up to a total of 10 or more (Supplementary Fig. 7e)3,28.
We next leveraged chromatin states predicted by ChromHMM29 to delineate the epigenetic properties of the genomic regions that interact with promoters (Supplementary Fig. 7f). As expected, promoters often interact with active chromatin features such as other transcription start sites (TSS, 1_TssA and 2_TssAFlnk) and enhancers (6_EnhG and 7_Enh). However, it is of note that a significant proportion of promoters also interact with bivalent marks (10_TssBiv, 11_BivFlnk and 12_EnhBiv), suggesting that the associated regions are poised to be activated upon stimulation. Together, our results confirmed that gene expression was coordinately controlled by physical interactions of promoters with enhancers, indicative of transcriptional regulators that underlie cell-type-specific gene regulation. So, we implemented GimmeMotifs30 to evaluate differential TF motif enrichment at cell-type-specific enhancers that interact with promoters in NeuN+ and NeuN− cells. TFs involved in neuronal fate commitment, including ZBTB18, SMARCC1, TBR1, and NEUROD2, were enriched in NeuN+ cells (Supplementary Fig. 7g). Conversely, TF motifs for SOX2, SOX3, SOX4, SOX6, and SOX9 were enriched in NeuN− cells (Supplementary Fig. 7g). NeuN− distal enhancers were also enriched for motifs for the IRF family, such as IRF4, IRF7, IRF8, and IRF9 (Supplementary Fig. 7g), which contains key regulators of neural immune pathways expressed in glial cells31.
To further identify how enhancer–promoter interactions regulate cellular expression profiles, we next overlapped chromatin interactions with cell-type-specific enhancers (“Methods”). We were able to assign 10,167 and 11,242 cell-type-specific H3K27ac peaks to 7828 and 8851 genes via chromatin interactions in NeuN+ and NeuN− cells, respectively (Fig. 2a, Supplementary Data 3). Linking cell-type-specific enhancers to cell-type-specific loops revealed specific aspects of chromatin architecture regulating cell-type-specific gene expression. For example, a gene that encodes a synaptic scaffolding protein, HOMER132, was engaged in NeuN+ specific peaks and loops and its expression was significantly higher in NeuN+ cells compared with NeuN− cells (Fig. 2b). In contrast, a glial gene, SOX1033, was engaged in NeuN− specific H3K27ac peaks and Hi–C loops, and had higher expression in NeuN− cells than in NeuN+ cells (Fig. 2c). In line with these findings, NeuN+ enhancer–promoter interactions were enriched for synaptic and axonal genes, whereas NeuN− enhancer–promoter interactions were associated with actin-based motility (Supplementary Data 3). Moreover, genes assigned to NeuN+ specific peaks were enriched in the synaptic co-expression modules during neurodevelopment34 (Fig. 2d). Within a synapse, they were involved in specialized functions including exocytosis, intracellular signal transduction, and synaptic plasticity35 (Fig. 2e). Lastly, genes assigned to NeuN+ specific peaks were more highly expressed in neurons, while those assigned to NeuN− specific peaks were highly expressed in oligodendrocytes and astrocytes (Fig. 2f), demonstrating the tight relationship between cell-type-specific chromatin architecture and expression signature.
Enhancer–promoter interactions in glutamatergic and GABAergic neurons
Single-cell expression profiles have demonstrated a remarkable transcriptional diversity within neuronal subtypes10. We therefore used H3K27ac peaks defined in purified Glu and MGE-derived GABA (hereafter referred to as GABA) neurons18 to deconvolute NeuN+ chromatin interactions into two major neuronal subtypes. We obtained 45,911 Glu-specific and 32,169 GABA-specific H3K27ac peaks and assigned them to 6234 and 4342 genes, respectively (Supplementary Fig. 8a, Supplementary Data 4). These genes showed a remarked level of neuronal specificity. Genes assigned to Glu peaks were highly expressed in excitatory neurons, such as layers (L)2/3 pyramidal neurons (Ex1) and L5/6 corticothalamic projection neurons (Ex7), while genes assigned to GABA peaks were highly expressed in inhibitory neurons, such as parvalbumin-expressing GABA interneurons (In6, Supplementary Fig. 8b, the definition of Ex1–8 and In1–8 neuronal subtypes is available in Supplementary Table 2). GRIK4, a gene that encodes an ionotropic class of glutamate receptor36, displayed complex chromatin interactions with multiple Glu peaks. In contrast, GAD1, a gene that encodes a well-known cellular marker for inhibitory neurons37, was engaged in GABA peaks via chromatin interactions (Supplementary Fig. 8c). Collectively, we established neuronal subtype-specific gene regulatory relationships by integrating Glu- and GABA-specific peaks with the NeuN+ (neuronal) chromatin interaction profiles defined here.
Cell-type-specific nature of AD-associated epigenetic dysregulation
We next used the cell-type-specific gene regulatory relationship to refine cell-type-specific aspects of disease vulnerability. Studies in AD have revealed changes in gene regulation manifested by differences in H3K27ac in bulk tissue38, but how this relates to cell-type-specific vulnerability is not known. To this end, we attempted to deconvolve the cell-type-specificity of epigenetic dysregulation detected in the AD brain tissue38 by overlapping AD-associated hyperacetylated and hypoacetylated H3K27ac peaks with NeuN+ and NeuN− peaks (Fig. 3a). We detected that AD-associated hyperacetylated peaks were largely active in NeuN− cells, while AD-associated hypoacetylated peaks were largely active in NeuN+ cells in neurotypical controls (Fig. 3b). To decipher the biological impact of AD-associated epigenetic dysregulation, we annotated these AD-associated hyperacetylated and hypoacetylated H3K27ac peaks using chromatin interaction profiles. We were able to link hypoacetylated peaks in AD to 460 genes using NeuN+ Hi–C data and hyperacetylated peaks in AD to 676 genes using NeuN− Hi–C data (hereby referred to as NeuN+ hypo- and NeuN− hyperacetylated genes, respectively, Fig. 3c, Supplementary Data 5). NeuN+ hypoacetylated genes include CACNG3, whose promoter formed a loop with a peak, which was preferentially active in NeuN+ cells and was hypoacetylated in postmortem AD brain. CACNG3 encodes a voltage-gated calcium channel39 and had significantly higher expression in neurotypical NeuN+ cells compared to NeuN− cells (Fig. 3d). NeuN− hyperacetylated genes include EHD1 (Fig. 3e), a gene involved in endocytic recycling with another AD-associated gene BIN140. EHD1 was both hyperacetylated and highly expressed in neurotypical NeuN− cells compared to NeuN+ cells. Importantly, the promoter of EHD1 formed a loop with the AD hyperacetylated peak that was preferentially active in NeuN− cells. GO analysis demonstrated that NeuN+ hypoacetylated genes included synaptic genes while NeuN− hyperacetylated genes were involved in catalytic activity and glycoprotein binding (Supplementary Data 5). Cellular expression profiles also confirmed these findings, as NeuN+ hypoacetylated genes were highly expressed in neurons, while NeuN− hyperacetylated genes were highly expressed in oligodendrocytes, astrocytes, and endothelial cells (Fig. 3f). These results collectively suggest that in AD brains, many neuronal genes are downregulated due to their hypoacetylation in neurons, whereas many glial genes are upregulated due to their hyperacetylation in glia. To validate this prediction, we tested whether this epigenetic dysregulation in AD is associated with altered gene expression in AD by comparing NeuN− hyperacetylated and NeuN+ hypoacetylated genes with mRNA co-expression modules constructed from AD brains41 (Fig. 3g). Among 11 co-expression modules differently regulated in AD (see Supplementary Table 3 for module definition), NeuN− hyperacetylated genes were enriched in co-expression modules that were upregulated in AD (T-M8 and T-M14)41. These modules were involved in transcriptional regulation and cellular proliferation and were annotated as astrocyte-specific41. In contrast, NeuN+ hypoacetylated genes were enriched in co-expression modules that were downregulated in AD (T-M1 and T-M16)41. Both modules were associated with synaptic transmission. This result demonstrates that cell-type-specific epigenetic dysregulation in AD is coupled with gene expression changes.
Genetic risk factors associated with AD converge onto microglial function
Changes in gene expression or histone modifications may be a consequence or compensatory in disease and not necessarily causal. To link changes in chromatin and gene regulation to causal genetic factors, we next tested for enrichment of common genetic risk factors associated with AD by GWAS42. We reasoned that cell-type-specific annotation of the regulatory impact of genetic risk factors would provide support or refine our knowledge of causal mechanisms underlying AD, which to date heavily implicate neural immune/microglial mechanisms43,44. Therefore, we first performed linkage disequilibrium score regression (LDSC)45 analysis to determine the enrichment of AD-associated common genetic variants in NeuN− and NeuN+ H3K27ac peaks. AD SNP-based heritability was highly enriched in NeuN− enhancers (NeuN+ p = 0.205, NeuN− p = 8.41 × 10−9), suggesting that NeuN− gene regulatory relationships would be the most appropriate for refining biological insight from AD GWAS (Fig. 4a).
We next ran H-MAGMA46 built upon the NeuN− interactome to convert SNP-level association statistics into gene-level association statistics, thereby connecting non-coding variants to their cognate genes. This analysis identified 181 AD risk genes (FDR < 0.05) based upon their potential regulation by associated non-coding variants (“Methods”, Supplementary Data 6), many of which included well-known AD risk genes47. For instance, we found that AD genome-wide significant (GWS) SNPs interact with the promoter of BIN1, whose transcript level is increased in AD brains48 (Fig. 4b). Notably, the BIN1 promoter was connected to multiple enhancers in NeuN−, but not in NeuN+ cells, demonstrating the highly cell-type-specific gene regulatory landscape within the locus. AD risk genes were enriched for amyloid−beta (Aβ) pathways, lipoprotein assembly, and immune processes (Fig. 4c, Supplementary Data 6). Consistent with previous studies16,42,49, these genes were highly expressed in the postnatal brain samples (Fig. 4d) and microglia (Fig. 4e). Notably, they were enriched for genes that are upregulated in microglia from postmortem AD brain50 (Fig. 4f). In line with this, we observed that AD risk genes that we identified here were enriched for a microglial co-expression module upregulated in AD postmortem brains (T-M3, Fig. 4g)41. This module is distinct from the module associated with epigenetic dysregulation in AD (Fig. 3g), in that the former represents microglia, while the latter is enriched in astrocytes. This is consistent with recent single-cell sequencing studies identifying genetic risk enrichment and gene expression changes within microglia in AD, accompanied by significant gene expression changes in other cells, including astrocytes50–52, which is where we observe epigenetic signature enrichment. Since genetic risk resides at the initiation of the causal chain in disease pathogenesis, this differential enrichment of genetic risk factors and epigenetic regulation underscores different expression signatures in AD with distinct cellular specificities and likely causal relationships.
Refined cellular etiology of SCZ and BD
In our previous study, we found that genes associated with psychiatric disorders display substantial molecular convergence within neurons, which was in stark contrast to neurodegenerative disorders like AD46. Indeed, LDSC (“Methods”) confirmed that SCZ and BD displayed strong SNP-based heritability enrichment in NeuN+ cells (Fig. 5a). We next hypothesized that enhancer-gene networks at a more refined cellular resolution (e.g. Glu and GABA neurons) would help elaborate molecular processes associated with these two major psychiatric disorders. Therefore, we assessed SNP-based heritability enrichment for SCZ and BD in Glu- and GABA-specific enhancers18. Notably, both disorders showed strong enrichment of SNP-based heritability in Glu- and GABA-specific enhancers, suggesting that both excitatory and inhibitory neurons may contribute to the genetic etiology of SCZ and BD (Fig. 5b). Since Hi–C data from neuronal subtypes is yet unavailable, we constructed Glu and GABA H-MAGMA maps by integrating Glu- and GABA-enhancers with NeuN+ Hi–C data, respectively (“Methods”), obtaining 753 SCZ risk genes from Glu (hereby referred to as Glu-SCZ genes) and 624 risk genes from GABA-specific regulatory interactions integrated via H-MAGMA (GABA-SCZ genes) (Supplementary Data 7). Glu-SCZ genes were implicated in synaptic organization, ion channel activity, and neuron projection, while GABA-SCZ genes were associated with dendrite, axon, and hormonal response (Supplementary Data 7). We also identified 143 and 101 BD candidate risk genes from Glu (Glu-BD genes) and GABA (GABA-BD genes) H-MAGMA, respectively (Supplementary Data 7). Glu-BD genes were enriched for neurogenesis, cell adhesion molecules, and synapses, while GABA-BD genes were enriched for transcriptional regulation and NMDA receptor activity (Supplementary Data 7). Neuronal subtype expression profiles of SCZ and BD risk genes identified by Glu and GABA H-MAGMA were evaluated using scRNA data obtained from excitatory (Ex1 to Ex8) and inhibitory (In1 to In8) neuronal subtypes8. Neuronal subtype enrichment showed a clear distinction between risk genes identified by Glu and GABA H-MAGMA, uncovering cellular and disease specificities that have not been described before (Fig. 5c). Glu-SCZ genes displayed widespread expression among many Glu neuronal subclasses, with relatively higher expression signatures in L3/4 neurons (Ex2) and L5/6 corticothalamic projection neurons (Ex7). Glu-BD genes showed a much stronger cell-type-specificity, with the highest expression signature in L2/3 cortical projection neurons (Ex1) and L5/6 corticothalamic projection neurons (Ex7-8). In interneurons, GABA-SCZ genes and GABA-BD genes showed similar enrichment for parvalbumin-expressing cells (In6). Given that genetic correlation between SCZ and BD is remarkably high (rg = 0.67)46,53,54, these findings suggest cellular substrates for molecular convergence (Ex7 and In6 are shared between two disorders) and divergence (Ex1 is BD-specific, while Ex2 is SCZ-specific) among two highly genetically correlated disorders.
Discussion
Here we provide a high-resolution map of chromosome conformation from two major brain cell classes, neurons (NeuN+ cells) and glia (NeuN− cells, Supplementary Fig. 1). The genome-wide analysis of chromosome conformation in these two cell classes captures the major known elements of 3D architecture such as compartments, FIREs, and loops, addressing the roles and impact of these different hierarchical units in cell-type-specific gene regulation. We further refined maps of neuronal chromatin architecture by integrating Hi–C data from sorted neurons with H3K27ac peaks identified in two major neuronal subtypes, Glu and GABA neurons, which led to a successful delineation of neuronal subtype-specific regulatory relationships. We found that multiple architectural units that include compartments, FIREs, and chromatin loops display a remarked level of cellular specificity that is tightly coupled with gene regulation. We reasoned that these cell-type-specific gene regulatory networks would provide a window through which to understand the cellular etiology of brain diseases.
We first leveraged these networks to deconvolve epigenetic dysregulation in AD postmortem brain samples into the corresponding cell types. In their original work, Marzi et al.38 reported AD-associated H3K27ac peaks that are either hypo- or hyperacetylated in AD-affected individuals compared to age-matched low-pathology controls38 and linked these peaks to target genes on the basis of linear distance. We and others have shown that enhancers often regulate distal genes4,15,16, so this initial assignment is likely to be underpowered and inaccurate. Furthermore, these peaks were obtained from the brain homogenates, which would mask opposing interactions and obscure the cell-type-specific substrate of these changes. By developing neuronal and glial enhancer–promoter interaction maps, we not only accurately annotate these peaks with respect to their cognate genes, but also within their corresponding cell types. Notably, AD-associated hyperacetylation was enriched for glial enhancers that are associated with upregulation of astrocytic co-expression modules in AD, while AD-associated hypoacetylation was enriched for neuronal enhancers that potentially affect downregulation of a neuronal co-expression module in AD. These results highlight the importance of obtaining cell-type-specific epigenetic landscapes in diseases. This may reflect either the cellular composition changes (expansion of glia and neuronal death) or changes in regulatory landscape and cellular function (glial activation and decreased neuronal activity) in AD, which requires future investigation.
It is of note that while AD-associated hyperacetylation was linked to oligodendrocyte genes and astrocyte co-expression modules, genetic risk factors for AD were mapped onto a different glial cell type, microglia. These results are consistent with a newly emerging literature which indicates that different types of glia may contribute to the disease via different regulatory mechanisms55–57. In linking AD genetic risk factors to their cognate genes, we also recapitulated a previously found association between rs6733839 and BIN1. Nott et al. found that this SNP is causally implicated in the regulation of an AD risk gene, BIN1, in a microglia-specific manner16 as predicted by our analysis.
In contrast to AD in which multiple glial cells are implicated, previous work has indicated neurons as the central cell-type harboring genetic risk for the majority of psychiatric disorders3,22,46,58–61. However, given the strong genetic overlap between BD and SCZ53,62, we do not have much indication about specific biological pathways that are driving one disease versus the other. Here, we reasoned that neuronal subtype-specific gene regulatory networks would help refine the cellular etiology of disease and potentially identify discrete molecular neuropathology across psychiatric disorders. Indeed, when we applied Glu and GABA neuronal enhancer-gene networks to BD and SCZ, we were able to delineate discrete cellular substrates that may contribute to difference between BD (Ex1) and SCZ (Ex5), a distinction that has not been previously recognized. It is also of note that common genetic risk for SCZ and BD also indicated shared pathways coalescing onto parvalbumin-positive interneurons, paralleling what has been among the most robust pathologic findings in SCZ and BD63,64. However, we note that these findings need to be further verified by neuronal subtype-specific Hi–C data (e.g. Glu and GABA Hi–C). Moreover, CRISPR/Cas9-mediated genome engineering can be used to target risk genes’ enhancers to further investigate their roles in a neuronal subtype-specific manner. Our study provides a foundation for such future study by characterizing cell-type-specific 3D chromatin structure in the adult human brain, which we show can be used to improve our understanding of gene regulatory landscape in brain disorders.
Methods
Human tissue collection and nuclei isolation
Prefrontal cortex with no history of neurological disease was procured from tissue collections at the Department of Neuroscience at Yale University School of Medicine. Additional specimens were procured from the Brain and Tissue Bank at the University of Maryland. Tissue was collected after obtaining next of kin consent and with approval by the institutional review boards at the Yale University School of Medicine, the National Institutes of Health, and at each institution from which tissue specimens were obtained. Tissue specimens were handled in accordance with ethical guidelines and regulations for the research use of human brain tissue set forth by the NIH and the WMA Declaration of Helsinki. All available non-identifying information was recorded for each specimen (metadata available in Supplementary Table 1).
Procedures in preparation for flow cytometry (nuclei extraction, NeuN neuronal marker immunotagging using DAPI staining) for frozen never-fixed brain tissue specimens were previously described in detail27,65. Briefly, we dissected and homogenized 300 mg of frozen DLPFC tissue in lysis buffer (0.32 M Sucrose; 5 mM CaCl2; 3 mM Mg(Ace)2; 0.1 mM EDTA, pH8; 10 mM Tris-HCl, pH8; 1 mM DTT, 0.1% Triton X-100). Brain homogenate was fixed in 1% formaldehyde for 10 min prior to NeuN immunotagging66. Nuclei were isolated via ultracentrifugation in sucrose gradient. Isolated nuclei were subjected to NeuN immunotagging using 1:1000 anti-NeuN-488 (Millipore, MAB377X) and 1:1000 DAPI (Thermo Fisher Scientific, 62248) diluted in DPBS with 0.2% BSA. For each Hi–C assay, 2–5 × 106 sorted NeuN+ or NeuN− nuclei were used. For each nuclear RNA-seq assay, ~5 × 104 sorted nuclei were used.
Hi–C library generation and data processing
Sorted cells were fixed in 1% formaldehyde for 10 min. Cross-linked DNA was then digested by HindIII (NEB, R0104). Digested chromatin ends were filled, marked with biotin-14-dCTP (Thermo Fisher, 19518-018), and ligated within the nucleus. DNA was sheared into 300–600-bp fragments (Covaris, M220), and biotin-tagged DNA was pulled down with streptavidin beads (Invitrogen, 65001) and ligated with Illumina paired-end adapters. The resulting Hi–C library was amplified by PCR (KAPA Biosystems HiFi HotStart PCR kit, KK2502), and sequenced by Illumina 50 bp paired-end sequencing.
The resulting Hi–C reads were mapped and filtered using hiclib (v.0.9)67. Filtered reads were binned at 10 kb, 40 kb, and 100 kb resolution to build a genome-wide contact matrix at a given bin size, which was subsequently normalized using Iterative Correction and Eigenvector decomposition. We then used 100 kb resolution matrices for compartment analysis, 40 kb for TAD analysis, and 10 kb for loop detection. Heatmaps of Hi–C contact matrices at 10 kb resolution were plotted using R package pheatmap (v.1.0.12) with scale=“column”.
Comparison across multiple brain-derived Hi–C data
We used HiCRep (v.1.10.0)19 to measure reproducibility across biological replicates of Hi–C libraries using normalized Hi–C contact matrices at 40 kb resolution. HiCRep generates a smoothed contact matrix, and stratifies the matrix by the distance between the interacting regions of chromatin. From this stratified matrix, a SCC is defined, which provides a measure of similarity of genome-wide chromatin contacts across cell types.
HiCRep was also used to compare similarities between brain-derived genome-wide normalized chromatin contacts at 40 kb resolution. We compared Hi–C datasets derived from homogenized adult brain tissue (Adult brain)3, mature non-neuronal cells (NeuN−), mature neuronal cells (NeuN+), postmitotic neuron-enriched cortical plate from the fetal brain (Fetal brain CP)4, progenitor-enriched ventricular zone from the fetal brain (Fetal brain VZ)4, iPSC-derived neurons (iPSC Neuron)15 and iPSC-derived astrocytes (iPSC Astrocytes)15.
Compartment calling
HiCExplorer (v.2.2.1.1)68 was used to call compartments from normalized genome-wide chromatin contact matrices at 100 kb resolution68. Principal component analysis (PCA) was performed using 4 eigenvectors. PC values were selected for each chromosome and correlated with gene density to determine compartments. The correlations between PC values from each Hi–C data were then used to compare similarity in compartments across cell types.
TAD
We conducted TAD-level analysis as described previously4. In brief, we quantified the directionality index by calculating the degree of upstream or downstream (2 Mb) interaction bias of a given bin (40 kb), which was processed by a hidden Markov model (HMM) to remove hidden directionality bias.
FIRE analysis
We used FIREcaller (v.1.10)24 to define FIREs and super-FIREs based on raw Hi–C contact matrices at 40 kb resolution. As FIREcaller defines FIREs based on a specific statistical threshold (p < 0.05 which corresponds to a FIRE score >1.56), NeuN+ or NeuN− specific FIREs defined by FIREcaller although displaying similar FIRE scores between two cell types, may be called significant only in one cell, as only one cell type exceeds the statistical cutoff (e.g. FIRE score = 1.6), while the other fails to reach the statistical cutoff (e.g. FIRE score = 1.5). Therefore, rather than using an all-or-none FIRE definition, we identified significantly differential FIREs based on the difference in FIRE scores between NeuN+ and NeuN−. NeuN+ differential FIREs were identified by obtaining the genomic regions that have NeuN+ FIRE scores greater than qnorm (0.975) and NeuN− FIRE scores lower than qnorm (0.9). NeuN− differential FIREs were defined in the opposite way; NeuN− FIRE scores<qnorm (0.9) & NeuN+ FIRE scores>qnorm (0.95). To link differential and super-FIREs to target genes, we intersected these differential FIREs and super-FIREs with gene promoters (defined as 2 kb upstream and 1kb downstream of TSS).
We also defined common FIREs, which are FIREs detected in both NeuN+ and NeuN− cells, and the difference of FIREs score between two cell types was less than 0.5. Common FIREs were then overlapped with promoters to identify common FIRE-associated genes. Since some of the common FIRE-associated genes also overlapped with NeuN+ and NeuN− FIREs, we removed those potential cell-type-specific genes to obtain common FIRE-associated genes.
Loop calling
Promoter-based interactions were identified as previously described3,4. Briefly, we constructed background interaction profiles from randomly selected length- and GC content-matched regions to promoters. Using these background interaction profiles, we fit interaction frequencies into Weibull distribution at each distance for each chromosome using the fitdistrplus69 package in R. Significance of interaction from each promoter was calculated as the probability of observing higher interaction frequencies under the fitted Weibull distribution, and interactions with FDR < 0.01 were selected as significant promoter-based interactions. We overlapped promoter-based interactions with genomic coordinates of TADs, and found that the majority (~75%) of promoter-based interactions were located within the same TADs. We also applied Fit-Hi-C2 (v.2.0.7)70 with -U 2000000 -L 20000 -r 10000 -p 2 to call loops.
RNA-seq library generation and data processing
Nuclei (50,000 in DPBS) were sorted into 300 µl of Trizol (Fisher; Cat#: 15596018) such that the volume ratio of nuclei:Trizol was 1:3. Nuclei were lysed by pipetting 20 times in the Trizol solution on ice. RNA extraction was performed using the Zymo Directzol Microprep RNA Kit (Zymo; Cat#: R2062). RNA quality was evaluated using an Agilent Bioanalyzer 2100 with an RNA 6000 Pico Kit (Agilent; Cat#5067-1513). The RNA was then converted into cDNA and prepared into RNA-seq libraries using the SmarterStranded kit (Takara; Cat#634843). The libraries were size selected for an average fragment size of 300 bp using SPRI beads (Beckman Coulter Life Sciences; Cat#B23317). Library quality was assessed with a Qubit and Agilent Bioanalyzer 2100 using the Agilent High Sensitivity DNA Kit (Cat#5067-4626).
Once RNA-seq libraries were sequenced, we used FastQC (v.0.11.8)71 to check the quality of RNA-seq reads, and removed adapter with Cutadapt program (v.1.18)72. Next, clean reads were mapped to the human reference genome (Release 19 (GRCh37.p13)) from GENCODE database with HISAT2 (v.2.1.0)73 using default parameters. And, we assembled and quantified transcripts using StringTie (v.1.3.5)74. Differential analysis was done with DESeq2 (v.1.22.2)75. RNA-seq data from Glu and GABA neurons were obtained from a previously published study18.
ChIP-seq data analysis
Differential H3K27ac peaks between NeuN+ and NeuN− cells were obtained from the previously published datasets27. We re-analyzed previously published H3K27ac ChIP-seq data from Glu and GABA neurons18 to define differential peaks between Glu and GABA neurons. We first used FastQC (v.0.11.8)71 to check the quality of ChIP-seq reads18. Next, we used Bowtie2 (v.2.3.4.3)76 with—very-sensitive to align reads to human reference genome build (Release 19 (GRCh37.p13)) from the GENCODE database. We removed duplicate reads using Picard (v.2.20.1, http://broadinstitute.github.io/picard/) MarkDuplicates function. We then called H3K27ac peaks using MACS2 (v.2.1.0.20150731)77 with—broad-cutoff 0.00001. Finally we used DiffBind (v.2.13.1)78 to analyze differentially binding regions between Glu and GABA ChIP-seq data.
Motif analysis
To identify TFs that are involved in cell-type-specific distal regulation, we first extracted differentially accessible chromatin peaks by combining differential H3K27ac27 and ATAC-seq peaks79 that are brought to the promoters via chromatin loops. We then performed differential motif analyses on these cell-type-specific distal regulatory peaks using GimmeMotifs (gimme maelstrom) (v0.14.1)30 with the default settings.
Gene Ontology Enrichment analysis
We used gProfileR2 (v.0.1.9)80 to identify GO terms that were overrepresented in particular gene sets such as genes assigned to common FIREs, differential FIREs, and super-FIREs. We set the arguments as following:
gost(input,organism = “hsapiens”,ordered_query = F,significant = T,evcodes = TRUE, user_threshold = 0.05, correction_method = “fdr”,sources = c(“GO”))$result
Top GO terms for super-FIRE-associated genes were highly redundant, which hindered the identification of biological pathways implicated for super-FIREs. We therefore applied REVIGO with default arguments (http://revigo.irb.hr/)81 to remove redundant GO terms.
To analyze functional enrichment of H-MAGMA genes, we used a previously reported rank-based approach46. This approach is a threshold-free method, because it does not require a specific FDR cutoff. Instead, biological pathways enriched in the top ranked genes are queried. Briefly, gProfileR (v.0.7.0)80 was used to perform GO analyses for ranked gene lists from H-MAGMA (v.1.08)82 for AD, SCZ, and BD GWAS:
gprofiler(input,organism = “hsapiens”, ordered_query = T, significant = T, max_p_value = 0.05, min_set_size = 15, max_set_size = 600, min_isect_size = 5, custom_bg = background gene set, correction_method = “fdr”, hier_filtering = “moderate”, include_graph = T, src_filter =“GO”)
custom_bg: H-MAGMA output genes overlapping with MCH regions were filtered (background gene set);
input: background gene set was ordered based on p value (Ranked gene list)
We used gProfileR to perform GO analysis for NeuN+ hypoacetylated and NeuN− hyperacetylated genes:
gprofiler(input,organism=“hsapiens”, ordered_query=F, significant=T, max_p_value=0.05, min_set_size=15, min_isect_size=5, correction_method=“fdr”, hier_filtering=“moderate”, include_graph=T, src_filter=“GO”).
Cellular expression profile analysis
To quantify the significance of cellular expression of the genes assigned to cell-type-specific chromatin architecture (e.g., differential FIREs and loops), we used EWCE (v.1.3.0)83. In addition, cellular expression profiles of the disease risk genes were interrogated by plotting centered expression values for each cell type using the scRNA-seq data as described before10,46. Each neuronal subtype definition is provided in Supplementary Table 2.
Module enrichment analysis
Developmental and synaptic modules were obtained from Parikshak et al.34 and Lips et al.35, respectively. We employed Fisher’s exact test to compare developmental and synaptic modules with genes engaged in NeuN+ and NeuN− enhancer–promoter interactions. AD-associated co-expression networks were obtained from Seyfried et al.41 (Supplementary Table 3).
AD co-expression modules were compared with (1) genes that were linked to hyper and hypoacetylated genes in AD and (2) AD-associated genes identified by H-MAGMA46 using Fisher’s exact test.
Linking AD-associated epigenetic dysregulation to cognate genes
We downloaded AD-associated hyper- and hypoacetylated H3K27ac peaks from Marzi et al.38. Because these peaks were obtained from the brain homogenate that lacks cellular resolution, we overlapped them with NeuN+ and NeuN− differential peaks. Since we found that hyperacetylated peaks in AD significantly overlapped with NeuN− differential peaks, we used NeuN− loops to assign them to the target genes. On the other hand, hypoacetylated peaks in AD overlapped with NeuN+ differential peaks, so they were annotated to the target genes by NeuN+ loops.
GWAS data
We downloaded the following GWAS summary datasets: BD (n = 20,352 cases; 31,538 controls)84, SCZ (n = 11,260 cases; 24,542 controls)1, and AD (n = 71,880 cases; 383,378 controls)49.
LD score regression analysis
We implemented the LDSC software45 (v.1.0.0) to estimate the enrichment of SNP-based heritability for brain disorder GWAS in differential H3K27ac peaks between (1) NeuN+ and NeuN− cells and (2) Glu and GABA neurons. Genetic variants were annotated to differential H3K27ac peaks, and SNP-based heritability statistics were calculated using the GWAS summary statistics mentioned above. Enrichment statistics was calculated as the proportion of SNP-based heritability divided by the proportion of SNPs annotated to differential H3K27ac peaks.
H-MAGMA input file generation
We generated H-MAGMA (v.1.08) input files that provide SNP-gene relationships based on chromatin interaction profiles from NeuN+ and NeuN− cells. Exonic and promoter SNPs were directly assigned to their target genes based on their genomic location. Intronic and intergenic SNPs were assigned to their cognate genes based on chromatin interactions with promoters and exons as previously described46. To provide SNP-gene relationships at a neuronal subtype resolution, we also generated Glu and GABA H-MAGMA input files. For example, we obtained SNPs that map onto Glu H3K27ac peaks, then SNPs active in Glu neurons were assigned to their genes by either genomic coordinates (when located in promoters and exons) or distal interactions (non-coding SNPs were mapped to their cognate genes via NeuN+ chromatin interactions). We used the same framework to obtain GABA SNP-gene relationships. Input files can be found in the github repository: https://github.com/thewonlab/NeuN.
H-MAGMA
Based on the SNP-based heritability enrichment result, we used NeuN− H-MAGMA to infer AD risk genes and Glu/GABA H-MAGMA to infer SCZ and BD risk genes.
We set the arguments as following: magma_v1.08/magma—bfile g1000_eur –pval <GWAS summary statistics> use = rsid,p ncol = N—gene-annot <MAGMA input annotation file>—out <output file>.
g1000_eur denotes the reference data file for European ancestry population.
Python and R environments
We used Python v.2.7 and R v.3.6.0.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank members of the Won lab for helpful discussions and comments about this paper. We also thank Sergio Espeso Gil for transferring RNA-seq datasets from NeuN+ and NeuN− cells. This research was supported by the National Institute of Health (NIMH: P50MH106438, R01MH094714, U01MH103339, R01MH110927, R01MH100027, D.H.G.; R00MH113823, DP2MH122403, R21DA051921, R01AG066871, H.W.; U01MH103392, S.A.; R21MH103877, U01MH122590, R01DA044940, S.D.), the Veterans Administration Biomedical Laboratory Research & Development (I01BX002876, S.D.), a NARSAD Young Investigator Award from the Brain and Behavior Research Foundation (H.W.), a SPARK grant from the Simons Foundation Autism Research Initiative (H.W.), and Helen Lyng White Fellowship (W.M.). We also acknowledge the support from the Data Science Core at the UNC IDDRC (P50HD103573). The DLPFC tissue used in this research was obtained from the Human Brain Collection Core, Intramural Research Program, NIMH (http://www.nimh.nih.gov/hbcc). We thank Drs. Stefano Marenco, Barbara Lipska, and Pavan Auluck for their assistance in tissue procurement.
Source data
Author contributions
The design of the NeuN+ and NeuN− Hi–C comparison was developed as part of a reference brain map for the PsychENCODE consortium by D.H.G., S.A., and N.S. Tissue dissection and dissemination has been conducted by S.P. and N.S. Cells were sorted from the adult DLPFC to neurons (NeuN+) and glia (NeuN−) by R.B.P., B.K., and S.A. The subsequent integrative analysis was designed by H.W. and D.H.G. B.H. performed overall data analyses. K.S. ran HiCRep analyses. S.A. contributed H3K27ac and RNA-seq data from NeuN+ and NeuN− cells. A.K. and S.D. contributed H3K27ac data from glutamatergic and GABAergic neurons. C.A.C. and Y.L. helped FIRE analyses. B.H. and W.M. generated and edited figures. B.H. and H.W. co-wrote the first draft of the manuscript, which was subsequently revised by D.H.G., S.A., and S.D.
Data availability
Hi–C data described in this manuscript is available through the PsychENCODE Knowledge Portal with the accession code of syn21760712. H3K27ac, ATAC-seq, and RNA-seq data from NeuN+ and NeuN− cells are available through syn4566010, GSE83345, and syn20545534, respectively. H3K27ac ChIP-seq data from Glu and GABA neurons are available through syn12034263. Human reference genome and gene definition was obtained from GENCODE and UCSC genome browser [http://genome.ucsc.edu]. All other relevant data supporting the key findings of this study are available within the article and its Supplementary Information files or from the corresponding authors upon reasonable request. Source data are provided with this paper. A reporting summary for this Article is available as a Supplementary Information file. Source data are provided with this paper.
Code availability
All custom code used in this work is available in the following github repository: https://github.com/thewonlab/NeuN85.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewer(s) for their contribution to the peer review of this work.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Benxia Hu, Hyejung Won.
A list of authors and their affiliations appears at the end of the paper.
Contributor Information
Hyejung Won, Email: hyejung_won@med.unc.edu.
Daniel H. Geschwind, Email: dhg@mednet.ucla.edu
The PsychENCODE Consortium:
Allison E. Ashley-Koch, Gregory E. Crawford, Melanie E. Garrett, Lingyun Song, Alexias Safi, Graham D. Johnson, Gregory A. Wray, Timothy E. Reddy, Fernando S. Goes, Peter Zandi, Julien Bryois, Andrew E. Jaffe, Amanda J. Price, Nikolay A. Ivanov, Leonardo Collado-Torres, Thomas M. Hyde, Emily E. Burke, Joel E. Kleiman, Ran Tao, Joo Heon Shin, Kiran Girdhar, Yan Jiang, Marija Kundakovic, Leanne Brown, Jennifer R. Wiseman, Elizabeth Zharovsky, Rivka Jacobov, Olivia Devillers, Elie Flatow, Gabriel E. Hoffman, Judson Belmont, Diane Del Valle, Nancy Francoeur, Evi Hadjimichael, Dalila Pinto, Harm van Bakel, Panos Roussos, John F. Fullard, Jaroslav Bendl, Mads E. Hauberg, Alexander W. Charney, Vahram Haroutunian, Barbara K. Lipska, David A. Lewis, Chang-Gyu Hahn, Lara M. Mangravite, Mette A. Peters, Yooree Chae, Junmin Peng, Mingming Niu, Xusheng Wang, Maree J. Webster, Thomas G. Beach, Chao Chen, Yi Jiang, Rujia Dai, Yongjun Wang, Yan Xia, Annie W. Shieh, Chunyu Liu, Kay S. Grennan, Ramu Vadukapuram, Gina Giase, Dominic Fitzgerald, Lijun Cheng, Miguel Brown, Mimi Brown, Tonya Brunetti, Thomas Goodman, Majd Alsayed, Kevin P. White, Mohana Ray, Damon Polioudakis, Brie Wamsley, Jiani Yin, Luis De La Torre Ubieta, Michael J. Gandal, Vivek Swarup, Stephan J. Sanders, Matthew W. State, Donna M. Werling, Joon-Yong An, Brooke Sheppard, A. Jeremy Willsey, Amira Kefi, Eugenio Mattei, Michael Purcaro, Zhiping Weng, Jill Moore, Henry Pratt, Jack Huey, Tyler Borrman, Patrick F. Sullivan, Paola Giusti-Rodriguez, Yunjung Kim, Jin Szatkiewicz, Suhn Kyong Rhie, Christoper Armoskus, Adrian Camarena, Peggy J. Farnham, Valeria N. Spitsyna, Heather Witt, Shannon Schreiner, Oleg V. Evgrafov, James A. Knowles, Mark Gerstein, Shuang Liu, Fabio C. P. Navarro, Jonathan Warrell, Declan Clarke, Prashant S. Emani, Mengting Gu, Xu Shi, Min Xu, Yucheng T. Yang, Robert R. Kitchen, Gamze Gürsoy, Jing Zhang, Becky C. Carlyle, Angus C. Nairn, Mingfeng Li, Mario Skarica, Zhen Li, Andre M. M. Sousa, Gabriel Santpere, Jinmyung Choi, Ying Zhu, Tianliuyun Gao, Daniel J. Miller, Adriana Cherskov, Mo Yang, Anahita Amiri, Gianfilippo Coppola, Jessica Mariani, Soraya Scuderi, Anna Szekely, Flora M. Vaccarino, Feinan Wu, Sherman Weissman, Daifeng Wang, Tanmoy Roychowdhury, and Alexej Abyzov
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-24243-0.
References
- 1.Pardiñas AF, et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat. Genet. 2018;50:381–389. doi: 10.1038/s41588-018-0059-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Rada-Iglesias A. Genetic variation within transcriptional regulatory elements and its implications for human disease. Biol. Chem. 2014;395:1453–1460. doi: 10.1515/hsz-2014-0109. [DOI] [PubMed] [Google Scholar]
- 3.Wang, D. et al. Comprehensive functional genomic resource and integrative model for the human brain. Science362, eaat8464 (2018). [DOI] [PMC free article] [PubMed]
- 4.Won H, et al. Chromosome conformation elucidates regulatory relationships in developing human brain. Nature. 2016;538:523–527. doi: 10.1038/nature19847. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.de la Torre-Ubieta, L. et al. The dynamic landscape of open chromatin during human cortical neurogenesis. Cell172, 289–304 (2018). [DOI] [PMC free article] [PubMed]
- 6.Schmitt AD, et al. A compendium of chromatin contact maps reveals spatially active regions in the human genome. Cell Rep. 2016;17:2042–2059. doi: 10.1016/j.celrep.2016.10.061. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jung I, et al. A compendium of promoter-centered long-range chromatin interactions in the human genome. Nat. Genet. 2019;51:1442–1449. doi: 10.1038/s41588-019-0494-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lake BB, et al. Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain. Science. 2016;352:1586–1590. doi: 10.1126/science.aaf1204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Habib N, et al. Massively parallel single-nucleus RNA-seq with DroNc-seq. Nat. Methods. 2017;14:955–958. doi: 10.1038/nmeth.4407. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Darmanis S, et al. A survey of human brain transcriptome diversity at the single cell level. Proc. Natl Acad. Sci. USA. 2015;112:7285–7290. doi: 10.1073/pnas.1507125112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lake BB, et al. Integrative single-cell analysis of transcriptional and epigenetic states in the human adult brain. Nat. Biotechnol. 2018;36:70–80. doi: 10.1038/nbt.4038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Cusanovich DA, et al. A single-cell atlas of in vivo mammalian chromatin accessibility. Cell. 2018;174:1309–1324.e18. doi: 10.1016/j.cell.2018.06.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ziffra, R. S. et al. Single cell epigenomic atlas of the developing human brain and organoids. Preprint at bioRxiv10.1101/2019.12.30.891549 (2020).
- 14.Trevino, A. E. et al. Chromatin accessibility dynamics in a model of human forebrain development. Science367, eaay1645 (2020). [DOI] [PMC free article] [PubMed]
- 15.Rajarajan, P. et al. Neuron-specific signatures in the chromosomal connectome associated with schizophrenia risk. Science362, eaat4311 (2018). [DOI] [PMC free article] [PubMed]
- 16.Nott A, et al. Brain cell type–specific enhancer–promoter interactome maps and disease-risk association. Science. 2019;366:1134–1139. doi: 10.1126/science.aay0793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Matevossian, A. & Akbarian, S. Neuronal nuclei isolation from human postmortem brain tissue. J. Vis. Exp. 10.3791/914 (2008). [DOI] [PMC free article] [PubMed]
- 18.Kozlenkov A, et al. A unique role for DNA (hydroxy) methylation in epigenetic regulation of human inhibitory neurons. Sci. Adv. 2018;4:eaau6190. doi: 10.1126/sciadv.aau6190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yang T, et al. HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient. Genome Res. 2017;27:1939–1949. doi: 10.1101/gr.220640.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Li M, et al. Integrative functional genomic analysis of human brain development and neuropsychiatric risks. Science. 2018;362:eaat7615. doi: 10.1126/science.aat7615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Halene TB, et al. NeuN+ neuronal nuclei in non-human primate prefrontal cortex and subcortical white matter after clozapine exposure. Schizophr. Res. 2016;170:235–244. doi: 10.1016/j.schres.2015.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Polioudakis D, et al. A single-cell transcriptomic atlas of human neocortical development during mid-gestation. Neuron. 2019;103:785–801.e8. doi: 10.1016/j.neuron.2019.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Dixon JR, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485:376–380. doi: 10.1038/nature11082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Crowley C, et al. FIREcaller: detecting frequently interacting regions from Hi-C data. Comput. Struct. Biotechnol. J. 2021;19:355–362. doi: 10.1016/j.csbj.2020.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Halvorsen M, et al. Increased burden of ultra-rare structural variants localizing to boundaries of topologically associated domains in schizophrenia. Nat. Commun. 2020;11:1–13. doi: 10.1038/s41467-020-15707-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gorkin DU, et al. Common DNA sequence variation influences 3-dimensional conformation of the human genome. Genome Biol. 2019;20:255. doi: 10.1186/s13059-019-1855-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Girdhar K, et al. Cell-specific histone modification maps in the human frontal lobe link schizophrenia risk to the neuronal epigenome. Nat. Neurosci. 2018;21:1126–1136. doi: 10.1038/s41593-018-0187-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kozlenkov A, et al. Evolution of regulatory signatures in primate cortical neurons at cell-type resolution. Proc. Natl Acad. Sci. USA. 2020;117:28422–28432. doi: 10.1073/pnas.2011884117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Consortium, Roadmap Epigenomics. et al. Integrative analysis of 111 reference human epigenomes. Nature. 2015;518:317–330. doi: 10.1038/nature14248. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.van Heeringen SJ, Veenstra GJC. GimmeMotifs: a de novo motif prediction pipeline for ChIP-sequencing experiments. Bioinformatics. 2011;27:270–271. doi: 10.1093/bioinformatics/btq636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Tamura T, Yanai H, Savitsky D, Taniguchi T. The IRF family transcription factors in immunity and Oncogenesis. Annu. Rev. Immunol. 2008;26:535–584. doi: 10.1146/annurev.immunol.26.021607.090400. [DOI] [PubMed] [Google Scholar]
- 32.Tu JC, et al. Homer binds a novel proline-rich motif and links group 1 metabotropic glutamate receptors with IP3 receptors. Neuron. 1998;21:717–726. doi: 10.1016/s0896-6273(00)80589-9. [DOI] [PubMed] [Google Scholar]
- 33.Kuhlbrodt K, Herbarth B, Sock E, Hermans-Borgmeyer I, Wegner M. Sox10, a novel transcriptional modulator in glial cells. J. Neurosci. 1998;18:237–250. doi: 10.1523/JNEUROSCI.18-01-00237.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Parikshak NNN, et al. Integrative functional genomic analyses implicate specific molecular pathways and circuits in autism. Cell. 2013;155:1008–1021. doi: 10.1016/j.cell.2013.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Lips ES, et al. Functional gene group analysis identifies synaptic gene groups as risk factor for schizophrenia. Mol. Psychiatry. 2012;17:996–1006. doi: 10.1038/mp.2011.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Szpirer C, et al. The genes encoding the glutamate receptor subunits KA1 and KA2 (GRIK4 and GRIK5) are located on separate chromosomes in human, mouse, and rat. Proc. Natl Acad. Sci. USA. 1994;91:11849–11853. doi: 10.1073/pnas.91.25.11849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Erlander MG, Tillakaratne NJ, Feldblum S, Patel N, Tobin AJ. Two genes encode distinct glutamate decarboxylases. Neuron. 1991;7:91–100. doi: 10.1016/0896-6273(91)90077-d. [DOI] [PubMed] [Google Scholar]
- 38.Marzi SJ, et al. A histone acetylome-wide association study of Alzheimer’s disease identifies disease-associated H3K27ac differences in the entorhinal cortex. Nat. Neurosci. 2018;21:1618–1627. doi: 10.1038/s41593-018-0253-7. [DOI] [PubMed] [Google Scholar]
- 39.Black JL, 3rd, Lennon VA. Identification and cloning of putative human neuronal voltage-gated calcium channel gamma-2 and gamma-3 subunits: neurologic implications. Mayo Clin. Proc. 1999;74:357–361. doi: 10.4065/74.4.357. [DOI] [PubMed] [Google Scholar]
- 40.Pant S, et al. AMPH-1/Amphiphysin/Bin1 functions with RME-1/Ehd1 in endocytic recycling. Nat. Cell Biol. 2009;11:1399–1410. doi: 10.1038/ncb1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Seyfried NT, et al. A multi-network approach identifies protein-specific co-expression in asymptomatic and symptomatic Alzheimer’s disease. Cell Syst. 2017;4:60–72.e4. doi: 10.1016/j.cels.2016.11.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lambert JC, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nat. Genet. 2013;45:1452–1458. doi: 10.1038/ng.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lee CYD, Landreth GE. The role of microglia in amyloid clearance from the AD brain. J. Neural Transm. 2010;117:949–960. doi: 10.1007/s00702-010-0433-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Novikova, G. et al. Integration of Alzheimer’s disease genetics and myeloid genomics identifies disease risk regulatory elements and genes. Nat. Commun.12, 1610 (2021). [DOI] [PMC free article] [PubMed]
- 45.Finucane HK, et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nat. Genet. 2015;47:1228–1235. doi: 10.1038/ng.3404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Sey, N. Y. A. et al. A computational tool (H-MAGMA) for improved prediction of brain-disorder risk genes by incorporating brain chromatin interaction profiles. Nat. Neurosci. 10.1038/s41593-020-0603-0 (2020). [DOI] [PMC free article] [PubMed]
- 47.Matoba, N., Quiroga, I. Y., Phanstiel, D. H. & Won, H. Mapping Alzheimer’s disease variants to their target genes using computational analysis of chromatin configuration. J. Vis. Exp. 10.3791/60428 (2020). [DOI] [PubMed]
- 48.Chapuis J, et al. Increased expression of BIN1 mediates Alzheimer genetic risk by modulating tau pathology. Mol. Psychiatry. 2013;18:1225–1234. doi: 10.1038/mp.2013.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Jansen IE, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet. 2019;51:404–413. doi: 10.1038/s41588-018-0311-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Mathys H, et al. Single-cell transcriptomic analysis of Alzheimer’s disease. Nature. 2019;570:332–337. doi: 10.1038/s41586-019-1195-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Grubman A, et al. A single-cell atlas of entorhinal cortex from individuals with Alzheimer’s disease reveals cell-type-specific gene expression regulation. Nat. Neurosci. 2019;22:2087–2097. doi: 10.1038/s41593-019-0539-4. [DOI] [PubMed] [Google Scholar]
- 52.Sala Frigerio C, et al. The major risk factors for Alzheimer’s disease: age, sex, and genes modulate the microglia response to Aβ plaques. Cell Rep. 2019;27:1293–1306.e6. doi: 10.1016/j.celrep.2019.03.099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Brainstorm Consortium, The Brainstorm et al. Analysis of shared heritability in common disorders of the brain. Science360, eaap8757 (2018). [DOI] [PMC free article] [PubMed]
- 54.Ruderfer DM, et al. Polygenic dissection of diagnosis and clinical dimensions of bipolar disorder and schizophrenia. Mol. Psychiatry. 2014;19:1017–1024. doi: 10.1038/mp.2013.138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Garwood CJ, Pooler AM, Atherton J, Hanger DP, Noble W. Astrocytes are important mediators of Aβ-induced neurotoxicity and tau phosphorylation in primary culture. Cell Death Dis. 2011;2:e167. doi: 10.1038/cddis.2011.50. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Liddelow SA, et al. Neurotoxic reactive astrocytes are induced by activated microglia. Nature. 2017;541:481–487. doi: 10.1038/nature21029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Zuchero JB, Barres BA. Glia in mammalian development and disease. Development. 2015;142:3805–3809. doi: 10.1242/dev.129304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Satterstrom, F. K. et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell.10.1016/j.cell.2019.12.036 (2020). [DOI] [PMC free article] [PubMed]
- 59.Gandal MJ, et al. Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science. 2018;362:eaat8127. doi: 10.1126/science.aat8127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ruzzo EK, et al. Inherited and de novo genetic risk for autism impacts shared networks. Cell. 2019;178:850–866.e26. doi: 10.1016/j.cell.2019.07.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Skene NG, et al. Genetic identification of brain cell types underlying schizophrenia. Nat. Genet. 2018;50:825–833. doi: 10.1038/s41588-018-0129-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Cross-Disorder Group of the Psychiatric Genomics Consortium. Genomic relationships, novel loci, and pleiotropic mechanisms across eight psychiatric disorders. Cell. 2019;179:1469–1482.e11. doi: 10.1016/j.cell.2019.11.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Enwright Iii JF, et al. Transcriptome alterations of prefrontal cortical parvalbumin neurons in schizophrenia. Mol. Psychiatry. 2018;23:1606–1613. doi: 10.1038/mp.2017.216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Mellios N, et al. Molecular determinants of dysregulated GABAergic gene expression in the prefrontal cortex of subjects with schizophrenia. Biol. Psychiatry. 2009;65:1006–1014. doi: 10.1016/j.biopsych.2008.11.019. [DOI] [PubMed] [Google Scholar]
- 65.Kundakovic M, et al. Practical guidelines for high-resolution epigenomic profiling of nucleosomal histones in postmortem human brain tissue. Biol. Psychiatry. 2017;81:162–170. doi: 10.1016/j.biopsych.2016.03.1048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Espeso-Gil S, et al. A chromosomal connectome for psychiatric and metabolic risk variants in adult dopaminergic neurons. Genome Med. 2020;12:19. doi: 10.1186/s13073-020-0715-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Imakaev M, et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat. Methods. 2012;9:999–1003. doi: 10.1038/nmeth.2148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ramírez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nat. Commun.9, 189 (2018). [DOI] [PMC free article] [PubMed]
- 69.Delignette-Muller ML, et al. fitdistrplus: an R package for fitting distributions. J. Stat. Softw. 2015;64:1–34. [Google Scholar]
- 70.Kaul A, Bhattacharyya S, Ay F. Identifying statistically significant chromatin contacts from Hi-C data with FitHiC2. Nat. Protoc. 2020;15:991–1012. doi: 10.1038/s41596-019-0273-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Andrews, S. et al. FastQC: a quality control tool for high throughput sequence data (2010).
- 72.Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 2011;17:10–12. [Google Scholar]
- 73.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 2019;37:907–915. doi: 10.1038/s41587-019-0201-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Pertea M, et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 2015;33:290–295. doi: 10.1038/nbt.3122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat. Methods. 2012;9:357–359. doi: 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Zhang Y, et al. Model-based analysis of ChIP-Seq (MACS) Genome Biol. 2008;9:R137. doi: 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Stark R, et al. DiffBind: differential binding analysis of ChIP-Seq peak data. R. Package Version. 2011;100:4–3. [Google Scholar]
- 79.Fullard JF, et al. Open chromatin profiling of human postmortem brain infers functional roles for non-coding schizophrenia loci. Hum. Mol. Genet. 2017;26:1942–1951. doi: 10.1093/hmg/ddx103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Raudvere U, et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update) Nucleic Acids Res. 2019;47:W191–W198. doi: 10.1093/nar/gkz369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Supek F, Bošnjak M, Škunca N, Šmuc T. REVIGO summarizes and visualizes long lists of gene ontology terms. PLoS ONE. 2011;6:e21800. doi: 10.1371/journal.pone.0021800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.de Leeuw, C., Sey, N. Y. A., Posthuma, D. & Won, H. A response to Yurko et al: H-MAGMA, inheriting a shaky statistical foundation, yields excess false positives. 10.1101/2020.09.25.310722 (2020).
- 83.Skene NG, Grant SGN. Identification of vulnerable cell types in major brain disorders using single cell transcriptomes and expression weighted cell type enrichment. Front. Neurosci. 2016;10:16. doi: 10.3389/fnins.2016.00016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Stahl EA, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat. Genet. 2019;51:793–803. doi: 10.1038/s41588-019-0397-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Hu, B. & hyejungwon. thewonlab/NeuN v1.0.0. 10.5281/zenodo.4552591 (2021).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Hi–C data described in this manuscript is available through the PsychENCODE Knowledge Portal with the accession code of syn21760712. H3K27ac, ATAC-seq, and RNA-seq data from NeuN+ and NeuN− cells are available through syn4566010, GSE83345, and syn20545534, respectively. H3K27ac ChIP-seq data from Glu and GABA neurons are available through syn12034263. Human reference genome and gene definition was obtained from GENCODE and UCSC genome browser [http://genome.ucsc.edu]. All other relevant data supporting the key findings of this study are available within the article and its Supplementary Information files or from the corresponding authors upon reasonable request. Source data are provided with this paper. A reporting summary for this Article is available as a Supplementary Information file. Source data are provided with this paper.
All custom code used in this work is available in the following github repository: https://github.com/thewonlab/NeuN85.