Abstract
Human nervous system development is an intricate and protracted process that requires precise spatio-temporal transcriptional regulation. Here we generated tissue-level and single-cell transcriptomic data from up to sixteen brain regions covering prenatal and postnatal rhesus macaque development. Integrative analysis with complementary human data revealed that global intra-species (ontogenetic) and inter-species (phylogenetic) regional transcriptomic differences exhibit concerted cup-shaped patterns, with a late fetal-to-infancy (perinatal) convergence. Prenatal neocortical transcriptomic patterns revealed transient topographic gradients, whereas postnatal patterns largely reflected functional hierarchy. Genes exhibiting heterotopic and heterochronic divergence included those transiently enriched in the prenatal prefrontal cortex or linked to autism spectrum disorder and schizophrenia. Our findings shed light on transcriptomic programs underlying the evolution of human brain development and the pathogenesis of neuropsychiatric disorders.
One Sentence Summary:
Spatio-temporal human and macaque brain transcriptomes display concerted ontogenetic and phylogenetic cup-shaped divergence patterns.
Introduction
The development of the human nervous system is an intricate process that unfolds over a prolonged time course, ranging from years to decades, depending on the region (1–6). Precise spatial and temporal regulation of gene expression is crucial for all aspects of human nervous system development, evolution, and function (6–13). Consequently, alterations in this process have been linked to psychiatric and neurological disorders, some of which may exhibit primate or human-specific manifestations (11, 14–18). However, our ability to explain many aspects of human nervous system development and disorders at a mechanistic level has been limited by our evolutionary distance from genetically tractable model organisms, such as mouse (15, 16, 19–22), and by a lack of contextual and functional interpretations of polymorphisms and disease-associated variations in the human and non-human primate (NHP) genomes (11, 17, 21, 23). Moreover, neither the extent of molecular changes underlying human-specific differences nor the specific developmental programs affected by these changes have been thoroughly studied.
The rhesus macaque (Macaca mulatta) is the most widely studied NHP in neuroscience and medicine (24–26). The macaque nervous system parallels the human nervous system with its complex cellular architecture and extended development, providing a unique opportunity to study features of neurodevelopment that are shared and divergent between the two closely related primates. Furthermore, studies of postmortem NHP tissues provide a unique opportunity to validate results obtained using postmortem human tissue, especially those from critical developmental periods that can be confounded by ante-mortem and post-mortem factors and tissue quality. Finally, significant advances in transgenic and genome editing technologies now allow the possibility to create more precise genetic models for human disorders in macaque (24–26). This will facilitate the interrogation of the effects of specific gene mutations in a model that is closer to the human brain than any other experimental animal.
Comparative transcriptomic profiling offers unbiased insight into conserved and clade- or species-specific molecular programs underlying cellular and functional development of the human nervous system (27–31). However, a systematic characterization of the spatial and temporal transcriptomic landscapes of the macaque brain at the region-specific and single-cell levels, as well as the identification of shared and divergence features between human and macaque, are lacking. Such data and analyses presented here should provide both retrospective and prospective benefits to the fields of neuroscience, evolutionary biology, genomics, and medicine.
Study design and data generation
Here, we describe the generation and integrated analysis of RNA sequencing (RNA-Seq) data obtained using bulk tissue (366 samples from 26 prenatal and postnatal brains) or single cells/nuclei (113,274 cells/nuclei from 2 fetal and 3 adult brains) from post-mortem rhesus macaque specimens. Both tissue and single cell/nucleus datasets were subjected to multiple quality control measures (figs. S1–6; tables S1–2; Methods (32)). Tissue-level samples covered the entire span of both prenatal and postnatal neurodevelopment (Fig. 1A and B; table S1) and included 11 areas of the cerebral neocortex (NCX), hippocampus (HIP), amygdala (AMY), striatum (STR), mediodorsal nucleus of thalamus (MD), and cerebellar cortex (CBC). Subject ages ranged from 60 postconception days (PCD) to 11 postnatal years (PY) and were matched by age and brain region to 36 human brains from an accompanying study (33) and 5 adult chimpanzee brains from a previous study (34) (Fig. 1A). To investigate the contribution of different factors to the global transcriptome dynamics, we applied unsupervised clustering and principal component analysis, which revealed that age, species, and regions contribute more to the global transcriptomic differences than other tested variables (figs. S3 and S4).
Fig. 1. Conserved and divergent transcriptomic features of human and macaque neurodevelopmental processes.
(A) Plot depicting the real age (x axis) and the age predicted by TranscriptomeAge (y axis) of human (red), chimpanzee (blue), and macaque (green). Macaque (164 PCD) and human (266 PCD) births are labeled with green or red dashed line, respectively. (B) Schematic showing the human developmental periods, as described in Kang et al., 2011 (29) and the matched macaque developmental and chimpanzee adult data sets. Each line corresponds to one macaque or one chimpanzee specimen and the corresponding predicted age when compared to human neurodevelopment. PCD – post-conception day; PY – postnatal year. (C) The weight (W) of five transcriptomic signatures in the developing human and macaque NCX and the respective association with neurodevelopmental processes. In signature 1 (neurogenesis), the arrow indicates the point at which the signature reaches the minimum in human (red) and macaque (green). The asterisk indicates the extension of the early fetal period that is also observed in (B), in which early fetal macaques (E60) cluster with mid-fetal humans. In transcription signatures 2, 3, 4, and 5, arrows indicate the point at which the signatures reach the maximum in human (red) or macaque (green). Note that for transcriptomic signatures 2 and 3 (neuronal differentiation and astrogliogenesis) there is a synchrony between human and macaque, whereas for transcriptomic signature 4 and 5 (synaptogenesis and myelination), there is heterochrony between the species, with acceleration in human synaptogenesis and delay in human myelination. Prefrontal cortical areas are plotted in red, primary motor cortex in orange, parietal areas in green, temporal areas in blue, and primary visual cortex in gray. MFC – medial prefrontal cortex; OFC – orbital prefrontal cortex; DFC – dorsolateral prefrontal cortex; VFC – ventrolateral prefrontal cortex; M1C – primary motor cortex; S1C – primary somatosensory cortex; IPC – inferior posterior parietal cortex; A1C – primary auditory cortex; STC – superior temporal cortex; ITC – inferior temporal cortex; V1C – primary visual cortex. (D) Cell type-enrichment is shown for each signature. P values adjusted by Benjamini–Hochberg procedure were plotted (size of dots) and significance was labeled by color (True: red and False: gray). H – human; M – macaque; eNPC – embryonic neuroepithelial progenitor; eIPC – embryonic intermediate progenitor cell; eNasN – embryonic nascent neuron; ExN – excitatory neuron; InN – interneuron; Astro – astrocyte; OPC – oligodendrocyte progenitor cell; Oligo – oligodendrocyte; Endo – endothelial cell; VSMC – vascular smooth muscle cell.
To explore cell-type origins of tissue-level inter-species difference, we conducted single-cell RNA-Seq (scRNA-Seq) on 86,341 cells from six matching regions of two 110 PCD fetal macaque brains (i.e., the dorsolateral prefrontal neocortex [DFC, aka DLPFC], HIP, AMY, STR, MD, and CBC) and single-nucleus RNA-Seq (snRNA-Seq) of 26,933 nuclei from three adult macaque DFC (8, 11 and 11 PY; tables S2–3; (32)). These data were complemented by 17,093 snRNA-Seq samples from the adult human (see (33)) as well as two scRNA-Seq datasets from embryonic and fetal human neocortex (33, 35). In the six fetal macaque brain regions, we identified 129 transcriptomically distinct clusters of cell types (i.e., 19 in DFC, 20 in HIP, 25 in AMY, 22 in STR, 20 in MD, and 23 in CBC) (figs. S7–12; tables S3–4). In the adult human DFC (fig. S13) and adult macaque DFC (fig. S14), we identified 29 and 21 transcriptomically distinct cell types, respectively (table S3, S5, and S6). Alignment of our macaque fetal data with the adult single-nuclei data revealed hierarchical relationships and similarities between major cell classes, reflecting their ontogenetic origins and functional properties (fig. S15). Cell clusters were categorized by their gene expression patterns and assigned identities commensurate with their predicted cell type and, in the case of human adult neocortical excitatory neurons, their putative laminar identity. While the majority of cell clusters were composed of cells derived from all brains, we found a few clusters in subcortical regions (AMY: 2 out of 25 clusters; CBC: 1 out of 23 clusters; and STR: 1 out of 22 clusters) that included cells from a single donor brain. This might be due to variations in dissection, age (even though both fetal macaques were 110 PCD, a 3 to 4 days variation remains), individual differences, and other technical bias. We used the single-cell datasets in this and the accompanying study (33) to deconvolve tissue-level RNA-Seq data, identify temporal changes in cell type specific signatures, analyze differences in cell types and their transcriptomic profiles, and conduct cell type enrichment analyses.
Similarities and differences in the spatio-temporal dynamics of the human and macaque brain transcriptomes
Unsupervised hierarchical clustering and principal component analysis of bulk tissue revealed common principles of transcriptomic regional architecture across development in macaque and human (figs. S3 and S4). Among macaque regions, these analyses showed distinct and developmentally regulated clustering of NCX (combination of 11 areas), HIP, and AMY, with CBC exhibiting the most distinctive transcriptional profile, an observation shared with our complementary study in humans (27, 29, 30, 33, 36). A hierarchical clustering of both fetal and postnatal NCX areal samples revealed their grouping by topographical proximity and functional overlap, similar to those relationships we observed in human brain (fig. S3). Thus, these results show that the transcriptomic architecture of the macaque brain is regionally and temporally specified and reflects conserved global patterns of ontogenetic and functional differences that are also found in humans.
To explore species similarities and differences in the spatio-temporal dynamics of the brain transcriptome, we used the XSAnno computational framework (37) to minimize biases in comparative data analyses due to the disparate quality of gene annotation for the two species. We created common annotation sets of 27,932 and 26,514 orthologous protein coding and non-coding mRNA genes for human-macaque and human- chimpanzee-macaque comparisons, respectively (fig. S2 and (32)). Next, we developed TranscriptomeAge, an algorithm to unbiasedly predict the equivalent ages of human and macaque samples on the basis of temporal transcriptomic changes (32). We chose to optimize this model for age-matching the aforementioned eleven neocortical areas, which are highly similar in terms of their transcriptomes, cellular composition, and developmental trajectories when compared to other brain regions (see (33)). TranscriptomeAge confirmed transcriptomic similarities in both species coinciding with major prenatal and postnatal developmental phases, including fetal development, infancy, childhood, and adulthood (Figs. 1, A and B, S16–18). However, we identified two human developmental periods where alignment suggested that they are transcriptomically distinct from macaque and/or especially protracted. First, 60 PCD macaque specimens (which correspond to human early fetal period (29) according to Translating Time model (38)), most closely aligned with mid-fetal human samples (102 – 115 PCD or 14.5 – 16.5 PCW). This suggests that, transcriptomically, human brain development is protracted even at early fetal periods. Second, we found that 2, 3.5, 4, 5, and 7 PY macaque specimens, of which at least the youngest should chronologically match to human childhood (39), did not align with any of our human specimens from early or late childhood (1 −12 PY or periods 9 and 10 according to (29)), but aligned with adolescent and adult human (Fig. 1A and B), indicating that, consistent with previous morphophysiological and behavioral studies (5), macaques lack global transcriptomic signatures of late childhood and/or that humans have a prolonged childhood relative to macaque (Fig. 1A and B).
Species differences in the timing of concerted neurodevelopmental processes
We hypothesized that the observed developmental differences between humans and macaques might be grounded on transcriptomic changes in concerted biological processes in developmental timing (i.e., heterochrony). By decomposing the gene expression matrix of human neocortical samples, we identified five transcriptomic signatures underlying neocortical development (32). Using top cell type-specific genes derived from our prenatal single-cell and adult single-nucleus data, we analyzed cell-type enrichment of each of the five signatures, and ascribed them to neurogenesis, neuronal differentiation, astrogliogenesis, synaptogenesis, and oligodendrocyte differentiation and myelination (Figs. 1C and D, S19). To determine whether the transcriptomic signatures we identified were correctly assigned, we compared their developmental patterns to the timing of major human neurodevelopmental processes, expression trajectories of key genes previously implicated in those processes, and to the trajectories of cell-type proportions identified by the deconvolution of tissue level data (figs. S19–20). We found that the developmental trajectories of genes associated with neuronal differentiation, synaptogenesis, and myelination, as well as the cell-type proportions of fetal human or macaque excitatory neurons, astrocytes, and oligodendrocytes, matched those of the corresponding transcriptomic signatures (fig. S20). Moreover, comparison of transcriptomic signatures to independently generated non-transcriptomic data predicting the start and end of human neocortical neurogenesis (for neurogenesis) (40), and data measuring the number of DCX-immunopositive nascent neurons in the human hippocampus throughout development and adulthood (for neuronal migration and initial differentiation) (41), the developmental variation in synaptic density in the human cortex (for synaptogenesis) (42), and the myelinated fiber length density (for myelination) (43) also confirmed the identities we assigned to these transcriptomic signatures (fig. S19).
Next, we analyzed how the shape of the five transcriptomic trajectories was conserved across the eleven neocortical areas within each species and between species. Analysis of their trajectories within each species revealed that the shape of a given trajectory is similar across neocortical areas (Fig. 1C and S17). However, the transcriptomic trajectories associated with oligodendrocyte differentiation and myelination exhibited a prominent temporal shift (asynchrony) across neocortical areas in both species (fig. S17). Between species, myelination and, to a lesser extent, synaptogenesis exhibited species differences in the shapes of these trajectories; the myelination transcriptomic signature progressively increases in the human neocortex beginning from late fetal development through adulthood without reaching an obvious plateau until 40 PY, but in the macaque neocortex the myelination signature reaches a plateau around the first postnatal year (Fig. 1C). This corresponds to early childhood in human neurodevelopment (window 6 or period 10 according to (33) or (29), respectively) and is consistent with histological studies and reflective of previously reported hierarchical maturation of neocortical areas (43–47). Similarly, we corroborated synchronous or concurrent transcriptomic patterns of neocortical synaptogenesis by analyzing previously collected data on synaptic density in multiple areas of the macaque neocortex (48) (Fig. S19). However, we observed that the synaptogenesis transcriptomic trajectory peaked earlier in human than macaque, at the transition between late infancy and early childhood (Fig. 1C). In addition, expression trajectories of genes induced by neuronal activity, a process critical for synaptogenesis, also showed dramatic increases during late fetal development and infancy, and like the synaptogenesis trajectory displayed a concurrent or synchronous shape across neocortical areas (see (33)). Interestingly, the developmental transcriptomic profile of DCX, a marker of nascent, migrating neurons, showed that macaques maintain higher expression in the hippocampus throughout postnatal development and adulthood, suggesting that postnatal neurogenesis is more prominent in macaque hippocampus than in human, as recently shown (fig. S19) (49). Thus, both species exhibited distinct transcriptomic signatures of neoteny, such as prolonged myelination in humans and prolonged postnatal hippocampal neurogenesis in macaques. Together, these data suggest the temporal staging of major neurodevelopmental processes, in particular with myelination beginning in primary areas before association neocortical areas, is a conserved feature of primate development, but the temporal progression of certain processes is heterochronic.
Concerted ontogenetic and phylogenetic transcriptomic divergence
After matching the global transcriptome by age between the two species, we analyzed regional differences in gene expression (heterotopy) within each species. By adopting Gaussian process models to accommodate the spatio-temporal correlations of gene expression (32), we found that the developmental cup-shaped or hourglass-like pattern of transcriptomic inter-regional differences we observed in humans (33) is also present in macaque neocortices and other brain regions (Figs. 2, A and B, S21) with greater differential expression between regions observed during early and mid-fetal ages preceding this period and subsequent young adulthood. Notably, two brain regions – CBC and STR – exhibited greater differences, compared to other brain regions, beginning immediately after birth, rather than beginning during childhood/adolescence (fig. S21). This suggests that the development of the primate forebrain may be constrained by unique developmental or evolutionary influences, leading us to query the gene expression patterns, developmental processes, and cell types underlying this transcriptomic phenomenon.
Fig. 2. Ontogenetic inter-regional transcriptomic differences display a cup-shaped pattern in human and macaque.
(A-B) The inter-regional difference of a given module was measured as the average distance of each neocortical area to all other areas in the (A) human and (B) macaque neocortices across development. The upper quartile inter-regional difference among all genes was plotted and the magnitude was shown in colors. The gray planes represent the transition from prenatal to early postnatal development (late fetal transition) and from adolescence to adult. (C) The number of co-expression modules that display gradient-like expression (anterior-posterior, posterior-anterior, medial-lateral, temporal lobe-enriched), and enrichment in primary areas or enrichment in association areas in each developmental phase. Left panel corresponds to human modules, right panel to macaque modules. (D) Donut plots depicting the modules (from (C)) that exhibited species-distinct inter-regional differences. Red indicates high expression of the genes in the module; blue indicates low expression of the genes in the module. Prenatal modules show a human-distinct anterior-posterior expression gradient (left panel); macaque-distinct early postnatal modules show enrichment in primary or association areas (middle panel); and a macaque-distinct adult module is enriched in association areas, especially in MFC (right panel). The expression pattern of each species-distinct module is shown in human (top) and macaque (bottom). HS – Human (Homo sapiens) module; MM – macaque (Macaca mulatta) module.
In order to do that, we considered three phases of brain development mirroring major transitions in the cup-shaped pattern: prenatal development, early postnatal development, and adulthood. Between these three phases are two transitional periods: a steep late-fetal transition (33) and a more moderate transition between childhood/adolescence and adulthood. We performed weighted gene co-expression network analysis (WGCNA) independently for each phase and species, resulting in Homo sapiens (HS) and macaque (Macaca mulatta, MM) modules ((32); table S7), with analyses conducted on eleven neocortical areas; this allowed us to identify discrete spatio-temporal expression patterns that otherwise might be co-mingled due to the highly disparate nature of CBC and other non-neocortical regions. Within the prenatal phase, we found twelve modules consisting of genes exhibiting spatial expression gradients along the anterior/posterior (8 modules), and medial/lateral (1 module) axes of the neocortex and broadly reflecting prospective neocortical areal topography (Fig. 2C). For example, prenatal modules HS85 and HS87 exhibited prefrontal/frontal-enriched graded expression in the human brain tapering to lowest expression in the temporal and occipital lobes (Fig. 2D). Furthermore, prenatal modules, such as HS15 and MM57, had their highest expression restricted to the temporal lobe (fig. S22–23; table S8) during prenatal development.
In contrast to the prenatal phase, no modules were identified from early postnatal development (i.e., infancy, childhood, and adolescence) in either species exhibited similar anterior/posterior or medial/lateral gradients in the expression of their constituent genes. Rather, the greater regional synchrony characterizing gene expression in this phase yielded differences organized not around topography but between primary and association areas of the neocortex (Fig. 2C, S24–25; table S9). This suggests that the gradient-like transcriptomic patterns arising during prenatal development are superseded by myelination and neuronal activity-related processes postnatally, which may differentiate the separation between primary and association areas. Early postnatal modules such as MM42, MM24, and MM23, among others, exhibited greater expression in primary areas such as M1C, A1C, and V1C as compared to association areas such as DFC and VFC (Fig. 2D).
The transition to young adulthood was marked by another decrease in inter-regional differences, but this reduction was not as pronounced as in the late-fetal transition, nor were inter-regional patterns of gene expression markedly different in the adult. Thus, gene expression differences between primary and association areas continued to drive regional variation in both the adult human and macaque (Figs. 2, C and D, S26–27; table S10). Gene Ontology (GO) enrichment analysis using the top variant genes in each period, with all genes expressed in each period as background, indicated that there is differential enrichment of biological processes associated with different cell populations across areas and time. As observed in the accompanying human study (33) and commensurate with the developmental trajectories of the observed transcriptomic signatures, the functional terms enriched prenatally were generally related to neurogenesis and neuronal differentiation, whereas early postnatal and adult functional terms were enriched for processes related to synaptogenesis and myelination (fig. S28).
We next sought to determine whether the regional-specific expression patterns of co-expression modules detected in human correlated with their expression patterns in macaque, and vice versa (32). We found that genes in two human prenatal modules exhibiting a pronounced anterior/posterior gradient in the human neocortex, HS85 and HS87, did not contain genes with enriched expression in the macaque prefrontal cortex (Fig. 2D; table S8). Among genes in these modules were RGMA and SLIT3, two genes encoding axon guidance molecules (50), and BRINP2 and CXXC5, which encode proteins involved in retinoic acid signaling (51), potentially implicating this signaling pathway, critical for early brain development and neuronal differentiation (51), in the patterning of the human prefrontal cortex. We also observed several modules in macaque postnatal development (MM23, MM24, MM26, and MM42) that did not correlate well with human modules and were enriched for genes that are expressed in oligodendrocytes (Fig. 2D; fig. S24; table S9) and were up-regulated in all primary areas of macaque NCX relative to association areas. Conversely, genes in these modules were up-regulated in humans only in M1C and A1C, but not in S1C or V1C (fig. S24; table S9). Integration with our multi- regional database of the adult chimpanzee transcriptome (34) indicates that the macaque gene expression pattern, rather than the human gene expression pattern, may be unique among these species (fig. S29). Many of the species-specific patterns of diversification between primary and association areas that we observed during early postnatal development were preserved in adult modules of both species (fig. S26), with some notable exceptions. For example, the adult macaque module MM25 exhibited up- regulation in association areas in both species but prominent up-regulation in medial prefrontal cortex (MFC) and down-regulation in V1C only in macaque (Fig. 2D; fig. S26; table S10). These findings reaffirm a conserved framework in primate neocortical development and function (21), including a topographic basis for transcriptomic differences during prenatal development and functional relationships postnatally. Our analyses also suggest that inter-regional and inter-species differences in oligodendrocyte development and myelination, particularly during early postnatal development, mediate key aspects of transcriptomic variation both within and among species.
Heterotopic changes in human and macaque brain transcriptome
We next investigated the transcriptomic divergence between human and macaque for each brain region across development. We found that the developmental phases exhibiting high levels of inter-regional differences within each species (i.e. prenatal development and young adulthood) also displayed greater transcriptomic divergence between the two species, revealing a concerted phylogenetic (evolutionary) cup-shaped pattern (Fig. 3A). This phylogenetic cup-shaped pattern divided neurodevelopment into the same three phases as the regional ontogenetic (developmental) cup-shape (Fig. 3A). However, unlike the ontogenetic (developmental) cup-shaped pattern, where CBC, MD and STR disproportionally exhibited more intra-species differences than NCX, HIP, and AMY, all regions appeared to exhibit a relatively similar amount of inter-species differences (Fig. 3A). Interestingly, inter-species differences among neocortical areas were distinct enough to provide clear clustering of topographically and functionally related prefrontal areas (i.e., MFC, OFC, DFC, and VFC), particularly during prenatal development, or topographically distributed non-visual primary areas (i.e., M1C, S1C, and A1C) in adulthood. Prospective areas of the prefrontal cortex, which underlie some of the most distinctly human aspects of cognition, were more phylogenetically distinct than other neocortical areas during early prenatal development (Fig. 3A and fig. S30). Together, these findings suggest that the evolutionary and developmental constraints acting on the brain transcriptome, in particular NCX, may share some overlapping features.
Fig. 3. Transcriptomic divergence between human and macaque throughout neurodevelopment reveals a phylogenetic cup-shaped pattern.
(A) The inter-species divergence, measured as the absolute difference in gene expression, between human and macaque in each brain region throughout development (coded as in Fig 2A). The upper quartile divergence among all genes was plotted. The gray planes represent the transition from prenatal to early postnatal development (late fetal transition, left) and from adolescence to adult (right). (B) Venn diagram displaying the number of differentially expressed genes (DEX, top) or genes with differential exon usage (DEU, bottom) between human and macaque in at least one brain region during prenatal development, early postnatal development, and adulthood. (C) Bubble matrix with examples of genes showing global or regional inter-species differential expression. Brain regions displaying significant differential expression between human and macaque are shown with black circumference. Red circles show up-regulation in human; blue shows up-regulation in macaque. Circle size indicates absolute log2 fold change. (D) Percentage of overlap between genes showing the highest inter-species divergence in each region (driving the evolutionary cup-shaped pattern), and genes with the largest pairwise distance between brain regions in prenatal (red), early postnatal (green) and adult (green) human (solid lines) and macaque brains (dashed lines; driving the developmental cup-shaped pattern). The result was plotted using a variable number of the highest ranked genes based on inter-regional difference and the inter-species divergence. Mean and standard deviation (error bar) across regions were plotted.
To get insights into the transcriptomic programs driving phylogenetic divergence across neocortical areas, we conducted a functional annotation of the top 100 genes driving the observed variation along the first principal component (PC1). We found that inter-species divergence in the prenatal prefrontal cortex could be explained by an enrichment of genes related to cell proliferation (FDR < 10−5). This indicated that the observed inter-species divergence in prefrontal cortex was likely due to a different proportion of progenitor cells in the early fetal human prefrontal tissue samples (fig. S30). In contrast, during postnatal development, PC1 separated prefrontal areas and inferior temporal cortex (ITC) from the other neocortical areas. This pattern was mainly driven by genes associated with myelination-associated categories (FDR < 0.05; fig. S30) and genes associated with synaptic transmission (FDR < 0.05; fig. S29). While speculative, these observations potentially link the expansion of human prefrontal cortex, the wealth of human-specific connectivity made possible by that extension, and the altered patterns of myelination we observe between human and macaque. Confirming the observed regional diversification in each species, postnatal development displayed the lowest number of differentially expressed genes between species and most of these (89.3%) were also differentially expressed in adulthood, the phase where we observed the greatest number of inter-species differentially expressed genes (Fig. 3B; table S11). Genes differentially expressed between human and macaque exhibited distinct patterns of spatio-temporal divergence (Fig. 3C) and showed diverse functional enrichment (table S12). While 229 genes (2.6%) displayed up- or down-regulation in all the sampled brain regions throughout development and adulthood, others were specifically up- or down-regulated in a subset of brain regions and/or in a particular developmental phase.
To test whether genes with differential expression between human and macaque showed distinct conservation profiles, we compared dN/dS values for the whole set of genes differentially expressed in any of the 16 brain regions in at least one of the three developmental phases (32). We found that the differentially expressed genes between human and macaque also show significantly higher dN/dS values associated with higher evolutionary rates than the remaining protein coding genes (Wilcoxon-Mann-Whitney P = 2.2×10−8, n = 4,429). This result was also observed when we focused on the genes differentially expressed in prenatal development (P = 3.7×10−11, n = 2,380 genes), early postnatal development (P = 4.5×10−24, n = 1,765 genes) or adulthood (P = 1.0×10−6, n = 3,837 genes) separately. Moreover, these higher dN/dS values for differentially expressed genes remained highly significant in all the brain regions and developmental phases analyzed, highlighting the consistent association between inter-species transcriptional variation and gene evolution.
Integration with our complementary dataset generated on adult chimpanzee brains (34), revealed that 531 (10.6%), 507 (12.9%), and 1079 (13.9%) genes differentially expressed between species in prenatal development, early postnatal development and adult, respectively, show human-specific expression in the same brain region in the adult brain. Several genes among those exhibiting species- or human-specific patterns of gene expression were developmentally and regionally regulated. PKD2L1, a gene that encodes an ion channel (52), exhibited human-specific up-regulation only postnatally (Fig. 3C). Conversely, TWIST1, a gene encoding a transcriptional factor implicated in Saethre-Chotzen syndrome (53), showed human-specific down-regulation only postnatally (Fig. 3C). In contrast, MET, a gene linked to autism spectrum disorders (54), showed human-specific up-regulation in the prefrontal cortex and STR postnatally (Fig. 3C). PTH2R, a gene encoding the parathyroid hormone 2 receptor, exhibited macaque-distinct up- regulation in prenatal NCX, but human-distinct up-regulation in the adult NCX, and is enriched, as evidenced by immunohistochemistry, in excitatory neurons (fig. S31). These results show that at least some of the tissue level inter-species differences we observed are due to changes at the level of specific cell types. Furthermore, even though the ontogenetic and phylogenetic patterns have similar profiles, the overlap of genes driving these two patterns is not substantial (Fig. 3D), indicating the existence of different molecular mechanisms and constraints for regional specification and species divergence.
To gain a more complete understanding of the inter-species transcriptomic differences, we performed inter-species differential exon usage as a conservative way of exploring the impact of putative differential alternative splicing. We detected largely similar numbers of genes containing clear differentially used exons between species in all developmental phases ((32); table S13), with 1,924 genes showing inter-species differential exon usage in at least one brain region during the prenatal phase, 1,952 during the early postnatal phase, and 1,728 during adulthood (Figs. 3B and S32). In our set of differentially used exonic elements, non-protein coding regions were overrepresented (P< 2.2×10−16, Chi-squared independence test), with 4,705 out of the 5,372 differentially used exonic elements in non-coding regions. This enrichment was especially strong for non-UTR exonic elements belonging to non-protein coding transcripts from protein-coding genes and 5’ UTR regions (P < 2.2×10−16), but also significant for 3’ UTR regions (P = 1.81×10−11) and non-UTR exonic elements from non-protein coding genes (P = 0.02364), suggesting post-transcriptional regulation may contribute to species differences at the exon level.
Phylogenetic divergence in transcriptional heterotopic regulation
Because transcription factors can regulate the expression of multiple genes, the differential expression we observe between species in different brain regions might be mediated in part by differential expression of a relatively small number of transcription factors. To assess this possibility, we searched for transcription factor binding sites (TFBS) that were enriched in the annotated promoters of inter-species differentially expressed genes for each brain region and developmental stage in our analysis (32). We found that the binding sites for 86 transcription factors were enriched among inter-species differentially expressed genes, 7 of which were inter-species differentially expressed genes (table S14). RUNX2 was differentially expressed between human and macaque in the prenatal HIP, PAX7 in the early postnatal AMY, STAT6 in the prenatal NCX, STAT4 in the early postnatal and adult NCX, SNAI2 in the adult CBC, and EWSR1 and NEUROD1 in the adult NCX. Although these enriched motifs were found in only a relatively small proportion of the promoters of the inter-species differentially expressed genes (table S15), expression changes of almost 30% of the differentially expressed genes in NCX can be explained solely by STAT4, EWSR1, and NEUROD1, transcription factors that have been previously implicated in neuronal development (55) and brain disorders (56, 57). This suggests that species differences in the expression levels of influential transcription factors could be phenotypically relevant.
To substantiate the possibility that these transcription factors might regulate inter-species differences in gene expression, we next conducted an independent analysis utilizing epigenomic data of regulatory elements (including both active promoters and enhancers) showing macaque-human differential signal of H3K27ac binding in different regions of adult brains (58). Using region-matched (i.e., NCX, STR, MD, and CBC) aspects of this dataset, we performed TFBS enrichments for both the regions defined as up-regulated in human as well as those down-regulated in human compared to macaque ((32), table S16–18). As before, we then compared TFBS enriched among regulatory elements differentially detected in human and macaque with the transcription factors differentially expressed in a given area/region between species. We observed a higher number of differentially expressed transcription factors associated with binding sites selective for epigenetic loci down-regulated in human (17, 6, 6, and 1 for NCX, CBC, MD, and STR, respectively) than for loci up-regulated in human (3, 1, and 1, for NCX, CBC, and MD). Moreover, 86% of promoters associated with inter-species differentially expressed genes in the NCX contained TFBS for transcription factors that were differentially expressed between species in NCX. The same was true for 33% of all differentially expressed genes retrieved from CBC, 29% for the differentially expressed genes in MD, and 8.5% of the differentially expressed genes in STR.
We found that all enriched DEX-TFBS were also found in the TFBS enrichment using epigenomic data (58) in matched brain regions and developmental stages. The good agreement between both independent datasets supports the regulatory relevance of these DEX-TFBSs in driving the expression changes of other DEX genes.
Diversity and cell type specificity of species differences
To explore whether cell type-specific transcriptomic changes account for the inter-species divergence observed at the tissue level, we tested the enrichment of human up-regulated genes in human single cells and human down-regulated genes in macaque single cells. Furthermore, we used prenatal scRNA-Seq data for prenatal differentially expressed genes and adult snRNA-Seq data for the early postnatal and adulthood periods (Figs. 4, A and B; fig. S33). In all prenatal neocortical areas, human up-regulated genes were enriched in neural progenitors, indicating that human NCX may possess more neural progenitors at matched timepoints compared with macaque counterparts, although we cannot completely exclude the possibility that a lack of macaque samples matching human early fetal samples (Fig. 1, A and B) might contribute to this observation, despite the efforts we made to minimize the effects of sampling bias between species by fitting a Gaussian process model. In contrast, macaque up-regulated genes were enriched in multiple subtypes of excitatory and inhibitory neurons in all neocortical areas (Fig. 4A). Interestingly, a specific subtype of excitatory neurons (i.e., ExN2) was enriched for the macaque up-regulated genes only in prefrontal areas. In the postnatal and adult NCX, human up-regulated genes were enriched in a single population of likely upper-layer excitatory neurons (ExN2b), which was not described in a recent snRNA-Seq study of the adult human neocortex (59). Conversely, postnatally up-regulated macaque genes were enriched in multiple subtypes of excitatory neurons (Fig. 4B). Inter-species differentially expressed genes in non-neocortical brain regions of the prenatal brain were also enriched in specific cell types (fig. S33). For example, genes displaying inter-species differentially expressed genes in HIP, STR, and CBC were enriched in a population of oligodendrocyte progenitor cells (OPC), medium spiny neurons (MSN), and external granular layer transition to granule neuron cells (EGL-TransGraN), respectively. Furthermore, genes showing inter-species differential expression in hippocampus, amygdala, striatum, and cerebellum were enriched in a population of microglia (fig. S34).
Fig. 4. Cell-type specificity of species differences.
(A) Cell type enrichment for genes up- or down-regulated in human neocortical areas. Enrichment of genes up-regulated in human or macaque was tested using single cells from prenatal human neocortex (33) or macaque DFC, respectively. The plot shows -log10-P values adjusted by Benjamini–Hochberg procedure averaged across all neocortical areas (NCX), prefrontal areas (PFC) and non-prefrontal areas (nonPFC). Significance was labeled by color (True: red and False: gray). (B) Cell type enrichment for genes up- or down-regulated in human neocortical areas. Enrichment of genes up-regulated in human or macaque was tested using single nuclei from adult human neocortex (33) or macaque DFC, respectively. P values were adjusted by Benjamini–Hochberg procedure and the log-transformed P values averaged across all neocortical areas (NCX), prefrontal areas (PFC) and non-prefrontal areas (nonPFC) was plotted (size). Significance was labeled by color (True: red and False: gray). (C) Cell type-enrichment of selected genes showing human-distinct up- or down-regulation in adult brain regions or neocortical areas (34). Preferential Expression Measure (PEM) (size and color) was plotted to show the cell type specificity.eNPC – embryonic neuroepithelial progenitor; eIPC – embryonic intermediate progenitor cell; eNasN – embryonic nascent neuron; ExN – excitatory neuron; InN – interneuron; Astro – astrocyte; OPC – oligodendrocyte progenitor cell; Oligo – oligodendrocyte; Endo – endothelial cell; VSMC – vascular smooth muscle cell.
By integrating our single-cell data sets with a tissue-level transcriptomic dataset of adult human, chimpanzee, and macaque brains (34), we identified the cell-type enrichment of several genes showing human-specific up- or down-regulation in NCX or all brain regions compared with chimpanzee and macaque. For example, CD38 was found to be down-regulated in all human brain regions and enriched in astrocytes (Fig. 4C). This gene encodes a glycoprotein that is important in the regulation of intracellular calcium and its deletion leads to impaired development of astrocytes and oligodendrocytes in mice (60). CLUL1, a gene reported to be specifically expressed in cone photoreceptor cells (61), showed human-specific up-regulation in all brain regions and was enriched in oligodendrocytes and astrocytes. TWIST1 exhibited human-specific down-regulation in all neocortical areas postnatally, and is enriched in putative upper-layer excitatory neurons (Fig. 4C). Conversely, PKD2L1 is up-regulated in NCX postnatally and enriched in putative deep-layer excitatory neurons (Fig. 4C). MET exhibited human-specific up-regulation in the prefrontal cortex and STR postnatally, and is enriched in upper-layer excitatory neurons (Fig. 4C).
Shared and divergent transcriptomic features of homologous cell types
To test whether the observed differential expression between human and macaque was due to differences in cell type composition or due to transcriptomic differences between homologous cell types, we performed a comparative analysis between human and macaque cell types of prenatal and adult dorsolateral prefrontal cortices. The correlation between human and macaque cell types showed that all human cell types had a close homologue in macaque, and vice versa (Fig. 5A and B). Nevertheless, we identified genes showing inter-species differential expression in homologous cell types (Fig. 5C). To avoid biases inherent to high variation in sc- or sn-RNA-Seq, we filtered out genes that did not display differential expression between species at the tissue-level and only included genes that exhibited enrichment in cell types where they show inter-species differential expression (Preferential expression measure > 0.3; (32)). We identified 14 differentially expressed genes in prenatal development and 41 differentially expressed genes in adulthood (Fig. 5C). For example, TRIM54, which encodes a protein implicated in axonal growth (62), was down-regulated in human prenatal neocortical excitatory neurons (Fig. 5C). VW2CL, which encodes a protein associated with AMPA-type glutamate receptors (63), was down-regulated in prenatal human neocortical interneurons. SLC17A8 (aka VGLUT3), which encodes the vesicular glutamate transporter 3, is up- regulated in human postnatal somatostatin-positive interneurons (InN8). Overall, we found that human DFC cell types showed high correlation with macaque DFC cell types and that only a small set of genes displayed differential expression between these homologous cell types (Fig. 5C). Thus, the inter-species differences identified at the tissue level are likely to be the results of variations in cellular diversity, abundance, and, to a lesser extent, transcriptional divergence between cell types.
Fig. 5. Shared and divergent transcriptomic features of homologous cell types between human and macaque.
(A) Dendrogram and heatmap showing diversity and correlation of prenatal cell types within and between the two species. The human single cells were from (33). (B) Dendrogram and heatmap showing diversity and correlation of adult cell types within and between the two species. (C) Cell type specificity of inter- species differentially expressed genes based on the single-cell/nucleus information. Blue, human down-regulated genes; and red, human up-regulated genes.
Heterochronic changes in human and macaque brain transcriptome
The observed heterotopic differences may result, in part, from changes in the timing of gene expression, or heterochrony. To identify such heterochronic differences, we created a Gaussian-process based model (TempShift; (32)) and applied this model independently to human and macaque gene expression datasets. Due to the similar nature of the transcriptomic signature of the eleven neocortical areas relative to other brain regions, and to maintain consistency with earlier analyses, we focused our analysis on eleven neocortical areas (see (33)). We identified genes with inter-regional temporal differences within neocortical areas of each species and aggregated them into 36 regional clusters (RC; fig. S35; table S19). For both human and macaque brain, analysis of all heterochronic genes revealed greater inter-areal differences during prenatal periods than early postnatal or adult ages (fig. S36). In addition, although we observed differences in inter-areal heterochrony between the early postnatal phase and the adult phase in human, we did not observe these differences in macaque (fig. S36). This suggests that inter-regional synchrony in the macaque precedes that in age-matched human, possibly reflecting the protracted development of the human brain during childhood and the earlier plateauing of myelination-associated processes in macaque postnatal development (Figs. 1C and S19). Analysis of the regional clusters revealed further insights into shared and species-distinct aspects of neurodevelopment. For example, we identified five regional clusters (RC 4, 21, 26, 29, and 34) enriched for genes expressed selectively by neural progenitors that exhibited temporal differences between human neocortical areas (fig. S35). Each of these clusters exhibited a gradient whereby a decrease in expression in central regions of the prenatal neocortex preceded a decrease at the anterior and posterior poles, suggesting increased progenitor populations or a prolonged neurogenic period in prefrontal cortex as well as STC, ITC, and V1C. However, although we observed similar temporal gradients in macaque for RC 4, 26, and 29, neither RC21 nor RC34, the modules exhibiting the sharpest delay in posterior neocortex, exhibited a similar central- to-polar gradient in macaque (Fig. 6A). Conversely, RC10 and RC12 exhibited an inverse gradient in humans, with decreased expression in prefrontal neocortex and STC, ITC, and V1C preceding a decrease in central cortex. These modules, which are enriched in astrocytes, did not exhibit a similar gradient in macaque (Fig. 6A; fig. S35). This indicates that even though the transcriptomic signature associated with astrogliogenesis showed a global synchronicity between species (Figs. 1C and S19), a smaller group of genes enriched in astrocytes displayed heterochrony between species.
Fig. 6. Heterochronic expression of regional and inter-species gene clusters.
(A) Clusters of genes exhibiting species-distinct regional heterochronic expression patterns in human and macaque brains at various prenatal periods and adulthood. The timing of expression of genes in the cluster is represented by blue (earlier expression) to red (later expression). Prenatal heterochronic regional clusters RC21 and RC34 show earlier expression (blue) in human prenatal fronto-parietal perisylvian neocortical areas (M1C, S1C, and IPC) and enrichment in neural progenitors. RC10 is composed of genes with earlier expression in the human prenatal prefrontal cortex and enrichment in astrocytes. These observed regional expression patterns are not present in the macaque prenatal NCX. Adult heterochronic cluster RC25 shows earlier expression in primary areas of the macaque cortex and enrichment for genes associated with oligodendrocytes. (B) A network of 139 inter-species heterochronic genes (blue) is enriched for targets of putative upstream transcriptional regulators that include those encoded by eight genes of the same network (red), and TWIST1 (green), a transcription factor with inter-species heterotopic expression (fig. S34). Arrows indicate direction of regulation. (C) Top five canonical pathways enriched among inter-species heterochronic genes in at least one neocortical area. (D) Cluster EC14 shows inter-species heterochronic expression, exhibits a delayed expression specifically in the human prenatal prefrontal cortex, and is enriched for genes selectively expressed by intermediate progenitor cells (IPC).
Despite the global enrichment of heterochronic genes in prenatal development (fig. S36), we also identified clusters exhibiting higher inter-regional differences in postnatal development and adult. One example is RC 25, a cluster enriched for oligodendrocyte markers that exhibited a pattern of early expression in primary motor and somatosensory areas in macaque but not in human NCX (Fig. 6A). This corroborates myelination-related regional asynchrony, as primary areas myelinate earlier, and that there is inter-species heterochrony in oligodendrocyte maturation and myelination- associated processes. Reflective of the cup-shaped pattern of regional variation in global development, the regional clusters also suggest the asynchronous maturation of prenatal areas, a gradual synchronization during early postnatal development in both species, and additional postnatal and adult differences driven in part by myelination.
We next applied TempShift to identify genes exhibiting inter-species heterochronic divergence. Among eleven neocortical areas, we identified approximately 3.9% of coding and non-coding mRNA genes (1,100 out of 27,932 analyzed orthologous genes) exhibiting inter-species heterochronic expression in at least one neocortical area. We then used Ingenuity Pathway Analysis to assess upstream transcriptional regulation of heterochronic genes. We found that the differential expression of 139 inter-species heterochronic genes could be explained by as few as 8 co-regulated heterochronic transcriptional regulators (Fig. 6B; (32)), plus one transcription factor with heterotopic expression (down-regulated in the postnatal human neocortex) between species, TWIST1 (fig. S37). Interestingly, a majority (90 out of 139) of these putative target genes of the nine transcriptional regulators exhibited accelerated expression in the human neocortex. As aforementioned, humans exhibit an accelerated heterochronic pattern for the synaptogenesis transcriptomic signature and the presence of FOS, a neuronal activity-regulated gene, as one of the hubs of this transcriptional network, indicates that this accelerated synaptogenesis likely drives the accelerated expression of several genes in the human neocortex. Furthermore, an ontological analysis of the genes with heterochronic expression revealed an enrichment for functional categories such as “axonal guidance signaling”, “glutamate receptor signaling” and “CREB signaling in neurons” (Fig. 6C), suggesting that heterochronic processes include molecular pathways related to axon guidance and synaptic activity.
We next identified 15 evolutionary clusters (EC) on the basis of the 1,100 heterochronic genes displaying inter-species neocortical heterochronic expression patterns (table S20). Among the evolutionary clusters, EC 14 exhibited a delayed expression in the human dorsolateral prefrontal cortex and was enriched for intermediate progenitor cell (IPC) markers (Figs. 6D and S38) in agreement with the progenitor cell population differences we observed previously in prefrontal cortex, indicating that this neocortical prefrontal area likely has a protracted neurogenesis when compared to macaque. Similarly, the species-distinct maturation gradients of neural progenitors, astrocytes, and oligodendrocytes also support observations we made concerning inter- species heterotopy. These results were supported by selective validation of the expression profiles of heterochronic genes; using droplet digital PCR, we selected 5 genes with different developmental profiles across regions and species (figs. S39–43), confirming not just the expression profiles of these genes but also that our observations were not the result of biases introduced by TranscriptomeAge.
Species difference in spatio-temporal expression of disease genes
Next, we investigated whether genes associated with risk for neuropsychiatric disorders exhibited differences in their spatio-temporal expression between human and macaque. We focused our analysis on genes linked to autism spectrum disorders (ASD) and other neurodevelopmental disorders (NDD), attention deficit hyperactivity disorder (ADHD), schizophrenia (SCZ), bipolar disorder (BD), major depressive disorder (MDD), Alzheimer’s disease (AD), and Parkinson’s disease (PD) in previous genetic studies or through our integrative analysis from the accompanying study or (see (33) and table S21 for details). We next sought to determine whether the expression of genes associated with these neuropsychiatric disorders were enriched in any particular developmental phase. Consistent with previous studies associating the mid-fetal timeframe with specific high confidence ASD (hcASD) genes (see (64)), we found that a larger group of hcASD genes were more highly expressed in the prenatal than the early postnatal and adult brains in both species (fig. S44). In contrast, AD-related genes were more highly expressed in the early postnatal and adult than the prenatal brains in both species (fig. S44). Other groups of disease related genes did not show any obvious global difference across development. Potentially suggesting the involvement of species-specific aspects in the etiology of ASD, NDD and SCZ, we identified genes with heterochronic or heterotopic expression between the two species that are associated with ASD (6 and 0, respectively), non-hcASD NDD (56 and 14, respectively) and SCZ (45 and 14, respectively) (Figs. 7A and B). Unsupervised hierarchical clustering of SCZ- associated genes, but not NDD, with heterotopic expression yielded 5 obvious spatio-temporal clusters, three of which exhibited species differences exclusively during prenatal development (fig. S44). Of the prenatal clusters, cluster 1 showed enrichment in prefrontal cortex, cluster 3 in temporal cortex, and cluster 2 in both frontal and temporal cortices, in human. Cluster 4 displayed an enrichment in the postnatal and adult macaque frontal cortex and cluster 5 exhibited a similar enrichment in adult macaque prefrontal cortex (Fig. 7D).
Fig. 7. Heterotopic and/or heterochronic expression of disease-associated genes between human and macaque.
(A) Bar plot depicting the number of genes associated with autism spectrum disorder (ASD; high confidence [hc]), neurodevelopmental disorders (NDD), attention deficit hyperactivity disorder (ADHD), schizophrenia (SCZ), bipolar disorder (BD), major depressive disorder (MDD), Alzheimer’s disease (AD), and Parkinson’s disease (PD) that display heterochronic divergence between human and macaque. (B) Bubble matrix showing the heterochronic expression of ASD and SCZ- associated genes. Blue represents earlier expression in human; red represents earlier expression in macaque. (C) Bar plot depicting the number of genes associated with neuropsychiatric disorders that exhibit heterotopic divergence between human and macaque. The 14 SCZ-associated genes that displayed heterotopy were grouped into 5 clusters on the basis of their spatio-temporal expression profiles (fig. S41). (D) Donut plots exhibiting the centered expression of the 5 SCZ-associated heterotopic clusters in prenatal, early postnatal development, and adulthood. Cluster numbers are labeled with the same color as in panel C. Clusters that are not significantly divergent between species in each period are grey and do not have a black border. Red indicates high expression; blue indicates low expression.
Further analysis revealed that the ASD-associated genes SHANK2 and SHANK3, which encode synaptic scaffolding proteins at the postsynaptic density of excitatory glutamatergic synapses, exhibited earlier expression in the macaque NCX and other brain regions relative to human (Fig. 7B). Commensurate with a role for these proteins in neural circuit development, and in agreement with analyses suggesting the involvement of neocortical projection neurons in the etiology of ASD, these two genes also became progressively more expressed across prenatal ages in both human and macaque (fig. S45). SCZ-associated genes displaying inter-species heterochrony included GRIA1, a glutamate ionotropic receptor AMPA type subunit that has different expression trajectories in MFC and OFC compared with other neocortical areas, and that is expressed earlier in human VFC, M1C, S1C, IPC, and STC. (Fig. 7B and fig. S45). These evolutionary changes in the spatio-temporal expression of certain disease-associated genes might therefore imply transcriptional underpinnings for potential human-specific aspects of neuropsychiatric disorders. For example, the presence of human-distinct heterochrony in synapse-related proteins associated with ASD, coupled with the lack of obvious heterotopic expression in hcASD genes, may suggest conserved neurodevelopmental programs common to primate species are uniquely shifted temporally in some areas in the human brain, potentially implicating key developmental periods, places, and cell types involved in disease etiology. Similarly, the heterochronic and heterotopic changes we associated with SCZ,in particular those affecting the prenatal prefrontal and temporal cortices, may be involved in human-specific aspect of disease etiology.
Given the importance of untranslated regions and other non-coding regions in the regulation of gene expression as well as disease, we next explored inter-species differences in exon usage between species in genes associated with neuropsychiatric disorders. We observed that 413 genes with differentially expressed exonic elements were linked to the studied diseases. Moreover, we detected 35 disease genes showing differentially used exonic elements with predicted binding sites (65) for miRNAs independently associated with central nervous system diseases ((32, 66), table S22). Several of these genes (e.g., GRIN2B, BCL11B, and NKPD1) were potentially targeted by a large number of disease-associated miRNAs (fig. S46), and gene-miRNA interactions have already been experimentally validated for 11 of the 35 genes we identified, according to miRTarBase (67) (table S23). For example, we detected differential exon usage of BCL11B, a gene involved in the development of medium spiny neurons (68), between human and macaque in the adult STR (fig. S46). However, while BCL11B shows lower expression in human than in macaque STR, the exonic element containing the 3’UTR of BCL11B was itself not differentially expressed. This observation suggests that overexpression in macaque is associated with an isoform containing a shorter 3’UTR region lacking the capacity to be bound by various miRNAs, possibly including, based on predicted binding sites, members of the miR-219 family of brain-specific miRNAs, which have been experimentally shown to interact with BCL11B mRNA (69). Together, these findings indicate that certain genes associated with neuropsychiatric disorders exhibit changes in the timing of their expression, location and splicing pattern between human and NHP brains, and thus may lead to species differences in disease pathogenesis.
Discussion
In this study, we present a comprehensive spatio-temporal transcriptomic brain dataset of the macaque brain. Resource description and access are available at evolution.psychencode.org. In addition, we present computational methods such as TranscriptomeAge to conduct unbiased age matching across species, and TempShift, a Gaussian-process based model for identifying transcriptomic changes in the timing of biological processes.
Our integrative and comparative analysis involving complementary human and adult chimpanzee (33, 34), revealed similarities and differences in the spatio-temporal transcriptomic architecture of the brain and the progression of major neurodevelopmental processes between the two species. For example, we have identified shared and divergent transcriptomic feature among homologous brain regions and cell types. We found transcriptomic evidence suggesting that human childhood is especially protracted, as compared to macaques. It has long been recognized that the development of the human brain is prolonged compared to that of other NHPs and that this slower rate of maturation expands the period of neural plasticity and capacity for learning activities, memory, and complex sensory perception, all processes necessary for higher order cognition (1–4, 14, 28). We also found that the early periods of human fetal neurodevelopment are transcriptomically distinct and protracted as compared to macaque. Interestingly, a similar observation of early neurodevelopmental protraction was recently observed in vitro, in neural progenitors derived from pluripotent cells of human and NHPs (70). However, we also identified cases of neoteny in macaques, such as the protracted postnatal expression of DCX in the hippocampus, likely reflecting differences in neurogenesis between the two species as recently shown (49).
We found that global patterns of spatio-temporal transcriptomic dynamics were conserved between humans and macaque, and display a highly convergent cup-like shape. The most dramatic decrease in inter-regional differences occurs during late fetal ages and before birth, likely reorganizational processes at this developmental period rather than extrinsic influences due to birth and subsequent events (i.e., respiratory activity or other developmentally novel stimuli). Interestingly, after this transitional period, diversification of neocortical areas appears to be driven mainly by differences between primary and association areas. In addition to these largely conserved broad developmental patterns of inter-regional differences, we identified numerous genes and gene modules with human-distinct heterochronic or heterotopic expression. These patterns involved brain regions such as the developing prefrontal areas, which are central to the evolution of distinctly human aspects of cognition and behavior (19–21). Surprisingly, we also found that developmental phases exhibiting high levels of inter-regional differences (i.e., early to mid-fetal periods and young adulthood) were also less conserved between the two species. The coincident convergence of the ontogenetic and phylogenetic cups during the late fetal period and infancy is strikingly distinct from the previously reported phylogenetic transcriptomic hourglass-like pattern that occurs during the embryonic organogenetic period (71, 72).
Genes with divergent spatio-temporal expression patterns included those previously linked to ASD, SCZ and NDD. These species differences in the expression of disease-associated genes linked to synapse formation, neuronal development, and function, as well as regional and species differences in synaptogenesis and myelination, might have implications for the overall development of neural circuitry and consequently human cognition and behavior. These observations are possibly relevant for recent non-human primate models of neuropsychiatric disease, such as the SHANK3-deficient macaque model (73), which might therefore not be capable of fully capturing human-distinct aspects of SHANK3 regulation during neurodevelopment.
Together, our study reveals insights into the evolution of gene expression in the developing human brain. Future work on the development patterns and the functional validation of the genes we report to have heterotopic and/or expression patterns between human and macaque will likely shed some light on potentially human-specific underpinnings of certain neuropsychiatric disorders.
Materials and Methods
Sixteen regions of the macaque brain spanning from early prenatal to adulthood were dissected using the same standardized protocol used for human specimens and described in the accompanying study by Li et al ((33); see also (32)). The macaque brain regions and developmental timepoints matched human brain regions and timepoints analyzed in the study by Li et al ((33)). The sampled homologous brain regions were identified using anatomical landmarks provided in the macaque brain atlas (74). An overview of dissected brain regions is provided in fig. S1. Translating Time model (38) was used to identify equivalent timepoints between macaque and human prenatal development. The list of macaque brains used in this study and relevant metadata are provided in tables S1–2. Macaque studies were carried out in accordance with a protocol approved by Yale University’s Committee on Animal Research and NIH guidelines.
We performed tissue-level RNA extraction and sequencing of all 16 regions, single-cell RNA-Seq of dorsolateral prefrontal cortex [DFC], hippocampus [HIP], amygdala [AMY], striatum [STR], mediodorsal nucleus of the thalamus [MD], and cerebellar cortex [CBC] of mid-fetal macaque, and single-nucleus RNA-Seq of DFC of adult macaque. Single cell/nucleus sample processing was done with 10X Genomics and sequencing was done with Illumina platforms.
For tissue-level analysis, we generated annotation of human-macaque orthologs using the XSAnno pipeline, and matched the developmental age of human and macaque samples based on their respective transcriptome using our algorithm TranscriptomeAge. We also developed TempShift, a method based on Gaussian process model, to reveal the inter-regional differences, inter-species divergence, and genes with heterotopic and heterochronic expression. We also queried differentially expressed genes for enrichment in transcription factor binding sites using findMotifs.pl, and analyzed inter-species differential exon usage using the R package DEXSeq.
The single cell/nucleus data was first analysed by cellranger for decoding, alignment, quality filtering, and UMI counting. After that, data was further analyzed with Seurat according to its guidelines, and cell types were clustered for classification with SpecScore.R. In order to perform direct comparisons between human and macaque at the single-cell level, we focused on the homologous genes between these species and aligned monkey and human cells together to further analyze inter-species divergence of homologous cell types (fig. S47). We used MetageneBicorPlot function to examine the correlation of neuronal and glial cell subtypes, and we employed the correlation analysis to detect the correspondence of excitatory neuron and interneuron subtypes. Finally, we did functional enrichment of disease-associated genes in both tissue-level and single-cell datasets.
Supplementary Material
Acknowledgements:
We thank M. Horn, G. Sedmak, M. Pletikos, D. Singh, G. Terwilliger, and S. Wilson for assistance with tissue acquisition and processing. We also thank Alvaro Duque for using equipment from MacBrainResource (NIH/NIMH R01MH113257).
Funding: Data were generated as part of the PsychENCODE Consortium, supported by: U01MH103392, U01MH103365, U01MH103346, U01MH103340, U01MH103339, R21MH109956, R21MH105881, R21MH105853, R21MH103877, R21MH102791, R01MH111721, R01MH110928, R01MH110927, R01MH110926, R01MH110921, R01MH110920, R01MH110905, R01MH109715, R01MH109677, R01MH105898, R01MH105898, R01MH094714, P50MH106934 awarded to: Schahram Akbarian (Icahn School of Medicine at Mount Sinai), Gregory Crawford (Duke University), Stella Dracheva (Icahn School of Medicine at Mount Sinai), Peggy Farnham (University of Southern California), Mark Gerstein (Yale University), Daniel Geschwind (University of California, Los Angeles), Fernando Goes (Johns Hopkins University), Thomas M. Hyde (Lieber Institute for Brain Development), Andrew Jaffe (Lieber Institute for Brain Development), James A. Knowles (University of Southern California), Chunyu Liu (SUNY Upstate Medical University), Dalila Pinto (Icahn School of Medicine at Mount Sinai), Panos Roussos (Icahn School of Medicine at Mount Sinai), Stephan Sanders (University of California, San Francisco), Nenad Sestan (Yale University), Pamela Sklar (Icahn School of Medicine at Mount Sinai), Matthew State (University of California, San Francisco), Patrick Sullivan (University of North Carolina), Flora Vaccarino (Yale University), Daniel Weinberger (Lieber Institute for Brain Development), Sherman Weissman (Yale University), Kevin White (University of Chicago), Jeremy Willsey (University of California, San Francisco), and Peter Zandi (Johns Hopkins University). Tomas Marques-Bonet is supported by BFU2017–86471-P (MINECO/FEDER, UE), U01 MH106874 grant, Howard Hughes International Early Career, Obra Social “La Caixa” and Secretaria d’Universitats i Recerca and CERCA Programme del Departament d’Economia i Coneixement de la Generalitat de Catalunya (GRC 2017 SGR 880). Paula Esteller-Cucala is supported by a Formació de Personal Investigador fellowship from Generalitat de Catalunya (FI_B00122). Luis Ferrández- Peral is supported by La Caixa Foundation. David Juan is supported by a Juan de la Cierva fellowship (FJCI-2016–29558) from MICINN. Additional support was provided by the NIH grant MH109904 and MH106874, the Kavli Foundation, the James S. McDonnell Foundation.
Footnotes
Data and materials availability: Data is available at NCBI BioProjects (accession number: PRJNA448973) and via Synapse in www.psycoencode.org. All algorithms, packages, and scripts are available at evolution.psychencode.org. Supplement contains additional data.
Competing interests: Authors have no conflict of interest.
References
- 1.Montagu MFA, Time, Morphology, and Neoteny in the Evolution of Man. Am. Anthropol. 57, 13–27 (1955). [Google Scholar]
- 2.Dobbing J, Sands J, Quantitative growth and development of human brain. Arch. Dis. Child 48, 757–767 (1973). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Prechtl HF, New perspectives in early human development. European journal of obstetrics, gynecology, and reproductive biology 21, 347–355 (1986). [DOI] [PubMed] [Google Scholar]
- 4.Bogin B, Evolutionary perspective on human growth. Annu. Rev. Anthropol. 28, 109–153 (1999). [DOI] [PubMed] [Google Scholar]
- 5.Schultz AH, Age changes in primates and their modification in man. J. M. Tanner, Ed., Growth Human (Pergamon Press, Oxford, 2009). [Google Scholar]
- 6.Silbereis JC, Pochareddy S, Zhu Y, Li M, Sestan N, The Cellular and Molecular Landscapes of the Developing Human Central Nervous System. Neuron 89, 248–268 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hill RS, Walsh CA, Molecular insights into human brain evolution. Nature 437, 64–67 (2005). [DOI] [PubMed] [Google Scholar]
- 8.Fish JL, Dehay C, Kennedy H, Huttner WB, Making bigger brains-the evolution of neural-progenitor-cell division. J. Cell. Sci. 121, 2783–2793 (2008). [DOI] [PubMed] [Google Scholar]
- 9.Geschwind DH, Konopka G, Neuroscience in the era of functional genomics and systems biology. Nature 461, 908–915 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lui JH, Hansen DV, Kriegstein AR, Development and evolution of the human neocortex. Cell 146, 18–36 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.McCarroll SA, Hyman SE, Progress in the genetics of polygenic brain disorders: significant new challenges for neurobiology. Neuron 80, 578–587 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lein ES, Belgard TG, Hawrylycz M, Molnar Z, Transcriptomic Perspectives on Neocortical Structure, Development, Evolution, and Disease. Annu Rev Neurosci 40, 629–652 (2017). [DOI] [PubMed] [Google Scholar]
- 13.Hardingham GE, Pruunsild P, Greenberg ME, Bading H, Lineage divergence of activity-driven transcription and evolution of cognitive ability. Nat. Rev. Neurosci. 19, 9–15 (2018). [DOI] [PubMed] [Google Scholar]
- 14.Paus T, Keshavan M, Giedd JN, Why do many psychiatric disorders emerge during adolescence? Nat. Rev. Neurosci. 9, 947–957 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Liao BY, Zhang J, Null mutations in human and mouse orthologs frequently result in different phenotypes. Proc. Natl. Acad. Sci. U.S.A. 105, 6987–6992 (2008) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Barak T, et al. , Recessive LAMC3 mutations cause malformations of occipital cortical development. Nat. Genet. 43, 590–594 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.O’Bleness M, Searles VB, Varki A, Gagneux P, Sikela JM, Evolution of genetic and genomic features unique to the human lineage. Nature Reviews. Genetics 13, 853–866 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Mirnics K, Pevsner J, Progress in the use of microarray technology to study the neurobiology of disease. Nat. Neurosci. 7, 434–439 (2004). [DOI] [PubMed] [Google Scholar]
- 19.Preuss TM, Human brain evolution: From gene discovery to phenotype discovery. Proc. Natl. Acad. Sci. U.S.A. 109 Suppl 1, 10709–10716 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Sherwood CC, Bauernfeind AL, Bianchi S, Raghanti MA, Hof PR, Human brain evolution writ large and small. Prog. Brain Res. 195, 237–254 (2012). [DOI] [PubMed] [Google Scholar]
- 21.Sousa AMM, Meyer KA, Santpere G, Gulden FO, Sestan N, Evolution of the Human Nervous System Function, Structure, and Development. Cell 170, 226–247 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.van der Worp HB, Howells DW, Sena ES, Porritt MJ, Rewell S, O’Collins V, Macleod MR, Can animal models of disease reliably inform human studies? PLoS Med 7, e1000245 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Marques-Bonet T, Ryder OA, Eichler EE, Sequencing primate genomes: what have we learned? Annu Rev Genomics Hum Genet 10, 355–386 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Izpisua Belmonte JC, Callaway EM, Caddick SJ, Churchland P, Feng G, Homanics GE, et al. , Brains, genes, and primates. Neuron 86, 617–631 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Jennings CG, Landman R, Zhou Y, Sharma J, Hyman J, Movshon JA, et al. , Opportunities and challenges in modeling human brain disorders in transgenic primates. Nat Neurosci 19, 1123–1130 (2016);. [DOI] [PubMed] [Google Scholar]
- 26.Sato K, Sasaki E, Genetic engineering in nonhuman primates for human disease modeling. Journal of human genetics 63, 125–131 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Johnson MB, Kawasawa YI, Mason CE, Krsnik Z, Coppola G, Bogdanovic D, et al. , Functional and evolutionary insights into human brain development through global transcriptome analysis. Neuron 62, 494–509 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Somel M, Franz H, Yan Z, Lorenc A, Guo S, Giger T, et al. , Transcriptional neoteny in the human brain. Proc Natl Acad Sci U S A 106, 5743–5748 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kang HJ, Kawasawa YI, Cheng F, Zhu Y, Xu X, Li M, et al. , Spatio-temporal transcriptome of the human brain. Nature 478, 483–489 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Pletikos M, Sousa AM, Sedmak G, Meyer KA, Zhu Y, Cheng F, et al. , Temporal specification and bilaterality of human neocortical topographic gene expression. Neuron 81, 321–332 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Liu X, Somel M, Tang L, Yan Z, Jiang X, Guo S, et al. , Extension of cortical synaptic development distinguishes humans from chimpanzees and macaques. Genome Res. 22, 611–622 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Materials and Methods.
- 33.Li M, et al. , Integrative functional genomic analysis of human brain development and neuropsychiatric risk convergence. Science, submitted. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Sousa AMM, Zhu Y, Raghanti MA, Kitchen RR, Onorati M, Tebbenkamp ATN, et al. ,, Molecular and cellular reorganization of neural circuits in the human lineage. Science 358, 1027–1032 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zhong S, Zhang S, Fan X, Wu Q, Yan L, Dong J, et al. , A single-cell RNA-seq survey of the developmental landscape of the human prefrontal cortex. Nature 555, 524–528 (2018). [DOI] [PubMed] [Google Scholar]
- 36.Miller JA, Ding S-L, Sunkin SM, Smith KA, Ng L, Szafer A, et al. , Transcriptional landscape of the prenatal human brain. Nature 508, 199–206 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Zhu Y, Li M, Sousa AM, Sestan N, XSAnno: a framework for building ortholog models in cross-species transcriptome comparisons. BMC Genomics 15, 343 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Workman AD, Charvet CJ, Clancy B, Darlington RB, Finlay BL, Modeling transformations of neurodevelopmental sequences across mammalian species. J Neurosci 33, 7368–7383 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Fooden J, Systematic review of the rhesus macaque, Macaca mulatta (Zimmermann, 1780). (Field Museum of Natural History, 2000), vol. Fieldiana Zoology New Series no.96. [Google Scholar]
- 40.Clancy B, Darlington RB, Finlay BL, Translating developmental time across mammalian species. Neuroscience 105, 7–17 (2001). [DOI] [PubMed] [Google Scholar]
- 41.Knoth R, Singec I, Ditter M, Pantazis G, Capetian P, Meyer RP, Horvat V, Volk B, Kempermann G, Murine features of neurogenesis in the human hippocampus across the lifespan from 0 to 100 years. PLoS One 5, 8809 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Huttenlocher PR, Dabholkar AS, Regional differences in synaptogenesis in human cerebral cortex. J Comp Neurol 387, 167–178 (1997). [DOI] [PubMed] [Google Scholar]
- 43.Miller DJ, Duka T, Stimpson CD, Schapiro SJ, Baze WB, McArthur MJ, et al. , Prolonged myelination in human neocortical evolution. Proc Natl Acad Sci U S A 109, 16480–16485 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Giedd JN, Blumenthal J, Jeffries NO, Castellanos FX, Liu H, Zijdenbos A, Paus T, Evans AC, Rapoport JL, Brain development during childhood and adolescence: a longitudinal MRI study. Nat Neurosci 2, 861–863 (1999). [DOI] [PubMed] [Google Scholar]
- 45.Rapoport JL, Gogtay N, Brain neuroplasticity in healthy, hyperactive and psychotic children: insights from neuroimaging. Neuropsychopharmacology 33, 181–197 (2008). [DOI] [PubMed] [Google Scholar]
- 46.Sowell ER, Peterson BS, Thompson PM, Welcome SE, Henkenius AL, Toga AW, Mapping cortical change across the human life span. Nat. Neurosci. 6, 309–315 (2003). [DOI] [PubMed] [Google Scholar]
- 47.Flechsig Of Leipsic P, Developmental (myelogenetic) localisation of the cerebral cortex in the human subject. The Lancet 158, 1027–1030 (1901). [Google Scholar]
- 48.Rakic P, Bourgeois JP, Eckenhoff MF, Zecevic N, Goldman-Rakic PS, Concurrent overproduction of synapses in diverse regions of the primate cerebral cortex. Science 232, 232–235 (1986). [DOI] [PubMed] [Google Scholar]
- 49.Sorrells SF, Paredes MF, Cebrian-Silla A, Sandoval K, Qi D, Kelley KW, et al. , Human hippocampal neurogenesis drops sharply in children to undetectable levels in adults. Nature 555, 377–381 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Chedotal A, Richards LJ, Wiring the brain: the biology of neuronal guidance. Cold Spring Harbor perspectives in biology 2, a001917 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Rhinn M, Dolle P, Retinoic acid signalling during development. Development 139, 843–858 (2012). [DOI] [PubMed] [Google Scholar]
- 52.Huang AL, Chen X, Hoon MA, Chandrashekar J, Guo W, Tränkner D, Ryba NJ, Zuker CS, The cells and logic for mammalian sour taste detection. Nature 442, 934–938 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Qin Q, Xu Y, He T, Qin C, Xu J, Normal and disease-related biological functions of Twist1 and underlying molecular mechanisms. Cell Res. 22, 90–106 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Campbell DB, Sutcliffe JS, Ebert PJ, Militerni R, Bravaccio C, Trillo S, et al. , A genetic variant that disrupts MET transcription is associated with autism. Proc. Natl. Acad. Sci. U.S.A. 103, 16834–16839 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Pataskar A, Jung J, Smialowski P, Noack F, Calegari F, Straub T, Tiwari VK, NeuroD1 reprograms chromatin and transcription factor landscapes to induce the neuronal program. EMBO J 35, 24–45 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Rodriguez M, Zoecklein L, Gamez JD, Pavelko KD, Papke LM, Nakane S, et al. , STAT4- and STAT6-signaling molecules in a murine model of multiple sclerosis. FASEB J 20, 343–345 (2006). [DOI] [PubMed] [Google Scholar]
- 57.Sturm D, Orr BA, Toprak UH, Hovestadt V, Jones DTW, Capper D, et al. , New brain tumor entities emerge from molecular classification of CNS-PNETs. Cell 164, 1060–1072 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Vermunt MW, Tan SC, Castelijns B, Geeven G, Reinink P, de Bruijn E, et al. , Epigenomic annotation of gene regulatory alterations during evolution of the primate brain. Nat. Neurosci. 19, 494–503 (2016). [DOI] [PubMed] [Google Scholar]
- 59.Lake BB, Ai R, Kaeser GE, Salathia NS, Yung YC, Liu R, et al. , Neuronal subtypes and diversity revealed by single-nucleus RNA sequencing of the human brain. Science 352, 1586–1590 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Rosenberg NL, Kotzin BL, Kemp MC, Burks JS, Santoro TJ, Coronavirus SD-induced immunoregulatory disturbances in a murine model of demyelination. Advances in experimental medicine and biology 218, 441–447 (1987). [DOI] [PubMed] [Google Scholar]
- 61.Sturgill GM, Pauer GJ, Bala E, Simpson E, Yaniglos SS, Crabb JW, et al. , Mutation screen of the cone-specific gene, CLUL1, in 376 patients with age-related macular degeneration. Ophthalmic genetics 27, 151–155 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Gomez-Ferreria MA, Rath U, Buster DW, Chanda SK, Caldwell JS, Rines DR, Sharp DJ, Human Cep192 is required for mitotic centrosome and spindle assembly. Curr Biol 17, 1960–1966 (2007). [DOI] [PubMed] [Google Scholar]
- 63.Schwenk J, Harmel N, Brechet A, Zolles G, Berkefeld H, Muller CS, Wet al., High-resolution proteomics unravel architecture and molecular diversity of native AMPA receptor complexes. Neuron 74, 621–633 (2012). [DOI] [PubMed] [Google Scholar]
- 64.Sanders SJ, He X, Willsey AJ, Ercan-Sencicek AG, Samocha KE, Cicek AE, et al. , Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron 87, 1215–1233 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Agarwal V, Bell GW, Nam JW, Bartel DP, Predicting effective microRNA target sites in mammalian mRNAs. Elife 4, (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Jiang Q, Wang Y, Hao Y, Juan L, Teng M, Zhang X, et al. , miR2Disease: a manually curated database for microRNA deregulation in human disease. Nucleic Acids Res. 37, D98–104 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Chou CH, Shrestha S, Yang CD, Chang NW, Lin YL, Liao KW, et al. , miRTarBase update 2018: a resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 46, D296–D302 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Arlotta P, Molyneaux BJ, Jabaudon D, Yoshida Y, Macklis JD, Ctip2 controls the differentiation of medium spiny neurons and the establishment of the cellular architecture of the striatum. J. Neurosci. 28, 622–632 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Zhao C, Li X, Han B, You Z, Qu L, Liu C, Song J, Lian L, Yang N, Gga-miR-219b targeting BCL11B suppresses proliferation, migration and invasion of Marek’s disease tumor cell MSB1. Scientific reports 7, 4247 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Otani T, Marchetto MC, Gage FH, Simons BD, Livesey FJ, 2D and 3D Stem Cell Models of Primate Cortical Development Identify Species-Specific Differences in Progenitor Behavior Contributing to Brain Size. Cell Stem Cell 18, 467–480 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Domazet-Loso T, Tautz D, A phylogenetically based transcriptome age index mirrors ontogenetic divergence patterns. Nature 468, 815–818 (2010). [DOI] [PubMed] [Google Scholar]
- 72.Kalinka AT, Varga KM, Gerrard DT, Preibisch S, Corcoran DL, Jarrells J, Ohler U, Bergman CM, Tomancak P, Gene expression divergence recapitulates the developmental hourglass model. Nature 468, 811–814 (2010). [DOI] [PubMed] [Google Scholar]
- 73.Zhao H, Tu Z, Xu H, Yan S, Yan H, Zheng Y, et al. , Altered neurogenesis and disrupted expression of synaptic proteins in prefrontal cortex of SHANK3-deficient non-human primate. Cell Res. 27, 1293–1297 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Saleem KS, Logothetis N, A combined MRI and histology atlas of the rhesus monkey brain in stereotaxic coordinates. (Academic, London; Burlington, MA, 2007), pp. ix, 326 p. [Google Scholar]
- 75.Jiang L, Schlesinger F, Davis CA, Zhang Y, Li R, Salit M, Gingeras TR, Oliver B, Synthetic spike-in standards for RNA-seq experiments. Genome Res. 21, 1543–1551 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR, STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Harrow J, Denoeud F, Frankish A, Reymond A, Chen C-K, Chrast J, Lagarde J, et al. , GENCODE: producing a reference annotation for ENCODE. Genome Biol. 7 Suppl 1, 4 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D, The human genome browser at UCSC. Genome Res 12, 996–1006 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Liao Y, Smyth GK, Shi W, featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014);. [DOI] [PubMed] [Google Scholar]
- 80.Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B, Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Methods 5, 621–628 (2008). [DOI] [PubMed] [Google Scholar]
- 81.Butler A, Hoffman P, Smibert P, Papalexi E, Satija R, Integrating single-cell transcriptomic data across different conditions, technologies, and species. Nat Biotechnol 36, 411–420 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Efroni I, Ip PL, Nawy T, Mello A, Birnbaum KD, Quantification of cell identity from single-cell gene expression profiles. Genome Biol. 16, 9 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Friedman J, Hastie T, Tibshirani R, Regularization Paths for Generalized Linear Models via Coordinate Descent. Journal of statistical software 33, 1–22 (2010). [PMC free article] [PubMed] [Google Scholar]
- 84.He Z, Bammann H, Han D, Xie G, Khaitovich P, Conserved expression of lincRNA during human and macaque prefrontal cortex development and maturation. Rna 20, 1103–1111 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Gaujoux R, Seoighe C, A flexible R package for nonnegative matrix factorization. BMC Bioinformatics 11, 367 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Huminiecki L, Lloyd AT, Wolfe KH, Congruence of tissue expression profiles from Gene Expression Atlas, SAGEmap and TissueInfo databases. BMC Genomics 4, 31 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Benjamini Y, Drai D, Elmer G, Kafkafi N, Golani I, Controlling the false discovery rate in behavior genetics research. Behav Brain Res 125, 279–284 (2001). [DOI] [PubMed] [Google Scholar]
- 88.Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, et al. , Robust enumeration of cell subsets from tissue expression profiles. Nat Methods 12, 453–457 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Rasmussen CE, Williams CKI, Gaussian processes for machine learning Adaptive computation and machine learning (MIT Press, Cambridge, Mass., 2006), pp. xviii, 248 p. [Google Scholar]
- 90.Smedley D, Haider S, Durinck S, Pandini L, Provero P, Allen J, et al. , The BioMart community portal: an innovative alternative to large, centralized data repositories. Nucleic Acids Res. 43, W589–598 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Ezkurdia I, Juan D, Rodriguez JM, Frankish A, Diekhans M, Harrow J, Vazquez J, Valencia A, Tress ML, Multiple evidence strands suggest that there may be as few as 19,000 human protein-coding genes. Hum. Mol. Genet. 23, 5866–5878 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Langfelder P, Horvath S, WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics 9, 559 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Langfelder P, Zhang B, Horvath S, Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics 24, 719–720 (2008). [DOI] [PubMed] [Google Scholar]
- 94.Csardi G, Nepusz T, The igraph software package for complex newtwork research. Interjournal 1965, (2006). [Google Scholar]
- 95.Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, et al. , Simple combinations of lineage-determining transcription factors prime cis-regulatory elements required for macrophage and B cell identities. Mol. Cell 38, 576–589 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Anders S, Huber W, Differential expression analysis for sequence count data. Genome Biol. 11, 106 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Reyes A, Anders S, Weatheritt RJ, Gibson TJ, Steinmetz LM, Huber W, Drift and conservation of differential exon usage across tissues in primate species. Proc. Natl. Acad. Sci. U.S.A. 110, 15377–15382 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Love MI, Huber W, Anders S, Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.







