Abstract
Little is known about circular RNAs (circRNAs) in specific brain cells and human neuropsychiatric disease. Here, we systematically identify over 11,039 circRNAs expressed in vulnerable dopamine and pyramidal neurons laser-captured from 190 human brains and non-neuronal cells using ultra-deep, total RNA sequencing. 1526 and 3308 circRNAs are custom-tailored to the cell identity of dopamine and pyramidal neurons and enriched in synapse pathways. 29% of Parkinson’s and 12% of Alzheimer’s disease-associated genes produced validated circRNAs. circDNAJC6, which is transcribed from a juvenile-onset Parkinson’s gene, is already dysregulated during prodromal, onset stages of common Parkinson’s disease neuropathology. Globally, addiction-associated genes preferentially produce circRNAs in dopamine neurons, autism-associated genes in pyramidal neurons, and cancers in non-neuronal cells. This study shows that circular RNAs in the human brain are tailored to neuron identity and implicate circRNA-regulated synaptic specialization in neuropsychiatric diseases.
Subject terms: Neurodegenerative diseases, Transcriptomics, Genome informatics
Dopamine neurons control movements while pyramidal neurons regulate memory and language. Here the authors show that circular RNAs production in these neurons appears tailored to neuron identity and genetically linked to neuropsychiatric disease such as Parkinson’s and Alzheimer’s disease.
Introduction
Initially cast aside as junk, circular RNAs are reemerging as an expanding, largely unexplored class of RNAs1–3. circRNAs are prominently enriched in synapses and brain, and linked to neuronal development4,5 and aging6,7. Produced by the “back-splicing” of a downstream splice donor site to an upstream splice acceptor site, circRNAs have been linked to peripheral diseases, particularly cancer, but their role in brain health, is largely unexplored with initial clues pointing at a role in neurodegenerative diseases8–10 and psychiatric diseases11. circRNA lack a polyadenylated tail and are considerably less prone to exonuclease-mediated degradation than mRNAs12. The stable circular transcripts have half-lives (18.8 to 23.7 h) that are at least 2.5 times longer than the median half-life of their linear counterparts (4.0 to 7.4 h)13,14 and these properties may be important for their accumulation in terminally differentiated neurons.
High-performing neurons in the human brain are functionally and morphologically specialized to perform essential computation. Dopamine neurons in the midbrain control movements, mood, and motivation, while pyramidal neurons in the temporal cortex play important roles in memory and language. Intriguingly, dopamine neurons are preferentially lost in Parkinson’s disease15, while pyramidal neurons are preferentially affected in Alzheimer’s disease16. This raises a central question: How does the human genome program neuronal diversity? Could it be that brain-enriched circRNAs are tailored to neuron identity and contribute to their specialized synapses and disease vulnerability? Previous studies have examined circRNAs mostly in bulk tissues and lacked cellular resolution5,8, and standard single-cell sequencing methods have been limited to linear and in many cases, polyadenylated RNAs. Intriguingly, one recent study suggested that circRNAs may display cell-specific expression patterns in inhibitory and excitatory neurons17 by analyzing public full-length, single-cell RNAseq data comprising pilot human brain (e.g. glioblastoma) and organoid samples. Here we provide a comprehensive inventory of circRNAs in two prototypical types of brain cells—dopamine and pyramidal neurons— using laser-capture RNA sequencing that combines ultra-deep sequencing of polyadenylated and non-polyadenylated RNA with cell-type enrichment afforded by light microscopy18. We reveal that circRNA production is finely tuned to a cell’s identity and prominent from Parkinson’s, Alzheimer’s disease, and other neuropsychiatric disease loci.
Results
Identification of 11,039 circRNAs with high confidence in human brain neurons
To systematically profile the transcriptome of human brain neurons, we have developed laser-capture RNA sequencing (lcRNAseq) to deeply sequence ribo-depleted total RNAs from laser-captured neurons for human post-mortem brains in the BRAINcode project18. Compared to most brain RNAseq studies, our method provides ultra-deep sequencing depth and reveals low-abundance transcripts as documented by our previous identification of transcribed non-coding elements (TNE)-defined putative enhancer RNAs (eRNAs)18. lcRNAseq captures total RNAs without limitation to polyA+ mRNAs, allowing us to detect non-polyadenylated RNAs such as circRNAs. Cell type enrichment is achieved through laser-capture microdissection. In this study, we doubled the sample size (n = 212) by including not just more healthy controls, but also brains from patients with Parkinson’s disease (PD) and Alzheimer’s disease (AD), the two most common neurodegenerative disorders (Supplementary Fig. 1a). After stringent quality control (Supplementary Fig. 1), 197 samples were available for downstream analysis (Fig. 1a, b). In brief, dopaminergic neurons (DA) from the midbrain substantia nigra pars compacta of 104 high-quality human brains (DA; healthy contols (HC): n = 59; prodromal PD-associated Lewy body neuropathology (ILB): n = 27; PD: n = 18); pyramidal neurons from layers V/VI of the middle temporal gyrus of 83 brains (TCPY; HC: n = 40; AD: n = 43); and pyramidal neurons from the primary motor cortex of three brains (MCPY) were laser-captured (Fig. 1b). Human fibroblasts (FB) from four individuals and peripheral blood mononuclear white cells from three individuals (PBMC) were included as non-CNS cell types using the same pipeline. On average, we obtained 132 million RNAseq reads per sample, and intriguingly, ~20% of the RNAseq reads remained unmapped using a standard linear aligner (see Supplementary Data 1).
Recent development in bioinformatics algorithms allowed us to rescue many of the unmapped reads via mapping them chimerically in back-splicing order to discover novel circRNAs19,20. In this study, we first re-aligned the unmapped RNA-seq reads using Tophat-fusion21 and then called circular RNAs using circExplorer (v2.0)22 (see Methods). CircExplorer2 has been reviewed with the best overall performance balancing precision and sensitivity19,20. As in ref. 23, at least two unique back-spliced reads in overall samples were required for a circRNA to be considered for further evaluation. With this cutoff, we discovered an initial set of 111,419 circular RNAs expressed in neurons in overall 190 brain samples (Fig. 1a). circRNAs identified here were highly reproducible in replicate samples (Pearson’s r = 0.63; Supplementary Fig. 1e).
Independent validation of circRNAs with RNase R treated RNAseq experiments
To experimentally confirm circular RNAs and to reduce false positives, we next used RNase R treatment of the human brain followed by RNAseq (see Method and Supplementary Data 2). RNase R is a 3′ to 5′ exoribonuclease that digests all linear RNAs (including the tail of intron lariats), but not circular RNA structures12,24. Paired RNase R-treated and -untreated RNAseq experiments were performed in six independent samples (see Method and Supplementary Fig. 2a). We rigorously required at least 20 unique reads per circular RNA in RNase R treated samples with at least two-fold enrichment compared to untreated samples in at least one pair (Supplementary Fig. 2a). Overall, out of the 111,419 circRNAs discovered in brain neurons, 11,039 met our stringent validation criteria; 48.0% of exon-derived circRNAs (e.g., 10,845 out of 22,593) and 0.22% of circular intronic RNAs (ciRNAs; 194 out of 88,826). These circRNAs were conservatively used for downstream analyses (Fig. 1a, validated circular RNAs in brain neurons). 98.2% of the validated circular RNAs were exon-derived circRNAs, suggesting that our filtering strategy effectively removed false-positive or lowly expressed circular RNAs (e.g., intron lariats from splicing25; Supplementary Fig. 2b, c). In total, we detected 6699, 8381, and 1263 circRNAs in dopamine neurons, pyramidal neurons, and Betz cells, respectively (Fig. 1b).
Genomic characteristics of circRNAs in the brain
We observed that many genes transcribe multiple (up to 36) neuronal circRNAs (Fig. 1c). For example, the AD-associated gene PICALM produces 25 distinct circular RNA isoforms (Supplementary Fig. 2d). The number of circular RNAs per gene was linearly correlated with the number of exons of the host gene (Supplementary Fig. 2e). Circularized exons were significantly longer than the average exons in the human genome (Supplementary Fig. 2e), and had significantly less overlapping RNA binding protein (RBP) binding activity (using ENCODE eCLIP data26) than non-circularized exons (Fig. 1d). Similarly, the introns flanking circRNAs were longer; harbored significantly more repetitive elements; and had less overlapping RBP binding activity26 compared to the introns flanking non-circularized exons (Fig. 1e, f, Supplementary Fig. 2e, see Methods). The host genes generating circRNAs are relatively more conserved (Fig. 1g). These might be universal features of circRNA biogenesis as they are highly consistent with prior evidence from non-neuronal cells6,12,27–29.
Dopamine and pyramidal neurons express cell type-specific circRNAs in human brain
To delineate circRNAs specifically expressed in each cell type, we further calculated the cell-specificity score of each circRNA based on the Jensen-Shannon distance of its expression profile similar as in ref. 23. CircRNAs with a specificity score S ≥ 0.5 and mean expression > mean+s.d. of overall expression were defined as cell type-specific (see Methods). We focused our analysis of cell type-specificity on the control samples (i.e., 59 dopamine neuron samples, 43 pyramidal neuron samples, and 7 non-neuronal samples; excluding disease samples) to avoid confounding by disease state-driven changes. We identified 1526, 3308, and 4860 circRNAs preferential expressed in dopamine neurons, pyramidal neurons, and non-neuronal cells, respectively (left panel of Fig. 2a, Supplementary Data 3). Cell type-preferential expression of five dopamine neuron-specific, five pyramidal neuron-specific, and one non-neuronal cells-specific circRNA was confirmed in an independent set of substantia nigra and temporal cortex samples (e.g., the source regions for laser-captured dopamine and pyramidal neurons, respectively) using RNase R treatment and qPCR (Supplementary Fig. 3a). In total, 17 of 24 (70.8%) selected circRNAs showed consistent expression between preferential cell types and the source regions (Supplementary Data 4).
Circular RNAs rather than the linear RNAs expressed from 3532 loci defined cell diversity and identity
3532 genetic loci custom-tailored 1526 cell type-specific circRNAs to the cell identity of dopamine neurons, 3308 circRNAs to pyramidal neurons, and 4860 cell type-specific circRNAs to non-neuronal cells, respectively (i.e., cell specificity score S < 0.5 and mean expression > mean + s.d.). Figure 2a, b; Supplementary Data 3). Surprisingly, the expression profiles of linear RNAs (e.g., mRNAs and long non-coding RNAs) transcribed from these same loci did not meet criteria for cell-type specificity (Fig. 2c) (i.e., S ≥ 0.5 and mean expression > mean + s.d.) and cell-specificity scores of the linear transcripts were significantly lower than that of circRNAs (Fig. 2d). We observed the same trend even only considering the host gene with single circRNA (Supplementary Fig. 3c). Cell type-specificity of circRNAs was not explained by linear mRNA abundance from the corresponding loci. The abundance of total linear reads (i.e., spliced + unspliced reads) surrounding the back-splicing site only weakly correlated with the abundance of their associated circular reads (Spearman’s rho = 0.10, see Supplementary Fig. 2f for detail). Interestingly, the circular-to-linear ratios of circRNAs were significantly higher in neurons vs. non-neuronal cells (Mann–Whitney test, P < 2.2 × 10−16, Fig. 2e). More research is needed to clarify whether this is modulated by distinct mechanisms regulating biogenesis, stability, and turnover of the circular RNAs, of their linear cognates, or both27–29. Interestingly, our data (Fig. 2a) suggest two types of cell type-specific circRNAs. First, some genetic host loci focus production of cell type-specific circRNAs onto a singular specific cell type (one locus producing one or more cell type-specific circRNAs specific to the same one cell type): These loci turn on circRNA production in one cell type and turn it off in the others e.g., 345 loci produce 400 circRNAs exclusively in dopamine neurons, 730 loci expressed 1046 circRNAs exclusively in pyramidal neurons, and 1191 loci expressed 2222 circRNAs exclusively in non-neuronal cells. Second, some loci precisely tailor the production of cell type-specific circRNAs for multiple cell types (e.g., via alternative back-splicing or combinations of different sets of exons) to the requirements of multiple types of cells (one locus producing multiple cell type-specific circRNAs for multiple cell types). Indeed, 306 super-host loci (Fig. 2a) tailored a diverse circRNAs portfolio specifically to dopamine neurons, pyramidal neurons, and non-neuronal cells via alternative back-splicing. These 306 super host loci expressed 478 distinct dopamine neuron-specific, 751 pyramidal neuron-specific, and 988 non-neuronal circRNA back-spliced variants (see Fig. 2a). RNA Binding Protein activity superimposed from the ENCODE eCLIP track26 highlighted subsets of RBPs statistically associated with these types of cell type-specific circRNAs (Supplementary Fig. 4). These super-host loci included the LKS/RAB6-interacting/CAST family member 1 (ERC1) locus. ERC1 binds to the PD-linked RIM proteins30,31, to facilitate the docking of synaptic vesicles at the presynaptic active zone. The ERC1 locus expresses two distinct circRNAs, which we termed circERC1-1 and circERC1-2, where circERC1-1 is circularized between exon 17 and exon 19 and circERC1-2 is alternatively back-spliced between exon 16 and exon 19 (see Fig. 2f). circERC1-1 was preferentially expressed in dopamine neurons (Fig. 2f, g); circERC1-2 was preferentially expressed in pyramidal neurons; neither was expressed in non-neuronal cells. Cell type-specific divergence of circERC1-1 and circERC1-2 was supported by qPCR with RNase R (Fig. 2g). circERC1-1 abundance was higher in substantia nigra (the region containing dopamine neurons) compared to the temporal cortex (the region containing pyramidal neurons). Neither was meaningfully expressed in non-neuronal cells, e.g., fibroblasts and white blood cells.
Thus, circRNA diversity provides finely tuned, cell type-specific information that is not explained by the corresponding linear RNAs from the same loci.
circRNAs are predominantly expressed from synapse machinery (Fig. 3)
Unbiased pathway analysis of host loci revealed that cell type-specific neuronal circRNA production was clustered around synapse function and neuronal projection loci (Fig. 3), while in non-neuronal cells circRNA production clustered around cell cycle loci (Fig. 3). Thus, the host genes producing cell type-specific circRNAs in dopamine neurons vs. pyramidal neurons are actually representing similar synapse and neuronal projection-related pathways. While it is reasonable to infer clues for candidate pathway membership from host gene enrichment analyses, careful experimental evaluation of individual circRNAs per se will be needed to substantiate their mechanistic roles in synapse specialization and disease. See Supplementary Data 5 for the full list of enriched functional terms.
CircRNAs are linked to neuropsychiatric disease (Fig. 4)
We observed prominent expression of circRNAs from loci linked to neurodegenerative diseases (Fig. 4a). 80.2% of GWAS-derived AD candidate genes implicated by ref. 32 (174 of 217) produced one or more filtered circRNAs including SORL133, MARK4, PICALM, PSEN1, and APP (see Supplementary Fig. 5). 12.4% of AD GWAS genes (27 of 217) produced at least one rigorously validated circRNAs. Out of 109 genes implicated by the latest PD GWAS30, 96 (88.1%) expressed one or more filtered circRNA transcripts and 32 (29.4%) expressed at least one validated circRNAs (Fig. 4a). Moreover, 99% of the AD-associated GWAS SNPs32 (2334/2357) and 96% (6751/7057) of the PD-associated GWAS SNPs30 were located in the proximity (i.e., within 1 Mbp) of a circRNA. These included pleomorphic genes linked to both Mendelian and sporadic forms of PD, e.g., GBA, SNCA, RIMS1, RIMS231, and VPS13C (see Supplementary Fig. 5). For example, under the peak of AD GWAS SNP rs867611 (P = 2.19 × 10−18), we detected 16 circRNAs actively expressed in the pyramidal neuron samples from the PICALM gene (Fig. 4b). Interestingly, the most abundantly expressed circRNA in both dopamine neurons and pyramidal neurons was CDR1as with over 70,000 total back-spliced reads in the neuronal samples. Previous studies showed that human CDR1as may serve as “sink” for miR-7 which represses the PD gene SNCA34.
Unbiased enrichment analysis using the gene-disease annotations defined in DisGeNET35 (updated for PD and AD GWAS-implicated susceptibility genes from refs. 30,32) showed that 20 neuropsychiatric diseases were statistically significantly enriched in circRNA loci compared to only three non-CNS diseases (Fig. 4c, Supplementary Fig. 2g, Supplementary Data 6) out of a total of 4638 human diseases and traits. Enrichment analysis showed a diverse spectrum of links between cellular circRNA loci and disease. PD- and intellectual disability-associated loci were enriched in circRNAs expressed in all three cell types (Fig. 4c). Examples of such loci include circRNAs from KANSL1 gene (Fig. 4d), which was linked to PD by us18 and others36 as well as to neurodevelopmental disorders37,38. The enrichment of PD in circRNA-producing loci from all cell types is consistent with the increasingly appreciated roles of cortical neurons31 and immune cells39,40 in the pathobiology of PD, beyond classic dopaminergic neurodegeneration. Ten addiction traits related to drug and substance abuse were enriched in circRNAs active in dopamine neurons (see an example of circRNAs from addiction-related genes PDE4B41,42 and FTO41,43 in Fig. 4e and Supplementary Fig. 5). Autism and bipolar disease were enriched in circRNA loci actively expressed in pyramidal neurons. By contrast, the oncologic diseases leukemia and adenocarcinoma of the large intestine were enriched in circRNAs specific to non-neuronal cells (see an example of circRNAs from the cancer-associated gene ATM44 in Fig. 4f). Surprisingly, atrial fibrillation, a common cardiac arrhythmia, was enriched in circRNA loci active in pyramidal neurons pointing to a potential role of synaptic plasticity in cardiac innervation45.
We confirmed and evaluated circRNA expression from two of the implicated familial disease loci—DNAJC6 and PSEN1—using a second method, qPCR with RNase R treatment. circPSEN1-2 is one of several circRNAs back-spliced from 4 exons of the familial AD gene PSEN1 (Supplementary Figs. 3, 5). It was highly expressed in brain neurons with over 100 unique back-splice junction reads. qPCR confirmed that circPSEN1-2 is >20-fold enriched in both temporal cortex and substantia nigra after RNase R treatment (Supplementary Fig. 3).
Mutations in DNAJC6 cause juvenile-onset, dopamine responsive, autosomal recessive PD46–48. DNAJC6 plays a key role in uncoating the clathrin-coated synaptic vesicles49. circDNAJC6 produced from the locus was expressed highly in dopamine neurons, moderately in pyramidal neurons, but not in non-neuronal by lcRNAseq. Consistent with this, qPCR and RNase R treatment confirmed high expression in the substantia nigra, moderate expression in the temporal cortex, and minimal expression in the non-neuronal cells (Supplementary Fig. 3). Functionally, expression of circDNAJC6 and several other synapse-related circRNAs (labeled in orange in Fig. 4g) was altered at the earliest, prodromal Parkinson’s disease stages in brainstem-confined Lewy body Braak stages 1–3 (P = 0.014 by Wald test, with covariates of sex, age, PMI, and RIN adjusted, Fig. 4g, Supplementary Data 7). By contrast, expression of the linear mRNA DNAJC6 was not affected (bottom right panel of Fig. 4g).
We also explored whether circRNAs expression changes during with disease progression. In PD-vulnerable dopamine neurons from 95 brains (with available neuropathology stages), 26 circRNAs (including DNAJC6) showed suggestive associations with progressive alpha-synuclein-positive Lewy body burden—spanning brainstem-predominant, midbrain, and cortical stages of PD (e.g., based on the Unified Lewy Body Staging System50 of 0, I, IIa, III, and IV, see Supplementary Data 1). Furthermore, 51 circRNAs expressed in AD-vulnerable pyramidal neurons had suggestive associations with neurofibrillary tangle Braak stages with nominal P values ≤ 0.05 (Supplementary Fig. 6c).
Discussion
Human brain cells custom tailor thousands of circRNAs to fit their cell identity. For many loci, which lacked diversity in linear mRNA expression, the corresponding “companion” circRNAs were cell-type specific. Dopamine and pyramidal neurons prominently expressed circRNAs from synaptic genes and non-neuronal cells prominently produced circRNAs from loci involved in cell cycle regulation.
The sheer number and diversity of circRNAs add a new class of components to the growing inventory of non-coding RNAs actively expressed in the human brain disease18,51,52, including enhancer RNAs18, microRNAs33, and long-non-coding RNAs51. Indeed, there is reason to hypothesize that this expanding regulatory network of non-coding RNAs may be a major contributor to the exceptional diversity and performance of human brain cells that cannot be explained by the surprisingly small number of protein-coding genes in the human genome which are similar in humans and worms52. The fact that circRNAs are predominantly expressed from synapse loci in human dopamine and pyramidal neurons raises the possibility that they encode as yet unknow important functions in synaptic functions of the human neuronal networks controlling quintessential human experiences: fine motor movements, motivation, reward, and higher cortical functions. This expands on a postulated role for circRNAs in synaptosomes4,5 and synaptic plasticity4 in animals. What could that function be? It could be regulation of transcription consistent with the literature2,4,25,53,54. Alternatively, circRNAs—similar to some linear RNAs— might be targeted to the synapse as local regulatory switches that control the translation or assembly of highly specialized synaptic machinery for each type of neuron55. Their circular conformation may confer functional advantages (e.g., longer half-life) or suit transport along axons.
Importantly, 61% of all synaptic circRNAs were linked to brain disorders. Synaptic dysfunction—synaptopathy—may be one of the earliest defects in these neurodegenerative and neuropsychiatric diseases56–61. For example, in both toxic and genetic animal models of PD, synaptic plasticity is disrupted during the early phases of dopaminergic dysfunction, much earlier than nigral cell death and the clinical manifestation of motor features57. Disease-linked circRNAs expression showed, in part, evidence of cell type bias: Addiction-associated genes prominently express circRNAs in dopamine neurons, autism genes express circRNAs predominantly in pyramidal neurons, and interestingly, PD GWAS-associated loci express circRNAs highly in non-neuronal cells as well as in neurons. Based on these and prior data4,5,8,11,62, we hypothesize that circRNAs may serve as finely tuned, special-purpose RNA vehicles for the assembly of cell type-specific synapses and that their dysregulation may contribute to synaptopathies. The mechanisms controlling the biogenesis of cell type-specific circRNAs could involve subsets of RNA binding proteins (e.g. refs. 27–29 and Supplementary Fig. 4). Much more work will be required to fully elucidate the kinetics and relation of circular and cognate linear RNA biogenesis, the involved regulators, and to reveal how this complex RNA machinery specifies neuron identity and synapses.
More generally, this study provides a unique catalog of circRNAs in two major types of human brain neurons that will be generally useful for decoding genome function in neuropsychiatric disease and for advancing the burgeoning field of RNA medicines and diagnostics63–65.
Methods
Sample collection and processing
In our BRAINcode project, we collected 190 high-quality, frozen postmortem human brain samples identified from Banner Sun Health Research Institute, Brain Tissue Center at Massachusetts General Hospital, Harvard Brain Tissue Resource Center at McLean Hospital, University of Kentucky ADC Tissue Bank, the University of Maryland Brain and Tissue Bank, Pacific Northwest Dementia and Aging Neuropathology Group (PANDA) at University of Washington Medicine Center, and Neurological Foundation of New Zealand Human Brain Bank. A subset of 96 of these samples were previously analyzed in ref. 18. Detailed quality measures and demographic characteristics of these high-quality, frozen postmortem samples are shown in Supplementary Data 1. Median RNA integrity numbers (RIN) were 7.7, 7.4, and 7.3 for substantia nigra samples (used to laser-capture dopamine neurons), temporal cortex (used to laser-capture temporal cortex pyramidal neurons), and motor cortex samples (used to laser-capture Betz cells) indicating high RNA quality. Median post-mortem intervals were exceptionally short with 3 h for substantia nigra, 3 h for temporal cortex, and 13 h for motor cortex samples further consistent with the highest sample quality (Supplementary Data 1).
The 190 brain samples are composed of 102 samples from healthy control (“HC”) subjects, 27 from incidental Lewy body cases (“ILB”), 18 from Parkinson’s disease cases (“PD”), and 43 from Alzheimer’s disease cases (“AD”). Healthy control subjects were defined with the following stringent inclusion and exclusion criteria. Inclusion criteria: (1) absence of clinical or neuropathological diagnosis of a neurodegenerative disease e.g., Parkinson’s disease according to the UKPDBB criteria66, Alzheimer’s disease according to NIA-Reagan criteria67, dementia with Lewy bodies by revised consensus criteria68. (2) PMI ≤ 48 h; (3) RIN69 ≥ 6.0 by Agilent Bioanalyzer (good RNA integrity); (4) visible ribosomal peaks on the electropherogram. Exclusion criteria were: (1) a primary intracerebral event as the cause of death; (2) brain tumor (except incidental meningiomas); (3) systemic disorders likely to cause chronic brain damage. Incidental Lewy body cases are those not meeting clinical diagnostic criteria for PD or other neurodegenerative diseases but found with Lewy body inclusion at autopsy. ILB is widely considered a preclinical stage of PD70, providing a unique opportunity to investigate preclinical molecular changes of PD. We also included seven non-brain tissue samples as controls, including four samples of peripheral blood mononuclear cell (PBMC) and three fibroblasts (FB), provided by Harvard Biomarker Study and Coriell Institute.
Our research complies with all relevant ethical regulations. This study was approved by the Institutional Review Board of Brigham and Women’s Hospital.
Laser-capture RNA-seq and RNase-treated RNA-seq
We applied the laser-capture RNA-seq (lcRNAseq) method to profile the total RNAs of brain neurons as we previously reported18. In brief, laser-capture microdissection was performed on the brain samples to extract neurons from different brain regions. For each substantia nigra sample, 300–800 dopamine neurons, readily visualized in HistoGene-stained frozen sections based on hallmark neuromelanin granules were laser-captured using the Arcturus Veritas Microdissection System (Applied Biosystems). For each temporal cortex (middle gyrus) or motor cortex sample, about 300 pyramidal neurons were outlined in layers V/VI by their characteristic size, shape, and location in HistoGene-stained frozen sections and laser-captured using the Arcturus Veritas Microdissection System (Applied Biosystems). Total RNA was isolated and treated with DNase (Qiagen) using the Arcturus Picopure method (Applied Biosystems), yielding approximately 7–8 ng RNA per subject. Total RNA was linearly amplified into 5–10 µg of double-stranded cDNA using the validated, precise, isothermal RNA amplification method implemented in the Ovation RNA-seq System V2 (NuGen)71,72. Unlike PCR-based methods that exponentially replicate original transcript and copies, with this method only the original transcripts are linearly replicated71,72, and amplification is initiated at the 3’ end as well as randomly thus allowing for amplification of both mRNA and non-polyadenylated transcripts71,72. Sequencing libraries were generated from 500 ng of the double-stranded (ds) cDNA using the TruSeq RNA Library Prep Kit v2 (Illumina) according to the manufacturer’s protocol. The cDNA was fragmented, and end repair, A-tailing, adapter ligation was performed for library construction. Sequencing library quality and quantity control was performed using the Agilent DNA High Sensitivity Chip and qPCR quantification, respectively. Libraries were sequenced (50 or 75 cycles, paired-end) on an Illumina HiSeq 2000 and 2500 at the Harvard Partners Core.
To enrich the circRNAs, we also performed paired RNase R-treated and mock RNA-seq for 6 bulk tissues. To obtain sufficient amount of input RNAs, we mixed 1–6 samples from the same tissue type (see Supplementary Data 2). The total RNAs (10 μg) were treated with the Ribo-Zero Gold kit (Illumina) to remove rRNA before linear RNA digestion. The rRNA-depleted samples were purified using a modified RNeasy MinElute Cleanup kit (QIAGEN) method according to the Illumina’s recommendation. After purification, the rRNA-depleted RNAs were divided into two equal parts: to one was added 2 μl 10× RNase R Reaction Buffer and 0.1 μl RNase R (20 U/μl, RNase R, Epicentre), the other received mock treatment with 2 μl 10× RNase R Reaction Buffer and 0.1 μl RNase-free water. We incubated the tubes at 37 °C for 30 min. After the digestion, the RNase R treated and mock treated RNAs were purified using RNeasy MinElute Cleanup kit (QIAGEN). Before the RNA-seq library preparation, about 10 ng of each the RNase digested and mock RNA samples were linearly amplified to yield enough amount of double-stranded cDNA with Ovation RNA-Seq System V2 kit (NuGen) according to the manufacture’s instruction. After the linear amplification, ~600 ng double-stranded cDNA samples were sonicated and applied to prepare the sequencing library using TruSeq RNA sample Preparation v2 kit (Illumina) following the manufacturer’s instructions. Detailed characteristics of the samples used for RNase R experiment are shown in Supplementary Data 2.
RNA sequencing data analysis pipeline
RNA-seq raw files in FASTQ format were processed in a customized pipeline. For each sample, we first filtered out reads that failed vendor check or are too short (<15nt) after removing the low-quality ends or possible adaptor contamination by using fastq-mcf with options of “-t 0 –x 10 –l 15 –w 4 –q 10 –u”. We then checked the quality using FastQC and generated k-mer profile using kpal73 for the remaining reads. Reads were then mapped to the human genome (GRCh37/hg19) using Tophat74 (v2.0.8) by allowing up to 2 mismatches and 100 multiple hits. Reads mapped to ribosomal RNAs or to the mitochondrial genome were excluded from downstream analysis. Gene expression levels were quantified using FPKM (Fragments Per Kilobase of transcript per Million mapped reads). Only uniquely mapped reads were used to estimate FPKM. To calculate normalized FPKM, we first ran Cuffquant75 (v2.2.1) with default arguments for genes annotated in GENCODE (v19), and then ran Cuffnorm with parameters of “-total-hits-norm –library-norm-method quartile” on the CBX files generated from Cuffquant.
Sample QC based on RNA-seq data
We performed sample QC similar to ‘t Hoen PA et al.76. In brief, we ran k-mer profiling for filtered reads using kpal73 and calculated the median profile distance for each sample. Samples with distances clearly different from the rest samples were marked as outliers (Supplementary Fig. 1c). We also calculated pair-wise Spearman correlations of gene expression quantification across samples and measured the outlier via hierarchical clustering (Supplementary Fig. 1b). Moreover, we tested for concordance between reported clinical sex and sex indicated by the expression of the female-specific XIST gene and male-specific Y-chromosome gene RPS4Y1 (Supplementary Fig. 1d). Samples from the first batch with a relatively low sequencing depth were also excluded. In addition to these samples used for cell type-specific transcriptome analyses, various additional control samples were analyzed (e.g., amplification controls, tissue homogenate), and technical replicates (Supplementary Fig. 1e). In the end, 197 out of 221 samples passed QC and are used for downstream analysis (Supplementary Fig. 1a).
Calling circular RNAs
We first extracted the chimerically aligned RNAseq reads using Tophat-fusion21 and then called circRNAs using circExplorer (v2.0)22. CircExplorer has been reviewed with the best overall performance in balance of precision and sensitivity19,20. To identify circRNAs that are not back-spliced from a canonical exon border, we ran circExplorer2 with a customized gene annotation file and the “--low-confidence” option. As in ref. 23, circRNAs with at least two unique back-spliced reads in overall samples are categorized as “being expressed” and kept as circRNA candidates. CircRNA expression is quantified by normalized reads per million (RPM) at the back-splicing sites for each sample.
Cell specificity
The specificity score S is defined as , where JSD is the Jensen–Shannon distance, is the expression profile of a given circRNA c expressed as a density of , and is the unit vector of ‘perfect expression’ in a particular cell type i (e.g. [1, 0, 0, 0, 0] for i = 1). Like the ref. 23, a circRNA is defined as cell-type specific if its specificity score S ≥ 0.5 and mean expression in a cell type is larger than the mean + one standard deviation (s.d.) of the overall expression.
circRNA-producing gene function enrichment analysis
Functional enrichment analysis of circRNA host genes was performed using the C5 gene sets (GO terms) implemented in the MSigDB database (version v7.2) using Fisher’s exact test. Each gene set contains genes annotated to the same GO term. For each gene set, the Fisher’s exact test was performed for k, K, N−K, n; where k is the number of circRNA host genes that are part of a GO term gene set; K is the total number of genes annotated to the same GO term gene set; N is the total number of all annotated human genes in GENCODE (v19); and n is the number of genes in the query set. The top 10 GO terms in each GO category (BP, MF, and CC) enriched in these circRNA host genes are further slimmed using a similar algorithm as the clusterProfile package77. The slimmed results are shown in Supplementary Data 5 (all with an FDR q value < 0.05).
We also evaluated whether there is a specific enrichment among circRNAs in genes associated with brain disorders. We used diseases in MeSH C10 (Nervous System Diseases) or F03 (Mental Disorders) for brain disorders, and associated diseases to genes using GenDisNet database. The disease-gene association was extracted from DisGeNet35 (http://www.disgenet.org/) filtered with GDA score > 0.1. For all annotated protein-coding genes, we performed Fisher’s exact test based on whether a gene is associated with a brain disorder and a gene hosts a circRNA (Supplementary Data 6).
AD and PD risk genes
The GWAS-derived AD risk genes used in this study were extracted from Jansen et al.32, where they defined AD potential causal genes with four gene-mapping strategies (nearest, eQTL, chromatin interaction, and GWGAS). Gene symbols were extracted from their Tables S13, S18 and there are 217 uniquely mapped genes. The GWAS-derived PD risk genes used in this study were extracted from Nalls et al.30, where they defined PD potential causal genes based on nearest position and QTL. Gene symbols were extracted from their Table S2 and there are 109 uniquely mapped genes.
Differentially expressed circRNAs
Similar to Dube et al.8, we first aggregated all circular RNA reads count from one gene into a gene-based count matrix, and then used that count matrix as input to perform differential expression analysis using the DEseq278 framework. Covariates of sex, age, PMI (post-mortem interval), and RIN (RNA integrity number) were included in the negative binomial (NB) based generalized linear model, and P values from the Wald test with covariates adjusted were reported.
Confirming circRNA expression by qPCR
Quantitative PCR was performed using SYBR Green Master Mix (Thermo Fisher) on an ABI 7900HT instrument (Applied Biosystems). The divergent primer pairs flanking the back-splice sit were designed using Prime3 online primer design web tool (http://bioinfo.ut.ee/primer3-0.4.0/primer3/) and are shown in Supplementary Data 4. To confirm the expression of lcRNAseq-derived circRNAs in dopamine neurons and pyramidal neurons, relative abundances of target circRNAs were evaluated by qPCR in human substantia nigra or temporal cortex samples, as well as in human fibroblast and PBMC samples (shown in Supplementary Fig. 3). The human ubiquitin gene UBC was used as a reference to normalize RNA loading. Control samples lacking template and those lacking reverse transcriptase showed virtually no expression of these target circRNAs indicating that DNA contamination did not materially influence results. Expression values were analyzed using the comparative threshold cycle method63. All quantitative PCR reactions were conducted in triplicate. Equal amplification efficiencies for target and reference transcripts were confirmed using melting curve analysis.
Data collection, statistical analysis, and data presentation
Sample sizes were based on the total number of available high-quality brain samples that met inclusion and exclusion criteria. No statistical methods were used to pre-determine sample sizes. No randomization of data collection was performed in this study. Brains were selected based on pre-defined inclusion and exclusion criteria (see above). Sample outliers were rationally identified as described in the Section on Sample QC based on RNA-seq data. Data were not excluded based on arbitrary post-hoc considerations. Data collection and analysis were not performed blinded to the conditions of the experiments.
R (The R Foundation for Statistical Computing, Vienna, Austria) was used for other statistical tests. Box plots were used to present multi-groups comparison. In all box plots, center line represents the median value; box limits, first and third quartiles; whiskers, the most extreme data point which is no more than 1.5 times the interquartile range from the box.
Statistical tests used in each figure: Fig. 1d–f, two-sided Wilcox test; Fig. 2e, two-sided Mann–Whitney test; Figs. 3, 4c, two-sided Fisher’s exact test; Fig. 4g, two-sided Wald test; Supplementary Fig. 1e, f, Pearson’s correlation test; Supplementary Fig. 2e, two-sided Wilcox test; Supplementary Fig. 2f, Spearman’s rank correlation test; Supplementary Fig. 2g, two-sided Fisher’s exact test; Supplementary Fig. 2h, two-sided Student’s t test; Supplementary Fig. 4b, two-sided Wilcox test; Supplementary Fig. 4c–e, two-sided Fisher’s exact test; Supplementary Fig. 6b, d, Pearson’s correlation test; Supplementary Data 5, 6, two-sided Fisher’s exact test; Supplementary Data 7, two-sided Wald test.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Acknowledgements
This study was funded in part by NIH grant NIH R01AG057331, U01 NS082157, the U.S. Department of Defense (to C.R.S.), the American Parkinson Disease Association (APDA) Research Award (to X.D.). C.R.S.’s work is supported by NIH grants NINDS/NIA R01NS115144, U01NS095736, U01NS100603, and the American Parkinson Disease Association Center for Advanced Parkinson Research. X.D. received funding from the American Parkinson Disease Association (APDA). C.R.S and X.D.’s work was in part funded by the joint efforts of The Michael J. Fox Foundation for Parkinson’s Research (MJFF) and the Aligning Science Across Parkinson’s (ASAP) initiative. MJFF administers the grant [ASAP-000301] on behalf of ASAP and itself. This work was further enabled by NINDS U24 NS072026 National Brain and Tissue Resource for Parkinson’s Disease and Related Disorders (to T.G.B.) and the National Institute on Aging (P30 AG19610 and P30AG072980, Arizona Alzheimer’s Disease Center. We gratefully acknowledge the Banner Sun Health Research Institute, Massachusetts Alzheimer’s Disease Research Center at Massachusetts General Hospital, Harvard Brain Tissue Resource Center at McLean Hospital, University of Kentucky ADC Tissue Bank, University of Maryland Brain and Tissue Bank, Pacific Northwest Dementia and Aging Neuropathology Group at University of Washington Medicine Center, and Neurological Foundation of New Zealand for providing human brain tissue. For the purpose of open access, the author has applied a CC BY public copyright license to all Author Accepted Manuscripts arising from this submission.
Author contributions
C.R.S. and X.D. conceived, designed, analyzed, and interpreted the study. X.D. performed bioinformatics analyses with contributions from R.B.M., T.W., A.E., and Y.B. Z.L., D.G. were responsible for laser-capture and lcRNAseq data production. Y.B., X.L., and Z.L. performed and analyzed RNase R and qPCR confirmation experiments. C.R.S. and X.D. wrote the manuscript with input from all other authors. C.R.S. supervised data production and analysis and contributed funding. T.B and G.S. neuropathologically characterized autopsy samples. M.B.F. contributed neuropathological expertise and critically reviewed results and manuscript. All authors reviewed, edited, and approved the manuscript prior to submission.
Peer review
Peer review information
Nature Communications thanks Fangqing Zhao and the other, anonymous, reviewers for their contribution to the peer review of this work. A peer review file is available.
Data availability
The data supporting the findings of this study are available from the corresponding authors upon reasonable request. The RNA-seq raw FASTQ data and normalized expression matrix of all circRNAs in this study have been deposited in GEO under accession number GSE218203. The proceeded data for this study, including a browser track hub for all circRNAs from this study, can be queried at the BRAINcode project website at http://www.humanbraincode.org through a user-friendly interface.
Code availability
Custom code associated with this study is publicly available at https://github.com/sterding/circRNA.
Competing interests
C.R.S. has served as consultant, scientific collaborator or on scientific advisory boards for Sanofi, Berg Health, Pfizer, Biogen, and has received grants from NIH, U.S. Department of Defense, APDA, ASAP, and MJFF. X.D. has received funding from NIH, APDA, and ASAP. The other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-40348-0.
References
- 1.Salzman J, Gawad C, Wang PL, Lacayo N, Brown PO. Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS ONE. 2012;7:e30733. doi: 10.1371/journal.pone.0030733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Hansen TB, et al. Natural RNA circles function as efficient microRNA sponges. Nature. 2013;495:384–388. doi: 10.1038/nature11993. [DOI] [PubMed] [Google Scholar]
- 3.Memczak S, et al. Circular RNAs are a large class of animal RNAs with regulatory potency. Nature. 2013;495:333–338. doi: 10.1038/nature11928. [DOI] [PubMed] [Google Scholar]
- 4.You X, et al. Neural circular RNAs are derived from synaptic genes and regulated by development and plasticity. Nat. Neurosci. 2015;18:603–610. doi: 10.1038/nn.3975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Rybak-Wolf A, et al. Circular RNAs in the mammalian brain are highly abundant, conserved, and dynamically expressed. Mol. Cell. 2014;58:870–885. doi: 10.1016/j.molcel.2015.03.027. [DOI] [PubMed] [Google Scholar]
- 6.Westholm JO, et al. Genome-wide analysis of drosophila circular RNAs reveals their structural and sequence properties and age-dependent neural accumulation. Cell Rep. 2014;9:1966–1980. doi: 10.1016/j.celrep.2014.10.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gruner H, Cortés-López M, Cooper DA, Bauer M, Miura P. CircRNA accumulation in the aging mouse brain. Sci. Rep. 2016;6:38907. doi: 10.1038/srep38907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dube U, et al. An atlas of cortical circular RNA expression in Alzheimer disease brains demonstrates clinical and pathological associations. Nat. Neurosci. 2019;22:1903–1912. doi: 10.1038/s41593-019-0501-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Chen HH, et al. Circular RNA detection identifies circPSEN1 alterations in brain specific to autosomal dominant Alzheimer’s disease. Acta Neuropathologica Commun. 2022;10:29. doi: 10.1186/s40478-022-01328-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hanan M, et al. A Parkinson’s disease CircRNAs Resource reveals a link between circSLC8A1 and oxidative stress. EMBO Mol. Med. 2020;12:e13551. doi: 10.15252/emmm.202013551. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zimmerman AJ, et al. A psychiatric disease-related circular RNA controls synaptic gene expression and cognition. Mol. Psychiatry. 2020;25:2712–2727. doi: 10.1038/s41380-020-0653-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jeck WR, Sharpless NE. Detecting and characterizing circular RNAs. Nat. Biotechnol. 2014;32:453–461. doi: 10.1038/nbt.2890. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jeck WR, et al. Circular RNAs are abundant, conserved, and associated with ALU repeats. RNA. 2013;19:141–157. doi: 10.1261/rna.035667.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Enuka Y, et al. Circular RNAs are long-lived and display only minimal early alterations in response to a growth factor. Nucleic Acids Res. 2016;44:1370–1383. doi: 10.1093/nar/gkv1367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Bloem BR, Okun MS, Klein C. Parkinson’s disease. Lancet. 2021;397:2284–2303. doi: 10.1016/S0140-6736(21)00218-X. [DOI] [PubMed] [Google Scholar]
- 16.Deture MA, Dickson DW. The neuropathological diagnosis of Alzheimer’s disease. Mol. Neurodegeneration 2019 14:1. 2019;14:1–18. doi: 10.1186/s13024-019-0333-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wu W, Zhang J, Cao X, Cai Z, Zhao F. Exploring the cellular landscape of circular RNAs using full-length single-cell RNA sequencing. Nat. Commun. 2022;13:3242. doi: 10.1038/s41467-022-30963-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dong X, et al. Enhancers active in dopamine neurons are a primary link between genetic variation and neuropsychiatric disease. Nat. Neurosci. 2018;21:1482–1492. doi: 10.1038/s41593-018-0223-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hansen TB, Venø MT, Damgaard CK, Kjems J. Comparison of circular RNA prediction tools. Nucleic acids Res. 2016;44:e58. doi: 10.1093/nar/gkv1458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Zeng X, Lin W, Guo M, Zou Q. A comprehensive overview and evaluation of circular RNA detection tools. PLoS Comput. Biol. 2017;13:e1005420. doi: 10.1371/journal.pcbi.1005420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kim D, Salzberg SL. TopHat-Fusion: An algorithm for discovery of novel fusion transcripts. Genome Biol. 2011;12:1–15. doi: 10.1186/gb-2011-12-8-r72. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Zhang XO, et al. Diverse alternative back-splicing and alternative splicing landscape of circular RNAs. Genome Res. 2016;26:1277–1287. doi: 10.1101/gr.202895.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zheng Q, et al. Circular RNA profiling reveals an abundant circHIPK3 that regulates cell growth by sponging multiple miRNAs. Nat. Commun. 2016;7:11215. doi: 10.1038/ncomms11215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Suzuki H. Characterization of RNase R-digested cellular RNA source that consists of lariat and circular RNAs from pre-mRNA splicing. Nucleic Acids Res. 2006;34:e63–e63. doi: 10.1093/nar/gkl151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zhang Y, et al. Circular intronic long noncoding RNAs. Mol. Cell. 2013;51:792–806. doi: 10.1016/j.molcel.2013.08.017. [DOI] [PubMed] [Google Scholar]
- 26.Van Nostrand EL, et al. A large-scale binding and functional map of human RNA-binding proteins. Nature. 2020;583:711–719. doi: 10.1038/s41586-020-2077-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Jiang M-P, et al. The emerging role of the interactions between circular RNAs and RNA-binding proteins in common human cancers. J. Cancer. 2021;12:5206–5219. doi: 10.7150/jca.58182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Okholm TLH, et al. Transcriptome-wide profiles of circular RNA and RNA-binding protein interactions reveal effects on circular RNA biogenesis and cancer pathway expression. Genome Med. 2020;12:112. doi: 10.1186/s13073-020-00812-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Ji P, et al. Expanded expression landscape and prioritization of circular RNAs in mammals. Cell Rep. 2019;26:3444–3460.e5. doi: 10.1016/j.celrep.2019.02.078. [DOI] [PubMed] [Google Scholar]
- 30.Nalls MA, et al. Identification of novel risk loci, causal insights, and heritable risk for Parkinson’s disease: a meta-analysis of genome-wide association studies. Lancet Neurol. 2019;18:1091–1102. doi: 10.1016/S1474-4422(19)30320-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Liu G, et al. Genome-wide survival study identifies a novel synaptic locus and polygenic score for cognitive progression in Parkinson’s disease. Nat. Genet. 2021;53:787–793. doi: 10.1038/s41588-021-00847-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jansen IE, et al. Genome-wide meta-analysis identifies new loci and functional pathways influencing Alzheimer’s disease risk. Nat. Genet. 2019;51:404–413. doi: 10.1038/s41588-018-0311-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Landgraf P, et al. A mammalian microRNA expression Atlas based on small RNA library sequencing. Cell. 2007;129:1401. doi: 10.1016/j.cell.2007.04.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Junn E, et al. Repression of α-synuclein expression and toxicity by microRNA-7. Proc. Natl Acad. Sci. 2009;106:13052–13057. doi: 10.1073/pnas.0906277106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Piñero J, et al. DisGeNET: a comprehensive platform integrating information on human disease-associated genes and variants. Nucleic Acids Res. 2017;45:D833–D839. doi: 10.1093/nar/gkw943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Soutar, M. P. M. et al. Regulation of mitophagy by the NSL complex underlies genetic risk for Parkinson’s disease at Chr16q11.2 and on the MAPT H1 allele. bioRxiv10.1101/2020.01.06.896241 (2021). [DOI] [PMC free article] [PubMed]
- 37.Zollino M, et al. Mutations in KANSL1 cause the 17q21.31 microdeletion syndrome phenotype. Nat. Genet. 2012;44:636–638. doi: 10.1038/ng.2257. [DOI] [PubMed] [Google Scholar]
- 38.Koolen DA, et al. Mutations in the chromatin modifier gene KANSL1 cause the 17q21.31 microdeletion syndrome. Nat. Genet. 2012;44:639–641. doi: 10.1038/ng.2262. [DOI] [PubMed] [Google Scholar]
- 39.Raj T, et al. Polarization of the effects of autoimmune and neurodegenerative risk alleles in leukocytes. Science. 2014;344:519–523. doi: 10.1126/science.1249547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Sulzer D, et al. T cells from patients with Parkinson’s disease recognize α-synuclein peptides. Nature. 2017;546:656–661. doi: 10.1038/nature22815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Hatoum, A. S. et al. Multivariate genome-wide association meta-analysis of over 1 million subjects identifies loci underlying multiple substance use disorders. medRxiv10.1101/2022.01.06.22268753 (2022). [DOI] [PMC free article] [PubMed]
- 42.Olsen CM, Liu QS. Phosphodiesterase 4 inhibitors and drugs of abuse: current knowledge and therapeutic opportunities. Front. Biol. 2016;11:376. doi: 10.1007/s11515-016-1424-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Hubacek JA, et al. The association between the FTO gene variant and alcohol consumption and binge and problem drinking in different gene-environment background: The HAPIEE study. Gene. 2019;707:30–35. doi: 10.1016/j.gene.2019.05.002. [DOI] [PubMed] [Google Scholar]
- 44.Cremona CA, Behrens A. ATM signalling and cancer. Oncogene. 2014;33:3351–3360. doi: 10.1038/onc.2013.275. [DOI] [PubMed] [Google Scholar]
- 45.Ashton JL, Burton RAB, Bub G, Smaill BH, Montgomery JM. Synaptic plasticity in cardiac innervation and its potential role in atrial fibrillation. Front. Physiol. 2018;9:240. doi: 10.3389/fphys.2018.00240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Edvardson S, et al. A deleterious mutation in DNAJC6 encoding the neuronal-specific clathrin-uncoating co-chaperone auxilin, is associated with Juvenile Parkinsonism. PLoS ONE. 2012;7:e36458. doi: 10.1371/journal.pone.0036458. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Olgiati S, et al. DNAJC6 mutations associated with early-onset Parkinson’s disease. Ann. Neurol. 2016;79:244–256. doi: 10.1002/ana.24553. [DOI] [PubMed] [Google Scholar]
- 48.Elsayed LEO, et al. A novel nonsense mutation in DNAJC6 expands the phenotype of autosomal-recessive juvenile-onset parkinson’s disease. Ann. Neurol. 2016;79:335–337. doi: 10.1002/ana.24591. [DOI] [PubMed] [Google Scholar]
- 49.Ungewickell E, et al. Role of auxilin in uncoating clathrin-coated vesicles. Nature. 1995;378:632–635. doi: 10.1038/378632a0. [DOI] [PubMed] [Google Scholar]
- 50.Beach TG, et al. Unified staging system for Lewy body disorders: correlation with nigrostriatal degeneration, cognitive impairment and motor dysfunction. Acta Neuropathologica. 2009;117:613–634. doi: 10.1007/s00401-009-0538-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Qureshi IA, Mattick JS, Mehler MF. Long non-coding RNAs in nervous system function and disease. Brain Res. 2010;1338:20–35. doi: 10.1016/j.brainres.2010.03.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Mattick JS, Taft RJ, Faulkner GJ. A global view of genomic information—moving beyond the gene and the master regulator. Trends Genet. 2010;26:21–28. doi: 10.1016/j.tig.2009.11.002. [DOI] [PubMed] [Google Scholar]
- 53.Piwecka M, et al. Loss of a mammalian circular RNA locus causes miRNA deregulation and affects brain function. Science. 2017;357:eaam8526. doi: 10.1126/science.aam8526. [DOI] [PubMed] [Google Scholar]
- 54.Li Z, et al. Exon-intron circular RNAs regulate transcription in the nucleus. Nat. Struct. Mol. Biol. 2015;22:256–264. doi: 10.1038/nsmb.2959. [DOI] [PubMed] [Google Scholar]
- 55.Fernandopulle MS, Lippincott-Schwartz J, Ward ME. RNA transport and local translation in neurodevelopmental and neurodegenerative disease. Nat. Neurosci. 2021;24:622–632. doi: 10.1038/s41593-020-00785-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Bellucci A, Antonini A, Pizzi M, Spano PF. The end is the beginning: Parkinson’s disease in the light of brain imaging. Front. Aging Neurosci. 2017;9:1–5. doi: 10.3389/fnagi.2017.00330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Schirinzi T, et al. Early synaptic dysfunction in Parkinson’s disease: Insights from animal models. Mov. Disord. 2016;31:802–813. doi: 10.1002/mds.26620. [DOI] [PubMed] [Google Scholar]
- 58.Masliah E, et al. Altered expression of synaptic proteins occurs early during progression of Alzheimer’s disease. Neurology. 2001;56:127–129. doi: 10.1212/wnl.56.1.127. [DOI] [PubMed] [Google Scholar]
- 59.Scheff SW, Price DA, Schmitt FA, DeKosky ST, Mufson EJ. Synaptic alterations in CA1 in mild Alzheimer disease and mild cognitive impairment. Neurology. 2007;68:1501–1508. doi: 10.1212/01.wnl.0000260698.46517.8f. [DOI] [PubMed] [Google Scholar]
- 60.Scheff SW, Price DA, Schmitt FA, Mufson EJ. Hippocampal synaptic loss in early Alzheimer’s disease and mild cognitive impairment. Neurobiol. Aging. 2006;27:1372–1384. doi: 10.1016/j.neurobiolaging.2005.09.012. [DOI] [PubMed] [Google Scholar]
- 61.Selkoe DJ. Alzheimer’s disease is a synaptic failure. Science. 2002;298:789–791. doi: 10.1126/science.1074069. [DOI] [PubMed] [Google Scholar]
- 62.Kleaveland B, Shi CY, Stefano J, Bartel DP. A network of noncoding regulatory RNAs acts in the mammalian brain. Cell. 2018;174:350–362.e17. doi: 10.1016/j.cell.2018.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Anthony K. RNA-based therapeutics for neurological diseases. RNA Biol. 2022;19:176–190. doi: 10.1080/15476286.2021.2021650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wu N, Qadir J, Yang BB. CircRNA perspective: new strategies for RNA therapy. Trends Mol. Med. 2022;28:343–344. doi: 10.1016/j.molmed.2022.02.002. [DOI] [PubMed] [Google Scholar]
- 65.He AT, Liu J, Li F, Yang BB. Targeting circular RNAs as a therapeutic approach: current strategies and challenges. Signal Transduct. Target. Ther. 2021;6:1–14. doi: 10.1038/s41392-021-00569-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hughes AJ, Daniel SE, Kilford L, Lees AJ. Accuracy of clinical diagnosis of idiopathic Parkinson’s disease: a clinico-pathological study of 100 cases. J. Neurol. Neurosurg. Psychiatry. 1992;55:181–184. doi: 10.1136/jnnp.55.3.181. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Ball M, et al. Consensus recommendations for the postmortem diagnosis of Alzheimer’s disease. Neurobiol. Aging. 1997;18:S1–S2. [PubMed] [Google Scholar]
- 68.Bonanni L, Thomas A, Onofrj M, McKeith IG. Diagnosis and management of dementia with Lewy bodies: third report of the DLB Consortium. Neurology. 2006;66:1455–1455. doi: 10.1212/01.wnl.0000224698.67660.45. [DOI] [PubMed] [Google Scholar]
- 69.Schroeder A, et al. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol. Biol. 2006;7:3. doi: 10.1186/1471-2199-7-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.DelleDonne A, et al. Incidental Lewy body disease and preclinical Parkinson disease. Arch. Neurol. 2008;65:1074–1080. doi: 10.1001/archneur.65.8.1074. [DOI] [PubMed] [Google Scholar]
- 71.Kurn N. Novel isothermal, linear nucleic acid amplification systems for highly multiplexed applications. Clin. Chem. 2005;51:1973–1981. doi: 10.1373/clinchem.2005.053694. [DOI] [PubMed] [Google Scholar]
- 72.Faherty SL, Campbell CR, Larsen PA, Yoder AD. Evaluating whole transcriptome amplification for gene profiling experiments using RNA-Seq. BMC Biotechnol. 2015;15:65. doi: 10.1186/s12896-015-0155-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Anvar S, et al. Determining the quality and complexity of next-generation sequencing data without a reference genome. Genome Biol. 2014;15:555. doi: 10.1186/s13059-014-0555-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Trapnell C, Pachter L, Salzberg SL. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009;25:1105–1111. doi: 10.1093/bioinformatics/btp120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Trapnell C, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc. 2012;7:562–578. doi: 10.1038/nprot.2012.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.’t Hoen PAC, et al. Reproducibility of high-throughput mRNA and small RNA sequencing across laboratories. Nat. Biotechnol. 2013;31:1015–1022. doi: 10.1038/nbt.2702. [DOI] [PubMed] [Google Scholar]
- 77.Yu G, Wang L-G, Han Y, He Q-Y. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS: J. Integr. Biol. 2012;16:284–287. doi: 10.1089/omi.2011.0118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data supporting the findings of this study are available from the corresponding authors upon reasonable request. The RNA-seq raw FASTQ data and normalized expression matrix of all circRNAs in this study have been deposited in GEO under accession number GSE218203. The proceeded data for this study, including a browser track hub for all circRNAs from this study, can be queried at the BRAINcode project website at http://www.humanbraincode.org through a user-friendly interface.
Custom code associated with this study is publicly available at https://github.com/sterding/circRNA.