Dear Editor,
In the present study, we have demonstrated that extracellular vesicles (EVs) derived from cerebrospinal fluid (CSF) represent a promising source for the identification of a novel miRNA signatures in Parkinson's disease (PD). Using next‐generation small‐RNA sequencing, we present for the first time the complete and quantitative microRNAome of EVs isolated from human CSF of PD and age‐correlated controls (CTR). In parallel, we performed CSF proteomic profiling of overlapping patient cohorts, which revealed the deregulation of disease‐relevant pathways similar to the ones obtained with the parallel miRNA analyses, supporting the results for the identified signature.
Novel molecular signatures and disease biomarkers are urgently needed for PD, not only to improve diagnostic precision, but also to enable monitoring of treatment responses, as well as stratification of patients according to the molecular background, rather than solely on clinical phenotypes. 1 Circulating miRNAs are auspicious targets for biomarker studies because their expression reflects the functional state of cells and is directly influenced by pathological stimuli. 2 CSF is in direct contact with the brain parenchyma, and molecular alterations in its composition may reflect specific changes related to PD pathology in the brain.
MiRNA species circulating in CSF seem to overlap with miRNAs expressed in brain tissue. 3 Furthermore, miRNAs and other small‐RNAs are enriched in the vesicular fraction of human CSF. 3 , 4 In order to characterize the size and particle distribution in our CSF EV preparations, we used Nanoparticle Tracking Analyses and observed a similar enrichment as previously reported (Figure 1A). Small‐RNA sequencing ratified the miRNA abundance in CSF EVs—they represented, on average, 97.4% of all mapped small‐RNAs in the discovery cohort (Figure 1C). In total, we detected 688 miRNAs. A total of 208 of these had a base mean higher than 5 reads and were analyzed further. Differential expression analyses revealed differences in the levels of 22 miRNAs in the PD versus CTR comparison (Figure 1D). The majority of the differentially expressed miRNAs were upregulated in PD subjects, whereas downregulated miRNAs showed only subtle levels of deregulation (–0.60 ≤ log2FC ≤ –0.16). Among upregulated species figured brain‐enriched miRNAs miR‐9‐5p, let‐7b, miR‐181a‐5p, and miR‐181b‐5p (Figures 1D and 1E). To reduce bias by a potential erythrocyte contamination, we strictly selected CSF samples with a low number of red blood cells (<100/μl CSF). Furthermore, because miR‐451a is highly enriched in red blood cells, it was excluded from feature‐selection analyses.
To explore the overall miRNA expression differences in the cohorts, hierarchical clustering analyses were performed (Figure 1F). Grouping samples based on miRNA expression levels revealed differences in the overall miRNA abundance between PD and CTR samples. PD samples showed expression heterogeneity, as some of these subjects clustered close to/among CTRs. Repeating the analysis with PD samples only revealed five different subclusters (Figure 1G) that did not correlate with the distribution of clinical parameters (e.g., disease duration; age of death; Levodopa‐equivalent dose; scores for disease severity [PDNMS; MDS‐UPDRS; MoCA; mH&Y]). This suggests a molecular diversity in PD cases that is reflected by miRNA expression.
Using machine learning approaches (measure of relevance [MoR]; reliability analysis [RiA]; random forest) with the small‐RNA sequencing data, we found an iterative signature comprising miR‐126‐5p, miR‐99a‐5p, and miR‐501‐3p, which could differentiate PD and CTR samples in our discovery cohorts (42 PD; 43 CTR) (Figures 2A–2D). Sample numbers for the discovery cohort were similar to other studies in the field 4 and were shown to be adequate for algorithm training. The panel was able to classify PD/CTR samples in an independent validation cohort (nine PD; 11 CTR) with an area under the curve (AUC) value of 0.85 (Precision–recall AUC = 0.88; sensitivity = 0.78; specificity = 0.95). Mean decrease in Gini (Figure 2D) indicated miR‐126‐5p as the most discriminative variable, followed by miR‐99a‐5p and miR‐501‐3p. A third independent cohort (25 PD; 25 CTR) was used for validation purposes. Real‐Time Quantitative Reverse Transcription PCR (qRT‐PCR) experiments confirmed the differential expression of miR‐126‐5p and miR‐99a‐5p when comparing PD and CTR cohorts (Figure S2). The individual expression of each signature miRNA in PD subjects of the discovery cohort (Figure 2E) delineated a similar heterogeneity to the one observed in the global miRNA analysis (Figure 1G), confirming the molecular diversity within PD cases. Subclusters 1 and 3 present opposing expression for the signature miRNAs, whereas subclusters 2 and 5 present similar levels for these candidates. These findings suggest that the identified signature would be a useful tool for distinguishing disease subgroups based on miRNA expression. On the other hand, the inclusion of additional patients/cohorts with variate compositions might explain the lack of reproducibility of studies in the field, 5 as well as the discrepant results for some candidates during the additional validation studies we presented here. Using samples from patients with different molecular backgrounds, which cannot be distinguished by clinical phenotype alone, as well as the smaller size of the validation cohort might explain the differences in the results observed for miR‐501‐3p with RNA sequencing and qRT‐PCR experiments.
Regarding the biological role of the three signature miRNAs, functional annotation analyses with their predicted targets indicated that these candidates likely originate in neurons. Neuron‐related terms comprised the most frequent enriched categories for Gene Ontology‐Biological Processes (GO‐BP) results (8/35 enriched GO‐BP terms), indicating their neuronal origin. Terms including neuron death, vesicle‐mediated transport, and proteasomal‐protein catabolic process indicate the participation of these miRNAs in processes directly related to PD pathogenesis 6 (Figures 2H and 2J). These findings are corroborated by KEGG pathway (Kyoto Encyclopedia of Genes and Genomes) enrichment results: 19 out of 64 annotated KEGG pathways were neuron related (Figures 2G and 2I). Among the top 15 pathways figure retrograde endocannabinoid signaling and cholinergic‐dopaminergic synapse, categories with important involvement in PD pathology. 6 , 7 Furthermore, each candidate of our panel has been linked to neurodegenerative mechanisms previously: miR‐126 has been linked to insulin/IGF‐1/PI3K signaling and found in increased levels in PD substantia nigra 8 ; miR‐99a‐5p has been associated with neuroinflammation/neurodegeneration processes by regulating microglial functions 9 ; miR‐501‐3p is a regulator of dendritic spine remodeling, and was also found upregulated in Alzheimer's disease brains. 10
Aiming to identify differentially expressed proteins and to explore disease‐relevant pathways further, an overlapping cohort (64 PD; 61 CTR) was analyzed using mass spectrometry using total CSF (Figure 3A). In total, 67 proteins were found differentially expressed between conditions (45 downregulated in PD/22 upregulated in PD) (Figure 3B). Functional annotation showed an important enrichment for inflammatory/immune‐related terms, as well as neuronal‐related terms (e.g., axon regeneration; neuronal development; synapse organization for GO‐BP terms; complement/coagulation cascades for KEGG pathways) (Figure 3D). Remarkably, these results overlap with the pathways annotated for the signature miRNAs, especially for the regulation of neuron development/morphogenesis and synapse‐ and secretion‐related terms. PPI networks with deregulated proteins revealed important hub‐proteins (TGOLN2; SCG2; KNG1; APOA4) (Figure 3E). Proteins that have been previously postulated as PD biomarkers (VGF and EPHA4) were also identified in our studies (Table S4). Overall, although the parallel studies differed regarding the analyzed CSF compartments and the cohorts did not overlap completely, several disease‐relevant pathways were coincidental, further supporting the results of the miRNA study.
Limitations for the identification of molecular signatures in CSF EVs must be critically considered: the starting volume of CSF for isolation of sequencing‐quality RNA (∼4.5 mL) is relatively high, limiting the number of available samples/additional analyses that can be performed. More efficient EV/RNA isolation protocols will significantly improve further CSF multiomics studies. It is important to highlight that the identification of such a miRNA signature in PD CSF must be taken as a starting point, and both the individual expression of each miRNA candidate as well as the combinatorial diagnostic value of the proposed panel must be validated in subsequent multicentric studies. Furthermore, we aimed to strictly select PD patients with a clear clinical phenotype to evaluate miRNA changes in a more advanced stage of the disease. A subsequent study recruiting patients shortly after onset of motor symptoms would be an important follow‐up for this work to assess the value of the signature for the identification of early PD patients.
In summary, we identified a novel miRNA signature in PD CSF composed of miR‐126‐5p, miR‐99a‐5p, and miR‐501‐3p. This signature could potentially contribute to an improved PD diagnosis, as well as to delineate future druggable targets for the disease by revealing important pathophysiological mechanisms. The validity of this signature as a diagnostic biomarker panel should be subsequently validated in larger multicentric studies. Our small‐RNA data also indicate that profiling miRNA expression in CSF EVs might identify clinically inapparent subgroups of PD patients, which could be ultimately used for personalized diagnostic and therapeutic strategies for the disease.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
DATA AVAILABILITY STATEMENT
RNA sequencing and proteomics datasets were deposited in public repositories — the European Genome‐phenome Archive (EGA) and the Proteomics Identifications Database (PRIDE), respectively — following the applicable guidelines. RNA sequencing data are stored under the accession number EGAD00001006629. Proteomics data are available via ProteomeXchange with identifier PXD022234.
Supporting information
ACKNOWLEDGMENTS
The authors appreciate the participation of patients in this study. We thank Woori Koh, Anna Fischbach, and Matthias Börger for their participation in patient recruitment. We also thank Barbara Müller for technical assistance and the Transcriptome Analysis Laboratory Göttingen for the performance of the small RNA sequencing.
Lucas Caldi Gomes and Anna‐Elisa Roser contributed equally to this work.
Funding information
This study was supported by the Cluster of Excellence and DFG Research Center Nanoscale Microscopy and Molecular Physiology of the Brain (CNMPB), Göttingen, Germany, by Bridging Funds from the Göttingen Graduate Center for Neurosciences, Biophysics, and Molecular Biosciences (GGNB) (LCG), by the German Network for Bioinformatics Infrastructure (de.NBI ‐ FKZ 031 A 534A) (Federal Ministery of Education and Research), P.U.R.E. (Protein Research Unit Ruhr within Europe), and Ruhr‐University Bochum, Bochum, Germany ProDi (Center for protein diagnostics), Ministry of Innovation, Science and Research of North‐Rhine Westphalia, Germany (KM).
REFERENCES
- 1. Schlossmacher MG, Mollenhauer B. Biomarker research in Parkinson's disease: objective measures needed for patient stratification in future cause‐directed trials. Biomarkers Med. 2010;4:647‐650. [DOI] [PubMed] [Google Scholar]
- 2. Saba R, Schratt GM. MicroRNAs in neuronal development, function and dysfunction. Brain Res. 2010;1338:3‐13. [DOI] [PubMed] [Google Scholar]
- 3. Yagi Y, Ohkubo T, Kawaji H, et al. Next‐generation sequencing‐based small RNA profiling of cerebrospinal fluid exosomes. Neurosci Lett. 2017;636:48‐57. [DOI] [PubMed] [Google Scholar]
- 4. Jain G, Stuendl A, Rao P, et al. A combined miRNA–piRNA signature to detect Alzheimer's disease. Transl Psychiatry. 2019;9:250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Roser AE, Caldi Gomes L, Schünemann J, et al. Circulating miRNAs as diagnostic biomarkers for Parkinson's disease. Front Neurosci. 2018;12:625. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Dauer W, Przedborski S. Parkinson's disease: mechanisms and models. Neuron. 2003;39:889‐909. [DOI] [PubMed] [Google Scholar]
- 7. Pisani V, Madeo G, Tassone A, et al. Homeostatic changes of the endocannabinoid system in Parkinson's disease: ECB changes in PD. Mov Disord. 2011;26:216‐222. [DOI] [PubMed] [Google Scholar]
- 8. Kim W, Lee Y, McKenna ND, et al. miR‐126 contributes to Parkinson's disease by dysregulating the insulin‐like growth factor/phosphoinositide 3‐kinase signaling. Neurobiol Aging. 2014;35:1712‐1721. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Butovsky O, Jedrychowski MP, Moore CS, et al. Identification of a unique TGF‐β dependent molecular and functional signature in microglia. Nat Neurosci. 2014;17:131‐143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Hara N, Kikuchi M, Miyashita A, et al. Serum microRNA miR‐501‐3p as a potential biomarker related to the progression of Alzheimer's disease. Acta Neuropathol Commun. 2017;5:10. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
RNA sequencing and proteomics datasets were deposited in public repositories — the European Genome‐phenome Archive (EGA) and the Proteomics Identifications Database (PRIDE), respectively — following the applicable guidelines. RNA sequencing data are stored under the accession number EGAD00001006629. Proteomics data are available via ProteomeXchange with identifier PXD022234.