Abstract
Disruption of sleep and circadian rhythms are a comorbid feature of many pathologies, and can negatively influence many health conditions, including neurodegenerative disease, metabolic illness, cancer, and various neurological disorders. Genetic association studies linking sleep and circadian disturbances with disease susceptibility have mainly focused on changes in gene expression due to mutations, such as single-nucleotide polymorphisms. The interaction between sleep and/or circadian rhythms with the use of Alternative Polyadenylation (APA) has been largely undescribed, particularly in the context of other disorders. APA generates transcript isoforms by utilizing various polyadenylation sites (PASs) from the same gene affecting its mRNA translation, stability, localization, and subsequent function. Here we identified unique APAs expressed in rat brain over time-of-day, immediately following sleep deprivation, and the subsequent recovery period. From these data, we performed a secondary analysis of these sleep- or time-of-day associated PASs with recently described APA-linked human brain disorder susceptibility genes.
Subject terms: Diseases, Circadian rhythms and sleep
Introduction
Dysregulation of sleep and circadian rhythms can profoundly impact human health and compound disease1,2. Indeed, sleep disruption is associated with negative outcomes in cardiovascular, metabolic, immunologic, and cognitive health that can have substantial short- and long-term consequences3. Alterations in sleep and circadian rhythms are often observed with various brain disorders, including autism spectrum disorder, bipolar disorder, major depression, schizophrenia, Parkinson’s, and Alzheimer’s diseases4–7. Complicating the association between sleep and health is the fact that functional aspects of sleep remain largely undefined and inconclusive8,9; however, the use of evolutionarily distinct animal models to study sleep has historically offered keen insights10,11. For example, studies on circadian- and sleep-dependent gene-regulatory mechanisms in diverse species, including flies, rodents, and humans, have identified important phylogenetically conserved pathways with functional relevance12–15. Employing unbiased approaches, such as large-scale metabolomic, transcriptomic, and proteomic analyses, have also greatly aided in the generation of conceptual frameworks for characterizing sleep function in health14,16. Therefore, performing such discovery-based studies of sleep and circadian regulatory processes in model organisms will help define the fundamental biological mechanisms underlying sleep function and inform preclinical relevance for comorbidities of sleep dysfunction associated with poor health.
Alternative polyadenylation (APA) site usage is an important and often overlooked mechanism of gene regulation, that can affect mRNA stability, mRNA/protein targeting, translational competence, and generate alternative protein isoforms17,18. APA sites are common and occur most frequently in the 3′ untranslated regions (3′ UTR) of mRNAs across phylogeny, with more than half of human genes having multiple polyadenylation sites (PASs) that generate alternative isoforms19. These isoforms can have altered coding sequences or 3′ UTRs, resulting in the diversification of cis-regulatory elements (e.g., RNA-binding protein sites, microRNA binding sites) that influence transcript abundance, trafficking, stability, and/or translation efficiency20. Furthermore, there’s growing evidence of cell-type-specific APA preference21. The involvement of APA in the context of sleep and circadian rhythms has been largely unexplored, with the few studies available mostly focused on peripheral organs22,23 and cells24. Here, we have characterized how APA site usage oscillates based on the time-of-day as well as how it is altered following acute changes in sleep pressure, specifically in the adult mammalian brain. Multiple methodologies have been developed for transcriptome-wide profiling and mapping of APA sites25,26. To complete this study, we performed whole transcriptome termini sequencing (WTTS-seq)27,28 analysis to profile the variations in APA usage that occur due to sleep pressure and daily rhythms in the rat forebrain. Over 31,000 PASs were recovered in total, with 45% of the represented genes having multiple APA sites. Interestingly, many of the PASs sequenced were not previously annotated in the rat genome. Moreover, a total of 2011, (6%) of PASs cycled over the day, and 831 (3%) were homeostatically regulated following sleep loss following sleep loss or during recovery. Over half of all cycling or differentially expressed PASs were APAs, (i.e., in genes with ≥2 PASs). Given the importance of sleep4–7 and APA in health and disease25,29,30, we compared our sequencing results with results from a recent study that determined APA usage in human brain disorder susceptibility31. The genes found in both studies warrant further examination and could lead to new preclinical animal models to investigate these disorders.
To the best of our knowledge, the current study represents the first comprehensive, transcriptome-wide mapping of APA sites in adult mammalian brain tissue over the day-night cycle as well as following changes in sleep homeostasis. This global temporal dataset will be useful for future comparative studies that require the determination of baseline APA site usage profiles in the mammalian brain. Furthermore, our study underscores the importance of using alternative-omic approaches to characterize phylogenetically conserved genome-phenome information and reveals another expansive layer of complexity in sleep and circadian gene regulation that has not previously been documented.
Results
Identification of PASs in the rat forebrain
Given the rat transcriptome is not as extensively annotated as the human or mouse, we first identified all PASs, including novel candidate PASs prior to determining changes in PAS usage. Replicate diurnal (central forebrains) were taken from five rats every four hours starting at two hours after lights on (i.e., ZT2, ZT6, ZT10, ZT14, ZT18 and ZT22) (Fig. 1A, B). RNA was purified from these samples and used to generate WTTS-seq cDNA libraries that were subsequently sequenced. Poly(A)-directed sequence reads were then mapped to the rat genome, giving rise to 31,757 PAS clusters (see Supplementary Table S1). Among the 31,757 PAS clusters identified, a sizable portion mapped to novel unannotated PASs, leaving 26,635 PASs that mapped to named loci (i.e., genes). Many APAs occur at different points within the longest 3′ UTR (Fig. 1C, sites 4 and 5). Some are distal to the longest documented 3′ UTR (site 6), while some occur in internal exons (site 1) or introns (sites 2 and 3) (Fig. 1C). In our dataset of all PASs that mapped to genes, 45% mapped to genes with ≥2 APA sites, and 19% mapped to genes with ≥3 APA sites (Fig. 1D).
Identification of PASs that exhibit a daily cycle
Periodicity of PAS expression was assessed using meta2d32. Diurnal (24 h period) oscillations were demonstrated for 2011 PASs. Among these, 1173 were in genes with ≥2 total APA sites, including ones in known diurnal transcripts, such as Dbp (diurnal in 2 of 2 APA sites recovered), Nr1d2 (in 1 of 1), Per2 (in 2 of 2), and Ntrk2 (in 2 of 10)33 (Table 1 and Supplementary Table S2).
Table 1.
PAS_ID | Symbol | Chromosome | Strand | Peak | PASs/gene | Total | median p | median BH.Q | AVE phase | AVE Amp | AVE rAmp |
---|---|---|---|---|---|---|---|---|---|---|---|
497,844 | Parvb | 7 | + | 115,445,766 | 2 | 6707 | 1.30E-10 | 1.91E-06 | 1.89 | 373 | 0.29 |
251,968 | Dnajb1 | 19 | - | 24,522,731 | 2 | 4379 | 1.09E-09 | 6.93E-06 | 15.54 | 194 | 0.24 |
424,776 | Ccdc107 | 5 | + | 57,752,233 | 3 | 1833 | 1.13E-08 | 4.15E-05 | 18.14 | 124 | 0.37 |
282,515 | Rap1gds1 | 2 | - | 227,512,038 | 2 | 45 | 8.29E-09 | 3.62E-05 | 18.35 | 10 | 1.29 |
86,566 | Hint1 | 10 | + | 38,993,254 | 3 | 44,628 | 1.01E-08 | 3.73E-05 | 11.56 | 1871 | 0.22 |
42,414 | Dbp | 1 | + | 96,180,614 | 2 | 5424 | 2.19E-08 | 5.35E-05 | 11.88 | 408 | 0.40 |
496,957 | Cacna1i | 7 | + | 111,947,418 | 3 | 2361 | 3.52E-08 | 7.46E-05 | 0.81 | 112 | 0.24 |
301,149 | S100a10 | 2 | + | 179,229,660 | 2 | 863 | 4.75E-08 | 9.48E-05 | 21.50 | 88 | 0.59 |
249,570 | Ndrg4 | 19 | - | 9,351,408 | 2 | 57,912 | 5.93E-08 | 1.11E-04 | 0.75 | 2119 | 0.19 |
141,758 | Iars2 | 13 | - | 96,831,484 | 3 | 263 | 5.83E-08 | 9.74E-05 | 18.22 | 33 | 0.73 |
89,040 | Phf23 | 10 | + | 54,722,784 | 2 | 2014 | 7.00E-08 | 1.15E-04 | 13.48 | 119 | 0.32 |
455,360 | Wdr35 | 6 | + | 31,831,183 | 5 | 154 | 6.21E-08 | 9.86E-05 | 18.05 | 23 | 0.83 |
279,711 | Prpf38b | 2 | - | 196,553,983 | 3 | 397 | 1.05E-07 | 1.47E-04 | 19.09 | 34 | 0.48 |
363,773 | Cacna2d1 | 4 | - | 18,950,614 | 2 | 3218 | 1.05E-07 | 1.51E-04 | 5.00 | 248 | 0.39 |
309,907 | Cbs | 20 | - | 9,708,260 | 4 | 3085 | 1.65E-07 | 2.18E-04 | 21.42 | 165 | 0.30 |
214,595 | Faf2 | 17 | - | 9,947,225 | 4 | 2246 | 2.24E-07 | 2.29E-04 | 15.60 | 105 | 0.26 |
447,907 | Syndig1l | 6 | - | 104,323,418 | 2 | 253 | 2.17E-07 | 2.46E-04 | 8.92 | 36 | 0.77 |
470,887 | Btbd11 | 7 | - | 18,035,151 | 3 | 3401 | 3.07E-07 | 3.36E-04 | 23.59 | 184 | 0.29 |
449,908 | Dicer1 | 6 | - | 123,627,529 | 3 | 296 | 3.65E-07 | 4.14E-04 | 17.00 | 30 | 0.58 |
549,543 | Coq10b | 9 | + | 56,573,671 | 2 | 334 | 2.07E-07 | 2.43E-04 | 16.60 | 32 | 0.54 |
PAS_ID is a unique identifier for each PAS. Gene Symbol, Chromosome are listed, as well as Strand according to the convention of each chromosome. Peak is the mode or most common 3′ endpoint in the cluster. The number of PASs per gene is listed. Total refers to the sum of the reads for all samples.
Meta2d calculations shown are: median p median probability of cycling, median BH.Q median Benjamini/Hochberg adjusted probability, AVE phase average peak phase, AVE Amp average amplitude (from peak to trough) of reads, and AVE rAmp average relative amplitude adjusted by the mean read number.
To look for functions or cell components that are particularly affected by APA site usage in a time-of-day dependent manner, we performed pathway and gene ontology (GO) over-representation analyses using the online tool WebGestalt34. The set of 1173 gene symbols corresponding to diurnal PASs in genes with ≥2 APAs were input (Fig. 2 and Supplementary Table S3). Glutamatergic Synapse, Membrane Trafficking, and Circadian Entrainment are among the enriched terms.
We were interested whether rhythmic PASs might cluster predominantly into certain phases of peak expression, and whether APAs that share a common peak phase might also share some functional relationship. It was evident that some phases had very few APAs relative to other phases and the expression levels of many PASs peaked around ZT18-20 (Supplementary Fig. S1). When diurnal APAs from genes with ≥2 total APAs were grouped by phase, GO and pathway analysis on each group found that only phases 2, 10, and 18 had significantly over-represented terms. Phase 18 had the most, with the over-representation of multiple signaling pathways, including ‘neuron-to-neuron synapse’, ‘postsynaptic specialization’, and interestingly, genes in ‘mRNA processing’ pathways (Supplementary Table S4).
There is a growing appreciation that rhythms shorter than 24 h are biologically relevant35–39. Thus, we evaluated the PASs data for ultradian cycling using meta2d with the period set to 12 h. Overall, 1502 PASs that cycled with a 12 h period were identified (Supplementary Table S5). Of the 12 h cycling PASs, 1198 were in genes, and after adjusting for genes with multiple 12 h cycling APAs, there were 1149 unique genes in the set. In total, 827 of the 12 h cycling APA sites were in genes that had ≥2 APAs, representing 778 unique genes. Pathway analysis on this set of 778 unique genes (Supplementary Table S6) showed that CREB phosphorylation and circadian entrainment were highly enriched, while GO analysis of this dataset resulted in 16 GO terms related to the synapse.
PASs are differentially expressed after sleep deprivation and during recovery sleep
To investigate changes in APA site usage related to sleep pressure, rats were subjected to SD for 6 h from ZT0 to ZT6, and central forebrain tissue was collected immediately afterward (R0). Additional animals were allowed to recover for 2, 4, or 8 h after SD (R2, R4, and R8) before tissue was collected. WTTS-seq data from these samples were compared to time-matched controls that were allowed to sleep undisturbed (ZT6, ZT8, ZT10, and ZT14). All groups consisted of 5 biological replicates. Our sequencing data showed that the most significant differences in expression were seen when we compared R0 with its control (ZT6) and R4 with its control (ZT10) (Supplementary Table S7 and Fig. 3). Interestingly, a Homer1a APA isoform is the most abundant at R0, R4, and ZT6, whereas a full-length isoform is dominant at ZT10 (Supplementary Fig. S2a, b) Also, the expression of one APA isoform of Prmt1, was upregulated with high confidence after 6 h of sleep deprivation (Fig. 3). PRMT1 protein regulates multiple stress response pathways40,41, which have a roll in acute sleep loss.
The gene names of differentially expressed APA sites from genes with ≥2 APAs were used for GO and pathway over-representation analysis (Table 2). ZT6 vs R0 only had significant results for GO while ZT10 vs R4 had significant GO and pathway results.
Table 2.
GO description | size | overlap | expect | ratio | p-value | FDR | database |
---|---|---|---|---|---|---|---|
ZT6 vs R0 | |||||||
synaptic membrane | 396 | 5 | 0.37 | 13.40 | 1.88E-05 | 1.79E-02 | CC |
neuron-to-neuron synapse | 322 | 4 | 0.30 | 13.18 | 1.69E-04 | 4.52E-02 | CC |
postsynaptic specialization | 327 | 4 | 0.31 | 12.98 | 1.80E-04 | 4.52E-02 | CC |
localization within membrane | 130 | 3 | 0.12 | 24.48 | 2.14E-04 | 4.52E-02 | BP |
glutamatergic synapse | 368 | 4 | 0.35 | 11.53 | 2.83E-04 | 4.52E-02 | CC |
phosphatase binding | 143 | 3 | 0.13 | 22.26 | 2.83E-04 | 4.52E-02 | MF |
ZT10 vs R4 | |||||||
synaptic membrane | 396 | 31 | 8.09 | 3.83 | 1.22E-10 | 1.17E-07 | CC |
glutamatergic synapse | 368 | 26 | 7.51 | 3.46 | 3.52E-08 | 1.68E-05 | CC |
neuron-to-neuron synapse | 322 | 23 | 6.58 | 3.50 | 1.86E-07 | 5.85E-05 | CC |
postsynaptic specialization | 327 | 23 | 6.68 | 3.44 | 2.45E-07 | 5.85E-05 | CC |
regulation of trans-synaptic signaling | 375 | 24 | 7.66 | 3.13 | 7.43E-07 | 1.42E-04 | BP |
neuron spine | 182 | 16 | 3.72 | 4.30 | 9.83E-07 | 1.57E-04 | CC |
glutamate receptor signaling pathway | 78 | 10 | 1.59 | 6.28 | 3.94E-06 | 5.38E-04 | BP |
regulation of synapse structure or activity | 201 | 15 | 4.10 | 3.65 | 1.60E-05 | 1.91E-03 | BP |
synaptic transmission, glutamatergic | 83 | 9 | 1.69 | 5.31 | 4.73E-05 | 5.02E-03 | BP |
regulation of neuron projection development | 422 | 22 | 8.62 | 2.55 | 5.46E-05 | 5.22E-03 | BP |
localization within membrane | 130 | 11 | 2.65 | 4.14 | 7.10E-05 | 6.17E-03 | BP |
behavior | 464 | 22 | 9.48 | 2.32 | 2.15E-04 | 1.71E-02 | BP |
positive regulation of nervous system development | 447 | 21 | 9.13 | 2.30 | 3.39E-04 | 2.41E-02 | BP |
protein exit from endoplasmic reticulum | 31 | 5 | 0.63 | 7.90 | 3.76E-04 | 2.41E-02 | BP |
synapse organization | 296 | 16 | 6.04 | 2.65 | 3.86E-04 | 2.41E-02 | BP |
axon part | 389 | 19 | 7.94 | 2.39 | 4.03E-04 | 2.41E-02 | CC |
presynapse | 489 | 22 | 9.99 | 2.20 | 4.44E-04 | 2.50E-02 | CC |
cell part morphogenesis | 428 | 20 | 8.74 | 2.29 | 5.02E-04 | 2.52E-02 | BP |
dendrite development | 189 | 12 | 3.86 | 3.11 | 5.06E-04 | 2.52E-02 | BP |
regulation of G protein-coupled receptor signaling pathway | 91 | 8 | 1.86 | 4.30 | 5.35E-04 | 2.52E-02 | BP |
cell leading edge | 276 | 15 | 5.64 | 2.66 | 5.53E-04 | 2.52E-02 | CC |
endoplasmic reticulum to cytosol transport | 21 | 4 | 0.43 | 9.33 | 7.73E-04 | 3.23E-02 | BP |
excitatory synapse | 54 | 6 | 1.10 | 5.44 | 7.78E-04 | 3.23E-02 | CC |
endocytosis | 418 | 19 | 8.54 | 2.23 | 9.62E-04 | 3.83E-02 | BP |
forebrain development | 295 | 15 | 6.02 | 2.49 | 1.09E-03 | 4.18E-02 | BP |
receptor metabolic process | 127 | 9 | 2.59 | 3.47 | 1.17E-03 | 4.31E-02 | BP |
ZT10 vs R4 | |||||||
Pathway description | size | overlap | expect | ratio | p-value | FDR | database |
mRNA surveillance pathway | 95 | 11 | 1.81 | 6.07 | 1.83E-06 | 3.40E-03 | KEGG |
GO and pathway analyses were performed on lists of genes with ≥2 APA sites that exhibited differential expression of at least 1 APA site following sleep deprivation/recovery compared to controls using the over-representation analysis function of the online web tool WebGestalt. GO or pathway description is followed by the size (total number of genes that the term is comprised of), overlap (number of input genes matching the term), expect (number of input genes expected to match by chance), ratio (number of actual/expected matches), p-Value (probability), FDR (false discovery rate; probability adjusted for multiple sampling), database (For the GO analysis: BP Biological process, CC Cellular Component, MF Molecular Function. For the pathway analysis: KEGG is Kyoto Encyclopedia of Genes and Genomes).
Comparison of APA-linked brain disorder susceptibility genes with WTTS-seq identified diurnal APAs and APAs differentially expressed with sleep pressure
A recent survey by Cui et al.31 using APA transcriptome-wide association studies (TWAS) highlighted the importance of APA site usage in brain disorders. To establish the extent to which genes with APA-linked neurological phenotypes had diurnal or sleep-related changes in rats, our list of diurnal genes with ≥2 APA sites were compared to those reported in Cui et al. 31. There were 25 overlapping genes (representing 28 APAs in our data since three genes had 2 diurnal APA sites). Another 19 genes with WTTS-seq-identified APA sites that cycle on a 12 h period were identified in the TWAS dataset, as were nine genes (11 APA sites) that were differentially expressed with sleep pressure. Altogether, 54 APAs representing 46 genes were observed in common with genes having disease-associated APAs (Table 3).
Table 3.
Cycle | PAS_ID | Symbol | Disorders Cui et al. | PASs/gene | major PAS | p AVE | BH.Q AVE | phase AVE |
---|---|---|---|---|---|---|---|---|
24 h | 550,479 | Abi2 | AD(1), DEP(1) | 5 | 2.61E-03 | 1.14E-01 | 22 | |
24 h | 126,847 | Agfg2 | AD(1) | 2 | 1.04E-02 | 2.64E-01 | 21 | |
24 h | 61,185 | Arl3 | ANX(1), BIP(2), SZC(6) | 3 | 3.51E-02 | 5.26E-01 | 16 | |
24 h | 234,171 | Brd8 | SCZ(2) | 3 | 5.51E-03 | 1.81E-01 | 10 | |
24 h | 188,939 | Ccdc25 | BIP(1) | 6 | mPAS | 3.66E-02 | 5.37E-01 | 4 |
24 h | 82,922 | Cdip1 | SCZ(2) | 3 | 9.97E-05 | 1.51E-02 | 17 | |
24 h | 392,202 | Chmp3 | ANX(1) | 4 | 9.61E-03 | 2.47E-01 | 7 | |
24 h | 392,209 | Chmp3 | ANX(1) | 4 | 3.11E-02 | 4.94E-01 | 18 | |
24 h | 210,841 | Ddhd2 | SCZ(18) | 2 | mPAS | 2.57E-03 | 1.11E-01 | 18 |
24 h | 330,991 | Elp4 | ANX(2), DEP(1) | 2 | mPAS | 1.27E-03 | 7.33E-02 | 22 |
24 h | 331,015 | Elp4 | ANX(2), DEP(1) | 2 | 2.49E-02 | 4.38E-01 | 1 | |
24 h | 494,157 | Emc2 | ANX(11) | 4 | mPAS | 5.60E-04 | 4.36E-02 | 16 |
24 h | 141,301 | Enah | ANX(7), BIP(1) | 4 | 1.51E-02 | 3.25E-01 | 21 | |
24 h | 332,926 | Frmd5 | ANX(1), DEP(1) | 6 | 1.15E-02 | 2.78E-01 | 2 | |
24 h | 205,543 | Gatad2a | SCZ(1) | 3 | 9.60E-03 | 2.48E-01 | 19 | |
24 h | 204,624 | Grid1 | ANX(1) | 2 | 2.47E-04 | 2.66E-02 | 6 | |
24 h | 121,855 | Hip1r | SCZ(1), AD(1) | 2 | 3.83E-02 | 5.54E-01 | 17 | |
24 h | 61,332 | Ina | SCZ(1) | 3 | mPAS | 5.32E-03 | 1.76E-01 | 23 |
24 h | 351,590 | Map1a | SCZ(1) | 11 | 1.31E-05 | 4.05E-03 | 2 | |
24 h | 95,493 | Mapt | AD(1), ANX(2), ASD(1), DEP(2), MDD(2), PD(2), PTSD(2), SCZ(2) | 9 | 1.13E-02 | 2.71E-01 | 1 | |
24 h | 213,927 | Ntrk2 | ANX(1) | 10 | mPAS | 2.02E-02 | 3.86E-01 | 20 |
24 h | 213,950 | Ntrk2 | ANX(1) | 10 | 3.24E-02 | 5.05E-01 | 2 | |
24 h | 55,226 | Pnpla2 | ADHD(1) | 2 | 2.78E-03 | 1.18E-01 | 20 | |
24 h | 514,594 | Rbm6 | DEP(1) | 4 | mPAS | 2.06E-02 | 3.90E-01 | 18 |
24 h | 549,537 | Sf3b1 | SCZ(1) | 4 | 8.47E-04 | 5.47E-02 | 8 | |
24 h | 286,602 | Ssbp2 | PTSD(1) | 5 | 1.03E-03 | 6.43E-02 | 17 | |
24 h | 131,036 | Svop | Anx(2) | 4 | 8.34E-03 | 2.28E-01 | 15 | |
24 h | 17,074 | Wdr73 | SCZ(2) | 2 | 2.27E-02 | 4.09E-01 | 3 | |
Cycle | PAS_ID | Symbol | PASs/gene | major PAS | p AVE | BH.Q AVE | phase AVE | |
12 h | 550,473 | Abi2 | AD(1), DEP(1) | 5 | mPAS | 3.50E-02 | 6.33E-01 | 6 |
12 h | 61,185 | Arl3 | ANX(1), BIP(2), SZC(6) | 3 | 1.84E-02 | 4.39E-01 | 5 | |
12 h | 98,600 | Cadm2 | BIP(1) | 16 | 3.40E-06 | 2.14E-03 | 5 | |
12 h | 392,206 | Chmp3 | ANX(1) | 4 | mPAS | 2.61E-02 | 5.18E-01 | 4 |
12 h | 287,927 | Ercc8 | SCZ(1) | 2 | 8.29E-03 | 2.72E-01 | 6 | |
12 h | 200,908 | Fgfr1 | SCZ(1) | 5 | 2.82E-02 | 5.55E-01 | 8 | |
12 h | 538,044 | Hecw2 | SCZ(1) | 6 | 1.46E-04 | 2.16E-02 | 7 | |
12 h | 61,328 | Ina | SCZ(1) | 3 | 1.75E-05 | 5.35E-03 | 11 | |
12 h | 77,702 | Lsm12 | BIP(1) | 2 | mPAS | 8.17E-03 | 2.58E-01 | 5 |
12 h | 529,393 | Mon1a | DEP(1), MDD(1), PTSD(1) | 2 | mPAS | 9.21E-04 | 7.38E-02 | 4 |
12 h | 525,326 | Myo1e | SCZ(1) | 2 | mPAS | 1.16E-02 | 3.17E-01 | 5 |
12 h | 90,245 | Pitpna | AD(1) | 8 | 2.01E-02 | 4.61E-01 | 2 | |
12 h | 258,834 | Pskh1 | SCZ(1) | 2 | 3.98E-03 | 1.76E-01 | 7 | |
12 h | 145,571 | Rab29 | PD(1) | 3 | mPAS | 3.33E-04 | 4.04E-02 | 6 |
12 h | 89,414 | Rabep1 | SCZ(13) | 4 | 5.16E-06 | 2.72E-03 | 4 | |
12 h | 22,342 | Setd1a | PD(1) | 2 | 2.67E-02 | 5.45E-01 | 3 | |
12 h | 390,670 | Snca | AD(1),PD(7),SCZ(3) | 4 | 3.35E-03 | 1.57E-01 | 10 | |
12 h | 18,304 | Usp35 | ANX(2) | 2 | mPAS | 8.36E-04 | 6.91E-02 | 10 |
12 h | 218,787 | Wac | BIP(1) | 4 | 1.83E-05 | 5.61E-03 | 10 | |
DE | PAS_ID | symbol | PASs/gene | major PAS | log2FC | p-value | padj | |
ZT10 vs R4 | 467,701 | Mark3 | SCZ(1) | 5 | 3.18 | 5.81E-09 | 6.94E-06 | |
ZT10 vs R4 | 467,700 | Mark3 | SCZ(1) | 5 | 3.00 | 7.13E-08 | 4.95E-05 | |
ZT10 vs R4 | 191,750 | Ndfip2 | SCZ(1),BIP(1) | 5 | -1.15 | 2.55E-08 | 2.19E-05 | |
ZT10 vs R4 | 218,787 | Wac | BIP(1) | 4 | -1.51 | 1.63E-04 | 1.30E-02 | |
ZT10 vs R4 | 549,537 | Sf3b1 | SCZ(1) | 4 | -1.30 | 6.47E-04 | 3.15E-02 | |
ZT10 vs R4 | 35,839 | Mtrf1l | SCZ(1) | 4 | -1.60 | 9.64E-04 | 4.08E-02 | |
ZT10 vs R4 | 205,547 | Gatad2a | 3 | mPAS | 1.03 | 9.24E-05 | 9.16E-03 | |
ZT10 vs R4 | 205,543 | Gatad2a | SCZ(1) | 3 | 1.42 | 2.29E-03 | 7.24E-02 | |
ZT10 vs R4 | 261,774 | Spg7 | SCZ(2) | 3 | mPAS | 2.17 | 2.75E-04 | 1.89E-02 |
ZT10 vs R4 | 519,450 | Snx19 | ADHD(1), BIP(1), SCZ(3) | 3 | -1.18 | 3.65E-04 | 2.20E-02 | |
ZT10 vs R4 | 78,129 | Plekhm1 | ANX(1), DEP(1), PD(1), PTSD(1), SCZ(1) | 3 | mPAS | 1.96 | 2.59E-03 | 7.74E-02 |
The number of PASs per gene is listed. mPAS is indicated if the PAS maps to a known, major PAS. refers to the sum of the reads for all samples. Meta2d calculations shown are: median p median probability of cycling, median BH.Q median Benjamini/Hochberg adjusted probability, phase AVE average peak phase. Abbreviations for the brain disorders studied in Cui et al. 31 are: ALS amyotrophic lateral sclerosis, ADHD attention deficit hyperactivity disorder, ASD autism spectrum disorder, ANX anxiety, BIP bipolar disorder, DEP depression, MDD major depressive disorder, SCZ schizophrenia, PTSD post-traumatic stress disorder, PD Parkinson’s disease and AD Alzheimer’s disease (AD).
Discussion
APA site usage is an understudied aspect of gene regulation. Although APA sequencing can reveal changes in overall gene expression, it’s designed to focus on changes in APA usage and cannot reveal differences in splicing or transcription start sites (TSSs). On the other hand, bulk RNA-seq analysis often ignores APA, TSS and splice isoforms to simply assess reads per gene. Currently it would be very difficult to enumerate copies of all the mRNA isoforms for each gene. Yet appreciation is growing for the importance of APA sites in regulating mRNA stability17,42, mRNA/protein localization20,43,44, and human disease31,45.
Rhythmic APA site usage has been uncovered in the mouse liver22,23,46, and in temperature-entrained cultured cells, circadian APA usage occurs in many genes and can regulate expression of specific central clock genes24. Still, alternative poly(A) site usage hasn’t been given much attention in the sleep and circadian field. We therefore initiated this investigation into the conjunction of APA with sleep and diurnal expression. As far as we are aware, the current study is the first to examine APA sites related to circadian rhythms and sleep pressure in any mammalian brain. There are several, diverse ways in which data from this study can translate into biological relevance as described in the examples below.
Here, we observed that 6% of all PASs cycled with a 24 h period. One of the top pathways identified for the diurnal APA gene set was ‘circadian entrainment’ (Fig. 2). Since transcription-translation feedback loops are central to circadian regulation, this may not be surprising, but APA site usage suggests a more complex role24,46. For example, we find that one Sin3b APA follows a diurnal rhythm (Fig. 3A, B). Sin3b encodes short and long variants conserved in mammals. The short variant binds to CRY1 but cannot bind HDAC147. The long isoform is implicated in regulation of Per1/Per2 transcription48, along with many other genes49. In our data, long Sin3b APA reads constitute the predominant isoform at ZT6 and ZT22, while the short, diurnal isoform is the most abundant one at ZT10, ZT14 and perhaps ZT2 (Fig. 3B). Sin3b transcript levels in mouse hippocampus have previously been reported to be affected by sleep deprivation50, although this effect was not observed using TRAP-seq51, suggesting post-transcriptional processing can lead to changes in sleep-dependent differential expression. Together with our work, this example highlights the importance of utilizing various “-omic” approaches to properly decipher the complexity of molecular processing tied to changes in behavioral state in the brain.
Additional significant pathways emerged from the diurnal APAs, such as Oxytocin, Ephrin, and MAPK signaling that have demonstrated links to the circadian clock52–54. In the GO analysis of the diurnal genes with multiple PASs, we discovered that terms related to the synapse (12), protein localization (6), and vesicles (7) (Fig. 2 and Supplementary Table S3) were enriched suggesting APAs are poised to affect neural communication.
A large proportion of diurnal APAs had expression peaks around ZT20 (Supplementary Fig. S1). Considering that rats are nocturnal, this is similar to what has been seen for bulk transcripts in several human tissues, including brain55. Interestingly, among the identified diurnal APA sites, 3 were in genes for RNA-binding proteins (Celf2, Elavl3, and Rbfox1) whose expressions correlate with more distal APA usage47. Peak expression of these three genes is from ZT21 to ZT1, so it would be interesting to see if transcripts of predicted targets tend to be longer at these times.
In addition to the 24 h circadian rhythm, recent studies have also demonstrated the existence of cell-autonomous ultradian clocks that run independently of the circadian clock to regulate 12 h oscillations in gene expression and metabolism35–39. Here we found that 5% of all PASs cycle with a 12 h period. Further analysis of these genes showed enrichment of gene ontology terms and pathways such as “regulation of trans-synaptic signaling” and “protein-protein interactions at synapses” (Supplementary Table S6), indicating that APAs could function to regulate cyclic actions of cell signaling and communication.
Gene expression studies following changes in sleep homeostasis have largely ignored alternative polyadenylation. Of the 31,795 total PASs characterized in rat forebrain in our study, we determined that 2.5% were differentially expressed with sleep deprivation and recovery sleep. Few PASs were differentially expressed at ZT14 vs R8 which may indicate that transcription/PAS use has recuperated after 8 h (Supplementary Table S7), and none were significant at ZT8 vs R2. The reasons for this result remain unclear as the total number of PASs recovered for ZT8 and R2 approximate the average of the other groups. It could be that transcription/PAS use is transitioning through a temporal window of recovery that closely resembles ZT8. Ample differentially expressed PASs were found at the other timepoints, and we observed six GO terms significantly enriched following 6 h of sleep loss and 26 following 4 h of recovery sleep (Table 2).
Human APA isoforms have been linked to many neurological disorders31. Among the genes that we identified to have rhythmic expression of APA sites or had APA sites that were affected by sleep pressure, we found that 46 have also been correlated with brain disorder susceptibility (Table 3). For example, the human MAPT/TAU gene produces transcripts containing short or long 3′ UTRs (Fig. 4C), and a 3′ single-nucleotide polymorphism, (SNP) is associated with both 3′ UTR length and risks for 8 neurological disorders, including Alzheimer’s (AD) and Parkinson’s diseases (PD)31. Homozygosity of the more common SNP variant is associated with short MAPT 3′ UTRs, homozygosity of the less common SNP variant is associated with long 3′ UTRs, and heterozygosity is associated with 3′ UTRs of intermediate lengths. In our rat APA data, there were both short and long 3′ UTR forms (5 in total) of the Mapt gene that were identified (Fig. 4C, D). Only two are currently annotated in the rat genome and one of the newly discovered APAs was observed to cycle with time-of-day. In mouse, binding of the ALS-associated protein TDP-43 to two sites in the 3′ UTR of Mapt has been shown to destabilize the mRNA56. In Alzheimer’s disease, the expression level of TDP-43 protein is often low, and TAU is overexpressed and eventually forms neurofibrillary tangles. The two TDP-43 binding sites that were experimentally determined in mouse are conserved in sequence and position in the rat gene, implying that transcripts with shorter 3′ UTRs would not be affected by TDP-43, while longer ones could be destabilized56,57. The presence of at least one putative TDP-43 binding site in the human MAPT 3′ UTR suggests that this may be contributing to the neurological disorder risk. Another interesting candidate is the miRNA 150-5p, which has been implicated in AD58–60, PD61,62, and possibly depression63. This mRNA has four putative binding sites in human MAPT/TAU 3′ UTR, of which two align at corresponding positions in the rat and mouse genes. Again, APAs producing the short transcripts would lack these sites (Fig. 4C).
Ntrk2 is among the APA TWAS genes linked to anxiety31 and has been associated with autism in other studies64. We found strong time-of-day oscillations of the 2 most abundant APA sites of the short, tyrosine kinase-deficient (TK-) Ntrk2 isoform. The TK- isoform of Ntrk2 has several known functions, including a dominant negative effect on the full-length TK+ isoform during neuronal proliferation, differentiation, and survival. In addition, the TK- version promotes filopodia and neurite outgrowth; sequesters, translocates, and presents BNDF; and affects calcium signaling and cytoskeletal modifications in glia65. Our WTTS-seq data revealed short, medium, and long 3′ UTRs in the rat Ntrk2 TK- isoform (Fig. 4E). In mice, the longer Ntrk2 TK- transcripts are preferentially targeted to apical dendrites66. Since the sequence of the rat 3′ UTR is highly conserved with the mouse sequence, it is plausible that an analogous dendritic localization mechanism is also in use in the rat (Fig. 4E). Interestingly, ‘Ntrk signaling’ was one of the pathways over-represented in the diurnal APA genes (Supplementary Table S3). APA sites in Src, Frs2, Atf1, Nras, Sh3gl2, Ntrk3, Mapk1, Grb2, Pik3r1, and Mapk14 contributed to this enrichment.
We found 118 genes that had a diurnal APA and an APA that cycles on a 12 h period. The Sorl1 gene is among these, exhibiting four different APAs with significant changes in our analyses; two diurnal, one cycled with a 12 h period, and one was reduced during recovery from sleep deprivation (Fig. 5). In total, there were seven APAs in the Sorl1 3′ UTR, three short, one medium and two long. The longest and most abundant isoform cycles per 12 h, the second longest and medium ones are diurnal and the shortest isoform is differentially expressed after SD (Fig. 5). SORL1 encodes an endosomal recycling receptor67, and a deficiency of SORL1 as well as many polymorphisms are strong risk factors for AD68,69. The mouse and human 3′ UTRs share extensive similarities including 5 APAs in mouse and 3 in human based on the PolyA_DB v3 (https://exon.apps.wistar.org/polya_db/v2/) and UCSC database70. Four microRNA binding sites with high probability of preferential conservation are in good alignment (TargetScanHuman v8.0)71. The first motif can be bound by five miRNAs (miR-25-3p, miR-32-5p, miR-92-3p, miR-363-3p, and miR-367-3p), while the second contains overlapping 7mer and 8mer motifs bound by miR-128-3p and miR-27-3p, respectively. The final two more distal sites are recognized by miR-153-3p and mir-137 (Fig. 5A). Sequences matching the consensus binding site for CPEB are present in the 3′ UTRs of all three species, with 2 in very good alignment. Cytoplasmic polyadenylation element binding protein (CPEB) facilitates mRNA trafficking to synapses and local translation72,73, and we have previously shown that the core clock-controlled Fabp7 mRNA74,75 contains functional CPE sites in its 3′ UTR to regulate translation76. Since APOE4, an apolipoprotein E variant with increased risk of AD77, disrupts FABP7 interaction with sortilin, (an APOE receptor similar to Sorl1), to interfere with neuroprotective lipid signaling78, this suggests circadian variation in local translation of CPEB-mediated polyadenylation of target mRNAs may be a generalizable mechanism that modulates AD susceptibility through downstream lipid pathways. Any one or more of these conserved features could lead to conserved functional consequences dependent on APA choice.
One caveat to our approach is that WTTS-seq generates Ion Torrent PGM sequences which may retain more noise compared to Illumina platform reads and since only Illumina has the option of paired-end reads, there can be more uncertainty in mapping Ion Torrent reads. Our strategy was to capture the maximum number of PASs, including the discovery of novel PASs, and the rat genome is not as thoroughly annotated as some other vertebrate species, we therefore included potentially intergenic reads. In our analysis, we found 5122 PASs and 318 diurnal PASs that mapped outside of known genes, and many APAs within genes mapped to regions in which 3′ ends have yet to be annotated. Based on prior WTTS-seq data sets and other PAS mapping approaches, some portion of our PASs could be method-based artifacts27,79, (see Zhou et al. 27. Figs. 3, 4 and 5). In this, our initial PAS survey, we assayed a large portion of the brain. Therefore, future studies in restricted brain structures or cell types will be required to uncover APAs that cycle or are differentially expressed at a finer scale. Overall, the newly discovered PASs should add valuable insights into regulation of the rat transcriptome and for characterizing PAS usage in the mammalian brain.
Here we used an unbiased discovery-based approach for uncovering novel APA usage following time-of-day or changes in sleep pressure in mammalian brain. These data leverage a call to action for additional work to elucidate the core mechanisms of PAS usage in the brain and to examine the capacity of APA to affect the transcriptomes and proteomes that regulate central brain processes known to be altered by time-of-day and sleep/wake homeostasis. Moreover, it known that PAS usage varies across brain region and cell-type21 (i.e., substructure-, circuit-, laminar- or nucleus-specific)80. These hypothesis-generating data provide an impetus for continued research aimed at delineating how sleep and circadian rhythms impact mental health and neurodegenerative disease.
Methods
Subjects
All animal procedures were carried out in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals and ARRIVE and OLAW guidelines and approved by the WSU Institutional Animal Care and Use Committee (IACUC; ASAF# 6804). Male Long Evans rats (7–9 weeks old) were housed in pairs at 22 ± 2 °C on a 12:12 h light-dark cycle. The rats were acclimated to this light cycle for at least 10 days prior to tissue collection, with water and chow ad libitum. Cages were cleaned weekly (between 8 and 11 AM) unless the rats were being euthanized within 24 h. Thirty rats were randomly assigned to one of six groups (n = 5/group) that were sampled every 4 h, beginning 2 h after light onset (zeitgeber time (ZT)) (i.e., ZT2, 6, 10, 14, 18, and ZT22). For the sleep deprivation (SD) study, twenty rats were randomly assigned to 6 h SD from ZT0–6, wherein rats were kept awake by an automated bedding stir bar (Pinnacle) at the bottom of a cylindrical cage. The bar was set to rotate for 4 s, randomly changing rotation direction, and stopped for a random interval ranging from 10 to 30 s81,82. Following SD, rats (n = 5/time point) were euthanized immediately (R0) by live decapitation or were returned under red light to their home cage for 2 h (R2), 4 h (R4), or 8 h (R8) without disruption before sampling. Five additional rats were euthanized at ZT8 as undisturbed, time-matched controls. The other time-matched controls with undisrupted sleep (i.e., ZT6, 10, and 14) were taken from the corresponding time-of-day matched samples described above. Ambient lighting was matched in all R vs ZT rat pairs. No animals or data points were excluded.
Tissue collection
Rats were decapitated by guillotine under normal room light (ZT2–10) or under dim red light (ZT14–22). Following decapitation, forebrains were resected (Fig. 1A), frozen in 2-methylbutane suspended in dry ice, and then stored at −80 °C until homogenization for RNA extraction. Following tissue collection, total RNA isolates were coded and sent for WTTS-Seq and remained blinded until analyses.
RNA isolation
Just before RNA isolation, forebrains were removed from −80 °C storage and placed on dry ice. Prior to use, a stainless-steel mortar and pestle were cleaned with RNase Zap (Thermo Fisher) and 70% ethanol. The mortar was then partially filled with liquid nitrogen before a forebrain was added, pulverized, and placed in a conical tube. Between each sample, the mortar and pestle were cleaned with 70% ethanol. A small aliquot of sample was removed for RNA isolation using Trizol Reagent (Invitrogen), according to the manufacturer’s instructions. Purified RNA was resuspended in water, and concentration and purity were measured with a Nanodrop spectrophotometer (Thermo Fisher). Samples were stored at −20 °C until further processing was performed.
Library preparation
WTTS-seq libraries were prepared as described by Zhou et al. 27. Briefly, total RNA (2.5 µg) was incubated at 70 °C with 10× Fragmentation buffer (Invitrogen) for 3 min. The fragmentation reaction was halted by the addition of Stop Solution and incubation on ice for at least 2 min. Next, poly(A) + RNA was purified from the fragmented total RNA with Dynabeads Oligo (dT)25 (Invitrogen), according to the manufacturer’s directions, and used for first-strand cDNA synthesis in a 20 µL reaction mixture. First, 1.0 µL of barcode primer (100 µM) and 1.0 µL of a common SMART primer (100 µM) were annealed to the poly(A) + RNA template by heating to 65 °C for 5 min and incubating on ice for at least 2 min. Next, 4.0 µL of 5x First-strand buffer (Invitrogen), 1.0 µL of SuperScript III reverse transcriptase (Invitrogen), 1.0 µL of 0.1 M dithiothreitol, 2.5 µL of 10 mM dNTP, and 1.0 µL of RNase OUT (Invitrogen) were added to the mixture. First-strand cDNA was synthesized by incubating the mixture at 40 °C for 90 min in the presence of library-specific adaptors. Synthesis was terminated by heating the mixture at 70 °C for 15 min. RNases I (100 U/µL; Invitrogen) and H (2 U/µL; Invitrogen) were subsequently added and incubated with the mixture at 37 °C for 30 min to hydrolyze the remaining single-stranded RNA molecules and ensure that only single-stranded cDNA remained. RNase activities were terminated by heating the samples at 70 °C for 20 min. Following purification with solid-phase reversible immobilization (SPRI) beads, second-strand cDNA was synthesized from first-strand cDNA by asymmetric PCR. In addition to the cDNA, the 50 µL PCR reaction contained 1.0 µL of Phusion Hi-Fidelity DNA polymerase, 10.0 µL of 5X HF buffer, 1.0 µL of 0.4 µM barcode primer, 1.0 µL of 0.8 µM common primer, 1.0 µL of 10 mM dNTP, and nuclease-free water. The PCR reaction was carried out by heating at 95 °C for 30 s, followed by 20 cycles of 98 °C for 10 s, 50 °C for 30 s, and 72 °C for 30 s, with a final elongation step at 72 °C for 10 min. SPRI beads were used to purify and select 200–500 bp fragments from the final library. After quality control analyses, the size-selected library was sequenced with an Ion PGM Sequencer at the WSU Genomics Core Laboratory.
Raw read processing
Raw data were obtained from 55 samples and stored in FASTQ format. We filtered raw reads with the FASTQ quality filter in the FASTX Toolkit (v0.0.13), allowing for a minimum score of ≥10 for ≥50% of bases (http://hannonlab.cshl.edu/fastx_toolkit/). We trimmed T nucleotides or T-rich sequences located at the 5’ ends of the reads using Perl scripts, as described previously27. Trimmed reads of at least 16 bp in length were kept for further analysis.
Read mapping and poly(A) site clustering
For each dataset, the processed reads were aligned to the Rattus norvegicus genome (mRatBN7.2/rn7) using the torrent mapping program (TMAP, v3.4.1; http://github.com/iontorrent/tmap) with the unique best hits parameter (-a 0). Raw PASs supported by the uniquely mapped reads were extracted from SAM files and merged into a polyadenylation tag (PAT) file with a script previously used for WTTS-seq (freely available by contacting Dr. Zhihua Jiang, Washington State University). The PAT files were merged to determine the final PASs for all samples. PASs within 25 nucleotides of one another were grouped into one polyadenylation site cluster (PAC) using GetPolyaSiteCluster83. PACs were filtered taking into account the library size. For libraries that had less than 1.7 M reads, PACs were required to have ≥1 set of 5 biological replicates had ≥3 samples with ≥3 reads. For libraries with more than 1.7 M reads, at least 3 samples with ≥4 reads were required.
Gene annotation and usage of poly(A) sites
We annotated all the final PACs for PAS_ID, gene symbol, functional region, and other factors, as indicated, using Cuffcompare (v2.2.1)84, Perl scripts, and annotation file (GCF_000001895.5_Rnor_6.0_genomic.gtf; https://ftp.ncbi.nlm.nih.gov/). Clusters that mapped to mitochondrial genes were removed, then the number of PAS-covered reads was normalized85 to the total number of covered reads within each library and rescaled by a factor of 107.
Diurnal/ultradian PAS discovery
Using normalized PAS read counts as input, rhythmic patterns were identified using the MetaCycle32 R package meta2d, which synthesizes the results of three cycle analysis algorithms (ARSER, JTK_Cycle, and Lomb-Scargle). The five replicates were arranged sequentially over five days since at least two complete cycles of data are recommended86, and the analysis was run five times with different replicates inserted into each of the appropriate ZT time slots, and the median p-value, median BH.Q, average phase, amplitude, and relative amplitude were calculated according to32. We found when 5 meta2d runs were combined, they out-performed randomized data much better than a single run. Following the meta2d analysis as referenced in86, only the highly corroborated PASs that were significant (p < 0.05) in all five trials were used for all analyses. Plots of read counts use normalized reads per 107 and show the SEM of five biological replicates.
Detailed mapping of APA sites
The data supporting all figures depicting APA sites was from rat genome build BN7.2 and the UCSC (http://genome.ucsc.edu) and RDG (https://rgd.mcw.edu/rgdweb/homepage/) genome browsers70,87.)
Gene ontology and pathway analysis
Gene over-representation analysis was performed with the web-based tool WebGestalt34. Input gene symbol sets representing genes with cycling APA sites (p < 0.05 in 5 of 5 trials and >1 PAS) or APA sites that were differentially expressed with sleep pressure (p < 0.01, log2FC > 0.5 and >1 PAS), were compared to relevantly annotated rat genes using an output threshold of FDR ≤ 0.05. For phase-specific analysis, a sliding window of 5 h centered on each sample collection time point was used. For example, for phase ZT6, all PASs with average phase calculations that ranged from 3.5 to 8.5 were grouped. This produced a sufficient number of PASs to do gene over-representation analysis. The 0.5 h overlap in adjacent phase clusters was to account for uncertainty in phase calls.
Differential expression analysis of sleep deprivation/recovery
To evaluate the expression of PASs in sleep homeostasis experiments, PAS counts from rats recovering from 6 h SD were contrasted with time-matched controls (R0 vs ZT6, R2 vs ZT8, R4 vs ZT10, R8 vs ZT14). We removed high variation from the first principal component systematically, resulting in improved variance estimates for low read counts. Prcomp (in R) was used to perform principal component analysis (PCA) and to find eigenvectors by way of singular value decomposition. DESeq-2 with “Apeglm” Shrinkage88 and the Wald Test were used to generate test statistics in R software. The FDRtool was used to determine the Local FDR.
Supplementary information
Acknowledgements
The authors would also like to thank Michelle A. Schmidt, Jennifer J. Michal, Kyle C. Schuppe, Vivian Wei, and Brooke Rogers for technical assistance, and Amy Sullivan for an expert and detailed critique of the manuscript. This work was supported by NIH grant R35GM133440 “Characterizing evolutionarily conserved mechanisms underlying sleep, clocks, and memory”, and initial seed funds provided by the Department of Translational Medicine and Physiology, WSU.
Author contributions
C.D. and J.G. conceived of and designed the experiments. C.D. ran the experiments and collected the samples. C.F. ran the data analysis, wrote the main manuscript text, and prepared figures and tables. H.W. processed the sequence data, (mapping, clustering, and APA annotation). N.P. and A.D. carried out differential expression analyses. J.G., C.D., C.F., Z.J., and J.D. revised the manuscript. C.D. and J.G. funded the study. All authors reviewed and approved the final version of the manuscript.
Data availability
The PAS sequence data discussed here have been deposited in NCBI’s Gene Expression Omnibus89 and are accessible through GEO Series accession number GSE250324.
Competing interests
J.G. is the founder of Blood Brain Biotechnology, LLC, and a member of its scientific advisory board. Jon Davis is a principal scientist at Integrated Physiology Research, Novo Nordisk. All other authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
The online version contains supplementary material available at 10.1038/s44323-024-00012-2.
References
- 1.Abbott, S. M., Malkani, R. G. & Zee, P. C. Circadian disruption and human health: a bidirectional relationship. Eur. J. Neurosci.51, 567–583 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Grandner, M. A. & Fernandez, F. X. The translational neuroscience of sleep: a contextual framework. Science374, 568–573 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Medic, G., Wille, M. & Hemels, M. E. Short- and long-term health consequences of sleep disruption. Nat. Sci. Sleep.9, 151–161 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Baglioni, C. et al. Sleep and mental disorders: a meta-analysis of polysomnographic research. Psychol. Bull.142, 969–990 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Alachkar, A. et al. The hidden link between circadian entropy and mental health disorders. Transl. Psychiatry12, 281 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Logan, R. W. & McClung, C. A. Rhythms of life: circadian disruption and brain disorders across the lifespan. Nat. Rev. Neurosci.20, 49–65 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Musiek, E. S. & Holtzman, D. M. Mechanisms linking circadian clocks, sleep, and neurodegeneration. Science354, 1004–1008 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Krueger, J. M., Frank, M. G., Wisor, J. P. & Roy, S. Sleep function: toward elucidating an enigma. Sleep. Med. Rev.28, 46–54 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Krueger, J. M. & Tononi, G. Local use-dependent sleep; synthesis of the new paradigm. Curr. Top. Med. Chem.11, 2490–2492 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Anafi, R. C., Kayser, M. S. & Raizen, D. M. Exploring phylogeny to find the function of sleep. Nat. Rev. Neurosci.20, 109–116 (2019). [DOI] [PubMed] [Google Scholar]
- 11.Siegel, J. M. Sleep function: an evolutionary perspective. Lancet Neurol.21, 937–946 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Panda, S., Hogenesch, J. B. & Kay, S. A. Circadian rhythms from flies to human. Nature417, 329–335 (2002). [DOI] [PubMed] [Google Scholar]
- 13.Cirelli, C. The genetic and molecular regulation of sleep: from fruit flies to humans. Nat. Rev. Neurosci.10, 549–560 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lane, J. M. et al. Genetics of circadian rhythms and sleep in human health and disease. Nat. Rev. Genet.24, 4–20 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gerstner, J. R. et al. Normal sleep requires the astrocyte brain-type fatty acid binding protein FABP7. Sci. Adv.3, e1602663 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Jagannath, A., Taylor, L., Wakaf, Z., Vasudevan, S. R. & Foster, R. G. The genetics of circadian rhythms, sleep and health. Hum. Mol. Genet.26, R128–R138 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Mitschka, S. & Mayr, C. Context-specific regulation and function of mRNA alternative polyadenylation. Nat. Rev. Mol. Cell Biol.23, 779–796 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Tian, B. & Manley, J. L. Alternative polyadenylation of mRNA precursors. Nat. Rev. Mol. Cell Biol.18, 18–30 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Mayr, C. Evolution and biological roles of alternative 3’UTRs. Trends Cell Biol.26, 227–237 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Arora, A. et al. The role of alternative polyadenylation in the regulation of subcellular RNA localization. Front. Genet.12, 818668 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kang, B. et al. Infernape uncovers cell type-specific and spatially resolved alternative polyadenylation in the brain. Genome Res.10.1101/gr.277864.123 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Greenwell, B. J. et al. Isoform-specific regulation of rhythmic gene expression by alternative polyadenylation. bioRxiv10.1101/2020.12.12.422514 (2020).
- 23.Gendreau, K. L., Unruh, B. A., Zhou, C. & Kojima, S. Identification and characterization of transcripts regulated by circadian alternative polyadenylation in mouse liver. G38, 3539–3548 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Liu, Y. et al. Cold-induced RNA-binding proteins regulate circadian gene expression by controlling alternative polyadenylation. Sci. Rep.3, 2054 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gruber, A. J. & Zavolan, M. Alternative cleavage and polyadenylation in health and disease. Nat. Rev. Genet.20, 599–614 (2019). [DOI] [PubMed] [Google Scholar]
- 26.Chen, W. et al. Alternative polyadenylation: methods, findings, and impacts. Genom. Proteom. Bioinform.15, 287–300 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhou, X. et al. Accurate profiling of gene expression and alternative polyadenylation with whole transcriptome termini site sequencing (WTTS-Seq). Genetics203, 683–697 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Brutman, J. N. et al. Vapor cannabis exposure promotes genetic plasticity in the rat hypothalamus. Sci. Rep.9, 16866, 10.1038/s41598-019-53516-4 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Mohanan, N. K., Shaji, F., Koshre, G. R. & Laishram, R. S. Alternative polyadenylation: An enigma of transcript length variation in health and disease. Wiley Interdiscip. Rev. RNA13, e1692 (2022). [DOI] [PubMed] [Google Scholar]
- 30.Mittleman, B. E. et al. Alternative polyadenylation mediates genetic regulation of gene expression. Elife10.7554/eLife.57492 (2020). [DOI] [PMC free article] [PubMed]
- 31.Cui, Y. et al. Alternative polyadenylation transcriptome-wide association study identifies APA-linked susceptibility genes in brain disorders. Nat. Commun.14, 583 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wu, G., Anafi, R. C., Hughes, M. E., Kornacker, K. & Hogenesch, J. B. MetaCycle: an integrated R package to evaluate periodicity in large scale data. Bioinformatics32, 3351–3353 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bova, R., Micheli, M. R., Qualadrucci, P. & Zucconi, G. G. BDNF and trkB mRNAs oscillate in rat brain during the light-dark cycle. Brain Res. Mol. Brain Res.57, 321–324 (1998). [DOI] [PubMed] [Google Scholar]
- 34.Liao, Y., Wang, J., Jaehnig, E. J., Shi, Z. & Zhang, B. WebGestalt 2019: gene set analysis toolkit with revamped UIs and APIs. Nucleic Acids Res.47, W199–W205 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Zhu, B. et al. A cell-autonomous mammalian 12 h clock coordinates metabolic and stress rhythms. Cell Metab.25, 1305–1319.e1309 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Pan, Y. et al. 12-h clock regulation of genetic information flow by XBP1s. PLoS Biol.18, e3000580 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Meng, H. et al. XBP1 links the 12-hour clock to NAFLD and regulation of membrane fluidity and lipid homeostasis. Nat. Commun.11, 6215 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Tang, M. et al. An extra-clock ultradian brain oscillator sustains circadian timekeeping. Sci. Adv.8, eabo5506 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Asher, G. & Zhu, B. Beyond circadian rhythms: emerging roles of ultradian rhythms in control of liver functions. Hepatology77, 1022–1035 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Yuniati, L. et al. Tumor suppressor BTG1 promotes PRMT1-mediated ATF4 function in response to cellular stress. Oncotarget7, 3128–3143 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Giambruno, R. & Bonaldi, T. Dual role of PRMT1-dependent arginine methylation in cellular responses to genotoxic stress. Mol. Cell Oncol.7, 1743808 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Stacey, S. N. et al. A germline variant in the TP53 polyadenylation signal confers cancer susceptibility. Nat. Genet.43, 1098–1103 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Cheng, L. C. et al. Alternative 3’ UTRs play a widespread role in translation-independent mRNA association with the endoplasmic reticulum. Cell Rep.36, 109407 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Farberov, L. et al. Multiple copies of microRNA binding sites in long 3’UTR variants regulate axonal translation. Cells10.3390/cells12020233 (2023). [DOI] [PMC free article] [PubMed]
- 45.Griesemer, D. et al. Genome-wide functional screen of 3’UTR variants uncovers causal variants for human disease and evolution. Cell184, 5247–5260.e5219 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Ptitsyna, N., Boughorbel, S., El Anbari, M. & Ptitsyn, A. The role of alternative Polyadenylation in regulation of rhythmic gene expression. BMC Genom.18, 576 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Latypova, X. et al. Haploinsufficiency of the Sin3/HDAC corepressor complex member SIN3B causes a syndromic intellectual disability/autism spectrum disorder. Am. J. Hum. Genet.108, 929–941 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Naruse, Y. et al. Circadian and light-induced transcription of clock gene Per1 depends on histone acetylation and deacetylation. Mol. Cell Biol.24, 6278–6287 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Naruse, Y., Aoki, T., Kojima, T. & Mori, N. Neural restrictive silencer factor recruits mSin3 and histone deacetylase complex to repress neuron-specific target genes. Proc. Natl Acad. Sci. USA96, 13691–13696 (1999). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Vecsey, C. G. et al. Genomic analysis of sleep deprivation reveals translational regulation in the hippocampus. Physiol. Genom.44, 981–991 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lyons, L. C., Chatterjee, S., Vanrobaeys, Y., Gaine, M. E. & Abel, T. Translational changes induced by acute sleep deprivation uncovered by TRAP-Seq. Mol. Brain13, 165 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Caba, M. et al. Oxytocinergic cells of the posterior hypothalamic paraventricular nucleus participate in the food entrained clock. Sci. Rep.11, 19957 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Kumar, J. R. et al. Relaxin’ the brain: a case for targeting the nucleus incertus network and relaxin-3/RXFP3 system in neuropsychiatric disorders. Br. J. Pharm.174, 1061–1076 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Goldsmith, C. S. & Bell-Pedersen, D. Diverse roles for MAPK signaling in circadian clocks. Adv. Genet.84, 1–39 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Talamanca, L., Gobet, C. & Naef, F. Sex-dimorphic and age-dependent organization of 24-hour gene expression rhythms in humans. Science379, 478–483 (2023). [DOI] [PubMed] [Google Scholar]
- 56.Gu, J. et al. TDP-43 suppresses tau expression via promoting its mRNA instability. Nucleic Acids Res.45, 6177–6193 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gu, J., Chu, D., Jin, N., Chen, F. & Liu, F. Cyclic AMP-dependent protein kinase phosphorylates TDP-43 and modulates its function in Tau mRNA processing. J. Alzheimers Dis.70, 1093–1102 (2019). [DOI] [PubMed] [Google Scholar]
- 58.Lugli, G. et al. Plasma exosomal miRNAs in persons with and without Alzheimer disease: altered expression and prospects for biomarkers. PLoS ONE10, e0139233 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Chia, S. Y. et al. Upregulated blood miR-150-5p in Alzheimer’s disease dementia is associated with cognition, cerebrospinal fluid amyloid-β, and cerebral atrophy. J. Alzheimers Dis.88, 1567–1584 (2022). [DOI] [PubMed] [Google Scholar]
- 60.Wen, Q. et al. Lead-exposure associated miRNAs in humans and Alzheimer’s disease as potential biomarkers of the disease and disease processes. Sci. Rep.12, 15966 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Chen, C., Zhang, S., Wei, Y. & Sun, X. LncRNA RMST regulates neuronal apoptosis and inflammatory response via sponging miR-150-5p in Parkinson’s disease. Neuroimmunomodulation29, 55–62 (2022). [DOI] [PubMed] [Google Scholar]
- 62.Zhao, J., Wan, X. N., Zhu, J. P., Liu, Q. C. & Gan, L. LncRNA NEAT1 promoted MPP+‑induced ferroptosis via regulating miR‑150‑5p/BAP1 pathway in SK‑N‑SH cells. Acta Neurobiol. Exp.82, 226–236 (2022). [DOI] [PubMed] [Google Scholar]
- 63.Foley, H. B. et al. Depression, perceived stress, and distress during pregnancy and EV-associated miRNA profiles in MADRES. J. Affect Disord.323, 799–808 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Abdolmaleky, H. M., Martin, M., Zhou, J. R. & Thiagalingam, S. Epigenetic alterations of brain non-neuronal cells in major mental diseases. Genes10.3390/genes14040896 (2023). [DOI] [PMC free article] [PubMed]
- 65.Fenner, B. M. Truncated TrkB: beyond a dominant negative receptor. Cytokine Growth Factor Rev.23, 15–24 (2012). [DOI] [PubMed] [Google Scholar]
- 66.Chen, S. et al. 3’UTRs regulate mouse Ntrk2 mRNA distribution in cortical neurons. J. Mol. Neurosci.70, 1858–1870 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Mishra, S. et al. The Alzheimer’s gene SORL1 is a regulator of endosomal traffic and recycling in human neurons. Cell Mol. Life Sci.79, 162 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Andersen, O. M. et al. A genetically modified minipig model for Alzheimer’s disease with SORL1 haploinsufficiency. Cell Rep. Med.3, 100740 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Lee, H. et al. Cell-type-specific regulation of APOE and CLU levels in human neurons by the Alzheimer’s disease risk gene SORL1. Cell Rep.42, 112994 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Nassar, L. R. et al. The UCSC Genome Browser database: 2023 update. Nucleic Acids Res.51, D1188–D1195 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.McGeary, S. E. et al. The biochemical basis of microRNA targeting efficacy. Science10.1126/science.aav1741 (2019). [DOI] [PMC free article] [PubMed]
- 72.Huang, Y. S., Carson, J. H., Barbarese, E. & Richter, J. D. Facilitation of dendritic mRNA transport by CPEB. Genes Dev.17, 638–653 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Martin, K. C. & Zukin, R. S. RNA trafficking and local protein synthesis in dendrites: an overview. J. Neurosci.26, 7131–7134 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Gerstner, J. R. & Paschos, G. K. Circadian expression of Fabp7 mRNA is disrupted in Bmal1 KO mice. Mol. Brain13, 26 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Vanderheyden, W. M., Fang, B., Flores, C. C., Jager, J. & Gerstner, J. R. The transcriptional repressor Rev-erbα regulates circadian expression of the astrocyte Fabp7 mRNA. Curr. Res. Neurobiol.10.1016/j.crneur.2021.100009 (2021). [DOI] [PMC free article] [PubMed]
- 76.Gerstner, J. R. et al. Time of day regulates subcellular trafficking, tripartite synaptic localization, and polyadenylation of the astrocytic Fabp7 mRNA. J. Neurosci.32, 1383–1394 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Kim, J., Basak, J. M. & Holtzman, D. M. The role of apolipoprotein E in Alzheimer’s disease. Neuron63, 287–303 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Asaro, A. et al. ApoE4 disrupts interaction of sortilin with fatty acid-binding protein 7 essential to promote lipid signaling. J. Cell Sci.10.1242/jcs.258894 (2021). [DOI] [PMC free article] [PubMed]
- 79.Balázs, Z. et al. Template-switching artifacts resemble alternative polyadenylation. BMC Genom.20, 824 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Vanrobaeys, Y. et al. Spatial transcriptomics reveals unique gene expression changes in different brain regions after sleep deprivation. Nat. Commun.14, 7095 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Strobel, B. K., Schmidt, M. A., Harvey, D. O. & Davis, C. J. Image discrimination reversal learning is impaired by sleep deprivation in rats: Cognitive rigidity or fatigue? Front. Syst. Neurosci.16, 1052441 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Ward, C. P., Wooden, J. I. & Kieltyka, R. Effects of sleep deprivation on spatial learning and memory in juvenile and young adult rats. Psychol. Neurosci.10, 109–116 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Dong, M., Ji, G., Li, Q. Q. & Liang, C. Extraction of poly(A) sites from large-scale RNA-Seq data. Methods Mol. Biol.1255, 25–37 (2015). [DOI] [PubMed] [Google Scholar]
- 84.Trapnell, C. et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat. Protoc.7, 562–578 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Wang, H. et al. Genome-wide analysis and functional characterization of the polyadenylation site in pigs using RNAseq data. Sci. Rep.6, 36388 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Hughes, M. E. et al. Guidelines for genome-scale analysis of biological rhythms. J. Biol. Rhythms32, 380–393 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Vedi, M. et al. 2022 updates to the Rat Genome Database: a Findable, Accessible, Interoperable, and Reusable (FAIR) resource. Genetics10.1093/genetics/iyad042 (2023). [DOI] [PMC free article] [PubMed]
- 88.Zhu, A., Ibrahim, J. G. & Love, M. I. Heavy-tailed prior distributions for sequence count data: removing the noise and preserving large differences. Bioinformatics35, 2084–2092 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Edgar, R., Domrachev, M. & Lash, A. E. Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res.30, 207–210 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The PAS sequence data discussed here have been deposited in NCBI’s Gene Expression Omnibus89 and are accessible through GEO Series accession number GSE250324.