Abstract
Brain-Machine Interfaces can allow neural control over assistive devices. They also provide an important platform to study neural plasticity. Recent studies indicate that optimal engagement of learning is essential for robust neuroprosthetic control. However, little is known about the neural processes that may consolidate a neuroprosthetic skill. Based on the growing body of evidence linking slow-wave activity (SWA) during sleep to consolidation, we examined if there is ‘offline’ processing after neuroprosthetic learning. Using a rodent model, here we show that after successful learning, task-related units specifically experienced increased locking and coherency to SWA during sleep. Moreover, spike-spike coherence among these units was significantly enhanced. These changes were not present with poor skill acquisition or after control awake periods, demonstrating specificity of our observations to learning. Interestingly, time spent in SWA predicted performance gains. Thus, SWA appears to play a role in offline processing after neuroprosthetic learning.
INTRODUCTION
Brain-machine interfaces (BMIs) have the potential to seamlessly merge the computational power of the motor system with that of artificial electronic systems. Research into the development of BMIs has flourished over the past decade, leading to impressive demonstrations of rodents1–3, non-human primates4–10, and humans controlling prosthetic devices11–13 in real-time through modulation of neural signals. This body of work has also identified that adaptive processes, both in the motor system and in the algorithms that transform neural activity into prosthetic control signals (i.e. ‘decoders’), are essential for achieving stable neuroprosthetic control3,6, 8, 9, 14,15. For example, practicing neuroprosthetic control over multiple sessions in the setting of a fixed decoder is known to result in persistent improvements in performance14. Moreover, long-term adaptation of decoders at a rate that is seemingly concordant with neural learning rates can be essential for long-term stable performance15. Together, these studies underscore the importance of optimal engagement of long-term neural learning mechanisms9, 14, 15. However, the specific neural process that stabilizes a newly acquired prosthetic skill remains incompletely understood.
Based on the growing body of literature indicating that cortical slow-wave activity (SWA) is associated with ‘offline’ processing16–19, we hypothesized that such processing facilitates direct cortical control of an artificial actuator. Studies in humans indicate that offline processing during overnight sleep20–23 as well as during daytime naps24 facilitates memory consolidation. Motor task performance is also known to improve after sleep versus an equivalent amount of wakefulness21–25. While SWA is also directly implicated in the consolidation of motor skills26–30, it remains poorly understood how neural ensembles in motor cortex are precisely modified during SWA after skill acquisition. While most theories suggest that reactivation of cortical neurons during SWA is important for consolidation31, to our knowledge there is little experimental support of this for procedural memory formation. In this context, BMIs offer a powerful tool to directly modulate single units, thereby allowing precise characterization of how both task-related (tr) and task-unrelated (tu) cortical units are differentially processed in motor cortex during SWA. In these experiments, rats were trained to control the angular velocity of a feeding tube via direct modulation of cortical units through operant conditioning. We hypothesized that after neuroprosthetic skill acquisition, tr units would be differentially processed from tu units during SWA.
RESULTS
After implantation of microelectrodes in M1, we trained six rats to exert direct neural control of the angular velocity of a mechanical actuator that could also deliver water (Fig. 1a). A linear decoder with randomized weights typically converted the firing rates of two randomly selected units (hereafter direct units) into the angular velocity of the actuator (see Supplementary Fig. 1 and 2 and Methods for discussion of unit sorting using microwire arrays versus silicon probes in a tetrode configuration). We also recorded multiple other units that were not causally linked to actuator movements (hereafter indirect units). The decoder weights were held constant during the session in order to exclusively rely on neural learning mechanisms. Each trial started with the simultaneous delivery of an auditory tone and the opening of a door to allow access to the tube (Fig. 1a,b). At the start of each trial the angular position of the tube was set to 0° (P1). If the angular position of the tube was held for > 300 ms at position P2 (90°), a defined amount of water was delivered (i.e. successful trial). A trial was stopped if this was not achieved within 15 seconds (i.e. unsuccessful trial). At the end of a trial, the door was closed and the actuator returned to position P1.
Over the course of a typical two-hour practice session, animals showed improvements in task performance, with a significant reduction in the time to successful trial completion and a decrease in the number of unsuccessful trials (Fig. 1c, d). Specifically, out of 19 such training sessions, there were 15 ‘robust learning’ sessions with significant reductions for both metrics (Fig. 1d, n=15 sessions; ‘early’ completion time: 13.49 ± 0.75 seconds; ‘late’ completion time: 4.83 ± 0.42 mean ± s.e.m., p < 0.0001, one tailed t-test). Moreover, the percentage of unsuccessful trials significantly decreased from 33.56 ± 4.04% to 6.00 ± 1.51%. (p < 0.0001, one tailed t-test). For the remaining four sessions these values did not improve with training (i.e. ‘poor learning’). In the robust learning sessions, we also found that the path of the actuator taken from position P1 to P2 became direct (Fig. 1e).
As also illustrated in Figure 1f, for direct units that were causally linked to actuator movements via the decoder (i.e. task-related direct or trd), nearly all experienced significant changes in task-related modulation at the end of session (96 % of 27 trd units with a significant increase in modulation depth). As outlined in the methods, we assigned weights in the decoder that ranged from +1 to −1. While we did not see suppression of activity relative to the baseline, there was a significantly greater modulation of the units assigned positive weights (i.e. trd+) than negative weights (i.e. trd−) (Supplementary Fig. 3). Direct units were also significantly more likely to experience a change in modulation in comparison to indirect units (Fig. 1f, n=15 sessions; direct: 96.7% ± 3.5%; indirect: 65.7 ± 3.7%, p <0.001, t-test). However, based on the growing notion that subsets of indirect units likely contribute to neuroprosthetic control32–34, we further sub-classified such units as either task-related (tri) or task-unrelated (tu) based on changes in modulation depth with learning. A unit was assigned as task-related if its post-learning change in firing rate was 2.5 standard deviation above the baseline firing rate (see Methods for details). Moreover, we did not find a preponderance of particular cell types in either category as determined by the width of the record spike (Supplementary Fig. 1b,c, and d). For subsequent analysis we respectively used 27 trd, 108 tri, and 39 tu units in 15 BMI training sessions (see Supplementary Table 1 for details). Figure 1g illustrates the distribution of changes in modulation depth for each of the three categories. Interestingly, after learning, the temporal profiles of activation (i.e. Fig. 1e for single examples) for trd and tri were not significantly different (time to peak firing, direct 3.1 ± 0.2 and tri 3.3± 0.1 seconds, p=0.24, t-test), suggesting that these two distinct groups of task-related units became functionally and temporally coupled with learning and successful task performance. We also saw evidence for changes in the spike-triggered local field potential (LFP) for task-related units that was not present for the task unrelated units (Supplementary Fig. 4).
Increased locking to SWA after learning
Because of the literature indicating a link between slow-wave sleep (SWS) and motor learning16–18, we first examined whether the spike-field relationship during SWS is altered after robust learning sessions. To look for signatures of offline processing, we first assessed whether individual units (trd, tri and tu) experienced changes when comparing the pre- and post-SWS around a successful training session (hereafter SWSpre and SWSpost; see Methods and Supplementary Fig. 5 for identification of SWS epochs). Specifically, we used spike-triggered averaging (STA) to quantify the relationship between spiking and SWA. The STA provides an intuitive estimate of how neural spiking is modulated by cortical oscillations. In order to match SWS durations, we used the first 10 minutes of SWSpre and SWSpost for this analysis (see Methods). Interestingly, we found that trd and tri units consistently experienced a significant increase in the peak-to-peak amplitude of the STA in comparison to tu units even in the absence of any changes in the LFP power (Fig. 2). This appeared to be the result of a greater likelihood of spikes occurring at a specific phase of the SWA (Fig. 2a). While trd and tri units experienced respective 48.87 ± 7.23 % and 54.37 ± 8.46 % increases in STA amplitude, tu units experienced a significant net −11.68 ± 4.89 % reduction (Fig. 2c, p < 0.0001, one-way ANOVA for overall, post hoc t-test for trd and tu; and tri and tu ). In contrast, we did not find any significant differences when we calculated the STA for spindle (8 – 20Hz) and ripple (100 – 300 Hz) frequency bands (p>0.05, one-way ANOVA, Supplementary Fig. 6). We next assessed whether changes in the power or frequency of SWS or firing properties of units could account for our observations. There were no significant changes in either the SWA power (average change of 3.11 ± 6.48 %, p=0.79 on paired t-test of absolute values, Fig. 2e and Supplementary Fig. 7), density of delta waves in SWS (11.82 ± 5.67 waves min−1 vs. 12.86 ± 6.09 delta waves min−1, p=0.16, Supplementary Fig. 8c), firing rate (6.42 ± 0.43 Hz vs. 7.01 ± 0.34 Hz, p = 0.270, Supplementary Fig. 1e,f and g), or changes in bursting properties (Supplementary Fig. 8a, b). We also did not find any evidence for a significant relationship between the changes in the firing rates of individual units and changes in STA amplitude (linear regression R2 = 0.01, p=0.29; Supplementary Fig. 1e also shows plots of the firing rate in SWSpre versus SWSpost) or changes in the LFP power relative to change in STA amplitude (linear regression R2= 0.033, p= 0.64). Moreover, we did not find evidence for spatiotemporal changes in the occurrence and rate of SWA (Supplementary Fig. 9).
To further establish a link between the task-dependent modulation that emerges after learning and the change in phase-locking to SWA we examined the relationship between the extent of task-dependent modulation and the percent changes in the STA amplitude. Interestingly, we found that the degree of task-related firing rate modulation significantly predicted the extent of increased STA amplitude during the subsequent slow-wave sleep (Fig. 2d, R = 0.85, P < 0.05, Spearman correlation). The non-linear nature of the curve is likely the result of the significant reduction in STA amplitude without a mirror symmetric reduction in depth of modulation for tu units. Also consistent with the notion of a link between the extent of task-dependent modulation and the change in STA amplitude was the finding that trd+ units experienced a significantly greater change in the STA amplitude than trd− units (Supplementary Fig. 3d). Together, these results suggest that after successful learning of direct cortical control there is greater locking of the task-related cortical spiking to SWA.
To further confirm that changes in STA amplitude reflect greater phase-locking of trd and tri units, we used spike-field coherence (SFC) to measure coherence in the 0.3–3Hz band between the spiking activity and the local field potential (LFP). The SFC measure complements the STA analysis in that it is not sensitive to changes in power and can readily assess all frequencies. The magnitude of SFC, which varies as a function of frequency and yields a value between 0 and 1, showed a similar trend of enhanced SFC for trd and tri units in the SWA frequency range (Fig. 3 and Supplementary Fig. 10 for the full spectrum and phase analysis). Consistent with the STA results in the spindle/ripple bands, there were no significant differences for other frequency bands (i.e multitaper method using jackknife-based confidence intervals). The three groups trd, tri and tu respectively experienced a change in SFC magnitude of 53.24 ± 15.02%, 60.95 ± 8.89% and −20.39 ± 11.42%. Thus, task-related units (i.e. both trd and tri) showed a significant increase in SFC magnitude compared to tu units (p < 0.0001, one-way ANOVA for overall, post hoc t-test for trd and tu; and tri and tu). There were no significant changes in SFC phases of the task-related units between SWSpre and SWSpost (0.18 ± 0.14 radians, mean ± s.e.m., net change in phase and p=0.44, Watson-Williams circular t-test for comparison of phases pre- and post-learning, Supplementary Fig. 10). Together with the findings from the STA analysis, these results confirm that tr units, in contrast to tu units, were significantly more phase-locked to SWA after successful learning.
Increased spike-spike coherence after learning
We subsequently assessed whether there are changes in the functional connectivity among the recorded M1 neural ensembles during SWSpost. We calculated the magnitude of spike-spike coherence (SSC) for both SWSpost and SWSpre for direct units (trd) relative to all other units (i.e. trd - trd, trd - tri, and trd - tu neuronal pairs). We used a ‘shuffling’ method to assess significant SSC increases in SWSpost in comparison to SWSpre. For example in Figure 4a, only the SWSpost SSC curves showed a significant increase in the 0.3–3 Hz band (i.e. SWA). In Figure 4b, however, both SSC curves were not significantly greater than chance. We found surprisingly robust increases in SSC (0.3–3 Hz range) for task-related pairs (i.e. trd - trd and trd - tri) after successful learning (Fig. 4c, 92.65 ± 15.55 % increase for 12 trd - trd pairs and 59.84 ± 12.04 % increase for 108 trd - tri pairs). In comparison, the change in SSC for the trd -tu pairs was −35.96 ± 15.55 % (Fig. 4c, p<0.001, one-way ANOVA for overall three groups; post-hoc t-test for trd - trd, and trd - tu pairs; and trd - tri, and trd - tu pairs). When we used cross-correlation analysis, we also found similar results (Fig. 4 a, b shows single examples and summary statistics of p<0.001 in Supplementary Fig. 11). This indicates that task-related units are significantly more likely to fire synchronously when compared with task-unrelated units during SWSpost.
Lack of changes after poor learning
We next determined whether these changes in STA, SFC and SSC are specific to successful learning. We thus analyzed four sessions in which animals performed the BMI task but did not demonstrate robust learning (Fig. 1d, Fig. 5a). During these sessions many of the trials ended because the 15 second timeout was reached. As shown in Figure 1d, the respective time to completion were 13.19 ± 1.10 seconds for early trials versus 12.76 ± 0.80 seconds for late trials (p = 0.72, one tailed t-test); and the percentage of unsuccessful trials remained unchanged (52.50 ± 3.29 % early versus 54.17 ± 6.36 %, for late; P = 0.68, one tailed t-test). In these sessions, tr units did not experience an increase in STA amplitude (Fig. 5a, b). The mean STA amplitude change for trd units was −12.24 ± 4.24 % (n=8 units). In addition, the SSC between trd-trd pairs did not increase in a manner similar to that during robust learning sessions (Fig. 5c, −12.67 ± 8.78 %, n=4 pairs). These were significantly different from the corresponding values in robust learning sessions (p < 0.001, t-test). Moreover, we also performed control experiments in which rodents were placed in the BMI chamber but did not perform any task (i.e. “control sleep” in Fig. 5b). Instead, they received an equivalent amount of water reward over a time period that matched a typical practice session. The STA before and after these control sessions did not show any increase. The overall change was significantly different from trd units from robust learning sessions (n=3 sessions with 32 units, −10.23 ± 3.76% STA change, p<0.01; t-test, see Fig. 5b). This suggests that factors such as sustained attention, reward and attempts at execution without evidence of learning are not sufficient to trigger increased coherent activation during the SWApost. Together, this further supports a conclusion that enhanced locking occurs only for task-related ensembles linked to learning.
Reactivation of emergent task-related ensembles
The analysis presented above examined changes in spike-field relationships and pairwise interactions among task related units after robust and poor learning. We next used a method previously developed to identify task-related neuronal cell assemblies and their coordinated reactivation35–37. This method uses principal components analysis (PCA, see Methods) to identify significant ‘signal components’, or ensembles of neurons that become temporally correlated during learning and task performance. The output of this analysis are principle components (PCs), consisting of an array of weights assigned to each unit in the identified ensemble, and the eigenvalue, a numerical value that represents the extent of total variance that is captured by a given PC; PCs with the largest eigenvalues capture the most variance (Fig. 6a,b). Whether a calculated PC represents a significant temporally correlated pattern of activity is determined by λmax, i.e. the highest eigenvalue that arises out of an equivalently sized random matrix based on the Marchenko-Pastur law 35–37 (Fig. 6b). Thus, PCs with eigenvalues greater than λmax were considered ‘signal components’ whereas those below λmax were considered as arising from chance interactions. We started by estimating the signal components (i.e. ensemble patterns of activity) linked to successful learning (Fig. 6a, b). Interestingly, we found evidence for significant signals only for robust learning sessions (Fig. 6b). In contrast, poor-learning sessions generated PCs that were never greater than λmax.
We then examined if there was a change in the ‘reactivation strength’ during SWSpost by calculating the instantaneous reactivation strength of signal components35–37. The reactivation strength is an instantaneous measure of how similar the ensemble activity during the SWS epoch is to that identified during the awake period using the PCA analysis. Importantly, by comparing the SWSpre and the SWSpost epochs we can identify changes that are specifically linked to learning control. After robust learning sessions, the observed strength of reactivation of the signal component was greatly enhanced in comparison to SWSpre (Fig. 6c,d illustrate the significant increase in the incidence of high reactivation strength events). Moreover, this was not evident during the awake spontaneous periods prior to the onset of SWS (n=15, logrank, Supplementary Fig. 12). Across multiple sessions, the reactivation strength of the task-related ensemble during SWSpost was significantly greater after robust learning in comparison to poor learning (Fig. 6d, logrank test, p <0.05). We also examined the specific relationship between instantaneous reactivation and SWA. We thus performed a delta-wave triggered averaging of reactivation strength. As shown in Figure 6 e,f, cell assembly reactivation events occurred in concert with maximum delta wave negativity. The ratio of peak reactivation at the time of maximum delta negativity to the baseline reactivation strength was significantly greater after robust learning than either after poor learning (Fig. 6e, f) or during spontaneous awake-periods after robust learning (p<0.0001, unpaired t-test, Supplementary Fig. 12c, d).
Improvements in task performance
We also conducted experiments in which the animals performed the task during two sessions (hereafter Block1 and Block2) using the same decoder but separated by a period of spontaneous activity and sleep (Fig. 7a). Interestingly, we found that task performance consistently improved at the start of Block2 (Fig. 7b, p<0.05 for each of the eight individual comparisons of the last 30 trials from Block1 and the first 30 trials from Block2, paired t-test). Such improvements were also evident if we compared the best performance at the end of Block1 with that of early Block2 (p<0.05, paired t-test, Supplementary Fig. 13). That increased motivation after rest does not account for this finding is supported by the lack of significant performance changes during Block2 itself (Fig. 7c, p > 0.05 for each of the eight comparisons). Moreover, even while the total period was the same between the two blocks, the time-spent in SWS after Block1 was positively correlated with the extent of improvement at the beginning of Block2 (R=0.67, p<0.05, Spearman Correlation). Interestingly, the extent of improvement was also positively correlated with the mean change in STA amplitude between Sleep1–2 for trd units (Fig 7d; R=0.85, P=0.01, Spearman correlation).
Changes with continued task performance
We first compared the STA amplitude and SFC magnitude changes for trd and tri units (n=8 experiments, 15 trd and 47 tri units for Sleep1–2 and Sleep2–3). With continued execution of the task there was no further increase in coherent spiking for task-related units (Fig. 7a, e-g). As shown in Figure 7, STA amplitude changes were −19.06 ± 7.55 % for trd units and −18.85 ± 4.62 for tri units, whereas SFC magnitude changes were −12.06 ± 8.42 % for trd units and −8.54 ± 6.09 for tri units, after the second training session (for Sleep2-3). We also examined SSC magnitude changes for task-related pairs (i.e. trd -trd and trd-tri). The changes in SSC magnitude were −21.92 ± 11.22 % for trd - trd pairs (n=7) and −16.36 ± 5.01 for trd - tri pairs (Fig. 7g, n=47 pairs). The small changes observed between Sleep2 and Sleep3 (i.e. Fig. 7 e–g, STAamp and SSCmag) were not significantly different from the absolute changes seen during poor learning sessions (p = 0.23 and 0.20 respectively for STAamp changes for comparisons of trd and tri units; and p=0.14 and 0.07 respectively for SSCmag changes for trd - trd pairs and trd - tri pairs). Together, this indicates that the continued execution of the task in the absence of new learning is not sufficient to trigger further increases in phase-locking and coherent spiking.
DISCUSSION
In summary, here we show that successful learning of direct neural control of an actuator is linked to enhanced phase-locking and coherent activation of emergent task-related activity, both direct and indirect, during the post-learning SWS. During the process of learning, these two sets of task-related units became increasingly activated with similar temporal profiles. Interestingly, we found strong evidence for coherent reactivation of these newly functionally coupled ensembles during SWA. In contrast, task-unrelated activity either remained unchanged or experienced a reduction in phase-locking and coherent activation. For control sleep sessions, poor-learning sessions, and second sessions without evidence of new performance gains, we did not find evidence of increased phase-locking to SWA or coherent activation, indicating specificity of our observed findings to new learning and skill acquisition. Interestingly, we found a positive correlation between both time spent in SWS and the extent of STA amplitude change and improvements in task performance upon awakening.
Processing of an emergent task-related ensemble
Learning was associated with the emergence of a novel task-related ensemble of both direct and indirect units. In our experiments, randomly chosen pairs of direct units were conditioned through feedback to be volitionally modulated in order to move an actuator. Consistent with a growing body of literature3, 33,34, modulation of direct units was accompanied by similar task-related firing rate changes in a subset of indirect units (i.e. tr). As shown in Figure 1 and outlined in the results section, nearly all direct and a subset of indirect units became increasingly modulated in a task-dependent manner with practice. The emergent functional coupling between these units appeared to be a consequence of learning and successful completion of the neuroprosthetic task. Because direct units are embedded in highly connected M1 cortical networks, it is possible that such modulation of adjacent indirect activity is important for precise cortical control.
During the post-learning SWS there was significantly higher coincident activation of the emergent task-related ensemble. We also found that the extent of firing rate modulation during task performance predicted the strength of the modifications evident during SWS (Fig 2d). In the absence of learning or new performance gains this was not present. This strongly suggests that the functional co-activation of units during successful learning is directly linked to the modified functional connectivity and coherent reactivation detected during SWS. We also noted a positive correlation between time spent in SWS and subsequent improvements in task performance. This observed performance gain is consistent with past studies suggesting that even brief periods of sleep can improve motor performance20–24. It remains unclear if our observed offline processing during SWA is sufficient and necessary to trigger performance gains upon awakening; there may be other processes recruited during SWS that could also contribute to task improvements17,38.
Broader link to sleep and memory
A growing body of literature has linked offline processing during sleep to memory consolidation17, 18. Memory formation in the hippocampal system is the most widely studied19,39–41. In general, after an initial encoding phase, hippocampus-based memories appear to undergo a process of consolidation in which its representation is stabilized to neocortical systems. While the precise mechanisms underlying such consolidation are incompletely understood, sleep-dependent interactions between hippocampal and cortical circuits have been linked to this process of consolidation. Spontaneous reactivation of neurons reflecting previous experiences have been found during both non-REM (i.e. including SWA) and REM sleep in both the hippocampus and cortex19. Recent studies have also suggested that both disruption of hippocampal circuits with electrical stimulation as well as sensory cue dependent facilitation of memory formations can respectively impede and facilitate memory formation, suggesting a direct link between the offline processing and the consolidation of episodic memories42,43.
Sleep, in general, and SWS, in particular, also appear to be important for the consolidation of procedural memories21–25. For example, after learning a new motor skill, offline processing during overnight sleep20–23 or brief daytime naps24 can facilitate consolidation. Local changes in SWA are also associated with motor learning and performance26–29. After new motor learning or sensory stimulation there appear to be local increases of SWA in specific cortical areas or hemispheres26,44. In contrast, disuse and inactivity can locally decrease SWA and also deteriorate task performance28. It remains incompletely understood, however, how neural ensembles in motor cortex are precisely modified during SWA after skill acquisition. In this context, BMIs offer a powerful tool to directly modulate neurons in motor cortex, thereby allowing precise characterization of how both task-related and task-unrelated ensembles are differentially processed in motor cortex during SWA. While the exact link between neuroprosthetic learning and general procedural learning remains unclear, our results suggest that the emergent task-related ensembles could be also processed in a similar manner after natural motor learning.
Implications for neuroprosthetic control
An important goal of the field of BMIs is to allow stable control of complex devices over long periods of time. While decoder adaptation can speed the overall rate of skill acquisition, adaptation of neural circuits and neural plasticity over longer periods of time may be essential for stable skill acquisition. This may be especially important for skilled control over complex devices that resemble our natural control of limbs. For example, a recent study highlighted that optimal recruitment of long-term neural plasticity is essential for achieving flexible control that resembles our natural abilities14. The study illustrated that long-term neural plasticity (i.e. across multiple sessions) is essential for achieving BMI control that can readily switch between two control schemes without interference. Specifically, only after long-term stabilization of a single control scheme, was simultaneous acquisition of a second control scheme possible. Based on the parallels between motor and neuroprosthetic control, a reasonable hypothesis is that consolidation of ‘prosthetic memory’ is essential for such apparent resistance to interference. Our observed phenomenon of offline processing during SWS may consolidate such a prosthetic memory.
Conclusion
We provide direct evidence that an emergent group of task-related units, which were increasingly functionally coupled with learning and task performance, experienced coherent reactivation during SWSpost. Knowledge about this phenomenon should help better account for mechanisms of neural plasticity and further the goal of achieving stable long-lasting neuroprosthetic control that resembles natural motor control.
FULL METHODS
Animals/Surgery
Experiments were approved by the Institutional Animal Care and Use Committee at the San Francisco VA Medical Center. We used six adult Long-Evans male rats. Animals were kept under controlled temperature and a 12-hour light: 12-hour dark cycle with lights on at 06:00 AM. Probes were implanted during a recovery surgery performed under isofluorane (1–3%) anesthesia following induction with ketamine (2mg/kg body weight or b.w.) and xylazine (1 mg/kg b.w.). Atropine sulfate was also administered prior to anesthesia (0.02 mg/kg b.w.) The post-operative recovery regimen included administration of buprenorphine at 0.02 mg/kg b.w and meloxicam at 0.2mg/kg b.w. Dexamethasone at 0.5 mg/kg b.w. and Trimethoprim sulfadiazine at 15 mg/kg b.w. were also administered post-operatively for five days. We used 32-channel microwire arrays (33 µm polyimide coated tungsten microwire arrays) in four rats. We also used 32-channel silicon probe arrays in a tetrode configuration in two rats. Arrays were lowered down to 1400–1800 µm in the primary motor cortex (M1) in the upper limb area (1–3 mm anterior to bregma and 2–4 mm lateral from midline). The reference wire was wrapped around a screw inserted in the midline over the cerebellum. Final localization of depth was based on quality of recordings across the array at the time of implantation. In one rat we implanted electromyogram (EMG) wires into neck muscles. Specifically, pairs of Teflon-coated stainless steel wires (AM Systems) were inserted in neck muscles and tunneled subcutaneously to a connector on the cap. All animals were allowed to recover for 1-week prior to start of experiments.
Electrophysiology
We recorded extracellular neural activity using tungsten microwire electrode arrays (MEAs, n=4 rats, Tucker-Davis Technologies or TDT, FL) or silicon probes in a tetrode configuration (n=2 rats, Neuronexus Technologies, MI). We recorded spike and LFP activity using a 128-channel TDT-RZ2 system (Tucker-Davies Technologies). Spike data was sampled at 24414 Hz and LFP data at 1018 Hz. ZIF-clip based analog headstages with a unity gain and high impedance (~1 GΩ) were used. Differential EMG was also recorded at 1018 Hz and high-pass filtered with low-frequency cut off at 100 Hz. Only clearly identifiable units with good waveforms and high signal-to-noise were used. The remaining neural data was recorded for offline analysis. Behavior related timestamps (i.e. trial onset, trial completion) were sent to the RZ2 analog input channel using an Arduino digital board and synchronized to neural data.
We have used the term ‘unit’ to refer to the sorted spike recordings from both the MEA and tetrode recordings. For both, we initially used an online sorting program (SpikePac, TDT) for neuroprosthetic control (i.e. either in single channel or tetrode mode). We then conducted offline sorting. We sorted the MEA recordings using standard offline cluster cutting methods in TDT’s OpenSorter software. We subsequently used waveform shape and the presence of an absolute/relative refractory period in the interspike interval (ISI) to judge quality of isolation (Supplementary Fig. 1)14. Offline tetrode sorting was performed using a method described previously (Supplementary Fig. 2)45,46. Specifically, a voltage-based threshold was set based on visual inspection for each channel that allowed for best separation between putative spikes and noise; typically this threshold was at least 4 standard deviation (SD) away from the mean. Events were time-stamped and waveforms for each event were peak aligned. K-means clustering was then performed across the entire data matrix of waveforms (30 samples/ch × 4 chs × # of waveforms). Automated sorting was performed by: (1) first over-clustering wave-forms using a K-means algorithm (i.e. split into many mini-clusters), (2) followed by a calculation of interface energy (a nonlinear similarity metric that allows for an automated decision of whether mini-clusters are actually part of the same cluster), and (3) followed by aggregation of similar clusters. Such aggregation allows for a reduction in the total numbers of clusters that need to be manually inspected. Automated sorting was followed by manual inspection and sorting of spikes (including both merging and splitting clusters further, and removing significant outliers based on Gaussian distribution), using feature space, autocorrelations, cross-correlations and linear discriminant analysis to determine which clusters represent single units and to prevent over-sorting. Supplementary Fig. 1 and 2 illustrates examples of sorting achieved with the MEA and silicon probe tetrode recordings.
Behavior
After recovery, animals were typically gentled for several days prior to the start of experimental sessions. Animals acclimated to a custom plexiglass behavioral box (Fig. 1a) during this period. The box was equipped with a slit (covered with a door at one end) that served as a drinking zone. Initially, water delivery from the actuator was not introduced to the rats and they were just acclimatized to the box. Towards the end of the acclimation period, the rats typically fell asleep while in the box. Animals were then water scheduled such that water (from the feeding tube illustrated in Fig. 1a) was available in a randomized fashion while in the behavioral box. We monitored body weights on a daily basis to ensure that the weight did not drop below 95% of the initial weight.
Behavioral sessions were typically conducted in the morning, with second sessions conducted in the afternoon. Two single learning sessions were conducted in the afternoon. We found the same results for these experiments. We recorded neural data from the rats for 2 hours prior to start of BMI training. The rats were then allowed to perform the task over a 2-hour session. Recorded neural data was entered in real-time to custom routines in Matlab. These then served as control signals for the angular velocity of the feeding tube. The rats typically performed ~180–200 trials for robust learning sessions (e.g. Fig. 1c, Fig. 6a). Following this, we recorded neural data from animals for a 2-hour period. For the two session experiments, the animals then continued with another 2-hour training session followed by another spontaneous activity recording. Sorted units at the beginning of the recording were checked for maintenance throughout the second training session. We also collected ‘control sleep’ data in a subset of animals. After an initial spontaneous activity period with sleep, we delivered random automatic water rewards over a period to match a typical time period for task performance. We then recorded spontaneous activity and sleep.
Neural control of the feeding tube
During the BMI training sessions, we typically selected two well-isolated units as ‘direct’ and allowed their neural activity to control the angular velocity of the feeding tube (Fig. 1c). In three of the 15 sessions, there was only one neuron selected as the direct unit. These units maintained their stability throughout the recording as evidenced by stability of waveform shape and interspike-interval histograms14. We binned the spiking activity into 100 ms bins. We then established a mean firing rate for each neuron over a 3–5 minute baseline period. The mean firing rate was then subtracted from its current firing rate at all times. The specific transform that we used was:
where Θv was the angular velocity of the feeding tube, r1(i) and r2(i) were firing rates of the direct units. G1 and G2 were randomized coefficients that ranged from +1 to −1 and were held constant after initialization. C was a fixed constant that scaled the firing rates to angular velocity. The animals were then allowed to control the feeding tube via modulation of neural activity. The tube started at the same position at the start of each trial (P1 in Fig. 1a). The calculated angular velocity was added to the previous angular position at each time step (100 ms). During each trial, the angular position could range from −45 to +180 degrees. If the tube stayed in the ‘target zone’ (P2 in Fig. 1a; spanned 10° area) for a period of 300 ms a water reward was delivered. In the beginning of a session, most rats were unsuccessful at bringing the feeding tube to position P2. Most rats steadily improved control and reduced the time to completion of the task during the first session. In some cases, animals did not improve control (i.e. poor learning shown in Figure 4). In subsequent sessions these animals did demonstrate that they could learn the task. As shown in Supplementary Table 1, multiple learning sessions were obtained from each animal. These sessions were typically 1 week apart in order to ensure that new units were recorded. Consistent with past studies, we also found that incorporation of new units into the control scheme required new learning8,14,15. In many sessions we videotaped the rat during the BMI training blocks. Consistent with multiple reports we did not observe movements that systematically predicted feeding tube movements2,15,32. Specifically, we analyzed whether limb movements measured using the video recording (i.e. markers manually assigned to the head, torso and each limb using image processing software) co-varied with movements of the feeding tube. Across multiple sessions we did not find evidence for significant co-variation (data not shown). This is likely due to the fact that non-movement related random weights were assigned to the direct units.
Data Analysis
Sessions and changes in performance
Analysis was performed in Matlab (Mathworks, Natick, MA) with custom-written routines. A total of 19 training sessions recorded from 6 rats were used for our initial analysis. Fifteen of these sessions had ‘robust learning’ (i.e. > 3 SD drop in time to completion in the last 1/3 of trials or ‘late’ trials in comparison to the first 1/3 of trials or ‘early’ trials). Eight of these 15 sessions also had a second follow-up training session (i.e. Fig. 7). Four additional sessions were classified as ‘poor learning’ (i.e. no significant improvement in time to task completion). Supplementary Table 1 summarizes the Session details. In addition we analyzed three separate ‘control sleep’ sessions. As outlined above, during these experiments we matched the time awake using a randomized reward schedule that did not require learning. For Figure 7b and c we compared changes in task performance between and across sessions. Specifically, we compared the performance change between Block1 and Block2 by calculating the mean and standard deviation of the time to completion during the last 30 trials in Block1 and the first 30 trials Block2 (Fig. 7b). We also compared the first and last 30 trials within Block2. Furthermore, we also compared the best performance in Block1 (30 trials with the shortest time to reward) and the first 30 trials Block2 (Supplementary Fig. 13). We used a paired t-test to assess statistical significance.
Task-related activity
The distinction between trd, tri and tu units was based on whether significant modulation of baseline firing activity after the ‘Go cue’ was present (i.e. peak of modulation at the time > 2.5 SD above the baseline period). For the correlation between task-related modulation and changes in amplitude of the STA, the percentage change in modulation was calculated for firing rates during the first four seconds after the ‘Go Cue’ relative to 3 seconds of the baseline segment (i.e. prior to the Go Cue).
Putative cell-type identification
This was performed by comparing the widths between the first trough in the negatively-detected spikes and the first peak after that on the mean waveforms (Supplementary Fig 1b–d).
Identification of NREM oscillations
Identification of pre and post-SWS epochs was performed by combined visual assessment of low-frequency, high amplitude slow-wave oscillations as well as a 3 SD threshold of the filtered data (0.3 – 3 Hz)47,48. If there was a sustained reduction > 1.5 seconds in the amplitude of the SWA below threshold during a continuous epoch we excluded these segments. For the analysis of STA’s and coherence we matched the SWS time periods used for each (first 600 seconds). For subsequent analysis (i.e. Spearman correlation below) we used the entire time spent in SWS per session. Delta waves were detected directly from a filtered, z-scored SWS in the ‘median LFP channel’ in each animal (i.e. median value of z-scored LFP from all channels). After identification, only waves that contained a negativity larger than 80 µV (i.e. in the raw LFP) were included48. These criteria resulted in reproducible and specific detection of high-amplitude delta waves in pre and post SWS epochs. We classified SWS using video analysis in 5 rats and EMG recordings/video analysis in 1 rat. We used the EMG recordings to confirm SWS (Supplementary Fig. 5). The video recordings were processed using MATLAB (Image Processing Toolbox) to detect any movements. We specifically subtracted a baseline image of the cage environment from each frame, converted the difference into a black and white image, filtered the resulting image for large objects, and then used segmentation analysis to identify the rodent’s body/head. We found that the previously described ‘40 second rule’ 49 was a reliable indicator of sleep in comparison to EMG recordings. There were no significant differences between the duration of SWS (p=0.98, paired t-test) in the SWSpre and SWSpost blocks (Supplementary Fig. 5). Moreover, we also examined if there are changes in the frequency and temporal lags between channels with a trd and those with ‘distant’ tu’s (i.e. greater than 2 electrodes apart). We did not find any differences in the temporal latencies (p=0.76, unpaired t-test, Supplementary Fig. 9b) and the OFF period density (p=0.77, unpaired t-test, Supplementary Fig. 9c).
Spike-triggered average
We first calculated the STA to measure how spikes locked to the SWA. For the SWS, we used both the filtered (0.3–3Hz) and the unfiltered LFP. Given the predominance of high amplitude low-frequencies these were not significantly different (Figure 2b). For the filtering we used a zero-phase distortion filter in MATLAB48. While we used 600 sec for comparing the changes shown in the figure, we found that effects were similar even if all recorded SWS durations were used (data not shown). However, for subsequent analysis in the spindle (8–20 Hz) and ripple (100–300 Hz range) bands we used filtered data to minimize contamination from other bands48. During the task, we found that the task related STA changed only in the ‘beta’ range (i.e. 12–40Hz band but not in other bands, p<0.0001, one-way ANOVA, Supplementary Fig. 4). We used the first 5 seconds after the start of the trial to calculate these STAs. We also used the LFP from a neighboring channel at a distance ranging from 150–250 µm (depending on if it was a MEA or silicon probe). For a subset of the data we also ensured that coherent spiking did not contribute to our results. Thus, we excised LFP data around the spike times (in the neighboring channel) and linearly interpolated the excised segment using neighboring data points. Our results were identical both with and without this procedure.
Coherency measures
We used the Chronux toolbox to calculate both the spike-spike coherence (SSC) and spike-field coherence (SFC)50 (available online at : http://chronux.org/). The magnitude of both is a function of frequency and takes values between 0 and 1. We segmented the pre and post-SWS into 20-second segments and then averaged the coherency measures across segments. For the multitaper analysis we used a time-bandwidth (TW) product of 10 with 19 tapers. To compare coherences across groups, a z-score was calculated using the programs available in the Chronux Toolkit. Coherence between activity in two regions was calculated and defined as:
where Rxx and Ryy are the power spectra and Rxy is the cross-spectrum. Spectral analysis were calculated in segmented SWS epochs and averaged across these epochs across animals. Mean coherence in the slow-wave band was calculated for the 0.3 – 3 Hz band. Significance testing on coherence estimates was performed on mean slow-wave band estimates across spike-field and spike-spike pairs using one-way ANOVA and independent samples t-tests for multiple comparisons. Circular statistics were used for comparing phases of the different cell types before and after learning. Specifically, Raleigh test for homogeneity was used to check the presence of a uniform distribution of the phases and Watson-Williams circular t-test was used to compare changes in phase. The change of magnitude for SFC and SSC were analyzed between the 0.3 and 3 Hz frequency band. For SSC analysis, task-related direct unit with greatest depth modulation was used to calculate SSC for every other unit. For tu - tu and tri - tri pairs, the least modulated unrelated unit and greatest modulated indirect unit were chosen respectively. The power-spectrum of the LFP channels used in the coherence calculation as well as for overall SWA power change in pre and post SWS was also determined using the multitaper method (Supplementary Fig. 7).
Spike cross-correlation
We computed the cross-correlation histogram (CCH) for the same trd – trd, trd -tri and tu - tu pairs that were evaluated during the SSC analyses. The spike counts were equalized in pre and post SWS and the pairwise coincident firing was quantified by taking the ratio of the peak (within a time period of ±5 ms) in the center with tail on side (time period of ±10–20ms; see Supplementary Fig 11). For a subset of data we also constructed pseudo random spike train CCH with identical spike counts, mean firing rate and duration to the experimental data. Such simulations were run 200 times (Monte Carlo Simulations). Significant CCH in these cases were the ones with higher central peaks than 99% of the simulated data. Only task related neuronal pairs were found to have significant CCHs.
Ensemble Reactivation Analyses
Reactivation analysis characterized the neural activity patterns in SWSpre and SWSpost using a template that was created during task execution in the awake period35–37. We computed a pairwise unit activity correlation matrix as described in Peyrache et al, 2010 for SWSpre, awake and SWSpost. The eigenvector with the largest eigenvalue in the awake correlation matrix served as the learning-related cell assembly. For all epochs, spike trains were binned (Δtbin = 50ms). The obtained spike counts for each cell [si (t), i = 1:n, t= 1:B, where n is the number of cells and B is the number of time bins in the epoch] were z transformed, obtaining the Q matrix
where < si > is the mean and is σsi the standard deviation.
The pairwise cell activity correlation matrix can be presented in following equation (1):
We thus obtained three spike count matrices, QSWSpre, Qawake and QSWSpost, and the three correlation matrices, CSWSpre, Cawake and CSWSpost. For quantification of the instantaneous match of activity during SWS to learning related activity during awake periods, we established a Ctemplate (i.e. awake) and Cmatch (i.e. SWSpre and SWSpost). As outlined in Peyrache et. al, we also calculated a measure of similarity, M, using the elements of Ctemplate and Cmatch:
To distinguish among groups, we calculated the eigenvectors of Ctemplate, namely p(l) (with p(1) associated with the largest eigenvalue λ1). We then decomposed Ctemplate into projectors, P(l), defined by the outer products of the eigenvectors with themselves.
We also generated a time series Rlmatch (t) that measured the instantaneous match of the l-th template, corresponding to the eigenvectors described below. Put another way, this value is the ‘reactivation strength’ or the instantaneous measure of how similar the ensemble activity during SWSpre and SWSpost is to that identified during the awake period.
with
Qijmatch is a vector of the multi-cell counts at time point t. Peyrache et. al further proposed the use of the Marcenko-Pastur distribution as the null hypothesis for the existence of cell assemblies. It was demonstrated that eigenvalues of the correlation matrix of a normal random matrix R with statistically independent rows follows a probability distribution described.
where
And σ2 is the variance of the elements of the random matrix R, which is 1 here (due to z scoring), and q = Rcolumns/Rrows ≥ 1. Under the null hypothesis of an uncorrelated matrix, the correlations between spike trains are determined only by random fluctuations and the eigenvalues of template awake matrix must lie between λmin and λmax. Eigenvalues greater than λmax are therefore a sign of nonrandom correlations in the matrix, and for this reason we refer to these principal components as signal. A reactivation time series (i.e. Figure 6c) measured the instantaneous match of this cell assembly to the ongoing activity35.
Statistical analysis
We performed one-way ANOVA with multiple comparisons wherever significance assessment was required (Fig. 2c,3c,4c,5b,c; Supp. Fig 1d,f,g, 4b 6a,b, and 11). We also used a paired t-test for comparisons between early and late trials and features of SWSpre and SWSpost and other relevant comparisons(e.g. time to reward and proportion of unsuccessful trials in and early and late trials, and time to reward comparison between block1 and block2 for second sleep dataset; delta wave density, PSD amplitude changes, duration changes, etc in SWSpre and SWSpost; change in firing rate of units, reactivation strengths in post awake and SWS epochs, etc). Unpaired t-test were also used for several comparisons namely, proportion of cells with significant MDchange in direct and indirect unit pools, STAamp change for units recorded with silicon probes versus multielectrode arrays, delta-triggered reactivation strength comparison in SWSpost from robust and poor learning sessions, etc. We also used linear regression to evaluate trends between firing rate change of individual units versus STA amplitude and PSD amplitude versus STA amplitude. Kolmogorov-Smirnov test was used for comparing burst distribution in ISI distributions from SWSpre and SWSpost. We used the spearman correlation between time spent in SWS and improvements in task performance because the distribution of SWS durations were not normal. For similar reasons, we used spearman correlation between the firing rate modulation and STA effect size. Logrank test was used to determine probability of higher strength reactivation strength in SWSpost after robust or poor learning and also in SWSpost versus post awake epochs after robust learning. If there was higher likelihood of greater reactivation strength in comparison to SWSpre, the difference of counts of reactivations strengths of higher magnitude would show right skewed histogram and logrank test can test if such right-skewed curves are different from each other.
Supplementary Material
References
- 1.Chapin JK, Moxon KA, Markowitz RS, Nicolelis MA. Real-time control of a robot arm using simultaneously recorded neurons in the motor cortex. Nat Neurosci. 1999;2:664–70. doi: 10.1038/10223. [DOI] [PubMed] [Google Scholar]
- 2.Koralek AC, Jin X, Long JD, 2nd, Costa RM, Carmena JM. Corticostriatal plasticity is necessary for learning intentional neuroprosthetic skills. Nature. 2012;483:331–5. doi: 10.1038/nature10845. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Arduin PJ, Fregnac Y, Shulz DE, Ego-Stengel V. “Master” neurons induced by operant conditioning in rat motor cortex during a brain-machine interface task. J Neurosci. 2013;33:8308–20. doi: 10.1523/JNEUROSCI.2744-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Carmena JM, et al. Learning to control a brain-machine interface for reaching and grasping by primates. PLoS Biol. 2003;1:E42. doi: 10.1371/journal.pbio.0000042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Serruya MD, Hatsopoulos NG, Paninski L, Fellows MR, Donoghue JP. Instant neural control of a movement signal. Nature. 2002;416:141–2. doi: 10.1038/416141a. [DOI] [PubMed] [Google Scholar]
- 6.Taylor DM, Tillery SI, Schwartz AB. Direct cortical control of 3D neuroprosthetic devices. Science. 2002;296:1829–32. doi: 10.1126/science.1070291. [DOI] [PubMed] [Google Scholar]
- 7.Santhanam G, Ryu SI, Yu BM, Afshar A, Shenoy KV. A high-performance brain-computer interface. Nature. 2006;442:195–8. doi: 10.1038/nature04968. [DOI] [PubMed] [Google Scholar]
- 8.Moritz CT, Perlmutter SI, Fetz EE. Direct control of paralysed muscles by cortical neurons. Nature. 2008;456:639–42. doi: 10.1038/nature07418. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jarosiewicz B, et al. Functional network reorganization during learning in a brain-computer interface paradigm. Proc Natl Acad Sci USA. 2008;105:19486–91. doi: 10.1073/pnas.0808113105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Musallam S, Corneil BD, Greger B, Scherberger H, Andersen RA. Cognitive control signals for neural prosthetics. Science. 2004;305:258–62. doi: 10.1126/science.1097938. [DOI] [PubMed] [Google Scholar]
- 11.Hochberg LR, et al. Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature. 2006;442:164–71. doi: 10.1038/nature04970. [DOI] [PubMed] [Google Scholar]
- 12.Hochberg LR, et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature. 2012;485:372–5. doi: 10.1038/nature11076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Collinger JL, et al. High-performance neuroprosthetic control by an individual with tetraplegia. Lancet. 2013;381:557–64. doi: 10.1016/S0140-6736(12)61816-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ganguly K, Carmena JM. Emergence of a stable cortical map for neuroprosthetic control. PLoS Biol. 2009;7:e1000153. doi: 10.1371/journal.pbio.1000153. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gilja V, et al. A high-performance neural prosthesis enabled by control algorithm design. Nat Neurosci. 2012;15:1752–7. doi: 10.1038/nn.3265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Stickgold R. Sleep-dependent memory consolidation. Nature. 2005;437:1272–8. doi: 10.1038/nature04286. [DOI] [PubMed] [Google Scholar]
- 17.Diekelmann S, Born J. The memory function of sleep. Nat Rev Neurosci. 2010;11:114–26. doi: 10.1038/nrn2762. [DOI] [PubMed] [Google Scholar]
- 18.Peigneux P, Laureys S, Delbeuck X, Maquet P. Sleeping brain learning brain. The role of sleep for memory systems. Neuroreport. 2001;12:A111–24. doi: 10.1097/00001756-200112210-00001. [DOI] [PubMed] [Google Scholar]
- 19.Lee AK, Wilson MA. Memory of sequential experience in the hippocampus during slow wave sleep. Neuron. 2002;36:1183–94. doi: 10.1016/s0896-6273(02)01096-6. [DOI] [PubMed] [Google Scholar]
- 20.Walker MP, Brakefield T, Morgan A, Hobson JA, Stickgold R. Practice with sleep makes perfect: sleep-dependent motor skill learning. Neuron. 2002;35:205–11. doi: 10.1016/s0896-6273(02)00746-8. [DOI] [PubMed] [Google Scholar]
- 21.Fischer S, Nitschke MF, Melchert UH, Erdmann C, Born J. Motor memory consolidation in sleep shapes more effective neuronal representations. J Neurosci. 2005;25:11248–55. doi: 10.1523/JNEUROSCI.1743-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cohen DA, Pascual-Leone A, Press DZ, Robertson EM. Off-line learning of motor skill memory: a double dissociation of goal and movement. Proc Natl Acad Sci USA. 2005;102:18237–41. doi: 10.1073/pnas.0506072102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Fischer S, Hallschmid M, Elsner AL, Born J. Sleep forms memory for finger skills. Proc Natl Acad Sci USA. 2002;99:11987–91. doi: 10.1073/pnas.182178199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Korman M, et al. Daytime sleep condenses the time course of motor memory consolidation. Nat Neurosci. 2007;10:1206–13. doi: 10.1038/nn1959. [DOI] [PubMed] [Google Scholar]
- 25.Walker MP, Brakefield T, Hobson JA, Stickgold R. Dissociable stages of human memory consolidation and reconsolidation. Nature. 2003;425:616–20. doi: 10.1038/nature01930. [DOI] [PubMed] [Google Scholar]
- 26.Huber R, Ghilardi MF, Massimini M, Tononi G. Local sleep and learning. Nature. 2004;430:78–81. doi: 10.1038/nature02663. [DOI] [PubMed] [Google Scholar]
- 27.Tamaki M, et al. Enhanced spontaneous oscillations in the supplementary motor area are associated with sleep-dependent offline learning of finger-tapping motor-sequence task. J Neurosci. 2013;33:13894–902. doi: 10.1523/JNEUROSCI.1198-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Huber R, et al. Arm immobilization causes cortical plastic changes and locally decreases sleep slow wave activity. Nat Neurosci. 2006;9:1169–76. doi: 10.1038/nn1758. [DOI] [PubMed] [Google Scholar]
- 29.Hanlon EC, Faraguna U, Vyazovskiy VV, Tononi G, Cirelli C. Effects of skilled training on sleep slow wave activity and cortical gene expression in the rat. Sleep. 2009;32:719–29. doi: 10.1093/sleep/32.6.719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Moroni F, et al. Procedural learning and sleep hippocampal low frequencies in humans. Neuroimage. 2008;42:911–8. doi: 10.1016/j.neuroimage.2008.05.027. [DOI] [PubMed] [Google Scholar]
- 31.Inostroza M, Born J. Sleep for preserving and transforming episodic memory. Annu Rev Neurosci. 2013;36:79–102. doi: 10.1146/annurev-neuro-062012-170429. [DOI] [PubMed] [Google Scholar]
- 32.Ganguly K, Dimitrov DF, Wallis JD, Carmena JM. Reversible large-scale modification of cortical networks during neuroprosthetic control. Nat Neurosci. 2011;14:662–7. doi: 10.1038/nn.2797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Fetz EE. Volitional control of neural activity: implications for brain-computer interfaces. J Physiol. 2007;579:571–9. doi: 10.1113/jphysiol.2006.127142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Koralek AC, Costa RM, Carmena JM. Temporally Precise Cell-Specific Coherence Develops in Corticostriatal Networks during Learning. Neuron. 2013 doi: 10.1016/j.neuron.2013.06.047. [DOI] [PubMed] [Google Scholar]
- 35.Peyrache A, Khamassi M, Benchenane K, Wiener SI, Battaglia FP. Replay of rule-learning related neural patterns in the prefrontal cortex during sleep. Nat Neurosci. 2009;12:919–26. doi: 10.1038/nn.2337. [DOI] [PubMed] [Google Scholar]
- 36.Peyrache A, Benchenane K, Khamassi M, Wiener SI, Battaglia FP. Principal component analysis of ensemble recordings reveals cell assemblies at high temporal resolution. J Comput Neurosci. 2010;29:309–25. doi: 10.1007/s10827-009-0154-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lopes-dos-Santos V, Ribeiro S, Tort AB. Detecting cell assemblies in large neuronal populations. J Neurosci Methods. 2013;220:149–66. doi: 10.1016/j.jneumeth.2013.04.010. [DOI] [PubMed] [Google Scholar]
- 38.Tononi G, Cirelli C. Sleep and the price of plasticity: from synaptic and cellular homeostasis to memory consolidation and integration. Neuron. 2014;81:12–34. doi: 10.1016/j.neuron.2013.12.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Marshall L, Born J. The contribution of sleep to hippocampus-dependent memory consolidation. Trends Cogn Sci. 2007;11:442–50. doi: 10.1016/j.tics.2007.09.001. [DOI] [PubMed] [Google Scholar]
- 40.Pavlides C, Winson J. Influences of hippocampal place cell firing in the awake state on the activity of these cells during subsequent sleep episodes. J Neurosci. 1989;9:2907–18. doi: 10.1523/JNEUROSCI.09-08-02907.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Wilson MA, McNaughton BL. Reactivation of hippocampal ensemble memories during sleep. Science. 1994;265:676–9. doi: 10.1126/science.8036517. [DOI] [PubMed] [Google Scholar]
- 42.Rasch B, Buchel C, Gais S, Born J. Odor cues during slow-wave sleep prompt declarative memory consolidation. Science. 2007;315:1426–9. doi: 10.1126/science.1138581. [DOI] [PubMed] [Google Scholar]
- 43.Jadhav SP, Kemere C, German PW, Frank LM. Awake hippocampal sharp-wave ripples support spatial memory. Science. 2012;336:1454–8. doi: 10.1126/science.1217230. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Vyazovskiy V, Borbely AA, Tobler I. Unilateral vibrissae stimulation during waking induces interhemispheric EEG asymmetry during subsequent sleep in the rat. J Sleep Res. 2000;9:367–71. doi: 10.1046/j.1365-2869.2000.00230.x. [DOI] [PubMed] [Google Scholar]
- 45.Hill DN, Mehta SB, Kleinfeld D. Quality metrics to accompany spike sorting of extracellular signals. J Neurosci. 2011;31:8699–705. doi: 10.1523/JNEUROSCI.0971-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Fee MS, Mitra PP, Kleinfeld D. Automatic sorting of multiple unit neuronal signals in the presence of anisotropic and non-Gaussian variability. J Neurosci Methods. 1996;69:175–88. doi: 10.1016/S0165-0270(96)00050-7. [DOI] [PubMed] [Google Scholar]
- 47.Bjorvatn B, Fagerland S, Ursin R. EEG power densities (0.5–20 Hz) in different sleep-wake stages in rats. Physiol Behav. 1998;63:413–7. doi: 10.1016/s0031-9384(97)00460-5. [DOI] [PubMed] [Google Scholar]
- 48.Phillips KG, et al. Decoupling of sleep-dependent cortical and hippocampal interactions in a neurodevelopmental model of schizophrenia. Neuron. 2012;76:526–33. doi: 10.1016/j.neuron.2012.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Pack AI, et al. Novel method for high-throughput phenotyping of sleep in mice. Physiol Genomics. 2007;28:232–8. doi: 10.1152/physiolgenomics.00139.2006. [DOI] [PubMed] [Google Scholar]
- 50.Mitra P, Bokil H. Observed brain dynamics. Oxford University Press, Oxford; New York: 2008. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.