Skip to main content
Brain Sciences logoLink to Brain Sciences
. 2026 Jan 28;16(2):141. doi: 10.3390/brainsci16020141

NeuroGator: A Low-Power Gating System for Asynchronous BCI Based on LFP Brain State Estimation

Benyuan He 1,2, Chunxiu Liu 1,2, Zhimei Qi 1,2, Ning Xue 3, Lei Yao 3,*
Editor: Ivan Volosyak
PMCID: PMC12937859  PMID: 41750145

Abstract

The continuous handling of the large amount of raw data generated by implantable brain–computer interface (BCI) devices requires a large amount of hardware resources and is becoming a bottleneck for implantable BCI systems, particularly for power-constrained wireless systems. To overcome this bottleneck, we present NeuroGator, an asynchronous gating system using Local Field Potential (LFP) for the implantable BCI system. Unlike a conventional continuous data decoding approach, NeuroGator uses hierarchical state classification to efficiently allocate hardware resources to reduce the data size before handling or transmission. The proposed NeuroGator operates in two stages: Firstly, a low-power hardware silence detector filters out background noise and non-active signals, effectively reducing the data size by approximately 69.4%. Secondly, a Dual-Resolution Gate Recurrent Unit (GRU) model controls the main data processing procedure on the edge side, using a first-level model to scan low-precision LFP data for potential activity and a second-level model to analyze high-precision LFP data for confirmation of an active state. The experiment shows that NeuroGator reduces overall data throughput by 82% while maintaining an F1-Score of 0.95. This architecture allows the Implantable BCI system to stay in an ultra-low-power state for over 85% of its entire operation period. The proposed NeuroGator has been implemented in an Application-Specific Integrated Circuit (ASIC) with a standard 180 nm Complementary Metal Oxide Semiconductor (CMOS) process, occupying a silicon area of 0.006mm2 and consuming 51 nW power. NeuroGator effectively resolves the resource efficiency dilemma for implantable BCI devices, offering a robust paradigm for next-generation asynchronous implantable BCI systems.

Keywords: implantable brain–computer interface, asynchronous control, low power, Local Field Potentials, Gated Recurrent Unit

1. Introduction

Implantable brain–computer interfaces (BCIs) have demonstrated immense clinical potential for restoring motor function and facilitating neural rehabilitation [1,2]. Recent clinical evidence suggests that fully implantable systems are moving toward long-term human applications [3], yet they remain hindered by a critical “power-thermal” bottleneck. To achieve high-fidelity control, modern implantable BCIs typically record broadband signals at high sampling rates (>20 kHz) to detect and decode action potentials (Spikes) [4,5]. However, this high-throughput processing generates significant heat dissipation. In confined intracranial environments, the power density of implanted electronics is strictly limited, because a local tissue temperature rise exceeding 1 °C can trigger irreversible inflammatory responses and neuronal damage [6]. Specifically, satisfying this 1 °C safety limit typically constrains the maximum power budget for wireless neural implants to approximately 7.7 mW [7]. Consequently, minimizing power consumption is not merely an energy-saving requirement but a fundamental safety prerequisite for thermal management.

Since motor intent is sparse in daily life, maintaining high-power decoders in an “always-on” state is highly inefficient. To resolve this, asynchronous BCI strategies—functioning as “brain switches” to wake the system on demand—have emerged as a pivotal solution [8,9]. The engineering challenge lies in selecting a signal modality that balances precision, power, and stability. Within the last three years, various approaches have been explored. Spike-based switches, such as the 384-channel sorting IC proposed by Chen et al. [10], offer high precision but require μW-level power per channel, posing thermal risks in high-density arrays. Similarly, multi-unit activity (MUA)-based systems [11,12,13] achieve robust state estimation but often necessitate computationally intensive learning phases to identify optimal channels. Furthermore, high-frequency spikes are prone to degradation in chronic environments due to glial scarring and micromotion, rendering wake-up systems dependent on them prone to failure [14,15].

Alternatively, low-frequency signals like Spiking Band Power (SBP) and Local Field Potentials (LFPs) offer superior longevity [1,16,17]. Recent work by Atzeni et al. [18] demonstrated low-power SBP recording, yet LFP remains the most robust modality due to its “signal persistence” [8,17]. As demonstrated by Valencia et al. [8], LFP-based switches can achieve performance comparable to spike-based systems with nW-level power [19,20], as they bypass the power-hungry sampling requirements of spikes. LFPs reflect the synchronized activity of neuronal populations, allowing them to remain decodable even after the loss of individual spike signals due to interface encapsulation [21,22]. However, such existing LFP or SBP schemes still face notable limitations in fully implantable applications. First, on-chip processing of full-precision signals for complex frequency band extraction (e.g., parallel bandpass filtering) imposes significant logic complexity and hardware area overhead, while continuous transmission of these features creates persistent pressure on wireless bandwidth and power [23,24]. Second, conventional “Active/Inactive” binary classification logic struggles to capture the intricate transition states between rest and execution; simple switches relying solely on amplitude or frequency thresholds are susceptible to physiological noise, introducing decision errors during critical transition periods [19,24].

Against this backdrop, we propose NeuroGator, a hierarchical low-power gating system for asynchronous BCIs built upon the stability of LFPs. To achieve this, we aim to resolve the trade-off between energy efficiency and longevity in fully implantable devices. Our design allocates data and computational resources on demand. Rather than relying on complex feature engineering, we propose a cascaded processing flow—a “Shallow State Decision Tree” that extends from the hardware substrate to the algorithmic layer. The architecture thus begins with an ultra-low-power on-chip silence detector acting as a gatekeeper, intercepting the vast majority of invalid background noise. Signals passing this screening are processed by a Dual-Resolution Gate Recurrent Unit Cascade (DualGRU). We demonstrate that a lightweight primary model driven by low-precision data (6-bit, 1 kHz) is sufficient to capture ambiguous “transition states”. Only when a final confirmation of the active state is required does the system trigger the high-precision model (12-bit, 2 kHz). Experimental validation demonstrates that NeuroGator reduces data throughput by 82% compared to full-precision systems, while maintaining an F1-Score of 0.95. This architecture allows the BCI to remain in an ultra-low-power state for over 85% of its operating time. In a 180 nm Complementary Metal Oxide Semiconductor (CMOS) process, the core circuit area is just 0.006mm2. Power consumption is only 51 nW. We believe that NeuroGator provides a robust paradigm for next-generation asynchronous interfaces.

The remainder of this paper is organized as follows: Section 2 details the rationale for using LFPs and the overall NeuroGator architecture. Section 3 presents the experimental setup, results, and discussion, including data preprocessing, feature extraction analysis, and hardware validation. Finally, Section 4 concludes the study and outlines future work.

2. Methods and Materials

2.1. System Overview

The NeuroGator system is partitioned into an implantable front-end and an edge decoding unit. Conceptually, its workflow maps to the logic of a “Shallow State Decision Tree.” Building on this, the design philosophy centers on a hierarchical wake-up mechanism to maximize energy efficiency; however, to ensure that this power-saving approach does not compromise the brain-switch’s reliability, we devised a DualGRU model specifically optimized for single-channel processing. As a result, this ensures effective state classification even under strict power constraints. The complete architectural flow is illustrated in Figure 1.

Figure 1.

Figure 1

System architecture of the NeuroGator asynchronous gating system.

2.2. Feasibility and Advantages of Using Local Field Potentials

Intracortical microelectrode arrays (MEAs) offer a unique window into neural activity across varying spatial scales. Figure 2 illustrates the relative spatial resolution of the alternative signal modalities available for intracortical recording, with the center representing the location of the electrode tip [15]. When an electrode tip sits within a tight 50 μm to 100 μm radius of a neuron, we can successfully isolate single-unit (SUA) or MUA. However, these high-frequency components (>500 Hz) are notoriously sensitive to the immediate recording environment. Local Field Potentials (LFPs) present a contrasting profile. Representing the superposition of transmembrane currents over a range of hundreds of micrometers, LFPs undergo low-pass filtering by the extracellular medium itself. This physically integrates synaptic inputs and sub-threshold oscillations from a local population, offering a much more robust spatial average than the fragile isolation of single spikes. For any system intended for long-term clinical service, signal stability—rather than just instantaneous decoding precision—becomes the defining metric. That emphasis on stability is crucial, as we know that maintaining stable SUA in chronic recordings is an uphill battle; between micromotion and the body’s inevitable tissue response, tracking the exact same neuron over days or weeks is rarely reliable [5,14]. Even more concerning is the irreversible decline in detectable single-neuron yields as implantation time drags on [25,26]. While some researchers turn to Threshold Crossing (TC)-based MUA as a more stable alternative to isolated units, this approach remains an imperfect solution. TC still fundamentally relies on high-frequency components and remains vulnerable to the same signal-to-noise ratio attenuation. In contrast with the ephemeral nature of spikes, LFPs demonstrate remarkable robustness in chronic settings. For example, Flint et al. have clearly shown that LFPs offer superior day-to-day stability compared to threshold crossings [27]. Most notably, the feature known as “signal persistence” is clinically significant: even when glial scarring completely chokes off spike signals—resulting in a total loss of Threshold Crossings—the LFP signal often survives, retaining effective neural decoding capabilities [21,28]. This unique capability to function even after electrode interface degradation positions LFPs as a highly viable candidate for constructing a truly “permanent” wake-up system. Furthermore, from an engineering perspective, shifting to LFPs significantly simplifies the design constraints. Specifically, since LFPs’ sampling and processing rates are a fraction of what is required for spike detection, we can bypass the power-hungry computational overhead of complex spike classification algorithms [10,11]. This, in turn, naturally extends the lifespan of the implanted device [29,30]. Additionally, by relying on LFPs, we might reduce our dependence on deep-penetrating electrodes, potentially mitigating tissue scarring in the first place [29,30].

Figure 2.

Figure 2

The different signal modalities of intracortical recording electrodes.

In summary, spikes provide high-resolution data on individual neuronal firing but are highly sensitive to signal attenuation from glial scarring and electrode micromotion. In contrast, low-frequency LFPs reflect a broader spatial scale and show greater resilience to biological responses after chronic implantation. Information requirements for asynchronous gating differ from those for high-degree-of-freedom kinematic reconstruction. While kinematic reconstruction requires precise spike timing or high-frequency resolution for decoding complex movements, temporal-state detection emphasizes macroscopic population dynamics. Consequently, this task imposes lower demands on fine-grained signal resolution, making LFPs a more suitable and reliable modality for detection in this application. Additionally, using LFPs reduces reliance on deep-penetrating electrodes and greatly lowers the requirements for sampling rates and data transmission bandwidth. Because of their broad spatial integration, robustness, and energy efficiency, LFPs are a pragmatic choice for achieving long-term, stable, low-power, asynchronous brain–computer interface systems.

2.3. Shallow State Tree

To balance energy efficiency with decoding accuracy, we discretized continuous neural temporal states into four categories using hierarchical logic. This forms a “Shallow State Tree” that directly maps the NeuroGator decision process. Action states reflected by LFPs are broadly divided into Resting and Active. Resting states are further refined into Distinguishable Resting (S0) and Indistinguishable Resting (S1). S0 marks periods when the subject is fully relaxed, and LFP amplitudes are close to background thermal noise. This state occupies most of the recording. S1 covers times when, with no motor intent, high-amplitude interference arises from EMG artifacts, blinking, or other neural fluctuations. These signals physically exceed hardware thresholds. However, they lack the temporal structure of motor preparation. While they may “false wake” the primary system, GRU-1 ultimately rejects them. Conversely, Active states include the Transition State (S2) and Fully Active State (S3). S2 is the dynamic process in a fixed window before the movement trigger point (Switch Release). It represents the climb from resting baseline to active burst. The next state, S3, is the stable phase of movement execution and contains complex information. Here, the system enters full-precision mode. GRU-2 uses high-frequency, high-bit-width signals to achieve clear brain-state discrimination.

2.4. Level 1: On-Chip Silence Detector

In practical BCI scenarios, neural signals exhibit high temporal sparsity. Valid command windows typically occupy less than 30% of the total operation time, leaving the system in a non-task—or resting—state for over 70% of the day [31]. During these intervals, LFP signals are effectively indistinguishable from low-amplitude background thermal noise. To achieve nanowatt-level standby power, we designed a minimalist detection logic based on the Mean Absolute Value (MAV) of the LFP signal as the system’s first gate. Unlike power calculations requiring energy-intensive multipliers, calculating absolute values requires only simple addition and logic operations, making it ideal for ultra-low-power on-chip implementation independent of external processors. We define the signal strength E within a short window Tw, as shown in Equation (1):

E[n]=k=nTwn|xvc[k]| (1)

To ensure high sensitivity for system wake-up, the hardware noise threshold, Vth, was determined by analyzing the probability density function of the signal strength during the Active state S3. Instead of defining Vth based on the noise floor of the Resting distribution, Vth was calculated as the 3% quantile of the Active signal distribution. As shown in Figure 3, this approach ensures that 97% of valid neural activity exceeds the threshold to trigger the system, emphasizing a high recall rate for motor intent over noise rejection.

Figure 3.

Figure 3

Signal amplitude distributions across four brain states (S0,S1,S2,S3) and the determination of the noise threshold (Vth).

Operating on this threshold, the system enforces a strict gating protocol. When the windowed MAV<Vth, the sample is classified as Distinct Resting (S0); since this interval is dominated by background noise, the system determines that no processing is required. Consequently, the back-end wireless transmitter (Tx) and neural codec are completely shut down, leaving only the silence detector active to minimize power draw. Conversely, when MAVVth, the sample is flagged as S1 or higher—indicating either potential neural activity or ambiguous noise—which immediately wakes the transmitter to offload the data to the next stage for processing.

2.5. Level 2: DualGRU Model

Once signals pass the silence detector, they are transmitted to the edge for precise brain state identification. Selecting the appropriate algorithm is critical for efficient LFP decoding. Although deep learning applications in BCIs have surged [32], and some studies suggest that LFP decoding relies on long-term temporal dynamics [33], practical implementation requires evaluating mainstream RNN variants [34] for LFP time-series processing. These include Long Short-Term Memory (LSTM) [35], Quasi-Recurrent Neural Networks (QRNNs) [36], and Gated Recurrent Units (GRUs) [8]. Standard RNNs, while simple, suffer from vanishing or exploding gradients in long sequences. LSTM addresses long-range dependencies with three gates, but increases parameter count due to its complexity. QRNNs replace direct state propagation with convolutions, allowing speed but sometimes missing complex temporal features. In contrast, GRUs reduce parameter count and simplify computation while preserving decoding accuracy. Based on this comparison, we selected a GRU as the core classifier.

Building on this, we propose a Dual-Resolution Architecture. In the first stage, GRU1 uses 6 bit, 1 kHz LFP signals as input, cutting wireless transmission load by 50% compared to 12 bit, 2 kHz signals. GRU1 classifies Rest vs. Non-Rest, with a focus on high recall for Active states. We prefer false alarms (classifying S1 as S2) to avoid missing Active states and maintain responsiveness. If GRU1 detects a “Potential Active” state (S2 or S3), the system switches to high-precision mode. The second stage, GRU2, processes 12 bit, 2 kHz LFP signals to retain fine temporal details and performs high-precision binary classification, filtering GRU1’s false positives to confirm true Active states. The gate is triggered only when GRU2 confirms an Active status, waking the high-power back-end decoder.

3. Results and Discussion

3.1. Dataset and Experimental Setup

We validated our algorithms using the publicly available rhesus macaque motor cortex dataset [37]. This dataset records neural signals during a spontaneous reach-to-grasp task. It is widely used for benchmarking LFP algorithms [8]. The primary objective of our experimental validation is to verify whether NeuroGator can accurately identify the LFP Active state and maintain ultra-low power consumption by filtering out invalid background noise. To support this, we designed a four-class label generation logic. This maps the experimental task phases to discrete brain states. The actual reach-to-grasp movement, from the movement trigger point until the end of the trial, is labeled as the Fully Active State (S3). The period when the monkey is ’waiting’ and ’preparing’ to execute the task is marked as the Transition State (S2). Specifically, this is defined as the four consecutive time windows tracing back from the movement preparation point. This division is sufficient to cover the dynamic transition signal from rest to active burst. All remaining non-task intervals are classified as Resting states. These are further divided into Easy Rest (S0) and Hard Rest (S1), according to the silence detector threshold logic.

The original signals were sampled at 30 kHz to preserve spike details. To reduce edge computing and transmission bandwidth requirements, we downsampled signals to fs=2kHz. This preserves the Delta, Theta, Alpha, Beta, and High-Gamma bands—rich in motor intent information—while drastically cutting data throughput. Furthermore, to enhance robustness in interference-prone environments, we selected N=4 spatially adjacent physical channels on-chip to compute their arithmetic mean, synthesizing a high-quality “Virtual Channel” xvc(t):

xvc(t)=1Ni=1Nxi(t) (2)

The reason for using this simple averaging rather than complex beamforming lies in the physical characteristics of microelectrode arrays. Since the electrode spacing is typically small, low-frequency LFP signals (<500 Hz) recorded by adjacent channels have high spatial correlation, and the phase difference is negligible, eliminating the need for high-power phase alignment algorithms. Therefore, simple arithmetic averaging is sufficient to enhance effective neural signal components while effectively canceling out uncorrelated thermal noise and random interference. To train the hierarchical wake-up model, we designed a sample construction process based on a sliding window. The model input is a historical field of view with a length of 512 ms, and the sliding step is set to 16 ms, meaning that the system outputs a decision result every 16 ms. Using the “Switch Release” event in the experimental records as the physical reference point trelease for movement occurrence, we trace back from trelease by Tpre to define the start time of movement preparation.

To comprehensively evaluate the system’s performance on imbalanced datasets, we use the F1-Score as the core metric, which comprehensively considers the model’s precision and recall [8]. All reported performance metrics are presented as mean ± standard deviation (SD) calculated across the five folds. The SD is calculated using the following formula:

SD=i=1n(xix¯)2n1 (3)

where n=5xi represents the performance metric of the i-th fold, and x¯ denotes the mean value across all folds. Specifically, in the GRU-1 low-power stage, we focus on the recall of the Transition state (S2) and Active state, as this metric directly determines the timeliness of the BCI system response and user experience; any missed detection of the Transition state will lead to system wake-up delay. The experiment uses five-fold cross-validation to ensure the reliability of the results. Model training is performed via the Adam optimizer, with the initial learning rate set to 3×104, training epochs set to 300, and coupled with an Early Stopping mechanism to prevent overfitting.

3.2. Comparison and Selection of On-Chip Feature Extraction Methods

To evaluate the efficacy of the proposed MAV method for LFP state gating, we compared it against three robust feature extraction techniques adopted in neural signal processing: Median Absolute Deviation (MAD) [38], Winsorization Average (WA) [39], and Non-linear Energy Operator (NEO) [40]. As shown in Figure 4, we uniformly constrained the Active Miss Rate to 3% to prioritize system responsiveness. Under this premise, the background noise interception capabilities exhibited distinct trade-offs. The MAD method achieved the highest intercept rate of 71.6%, followed by our MAV method at 69.4%  and WA at 69.1%. The NEO method showed a lower interception rate of 65.2%. While MAD provides the most robust statistical separation, its on-chip implementation necessitates sorting operations on large data arrays, which is computationally prohibitive for ultra-low-power implantable devices. Similarly, the WA method requires additional hardware resources, including comparators for clipping logic and multipliers for scaling coefficients. In contrast, the MAV method achieves comparable interception performance with only simple absolute-value and addition logic. This minimalist architecture enables the hardware layer to intercept 69.4% of invalid data, making it the most hardware-efficient choice for the NeuroGator gating system.

Figure 4.

Figure 4

Feature distribution and noise interception performance of different on-chip extraction methods with a fixed 3% active miss rate.

3.3. Impact of Channel Count on Model Performance

We evaluated the impact of spatial information on LFP active state recognition by comparing GRU model training and performance across input channels of 1, 2, 4, 8, 16, and 32. This allowed us to test whether full-channel input is necessary. The experimental results are shown in Figure 5. The data shows that the accuracy improvement brought by increasing the number of channels exhibits significant diminishing returns. Although the convergence speed of the one-channel input lagged behind that of high-channel configurations—possibly due to the difference in information throughput caused by the reduction in input data dimensions within a single epoch—the final F1-Score differed from the 32-channel full input result by only 0.37%. This result strongly implies that the effective feature information retained in single-channel data is highly consistent with full-channel data; LFP signals have high spatial redundancy, and a few key channels are sufficient to capture macroscopic motor intentions. This conclusion aligns with the findings of Bullard et al. regarding the high correlation between LFP band power and population firing rates [30], while avoiding complex channel selection algorithms [11].

Figure 5.

Figure 5

Impact of input channel count on model performance: F1-Score learning curves.

3.4. Performance–Efficiency Trade-Off Analysis

We investigated the trade-off between LFP active state recognition performance and power consumption for GRU1 in the DualGRU design. A grid search compared model performance across combinations of sampling rates and quantization bit widths. The experimental results are shown in Figure 6. The results reveal that model performance exhibits a clear parameter sensitivity distribution, with the sampling rate being the primary factor affecting accuracy. When the sampling rate is maintained at 1000 Hz and above, the F1-Score generally remains above 0.92, whereas when the sampling rate drops to 500 Hz, the highest F1-Score drops significantly to 0.875, indicating that 1000 Hz is the critical frequency for ensuring signal feature integrity. We analyzed how sampling rate and bit-depth affect the model’s performance. As shown in Figure 6, the model remains robust when the resolution is reduced from 12 bit to 6 bit. However, performance drops significantly when reduced to 4 bit. Based on this, we selected 1 kHz and 6 bit as the optimal configuration. Although this configuration has an accuracy loss of about 1.9% compared to the highest configuration, it achieves a data transmission reduction of up to 75%, and an F1-Score of 0.9225 is sufficient to meet the performance requirements of the GRU1 stage.

Figure 6.

Figure 6

F1-Score heatmap of the GRU model under varying sampling rates and bit-depth configurations.

Furthermore, we performed a Precision–Recall trade-off analysis for LFP active state recognition to validate the model’s performance and energy-efficiency advantages under the optimized configuration of 1 kHz and 6 bit (Figure 7). The P-R curve demonstrates that the model can maintain excellent precision in the high Active Recall region. Considering the zero-tolerance characteristic of the wake-up system for missed reports, we adjusted the operating point to a 97% Active target recall rate. Under this threshold, the model can still maintain an F1-Score of 0.91, only slightly lower than the default high-precision configuration. This result strongly supports the feasibility of using low-precision signals as wake-up inputs, meaning that 6 bit coarse-grained LFP waveforms are sufficient to carry macroscopic motor intentions.

Figure 7.

Figure 7

Precision–Recall curve of the GRU1 model using the optimized configuration (1000 Hz/6-bit).

The hardware silence detector first intercepts 69.4% of the total recording time, which represents the S0 state. Of the remaining 30.6%  of the data, the system spends 55.0% of its duration in the transition or interference states, namely S1 and S2, where the low-resolution 1 kHz/6-bit configuration is utilized. This leads to an ultra-low-power operating state occupancy defined by the sum of 69.4% and the product of 55% and 30.6%, totaling 86.2%, which is conservatively reported as over 85% of the total operation time. Consequently, the overall reduction in data transmission volume is estimated at 82%, calculated as the summation of the 69.4% hardware interception and the additional algorithmic optimization of 75%×55%×30.6%=12.62%. These efficiencies are fundamental for maintaining thermal safety and extending the battery life of fully implantable BCI devices.

3.5. Model Comparison Analysis

We demonstrate the advantages of the DualGRU architecture for LFP active state recognition and justify using a GRU as the core temporal processing unit by benchmarking our dual-resolution cascaded model against LSTM, QRNN, Convolutional Neural Network (CNN)-LSTM [41], Temporal Convolutional Networks (TCNs) [42], and a single-stage GRU under the same experimental constraints. The detailed comparative experimental results are shown in Figure 8. We compared the parameter count, F1-Score, Recall, Precision, and Area Under the Curve (AUC) metrics of each model. Experimental data show that the single-stage GRU model achieved the highest F1-Score of 0.9603 and AUC of 0.9979. Other architectures exhibited comparatively lower performance: the LSTM model reached an F1-Score of 0.9222, while the CNN-LSTM and QRNN models followed with scores of 0.8914 and 0.8506, respectively. The TCN model underperformed significantly, yielding the lowest F1-Score of 0.6347. This disparity is likely due to the TCN’s limited capacity to capture the specific long-term temporal dependencies inherent in LFP signals when compared to recurrent architectures. This significant performance gap proves that the GRU has extremely high data compression efficiency and feature extraction capability when processing LFP sequences, validating our choice of it as the basic building block.

Figure 8.

Figure 8

Comprehensive evaluation of model performance and training convergence analysis.

Although the DualGRU architecture uses low-precision data input in the primary detection stage to significantly reduce front-end wireless transmission power, sacrificing some data precision, thanks to the robust design of the dual-resolution cascaded architecture, its final decoding performance did not decline significantly compared to the full-precision single-stage GRU. Experimental data show that both the single-stage GRU (0.9603±0.012) and the DualGRU (0.9545±0.015) exhibit high statistical stability with minimal fluctuations across cross-validation folds. Compared to the single-stage model, the DualGRU incurs a slight F1-Score loss of less than 0.6%, but it remains superior to other models.

As illustrated in Figure 8b, the architecture demonstrates sufficiently robust generalization capabilities. The integration of the early stopping mechanism, combined with a sufficient number of training epochs, prevents the training process from stagnating near the transient performance dips caused by random noise data, thereby ensuring stable and accurate convergence during model training. Combined with Figure 8b, this architecture successfully demonstrates the feasibility of sacrificing a small amount of non-critical accuracy for significant system-level energy efficiency improvements, representing an optimal solution for low-power scenarios.

3.6. System Verification and Comparison

To evaluate the practical efficacy of the NeuroGator system, we implemented the hardware architecture on a Xilinx Zynq 7010 Field-Programmable Gate Array (FPGA) and conducted comprehensive system-level testing. The front-end silence detection core utilizes a hardware logic implementation based on Mean Absolute Value (MAV), comprising a four-channel multiplexer, an accumulator, and a comparator, as detailed in Figure 9. Real-time waveforms captured with a logic analyzer after deployment of the complete processing link are shown in Figure 10. The system demonstrated robust adaptive performance, effectively tracking signal fluctuations and suppressing data transmission during resting intervals. By employing a hierarchical wake-up mechanism—transitioning from coarse-grained detection to fine-grained decoding—the system achieved real-time gating of neural signals with a classification accuracy of 95.7% and a total response latency of less than 22 ms.

Figure 9.

Figure 9

Hardware schematic of the MAV-based silence detector core.

Figure 10.

Figure 10

FPGA hardware-in-the-loop validation results and waveform analysis. (a) Hardware logic overview on Zynq 7010. (b) Input signal and MAV-based gating effect showing 78.3% data reduction. (c) System output verification of hierarchical GRU triggering. (d) Key performance statistics.

To assess the feasibility of integrating this architecture into fully implantable systems, we designed an ASIC in a standard 180 nm CMOS process and evaluated its area and power characteristics. Following this, the physical implementation, including placement and routing, was performed using Cadence Innovus. For accurate power estimation, we adhered to standard methodologies from prior studies [8,11], conducting post-simulations on the post-routed netlist. By using a test subset of the dataset as the input stimulus, we captured realistic circuit-switching activity and dynamic power transitions. The design layout and the hardware schematic of the silence detector core are illustrated in Figure 11 and Figure 9, respectively.

Figure 11.

Figure 11

Layout of the proposed NeuroGator feature extraction ASIC.

The exceptional energy efficiency of NeuroGator is fundamentally rooted in the physical characteristics of the LFP modality. The inherent low temporal resolution of LFP signals relative to action potentials allows for lower sampling rates and narrower bandwidths. This characteristic directly impacts system efficiency: the power consumption of a Low-Noise Amplifier (LNA) is directly proportional to its target bandwidth [29], thus minimizing LNA requirements for LFPs. Similarly, power consumption in a conventional successive approximation register Analog-to-Digital Converter (ADC) scales linearly with the sampling rate [43], making the LFP modality inherently more energy-efficient. By integrating on-chip silence detection and hierarchical quantization, NeuroGator reduces the per-channel data rate from 24 kbps to below 4.8 kbps, significantly decreasing the energy budget for wireless data transmission.

Compared to prior research utilizing MUA features [11,44], which generally necessitate a minimum of eight channels and high sampling rates (∼10 kS/s), resulting in a power consumption of 7.4 μW per channel [45], NeuroGator achieves significant savings. While SBP-based methods [18,46] employ multi-frequency analysis, they still incur higher acquisition costs compared to our cascaded LFP architecture. By operating at 80 kHz, NeuroGator ensures a sufficient computational margin for 2 kHz LFP signals while further optimizing performance by avoiding the high dynamic power of megahertz-range clocks.

Thermal safety requires limiting temperature rise. To prevent brain tissue damage, the Food and Drug Administration (FDA) restricts local temperature increases to 1 °C. For wireless implants, this sets a maximum power budget of approximately 7.7 mW [7,47]. The implant-side of NeuroGator operates at a mere 51 nW, providing substantial power and thermal dissipation headroom for other implantable BCI decoding functions within this safety constraint. As summarized in Table 1, the core feature-extraction logic consumes only 51 nW per channel and occupies a silicon area of 0.006mm2.

Table 1.

Performance and architectural comparison of asynchronous BCI gating systems.

Work Ours [8] [11] [18] [46] [44]
Signal modality LFP LFP-BP MUA SBP SBP MUA
Process On Chip MAV Bandpower RMS Integration Peak Detect NEO + RMS
State Estimate DualGRU GRU TCN - SSKF GRU
Clock Freq (kHz) 80 - 400 500 80 800
Technology (nm) 180 180 180 180 180 180
Supply voltage (V) 1.8 1.8 1.8 1.55 0.625 1.8
Area per channel (mm2) 0.006 0.09 0.03 0.07 - 0.02
Power per channel (nW) 51 91.87 630 212 3660 4900

TCN: Temporal Convolutional Network; SSKF: steady-state Kalman filter.

4. Conclusions

This paper proposes and validates NeuroGator, a low-power gating system using LFP-based brain-state estimation. NeuroGator addresses power-consumption challenges in implantable brain–computer interfaces. The system replaces the high-power always-on operation typical of asynchronous BCIs. It uses a multi-level gating architecture with on-chip silence detection, transition recognition, and active state confirmation. The hardware intercepts 69.4% of invalid data. On the edge side, DualGRU collaboration further reduces the portion of data transmission, accounting for 12.6% of the total. The system cuts overall power consumption by 82%, while maintaining s brain-state estimation accuracy of above 95%. Implemented in a 180 nm CMOS process, the hardware core uses only 51 nW and occupies 0.006 mm2. This makes NeuroGator one of the most advanced asynchronous brain–computer switches. While the MAV-based detector offers extreme power efficiency, its reliance on amplitude thresholds makes it sensitive to certain low-frequency artifacts. For example, EMG interference during vigorous movements presents a challenge for purely threshold-based gating. In addition, we recognize that our current validation relies on a specific reach-to-grasp task. The LFP population dynamics in this dataset demonstrate robust gating performance. However, the system’s generalizability across diverse motor paradigms remains a limitation that needs further investigation. Hardware optimization also remains a key objective. Integrating the MAV logic directly into the AFE could potentially eliminate unnecessary ADC sampling power during resting states. This would provide a path toward even greater energy efficiency. Taken together, this work provides an important theoretical basis and technical paradigm for developing the next generation of implantable neural interfaces, characterized by long battery life, miniaturization, and high intelligence. Building on these advancements, future work will further explore the deep fusion design of ultra-low-power circuits and algorithms to enhance robustness against physiological noise and diverse task requirements.

Acknowledgments

We thank the members of the Lingang Laboratory for their support.

Author Contributions

B.H.: Investigation, Methodology, Software, Hardware, Writing—original draft. C.L.: Validation, Writing—review editing. Z.Q.: Validation, Writing—review editing. N.X.: Conceptualization, Funding acquisition, Supervision, Writing—review editing. L.Y.: Conceptualization, Investigation, Funding acquisition, Supervision, Writing—review editing. All authors have read and agreed to the published version of the manuscript.

Data Availability Statement

The benchmark dataset analyzed in this study is publicly available on G-Node at https://doi.org/10.12751/g-node.f83565 [37]. Other relevant data are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Funding Statement

This work was supported in part by Grant LG-GG-202402-05, “High-Performance Neural Signal Acquisition IC Chip Development”.

Footnotes

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

References

  • 1.Zhang H., Jiao L., Yang S., Li H., Jiang X., Feng J., Zou S., Xu Q., Gu J., Wang X., et al. Brain–computer interfaces: The innovative key to unlocking neurological conditions. Int. J. Surg. 2024;110:5745–5762. doi: 10.1097/JS9.0000000000002022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Patrick-Krueger K.M., Burkhart I., Contreras-Vidal J.L. The state of clinical trials of implantable brain–computer interfaces. Nat. Rev. Bioeng. 2025;3:50–67. doi: 10.1038/s44222-024-00239-5. [DOI] [Google Scholar]
  • 3.Schalk G., Brunner P., Allison B.Z., Soekadar S.R., Guan C., Denison T., Rickert J., Miller K.J. Translation of neurotechnologies. Nat. Rev. Bioeng. 2024;2:637–652. doi: 10.1038/s44222-024-00185-2. [DOI] [Google Scholar]
  • 4.Zhao Z.-P., Nie C., Jiang C.-T., Cao S.-H., Tian K.-X., Yu S., Gu J.-W. Modulating brain activity with invasive brain–computer interface: A narrative review. Brain Sci. 2023;13:134. doi: 10.3390/brainsci13010134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Wang R., Xu Y., Zhang Y., Hu X., Li Y., Zhang S. A fast and effective spike sorting method based on multi-frequency composite waveform shapes. Brain Sci. 2023;13:1156. doi: 10.3390/brainsci13081156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Otte E., Vlachos A., Asplund M. Engineering strategies towards overcoming bleeding and glial scar formation around neural probes. Cell Tissue Res. 2022;387:461–477. doi: 10.1007/s00441-021-03567-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Valencia D., Mercier P.P., Alimohammad A. Efficient in vivo neural signal compression using an autoencoder-based neural network. IEEE Trans. Biomed. Circuits Syst. 2024;18:691–701. doi: 10.1109/TBCAS.2024.3359994. [DOI] [PubMed] [Google Scholar]
  • 8.Valencia D., Mercier P.P., Alimohammad A. An efficient brain-switch for asynchronous brain-computer interfaces. IEEE Trans. Biomed. Circuits Syst. 2025;19:130–141. doi: 10.1109/TBCAS.2024.3396115. [DOI] [PubMed] [Google Scholar]
  • 9.Han C.-H., Müller K.-R., Hwang H.-J. Brain-switches for asynchronous brain-computer interfaces: A systematic review. Electronics. 2020;9:422. doi: 10.3390/electronics9030422. [DOI] [Google Scholar]
  • 10.Chen Y., Tacca B., Chen Y., Biswas D., Gielen G., Catthoor F., Verhelst M., Mora Lopez C. A 384-channel online-spike-sorting IC using unsupervised Geo-OSort clustering and achieving 0.0013 mm2/ch and 1.78 μW/ch; Proceedings of the 2023 IEEE International Solid-State Circuits Conference (ISSCC); San Francisco, CA, USA. 19–23 February 2023; pp. 486–488. [Google Scholar]
  • 11.Valencia D., Mercier P.P., Alimohammad A. Power-efficient in vivo brain-machine interfaces via brain-state estimation. J. Neural Eng. 2023;20:016032. doi: 10.1088/1741-2552/acb385. [DOI] [PubMed] [Google Scholar]
  • 12.Yang X., Xiong X., Li X., Lian Q., Zhu J., Zhang J., Qi Y., Wang Y. Reconstructing multi-stroke characters from brain signals toward generalizable handwriting brain-computer interfaces. IEEE Trans. Neural Syst. Rehabil. Eng. 2024;32:1481–1491. doi: 10.1109/TNSRE.2024.3492191. [DOI] [PubMed] [Google Scholar]
  • 13.Le L., Papale P., Seeliger K., Lozano A., Dado T., Wang F., Roelfsema P., van Gerven M.A.J., Güçlütürk Y., Güçlü U. MonkeySee: Space-time-resolved reconstructions of natural images from macaque multi-unit activity. Adv. Neural Inf. Process. Syst. 2024;37:93826–93848. [Google Scholar]
  • 14.Perge J.A., Homer M.L., Malik W.Q., Cash S., Eskandar E., Friehs G., Donoghue J.P., Hochberg L.R. Intra-day signal instabilities affect decoding performance in an intracortical neural interface system. J. Neural Eng. 2013;10:036004. doi: 10.1088/1741-2560/10/3/036004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Saha S., Mamun K.A., Ahmed K., Mostafa R., Naik G.R., Darvishi S., Khandoker A.H., Baumert M. Progress in brain computer interface: Challenges and opportunities. Front. Syst. Neurosci. 2021;15:578875. doi: 10.3389/fnsys.2021.578875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li T., Gao Y., Zhou J., Chen Y., Zhang S., Gong X., Liu Y. Advancements in the application of brain-computer interfaces based on different paradigms in amyotrophic lateral sclerosis. Front. Neurosci. 2025;19:1658315. doi: 10.3389/fnins.2025.1658315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Chen B., Lu J., Chen Z., Han X., Sun Y., Lin X., Tang Z., Jia S., Xie G., Huo F., et al. Long-term implantable flexible neural interfaces for electrophysiological monitoring. J. Mater. Chem. C. 2025;13:5951–5980. doi: 10.1039/D4TC04289A. [DOI] [Google Scholar]
  • 18.Atzeni G., Lim J., Liao J., Novello A., Lee J., Moon E., Barrow M., Letner J., Costello J., Nason S.R., et al. A 260 × 274 μm2 572 nW neural recording micromote using near-infrared power transfer and an RF data uplink; Proceedings of the 2022 IEEE Symposium on VLSI Technology and Circuits (VLSI Technology and Circuits); Honolulu, HI, USA. 12–17 June 2022; pp. 64–65. [Google Scholar]
  • 19.Pawlak W.A., Howard N. Neuromorphic algorithms for brain implants: A review. Front. Neurosci. 2025;19:1570104. doi: 10.3389/fnins.2025.1570104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Huang T. Research and design of low power amplifier for brain computer interface; Proceedings of the 2nd International Conference on Mechanics, Electronics Engineering and Automation (ICMEEA 2025); Paris, France. 16–18 May 2025; pp. 959–967. [Google Scholar]
  • 21.Milekovic T., Sarma A.A., Bacher D., Simeral J.D., Saab J., Pandarinath C., Sorice B.L., Blabe C., Oakley E.M., Tringale K.R., et al. Stable long-term BCI-enabled communication in ALS and locked-in syndrome using LFP signals. J. Neurophysiol. 2018;120:343–360. doi: 10.1152/jn.00493.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Xia R., Chen X., Engel T.A., Moore T. Common and distinct neural mechanisms of attention. Trends Cogn. Sci. 2024;28:554–567. doi: 10.1016/j.tics.2024.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Roy S., Azad A.W., Baidya S., Alam M.K., Khan F. Powering solutions for biomedical sensors and implants inside the human body: A comprehensive review on energy harvesting units, energy storage, and wireless power transfer techniques. IEEE Trans. Power Electron. 2022;37:12237–12263. doi: 10.1109/TPEL.2022.3164890. [DOI] [Google Scholar]
  • 24.Shaeri M., Afzal A., Shoaran M. Challenges and opportunities of edge AI for next-generation implantable BMIs; Proceedings of the 2022 IEEE 4th International Conference on Artificial Intelligence Circuits and Systems (AICAS); Incheon, Republic of Korea. 13–15 June 2022; pp. 190–193. [Google Scholar]
  • 25.Zhang S., Song Y., Lv S., Jing L., Wang M., Liu Y., Xu W., Jiao P., Zhang S., Wang M., et al. Electrode arrays for detecting and modulating deep brain neural information in primates: A review. Cyborg Bionic Syst. 2025;6:0249. doi: 10.34133/cbsystems.0249. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tang X., Shen H., Zhao S., Li N., Liu J. Flexible brain–computer interfaces. Nat. Electron. 2023;6:109–118. doi: 10.1038/s41928-022-00913-9. [DOI] [Google Scholar]
  • 27.Downey J.E., Schwed N., Chase S.M., Schwartz A.B., Collinger J.L. Intracortical recording stability in human brain–computer interface users. J. Neural Eng. 2018;15:046016. doi: 10.1088/1741-2552/aab7a0. [DOI] [PubMed] [Google Scholar]
  • 28.Saeedi S., Chavarriaga R., Millán J.d.R. Long-term stable control of motor-imagery BCI by a locked-in user through adaptive assistance. IEEE Trans. Neural Syst. Rehabil. Eng. 2016;25:380–391. doi: 10.1109/TNSRE.2016.2645681. [DOI] [PubMed] [Google Scholar]
  • 29.Even-Chen N., Muratore D.G., Stavisky S.D., Hochberg L.R., Henderson J.M., Murmann B., Shenoy K.V. Power-saving design opportunities for wireless intracortical brain–computer interfaces. Nat. Biomed. Eng. 2020;4:984–996. doi: 10.1038/s41551-020-0595-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Nason S.R., Vaskov A.K., Willsey M.S., Welle E.J., An H., Vu P.P., Bullard A.J., Nu C.S., Kao J.C., Shenoy K.V., et al. A low-power band of neuronal spiking activity dominated by local single units improves the performance of brain–machine interfaces. Nat. Biomed. Eng. 2020;4:973–983. doi: 10.1038/s41551-020-0591-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Cerqueira E.M.G., de Medeiros R.E., Fiorin F.S., de Arújo e Silva M., Lima R.H., Dantas A.F.O.A., Rodrigues A.C., Delisle-Rodriguez D. Local field potential-based brain-machine interface to inhibit epileptic seizures by spinal cord electrical stimulation. Biomed. Phys. Eng. Express. 2024;11:015016. doi: 10.1088/2057-1976/ad9155. [DOI] [PubMed] [Google Scholar]
  • 32.Alzahab N.A., Apollonio L., Di Iorio A., Alshalak M., Iarlori S., Ferracuti F., Monteriu A., Porcaro C. Hybrid deep learning (hDL)-based brain-computer interface (BCI) systems: A systematic review. Brain Sci. 2021;11:75. doi: 10.3390/brainsci11010075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gao S., Yang J., Shen T., Jiang W. A parallel feature fusion network combining GRU and CNN for motor imagery EEG decoding. Brain Sci. 2022;12:1233. doi: 10.3390/brainsci12091233. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Mao S., Sejdić E. A review of recurrent neural network-based methods in computational physiology. IEEE Trans. Neural Netw. Learn. Syst. 2022;34:6983–7003. doi: 10.1109/TNNLS.2022.3145365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Dong Y., Wang S., Huang Q., Berg R.W., Li G., He J. Neural decoding for intracortical brain–computer interfaces. Cyborg Bionic Syst. 2023;4:0044. doi: 10.34133/cbsystems.0044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ahmadi N., Constandinou T.G., Bouganis C.-S. Robust and accurate decoding of hand kinematics from entire spiking activity using deep learning. J. Neural Eng. 2021;18:026011. doi: 10.1088/1741-2552/abde8a. [DOI] [PubMed] [Google Scholar]
  • 37.Brochier T., Zehl L., Hao Y., Duret M., Sprenger J., Denker M., Grün S., Riehle A. Massively parallel recordings in macaque motor cortex during an instructed delayed reach-to-grasp task. Sci. Data. 2018;5:180055. doi: 10.1038/sdata.2018.55. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Yang Y., Mason A.J. Hardware efficient automatic thresholding for NEO-based neural spike detection. IEEE Trans. Biomed. Eng. 2017;64:826–833. doi: 10.1109/TBME.2016.2580319. [DOI] [PubMed] [Google Scholar]
  • 39.Saggese G., Tambaro M., Vallicelli E.A., Strollo A.G.M., Vassanelli S., Baschirotto A., De Matteis M. Comparison of sneo-based neural spike detection algorithms for implantable multi-transistor array biosensors. Electronics. 2021;10:410. doi: 10.3390/electronics10040410. [DOI] [Google Scholar]
  • 40.Mukhopadhyay S., Ray G.C. A new interpretation of nonlinear energy operator and its efficacy in spike detection. IEEE Trans. Biomed. Eng. 1998;45:180–187. doi: 10.1109/10.661266. [DOI] [PubMed] [Google Scholar]
  • 41.Liu S., Wang J., Li S., Cai L. Multi-dimensional hybrid bilinear CNN-LSTM models for epileptic seizure detection and prediction using EEG signals. J. Neural Eng. 2024;21:066045. doi: 10.1088/1741-2552/ada0e5. [DOI] [PubMed] [Google Scholar]
  • 42.Fan J., Zhang K., Huang Y., Zhu Y., Chen B. Parallel spatio-temporal attention-based TCN for multivariate time series prediction. Neural Comput. Appl. 2023;35:13109–13118. doi: 10.1007/s00521-021-05958-z. [DOI] [Google Scholar]
  • 43.Wang H., Wang X., Barfidokht A., Park J., Wang J., Mercier P.P. A battery-powered wireless ion sensing system consuming 5.5 nW of average power. IEEE J. Solid-State Circuits. 2018;53:2043–2053. doi: 10.1109/JSSC.2018.2815657. [DOI] [Google Scholar]
  • 44.Valencia D., Mercier P.P., Alimohammad A. In vivo neural spike detection with adaptive noise estimation. J. Neural Eng. 2022;19:046018. doi: 10.1088/1741-2552/ac8077. [DOI] [PubMed] [Google Scholar]
  • 45.Li J., Liu X., Mao W., Chen T., Yu H. Microelectronic Implants for Central and Peripheral Nervous System: Overview of Circuit and System Technology. Frontiers; Lausanne, Switzerland: 2022. Advances in neural recording and stimulation integrated; pp. 718–731. [Google Scholar]
  • 46.An H., Nason-Tomaszewski S.R., Lim J., Kwon K., Willsey M.S., Patil P.G., Kim H.-S., Sylvester D., Chestek C.A., Blaauw D. A power-efficient brain-machine interface system with a sub-mW feature extraction and decoding ASIC demonstrated in nonhuman primates. IEEE Trans. Biomed. Circuits Syst. 2022;16:395–408. doi: 10.1109/TBCAS.2022.3175926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zhao Y., Tang L., Rennaker R., Hutchens C., Ibrahim T.S. Studies in RF power communication, SAR, and temperature elevation in wireless implantable neural interfaces. PLoS ONE. 2013;8:e77759. doi: 10.1371/journal.pone.0077759. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The benchmark dataset analyzed in this study is publicly available on G-Node at https://doi.org/10.12751/g-node.f83565 [37]. Other relevant data are available from the corresponding author upon reasonable request.


Articles from Brain Sciences are provided here courtesy of Multidisciplinary Digital Publishing Institute (MDPI)

RESOURCES