Abstract
The living system follows the principles of physics, yet distinctive features, such as adaptability, differentiate it from conventional systems. The cognitive functions of decision-making (DM) and working memory (WM) are crucial for animal adaptation, but the underlying mechanisms are still unclear. To explore the mechanism underlying DM and WM functions, here we applied a general non-equilibrium landscape and flux approach to a biophysically based model that can perform decision-making and working memory functions. Our findings reveal that DM accuracy improved with stronger resting states in the circuit architecture with selective inhibition. However, the robustness of working memory against distractors was weakened. To address this, an additional non-selective input during the delay period of decision-making tasks was proposed as a mechanism to gate distractors with minimal increase in thermodynamic cost. This temporal gating mechanism, combined with the selective-inhibition circuit architecture, supports a dynamical modulation that emphasizes the robustness or flexibility to incoming stimuli in working memory tasks according to the cognitive task demands. Our approach offers a quantitative framework to uncover mechanisms underlying cognitive functions grounded in non-equilibrium physics.
Subject terms: Biophysics; Biological physics; Statistical physics, thermodynamics and nonlinear dynamics; Dynamical systems
Introduction
Living systems, composed of chemical substances and requiring energy, adhere to the principles of physics and chemistry. However, these systems possess unique attributes, such as self-organization, self-replication, and adaptability, that distinguish them from conventional physical systems. Adaptability refers to the ability of a living system to adjust and thrive in response to changing environmental conditions, maintaining critical functions for survival and fitness. The brain, a complex dynamical system, plays a crucial role in animal adaptation1–6.
The fundamental cognitive functions of decision-making (DM) and working memory (WM) are integral to animal adaptation. DM is the process of selecting among alternative courses of action based on available information and goals, enabling animals to balance risks and rewards, avoid threats, and obtain resources. WM allows for the temporary storage and manipulation of information, aiding in DM. From an evolutionary perspective, WM and DM are thought to have evolved as adaptations to improve the survival of organisms in their environments7,8. Natural selection has shaped the neural circuitry that underlies these cognitive functions over time. Despite significant advances in the field9–11, the mechanisms related to working memory and cognitive DM remain elusive.
DM is associated with the ramping activity of neurons in the parietal, prefrontal, or premotor cortex, reflecting the accumulation of evidence3,12–15. The WM function is associated with stimulus-selective persistent activity, a sustained neural response specific to a particular stimulus that persists beyond its presentation, commonly observed in the same cortical areas3,13,15–17. Neurophysiological studies have shown that neural circuits exhibiting persistent activity are likely capable of integrating stimuli and making categorical decisions3,15. Moreover, DM and WM may share similar circuit mechanisms.
Traditional models of DM, such as the drift-diffusion model (DDM), conceptualize the DM process as a linear accumulation of evidence towards predefined decision boundaries, characterized by parameters like drift rate, boundary separation, and starting point18,19. While effective, these models often abstract away the underlying neural dynamics, limiting their capacity to capture the complexities of neural interactions during DM and WM tasks. In contrast, recent theoretical studies show that the characteristic neural activity of DM and WM can be successfully described by an attractor network framework3,15,20,21, which offers a nonlinear, network-based approach to modeling DM processes. This framework uses the concept of attractor states, which are stable patterns of neural activity and represent different decision outcomes. By modeling neural populations and their dynamic interactions, the attractor framework provides a more biologically plausible and comprehensive depiction of the neural mechanisms underpinning DM and WM.
In this framework, attractor networks contain two or more populations with self-excitation and are selective to different stimuli. These excitatory populations interact with each other through a common pool of inhibitory neurons3,15,20,21. The neural circuits with such self-excitation and mutual inhibition architecture can generate the stimulus-selective persistent activity pattern for WM16,17,22, and also ramping, categorical, and winner-take-all dynamics for perceptual DM15,23. Inhibitory neurons are specialized cells that release neurotransmitters to decrease the activity of other neurons, serving as a critical component in the regulation of the dynamics of neural circuits. They are usually provided by a single pool of non-selective neurons lacking connection specificity3,15,20,21. However, recent studies have found that both excitatory and inhibitory neurons in decision circuits are selective, suggesting the existence of functional subnetworks within inhibitory populations, similar to excitatory populations5,24. These findings prompt inquiries into why, out of all potential circuit architectures capable of supporting DM, a distinct architecture characterized by these precise attributes and configurations, comprising functionally specialized subnetworks, emerges through learning. Furthermore, it raises questions about whether this architecture could improve the accuracy of DM and the resilience of working memory representations in the presence of distractions. If the selective-inhibition architecture cannot protect memory from interference during DM, are there additional mechanisms, apart from the static organization of the circuit, such as temporal gating inputs, that can further modulate cognitive function?
To address these questions, we investigated the mechanisms underlying the DM and WM functions using the attractor network framework. This approach can reproduce psychophysical and physiological results in perceptual DM and working memory tasks13,25 and provide a natural candidate mechanism for persistent neural activity and winner-take-all competition leading to a categorical choice. However, noise is ubiquitous in the brain, introducing stochastic transitions between different perceptual or physiological states. To comprehensively understand this system, it is imperative to quantify the underlying attractor landscapes, which fundamentally determine the behavior of the system. Unlike previous studies that have primarily relied on qualitative illustrations of attractor landscapes3,15,20, we emphasize the quantitative assessment of landscape features such as basin depths and barrier heights. By doing so, we can relate these quantified landscape characteristics to observable behavioral metrics, including decision time and transition rates in various cognitive tasks. This quantitative approach, inspired by thermodynamics and statistical mechanics, allows for more precise predictions and a deeper understanding of the neural mechanisms governing DM and WM.
Inspired by thermodynamics and statistical mechanics, physicists have introduced concepts such as “internal energy”, “free energy”, and “entropy” to describe the collective properties of complex systems like neural circuits and proteins1,26,27. Hopfield explored the computational properties of neural circuits, for example, memory storage and retrieval, by constructing an “energy function” (a representation of the stability and dynamics of system states)1,26. However, this approach only works for symmetrical neural circuits and fails for more realistic asymmetrical connections.
To meet this challenge, we have developed a potential landscape and flux framework to explore the dynamics of neural circuits involved in DM and WM21,28,29. Within our framework, the “potential” is an abstract concept that represents an effective potential landscape. This mathematical tool aids in visualizing the system’s dynamics, offering insights into the stability of different activity patterns and the transitions between them. It is important to underscore that this effective potential is distinct from the actual metabolic energy consumption of neurons.
Moreover, neural circuits, akin to other biological systems, require energy to perform various vital functions30,31. The biological energy usage is associated with metabolic costs incurred during the maintenance and transition of neuronal states, such as the ATP (adenosine triphosphate) consumed by ion pumps to re-establish ionic gradients after activity and the energy necessary for action potential generation. In our non-equilibrium potential and flux framework, we account for these biophysical energy considerations by examining the entropy production rate. This entropy production rate serves as a proxy for measuring the cost of biological energy that underlies biological functions32–34. By integrating this measure, we gain novel insights into the role of energy supply in the context of these cognitive processes.
Here, we use a reduced version of the spiking neuronal network models comprised of integrate-and-fire types through a mean-field approach3,15,35, which can reproduce most of the psychophysical and physiological results in delayed response DM tasks13,15,25. By quantifying the underlying attractor landscapes, which refer to the landscapes of stable states into which the system settles and determine the behavioral responses of the system, we found that a circuit architecture with selective inhibition results in stronger resting states, improving DM accuracy. However, the weaker decision states in this circuit structure lead to less stable working memories against distracting stimuli. Our results show that presenting a ramping non-selective input during the initial phase of the delay interval in DM tasks can serve as a temporal modulation mechanism that enhances WM robustness against distractors. Maintaining robustness requires stabilizing the decision attractor landscapes, which our thermodynamic analysis shows has an associated energetic cost. Compared to alternatives like a constant non-selective input, a ramping input achieves robustness using less total thermodynamic cost over time. This temporal gating mechanism complements the selective inhibition architecture to dynamically emphasize either robustness or flexibility of working memory depending on specific demands. Overall, our approach provides a new paradigm for exploring the underlying mechanisms of various cognitive functions.
Results
Influence of the circuit architecture on DM and WM functions
Both the posterior parietal cortex (PPC) and prefrontal cortex (PFC) are key nodes that can exhibit characteristic neural activity of cognitive functions such as DM and WM36–38. The perceptual decision-making behavior and working memory function in these cortical areas were well explored through the classic random-dot motion (RDM) discrimination tasks39–42. In such experimental paradigms, monkeys are trained to judge the direction of motion of random dots, and their choice responses are indicated by a rapid saccadic eye movement. In a delayed response version of the task13, monkeys are required to withhold the response for a delay period, which means that its choice must be actively maintained in working memory. To clarify the neural foundation of DM and WM, models have shown that DM can be defined by ramping dynamics, where neural activity gradually increases over time as evidence accumulates, and a “winner-take-all” process in which the neural population representing the chosen choice suppresses the activity of other competing populations3,20. Meanwhile, WM is linked to sustained activity within neural circuits that preserves stimulus information over a delay period3,23. These circuits are structured so that groups of stimulus-selective excitatory neurons compete, regulated by feedback inhibition from a common pool of non-selective inhibitory neurons3,15,20,21,23 (Fig. 1a).
Fig. 1. Selective subnetwork of inhibitory neurons enhances the accuracy in decision-making.
a The diagram of the circuit model with non-selective inhibitory neurons. b The circuit architecture with a selective subnetwork of inhibitory neurons. The control parameter w can indicate the degree of selectivity of the inhibitory neurons. w and w− modulate the strength of connections between the excitatory and inhibitory populations. w increases the strength of connections between two sub-populations with the same subscript relative to cross-connections. w− decreases with increasing w, modeling the homeostatic depression of connections between clusters with the different subscripts (1,2). Larger w implies the inhibitory neurons more selective (see more details in the Methods section). c, d Neural activities during DM for different circuit architectures with non-selective and selective inhibitory neurons, where w = 1, 1.06, respectively. The blue curves represent the activities of the excitatory neural population 1 for correct trials, while the red curves for the error trials. The black trajectories mark the firing rate of population 1 for the trial, with median decision time. e, f The average decision times and correct rates in DM with varied architectures indicated by the parameter w. For the generality of the results, we explored the DM tasks with different difficulties (stimulus motion coherence c = 0.1, 0.2, 0.5, respectively). Each data point in Fig. 1e, f are averaged over 30,000 trials. The circuit architecture with a selective subnetwork of inhibitory neurons (larger w) results in longer decision time, with variability across trials represented by error bars (SEM). In addition, the accuracy in DM increases with w. The underlying mechanism can be reflected from the attractor landscapes showing that the selective-inhibition architecture leads to a stronger resting state, which extends the time of integration of evidences ((g–i) with w = 1.01, 1.04, 1.06, respectively and the nonzero-motion coherence c = 0.1).
However, recent experimental and modeling evidence suggests that specific functional circuits exist within inhibitory populations, analogous to the stimulus-selective circuits within excitatory populations5,24. These inhibitory “subnetworks” can be thought of as distinct inhibitory circuits preferentially connected to different excitatory populations. Studies have found that during learning, the formation of selectivity in these inhibitory subnetworks takes place concurrently with the development of selectivity in excitatory populations (Fig. 1b)24. To study why such circuit architecture is chosen from many different alternatives, we explored a biologically realistic attractor model to test how different circuit architectures influence performance in DM and further WM tasks.
In traditional drift-diffusion models (DDM), the decision-making process is modeled as a linear accumulation of sensory evidence towards predefined decision boundaries, with parameters such as drift rate, boundary separation, and starting point determining the decision dynamics18,19. However, these models do not explicitly account for the underlying neural circuitry involved in DM and WM. In contrast, the attractor network framework conceptualizes DM as a nonlinear process driven by the dynamic interactions of neural populations. A DM task in this framework commences from a resting state, representing baseline neural activity before decision-making begins. Upon presentation of an external stimulus, the system’s trajectory through the high-dimensional neural state space is governed by network dynamics, including mutual excitation within selective neural pools and inhibition between them. Unlike the linear evidence accumulation in DDM, trajectories in the attractor framework can exhibit complex patterns, such as nonlinear pathways, which ultimately converge in one of the competing attractor states, stable neural activity patterns that correspond to specific decision outcomes15,20.
This approach not only captures the stability and flexibility of neural representations during DM but also integrates working memory processes through persistent neural activity within attractor states. By quantifying landscape features such as basin depths and barrier heights between attractors, we aim to relate these features to quantifiable aspects of the model’s behavior observable in simulations. Here, we use a reduced version of the spiking neuronal network models comprised of integrate-and-fire types through a mean-field approach3,15,35, which can reproduce most of the psychophysical and physiological results in delayed response DM tasks13,15,25. In this model, the two excitatory neural populations (selective for rightward and leftward motion directions) receive external inputs from the visual motion stimulus. These inputs are linear functions of the motion coherence c (the percentage of random dots moving coherently, see details in the Methods section), which sets the bias of the input for one population over the other, depending on whether the motion stimulus is in the preferred or non-preferred direction of the cell. For a zero-coherence stimulus (c = 0), the two excitatory populations receive equal input. Under these conditions, making a decision becomes more challenging, as the decision-making process is predominantly influenced by random noise. For a high coherence c, there is a larger difference in the external input currents to the two excitatory populations. This implies that it is easier to make a decision.
In contrast to the assumption of non-selective inhibition in previous theoretical models (Fig. 1a), recent studies suggested that the subnetworks of selective inhibitory neurons support DM24. As shown in Fig. 1b, we model this circuit architecture assuming selective inhibitory neurons by dividing the inhibitory neurons into two sub-populations (I1 and I2), each connected preferentially to one excitatory pool (E1 or E2). We use a control factor w to modulate the connection strengths between the excitatory and inhibitory sub-populations. The connection strengths within population 1 or 2 are larger by the factor w, while cross-population (between 1 and 2) connection strengths are smaller by a factor of w−(details in the Methods section). In the presence of external stimulus in DM, the increasing factor w(>1) resulted in a gradual increase in the distance between the average neural activities in the two inhibitory sub-populations (Fig.S1 in the supplementary materials), which are similar to the two competing choice states represented by the neural activities in the two selective excitatory populations. Therefore, larger w implies that the sub-populations are more selective, while w = 1 corresponds to the completely non-selective case. In this model, a nonzero-motion coherence stimulus suggests that the excitatory population E1 receives larger external input. A correct choice is reflected in the ramping neural activity of the population E1 to a high-activity state while the population E2 to a low-activity state. As shown in Fig. 1c, d, the circuit structure indicated by the factor w plays a crucial role in determining the time integration in perceptual decisions, here with the motion coherence c = 0.2. With the non-selective structure (w = 1), the neural activity of the population E1 ramps relatively faster. With a selective structure (w > 1), the timescale of integration is relatively slower.
The easier tasks indicated by the larger motion coherence c always correspond to shorter decision times and higher accuracy. However, for a specific motion coherence condition, there is a tradeoff between the decision time and the accuracy, which is known as the speed-accuracy tradeoff (SAT) in DM31,43. Unlike the drift-diffusion model, which utilizes explicit decision thresholds to trigger decisions, our attractor model embeds decision criteria implicitly within the system dynamics. In this framework, the decision criterion is represented by the system’s convergence to one of the stable attractors, with the basin of attraction determining the conditions required for convergence to a particular decision state. Therefore, the decision time here is defined as the time required for the system to transition from the resting state to a decision attractor in response to the external stimulus. Figure 1e depicts that the average decision time increases as the factor w is increased in different motion coherence conditions. Figure S2 shows the decision time distributions (kernel density estimates) for varying w. The longer integration times in decision processes are accompanied by higher accuracy (Fig. 1f). However, the overall performance depends not just on this speed-accuracy tradeoff, it also relies on the specific constraints of the decision task. For tasks with long required delays between trials, higher w allows better accuracy to accumulate given available time. But for tasks emphasizing speed on individual trials, lower w can enable faster decisions and superior performance.
To investigate the underlying mechanisms for the improved accuracy that results from the signal-selective architecture, we quantified the attractor landscapes that characterize the DM function with varying w (see details in the Methods section). As shown in Fig. 1g, there are two biased basins of attraction (decision attractors) corresponding to the two choices in the presence of the nonzero-coherence stimulus. The attractor corresponding to the correct choice is stronger, whereas the other is weaker. The system initiates at the resting state between the two decision attractors before the onset of the external stimuli. When an external stimulus is introduced, the resting state becomes unstable, compelling the system to evolve toward one of the two decision attractors. This process serves as the decision rule, where the system’s dynamics dictate the final decision outcome based on the shape of the potential landscape. Specifically, the decision criterion is achieved when the system’s state traverses the landscape barrier separating the resting state from the decision attractors, effectively reaching a stable attractor corresponding to a particular choice. This mechanism allows for quantitative predictions of decision-related measures. For instance, higher barriers result in longer decision times as the system requires more time to accumulate sufficient evidence to overcome the barrier.
As w is increased, inhibitory neurons become more selective, leading to a stronger resting state characterized by a local minimum in the potential landscape situated between the two competing decision attractors (Fig. 1g–i) with w = 1.01, 1.04, 1.06, respectively). Consequently, a higher w value results in a larger barrier that trajectories must overcome to reach a decision attractor. This landscape configuration directly predicts longer decision times in simulations, as increased evidence accumulation is necessary to traverse the higher barrier (Fig. 1e). Additionally, the slower evidence integration process afforded by higher barriers reduces decision errors induced by random noise, enhancing decision accuracy.
To provide a more comprehensive understanding of the parameter relationships, we show a three-dimensional graph (Fig. 2) illustrating how accuracy varies across different combinations of inhibitory selectivity w and motion coherence c. The surface plot reveals a clear threshold boundary where accuracy transitions to 100%, with the critical w value decreasing as motion coherence increases (the red curve). This relationship demonstrates how task difficulty and circuit architecture interact: for more challenging tasks (lower c), higher inhibitory selectivity is required to achieve perfect performance, while easier discriminations can reach maximal accuracy with less selective inhibition. At the threshold boundary, the system transitions from bistable dynamics supporting two competing attractors to a monostable regime where only the correct choice attractor remains. Figures S3, S4 show the phase-planes for different w with the motion coherence c = 0.4 and 0.5, respectively. When w exceeds this critical value for a given coherence level, the potential barrier maintaining the incorrect choice state disappears, making the system deterministically converge to the correct decision regardless of noise fluctuations. These findings highlight how neural circuits can adaptively tune selectivity parameters to optimize decision-making performance across varying task difficulties, suggesting a principled framework for understanding the evolution of inhibitory circuit architectures in cortical networks supporting perceptual decisions.
Fig. 2. 3D parameter relationships and the threshold curve.
Three-dimensional visualization of decision accuracy as a function of selective inhibition w and stimulus strength c. The surface highlights parameter regimes where selective inhibition w and stimulus strength c synergistically optimize accuracy. Critical w values (red curve) at which decision accuracy reaches 100% for varying coherence c. Higher c reduces the required w, demonstrating that stronger sensory evidence compensates for weaker selective inhibition.
Our results raise the possibility that the brain chooses the higher accuracy architecture among possible alternatives to support DM. This theoretical prediction is consistent with previous experimental findings that used machine learning techniques such as support vector machines (SVMs) and classifier analysis to decode choice-related signals from neural population data24. The study showed that choice representations emerged simultaneously in excitatory and inhibitory populations during task learning, supporting our model result that selective inhibition circuitry improves decision accuracy by strengthening choice coding in both populations.
In some cases, for example, in the delayed response version of the visual motion discrimination task13, the choice must be actively maintained in the working memory. Robust WM requires shielding the interference from both internal noise and external distraction. To assess how the signal-selective structure influences the WM function, a distractor stimulus is applied during the delay period. The strength of this distractor stimulus is equal to the original visual motion stimulus, while the motion coherence condition is inverse (the excitatory population E2 receives a larger external input). The distractor is introduced after the system has reached the decision attractor and is presented until a transition occurs. During this time, the network received an additional input intended to disrupt the sustained activity representing the initial stimulus. Figure 3a shows that it takes less time to make a different choice (switch to the other decision state) in the presence of the distractor stimulus as the factor w increases. The robustness of the decision attractor can be quantified through the barrier height between the decision attractors that is inferred from the landscape topography. As shown in Fig. 3b, the barriers separating the current decision attractor from the other one are reduced as the circuit architecture changes from the one with non-selective inhibition to the one with selective inhibition.
Fig. 3. The robustness of WM against distractors.
a The average transition time from the correct choice to the error one in the presence of distractors for different circuit architectures. The system has to go across a potential barrier to switch to another decision state. b The barrier heights (bh) inferred from the underlying landscape topography can measure the robustness of the decision state in WM. Both the average transition time and the corresponding barrier heights are reduced as the w is increased, which implies less robustness against distractors.
The transition rates between decision attractors in the presence of distractors are influenced by the barrier heights, with higher barriers reducing the likelihood of transitions and enhancing WM resilience against distractions. As w increases, the barriers separating the decision attractors are reduced (Fig. 3b). Concurrently, the intermediate “initial” state becomes more stable (Fig. 1h, i). This combination of effects leads to less robust memory states that can be more easily reverted to the stabilized initial state under distraction, enabling quicker flipping between decision representations (Fig. 3a). The increasingly stable intermediate attractor at higher w provides a gateway state that accelerates transitions between decision attractors when the system is perturbed from sustained choice representations. Our results suggest that the signal-selective structure is less robust against distractors. On the other hand, the weakened stability of the decision state due to the strengthened intermediate state implies that the initial errors are more likely to be corrected in the presence of the visual stimuli. This can further improve the accuracy of DM in addition to the increased accuracy of the initial choices. In summary, there is a tradeoff between the accuracy in DM and the robustness in WM.
Dynamical modulation of the tradeoff between decision-making and working memory functions
Our results suggest that the circuit architecture with selective inhibition enhances accuracy in decision-making while reducing the robustness of decisions held in working memory against distracting stimuli. This raises an interesting question: What is the underlying mechanism for protecting memory from interference in the circuit with a selective inhibition architecture? If the circuit architecture is specified, one possibility is that gating incoming stimuli might be achieved by upstream modulation signals to the circuit. To test this hypothesis, we applied an additional non-selective input (i.e., the same input to both selective neuronal populations) during the delay period of the DM tasks. We examined how this additional non-selective input influences the dynamical behavior and the underlying attractor landscape of the system.
In the delay period of the DM tasks, the presence of distracting stimuli can cause changes from the initial correct choice to the incorrect one. Figure 4a shows that in the presence of the distracting stimuli, the average transition time to the incorrect decision state from the initial correct one increases as the additional non-selective input (indicated by Δ>I0) is increased. This implies that the increasing non-selective input can enhance the robustness of WM against distractors. When the distracting stimulus is presented for a limited duration (for example, 1s), the proportion of switches to the wrong choices from the initial correct one reduces with the increased non-selective input (Fig. 4b). The mechanism of the enhanced robustness against distractors due to the additional non-selective input can be uncovered from the changes in the underlying landscape topography (Fig. 4c). Both the two decision attractors become stronger with increased non-selective input. The larger non-selective input increases not only the separation between two decision attractors, but also the depth of the attractor basins and the barrier between them (Fig. 4d–f). It is less likely to go across the barrier and switch to the other decision state in the presence of a larger non-selective input, thereby resulting in the gating of the distracting inputs.
Fig. 4. An increasing non-selective input can enhance the robustness of WM against distractors.
a The average transition time from the correct choice to the error one in the presence of distractors increases with increased non-selective input to both selective excitatory neural populations. The ΔI0 here represents the magnitude of the additional non-selective input. b The transition rate in a limited duration of the distracting stimulus (1 s) is reduced as the non-selective input is increased. The mechanism of improved robustness is reflected in the larger barrier height (bh) inferred from the underlying landscape topography (c). The detailed attractor landscapes with increased non-selective input are displayed in (d–f), where the non-selective input I0 = 0, 0.004, 0.008 nA, respectively, with the specific nonzero-motion coherence c = 0.2 and w = 1.
In the delay period, the presence of distracting stimuli can cause transitions from the initial correct choice to the incorrect one. Figure 4a shows that, in the presence of distracting stimuli, the average transition time from the correct decision state to the incorrect one increases as the magnitude of the additional non-selective input increases. This implies that increasing the non-selective input enhances the robustness of WM against distractors. When the distracting stimulus is presented for a limited duration (e.g., 1 s), the probability of switching to the wrong choice from the initial correct one decreases with the increased non-selective input (Fig. 4b). The mechanism behind the enhanced robustness against distractors due to the additional non-selective input can be understood from changes in the underlying attractor landscape topology (Fig. 4c). Both the two decision attractors become stronger with increased non-selective input. The larger non-selective input increases not only the depth of the attractor basins for the decision states, but also the barriers between them (Fig. 4d–f). With larger barriers, it becomes less likely for the system to cross over to the other decision state in the presence of distractors, thereby effectively gating distracting inputs.
However, applying a large additional non-selective input directly is accompanied by increased thermodynamic cost. Neural circuits, as open systems with constant exchange of material, information, and energy with the environment, are intrinsically out of equilibrium. The thermodynamic cost for maintaining persistent neural activities can be measured by the entropy production rate in non-equilibrium systems29,32–34,44,45 (more details are provided in the Methods section). Figure 5a shows that this thermodynamic cost increases with the magnitude of the additional non-selective input. Therefore, while a larger non-selective input enhances WM stability against distractors, it does so at the expense of higher thermodynamic.
Fig. 5. The disadvantages of strong non-selective input in DM and WM.
Although the increased non-selective input can improve the robustness of WM against distractors. Larger thermodynamic costs are needed to support the enhanced robustness against distractors during the delay period (a). The ΔI0 here represents the magnitude of the additional non-selective input. It leads to shorter decision time (b) while making less accurate choices (c), and the initial errors are less likely to be corrected (d). Presenting A ramping input during the delay period may serve as a cost-effective mechanism of temporal gating of distractors.
Moreover, we found that applying a large non-selective input throughout the entire task adversely affects the DM function. Specifically, when the additional non-selective input is applied from the onset of the visual stimuli during the evidence accumulation period of the DM tasks, it leads to faster decisions but with lower accuracy (Fig. 5b, c). The underlying landscapes that determine these behavioral responses are similar to the ones shown in Fig. 4f. The stronger decision attractors reduce the integration time of evidence, which can result in more errors. Additionally, the initial errors may be corrected during the delay period of the delay-response DM tasks31,46. Figure 5d shows that the probability of correcting initial errors during the delay period decreases as the additional non-selective input increases. These results suggest that if the additional non-selective input is presented together with the target stimuli during the evidence accumulation period of the DM tasks, there could be more errors. This seems to be in contradiction with the mechanism underlying the circuit architecture with selective inhibition, in which the accuracy of DM should be emphasized. Thus, to preserve DM accuracy, the additional non-selective input should only be applied during the delay period to enhance the WM function against distractors without reducing DM accuracy.
To address the increased thermodynamic cost associated with a large constant non-selective input during the delay period, we propose that a gradual increase in the strength of the non-selective input during the delay period can achieve enhanced robustness more efficiently. Due to uncertainties in the exact temporal dynamics of such modulatory signals in biological systems, we introduce a specific time-varying form of the additional input (stepped increases over time) in our simulations. This increasing input allows for progressive strengthening of the decision attractors over time. Initially, the system maintains flexibility, which is beneficial for integrating any new relevant information and correcting potential initial errors. As the delay period progresses and robustness becomes more critical, the increasing non-selective input enhances the system’s resistance to distractors, effectively gating out interfering stimuli. By allocating energy resources in this time-dependent manner, the system can achieve stability with a lower overall thermodynamic cost. As demonstrated in Fig. S5, the ramping non-selective input reduces the transition rate (improving robustness) compared to the no-input condition while incurring lower entropy production than a large constant input. This mechanism dynamically prioritizes flexibility early in the delay period (allowing error correction) and robustness later (shielding against distractions). This approach complements the selective-inhibition circuit architecture and provides a dynamic mechanism to balance DM and WM functions according to cognitive task demands.
Discussion
One of the distinguishing features of biological systems, as opposed to conventional physical systems, is their ability to adapt to the environment. The fundamental cognitive functions of decision-making (DM) and working memory (WM) play crucial roles in processing and acting upon the information that animals perceive from their surroundings. Within the attractor network framework proposed by previous researchers, the distinct neural activities associated with DM and WM observed in the prefrontal and posterior parietal cortex can be replicated in the circuit architecture through strong recurrent excitation within selective populations and lateral inhibition via a single pool of inhibitory neurons3,15,20. However, recent experimental and modeling evidence suggests the existence of specific functional subnetworks within inhibitory populations24. These subnetworks challenge the previously proposed circuit mechanisms that assumed a single non-selective pool of inhibitory neurons, and the mechanisms underlying DM and WM functions based on such a circuit architecture remain unclear.
The attractor landscape provides a comprehensive and quantitative description of the dynamics of neural circuit systems. Different landscape topographies indicate that the number and relative weights of attractors (functional states) vary in the underlying landscapes, determining the categorical choice in decision-making and the robustness of working memory. By quantifying landscape features such as basin depths and barrier heights, we can directly relate these to observable behavioral metrics. For example, higher barrier heights necessitate longer decision times, as the system must accumulate more evidence to transition from the resting state to a decision attractor. This is in contrast to the drift-diffusion model, where decision thresholds are explicitly defined parameters. In the attractor framework, the decision criterion is implicitly embedded within the system’s dynamics, determined by the basin of attraction around each attractor. This allows for a more nuanced and biologically plausible representation of decision-making processes47. Additionally, the transition rates between attractors, especially in the presence of distractors, are intrinsically governed by the potential barriers, enhancing our understanding of WM’s resilience against interference. These quantitative insights demonstrate the superiority of the attractor model in elucidating the underlying neural mechanisms of cognitive functions over more abstract models like the drift-diffusion model.
Although the concept of attractor landscapes has been extensively introduced to describe cognitive functions, they are often presented as illustrations3,15,20 or quantified only in limited, specific circuits26. Drawing inspiration from thermodynamics and statistical mechanics for physical systems, we have developed a non-equilibrium potential landscape and flux framework for general neural circuits21,28,29. By applying this approach to a biophysically based model that can perform DM and WM computations, we have explored the circuit mechanisms of DM and WM by focusing on the influence of varied circuit structures on the system’s behavioral responses and underlying attractor landscapes.
By relating quantifiable natures of the potential landscape to computational features emergent in the simulations, our approach provides a comprehensive framework for elucidating mechanisms that govern the model’s function. We found that the circuit architecture with selective inhibition supports three stable states, including one resting state and two decision states, in the presence of visual stimuli. In contrast, the circuit architecture with non-selective inhibition only supports two decision states, resulting in a shorter time for evidence integration. Consequently, the decision time is longer with higher accuracy in the circuit architecture with selective inhibition, which is consistent with previous results obtained using a machine learning method24.
We found that the circuit architecture with selective inhibition supports decision-making through competition between two choice attractors. Similar to models such as the leaky competing accumulator48, this competition is implemented via winner-take-all dynamics from the selective inhibition. Specifically, selective versus non-selective inhibition results in a stronger intermediate resting state in the underlying potential landscape. This increases the evidence required to switch from rest to a decision, lengthening decision times while filtering out noise to improve accuracy for difficult choices. In both cases, inhibition between competing choices balances the speed-accuracy tradeoff through modulating evidence accumulation over time.
Protecting the decision held in WM from interference is crucial for cognitive tasks, such as a delayed response version of the decision-making task. However, whether the stability of WM can also benefit from such circuit architecture is still unknown. Our findings suggest that memories are less robust against distractors due to reduced barriers between decision attractors on the underlying landscapes. Thus, there is a tradeoff between the DM and WM functions based on a specific circuit architecture. Our prediction suggests that additional dynamic modulation mechanisms beyond the specific circuit architecture may be necessary for balancing DM and WM functions according to cognitive task demands.
Based on the selective-inhibition circuit architecture, which emphasizes DM accuracy, we found that applying an additional non-selective input during the delay period of the task enhances working memory robustness against distractors in a cost-effective manner. A larger non-selective input induces stronger decision attractors and higher barriers between them, making it less likely for distractors to change the initial choice. Our potential landscape and flux approach provides a valuable framework for understanding the dynamics of neural circuits.
In reality, the functioning of neurons is powered by metabolic energy, with ion pumps consuming ATP to maintain ionic gradients and action potentials representing a significant energy expenditure. We employ the concept of the entropy production rate to provide a realistic measure of the dissipation costs associated with neural activity. This concept is grounded in the principles of non-equilibrium thermodynamics and offers a direct connection to the thermodynamic expenditures required to sustain the neural network’s functions32–34. We observed that maintaining higher barrier heights for enhanced robustness incurs a significant thermodynamic cost, as evidenced by the increased entropy production rates required to sustain these landscape configurations. This tradeoff underscores the balance between thermodynamic cost efficiency and cognitive performance, suggesting that the brain must optimize landscape characteristics to achieve the desired behavioral outcomes within metabolic constraints.
In terms of DM function, additional non-selective inputs can lead to shorter decision time but lower accuracy when presented together with target stimuli during the evidence accumulation period. Thus, the additional non-selective input should only be presented during the delay period against distractors. Valuable information may be transmitted to the DM circuit during the early delay period, which can help correct errors that may occur in the initial choice, and should not be gated. Combining these findings, the most efficient way to achieve higher accuracy in DM while guaranteeing a more robust choice with less cost is through the circuit architecture with selective inhibition and applying an additional non-selective input during the delay period in a manner that gradually increases in strength. These results are consistent with recent experimental work suggesting that a ramping input, rather than an input with constant strength, renders the system insensitive to distracting stimuli over time49.
Sometimes, in order to adapt to environmental conditions or demands from behavioral tasks, the working memory system should be highly flexible rather than robust to incoming stimuli22,50. Previous research has investigated the biophysical mechanisms responsible for this flexibility, including the ability to erase memory (return to the resting state) using a negative input51. A recent study suggests that an intermediate state with high activity for both selective neural populations, resulting from increased recurrent excitation within these neural pools, can enhance flexibility to new stimuli21. In this study, we found that transitions between different memory states can be promoted through circuit architecture with selective inhibition, where the resting state serves as the intermediate state for switching. This intermediate state increases the accuracy of initial choices and may help correct initial errors in the presence of visual stimuli from the perspective of the DM function. From the perspective of the WM ability, such an intermediate state makes the circuit more flexible to the most recent incoming stimulus, making it easier to load new inputs and discard old memories. This implies a new network mechanism, indicating that enhanced flexibility to new stimuli in WM does not require an additional negative input to reset the system, nor does it require the intermediate state resulting from increased recurrent excitation. Rather, enhanced flexibility can arise from circuit architecture with selective inhibition, resulting in a strengthened resting state as the intermediate state for transitions to another memory state. Future experiments monitoring neural activity in subjects trained to remember the latest stimulus will reveal whether enhanced flexibility originates from the selective inhibition circuit.
A recent modeling study has also highlighted the crucial role of inhibitory neurons in DM, with their specificity in connections that mediate the speed-accuracy trade-off52. Our current work examines not only how the selective inhibition circuit architecture affects DM function but also the tradeoff between DM and WM functions based on this specific circuit architecture. Additionally, we investigate potential dynamic modulation mechanisms beyond the specific circuit architecture that can balance DM and WM functions according to cognitive task demands. Importantly, our potential landscape and flux approach provides explicit elucidation of the biophysical mechanisms underlying specific cognitive functions, in contrast to previous studies relying only on statistics of individual trajectories23,24,53. Furthermore, our approach quantifies the nature of dissipative consumption underlying cognitive functions.
Recent advances have expanded landscape quantification approaches21,28,31 to encompass both spiking neural networks and large-scale working memory models54,55. These works demonstrate the utility of landscape frameworks in analyzing complex anatomical architectures and distributed neural systems. However, our simplified architecture intentionally isolates core circuit mechanisms, such as selective inhibition, to systematically examine how circuit parameters shape attractor landscapes, thermodynamic costs, and cognitive tradeoffs. This targeted approach enables us to establish direct causal relationships between inhibitory subnetworks and cognitive functions, which can be challenging to disentangle in models with extensive connectivity patterns or region-specific variations. Our work provides mechanistic insights into a foundational cortical processing module, complementing broader network analyses by revealing circuit-level principles that may underlie distributed cognitive computations56. Future research can integrate these findings with detailed connectomic data to investigate how selective inhibition architectures interact with long-range connectivity patterns to optimize cognitive performance across distributed neural systems.
Overall, our results indicate that fundamental evolutionary drivers for neural circuit architectures supporting DM and WM have likely emphasized the optimization of DM accuracy and the enhancement of WM flexibility. This evolutionary shaping allows for an effective response to environmental changes, with the circuit structure featuring selective inhibition fine-tuned to balance behavior and optimize DM based on the latest information. When the behavioral task demands prioritize memory robustness against distractors, applying an additional non-selective input during the delay period, in a manner that gradually increases in strength, can enhance the robustness of WM against distractors with a relatively lower thermodynamic cost compared to maintaining a constant high input throughout the delay period. This temporal gating mechanism complements specific functional subnetworks in the circuit architecture to achieve an optimal balance between the DM and WM functions.
Our study demonstrates that the non-equilibrium landscape and flux framework offers a comprehensive and quantitative approach to understanding the neural mechanisms of DM and WM. The ability to quantify landscape features provides a powerful tool for linking neural circuitry parameters to cognitive function metrics. For instance, adjusting the selectivity parameter offers a direct pathway to modulate decision accuracy and WM robustness, offering targets for future experimental investigations. By explicitly linking landscape features to decision time and transition rates, we provide a robust foundation for future explorations into the dynamic modulation of cognitive functions within biologically plausible neural architectures.
Methods
Non-equilibrium landscape and flux framework for general neural circuits
The metaphor of an attractor landscape is employed to conceptualize how neural activity patterns correspond to various cognitive states, with each state represented as a basin or valley within this landscape. For example, this attractor landscape metaphor is widely used to describe cognitive functions such as associative memory retrieval, classification1,26,57. However, this concept can only be mathematically delineated and computed, in terms of stability and transition rates, in models with specific symmetry assumptions, such as the symmetrical neural circuit in the original Hopfield model1,26. To address the issues of global stability of states associated with cognitive functions, we developed a non-equilibrium potential landscape and flux theory for a wide array of neural network models beyond those with specific symmetries, thereby encompassing more biologically realistic architectures28.
For realistic complex systems comprising numerous interacting components, tracing microscopic trajectories of every component via solving corresponding dynamical equations can be a daunting task. Nonetheless, macroscopic emergent behaviors of the system can be approximately described by the dynamical evolution of several macroscopic observables with deterministic driving forces and random forces that represent effects of microscopic degrees of freedom on macroscopic observables. One well-known example is the stochastic Langevin particle dynamics. We consider a system described by a set of Langevin equations: in which the vector, x, represents the state variables of the system. The corresponding driving force is represented by the vector, F(x), while ζ represents Gaussian white noise, which has an ensemble average (denoted by < ζ > = 0) of zero, signifying that the mean of the noise over many realizations is zero. The autocorrelations . Here, D is a scale factor representing the magnitude of the fluctuations, and D(x) is the diffusion tensor or matrix. The state variables x typically evolve according to nonlinear dynamical laws due to the complex interactions in the system. These laws can lead to diverse behaviors, and, when coupled with stochastic influences from random forces (noise), result in trajectories that are inherently unpredictable.
These stochastic differential equations describe the neural firing trajectories in time, the corresponding probabilistic evolution is given by the Fokker–Planck equation for the probability density28,58,59:
| 1 |
The corresponding Fokker–Planck equation can be written as:
| 2 |
where the probability flux is defined as:
| 3 |
The change in the local probability P(x, t) over time is determined by the net probability flux J flowing in or out of that region, ensuring local conservation of probability. The divergence of the flux is zero at long times when a steady state emerges:
| 4 |
JSS represents the steady-state flux. However, it is not necessary for the flux itself to be zero under this divergence-free condition. A nonzero flux signifies a net directional flow of probabilities that disrupts the detailed balance—a state where every microscopic transition between states is exactly counterbalanced by its reverse—highlighting that the system is operating out of thermodynamic equilibrium. Additionally, the divergence-free flux suggests that there are no sources or sinks for the probability to go to or come from, and hence, locally, the flux must be rotational.
When a system reaches steady-state, the corresponding steady-state probability distributions can be solved through the steady-state equation:
| 5 |
Further, the non-equilibrium potential function can be defined in analogy to the Boltzmann law in equilibrium statistical mechanics. The driving force can be decomposed as28,29,31,58,59:
| 6 |
In contrast to equilibrium systems, where the dynamics are governed solely by the gradient of the underlying energy function, resulting in a zero net flux, the dynamics of non-equilibrium systems are influenced by both the potential gradient and the curl flux (JSS). The non-equilibrium potential U serves as a useful tool for characterizing the global behavior of such systems.
In a steady state, non-equilibrium thermodynamic systems are associated with continuous entropy production28,29,31. While the entropy of a non-equilibrium system in steady state remains constant over time, there is a flow of entropy to the surroundings equal to the entropy generated spontaneously within the system. To calculate the entropy production, the entropy associated with the time-dependent probability distribution can be focused on. Using the Fokker–Planck equation and the definition of probability flux J(x), the time derivative of the system entropy dS/dt can be expressed as a sum of two terms:
| 7 |
Notice that the first term on the right-hand side of the equation is always larger or equal to zero due to the positive definite diffusion matrix D. Therefore, it can be identified as the total entropy production rate, which is the global thermodynamic dissipation or cost directly linked to the non-equilibrium flux from the dynamics29:
| 8 |
where EPR stands for entropy production rate. The second term on the right-hand side of the equation 6 can then be regarded as the entropy flux from the system to the environment:
| 9 |
The total entropy change is equal to the sum of the entropy change of the system and that of the environment, leading to the emergence of the generalized non-equilibrium first law of thermodynamics:
| 10 |
The entropy change of the non-equilibrium system () can either be increased or decreased due to the entropy flow to the environments, while the total entropy change of the system and the environments, represented by the entropy production rate, is always non-negative. This leads to the generalized non-equilibrium thermodynamic second law:
| 11 |
Quantification of neural circuit energy expenditure
Biological systems, such as neural circuits, consume energy to perform different vital functions30,31. While increased firing rates generally lead to increased energy demand60, relying solely on absolute firing rate to quantify neural circuit energy expenditure fails to capture substantial portions of the brain’s overall energy budget. As Marcus E. Raichle described with the concept of “dark energy,” a considerable fraction is devoted to baseline cellular processes independent of external stimuli, with task-related increments potentially accounting for only 0.5–1.0% of total consumption61. Additionally, nearly one-third finances “housekeeping” functions such as protein and lipid synthesis, proton leak across the mitochondrial membrane, and cytoskeletal rearrangements. Firing rates do not reflect these continuous, non-signaling processes comprising 25–50% of brain energy expenditure62. Hence, firing rate alone may not adequately measure the spectrum of neural circuit energy costs.
In our study, we employ the concept of the entropy production rate (EPR) to quantitatively describe the energy expenditure of biological systems—specifically neural circuits—by mapping their continuous energy dissipation. Our study is grounded in a robust theoretical framework, supported by a series of both theoretical and empirical studies29,33,34,44,63,64, that positions EPR as a pivotal thermodynamic quantity capable of quantifying the continuous energy dissipation characteristic of biological systems.
To operationalize this concept, we consider the biological system as an open system that is maintained far from equilibrium by the constant input and dissipation of energy, primarily through the hydrolysis of ATP and GTP molecules. The total thermodynamic cost is contributed both from the free energy input or known as the housekeeping heat QHK65 to maintain the non-equilibrium steady state and the generalized free energy relaxation : . The total thermodynamic dissipation or cost is quantified as the product of EPR and temperature, encompassing both steady-state maintenance and free energy changes29,44,63.
The hydrolysis of ATP, a main source of entropy production, provides the free energy to drive endergonic biochemical reactions, maintaining non-equilibrium conditions in living cells. The ATP/ADP ratio, indicative of these conditions, results in a chemical potential difference , analogous to the voltage in electric circuits, which generates a non-equilibrium flux, as the energy pump of the system32. As we have shown in Eq.8: EPR = ∫ dx(J ⋅ (DD)−1 ⋅ J)/P, the EPR is directly linked to the non-equilibrium flux. The previous work shows that the non-equilibrium flux J is directly linked to the chemical reaction flux32. The rate of ATP/GTP hydrolysis and the consumption of other metabolic fuels are thus directly related to the EPR, which quantifies the minimal rate of heat dissipation required to counteract fluctuation-driven disorder. Greater EPR denotes higher expenditure and metabolic demand to sustain non-equilibrium living system dynamics. Notably, this non-equilibrium entropy measure stems from stochastic thermodynamics considerations of dissipative work rather than information-theoretic entropy reflecting the state uncertainty and informational capacity more commonly applied in computational neuroscience contexts66.
Crucially, the relevance of EPR as a measure of energy cost is empirically supported by studies on the Escherichia coli chemosensory system33. In this pivotal study, researchers used the entropy production rate to quantify energy consumption, revealing a fundamental energy-speed-accuracy relationship. They showed that the non-equilibrium state of the system’s negative feedback mechanism necessitates energy dissipation for accurate sensory adaptation. The experimental validation of theoretical predictions in E. coli chemotaxis confirms the utility of EPR in quantifying energy costs in biological adaptation mechanisms.
Our application of EPR to neural circuits is informed by these findings, providing a robust framework for assessing continuous and non-equilibrium energy demands. This methodological advance allows us to quantify energy dissipation beyond transient neuronal firing, offering a more holistic view of the brain’s energy utilization.
The neural circuit model
In this study, the decision to utilize an attractor network framework over the drift-diffusion model stems from the primary objective of elucidating the underlying neural mechanisms of cognitive functions such as decision-making and working memory. While the DDM offers a robust mathematical framework for modeling behavioral data like reaction times and choice probabilities, it remains abstract in terms of neural implementation. The attractor model, on the other hand, provides a biologically plausible representation of neural dynamics by incorporating features like selective inhibition and mutual excitation among neural populations. By choosing the attractor model, our study aims to bridge the gap between cognitive theories and neural substrates, offering a more comprehensive understanding of how the brain processes and maintains information during decision-making and memory tasks.
This study focuses on the neural circuit, where the collective behavior of the network is of greater significance than the individual neurons. Therefore, the model developed for this study utilizes average firing rates instead of individual spikes, which is well-suited for representing populations or nuclei within a network. The corresponding mean-field approach, which reduces the spiking model to the firing rate model, is based on the seminal work of Wilson and Cowan67, which was later refined by Dayan and Abbot68 to address the computational and interpretational challenges posed by other models. The state variables that describe the macroscopic emergent behaviors of the neural circuit in the present model are the firing rates of different nuclei.
Here we explored a biophysics-based model that is able to perform WM and DM computations3,15,53,67. The model contains two selective, excitatory populations, labeled E1 and E2. These two populations have self-excitations from the strong recurrent excitatory connections within each excitatory population, and the overall effective connectivity between the two excitatory populations is inhibitory through the inhibitory population3. In contrast to the original models showing that the excitatory populations receive inhibition from a common pool of inhibitory interneurons3,15, recent studies suggest the existence of subnetworks of selective inhibitory neurons in the decision-making circuit24. We divide the inhibitory neurons into two sub-populations, labeled I1 and I2, as shown in Fig. 1b. To manipulate the degree of selectivity of inhibitory neurons, we set the connection strengths between two sub-populations with the same subscript to be stronger by multiplying a control factor w, while the connection strengths between sub-populations across different populations are made smaller by multiplying the factor , where f = 0.45. The factor w− is chosen based on previous work that describes how the synaptic weights between neurons belonging to different clusters were decreased69. The value of w determines the selectivity of the inhibitory neurons: w = 1 corresponds to non-selective inhibition, while a larger w implies more selective inhibition.
Previous modeling studies suggest that the decision-making circuit, which consists of thousands of spiking neurons, can be reduced into a two-variable model through several approximations, such as the linearization of the input-output relation of the inhibitory interneurons3,15. Based on the fact that the synaptic gating variable of NMDA receptors has a much longer decay time constant than other timescales in the circuit, the dynamics of the system can be described by the dynamical equations of the average NMDA synaptic gating variable Si:
| 12 |
where firing rate ri of neural population i is a function of total input current Ii,tot that can be written as15,53:
| 13 |
The corresponding parameters are a = 270(VnC)−1, b = 108 Hz, d = 0.154 s, γ = 0.641, and τS = 100 ms, the same as those shown in the previous work15. The total synaptic input currents of the two neural populations dominated by the NMDA receptors are:
| 14 |
here JEE is the effective coupling constant between excitatory sub-population and JIE is the coupling constant from the inhibitory sub-population to the excitatory sub-population for the non-selective case(w = 1). The factor w0 = 1.7 indicates the neurons belonging to the same excitatory sub-population with potentiated synaptic weight. ζ is the Gaussian noise term. With the linear approximation of the input-output transfer function of the inhibitory cell, the inhibitory input from an inhibitory sub-population to an excitatory sub-population depends on the total synaptic input current(II1 or II2) that this inhibitory sub-population receives, where
| 15 |
Substitute Equation (15) into Equation (14), and we can obtain the total synaptic input currents I1,tot and I2,tot that depend only on the average NMDA synaptic gating variables S1 and S2. It can be easily proven that the corresponding parameters in the input-output transfer function of the inhibitory cell can be absorbed into the coupling constants JIE, JEI, JII, and the background input IBG. Here, the particular values of the parameters are set as JEE = 0.48 nA, JII = 5 nA, JEI = 0.52 nA, JIE = 1 nA, and IBG = 0.31 nA based on the refs. 3,15,24.
To characterize the DM function in this circuit model, the external stimulus input to the selective neural population is introduced as Ii,ext = JA,ext ⋅ μ ⋅ (1 ± c), where i = 1, 2, JA,ext = 5.2 × 10−4 nA/Hz, μ = 30 Hz is the external average synaptic coupling with the AMPA receptors. The + or − sign refers to whether the stimulus is the preferred or non-preferred one of the corresponding selective neural population. As introduced in the main text, the motion coherence c sets the bias of the input for one population over the other. For a zero-coherence stimulus (c = 0), the two excitatory populations receive equal input.
To quantitatively analyze the neural network dynamics underlying DM and WM functions, we apply the non-equilibrium potential landscape and flux framework to this biophysically based neural network model. The stochastic differential equations above can be translated into a Fokker–Planck equation describing the evolution of the probability density function P(I1,tot, I2,tot, t). Then we can calculate the probability distributions at steady state: PSS and the corresponding potential function . Furthermore, we can obtain the non-equilibrium flux JSS and the corresponding entropy production rate EPR = ∫ dItot(J ⋅ (DD)−1 ⋅ J)/P.
In our attractor network model, tasks such as perceptual decision-making and working memory are simulated through the dynamics of neural populations. Decision rules are inherently defined by the system’s transition from a resting state to one of the stable attractor states in response to external stimuli. Specifically, when an external stimulus is introduced, the resting state becomes unstable, compelling the system to evolve towards one of the two decision attractors. This transition constitutes the decision-making process. Unlike the drift-diffusion model, which employs explicit decision boundaries or thresholds to determine when enough evidence has been accumulated to make a choice, our attractor model uses the potential landscape to implicitly govern decision criteria. The basin of attraction around each attractor determines the conditions under which the system will converge to that attractor, effectively setting the decision criterion without the need for predefined thresholds. Specifically, the decision process continues until the system reaches one of the decision states. This is detected using detection circles placed around the two decision minima. As soon as the evolving activities reach one of these circles, a choice is made, and the decision part of the trial is completed. The choice pertains to the associated decision minimum. The radius of this circle is one-tenth of the distance between two minima in the potential surface, which balances sensitivity to fluctuations in neural activity while preventing premature decisions.
The WM function is characterized by stimulus-selective persistent activities and their robustness against distractors. The distractor-related current is stimulated as an input with the same strength as the original external stimulus, while the coherence condition is inverse (the excitatory population that initially receives a larger external input now receives a smaller input). To investigate the robustness of WM against distractors, this distractor-related current was applied throughout the delay period until transitions occurred in our simulations. The delay period represents the phase where the working memory must maintain the choice in the absence of the initial stimulus.
We also examined the effect of an additional non-selective input on WM and DM functions. Due to uncertainties in the temporal dynamics of these modulatory signals in biological systems, we did not specify a time-varying form for the input in our simulations. For the WM function, a constant non-selective input was applied to both selective neuronal populations during the delay period until transitions from the original decision attractor occurred. By varying the input’s magnitude, we assessed its influence on the stability of working memory representations against interference. In the case of DM, the non-selective input was applied alongside the external stimulus throughout the decision-making process until a decision was reached.
The dynamical trajectories of the system are calculated using the Runge–Kutta method with an integration time step of 0.02 ms. In each trial, the random fluctuation is introduced by a noise term implemented as uncorrelated standard Gaussian noise with zero mean and a variance equal to 0.024 nA. For the computations of the average decision time/transition time and correct rate, each data point is obtained from more than 30,000 trials.
Supplementary information
Acknowledgements
We thank Kun Zhang and Li Xu for valuable discussions and suggestions related to this manuscript. H.Y. is grateful for support via National Natural Science Foundation of China Grants 12205306 and Natural Science Foundation of Jilin Province No. 20220101013JC.
Author contributions
H.Y. and J.W. designed research; H.Y. and J.W. performed research; H.Y. contributed new reagents/analytic tools; H.Y. and J.W. analyzed data; and H.Y. and J.W. wrote the paper.
Data availability
The data that support the findings of this study are openly available on the open science framework at https://osf.io/hw5rd/ (10.17605/OSF.IO/HW5RD).
Code availability
The code that support the findings of this study are openly available on the open science framework at https://osf.io/hw5rd/ (10.17605/OSF.IO/HW5RD).
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Change history
12/16/2025
A Correction to this paper has been published: 10.1038/s41540-025-00634-7
Supplementary information
The online version contains supplementary material available at 10.1038/s41540-025-00520-2.
References
- 1.Hopfield, J. J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl Acad. Sci. USA79, 2554–2558 (1982). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Deisboeck, T. & Kresh, J. Y. Complex Systems Science in Biomedicine (Springer Science & Business Media, 2007).
- 3.Murray, J. D., Jaramillo, J. & Wang, X.-J. Working memory and decision-making in a frontoparietal circuit model. J. Neurosci.37, 12167–12186 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Fadok, J. P. et al. A competitive inhibitory circuit for selection of active and passive fear responses. Nature542, 96–100 (2017). [DOI] [PubMed] [Google Scholar]
- 5.Yoshimura, Y. & Callaway, E. M. Fine-scale specificity of cortical networks depends on inhibitory cell type and connectivity. Nat. Neurosci.8, 1552–1559 (2005). [DOI] [PubMed] [Google Scholar]
- 6.Lin, Z., Nie, C., Zhang, Y., Chen, Y. & Yang, T. Evidence accumulation for value computation in the prefrontal cortex during decision making. Proc. Natl Acad. Sci. USA117, 30728–30737 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Carruthers, P. Evolution of working memory. Proc. Natl Acad. Sci. USA110, 10371–10378 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Morelli, M., Casagrande, M. & Forte, G. Decision making: a theoretical review. Integr. Psychol. Behav. Sci.56, 609–629 (2022). [DOI] [PubMed] [Google Scholar]
- 9.Xing, Y., Zan, C. & Liu, L. Recent advances in understanding neuronal diversity and neural circuit complexity across different brain regions using single-cell sequencing. Front. Neural Circuits17, 1007755 (2023). [DOI] [PMC free article] [PubMed]
- 10.Wong, H. H.-W., Chou, C. Y. C., Watt, A. J. & Sjöström, P. J. Comparing mouse and human brains. eLife12, e90017 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Luo, L. Architectures of neuronal circuits. Science373, eabg7285 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Schall, J. D. Neural basis of deciding, choosing and acting. Nat. Rev. Neurosci.2, 33–42 (2001). [DOI] [PubMed] [Google Scholar]
- 13.Shadlen, M. N. & Newsome, W. T. Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. J. Neurophysiol.86, 1916–1936 (2001). [DOI] [PubMed] [Google Scholar]
- 14.Gold, J. I. & Shadlen, M. N. The neural basis of decision making. Annu. Rev. Neurosci.30, 535–574 (2007). [DOI] [PubMed] [Google Scholar]
- 15.Wong, K.-F. & Wang, X.-J. A recurrent network mechanism of time integration in perceptual decisions. J. Neurosci.26, 1314–1328 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Amit, D. J. & Brunel, N. Model of global spontaneous activity and local structured activity during delay periods in the cerebral cortex. Cereb. Cortex7, 237–252 (1997). [DOI] [PubMed] [Google Scholar]
- 17.Wang, X.-J. Synaptic reverberation underlying mnemonic persistent activity. Trends Neurosci.24, 455–463 (2001). [DOI] [PubMed] [Google Scholar]
- 18.Ratcliff, R. A theory of memory retrieval. Psychol. Rev.85, 59–108 (1978). [Google Scholar]
- 19.Ratcliff, R. & Rouder, J. N. Modeling response times for two-choice decisions. Psychol. Sci.9, 347–356 (1998). [Google Scholar]
- 20.Wang, X. J. in Encyclopedia of Neuroscience (Elsevier Ltd., 2009).
- 21.Yan, H. & Wang, J. Non-equilibrium landscape and flux reveal the stability-flexibility-energy tradeoff in working memory. PLoS Comput. Biol.16, e1008209 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Machens, C. K., Romo, R. & Brody, C. D. Flexible control of mutual inhibition: a neural model of two-interval discrimination. Science307, 1121–1124 (2005). [DOI] [PubMed] [Google Scholar]
- 23.Wang, X.-J. Probabilistic decision making by slow reverberation in cortical circuits. Neuron36, 955–968 (2002). [DOI] [PubMed] [Google Scholar]
- 24.Najafi, F. et al. Excitatory and inhibitory subnetworks are equally selective during decision-making and emerge simultaneously during learning. Neuron105, 165–179.e8 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Roitman, J. D. & Shadlen, M. N. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J. Neurosci.22, 9475–9489 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hopfield, J. J. & Tank, D. W. Computing with neural circuits: a model. Science233, 625–633 (1986). [DOI] [PubMed] [Google Scholar]
- 27.Wolynes, P. G., Onuchic, J. N. & Thirumalai, D. Navigating the folding routes. Science267, 1619–1620 (1995). [DOI] [PubMed] [Google Scholar]
- 28.Yan, H. et al. Nonequilibrium landscape theory of neural networks. Proc. Natl Acad. Sci. USA110, E4185–E4194 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wang, J. Landscape and flux theory of non-equilibrium dynamical systems with application to biology. Adv. Phys.64, 1–137 (2015). [Google Scholar]
- 30.Eisenberg, E. & Hill, T. L. Muscle contraction and free energy transduction in biological systems. Science227, 999–1006 (1985). [DOI] [PubMed] [Google Scholar]
- 31.Yan, H., Zhang, K. & Wang, J. Physical mechanism of mind changes and tradeoffs among speed, accuracy, and energy cost in brain decision making: landscape, flux, and path perspectives*. Chin. Phys. B25, 078702 (2016). [Google Scholar]
- 32.Xu, L., Shi, H., Feng, H. & Wang, J. The energy pump and the origin of the non-equilibrium flux of the dynamical systems and the networks. J. Chem. Phys.136, 165102 (2012). [DOI] [PubMed] [Google Scholar]
- 33.Lan, G., Sartori, P., Neumann, S., Sourjik, V. & Tu, Y. The energy–speed–accuracy trade-off in sensory adaptation. Nat. Phys.8, 422–428 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cao, Y., Wang, H., Ouyang, Q. & Tu, Y. The free-energy cost of accurate biochemical oscillations. Nat. Phys.11, 772–778 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Brunel, N. & Wang, X.-J. Effects of neuromodulation in a cortical network model of object working memory dominated by recurrent inhibition. J. Comput. Neurosci.11, 63–85 (2001). [DOI] [PubMed] [Google Scholar]
- 36.Duncan, J. The multiple-demand (MD) system of the primate brain: mental programs for intelligent behaviour. Trends Cogn. Sci.14, 172–179 (2010). [DOI] [PubMed] [Google Scholar]
- 37.Mitchell, D. J. et al. A putative multiple-demand system in the macaque brain. J. Neurosci.36, 8574–8585 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Domenech, P., Redouté, J., Koechlin, E. & Dreher, J.-C. The neuro-computational architecture of value-based selection in the human brain. Cereb. Cortex28, 585–601 (2018). [DOI] [PubMed] [Google Scholar]
- 39.Newsome, W. T., Britten, K. H. & Movshon, J. A. Neuronal correlates of a perceptual decision. Nature341, 52–54 (1989). [DOI] [PubMed] [Google Scholar]
- 40.Shadlen, M. N. & Newsome, W. T. Motion perception: seeing and deciding. Proc. Natl Acad. Sci. USA93, 628–633 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Resulaj, A., Kiani, R., Wolpert, D. M. & Shadlen, M. N. Changes of mind in decision-making. Nature461, 263–266 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Stine, G. M., Zylberberg, A., Ditterich, J. & Shadlen, M. N. Differentiating between integration and non-integration strategies in perceptual decision making. eLife9, e55365 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Lo, C.-C., Wang, C.-T. & Wang, X.-J. Speed-accuracy tradeoff by a control signal with balanced excitation and inhibition. J. Neurophysiol.114, 650–661 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Ge, H. & Qian, H. Dissipation, generalized free energy, and a self-consistent nonequilibrium thermodynamics of chemically driven open subsystems. Phys. Rev. E87, 062125 (2013). [DOI] [PubMed] [Google Scholar]
- 45.Yan, H., Li, B. & Wang, J. Non-equilibrium landscape and flux reveal how the central amygdala circuit gates passive and active defensive responses. J. R. Soc. Interface16, 20180756 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Durstewitz, D., Seamans, J. K. & Sejnowski, T. J. Neurocomputational models of working memory. Nat. Neurosci.3, 1184–1191 (2000). [DOI] [PubMed] [Google Scholar]
- 47.Richman, E. B., Ticea, N., Allen, W. E., Deisseroth, K. & Luo, L. Neural landscape diffusion resolves conflicts between needs across time. Nature623, 571–579 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Usher, M. & McClelland, J. L. The time course of perceptual choice: The leaky, competing accumulator model. Psychol. Rev.108, 550–592 (2001). [DOI] [PubMed] [Google Scholar]
- 49.Finkelstein, A. et al. Attractor dynamics gate cortical information flow during decision-making. Nat. Neurosci.24, 843–850 (2021). [DOI] [PubMed] [Google Scholar]
- 50.Cohen, J. D. et al. Temporal dynamics of brain activation during a working memory task. Nature386, 604–608 (1997). [DOI] [PubMed] [Google Scholar]
- 51.Pereira, J. & Wang, X.-J. A tradeoff between accuracy and flexibility in a working memory circuit endowed with slow feedback mechanisms. Cereb. Cortex25, 3586–3601 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Roach, J. P., Churchland, A. K. & Engel, T. A. Choice selective inhibition drives stability and competition in decision circuits. Nat. Commun.14, 147 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Deco, G. et al. Resting-state functional connectivity emerges from structurally and dynamically shaped slow linear fluctuations. J. Neurosci.33, 11239–11252 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Ye, L. & Li, C. Quantifying the landscape of decision making from spiking neural networks. Front. Comput. Neurosci.15, 740601 (2021). [DOI] [PMC free article] [PubMed]
- 55.Ye, L., Feng, J. & Li, C. Controlling brain dynamics: landscape and transition path for working memory. PLoS Comput. Biol.19, e1011446 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Mejías, J. F. & Wang, X.-J. Mechanisms of distributed working memory in a large-scale network of macaque neocortex. eLife11, e72136 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Amit, D. J. & Amit, D. J. Modeling Brain Function: The World of Attractor Neural Networks (Cambridge Univ. Press, 1989).
- 58.Wang, J., Xu, L. & Wang, E. Potential landscape and flux framework of nonequilibrium networks: Robustness, dissipation, and coherence of biochemical oscillations. Proc. Natl Acad. Sci. USA105, 12271–12276 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Wang, J., Zhang, K., Xu, L. & Wang, E. Quantifying the Waddington landscape and biological paths for development and differentiation. Proc. Natl Acad. Sci. USA108, 8257–8262 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Yang, D.-P., Zhou, H.-J. & Zhou, C. Co-emergence of multi-scale cortical activities of irregular firing, oscillations and avalanches achieves cost-efficient information capacity. PLoS Comput. Biol.13, e1005384 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Raichle, M. E. The brain’s dark energy. Science314, 1249–1250 (2006). [PubMed] [Google Scholar]
- 62.Engl, E. & Attwell, D. Non-signalling energy use in the brain. J. Physiol.593, 3417–3429 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Ge, H. & Qian, H. Physical origins of entropy production, free energy dissipation, and their mathematical representations. Phys. Rev. E81, 051133 (2010). [DOI] [PubMed] [Google Scholar]
- 64.Yang, X. et al. Physical bioenergetics: energy fluxes, budgets, and constraints in cells. Proc. Natl Acad. Sci. USA118, e2026786118 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Hatano, T. & Sasa, S.-i Steady-state thermodynamics of Langevin systems. Phys. Rev. Lett.86, 3463–3466 (2001). [DOI] [PubMed] [Google Scholar]
- 66.Stern, M., Sompolinsky, H. & Abbott, L. F. Dynamics of random neural networks with bistable units. Phys. Rev. E90, 062710 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wilson, H. R. & Cowan, J. D. Excitatory and inhibitory interactions in localized populations of model neurons. Biophys. J.12, 1–24 (1972). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Dayan, P. & Abbott, L. F. Theoretical Neuroscience: Computational and Mathematical Modeling of Neural Systems (MIT Press, 2002).
- 69.Mazzucato, L., La Camera, G. & Fontanini, A. Expectation-induced modulation of metastable activity underlies faster coding of sensory stimuli. Nat. Neurosci.22, 787–796 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support the findings of this study are openly available on the open science framework at https://osf.io/hw5rd/ (10.17605/OSF.IO/HW5RD).
The code that support the findings of this study are openly available on the open science framework at https://osf.io/hw5rd/ (10.17605/OSF.IO/HW5RD).





