ATTRACTOR-BASED MODELS FOR SEQUENCES AND PATTERN GENERATION IN NEURAL CIRCUITS

Juliana Londono Alvarez

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2024 Oct 14:arXiv:2410.11012v1. [Version 1]

ATTRACTOR-BASED MODELS FOR SEQUENCES AND PATTERN GENERATION IN NEURAL CIRCUITS

Juliana Londono Alvarez ^1,^a

PMCID: PMC11527095 PMID: 39483348

Abstract

Neural circuits in the brain perform a variety of essential functions, including input classification, pattern completion, and the generation of rhythms and oscillations that support processes such as breathing and locomotion [51]. There is also substantial evidence that the brain encodes memories and processes information via sequences of neural activity. In this dissertation, we are focused on the general problem of how neural circuits encode rhythmic activity, as in central pattern generators (CPGs), as well as the encoding of sequences. Traditionally, rhythmic activity and CPGs have been modeled using coupled oscillators. Here we take a different approach, and present models for several different neural functions using threshold-linear networks. Our approach aims to unify attractor-based models (e.g., Hopfield networks) which encode static and dynamic patterns as attractors of the network.

In the first half of this dissertation, we present several attractor-based models. These include: a network that can count the number of external inputs it receives; two models for locomotion, one encoding five different quadruped gaits and another encoding the orientation system of a swimming mollusk; and, finally, a model that connects the fixed point sequences with locomotion attractors to obtain a network that steps through a sequence of dynamic attractors. In the second half of the thesis, we present new theoretical results, some of which have already been published in [59]. There, we established conditions on network architectures to produce sequential attractors. Here we also include several new theorems relating the fixed points of composite networks to those of their component subnetworks, as well as a new architecture for layering networks which produces “fusion” attractors by minimizing interference between the attractors of individual layers.

Chapter 1 | Introduction

Attractor neural networks play an important role in computational neuroscience by providing a rich framework for modeling the dynamic behavior of neural systems and giving insights into how the brain might process information and perform computations [3,43]. Originally devised as models of associative memory, these networks were designed to store static patterns representing discrete memories as attractors. Their ability to simultaneously encode multiple static patterns, represented as fixed points, made them ideal for this purpose. Perhaps the most well-known example is that of Hopfield networks, a foundational example of attractor neural networks [39]. As illustrated in Figure 1.1A, in the classical Hopfield paradigm, memories are stored in the network as several coexistent stable fixed points, each one accessible via distinct input pulses (represented as color-coded pulses in the figure). The state space is partitioned into basins of attraction, and each input will start the trajectory into one of these basins. Coexistence of attractors, even if it’s just multiple stable fixed points, requires nonlinear dynamics.

Figure 1.1. — (A) Classical Hopfield-like paradigm. Multiple stable fixed points (stable attractors) are encoded in the same network, each one accessible via distinct (colored) input pulses. (B) Multiple stable fixed points are encoded in the same network, each one accessible via identical pulses. Since the pulses are all identical, the network is *internally* encoding a sequence of fixed points. Each identical pulse causes one transition in the sequence, as indicated by the corresponding numbers in the pulses and arrows. (C) Multiple *dynamic* attractors encoded in the same network, each one accessible via distinct (colored) input pulses. (D) Internally encoded sequence of dynamic attractors, each step of the sequence accessible via identical inputs.

While Hopfield networks are well-suited for encoding multiple static patterns, some patterns of neural activity are better stored as dynamic attractors. Such is the case for the rhythms and oscillations produced by Central Pattern Generator circuits (CPGs). CPGs are neural circuits that generate rhythmic patterns that control movements like walking, swimming, chewing, and breathing [51]. Unlike static patterns (such as images), these rhythmic processes require attractors whose neurons fire sequentially, meaning that neurons take turns to fire. Furthermore, a single CPG network should potentially be able to encode multiple such patterns. Certainly, animals have several locomotive gaits, all of which activate the same limbs [4,34]. How can all these different overlapping dynamic patterns be produced by the same network? Modeling this using attractors requires multi-stability of dynamic attractors, which is known to be a difficult problem [24,63].

Traditional models of locomotion and other CPGs have tackled these challenges by using coupled oscillators [12,21,31,41,73,74]. In these models, the parameters are typically adjusted depending on the desired pattern, effectively altering the dynamical system in use. For various reasons, this approach presents some challenges. For instance, while there is evidence of pacemaker neurons, not all neurons are intrinsic oscillators [62]. Additionally, assuming that synaptic strengths change every time we transition between different locomotive gaits necessitates additional ingredients for the model, such as synaptic plasticity. Despite this, coupled oscillator models have remained popular for important reasons. One of the reasons they are so widely used is the availability of theoretical results coming from physics and mathematics [5]. Indeed, many CPG models stemmed from the physics and mathematics communities [12,31,73,74].

The availability of theoretical tools is indeed a very compelling argument for using coupled oscillators to model central pattern generators. Here we aim to take a different approach that also leverages recent theoretical advances, but within the world of attractor neural networks. The framework of attractor neural networks differs from traditional coupled oscillator models in two key ways. First, neurons do not intrinsically oscillate; instead, patterns of activity emerge as a result of connectivity. Second, these patterns are true attractors of the system, making them more robust and stable to noise and other perturbations. Additionally, providing models within this framework would unify our approach to CPGs with other classical models, like the aforementioned associative memory models, ring attractors, etc [3,43]. On the biological side, it has also been suggested that cortical circuits involved in associative memory encoding and retrieval have many features in common with CPGs [79].

To accomplish this, we propose the use of Threshold-Linear Networks (TLNs). TLNs are recurrent neural networks with simple non oscillating units and piecewise linear activation [38]. This makes them focus on the role of connectivity in emergent behaviors. TLNs have a rich history as neural models [7,35,46,67,70], and more importantly, they are supported by a wide array of theoretical results [13,14,37,53,77], many of them recent [15,16,54,59].

One such key finding is that threshold-linear networks with symmetric connectivity matrices can only have stable fixed point attractors (static patterns) [36,39], which is why here we use non-symmetric TLNs, introduced in Chapter 2. These are known to give rise to a rich variety of non-linear dynamics including multi-stability, limit cycles, chaos and quasi-periodicity. Therefore, we expect it to be possible to simultaneously encode multiple dynamic patterns of activity, whereas in classical Hopfield networks the stored patterns are all static. Within this unified framework, our aim is to provide attractor models for three broad neural functions, as summarized in Figure 1.1, and detailed below:

First, we propose a simple neural integrator model that is both robust to noise and can count inputs, using fixed point attractors. Neural integration refers to the process in which information from various sources is combined to create an output. For instance, counting the number of left and right cues to make a decision is quite literally integration (summation) in the mathematical sense. Here, we propose to model a discrete counter as a sequence of static attractors, as shown in Figure 1.1B. While this concept is akin to the classical Hopfield model (Figure 1.1A), our aim is to internally encode the sequence of fixed points, meaning that the input pulses are all identical and contain no information about which fixed point comes next in the sequence (i.e. they are just like pushing a single input button).

Second, we aim to devise a (small) network that has attractors corresponding to 4–5 distinct, but overlapping, quadruped gaits. The goal is for the attractors to coexist in the same network so that they can be accessed by different initial conditions, and without changing parameters. This requires the presence of several coexistent dynamic attractors, as pictured in Figure 1.1C, arising as distinct limit cycles in state space. Recall that the simultaneous encoding of multiple dynamic attractors (non fixed point attractors) is a network is challenging, especially when the attractors have overlapping units. While classic models circumvent this challenge by adjusting synaptic weights, we aim to obtain a single fixed network, which in turn means a simpler model with fewer control parameters.

In addition to quadruped gaits, we will also model “Clione’s hunting system ” [58], which is a different CPG example, using the same framework. Previous models for it have also used intrinsically oscillating units and fine-tuned parameters [71]. Here we intend to devise a more robust network for this using attractors to avoid the need for finely-tuned parameters.

Third, we combine the modeling approaches from panels B and C to devise a network that can step through a set of dynamic attractors sequentially, as in Figure 1.1D. Can different attractors be linked together so that they can be activated in sequence, where the sequence itself is stored within the network? This could be useful for modeling sequences of complex movements, such as a choreographed sequence of dance moves, for instance.

Sequences of sequential attractors.

Note that in Figure 1.1C, we are dealing with dynamic attractors, that are sequential themselves, meaning attractors whose nodes activate in an ordered sequence [59]. This definition does not completely exclude attractors in which there is some synchrony in the activations. So for example, we consider both attractors in Figure 1.2 sequential attractors, even though the attractor on the right has nodes 2 and 3 synchronized (we will later formalize the dynamic prescription of the nodes, for now note the sequential of activations of nodes).

Figure 1.2. — (A) Nodes 1,2,3 are activating in sequence, as seen by the curves to the left of the graph. (B) Node 1 activates, then 2 and 3 simultaneously, then node 4.

These are great to model rhythmic activations like those of CPGs. In contrast, Figure 1.1D deals with sequences of dynamic attractors, which means that different attractors, either static or dynamic, are activated in a specific ordered sequence (e.g., attractor A, then attractor B, then attractor C). With this distinction clear, our ultimate goal is to achieve an internally encoded sequence of sequential attractors.

Internally encoded sequences of sequential attractors are good models for complex sequences of movements, like choreographed dancing. These complex motor behaviors have also been modeled in the past using threshold-linear recurrent networks that choose and learn motor motifs, but whose choice mechanism requires plasticity, and where the sequence’s order is externally encoded [49]. It is not uncommon to think of broader cognitive sequential processes as a recombination of several pre-stored patterns, which offers an efficient alternative to re-encoding patterns with each occurrence. Studies in this vein include [65], where they utilize a combination of discrete metastable states, leveraging winnerless competition of oscillator neurons. Similar mechanisms have been explored using boolean and spiking networks [69,72].

Desired properties of models.

As it turns out, the versatility of threshold linear networks will prove ideal for modeling both sequential attractors and sequences of them.

To summarize the discussion above, we want our models to satisfy the following properties:

Neurons are not intrinsic oscillators.
Stored static and dynamic patterns should emerge as attractors of the network, rather than being fine-tuned trajectories. Static patterns should manifest as fixed points, while dynamic patterns should arise from non fixed point attractors, like limit cycles.
A network’s attractors should be accessible via different initial conditions, easily implemented via input pulses that target subsets of neurons.
A sequence of attractors should be accessible via a series of identical inputs pulses, with the sequence itself stored within the network (possibly in a separate layer, as observed in some biological brains).
The models should be mathematically tractable–that is, simple enough to be analyzed mathematically.

These properties will distinguish our models from previous coupled oscillator models and position them within the framework of attractor neural networks. We begin by adhering to the last point above, by choosing a framework that fits into the attractor neural network paradigm and also provides mathematically tractable models. With tractability also come great simplifications. Although TLNs are inspired by networks of biological neurons, real neurons and their interactions are of course far more complex than TLNs paint them to be. TLNs remain useful however because they capture two fundamental pieces of biological networks: connectivity and threshold-activation. This is why here we also focus on another simplification of TLNs, known as Combinatorial Threshold-Linear Networks (CTLNs) [15,53,54]. CTLNs are a special family of TLNs, whose connectivity matrix is defined by a simple directed graph (giving rise to binary connections), as in Figure 1.2. Their added simplicity can be used to gain further theoretical results.

Summary of models.

The table below lists all the models included in the dissertation. Each row defines a single model/network, and each is an example of the attractor behaviors described by Figure 1.1:

Model	Network function	Type of attractor	Chapter
Model 1a	counter	sequence of static attractors	3
Model 1b	signed counter	sequence of static attractors	3
Model 1c	dynamic attractor chain	sequence of identical dynamic attractors	3
Model 2a	quadruped gaits	coexisting distinct dynamic attractors	4
Model 3a	molluskan swimming	coexisting identical dynamic attractors	4
Model 2b	sequential control of quadruped gaits	sequence of distinct dynamic attractors	5
Model 3b	sequential control of molluskan swimming	sequence of identical dynamic attractors	5

Open in a new tab

In Chapter 3, we introduce three models for sequences of attractors. Models 1a and 1b are two counter networks that step through sequences of fixed points. Models 1a is shown in Figure 1.3A, where we can see that identical pulses move the network into the next stable fixed point, where it stays until it receives another pulse. This network serves as robust discrete neural integrators of inputs, as we show in their chapter by doing a thorough robustness analysis. Additionally, we extend our work to dynamic attractors by presenting an additional network, Model 1c, capable of encoding a sequence of dynamic attractors. These dynamic attractors are all qualitatively identical, and because of this, there are some symmetries in the basins of attractors, and thus all attractors easily accessible via distinct input pulses. However, to effectively model CPGs, we require different types of patterns to coexist simultaneously. How can we achieve this?

Figure 1.3. — (A) Model 1a: counter network from Chapter 3. Pulses are all identical (in black). (B) Model 2a: quadruped gaits network from Chapter 4. Pulses are attractor-specific (colored according to stimulated node).

That is the content of Chapter 4, where we model two different CPGs, Model 2a and 3a, which require sequential activation of neurons. Model 2a, developed in in Section 4.2, is pictured in Figure 1.3B. It consists of a network encoding five different quadruped gaits as coexistent limit cycles in a 24 unit network. There, we see that all gaits coexist and are accessible via gait-specific pulses. We do a thorough analysis of its dynamics via the set of fixed point supports, and the effect of parameters in modulating gait characteristics. Model 3a, in Section 4.3, consists of a network encoding swimming orientation of a marine mollusk (Clione). Since the attractors are all identical, we manage to prove symmetry of its basins of attraction in Theorem 12.

Finally, in Chapter 5, we merge the concepts introduced in Chapters 3 and 4 to achieve sequences of sequential attractors. From Chapter 3 we get the counter network that will encode the sequence transitions, and from Chapter 4 we get the coexistent sequential attractors. From this integration we obtain Models 2b and 3b: a network for the sequential control of quadruped locomotion and for the sequential control directing swimming movements in Clione. The latter is the one pictured in Figure 1.4, where we observe the attractors from the CPG network “fuse” with the attractors of the counter network, as both are simultaneously active, and look qualitatively like they did when isolated. Figure 1.4 shows the resulting network using Clione’s model, but it can also be done, analogously, with the five-gait network. The fact that we could use the exact same construction with two different networks, led us to believe this is an even more general phenomenon, arising from some structural constraints on these networks, ad they were indeed built with similar principles.

Code to reproduce the plots in Figures 1.3 and 1.4, and also all models listed in the table, can be found at https://github.com/juliana-londono/phd-thesis-basic-plots.

Note that in Figure 1.4, we see a “blend” of two different attractors: at the top of they greyscale we see the dynamic attractors coming from layer L3, and at the bottom we see fixed points coming from layer L1. This phenomenon was also observed in [61], where it was called fusion attractors. Fusion attractors offer a clean solution for managing sequences of static and dynamic attractors. Understanding the mechanisms behind this phenomenon motivates us to further explore the underlying structural constraints giving rise to it, from a theoretical standpoint. This is why we now transition from models to theory.

New network theory.

We want to note that all the models we have developed thus far were built within the TLN framework, for which there are plenty of well-established theoretical results. This theoretical foundation made the process of building these models a lot easier. However, our models have now surpassed the available theory and so now they serve as sources of inspiration for the development of new theoretical results. This is why in the second part of the dissertation, we take a reverse approach: while theory initially guided our modeling efforts, now the models are leading the development of new theoretical results.

Chapter 6 presents original theoretical contributions, including several results recently published in the paper I co-authored: "Sequential Attractors in Combinatorial Threshold-Linear Networks" [59]. This chapter is divided in three parts. First, in Section 6.1, we establish some necessary technical results, some of which are earlier version of results that we end up generalizing in this chapter. Then, in Section 6.2, we derive new structural theorems for CTLNs supporting sequential attractors. All of the results within this section are my contribution to [59], which contains several other architectures that support sequential attractors. All theorems I proved are in bold. Most of these results relate the fixed point supports of a network to the fixed point supports of component subnetworks, as follows:

Theorem 21 for “simply-embedded partitions”, constrains the possible fixed point supports of a network to unions of fixed points chosen from a particular menu of component subnetwork fixed point supports. This is generalizing results from [15]. In the same section, Theorem 23 and Corollary 24 give conditions on when a node can be removed from a network without changing the set of fixed points supports. We include here a new result on removable nodes, that has not been published: Theorem 25.
Theorem 28 for “simple linear chains”, showing that the set of fixed point supports of a simple linear chain network is closed under unions of “surviving” component fixed point supports.
Theorem 31 for “strongly simply-embedded partitions”, showing that the set of fixed point supports of a network can be fully determined from knowledge of the component fixed point supports together with knowledge of which of those component fixed points “survive” in the full network.

Finally, in Section 6.3, we extend some of these theorems to TLNs and provide theoretical explanations for the fusion attractors observed in Chapter 5, culminating with:

Theorem 40, which is an important technical result generalizing previous theorems on certain determinant factorizations that control the set of fixed point supports of a network. It relies on a new determinant factorization lemma, Lemma 38, which I have also proven. Theorem 40 is then used as a crucial ingredient in the proofs of: Theorem 42, generalizing Theorem 21 above. Theorem 44, explaining how the fixed points of some special networks are formed from fixed point of smaller component networks. That theorem generalizes both Theorem 17 (from two components to N components) and Theorem 31 (from several CTLN components to several TLN components). And finally, we present a similar result but for “nested” component fixed point supports in Theorem 45.

We also show that the networks in Chapter 5 satisfy these conditions, thus explaining the fusion attractors observed there.

In this dissertation’s final chapter, Chapter 7, we present partial and further theoretical results derived from projects in sections 4.2 and 6.3. In Section 7.1, Lemma 48, gives conditions under which the same attractor can arise from two different networks. This phenomenon is known as degeneracy. All code from this section is available online at https://github.com/juliana-londono/TLN-attractor-interpolation. In Section 7.2, Lemma 55, gives a new way to think about certain Cramer’s determinants, which are at the core of the dynamics of TLNs.

The rest of this dissertation is organized as follows: Chapter 2 introduces the framework, including firing rate models, attractor neural networks, TLNs, and CTLNs. Chapter 3 provides models for sequences of static and dynamic attractors, both internally and externally encoded. In Chapter 4, we provide two CPG models of locomotion, each consisting of several coexistent dynamic attractors, easily accessible via initial conditions or inputs. Chapter 6 explores new architectures and theoretical results, focusing on sequential dynamics complex and networks made up of simpler subnetworks. Finally, Chapter 7 discusses further theoretical results derived from the presented projects, suggesting avenues for further exploration. Appendix A contains the matrices and parameters used to construct all the models, along with some technical calculations of the fixed points of the five-gait quadruped network. That’s all. We hope you enjoy reading this dissertation. We encourage you to keep in mind Figure 1.1, which is the road map guiding us through the chapters on models.

Chapter 2 | Review of relevant background

This section offers a broad overview of the mathematical and historical background necessary for understanding and contextualizing the subject at hand. More detailed technical background specific to each chapter is provided later, as needed. None of the results in this chapter are original work of my own. Thus, the proofs of these results are not included here and can be found in their original publications, as cited.

2.1. Firing rate models and attractors

In this dissertation, we deal with the dynamics of recurrent neural networks. A recurrent neural network consists of a directed graph along with a prescription of the nodes’ dynamics. The nodes are thought of as neurons and edges represent synapses between them. One way to interpret the dynamics of the nodes is as firing rates, which indicate the average frequency at which the neuron generates action potentials (or “fires”) [19]. The dynamics of a firing rate network model can be described by a system of $n$ coupled differential equations:

τ_{i} \frac{d x_{i}}{d t} = - x_{i} + φ (\sum_{j = 1}^{n} W_{i j} x_{j} + b_{i}), i = 1, \dots, n,

(2.1)

where $x_{i} = x_{i} (t)$ represents the firing rate of neuron $i$ , $W = [W_{i j}]$ is a matrix prescribing the interaction strengths between neurons, and $b_{i} = b_{i} (t)$ is some external input to neuron $i$ (which might vary in time), for $n$ recurrently connected neurons. In Equation 2.1, $τ_{i}$ is referred to as the time constant, representing the rate of decay when there is no input to neuron $i$ , and $φ$ is an activation function (e.g. sigmoid, ReLU).

Also, Equation 2.1 indicates that we are viewing recurrent neural networks here as dynamical systems, contrasting with the typical machine learning perspective that treats neural networks as learning algorithms or black-box function approximators.

While firing rate models are clearly a simplification of biological neural dynamics, they allow us to focus on factors such as the interactions between neurons, activation functions, and external inputs, and help bridge the gap between detailed spiking neuron models and large-scale network behavior. This provides a valuable framework for understanding the functional roles of these factors in neural computations.

Indeed, firing rate models of recurrent neural networks are a popular tool for studying nonlinear dynamics in neuroscience [19,20,23,67], particularly within the context of attractor neural networks. Attractor neural networks have emerged as a framework for studying neural dynamics, covering various cognitive processes like memory recall, decision-making, and perception [3,43]. Understanding how these networks compute is useful for advancing our understanding of how biological networks compute.

Under this framework, attractors of the system are thought of as representations of some cognitive process or pattern encoded in the system. Hopfield networks are a classical example [39]. In these networks, fixed points of the network are interpreted as encoded memories. In Hopfield networks, units can only be in one of two states, and the dynamics are discrete. Importantly, if the connections between neurons are symmetric, then the network is guaranteed to converge to a stable fixed point, which happens to be the minimum of an energy function, pictured as an energy landscape in Figure 2.1A [39,40].

Figure 2.1. — (A) Minima of energy landscape for symmetric network. (B) Diversity of attractors of non-symmetric networks. Adapted from [17].

Such theoretical results are not only available for Hopfield networks, but also more broadly for certain continuous-time recurrent neural networks, which are the main subject of this dissertation: threshold-linear networks (TLNs). In the case of TLNs, $φ (y) = \max (0, y) = {[y]}_{+}$ in Equations 2.1 (a.k.a. ReLU activation function, Fig. 2.2). TLNs were introduced around the 50s [38], and have been widely used in computational and mathematical neuroscience [7,35,46,67,70] since.

Figure 2.2. — A recurrent neural network, and the ReLU activation function.

A crucial aspect of TLNs is their piecewise linear activation function, which greatly simplifies mathematical analysis. This mathematical tractability has given rise to numerous theoretical results. Notably, as with Hopfield networks, symmetric TLNs were shown to converge to stable fixed points under some extra restrictions on $W$ (copositivity) [36]. Furthermore, there exist constraints on which sets of neurons can be co-active at a stable steady state (forbidden and permitted sets in [36]). These results have been further explored in subsequent studies [13,16].

We are however interested in a broader set of attractors, not only stable fixed points. That is why we focus here in non-symmetric inhibition-dominated TLNs, for which W is non-symmetric and non-positive. This choice is motivated by the fact that inhibition fosters competition between neurons, and thus neurons tend to alternate in reaching peak activity levels, leading to sequential behaviors within limit cycles.

Indeed, these networks, even though still piecewise linear, nonetheless permit rich non-linear dynamics like multi-stability, limit cycles, chaos and quasi-periodicity to arise and, moreover, coexist [53,54,61], as in Figure 2.1B. This, along with extra simplifying assumptions (particularly on $W$ ), has given rise to a robust body of theoretical work characterizing the dynamics of TLNs [13–16,37,53,54,59,77].

In particular, a main focus of this dissertation is on a special type of non-symmetric TLNs, called combinatorial threshold-linear networks (CTLNs), introduced in [54]. CTLNs are TLNs where the connectivity matrix $W$ is prescribed by a simple directed graph, and often the input is assumed to be uniform across neurons.

Below, we formally introduce TLNs and CTLNs, and present some results that we will use throughout the dissertation. The presentation of these results is adapted from [15,59], with slight modifications. Since CTLNs are a special family of TLNs, they inherit many properties from TLNs. Hence, we first provide an overview of the necessary background shared with TLNs, and then state results specific to CTLNs.

2.2. Threshold-linear networks (TLNs)

A threshold-linear network (TLN) is a continuous-time recurrent neural network (Eqns. 2.1) where $φ = \max (0, y) = {[y]}_{+}$ . In addition, we assume for the time being that the input is constant in time $b_{i} (t) = b_{i}$ and the timescales are constant and uniform across neurons (without loss of generality, we assume $τ_{i} = 1$ ). More precisely:

Definition 1.

A threshold linear network on $n$ neurons is a system of ordinary differential equations

\frac{d x_{i}}{d t} = - x_{i} + {[\sum_{j = 1}^{n} W_{i j} x_{j} + b_{i}]}_{+}, i = 1, \dots, n,

(2.2)

where $W_{i j}, b_{i} \in ℝ$ for all $i$ , $j = 1, \dots, n$ . This can also be written in vector form as

\frac{d x}{d t} = - x + {[W x + b]}_{+},

where $W$ is an $n \times n$ real matrix and $b \in ℝ^{n}$ .

Under these circumstances, a given TLN is completely determined by the choice of the connectivity matrix $W$ and its vector of inputs $b$ , so we denote it by $(W, b)$ . Unless otherwise specified, $n$ is the total number of neurons in the network. As mentioned above, here we only consider inhibition-dominated TLNs (a.k.a. competitive). This means we assume $W_{i j} \leq 0$ for all $i$ , $j = 1, \dots, n$ . In addition, we pose some extra restrictions on degeneracies. The following definition requires some Cramer’s determinants of some sub-matrices to not vanish, we denote by $({(A_{σ})}_{i}; b_{σ})$ the matrix the matrix $A_{σ}$ where the column corresponding to the index $i \in σ$ has been replaced by $b_{σ}$ , where the subindex denotes restriction to the rows/columns given by $σ$ . In the definition below, and in all that follows, $[n]$ denotes the set of indices ${1, \dots, n}$ .

Definition 2 ([15, Definitions 1 and 2]).

We say that a TLN $(W, b)$ is competitive if $W_{i j} \leq 0$ , $W_{i i} = 0$ and $b_{i} \geq 0$ for all $i, j \in [n]$ . We say that it is non-degenerate if

$b_{i} > 0$ for at least one $i \in [n]$
$det (I - W_{σ}) \neq 0$ for each $σ \subseteq [n]$ , and
for each $σ \subseteq [n]$ such that $b_{i} > 0$ for all $i \in σ$ , the corresponding Cramer’s determinant is nonzero: $det ({(I - W_{σ})}_{i}; b_{σ}) \neq 0$ .

Unless otherwise noted, all TLNs here are competitive and non-degenerate.

A good point of entry to explore the dynamics of a competitive TLN are its fixed points: a fixed point $x^{*}$ of a TLN is a solution that satisfies $d x_{i} / {d t |}_{x = x^{*}} = 0$ for each $i \in [n]$ . Per equations 2.2, this translates into

x_{i}^{*} = {[\sum_{j = 1}^{n} W_{i j} x_{j}^{*} + b_{i}]}_{+} for all i = 1, \dots, n .

By the defintiion of the ${[\cdot]}_{+}$ function, when $y_{i} = \sum_{j = 1}^{n} W_{i j} x_{j}^{*} + b_{i} \leq 0$ , $y_{i}$ will evaluate to 0. Thus, these $y_{i}$ define hyperplanes that divide state space into chambers, and inside each of those chambers, Equations 2.2 define a linear system of ODEs. Under the non-degeneracy assumption, each of those can have exactly one fixed point, though its fixed point might not lie inside the correct chamber.

In Figure 2.3, for example, we have marked the four fixed points associated to the 4 linear systems with corresponding colors, but only the fixed point in chamber {2} (the pink one) is in its correct chamber. This makes it the only true fixed point of the TLN pictured there. Since there is at most one fixed point per chamber, we can label all the fixed points of a network by their support supp $x^{*} = {i ∣ x_{i}^{*} > 0}$ . We gather the supports of all fixed points into a set

FP (W, b) \overset{def}{=} {σ \subseteq [n] ∣ σ is a fixed point support of (W, b)} .

(2.3)

Figure 2.3. — 2-dimensional state space is divided in 2² chambers by the hyperplanes $y_{i} = \sum_{j = 1}^{n} W_{i j} x_{j}^{*} + b_{i} = 0$ , inside each the dynamics are truly linear. The fixed points of the linear systems are color-coded by chamber.

We have experimentally observed that often $FP (W, b) \neq [n]$ , although no precise number for that “often” are provided. Thus, this set is often non trivial and so it already contains quite a bit of information about the dynamics of $(W, b)$ , as we will see later. We refer to this set many times throughout this dissertation, so it is good to keep it in mind. Finally, we give an important result about belonging to $FP (W, b)$ .

Corollary 3 ([15]).

Let $(W, b)$ be a TLN on n neurons, and let $σ \subseteq [n]$ . The following are equivalent:

$σ \in FP (W, b)$
$σ \in FP ({W |}_{τ}, {b |}_{τ}) for
all σ \subseteq τ \subseteq [n]$
$σ \in F P ({W |}_{σ}, {b |}_{σ}) and σ \in FP ({W |}_{σ \cup k}, {b |}_{σ \cup k}) for
all k \notin σ$
$σ \in FP ({W |}_{σ \cup k}, {b |}_{σ \cup k}) for
all k \notin σ$

This concludes the necessary background on TLNs. Next, we formally introduce CTLNs and review some results particular to them.

2.3. Combinatorial threshold-linear networks (CTLNs)

A big part of this dissertation focuses on a special family of TLNs, called combinatorial threshold-linear networks (CTLNs). In this case, in addition to the network being a competitive non-degenerate TLN, the connectivity matrix $W$ in Equation 2.2 is prescribed by a simple directed graph G, as shown in Figure 2.4. We are thinking of the graph as retaining only the excitatory neurons and the connections between them, where the inhibitory neurons and their background inhibition are not represented in the graph, but are included in the equations defining the dynamics.

Figure 2.4. — Right: A neural network with excitatory neurons (black) and inhibitory neurons (gray), producing global inhibition. Left: Only the excitatory connections remain in our graph representation of the network. From [17].

More precisely, if we denote $j ↛ i$ in $G$ by $j \to i$ not in $G$ , we have the following definition

Definition 4.

A combinatorial threshold-linear network is a threshold-linear network $(W, b)$ whose connectivity matrix $W$ is prescribed by a simple directed graph $G$ as

W_{i j} = {\begin{array}{l} 0 & if i = j, \\ - 1 + ε & if j \to i in G, \\ - 1 - δ & if j ↛ i in G . \end{array}

(2.4)

where $ε, δ \in ℝ$ , and the input values $b_{i} = θ$ are kept constant across neurons. When $θ > 0$ , $δ > 0$ , and $0 < ε < \frac{δ}{δ + 1}$ , we say that the parameters are in the legal range.

Although this puts us further from reality, this simplification allows us to retain fundamental properties while being able to derive relationships between structure and function more easily. We are assuming input to be constant across neurons to further isolate the role of connectivity in the network.

When $θ > 0$ , $δ > 0$ , and $0 < ε < \frac{δ}{δ + 1}$ , we say that the parameters are in the legal range. These are chosen both to have all inhibitory connections, but also to satisfy some conditions in some proofs. Parameters in this dissertation are always chosen from the legal range, unless otherwise noted. Before we show an example, it is important to clarify that our convention for the adjacency matrix of a graph $G$ is:

A_{i j} = {\begin{matrix} 1 & if j \to i in G, \\ 0 & if j ↛ i in G . \end{matrix}

to match the $W$ connectivity matrix with the $W_{i j}$ conventions.

Figure 2.5 shows an example of a CTLN where the defining graph consists of a 3-cycle. From it, we get the transposed adjacency matrix, and from it we get the connectivity matrix $W$ as defined by Equation 2.4 with parameters $ε = 0.25$ and $ε = 0.25$ . To get the rate curves in the Figure 2.5, we simulate the system of equations 2.2 with $b = θ 𝟙$ . The network activity follows the arrows in the graph. Peak activity occurs sequentially in the cyclic order 123.

The dependence of the dynamics of a CTLN on its defining graph $G$ has given rise to extensive theory relating the dynamics and the combinatorial properties of $G$ through the set of its fixed point supports, which we now denote as $FP (G, ε, δ)$ ¹ or $FP (G)$ , if clear from the context:

FP (G) = FP (G, ε, δ) \overset{def}{=} {σ \subseteq [n] ∣ σ is a fixed point support of W (G, ε, δ)} .

This set contains information about all the fixed points of the network, giving insights into the dynamics of the network through a special subset, as it will be seen next. Indeed, we will see that the fixed points shape the dynamics of the network, whether or not they are stable. In the sections that follow, we show connections between this set and the dynamics of the network, and our main entry point are the the core motifs, introduced in the next section. There are special minimal fixed point supports which have been conjectured to correspond to attractors. Since these have proven to be insightful, we provide rules to find them in Subection 2.3.2. We conclude the chapter by showing how to build in certain core motifs into the network, hoping to see the desired attractors, using constructions like cyclic and clique unions [15].

2.3.1. Core motifs

In prior work [59], it has been conjectured that the dynamic attractors of a network $G$ correspond to supports $σ \in FP (G)$ that are core motifs:

Definition 5 ([17]).

Let $G$ be the graph of a CTLN on $n$ nodes. An induced subgraph ${G |}_{σ}$ is a core motif of the network if $FP ({G |}_{σ}) = {σ}$ .

We refer to both the support of the fixed point $σ$ and the induced subgraph ${G |}_{σ}$ as the core motif. Note that a necessary condition to be a core motif of the network is that $σ$ is minimal in $FP (G)$ by inclusion. Core motifs have been observed to be useful in predicting the dynamics of a network. For example, consider Figure 2.6. The CTLN defined by the graph in panel A has two core motifs in it, 123 and 4. 1234 is not a core motif, since it is not minimal in $FP (G)$ , by inclusion. Panel B shows the results of simulating the CTLN using $θ = 1$ , $ε = 0.25$ , and $δ = 0.5$ under two different initial conditions. In the top, the initial condition is a small perturbation of the fixed point supported on 123. The activity spirals out of the unstable fixed point and converges to a limit cycle where the high-firing neurons are the ones in the fixed point support. In the bottom, the initial condition is a small perturbation of the fixed point supported on 4, which is a stable fixed point. Thus, the activity converges to a static attractor where the high-firing neuron is the one in the fixed point support. In panel C, we see that the 1234 fixed point is fundamentally different: small perturbations of the fixed point supported on 1234 produce solutions that either converge to the limit cycle shown in panel B, or to the stable fixed point. This support therefore does not “correspond” to any attractor, but rather acts as a “tipping point” between two distinct attractors.

Figure 2.6. — (A) Graph of a CTLN and its $FP (G)$ set. (B) Solutions to the CTLN with the graph in panel A using two different initial conditions, which are perturbations of the fixed points supported in 123 (top) and 4 bottom. (C) Fixed points and example trajectories which are perturbations of the fixed point supported in 1234, depicted in a three-dimensional projection of the four-dimensional state space. From [61].

Less informally, we will say that a support $σ$ corresponds to an attractor if initial conditions that are small perturbations from the fixed point lead to solutions that converge to the attractor. Heuristically, the high-firing neurons in the attractor tend to match the support of the fixed point. Core motifs often correspond to attractors [59] and, consequently, understanding the core motifs of a network becomes useful when predicting the dynamics of a given CTLN. We make use of this heuristic often in the dissertation and so we give the set of core motifs of a given network $G$ its own notation:

{FP}_{core} (G) = {FP}_{core} (G, ε, δ) \overset{def}{=} {σ \in FP (G) ∣ σ is a core motif of W (G, ε, δ)} .

An example where the core-attractor correspondence is perfect is for cycle graphs, that is, a graph (or an induced subgraph) where each node has exactly one incoming and one outgoing edge, and they are all connected in a single directed cycle. First, it is a fact that all cycle graphs are core motifs:

Theorem 6.

If ${G |}_{σ}$ is a cycle, then ${G |}_{σ}$ is a core-motif.

Indeed, it was recently proven that a graph that is an 3-cycle has a corresponding attractor [6].

2.3.2. Graph rules

The connection between attractors and the fixed points has motivated an extensive research program where graph rules were developed [15, 18, 59, 61]. These refer to theoretical results directly connecting the structure of $G$ , to the set $FP (G)$ . Several of those results are independent of the choice of $ε, δ$ , and thus useful for engineering robust networks with prescribed attractors. We use of some these in the modeling section, and so we reproduce them below.

The first graph rule, central to our work here, concerns a special type of graph. We say that ${G |}_{σ}$ has uniform in-degree $d$ if every node $i \in σ$ has $d$ incoming edges from within ${G |}_{σ}$ . In that case, we have:

Theorem 7 (uniform in-degree, [15]).

Let $G$ be a graph on n nodes and $σ \subseteq [n]$ . Suppose ${G |}_{σ}$ has uniform in-degree $d$ . For each $k \notin σ$ , let $d_{k} \overset{def}{=} ∣ {i \in σ ∣ i \to k i n G} ∣$ be the number of edges $k$ receives from $σ$ . Then

σ \in FP (G) \Leftrightarrow d_{k} \leq d f o r a l l k \notin σ .

Furthermore, if $| σ | > 1$ and $d < | σ | / 2$ , then the fixed point is unstable. If $d = | σ | - 1$ , then the fixed point is stable.

From this theorem, we easily get the following Rules for families of uniform in-degree graphs that we use in our models:

Cliques are all-to-all connected graphs and therefore uniform in-degree with $d = n - 1$ , where $n$ is the total number of nodes in the clique.

Rule 1 (Cliques, Fig. 2.7A). If ${G |}_{σ}$ is a clique, then $σ$ supports a stable fixed point if and only if for all $k \notin σ, k$ receives at most $n - 1$ edges from ${G |}_{σ}$ . When no external node receives $n$ edges from ${G |}_{σ}$ , we say that ${G |}_{σ}$ is target-free.

Figure 2.7. — Three example families of uniform in-degree graphs, and corresponding Rules.

Cycles are graphs whose vertices are connected in a closed chain and therefore uniform in-degree with $d = 1$ .

Rule 2 (Cycles, Fig. 2.7B). If ${G |}_{σ}$ is a cycle, then $σ$ supports an unstable fixed point if and only if for all $k \notin σ, k$ receives at most one edge from ${G |}_{σ}$ .

A tournament is an orientation of a (undirected) complete graph. A tournament ${G |}_{σ}$ is called cyclic if the set of all automorphisms of ${G |}_{σ}$ contains $(1, 2, \dots, n) \in S_{n}$ . All cyclic tournaments must have an odd number of nodes $n = 2 d + 1$ , and have uniform in-degree $d$ [11].

Rule 3 (Cyclic tournaments, Fig. 2.7C). If ${G |}_{σ}$ is a cyclic tournament, then $σ$ supports an unstable fixed point if and only if for all $k \notin σ, k$ receives at most $d = \frac{n - 1}{2}$ edges from ${G |}_{σ}$ , where $n$ is the number of nodes in $G$ .

All of these graphs are core motifs [11,15] and they indeed have a corresponding attractor, as expected. In the case of cliques, the fixed point is stable. In the case of cycles and cyclic tournaments, their unique fixed point is unstable and the dynamics reveal a corresponding limit cycle attractor.

In addition to providing tools for engineering certain patterns into a network, graph rules can also be used to prove that a given support $σ$ belongs to $FP (G)$ in more general cases. Indeed, in Chapter 6, we use such rules to prove new theoretical results. A key one derives from the concept of graphical domination, introduced in [15] and summarized in Figure 2.8:

Figure 2.8. — Four types of graphical domination from Rule 4. Modified from [17]. Dashed arrows represent optional edges.

Definition 8.

We say that $k$ graphically dominates $j$ with respect to $σ$ if the following three conditions hold:

For each $i \in σ ∖ {k, j}$ , if $i \to j$ then $i \to k$ .
If $j \in σ$ , then $j \to k$ .
If $k \in σ$ , then $k ↛ j$ .

If there is graphical domination within a graph, a lot can be said about its fixed points:

Rule 4 (graphical domination, [17]). Let $G$ be a graph on $n$ nodes, and $σ \subseteq [n]$ . Suppose $k$ graphically dominates $j$ with respect to $σ$ . Then the following statements hold:

(inside-in) If $j, k \in σ$ , then $σ \notin FP ({G |}_{σ})$ , and thus $σ \notin FP (G)$ .
(outside-in) If $j \in σ$ and $k \notin σ$ , then $σ \notin FP ({G |}_{σ \cup {k}})$ , and thus $σ \notin FP (G)$ .
(inside-out) If $k \in σ$ and $j \notin σ$ , then $σ \in FP ({G |}_{σ})$ implies $σ \in FP ({G |}_{σ \cup {j}})$ .
(outside-out) If $j, k \notin σ$ , then $σ \in FP ({G |}_{σ \cup {k}})$ implies $σ \in FP ({G |}_{σ \cup {j}})$ .

A good example of the use of Rule 4 in proving more general results on the $FP (G)$ , is the rule of sources. A source is a node that has in-degree 0. The we have the following result:

Rule 5 (Sources, [15]). Let $G$ be a graph on $n$ nodes and $σ \subseteq [n]$ . If there exists an $ℓ \in [n]$ such that $σ$ contains a proper source in ${G |}_{σ \cup {ℓ}}$ , then $σ \notin FP (G)$ .

Proof. Suppose there exists an $ℓ \in [n]$ such that $k \in σ$ is a proper source in ${G |}_{σ \cup {ℓ}}$ with $k \to ℓ$ . Then $ℓ$ graphically dominates $k$ since $k$ has no inputs in ${G |}_{σ \cup {ℓ}}$ and $k \to ℓ$ . Hence, $σ \notin FP ({G |}_{σ \cup {ℓ}})$ by Theorem 4, and so $σ \notin FP (G)$ by Corollary 3. □

We conclude this section with a cool application of Rule 4, which we use in Subection 6.3.1 to present an alternative proof of one of the results in [59]. The proof of the result below can be found in [60].

Theorem 9.

Let $G$ be a graph on n nodes, and suppose there is $j, k \in [n]$ such that $k$ graphically dominates $j$ with respect to $[n]$ . Then $FP (G) = FP ({G |}_{[n] ∖ {j}})$ .

2.3.3. Cyclic unions and sequential attractors

These graph rules allow us to construct graphs with prescribed fixed point supports $σ$ , and facilitate the analysis of $FP (G)$ in terms of the structure of $G$ . This in turns allows for the derivation of more general structure theorems, supported on simpler building blocks. One of these structures, which does the heavy lifting of pattern generation in Chapter 4, is the cyclic union, pictured in Figure 2.9A:

Figure 2.9. — (A) A cyclic union of $N$ components, and Theorem 11. (B - C) Two examples of a cyclic union, and its sequential activation of the nodes. Nodes whose activations are synchronized appear in parenthesis.

Definition 10.

Given a set of component subgraphs ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ , on subsets of nodes $τ_{1}, \dots, τ_{N}$ , the cyclic union of is constructed by connecting these subgraphs in a cyclic fashion so that there are edges forward from every node in $τ_{i}$ to every node in $τ_{i + 1}$ (cyclically identifying $τ_{N}$ with $τ_{0}$ ), and there are no other edges between components

Cyclic unions are great for pattern generation because, as it can be seen in Figure 2.9B-C, they give rise to sequential attractors whose order of activation coincides with the overall structure and direction of the cyclic union. Indeed, this is a general fact, that derives from the way in which the $FP (G)$ set is made up for cyclic unions:

Theorem 11 (cyclic unions, theorem 13 in [15]).

Let $G$ be a cyclic union of component subgraphs ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ . For any $σ \subseteq [n]$ , we have

σ \in FP (G) \Leftrightarrow σ \cap τ_{i} \in FP ({G |}_{τ_{i}}) f o r a l l i \in [N] .

Moreover,

σ \in {FP}_{c o r e} (G) \Leftrightarrow σ \cap τ_{i} \in {FP}_{core} ({G |}_{τ_{i}}) f o r a l l i \in [N] .

This means that the global fixed point supports can be completely understood in terms of the component fixed point supports. Theorem 11 guarantees that every fixed point of $G$ hits every component. In simulations we have observed that this ensures that the every component is active in corresponding attractors. Moreover, neurons are activated in the order of the components in cyclic order.

Indeed, Figure 2.10 shows an example of this. There, $G$ is a cyclic union of four component subgraphs ${G |}_{τ_{i}}, i = 1, 2, 3, 4$ . Thick colored edges from a node to a component indicate that the node send edges out to all the nodes in the receiving component. $FP ({G |}_{τ_{i}})$ can be easily computed using graph rules, and it is shown below the graph in color-coded components. To simplify notation for $FP (G)$ , we denote a subset ${i_{1}, \dots, i_{k}}$ by $i_{1} \dots i_{k}$ . For example, 12 denotes the set {1,2}. $FP (G)$ follows the same color convention and consists of unions of component fixed point supports, exactly one per component. A solution for the corresponding CTLN is pictured. It shows that the attractor visits every component, in the cyclic union order.

There exists generalization of cyclic unions that also give rise to sequential attractors, see [59].

Theorem 11 is thus helpful when engineering networks that must follow a sequential activation, because the order of activation of neurons will match the direction of the cyclic union. This is exactly what we look for in central pattern generators models, which are the main topic of Chapter 4.

2.3.4. Fusion attractors

In the previous subsection, we saw by example that cyclic unions naturally gave rise to sequential attractors. Cyclic unions are one of the three block-structures first introduced in [15], that relate the parts to the whole. The two other structures are the clique union and the disjoint union of component networks.

We review the clique union here because it is special in that it gives rise to “fusion attractors”, introduced in [61]. Clique unions are built by partitioning the vertices of the graph into components $τ_{1}, \dots, τ_{N}$ , and putting all edges between all pairs of components. For example, the fusion 3-cycle from [61], in Figure 2.11, is the clique union of $τ_{1} = {123}$ and $τ_{2} = {4}$ . What we observe in the dynamics is the “fusion” of what seems to be two different attractors: the stable fixed point supported in 4, and the limit cycle supported in 123. That’s why it is called fusion attractor. However clique unions are a bit too restrictive. How can we relax the assumptions on the structure of the network, and still get fusion attractors? This is partly the topic of Chapter 6. Later, we will generalize this concept for layered TLNs to get attractors for sequential control of quadruped gaits and other CPGs.

Figure 2.11. — The fusion 3-cycle from [61].

Chapter 3 | Sequences of attractors

Recall that in Hopfield networks [39], memories are encoded as fixed points. Many memories can be encoded simultaneously, but accessing these fixed points usually requires attractor-specific inputs (equivalently, attractor-specific initial conditions), as illustrated in Figure 3.1A. Thus, we obtain a sequence of fixed points with whose transitions are controlled by external inputs, this means that, in a way, the order of the sequence is encoded in the external pulses, i.e. externally encoded. Would it be possible to have the sequence’s order internally encoded, as in Figure 3.1B, where the transitions happen in response to identical external pulses who do not know what comes next? This could be useful to model highly stereotyped sequences, like songbird songs [50] or as a counting mechanism, like in discrete neural integrators [30].

The first half of this chapter focuses on internally encoded sequences of static attractors, as depicted in Figure 3.1B. The second half will attempt to model the situation in Figures 3.1C,D, where several dynamic attractors coexist and are easily accessed via attractor-specific inputs. Our model will have a catch though, and it is that all attractors are identical to each other. Later, in Chapter 4, we will focus on modeling the situation of Figure 3.1C again, but this time all dynamic attractors are different. Finally, in Chapter 5, we join the models from this chapter and Chapter 4 to present a model/mechanism for the situation of Figure 3.1D, where we have an internally encoded sequence of diverse dynamic attractors.

This is, again, a good place to recall that we are using sequence/sequential in two different contexts. On one hand, we have sequential attractors (Figs. 3.1C,D), which refer to attractors whose nodes fire in sequence e.g. limit cycles. On the other hand, we have sequences of attractors, where the elements of the sequence are not the nodes, but the attractor themselves (Figs. 3.1B,D).

In this section, we leverage theoretical results from [15] to engineer networks with prescribed FP(G) sets. In doing so, we have a strong indication that the network will support the desired attractor. Throughout the chapter, we hope the important role of the theoretical results in facilitating the engineering work will become evident. Indeed, we begin the work by recalling the rules from Chapter 2 that followed from Theorem 7, and that we use in the sections that follow to design our attractors:

Rule 1. If ${G |}_{σ}$ is a clique, then $σ$ supports a stable fixed point if and only if for all $k \notin σ$ , $k$ receives at most $n - 1$ edges from ${G |}_{σ}$ . When no external node receives $n$ edges from ${G |}_{σ}$ , we say that ${G |}_{σ}$ is target-free.

Rule 2. If ${G |}_{σ}$ is a cycle, then $σ$ supports an unstable fixed point if and only if for all $k \notin σ$ , $k$ receives at most one edge from ${G |}_{σ}$ .

Rule 3. If ${G |}_{σ}$ is a cyclic tournament, then $σ$ supports an unstable fixed point if if and only if for all $k \notin σ$ , $k$ receives at most $d$ edges from ${G |}_{σ}$ .

3.1. Sequences of fixed point attractors

In this section we model a discrete counter as a sequence of static attractors, each accessible via identical inputs. Neural integration is an important correlate of processes such as oculomotor and head orientation control, short term memory keeping, decision making, and estimation of time intervals [9,10,30,52,68]. Understanding how neurons integrate information has been a longstanding problem in neuroscience. How exactly do brain circuits generate a persistent output when presented with transient synaptic inputs?

Many of models have been proposed to model neural integrators, but classic models are known to be very fine-tuned, requiring exact values for the parameters in order to achieve perfect integration. By contrast, some robuster models tend to be rather insensitive to weaker inputs, requiring strong inputs to switch between adjacent states [30,68]. The models we propose here represent a very simple alternative to classic discrete neural integrators. Our models are both robust and respond well to a wide variety of input strengths. Importantly, since they are encoding sequences of stable fixed points, they could be also useful to model highly stereotyped sequences, like songbird songs [50].

CTLN counter.

Our initial model is a simple and robust CTLN that can keep a count of the number of input pulses it has received via well separated discrete states, providing a straightforward readout mechanism of the encoded count. We henceforth refer to this network, informally, as a “counter network”. Counting the number of inputs can be achieved with an ordered sequence of stable fixed points, each representing one position in the count. Each transition between fixed points indicates an increase in the count, and so in order to be integrating the external inputs, we want them to cause the fixed point transitions.

First, to encode a set of stable fixed points in a network we can appeal to Rule 1, reproduced again in Figure 3.2A, which says that a clique will yield a stable fixed point if it is target free. Thus, our network must have as many cliques as desired stable fixed points, and we need to embed them in the network in such a way that they will survive as fixed points, that is $σ \in FP ({G |}_{σ})$ but also $σ \in FP (G)$ .

We also need some mechanism to be able to transition between these stable fixed points. Emphasis on stable, as this indicates we will need a strong enough perturbation to leave that steady state. We can potentially achieve this by adding some edges in the graph that will permit the activity to flow from clique to clique in response to input pulses. Graph-wise, these are all the ingredients we need to construct our network.

We get to work and assemble the network by chaining together several target-free 2-cliques, as shown in Figure 3.2C. Each of these cliques is embedded in a target-free way. More specifically, Rule 1 says that the 2-cliques will survive as fixed point supports as long as they do not have more than $d = 2 - 1 = 1$ outgoing edge to any other node in the graph, and this is in fact the case for all of them. This would not be the case if, for example, we were to add the edge 1 → 4 in the network of Figure 3.2C, because then node 4 would be a target of {1,2}.

Note that the top and bottom cycles ({1,3,5,7,9,11}, {2,4,6,8,10,12}) also each support a fixed point of the network by Rule 2 (Fig. 3.2B), since no node receives more than one edge from these cycles either. This cycles will provide the perturbation we are looking for to be able to exit the fixed points we just built-in. We made the cycles wrap around to ensure activity will not stale in the last clique of the chain, and it will continue cycling. Adding six 2-cliques was an arbitrary decision, we could have had more or less.

In this way, we have built a network whose $FP (G)$ , by design, should contain all the cliques ({1,2}, {3,4}, {5,6}, {7,8}, {9,10}, {11,12}) and the bottom and top cycles as core fixed point supports. Computationally, we confirm these inclusions and find, more precisely that $| FP (G) | = 141$ (yes, 141 fixed points, this is not that crazy because there are up to 2¹² linear systems that could potentially lead to a fixed point of the network) and that

{FP}_{core} (G) = {{1, 2}, {3, 4}, {5, 6}, {7, 8}, {9, 10}, {11, 12}, {1, 3, 5, 7, 9, 11}, {2, 4, 6, 8, 10, 12}} .

(3.1)

Based on the core motif and attractor heuristic correspondence, this is indicative that we will have a stable fixed point attractor per clique, and two unstable fixed points, most likely giving rise to dynamic attractors (but also maybe not, as the heuristic is a heuristic and not a theorem or perfect correspondence, only the simulation will tell).

We have designed this network to have 6 stable fixed points, which we computationally checked that indeed it has. Since the $FP (G)$ set does not depend on $θ$ , so far no mention of it has been necessary, and those stable fixed points will be there regardless of the value of $θ$ , as long as it is in the legal range. However, if we aim to transition between stable fixed points, we will need exercise the perturbation somehow. And that’s where hysteresis comes. The perturbation will come in the form of external pulses, or $θ$ -pulses, defined below.

External pulses are communicated the network by transiently changing the value of the external input $b_{i}$ in Equations 2.2, which we recall to be given by

\frac{d x_{i}}{d t} = - x_{i} + {[\sum_{j = 1}^{n} W_{i j} x_{j} + b_{i}]}_{+}, i = 1, \dots, n .

Because we are using CTLNs, $W$ is prescribed by the $G$ graph of Figure 3.3A, using the rule

W_{i j} = {\begin{array}{l} 0 & if i = j, \\ - 1 + ε & if j \to i in G, \\ - 1 - δ & if j ↛ i in G . \end{array}

Now, in the CTLN setup the external input $b_{i} (t)$ would typically constant, but because we want to use pulses, the external input can no longer be constant. Because now we will have a temporary identical perturbation for all the neurons, $b_{i} (t) = θ (t)$ above must be defined as a step function, pictured right at the very top of Figure 3.3B, that also does not depend on i, because we assume the pulses to be identical across neurons.

b_{i} (t) = θ (t) = {\begin{array}{l} θ_{0} for t outside of pulses \\ θ_{1} for pulse times t . \end{array}

We refer to $θ_{0}$ as the baseline and to $θ_{1}$ as the pulse. It is important to note that this varying external pulse will cause our network to no longer be a CTLN (which requires external input to be constant and uniform across neurons), but rather a TLN, or a piecewise CTLN, if you wish. But because we will only need two different $θ$ values, and we designed the network with CTLN principles, and because it really is a piecewise CTLN, we continue to call this network a CTLN.

Now, how does hysteresis arise from the $θ_{1}$ pulse? Figure 3.3 explores how this transient $θ$ change is affecting the dynamics of the network, at three time-stamps. First, note that the nullclines of the system are piecewise linear functions given by

N_{i} : x_{i} = {\begin{array}{l} \sum_{j = 1}^{n} W_{i j} x_{j} + θ_{i} & if \sum_{j = 1}^{n} W_{i j} x_{j} + θ_{i} > 0 \\ 0 & if \sum_{j = 1}^{n} W_{i j} x_{j} + θ_{i} \leq 0 \end{array}

That is, because TLNs are piecewise linear, the nullclines are piecewise linear as well. These are pictured in Figure 3.3C for neurons 1 to 4 of the network from panel A of the same figure, using corresponding colors.

By changing the value of $θ$ , we are actually causing the nullclines of the system to shift in state space. More specifically, looking at Figure 3.3C, we see that at $t_{0}$ we are at the baseline $θ_{0}$ , and the system is at rest in the stable fixed point supported on ${1, 2}$ (cf. panel B). When the pulse arrives, at time $t_{1}$ , the nullclines experience a translation in state space, causing the trajectory to move towards the new fixed point (always at the intersection of the nullclines). When the pulse ceases, at $t_{2}$ , it is too late for the trajectory to go back, it has fallen in the basin of attraction of a different fixed point, the one supported on ${3, 4}$ !

It is important to note that newer, more robust models of neural integration, also make use of hysteresis, but it typically arises from the use of bistable units [29,45,56]. In our case, hysteresis emerged as a property of connectivity, since CTLN units are not bistable by themselves (as they just die off), only when connected to each other, and so the multistability in our model is a property of the network, not of each unit.

What we explained above is exactly what we observe in simulations. We simulated the network using the parameters $ε = 0.25$ , $δ = 0.5$ , a baseline $θ_{0} = 1$ , and an input pulse $θ_{1} = 5$ . However, note that since Theorem 7 does not depend on the values of these parameters, any (C)TLN with dynamics prescribed by Equation 2.2, whose connectivity matrix $W$ is build from the graph of Figure 3.4A according to the binary synapse prescription of Equation 2.4, will have the supports of Eqn 3.1. This is true for any values of $ε$ and $δ$ , provided they are within the legal range.

Figure 3.4. — (A) Counter. Pulses shown in the middle plot are sent to all neurons in the network. Activity slides to the next clique (stable fixed point) after reception of the pulse. Pulse duration is 3 time units, with no refractory period. (B) Signed counter. Black pulses are sent to odd-numbered neurons and red pulses are sent to even-numbered neurons. Activity slides to the right for black pulses, and to the left for red pulses. Pulse duration is 2 time units, with 3 time units of refractory period where $θ = 0.95$ .

The ${FP}_{core} (G)$ predictions about the dynamics are partly confirmed by simulations. We do observe the stable fixed points arise, but not the limit cycle. In any case, the rate curves of Figure 3.4A confirm the counting mechanism works: activity remains in the stable fixed point that the network was initialized to, unless a uniform $θ$ -pulse is sent to all neurons in the network. When the network receives the pulse, activity slides down to the next stable state in the chain. Activity is maintained in this state indefinitely until future pulses are provided to the system. Indeed, the network is effectively counting the number of input pulses it has received via the position of the attractor in the linear chain of attractor states. This network is a very simple alternative to the neural integrator models often used to maintain a count in working memory of some number of input cues. The matrix and parameters needed to reproduce this Figure are available in Appendix A, Equation A.1.

Good! We have successfully encoded a sequence of fixed point attractors, all accessible via identical pulses. But can we adapt this construction to be able to decrease the count when presented with a negative input?

CTLN signed counter.

The architecture that made the transitions possible before, the cycles, can be so slightly modified to allow for the count to be decreased: making the bottom cycle travel in the opposite direction, as seen in Figure 3.4B, allows us to perturb the network with pulses so that the fixed points can also transition backwards in the chain, decreasing the count. This means that the pulses are now signed, and thus we informally refer to this modified network as “signed counter network”. The core motif analysis in this case is the same, as the cliques remain target-free and the two cycles again survive by Rule 2. Indeed, computationally, we found $| FP (G) | = 369$ and

F P_{core} (G) = {{1, 2}, {3, 4}, {5, 6}, {7, 8}, {9, 10}, {11, 12}, {1, 3, 5, 7, 9, 11}, {2, 4, 6, 8, 10, 12}} .

(3.2)

Again the predicted behavior of the network is confirmed by the rate curves of Figure 3.4B. The dynamics of the signed counter are analogous to those of the previous counter, except now pulses are signed, and the count can be decreased. More specifically, neurons are divided into two opposite populations (top and bottom cycles or, equivalently, odd and even nodes), and pulses are sent to either one of them. Note that pulses are now followed by a brief refractory period where $θ = 0.95$ to allow the system to reset. The color of the pulse in Figure 3.4B indicates which group of neurons received the pulse. Black pulses are sent to the top cycle (odd-numbered nodes), and red pulses are sent to the bottom cycle (even-numbered nodes). When the network receives a black pulse, the attractor slides to the right. If it receives a red pulse, the attractor slides to the left. That is, the activity travels in the direction of the cycle that received the pulse. This network can not only keep a count, but also store net displacement, position on a line, or relative number of left and right cues. The matrix and parameters needed to reproduce this Figure are available in Appendix A, Equation A.2.

Both counter models are simple in that they only require twice as many neurons as stable states, and pulses input to the system are given to all neurons on the network. Although both counter networks pictured in Figure 3.4 are able to keep a count modulo 6 (number of chained cliques), this can be generalized to an arbitrary (but finite) number of cliques, allowing to keep a memory of an arbitrary large count. This means that the number of possible states approaches infinity as the number of links in the chain approaches infinity as well, and so the count is effectively continuous. In addition, since the networks are able to keep a modular count of the number of pulses, it could be useful to estimate modular counts like time intervals or angles. Both counters thus constitute a simple alternative to traditional discrete neural integrators, but are they really that much robust?

Robustness of CTLN counters.

In general, CTLNs are expected to perform robustly, since a lot of their dynamic information is contained in ${FP}_{core} (G)$ and we know this set is preserved across the legal range.

To asses the performance of both counters across parameter space, we ran the simulations of Figure 3.4 using various pairs of $(ε, δ)$ parameters, and various combinations of pulses strengths and durations. We found three different possible outcomes, exemplified in Figure 3.5A. We classified these according to whether the function is preserved (keeps the correct count), corrupted (counts in multiples of two) or lost (does not keep a consistent count) as $ε, δ, θ$ vary.

Figure 3.5. — (A) Examples of what can go wrong in a single simulation. In the first plot the signed counters enter a “roulette” behavior (function is lost), in the second plot the pulses do not move the counter from the current fixed point (function is lost), and in the third plot the counter consistently slides two positions (function is corrupted). (B) Various counter behaviors for fixed values of baseline $θ = 1$ , with pulses $θ = 5$ for counter, $θ = 2.5$ and $θ = 0.95$ for signed counter. $ε$ , $δ$ vary according to the axes. Shaded in blue are parameter pairs outside the legal range $ε < \frac{δ}{δ + 1}$ . (C) Various counter behaviors for a fixed values of $ε = 0.30$ , $δ = 0.60$ and baseline $θ = 1$ , with pulse height and duration varying.

The results for all pairs $(ε, δ)$ of parameters and various combinations of pulses strengths and durations are recorded in the bottom row of Figure 3.5, where green indicates preserved, yellow indicates corrupted and red indicates lost performance. One dot corresponds to one simulation of 7 pulses, with a running time of $T = 428$ time units for the counter and $T = 435$ time units for the signed counter. We found that there is a good range of parameters where the counters behave as expected, successfully keeping the count of the number of input pulses received.

This robustness across parameter space was given to us by the theoretical results. But will the performance of the counters survive to added noise in the input and connections? To verify robustness to noise, we computed the proportion of failed transitions among $m = 100$ pulses in both fixed point counters. This was done by introducing varying percentages of noise into the $θ$ input, varied every $d t = 0.1$ of a time unit, and into the connectivity matrix $W$ , as specified in the axes of Figure 3.6. A failed transition is defined as any behavior that does not advance the counter a single clique forward/backwards, depending on the sign of the pulse. Figure 3.6 contains examples of all possible transition failures, as well as examples successful transitions in noisy conditions, as detailed below. All plots simulations used to quantify the failures were done with $ε = 0.25$ , $δ = 0.5$ .

Panel A of Figure 3.6 is an example of a very noisy counter that still preforms well. Each pulse advances the counter a single clique forward, as expected. Panel B of Figure 3.6 shows a noisy counter with two types of failures: first the counter advances too many steps, and then it gets stuck in one of the cliques (altogether these would count as four failures in out analysis, even though there is probably just two defective cliques). Panel D of Figure 3.6 is an example of a very noisy signed counter that performs perfectly well. Panel E of Figure 3.6 is an example of things that can go wrong in the signed counter. The first failure advances the counter one more than expected, the second failure makes the counter go into this kind of roulette behavior until it stops in a given clique. We have also observed the signed counter getting stuck at some point, as in Figure 3.6B.

The percentages of noise in Figure 3.6 were calculated as follows: The noise in the external input $θ$ is i.i.d random noise from the interval $(- 1, 1)$ , and it was introduced to vary every 0.1 fraction of a unit time. The noise in the connectivity matrix $W$ is obtained by perturbing the adjacency matrix $A$ with i.i.d random noise from the interval (0, 1). More precisely, if denote by $\tilde{A}$ , $\tilde{W}$ , $\tilde{θ}$ the noisy versions of $A$ , $W$ , $θ$ ; where $A$ is the transpose of the adjacency matrix of the graph $G$ defining the network, then the noisy versions are:

\tilde{A} = A + p R \tilde{W} = (- 1 + ε) \tilde{A} + (- 1 + δ) (1 - I - \tilde{A}) \tilde{θ} = θ + q S

where $p$ , $q$ are the percentages of noise introduced, and $S$ , $R$ are random matrices the same size as $W$ , $θ$ with entries between (0,1) and (−1, 1), respectively, and 1 is a matrix of 1’s the same size as $A$ . This is equivalent to perturbing $W$ by an amount proportional to length of the interval $[- 1 - δ, - 1 + ε]$ , that is:

\tilde{W} = (- 1 + ε) \tilde{A} + (- 1 + δ) (1 - I - \tilde{A}) = W + p [(- 1 + ε) - (- 1 - δ)] R

We choose to perturb the adjacency matrix instead for computational easy. For each percentage of noise pair $(p, s)$ , we ran 20 simulations, each one consisting of 5 (signed) pulses in the CTLN (signed) counter, as exemplified in Figure 3.6. The results of counting over all these pulses are summarized in panels C and F of Figure 3.6. Each pixel represents the fraction of failed transitions for each pair $(p, s)$ . We found that discrete counters perform perfectly well up to 5% noise, proving that perfect synapses are not necessary. When the noise limits are pushed beyond, counters lose stability and start to slide too many attractors, or to get stuck in one of the cycles. The (unsigned) counter, not surprisingly, proved to be a lot more robust, being barely affected by the amount of noise in $W$ . Although not as robust, the signed counter accurately kept the count 50% of times under 20% noise. Not too bad.

Having successfully encoded sequences of stable fixed points, it is natural to now ask: can we use the same ideas to construct a network that encodes a sequence of dynamic attractors instead? Maybe by chaining together other kinds of core motifs, instead of cliques?

3.2. Sequences of dynamic attractors of the same type

Here we use one of the core motifs giving rise to a dynamic attractor from Chapter 2 to construct a network that steps through a sequence of dynamic attractors. In the previous section, we chained together repeated core motifs that we knew gave rise to stable fixed points to obtained a network that steps through a sequence of stable fixed points. We use the same idea as before, but a cyclic tournament instead of a 2-clique.

To construct the network, we chain together overlapping 5-stars, which are cyclic tournaments. These were introduced in Chapter 2 and are the motifs composing the networks shown in Figures 3.7C, D (a single 5-star is, for instance, the subgraph induced by 1 to 5). Rule 3 says that each 5-star will yield an unstable fixed point support, since each 5-star is connected in such a way that it survives to the larger network. Thus, we should get:

{{1, 2, 3, 4, 5}, {4, 5, 6, 7, 8}, {7, 8, 9, 10, 11}, {10, 11, 12, 13, 14}, {13, 14, 15, 16, 17}, {1, 2, 16, 17, 18}} \subseteq {FP}_{core} (G)

Figure 3.7. — (A) Coexistent dynamic attractors. Each attractor is accessed via attractor-specific pulses. The sequential information is externally encoded. (B) Coexistent dynamic attractors. Each attractor is accessed via identical pulses sent to the network. The sequential information is internally encoded. (C) Patchwork of six 5-stars. Pulses are sent, one at the time, to neurons 6,9,12,15,18,3 (colored) in that order. Each pulse makes the attractor slide to the next limit cycle down the chain. Pulse duration is 1 time unit. (D) Same network as in panel A receives simultaneous pulses to neurons 3,6,9,12,15,18 (bold nodes). Pulse duration is 1 time unit. Two cycles go through the network in the direction opposite to attractor sliding. Despite their apparent symmetry, the green cycle is a core motif of the network, but the red cycle is not (it dies by Rule 2).

Indeed, computational work shows that ${FP}_{core} (G)$ contains precisely the six supports above, each for one 5-star. We also found $| FP (G) | = 361$ (which again, is not surprising since there are up to 2¹⁸ linear systems) and that

{FP}_{core} (G) = {{1, 2, 3, 4, 5}, {1, 2, 16, 17, 18}, {4, 5, 6, 7, 8}, {7, 8, 9, 10, 11}, {10, 11, 12, 13, 14}, {13, 14, 15, 16, 17}, {1, 4, 7, 10, 13, 16}}

(3.3)

Notice that we got an extra core motif, the last one in Equation 3.3. This is a cycle that is also uniform in-degree, and that survives by Rule 2 (colored in green in Figure 3.7D). In contrast, the colored red cycle of Figure 3.7D, at first glance symmetric to the green cycle, will not support a fixed point, by Rule 2 again (because, for instance, node 7 has two inputs from it). The attractor corresponding to the core motif ${1, 4, 7, 10, 13, 16}$ has not yet been found computationally.

We therefore expect to see at least six limit cycles, each one corresponding to a 5-star. Simulations confirm these predictions. In this case, both selective stimulation of specific nodes (Fig. 3.7C) and identical stimulation of all multiple-of-three nodes (Fig. 3.7D) can move the attractor to the next limit cycle in the chain. Indeed, in Figure 3.7C pulses are sent to specific nodes (color coded) which activate the attractor associated with the core motif that the node belongs to. For example, stimulating node 9 will activate the limit cycle associated with the core motif ${7, 8, 9, 10, 11}$ , to which 9 belongs. Note that the order in which pulses are sent matches the order in which the attractors are chained. Simulations show that is it not always possible to jump between non-adjacent 5-stars. The matrix and parameters needed to reproduce this Figure are available in Appendix A, Equation A.3.

By contrast, in Figure 3.7D, pulses are identically sent to the nodes ${3, 6, 9, 12, 15, 18}$ (yes, all the multiples of 3), each producing a transition to the next limit cycle down the chain, analogous to the mechanism observed in the previous two counters, where pulses contain no information about which attractor comes next, but the state of the network counts pulses.This difference is more clearly conceptualized in the cartoon above each type of stimulation (Fig. 3.7A vs Fig. 3.7B).

Robustness of dynamic attractor chain.

We performed similar robustness simulations for both kinds of stimulation types in the dynamic attractor chain as we did for the fixed point counters. That is, we ran simulations for several parameter pairs $(ε, δ)$ , and several percentages of noise $(p, q)$ in $W$ and $θ$ . The results of these noise simulations are summarized in Figure 3.8.

Figure 3.8. — (A) Example of a noisy dynamic attractor chain network that performs well for specific stimulation of all multiple-of-three nodes. (B) Example of a noisy dynamic attractor chain network with 3 failures, global stimulation. (C) Example of a noisy dynamic attractor chain network with 4 failures, global stimulation. (D) Percentage of failed transitions for specific and global stimulation in the dynamic attractor chain network in 120 trials for several percentages of noise in $W$ and $θ$ . (E) Different behaviors of the dynamic attractor chain for various pairs of $ε$ , $δ$ parameter values.

First, we analyze the performance under varying levels of noise. All the noise simulations were done with $ε = 0.51$ , $δ = 1.76$ . The robustness to noise of the dynamic attractor chain, in both types of stimulation, was calculated as in Section 3.1. Recall that this was done by computing the proportion of failed transitions among $m$ pulses in both fixed point counters. This was done by introducing varying percentages of noise into the $θ$ input, varied every $d t = 0.1$ of a time unit, and into the connectivity matrix $W$ , as specified in the axes of Figure 3.8D. This time however, $m = 120$ to accommodate the 6-pulse simulation, for 20 simulations per percentage of noise pair $(p, s)$ . As expected, our dynamic attractor chain is not nearly as robust as our CTLN counters of static attractors. Interestingly, global stimulation performed slightly better than specific stimulation in the presence of noise, but even this was barely robust at 5% noise. Nevertheless, although not very robust to noise, the dynamic attractor chain still performs well under a wide range of $ε$ , $δ$ , $θ$ parameters, as seen in Figure 3.8E, where the red dot indicates that the attractor was qualitatively identical to the ones originally observed in Figure 3.7C,D.

By chaining together identical graphs, we have successfully constructed a network that steps through a set of dynamic attractors that were all repetitions of motifs of the same type, namely the 5-stars. Can we patch together a more diverse set of dynamic attractors in a single network, all accessible via initial conditions and/or pulses? In the next section, we present a toy model that requires coexistence of diverse dynamic attractors, where each attractor is accessible via targeted pulses. Later, we will put these networks together with a counter to step through a set of disparate dynamic attractors.

Chapter 4 | Central pattern generators

In the last chapter, we leveraged network symmetry to provide an example of a network that can support several coexistent dynamic attractors, all easily accessible via attractor-specific inputs, and also accessible in sequence. A natural neuroscience application of coexistent dynamic attractors, and the one we seek to model in this section, is pattern generation. Central Pattern Generators (CPGs) are networks of neurons that can generate rhythmic output in the absence of rhythmic driving input [51]. However, unlike the dynamic attractor chain of the previous section, CPGs may need to support many different types of dynamic patterns simultaneously. So, in addition to coexistence of dynamic attractors, we now also want diversity of said attractors, as schematized in Figure 4.1C.

Figure 4.1. — Reproduction of Figure 1.1. The focus of this chapter is Panel C: Multiple dynamic attractors encoded in the same network, each one accessible via targeted (colored) inputs.

In this chapter we focus on CPGs that have been of interest in neuroscience for a long time, and recently in robotics: animal locomotion [21,22,51,57,71]. Quadruped locomotion is probably the most popular example of a CPG in neuroscience [21,32,41], and beyond [25,55]. Modeling the different modes of locomotion, called gaits, has been challenging because of the overlap in units controlling legs between different gaits. Most models overcome this difficulty by requiring changes in network parameters, such as synaptic weights [21,32], in order to transition between different gaits. In contrast, here we present a CTLN capable of reproducing five coexistent quadrupedal gaits (bound, pace, trot, walk and pronk). Gaits coexist as distinct limit cycle attractors in the network, without any change of parameters needed to access different gaits. Instead, different gaits are obtained from different initial conditions, or by $θ$ -stimulation of a specific neuron involved in the gait. Also, where most models for locomotion use oscillating units, the oscillatory behavior we obtain is a result of connectivity alone, since none of our units are intrinsically oscillating.

Although several types of architectures have been proposed to model distinct CPGs (recurrent neural network based, half-center oscillator based and abstract oscillator based CPGs) [22], coexistence of different attractors in the same circuit (as would be needed to model different modes of locomotion, controlled by the same set of neurons) has been challenging to many models. In this chapter we leverage cyclic unions (Thm. 11, Ch. 2), to present two different locomotion CPGs where patterns arise as attractors, and are thus very robust to noise and perturbations: a network that supports five different quadrupedal gaits and molluskan hunting mechanism.

4.1. Cyclic unions as pattern generators

Recall from Chapter 2 that for cyclic unions (reproduced in Fig. 4.2A), the neural activity flows through the components in cyclic order. Thus, cyclic unions are particularly well suited to model networks that must follow a sequential activation of the nodes, like CPGs, because attractors themselves will follow the direction of the cyclic union. This is true for other, less constricted architectures. These are the subject of [59], some of whose results are work of my own and thus presented in Chapter 6.

Figure 4.2. — (A) Cyclic unions, and Theorem 11. (B) Example of a cyclic union giving rise to sequential activation of the nodes.

Recall also from Theorem 11 in Chapter 2 that the core motifs of the cyclic unions are made up of core motifs coming from each component and thus cyclic union of core motifs is core. For instance, in Figure 4.2B, we have an example of a cyclic union of four core motifs: two 2-cliques ({1,2}, {4,5}) and two single nodes ({3}, {6}). The $FP (G)$ set of the whole graph is made up of pieces taken from each $FP (G_{τ_{i}})$ , colored-coded by component. Because all components were core motifs, the whole graph is now core motif, as seen by its $FP (G)$ in Figure 4.2B. The activity of the network, pictured below its $FP (G)$ set, follows the direction of the cyclic union. Note that neurons 1 and 2 are synchronized, neurons 5 and 4 are synchronized, and then there is a successively cycling through these pairs, half a period apart.

As mentioned earlier, not only cyclic unions yield sequential attractors, as seen in Figure 4.3: both the graphs at the top of the figure are examples of cyclic unions with three components, and the activity for these networks traverses the components in cyclic order. Compare these with the the bottom graphs of the figure, which do not have a perfect cyclic union structure (each graph has some added back edges or dropped forward edges highlighted in magenta), but have very similar dynamics to the ones above them. Despite the deviations from the cyclic union architecture, these graphs still produce sequential dynamics.

4.2. Quadruped gaits

In this section we present a CTLN capable of reproducing five coexistent quadrupedal gaits: bound, pace, trot, walk and pronk. Figure 4.4 shows how the gaits are characterized by their relative phases. For example, the bound gait is characterized by having both front legs synchronized, both back legs synchronized, and then successively cycling through these pairs, half a period apart. This is why back legs have a relative phase of 0, and front legs both have a relative phase of 0.5. The rest of the gaits in Figure 4.4 are read similarly. Yes, pronk is a little all-legs jump. Gazelles do it.

We begin by modeling each one of these on its own, using cyclic unions, as foreshadowed by the example of Figure 4.2B.

4.2.1. Construction of gaits

Single gaits

In the example of Figure 4.2B, we actually constructed the bound gait: two pairs of nodes are synchronized, half a period apart. There, we did it using only six nodes. In what follows, we aim to glue several gaits together, so it is not convenient to have such a small number of nodes per gait (very few nodes with a lot of edges will form cliques, which yield stable fixed points, which we do not want for gaits), since adding new gaits will results in more and more connections between leg nodes and this would make everything collapse into a giant clique, resulting in a stable fixed point. This is why we now propose a slightly different construction for bound, and other gaits, by adding a few extra auxiliary neurons to overcome this issue.

To re-model the bound gait, we replace the old 2-clique components by a square made up of four 2-cliques, like that of Figure 4.5C. Why? The ${FP}_{core} (G)$ of such a square is given by all the individual 2-cliques composing its sides, as seen in Figure 4.5C. And since cyclic unions will grab a piece from every component, we are sure that a given pair of legs will still be part of some core motif of the network. More precisely: since the bound gait is characterized by having both front legs synchronized, both back legs synchronized, and then successively cycling through these pairs, half a period apart, as in Figure 4.6A, it is natural to group front legs in a single component and back legs in a single component, and to connect them through two auxiliary 1-node components.

To achieve this, we split the nodes into four components, as shown Figure 4.6B. The resulting components are $τ_{1} = {2, 3, 6, 7}$ , $τ_{2} = {10}$ , $τ_{3} = {1, 4, 5, 8}$ , and $τ_{4} = {9}$ . Theorem 11 now says that the ${FP}_{core} (G)$ set of the bound gait of Figure 4.6B is obtained by selecting a support $σ_{i} \in {FP}_{core} ({G |}_{τ_{i}})$ from each component $τ_{i}$ and forming their union: $σ_{1} \cup {9} \cup σ_{2} \cup {10}$ , where $σ_{1} \in {{2, 3}, {2, 6}, {6, 7}, {3, 7}}$ and $σ_{2} \in {{1, 4}, {1, 5}, {5, 8}, {4, 8}}$ . This will include, among others, the support ${1, 2, 3, 4, 13, 14}$ corresponding to the bound gait attractor seen in the rate curves of Figure 4.6B. It is because we made sure to built-in the support ${1, 2, 3, 4, 13, 14}$ that we can easily see and access the attractor reproducing the bound gait.

A slightly different construction consists of the gait walk, where legs move cyclically with a quarter period difference, as seen in Figure 4.6C. In the same manner, we use Theorem 11 to group each leg in a component along with its auxiliary neurons and add four auxiliary neurons to ensure that the phase between legs is a quarter period apart (Figure 4.6D). The building block for the leg components is now the 3-clique of Figure 4.5B. Since the building block is a core motif, the ${FP}_{core} (G)$ of the walk gait consists of a single element, formed by taking a core motif from each component in Figure 4.6D, resulting in $FP (G) = {FP}_{core} (G) = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16}$ . This core motif promises to give rise to the attractor reproducing the walk gait. Indeed, the resulting dynamics of this network are depicted in Figure 4.6D, and the network once again behaves as expected, reproducing the walk gait. The matrix and parameters needed to reproduce these simulations, and other individual gaits, are available in Appendix A, Equations A.4 to A.8.

Other individual gaits can be similarly constructed, as shown in Figure 4.7A. In pace, left legs are synchronized, right legs are synchronized, half a period apart, and so we group neurons into appropriate components, in the same way we did bound. Similarly with trot, where diagonal legs are synchronized, half a period apart. The core motifs of pace and trot are similarly derived, using the “square” building block of Figure 4.5C, along with Theorem 11. The last gait our model will include is pronk, where all legs are synchronized, requiring now a single component to group all legs together along with a pair of auxiliary nodes that control the frequency of the pronking movement.

Note that all gaits have been constructed as a cyclic union of the three basic building blocks of Figure 4.5: a “square” made out of 2-cliques, a 3-clique and an isolated node. From these pieces, and by Theorem 11, the $FP (G)$ set for each gait $G$ can be readily obtained as we did with bound: by selecting a support $σ_{i} \in FP (G_{i})$ from each component $G_{i}$ and forming their union $σ = \cup_{i = 1}^{4} σ_{i} \in FP (G)$ . For a detailed analysis of the $FP (G)$ sets see Tables A.2 and A.3 in Appendix A.

Since each gait was constructed to have ${FP}_{core} (G)$ include the leg nodes, along with the appropriate auxiliary nodes, we expect the graphs of Figure 4.7A to give rise to a CLTN that reproduces these gaits. Indeed, in Figure 4.7B we have simulated the individually constructed gait networks. As expected, we see an attractor reproducing the respective gait. Initial conditions are plotted to the right of each greyscale, and were chosen to have all neurons off, except one colored auxiliary neuron on.

Matrices and parameters needed to reproduce the simulations of Figure 4.7 are available in Appendix A.

Now, can we make these attractors we are seeing here coexist into a single network, with leg nodes overlapping? Said differently, can we glue together (in topology sense) the networks in Figure 4.7A by leg nodes, and other shared nodes? As we will see below, the answer is yes.

4.2.2. Coexistent gaits

To construct the five-gait network, we glued together the networks in Figure 4.7A by identifying common neurons (1 through 6) and induced edges between them. Figure 4.7F shows the resulting glued network. More precisely, this gluing identifies all nodes from 1 to 6 coming from each isolated gait into a single one in the glued network. This operation does not exactly correspond to any graph operation, and it is more like a gluing in a topological way, where the gluing instructions are given by the vertex numbers and the edges between them.

As mentioned before, the inclusion of several auxiliary nodes was necessary to merge the five networks into a single one. Indeed, nodes 1 to 4 are shared by all gaits and represent the leg assignment (1. LB - left back, 2. LF - left front, 3. RF - right front, 4. RB - right back), but nodes 5 to 8 are shared by bound, pace and trot, while not representing any leg assignment. The rest of the nodes are gait-specific, meaning that they are uniquely associated to a single gait and therefore only active when such gait is active, and off otherwise. These nodes play the very important roles of keeping the phase between foot steps in each gait and activating a specific gait by stimulating one of them. These nodes are: nodes 9 to 12 which are there to augment the walk cliques, and so they are specific to walk; and nodes 13 to 23 which are specific to a unique gait, as color coded in Figure 4.7A.

Although the complex structure of the merged network makes it hard to analytically derive the $FP (G)$ , we found computationally that $| FP (G) | = 875$ (again, coming from up to 2²⁴ linear systems), and

{FP}_{core} (G) = {{1, 2, 23, 24}, {1, 4, 23, 24}, {2, 3, 23, 24}, {3, 4, 23, 24}, {1, 6, 7, 8, 13, 16, 18}, {2, 5, 7, 8, 14, 16, 17}, {3, 5, 6, 8, 14, 15, 18}, {4, 5, 6, 7, 13, 15, 17}} .

(4.1)

Although sadly none of the core motifs in Equation 4.1 there correspond to the gait attractors, in Figure 4.7C it can be seen that all gaits are actually preserved when embedded in the big network with all gaits, as the dynamics of each gait when embedded in the glued network are indeed remarkably the same as when isolated (cf. panel B). The greyscale and rate curves for each gait embedded within the glued network are plotted in the same column as its corresponding graph in Figure 4.7A. Also, it is important to note that even though the isolated gait networks (Fig. 4.7A) are not the same as the induced gait networks (Fig. 4.7D), they support the same (qualitatively) attractor, as seen in the rate curves of Figure 4.7E.

To see that each gait is accessible via different initial conditions when embedded in the five-gait network, notice in Figure 4.7B that when all neurons, but neuron 13, are initialized to zero, the system evolves into the bound gait. This is because neuron 13 corresponds uniquely to bound. Similarly, when the network is initialized in neuron 15, it goes into pace. When it is initialized in neuron 17, it goes into trot. And so on. Recall that walk has two different sets of gait-specific neurons (9 – 12 and 19 – 22), and both kinds will send the network into the walk gait. Hence, our five-gait network successfully encodes these coexistent dynamic attractors, each accessible via initial conditions, that have several overlapping active nodes (nodes 1 to 8), with no interference between attractors. Moreover, the attractors are not overly sensitive to the value of initial conditions, as long as the highest firing neuron in the initial condition is the one associated with the desired gait.

In particular, since all gaits coexist as distinct limit cycle attractors in the network, each accessible via different initial conditions, it is also possible to smoothly transition between gaits by means of a gait-specific external pulse $θ$ . To effectively change gaits, it suffices to stimulate a the appropriate gait-specific auxiliary neuron with a pulse. The network quickly settles into the dynamic attractor corresponding to the gait of the auxiliary neuron stimulated, as seen in Figure 4.7G. There, pulses of 2 time units are sequentially sent to neurons 17,19,15,13,23 and 17, producing the expected transitions between gaits. The network smoothly transitions to the corresponding gait. The order in which neurons are stimulated does not matter, any sequence of pulses sent to gait-specific auxiliary neurons will produce the respective transitions, regardless of which gait comes before or after.

However, not all gaits are created equal, and this becomes evident in some transitions. From the construction of the graph we suspect that bound, pace and trot are on equal footing, since they were created analogously, only differing by which pairs of legs are synchronized. Consequently, we expect some symmetry in their basins of attraction, so that transitioning to and from these gaits is easier, and this has been indeed observed in simulations. By contrast, we have observed that walk and pronk (both at the boundary of how many legs can be synchronized with 1 leg and 4 legs respectively) seem to have a bigger basins of attraction, since the network requires a higher $θ$ stimuli to transition out of these two gaits. All in all, this network provides a straightforward mechanism to switch between the desired attractors, fulfilling our initial goal of controlling coexistent dynamic attractors in a single network.

4.2.3. Parameter analyses

Good parameters.

A common drawback in many models is that sometimes they require very fine-tuned values of their parameters to perform as expected. Also since dynamic attractors are known to be less robust than fixed point attractors, it makes sense to assess the robustness of the network in Figure 4.7F. Individual gaits were designed as a cyclic union (Thm. 11) of building blocks (Fig. 4.5) whose $FP (G)$ is parameter independent [15], meaning that its $FP (G)$ set is preserved across the legal range of parameters $ε < \frac{δ}{δ + 1}$ . Since cyclic unions of parameter independent components are also parameter independent, all isolated gaits are parameter independent as well. Indeed, it is a simple consequence of Theorem 11:

Corollary 1.

Let $G$ be a cyclic union of component subgraphs ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ . Then $FP (G)$ is parameter independent if and only if $FP ({G |}_{τ_{i}})$ is parameter independent for all $i \in [N]$ .

Proof. Suppose $σ \in FP (G, ε, δ, θ)$ for all $ε$ , $δ$ , $θ$ in the legal range. By Theorem 11, $σ \cap τ_{i} \in FP ({G |}_{τ_{i}})$ for all $ε$ , $δ$ , $θ$ in the legal range and all $i \in [N]$ , meaning that $FP ({G |}_{τ_{j}})$ is also parameter independent. The other implication is analogous. □

This implies that the $FP (G)$ set does not vary with parameter changes within the legal range, and so we can expect the gait attractors to be present in each individual gait network for a wide range of parameter values, so each gait, isolated, turns out to be very robust! The same unfortunately cannot be said about the glued five-gait network, since it is not truly a cyclic union, but rather a gluing of cyclic unions, for which we do not have theoretical results yet. So, how sensitive is our network to the choice of $ε$ , $δ$ , $θ$ parameters? Are the five attractors still present, and easily accessible, if we vary our parameter values?

To answer this question, we tried to reproduce the simulations of Figure 4.7C using different $ε$ , $δ$ , $θ$ parameters. First, we observed how the attractor corresponding to each gait behaved under several values of $ε$ , $δ$ in the five-gait network. We found that there were three different possibilities: either the attractor is preserved, meaning that it is identical in qualitative behavior to the attractors in Figure 4.7C (up to a change in amplitude or period); corrupted, indicating that the auxiliary neurons corresponding uniquely to that gait are still active and firing cyclically, but the leg nodes have lost its symmetry; or lost, when the auxiliary neurons corresponding uniquely to that gait are not firing cyclically (this usually translated into a gait becoming a different attractor, which could either be another gait, a stable fixed point or a chaotic-looking attractor).

An example of each of these outcomes is exemplified in Figure 4.8A, where only the last 200 time units of the simulation are shown, for clarity. In the left example, the network was initialized to the bound gait, and the gait is perfectly preserved. In the middle plot, the network was initialized to the pronk gait, and even though the auxiliary neurons corresponding to pronk are still active, the leg neurons lost their symmetry towards the end. In the right plot, the network was initialized to the trot gait, but near the end of the simulation the network settles into a corrupted form of pronk, as indicated by the auxiliary neurons of pronk being active. The results of all those simulations are summarized in Figure 4.8B. Green denotes preserved attractor, yellow denotes corrupted attractor and red denotes lost attractor. The intersection of the green dotted regions is the range in which all gaits are preserved: $0.05 < ε < 0.23$ , $0.2 < δ < 0.6$ , for at least the first 500 units of time. These are the “good parameters”.

We then did the same thing but varying the $θ$ parameter only. Figure 4.8C shows examples of what can happen in this case. In the first example, the network was initialized to the walk gait, and the gait is perfectly preserved. In the right example, on the other hand, the network was initialized to the pace gait, but immediately went into pronk. Figure 4.8D shows the same summary of results, but carried out for constant values $ε = 0.14$ , $δ = 0.40$ , and varying $θ$ . These values were arbitrarily chosen from the $0.05 < ε < 0.23$ , $0.2 < δ < 0.6$ good parameters from above.

In conclusion, we have computationally found that all the attractors corresponding to the five gaits in our network are preserved for all $0.05 < ε < 0.23$ , $0.2 < δ < 0.6$ and $0.5 < θ < 2.5$ , for at least 500 units of time. Having a range of parameters available raises the question: what part do they play in modulating the non-qualitative characteristics of our gaits such as period and amplitude? These are important questions because period and amplitude in firing rates affect muscle tension [1,75].

Parameter modulation.

In this section, we compute the period and amplitude, of each gait within the five-gait network, using the good parameters from above. To do so, we ran a single simulation for $T = 500$ time units and then we used the MATLAB function findpeaks to find the location and value of the peaks of a single rate curve. Examples of this can be seen in Figure 4.9I. The black bars in the rate curves show the amplitude, and the black triangles mark the time of the peaks, both calculated using only the first leg $(x_{1} (t) - LB)$ . We found these peaks only during the last half of the simulation to avoid taking into account the time before settling into the attractor. To find the periods we subtracted all the consecutive peak locations. We had to get rid of the first and last period and amplitude data point because they became corrupted by chopping the curve. Then we averaged the periods and peak values to obtain a single period and a single amplitude per simulation. This is the value shown in Figures 4.9A-H.

There, each point corresponds to a single simulation. The color of the point corresponds to a given gait, as specified in the label of Figure 4.9A. Note that, since bound, pace and trot have virtually the same structure, the data point for these are overlapped in Figures 4.9A-H. We also show crossections for Figures 4.9A,D. The blue shaded corner of the 3-dimensional plots corresponds to values outside the legal range. In Figures 4.9A-F we are varying only $ε$ and $δ$ , with a fixed value of $θ = 1$ . In Figures 4.9G-H, we use the fixed values of $ε$ , $δ$ in the slices above to vary the value of $θ$ .

Notice that for fixed values of $ε$ , $δ$ , $θ$ , period decreases with $ε$ and $δ$ ; whereas amplitude increases for walk, and decreases for all other gaits. Period is seen to be invariant to $θ$ changes, and amplitude increases with increasing values of $θ$ . It is known that an increased firing rate produces an increase in muscle tension by summation of more frequent successive motor contractions [1,75]. This implies that in our model, period would correspond to how fast/slow muscles contract and relax (bigger period means muscles contract and relax faster). Period therefore is controlling how fast the movement is executed, that is, the speed of the gait. Amplitude of firing rate corresponds to tension in muscle, because the more action potentials, the more muscle units are activated.

Great. But while we have showed that there is a fairly wide range of parameters under which our network behaves as expected, and where we can module the properties of the firing rates, there is still the assumption that all synaptic connections are perfect and identical: either $- 1 + ε$ or $- 1 - δ$ . Is our network robust to noise in the synapses? What about the inputs? Are transitions still possible in the light of noisy connections?

4.2.4. Robustness to noise

To explore these questions, we added uniform noise to $W$ and $θ$ and computationally tested whether the attractor was lost. A gait is said to be lost when the attractor corresponding to said gait degenerates into a different attractor, which is clear from which auxiliary neurons are active. We did not measure if the attractor was corrupt or not, because any amount of noise will break the symmetry of the legs. This was done in the same way as we did in Chapter 3. We ran a single noisy simulation and observed if the attractor was lost or not. Examples of the behaviors we observed can be found in Figure 4.10A-C. In A, the networks was initialized to trot, but it chaotically jumps between different attractors, until it settles to pronk. In B, the network was initialized to walk, and it remains there, as good as it can. This means, the legs lost their symmetry, but the high-firing neurons are the same as in walk, in the same order. This classifies as not lost in our analysis. In C, we observe the same situation but with pronk. Here, the asymmetry of the legs is more notable. These are wobbly gaits.

Figure 4.10D summarizes our findings. We simulated each gait 10 times, and then calculated the percentage of lost gaits in those 10 trials, for varying percentages of noise in $W$ and $θ$ . The color of the pixel in Figure 4.10D represents the percentage of lost gaits out of these 10 trials. A trial corresponds to a single noisy pair $(W, θ)$ , which was used to test all gaits once. All simulations used to quantify the failures were done with $ε = 0.25$ , $δ = 0.5$ and $θ$ -noise with $d t = 0.1$ .

In around 90% of trials, gaits were not lost at 2% noisy $W$ and $θ$ , indicating that that the perfect binary synapses of CTLNs are not necessary for this network to produce the desired gaits, but bigger levels of noise will break the network’s performance anyway. Interestingly, at bigger levels of noise, the gaits were lost most commonly to pronk, again. Pronk also happens to be one of the two most robust gaits (see pronk in Figure 4.10D). This supports our suspicion that pronk’s basin of attraction is bigger. It is also evident again the way in which the construction of the first three gaits differs from the last two gaits. This raises the question of what would happen to the robustness of the network if there was no pronk or walk, so that the network is perfectly symmetric.

Finally, we also tested the ability of the network to transition under 1% and 5% noise in $W$ , as shown in Figure 4.10E. Transitions are robust at 1% added noise, but past 5% gaits are commonly lost to pronk and thus transitions are not successful.

4.3. Molluskan hunting

A dynamically different example of a CPG is Clione’s hunting mechanism. Clione is a marine mollusk without a visual system, so when it chemically senses its prey, it must explore its surroundings by swimming in its vicinity until it finds it. The direction of swimming is controlled by Clione’s tail, which receives input from Clione’s gravitational sensory organs, the statocyst. The statocyst is covered by mechanoreceptors that receive inputs from the statolith, a stone-like structure moving inside the statocyst under the effect of gravity [58].

The model we propose here is inspired by the model in [71]. There, the authors propose a six-neuron model, with Lotka-Volterra-type units and inhibitory non-symmetric connections. This fosters competition and lead to a state of winnerless competition, where no single neuron dominates the activity. This in turns result in complex spatiotemporal patterns that mimic random changes in direction in the gravitational field. These patterns drive the unpredictable and random-like hunting behavior observed in Clione.

In that model, each node represents a receptor neuron in the gravity sensory organ, the statocyst. These receptor neurons are part of the sensory network that processes information related to the body’s position relative to the gravitational field and are influenced by the central hunting neuron during hunting behavior in Clione. Here we propose a very similar model, also consisting of six-units, connected via inhibitory non-symmetric connections. However, our units are not intrinsic oscillators, and so the mechanism of pattern generation in our case arises purely from connectivity.

Recall that cyclic unions are particularly well-suited to model CPGs, so here we construct our network as a cyclic union of three components: each component is an independent set of opposing directions, as shown in Figure 4.11A. This will create competition between opposing directions, and force the choice among them (thus avoiding invalid combinations where two opposing directions are simultaneously high-firing). This should result in the 8 possible combinations of directions in 3-dimensional space (up/down, right/left, front/back), as shown in Figure 4.11B.

Figure 4.11. — (A) Clione’s network as a cyclic union and as an octahedron. (B) All possible swimming directions in 3-dimensional space, each one corresponding to an attractor as labeled. (C) CTLN solution for Clione’s network. Attractor-specific pulses are sent to the network. Often producing a change in the attractor, labeled at the top.

Each neuron represents swimming in a given direction: up, down, right, left, front, and back. Any valid combination of these three directions will uniquely define a direction in 3-dim space in which Clione will find itself swimming. All possible swimming directions will then correspond to the eight octants of 3-dimensional space, as illustrated in Figure 4.11B. Antagonist directions are colored in the same color with different shades. A more intuitive view of the network is given by the octahedral shape in Figure 4.11A, hence we also sometimes refer to this network as “octahedral network”.

Since our network is a cyclic union of independent sets, the computation of core motifs becomes straightforward. By Theorem 11, the core motifs are obtained by choosing a core motif from each component. The core motifs of an independent set are each singleton node [15, Rule 3]. This results in the following $2^{3} = 8$ core motifs:

{FP}_{core} (G) = {{1, 2, 3}, {1, 2, 6}, {1, 3, 5}, {1, 5, 6}, {2, 3, 4}, {2, 4, 6}, {3, 4, 5}, {4, 5, 6}},

(4.2)

which correspond to the 8 possible swimming directions, as expected. Computationally, we also found that $| FP (G) | = 27$ , as expected from Theorem 11, as well.

Simulations show that each core motif in Equation 4.2 corresponds to an attractor, and that each attractor is accessible via initial conditions, as shown in the simulations of Figure 4.11. In addition, as in the five-gait network, it is possible for the octahedral network to transition between attractors by sending a $θ$ -pulse to the direction we want to switch to. For example, in Figure 4.11C, the network was was initialized in the attractor corresponding to up-right-back, and even though a first pulse fails to transition the network, the second pulse, sent to neuron 2 (left) changes the attractor in this single direction, that is from up-right-back to up-left-back. We observe that subsequent pulses do produce the desired effect of transitioning a single direction. The matrix and parameters needed to reproduce the simulations of Figure 4.11 are available in Appendix A, Equation A.10.

But why is it that the first pulse failed to make the network transition attractors? Figure 4.12 offers some insights. Note that in Figure 4.12B, neuron 2 (left) received a pulse, thereby sending the network into the down-left-back attractor, as wanted and expected. However in Figure 4.11C, a pulse was sent to switch from up to down, and nothing really happened.

Looking closely, we observe that in D, the dark green rate curve surpasses the light green rate curve, and the transition is successful. In E, by contrast, the dark blue rate curve was very close to surpassing the light blue rate curve, however, at the end of the pulse the dark rate curve was still below the light blue curve, so the transition fails. It can be proven that this is a general phenomenon: the network will transition attractors into an opposing direction only if the neuron that received the pulse “wins” at end of the pulse.

4.3.1. Analysis of basins of attraction

Indeed, the next theorem shows that the network will change from direction $i$ to direction $j$ at time $t$ only if $x_{i} (t) < x_{j} (t)$ , for $i$ , $j$ opposite directions. To formalize the opposite directions notation we introduce a permutation $π = (14) (25) (36)$ that prescribes which nodes are opposite. We then have:

Theorem 12.

Let $I$ denote the core-motifs of the octahedral network ( ${FP}_{c o r e} (G)$ above) and suppose there exists an attractor $A_{σ}$ of the octahedral network, corresponding to one of the core motifs $σ \in I$ , and that there are no attractors that do not correspond to a core motif. Then there are exactly 8 attractors, one for each core motif, and their respective basins of attraction $B_{σ}$ are contained in the sets

C_{i j k} = {(x_{1}, x_{2}, x_{3}, x_{4}, x_{5}, x_{6}) \in ℝ^{6} ∣ x_{i} \leq x_{π (i)}, x_{j} \leq x_{π (j)}, x_{k} \leq x_{π (k)}},

(4.3)

where $π = (14) (25) (36)$ .

Proof. Note that the octahedral network is symmetric under the following permutations in S₆:

π_{1} = (14) π_{2} = (25) π_{3} = (36)

so that $π = π_{1} π_{2} π_{3}$ . Therefore the group of symmetries of the network is $〈 π_{1}, π_{2}, π_{3} 〉 ≅ ℤ_{2} \times ℤ_{2} \times ℤ_{2}$ . As a consequence of these symmetries, the existence of an attractor $A_{σ}$ corresponding to a core motif $σ \in I$ implies the existence of all other attractors corresponding to the rest of the core motifs in $I$ . Since there are no other attractors by assumption, the state space must be partitioned into exactly eight symmetric basins of attraction $B_{σ}$ .

On the other hand, since $ℓ$ and $π (ℓ)$ receive input from the same neurons (for $ℓ = 1, \dots, 6$ ), it is impossible to cross the hyperplanes $x_{ℓ} = x_{π (ℓ)}$ , as we show next. Suppose that $x_{ℓ} (t_{0}) = x_{π (ℓ)} (t_{0})$ for some $t_{0}$ . The dynamic equations for neurons $ℓ$ and $π (ℓ)$ are

\frac{d x_{ℓ}}{d t} = - x_{ℓ} + {[\sum_{j \neq ℓ, π (ℓ)} W_{i j} x_{j} + (- 1 - δ) x_{π (ℓ)} + θ]}_{+} \frac{d x_{π (ℓ)}}{d t} = - x_{π (ℓ)} + {[\sum_{j \neq ℓ, π (ℓ)} W_{i j} x_{j} + (- 1 - δ) x_{ℓ} + θ]}_{+} .

Both summation terms are the same, since $ℓ$ and $π (ℓ)$ receive input from the same neurons. Since $x_{ℓ} (t_{0}) = x_{π (ℓ)} (t_{0})$ by assumption, we have that $\frac{d x_{ℓ}}{d t} = \frac{d x_{π (ℓ)}}{d t}$ , and therefore $x_{ℓ} (t) = x_{π (ℓ)} (t)$ for all $t \geq t_{0}$ . This proves that it is impossible to cross the hyperplanes $x_{ℓ} = x_{π (ℓ)}$ . This implies that the sets

C_{i j k} = {(x_{1}, x_{2}, x_{3}, x_{4}, x_{5}, x_{6}) \in ℝ^{6} ∣ x_{i} \leq x_{π (i)}, x_{j} \leq x_{π (j)}, x_{k} \leq x_{π (k)}}

are forward-invariant and also partition the state space into eight sets. Since the basins of attraction $B_{σ}$ are also forward-invariant and partition the state space into eight sets, we get the desired result. □

This theorem says that, under some reasonable assumptions, the basins of attraction partition state space symmetrically, a direct consequence of the symmetry of the graph. A notable consequence of the theorem is that the transitions depend on the intensity and duration of the pulse and in turn, of the timing of the pulse arrival. This would partially explain the observed randomness in direction switching of Clione. If Clione receives an external signal (from a prey encounter, for instance), then this signal must arrive at the right time and have the right duration and strength for Clione to change directions. This introduces a great amount of randomness into the system, because several conditions must align for Clione to effectively change directions.

Chapter 5 | Sequential control of dynamic attractors

Let us recall the path that brought us here, as re-told by Figure 5.1. Networks that support many stable fixed point attractors, each accessible via attractor-specific inputs, as in Figure 5.1A, have been around for a while. In Chapter 3, Section 3.1, we built two networks that internally encode a sequences of fixed point attractors, each accessible via identical inputs, as in Fig. 5.1B. In the same chapter, in Section 3.2, we also presented a network supporting several dynamic attractors, each accessible via both targeted and identical inputs, as in Fig. 5.1B,C. Those attractors were all of the same type, and so in Chapter 4 we gave an example model of a neural function that required coexistence and accessibility of several different dynamic attractors, also as in Fig. 5.1C.

Now, in this chapter, motivated by the hierarchical complex motor control of biological brains [66], we aim to combine Figs. 5.1B and 5.1C to obtain Fig. 5.1D, but whose attractors can be flexibly recombined. A key difference between Fig. 5.1D as done in Chapter 3, and the network we build in this chapter, is that in the dynamic attractor chain network of Fig. 3.7D, the order of the sequence was built into the network, meaning we could reorder attractors arbitrarily, and each new element of the sequence required storing the same pattern in the network again in the form of an extra motif.

By contrast, the construction we propose here encodes the order of the sequence in a network separate from that supporting the dynamic attractors that will be the sequence elements. This very akin to how birds and humans sequence complex movements [50,78]. This approach has many advantages, among which we have the capacity to recombine attractors in any order and that to add an attractor we do not need to make a whole new copy of it and embedded into the network.

The general construction of such a network, illustrated in Figure 5.2, goes as follows: the CTLN fixed point counter network from Chapter 3 will encode the order of the sequence. A separate network will encode the dynamic attractors that are to be the elements of the sequence. In Figure 5.2 we see this as several coexistent (dynamic) attractors, each one accessed via global $θ$ -pulses sent to the CTLN counter network (shaded in gray). We represent these global pulses by a hand pushing a button in Figure 5.2. Each pulse activates the next clique down the chain, which will activate a different attractor in the network above, as prescribed by the orange arrows. With this construction, the order of the sequence is encoded in the orange connections, separately from the attractors themselves. Notice that the only difference between panels A and B are the orange arrows, and each wiring will result in two different sequences, whose elements are the same, but accessed in different order.

To give a proof of concept of this idea, we illustrate the construction using both networks from Chapter 4 as our networks encoding the dynamic attractors that are to be the sequence elements.

5.1. Sequences of quadruped gaits

In this section, we apply the construction outlined in the previous paragraph by using our five-gait network from Section 4.2 as the network supporting the attractors will be the sequence elements. The resulting network, consisting of less than 50 units, is shown in Figure 5.3A. The network is composed of three layers: L1, the unsigned CTLN counter from Section 3.1, with as many cliques as sequence steps. L2, a intermediate layer with one node per attractor in the sequence. And L3, the five-gait network of Section 4.2, encoding the set of dynamic attractors that are the elements of the sequence.

Layers are connected in such a way that each node in L2 is connected to a node in L3 activating one of the sequence elements; and the connections from L1 to L2 determine the order in which each attractor in L3 will be accessed. The shaded squares in L1 indicate that all the shaded nodes send edges, which are drawn as a single thick arrow stemming from the shaded region. Each of these layers, and the connections between them, play an important role, as described below:

L1 in Figure 5.3A, the counter network layer, advances the sequence upon reception of each input pulse. Recall that when the neurons in the counter receive a uniform input pulse, the activity slides to the next stable fixed point in the chain, thus advancing the sequence one step by activation of the next clique. This shift in activity is effectively communicated as a pulse to the intermediate layer L2.

L2 in Figure 5.3A, because each of its nodes is connected to a single clique from L1, is sort of summarizing the input from L1 to communicate it to L3. This “summary” of activity will prevent L1, and its high firing rates, to interfere with L3, and its attractors. Neurons in this layer must also receive a pulse simultaneously with L1 to successfully advance the sequence, but not as strong.

L3’s job is to support the dynamic attractors. Recall that each of these can be accessed via $θ$ -stimulation of a specific neuron uniquely associated with each gait. This fact is precisely what makes this construction possible. Otherwise it would not be possible for L2 to select a specific attractor in L3 by communicating the “pulses” coming from L1 to the specific neuron associated with the desired attractor. In the five-gait network, these are the auxiliary nodes corresponding to different gaits, which are redrawn at the bottom of L1 for clarity in Figure 5.3A (nodes 32,34,36,38,42).

Finally, connections between L1 and L2 is where the order of the sequence is encoded: the order in which the nodes in L1 connect to the nodes in L2 will dictate the order in which the attractors are accessed. In Figure 5.3A, for instance, the cliques connect to L2, in such a way that nodes in L2 activate the auxiliary nodes of L3 corresponding to pace (34), bound (32), pronk (42), trot (36), walk (38), pronk (42), and trot (36), and so the sequence will be executed in that exact order.

The formal layering scheme is prescribed by the matrix in Figure 5.3C, and formalized in the next section. This matrix, along with Equations 2.2 and pulses in Figure 5.3B, is what we used to simulate the dynamics. The dynamics behave as we hoped when we first put together the layering scheme in our imaginations, as seen in Figure 5.3C, and as described below:

The first plot shows the greyscale of all neurons. Below the greyscale, the pulses that L1 and L2 receive are pictured, colored by layered. L1, pictured in blue, has a baseline of b = 1 and a pulse b = 6. L2’s pulse, colored in red has a baseline b = 0 and a pulse b = 12, double that of L1. The idea behind having a 0 baseline, is that neurons in L2, the only ones connecting to L3, will die out and won’t interfere with the dynamics of L3 for long. Finally, L3 has b = 1 constant, so this layer is not receiving external pulses at all. Instead, external pulses are sent to all neurons in layers L2 and L3, as opposed to specific neurons controlling each gait (as in Ch. 4). This means that the sequence is fully encoded within the network, and the pulse themselves carry no information about what the next step is. The “pulses” that L3 is receiving from L2 are attractor-specific, even though the pulses sent to the network are not.

All of these predictions about the behavior can be clearly seen in the rate curves of Figure 5.3B. Since the baseline b is zero for neurons in L2, the rate curves of L2 nodes show peaks only at pulse times, but otherwise neurons die off. We also observe the dynamics of L1, remarkably the same as in 3.1; each pulse moves L1 one step to the right, activating the next sequence element, L2 communicates this transition to L3. The sequence of gaits can be read off from the rate curves of the leg nodes, as labeled above the greyscale, and it is exactly the same sequence encoded in the connection from L1 to L2. Transitions seem as effective as sending specific pulses to the five-gait network on its own, as in Section 4.2. Note that the color of the rate curves of L1 and L2 match those of the auxiliary neurons on Figure 5.3A. The matrix needed to reproduce the simulation of Figure 5.3B is available in Appendix A, Equation A.11.

As L2 effectively acts as transient pulses to L3, it seems like all it takes to make this construction possible is to have the attractors in L3 be easily accessible via targeted pulses. If that is the case, can we generalize this architecture to other CPGs? As we will see below, the answers is yes, at least for the CPG all the CPGs we presented in Chapter 4.

5.2. Sequences of molluskan hunting

Here, we offer a proof of concept by using the molluskan hunting network of Section 4.3. Since all attractors of the molluskan hunting network are accessible via changes in initial conditions, or targeted (long or strong) $θ$ -pulses, we are able to use an analogous construction to internally encode a sequence of its attractors.

Indeed, in Figure 5.4A we layered our octahedral network with the same L1 and L2 layers from last section to obtain a network that can encode sequences of attractors in Clione’s swimming direction. We have prescribed the order of the attractors (orange arrows from L1 to L2) in such a way that a transition is always required, so that it is clearly seen that transitions are indeed taking place.

Again, network performs as in last section, as seen in Figure 5.4B. Attractors are labeled by the initials of up/down, left/right and front/back, as before, and it can be seen above the greyscale that they are accessed in the order prescribed by the orange arrows. Only the stimulated direction is making the switch. Also, in the rate curves it can be seen that there are no failed transitions, and this partly thanks to Theorem 12 as well; because now we can make sure to send pulses that are long/strong enough to make the transition happen. It is important here that other nodes in L2 are not overly active, because again by Theorem 12, it could mean that the wrong node is being stimulated. The matrix needed to reproduce the simulation of 5.4B is available in Appendix A, Equation A.11.

This second example shows that this construction is truly versatile, as long as attractors in L3 are easily accessible. This construction also comes with great advantages:

Sequential information is internally encoded.

Since our counter network from Section 3.1 sequentially steps through stable fixed points, we leveraged this sequence of fixed point activations to encode a sequence of pulses as wirings between layers that are to receive uniform pulses, and the CPG layer.

To transition CPG attractors, we simply send a uniform, non-specific pulse to all neurons in the counter layer (and cycle layer) so as to advance the counter, and therefore send a targeted pulse to the next gait down the chain.

This means that pulses are attractor-specific locally in the CPG layer, but not globally, as the counter and cycle layers are the ones encoding the sequential information, and they only require uniform, non-specific pulses. The sequence is therefore truly encoded within the network, and the pulse themselves carry no information about what the next step is. This fact contrasts with other models for sequential control where the information about the order of the sequence is not encoded in the network itself, and so it requires motif-specific inputs to activate each element in the sequence [48,49]. Having the sequence encoded within the network would be advantageous for highly stereotyped patterns, like songbird, choreographed dance or other sequential tasks that reuse motifs.

Sequential information is independently encoded from motor commands.

As mentioned before, the order of the sequence is encoded in the connections from L2 to L3, whereas the dynamic attractors are completely encoded within L3. This means that sequential information is independently encoded from motor commands and so to access an attractor multiple times in the sequence, it suffices to create a new connection from L2 to L3. In Figure 5.3C, for example, both pronk and trot are activated twice in the sequence, but there was no need to add a whole new set of neurons that are re-encoding pronk or trot. That is, we are re-using our attractors in an efficient and flexible manner. This is remarkable property of this construction, as dynamic attractors are usually very sensitive to any interference.

This also implies that to add a new element to the sequence, it is only necessary to add three extra neurons and a few connections between them (an extra 2-clique, one cycle node, and connect this to the appropriate neuron in the five-gait network). This greatly improves efficiency, since the repeated pattern does not have to be stored again, separately into the network. Also, new elements in the sequence do not do not interfere with other elements, controlled by the same network. This is because since the cycle neurons are active one at the time, the new inhibition stemming from the new sequence element will not affect the CPG dynamics whatsoever until the new neuron is active. When the new neuron is active, it will uniformly inhibit everyone in the CPG, except the aux neuron it is connected to. At pulse reception, when the new neuron is active, the jump in neural activity will simply act as pulse to transition the CPG into the appropriate gait. This model can thus generate all kinds of transitions, and the sequence can be as long as desired.

This mechanism is not unlike to how our brain represents complex sequences of movement [47,76]. Also, the structure of our model resembles very much they way in which songbird is encoded, where propagation of syllables is mediated by a synaptic chain of neurons [50]. Also, and although not anatomically separated, similar mechanisms of complex motor sequence generation have also been found in humans’ neocortex [78] and basal ganglia [33].

Timing and sequential information are independent.

Notice also that timing of the sequence is controlled by these external pulses and therefore a sequence’s execution can be sped up or slowed, or rhythm altered altogether, without interfering with attractors or sequence order. Moreover, there is no need to change synaptic time constants, physiological variables, to alter the timing of the sequence. This disassociation also implies efficiency. This is an important characteristic of sequential control in premotor cortex areas and serves, among others, the purpose of independently controlling the speed of movements from the order of execution [44]. Notice also that timing of the sequence is controlled by these external pulses and therefore a sequence’s execution can be sped up or slowed, or rhythm altered altogether, without interfering with attractors or sequence order.

Can this construction be generalized to other CPG networks? We conjecture that the five-gait network could potentially be replaced by any other network that has coexistent attractors, each accessible via changes in initial conditions or specific stimulation of neurons. The mechanism would be the same: the order of the sequence is determined by the connections from the cycle layer (L2) to the multi-attractor network, and any sequential ordering is possible, with as many sequence steps as desired.

From the matrices in Figures 5.3B, it can be seen that these networks are not CTLNs, but rather a TLN whose “layers” are CLTNs. Can we find theoretical grounds to support this seemingly generalizable construction? We should be able to leverage all those zeros!

Chapter 6 | New theoretical results

Why are the networks of Chapter 5 so neat? In this chapter, we provide theoretical explanations on why we see the “fusion attractors” of Chapter 5 arise so seamlessly. First, we introduce some technical background that will help us prove new theoretical results, along with some extra CTLN background theorems. Then, we present new CTLN results generalizing certain structures from [15] (these “certain structures” are reviewed in the technical background section). The new CTLN results are published in [59], and only my contributions to that paper are included here, with only small changes to the presentation. Finally, we generalize some of those results to TLNs. Since the new TLN results are generalizing the CTLN results, we have omitted the proofs for the following CTLN results: Lemma 20, Theorem 21, Lemma 30 and Theorem 31.

6.1. Technical background

A characterization of $FP (W, b)$ that we often use in the proofs that follow, developed in [15], relies on Cramer’s determinants. For any $σ \subseteq [n]$ , define $s_{i}^{σ}$ to be the relevant Cramer’s determinant:

s_{i}^{σ} \overset{def}{=} \det ({(I - W_{σ \cup {i}})}_{i}; b_{σ \cup {i}}), for each i \in [n],

(6.1)

where $det (A_{i}; b)$ denotes the determinant obtained by replacing the $i^{th}$ column of $A$ with the vector $b$ and $({(A_{σ})}_{i}; b_{σ})$ denotes the matrix obtained from the restricted matrix $A_{σ}$ by replacing the column corresponding to the index $i \in σ$ with the restricted vector $b_{σ}$ . Note that $i$ might ot might not be in $σ$ , both are possible.

In [15, Lemma 2], a formula for $s_{k}^{σ}$ was proven that directly connects it to the relevant quantity in the “off"-neuron condition:

s_{k}^{σ} = \sum_{i \in σ} W_{k i} s_{i}^{σ} + θ \det (I - W_{σ}) for any k \in [n] .

(6.2)

Combining this with Cramer’s rule, it was shown that $FP (G)$ can be fully characterized in terms of the signs of the $s_{i}^{σ}$ . It turns out these signs are also connected to the index of a fixed point. For each fixed point of a CTLN $W = W (G, ε, δ)$ , labeled by its support $σ \in FP (G)$ , we define the index as

idx (σ) \overset{def}{=} sgn
det (I - W_{σ}) .

Since we assume our CTLNs are nondegenerate, $(I - W_{σ}) \neq 0$ and thus $idx (σ) \in {\pm 1}$ .

For each fixed point of a TLN labeled by its support $σ \in FP (W, b)$ , we define the def index as $idx (σ) \overset{def}{=} sgn
det (I - W_{σ})$ . In [15] it was shown that $FP (W, b)$ can be fully characterized in terms of the signs of the $s_{i}^{σ}$ , which are also connected to the index of the fixed point. This is the tool we use constantly in this chapter to prove that some support $σ$ belongs to $FP (W, b)$ , so it is worth keeping in mind:

Theorem 13 (sign conditions, [15]).

Let $(W, b)$ be a TLN on n neurons. For any nonempty $σ \subseteq [n]$ ,

σ \in FP (W_{σ}, b_{σ}) \Leftrightarrow sgn s_{i}^{σ} = sgn s_{j}^{σ} f o r a l l i, j \in σ .

In that case we say that $σ$ is permitted (and forbidden otherwise), and $sgn s_{i}^{σ} = sgn \det (I - W_{σ}) = idx (σ)$ for all $i \in σ$ . Furthermore,

σ \in FP (W, b) \Leftrightarrow sgn s_{i}^{σ} = sgn s_{j}^{σ} = - sgn s_{k}^{σ} f o r a l l i, j \in σ, k \notin σ .

Also recall from Chapter 2:

Corollary 3.

Let $(W, b)$ be a TLN on n neurons, and let $σ \subseteq [n]$ . The following are equivalent:

$σ \in FP (W, b)$
$σ \in FP ({W |}_{τ}, {b |}_{τ})$ for all $σ \subseteq τ \subseteq [n]$
$σ \in FP ({W |}_{σ}, {b |}_{σ})$ and $σ \in FP ({W |}_{σ \cup k}, {b |}_{σ \cup k})$ for all $k \notin σ$
$σ \in FP ({W |}_{σ \cup k}, {b |}_{σ \cup k})$ for all $k \notin σ$

Note from the above corollary that a given $σ \subseteq [n]$ might be permitted, but it can be the case that $σ \notin FP (W, b)$ . This is because since $σ$ might not survive the addition of extra neurons to the network. In that case, we say that $σ$ dies in the larger network. To make this distinction formal, we define, for any $σ \subseteq [n]$ , the sets of surviving $(S_{σ})$ and dying fixed points supports $(D_{σ})$ :

S_{σ} \overset{def}{=} FP ({W |}_{σ}, {b |}_{σ}) \cap FP (W, b), and D_{σ} \overset{def}{=} FP ({W |}_{σ}, {b |}_{σ}) ∖ S_{σ} .

It is possible to characterize exactly the fixed points supports of some networks in terms of its dying and surviving fixed points. These networks are the transversal topic of this chapter. To begin exploring the special structures of those, we go back to when the simply-added structure was introduced in [15]:

Definition 14 (simply-added split).

Let $G$ be a graph on $n$ nodes. For any nonempty $ω, τ \subseteq [n]$ such that $ω \cap τ = \emptyset$ , we say $ω$ is simply-added onto $τ$ if for each $j \in ω$ , either $j$ is a projector onto $τ$ , i.e., $j ↛ k$ for all $k \in τ$ , or $j$ is a non-projector onto $τ$ , so $j \to k$ for all $k \in τ$ . In this case, we say that $τ$ is simply-embedded in $G$ , and we say that $(ω, τ)$ is a simply-added split of the subgraph ${G |}_{σ}$ , for $σ = ω \cup τ$ .

This structure is beneficial because when the graph has a simply-added split, the $s_{i}^{σ}$ ’s easily factor, which makes it straightforward to compute their signs and apply Theorem 13.

Theorem 15 ( [15]).

Let $G$ be a graph on $n$ nodes, and let $ω$ , $τ \subseteq [n]$ be such that $ω$ is simply-added to $τ$ . For $σ \subseteq ω \cup τ$ , define $σ_{ω} \overset{def}{=} σ \cap ω$ and $σ_{τ} \overset{def}{=} σ \cap τ$ . Then

s_{i}^{σ} = \frac{1}{θ} s_{i}^{σ_{ω}} s_{i}^{σ_{τ}} = α s_{i}^{σ_{τ}} f o r e a c h i \in τ,

where $α = \frac{1}{θ} s_{i}^{σ_{ω}}$ has the same value for every $i \in τ$ .

Notice that being simply-added is not a bidirectional property, and Theorem 15 only provides factorization for each $i \in τ$ . A factorization that holds for every node requires the simply-added property be bidirectional, i.e.:

Definition 16 (bidirectional simply-added split).

Let $G$ be a graph on $n$ nodes. For any nonempty $ω$ , $τ \subseteq [n]$ such that $[n] = ω \cup τ$ and $ω \cap τ = \emptyset$ , we say that $G$ has a bidirectional simply-added split $(ω, τ)$ if $ω$ is simply-added onto $τ$ and $τ$ is simply-added onto $ω$ . In other words, for all $j \in ω$ , either $j \to k$ for all $k \in τ$ or $j ↛ k$ for all $k \in τ$ , and for all $k \in τ$ , either $k \to j$ for all $j \in ω$ or $k ↛ j$ for all $j \in ω$ .

In this case, the $s_{i}^{σ}$ factor for every $i \in [n]$ ¹. With this structure, we can see exactly how the sets of fixed points supports is formed:

Theorem 17 ( [15]).

Let $G$ be a graph with bidirectional simply-added split $[n] = ω \cup τ$ . For any nonempty $σ \subseteq [n]$ , let $σ = σ_{ω} \cup σ_{τ}$ where $σ_{ω} \overset{def}{=} σ \cap ω$ and $σ_{τ} \overset{def}{=} σ \cap τ$ . Then $σ \in FP (G)$ if and only if one of the following holds:

$σ_{τ} \in S_{τ} \cup {\emptyset}$ and $σ_{ω} \in S_{ω} \cup {\emptyset}$ , or
$σ_{τ} \in D_{τ}$ and $σ_{ω} \in D_{ω}$ .

In other words, $σ \in FP (G)$ if and only if $σ$ is either a union of surviving fixed points $σ_{i}$ , at most one from $ω$ and at most one from $τ$ , or it is a union of dying fixed points, exactly one from $ω$ and one from $τ$ .

In the sections that follow, we generalize this theorem to more than two components, and to TLNs.

Finally, some of the results we use only hold for TLNs with uniform external input $θ$ . That is, $(W, b)$ is such that for every $i \in [n]$ , $b_{i} = θ$ . In this case, we abuse notation and denote the TLN $(W, θ 𝟙)$ by just $(W, θ)$ . The next theorem is one of those results and it relates the magnitudes of the $s_{i}^{σ}$ ’s to $FP (W, θ)$ . This is a TLN version of Rule 4. For this, [15] defines the domination quantity for $j \in [n]$ :

w_{j}^{σ} = \sum_{i \in σ} {\tilde{W}}_{j i} | s_{i}^{σ} |,

where $\tilde{W} = - I + W$ . We say that $k$ dominates $j$ with respect to $σ$ , if $w_{k}^{σ} > w_{j}^{σ}$ . The theorem then states that $σ \in FP (W, θ)$ precisely when these domination quantities are perfectly balanced within $σ$ , so that $σ$ is domination-free, and when every external node $k \notin σ$ is “inside-out” dominated by nodes inside $σ$ :

Theorem 18 (general domination, Theorem 15 in [15]).

Let $(W, θ)$ be a TLN with uniform input, and let $σ \subseteq [n]$ . Then

σ \in FP (W_{σ}, θ) \Leftrightarrow w_{i}^{σ} = w_{j}^{σ} f o r a l l i, j \in σ .

That is, $σ$ is permitted if and only if $σ$ is domination-free. If $σ \in FP (W_{σ}, θ)$ , then $σ \in FP (W, θ)$ if and only if for each $k \notin σ$ , there exists $j \in σ$ such that $w_{j}^{σ} > w_{k}^{σ}$ , i.e. such that $j$ inside-out dominates $k$ .

That is all the extra technical background that we will reference in the proofs below (in addition to that of Ch. 2).

6.2. Simply-embedded CLTN structures

The results in this section are my contribution to [59]. Again, Lemma 20, Theorem 21, Lemma 30 and Theorem 31 are presented with no proof, because the proofs now follow from more general theorems in Section 6.3. Old proofs can be found in the original publication. The exposition and figures below come from that paper, with small modifications.

The aim of this section is to generalize Theorem 11 by further constraining the simply-added architecture. The simply-added structure of cyclic unions was restrictive enough to allow us to completely characterize the fixed point supports of the graph in term of its components. However, it can be relaxed a little further to obtain similar results.

6.2.1. Simply-embedded partitions

We begin by generalizing simply-added splits (Def. 14) and introduce the more general notion of a simply-embedded partition. Recall that given a graph $G$ and a partition of its nodes into two components, ${ω ∣ τ}$ , we say that $τ$ is simply-embedded in $G$ if $ω$ is simply-added onto $τ$ . We can generalize this idea to more components by making every $τ_{i}$ simply-embedded in $G$ :

Definition 19 (simply-embedded partition).

Given a graph $G$ , a partition of its nodes ${τ_{1} | \dots | τ_{N}}$ is called a simply-embedded partition if every $τ_{i}$ is simply-embedded in $G$ . In other words, for each $τ_{i}$ and each $k \notin τ_{i}$ , either $k \to j$ for all $j \in τ_{i}$ or $k ↛ j$ for all $j \in τ_{i}$ .

Notice that the definition is trivially satisfied when there are no $k \notin τ_{i}$ or there is only a single $j \in τ_{i}$ for every $i$ . Thus, every graph has two trivial simply-embedded partitions: one where all the nodes are in one component and one where every node is in its own component. Neither of these partitions is useful for giving information about the structure of $G$ . But when a graph has a nontrivial simply-embedded partition, this structure is sufficient to dramatically constrain the possible fixed point supports of $G$ to unions of fixed points chosen from a menu of component fixed point supports, $FP ({G |}_{τ_{i}})$ . To prove this fact, we need a lemma connecting the $s_{j}^{σ}$ values to the $s_{j}^{σ_{i}}$ values from the component subgraphs.

Lemma 20.

Let $G$ have a simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ , and consider $σ \subseteq [n]$ . Let $σ_{i} \overset{def}{=} σ \cap τ_{i}$ . Then for any $σ_{i} \neq \emptyset$ ,

sgn s_{j}^{σ} = sgn s_{k}^{σ} \Leftrightarrow sgn s_{j}^{σ_{i}} = sgn s_{k}^{σ_{i}}, f o r a l l j, k \in τ_{i} .

Combining this lemma with the sign conditions characterization of fixed point supports (Thm. 13), we get the desired result:

Theorem 21 ( $FP (G)$ menu for simply-embedded partitions).

Let $G$ have a simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ . For any $σ \subseteq [n]$ , let $σ_{i} \overset{def}{=} σ \cap τ_{i}$ . Then

σ \in FP (G) \Rightarrow σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset} f o r a l l i \in [N] .

In other words, every fixed point support of $G$ is a union of component fixed point supports $σ_{i}$ , at most one per component.

Example 22 (Example 3.1 in [59]).

Consider the component subgraphs shown in Figure 6.1A together with their $FP ({G |}_{τ_{i}})$ . By Theorem 21, any graph $G$ with a simply-embedded partition of these component subgraphs has a restricted menu for $FP (G)$ consisting of the component fixed point supports (the set of all possible supports derived from this menu is shown on the bottom of panel A). Note that an arbitrary graph on 7 nodes could have up to 2⁷ − 1 = 127 possible fixed point supports, but the simply-embedded partition structure narrows the options to only 15 candidate fixed points. Figure 6.1B-E show four possible graphs with simply-embedded partitions of these component subgraphs, together with $FP (G)$ for each of the graphs.

Observe that the graph in Figure 6.1B is a disjoint union of its component subgraphs. For this graph, $FP (G)$ consists of all possible unions of at most one fixed point support per component subgraph (see [15, Thm. 11]). Thus, every choice from the menu provided by Theorem 21 does in fact yield a fixed point for $G$ .

In contrast, the graph in Figure 6.1C is a cyclic union of the component subgraphs. For this graph, $FP (G)$ only has sets that contain a fixed point support from every component, i.e., $σ_{i} \neq \emptyset$ for all $i \in [N]$ (by Thm. 11). Thus, any subset from the menu of Theorem 21 that does not intersect every $τ_{i}$ does not produce a fixed point for $G$ .

Meanwhile, the graph in Figure 6.1D is a simply-embedded partition with heterogeneity in the outgoing edges from a component (notice different nodes in $τ_{1}$ treat $τ_{3}$ differently). $FP (G)$ has a mixture of types of supports: there are some $σ \in FP (G)$ that do not intersect every component, and others that do.

Finally, the graph in Figure 6.1E is another simply-embedded partition with heterogeneity (notice different nodes in $τ_{3}$ treat $τ_{1}$ and $τ_{2}$ differently). However, for this graph, there is a uniform rule for the fixed point supports: every fixed point consists of exactly one fixed point support per component subgraph (identical to $FP (G)$ for the graph in panel C).

Even though Theorem 21 is not sufficient to fully determine $FP (G)$ , it significantly limits the options for fixed point supports (menu). In particular, one direct consequence of Theorem 21 is that if there is some node $j \in τ_{i}$ in $G$ that does not participate in any fixed points of ${G |}_{τ_{i}}$ , then $j$ cannot participate in any fixed point of the full graph $G$ . Thus the supports of all the fixed points of $G$ are confined to $[n] ∖ {j}$ . It turns out that if the removal of node $j$ does not change the fixed points of the component subgraph, i.e. if $FP ({G |}_{τ_{i}}) = FP ({G |}_{τ_{i} ∖ {j}})$ , then we can actually remove $j$ from the full graph $G$ without changing $FP (G)$ . Thus we have the following theorem.

Theorem 23 (removable nodes).

Let $G$ have a simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ . Suppose there exists a node $j \in τ_{i}$ such that $FP ({G |}_{τ_{i}}) = FP ({G |}_{τ_{i} ∖ {j}})$ . Then $FP (G) = FP ({G |}_{[n] ∖ {j}})$ .

Proof. To see that $FP (G) \subseteq FP ({G |}_{[n] ∖ {j}})$ , notice that for all $σ \in FP (G)$ , we have $σ \subseteq [n] ∖ {j}$ by Theorem 21. Then by Corollary 3(2), we must have $σ \in FP ({G |}_{[n] ∖ {j}})$ , and so $FP (G) \subseteq FP ({G |}_{[n] ∖ {j}})$ .

For the reverse containment, we will show that every fixed point in $FP ({G |}_{[n] ∖ {j}})$ survives the addition of node $j$ by appealing to Theorem 13 (sign conditions). There are two cases to consider: $σ_{i} = \emptyset$ and $σ_{i} \neq \emptyset$ , where $j \in τ_{i}$ and $σ_{i} \overset{def}{=} σ \cap τ_{i}$ .

Case 1: $σ_{i} = \emptyset$ . Since $j$ is not contained in the support of any fixed point of ${G |}_{τ_{i}}$ , there must be at least one other node $k$ in $τ_{i}$ , since $FP ({G |}_{τ_{i}})$ cannot be empty. Since $G$ is a simply-embedded partition, we have that $[n] ∖ τ_{i}$ is simply-embedded onto $τ_{i}$ meaning that every node in $τ_{i}$ receives identical inputs from the rest of the graph. Recall from Equation 6.2, that $s_{j}^{σ} = \sum_{ℓ \in σ} W_{j ℓ} s_{ℓ} + θ \det (I - W_{σ})$ . Then since $σ \subseteq [n] ∖ τ_{i}$ , we have that $j$ and $k$ receive identical inputs from $σ$ , so $W_{j ℓ} = W_{k ℓ}$ for all $ℓ \in σ$ , and thus $s_{j}^{σ} = s_{k}^{σ}$ . Since $σ \in FP ({G |}_{[n] ∖ {j}})$ , we have $sgn s_{k}^{σ} = - sgn s_{ℓ}^{σ}$ for all $ℓ \in σ$ by Theorem 13 (sign conditions). Thus, we also have $sgn s_{j}^{σ} = - sgn s_{ℓ}^{σ}$ and $σ$ survives the addition of node $j$ , so $σ \in FP (G)$ .

Case 2: $σ_{i} \neq \emptyset$ . First observe that ${G |}_{[n] ∖ {j}}$ has the same simply-embedded partition structure as $G$ , but with $τ_{i} ∖ {j}$ rather than $τ_{i}$ . Thus $σ \in FP ({G |}_{[n] ∖ {j}})$ implies that $σ \in FP ({G |}_{[n] ∖ {j}})$ by Theorem 21 (menu). By hypothesis, $FP ({G |}_{τ_{i} ∖ {j}}) = FP ({G |}_{τ_{i}})$ , and so $σ_{i} \in FP ({G |}_{τ_{i}})$ . Then by Theorem 13 (sign conditions), since $j \notin σ_{i}$ , we have $sgn s_{j}^{σ_{i}} = - sgn s_{ℓ}^{σ_{i}}$ for all $ℓ \in σ_{i}$ . And by Lemma 20, this ensures $sgn s_{j}^{σ} = - sgn s_{ℓ}^{σ}$ for all $ℓ \in σ_{i}$ . Since $σ \in FP ({G |}_{[n] ∖ {j}})$ , we have that $sgn s_{ℓ}^{σ}$ is identical for all $ℓ \in σ$ , not just $ℓ \in σ_{i}$ , and so $sgn s_{j}^{σ} = - sgn s_{ℓ}^{σ}$ for all $ℓ \in σ$ . Thus by Theorem 13 (sign conditions), $σ$ survives the addition of node $j$ , so $σ \in FP (G)$ . □

Theorem 23 shows that if a node $j$ is locally removable without altering fixed points of its component, then node $j$ is also globally removable without altering the fixed points of the full graph $G$ . This result gives a new tool for determining that two graphs have the same collection of fixed points.

Corollary 24.

Let $G$ have a simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ and suppose there exists $j \in τ_{i}$ such that $FP ({G |}_{τ_{i}}) = FP ({G |}_{τ_{i} ∖ {j}})$ . Let $G'$ be any graph that can be obtained from $G$ by deleting or adding outgoing edges from $j$ to any other component without altering the simply-embedded structure of $G$ . Then $FP (G^{'}) = FP (G)$ .

Proof. Observe that by deleting all the outgoing edges from $j$ to a component $τ_{k}$ , node $j$ has simply changed from a projector onto $τ_{k}$ to a non-projector. Alternatively, by adding all the outgoing edges to $τ_{k}$ , node $j$ switches from being a non-projector onto $τ_{k}$ to being a projector. In either case, $j$ is still simply-added onto $τ_{k}$ , and so $G'$ has the same simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ as $G$ had. Additionally, since no edges within $τ_{i}$ have been altered, we have that $FP ({G^{'} |}_{τ_{i}}) = FP ({G |}_{τ_{i}}) = FP ({G |}_{τ_{i} ∖ {j}}) = FP ({G^{'} |}_{τ_{i} ∖ {j}})$ . Thus both $G$ and $G'$ satisfy the hypotheses of Theorem 23. Moreover, ${G |}_{[n] ∖ {j}} = {G^{'} |}_{[n] ∖ {j}}$ since the only differences between $G$ and $G'$ were in edges involving node $j$ , which has been removed. Thus, by Theorem 23, $FP (G) = FP ({G |}_{[n] ∖ {j}}) = FP (G^{'})$ . □

The result below is new, but fits into the narrative of [59], which is the paper where the above results are published. That is why we include it here.

Theorem 25.

If $τ$ is simply-embedded in $G$ , then if $k$ dominates $j$ in ${G |}_{τ}, FP (G) = FP ({G |}_{[n ∖ j}})$

Proof. If $k$ dominates $j$ in ${G |}_{τ}$ , because $τ$ is simply-embedded, both $k$ and $j$ receive the same inputs from the rest of the graph. Thus, $k$ dominates $j$ with respect to $[n]$ and by Theorem 9 $FP (G) = FP ({G |}_{[n] ∖ {j}})$ . □

6.2.2. Simple linear chains

In this section we add a chain-like architecture to a simply-embedded partition, and term this new architecture simple linear chains:

Definition 26 (simple linear chain).

Let $G$ be a graph with node partition ${τ_{1} | \dots | τ_{N}}$ . We say that $G$ is a simple linear chain if the following two conditions hold:

the only edges between components go from nodes in $τ_{i}$ to $τ_{i + 1}$ , and
for every $j \in τ_{i}$ , either $j \to k$ for every $k \in τ_{i + 1}$ or $j ↛ k$ for every $k \in τ_{i + 1}$ .

A key structural advantage of linear chains is that if $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}})$ , then it turns out that $σ_{i} \in FP (G)$ ; in other words, survival of the addition of the next component is sufficient to guarantee survival in the full network. This occurs because $σ_{i}$ has no outgoing edges to any nodes outside of $τ_{i} \cup τ_{i + 1}$ . Lemma 27 shows that whenever a permitted motif has no outgoing edges to a node $k$ , then it is guaranteed to survive the addition of node $k$ .

Lemma 27.

Let $G$ be a graph on $n$ nodes, let $σ \subseteq [n]$ be nonempty, and $k \in [n] ∖ σ$ . If $i ↛ k$ for all $i \in σ$ , then

σ \in FP ({G |}_{σ \cup {k}}) \Leftrightarrow σ \in FP ({G |}_{σ}) .

In other words, if $σ$ has no outgoing edges to node $k$ then $σ$ is guaranteed to survive the addition of node $k$ whenever $σ$ is permitted.

Proof. For any $j \in σ$ , we have that $j$ inside-out dominates $k$ . Thus by Rule 4c, $σ \in FP ({G |}_{σ \cup {k}})$ if and only if $σ \in FP ({G |}_{σ})$ .

It turns out that the simply-embedded partition structure of the simple linear chain with the added restriction that $τ_{i}$ does not send edges to any $τ_{k}$ other than $τ_{i + 1}$ gives significant structure to the values of $s_{i}^{σ}$ and thus to the domination quantities $w_{j}^{σ}$ . This structure is the key to the proof of the following theorem.

Theorem 28 (simple linear chains).

Let $G$ be a simple linear chain with components $τ_{1}, \dots, τ_{N}$ .

If $σ \in FP (G)$ , then $σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset}$ for all $i \in [N]$ , where $σ_{i} = σ \cap τ_{i}$ .
Consider a collection ${σ_{i}}_{i \in [N]}$ of $σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset}$ . If additionally $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}}) \cup {\emptyset}$ for all $i \in [N]$ , then
$\underset{i \in [N]}{\cup} σ_{i} \in FP (G) .$
In other words, $FP (G)$ is closed under unions of component fixed point supports that survive in ${G |}_{τ_{i} \cup τ_{i + 1}}$ .

Proof. (i) follows directly from Theorem 21 by noting that the simple linear chain structure endows $G$ with a simply-embedded partition: for every $τ_{i}$ , the nodes in $τ_{i - 1}$ are each either a projector or non-projector onto $τ_{i}$ , while all nodes outside of $τ_{i - 1}$ are all non-projectors onto $τ_{i}$ .

To prove (ii), consider ${σ_{i}}_{i \in [N]}$ where $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}}) \cup {\emptyset}$ for all $i \in [N]$ . Notice that by Lemma 27, the fact that $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}})$ implies that $σ_{i} \in FP (G)$ since $σ_{i}$ has no outgoing edges to any external node $k$ outside of $τ_{i} \cup τ_{i + 1}$ . Thus, we may assume $σ_{i} \in FP (G) \cup {\emptyset}$ for all $i \in [N]$ . We will prove that this guarantees that $\cup_{i \in [N]} σ_{i} \in FP (G)$ by induction on the number $N$ of components of the simple linear chain.

For $N = 1$ , the result is trivially true. For $N = 2$ , observe that the simple linear chain on ${τ_{1} ∣ τ_{2}}$ actually has the structure of a bidirectional simply-embedded split $(τ_{1}, τ_{2})$ , and thus Theorem 17 gives the complete structure of $FP (G)$ in terms of the surviving fixed points of the component subgraphs $S_{τ_{i}}$ and the dying fixed points $D_{τ_{i}}$ . The sets of interest here, $σ_{i} \subseteq τ_{i}$ with $σ_{i} \in FP (G)$ , are precisely the elements of $S_{τ_{i}}$ . Theorem 17(1) then guarantees that $σ_{1} \cup σ_{2} \in FP (G)$ whenever $σ_{i} \in FP (G)$ , and so the result holds when $N = 2$ .

Now, suppose the result holds for any simple linear chain with $N - 1$ components. For ease of notation, denote $σ_{1 \dots N - 1} \overset{def}{=} σ_{1} \cup \dots \cup σ_{N - 1}$ and let $σ \overset{def}{=} \cup_{i \in [N]} σ_{i}$ . We will show the result holds for any simple linear chain $G$ with $N$ components.

Observe that if $σ_{N} = \emptyset$ , we have $σ = σ_{1 \dots N - 1} \in FP ({G |}_{τ_{1 \dots N - 1}})$ by the inductive hypothesis, and we need only show that this implies that $σ_{1 \dots N - 1} \in FP (G)$ . On the other hand, if $σ_{N} \neq \emptyset$ , then $σ = σ_{1 \dots N - 1} \cup σ_{N}$ , where $σ_{N} \in FP (G)$ by Lemma 27, since $σ_{N} \in FP ({G |}_{τ_{N}})$ and $σ_{N}$ has no outgoing edges to any external nodes outside of $τ_{N}$ . Notice that the simple linear chain structure of $G$ ensures that $(τ_{1 \dots N - 1}, τ_{N})$ is a bidirectional simply-embedded split. Thus by Theorem 17, since $σ_{N}$ is a surviving fixed point support, $σ_{1 \dots N - 1} \cup σ_{N} \in FP (G)$ if and only if $σ_{1 \dots N - 1} \in FP (G)$ . Therefore for any ${σ_{i}}_{i \in [N]}$ , it suffices to show that $σ_{1 \dots N - 1} \in FP (G)$ , and the result will follow.

Notice that by the inductive hypothesis, $σ_{1 \dots N - 1} \in FP ({G |}_{τ_{1 \dots N - 1}})$ , and thus to show $σ_{1 \dots N - 1} \in FP (G)$ , we need only show that $σ_{1 \dots N - 1}$ survives the addition of the nodes in $τ_{N}$ . There are two cases to consider here based on whether $σ_{1 \dots N - 1}$ intersects $τ_{N - 1}$ or not. Observe that if $σ_{1 \dots N - 1} \cap τ_{N - 1} = \emptyset$ , then $σ_{1 \dots N - 1}$ has no outgoing edges to $τ_{N}$ since only nodes in $τ_{N - 1}$ can send edges forward to $τ_{N}$ by the linear chain structure. In this case, we have $i ↛ k$ for all $i \in σ_{1 \dots N - 1}$ and all $k \in τ_{N}$ , and so Lemma 27 guarantees that $σ_{1 \dots N - 1} \in FP (G)$ since we already had $σ_{1 \dots N - 1} \in FP ({G |}_{τ_{1 \dots N - 1}})$ .

For the other case where $σ_{1 \dots N - 1} \cap τ_{N - 1} \neq \emptyset$ , we will prove $σ_{1 \dots N - 1} \in FP (G)$ by appealing to Theorem 18 (general domination) and demonstrating that each $k \in τ_{N}$ is inside-out dominated by some node $j \in σ_{1 \dots N - 1}$ . First notice that $σ_{1 \dots N - 1} = σ_{1 \dots N - 2} \cup σ_{N - 1}$ and by the simple linear chain structure of $G$ , we have that $τ_{1 \dots N - 2}$ is simply-embedded onto $τ_{N - 1}$ . Thus by Theorem 15,

s_{i}^{σ_{1} \dots N - 1} = \frac{1}{θ} s_{i}^{σ_{1} \dots N - 2} s_{i}^{σ_{N - 1}} = α s_{i}^{σ_{N - 1}} for all i \in σ_{N - 1},

(6.3)

where $α = \frac{1}{θ} s_{i}^{σ_{1 \dots N - 2}}$ has the same value for every $i \in σ_{N - 1}$ . Using this, we can now compute the domination quantities $w_{j}^{σ_{1 \dots N - 1}}$ and $w_{k}^{σ_{1} \dots N - 1}$ for $j \in σ_{N - 1}$ and $k \in τ_{N}$ . For $j \in σ_{N - 1}$ , we have:

w_{j}^{σ_{1} \dots N - 1} \overset{def}{=} \sum_{i \in σ_{1 \dots N - 1}} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | = \sum_{i \in σ_{1} \dots N - 2} {\tilde{W}}_{j i} | s_{i}^{σ_{1 \dots N - 1}} | + \sum_{i \in σ_{N - 1}} {\tilde{W}}_{j i} | s_{i}^{σ_{1 \dots N - 1}} | = \sum_{i \in σ_{1} \dots N - 2} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | + \sum_{i \in σ_{N - 1}} {\tilde{W}}_{j i} | α s_{i}^{σ_{N - 1}} | by (6.3) = \sum_{i \in σ_{1 \dots N - 2}} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | + | α | \sum_{i \in σ_{N - 1}} {\tilde{W}}_{j i} | s_{i}^{σ_{N - 1}} | = \sum_{i \in σ_{1 \dots N - 2}} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | + | α | w_{j}^{σ_{N - 1}}

On the other hand, for $k \in τ_{N}$ we have the following formula for $w_{k}^{σ_{1 \dots N - 1}}$ , where we use the fact that ${\tilde{W}}_{k i} = - 1 - δ$ for all $i \in σ_{1 \dots N - 2}$ since there are no edges from nodes in $τ_{1 \dots N - 2}$ to $τ_{N}$ :

w_{k}^{σ_{1} \dots N - 1} \overset{def}{=} \sum_{i \in σ_{1} \dots N - 1} {\tilde{W}}_{k i} | s_{i}^{σ_{1} \dots N - 1} | = \sum_{i \in σ_{1 \dots N - 2}} {\tilde{W}}_{k i} | s_{i}^{σ_{1 \dots N - 1}} | + \sum_{i \in σ_{N - 1}} {\tilde{W}}_{k i} | s_{i}^{σ_{1 \dots N - 1}} | = \sum_{i \in σ_{1 \dots N - 2}} (- 1 - δ) | s_{i}^{σ_{1} \dots N - 1} | + \sum_{i \in σ_{N - 1}} {\tilde{W}}_{k i} | α s_{i}^{σ_{N - 1}} | = \sum_{i \in σ_{1 \dots N - 2}} (- 1 - δ) | s_{i}^{σ_{1 \dots N - 1}} | + | α | \sum_{i \in σ_{N - 1}} {\tilde{W}}_{k i} | s_{i}^{σ_{N - 1}} | = \sum_{i \in σ_{1 \dots N - 2}} (- 1 - δ) | s_{i}^{σ_{1} \dots N - 1} | + | α | w_{k}^{σ_{N - 1}} .

Moreover, since $σ_{N - 1} \in FP (G)$ , we have that $j \in σ_{N - 1}$ must inside-out dominate the external node $k$ , so $w_{j}^{σ_{N - 1}} > w_{k}^{σ_{N - 1}}$ . Combining this with the fact that ${\tilde{W}}_{j i} \geq - 1 - δ$ , we see that

w_{k}^{σ_{1} \dots N - 1} \leq \sum_{i \in σ_{1} \dots N - 2} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | + | α | w_{k}^{σ_{N - 1}} < \sum_{i \in σ_{1} \dots N - 2} {\tilde{W}}_{j i} | s_{i}^{σ_{1} \dots N - 1} | + | α | w_{j}^{σ_{N - 1}} = w_{j}^{σ_{1 \dots N - 1}}

Thus $w_{j}^{σ_{1 \dots N - 1}} > w_{k}^{σ_{1 \dots N - 1}}$ and so $j$ inside-out dominates $k$ for all $k \in τ_{N}$ . Thus by Theorem 18, $σ_{1 \dots N - 1} \in FP (G)$ , and so $\cup_{i \in [N]} σ_{i} = σ_{1 \dots N - 1} \cup σ_{N} \in FP (G)$ as desired. □

Figure 6.2 illustrates Theorem 28 with an example simple linear chain. By Theorem 28(i), every fixed point support in $FP (G)$ restricts to a fixed point in $FP ({G |}_{τ_{i}})$ . Next consider a collection of $σ_{i}$ such that $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}}) \cup \emptyset$ for all $i \in [N]$ . First observe that each $σ_{i} \in FP ({G |}_{τ_{i} \cup τ_{i + 1}})$ actually survives to the full network, and so $σ_{i} \in FP (G)$ . This is guaranteed because $σ_{i}$ has no outgoing edges to nodes outside of $τ_{i} \cup τ_{i + 1}$ (Rule 4C). Moreover, by Theorem 28(ii), we see that every union of surviving component fixed points yields a fixed point of the full network, but additional fixed point supports are also possible.

6.2.3. Strongly simply-embedded partitions

Recall that the difference between a simply-added split and a bidirectional simply-added split is that, for bidirectional splits, not only $τ$ is simply-embedded in $G$ , but $[n] ∖ τ$ is also simply-embedded in $G$ . To achieve an analogous bidirectionality in the case of simply-embedded partitions we must now additionally require that every $[n] ∖ τ_{i}$ is simply-embedded in $G$ as well. This means that not only $τ_{i}$ is treated the same by the rest of the graph, but it must also treat the rest of the graph the same. We term this new, more rigid, partition structure a strongly simply-embedded partition.

Definition 29 (strongly simply-embedded partition).

Let $G$ be a graph with a partition of its nodes ${τ_{1} | \dots | τ_{N}}$ . The partition is called strongly simply-embedded if for every node $j$ in $G$ , either $j \to k$ for all $k \notin τ_{i}$ or $j ↛ k$ for all $k \notin τ_{i}$ , where $τ_{i}$ is the component containing $j$ .

The simplest examples of graphs with a strongly simply-embedded partition are disjoint unions and clique unions, which are building block constructions first studied in [15]. In a disjoint union of component subgraphs ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ , there are no edges between components. In this case, every node in $G$ is a non-projector onto the rest of the graph. At the other extreme, a clique union has bidirectional edges between every pair of nodes in different components. In a clique union, every node is a projector onto the rest of the graph.

The key fact that allowed to characterize the fixed points of disjoint and clique unions in [15] was a complete complete factorization of the $s_{j}^{σ}$ values in terms of the $s_{j}^{σ_{i}}$ of the component fixed point supports. But recall that we can only get this complete factorization when the simply-added structure is bidirectional. Strongly simply-embedded partitions now satisfy that $(τ_{i}, [n] ∖ τ_{i})$ is also a bidirectional simply-added split. This means we can prove a result analogous to Theorem 15, but for a partition, and for every $j \in [n]$ . Moreover, the $s_{j}^{σ_{i}}$ values are fully determined by whether $σ_{i}$ is a surviving or a dying fixed point of ${G |}_{τ_{i}}$ . Recall that we denote the sets of surviving and dying fixed points as:

S_{τ_{i}} \overset{def}{=} FP ({G |}_{τ_{i}}) \cap FP (G) and D_{τ_{i}} \overset{def}{=} FP ({G |}_{τ_{i}}) ∖ S_{τ_{i}} .

Lemma 30.

Let $G$ be a graph on n nodes with a strongly simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ . For any $σ \subseteq [n]$ , denote $σ_{i} \overset{def}{=} σ \cap τ_{i}$ , and $σ_{i_{1} \dots k} \overset{def}{=} σ_{i_{1}} \cup \dots \cup σ_{i_{k}}$ and let $I = {i \in [N] ∣ σ_{i} \neq \emptyset}$ . Then for every $j \in [n]$ ,

s_{j}^{σ} = \frac{1}{θ^{| I | - 1}} \prod_{i \in I} s_{j}^{σ_{i}},

where $s_{j}^{σ_{i}}$ has the same value for every $j \in [n] ∖ τ_{i}$ .

Moreover, for any $σ_{i} \in FP ({G |}_{τ_{i}})$ and $j \in τ_{i}$ :

sgn s_{j}^{σ_{i}} = {\begin{array}{l} idx (σ_{i}) & if j \in σ_{i} \\ - idx (σ_{i}) & if j \in τ_{i} ∖ σ_{i} \end{array}

while for any $k \notin τ_{i}$ ,

sgn s_{k}^{σ_{i}} = {\begin{matrix} - idx (σ_{i}) & if σ_{i} \in S_{τ_{i}} \\ idx (σ_{i}) & if σ_{i} \in D_{τ_{i}} \end{matrix}

Similar to simple linear chains, it turns out that strongly simply-embedded partitions also have the property that $FP (G)$ is closed under unions of surviving fixed point supports of the component subgraphs. With the added structure of the strongly simply-embedded partition, though, we can actually say something stronger – $FP (G)$ can be fully determined from knowledge of the component fixed point supports together with knowledge of which of those component fixed points survive in the full network. This complete characterization of $FP (G)$ is given in Theorem 31 below.

Theorem 31.

Suppose $G$ has a strongly simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ , and let $σ_{i} \overset{def}{=} σ \cap τ_{i}$ for any $σ \subseteq [n]$ . Then $σ \in FP (G)$ if and only if $σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset}$ for each $i \in [N]$ , and either

every $σ_{i}$ is in $FP (G) \cup {\emptyset}$ , or
none of the $σ_{i}$ are in $FP (G) \cup {\emptyset}$ .

In other words, $σ \in FP (G)$ if and only if $σ$ is either a union of surviving fixed points $σ_{i}$ , at most one per component, or it is a union of dying fixed points, exactly one from every component.

This theorem generalizes Theorem 17, characterizing every element of $FP (G)$ in terms of the sets of surviving and dying component fixed points supports, $S_{τ_{i}}$ and $D_{τ_{i}}$ . Notice that in the statement of Theorem 31, all the fixed point supports of type (a) have the form $\cup_{i \in I} σ_{i}$ for $σ_{i} \in S_{τ_{i}}$ and $I \subseteq [N]$ , while those of type (b) have the form $\cup_{i = 1}^{N} σ_{i}$ for $σ_{i} \in D_{τ_{i}}$ .

More generally, though, a strongly simply-embedded partition can have a mix of surviving and dying component fixed points, so that $FP (G)$ has a mix of both type (a) and type (b) fixed point supports. Figure 6.3A gives an example strongly simply-embedded partition, and panel B shows both the set of component fixed point supports, $FP ({G |}_{τ_{i}})$ , and the subset of those that survive to yield fixed points of the full network. Since there are dying fixed points in every component, we see that $FP (G)$ has a mix of both type (a) and type (b) fixed point supports.

Figure 6.3. — (A) A graph with a strongly simply-embedded partition ${τ_{1} | τ_{2} | τ_{3}}$ . Projector nodes are colored brown. (B) (Top) $FP ({G |}_{τ_{i}})$ for each component subgraph together with the supports from each component that survive within the full graph. (Bottom) $FP (G)$ for the strongly simply-embedded partition graph. The first two lines of $FP (G)$ consist of unions of surviving fixed points, at most one per component. The third line gives the fixed points that are unions of dying fixed point supports, exactly one from every component. Figure reproduced from [59].

A special case of a bidirectional simply-added split occurs whenever a graph contains a node that is projector/non-projector onto the rest of the graph. Specifically, since any subset is always simply-added onto a single node $j$ trivially, we see that we have a bidirectional simply-added split $({j}, [n] ∖ {j})$ whenever $j$ is either a projector or a non-projector onto the rest of the graph. Recall that if $j$ is a non-projector onto $[n] ∖ {j}$ , then $j$ has no outgoing edges in $G$ , and so it is a sink. Moreover, we have seen that sinks are the only single nodes that can support fixed points since a singleton ${j}$ is trivially uniform in-degree 0, and thus only survives when it has no outgoing edges, by Theorem 7. Combining this observation with the bidirectional simply-added split for a sink, we see there is certain internal structure that must be present in $FP (G)$ whenever it contains any singleton sets.

Proposition 32.

Let $G$ be a graph such that there is some singleton ${j} \in FP (G)$ . Then for any $σ \in FP (G)$ (with $σ \neq {j}$ ),

If $j \notin σ$ , then $σ \cup {j} \in FP (G)$ ; i.e., $FP (G)$ is closed under unions with singletons.
If $j \in σ$ , then $σ ∖ {j} \in FP (G)$ ; i.e., $FP (G)$ is closed under set differences with singletons.

Proof. First notice that since ${j} \in FP (G)$ , $j$ is a sink in $G$ by Theorem 7 (since a singleton is trivially uniform in-degree 0, and thus survives exactly when it has no outgoing edges), and therefore $({j}, [n] ∖ {j})$ is a bidirectional simply-added split.

To prove (1), suppose $j \notin σ$ . Since $({j}, [n] ∖ {j})$ is a bidirectional simply-added split, Theorem 17 guarantees that $σ \cup {j} \in FP (G)$ if and only if ${j}$ , $σ$ both survive or both die. By assumption, both sets are in $FP (G)$ , so both survive. Thus, $σ \cup {j} \in FP (G)$ .

To prove (2), suppose $j \in σ$ . By Theorem 17, $σ \in FP (G)$ if and only if ${j}, σ ∖ {j}$ both survive or both die. By assumption, ${j} \in FP (G)$ , and so $σ ∖ {j} \in FP (G)$ as well. □

Corollary 33.

Let $G$ be a graph such that $FP (G)$ contains singleton sets ${j_{1}}, {j_{2}}, \dots, {j_{ℓ}}$ , and let $S = {j_{1}, \dots, j_{ℓ}}$ be the set of singletons. Then for any $σ \in FP (G)$ and any $ω \subseteq S$

σ \cup ω \in FP (G) .

Moreover, let $τ = [n] ∖ S$ . Then $FP (G)$ has the direct product structure:

FP (G) \cup {\emptyset} ≅ ({σ \in FP ({G |}_{τ}) ∣ σ \in FP (G)} \cup {\emptyset}) \times P (S),

where $P (S)$ denotes the power set of $S$ . In other words, every fixed point support in $FP (G)$ has the form $σ \cup ω$ where $σ \in FP ({G |}_{τ}) \cup {\emptyset}$ and $ω \subseteq S$ .

Proof. The first statement follows by iterating Proposition 32(1) $| ω |$ times for each of the added singletons in $ω$ . To prove the second statement, we will show that every $ν \in FP (G)$ is the union of a surviving fixed point $σ \subseteq τ$ (or the empty set) with a subset of $S$ (including empty set); moreover, every such union yields a fixed point (other than $\emptyset \cup \emptyset$ ). The direct product structure of $FP (G)$ immediately follows from this decomposition of the fixed point supports. By the first result, we see that every such union is contained in $FP (G)$ . Thus, all that remains to show is that every element of $FP (G)$ is such a union. Let $ν \in FP (G)$ and let $σ = ν \cap τ$ and $ω = ν \cap S$ , so that $ν = σ \cup ω$ . If $σ$ or $ω$ are empty, then we’re done, so suppose both are nonempty. Then we can iteratively apply Proposition 32(2) $| ω |$ times to see that $σ \in FP (G)$ . Thus, every fixed point support arises as a union of some $σ \subseteq τ$ with an arbitrary subset of $S$ , where $σ \in FP (G) \cup {\emptyset}$ (and for every $σ \in FP (G)$ , we have $σ \in FP ({G |}_{τ})$ as well by Corollary 3(2)). □

As an application of Theorem 31, we can immediately recover characterizations of the fixed points of disjoint unions and clique unions previously given in [15, Theorems 11 and 12]. In a disjoint union, every component fixed point support survives to the full network since it has no outgoing edges (by Rule 4: inside-out domination). Thus, for a disjoint union, $FP (G)$ consists of all the fixed points of type (a) from Theorem 31: unions of (surviving) component fixed points $σ_{i}$ , at most one per component. In contrast, in a clique union, every component fixed point support dies in the full network since it has a target that outside-in dominates it (in fact, every node outside of $τ_{i}$ is a target of any subset of $τ_{i}$ ). Thus, for a clique union, $FP (G)$ consists of all the fixed points of type (b): unions of (dying) component fixed points $σ_{i}$ , exactly one from every component. Both the disjoint union and clique union characterizations of $FP (G)$ [15, Theorems 11 and 12] are now immediate corollaries of Theorem 31.

Corollary 34.

Let $G$ be a graph with partition ${τ_{1} | \dots | τ_{N}}$ .

If $G$ is a disjoint union of ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ , then $σ \in FP (G)$ if and only if $σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset}$ for all $i \in [N]$ .
If $G$ is a clique union of ${G |}_{τ_{1}}, \dots, {G |}_{τ_{N}}$ , then $σ \in FP (G)$ if and only if $σ_{i} \in FP ({G |}_{τ_{i}})$ for all $i \in [N]$ .

6.3. Layered CTLNs

The results here are inspired by the sequential construction of Chapter 5. Our goal is to provide theoretical explanations for why the networks of Chapter 5 work so well. Leaving the world of CTLNs, we allowed off-diagonal blocks to not be strictly constrained by a graph. Consequently, the networks in Chapter 5 are best described as layered CTLNs, because the diagonal blocks are CTLNs but the off-diagonal blocs are not. Here we derive Theorem 31 for TLNs in general, not only layered CTLNs. The diagonal blocks, or layers, can also be TLNs. It is convenient to choose CTLNs as the layers, because we have plenty of results to build in the desired attractors.

Throughout the chapter, TLNs with a vector $b$ of heterogeneous (but constant in time) inputs are denoted as $(W, b)$ , as usual. When the input is also constant across neurons, we abuse notation and denote it as $(W, θ)$ instead of $(W, 𝟙 θ)$ . We start by showing that any node $k$ with non-positive input $b_{k} \leq 0$ cannot be involved in any fixed points of the network. This will allow us to restrict, moving forward, to networks where all inputs are positive.

Definition 35.

We say that $(W, b)$ is a competitive TLN on $n$ neurons if $W_{i j} \leq 0$ and $W_{i i} = 0$ for all $i$ , $j \in [n]$ .

Attention! This definition is different from the one in [15], which requires $b_{i} \geq 0$ for all $i \in [n]$ too.

Lemma 36.

Let $(W, b)$ be a competitive non-degenerate TLN on $n$ neurons, and suppose that for some $k \in [n]$ we have $b_{k} \leq 0$ . Then, $x^{*}$ is a fixed point of $(W, b)$ if and only if $x_{k}^{*} = 0$ and ${\hat{x}}^{*}$ is a fixed point of $(W_{[n] ∖ {k}}, {b |}_{[n] ∖ {k}})$ , where ${\hat{x}}^{*} : = {x^{*} |}_{[n] ∖ {k}}$ .

Proof. Denote $\hat{W} = W_{[n] ∖ {k}}, \hat{b} = {b |}_{[n] ∖ {k}}$ and let ${\hat{x}}^{*} : = {x^{*} |}_{[n] ∖ {k}}$ .

(⇒) Suppose $x^{*}$ is a fixed point of $(W, b)$ . We have that for all $i \in [n]$ , ${\frac{d x_{i}}{d t} |}_{x = x^{*}} = 0$ , and thus

x_{i}^{*} = {[\sum_{j \in [n]} W_{i j} x_{j}^{*} + b_{i}]}_{+} .

Since $b_{k} \leq 0$ and $W_{i j} \leq 0$ , we obtain $x_{k}^{*} = {[\sum_{j \in [n]} W_{i j} x_{j} + b_{k}]}_{+} = 0$ , as desired.

Now let’s see that ${\hat{x}}^{*}$ is a fixed point of $(\hat{W}, \hat{b})$ . Indeed, for all $i \neq k$ we have that

{\frac{d x_{i}}{d t} |}_{x = {\hat{x}}^{*}} = - {\hat{x}}_{i}^{*} + {[\sum_{j \in [n] ∖ {k}} {\hat{W}}_{i j} {\hat{x}}_{j}^{*} + {\hat{b}}_{i}]}_{+} = - x_{i}^{*} + {[\sum_{j \in [n] ∖ {k}} W_{i j} x_{j}^{*} + b_{i}]}_{+} = - x_{i}^{*} + {[\sum_{j \in [n]} W_{i j} x_{j}^{*} + b_{i}]}_{+} = {\frac{d x_{i}}{d t} |}_{x = x^{*}} = 0

where the second equality follows from the definition of $\hat{W}$ , $\hat{b}$ , $\hat{x}$ and the third one because $x_{k}^{*} = 0$ .

$(\Leftarrow)$ Suppose ${\hat{x}}^{*}$ is a fixed point of $(\hat{W}, \hat{b})$ and $x_{k}^{*} = 0$ . Then we have that for all $i \neq k$ , ${\frac{d x_{i}}{d t} |}_{x = {\hat{x}}^{*}} = 0$ . Let’s see that $x^{*}$ is a fixed point of $(W, b)$ . Indeed,

{\frac{d x_{i}}{d t} |}_{x = x^{*}} = - x_{i}^{*} + {[\sum_{j \in [n]} W_{i j} x_{j}^{*} + b_{i}]}_{+} = - x_{i}^{*} + {[\sum_{j \in [n] ∖ k}} W_{i j} x_{j}^{*} + b_{i}]}_{+} = - {\hat{x}}_{i}^{*} + {[\sum_{j \in [n] ∖ {k}} {\hat{W}}_{i j} {\hat{x}}_{j}^{*} + {\hat{b}}_{i}]}_{+} = {\frac{d x_{i}}{d t} |}_{x = {\hat{x}}^{*}} = 0

where the second equality once again follows from the assumption that $x_{k}^{*} = 0$ , and the third equality from definition of $\hat{W}$ , $\hat{b}$ , $\hat{x}$ . □

We then easily see that neurons with negative input do not participate of any fixed point support:

Corollary 37.

Let $(W, b)$ be a competitive non-degenerate TLN. If $b_{k} \leq 0$ for some $k \in [n]$ , then $FP (W, b) = FP (W_{[n] ∖ {k}}, {b |}_{[n] ∖ {k}})$ . In particular, $k \notin σ$ for all $σ \in FP (W, b)$ .

Proof. Denote $\hat{W} = W_{[n] ∖ {k}}, \hat{b} = {b |}_{[n] ∖ {k}}$ and let ${\hat{x}}^{*} : = {x^{*} |}_{[n] ∖ {k}}$ . Observe that

$σ \in FP (W, b)$ $\Leftrightarrow$ there exists a fixed point $x^{*}$ of $(W, b)$ such that $σ = supp (x^{*})$

$\Leftrightarrow x_{k}^{*} = 0$ , and ${\hat{x}}^{*}$ is a fixed point of $(\hat{W}, \hat{b})$

$\Leftrightarrow k \notin σ$ , and $σ \in FP (\hat{W}, \hat{b})$ .

Where the second equivalence follows from Lemma 36. Thus, $FP (W, b) = FP (\hat{W}, \hat{b})$ and $k \notin σ$ for all $σ \in FP (W, b)$ , as desired. □

The above lemma explains the fusion attractors we saw in last chapter.

Examples from Chapter 5.

We begin by analyzing the network from Section 5.1, the sequential control of gaits, whose connectivity matrix is reproduced in Figure 6.4. We focus on deriving $FP (W, θ)$ outside of pulse times, which is precisely when the network settles in the attractors we observe. Since outside of pulses, $θ_{2} = 0$ , by Lemma 37, $FP (W, θ) = FP ({W |}_{L_{1} \cup L_{3}}, {b |}_{L_{1} \cup L_{3}})$ . But

graphic file with name nihpp-2410.11012v1-f0049.jpg

(6.4)

and thus layers are decoupled and it is now easy to see that attractors from $L_{1}$ and $L_{2}$ will be preserved, since they do not really affect each other’s dynamics.

Analogously, the same mechanism is doing the work in the network of Section 5.2, as once again we have $FP (W, θ) = FP ({W |}_{L_{1} \cup L_{3}}, {b |}_{L_{1} \cup L_{3}})$ , with $W$ as in Equation 6.4. Layers $L_{1}$ and $L_{3}$ are transiently connected during pulse times through $L_{2}$ , which allows $L_{1}$ to communicate its attractor transitions to $L_{3}$ . Once the transition is communicated, and pulse ends, the network will settle in the chosen attractor. Neat.

The results in this section were inspired by a combination of the networks of Chapter 5, and past results (in Secs. 6.1 and 6.2). One of the things the proofs have in common is the following simple linear algebra lemma, which allow us to factor determinants whose top right block is rank 1:

Lemma 38.

Let

graphic file with name nihpp-2410.11012v1-f0050.jpg

be an $(m + n - 1) \times (m + n - 1)$ matrix, consisting of the blocks $A (m \times m)$ , $B (m \times n)$ , $C (n \times m)$ and $D (n \times n)$ , where adjacent blocks overlap in one row/column. Note that $a_{m, m} = b_{m, 1} = c_{1 m} = d_{1, 1}$ is the single entry where all four matrices overlap. If $B$ or $C$ is rank 1 and $a_{m, m} \neq 0$ , then

\det M = \frac{1}{a_{m, m}} \det A \det D .

Proof. Let

A = [\begin{matrix} a_{1, 1} & \dots & a_{1, m} \\ ⋮ & ⋮ \\ a_{m, 1} & \dots & a_{m, m} \end{matrix}], B = [\begin{matrix} b_{1, 1} & \dots & b_{1, n} \\ ⋮ & ⋮ \\ b_{m, 1} & \dots & b_{m, n} \end{matrix}], C = [\begin{matrix} c_{1, 1} & \dots & c_{1, m} \\ ⋮ & \dots & ⋮ \\ c_{n, 1} & \dots & c_{n, m} \end{matrix}], D = [\begin{matrix} d_{1, 1} & \dots & d_{m, n} \\ ⋮ & \dots & ⋮ \\ d_{n, 1} & \dots & d_{n, n} \end{matrix}] .

Note that

[\begin{matrix} a_{1, m} \\ ⋮ \\ a_{m, m} \end{matrix}] = [\begin{matrix} b_{1, 1} \\ ⋮ \\ b_{m, 1} \end{matrix}], [\begin{matrix} c_{1, m} \\ ⋮ \\ c_{n, m} \end{matrix}] = [\begin{matrix} d_{1, 1} \\ ⋮ \\ d_{n, 1} \end{matrix}], [\begin{matrix} b_{m, 1} \dots b_{m, n} \end{matrix}] = [\begin{matrix} d_{1, 1} \dots d_{1, n} \end{matrix}]

and $[\begin{matrix} c_{1, 1} \dots c_{1, m} \end{matrix}] = [\begin{matrix} a_{m, 1} \dots a_{m, m} \end{matrix}]$ . Assume first that $B$ is rank 1, then we can take all columns of $B$ to be multiples of its first column. Let $β_{k}$ be such that

[\begin{matrix} b_{1, k} \\ ⋮ \\ b_{m, k} \end{matrix}] = β_{k} [\begin{matrix} b_{1, 1} \\ ⋮ \\ b_{m, 1} \end{matrix}],

with $β_{1} = 1$ . Subtracting $β_{k}$ times the $m$ -th column of $M$ from its $k$ -th column, we get

graphic file with name nihpp-2410.11012v1-f0051.jpg

because $[\begin{matrix} b_{m, 1} \dots b_{m, n} \end{matrix}] = [\begin{matrix} b_{m, 1} β_{2} b_{m, 1} \dots β_{n} b_{m, 1} \end{matrix}] = [\begin{matrix} d_{1, 1} \dots d_{1, n} \end{matrix}]$ and $b_{m, 1} = a_{m, m}$ . If C is rank 1, the proof is similar. □

This determinant lemma is a key technical tool for proofs in this section, as well as in [15,59], but also in Section 6.2. The fact that allows us to get a rank 1 block here, and in the past, is the simply-embeddedness of a given component network. This translates into said component receiving certain uniform inputs from other neurons. Below we make this notion precise by extending Definition 39 beyond CTLNs to apply more generally to TLNs:

Definition 39.

Let $(W, b)$ be a competitive non-degenerate TLN. We say that $τ \subseteq [n]$ is simply-embedded in the network if for every $j \notin τ$ , $W_{i j} = W_{k j} = γ_{j}$ for all $i$ , $k \in τ$ . That is, if $τ = {1, \dots, ℓ}$ and $[n] ∖ τ = {ℓ + 1, \dots, n}$ , then:

graphic file with name nihpp-2410.11012v1-f0052.jpg

(6.5)

With this definition, we can readily obtain the TLN version of Theorem 15, key in the proofs of [15,59], and in this section. The reasoning is as follows: if we can factor the $s_{i}$ ’s, we can inherit their signs to the component networks, if we can inherit their signs to the component networks then we can compare these signs to asses who is a fixed point support of the whole and parts via Theorem 13. This is why the next lemma is so important.

Theorem 40.

Let $τ$ be simply-embedded in $(W, b)$ and $b_{i} = θ$ for $i \in τ$ . Let $ω = [n] ∖ τ$ . Then for any $σ \subseteq [n]$ , we have

s_{i}^{σ} = \frac{1}{θ} s_{i}^{σ \cap ω} s_{i}^{σ \cap τ} = α s_{i}^{σ \cap τ} f o r e a c h i \in τ,

where $α = \frac{1}{θ} s_{i}^{σ \cap ω}$ has the same value for every $i \in τ$ .

Proof. Since $τ$ is simply-embedded with uniform input, $W$ has the form

graphic file with name nihpp-2410.11012v1-f0053.jpg

where $𝟙_{τ}$ is a column vector of all ones of size $| τ |$ , and $γ^{⊤} = [\begin{matrix} γ_{1} \dots γ_{m} \end{matrix}]$ so that

𝟙_{τ} γ^{⊤} = [\begin{matrix} γ_{1} & γ_{m} \\ ⋮ & \dots & ⋮ \\ γ_{1} & γ_{m} \end{matrix}] .

Let $i \in τ$ , then

graphic file with name nihpp-2410.11012v1-f0054.jpg

Since the upper right block $𝟙_{σ \cap τ} [\begin{matrix} θ γ_{1} \dots γ_{m} \end{matrix}]$ , is rank 1 and $θ \neq 0$ , by Lemma 38,

graphic file with name nihpp-2410.11012v1-f0055.jpg

note that the last matrix, equal to $α = \frac{s_{i}^{σ \cap ω}}{θ}$ , does not depend on $i$ . □

With the factorization, we can now see how the signs of the $s_{i}$ ’s are inherited to simply-embedded components:

Lemma 41.

Suppose $τ_{1}, \dots, τ_{N}$ is a partition of the vertices of a TLN $(W, θ)$ , with $θ$ constant input. For any $σ \subseteq [n]$ , let $σ_{ℓ} \overset{def}{=} σ \cap τ_{ℓ}$ . If for every $ℓ \in [N]$ we have $τ_{ℓ}$ simply-embedded in $(W, θ)$ , then for any $σ_{i} \neq \emptyset$ ,

sgn s_{i}^{σ} = sgn s_{j}^{σ} \Leftrightarrow sgn s_{i}^{σ_{ℓ}} = sgn s_{j}^{σ_{ℓ}}, f o r a l l i, j \in τ_{ℓ} .

Proof. Since each $τ_{ℓ}$ is simply-embedded in $(W, θ)$ , and input is assumed to be uniform across all components, by Theorem 40 we have $s_{i}^{σ} = α s_{i}^{σ_{ℓ}}$ for all $i \in τ_{ℓ}$ , where $α = \frac{1}{θ} s_{i}^{σ ∖ τ_{ℓ}}$ has the same value for every $i \in τ_{ℓ}$ . Hence, for all $i$ , $j \in τ_{ℓ}$ , we have that $sgn s_{i}^{σ} = sgn s_{j}^{σ}$ if and only if $sgn α s_{i}^{σ_{ℓ}} = sgn α s_{j}^{σ_{ℓ}}$ if and only if $sgn s_{i}^{σ_{ℓ}} = sgn s_{j}^{σ_{ℓ}}$ . □

With the inheritance, we can now start to see how the supports of the parts relate to supports of the whole. The next theorem generalizes Theorem 1.4 in [59] for TLNs with a simply-embedded structure, but constant input. Now we know that a fixed point of the whole network must have come from supports of the component parts.

Theorem 42 (TLN version of Theorem 1.4 in [59]).

Let $(W, θ)$ be composed of layers $τ_{1}, \dots, τ_{N}$ , such that each layer $τ_{ℓ}$ is simply-embedded in the network, with $θ$ uniform input. For any $σ \subseteq [n]$ , let $σ_{ℓ} \overset{def}{=} σ \cap τ_{ℓ}$ . Then

σ \in FP (W, θ) \Rightarrow σ_{ℓ} \in FP (W_{τ_{ℓ}}, θ) \cup {\emptyset} f o r a l l ℓ \in [N] .

In other words, every fixed point support of $(W, θ)$ is a union of component fixed point supports $σ_{ℓ}$ , at most one per component.

Proof. For $σ \in FP (W, θ)$ , we have

sgn s_{i}^{σ} = sgn s_{j}^{σ} = - sgn s_{k}^{σ}

for any $i$ , $j \in τ_{ℓ}$ and $k \in τ_{ℓ} ∖ σ_{ℓ}$ , by Theorem 13 (sign conditions). Then by Lemma 41, we see that whenever $σ_{ℓ} \neq \emptyset$ ,

sgn s_{i}^{σ_{ℓ}} = sgn s_{j}^{σ_{ℓ}} = - {sgn}_{k}^{σ_{ℓ}},

and so $σ_{ℓ}$ satisfies the sign conditions in $(W_{τ_{ℓ}}, θ)$ . Thus $σ_{ℓ} \in FP (W_{τ_{ℓ}}, θ)$ for every nonempty $σ_{ℓ}$ . □

The next lemma generalizes Lemma 30 to TLNs, the structure below is the analog of a strongly simply-embedded partition. Recall that this lemma assumed several simply-embeddednesses to obtain a full factorization of the $s_{i}$ ’s. Also recall the sets of surviving $(S_{σ} = FP ({W |}_{σ}, {b |}_{σ}) \cap FP (W, b))$ and dying fixed points supports $(D_{σ} = FP ({W |}_{σ}, {b |}_{σ}) ∖ S_{σ})$ .

Lemma 43.

Let $(W, θ)$ be a TLN on n nodes with uniform input. And suppose that for $j \in [n]$ , if $j \in τ_{ℓ}$ , we have $W_{i j} = γ_{j}$ for all $i \notin τ_{ℓ}$ . That is, if $τ_{j} = {t_{j - 1} + 1, \dots, t_{j}}$ , $W$ has the form

graphic file with name nihpp-2410.11012v1-f0056.jpg

(6.6)

For any $σ \subseteq [n]$ , denote $σ_{ℓ} \overset{def}{=} σ \cap τ_{ℓ}$ , and $σ_{ℓ_{1} \dots ℓ_{k}} \overset{def}{=} σ_{ℓ_{1}} \cup \dots \cup σ_{ℓ_{k}}$ and let $I = {ℓ \in [N] ∣ σ_{ℓ} \neq \emptyset}$ . Then for every $i \in [n]$ ,

s_{i}^{σ} = \frac{1}{θ^{| I | - 1}} \prod_{ℓ \in I} s_{i}^{σ_{ℓ}},

where $s_{i}^{σ_{ℓ}}$ has the same value for every $i \in [n] ∖ τ_{ℓ}$ . Moreover, for any $σ_{ℓ} \in FP ({G |}_{τ_{ℓ}})$ and $j \in τ_{ℓ}$ :

sgn s_{j}^{σ_{ℓ}} = {\begin{array}{l} idx (σ_{ℓ}) & i f j \in σ_{ℓ} \\ - idx (σ_{ℓ}) & i f j \in τ_{ℓ} ∖ σ_{ℓ} \end{array}

while for any $k \notin τ_{ℓ}$ ,

sgn s_{k}^{σ_{ℓ}} = {\begin{array}{l} - idx (σ_{ℓ}) & i f σ_{ℓ} \in S_{τ_{ℓ}} \\ idx (σ_{ℓ}) & i f σ_{ℓ} \in D_{τ_{ℓ}} \end{array}

Proof. Since $τ_{1}$ is simply-embedded in $[n] ∖ τ_{1}$ ,

s_{i}^{σ} = \frac{1}{θ} s_{i}^{σ_{2} \dots N} s_{i}^{σ_{1}} for all i \in τ_{1}

by Theorem 40. On the other hand, since $[n] ∖ τ_{1}$ is also simply-embedded, we also have

s_{i}^{σ} = \frac{1}{θ} s_{i}^{σ_{1}} s_{i}^{σ_{2} \dots N} for all i \in [n] ∖ τ_{1} .

Therefore, the above factorization holds for all $i \in [n]$ . Similarly, since both $τ_{2}$ and $[n] ∖ τ_{2}$ are simply-embedded,

s_{i}^{σ_{2} \dots N} = \frac{1}{θ} s_{i}^{σ_{2}} s_{i}^{σ_{3} \dots N} for all i \in [n]

by Theorem 40, and so $s_{i}^{σ} = \frac{1}{θ^{2}} s_{i}^{σ_{1}} s_{i}^{σ_{2}} s_{i}^{σ_{3 \dots N}}$ . Continuing in this fashion, we see that for any $i \in [n]$ ,

s_{i}^{σ} = \frac{1}{θ^{N - 1}} s_{i}^{σ_{1}} \dots s_{i}^{σ_{N}} .

Note that if $σ_{ℓ} = \emptyset$ , then $s_{i}^{σ_{ℓ}} = s_{i}^{\emptyset} = s_{i}^{{j}} = θ$ , and thus for all $i \in [n]$ ,

s_{i}^{σ} = \frac{θ^{N - | I |}}{θ^{N - 1}} \prod_{i \in I} s_{i}^{σ_{ℓ}} = \frac{1}{θ^{| I | - 1}} \prod_{i \in I} s_{i}^{σ_{ℓ}} .

The fact that $s_{i}^{σ_{ℓ}}$ has the same value for every $i \in [n] ∖ τ_{ℓ}$ is a direct consequence of Theorem 40.

Finally, to prove the last statements about the signs of $s_{i}^{σ_{ℓ}}$ , observe that for $i \in τ_{ℓ}$ , the values of $sgn s_{i}^{σ_{ℓ}}$ are fully determined by Theorem 13 (sign conditions) since $σ_{ℓ} \in FP ({G |}_{τ_{ℓ}})$ by hypothesis. In particular, if $σ_{ℓ} \in S_{τ_{ℓ}}$ , then $σ_{ℓ}$ survives the addition of every $k \notin τ_{ℓ}$ , and so $sgn s_{k}^{σ_{ℓ}} = idx (σ_{ℓ})$ by Theorem 13 (sign conditions). On the other hand, if $σ_{ℓ} \in D_{τ_{ℓ}}$ then $σ_{ℓ}$ dies in $G$ and so there is some $k \notin τ_{ℓ}$ for which $sgn s_{k}^{σ_{ℓ}} = - idx (σ_{ℓ})$ . But by the first part of the theorem, all the $s_{k}^{σ_{ℓ}}$ values are identical for $k \in [n] ∖ τ_{ℓ}$ , and thus $sgn s_{k}^{σ_{ℓ}} = - idx (σ_{ℓ})$ for all such $k$ .

Finally, we arrive to the main theorem of last section, Theorem 31, TLN version. This is the big result. It generalizes Theorem 31 from [59], and also Theorem 17 from [15]. The proof is a generalized version of what it was in both of those publications.

Theorem 44.

Let $(W, b)$ be such that

graphic file with name nihpp-2410.11012v1-f0057.jpg

(6.7)

and $b = θ 𝟙$ . Let $σ_{i} \overset{def}{=} σ \cap τ_{i}$ for any $σ \subseteq [n]$ . Then $σ \in FP (W, θ)$ if and only if $σ_{i} \in FP (W_{τ_{i}}, θ) \cup {\emptyset}$ for each $i \in [N]$ , and either

every $σ_{i}$ is in $FP (W, θ) \cup {\emptyset}$ , or
none of the $σ_{i}$ are in $FP (W, θ) \cup {\emptyset}$ .

In other words, $σ \in FP (W, θ)$ if and only if $σ$ is either a union of surviving fixed points $σ_{i}$ , at most one per component, or it is a union of dying fixed points, exactly one from every component.

Proof. Note that $(W, θ)$ satisfies the conditions for Lemma 43 above, and thus, assuming def without loss of generality that $θ = 1$ , we have that for $I \overset{def}{=} {i ∣ σ_{i} \neq \emptyset}$ and $σ \subseteq [n]$ ,

s_{j}^{σ} = \prod_{i \in I} s_{j}^{σ_{i}},

with $s_{j}^{σ_{i}}$ is constant across $j \in [n] ∖ τ_{i}$ for each $i \in [N]$ .

$(\Rightarrow)$ Suppose $σ \in FP (W, θ)$ . By Lemma 43, $sgn s_{i}^{σ} = \prod_{i \in I} sgn s_{j}^{σ_{i}}$ . Next, by Theorem 42,

6.3.

$σ_{i} \in FP (W_{τ_{i}}, θ)$ for every $i \in I$ , and by the sgn formula of Lemma 43. Denote by $S \overset{def}{=} {a \in I ∣ σ_{a} \in S_{a}}$ . For any $j \in σ$ , there exists $i \in I$ such that $j \in σ_{i}$ , then we have

sgn s_{j}^{σ} = idx (σ_{i}) \prod_{a \in S ∖ {i}} - idx (σ_{a}) \prod_{b \in S^{c} ∖ {i}} idx (σ_{b}) = {(- 1)}^{| S ∖ {i} |} \prod_{ℓ \in I} idx (σ_{ℓ}),

(6.8)

Now, observe that if $σ$ contained a mix of $σ_{a} \in S_{a}$ and $σ_{b} \in D_{b}$ , then there would be $i$ , $j \in σ$ such that $i \in σ_{a}$ for some $a \in S$ , while $j \in σ_{b}$ for some $b \notin S$ and thus $sgn s_{i}^{σ} = {(- 1)}^{| S | - 1} Π_{ℓ \in I} idx σ_{ℓ}$ and $sgn s_{j}^{σ} = {(- 1)}^{| S |} \prod_{ℓ \in I} idx σ_{ℓ} = - sgn s_{i}^{σ}$ . But this contradicts the fact that $σ \in FP (W, θ)$ by Theorem 13 (since we assumed $i$ , $j \in σ$ ). Thus, we must have either $σ_{i} \in S_{τ_{i}}$ for all $i \in I$ , as in (a), or $σ_{i} \in D_{τ_{i}}$ for all $i \in I$ as in (b).

Now to see that when $σ_{i} \in D_{τ_{i}}$ for all $i \in I$ , we must have $I = [N]$ , suppose to the contrary that $I ⊊ [N]$ so that there is some $m \in [N]$ such that $τ_{m} \cap σ = \emptyset$ . Then, for $k \in τ_{m} (so k \notin σ)$ , we have $sgn s_{k}^{σ_{ℓ}} = idx (σ_{ℓ})$ for all $ℓ \in I$ , by Lemma 43, since $σ_{ℓ} \in D_{τ_{ℓ}}$ . Thus $sgn s_{k}^{σ} = \prod_{ℓ \in I} idx (σ_{ℓ})$ .

On the other hand, if $j \in σ$ , with $j \in τ_{i}$ , we have

sgn s_{j}^{σ} = {(- 1)}^{| S ∖ {i} |} \prod_{ℓ \in I} idx (σ_{ℓ}) = \prod_{ℓ \in I} idx (σ_{ℓ}) .

Where the first equality follows from Equation 6.8, and the second because $S = \emptyset$ in this case.

6.3.

But then ${sgn}_{k}^{σ} = {sgn}_{j}^{σ}$ for $k \notin σ$ and $j \in σ$ , which contradicts $σ \in FP (W, θ)$ by Theorem 13. Thus $I = [N]$ when $σ_{ℓ} \in D_{ℓ}$ for all $ℓ \in I$ .

$(\Leftarrow)$ Suppose (a) holds and so $σ_{i} \in S_{τ_{i}}$ for all $i \in I$ . Let us check the sign conditions for $σ \overset{def}{=} \underset{i \in I}{\cup} σ_{i}$ .

For any $j \in σ$ , there exists $i \in I$ such that $j \in τ_{i}$ . Then by Equation 6.8, we have

sgn s_{j}^{σ} = {(- 1)}^{| S ∖ {i} |} \prod_{ℓ \in I} idx (σ_{ℓ}) = {(- 1)}^{| I | - 1} \prod_{ℓ \in I} idx σ_{ℓ},

since $S = I$ in this case. On the other hand, for $k \notin σ$ , we have $sgn s_{k}^{σ_{ℓ}} = - idx σ_{ℓ}$ for all $ℓ \in I$ , by Lemma 43, since $σ_{ℓ} \in S_{τ_{ℓ}}$ . Thus

sgn s_{k}^{σ} = \prod_{ℓ \in I} (- idx σ_{ℓ}) = {(- 1)}^{| I |} \prod_{ℓ \in I} idx σ_{ℓ} = - sgn s_{j}^{σ} .

Therefore $σ \in FP (W, θ)$ by Theorem 13 (sign conditions).

Next, suppose (b) holds so $σ_{ℓ} \in D_{τ_{ℓ}}$ for all $ℓ \in [N]$ (so $I = [N]$ ). Then for any $j \in σ$ ,

6.3.

there is $i \in [N]$ such that $j \in σ_{i}$ and by Equation 6.8, we have

sgn s_{j}^{σ} = {(- 1)}^{| S ∖ {i} |} \prod_{ℓ \in [N]} idx (σ_{ℓ}) = \prod_{ℓ \in [N]} idx (σ_{ℓ}),

since $S = \emptyset$ .

Now, let $k \notin σ$ , with $k \in τ_{m}$ . Since $σ_{m} \in FP (W_{τ_{m}}, θ)$ with $σ_{m} \neq \emptyset$ (because $I = [N]$ ), we have $sgn s_{k}^{σ_{m}} = - idx (σ_{m})$ and thus

sgn s_{k}^{σ} = sgn s_{k}^{σ_{m}} \prod_{ℓ \in [N ∖ {m}} sgn s_{k}^{σ_{ℓ}} = - idx (σ_{m}) \prod_{ℓ \in [N] ∖ {m}} idx (σ_{ℓ}) = - \prod_{ℓ \in [N]} idx (σ_{ℓ}) = - sgn s_{j}^{σ} .

Thus sign conditions are satisfied, and so $σ \in FP (W, θ)$ . □

Note that the structure of the network in Theorem 44 is such that the graph can be decomposed as shown in Figure 6.5B. In contrast, in Figure 6.5A we have the less restrictive structure of Theorem 42, where the edges to two different components might be different.

We can also consider a mid point between these two structures, where nested components are simply-embedded, to obtain:

Theorem 45.

Let $(W, θ)$ such that $ρ_{k} = \cup_{j = 1}^{k} τ_{j}$ is simply-embedded in $(W, θ)$ for every $k = 1, \dots, N$ . That is, if $τ_{j} = {t_{j - 1} + 1, \dots, t_{j}}$ then $W$ has the form

graphic file with name nihpp-2410.11012v1-f0058.jpg

(6.9)

Then for any $σ \subseteq [n]$ and any $k \in [N]$ we have

σ \in FP (W, θ) \Rightarrow σ \cap ρ_{k} \in FP (W_{ρ_{k}}, θ) \cup {\emptyset}

where $ρ_{k} = \cup_{j = 1}^{k} τ_{j}$ .

This means that the fixed point supports of the whole network must come from supports of “ordered, nested” parts.

Proof. Since $σ \in FP (W, θ)$ , by Theorem 13 we have that

\forall j, m \in σ \forall l \notin σ sgn s_{j}^{σ} = sgn s_{m}^{σ} = - sgn s_{l}^{σ} .

By Lemma 41, since for every $k, ρ_{k}$ is simply-embedded, we have

\forall j, m \in ρ_{k} \forall l \notin ρ_{k} - σ sgn s_{j}^{σ \cap ρ_{k}} = sgn s_{m}^{σ \cap ρ_{k}} = - sgn s_{l}^{σ \cap ρ_{k}} .

This means that $σ \cap ρ_{k}$ satisfies the sign conditions in $ρ_{k}$ and so $σ \cap ρ_{k} \in FP (W_{ρ_{k}}, θ) \cup {\emptyset}$ . □

Corollary 46.

Let $τ$ be simply-embedded in $(W, θ)$ . Then for any $σ \subseteq [n]$ ,

σ \in FP (W, θ) \Rightarrow σ \cap τ \in FP (W_{τ}, θ) \cup {\emptyset} .

6.3.1. Updated results on simply-embedded partitions

As mentioned before, the results in this section generalized those published in [59], for which we omitted the proofs when we presented them in Section 6.2. Here we include references to which theorems from Section 6.3 will prove theorems of Section 6.2.

First, note that Definition 39 is generalizing Definition 19 now. That’s how we straightforwardly get:

Lemma 20.

sgn s_{j}^{σ} = sgn s_{k}^{σ} \Leftrightarrow sgn s_{j}^{σ_{i}} = sgn s_{k}^{σ_{i}}, f o r a l l j, k \in τ_{i} .

Proof. Follows directly from Lemma 41. □

Theorem 21 ( $FP (G)$ menu for simply-embedded partitions).

Let $G$ have a simply-embedded partition ${τ_{1} | \dots | τ_{N}}$ . For any $σ \subseteq [n]$ , let $σ_{i} \overset{def}{=} σ \cap τ_{i}$ . Then

σ \in FP (G) \Rightarrow σ_{i} \in FP ({G |}_{τ_{i}}) \cup {\emptyset} f o r a l l i \in [N] .

In other words, every fixed point support of $G$ is a union of component fixed point supports $σ_{i}$ , at most one per component.

Proof. Follows directly from Theorem 42. □

Lemma 30.

s_{j}^{σ} = \frac{1}{θ^{| I | - 1}} \prod_{i \in I} s_{j}^{σ_{i}},

where $s_{j}^{σ_{i}}$ has the same value for every $j \in [n] ∖ τ_{i}$ .

Moreover, for any $σ_{i} \in FP ({G |}_{τ_{i}})$ and $j \in τ_{i}$ :

sgn s_{j}^{σ_{i}} = {\begin{array}{l} idx (σ_{i}) & i f j \in σ_{i} \\ - idx (σ_{i}) & i f j \in τ_{i} ∖ σ_{i} \end{array}

while for any $k \notin τ_{i}$ ,

sgn s_{k}^{σ_{i}} = {\begin{array}{l} - idx (σ_{i}) & i f σ_{i} \in S_{τ_{i}} \\ idx (σ_{i}) & i f σ_{i} \in D_{τ_{i}} \end{array}

Proof. Follows directly from Lemma 43. □

Theorem 31.

every $σ_{i}$ is in $FP (G) \cup {\emptyset}$ , or
none of the $σ_{i}$ are in $FP (G) \cup {\emptyset}$ .

Proof. Follows directly from Theorem 44. □

Chapter 7 | Further theoretical results, and open questions

In this chapter, we explore two seemingly unrelated results originating from our work in Chapters 4 and 6. The first part of this chapter emerges from our attempt to extend our five-gait quadruped network approach to model hexapod gaits. We wondered whether CLTNs could learn hexapod gaits using a combination of theoretical insights from CTLNs and computational machine learning techniques. While we did not ultimately answer this question, during our exploration, we made an interesting observation: many different networks could support the same attractor. This observation isn’t entirely novel [27,28,42], which prompted us to study this phenomenon using CTLNs from a dynamical systems perspective. What conditions are sufficient for two different networks to support the same attractor? Our progress in addressing this question is discussed in Section 7.1.

The second half of the chapter derived from Chapter 6. Recall that in that chapter, we repeatedly invoked the ability to fully factorize the s_i values from Equation 6.1 when using a simply-embedded structure. This has been the case in 2016 [15], 2020 [59], and 2023 6.3. Now we extend this approach to even more general structures. As it turns out, these s_i values are elements of a chirotope. Section 7.2 explores the concept of chirotopes, their relation to TLNs, and partially computes the chirotope of a TLN that has some simply-embedded subnetworks.

These projects are still in their early stages, which explains their inclusion, together, in the final chapter.

7.1. Degeneracy

One of the aims of neuroscience is to identify the relationship between function and the underlying structure responsible for that function. However, a wide variety of different structures can produce the same function, a concept known as degeneracy [42]. Degeneracy manifests in various contexts in neuroscience, ranging from single cells with different properties generating the same behavior [27] to different neuronal circuits producing similar circuit performance [28]. That is, there is not a unique, one-to-one correspondence between function and structure. This holds true for models as well. A myriad of different parameters in a single model can yield the same output [26,64]. What fraction of neurons need to be observed to reconstruct a network’s behavior?

In the context of TLNs, the degeneracy problem translates into: for a given pattern of firing $x (t)$ , how many TLNs $(W, b)$ can generate that pattern? Furthermore, is there a precise notion under which two distinct networks $(W^{(0)}, b^{(0)})$ and $(W^{(1)}, b^{(1)})$ are considered equivalent? Can we derive a general principle that yields a family of "equivalent" networks? The degeneracy problem also raises the question: if several different networks can reproduce the same attractor, how do we identify conditions for accurate model fitting or parameter estimation? This would also require a precise notion under which two attractors are the same, and then conditions that facilitate this verification.

In the special case of TLNs, we have observed that even networks as small as 3 neurons can reproduce the same limit cycle, under different parameters. Figure 7.1 shows an example. On top, there is a binary-synapse matrix $W^{(0)}$ , arising as the CTLN connectivity matrix of a 3-cycle graph, and an all uniform input $b^{(0)}$ defining a network that supports a limit cycle, as shown by the rate curves. Below, a slightly different matrix $W^{(1)}$ and a non-uniform input $b^{(1)}$ is reproducing (apparently) the same limit cycle.

One partial result in this direction states that if the vector fields of two TLNs match at some point, then they match at that point for all TLNs that are convex combinations of the original two TLNs. More precisely:

Definition 47.

Let $(W^{(0)}, b^{(0)})$ and $(W^{(1)}, b^{(1)})$ be two TLNs on $n$ nodes, with vector fields

\frac{d x}{d t} = v^{(a)} (x) \overset{def}{=} - x + {[W^{(a)} x + b^{(a)}]}_{+}

for $a = 0, 1$ . For any $s \in [0, 1]$ , define the convex combination $(W^{(s)}, b^{(s)})$ as the TLN with

W^{(s)} = (1 - s) W^{(0)} + s W^{(1)} and b^{(s)} = (1 - s) b^{(0)} + s b^{(1)} .

Lemma 48.

Let $v^{(s)} (x) = - x + {[W^{(s)} x + b^{(s)}]}_{+}$ be the vector field of a TLN convex combination $(W^{(s)}, b^{(s)})$ , for $s \in [0, 1]$ . Suppose there exists a certain point $x_{0} \in ℝ^{n}$ such that the vector fields match: $v^{(1)} (x_{0}) = v^{(0)} (x_{0})$ . Then, for all $s \in [0, 1]$ , $v^{(s)} (x_{0}) = v^{(0)} (x_{0})$ .

Proof. First, observe that if ${[y_{1}]}_{+} = {[y_{2}]}_{+}$ , then for any $c_{1}, c_{2} > 0$ we also have

{[c_{1} y_{1} + c_{2} y_{2}]}_{+} = c_{1} {[y_{1}]}_{+} + c_{2} {[y_{2}]}_{+} .

Moreover, this will hold for $y_{1}, y_{2} \in ℝ^{n}$ , where some entries are positive and others are negative, and $y_{1} \neq y_{2}$ (as negative entries need not agree). Now suppose $v^{(1)} (x_{0}) = v^{(0)} (x_{0})$ . It follows that

{[W^{(1)} x_{0} + b^{(1)}]}_{+} = {[W^{(0)} x_{0} + b^{(0)}]}_{+} .

and hence, for $s \in [0, 1]$ ,

{[W^{(s)} x_{0} + b^{(s)}]}_{+} = {[(1 - s) W^{(0)} x_{0} + s W^{(1)} x_{0} + (1 - s) b^{(0)} + s b^{(1)}]}_{+} = {[(1 - s) (W^{(0)} x_{0} + b^{(0)}) + s (W^{(1)} x_{0} + b^{(1)})]}_{+} = (1 - s) {[W^{(0)} x_{0} + b^{(0)}]}_{+} + s {[W^{(1)} x_{0} + b^{(1)}]}_{+} = {[W^{(0)} x_{0} + b^{(0)}]}_{+} .

This implies that $v^{(s)} (x_{0}) = - x_{0} + {[W^{(s)} x_{0} + b^{(s)}]}_{+} = - x_{0} + {[W^{(0)} x_{0} + b^{(0)}]}_{+} = v^{(0)} (x_{0})$ . □

This implies in particular that fixed points survive to interpolations in convex combinations:

Corollary 49.

If $x^{*}$ is a fixed point of both $(W^{(0)}, b^{(0)})$ and $(W^{(1)}, b^{(1)})$ , then it is a fixed point for all convex combinations $(W^{(s)}, b^{(s)})$ .

Proof. If $v^{(1)} (x^{*}) = v^{(0)} (x^{*}) = 0$ , then $v^{(s)} (x^{*}) = 0$ for all $s \in [0, 1]$ . □

Also, any attractor that appears identically in two distinct TLNs - that is, with identical vector field along all point in the attractor - must also appear in all “in between” TLNs formed as convex combinations of the original two.

Corollary 50.

If $x (t)$ , for $t \in (t_{0}, t_{1})$ , is a trajectory of both $(W^{(0)}, b^{(0)})$ and $(W^{(1)}, b^{(1)})$ , then it is also a trajectory for all convex combinations $(W^{(s)}, b^{(s)})$ .

Several questions arise from our exploration of attractor degeneracy in the context of TLNs. First, can any of these results be generalized to any convex hull, beyond line segments? If so, can we identify distinct "chunks" of TLN parameter space $(W, b)$ that yield the same attractor? If such chunks exist, can we systematically characterize them to obtain partitions of TLN space $(W, b)$ by attractors? We could begin tackling this from a computational perspective by developing algorithms to efficiently identify and characterize regions of parameter space that produce equivalent attractors.

Some answers could also help elucidate the relationship between structure and function. For example, are there specific structural properties or network configurations that lead to the emergence of these shared attractors? This is a challenging question, but computationally, we could at least attempt to answer how different network architectures impact the existence and distribution of shared attractors.

Further-reaching questions include: what implications do these findings have for our understanding of neural circuit function and plasticity? How might the presence of degenerate attractors influence information processing and computational capabilities within neural networks? Are there practical applications of understanding attractor degeneracy in the design of artificial neural networks? These questions go beyond the immediate scope of our current investigation but are worth exploring to gain deeper insights into neural network dynamics and their computational properties.

7.2. Chirotope

7.2.1. Background

Given any set of vectors $E = {z_{1}, \dots, z_{m}} \subseteq ℝ^{d}$ , the Grassmann-Plucker (G-P) relations give us a series of identities relating determinants formed from elements of $E$ . Specifically, given any $x_{1}, \dots, x_{d} \in E$ and $y_{1}, \dots, y_{d} \in E$ ,

\det (x_{1}, \dots, x_{d}) \det (y_{1}, \dots, y_{d}) = \sum_{ℓ = 1}^{d} \det (x_{1}, \dots, x_{d - 1}, y_{ℓ}) \det (y_{1}, \dots, y_{ℓ - 1}, x_{d}, y_{ℓ + 1}, \dots, y_{d}) .

(7.1)

For different choices of $x_{i}$ ’s and $y_{i}$ ’s, Equation 7.1 yields all possible G-P relations on $E$ . Note that if $x_{d} = y_{j}$ for some $j = 1, \dots, d,$ , then the G-P relation is trivial.

What do G-P relations tell us about TLNs? Recall that the rows of the $n \times (n + 1)$ matrix $[- I + W ∣ b]$ are the vectors

h_{i} = (W_{i 1}, W_{i 2}, \dots, W_{i, i - 1}, - 1, W_{i, i + 1}, \dots, W_{i n}, b_{i}) .

For a TLN $(W, b)$ of dimension $n$ , we are interested in the set of vectors

E = E (W, b) = {h_{1}, \dots, h_{n}, e_{1}, \dots, e_{n + 1}} \subseteq ℝ^{n + 1},

corresponding to $d = n + 1$ . G-P relations for TLNs will relate determinants formed from these vectors, and this in turn will produce relationships among elements of the associated chirotope $χ$ :

Definition 51.

The chirotope $χ = χ_{(W, b)}$ of a TLN $(W, b)$ is the map $χ : E_{(W, b)}^{n + 1} \to {- 1, 0, 1}$ , where

χ (v_{1}, \dots, v_{n + 1}) = sgn
det (v_{1}, \dots, v_{n + 1}),

for any $v_{1}, \dots, v_{n + 1} \in E_{(W, b)}$ .

The reason we are interested in the chirotope $χ$ is that it encodes the full combinatorial geometry of a TLN. In particular, $χ$ encodes all fixed point supports in $FP (W, b)$ .

Proposition 52.

Let $(W, b)$ be a TLN on n neurons, and let $χ : E_{(W, b)}^{n + 1} \to {- 1, 0, 1}$ be its associated chirotope. Then

σ \in FP (W, b) \Leftrightarrow χ (v_{1}^{σ}, \dots, v_{n}^{σ}, e_{i}) = χ (v_{1}^{σ}, \dots, v_{n}^{σ}, e_{n + 1}) f o r a l l i \in σ, and

χ (v_{1}^{σ}, \dots, v_{n}^{σ}, h_{k}) = - χ (v_{1}^{σ}, \dots, v_{n}^{σ}, e_{n + 1}) f o r a l l k \in [n] ∖ σ,

where

v_{ℓ}^{σ} \overset{def}{=} {\begin{matrix} h_{i} for ℓ \in σ, \\ e_{ℓ} for ℓ \notin σ . \end{matrix}

(7.2)

Recall that for $σ \subseteq [n]$ , we use the notation $v_{ℓ}^{σ}$ to indicate $h_{i}$ or $e_{i}$ , with $v_{i}^{σ} = h_{i}$ for $i \in σ$ and $v_{i}^{σ} = e_{i}$ for $i \notin σ$ (see Eq. 7.2). We will also use the facts:

0 = \det (v_{1}^{σ}, \dots, v_{n}^{σ}, e_{i}), for i \in [n] ∖ σ, 0 = \det (v_{1}^{σ}, \dots, v_{n}^{σ}, h_{i}), for i \in σ, W_{i j} = \det (e_{1}, \dots, e_{j - 1}, h_{i}, e_{j + 1}, \dots, e_{n + 1}), for i \in [n], i \neq j, - 1 = \det (e_{1}, \dots, e_{i - 1}, h_{i}, e_{i + 1}, \dots, e_{n + 1}), for i \in [n], b_{i} = \det (e_{1}, \dots, e_{n}, h_{i}), and 1 = \det (e_{1}, \dots, e_{n}, e_{n + 1}),

Also, recall that:

\det A_{j}^{{i, j}} > 0 \Leftrightarrow i \to j and \det A_{i}^{{i, j}} > 0 \Leftrightarrow j \to i .

Finally, we will use the notation:

Δ_{ℓ}^{i j} \overset{def}{=} \det (e_{1}, \dots, e_{ℓ - 1}, h_{i}, e_{ℓ + 1}, \dots, e_{n}, h_{j}) = - Δ_{ℓ}^{j i} = W_{i ℓ} b_{j} - W_{j ℓ} b_{i} if i, j \neq ℓ .

(7.3)

Note that for $i = j$ , we always have $Δ_{ℓ}^{i j} = 0$ . Assuming $i \neq j$ , then for $ℓ = i$ we obtain

Δ_{i}^{i j} = \det (e_{1}, \dots, e_{i - 1}, h_{i}, e_{i + 1}, \dots, e_{n}, h_{j}) = \det A_{j}^{{i}} = - \det A_{j}^{{i, j}} = - b_{i} W_{j i} - b_{j},

(7.4)

and for $ℓ = j$ we have

Δ_{j}^{i j} = \det (e_{1}, \dots, e_{j - 1}, h_{i}, e_{j + 1}, \dots, e_{n}, h_{j}) = - \det A_{i}^{{j}} = \det A_{i}^{{i, j}} = b_{j} W_{i j} + b_{i} .

(7.5)

So altogether, we get:

Δ_{ℓ}^{i j} = {\begin{matrix} b_{j} W_{i ℓ} - b_{i} W_{j ℓ}, & if i, j \neq ℓ \\ - b_{j} - b_{i} W_{j i}, & if i \neq j, ℓ = i \\ - b_{j} W_{i j} + b_{i}, & if i \neq j, ℓ = j \\ 0, & if i = j . \end{matrix}

For any $τ \subseteq [n]$ , and $i$ , $j$ , $k \in [n]$ , we can define:

Δ_{k}^{i j} (τ) \overset{def}{=} \det (v_{1}^{τ ∖ {i, j}}, v_{2}^{τ ∖ {i, j}}, \dots, v_{k - 1}^{τ ∖ {i, j}}, h_{i}, v_{k + 1}^{τ ∖ {i, j}}, \dots, v_{n}^{τ ∖ {i, j}}, h_{j}) .

Note that $v_{i}^{τ ∖ {i, j}} = e_{i}$ and $v_{j}^{τ ∖ {i, j}} = e_{j}$ , so the above determinant does not repeat $h_{i}$ or $h_{j}$ terms, provided $i \neq j$ , and is generically expected to be nonzero. For $τ = {i, j}$ , this new determinant reduces to:

Δ_{k}^{i j} ({i, j}) = \det (e_{1}, \dots, e_{k - 1}, h_{i}, e_{k + 1}, \dots, e_{n}, h_{j}) = Δ_{k}^{i j} .

7.2.2. New results

If you follow the proofs of Chapter 6, you will notice that there is a recurrent strategy: assume a component is simply-embedded to obtain a factorization of the s_i’s, whose signs greatly control the set of fixed points supports of a network. In this section we aim to leverage again the simply-embedded structure to see if we can easily completely compute the chirotope for networks that have some simply-embedded structure in them. The first results in this direction find that there is plenty of elements that end up vanishing:

Lemma 53.

Consider a CTLN with graph $G$ on $n$ nodes. Let $τ \subseteq [n]$ be simply-embedded in $G$ . Then

Δ_{k}^{i j} (τ) = 0,

for all $i$ , $j \in τ$ , and $k \notin τ$ .

Proof. If $i = j$ , then $Δ_{k}^{i j} = 0$ trivially ( $h_{i}$ gets repeated). So suppose $i \neq j$ . without loss of generality, let $τ = {1, \dots, i, i + 1}$ , let $j = i + 1$ , and $k > i + 1$ . Then,

Δ_{k}^{i j} (τ) = \det (h_{1}, \dots, h_{i - 1}, e_{i}, \dots, e_{k - 1}, h_{i}, e_{k + 1}, \dots, e_{n}, h_{j}) = det [\begin{array}{l} - 1 & W_{12} & \dots & W_{1, i - 1} & 0 & \dots & 0 & W_{1, k} & 0 & \dots & 0 & b_{1} \\ W_{21} & - 1 & \dots & W_{2, i - 1} & 0 & \dots & 0 & W_{2, k} & 0 & \dots & 0 & b_{2} \\ ⋮ & ⋱ & ⋮ & ⋱ & ⋮ & ⋱ & ⋮ \\ W_{i - 1, 1} & W_{i - 1, 2} & \dots & - 1 & 0 & \dots & 0 & W_{i - 1, k} & 0 & \dots & 0 & b_{i - 1} \\ 0 & 0 & \dots & 0 & 1 & \dots & 0 & 0 & 0 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & 0 & \dots & 0 & 0 & \dots & 1 & 0 & 0 & \dots & 0 & 0 \\ W_{i, 1} & W_{i, 2} & \dots & W_{i, i - 1} & 0 & \dots & 0 & W_{i, k} & 0 & \dots & 0 & b_{i} \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 & 0 & 1 & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 & 0 & \dots & 0 & 0 & 0 & \dots & 1 & 0 \\ W_{j, 1} & W_{j, 2} & \dots & W_{j, i - 1} & 0 & \dots & 0 & W_{j, k} & 0 & \dots & 0 & b_{j} \end{array}] .

Now, recall that for a CTLN we have $b_{ℓ} = θ$ for all $ℓ \in [n]$ , so the last column is of the form

θ {(1, 1, \dots, 1, 0, \dots, 0, 1, 0, \dots, 0, 1)}^{T} .

Furthermore, since $τ$ is simply-embedded and $k \notin τ$ all nonzero entries in the $k$ -th column are identical and equal to $W_{i, k}$ , so that this column has the form

W_{i, k} {(1, 1, \dots, 1, 0, \dots, 0, 1, 0, \dots, 0, 1)}^{T} .

In other words, the $k$ -th column is a scalar multiple of the last column, so $Δ_{k}^{i j} (τ) = 0$ . □

In fact, we can prove that there are far more elements that vanish:

Lemma 54.

Consider a CTLN with graph $G$ . Let $τ \subseteq [n]$ be simply-embedded in $G$ . Any chirotope element whose only $h_{i}$ elements come from $τ (i \in τ)$ , and does not contain $e_{p}$ , $e_{q}$ for some $p$ , $q \notin τ$ , will vanish.

Proof. Let $σ = {1, \dots, k} \subseteq τ$ . It suffices to prove:

\det (h_{1}, \dots, h_{k}, e_{s_{1}}, \dots, \hat{e_{p}}, \dots, \hat{e_{q}} \dots, e_{s_{n + 1 - k}}) = 0,

(7.6)

as other determinants satisfying the conditions in the theorem are either permutations or index relabelings of the LHS of Equation 7.6. Let $ρ = τ ∖ σ = {k + 1, \dots, l}$ , and $[n] ∖ τ = {l + 1, \dots, p, \dots, q, \dots, n}$ then

\det (h_{1}, \dots, h_{k}, e_{s_{1}}, \dots, \hat{e_{p}}, \dots, \hat{e_{q}} \dots, e_{s_{n + 1 - k}})

graphic file with name nihpp-2410.11012v1-f0059.jpg

But since $τ$ is simply-embedded, $W_{i, j} = W_{j}$ for all $i \in τ$ and $j \notin τ$ and therefore columns $p$ and $q$ are linearly dependent:

graphic file with name nihpp-2410.11012v1-f0060.jpg

Note however that the other columns may or may not be linearly dependent, as the * indicates the possible presence of a 1, so it is indeed necessary to have $e_{p}$ and $e_{q}$ not appear in the determinant. Also, note that the proof still holds if $q = n + 1$ . □

Lemma 53 now follows from Lemma 54:

Δ_{k}^{i j} (τ) = \det (h_{1}, \dots, h_{i - 1}, e_{i}, e_{i + 1}, e_{i + 2} \dots, e_{k - 1}, h_{i}, e_{k + 1}, \dots, e_{n}, h_{j}) = \pm \det (h_{1}, \dots, h_{i - 1}, h_{i}, h_{j}, e_{i + 2}, \dots, e_{k - 1}, e_{i}, e_{k + 1}, \dots, e_{n}, e_{i + 1}) = \pm \det (h_{1}, \dots, h_{j = i + 1}, e_{i}, e_{i + 1}, e_{i + 2} \dots, e_{k - 1}, e_{k + 1}, \dots, e_{n}) .

It is clear from the last determinant that $e_{k}$ and $e_{n + 1}$ are missing, with $k$ , $n + 1 \notin τ = {1, \dots, i, i + 1}$ . In this case, $σ = τ$ in the proof above. Note that the indices above give a total of $n + 1$ elements: $1, \dots, k - 1, k + 1, \dots, n$ with an extra $i$ , $i + 1$ for a total of $n - 1 + 2 = n + 1$ elements.

We can still push a little further:

Lemma 55.

Let $τ \subseteq [n]$ and suppose one of the following holds:

There exist $p$ , $q \in [n] ∖ τ$ , $(p \neq q)$ such that $τ$ is simply-embedded in $τ \cup {p, q}$ . That is, $W_{i, p} = W_{j, p}$ and $W_{i, q} = W_{j, q}$ for all $i$ , $j \in τ$ . (Assume no restrictions on b).
There exists $p \in [n] ∖ τ$ such that $τ$ is simply-embedded in $τ \cup {p}$ , $q = n + 1$ , and $b_{i} = θ$ for all $i \in τ$ (it can be anything outside of $τ$ ).

Then, any chirotope determinant whose only $h_{i}$ elements come from $τ$ (i.e. $i \in τ$ ), and does not contain $e_{p}$ or $e_{q}$ , will vanish.

Proof. Let $σ = {1, \dots, k} \subseteq τ$ . It suffices to prove:

\det (h_{1}, \dots, h_{k}, e_{s_{1}}, \dots, \hat{e_{p}}, \dots, \hat{e_{q}} \dots, e_{s_{n + 1 - k}}) = 0,

(7.7)

as other determinants satisfying the conditions in the theorem are either permutations or index relabelings of the LHS of Equation 7.7. First, suppose (i) holds. Let $ρ = τ ∖ σ = {k + 1, \dots, l}$ , and $[n] ∖ τ = {l + 1, \dots, p, \dots, q, \dots, n}$ , then

graphic file with name nihpp-2410.11012v1-f0061.jpg

where the ∗ are blocks of mostly zeros and some ones, coming from the $e_{s_{i}}$ rows. Now, since $τ$ is simply-embedded in $τ \cup {p, q}$ , it follows that columns $p$ and $q$ are linearly dependent and thus the determinant vanishes.

For (ii), let $ρ = τ ∖ σ = {k + 1, \dots, l}$ , and $[n] ∖ τ = {l + 1, \dots, p, \dots, n}$ . Then the determinant looks like

graphic file with name nihpp-2410.11012v1-f0062.jpg

and so in this case columns $p$ and $n + 1$ are linearly dependent too, and the determinant vanishes. □

As mentioned above, our goal was to completely compute the chirotope for networks that have some simply added structure. More specifically, if a given CTLN has a strongly simply-embedded partition, can the chirotope be factorized, just like the $s_{i}$ ’s can? We still don’t know.

Supplementary Material

Supplement 1

NIHPP2410.11012v1-supplement-1.pdf^{(2.3MB, pdf)}

Acknowledgments

I want to extend my deepest gratitude to my advisor, Prof. Carina Curto, for her constant support, guidance, and encouragement. Her expertise and mentorship have been invaluable in shaping not only my current research program but also my identity as a researcher. Her belief in me and the strong role model she is has been a truly transformative experience in this journey.

I would also like to express my appreciation to all my collaborators for their contributions to our shared projects, especially to Prof. Katie Morrison for her incredible support and valuable advice, which greatly contributed to the research presented in this dissertation.

Thanks also to labmates, but especially to Nikki Sanderson, for their continuous academic, professional, and personal support, which has been instrumental in navigating the challenges of graduate school.

I extend my appreciation to my committee members for taking the time to provide valuable feedback, constructive criticism, and insightful suggestions to this work. Their expertise has been instrumental in refining my ideas and strengthening the quality of my current and future work.

I would also like to acknowledge the Department of Mathematics for providing a nurturing environment for research, learning, and teaching. The resources offered by the department have played a crucial role in strengthening my professional profile. Special thanks to Profs. Cheryl Hile and Kathryn Stewart for their exceptional mentorship during my time as a graduate instructor. I am the most fortunate to have had such dedicated and inspiring teaching mentors.

I am also indebted to all those who have supported me personally during this time. Their presence in my life has been indispensable in keeping a healthy research-life balance.

Finally, I am immensely grateful for the funding provided by the Department of Mathematics, the NIH grant R01 EB022862 and NSF grant DMS-1951165, without which this research would not have been possible. The material in Chapter 7 is based upon work supported by the National Science Foundation under Grant No. DMS-1929284 while the author was in residence at the Institute for Computational and Experimental Research in Mathematics in Providence, RI, during the Math + Neuroscience program. The findings and conclusions in this dissertation do not necessarily reflect the view of the funding agencies.

Footnotes

The set $FP (G)$ does not depend on $θ$ , see [53]

This foreshadows the importance of this factorization and will come back to us in Chapter 7, in unexpected ways.

Bibliography

[1].Neuroscience. Sinauer Associates, 2 edition, 2001.
[2].McN R. Alexander. The gaits of bipedal and quadrupedal animals. The International Journal of Robotics Research, 3(2):49–59, 1984. [Google Scholar]
[3].Amit Daniel J. Modeling brain function: The world of attractor neural networks. Cambridge University Press, 1989. [Google Scholar]
[4].Arriaga M. and Han E. B. Dedicated hippocampal inhibitory networks for locomotion and immobility. J. Neurosci., 37:9222–9238, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
[5].Ashwin P., Coombes S., and Nicks R. Mathematical Frameworks for Oscillatory Network Dynamics in Neuroscience. J Math Neurosci, 6(1):2, Dec 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
[6].Bel A., Cobiaga R., Reartes W., and Rotstein H. G. Periodic solutions in threshold-linear networks and their entrainment. SIAM J. Appl. Dyn. Syst., 20(3):1177–1208, 2021. [Google Scholar]
[7].Biswas Tirthabir and Fitzgerald James E. Geometric framework to predict structure from function in neural networks. Phys. Rev. Res., 4:023255, Jun 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
[8].Buono P. L. and Golubitsky M. Models of central pattern generators for quadruped locomotion. I. Primary gaits. J Math Biol, 42(4):291–326, Apr 2001. [DOI] [PubMed] [Google Scholar]
[9].Büttner U. and Büttner-Ennever J.A. Present concepts of oculomotor organization. In Büttner-Ennever J.A, editor, Neuroanatomy of the Oculomotor System, volume 151 of Progress in Brain Research, pages 1–42. Elsevier, 2006. [DOI] [PubMed] [Google Scholar]
[10].Cain Nicholas and Shea-Brown Eric. Computational models of decision making: integration, stability, and noise. Current Opinion in Neurobiology, 22(6):1047–1053, 2012. Decision making. [DOI] [PubMed] [Google Scholar]
[11].Carlos Joaquín Castañeda Castro. Ctlns definidos por torneos, 2023.
[12].Collins J. J. and Stewart I. N. Coupled nonlinear oscillators and the symmetries of animal gaits. Journal of Nonlinear Science, 3(1):349–392, Dec 1993. [Google Scholar]
[13].Curto C., Degeratu A., and Itskov V. Encoding binary neural codes in networks of threshold-linear neurons. Neural Comput., 25:2858–2903, 2013. [DOI] [PubMed] [Google Scholar]
[14].Curto C., Geneson J., and Morrison K. Stable fixed points of combinatorial threshold-linear networks. Available at https://arxiv.org/abs/1909.02947 [DOI] [PMC free article] [PubMed]
[15].Curto C., Geneson J., and Morrison K. Fixed points of competitive threshold-linear networks. Neural Comput., 31(1):94–155, 2019. [DOI] [PubMed] [Google Scholar]
[16].Curto Carina and Morrison Katherine. Pattern completion in symmetric threshold-linear networks. Neural Computation, 28:2825–2852, 12 2016. [DOI] [PubMed] [Google Scholar]
[17].Curto Carina and Morrison Katherine. Graph rules for recurrent neural network dynamics. Notices of the American Mathematical Society, 70(04):536–551, 2023. [Google Scholar]
[18].Curto Carina and Morrison Katherine. Graph rules for recurrent neural network dynamics: extended version, 2023.
[19].1965 Dayan Peter and Abbott L. F. Theoretical neuroscience : computational and mathematical modeling of neural systems. Computational neuroscience. Massachusetts Institute of Technology Press, Cambridge, Mass., 2001. [Google Scholar]
[20].Durstewitz Daniel, Koppe Georgia, and Thurm Max Ingo. Reconstructing computational system dynamics from neural data with recurrent neural networks. Nature Reviews Neuroscience, 24(11):693–710, Nov 2023. [DOI] [PubMed] [Google Scholar]
[21].Dutta Sourav, Parihar Abhinav, Khanna Abhishek, Gomez Jorge, Chakraborty Wriddhi, Jerry Matthew, Grisafe Benjamin, Raychowdhury Arijit, and Datta Suman. Programmable coupled oscillators for synchronized locomotion. Nature Communications, 10(1):3299, Jul 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
[22].Dzeladini Florin, Ait-Bouziad Nadine, and Ijspeert Auke. CPG-Based Control of Humanoid Robot Locomotion, pages 1–35. Springer Netherlands, Dordrecht, 2018. [Google Scholar]
[23].Bard Ermentrout G. and David H. Terman. Firing Rate Models, pages 331–367. Springer; New York, New York, NY, 2010. [Google Scholar]
[24].Feudel Ulrike, Pisarchik Alexander N., and Showalter Kenneth. Multistability and tipping: From mathematics and physics to climate and brain—minireview and preface to the focus issue. Chaos: An Interdisciplinary Journal of Nonlinear Science, 28(3):033501, 2018. [DOI] [PubMed] [Google Scholar]
[25].Gambaryan P. P. How mammals run: anatomical adaptations. Wiley, New York, 1974. [Google Scholar]
[26].Gjorgjieva Julijana, Drion Guillaume, and Marder Eve. Computational implications of biophysical diversity and multiple timescales in neurons and synapses for circuit performance. Curr Opin Neurobiol, 37:44–52, 4 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
[27].Goaillard Jean and Marder Eve. Ion channel degeneracy, variability, and covariation in neuron and circuit resilience. Annu Rev Neurosci, 44:335–357, 7 2021. [DOI] [PubMed] [Google Scholar]
[28].Goaillard Jean, Taylor Adam, Schulz David, and Marder Eve. Functional consequences of animal-to-animal variation in circuit parameters. Nature Neuroscience, 12:1424–1430, 11 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
[29].Goldman M. S. Robust Persistent Neural Activity in a Model Integrator with Multiple Hysteretic Dendrites per Neuron. Cerebral Cortex, 13(11):1185–1195, November 2003. [DOI] [PubMed] [Google Scholar]
[30].Goldman Mark S, Compte A, and Wang X. J. Neural Integrator Models, pages 165–178. Elsevier Ltd, 2010. [Google Scholar]
[31].Golubitsky M., Stewart I., Buono P-L, and Collins J.J. Symmetry in locomotor central pattern generators and animal gaits. Nature, 401:693–695, 1999. [DOI] [PubMed] [Google Scholar]
[32].Golubitsky Martin, Stewart Ian, Buono Pietro-Luciano, and Collins J.J. A modular network for legged locomotion. Physica D: Nonlinear Phenomena, 115(1):56–72, 1998. [Google Scholar]
[33].Graybiel Ann M. The basal ganglia and chunking of action repertoires. Neurobiology of Learning and Memory, 70(1):119–136, 1998. [DOI] [PubMed] [Google Scholar]
[34].Grillner S. and Wallén P. Cellular bases of a vertebrate locomotor system – steering, intersegmental and segmental co-ordination and sensory control. Brain Res. Rev., 40:92–106, 2002. [DOI] [PubMed] [Google Scholar]
[35].Hahnloser R. H., Sarpeshkar R., Mahowald M.A., Douglas R.J., and Seung H.S. Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature, 405:947–951, 2000. [DOI] [PubMed] [Google Scholar]
[36].Hahnloser R. H., Seung H.S., and Slotine J.J. Permitted and forbidden sets in symmetric threshold-linear networks. Neural Comput., 15(3):621–638, 2003. [DOI] [PubMed] [Google Scholar]
[37].Richard Hahnloser H. Sebastian Seung, and Jean-Jacques Slotine. Permitted and forbidden sets in symmetric threshold-linear networks. Neural Comput, 15:621–638, 3 2003. [DOI] [PubMed] [Google Scholar]
[38].HARTLINE H. K. and RATLIFF F. Spatial summation of inhibitory influences in the eye of Limulus, and the mutual interaction of receptor units. J Gen Physiol, 41(5):1049–1066, May 1958. [DOI] [PMC free article] [PubMed] [Google Scholar]
[39].Hopfield J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci., 79(8):2554–2558, 1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
[40].Hopfield J.J. Neurons with graded response have collective computational properties like those of two-sate neurons. Proc. Natl. Acad. Sci., 81:3088–3092, 1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
[41].Ijspeert Auke Jan. Central pattern generators for locomotion control in animals and robots: A review. Neural Networks, 21(4):642–653, 2008. Robotics and Neuroscience. [DOI] [PubMed] [Google Scholar]
[42].Kamaleddin Mahdi. Degeneracy in the nervous system: from neuronal excitability to neural coding. Bioessays, 44:e2100148, 1 2022. [DOI] [PubMed] [Google Scholar]
[43].Khona Mikail and Fiete Ila R. Attractor and integrator networks in the brain. Nature Reviews Neuroscience, 23(12):744–766, Dec 2022. [DOI] [PubMed] [Google Scholar]
[44].Kornysheva Katja and Diedrichsen Jörn. Human premotor areas parse sequences into their spatial and temporal features. eLife, 3:e03043, aug 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
[45].Koulakov Alexei A., Raghavachari Sridhar, Kepecs Adam, and Lisman John E. Model for a robust neural integrator. Nature Neuroscience, 5(8):775–782, Aug 2002. [DOI] [PubMed] [Google Scholar]
[46].Lappalainen Janne K., Tschopp Fabian D., Prakhya Sridhama, Mason McGill Aljoscha Nern, Shinomiya Kazunori, Shin ya Takemura Eyal Gruntman, Macke Jakob H., and Turaga Srinivas C. Connectome-constrained deep mechanistic networks predict neural responses across the fly visual system at single-neuron resolution. bioRxiv, 2023. [DOI] [PMC free article] [PubMed]
[47].Lashley Karl S. The problem of serial order in behavior. 1951.
[48].Latorre Roberto, Varona Pablo, and Rabinovich Mikhail I. Rhythmic control of oscillatory sequential dynamics in heteroclinic motifs. Neurocomputing, 331:108–120, 2019. [Google Scholar]
[49].Laureline Logiaco L.F. Abbott, and Sean Escola. Thalamic control of cortical dynamics in a model of flexible motor sequencing. Cell Reports, 35(9):109090, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
[50].Long Michael A., Jin Dezhe Z., and Fee Michale S. Support for a synaptic chain model of neuronal sequence generation. Nature, 468(7322):394–399, Nov 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[51].Marder E. and Bucher D. Central pattern generators and the control of rhythmic movements. Curr. Bio., 11(23):R986–996, 2001. [DOI] [PubMed] [Google Scholar]
[52].Mazurek Mark E., Roitman Jamie D., Ditterich Jochen, and Shadlen Michael N. A Role for Neural Integrators in Perceptual Decision Making. Cerebral Cortex, 13(11):1257–1269, 11 2003. [DOI] [PubMed] [Google Scholar]
[53].Morrison K. and Curto C. Predicting neural network dynamics via graphical analysis. Book chapter in Algebraic and Combinatorial Computational Biology, edited by Robeva R. and Macaulay M. Elsevier, 2018.
[54].Morrison K., Degeratu A., Itskov V., and Curto C. Diversity of emergent dynamics in competitive threshold-linear networks. Available at https://arxiv.org/abs/1605.04463
[55].Eadweard. Muybridge. Attitudes of Animals in Motion, Illustrated with the Zoopraxiscope. University of Pennsylvania, 1891.
[56].Nikitchenko Maxim and Koulakov Alexei. Neural integrator: a sandpile model. Neural computation, 20(10):2379–2417, Oct 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
[57].Sara Oliveira Santos Nils Tack, Su Yunxing, Cuenca-Jiménez Francisco, Morales-Lopez Oscar, Gomez-Valdez P. Antonio, and Wilhelmus Monica M . Pleobot: a modular robotic solution for metachronal swimming. Scientific Reports, 13(1):9574, Jun 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
[58].Panchin Y. V., Arshavsky Y. I., Deliagina T. G., Popova L. B., and Orlovsky G. N. Control of locomotion in marine mollusk clione limacina. ix. neuronal mechanisms of spatial orientation. Journal of Neurophysiology, 73(5):1924–1937, 1995. [DOI] [PubMed] [Google Scholar]
[59].Parmelee Caitlyn,Londono Alvarez Juliana, Curto Carina, and Morrison Katherine. Sequential attractors in combinatorial threshold-linear networks. SIAM Journal on Applied Dynamical Systems, 21(2):1597–1630, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
[60].Parmelee Caitlyn, Castañeda Joaquin, Morrison Katherine, and Curto Carina. New core motifs paper.
[61].Parmelee Caitlyn, Moore Samantha, Morrison Katherine, and Curto Carina. Core motifs predict dynamic attractors in combinatorial threshold-linear networks. PLOS ONE, 17(3):1–21, 03 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
[62].Penn Yaron, Segal Menahem, and Moses Elisha. Network synchronization in hippocampal neurons. Proceedings of the National Academy of Sciences, 113(12):3341–3346, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
[63].Pisarchik Alexander N. and Feudel Ulrike. Control of multistability. Physics Reports, 540(4):167–218, 2014. Control of multistability. [Google Scholar]
[64].Prinz Astrid A., Bucher Dirk, and Marder Eve. Similar network activity from disparate circuit parameters. Nature Neuroscience, 7:1345–1352, 12 2004. [DOI] [PubMed] [Google Scholar]
[65].Rabinovich Mikhail I. and Varona Pablo. Discrete sequential information coding: Heteroclinic cognitive dynamics. Frontiers in Computational Neuroscience, 12, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
[66].Sakai Katsuyuki, Kitaguchi Katsuya, and Hikosaka Okihide. Chunking during human visuomotor sequence learning. Exp. Brain Res., 152(2):229–242, September 2003. [DOI] [PubMed] [Google Scholar]
[67].Seung H.S. and Yuste R. Principles of Neural Science, chapter Appendix E: Neural networks, pages 1581–1600. McGraw-Hill Education/Medical, 5th edition, 2012. [Google Scholar]
[68].Sebastian Seung H., Daniel D.Lee, Reis Ben Y, and Tank David W. Stability of the memory of eye position in a recurrent network of conductance-based model neurons. Neuron, 26(1):259–271, 2000. [DOI] [PubMed] [Google Scholar]
[69].Su Cui and Pang Jun. Sequential temporary and permanent control of boolean networks. In Computational Methods in Systems Biology: 18th International Confer-ence, CMSB 2020, Konstanz, Germany, September 23–25, 2020, Proceedings, page 234–251, Berlin, Heidelberg, 2020. Springer-Verlag. [Google Scholar]
[70].Tsodyks M. V., Skaggs W. E., Sejnowski T. J., and McNaughton B. L. Paradoxical effects of external modulation of inhibitory interneurons. J Neurosci, 17(11):4382–4388, Jun 1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
[71].Varona Pablo, Rabinovich Mikhail I., Selverston Allen I., and Arshavsky Yuri I. Winnerless competition between sensory neurons generates chaos: A possible mechanism for molluscan hunting behavior. Chaos: An Interdisciplinary Journal of Nonlinear Science, 12(3):672–677, 2002. [DOI] [PubMed] [Google Scholar]
[72].Vasa Suresh, Ma Tao, Byadarhaly Kiran V., Perdoor Mithun, and Minai Ali A. A spiking neural model for the spatial coding of cognitive response sequences. In 2010 IEEE 9th International Conference on Development and Learning, pages 140–146, 2010.
[73].White J.A., Chow C.C., Ritt J., Soto-Treviño C., and Kopell N. Synchronization and oscillatory dynamics in heterogeneous, mutually inhibited neurons. J. Comput. Neurosci., 5(1):5–16, 1998. [DOI] [PubMed] [Google Scholar]
[74].Whittington M.A., Traub R.D., Kopell N., Ermentrout B., and Buhl E.H. Inhibitionbased rhythms: experimental and mathematical observations on network dynamics. Int. J. Psychophysiol., 38(3):315–336, 2000. [DOI] [PubMed] [Google Scholar]
[75].Williams Thelma L. A new model for force generation by skeletal muscle, incorporating work-dependent deactivation. J. Exp. Biol., 213(4):643–650, February 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
[76].Wong Aaron L and Krakauer John W. Why are sequence representations in primary motor cortex so elusive? Neuron, 103(6):956–958, September 2019. [DOI] [PubMed] [Google Scholar]
[77].Xie X., Hahnloser R. H., and Seung H.S. Selectively grouping neurons in recurrent networks of lateral inhibition. Neural Comput., 14:2627–2646, 2002. [DOI] [PubMed] [Google Scholar]
[78].Yokoi Atsushi and Diedrichsen Jörn. Neural organization of hierarchical motor sequence representations in the human neocortex. Neuron, 103(6):1178–1190.e7, 2019. [DOI] [PubMed] [Google Scholar]
[79].Yuste R., MacLean J.N., Smith J., and Lansner A. The cortex as a central pattern generator. Nat. Rev. Neurosci., 6:477–483, 2005. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement 1

NIHPP2410.11012v1-supplement-1.pdf^{(2.3MB, pdf)}

[R1] [1].Neuroscience. Sinauer Associates, 2 edition, 2001.

[R2] [2].McN R. Alexander. The gaits of bipedal and quadrupedal animals. The International Journal of Robotics Research, 3(2):49–59, 1984. [Google Scholar]

[R3] [3].Amit Daniel J. Modeling brain function: The world of attractor neural networks. Cambridge University Press, 1989. [Google Scholar]

[R4] [4].Arriaga M. and Han E. B. Dedicated hippocampal inhibitory networks for locomotion and immobility. J. Neurosci., 37:9222–9238, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] [5].Ashwin P., Coombes S., and Nicks R. Mathematical Frameworks for Oscillatory Network Dynamics in Neuroscience. J Math Neurosci, 6(1):2, Dec 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] [6].Bel A., Cobiaga R., Reartes W., and Rotstein H. G. Periodic solutions in threshold-linear networks and their entrainment. SIAM J. Appl. Dyn. Syst., 20(3):1177–1208, 2021. [Google Scholar]

[R7] [7].Biswas Tirthabir and Fitzgerald James E. Geometric framework to predict structure from function in neural networks. Phys. Rev. Res., 4:023255, Jun 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] [8].Buono P. L. and Golubitsky M. Models of central pattern generators for quadruped locomotion. I. Primary gaits. J Math Biol, 42(4):291–326, Apr 2001. [DOI] [PubMed] [Google Scholar]

[R9] [9].Büttner U. and Büttner-Ennever J.A. Present concepts of oculomotor organization. In Büttner-Ennever J.A, editor, Neuroanatomy of the Oculomotor System, volume 151 of Progress in Brain Research, pages 1–42. Elsevier, 2006. [DOI] [PubMed] [Google Scholar]

[R10] [10].Cain Nicholas and Shea-Brown Eric. Computational models of decision making: integration, stability, and noise. Current Opinion in Neurobiology, 22(6):1047–1053, 2012. Decision making. [DOI] [PubMed] [Google Scholar]

[R11] [11].Carlos Joaquín Castañeda Castro. Ctlns definidos por torneos, 2023.

[R12] [12].Collins J. J. and Stewart I. N. Coupled nonlinear oscillators and the symmetries of animal gaits. Journal of Nonlinear Science, 3(1):349–392, Dec 1993. [Google Scholar]

[R13] [13].Curto C., Degeratu A., and Itskov V. Encoding binary neural codes in networks of threshold-linear neurons. Neural Comput., 25:2858–2903, 2013. [DOI] [PubMed] [Google Scholar]

[R14] [14].Curto C., Geneson J., and Morrison K. Stable fixed points of combinatorial threshold-linear networks. Available at https://arxiv.org/abs/1909.02947 [DOI] [PMC free article] [PubMed]

[R15] [15].Curto C., Geneson J., and Morrison K. Fixed points of competitive threshold-linear networks. Neural Comput., 31(1):94–155, 2019. [DOI] [PubMed] [Google Scholar]

[R16] [16].Curto Carina and Morrison Katherine. Pattern completion in symmetric threshold-linear networks. Neural Computation, 28:2825–2852, 12 2016. [DOI] [PubMed] [Google Scholar]

[R17] [17].Curto Carina and Morrison Katherine. Graph rules for recurrent neural network dynamics. Notices of the American Mathematical Society, 70(04):536–551, 2023. [Google Scholar]

[R18] [18].Curto Carina and Morrison Katherine. Graph rules for recurrent neural network dynamics: extended version, 2023.

[R19] [19].1965 Dayan Peter and Abbott L. F. Theoretical neuroscience : computational and mathematical modeling of neural systems. Computational neuroscience. Massachusetts Institute of Technology Press, Cambridge, Mass., 2001. [Google Scholar]

[R20] [20].Durstewitz Daniel, Koppe Georgia, and Thurm Max Ingo. Reconstructing computational system dynamics from neural data with recurrent neural networks. Nature Reviews Neuroscience, 24(11):693–710, Nov 2023. [DOI] [PubMed] [Google Scholar]

[R21] [21].Dutta Sourav, Parihar Abhinav, Khanna Abhishek, Gomez Jorge, Chakraborty Wriddhi, Jerry Matthew, Grisafe Benjamin, Raychowdhury Arijit, and Datta Suman. Programmable coupled oscillators for synchronized locomotion. Nature Communications, 10(1):3299, Jul 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] [22].Dzeladini Florin, Ait-Bouziad Nadine, and Ijspeert Auke. CPG-Based Control of Humanoid Robot Locomotion, pages 1–35. Springer Netherlands, Dordrecht, 2018. [Google Scholar]

[R23] [23].Bard Ermentrout G. and David H. Terman. Firing Rate Models, pages 331–367. Springer; New York, New York, NY, 2010. [Google Scholar]

[R24] [24].Feudel Ulrike, Pisarchik Alexander N., and Showalter Kenneth. Multistability and tipping: From mathematics and physics to climate and brain—minireview and preface to the focus issue. Chaos: An Interdisciplinary Journal of Nonlinear Science, 28(3):033501, 2018. [DOI] [PubMed] [Google Scholar]

[R25] [25].Gambaryan P. P. How mammals run: anatomical adaptations. Wiley, New York, 1974. [Google Scholar]

[R26] [26].Gjorgjieva Julijana, Drion Guillaume, and Marder Eve. Computational implications of biophysical diversity and multiple timescales in neurons and synapses for circuit performance. Curr Opin Neurobiol, 37:44–52, 4 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] [27].Goaillard Jean and Marder Eve. Ion channel degeneracy, variability, and covariation in neuron and circuit resilience. Annu Rev Neurosci, 44:335–357, 7 2021. [DOI] [PubMed] [Google Scholar]

[R28] [28].Goaillard Jean, Taylor Adam, Schulz David, and Marder Eve. Functional consequences of animal-to-animal variation in circuit parameters. Nature Neuroscience, 12:1424–1430, 11 2009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] [29].Goldman M. S. Robust Persistent Neural Activity in a Model Integrator with Multiple Hysteretic Dendrites per Neuron. Cerebral Cortex, 13(11):1185–1195, November 2003. [DOI] [PubMed] [Google Scholar]

[R30] [30].Goldman Mark S, Compte A, and Wang X. J. Neural Integrator Models, pages 165–178. Elsevier Ltd, 2010. [Google Scholar]

[R31] [31].Golubitsky M., Stewart I., Buono P-L, and Collins J.J. Symmetry in locomotor central pattern generators and animal gaits. Nature, 401:693–695, 1999. [DOI] [PubMed] [Google Scholar]

[R32] [32].Golubitsky Martin, Stewart Ian, Buono Pietro-Luciano, and Collins J.J. A modular network for legged locomotion. Physica D: Nonlinear Phenomena, 115(1):56–72, 1998. [Google Scholar]

[R33] [33].Graybiel Ann M. The basal ganglia and chunking of action repertoires. Neurobiology of Learning and Memory, 70(1):119–136, 1998. [DOI] [PubMed] [Google Scholar]

[R34] [34].Grillner S. and Wallén P. Cellular bases of a vertebrate locomotor system – steering, intersegmental and segmental co-ordination and sensory control. Brain Res. Rev., 40:92–106, 2002. [DOI] [PubMed] [Google Scholar]

[R35] [35].Hahnloser R. H., Sarpeshkar R., Mahowald M.A., Douglas R.J., and Seung H.S. Digital selection and analogue amplification coexist in a cortex-inspired silicon circuit. Nature, 405:947–951, 2000. [DOI] [PubMed] [Google Scholar]

[R36] [36].Hahnloser R. H., Seung H.S., and Slotine J.J. Permitted and forbidden sets in symmetric threshold-linear networks. Neural Comput., 15(3):621–638, 2003. [DOI] [PubMed] [Google Scholar]

[R37] [37].Richard Hahnloser H. Sebastian Seung, and Jean-Jacques Slotine. Permitted and forbidden sets in symmetric threshold-linear networks. Neural Comput, 15:621–638, 3 2003. [DOI] [PubMed] [Google Scholar]

[R38] [38].HARTLINE H. K. and RATLIFF F. Spatial summation of inhibitory influences in the eye of Limulus, and the mutual interaction of receptor units. J Gen Physiol, 41(5):1049–1066, May 1958. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] [39].Hopfield J.J. Neural networks and physical systems with emergent collective computational abilities. Proc. Natl. Acad. Sci., 79(8):2554–2558, 1982. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] [40].Hopfield J.J. Neurons with graded response have collective computational properties like those of two-sate neurons. Proc. Natl. Acad. Sci., 81:3088–3092, 1984. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] [41].Ijspeert Auke Jan. Central pattern generators for locomotion control in animals and robots: A review. Neural Networks, 21(4):642–653, 2008. Robotics and Neuroscience. [DOI] [PubMed] [Google Scholar]

[R42] [42].Kamaleddin Mahdi. Degeneracy in the nervous system: from neuronal excitability to neural coding. Bioessays, 44:e2100148, 1 2022. [DOI] [PubMed] [Google Scholar]

[R43] [43].Khona Mikail and Fiete Ila R. Attractor and integrator networks in the brain. Nature Reviews Neuroscience, 23(12):744–766, Dec 2022. [DOI] [PubMed] [Google Scholar]

[R44] [44].Kornysheva Katja and Diedrichsen Jörn. Human premotor areas parse sequences into their spatial and temporal features. eLife, 3:e03043, aug 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] [45].Koulakov Alexei A., Raghavachari Sridhar, Kepecs Adam, and Lisman John E. Model for a robust neural integrator. Nature Neuroscience, 5(8):775–782, Aug 2002. [DOI] [PubMed] [Google Scholar]

[R46] [46].Lappalainen Janne K., Tschopp Fabian D., Prakhya Sridhama, Mason McGill Aljoscha Nern, Shinomiya Kazunori, Shin ya Takemura Eyal Gruntman, Macke Jakob H., and Turaga Srinivas C. Connectome-constrained deep mechanistic networks predict neural responses across the fly visual system at single-neuron resolution. bioRxiv, 2023. [DOI] [PMC free article] [PubMed]

[R47] [47].Lashley Karl S. The problem of serial order in behavior. 1951.

[R48] [48].Latorre Roberto, Varona Pablo, and Rabinovich Mikhail I. Rhythmic control of oscillatory sequential dynamics in heteroclinic motifs. Neurocomputing, 331:108–120, 2019. [Google Scholar]

[R49] [49].Laureline Logiaco L.F. Abbott, and Sean Escola. Thalamic control of cortical dynamics in a model of flexible motor sequencing. Cell Reports, 35(9):109090, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] [50].Long Michael A., Jin Dezhe Z., and Fee Michale S. Support for a synaptic chain model of neuronal sequence generation. Nature, 468(7322):394–399, Nov 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] [51].Marder E. and Bucher D. Central pattern generators and the control of rhythmic movements. Curr. Bio., 11(23):R986–996, 2001. [DOI] [PubMed] [Google Scholar]

[R52] [52].Mazurek Mark E., Roitman Jamie D., Ditterich Jochen, and Shadlen Michael N. A Role for Neural Integrators in Perceptual Decision Making. Cerebral Cortex, 13(11):1257–1269, 11 2003. [DOI] [PubMed] [Google Scholar]

[R53] [53].Morrison K. and Curto C. Predicting neural network dynamics via graphical analysis. Book chapter in Algebraic and Combinatorial Computational Biology, edited by Robeva R. and Macaulay M. Elsevier, 2018.

[R54] [54].Morrison K., Degeratu A., Itskov V., and Curto C. Diversity of emergent dynamics in competitive threshold-linear networks. Available at https://arxiv.org/abs/1605.04463

[R55] [55].Eadweard. Muybridge. Attitudes of Animals in Motion, Illustrated with the Zoopraxiscope. University of Pennsylvania, 1891.

[R56] [56].Nikitchenko Maxim and Koulakov Alexei. Neural integrator: a sandpile model. Neural computation, 20(10):2379–2417, Oct 2008. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R57] [57].Sara Oliveira Santos Nils Tack, Su Yunxing, Cuenca-Jiménez Francisco, Morales-Lopez Oscar, Gomez-Valdez P. Antonio, and Wilhelmus Monica M . Pleobot: a modular robotic solution for metachronal swimming. Scientific Reports, 13(1):9574, Jun 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R58] [58].Panchin Y. V., Arshavsky Y. I., Deliagina T. G., Popova L. B., and Orlovsky G. N. Control of locomotion in marine mollusk clione limacina. ix. neuronal mechanisms of spatial orientation. Journal of Neurophysiology, 73(5):1924–1937, 1995. [DOI] [PubMed] [Google Scholar]

[R59] [59].Parmelee Caitlyn,Londono Alvarez Juliana, Curto Carina, and Morrison Katherine. Sequential attractors in combinatorial threshold-linear networks. SIAM Journal on Applied Dynamical Systems, 21(2):1597–1630, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] [60].Parmelee Caitlyn, Castañeda Joaquin, Morrison Katherine, and Curto Carina. New core motifs paper.

[R61] [61].Parmelee Caitlyn, Moore Samantha, Morrison Katherine, and Curto Carina. Core motifs predict dynamic attractors in combinatorial threshold-linear networks. PLOS ONE, 17(3):1–21, 03 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R62] [62].Penn Yaron, Segal Menahem, and Moses Elisha. Network synchronization in hippocampal neurons. Proceedings of the National Academy of Sciences, 113(12):3341–3346, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R63] [63].Pisarchik Alexander N. and Feudel Ulrike. Control of multistability. Physics Reports, 540(4):167–218, 2014. Control of multistability. [Google Scholar]

[R64] [64].Prinz Astrid A., Bucher Dirk, and Marder Eve. Similar network activity from disparate circuit parameters. Nature Neuroscience, 7:1345–1352, 12 2004. [DOI] [PubMed] [Google Scholar]

[R65] [65].Rabinovich Mikhail I. and Varona Pablo. Discrete sequential information coding: Heteroclinic cognitive dynamics. Frontiers in Computational Neuroscience, 12, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R66] [66].Sakai Katsuyuki, Kitaguchi Katsuya, and Hikosaka Okihide. Chunking during human visuomotor sequence learning. Exp. Brain Res., 152(2):229–242, September 2003. [DOI] [PubMed] [Google Scholar]

[R67] [67].Seung H.S. and Yuste R. Principles of Neural Science, chapter Appendix E: Neural networks, pages 1581–1600. McGraw-Hill Education/Medical, 5th edition, 2012. [Google Scholar]

[R68] [68].Sebastian Seung H., Daniel D.Lee, Reis Ben Y, and Tank David W. Stability of the memory of eye position in a recurrent network of conductance-based model neurons. Neuron, 26(1):259–271, 2000. [DOI] [PubMed] [Google Scholar]

[R69] [69].Su Cui and Pang Jun. Sequential temporary and permanent control of boolean networks. In Computational Methods in Systems Biology: 18th International Confer-ence, CMSB 2020, Konstanz, Germany, September 23–25, 2020, Proceedings, page 234–251, Berlin, Heidelberg, 2020. Springer-Verlag. [Google Scholar]

[R70] [70].Tsodyks M. V., Skaggs W. E., Sejnowski T. J., and McNaughton B. L. Paradoxical effects of external modulation of inhibitory interneurons. J Neurosci, 17(11):4382–4388, Jun 1997. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R71] [71].Varona Pablo, Rabinovich Mikhail I., Selverston Allen I., and Arshavsky Yuri I. Winnerless competition between sensory neurons generates chaos: A possible mechanism for molluscan hunting behavior. Chaos: An Interdisciplinary Journal of Nonlinear Science, 12(3):672–677, 2002. [DOI] [PubMed] [Google Scholar]

[R72] [72].Vasa Suresh, Ma Tao, Byadarhaly Kiran V., Perdoor Mithun, and Minai Ali A. A spiking neural model for the spatial coding of cognitive response sequences. In 2010 IEEE 9th International Conference on Development and Learning, pages 140–146, 2010.

[R73] [73].White J.A., Chow C.C., Ritt J., Soto-Treviño C., and Kopell N. Synchronization and oscillatory dynamics in heterogeneous, mutually inhibited neurons. J. Comput. Neurosci., 5(1):5–16, 1998. [DOI] [PubMed] [Google Scholar]

[R74] [74].Whittington M.A., Traub R.D., Kopell N., Ermentrout B., and Buhl E.H. Inhibitionbased rhythms: experimental and mathematical observations on network dynamics. Int. J. Psychophysiol., 38(3):315–336, 2000. [DOI] [PubMed] [Google Scholar]

[R75] [75].Williams Thelma L. A new model for force generation by skeletal muscle, incorporating work-dependent deactivation. J. Exp. Biol., 213(4):643–650, February 2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R76] [76].Wong Aaron L and Krakauer John W. Why are sequence representations in primary motor cortex so elusive? Neuron, 103(6):956–958, September 2019. [DOI] [PubMed] [Google Scholar]

[R77] [77].Xie X., Hahnloser R. H., and Seung H.S. Selectively grouping neurons in recurrent networks of lateral inhibition. Neural Comput., 14:2627–2646, 2002. [DOI] [PubMed] [Google Scholar]

[R78] [78].Yokoi Atsushi and Diedrichsen Jörn. Neural organization of hierarchical motor sequence representations in the human neocortex. Neuron, 103(6):1178–1190.e7, 2019. [DOI] [PubMed] [Google Scholar]

[R79] [79].Yuste R., MacLean J.N., Smith J., and Lansner A. The cortex as a central pattern generator. Nat. Rev. Neurosci., 6:477–483, 2005. [DOI] [PubMed] [Google Scholar]

PERMALINK

This is a preprint.

ATTRACTOR-BASED MODELS FOR SEQUENCES AND PATTERN GENERATION IN NEURAL CIRCUITS

Juliana Londono Alvarez

Abstract

Chapter 1 | Introduction

Figure 1.1. Summary of models as attractor networks.

Sequences of sequential attractors.

Figure 1.2. Examples of sequential attractors.

Desired properties of models.

Summary of models.

Figure 1.3. Summary of models I.

Figure 1.4. Summary of models II.

New network theory.

Chapter 2 | Review of relevant background

2.1. Firing rate models and attractors

Figure 2.1.

Figure 2.2.

2.2. Threshold-linear networks (TLNs)

Definition 1.

Definition 2 ([15, Definitions 1 and 2]).

Figure 2.3. Hyperplane chambers cartoon.

Corollary 3 ([15]).

2.3. Combinatorial threshold-linear networks (CTLNs)

Figure 2.4. CTLNs.

Definition 4.

Figure 2.5. CTLN example.

2.3.1. Core motifs

Definition 5 ([17]).

Figure 2.6. CTLN attractors.

Theorem 6.

2.3.2. Graph rules

Theorem 7 (uniform in-degree, [15]).

Figure 2.7. Three families of uniform in-degree graphs.

Figure 2.8. Graphical domination.

Definition 8.

Theorem 9.

2.3.3. Cyclic unions and sequential attractors

Figure 2.9. Cyclic unions.

Definition 10.

Theorem 11 (cyclic unions, theorem 13 in [15]).

Figure 2.10. FP(G) of cyclic union.

2.3.4. Fusion attractors

Figure 2.11.

Chapter 3 | Sequences of attractors

Figure 3.1. Sequences of attractors.

3.1. Sequences of fixed point attractors

CTLN counter.

Figure 3.2. Construction of counter.

Figure 3.3. Hysteresis.

Figure 3.4. CTLN counters.

CTLN signed counter.

Robustness of CTLN counters.

Figure 3.5. Good parameter grid for CTLN counters.

Figure 3.6. Noisy CTLN counters.

3.2. Sequences of dynamic attractors of the same type

Figure 3.7. Dynamic attractor chains.

Robustness of dynamic attractor chain.

Figure 3.8. Noisy dynamic attractors chain.

Chapter 4 | Central pattern generators

Figure 4.1. Coexistent dynamic attractors.

4.1. Cyclic unions as pattern generators

Figure 4.2. Cyclic unions as pattern generators.

Figure 4.3. Cyclic unions and generalizations.

4.2. Quadruped gaits

Figure 4.4. Gait phase relations.

4.2.1. Construction of gaits

Single gaits

Figure 4.5. Building blocks.

Figure 4.6. Design of two different quadrupedal gaits.

Figure 4.7.

4.2.2. Coexistent gaits

4.2.3. Parameter analyses

Good parameters.

Corollary 1.

Figure 4.8. Gaits survival under several values of ε, δ, θ.

Parameter modulation.

Figure 4.9. Amplitude and period of the gaits in the five-gait network.

4.2.4. Robustness to noise

Figure 4.10. Percentage of lost gaits in noisy five-gait networks.

Figure 2.10. $FP (G)$ of cyclic union.

Figure 4.8. Gaits survival under several values of $ε$ , $δ$ , $θ$ .

Theorem 21 ( $FP (G)$ menu for simply-embedded partitions).

Figure 6.3. Strongly simply-embedded partition with $FP (G)$ .

Theorem 21 ( $FP (G)$ menu for simply-embedded partitions).