Beyond the edge of chaos: Amplification and temporal integration by recurrent networks in the chaotic regime

T Toyoizumi; L F Abbott

doi:10.1103/PhysRevE.84.051908

. Author manuscript; available in PMC: 2017 Aug 16.

Published in final edited form as: Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Nov 14;84(5 Pt 1):051908. doi: 10.1103/PhysRevE.84.051908

Beyond the edge of chaos: Amplification and temporal integration by recurrent networks in the chaotic regime

T Toyoizumi ^1,^2,^*, L F Abbott ¹

PMCID: PMC5558624 NIHMSID: NIHMS795941 PMID: 22181445

Abstract

Randomly connected networks of neurons exhibit a transition from fixed-point to chaotic activity as the variance of their synaptic connection strengths is increased. In this study, we analytically evaluate how well a small external input can be reconstructed from a sparse linear readout of network activity. At the transition point, known as the edge of chaos, networks display a number of desirable features, including large gains and integration times. Away from this edge, in the nonchaotic regime that has been the focus of most models and studies, gains and integration times fall off dramatically, which implies that parameters must be fine tuned with considerable precision if high performance is required. Here we show that, near the edge, decoding performance is characterized by a critical exponent that takes a different value on the two sides. As a result, when the network units have an odd saturating nonlinear response function, the falloff in gains and integration times is much slower on the chaotic side of the transition. This means that, under appropriate conditions, good performance can be achieved with less fine tuning beyond the edge, within the chaotic regime.

The dynamic state of a network of neurons influences its information processing capabilities [1]. A network of recurrently connected neurons generates complex dynamics that have been utilized for various computational purposes [2–4]. Large networks of this type exhibit a sharp transition from nonchaotic to chaotic dynamics [5,6], and performance has been characterized as optimal at the edge of this transition [4,7–9]. The location of the transition depends on properties of the inputs to the network [10,11], so maintaining a network right at the edge of chaos would require finely tuning parameters for each input, which is impractical. Therefore, it is important to determine how performance degrades away from this optimal transition point. Here, using a model that is amenable to analytic calculation, we find that, under many circumstances, performance degrades more slowly on the chaotic side of the transition than on the nonchaotic side, showing that it is advantageous to work in the chaotic regime when fine tuning to the edge of chaos cannot be achieved.

Our study is based on a dynamic mean-field calculation applied to randomly connected networks. We compute the signal-to-noise ratio for reconstructing a small external input from a sparse linear readout, a standard network task. This ratio bounds decoding accuracy for both static [12] and dynamic [13] stimuli. To quantify the behavior of the signal-to-noise ratio near the transition point, we evaluate its critical exponents. The analytic expression for the signal-to-noise ratio provides an intuitive picture of the tradeoff between increasing the signal through a larger gain and increasing chaotic noise. The presence of observation noise emphasizes the importance of increasing the signal over decreasing the internally generated noise, providing an advantage to the chaotic state. As outlined above, in the presence of observation noise, the signal-to-noise ratio is maximized at the edge of chaos, where the network time constant shows a critical slowing and small inputs are highly amplified. In addition, at a given distance from the transition point, the chaotic state is often more informative than the nonchaotic state and provides longer-lasting memory.

I. MODEL AND METHOD

We use the dynamic mean-field method [5,10,11,14]to analyze responses of randomly connected networks to external input. For simplicity, we study a discrete-time model with N units. The dynamics of the recurrent input to unit i on trial a (where each trial starts from a different initial condition; what we call trials are also known as replicas) is described by

h_{i}^{a} (t) = \sum_{j = 1}^{N} J_{i j} ϕ_{j}^{a} (t),

(1)

where J_ij is the coupling strength from unit j to unit i,

ϕ_{j}^{a} (t) \equiv ϕ (θ (t - 1) + h_{j}^{a} (t - 1))

(2)

is an abbreviation used for a saturating response nonlinearity, ϕ, and θ(t) is a spatially uniform external input. Our goal is to determine how accurately and over what time period θ(t) can be decoded from a linear readout of network activity [15,16]. Each coupling strength is independently and randomly drawn from a distribution with zero mean and standard deviation $g / \sqrt{N}$ . For the purposes of calculation, we use a Gaussian distribution, but distributions that include a δ function at zero, corresponding to sparse connections, or that have discrete support, corresponding to a finite number of possible connection strengths, give the same results in the limit of large N.

We introduce a mean-field distribution for the set of state variables $h = {{h}_{i}^{a} (t)}$ through a Dirac delta function constraint,

P (h) = {[\prod_{i, t, a} δ (h_{i}^{a} (t) - \sum_{j = 1}^{N} J_{i j} ϕ_{j}^{a} (t))]}_{J},

(3)

with [·]_J denoting an average over random Gaussian couplings. In the following, we make use of two other averages: E[·| J], which is an average over trials with a fixed {Jij}, and E[·] ≡ [E[·| J ]]_J. In other words, to compute E[·| J], we average over h using the distribution inside the brackets in Eq. (3), removing the average over {Jij}. To compute E[·], we average over h weighted by the full J-averaged distribution of Eq. (3).

Calculations using the dynamic mean-field method [10,14], in the limit of large N, give the moment-generating function for h (see Appendix A),

E [\exp (\sum_{i, t, a} ξ_{i}^{a} (t) h_{i}^{a} (t))] \approx \exp (N f (ξ, q (ξ), \hat{q} (ξ))),

(4)

where $ξ = {ξ_{i}^{a} (t)}$ is the parameter of the generating function, f is the free energy, and the order parameters q = {q^ab(t,s)} and $\hat{q} = {{\hat{q}}^{a b} (t, s)}$ are determined self consistently by the saddle-point equations

\frac{\partial f}{\partial q} = 0 and \frac{\partial f}{\partial \hat{q}} = 0.

(5)

In principle, all the moments of h can be obtained by evaluating the derivatives of the generating function. In particular, the average is $E [h_{i}^{a} (t)] = 0$ and the correlation is

E [h_{i}^{a} (t) h_{j}^{b} (s)] = δ_{i j} q^{a b} (t, s) .

(6)

All higher-order cumulants above the second order are O(1/N).

When the input is constant in time, the system converges to a stationary state. In this case, the self-consistent solution for the order parameter is determined by only two parameters,

q^{a b} (t, s) = (q_{0} - q) δ_{a b} δ_{t s} + q,

(7)

satisfying, self consistenly,

q_{0} = g^{2} \int D x ϕ {(θ + \sqrt{q_{0}} x)}^{2},

(8)

q = g^{2} \int D x D y ϕ (θ + \sqrt{q_{0}} x) ϕ (θ + \frac{q}{\sqrt{q_{0}}} x + \sqrt{\frac{\sqrt{q_{0}^{2}} - q^{2}}{q_{0}} y})

with

D x = \frac{d x e^{- x^{2} / 2}}{\sqrt{2 π}} .

(9)

For θ = 0, using a hyperbolic tangent nonlinearity ϕ(x) = tanh(x), q = 0 is a stable solution, so the order parameter simplifies to

q^{a b} (t, s) = q_{0} δ_{a b} δ_{t s},

(10)

with q₀ increasing from zero in the chaotic region, g>1 [Fig. 1(a)].

The chaotic state is characterized by a Lyapunov exponent given in Ref. [10]

\frac{1}{2} \ln \int D x {[ϕ' (θ + \sqrt{q_{0}} x)]}^{2},

(11)

which increases more rapidly below the transition to chaos (g<1) than above the transition [g>1; Fig. 1(b)]. This difference foreshadows the asymmetric behavior of the signal detection and integration properties analyzed below, but it is important to point out that our later results do not follow directly from this feature of the Lyapunov exponent. The Lyapunov exponent, which determines how two trajectories starting from nearby initial conditions diverge, and the parameter we use to characterize signal detection and integration measure different things and, depending on the nonlinearity used, can take dissimilar values.

II. SIGNAL-TO-NOISE RATIO

We now evaluate the signal-to-noise ratio for sparse linear decoders designed to optimally read out a dynamic input θ(t) from a fixed subset of K(≪N) units. We assume that the measurement of the total input to network unit i on trial a, $θ (t) + h_{i}^{a} (t)$ , is corrupted by Gaussian measurement noise, σ_obsη, of mean zero and variance $σ_{obs}^{2}$ , so that the actual measured value is

v_{i}^{a} (t) = θ (t) + h_{i}^{a} (t) + σ_{obs} η_{i}^{a} (t) .

(12)

We limit our analysis to odd nonlinear functions, ϕ(x)= −ϕ(−x), because this simplifies the analysis. To evaluate the signal-to-noise ratio, we consider a small deviation of the external input from θ = 0 occurring at time t₀. We could alternatively consider decoding information from the nonlinear function of the total input, ϕ(θ + h)+σ_obsη, but the result is unaltered to the leading order for finite σ_obs around the edge of chaos because ϕ(θ + h) ≈ θ + h there.

The average (over networks) signal-to-noise ratio for an optimal linear decoder reading out this perturbation after a measurement period lasting from t₀ to T is [12]

R (t_{0}) \equiv \sum_{i, j} {\sum_{t, s ⩾ t_{0}}^{T} [\frac{\partial μ_{i} (t)}{\partial θ (t_{0})} D_{i j} (t, s) \frac{\partial μ_{j} (s)}{\partial θ (t_{0})}]}_{J}

(13)

where

μ_{i} (t) = E [v_{i}^{a} (t) | J],

(14)

and the sums over i and j are restricted to the values of the K units being used in the readout. The quantities in R(t)are all evaluated at θ = 0. The matrix D ≡ C⁻¹ is the inverse of the trial-averaged covariance of the observed K units for a given network (i.e., for a specific {J_ij}), whose elements are described by

C_{i j} (t, s) = Cov [v_{i}^{a} (t), v_{j}^{a} (s) | J] = E [(v_{i}^{a} (t) - μ_{i}) (v_{j}^{a} (s) - μ_{i}) | J] .

(15)

It is important for what follows that this covariance matrix has dimensions K×K, not N×N, and that D is the inverse of this K×K matrix.

The memory curve for an optimal linear decoder, which is sometimes used to quantify the ability of networks to buffer past input [15,16], is identical to Eq. (13) for small input. Equation (13) characterizes the accuracy of a readout based on the trial-mean μ. Generally, information could also be readout from higher-order statistics by using nonlinear decoders [12,17,18]. In this sense, Eq. (13) is a lower bound on the information available from more general nonlinear decoders. Note that the optimal linear readout weights depend on the specific {J_ij}, so it is necessary to adjust the decoder for each network.

From the mean-field analysis, we find that each element of the covariance matrix converges to its averaged value in the limit of large N (see Appendix B), i.e.,

C_{i j} (t, s) = {[C_{i j} (t, s)]}_{J} + O (N^{- 1 / 2}),

(16)

with

{[C_{i j} (t, s)]}_{J} = σ_{obs}^{2} δ_{i j} δ_{t s} + {[Cov [h_{i}^{a} (t), h_{j}^{a} (s) | J]]}_{J} = δ_{i j} δ_{t s} (σ_{obs}^{2} + q_{0})

(17)

evaluated at θ = 0. The O(N^−1/2) term in Eq. (16) introduces corrections of order $\sqrt{K / N}$ into R [Eq. (13)]. To avoid these corrections, we restrict our analysis to the case $K \sim O (\sqrt{N})$ . This assures that the O (N^−1/2), J-specific residuals in Eq. (16) do not contribute to R for large N, and we find

R (t_{0}) = \frac{1}{σ_{obs}^{2} + q_{0}} \sum_{i} {\sum_{t ⩾ t_{0}}^{T} [\frac{\partial μ_{i} (t)}{\partial θ (t_{0})} \frac{\partial μ_{i} (t)}{\partial θ (t_{0})}]}_{J},

(18)

with

{[\frac{\partial μ_{i} (t)}{\partial θ (t_{0})} \frac{\partial μ_{i} (t)}{\partial θ (t_{0})}]}_{J} = {[{\frac{\partial^{2} E [v_{i}^{a} (t) | J] E [v_{i}^{b} (t) | J]}{\partial θ^{a} (t_{0}) \partial θ^{b} (t_{0})} |}_{θ^{a} = θ^{b} = θ}]}_{J} = {\frac{\partial^{2} E [v_{i}^{a} (t) v_{i}^{b} (t)]}{\partial θ^{a} (t_{0}) \partial θ^{b} (t_{0})} |}_{θ^{a} = θ^{b} = θ} = {\frac{\partial^{2} (θ^{a} (t) θ^{b} (t) + q^{a b} (t, t))}{\partial θ^{a} (t_{0}) \partial θ^{b} (t_{0})} |}_{θ^{a} = θ^{b} = θ}

(19)

for a ≠ b. This means that, to evaluate R, we need to evaluate the second derivative of the order parameter, q. This calculation simplifies for an odd nonlinear response function (see Appendix C), and we obtain

R (t_{0}) = K \sum_{t ⩾ t_{0}}^{T} \frac{γ^{t - t_{0}}}{σ_{obs}^{2} + q_{0}}

(20)

with

\sqrt{γ} \equiv g \int D x ϕ' (\sqrt{q_{0}} x),

(21)

which corresponds to g times the effective gain (slope) of the response nonlinearity.

Equation (20) tells us that, at any particular time during the measurement period, R receives a contribution from the past input being detected that decays exponentially in time. The decay constant γ therefore determines the memory lifetime of the network, which is −1/ln(γ), and γ near 1 indicates a long memory lifetime. The denominator in Eq. (20) sums two sources of noise, the measurement noise $σ_{obs}^{2}$ and internal network noise quantified by q₀. The best strategy for increasing R is to minimize the internally generated noise, q₀, and to make γ as close to 1 as possible to allow long-time integration of the signal. In the presence of large observation noise σ_obs ≫q₀, the value of R is dominated by how close γ is to 1.

The lifetime variable $\sqrt{γ}$ reaches its maximum value γ = 1 at the edge of chaos and, importantly, it decreases more slowly in the chaotic regime than in the nonchaotic regime (Fig. 2). This indicates that, although optimal performance occurs at the edge of chaos and requires fine tuning of g to 1, for a given magnitude of detuning from this value (i.e., a given |g −1|), γ is closer to 1 in the chaotic regime (g> 1; Fig. 2).

FIG. 2 — (Color online) The factor $\sqrt{γ}$ plotted for *ϕ(x*) = tanh(x) as a function of the synaptic variability, g. $\sqrt{γ}$ takes the maximum value of 1 at the edge of chaos (g = 1; dotted line) and falls off more slowly in the chaotic regime (g>1) than for g<1.

Assuming an infinitely long observation period,

R = \frac{K}{σ_{obs}^{2} + q_{0}} \frac{1}{1 - γ},

(22)

which is plotted as a function of g in Fig. 3 for some σ_obs. When the decay constant γ approaches 1, which happens at the edge of chaos, R diverges because any input perturbations cause perpetually lasting changes in network activity. These analytic results agree well with simulation results (Fig. 4).

FIG. 4 — (Color online) Numerical calculation of Eq. (13)(circles) with σ_obs = 0.1. compared with the analytic result of Eq. (22) (solid line). The numerical result was obtained by linearly decoding a simulated network with N = 3000 and K = 20. The small circles describe performances of each network and the large circles describe the average performance across different networks. The analytic results matched well with the numerical results.

III. CRITICAL BEHAVIOR NEAR THE EDGE OF CHAOS

We next analyze the critical behavior of the system near the edge of chaos. By definition, the derivative of ϕ at 0 is 1, so we can expand any odd, monotonically increasing ϕ as

ϕ (x) = x + \frac{α_{3} x^{3}}{3!} + \frac{α_{5} x^{5}}{5!} + \dots .

(23)

The Landau expansion of Eq. (8) for small q₀ yields that the sign of α₃ determines the nature of the phase transition around q₀ = 0. The system shows a first-order transition if the nonlinearity is accelerating (α₃ > 0). In this case, q₀ jumps discontinuously from zero to a positive value at g = 1 as g increases. The transition is second order if the nonlinearity is saturating (α₃ < 0). In this case, q₀ increases from zero continuously at g = 1 as g increases. The analysis of the critical behavior is much easier for the second-order transition (α₃ < 0), the case we examine.

We analyze the critical behavior of R near the edge of chaos, that is, for small Δg≡g −1, using Eqs. (20) and (21). In the nonchaotic regime (Δg < 0), q₀ = 0 so the decay factor is γ = g². From Eq. (22), we find that

R = \frac{K}{σ_{obs}^{2} (1 - g^{2})} \approx \frac{K}{2 σ_{obs}^{2} | Δ g |} .

(24)

In the chaotic regime (Δg > 0), we expand Eq. (8) for small q₀ and find that the order parameter is

q_{0} = \frac{2}{| α_{3} |} Δ g + \frac{α_{5} / α_{3}^{2} - 4 / 3}{| α_{3} |} Δ g^{2} + O (Δ g^{3}) .

(25)

Based on this expression, the effective gain is $\sqrt{γ} \approx 1 - Δ g^{2} / 3$ to leading order. Hence, to leading order,

R = \frac{3 K}{2 σ_{obs}^{2} {(Δ g)}^{2}}

(26)

near the edge but on the chaotic side. Interestingly, the dependencies of $\sqrt{γ}$ and R on ϕ (such as α₃ and α₅) disappear up to this order. In contrast to the nonchaotic regime with R ~ Δg⁻¹, the divergence is stronger in the chaotic regime with R ~ Δg⁻², yielding larger R at an equal distance, |Δg|, away from the edge (Fig. 5).

FIG. 5 — (Color online) The critical behavior of R does not depend on details of the nonlinearity or on the noise level. For any saturating odd nonlinear response function, R diverges linearly on the nonchaotic and quadratically on the chaotic side of the edge or transition. The solid line describes the asymptotic behavior; the dash-dotted line describes ϕ(x) = tanh(x) and σ_obs = 0.1; and the dashed line describes $ϕ (x) = erf (\sqrt{π} x / 2)$ and σ_obs = 0.3.

We have determined analytically that the signal-to-noise ratio of large randomly connected networks diverges at the edge of chaos, and the memory lifetime of the network also diverges. Observation noise is an important element for this property. Without observation noise, any network without internally generated noise yields an infinite R. On the other hand, addition of observation noise emphasizes the benefit of increasing signal over increasing internally generated noise. Hence, if a deterministic network performs sensory or memory processing and if a receiver of its output has limited observational resolution, it is advantageous to increase the signal by increasing the network gain. Generally, setting network parameters right at the edge of chaos requires fine tuning. We have shown that at the same small distance away from the edge, R is larger in the chaotic regime than in the nonchaotic regime for any saturating odd nonlinear function.

Although, we have concentrated on a rather special situation in this paper for mathematical simplicity, several lines of generalization are possible without losing analytic tractability. First, we have neglected internal stochastic noise within the network. Although neurons behave irregularly in networks, they respond reliably in isolation. This observation has lead to a speculation that the dominant apparent stochasticity of cortical circuits is generated by the chaotic dynamics of, individually, essentially deterministic neurons [6]. The mean-field analysis with system noise has been studied previously [10]. With an addition of small system noise, R is peaked (but does not diverge) near the edge of chaos on the nonchaotic side. Second, we have concentrated on a class of odd nonlinear response functions. This assumption is a mathematical convenience that simplifies the final expression of R. For a general response nonlinearity, R depends not only on γ and q₀ but on other factors as well. Third, although we considered discrete temporal dynamics, it is possible to analyze a continuous-time model in a similar way [5,11]. We believe that qualitative aspects of the signal-to-noise ratio are common in the two models. Fourth, although we consider unstructured networks in this paper, it would be interesting to study how structured connections change chaotic dynamics [19] and influence signal extraction and integration [20].

Acknowledgments

We thank X. Pitkow, M. Tsodyks, K. Kang, and S. Amari for discussions. T.T. was supported by the Patterson Trust and the Special Postdoctoral Research Program at RIKEN. L.F.A. was supported by the National Institutes of Health (NIH) Director’s Pioneer program, part of the NIH Roadmap for Medical Research, through Grant No. 5-DP1-OD114-02, the Gatsby and Swartz Foundations, and the Kavli Institute for Brain Science at Columbia University.

APPENDIX A: MEAN-FIELD CALCULATION

In this appendix, we calculate the moment-generating function of Eq. (4). We denote $h_{i}^{a} (t) = h_{i t}^{a}$ and $ϕ (θ (t - 1) + h_{j}^{a} (t - 1)) = ϕ_{j t}^{a}$ . In this section, we follow the convention that summation is implied when the same index appear twice in an expression (e.g., $\sum_{j} J_{i j} ϕ_{j t}^{a} = J_{i j} ϕ_{j t}^{a}$ ). A calculation of the moment-generating function (as a function of ξ) yields

Z (ξ) = {[\int (\prod_{i, t, a} d h_{i t}^{a}) \exp (ξ_{i t}^{a} h_{i t}^{a}) \prod_{i, t, a} δ (h_{i t}^{a} - J_{i j} ϕ_{j t}^{a})]}_{J} = \int d H \exp (ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a}) {[\exp (- i {\hat{h}}_{i t}^{a} ϕ_{j t}^{a} J_{i j})]}_{J} = \int d H \exp (ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a} + \frac{g^{2}}{2 N} i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b} ϕ_{j t}^{a} ϕ_{j s}^{b}) = \int d H (\int \prod_{t, s, a, b} N d q_{t s}^{a b} δ (N q_{t s}^{a b} - g^{2} ϕ_{j t}^{a} ϕ_{j s}^{b})) \exp (ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a} + \frac{1}{2} q_{t s}^{a b} i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b}) = \int (\prod_{t, s, a, b} \frac{N d q_{t s}^{a b} d {\hat{q}}_{t s}^{a b}}{2 π}) \int d H \exp (i {\hat{q}}_{t s}^{a b} (N q_{t s}^{a b} - g^{2} ϕ_{j t}^{a} ϕ_{j s}^{b}) + ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a} + \frac{1}{2} q_{t s}^{a b} i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b}) = \int (\prod_{t, s, a, b} \frac{N d q_{t s}^{a b} d {\hat{q}}_{t s}^{a b}}{2 π}) \exp (N f (ξ, q, \hat{q})),

(A1)

where $d H \equiv Π_{i, t, a} (d h_{i t}^{a} d {\hat{h}}_{i t}^{a} / 2 π)$ ,

f (ξ, q, \hat{q}) \equiv i {\hat{q}}_{t s}^{a b} q_{t s}^{a b} + \frac{1}{N} \log \int e^{ℒ} d H,

(A2)

and

ℒ \equiv ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a} + \frac{1}{2} q_{t s}^{a b} i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b} - g^{2} i {\hat{q}}_{t s}^{a b} ϕ_{i t}^{a} ϕ_{i s}^{b} .

(A3)

Next, we use the saddle-point method to evaluate Z(ξ). To leading order, the integrals of q and $\hat{q}$ are approximated by the saddle-point value, i.e.,

\ln Z (ξ) \approx N f (ξ, q (ξ), \hat{q} (ξ)),

(A4)

and the saddle-point, [q(ξ), $\hat{q} (ξ)$ ], is determined self consistently by solving

0 = \frac{\partial f}{\partial q_{t s}^{a b}} = i {\hat{q}}_{t s}^{a b} + \frac{1}{2 N} {〈 i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b} 〉}_{ℒ}, 0 = \frac{\partial f}{\partial i {\hat{q}}_{t s}^{a b}} = q_{t s}^{a b} - \frac{g^{2}}{N} {〈 ϕ_{i t}^{a} ϕ_{i s}^{b} 〉}_{ℒ},

(A5)

with the average, 〈·〉_ℒ, defined as

{〈 A 〉}_{ℒ} \equiv \frac{\int A e^{ℒ} d H}{\int e^{ℒ} d H} .

(A6)

Equation (A5) is especially easy to solve when ξ = 0 because $\hat{q} (0) = 0$ is a self-consistent solution of Eq. (A5). To confirm this, we define $ℒ_{0} \equiv ℒ |_{ξ = 0, \hat{q} = 0} = i {\hat{h}}_{i t}^{a} h_{i t}^{a} + q_{t s}^{a b} (0) i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b} / 2$ , and find that

{〈 i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b} 〉}_{ℒ_{0}} = 2 \frac{\partial}{\partial q_{t s}^{a b}} \int e^{ℒ_{0}} d H = 0.

(A7)

Hence, when ξ = 0, a solution of the saddle-point condition is

q_{t s}^{a b} = g^{2} \int (\frac{\prod_{t, a} d h_{t}^{a}}{\sqrt{\det (2 π q)}}) ϕ_{t}^{a} ϕ_{s}^{b} \exp (- \frac{1}{2} {(q^{- 1})}_{t s}^{a b} h_{t}^{a} h_{s}^{b}),

{\hat{q}}_{t s}^{a b} = 0.

(A8)

Note that, in the above expression, ${〈 \cdot 〉}_{ℒ_{0}}$ describes a Gaussian average of h with mean zero and covariance $δ_{i j} q_{t s}^{a b}$ . The possibility of a $\hat{q} (0) \neq 0$ solution is not within the scope of this paper (see Ref. [14], for example). Thus, we concentrate on the $\hat{q} (0) = 0$ solution Eq. (A8) in the following.

In principal, we can obtain all higher-order cumulants of h by differentiating the cumulant-generating function ln Z(ξ) = Nf (ξ,q(ξ), $\hat{q} (ξ))$ by ξ and, then, setting ξ = 0. From the normalization constraint, ln Z(0) = 0. The first derivative is

N \frac{d f}{d ξ_{i t}^{a}} = N \frac{\partial f}{\partial ξ_{i t}^{a}} + N \frac{\partial f}{\partial q} \frac{\partial q}{\partial ξ_{i t}^{a}} + N \frac{\partial f}{\partial \hat{q}} \frac{\partial \hat{q}}{\partial ξ_{i t}^{a}} = N \frac{\partial f}{\partial ξ_{i t}^{a}},

(A9)

because ∂f/∂q = 0 and $\partial f / \partial \hat{q} = 0$ at the saddle-point Eq. (A5). Hence, the first-order cumulant is

{E [h_{i t}^{a}] \approx N \frac{\partial f}{\partial ξ_{i t}^{a}} |}_{ξ = 0} = {〈 h_{i t}^{a} 〉}_{ℒ_{0}} = 0.

(A10)

The calculation of high-order cumulants becomes easier if we neglect O(1/N) factors. First, ${\partial^{n} f / \partial ξ}^{n} = O (1 / N)$ for n ⩾ 1 at ξ = 0. Moreover, the nth (n ⩾ 1) derivatives of the order parameters are ${\partial^{n} q / \partial ξ}^{n} = O (1 / N)$ and ${\partial^{n} \hat{q} / \partial ξ}^{n} = O (1 / N)$ at ξ = 0 from Eq. (A5). This means that perturbations to a single unit contribute only ~1/N to the mean-field variables, which are defined by averaging over N units. Hence, terms that contain derivatives of order parameters contribute only to O (1 /N) terms. Thus, for the calculation of higher-order cumulants, the full derivatives of ξ can be approximated by its partial derivatives, d/dξ ≈ ∂/∂ξ, and the order parameters can be approximated, using their ξ = 0 values as q(ξ) ≈ q(0) and $\hat{q} (ξ) \approx \hat{q} (0)$ . Neglecting O(1/N) terms, we find

N f (ξ, q (0) \hat{q} (0)) = ln \int dH \exp (ξ_{i t}^{a} h_{i t}^{a} + i {\hat{h}}_{i t}^{a} h_{i t}^{a} + \frac{1}{2} q_{t s}^{a b} (0) i {\hat{h}}_{i t}^{a} i {\hat{h}}_{i s}^{b}) = \frac{1}{2} q_{t s}^{a b} (0) ξ_{i t}^{a} ξ_{i s}^{b} .

(A11)

This shows that the mean-field distribution P(h) is a Gaussian distribution for independent units with mean zero and covariance $δ_{i j} q_{t s}^{a b} (0)$ up to O(1/N) terms. Hence, to this precision, the two averages E[·] and ${〈 \cdot 〉}_{ℒ_{0}}$ are indistinguishable.

APPENDIX B: NETWORK SPECIFIC STATISTICS

In this appendix, we evaluate statistics of the state variable, h, for given {J_ij}. The trial-mean of a quantity A(h) is written as E[A(h)| J], where each trial has different initial conditions at t→ −∞. When the system is ergodic this trial average does not depend on a specific set of initial conditions. From this definition and Eq. (A11), to the leading order, we can derive the following:

{[E [h_{i t} | J]]}_{J} = E [h_{i t}] = O (1 / N) and {[Cov [h_{i t}, h_{j s} | J]]}_{J} = [E {[h_{i t} h_{j s} | J] - E [h_{i t} | J] E [h_{j s} | J]]}_{J} = [E {[h_{i t}^{a} h_{j s}^{a} | J] - E [h_{i t}^{a} h_{j s}^{b} | J]]}_{J} = E [h_{i t}^{a} h_{j s}^{a}] - E [h_{i t}^{a} h_{j s}^{b}] = δ_{i j} (q_{t s}^{S} - q_{t s}^{D}) + O (1 / N),

(B1)

where a ≠ b, and $q_{t s}^{a a} = q_{t s}^{S}$ and $q_{t s}^{a b} = q_{t s}^{D}$ . Now we define the network specific covariance as Γ_ij_;_ts ≡ Cov[h_it,h_js|J] and evaluate how Γ is different from one realization of {J_ij} to another. The variance of Γ across networks is, from Eq. (A11),

{[Γ_{i j; t s}^{2}]}_{J} - {[Γ_{i j; t s}]}_{J}^{2} = [(E [h_{i t}^{a} h_{j s}^{a} | J] - {{E [h_{i t}^{a} h_{j s}^{b} | J])}^{2}]}_{J} - (E [h_{i t}^{a} h_{j s}^{a}] - {E [h_{i t}^{a} h_{j s}^{b}])}^{2} = E [h_{i t}^{a} h_{j s}^{a} h_{i t}^{c} h_{j s}^{c}] - 2 E [h_{i t}^{a} h_{j s}^{a} h_{i t}^{c} h_{j s}^{d}] + E [h_{i t}^{a} h_{j s}^{b} h_{i t}^{c} h_{j s}^{d}] + {(E [h_{i t}^{a} h_{j s}^{a}] - E [h_{i t}^{a} h_{j s}^{b}])}^{2} = δ_{i j} {(q_{t s}^{S} q_{t s}^{S} + q_{t s}^{D} q_{t s}^{D} + q_{t t}^{D} q_{s s}^{D}) - 2 (q_{t s}^{S} q_{t s}^{D} + q_{t s}^{D} q_{t s}^{D} + q_{t t}^{D} q_{s s}^{D}) + (2 q_{t s}^{D} q_{t s}^{D} + q_{t t}^{D} q_{s s}^{D}) - {(q_{t s}^{S} q_{t s}^{D})}^{2}} \times (1 - δ_{i j}) {q_{t t}^{D} q_{s s}^{D} - 2 q_{t t}^{D} q_{s s}^{D} + q_{t t}^{D} q_{s s}^{D}} + O (1 / N) = O (1 / N),

(B2)

where a,b,c,d are all different. Therefore, each component of Γ converges to its network average as

Γ_{i j; t s} = {[Γ_{i j; t s}]}_{J} + O (N^{- 1 / 2}) .

(B3)

APPENDIX C: PERTURBATION EXPANSION OF THE ORDER PARAMETER

In this appendix, we evaluate the signal component of Eq. (19) by calculating the responses of the order parameter, ${q_{t t}^{a b}}$ , to perturbations in the external input, $θ = {θ_{t}^{a}}$ . Dynamic evolution of the order parameter is described by the saddle-point Eq. (A8), which we repeat here for convenience

q_{t + 1, t + 1}^{a b} = g^{2} {〈 ϕ_{t + 1}^{a} ϕ_{t + 1}^{b} 〉}_{ℒ_{0}} = g^{2} \int (\frac{Π_{t, a} d h_{t}^{a}}{\sqrt{\det (2 π q)}}) ϕ (θ_{t}^{a} + h_{t}^{a}) ϕ (θ_{t}^{b} + h_{t}^{b}) \exp (- \frac{1}{2} {(q^{- 1})}_{t s}^{a b} h_{t}^{a} h_{s}^{b}),

where $ℒ_{0} \equiv ℒ |_{ξ = 0, \hat{q} = 0}$ . To simplify expressions, we omit the temporal index so that $θ_{t}^{a} = θ^{a}$ and $q_{t t}^{a b} = q^{a b}$ in the following, and find that, for general smooth functions ϕ and ѱ, the Gaussian integral is expressed as

{〈 ϕ^{a} ψ^{b} 〉}_{ℒ_{0}} \equiv {〈 ϕ (θ_{t}^{a} + h_{t}^{a}) ψ (θ_{t}^{b} + h_{t}^{b}) 〉}_{ℒ_{0}} = \int D x D y ϕ (θ^{a} + \sqrt{q^{a a}} x) ψ (θ^{b} + \frac{q^{a b}}{\sqrt{q^{a a}}} x + \sqrt{\frac{q^{a a} q^{b b} - {(q^{a b})}^{2}}{q^{a a}}} y),

(C1)

with $D x \equiv d x e^{- x^{2} / 2} / \sqrt{2 π}$ . Note that this expression implies that h^a and h^b have variances of q^aa and q^bb, respectively, and covariance q^ab under the average, ${〈 \cdot 〉}_{ℒ_{0}}$ . The first derivative of this average is

d {〈 ϕ^{a} ψ^{b} 〉}_{ℒ_{0}} = {〈 {ϕ^{'}}^{a} ψ^{b} 〉}_{ℒ_{0}} d θ^{a} + {〈 ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} d θ^{b} + \frac{1}{2} (\frac{1}{\sqrt{q^{a a}}} {〈 x {ϕ^{'}}^{a} ψ^{b} 〉}_{ℒ_{0}} - \frac{q^{a b}}{{\sqrt{q^{a a}}}^{3}} {〈 x ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} + \frac{{(\frac{q^{a b}}{q^{a a}})}^{2}}{\sqrt{\frac{q^{a a} q^{b b} - {(q^{a b})}^{2}}{q^{a a}}}} {〈 y ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}}) d q^{a a} + \frac{1}{2 \sqrt{\frac{q^{a a} q^{b b} - {(q^{a b})}^{2}}{q^{a a}}}} {〈 y ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} d q^{b b} + (\frac{1}{\sqrt{q^{a a}}} {〈 x ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} + \frac{- \frac{q^{a b}}{q^{a a}}}{\sqrt{\frac{q^{a a} q^{b b} - {(q^{a b})}^{2}}{q^{a a}}}} {〈 y ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}}) d q^{a b} = {\vec{a}}^{T} d \vec{Θ}

(C2)

with vectors $\vec{a} = {({〈 {ϕ^{'}}^{a} ψ^{b} 〉}_{ℒ_{0}}, {〈 ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}}, {〈 {ϕ^{″}}^{a} ψ^{b} 〉}_{ℒ_{0}}, {〈 ϕ^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}}, {〈 {ϕ^{'}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}})}^{T}$ and $\vec{Θ} = {(θ^{a}, θ^{b}, q^{a a} / 2, q^{b b} / 2, q^{a b})}^{T}$ .

In the second line of Eq. (C2), we used the following relations

{〈 x ϕ^{a} ψ^{b} 〉}_{ℒ_{0}} = {〈 (\frac{d}{d x} ϕ^{a} ψ^{b}) 〉}_{ℒ_{0}} = \sqrt{q^{a a}} {〈 {ϕ^{'}}^{a} ψ^{b} 〉}_{ℒ_{0}} + \frac{q^{a b}}{\sqrt{q^{a a}}} {〈 ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}},

(C3)

{〈 y ϕ^{a} ψ^{b} 〉}_{ℒ_{0}} = {〈 (\frac{d}{d y} ϕ^{a} ψ^{b}) 〉}_{ℒ_{0}} = \sqrt{\frac{q^{a a} q^{b b} - {(q^{a b})}^{2}}{q^{a a}}} {〈 ϕ^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}},

(C4)

obtained from integration by parts. Similarly, using Eq. (C2) repeatedly, the second derivative is

d^{2} {〈 ϕ^{a} ψ^{b} 〉}_{ℒ_{0}} = {\vec{a}}^{T} d^{2} \vec{Θ} + {(d \vec{Θ})}^{T} d \vec{a} = {\vec{a}}^{T} d^{2} \vec{Θ} + {(d \vec{Θ})}^{T} A d \vec{Θ},

(C5)

where, applying Eq. (C2) once again to each component of $\vec{a}$ , we find $d \vec{a} = A d \vec{Θ}$ with

A = (\begin{matrix} {〈 {ϕ^{″}}^{a} ψ^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{″}^{a} ψ^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} \\ {〈 {ϕ^{'}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 ϕ^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 ϕ^{a} {ψ^{'}}^{″}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} \\ {〈 {ϕ^{'}}^{″}^{a} ψ^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{″}^{a} ψ^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{″}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} \\ {〈 {ϕ^{'}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 ϕ^{a} {ψ^{'}}^{″}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 ϕ^{a} {ψ^{″}}^{″}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{'}}^{″}^{b} 〉}_{ℒ_{0}} \\ {〈 {ϕ^{″}}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{″}^{a} {ψ^{'}}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{'}}^{a} {ψ^{'}}^{″}^{b} 〉}_{ℒ_{0}} & {〈 {ϕ^{″}}^{a} {ψ^{″}}^{b} 〉}_{ℒ_{0}} \end{matrix}) .

(C6)

Although we had to distinguish ϕ and ψ to derive Eq. (C5), we only have to consider the derivatives of ${〈 ϕ^{a} ϕ^{b} 〉}_{ℒ_{0}}$ in the following, so we can replace ψ by ϕ. When the external input is constant in time, the order parameter takes only two distinctive values, $q_{t s}^{a b} = (q_{0} - q) δ_{a b} δ_{t s} + q$ in the stationary state [see Eq. (7)]. Moreover, when the response nonlinearity is an odd function and the input is zero, θ = 0, we know that q = 0 is a stable solution. Hence, in this case, it is easy to check that

{〈 ϕ^{(m)} (h^{a}) ϕ^{(n)} (h^{b}) 〉}_{ℒ_{0}} = 0 if m + n is odd

(C7)

for all integers m and n, where ϕ⁽ⁿ⁾ is the nth derivative of ϕ. When θ = 0, the order parameter q^ab = 0 for a ≠ b, meaning that h^a and h^b are independent Gaussian random variables of mean zero. In this case, Eq. (C7) results because either ϕ⁽^m⁾ or ϕ⁽ⁿ⁾ (with an even m or n) is an odd function. When a = b, on the other hand, ϕ⁽^m⁾(h^a)ϕ⁽ⁿ⁾(h^a) is an odd function of a Gaussian random variable, h^a with zero mean. Using Eq. (C7), Eq. (C2) can be simplified to

d {〈 ϕ^{a} ϕ^{b} 〉}_{ℒ_{0}} = {〈 {ϕ^{″}}^{a} ϕ^{b} 〉}_{ℒ_{0}} \frac{d q^{a a}}{2} + {〈 ϕ^{a} {ϕ^{″}}^{b} 〉}_{ℒ_{0}} \frac{d q^{b b}}{2} + {〈 {ϕ^{'}}^{a} ϕ^{b} 〉}_{ℒ_{0}} {dq}^{a b} .

(C8)

Because $q_{t + 1, t + 1}^{a b} = g^{2} {〈 ϕ^{a} ϕ^{b} 〉}_{ℒ_{0}}$ , we obtain the self-consistent update equations

d q_{t + 1, t + 1}^{a a} = g^{2} ({〈 ϕ^{a} {ϕ^{″}}^{a} 〉}_{ℒ_{0}} + {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{a} 〉}_{ℒ_{0}}) d q_{t t}^{a a}

(C9)

when a = b, and

d q_{t + 1, t + 1}^{a b} = g^{2} ({〈 ϕ^{a} {ϕ^{″}}^{a} 〉}_{ℒ_{0}} \frac{d q_{t t}^{a a}}{2} + {〈 ϕ^{b} {ϕ^{″}}^{b} 〉}_{ℒ_{0}} \frac{d q_{t t}^{b b}}{2} + {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{b} 〉}_{ℒ_{0}} d q_{t t}^{a b})

(C10)

when a ≠ b, respectively. We can see from Eq. (C9) and Eq. (C10) that the stability condition of the order parameter is $g^{2} ({〈 ϕ^{a} {ϕ^{″}}^{a} 〉}_{ℒ 0} + {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{a} 〉}_{ℒ 0}) < 1$ and $g^{2} {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{b} 〉}_{ℒ 0} < 1$ . Under this stability condition, both dq^aa and dq^ab should converge to zero in time.

Next, we evaluate the quantity of interest: $\partial^{2} q_{t + 1, t + 1}^{a b} / \partial θ_{k}^{a} \partial θ_{k}^{b}$ for a ≠ b in Eq. (19). By using a simplified notation for partial derivatives (i.e., $\partial_{t}^{a} \equiv \frac{\partial}{\partial θ_{t}^{a}}$ ) we find $\partial_{k}^{a} \vec{Θ} = {(δ_{t k}, 0, \partial_{k}^{a} q^{a a} / 2, 0, \partial_{k}^{a} q^{a b})}^{T}$ , $\partial_{l}^{b} \vec{Θ} = {(0, δ_{t l}, 0, \partial_{l}^{b} q^{b b} / 2, \partial_{l}^{b} q^{a b})}^{T}$ , and $\partial_{k}^{a} \partial_{l}^{b} \vec{Θ} = {(0, 0, 0, 0, \partial_{k}^{a} \partial_{l}^{b} q^{a b})}^{T}$ . Furthermore, because ∂q^ab → 0for large t from Eq. (C9) and Eq. (C10), we can use $\partial_{k}^{a} \vec{Θ} = {(δ_{t k}, 0, 0, 0, 0)}^{T}$ , $\partial_{l}^{b} \vec{Θ} = {(0, δ_{t l}, 0, 0, 0)}^{T}$ , and $\partial_{k}^{a} \partial_{l}^{b} \vec{Θ} = {(0, 0, 0, 0, \partial_{k}^{a} \partial_{l}^{b} q^{a b})}^{T}$ for sufficiently large t. Hence, from Eq. (C5), the dynamics of the second derivative of the order parameter is

\partial_{k}^{a} \partial_{l}^{b} q_{t + 1, t + 1}^{a b} = g^{2} [{\vec{a}}^{T} \partial_{k}^{a} \partial_{l}^{b} \vec{Θ} + {(\partial_{k}^{a} \vec{Θ})}^{T} A \partial_{l}^{b} \vec{Θ}] = g^{2} {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{b} 〉}_{ℒ_{0}} [\partial_{k}^{a} \partial_{l}^{b} q^{a b} + δ_{t k} δ_{t l}] = {(g^{2} {〈 {ϕ^{'}}^{a} {ϕ^{'}}^{b} 〉}_{ℒ_{0}})}^{t - k - 1} δ_{k l} Θ (t - k)

(C11)

where Θ(x) is the step function, which is one for x ⩾ 0 and zero otherwise.

References

1.Rabinovich MI, Varona P, Selverston AI, Abarbanel HDI. Rev Mod Phys. 2006;78:1213. [Google Scholar]
2.Jaeger H, Haas H. Science. 2004;304:78. doi: 10.1126/science.1091277. [DOI] [PubMed] [Google Scholar]
3.Maass W, Natschlager T, Markram H. Neural Comput. 2002;14:2531. doi: 10.1162/089976602760407955. [DOI] [PubMed] [Google Scholar]
4.Sussillo D, Abbott LF. Neuron. 2009;63:544. doi: 10.1016/j.neuron.2009.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Sompolinsky H, Crisanti A, Sommers HJ. Phys Rev Lett. 1988;61:259. doi: 10.1103/PhysRevLett.61.259. [DOI] [PubMed] [Google Scholar]
6.van Vreeswijk C, Sompolinsky H. Science. 1996;274:1724. doi: 10.1126/science.274.5293.1724. [DOI] [PubMed] [Google Scholar]
7.Bertschinger N, Natschlager T. Neural Comput. 2004;16:1413. doi: 10.1162/089976604323057443. [DOI] [PubMed] [Google Scholar]
8.Büsing L, Schrauwen B, Legenstein R. Neural Comput. 2010;22:1272. doi: 10.1162/neco.2009.01-09-947. [DOI] [PubMed] [Google Scholar]
9.Schweighofer N, Doya K, Fukai H, Chiron JV, Furukawa T, Kawato M. Proc Natl Acad Sci USA. 2004;101:4655. doi: 10.1073/pnas.0305966101. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Molgedey L, Schuchhardt J, Schuster HG. Phys Rev Lett. 1992;69:3717. doi: 10.1103/PhysRevLett.69.3717. [DOI] [PubMed] [Google Scholar]
11.Rajan K, Abbott LF, Sompolinsky H. Phys Rev E. 2010;82:011903. doi: 10.1103/PhysRevE.82.011903. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Pouget A, Zhang K, Deneve S, Latham PE. Neural Comput. 1998;10:373. doi: 10.1162/089976698300017809. [DOI] [PubMed] [Google Scholar]
13.Rieke F, Warland D, de Ruyter van Steveninck R, Bialek W. Spikes: Exploring the Neural Code. MIT Press; Cambridge: 1996. [Google Scholar]
14.Sompolinsky H, Zippelius A. Phys Rev B. 1982;25:6860. [Google Scholar]
15.White OL, Lee DD, Sompolinsky H. Phys Rev Lett. 2004;92:148102. doi: 10.1103/PhysRevLett.92.148102. [DOI] [PubMed] [Google Scholar]
16.Ganguli S, Huh D, Sompolinsky H. Proc Natl Acad Sci USA. 2008;105:18970. doi: 10.1073/pnas.0804451105. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Latham PE, Deneve S, Pouget A. J Physiol Paris. 2003;97:683. doi: 10.1016/j.jphysparis.2004.01.022. [DOI] [PubMed] [Google Scholar]
18.Shamir M, Sompolinsky H. Neural Comput. 2004;16:1105. doi: 10.1162/089976604773717559. [DOI] [PubMed] [Google Scholar]
19.Tirozzi B, Tsodyks M. Europhys Lett. 1991;14:727. [Google Scholar]
20.Barrett DGT, Latham PE. COSYNE. 2010 [Google Scholar]

[R1] 1.Rabinovich MI, Varona P, Selverston AI, Abarbanel HDI. Rev Mod Phys. 2006;78:1213. [Google Scholar]

[R2] 2.Jaeger H, Haas H. Science. 2004;304:78. doi: 10.1126/science.1091277. [DOI] [PubMed] [Google Scholar]

[R3] 3.Maass W, Natschlager T, Markram H. Neural Comput. 2002;14:2531. doi: 10.1162/089976602760407955. [DOI] [PubMed] [Google Scholar]

[R4] 4.Sussillo D, Abbott LF. Neuron. 2009;63:544. doi: 10.1016/j.neuron.2009.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Sompolinsky H, Crisanti A, Sommers HJ. Phys Rev Lett. 1988;61:259. doi: 10.1103/PhysRevLett.61.259. [DOI] [PubMed] [Google Scholar]

[R6] 6.van Vreeswijk C, Sompolinsky H. Science. 1996;274:1724. doi: 10.1126/science.274.5293.1724. [DOI] [PubMed] [Google Scholar]

[R7] 7.Bertschinger N, Natschlager T. Neural Comput. 2004;16:1413. doi: 10.1162/089976604323057443. [DOI] [PubMed] [Google Scholar]

[R8] 8.Büsing L, Schrauwen B, Legenstein R. Neural Comput. 2010;22:1272. doi: 10.1162/neco.2009.01-09-947. [DOI] [PubMed] [Google Scholar]

[R9] 9.Schweighofer N, Doya K, Fukai H, Chiron JV, Furukawa T, Kawato M. Proc Natl Acad Sci USA. 2004;101:4655. doi: 10.1073/pnas.0305966101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] 10.Molgedey L, Schuchhardt J, Schuster HG. Phys Rev Lett. 1992;69:3717. doi: 10.1103/PhysRevLett.69.3717. [DOI] [PubMed] [Google Scholar]

[R11] 11.Rajan K, Abbott LF, Sompolinsky H. Phys Rev E. 2010;82:011903. doi: 10.1103/PhysRevE.82.011903. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Pouget A, Zhang K, Deneve S, Latham PE. Neural Comput. 1998;10:373. doi: 10.1162/089976698300017809. [DOI] [PubMed] [Google Scholar]

[R13] 13.Rieke F, Warland D, de Ruyter van Steveninck R, Bialek W. Spikes: Exploring the Neural Code. MIT Press; Cambridge: 1996. [Google Scholar]

[R14] 14.Sompolinsky H, Zippelius A. Phys Rev B. 1982;25:6860. [Google Scholar]

[R15] 15.White OL, Lee DD, Sompolinsky H. Phys Rev Lett. 2004;92:148102. doi: 10.1103/PhysRevLett.92.148102. [DOI] [PubMed] [Google Scholar]

[R16] 16.Ganguli S, Huh D, Sompolinsky H. Proc Natl Acad Sci USA. 2008;105:18970. doi: 10.1073/pnas.0804451105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Latham PE, Deneve S, Pouget A. J Physiol Paris. 2003;97:683. doi: 10.1016/j.jphysparis.2004.01.022. [DOI] [PubMed] [Google Scholar]

[R18] 18.Shamir M, Sompolinsky H. Neural Comput. 2004;16:1105. doi: 10.1162/089976604773717559. [DOI] [PubMed] [Google Scholar]

[R19] 19.Tirozzi B, Tsodyks M. Europhys Lett. 1991;14:727. [Google Scholar]

[R20] 20.Barrett DGT, Latham PE. COSYNE. 2010 [Google Scholar]

PERMALINK

Beyond the edge of chaos: Amplification and temporal integration by recurrent networks in the chaotic regime

T Toyoizumi

L F Abbott

Abstract

I. MODEL AND METHOD

FIG. 1.

II. SIGNAL-TO-NOISE RATIO

FIG. 2.

FIG. 3.

FIG. 4.

III. CRITICAL BEHAVIOR NEAR THE EDGE OF CHAOS

FIG. 5.

Acknowledgments

APPENDIX A: MEAN-FIELD CALCULATION

APPENDIX B: NETWORK SPECIFIC STATISTICS

APPENDIX C: PERTURBATION EXPANSION OF THE ORDER PARAMETER

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Beyond the edge of chaos: Amplification and temporal integration by recurrent networks in the chaotic regime

T Toyoizumi

L F Abbott

Abstract

I. MODEL AND METHOD

FIG. 1.

II. SIGNAL-TO-NOISE RATIO

FIG. 2.

FIG. 3.

FIG. 4.

III. CRITICAL BEHAVIOR NEAR THE EDGE OF CHAOS

FIG. 5.

Acknowledgments

APPENDIX A: MEAN-FIELD CALCULATION

APPENDIX B: NETWORK SPECIFIC STATISTICS

APPENDIX C: PERTURBATION EXPANSION OF THE ORDER PARAMETER

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases