Mathematical Equivalence of Two Common Forms of Firing-Rate Models of Neural Networks

Kenneth D Miller; Francesco Fumarola

doi:10.1162/NECO_a_00221

. Author manuscript; available in PMC: 2012 Apr 1.

Published in final edited form as: Neural Comput. 2011 Oct 24;24(1):25–31. doi: 10.1162/NECO_a_00221

Mathematical Equivalence of Two Common Forms of Firing-Rate Models of Neural Networks

Kenneth D Miller ¹, Francesco Fumarola ²

PMCID: PMC3237837 NIHMSID: NIHMS313491 PMID: 22023194

Abstract

We demonstrate the mathematical equivalence of two commonly used forms of firing-rate model equations for neural networks. In addition, we show that what is commonly interpreted as the firing rate in one form of model may be better interpreted as a low-pass-filtered firing rate, and we point out a conductance-based firing rate model.

Keywords: Firing rate, voltage, models

At least since the pioneering work of Wilson and Cowan (1972), it has been common to study neural circuit behavior using rate equations – equations that specify neural activities simply in terms of their rates of firing action potentials, as opposed to spiking models, in which the actual emissions of action potentials or “spikes” are modeled. Rate models can be derived as approximations to spiking models in a variety of ways (e.g. Aviel and Gerstner 2006, Ermentrout 1994, La Camera et al. 2004, Mattia and Del Giudice 2002, Ostojic and Brunel 2011, Shriki et al. 2003, Wilson and Cowan 1972; reviewed in Ermentrout and Terman 2010, Chapter 11; Gerstner and Kistler 2002, Chapter 6; Dayan and Abbott 2001, Chapter 7)).

Two forms of rate model most commonly used to model neural circuits are the following, which we will refer to as the v-equation and r-equation respectively:

τ \frac{d v}{d t} = - v + \tilde{I} + Wf (v)

(1)

τ \frac{d r}{d t} = - r + f (Wr + I)

(2)

Here, v and r are each vectors representing neural activity, with each element representing the activity of one neuron in the modeled circuit. v is commonly thought of as representing voltage, while r is commonly thought of as representing firing rate (probability of spiking per unit time). f(x) is a nonlinear input-output function that acts element-by-element on the elements of x, i.e. it has i^th element (f(x))_i = f(x_i) for some nonlinear function of one variable f. f typically takes such forms as an exponential, a power law, or a sigmoid function, and f(v_i) is typically regarded as a static nonlinearity converting the voltage of the i^th cell v_i to the cell’s instantaneous firing rate. W is the matrix of synaptic weights between the neurons in the modeled circuit. Ĩ and I are the vectors of external inputs to the neurons in the v or r networks, respectively, which may be time dependent. In the Appendix, we illustrate a simple heuristic “derivation” of the v-equation, starting from the biophysical equation for the voltages v. Along the way, we also point to a conductance-based version of the rate equation.

When developing a rate model of a network, it can be unclear which form of equation to use or whether or not it makes a difference. Here we demonstrate that the choice between Eqs. 1 and 2 makes no difference: the two models are mathematically equivalent, and so will display the same set of behaviors. It has been noted previously (Beer 2006) that, when I is constant and W is invertible, then the two equations are equivalent under the relationship v = Wr + I, Ĩ = I. We generalize this result to demonstrate the equivalence of the two equations when W is not invertible and inputs may be time dependent.

The v-equation is defined when we specify the input across time, Ĩ (t), and the initial condition v(0); we will call the combination of these and Eq. 1 a v-model. The r-equation is defined when we specify I(t) and r(0); we will call the combination of these and Eq. 2 an r-model. We will show that any v-model can be mapped to an r-model and any r-model can be mapped to a v-model such that the solutions to Eqs. 1–2 satisfy v = Wr + I.

As we will see, the inputs in equivalent models are related by $\tilde{I} = I + τ \frac{d I}{d t}$ , or $τ \frac{d I}{d t} = - I + \tilde{I}$ . That is, I is a low-pass filtered version of Ĩ. Note that there is an equivalence class of I, parametrized by I(0), that all correspond to the same Ĩ under this equivalence. We assume that the equivalence class has been specified, i.e. Ĩ has been specified (if I has been specified, Ĩ can be found as $\tilde{I} = I + τ \frac{d I}{d t}$ ). Then a v-model is defined by specifying v(0), while an r-model is defined by specifying the set {r(0), I(0)}. If W is D × D, then v(0) is D-dimensional, while {r(0), I(0)} is 2D-dimensional, so we can guess that the map from r to v takes a D-dimensional space of r-models to a single v-model, and conversely the map from v to r takes a single v-model back to a D-dimensional space of r-models, and we will show that this is true.

We first show that if r evolves according to the r-equation, then Wr + I evolves according to the v-equation. Setting v = Wr + I, we find:

τ \frac{d v}{d t} = W τ \frac{d r}{d t} + τ \frac{d I}{d t} = W (- r + f (Wr + I)) + τ \frac{d I}{d t}

(3)

= - (v - I) + W f (v) + τ \frac{d I}{d t}

(4)

= - v + \tilde{I} + W f (v)

(5)

Therefore, if v evolves according to the v-equation and r evolves according to the r-equation and v(0) = Wr(0) + I(0), then – since the v-equation propagates Wr + I forward in time − v = Wr + I at all times t > 0. We will thus have established the desired equivalence if we can solve v(0) = Wr(0) + I(0) for any v-model, specified by v(0), or for any r-model, specified by {r(0), I(0)}.

Note that, as expected, a D-dimensional space of r-models converges on the same v-model. Since {r(0), I(0)} forms a 2D-dimensional space, which is constrained by the D-dimensional equation v(0) = Wr(0)+I(0), the D-dimensional subspace of r-models {r(0), I(0)} that satisfy this equation all converge on the same v-model.

To go from an r-model to a v-model is straightforward: we simply set v(0) = Wr(0) + I(0).

To go from a v-model to an r-model, we first define some useful notation:¹

is the nullspace of W, i.e. the subspace of all vectors that W maps to zero. P_N is the projection operator into .
$N_{⊥}^{W}$ is the subspace perpendicular to . This is the subspace spanned by the rows of W. P_N_⊥ is the projection operator into $N_{⊥}^{W}$ .
ℛ^W is the range of W, i.e. the subspace of vectors that can be written Wx for some x. This is the subspace spanned by the columns of W. P_R is the projection operator into ℛ^W.
$R_{⊥}^{W}$ is the subspace perpendicular to ℛ^W, also called the left nullspace. P_R_⊥ is the projection operator into $R_{⊥}^{W}$ .

For any vector x, we define x_N ≡ P_N x, x_N_⊥ ≡ P_N_⊥x, x_R ≡ P_Rx, x_R_⊥ ≡ P_R_⊥x. We rely on the fact that x = x_N + x_N_⊥ = x_R + x_R_⊥.

Given a v-model, the equation v(0) = Wr(0)+I(0) has a solution if and only if v(0)−I(0) ∈ ℛ^W, which is true if and only if v_R_⊥ (0) − I_R_⊥ (0) = 0,² so we must choose

I_{R ⊥} (0) = v_{R ⊥} (0)

(9)

Letting D_R be the dimension of ℛ^W and D_N the dimension of Inline graphic the fundamental theorem of linear algebra states that D_R + D_N = D. So I_R_⊥ (0) has dimension D_N. This leaves unspecified I_R(0), which has dimension D_R.

To solve for r_N_⊥ (0), we note that the equation v = Wr + I can equivalently be written v = Wr_N_⊥ + I (because Wr_N = 0, so Wr = Wr_N_⊥). That is, knowledge of v only specifies r_N_⊥. We define W⁻¹ to be the Moore-Penrose pseudo-inverse of W. This is the matrix that gives the 1-to-1 mapping of ℛ^W into $N_{⊥}^{W}$ that inverts the 1–1 mapping of $N_{⊥}^{W}$ to ℛ^W induced by W, and that maps all vectors in $R_{⊥}^{W}$ to zero.³ The pseudoinverse has the property that W⁻¹W = P_N_⊥ while WW⁻¹ = P_R. Then we can solve for r_N_⊥ (0) as

r_{N ⊥} (0) = W^{- 1} (v (0) - I (0)) = W^{- 1} (v_{R} (0) - I_{R} (0))

(10)

This is a D_R-dimensional equation for the 2D_R-dimensional set of unknowns {r_N_⊥ (0), I_R(0)}, so it determines D_R of these parameters and leaves D_R free. For example, it could be solved by freely choosing I_R(0) and then setting r_N_⊥ (0) = W⁻¹(v_R(0) − I_R(0)), or by freely choosing r_N_⊥ (0) and then setting I_R(0) = v_R(0) − Wr_N_⊥ (0).

Equations 10 and 9 together ensure the equality v(0) = Wr(0) + I(0). Applying W to both sides of Eq. 10 yields v_R(0) = Wr_N_⊥ (0)+I_R(0) = Wr(0)+I_R(0). This states that the equality hold within the range of W; orthogonal to the range of W, we have P_R_⊥ Wr = 0 and v_R_⊥ (0) = I_R_⊥ (0). Together these yield v(0) = Wr(0) + I(0).

Finally, we can freely choose r_N (0), which has no effect on the equation v(0) = Wr(0) + I(0). r_N (0) has D_N dimensions, so we have freely chosen D_R + D_N = D dimensions in finding an r-model that is equivalent to the v-model. That is, we have found a D-dimensional subspace of such r-models, those that satisfy v(0) = Wr(0) + I(0).

To summarize, we have established the equivalence between r-models and v-models. For each fixed choice of W, τ, and Ĩ(t), an r-model is specified by {r(0), I(0)} and Eq. 2, while a v-model is specified by v(0) and Eq. 1. The equivalence is established by setting v(0) = Wr(0) + I(0), which yields a D-dimensional subspace of equivalent r-models for a given v-model. Under this equivalence, v obeys Eq. 1, r obeys Eq. 2, and the two are related at all times by v = Wr+ I, with $τ \frac{d I}{d t} = - I + \tilde{I}$ . To go from an r-model to its equivalent v-model, we simply set v(0) = Wr(0)+I(0). To go from a v-model to one of its equivalent r-models, we set I_R_⊥ (0) = v_R _⊥ (0); freely choose r_N (0); and freely choose {r_N_⊥ (0), I_R(0)} from the D_R-dimensional subspace of such choices that satisfy r_N_⊥ (0) = W⁻¹(v_R(0) − I_R(0)), where W⁻¹ is the pseudoinverse of W.

Finally, note that Eq. 2 can be written $τ \frac{d r}{d t} = - r + f (v)$ . That is, if we regard v as a voltage and f(v) as a firing rate, as suggested by the “derivation” in the Appendix, then r is a low-pass-filtered version of the firing rate, just as I is a low-pass-filtered version of the input Ĩ.

Acknowledgments

Supported by R01-EY11001 from the National Eye Institute and by the Gatsby Charitable Foundation through the Gatsby Initiative in Brain Circuitry at Columbia University.

Appendix

As an example of an unsophisticated and heuristic derivation of these equations (more sophisticated derivations can be found in the references in the main text), the v-equation can be “derived” as follows: we start with the equation for the membrane voltage of the i^th neuron:

C_{i} \frac{{d v}_{i}}{d t} = \sum_{j} g_{i j} (E_{i j} - v_{i})

(11)

where C_i is the capacitance of the i^th neuron and g_ij is the j^th conductance onto the neuron, with reversal potential E_ij. We assume that the g_ij’s are composed of an intrinsic conductance, $g_{i}^{L}$ , with reversal potential $E_{i}^{L}$ ; extrinsic input $g_{i}^{ext}$ with reversal potential $E_{i}^{ext}$ ; and within-network synaptic conductances, with g̃_ij representing input from neuron j with reversal potential Ẽ_ij. Dividing by Σ_k g_ik and defining τ_i(t) = C_i/Σ_k g_ik gives

τ_{i} (t) \frac{{d v}_{i}}{d t} = - v_{i} + \frac{g_{i}^{L} E_{i}^{L} + g_{i}^{ext} E_{i}^{ext} + \sum_{j} {\tilde{g}}_{i j} {\tilde{E}}_{i j}}{g_{i}^{L} + g_{i}^{ext} + \sum_{k} {\tilde{g}}_{i k}}

(12)

We now make a number of further simplifying assumptions. We assume that g̃_ij is proportional to the firing rate r_j of neuron j, with proportionality constant W̃_ij ≥ 0: g̃_ij = W̃_ijr_j. This ignores synaptic time courses, among other things. We assume that r_j is given by the static nonlinearity r_j = f(v_j) (e.g., see Hansel and van Vreeswijk 2002, Miller and Troyer 2002, Priebe et al. 2004 for such a relationship between firing rate and voltage averaged over a few 10’s of milliseconds). We assume synapses are either excitatory with reversal potential E_E or inhibitory with reversal potential E_I, and linearly transform the units of voltage so that E_E = 1 and E_I = −1. We define W_ij = W̃_ijE_j: this is now a synaptic weight that is positive for excitatory synapses and negative for inhibitory synapses. We define ${\tilde{I}}_{i} \equiv g_{i}^{L} E_{i}^{L} + g_{i}^{ext} E_{i}^{ext}$ and define $g_{i} \equiv g_{i}^{L} + g_{i}^{ext}$ . This yields the “conductance-based rate equation”:

τ_{i} (t) \frac{{d v}_{i}}{d t} = - v_{i} + \frac{{\tilde{I}}_{i} + \sum_{j} W_{i j} f (v_{j})}{g_{i} + \sum_{k} ∣ W_{i k} ∣ f (v_{k})}

(13)

with τ_i(t) = C_i/ (g_i + Σ_k |W_ik|f(v_k)).

Finally, we assume that the total conductance, represented by the denominator in the last term of Eq. 13, can be taken to be constant, e.g. if $g_{i}^{L}$ is much larger than synaptic and external conductances, or if inputs tend to be “push-pull”, with withdrawal of some inputs compensating for addition of others. We absorb the constant denominator into the definitions of Ĩ_i and W_ij, and note that this also implies that τ_i is constant, to arrive finally at the v-equation:

τ_{i} \frac{{d v}_{i}}{d t} = - v_{i} + \sum_{j} W_{i j} f (v_{j}) + {\tilde{I}}_{i}

(14)

Footnotes

Note: if W is normal, the eigenvectors are orthogonal, so the nullspace is precisely the space orthogonal to the range: P_N = P_R_⊥ and P_N_⊥ = P_R. However, if W is nonnormal, then vectors orthogonal to the nullspace can be mapped into the nullspace; the range always has the dimension of the full space minus the dimension of the nullspace, but it need not be orthogonal to the nullspace.

Note that the condition v − I ∈ ℛ^W, meaning v = Wr + I can be solved, is true for all time if it is true in the initial condition. We compute:

τ \frac{d (v - I)}{d t} = - v + \tilde{I} + W f (v) - τ \frac{d I}{d t}

(6)

= - v + I + W f (v)

(7)

Applying P_R_⊥ to Eq. 7 and noting that P_R_⊥ W = 0, we find

τ \frac{d (v_{R ⊥} - I_{R ⊥})}{d t} = - (v_{R ⊥} - I_{R ⊥})

(8)

If v(0) − I(0) ∈ ℛ^W, then v_R_⊥ (0) − I_R_⊥ (0) = 0, and hence v_R_⊥ − I_R_⊥ = 0 at all subsequent times so v − I ∈ ℛ^W at all subsequent times. Note also that, for any initial conditions, the condition v(t) − I(t) ∈ ℛ^W is true asymptotically as t → ∞.

If the singular value decomposition of a matrix M is M = USV^†, where S is the diagonal matrix of singular values and U and V are unitary matrices, then its pseudoinverse is M⁻¹ = VS̃U^† where S̃ is the pseudoinverse of S, obtained by inverting all nonzero singular values in S.

References

Aviel Y, Gerstner W. From spiking neurons to rate models: a cascade model as an approximation to spiking neuron models with refractoriness. Phys Rev E. 2006;73:051908. doi: 10.1103/PhysRevE.73.051908. [DOI] [PubMed] [Google Scholar]
Beer RD. Parameter space structure of continuous-time recurrent neural networks. Neural Comput. 2006;18:3009–3051. doi: 10.1162/neco.2006.18.12.3009. [DOI] [PubMed] [Google Scholar]
Dayan P, Abbott LF. Theoretical Neuroscience. MIT Press; Cambridge, MA: 2001. [Google Scholar]
Ermentrout B. Reduction of conductance based models with slow synapses to neural nets. Neural Comput. 1994;6:679–695. [Google Scholar]
Ermentrout GB, Terman DH. Mathematical Foundations of Neuroscience. Springer; New York: 2010. [Google Scholar]
Gerstner W, Kistler W. Spiking Neuron Models. Cambridge University Press; Cambridge, UK: 2002. [Google Scholar]
Hansel D, van Vreeswijk C. How noise contributes to contrast invariance of orientation tuning in cat visual cortex. J Neurosci. 2002;22:5118–5128. doi: 10.1523/JNEUROSCI.22-12-05118.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
La Camera G, Rauch A, Luscher HR, Senn W, Fusi S. Minimal models of adapted neuronal response to in vivo-like input currents. Neural Comput. 2004;16:2101–2124. doi: 10.1162/0899766041732468. [DOI] [PubMed] [Google Scholar]
Mattia M, Del Giudice P. Population dynamics of interacting spiking neurons. Phys Rev E. 2002;66:051917. doi: 10.1103/PhysRevE.66.051917. [DOI] [PubMed] [Google Scholar]
Miller KD, Troyer TW. Neural noise can explain expansive, power-law nonlinearities in neural response functions. J Neurophysiol. 2002;87:653–659. doi: 10.1152/jn.00425.2001. [DOI] [PubMed] [Google Scholar]
Ostojic S, Brunel N. From spiking neuron models to linear-nonlinear models. PLoS Comput Biol. 2011;7:e1001056. doi: 10.1371/journal.pcbi.1001056. [DOI] [PMC free article] [PubMed] [Google Scholar]
Priebe N, Mechler F, Carandini M, Ferster D. The contribution of spike threshold to the dichotomy of cortical simple and complex cells. Nat Neurosci. 2004;7(10):1113–22. doi: 10.1038/nn1310. [DOI] [PMC free article] [PubMed] [Google Scholar]
Shriki O, Hansel D, Sompolinsky H. Rate models for conductance-based cortical neuronal networks. Neural Comput. 2003;15:1809–1841. doi: 10.1162/08997660360675053. [DOI] [PubMed] [Google Scholar]
Wilson HR, Cowan JD. Excitatory and inhibitory interactions in localized populations of model neurons. Biol Cybern. 1972;12:1–24. doi: 10.1016/S0006-3495(72)86068-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] Aviel Y, Gerstner W. From spiking neurons to rate models: a cascade model as an approximation to spiking neuron models with refractoriness. Phys Rev E. 2006;73:051908. doi: 10.1103/PhysRevE.73.051908. [DOI] [PubMed] [Google Scholar]

[R2] Beer RD. Parameter space structure of continuous-time recurrent neural networks. Neural Comput. 2006;18:3009–3051. doi: 10.1162/neco.2006.18.12.3009. [DOI] [PubMed] [Google Scholar]

[R3] Dayan P, Abbott LF. Theoretical Neuroscience. MIT Press; Cambridge, MA: 2001. [Google Scholar]

[R4] Ermentrout B. Reduction of conductance based models with slow synapses to neural nets. Neural Comput. 1994;6:679–695. [Google Scholar]

[R5] Ermentrout GB, Terman DH. Mathematical Foundations of Neuroscience. Springer; New York: 2010. [Google Scholar]

[R6] Gerstner W, Kistler W. Spiking Neuron Models. Cambridge University Press; Cambridge, UK: 2002. [Google Scholar]

[R7] Hansel D, van Vreeswijk C. How noise contributes to contrast invariance of orientation tuning in cat visual cortex. J Neurosci. 2002;22:5118–5128. doi: 10.1523/JNEUROSCI.22-12-05118.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] La Camera G, Rauch A, Luscher HR, Senn W, Fusi S. Minimal models of adapted neuronal response to in vivo-like input currents. Neural Comput. 2004;16:2101–2124. doi: 10.1162/0899766041732468. [DOI] [PubMed] [Google Scholar]

[R9] Mattia M, Del Giudice P. Population dynamics of interacting spiking neurons. Phys Rev E. 2002;66:051917. doi: 10.1103/PhysRevE.66.051917. [DOI] [PubMed] [Google Scholar]

[R10] Miller KD, Troyer TW. Neural noise can explain expansive, power-law nonlinearities in neural response functions. J Neurophysiol. 2002;87:653–659. doi: 10.1152/jn.00425.2001. [DOI] [PubMed] [Google Scholar]

[R11] Ostojic S, Brunel N. From spiking neuron models to linear-nonlinear models. PLoS Comput Biol. 2011;7:e1001056. doi: 10.1371/journal.pcbi.1001056. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Priebe N, Mechler F, Carandini M, Ferster D. The contribution of spike threshold to the dichotomy of cortical simple and complex cells. Nat Neurosci. 2004;7(10):1113–22. doi: 10.1038/nn1310. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Shriki O, Hansel D, Sompolinsky H. Rate models for conductance-based cortical neuronal networks. Neural Comput. 2003;15:1809–1841. doi: 10.1162/08997660360675053. [DOI] [PubMed] [Google Scholar]

[R14] Wilson HR, Cowan JD. Excitatory and inhibitory interactions in localized populations of model neurons. Biol Cybern. 1972;12:1–24. doi: 10.1016/S0006-3495(72)86068-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Mathematical Equivalence of Two Common Forms of Firing-Rate Models of Neural Networks

Kenneth D Miller

Francesco Fumarola

Abstract

Acknowledgments

Appendix

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Mathematical Equivalence of Two Common Forms of Firing-Rate Models of Neural Networks

Kenneth D Miller

Francesco Fumarola

Abstract

Acknowledgments

Appendix

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases