Skip to main content
eLife logoLink to eLife
. 2024 Mar 12;12:RP89337. doi: 10.7554/eLife.89337

Neural criticality from effective latent variables

Mia C Morrell 1, Ilya Nemenman 2, Audrey Sederberg 3,†,
Editors: Peter Latham4, Timothy E Behrens5
PMCID: PMC10957169  PMID: 38470471

Abstract

Observations of power laws in neural activity data have raised the intriguing notion that brains may operate in a critical state. One example of this critical state is ‘avalanche criticality’, which has been observed in various systems, including cultured neurons, zebrafish, rodent cortex, and human EEG. More recently, power laws were also observed in neural populations in the mouse under an activity coarse-graining procedure, and they were explained as a consequence of the neural activity being coupled to multiple latent dynamical variables. An intriguing possibility is that avalanche criticality emerges due to a similar mechanism. Here, we determine the conditions under which latent dynamical variables give rise to avalanche criticality. We find that populations coupled to multiple latent variables produce critical behavior across a broader parameter range than those coupled to a single, quasi-static latent variable, but in both cases, avalanche criticality is observed without fine-tuning of model parameters. We identify two regimes of avalanches, both critical but differing in the amount of information carried about the latent variable. Our results suggest that avalanche criticality arises in neural systems in which activity is effectively modeled as a population driven by a few dynamical variables and these variables can be inferred from the population activity.

Research organism: None

Introduction

The neural criticality hypothesis – the idea that neural systems operate close to a phase transition, perhaps for optimal information processing – is both ambitious and banal. Measurements from biological systems are limited in the range of spatial and temporal scales that can be sampled, not only because of the limitations of recording techniques but also due to the fundamentally non-stationary behavior of most, if not all, biological systems. These limitations make proving that an observation indicates critical behavior difficult. At the same time, the idea that brain networks are critical echoes the anthropic principle: tuned another way, a network becomes quiescent or epileptic and in either state, seems unlikely to support perception, thought, or flexible behavior, yet these observations do not explain how such fine-tuning could be achieved. Further muddying the water, researchers have reported multiple kinds of criticality in neural networks, including through analysis of avalanches (Beggs and Plenz, 2003; Plenz et al., 2021; O’Byrne and Jerbi, 2022; Girardi-Schappo, 2021) and of coarse-grained activity (Meshulam et al., 2019), as well as of correlations (Dahmen et al., 2019). How these flavors of critical behavior relate to each other or any functional network mechanism is unknown.

The phenomenon that we will refer to as ‘avalanche criticality’ appears remarkably widespread. It was first observed in cultured neurons (Beggs and Plenz, 2003) and later studied in zebrafish (Ponce-Alvarez et al., 2018), turtles (Shew et al., 2015), rodents (Ma et al., 2019), monkeys (Petermann et al., 2009), and even humans (Poil et al., 2008). The standard analysis, described later, requires extracting power-law exponents from fits to the distributions of avalanche size and of duration and assessing the relationship between exponents. There is debate over whether these observations reflect true power laws, but within the resolution achievable from experiments, neural avalanches exhibit power laws with exponent relationships predicted from theory developed in physical systems (Perkovic et al., 1995).

Avalanche criticality is not the only form of criticality observed in neural systems. Zipf’s law, in which the frequency of a network state is inversely proportional to its rank, appears in systems as diverse as fly motion estimation and the salamander retina (Mora and Bialek, 2011; Schwab et al., 2014; Aitchison et al., 2016). More recently, Meshulam et al., 2019 reported various statistics of population activity in the mouse hippocampus, including the eigenvalue spectrum of the covariance matrix and the activity variance. These were found to scale as populations were ‘coarse-grained’ through a procedure in which neural activities were iteratively combined based on similarity. Similar observations have been reported in spontaneous activity recorded across a wide range of brain areas in the mouse (Morales et al., 2023). Simple neural network models of such data explain neither Zipf’s law nor coarse-grained criticality (Meshulam et al., 2019).

Even though these three forms of criticality are observed through different analyses, they may originate from similar mechanisms. Numerous studies have reported relatively low-dimensional structure in the activity of large populations of neurons (Mazor and Laurent, 2005; Ahrens et al., 2012; Mante et al., 2013; Pandarinath et al., 2018; Stringer et al., 2019; Nieh et al., 2021), which can be modeled by a population of neurons that are broadly and heterogeneously coupled to multiple latent (i.e. unobserved) dynamical variables. Using such a model, we previously reproduced scaling under coarse-graining analysis within experimental uncertainty (Morrell et al., 2021). Zipf’s law has been explained by a similar mechanism (Schwab et al., 2014; Aitchison et al., 2016; Humplik and Tkačik, 2017). A single quasi-static latent variable has been shown to produce avalanche power laws, but not the relationships expected between the critical exponents (Priesemann and Shriki, 2018), while a model including a global modulation of activity can generate avalanche criticality (Mariani et al., 2021), but has not demonstrated coarse-grained criticality (Morrell et al., 2021). It is not known under what conditions the more general latent dynamical variable model generates avalanche criticality.

Here, we examine avalanche criticality in the latent dynamical variable model of neural population activity. We find that avalanche criticality is observed over a wide range of parameters, some of which may be optimal for information representation. These results demonstrate how criticality in neural recordings can arise from latent dynamics in neural activity, without need for fine-tuning of network parameters.

Results

Critical exponents values and crackling noise

We begin by defining the metrics used to quantify avalanche statistics and briefly summarize experimental observations, which have been reviewed in detail elsewhere (Plenz et al., 2021; O’Byrne and Jerbi, 2022; Girardi-Schappo, 2021). Activity is recorded across a set of neurons and binned in time. Avalanches are then defined as contiguous time bins, in which at least one neuron in the population is active. The duration of an avalanche is the number of contiguous time bins and the size is the summed activity during the avalanche. The distributions of avalanche size and duration are fit to power laws (P(S)Sτ for size S, and P(D)Dα for duration D) using standard methods (Clauset et al., 2009).

Power laws can be indicative of criticality, but they can also result from non-critical mechanisms (Touboul and Destexhe, 2017; Priesemann and Shriki, 2018). A more stringent test of criticality is the ‘crackling’ relationship (Perkovic et al., 1995; Touboul and Destexhe, 2017), which involves fitting a third power-law relationship, S¯(D)Dγfit, and comparing γfit to the predicted exponent γpred, derived from the size and duration exponents, τ and α:

γfit=?γpredα1τ1. (1)

Previous work demonstrating approximate power laws in size and duration distributions through the mechanism of a slowly changing latent variable did not generate crackling (Touboul and Destexhe, 2017; Priesemann and Shriki, 2018).

Measuring power-laws in empirical data is challenging: it generally requires setting a lower cut-off in the size and duration, and the power-law behavior only has limited range due to the finite size and duration of the recording itself. Nonetheless, there is some consensus (Shew et al., 2015; Fontenele et al., 2019; Ma et al., 2019) that even if τ and α vary over a wide range (1.5 to about 3) across recordings, the values of γfit and γpred stay in a relatively narrow range, from about 1.1 to 1.3.

Avalanche scaling in a latent dynamical variable model

We study a model of a population of neurons that are not coupled to each other directly but are driven by a small number of latent dynamical variables – that is, slowly changing inputs that are not themselves measured (Figure 1A). We are agnostic as to the origin of these inputs: they may be externally driven from other brain areas, or they may arise from large fluctuations in local recurrent dynamics. The model was chosen for its simplicity, and because we have previously shown that this model with at least about five latent variables can produce power laws under the coarse-graining analysis (Morrell et al., 2021). In this paper, we examine avalanche criticality in the same model.

Figure 1. Latent dynamical variable model produces avalanche criticality.

Figure 1.

Simulated network is N=1024 neurons. Other parameters in Table 1. (A) Model structure. Latent dynamical variables hμ(t) are broadly coupled to neurons si(t) in the recorded population. (B) Raster plot of a sample of activity binned at 3 ms resolution across 128 neurons with five latent variables, each with correlation timescale τF=15s. (C) Projection of activity into a simulated field of view for illustration. (D-F) Avalanche analysis in a network (parameters NF=5, τF=104, η=4 and ϵ=12), showing size distribution (D), duration distribution (E), and size with duration scaling (F). Lower cutoffs used in fitting are shown with vertical lines and their values are indicated in the figures. There are Nobs=42725 avalanches of size SSmin in this simulated dataset. Estimated values of the critical exponents are shown in the titles of the panels.

Specifically, we model the neurons as binary units (si) that are randomly (JiμN(0,1)) coupled to dynamical variables hμ(t). The probability of any pattern {si}, given the current state of the latent variables, is

P(si|{hμ(t)})=1Z({hμ(t)})exp(ημ=1NFsiJiμhμ(t)ϵsi), (2)

where the parameter η controls the scaling of the variables and ϵ controls the overall activity level. We modeled each latent variable as an Ornstein-Uhlenbeck process with the time scale τF (see Materials and methods). Thus our model has four parameters: η (input scaling), ϵ (activity threshold), τF (dynamical timescale), and NF (number of latent variables).

Distributions of avalanche size and avalanche duration within this model followed approximate power laws (Figure 1C; see Materials and methods). In the example shown (NF=5, τF=104, η=4 and ϵ=12), we found exponents τ=1.89±0.02 (size) and α=2.11±0.02 (duration). Further, the average size of avalanches with fixed duration scaled as SDγ, with the fitted γfit=1.24±0.02, in agreement with the predicted value γpred=1.24±0.02. Thus, our model could generate avalanche scaling, at least for some parameter choices. In the following sections, we examine how avalanche scaling depends on model parameters (NF, τF, η and ϵ; see Table 2). We first focus on two sets of simulations: one set with NF=1 latent variable, which does not generate scaling under coarse-graining (Morrell et al., 2021), and one set with NF=5 latent variables, which can generate such scaling for some values of parameters τF, η, and ϵ (Morrell et al., 2021; Table 1).

Table 1. Simulation parameters for Figure 1.

Parameter Description Value
ϵ bias towards silence ϵ=12
η variance multiplier η=4.0
NF number of latent fields NF=5
τF latent field time constant τ=104
N number of cells N=1024

Avalanche scaling depends on the number of latent variables

We analyzed avalanches from one- and five-variable simulations, each with fixed latent dynamical timescale (τF=5×103 time steps; see Table 2 for parameters). In the following sections, time is measured in simulation time steps, see Materials and methods for converting time steps to seconds. We used established methods for measuring empirical power laws (Clauset et al., 2009). The minimum cutoffs for size (Smin) and duration (Dmin) are indicated by vertical lines in Figure 2. For the population coupled to a single latent variable, the avalanche size distribution was not well fit by a power law (Figure 2A). With a sufficiently high minimum cut-off (Dmin), the duration distribution was approximately power-law (Figure 2B).

Table 2. Simulation parameters for Figure 2.

Parameter Description Value
ϵ bias towards silence ϵ=8 (for NF=1) or ϵ=12 (for NF=5)
η variance multiplier η=4.0
NF number of latent fields NF=1 or 5
τF latent field time constant τF=103,...105
N number of cells N=1024

Figure 2. Multiple latent variables generate avalanche scaling at shorter timescales than a single latent variable.

Simulated network is N=1024 neurons. Other parameters used for simulations for this figure are found in Table 2. (A-C) Scaling analysis for one variable models where the dynamic timescale is equal to 5×103 time steps. (A) Distribution of avalanche sizes. MLE value of exponent for best-fit power law is τ=1.98 (0.02 SE), with lower cutoff indicated by the vertical line. (B) Distribution of avalanche duration. MLE value of α is 1.81 (0.02 SE). (C) Average size plotted against avalanche duration (blue points), with power-law fit (black line) and predicted relationship (yellow line) from MLE values for exponents in A and B. Gray bar on the horizontal axis indicates range, over which a power law with γ=1.72 fits the data (see Materials and methods). (D-F) Analysis of avalanches from a simulation of a population coupled to five independent latent variables where the dynamic timescale is equal to 5×103 time steps. (G) Exponents τ for avalanche size distributions across timescales for one-variable (blue) and five-variable (red) simulations. Each circle is a simulation with independently drawn coupling parameters. Simulations had to show scaling over at least two decades to be included in panels (G–J). (H) Exponents α for avalanche duration distributions for simulations in G. (I) Fitted values of γ for simulations in G. (J) Difference between fitted and predicted γ values. Five-variable simulations produce crackling over a wider range of timescales than single-variable simulations.

Figure 2.

Figure 2—figure supplement 1. Illustration of algorithm for determining τ and α, using one variable example in Figure 2.

Figure 2—figure supplement 1.

(A-B) Probability density function for avalanche size (A) and duration (B) on a log-log scale, with the best power law fit (red). (C-D) In blue: Maximum likelihood exponent of a power-law model as a function of the minimum (lower cutoff) size (C) and duration (D). In red: KS statistics (see Materials and methods) for each fit. ‘Best fit‘ is the power law with the minimum KS statistic. (E-F) Surrogate data procedure. To generate each surrogate, samples were drawn from a power law with size / duration cutoff indicated (E, Smin=3; F, Dmin=18) and the KS statistic was computed. Histograms illustrate KS statistic across surrogates (blue), while values derived from data are in red. Because the red line does not fall within the blue histogram, the hypothesis that the data is fitted well by a power law fit was rejected in E. At the same time, since the red line falls within the blue histogram in F, the hypothesis was accepted.
Figure 2—figure supplement 2. Illustration of algorithm for determining τ and α, using example in Figure 2, five latent variables.

Figure 2—figure supplement 2.

Notation the same as in Figure 2—figure supplement 3.
Figure 2—figure supplement 3. Illustration of algorithm for fitting the exponent γ and determining the range, over which power law scaling of average size with duration is observed, using example in Figure 2(A–D).

Figure 2—figure supplement 3.

(A-C) Determining the lower bound, the minimum duration Dmin. (A) The relation logS=b+γlogD was fit using linear least-squares, restricted to (overlapping) 1-decade ranges (blue, red: example decades). (B) Confidence intervals for fit parameters (γ,b) for fits starting at each value of Dmin. (C) Best value of Dmin was selected based on how many subsequent start points yielded consistent slope/intercept values. (D-F) Determining the upper bound, maximum duration.Dmax (D) Keeping Dmin fixed based of value obtained in C, we test values of Dmax up to the maximum duration event, and fit over the range.[Dmin,Dmax] (E) Average residual over the fit range [Dmin,Dmin+1], calculated for each fit and plotted against the value of Dmax used for that fit. The largest value of Dmax without evidence of bias in the residual was then selected. (F) Final fit and range.
Figure 2—figure supplement 4. Illustration of algorithm for fitting the exponent γ and determining the range over which power-law scaling of average size with duration is observed, using example in Figure 2E–H.

Figure 2—figure supplement 4.

See Figure 2—figure supplement 2 for caption. In this example, a lower value of Dmin was selected. Panel E, which was flat for Figure 2—figure supplement 2, now shows how extending the range to high values of Dmax can generate systematic errors at the low range of the fit, even while having a high overall goodness of fit metric.

We next assessed whether the simulation produced crackling. If so, the value γfit obtained by fitting S¯(D)Dγfit would be similar to γpred=α1τ1. In many cases, such as the one-variable example shown in Figure 2C, the full range of avalanche durations were not fit by a single power law. Therefore, we determined the largest range, over which a power law was a good fit to the simulated observations. In this case, slightly over two decades of apparent scaling were observed starting from avalanches with minimum duration slightly less than 100 time steps (Figure 2C), with a best-fit value of γfit[1.69,1.74]. As we did not find a power-law in the size distribution, calculating γpred is meaningless. If we do it anyway, we obtain γpred=0.83±0.03 (yellow line in Figure 2C), which clearly deviates from the fitted value of γ. Thus, for the single latent dynamical variable model (τF=5000), power-law fits are poor, and there is no crackling.

The five-variable model produces a different picture. We now find avalanches, for which size and duration distributions are much better fit by power-law models starting from very low minimum cutoffs (Figure 2D–E, Figure 2—figure supplement 2). Average size scaled with duration, again over more than two decades, with γfit=1.27±0.03, which was in close agreement with γpred=1.25±0.02 (Figure 2F). Holding other parameters constant, we thus found that scaling relationships and crackling arise in the multi-variable model but not the single-variable model.

Avalanche scaling depends on the time scale of latent variables

Based on simulations in the previous section, we surmised that the five-variable simulation generated scaling more readily due to creating an ‘effective’ latent variable that had slower dynamics than any individual latent variable. We reasoned that at any moment in time, the latent variable state hμ(t) is a vector in the latent space. Because coupling to the latent variables is random throughout the population, only the length (NF) and not the direction of this vector matters, and the timescale of changes in this length would be much slower than τF, the timescale of each of the components hμ(t). We therefore speculated that increasing the timescale of dynamics of the latent variables should eventually lead to scaling and crackling in the single-variable model as well as the five-variable one. To examine the dependence of avalanche scaling on this timescale, we simulated one-variable and five-variable networks at fixed η and ϵ coupled to latent variables with the correlation time of their Ornstein-Uhlenbeck dynamics of τF[103,105] time steps, spanning from a factor of 10 faster to a factor of 10 slower than the original τF in Figure 1. Simulations were replicated five times at each combination of parameters by drawing new latent variable coupling values (Jiμ), as well as new latent variable dynamics and instances of neural firing. For simulations that passed the criteria to be fitted by power laws, we plot the fitted values of τ , α, γfit, and γfitγpred (Figure 2G–J). Missing points are those for which distributions did not pass the power law fit criteria.

In the single-variable model, best-fit exponents changed abruptly for latent variable timescale around τF=104 (Figure 2G and H), while in the five-variable model, exponents tended to increase gradually (Figure 2G and H, red). The discontinuity in the single-variable case reflected a change in the lower cutoff values in the power-law fits: size and duration distributions generated with faster latent dynamics could be fit reasonably well to a power law by using a high value of the lower cutoff (Figure 2—figure supplement 3). For time scales greater than ∼104, the minimum cutoffs dropped, and the single-variable model generated power-law distributed avalanches and crackling (Figure 2J), similar to the five-variable model. In summary, in the latent dynamical variable model, introducing multiple variables generated scaling at faster timescales. However, by slowing the timescale of the latent dynamics, the model generated signatures of critical avalanche scaling for both multi- and single-variable simulations.

Avalanche criticality, input scaling, and firing threshold

In the previous section, we found that a very slow single latent dynamical variable generated avalanche criticality in the simulation population. Thus, from now on, we simplify the model in order to characterize avalanche statistics across values of input scaling η and firing threshold ϵ. Specifically, we modeled a population of N=128 neurons coupled to a single quasi-static latent variable. We simulated 103 segments of 104 steps each and drew a new value of the latent variable (hN(0,1)) for each segment. Ten replicates of the simulation were generated at each of the combinations of η and ϵ (see Materials and methods).

Almost independent of η and ϵ, we found quality power law fits and crackling. Figure 3 shows the average (across n=10 network realizations) of the exponents extracted from size (τ, Figure 3A) and duration (α, Figure 3C) distributions. At small firing threshold (ϵ=2), we do not observe scaling because the system is always active, and all avalanches merge into one. At large firing threshold ϵ and low input scaling η, we do not observe scaling because activity is so sparse that all avalanches are small. At intermediate values of the parameters, the simulations generated plausible scaling relationships in size and duration. The difference between γfit and γpred was typically less than 0.1 (Figure 4D–F), which was consistent with previously reported differences between fit and predicted exponents (Ma et al., 2019). Thus, there appears to be no need for fine-tuning to generate apparent scaling in this model, at least in the limit of (near) infinite observation time. Wherever η and ϵ generate avalanches, there are approximate power-law distributions and crackling.

Figure 3. Exponents across network simulations for networks of N=128 neurons.

Figure 3.

Each parameter combination η,ϵ was simulated for ten replicates, each time drawing randomly the couplings Ji, the latent variable values, and the neural activities. Other parameters in Table 3. (A) Average across replicates for the size exponent τ. (B) Scatter plot of α vs. τ for each network replicate for parameter combinations indicated in A. Linear relationships between τ and α, corresponding to the minimum and maximum values of γfit from panel E, are shown to guide the eye. (C) Same as A, for duration exponent α. (D) Predicted exponent, γpred, derived from A and C. (E) Value of γfit from fit to S¯Dγ. (F) Difference between γpred and γfit.

Figure 4. Avalanches in the latent dynamical variable model with a single quasistatic variable.

Parameters in Table 3. (A) Number of avalanches in simulations from Figure 3 as a function of the calculated probability of avalanches at fixed η across values of ϵ and latent variable h. Line indicates equality. (B) Probability of avalanches with η=2 across values of ϵ and h. The latent variable h is normally distributed with mean 0 and variance 1. Where the distribution of h overlaps with regions of high probability (black), avalanches occur. (C) Probability of avalanches at ϵ=8 across values of η and h. Increasing η narrows the range of h that generates avalanches. (D) Probability of avalanches at h=0 for a populations of 128 neurons (black line) and for a varying ϵ. Size distributions corresponding to simulations marked by the green and orange crosses are in E, F. (E) Example of size distribution with ϵ<ϵ0 (orange marker in D). Size cutoff is close to 100. (F) Example of size distribution with ϵ>ϵ0 (green marker in D). Size cutoff is < 10.

Figure 4.

Figure 4—figure supplement 1. Estimate of how long it takes to observe avalanche criticality at each combination of η and ϵ.

Figure 4—figure supplement 1.

We took a parameter combination with a low rate of avalanches but good apparent scaling (η=4 and ϵ=14) and assumed that this is a reasonable estimate of the minimum number of observations (approximately 106 avalanches) required to observe scaling. To translate to observation length (in hours), we divided the number of avalanches observed in each full-length simulation by this minimum count and converted to a time using a time bin of 10ms. Simulations were for a recorded population of 128 neurons. For this size of population, ϵ0=5.2.

To determine where avalanches occur, we derive the avalanche rate across values of the latent variable h. In the quasi-static model, the probability of an avalanche initiation is the probability of a transition from the quiet to an active state. Because all neurons are conditionally independent, this is Pava=Psilence(1Psilence). Then the expected number of avalanches N^ava is obtained by integrating Pava over h at each value of η and ϵ:

N^ava=Pava(ϵ,η,h;Ji,N)p(h)dh=i(11+eηJihϵ)(1i(11+eηJihϵ))p(h)dh, (3)

where p(h) is the standard normal distribution. This probability tracks the observed number of avalanches across simulations, Figure 4A.

To gain an intuition for the conditions under which avalanches occur, we show two slices of the avalanche probability, at fixed η (Figure 4B) and at fixed ϵ (Figure 4C). Black regions indicate where avalanches are likely to occur. If, for a given value of ϵ and η, there is no overlap between high avalanche probability regions and the distribution of h, then there will be no avalanches. For large ϵ, avalanches occur because neurons with large coupling to the latent variable (η|Ji|>>1, recall JiN(0,1)) are occasionally activated by a value of the latent variable h that is sufficient to exceed ϵ (Figure 4B). Thus, the scaling parameter η controls the value of h for which avalanches occur most frequently (Figure 4C). As ϵ decreases, avalanches occur for smaller and smaller h until avalanches primarily occur when h=0.

To calculate the probability of avalanches, we must integrate over all values of h, but we can gain a qualitative understanding of which avalanche regime the system is in by examining the probability of avalanches at h=0. At h=0, the avalanche probability (see Materials and methods) is

Pava(ϵ,η,h=0;Ji,N)=(11+eϵ)N(1(11+eϵ)N), (4)

which is maximized at ϵ0=log(21/N1), independent of Ji and η. After some algebra, we find that ϵ0logN for large N. The dependence on N reflects that a larger threshold is required for larger networks: large networks (N) are unlikely to achieve complete network silence, therefore preventing avalanches from occurring. Similarly, small networks (N1) are unlikely to fire consecutively and thus are unlikely to avalanche.

We plot Pava(ϵ,η;Ji,N,h=0) as a function of ϵ in Figure 4D. The peak at ϵ0 divides the space into two regions. For ϵ<ϵ0, a power-law is only observed in the large-size avalanches, which are rare (Figure 4E, green). By contrast, when ϵ>ϵ0, minimum size cutoffs are low (Figure 4F, orange). Both regions, ϵ<ϵ0 and ϵ>ϵ0, exhibit crackling noise scaling. If observation times are not sufficiently long (estimated in Figure 4—figure supplement 1), then scaling will not be observed in the ϵ<ϵ0 region, whose scaling relations arise from rare events. Insufficient observation times may explain experiments and simulations where avalanche scaling was not found.

Inferring the latent variable

Our analysis of Pava(ϵ,η,h) at h=0 suggested that there are two types of avalanche regimes: one with high activity and high minimum cutoffs in the power law fit (Type 1), and the other with lower activity and size cutoffs (Type 2). Further, when Pava drops to zero, avalanches disappear because the activity is too high or too low. We now examine how information about the value of the latent variables represented in the network activity relates to the activity type. To delineate these types, we calculated numerically ϵ(η), the value of ϵ, for which the probability of avalanches is maximized, and the contours of Pava (Figure 5A). Curves for ϵ(η) and ϵ0 and Pava=103 are shown in Figure 5A and B.

Figure 5. Information in the neural activity about the latent variable is higher in the low-ϵ avalanche region, compared to high-ϵ avalanche or high-rate avalanche-free activity.

Figure 5.

(A) Probability of avalanche per time step across values of η and ϵ. Solid magenta curve follows ϵ(η), the value of ϵ maximizing the probability of avalanches at fixed η. Dashed magenta line indicates ϵ0, calculated analytically, which matches ϵ at η=0. (B) Information about latent variable, calculated from maximum likelihood estimate of h using population activity. MLE approximation is invalid in the dark-blue region bounded by gray curve. Magenta line marks the maximum values of Pava, reproduced from A. Dashed black curve indicates Pava=0.001. The highest information region falls between ϵ(η) and the contour for Pava=0.001. (C - E) Slices of B, showing IMLE(ϵ) for η={2,5,9}. Magenta and dashed black lines again indicate ϵ and Pava=0.001, respectively, as in B. Black dashed line marks the approximate boundary between the high-activity/no avalanche and the high-cutoff avalanche, and magenta line marks boundary between high-cutoff and low-cutoff avalanche regions.

We expect that the more cells fire, the more information they would convey, until the firing rate saturates, and inferring the value of the latent variable becomes impossible. Figure 5B supports the prediction: generally, information is higher in regions with more activity (lower ϵ, higher η), but only up to a limit: as ϵ0, information decreases. This decrease begins approximately where the probability of avalanches drops to nearly zero (dashed black lines, Figure 5B–E) because all of the activity merges into a few very large avalanches. In other words, the Type-1 avalanche region coincides with the highest information about the latent variable.

The critical brain hypothesis suggests that the brain operates in a critical state, and its functional role may be in optimizing information processing (Beggs, 2008; Chialvo, 2010). Under this hypothesis, we would expect the information conveyed by the network to be maximized in the regions we observe avalanche criticality. However, we see that critical regions do not always have optimal information transmission. In Figure 5, the region that displays crackling noise is that where avalanches exist (Pava>0.001), which corresponds to any η value and ϵ3. This avalanche region encompasses both networks with high information transmission and networks with low information transmission. In summary, observing avalanche criticality in a system does not imply a high-information processing network state. However, the scaling can be seen at smaller cutoffs, and hence with shorter recordings, in the high-information state. This parallels the discussion by Schwab et al., 2014, who noticed that the Zipf’s law always emerges in neural populations driven by quasi-stationary latent fields, but it emerges at smaller system sizes when the information about the latent variable is high.

Discussion

Here, we studied systems with distributed, random coupling to latent dynamical variables and we found that avalanche criticality is nearly always observed, with no fine-tuning required. Avalanche criticality was surprisingly robust to changes in input gain and firing rate threshold. Loss of avalanche criticality could occur if the latent process was not well-sampled, either because the simulation was not long enough or the dynamics of the latent variables were too fast. Finally, while information about the latent variables in the network activity was higher where avalanches were generated compared to when they were not, there was a range of information values across the critical avalanche regime. Thus, avalanche criticality alone was not a predictor of optimal information transmission.

Explaining experimental exponents

A wide range of critical exponents has been found in ex vivo and in vivo recordings from various systems. For instance, the seminal work on avalanche statistics in cultured neuronal networks by Beggs and Plenz, 2003 found size and duration exponents of 1.5 and 2.0 respectively, along with γ=2, when time was discretized with a time bin equal to the average inter-event interval in the system. A subset of the in vivo recordings analyzed from anesthetized cat (Hahn et al., 2010) and macaque monkeys (Petermann et al., 2009) also exhibited a size distribution exponent close to 1.5. By contrast, a survey of many in vivo and ex vivo recordings found power-law size distributions with exponents ranging from 1 to 3 depending on the system (Fontenele et al., 2019). Separately, Ma et al., 2019 reported recordings in freely moving rats with size exponents ranging from 1.5 to 2.7. In these recordings, when the crackling relationship held, the reported value of γ was near 1.2 (Fontenele et al., 2019; Ma et al., 2019).

A model for generating avalanche criticality is a critical branching process (Beggs and Plenz, 2003), which predicts size and duration exponents of 1.5 and 2 and scaling exponent γ of 2. However, there are alternatives: Lombardi et al., 2023 showed that avalanche criticality may also be produced by an adaptive Ising model in the sub-critical regime, and in this case, the scaling exponent γ was not 2 but close to 1.6. Our model, across the parameters we tested that produced exponents consistent with the scaling relationship, generated τ values that ranged from 1.9 to about 2.5. Across those simulations, we found values γ within a narrow band from 1.1 to 1.3 (see Figure 2I and J and Figure 3H). While the exponent values our model produces are inconsistent with a critical branching process (γ=2), they match the ranges of exponents estimated from experiments and reported by Fontenele et al., 2019. In this context, it might be useful to explore if our model and that of Lombardi et al., 2023 might be related, with the adaptive feedback signal of the latter viewed as an effective, latent variable of the former.

A genuine challenge in comparing exponents estimated from different experiments with different recording modalities (spiking activity, calcium imaging, LFP, EEG, or MEG) arises from differences in spatial and temporal scale specific to a particular recording, as well as the myriad decisions made in avalanche analysis, such as defining thresholds or binning in time. Thus, one possible reason for differences in exponents across studies may derive from how the system is sub-sampled in space or coarse-grained in time, both of which systematically change exponents τ and α (Beggs and Plenz, 2003; Shew et al., 2015) and could account for differences in γ (Capek et al., 2023). The model we presented here could be used as a test bed for examining how specific analysis choices affect exponents estimated from recordings.

A second possible explanation for differences in exponents is that different experiments study similar, but distinct biological phenomena. For instance, networks that were cultured in vitro may differ from those that were not, whether they are in vivo or ex vivo (i.e. brain slices), and sensory-processing networks may have different dynamics from networks with different processing demands. It is possible that certain networks develop connections between neurons such that they truly do produce dynamics that approximate a critical branching process, while other networks have different structure and resulting dynamics and thus can be better understood as coupled neurons receiving feedback (Lombardi et al., 2023) or as a system coupled to latent dynamical variables. This is especially true in sensory systems, where coupling to (latent) external stimuli in a way that the neural activity can be used to infer the stimuli is the reason for the networks’ existence (Schwab et al., 2014).

Relationship to past modeling work

Our model is not the first to produce approximate power-law size and duration distributions for avalanches from a latent variable process (Touboul and Destexhe, 2017; Priesemann and Shriki, 2018). In particular, Priesemann and Shriki, 2018 derived the conditions, under which an inhomogeneous Poisson process could produce such approximate scaling. The basic idea is to generate a weighted sum of exponentially distributed event sizes, each of which are generated from a homogeneous Poisson process. How each process is weighted in this sum determines the approximate power-law exponent, allowing one to tune the system to obtain the critical values of 1.5 and 2. Interestingly, this model did not generate non-trivial scaling of size with duration (SDγ). Instead, they found γ=1, not the predicted γ=2. Our results differ significantly, in that γ was typically between 1.1 and 1.3 and it was nearly always close to the prediction from α and τ. We speculate that this is due to nonlinearity in the mapping from latent variable to spiking activity, as doubling the latent field h does not double the population activity, but doubling the rate of a homogeneous Poisson process does double the expected spike count. As biological networks are likely to have such nonlinearities in their responses to common inputs, this scenario may be more applicable to certain kinds of recordings.

Summary

Latent variables – whether they are emergent from network dynamics (Clark et al., 2023; Sederberg and Nemenman, 2020) or derived from shared inputs – are ubiquitous in large-scale neural population recordings. This fact is reflected most directly in the relatively low-dimensional structure in large-scale population recordings (Stringer et al., 2019; Pandarinath et al., 2018; Nieh et al., 2021). We previously used a model based on this observation to examine signatures of neural criticality under a coarse-graining analysis and found that coarse-grained criticality is generated by systems driven by many latent variables (Morrell et al., 2021). Here, we showed that the same model also generates avalanche criticality, and that when information about the latent variables can be inferred from the network, avalanche criticality is also observed. Crucially, finding signatures of avalanche criticality required long observation times, such that the latent variable was well-sampled. Previous studies showed that Zipf’s law appears generically in systems coupled to a latent variable that changes slowly relative to the sampling time, and that the Zipf’s behavior is easier to observe in the higher information regime (Schwab et al., 2014; Aitchison et al., 2016). However, this also suggests that observation of either scaling at modest data set sizes indeed points to some fine-tuning — namely to the increase of the information in the individual neurons (and, since neurons in these models are conditionally independent, also in the entire network) about the value of the latent variables. In other words, one would expect a sensory part of the brain, if adapted to the statistics of the external stimuli, to exhibit all of these critical signatures at relatively modest data set sizes. In monocular deprivation experiments, when the activity in the visual cortex is transiently not adapted to its inputs, scaling disappears, at least for recordings of a typical duration, and is restored as the system adapts to the new stimulus (Ma et al., 2019). We speculate that the observed recovery of criticality by Ma et al., 2019 could be driven by neurons adapting to the reduced stimuli state, for instance, by adjusting η (input scaling) and ϵ (firing rate threshold).

Taken together, these results suggest that critical behavior in neural systems – whether based on the Zipf’s law, avalanches, or coarse-graining analysis – is expected whenever neural recordings exhibit some latent structure in population dynamics and this latent structure can be inferred from observations of the population activity.

Materials and methods

Simulation of dynamic latent variable model

We study a model from Morrell et al., 2021, originally constructed as a model of large populations of neurons in mouse hippocampus. In the original version of the model, neurons are non-interacting, receiving inputs reflective of place-field selectivity as well as input current arising from a random projection from a small number of latent dynamical variables, representing inputs shared across the population of neurons that are not directly measured or controlled. In the current paper, we incorporate only the latent variables (no place variables), and we assume that every cell is coupled to every latent variable with some randomly drawn coupling strength.

The probability of observing a certain population state {si} given latent variables hμ(t) at time t is

P({si}|{hμ})=1Z({hμ})eH({si},{hμ}), (5)

where Z is the normalization, and H is the ‘energy’:

H=i,μ=1N,Nfηhμ(t)Jiμsi+ϵsi. (6)

The latent variables hμ(t) are Ornstein-Uhlenbeck processes with zero mean, unit variance, and time constant τm. Couplings Jiμ are drawn from the standard normal distribution.

Parameters for each figure are laid out in Tables 1–3. For the infinite time constant simulation, we draw a value hN(0,1) and simulate for 10000 time steps, then repeat for 1000 draws of h.

Table 3. Simulation parameters for Figures 3 and 4.

Parameter Description Value
ϵ bias towards silence ϵ{2,4,...14}
η variance multiplier η{1,2,...10}
NF number of latent fields NF=1
τF latent field time constant quasistatic
N number of cells N=128

Time step units

Most results were presented using arbitrary time units: all times (i.e. τF and avalanche duration D) are measured in units of an unspecified time step. Specifying a time bin converts the probability of firing into actual firing rates, in spikes per second, and this choice determines which part of the η-ϵ phase space is most relevant to a given experiment.

The time step is the temporal resolution at which activity is discretized, which varies from several to hundreds of milliseconds across different experimental studies (Beggs and Plenz, 2003; Fontenele et al., 2019; Ma et al., 2019). In physical units and assuming a bin size of 3 ms to 10 ms, our choice of η and ϵ in Figure 2 would yield physiologically realistic firing rate ranges (Hengen et al., 2016), with high-firing neurons reaching averages rates of 20-50 spikes/second and median firing-rate neurons around 1-2 spikes/second. The timescales of latent variables examined range from about 3 s to 3000 s, assuming 3-ms bins. Inputs with such timescales may arise from external sources, such as sensory stimuli, or from internal sources, such as changes in physiological state.

Simulations were carried out for the same number of time steps (2×106), which would be approximately 1 to 2 ‘hours’, a reasonable duration for in vivo neural recordings. Note that at large values of τF, the latent variable space is not well sampled during this time period.

Analysis of avalanche statistics

Setting the threshold for observing avalanches

In our model, we count avalanches as periods of continuous activity (in any subset of neurons) that is book-ended by time bins with no activity in the entire simulated neural network. For real neural populations of modest size, this method fails because there are no periods of quiescence. The typical solution is to set a threshold, and to only count avalanches when the population activity exceeds that threshold, with the hope that results are relatively robust to that choice. In our model, this operation is equivalent to changing ϵ, which shifts the probability of firing up or down by a constant amount across all cells independent of inputs. Our results in Figure 3 show that α and τ decrease as the threshold for detection is increased (equivalent to large |ϵ|), but that the scaling relationship is maintained. The model predicts that γpredγfit would initially increase slightly with the detection threshold before decreasing back to near zero.

Following the algorithm laid out in Clauset et al., 2009, we fit power laws to the size and duration distributions from simulations generating avalanches. We use least-squares fitting to estimate γfit, the scaling exponent for size with duration, assessing the consistency of the fit across decades.

Reading power laws from data

We want, from each simulation, a quantification of the quality of scaling (how many decades, minimally) and an estimate of the scaling exponents (τ for the size distribution, α for the duration distribution). We first compile all avalanches observed in the simulation and for each avalanche, calculate its size (total activity across the population during the avalanche) and its duration (number of time bins). Following the steps outlined by Clauset et al., 2009, we use the maximum-likelihood estimator to determine the scaling exponent. This is the solution to the transcendental equation

ζ(α^,xmin)ζ(α^,xmin)=1ni=1nlnxi (7)

where ζ(α,xmin) is the Hurwitz zeta function and xi are observations; that is, either the size or the duration of each avalanche i. For values of xmin<6, a numerical look-up table based on the built-in Hurwitz zeta function in the symbolic math toolbox was used (MATLAB2019b). For xmin>6 we use an approximation (Clauset et al., 2009),

α^=1+n(ilnxixmin12)1. (8)

To determine xmin, we computed the maximum absolute difference between the empirical cumulative density (S(x)) function and model’s cumulative density function P(x) (the Kolmogorov-Smirnov (KS) statistic; D=maxxxmin|S(x)P(x)|). The KS statistic was computed between for power-law models with scaling parameter α^ and cutoffs xmin. The value of xmin that minimizes the KS statistic was chosen. Occasionally the KS statistic had two local minima (as in Figure 2—figure supplement 1), indicating two different power-laws. In these cases, the minimum size and duration cutoffs were the smallest values that were within 10% of the absolute minimum of the KS statistic. Note that the statistic is computed for each model only on the power-law portion of the CDF (i.e. xixmin). We do not attempt to determine an upper cut-off value.

To assess the quality of the power-law fit, Clauset et al., 2009 compared the empirical observations to surrogate data generated from a semi-parametric power-law model. The semi-parametric model sets the value of the CDF equal to the empirical CDF values up to x=xmin and then according to the power-law model for x>xmin. If the KS statistic for the real data (relative to its fitted model) is within the distribution of the KS statistics for surrogate datasets relative to their respective fitted models, the power-law model was considered a reasonable fit.

Strict application of this methodology could give misleading results. Much of this is due to the loss of statistical power when the minimum cutoff is so high that the number of observations drops. For instance, in the simulations shown in Figure 2, the one-variable duration distribution passed the Clauset et al., 2009 criterion, with a minimum KS statistic of 0.03 when the duration cutoff was 18 time steps. However, for the five-variable simulation in Figure 2, a power-law would be narrowly rejected for both size and duration, despite having much smaller KS statistics: for τ, the KS statistic was 0.0087 (simulation range: 0.0008 to 0.0082; number of avalanches observed: 58,787) and for α it was 0.0084 (simulation range: 0.0011 to 0.0075). Below we discuss this problem in more detail.

Determining range over which avalanche size scales with duration

For fitting γ, our aim was to find the longest sampled range, over which we have apparent power-law scaling of size with duration. Because our sampled duration values have linear spacing, error estimates are skewed if a naive goodness of fit criterion is used. We devised the following algorithm. First, the simulation must have at least one avalanche of size 500. We fit S=cDγ over one decade at a time. We chose as the lower duration cutoff the value of minimum duration, for which the largest number of subsequent (longer-duration) fits produced consistent fit parameters (Figure 2—figure supplements 3 and 4, top row). Next, with the minimum duration set, we gradually increased the maximum duration cut-off, and we determined whether there was a significant bias in the residual over the first decade of the fit. We selected the highest duration cutoff, for which there was no bias. Finally, over this range, we re-fit the power law relationship and extracted confidence intervals.

Our analysis focused on finding the apparent power-law relationship that held over the largest log-scale range. A common feature across simulation parameters (τF, NF) was the existence of two distinct power-law regimes. This is apparent in Figure 2I, which shows that when NF=1 at small τF, the best-fit γ (that showing the largest range with power-law-consistent scaling) is much larger ( 1.7), and then above τF3000, the best-fit γ drops to around 1.3.

Statistical power of power-law tests

In several cases, we found examples of power-law fits that passed the rejection criteria commonly used to determine avalanche scaling relationships because of limited number of observations. A key example is that of the single latent variable simulation shown in Figure 2B, where we could not reject a power law for the duration distribution. Conversely, strict application of the surrogate criteria would reject a power law for distributions that were quantitatively much closer to a power-law (i.e. lower KS statistic), but for which we had many more observations and thus a much stronger surrogate test (Figure 2). This points to the difficulty of applying a single criterion to determining a power-law fit. In this work, we adhere to the criteria set forth in Clauset et al., 2009, with a modification to control for the unreasonably high statistical power of simulated data. Specifically, the number of avalanches used for fitting and for surrogate analysis was capped at 500,000, drawn randomly from the entire pool of avalanches.

Additionally, we found examples, in which a short simulation was rejected, but increasing the simulation time by a factor of five yielded excellent power-law fits. We speculate that this arises due to insufficient sampling of the latent space. These observations raise an important biological point. Simulations provide the luxury of assuming the network is unchanging for as long as the simulator cares to keep drawing samples. In a biological network, this is not the case. Over the course of hours, the effective latent degrees of freedom could change drastically (e.g. due to circadian effects [Aton et al., 2009], changes in behavioral state [Fu et al., 2014], plasticity [Hooks and Chen, 2020], etc.), and the network itself (synaptic scaling, firing thresholds, etc.) could be plastic (Hengen et al., 2016). All these factors can be modeled in our framework by determining appropriate cutoffs (in duration of recording, in time step sizes, for activity distributions) based on specific experimental timescales.

Calculation of avalanche regimes

In the quasistatic model, we derive the dependence of the avalanche rate on η, ϵ and number of neurons N, finding that there are two distinct regimes, in which avalanches occur. Each time bin is independent, conditioned on the value of h. For an avalanche to occur, the probability of silence in the population (i.e. all si=0) must not be too close to 0 (or there are no breaks in activity) or too close to 1 (or there is no activity). At fixed h, the probability of silence is

Psilence(ϵ,η;Ji,N,h)=i11+exp(ηJih+ϵ). (9)

An avalanche occurs when a silent time bin is followed by an active bin, which has probability Pava(ϵ,η;Ji,N,h)=Psilence(1P% silence).

Information calculation

Maximum-likelihood decoding

For large populations coupled to a single latent variable, we estimated the information between population spiking activity and the latent variable as the information between the maximum-likelihood estimator h of the latent variable h and the latent variable itself. This approximation fails at extremes of network activity levels (low or high).

Specifically, we approximated the log-likelihood of h given htrue near h by logL(hh)logLmax12(hh)2σh2. Thus we assume that h is normally distributed about htrue with variance σ2(htrue). The variance is then derived from the curvature of the log-likelihood at the maximum. The information between two Gaussian variables, here P(h|h)=N(h,σh2) and p(h)=N(0,1), is

I(h;s¯i,T)12logTσhtrue2htrue, (10)

where the average is taken over htrueN(0,1).

Given a set of T observations of the neurons {si}, the likelihood is

P({si}t|h)=i,tN,TP(si|h)=i,tN,TeηsiJihϵsi1+e(ηJih+ϵ). (11)

Maximizing the log likelihood gives the following condition:

0=(logP)h|h=h(i,t((ηsiJihϵsi)log(1+e(ηJih+ϵ))))|h (12)
=iηs¯iJiT+TJiη1+eηJih+ϵ, (13)

where s¯i=1Ttsit is the average over observations t. The uncertainty in h is σh, which was calculated from the second derivative of the log likelihood:

1σh2=2(logP)h2 (14)
=h(iηs¯iJiT+TJiη1+eηJih+ϵ)|h (15)
=iT(ηJi)2eηJih+ϵ(1+eηJih+ϵ)2 (16)
=iT(ηJi)24cosh2(ηJih+ϵ2). (17)

This expression depends on the observations s¯i only through the maximum-likelihood estimate h. When hhtrue, then the variance is

1σh2=iT(ηJi)24cosh2(ηJihtrue+ϵ2)Tσhtrue2. (18)

To generate Figure 5, we evaluated Equation 10 using Equation 18.

Code availability

Simulation code was adapted from our previous work (Morrell et al., 2021). Code to run simulations and perform analyses presented in this paper is uploaded as Source code 1 and also available from https://github.com/ajsederberg/avalanche (copy archived at Sederberg, 2024).

Acknowledgements

IN was supported in part by the Simons Foundation Investigator program, the Simons-Emory Consortium on Motor Control, NSF grant BCS/1822677 and NIH grant 2R01NS084844. AS was supported in part by NIH grant 1RF1MH130413-01 and by startup funds from the University of Minnesota Medical School. The authors acknowledge the Minnesota Supercomputing Institute (MSI) at the University of Minnesota for providing resources that contributed to the research results reported within this paper.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Audrey Sederberg, Email: audrey.sederberg@gatech.edu.

Peter Latham, University College London, United Kingdom.

Timothy E Behrens, University of Oxford, United Kingdom.

Funding Information

This paper was supported by the following grants:

  • Simons Foundation Simons-Emory Consortium on Motor Control to Ilya Nemenman.

  • National Science Foundation BCS/1822677 to Ilya Nemenman.

  • National Institute of Neurological Disorders and Stroke 2R01NS084844 to Ilya Nemenman.

  • National Institute of Mental Health 1RF1MH130413 to Audrey Sederberg.

  • Simons Foundation Investigator Program to Ilya Nemenman.

Additional information

Competing interests

No competing interests declared.

Reviewing editor, eLife.

Author contributions

Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – review and editing.

Conceptualization, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing – original draft, Writing – review and editing.

Additional files

MDAR checklist
Source code 1. Code (Python and MATLAB) used to run simulations, analyses, and calculations.
elife-89337-code1.zip (1.1MB, zip)

Data availability

The current manuscript is a computational study, so no data were generated for this manuscript. Modelling code is uploaded as Source code 1.

References

  1. Ahrens MB, Li JM, Orger MB, Robson DN, Schier AF, Engert F, Portugues R. Brain-wide neuronal dynamics during motor adaptation in zebrafish. Nature. 2012;485:471–477. doi: 10.1038/nature11057. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Aitchison L, Corradi N, Latham PE. Zipf’s law arises naturally when there are underlying, unobserved variables. PLOS Computational Biology. 2016;12:e1005110. doi: 10.1371/journal.pcbi.1005110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aton SJ, Seibt J, Dumoulin M, Jha SK, Steinmetz N, Coleman T, Naidoo N, Frank MG. Mechanisms of sleep-dependent consolidation of cortical plasticity. Neuron. 2009;61:454–466. doi: 10.1016/j.neuron.2009.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beggs JM, Plenz D. Neuronal avalanches in neocortical circuits. The Journal of Neuroscience. 2003;23:11167–11177. doi: 10.1523/JNEUROSCI.23-35-11167.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Beggs JM. The criticality hypothesis: how local cortical networks might optimize information processing. Philosophical Transactions of the Royal Society A. 2008;366:329–343. doi: 10.1098/rsta.2007.2092. [DOI] [PubMed] [Google Scholar]
  6. Capek E, Ribeiro TL, Kells P, Srinivasan K, Miller SR, Geist E, Victor M, Vakili A, Pajevic S, Chialvo DR, Plenz D. Parabolic avalanche scaling in the synchronization of cortical cell assemblies. Nature Communications. 2023;14:2555. doi: 10.1038/s41467-023-37976-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Chialvo DR. Emergent complex neural dynamics. Nature Physics. 2010;6:744–750. doi: 10.1038/nphys1803. [DOI] [Google Scholar]
  8. Clark DG, Abbott LF, Litwin-Kumar A. Dimension of activity in random neural networks. Physical Review Letters. 2023;131:118401. doi: 10.1103/PhysRevLett.131.118401. [DOI] [PubMed] [Google Scholar]
  9. Clauset A, Shalizi CR, Newman MEJ. Power-law distributions in empirical data. SIAM Review. 2009;51:661–703. doi: 10.1137/070710111. [DOI] [Google Scholar]
  10. Dahmen D, Grün S, Diesmann M, Helias M. Second type of criticality in the brain uncovers rich multiple-neuron dynamics. PNAS. 2019;116:13051–13060. doi: 10.1073/pnas.1818972116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Fontenele AJ, de Vasconcelos NAP, Feliciano T, Aguiar LAA, Soares-Cunha C, Coimbra B, Dalla Porta L, Ribeiro S, Rodrigues AJ, Sousa N, Carelli PV, Copelli M. Criticality between Cortical States. Physical Review Letters. 2019;122:208101. doi: 10.1103/PhysRevLett.122.208101. [DOI] [PubMed] [Google Scholar]
  12. Fu Y, Tucciarone JM, Espinosa JS, Sheng N, Darcy DP, Nicoll RA, Huang ZJ, Stryker MP. A cortical circuit for gain control by behavioral state. Cell. 2014;156:1139–1152. doi: 10.1016/j.cell.2014.01.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Girardi-Schappo M. Brain criticality beyond avalanches: open problems and how to approach them. Journal of Physics. 2021;2:031003. doi: 10.1088/2632-072X/ac2071. [DOI] [Google Scholar]
  14. Hahn G, Petermann T, Havenith MN, Yu S, Singer W, Plenz D, Nikolic D. Neuronal avalanches in spontaneous activity in vivo. Journal of Neurophysiology. 2010;104:3312–3322. doi: 10.1152/jn.00953.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Hengen KB, Torrado Pacheco A, McGregor JN, Van Hooser SD, Turrigiano GG. Neuronal firing rate homeostasis is inhibited by sleep and promoted by wake. Cell. 2016;165:180–191. doi: 10.1016/j.cell.2016.01.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Hooks BM, Chen C. Circuitry underlying experience-dependent plasticity in the mouse visual system. Neuron. 2020;106:21–36. doi: 10.1016/j.neuron.2020.01.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Humplik J, Tkačik G. Probabilistic models for neural populations that naturally capture global coupling and criticality. PLOS Computational Biology. 2017;13:e1005763. doi: 10.1371/journal.pcbi.1005763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Lombardi F, Pepić S, Shriki O, Tkačik G, De Martino D. Statistical modeling of adaptive neural networks explains co-existence of avalanches and oscillations in resting human brain. Nature Computational Science. 2023;3:254–263. doi: 10.1038/s43588-023-00410-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Ma Z, Turrigiano GG, Wessel R, Hengen KB. Cortical circuit dynamics are homeostatically tuned to criticality in vivo. Neuron. 2019;104:655–664. doi: 10.1016/j.neuron.2019.08.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Mante V, Sussillo D, Shenoy KV, Newsome WT. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature. 2013;503:78–84. doi: 10.1038/nature12742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Mariani B, Nicoletti G, Bisio M, Maschietto M, Oboe R, Leparulo A, Suweis S, Vassanelli S. Neuronal avalanches across the rat somatosensory barrel cortex and the effect of single whisker stimulation. Frontiers in Systems Neuroscience. 2021;15:709677. doi: 10.3389/fnsys.2021.709677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Mazor O, Laurent G. Transient dynamics versus fixed points in odor representations by locust antennal lobe projection neurons. Neuron. 2005;48:661–673. doi: 10.1016/j.neuron.2005.09.032. [DOI] [PubMed] [Google Scholar]
  23. Meshulam L, Gauthier JL, Brody CD, Tank DW, Bialek W. Coarse graining, fixed points, and scaling in a large population of neurons. Physical Review Letters. 2019;123:178103. doi: 10.1103/PhysRevLett.123.178103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Mora T, Bialek W. Are biological systems poised at criticality? Journal of Statistical Physics. 2011;144:268–302. doi: 10.1007/s10955-011-0229-4. [DOI] [Google Scholar]
  25. Morales GB, di Santo S, Muñoz MA. Quasiuniversal scaling in mouse-brain neuronal activity stems from edge-of-instability critical dynamics. PNAS. 2023;120:e2208998120. doi: 10.1073/pnas.2208998120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Morrell MC, Sederberg AJ, Nemenman I. Latent dynamical variables produce signatures of spatiotemporal criticality in large biological systems. Physical Review Letters. 2021;126:118302. doi: 10.1103/PhysRevLett.126.118302. [DOI] [PubMed] [Google Scholar]
  27. Nieh EH, Schottdorf M, Freeman NW, Low RJ, Lewallen S, Koay SA, Pinto L, Gauthier JL, Brody CD, Tank DW. Geometry of abstract learned knowledge in the hippocampus. Nature. 2021;595:80–84. doi: 10.1038/s41586-021-03652-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. O’Byrne J, Jerbi K. How critical is brain criticality? Trends in Neurosciences. 2022;45:820–837. doi: 10.1016/j.tins.2022.08.007. [DOI] [PubMed] [Google Scholar]
  29. Pandarinath C, O’Shea DJ, Collins J, Jozefowicz R, Stavisky SD, Kao JC, Trautmann EM, Kaufman MT, Ryu SI, Hochberg LR, Henderson JM, Shenoy KV, Abbott LF, Sussillo D. Inferring single-trial neural population dynamics using sequential auto-encoders. Nature Methods. 2018;15:805–815. doi: 10.1038/s41592-018-0109-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Perkovic O, Dahmen K, Sethna J. Avalanches, Barkhausen noise, and plain old criticality. Physical Review Letters. 1995;75:4528–4531. doi: 10.1103/PhysRevLett.75.4528. [DOI] [PubMed] [Google Scholar]
  31. Petermann T, Thiagarajan TC, Lebedev MA, Nicolelis MAL, Chialvo DR, Plenz D. Spontaneous cortical activity in awake monkeys composed of neuronal avalanches. PNAS. 2009;106:15921–15926. doi: 10.1073/pnas.0904089106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Plenz D, Ribeiro TL, Miller SR, Kells PA, Vakili A, Capek EL. Self-organized criticality in the brain. Frontiers in Physics. 2021;9:639389. doi: 10.3389/fphy.2021.639389. [DOI] [Google Scholar]
  33. Poil S-S, van Ooyen A, Linkenkaer-Hansen K. Avalanche dynamics of human brain oscillations: relation to critical branching processes and temporal correlations. Human Brain Mapping. 2008;29:770–777. doi: 10.1002/hbm.20590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Ponce-Alvarez A, Jouary A, Privat M, Deco G, Sumbre G. Whole-brain neuronal activity displays crackling noise dynamics. Neuron. 2018;100:1446–1459. doi: 10.1016/j.neuron.2018.10.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Priesemann V, Shriki O. Can a time varying external drive give rise to apparent criticality in neural systems? PLOS Computational Biology. 2018;14:e1006081. doi: 10.1371/journal.pcbi.1006081. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Schwab DJ, Nemenman I, Mehta P. Zipf’s law and criticality in multivariate data without fine-tuning. Physical Review Letters. 2014;113:068102. doi: 10.1103/PhysRevLett.113.068102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Sederberg A, Nemenman I. Randomly connected networks generate emergent selectivity and predict decoding properties of large populations of neurons. PLOS Computational Biology. 2020;16:e1007875. doi: 10.1371/journal.pcbi.1007875. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Sederberg A. Avalanche. swh:1:rev:c2e91a5341d1aa5978650cb2c22227c7e52997dcSoftware Heritage. 2024 https://archive.softwareheritage.org/swh:1:dir:de294ef532e0e07507488df691f10a229fd29665;origin=https://github.com/ajsederberg/avalanche;visit=swh:1:snp:83b5ae292da441d80eb6e99588c5dd3ac359a2fd;anchor=swh:1:rev:c2e91a5341d1aa5978650cb2c22227c7e52997dc
  39. Shew WL, Clawson WP, Pobst J, Karimipanah Y, Wright NC, Wessel R. Adaptation to sensory input tunes visual cortex to criticality. Nature Physics. 2015;11:659–663. doi: 10.1038/nphys3370. [DOI] [Google Scholar]
  40. Stringer C, Pachitariu M, Steinmetz N, Reddy CB, Carandini M, Harris KD. Spontaneous behaviors drive multidimensional, brainwide activity. Science. 2019;364:255. doi: 10.1126/science.aav7893. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Touboul J, Destexhe A. Power-law statistics and universal scaling in the absence of criticality. Physical Review. E. 2017;95:012413. doi: 10.1103/PhysRevE.95.012413. [DOI] [PubMed] [Google Scholar]

eLife assessment

Peter Latham 1

This paper provides a simple example of a neural-like system that displays criticality, but not for any deep reason; it's just because a population of neurons are driven (independently!) by a slowly varying latent variable, something that is common in the brain. Moreover, criticality does not imply optimal information transmission (one of its proposed functions). The work is likely to have an important impact on the study of criticality in neural systems and is convincingly supported by the experiments presented.

Joint Public Review

Anonymous

This paper shows that networks of binary neurons can exhibit power law behavior (including "crackling", which refers to a particular relationship among the power law exponents) without fine tuning. If, as is standard, we equate power law behavior to criticality, then criticality can arise in networks of neurons without fine tuning. The network model used to show this was extremely simple: a population of completely uncoupled neurons was driven by a small number of slowly varying "hidden" variables (either 1 or 5). This caused the firing rate of every neuron to change slowly over time, in a correlated fashion. Criticality was observed over a large range of couplings, time constants, and average firing rates.

This paper is extremely important in light of the hypothesis that criticality in the brain is both special, in the sense that it requires fine tuning, and that it leads to optimal information processing. As mentioned above, this paper shows that fine tuning is not required. It also shows that criticality does not imply optimal information transmission. This does not, of course, rule out the above "critical brain" hypothesis. But it does show that simply observing power law behavior is not enough to draw conclusions about either fine tuning or function.

These authors are not the first to show that slowly varying firing rates can give rise to power law behavior (see, for example, Touboul and Destexhe, 2017; Priesemann and Shriki, 2018). However, to our knowledge they are the first to show crackling, and to compute information transmission in, and out of, the critical state.

References:

Touboul and Destexhe, 2017: Touboul J, Destexhe A. Power-law statistics and universal scaling in the absence of criticality. Phys Rev E. 2017 95:012413, 2017.

Priesemann and Shriki, 2018: Priesemann V, Shriki O. PLOS Comp. Bio. 14:1-29, 2018.

eLife. 2024 Mar 12;12:RP89337. doi: 10.7554/eLife.89337.3.sa2

Author Response:

Mia C Morrell 1, Ilya Nemenman 2, Audrey Sederberg 3

The following is the authors’ response to the original reviews.

Joint Public Review:

[…] While this does not rule out criticality in the brain, it decidedly weakens the evidence for it, which was based on the following logic: critical systems give rise to power law behavior; power law behavior is observed in cortical networks; therefore, cortical networks operate near a critical point. Given, as shown in this paper, that power laws can arise from noncritical processes, the logic breaks. Moreover, the authors show that criticality does not imply optimal information transmission (one of its proposed functions). This highlights the necessity for more rigorous analyses to affirm criticality in the brain. In particular, it suggests that attention should be focused on the question "does the brain implement a dynamical latent variable model?".

These authors are not the first to show that slowly varying firing rates can give rise to power law behavior (see, for example, Touboul and Destexhe, 2017; Priesemann and Shriki, 2018). However, to our knowledge they are the first to show crackling, and to compute information transmission in the critical state.

We thank the reviewers for their thoughtful assessment of our paper.

We would push back on the assessment that our model ‘has nothing to do with criticality,’ and that we observed ‘signatures of criticality [that] emerge through fundamentally non-critical mechanisms.’ This assessment partially stems from the definition of criticality provided in the Public Comment, that ‘criticality is a very specific set of phenomena in physics in which fundamentally local interactions produce unexpected long-range behavior.’

Our disagreement is largely focused on this definition, which we do not think is a standard definition. Taking the favorite textbook example, the Ising model, criticality is characterized by a set of power-law divergences in thermodynamic quantities (e.g., susceptibility, specific heat, magnetization) at the critical temperature, with exponents of these power laws governed by scaling laws. It is not defined by local interactions. All-to-all Ising model is generally viewed as showing a critical behavior at a certain temperature, even though interactions there are manifestly non-local. It is possible that, by “local” in the definition, the Public Comment meant that interactions are “collective” and among microscopic degrees of freedom. However, that same all-to-all Ising model is mathematically equivalent to the mean-field model, where criticality is achieved through large fluctuations of the mean field, but not through microscopic interactions.

More commonly, criticality is defined by power laws and scaling relationships that emerge at a critical value of a parameter(s) of the system. That is, criticality is defined by its signatures. What is crucial in all such definitions is that this atypical, critical state requires fine tuning. For example, in the textbook example of the Ising model, a parameter (the temperature) must be tuned to a critical value for critical behavior to appear. In the branching process model that generates avalanche criticality, criticality requires tuning m=1. The key result of our paper is that all signatures expected for avalanche criticality (power laws, crackling, and, as shown below, estimates of the branching rate m), and hence the criticality itself, appear without fine-tuning.

As we discussed in our introduction, there are a few other instances of signatures of criticality (and hence of criticality itself) emerging without fine-tuning. The first we are aware of was the demonstration of Zipf’s Law (by Schwab, et al. 2014, and Aitchison et al. 2016), a power-law relationship between rank and frequency of states, which was shown to emerge generically in systems driven by a broadly distributed latent variable. A second example, arising from applications of coarse-graining analysis to neural data (cf., Meshulam et al. 2019; also, Morales et al., 2023), was demonstrated in our earlier paper (Morrell et al. 2021). Thus, here we have a third example: the model in this paper generates signatures of criticality in the statistics of avalanches of activity, and it does so without fine-tuning (cf., Fig. 2-3).

The rate at which these ‘criticality without fine-tuning' examples are piling up may inspire revisiting the requirement of fine-tuning in the definition of criticality, and our ongoing work (Ngampruetikorn et al. 2023) suggests that criticality may be more accurately defined through large fluctuations (variance > 1/N) rather than through fine-tuning or scaling relations.

References:

  • Schwab DJ, Nemenman I, Mehta P. “Zipf’s Law and Criticality in Multivariate Data without FineTuning.” Phys Rev Lett. 2014 Aug; doi::101103/PhysRevLett.113.068102,

  • Aitchison L, Corradi N, Latham PE. “Zipf’s Law Arising Naturally When There Are Underlying, Unobserved Variables.” PLOS Computational biology. 2016 12; 12(12):1-32. doi:10.1371/journal.pcbi.1005110

  • Meshulam L, Gauthier JL, Brody CD, Tank DW, Bialek W. “Coarse Graining, Fixed Points, and Scaling in a Large Population of Neurons.” Phys Rev Lett. 2019 Oct; doi: 10.1103/PhysRevLett.123.178103.

  • Morales GB, di Santo S, Muñoz MA. “Quasiuniversal scaling in mouse-brain neuronal activity stems from edge-of-instability critical dynamics.” Proceedings of the National Academy of Sciences. 2023; 120(9):e2208998120.

  • Morrell MC, Sederberg AJ, Nemenman I. “Latent Dynamical Variables Produce Signatures of Spatiotemporal Criticality in Large Biological Systems.” Phys Rev Lett. 2021 Mar; doi: 10.1103/PhysRevLett.126.118302.

  • Ngampruetikorn, V., Nemenman, I., Schwab, D., “Extrinsic vs Intrinsic Criticality in Systems with Many Components.” arXiv: arXiv:2309.13898 [physics.bio-ph]

Major comments:

1. For many readers, the essential messages of the paper may not be immediately clear. For example, is the paper criticizing the criticality hypothesis of cortical networks, or does the criticism extend deeper, to the theoretical predictions of "crackling" relationships in physical systems as they can emerge without criticality? Statements like "We show that a system coupled to one or many dynamical latent variables can generate avalanche criticality ..." could be misinterpreted as affirming criticality. A more accurate language is needed; for instance, the paper could state that the model generates relationships observed in critical systems. The paper should provide a clearer conclusion and interpretation of the findings in the context of the criticality hypothesis of cortical dynamics.

Please see the response to the Public Review, above. To clarify the essential message that the dynamical latent variable model produces avalanche criticality without fine-tuning, we have made revisions to the abstract and introduction. This point was already made in the discussion (first sentence).

Key sentences changed in the abstract:

"… We find that populations coupled to multiple latent variables produce critical behavior across a broader parameter range than those coupled to a single, quasi-static latent variable, but in both cases, avalanche criticality is observed without fine-tuning of model parameters. … Our results suggest that avalanche criticality arises in neural systems in which activity is effectively modeled as a population driven by a few dynamical variables and these variables can be inferred from the population activity."

In the introduction, we changed the final sentence to read:

"These results demonstrate how criticality in neural recordings can arise from latent dynamics in neural activity, without need for fine-tuning of network parameters."

2. On lines 97-99, the authors state that "We are agnostic as to the origin of these inputs: they may be externally driven from other brain areas, or they may arise from recurrent dynamics locally". This idea is also repeated at the beginning of the Summary section. Perhaps being agnostic isn't such a good idea: it's possible that the recurrent dynamics is in a critical regime, which would just push the problem upstream. Presumably you're thinking of recurrent dynamics with slow timescales that's not critical? Or are you happy if it's in the critical regime? This should be clarified.

We have amended this sentence to clarify that any latent dynamics with large fluctuations would suffice:

”We are agnostic as to the origin of these inputs: they may be externally driven from other brain areas, or they may arise from large fluctuations in local recurrent dynamics.”

3. Even though the model in Equation 2 has been described in a previous publication and the Methods section, more details regarding the origin and justification of this model in the context of cortical networks would be helpful in the Results section. Was it chosen just for simplicity, or was there a deeper reason?

This model was chosen for its simplicity: there are no direct interactions between neurons, coupling between neurons and latent variables is random, and simulation is straightforward. More complex latent dynamics or non-random structure in the coupling matrices could have been used, but our aim was to explore this model in the simplest setting possible.

We have revised the Results (“Avalanche scaling in a dynamical latent variable model,” first paragraph) to justify the choice of the model:

"We study a model of a population of neurons that are not coupled to each other directly but are driven by a small number of dynamical latent variables -- that is, slowly changing inputs that are not themselves measured (Fig.A). We are agnostic as to the origin of these inputs: they may be externally driven from other brain areas, or they may arise from large fluctuations in local recurrent dynamics. The model was chosen for its simplicity, and because we have previously shown that this model with at least about five latent variables can produce power laws under the coarse-graining analysis Morrell2021."

We have added the following to the beginning of the Methods section expanding on the reasons for this choice:

"We study a model from Morrell 2021, originally constructed as a model of large populations of neurons in mouse hippocampus. Neurons are non-interacting, receiving inputs reflective of place-field selectivity as well as input current arising from a random projection from a small number of dynamical latent variables, representing inputs shared across the population of neurons that are not directly measured or controlled. In the current paper, we incorporate only the latent variables (no place variables), and we assume that every cell is coupled to every latent variable with some randomly drawn coupling strength."

4. The Methods section (paragraph starting on line 340) connects the time scale to actual time scales in neuronal systems, stating that "The timescales of latent variables examined range from about 3 seconds to 3000 seconds, assuming 3-ms bins". While bins of 3 ms are relevant for electrophysiological data from LFPs or high-density EEG/MEG, time scales above 10 seconds are difficult to generate through biophysically clear processes like ionic channels and synaptic transmission. The paper suggests that slow time scales of the latent variables are crucial for obtaining power law behavior resembling criticality. Yet, one way to generate such slow time scales is via critical slowing down, implying that some brain areas providing input to the network under study may operate near criticality. This pushes the problem toward explaining the criticality of those external networks. Hence, discussing potential sources for slow time scales in latent variables is crucial. One possibility you might want to consider is sources external to the organism, which could easily have time scales in the 1-24 hour range.

As the reviewers note, it is a possibility that slow timescales arise from some other brain area in which dynamics are slow due to critical dynamics, but many other plausible sources exist. These include slowly varying sensory stimuli or external sources, as suggested by the reviewers. It is also possible to generate “effective” slow dynamics from non-critical internal sources. One example, from recordings in awake mice, is the slow change in the level of arousal that occurs on the scale of many seconds to minutes. These changes arise from release of neuromodulators that have broad effects on neural populations and correlations in activity (for a focused review, see Poulet and Crochet, 2019).

We have added the following sentence to the Methods section where timescales of latent variables was discussed:

"The timescales of latent variables examined range from about $3$ seconds to $3000$ seconds, assuming $3$-ms bins. Inputs with such timescales may arise from external sources, such as sensory stimuli, or from internal sources, such as changes in physiological state."

5. It is common in neuronal avalanche analysis to calculate the branching parameter using the ratio of events in consecutive bins. Near-critical systems should display values close to 1, especially in simulations without subsampling. Including the estimated values of the branching parameter for the different cases investigated in this study could provide more comprehensive data. While the paper acknowledges that the obtained exponents in the model differ from those in a critical branching process, it would still be beneficial to offer the branching parameter of the observed avalanches for comparison.

The reviewers requested that the branching parameter be computed in our model. We point out that, for the quasi-stationary latent variables (as in Fig. 3), a branching parameter of 1 is expected because the summed activity at time t+k is, on average, equal to the summed activity at time t, regardless of k. Numerics are consistent with this expectation. Following the methodology for an unbiased estimate of the branching parameter from Wilting and Priesemann (2018), we checked an example set of parameters (epsilon = 8, eta = 3) for quasi-stationary latent fields. We found that the naïve (biased) estimate of the branching parameter was 0.94, and that the unbiased estimator was exp(−1.4⋅10−8) ≈ 0.999999986.

For faster time scales, it is no longer true that summed activity is constant over time, as the temporal correlations in activity decay exponentially. Using the five-field simulation from Figure 2, we calculated the branching parameter for several values of tau. The biased estimates of m are 0.76 (τ=50), 0.79 (τ=500), and 0.79 (τ=5000). The corrected estimates are 0.98 (τ=50), 0.998 (τ=500), and 0.9998 (τ=5000).

6. In the Discussion (l 269), the paper suggests potential differences between networks cultured in vitro and in vivo. While significant differences indeed exist, it's worth noting that exponents consistent with a critical branching process have also been observed in vivo (Petermann et al 2009; Hahn et al. 2010), as well as in large-scale human data.

We thank the reviewers for pointing out these studies, and we have added the missing one (Hahn et al. 2010) to our reference list. The following was added to the discussion, in the section “Explaining Experimental Exponents:”

"A subset of the in vivo recordings analyzed from anesthetized cat (Hahn et al. 2010) and macaque monkeys (Petermann et al. 2009) exhibited a size distribution exponent close to 1.5."

Along these lines, we noted two additional studies of high relevance that have been published since our initial submission (Capek et al. 2023, Lombardi et al. 2023), and we have added these references to the discussion of experimental exponents.

Minor comments:

1. The term 'latent variable' should be rigorously explained, as it is likely to be unfamiliar to some readers.

Sentences and clauses have been added to the Introduction, Results and the Methods to clarify the term:

Intro: “Numerous studies have reported relatively low-dimensional structure in the activity of large populations of neurons [refs], which can be modeled by a population of neurons that are broadly and heterogeneously coupled to multiple dynamical latent (i.e., unobserved) variables.”

Results: “We studied a population of neurons that are not coupled to each other directly but are driven by a small number of dynamical latent variables -- that is, slowly changing inputs that are not themselves measured.”

Methods: “Neurons are non-interacting, receiving inputs reflective of place-field selectivity as well as input current reflecting a random projection from a small number of dynamical latent variables, representing inputs shared across the population of neurons that are not directly measured.”

2. There's a relatively important typo in the equations: Eq. 2 and Eq. 6 differ by a minus sign in the exponent. Eqs. 3 and 4 use the plus sign, but epsilon_0 on line 198 uses the minus sign. All very confusing until we figured out what was going on. But easy to fix.

Thank you for catching this. We have made the following corrections:

1. Figures adopted the sign convention that epsilon > 0, with larger values of epsilon decreasing the activity level. Signs in Eqs. 3 and 4 have been corrected to match.

2. Equation 5 was missing a minus sign in front of the Hamiltonian. Restoring this minus sign fixed the discrepancy between 2 and 6.

3. In Eq. 7, the left hand side is zeta'/zeta', which is equal to 1. Maybe it should be zeta'/zeta? Fixed, thank you.

Additional comments:

The authors are free to ignore these; they are meant to improve the paper.

We are extremely grateful for the close reading of our paper and note the actions taken below.

1. We personally would not use the abbreviation DLV; we find abbreviations extremely hard to remember. And DLV is not used that often.

Done, thank you for the suggestion.

2. l 198: epsilon_0 = -log(2^{1/N}-1) was kind of hard to picture -- we had to do a little algebra to make sense of it. Why not write e^{-epsilon_0} = 2^{1/N}-1 \approx log(2)/N, which in turn implies that epsilon_0 ~ log(N)?

Thank you, good point. We have added a sentence now to better explain:

"...which is maximized at $\epsilon_0 = - \log (2^{1/N} - 1)$, independent of $J_i$ and $\eta$. After some algebra, we find that $\epsilon_0 \sim \log N$ for large $N$."

3. Typo on l 202: "We plot P_ava as a function of epsilon in Fig. 4B". 4B --> 4D.

Done

4. It would be easier on the reader if the tables were all in one place. It would be even nicer to put the parameters in the figure captions. Or at least N; that one is kind of important.

Table placement was a Latex issue, which we have now fixed. We also have included links between tables and relevant figures and indicated network size.

5. What's x_i in Eqs. 7 and 8?

We added a sentence of explanation. These are the individual observations of avalanche sizes or durations, depending on what is being fit.

6. The latent variables evolve according to an Ornstein-Uhlenbeck process. But we might equally expect oscillations or non-normal behavior coupling dynamical modes, and these are likely to give different behavior with respect to avalanches. It might be worth commenting on this.

7. The model assumes a normal distribution of the coupling strengths between the latent variables and the binary units. Discussing the potential effects of different types of random coupling could provide interesting insights.

Both 6 and 7 are interesting questions. At this point, we could speculate that the main results would be qualitatively unchanged, provided dynamics are sufficiently slow and that the distribution of coupling strengths is sufficiently broad (that is, there is variance in the coupling matrix across individual neurons). Further studies would be needed to make these statements more precise.

8. In Fig 1, tau_f = 1E4 whereas in Fig 2 tau_f = 5E3. Why the difference?

For Figure 1, we chose a set of parameters that gave clear scaling. In Figure 2, we saw some value in showing more than one example of scaling, hence different parameters for the examples in Fig 2 than Fig 1. Note that the Fig 1 simulations are represented in Fig. 2 G-J, as the 5-field simulation with tau_F = 1e4.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    MDAR checklist
    Source code 1. Code (Python and MATLAB) used to run simulations, analyses, and calculations.
    elife-89337-code1.zip (1.1MB, zip)

    Data Availability Statement

    The current manuscript is a computational study, so no data were generated for this manuscript. Modelling code is uploaded as Source code 1.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES