Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2020 Dec 14;16(12):e1008480. doi: 10.1371/journal.pcbi.1008480

A functional theory of bistable perception based on dynamical circular inference

Pantelis Leptourgos 1,*, Vincent Bouttier 2,3, Renaud Jardri 2,3,4,*, Sophie Denève 2
Editor: Ulrik R Beierholm5
PMCID: PMC7769606  PMID: 33315961

Abstract

When we face ambiguous images, the brain cannot commit to a single percept; instead, it switches between mutually exclusive interpretations every few seconds, a phenomenon known as bistable perception. While neuromechanistic models, e.g., adapting neural populations with lateral inhibition, may account for the dynamics of bistability, a larger question remains unresolved: how this phenomenon informs us on generic perceptual processes in less artificial contexts. Here, we propose that bistable perception is due to our prior beliefs being reverberated in the cortical hierarchy and corrupting the sensory evidence, a phenomenon known as “circular inference”. Such circularity could occur in a hierarchical brain where sensory responses trigger activity in higher-level areas but are also modulated by feedback projections from these same areas. We show that in the face of ambiguous sensory stimuli, circular inference can change the dynamics of the perceptual system and turn what should be an integrator of inputs into a bistable attractor switching between two highly trusted interpretations. The model captures various aspects of bistability, including Levelt’s laws and the stabilizing effects of intermittent presentation of the stimulus. Since it is related to the generic perceptual inference and belief updating mechanisms, this approach can be used to predict the tendency of individuals to form aberrant beliefs from their bistable perception behavior. Overall, we suggest that feedforward/feedback information loops in hierarchical neural networks, a phenomenon that could lead to psychotic symptoms when overly strong, could also underlie perception in nonclinical populations.

Author summary

In cases of high ambiguity, our perceptual system cannot commit to a single percept and switches between different interpretations, giving rise to bistable perception. In this paper we outline a computational model of bistability based on the notion of circular inference, i.e. a form of suboptimal hierarchical inference in which priors and / or sensory inputs are reverberated and over-counted. We suggest that descending loops (i.e. reverberated priors) transform our perceptual system from a simple accumulator of sensory inputs into a bistable attractor, that switches between two highly-trusted interpretations. Using analytical methods we derive the necessary conditions for bistable perception to occur. We show that our dynamical circular inference model is able to capture many features of bistability, such as Levelt’s laws and the stabilizing effects of intermittent presentation of the stimulus. Finally we make novel predictions about the behavior of psychotic patients.

Introduction

All perceptual systems have one fundamental goal: to interpret the surrounding environment based on unreliable sensory evidence. In most cases, this task is performed very accurately, and the correct interpretation is found. Sometimes, perceptual systems fail to detect any meaningful interpretation (e.g., when sensory evidence is too degraded) or converge to the wrong interpretation (e.g., visual illusions [1,2]). Finally, a third possibility occurs (mainly in lab conditions [3]) when ambiguity is high; the system detects more than one plausible interpretations but instead of committing to one interpretation, it switches every few seconds, a phenomenon known as bistable perception [4]. Despite ongoing scientific efforts, there has been no unanimous agreement either on the causes of bistability or on its functional role.

The dominant mechanistic view on bistable perception suggests that it results from the competition between different neuronal populations, each of them encoding a different interpretation of the sensory signal [5]. The two populations suppress each other via lateral inhibition, while some form of slow negative feedback (e.g., spike frequency adaptation or synaptic depression) acts on the dominant population, weakening the interpretation that is currently perceived [611]. Additionally, injected noise renders irregular switching and in some models, it can even be the driving force of oscillatory behavior [1215]. Although these models have proven quite successful in describing different experimental observations (and linking them to the underlying neural mechanisms), they do not address functional considerations about bistable perception.

To overcome this issue, other groups suggested functional models of bistability, largely based on the idea that the brain is an inference machine and perception is equivalent to a probabilistic process (e.g., [16]; see also [17,18] for predictive coding, or [1921] for sampling). However, some crucial questions remain largely unanswered from a purely normative perspective, namely, (1) why would a system form such strong percepts based on ambiguous sensory evidence, but only in some cases, and why do the percepts persist in such a way instead of switching rapidly, and (3) how the behavior of individuals in bistable perception tasks may predict their performance in other probabilistic inference tasks.

In the present paper, we address the problem of bistable perception by proposing a functional model with a well-defined interpretation in terms of generic neural processes. Based on previous experimental findings, we suggest that bistability could be a perceptual manifestation of circular inference (CI), a form of belief propagation in which priors and likelihoods are reverberated in the cortical hierarchy and consequently corrupted by each other [22,23]. More specifically, bistable perception could be imposed by the presence of “descending loops”, where high-level beliefs are combined with sensory representations (through feedback connections), and subsequently reinforce themselves (through feedforward connections). This results in the perceptual system “seeing what it expects” instead of the truly ambiguous image [24]. Of note, previous work from our group linked CI with pathological brain function, as in the case of schizophrenia [25] but also to a smaller extent with physiological functioning [26].

In the following sections, we derive the dynamics of inference in the presence of ambiguous sensory stimuli and inference loops. The consequence of CI is to replace what is normally a slow temporal integration of unreliable sensory evidence with a bistable attractor switching between two highly trusted interpretations. We demonstrate that such a model can reproduce well-known qualitative aspects of bistability, including the four Levelt’s laws and the stabilizing effect of intermittent presentation, while it also makes testable quantitative predictions (e.g., about the behavior of patients suffering from schizophrenia). Since circularity arises from an imbalance between neural excitation and inhibition in recurrent brain circuits [24,27], our approach bridges normative interpretations of bistable perception with plausible underlying neural mechanisms.

Methods

Here, we introduce a CI model of bistable perception and highlight its underlying functional assumptions. For reasons of clarity, we refer to the example of the Necker cube, an ambiguous 2D figure which is compatible with 2 different 3D cubes and generates bistability: a cube that is “seen from above” (later called the SFA interpretation) and a cube that is “seen from below” (later called the SFB interpretation) (Fig 1A). Note that the model can be generalized to any other stimuli inducing perceptual rivalry.

Fig 1. Normative model for how 3D objects result in particular sensory inputs, and putative neural implementation of the corresponding perceptual inference.

Fig 1

(A.) The internal model is a simple Bayesian generative model, where 3D objects predict the 2D image, and the 2D image predicts low-level sensory inputs. The brain interprets the depth cues (basic features) as indicative of real depth. Consequently, it first reconstructs the 2D figure and from that, it infers the 3D object. Note that in reality there is one single 2D stimulus (the Necker cube drawing) containing contradictory depth cues. (B.) Close-up on the assumed “basic feature” distributions (likelihood) compared to the real input distributions. The brain interprets the depth cues as meaningful, predicting separate input distributions for the two cubes (SFA, SFB; two objects cannot occupy the same space), which corresponds to two nonoverlapping likelihood distributions in the internal model (dotted red and blue distributions). In the totally ambiguous case (cube with no extra cues), the real input is sampled from a distribution with mean 0 (black). Visual cues shift this input distribution toward mostly positive or negative values. Crucially, there is a discrepancy between the real input and the input assumed by the internal model. This, together with the loops, predicts the suboptimal inference at the heart of bistable perception. (C.) A simplified neural implementation of hierarchical perceptual inference. Reciprocal connections can combine bottom-up sensory evidence with top-down priors at all levels of the hierarchical representation. Unfortunately, this also creates redundant information loops, ascending (magenta arrow) and descending (blue arrow). (D.) The brain can cancel these loops by using inhibitory interneurons and maintaining a tight E/I balance. If this balance is impaired, however, there will be some residual loops, parameterized by aP (descending loops, amplifying prior beliefs) and aS (ascending loops, amplifying the sensory evidence). L is the log-ratio of the belief. (E.) From the Bayesian model (A.) we derived an attractor model that performs inference in the presence of loops. The model accumulates noisy evidence while descending loops add positive feedback and ascending loops increase the sensory gain.

Generative model

Our model postulates that bistable perception is triggered by the same mechanisms and computations that underlie normal perception. There is accumulating evidence that the brain uses its cortical hierarchy to represent the causal structure of the world [28,29]. Brain circuits invert this “generative model” to find the most likely interpretation of the noisy sensory information. In other words, perception can be viewed as an instance of hierarchical Bayesian inference [28,30] (Fig 1A). A particularly striking example of this inferential process is 3D vision (such as the perception of the Necker cube). The brain has no direct access to the 3D structure of the perceived object. In contrast, it receives low-level 2D sensory information from the retina. In such a context, the task of the perceptual system is to extract valuable depth cues and combine them with high-level prior knowledge, to make “educated guesses” about the 3D object. Evidence suggests that this is a gradual process [31], with different brain regions representing features of different complexity; the lower levels of the visual cortex represent the basic features of the stimulus such as contours and orientations while higher levels are responsible for more abstract information such as the 3D organization of the stimulus [32,33].

In the case of the Necker cube, a veridical percept would correspond to a 2D drawing of crossing lines. The presence of illusory depth cues forces the brain to consider a 3D structure. Nonetheless, since the cues are ambiguous and contradictory, the 2D projection of the hypothetical 3D stimulus is compatible with different objects, including the SFA and SFB interpretations mentioned previously. The two interpretations are considered mutually exclusive, an assumption that corresponds with the epistemological truth that two different 3D objects cannot occupy the same space [17]. It is interesting to note that in a more general sense, the Necker cube is compatible with an infinity of 3D objects, among which the brain represents only the two symmetrical cubes. This reduction of possible causes could be the result of hyperpriors used by the brain and is not considered in the current model.

We formalize this inference problem with a simple graphical model, a chain with 2 latent variables and one sensory observation (Fig 1A). This “generative model” summarizes assumptions made by our sensory system on the underlying causes of natural inputs, which may significantly differ from the artificial data presented in a laboratory setting.

The sensory observation (S) represents the basic features extracted by visual receptors (edges, contrast, etc.). For simplicity, S is assumed to be a scalar drawn from two probability distributions, one for each configuration of the cube, as illustrated in Fig 1A and 1B (red and blue dotted distributions; P(S|X2D = 1) ≠ P(S|X2D = 0)). These distributions have different means ±μint and the same variance σint2. The difference in these two distributions considers the fact that natural 3D objects have true depth cues (disparities, shadows, occlusion, etc.), predicting different likelihoods for the two interpretations. Note that completely ambiguous stimuli (i.e., falling in the perfect overlap between the two distributions) are, in fact, rarely encountered in nature.

The next variable X2D is binary and represents an intermediate level of complexity in the perceptual hierarchy (e.g., the 2D surfaces and their orientation). Finally, the binary variable X3D represents the final 3D cube configuration, with values 0 and 1 corresponding to SFB and SFA respectively. wS corresponds to how reliably X3D predicts X2D.

wS=P(X3D=1|X2D=1)=P(X3D=0|X2D=0) (1)

We also assume that the environment has some volatility, e.g., objects are not permanently present, but occasionally appear or disappear. Thus, X3D can randomly switch at any time, as represented by two rates of change, from 0 to 1 (ron), and from 1 to 0 (roff). For the sake of simplicity in the notation, we will replace X3D at time t by Xt, representing the 3D configuration of the cube (SFA or SFB) at time t.

rondt=P(Xt=1|Xt-dt=0) (2)
roffdt=P(Xt=0|Xt-dt=1) (3)

Note that if we use ronroff, one of the two interpretations becomes more probable that the other. This is very useful in the case of the Necker cube, where people usually prefer the SFA interpretation, according to a general prior to view things from above (ron > roff) [34].

Now that we have described the generative model, i.e., the internal model used by the brain to perceive objects in the real world, we have to consider the artificial stimulus provided during a bistable perception experiment. The Necker cube is very unnatural in the sense that it contains no real depth cue. Thus, the sensory information it provides is assumed to be sampled (independently at each time step) from a Gaussian distribution with mean μnoise (μnoise = 0 (Gaussian process without drift) if the cube is completely unbiased and μnoise ≠ 0 (Gaussian process with drift), if there are visual cues supporting one of the two configurations, e.g., different contrast for the edges) and variance σnoise2 (Fig 1B; black and gray distributions).

The ultimate goal of the perceptual system is to infer X3D using the noisy measurements and any available prior knowledge (for more information about the generative model, see S1 Text).

Temporal dynamics of inference

We show in S1 Text that exact inference implements a leaky integration of the noisy sensory input (Fig 1E), i.e.

dLdt=-Φ(L)+wintS (4)

where wint=2μintσint2(2wS-1) represents the overall reliability of the sensory input (as assumed by the generative model). L is the log-odds (L=log(P(Xt=1|S0t)P(Xt=0|S0t))). The nonlinear leak term Φ(L) depends on the transition rates, i.e.,

-Φ(L)=(rone-L-roffeL)+(ron-roff) (5)

As a result of this leak, in the absence of sensory evidence, the log-odds go back to the constant prior value log(ronroff). This relaxation is faster for larger volatility in the environment (higher transition rates). In the presence of reliable and unambiguous sensory input (e.g., when adding visual cues, i.e., μnoise ≠ 0), L integrates out the noise and eventually reaches high (positive) or low (negative) values, corresponding to high levels of confidence in favor of the SFA or SFB configurations. However, in the presence of a completely ambiguous sensory input, L integrates unbiased noise (μnoise ≠ 0) and constantly hovers around the prior value, rarely reaching a sustained high level of confidence in either of the two configurations.

Dynamics notably change in the presence of CI. CI is defined in the context of hierarchical probabilistic inference but can also be understood intuitively as a consequence of feedforward/feedback loops in brain circuits (Fig 1C). Bottom-up sensory evidence (from S to X2D) and top-down prior information (from X3D to X2D) have to be combined to compute the probability of intermediate representations (X2D), a task presumably performed by feedforward (bottom-up) and feedback (top-down) connections converging on the same intermediate “2D” sensory area [35]. This hypothesis is supported by the experimentally observed top-down modulation of sensory neuron responses by higher-level interpretation of the image [3638]. However, feedforward connections between the “2D” and “3D” areas also communicate this modulated sensory response back to the “3D” areas. While this modulation does not bring any “new” information, it could nevertheless be mistaken for additional sensory evidence supporting the current interpretation. In fact, without dedicated control mechanisms, feedforward/feedback loops would systematically result in CI in the underlying perceptual process. We found previously that while this can, in theory, be avoided by maintaining a tight excitatory/inhibitory balance in brain circuits (Fig 1D), human subjects show some level of circularity in their probabilistic reasoning, which is aggravated in individuals suffering from schizophrenia [25,26].

Here, we quantify the strength of CI by two variables representing the level of “ascending” (also called “climbing” [22]; aS) and “descending” loops (aP). Descending loops represent to what extent top-down modulation of sensory responses is misinterpreted by upstream (higher-level) neurons as new sensory information, forcing the perceptual system to “see what it expects”. Vice-versa, ascending loops represent to what extent intermediate sensory responses are misinterpreted by downstream (lower-level) neurons as prior knowledge, even when they do not provide them with any new information (Fig 1C). This forces the perceptual system to “expect what it sees” and over-interpret weak sensory inputs.

If CI is introduced in the model, the dynamics of perceptual integration changes as follows (Fig 1E):

dLdt=-Φ(L)+aL+wint*S (6)

Note that the new auto-amplification term aL = 2aPwSL (due to the corruption of the sensory evidence by the prior belief) is proportional to the strength of descending loops aP and the assumed reliability of the sensory information, wS. If a is large enough, this amplification term may exceed the leak term, at least in a certain range of confidence near L = 0. This leads, as we will see, to bistable dynamics. Importantly, this term not only depends on the strength of the descending loops but also on the reliability of the sensory input (assumed by the generative model). Bistable dynamics occur only for large wS, which we may interpret as a typically highly reliable input (such as 2D drawings of 3D objects) as opposed to typically unreliable inputs (e.g., low contrast or degraded stimuli). This may explain in part why bistable perception is a relatively rare phenomenon in natural (nonlaboratory) settings.

In contrast, ascending loops amplify the weight of the sensory evidence according to their strength, i.e., wint*=wint(1+2wSaS). In particular, ascending loops affect the dynamics only if a sensory stimulus is present and tend to destabilize the percept by increasing the gain of the noise injected into the dynamical system.

Note that without loss of generality, this model of perceptual dynamics can be reduced to 4 free parameters: the two transition rates ron and roff, the auto-amplification a and the overall gain of the sensory inputs wint*.

Perceptual decision

Finally, we require a model of perceptual decision, which can predict the current percept from the confidence. For simplicity, we assume a maximum-a-posteriori (MAP) decision criterion, which means that decisions are made according to the sign of L (SFA if L > 0; SFB if L < 0). The MAP decision criterion results in optimal behavior when the goal of the system is to maximize accuracy, as in the case of perception.

Simulations

For all the simulations, we used the Euler–Maruyama algorithm. The time step was fixed at dt = 0.01s. Both the standard deviation of the noise σnoise (real model) and of the likelihood function σint (internal model) were equal to 1. The mean of the likelihood function ±μint was also fixed at ±1. μnoise = 0 for the completely ambiguous case and μnoise ≠ 0 when sensory evidence was biased. The initial belief in all simulations was L0 = 0. A summary of the parameters can be found in Table 1.

Table 1. Parameters of the model.

Variable Description Link to other variables
μnoise Drift of sensory evidence -
σnoise Standard deviation of sensory evidence -
μint Mean of likelihood function -
σint Standard deviation of likelihood function -
wS Feed-forward weight -
aP Descending loops -
aS Ascending Loops -
ron Transition rate (0➔1) -
roff Transition rate (1➔0) -
wint Sensory gain without ascending loops wint=2μintσint2(2wS-1)
wint* Sensory gain with ascending loops wint*=wint(1+2wSaS)
b Bias b = ronroff

Results

As a first step, we highlight the importance of the descending loops in the generation of bistable perception from a phenomenological and mechanistic point of view. Subsequently, we illustrate how CI replicates some of the most seminal features of bistable perception, such as Levelt’s laws but also some counterintuitive findings, including stabilization of perception after a brief disappearance of the stimulus. Finally, we present further consequences of the model, notable predictions about the performance of schizophrenia patients exposed to bistable stimuli.

Strong descending loops induce bistable perception

An example of model dynamics in response to a continuous presentation of a Necker cube, in the presence of strong descending loops is shown in Fig 2A and 2C.

Fig 2. Examples of model dynamics.

Fig 2

(A.) Model with descending loops (aP = 1.5), unbiased (ron = roff = 0.5), with sensory gain wint = 0.8. The model received an ongoing, ambiguous, white noise input with standard deviation σnoise = 1. Blue line: L (log-ratio of the belief / confidence), red line = percept, dashed line = decision threshold). (B.) Model with no descending loops (same parameters as in (A.) except aP = 0). (C.) The same model as (A.), but with a preference for the “SFA” configuration (transition rates changed to ron = 0.52, roff = 0.48). (D.) The same model as (B.), with ron = 0.6, roff = 0.4. (E.) Phase-duration histogram (No loops; unbiased). The dynamical circular inference model (with/without loops; with/without bias) predicts exponential distribution of phase-durations. Gamma-like distributions, often observed in bistable perception experiments, can be obtained by adding filtered noise, adaptation-like mechanisms or more complex decision criteria to the model (see Discussion).

With descending loops, the percept switches between two highly trusted interpretations (for example, L = 4 corresponds to probability 0.98 in favor of SFA; see also S3 Text). Periods with low confidence are short and limited to sudden perceptual switches, induced by the noisy input. These switches occur at apparently random times, resulting in an exponential decay observed in the distribution of dominance durations (Fig 2E). When there is a bias (e.g., ron > roff), one of the two configurations (e.g., SFA) becomes more likely and is perceived more often (Fig 2C). However, the shape of the dominance durations remains similar for the two configurations, even if the durations of the preferred configurations are longer overall. It’s worth-highlighting that the stronger interpretation is also perceived with higher confidence, a prediction that could be tested in future studies.

For comparison, we also show the dynamics of the model without descending loops (aP = 0) (Fig 2B and 2D). The resulting system is equivalent to a hidden Markov model (HMM), with transition rates ron and roff [39], and has only one stable state corresponding to the prior. As a result, the confidence behaves similarly to a leaky random walk. Since the leak maintains L close to zero, the system rarely attains high levels of trust in either configuration, which may preclude the emergence of strong and stable percepts in the absence of descending loops (instead, low confidence might give rise to mixed percepts [40]).

Dependency of bistability on the parameters

Due to its simplicity, the model dynamics can be analyzed more formally. This has the advantage of generalizing the model and providing a general view on the dependency of bistable perception on prior assumptions about the external world and on the strength of ascending and descending loops.

This dynamics can be represented by an energy landscape plotting the “potential” (the temporal integral of the dynamic Eqs (1)/(6)) as a function of the current state L. The relationship between the energy landscape and stability of a dynamical system is shown in Fig 3A and 3B, while the actual energy landscape of the model for different parameter settings is shown on Fig 3C and 3D. In the absence of inputs, L always decreases toward the lower potentials in these energy landscapes, until it reaches a stable fixed point corresponding to a local minimum in the potential, also called an “energy well” (Fig 3A). The presence of a noisy input introduces random perturbations which might allow L to temporarily climb the barrier between two wells, thus switching to a different stable state (Fig 3B).

Fig 3. Energy landscapes of the model with and without descending loops.

Fig 3

(A.) Schema illustrating the relationship between wells in the energy landscape (potential = integral of the dynamic equation, in blue) and stable states. Gray and black dots represent the initial and final state from two different initial states. In the absence of external input, dots can only decrease. (B.) Schema illustrating how noise can force the state to climb an energy barrier (a hill in the energy landscape) and switch to a different stable state. (C.) Energy landscape of the model with no descending loops (dashed, aP = 0), and two increasing levels of descending loops (red: aP = 1, blue: aP = 1.3). Descending loops generate a bistable attractor, whose stable fixed points correspond to (strong beliefs about) the two interpretations (blue). In contrast, a system with no loops has only one attractor, the prior, (equal to 0 in this unbiased scenario). (D.) Energy landscape for different biases, no bias (red: ron = roff = 0.5), weak bias (magenta:, ron = 0.55, roff = 0.45) and strong bias (light green, ron = 0.6, roff = 0.4). Note that for stronger biases, the nonpreferred configuration becomes unstable.

Without the descending loops, the model is equivalent to an HMM. Importantly, an HMM acts as a leaky integrator with only one stable fixed point (the prior) determined by the 2 rates (volatility):

LSt,a=0=log(ronroff) (7)

This can be visualized by observing that the corresponding energy landscape contains a single energy well (Fig 3C, dashed line). As long as the descending loops are weak compared to the leak, the prior remains the only fixed point of the system and is stable. For example, with ron = roff = r, this remains true up to the value:

aPPf=rwS (8)

At this value, the system undergoes a pitchfork bifurcation (Fig 4A; see also S2 Text). The preexisting fixed point becomes unstable and 2 additional attractors are generated, given by the 2 symmetrical, nonzero solutions of the equation −Φ(L) + aL = 0 (Figs 3C and 4A). The stronger the descending loops (or the weaker the leak), the further apart the 2 symmetrical attractors are, resulting in more highly trusted configurations, which are also more stable since the energy barrier is harder to cross.

Fig 4. Phase diagrams of the model dynamics.

Fig 4

(A.) Stable fixed point (plain), unstable fixed point (dashed) and bifurcation point (red dot) as a function of aP for an unbiased system (ron = roff = r). (B.) Stable fixed point, unstable fixed point and bifurcation points as a function of r. (C.) The same as (A.) for a biased system (ron> roff). (D.) The same as (B.) but as a function of ron, roff being fixed at 0.5. Note that bistability can exist in a narrow range around symmetry. (A.,B.) Pitchfork bifurcation for symmetrical systems. (C.,D.) Saddle-node bifurcation for asymmetrical systems.

Adding bias to the system (ronroff; e.g., SFA bias in Necker cube) creates an asymmetry in the energy landscape (Fig 3D). A saddle-node (SN) bifurcation occurs when the loops become strong enough to overcome the leak (Fig 4C; for a mathematical description of the SN bifurcation, see S2 Text). However, bistability can only exist in a narrow range of biases (i.e., the difference between the two transition rates ron and roff), more particularly in the range constrained by the 2 SN bifurcation points (one for ron > roff and one for ron < roff; Fig 4D). These two bifurcations represent points at which the bias becomes strong enough to ensure that only one of the two configurations (the most likely one a-priori) can be stably perceived.

Our analysis suggests that descending loops can constitute a crucial part of the machinery of a system exhibiting bistable perception. When they are strong enough to overcome the effect of the leak, they generate a bistable attractor, implementing a memory-like mechanism that pushes the belief toward more extreme values based on the previous observations. This helps the system make decisions and act upon them in the absence of fully convincing evidence.

Until now, our analysis focused mainly on the effects of the descending loops. However, ascending loops play an important role as well. According to (6), ascending loops increase the gain of the sensory evidence (noise) (Fig 3B), which consequently acts by destabilizing perception and reducing the effect of the bias on predominance.

In conclusion, this analysis demonstrates that robust bistable perception requires a very specific set of conditions. It can only exist if there is a combination of (1.) reliable sensory inputs (large wS), (2.) stimuli that are assumed to be stable (i.e., small transition rates ron and roff, that are dominated by descending loops), (3.) at least two probable interpretations, even if one can dominate the other (i.e., ron and roff relatively close to each other, leading to a weak bias). Given these stringent conditions, it is not surprising that bistability is rather uncommon in everyday life and occurs mainly for artificial stimuli chosen to obey these requirements.

In the next sections, we explore the predictions of the model regarding well-known psychophysical features of bistable perception.

Levelt’s laws

An important qualitative aspect of bistable perception is Levelt’s laws. These laws constitute a set of 4 psychophysical propositions relating the strength of the bistable stimulus to the phenomenology of binocular rivalry [41], and more generally of bistable perception [42]. Despite some recent modifications in their formulation (to account for new experimental data [43,44]), Levelt’s laws remain fundamental to our understanding of the machinery of bistability and an important crash-test for any potential model. We will present one by one the four revised propositions (as described in [42] and not in Levelt’s original monograph [41]) and will critically discuss them through the prism of the dynamical circular inference (dCI) model.

1st Levelt’s law

The first proposition links the stimulus strength with the predominance of each interpretation. It postulates that increasing the stimulus strength of one perceptual interpretation increases the predominance of this perceptual interpretation [42]. For example, adding a cue to the Necker cube helps the relevant interpretation gain more perceptual dominance compared to its rival. Although in modern terminology, proposition 1 sounds more like a tautology, it is still useful for detecting stimulus features (or parameters of the model) that affect the strength of an interpretation [44]. Within our model, we can parameterize the strength of the sensory evidence by adjusting the drift μnoise of the Gaussian noise, which biases the sampling of evidence (Fig 1B). As expected, the more positive the drift the closer the relative predominance goes to 1 (the opposite for negative drift) (Fig 5A), in agreement with the first proposition.

Fig 5. Levelt’s laws.

Fig 5

The circular inference model qualitatively reproduces the 4 Levelt’s propositions (here: wS = 0.9; aP = 1; ron = roff = 0.5). (A.) 1st proposition—increasing the stimulus strength of one perceptual interpretation increases the predominance of this perceptual interpretation. (B.) 2nd proposition—Manipulating the stimulus strength of one perceptual interpretation of a bistable stimulus does not equally influence the average dominance duration of both interpretations, but mainly affects the persistence of the stronger interpretation. (C.) 3rd proposition—Increasing the difference in the stimulus strength between the 2 perceptual interpretations should result in a decrease in the perceptual alternation rate (i.e., maximum number of switches at equi-dominance). (D.) 4th proposition—When we increase the strength of both interpretations, the number of switches increases.

2nd Levelt’s law

The second proposition is less intuitive than the first and posits that manipulating the stimulus strength of one perceptual interpretation of a bistable stimulus does not influence equally the average dominance duration of both interpretations, but mainly affects the persistence of the stronger interpretation [42,45]. For example, increasing the strength of a visual cue in the Necker cube example mainly affects the mean dominance duration of the corresponding interpretation. The dCI model is fully compatible with Levelt’s second law, as presented in Fig 5B; making the drift more positive (bias for SFA) predominantly affects the mean phase duration of the SFA interpretation (the opposite happens for a negative drift and the SFB interpretation). Indeed, the drift acts as an additional bias term in (4)/(6), which deepens the well of the strong interpretation, while making the other well shallower. This dual effect of the drift (not obvious in other models in which different variables represent the different interpretations, see also [12]), along with the model’s inherent nonlinearity can explain Levelt’s second law [45].

3rd Levelt’s law

Levelt’s third proposition is closely related to the second proposition [44] and suggests that increasing the difference in the stimulus strength between the 2 perceptual interpretations should result in a decrease in the perceptual alternation rate [42]. In the Necker cube example, this proposition implies that adding a visual cue results in fewer switches. Importantly, the dCI model behaves exactly as the third proposition dictates. As shown in Fig 5C, the alternation rate achieves its maximum value for drift = 0 (completely ambiguous stimulus) and decreases symmetrically as the drift becomes more positive or negative, a direct consequence of the third law [45].

4th Levelt’s law

Finally, the fourth proposition goes one step further and discusses what happens to the alternation rate if we equally increase the strength of both interpretations. In this case, the number of switches increases, resulting in a higher alternation rate. Contrary to the 3 first propositions, the fourth proposition illustrates the effect of a simultaneous and equal manipulation of both interpretations (global stimulus strength). In the model, this should result in an increase in the mean of the absolute value of the sensory evidence, while it should have no effect on the mean of the sensory evidence per se. In other words, this global manipulation can be captured by a change in the variance in the noise distribution σnoise. A higher variance results in more exploration of the energy landscape due to the noise. Consequently, as illustrated in Fig 5D, increasing σnoise results in more switches, in agreement with Levelt’s fourth law.

In conclusion, the model obeys Levelt’s laws regardless of the chosen parameters as long as

  1. The sensory gain is high enough to induce transitions.

  2. The bias is not strong enough to render one of the two configurations unstable.

Note that the respect of Levelt’s laws is not sufficient to prove the presence of descending loops since the model without loops can also reproduce them (as long as the decision threshold is set appropriately). However, definite support for the existence of descending loops is provided by the stabilization of the percept by intermittent presentations of the stimulus, as described in the next section.

Intermittent presentation

When an ambiguous stimulus is presented continuously, switches between competing interpretations occur randomly every few seconds, with consecutive phase durations being largely independent [46]. Based on this observation, many researchers concluded that bistable perception is principally a memoryless process ([47], see also [48,49]). Nevertheless, this conclusion contravenes another observation: the fact that people tend to perceive the same interpretation repeatedly when ambiguous stimuli are presented intermittently for a wide range of OFF-durations (intervals during which stimulus is absent) [50,51]. This second observation forced researchers to assume the presence of some perceptual memory [52], which manifests when the stimulus disappears from the screen. A variety of mechanisms implementing this memory have been proposed, including low-level mechanisms such as adaptation (combined with subthreshold effects; [9]), or high-level memory mechanisms located outside the extrastriate cortex [51,53,54]. The dCI model offers a different explanation for this stabilization effect, based on the descending loops.

In agreement with previously published experimental observations, our model predicts no significant correlation in the duration of successive phases [46,47], as expected from a model that does not contain adaptation (or adaptation-like) mechanisms [49]. However, the model should be able to predict a stabilization effect, when the stimulus disappears for brief durations. To quantify stabilization, many studies referred to the alternation rate, which is the number of switches in a time interval [50,51,55]. However, this measure is not ideal as it can be affected by various confounding factors including different presentation durations and switches occurring during ON-durations (interval during which stimulus is present). Moreover, the alternation rate considers both interpretations together and obscures any possible asymmetries. Instead, we used the survival probability (SP) of each interpretation, which is the probability that the dominant percept at the end of an ON-duration will be dominant again when the stimulus reappears after the OFF-duration. Fig 6A illustrates our interpretation of the phenomenon (5 ON-OFF cycles, aP > 0).

Fig 6. Continuous vs intermittent presentation.

Fig 6

(A.) An interpretation of the phenomenon, based on the circular inference framework. When the stimulus disappears, the belief converges to an attractor. The behavior of the system depends on the number and the value of the fixed points (here: wS = 1; aP = 1.2; ron = roff = 1 (symmetrical case) or ron = 1; roff = 0.9 (asymmetrical case)). (B.,C.,F.,G.) No loops—If there are no (descending) loops, when the stimulus disappears the beliefs converge to the prior ((B.) No implicit preference; (F.) Implicit preference). Consequently, for longer OFF-durations, the 2 survival probabilities (blue and red solid lines) either converge to 0.5 ((C.) No implicit preference) or to symmetrical values ((G.) Implicit preference). In both cases, the stimulus is not stabilized for longer intervals. Interestingly, it is more stable compared to a continuous presentation (dashed lines). (D.,E.,H.,I.) Descending loops–Descending loops generate a bistable attractor ((D.) No implicit preference (H.) Implicit preference). Crucially, when they are strong enough, they cause stabilization for longer intervals ((E.) No implicit preference (I.) Implicit preference). Furthermore, in the biased case, survival probabilities converge to asymmetrical values.

Without descending loops (aP = 0), and in the absence of input (i.e., when the stimulus is “OFF”), the belief progressively goes back to its prior value (log(ronroff)) due to the leak (Fig 6B and 6F). For the unbiased system, the model predicts that both survival probabilities (SP) will decrease toward 0.5 (chance) with a time constant that depends on the transition rates (Fig 6C). An SP in a biased system would reach symmetrical points above and below chance, with the values depending on the strength of the bias (Fig 6G). The longer the OFF-duration, the less temporal dependency there would be between subsequent percepts. Thus, without descending loops, there could not be any stabilization of the percept by an intermittent presentation for long “OFF” durations. For comparison, SP is shown for the continuous case (stimulation is not interrupted; in which case, we measure the survival probability in constant intervals; dashed lines).

The descending loops (aP > 0) change the behavior of the system. The phase portrait of this system is presented in Fig 6D and 6H. Instead of one single point where all the trajectories meet, now we observe 2 clearly distinct basins of attraction, symmetrical for an unbiased system and asymmetrical for a biased system. As a result, the temporal stability of the percept is drastically increased, especially for long “OFF” durations (Fig 6E). In biased systems, the level of stabilization depends on whether we consider the dominant or nondominant percept. The probability of persistence of the dominant percept (if biased) always converges to a higher probability than the nondominant percept. In the example shown in Fig 6I, only the dominant stimulus is stabilized by intermittent presentation, while the nondominant percept SP converges to a chance level. In other cases, both the dominant and nondominant percept can be stabilized. The stabilization of both percepts increases with the level of descending loops and decreases with sensory gain, as shown in the next section.

An important comment needs to be made. The current version of the model does not predict a destabilization occurring for small OFF-durations, usually for values below 500 ms, as reported in some studies [55]. Other models have attributed this observation to short-term sensory adaptation [9]. To keep the model as simple as possible, we did not introduce sensory adaptation. However, such a short-term effect, occurring only at the time of stimulus presentation, would not affect the stabilization for long OFF-durations as predicted by the model with descending loops.

To summarize, dCI predicts the stabilization of bistable perception for longer OFF-periods. In addition, it makes specific predictions about the persistence of each interpretation separately, which could help to experimentally validate (or invalidate) this model.

Bistable perception as a tool for investigating mental illness

So far, we have described a functional model of bistable perception, based on the notion of CI. Accumulating evidence supports the idea that circularity (and especially a small amount of descending loops) is a common property of the human brain, reflecting some inherent limitations of neural circuits [25,26]. However, it has also been suggested that CI could be the cause of several cognitive and/or perceptual disorders, including schizophrenia [22,24]. In a previous study, Jardri et al found that on average, patients with schizophrenia have stronger ascending loops compared to a group of matched healthy controls [25]. Additionally, it was evidenced that “positive” (i.e., psychotic) symptoms, including hallucinations and delusions, correlate with the amount of ascending loops (i.e., sensory evidence amplification), “negative” symptoms, including lack of motivation and anhedonia, correlate with the amount of descending loops (i.e., prior amplification), and finally, cognitive disorganization correlates with the total amount of loops (aS + aP). Considering these previous findings, an interesting question is what does the current dCI model predict the behavior of schizophrenia patients exposed to bistable stimuli?

Fig 7A and 7B illustrates the effect of ascending loops on the bias (relative predominance) and stability (mean phase duration). As previously shown, ascending loops increase the gain of the noise, facilitating the jumps between the 2 attractors. Consequently, our model predicts that patients with more severe hallucinations and delusions should be less biased in their responses (both due to inherent priors and visual cues) but also less stable (especially the interpretation that is supported by the visual cue). Specifically, the effect of ascending loops on relative predominance, although it might seem counterintuitive (over-counting of sensory evidence leads to a smaller effect of that evidence), illustrates the detrimental effect of the higher gain of noise on the accumulation of evidence.

Fig 7. Predicted effects of CI strength on bistable perception.

Fig 7

(A.) Relative predominance (RP) as a function of the strength of sensory evidence in favor (positive drift) or against (negative drift) the preferred configuration (i.e., μnoise) for increasing sensory gain (including ascending loops), from light to dark gray. (B.) Mean phase duration of the preferred and nonpreferred configuration. (C.) The same as (A.) but with no ascending loops and increasing descending loops, from light to dark blue. (D.) The same as (B.), with no ascending loops and increasing descending loops. (E.) The probability of persistence of the preferred (blue) and nonpreferred (red) configuration during the intermittent presentation of an ambiguous stimulus (stimulus duration 200 ms, OFF-duration 5 s) as a function of the ascending loops aS (aP = 0.5). (F.) The same as (E.), but as a function of the descending loops aP (aP = 0). All the other parameters were kept constant across simulations: wS = 1; ron = 0.5; roff = 0.48.

In contrast, descending loops deepen the wells of the energy landscape and consequently, they produce the exact opposite effects. As shown in Fig 7C and 7D, the prediction would be that they increase both the bias and the stability of schizophrenia patients with more severe negative symptoms.

Similar stabilization and destabilization effects as a function of the level of ascending and descending loops are predicted for intermittent presentation (Fig 7E and 7F). In particular, increasing ascending loops (and thus, the sensory gain), leads to destabilization of both the dominant and nondominant percept (more precisely, both SP get closer to 0.5; Fig 7E). This effect is in agreement with recent experimental results on schizophrenia patients [56,57]. In contrast, increasing descending loops stabilizes first the dominant percept, and then both the dominant and nondominant percepts (Fig 7F).

Finally, note that these predictions are not only qualitative but also quantitative. The results in Fig 7, as well as the shape of the stabilization curves in Fig 6, depend on 4 free parameters, the transition rates, overall descending loop strength a and sensory gain wint, all specifically related to generic parameters of perceptions applicable to many behavioral tasks. This could provide a foundation for parametric study of natural variation in the general population and psychiatric disorders, generalization over the results of different experiments (e.g., probabilistic decision tasks versus bistable perception), and raise the possibility of finding specific neural correlates of these variations (e.g., levels of E/I balance, effective connectivity between high-level and low-level areas, etc.) (see S4 Text).

Discussion

In the present paper, we demonstrated that bistable perception could arise in a perceptual system where feedback based on the current beliefs corrupts the sensory inputs. In this scenario, expectations are reverberated back up and considered several times (forming descending information-loops), suboptimally amplifying prior beliefs and causing the system to «see what it expects» [24]. The emerging dynamical system can explain various intriguing features of bistable perception, including its mere existence. It artificially inflates the accumulated noisy information, leading to a system that perceives clearly, persistently and in alternation the two potential interpretations, with high levels of conviction. Such a dCI model is compatible with Levelt’s laws and accounts for the stabilization of the percepts when the stimulus is presented intermittently.

Importantly, this model allowed us to make new predictions regarding bistable perception in physiological and pathological conditions. Each free parameter has a clear interpretation in terms of perceptual inference, can be directly estimated from behavioral data (see S4 Text), and can be generalized to predict behavior in other tasks (e.g., probabilistic decisions). Crucially, although descending loops could be necessary for bistability, they are not sufficient. Bistable stimuli need to lack crucial information that would clearly disambiguate them in a natural setting (such as depth cues). The perceptual system should expect the input distribution to differ between the two interpretations (otherwise they would be uninformative and disregarded) even if this is not the case for artificial stimuli used in bistable experiments (Fig 1B). Of note, completely ambiguous stimuli are, in fact, very rare [3,58] and unlikely to be learned from experience.

From the point of view of the underlying dynamics of perception, descending loops have important consequences beyond bistability. Due to their inherently stabilizing effect, a perceptual system can switch from a pure Bayesian integrator to a bistable attractor. By changing just the strength of descending loops, the perceptual system can transit between two decision-making strategies: Integration to bound [59,60] and attractor dynamics [61,62].

Beyond our model, various other implementations have been proposed to account for the unique characteristics of bistable perception. Mechanistic models have either focused on neural mechanisms [7,8,10] and/or on more abstract dynamical systems [6,9,12]. Nevertheless, those models are usually designed on an ad hoc basis and remain largely descriptive. With few exceptions (e.g., [45]), they are agnostic regarding the functional implication of bistability for perception and decision in general. In other words, although they may address the «what» questions (mechanisms and implementations), they are not addressing the «why» questions (epistemological questions).

To answer the second type of question, other groups have proposed functional models of bistable perception that approach the problem in a top-down fashion [1721,63,64]. Like ours, those approaches focus on the type of problems that perceptual systems usually encounter (e.g., deal with uncertainty) and impose functional limitations (e.g., Markovian statistics, approximate Bayesian inference [65]). However, some of these models are abstract and do not specify neural mechanisms. Others are more complex and contain large numbers of free parameters, rendering them difficult to (in)validate experimentally.

In particular, an interesting model that bears some similarity with the dCI model was described by Hohwy and Friston [17] and formalized by Weilnhammer and colleagues [18]. Like dCI, it relies on a message passing algorithm, but instead of belief propagation, it is largely based on a simplified version of predictive coding [28,66,67]—predictive coding postulates that priors explain away sensory inputs while residual prediction error signals are fed-forward to higher regions to update beliefs. Importantly, top-down effects play a crucial role in both explanations of bistability. Instead of adding (descending) loops, the predictive coding model suggests that perception is biased by a stabilization prior, which depends on the current interpretation. This prior is constantly weakened by prediction errors emerging from evidence for the suppressed percept, via an exponential decay mechanism. A switch occurs when the evidence for the suppressed percept surpasses that for the dominant percept. Despite their similarities, the two models are not identical. While dCI is derived from first principles (inference in a hidden markov model, corrupted by loops), the predictive coding model relies on a number of ad-hoc assumptions, that nuance its normative character. For example, the precision of the stabilization prior is renormalized after each switch, resulting in strong and stable percepts; this is an important assumption, yet it’s difficult to interpret it from a normative perspective.

Furthermore, several models were based on the idea that inference is approximated by a sampling process, without explicit calculation and knowledge of the exact posterior distribution [1921]. In that case, bistable perception occurs because the perceptual system is assumed to take only one sample at each time step, resulting in high temporal correlations between samples. This is, in fact, a nuisance in this kind of algorithm, predicting a highly suboptimal form of perceptual inference (e.g., it takes a very long time to infer the exact probability distribution, and the corresponding estimates are much more variable than a maximum-a-posteriori estimate). Because of this limitation, perceptual inference by sampling might be far less performant than belief propagation (even with loops), raising the question of why our perceptual system would choose such a strategy. Additionally, it remains unclear whether those models could account for less trivial experimental results, including stabilization under an intermittent presentation.

Note that in our case, bistable perception could also be seen as a suboptimality resulting from descending loops (i.e., the estimated probability are not the correct ones given the real sensory evidence and prior knowledge). However, we predict that it mostly affects perception in rather unusual cases, e.g., for a fixed level of descending loops, stimuli that are both expected to be very reliable (high wS) and in reality are highly ambiguous (μnoise close to zero). Consequently, this unusual stimulus does not fit our generative model [68]. The effects could be far more subtle otherwise. In agreement with this hypothesis, we found that CI only rarely affects choices in randomly selected probabilistic inference problems (i.e., random graphs, see [22]).

The dCI model presented in this paper is normative (i.e. derived from first principles; strictly speaking, normativity is violated due to the loops) but can also be seen as descriptive due to its closed-form solution. Switches in perceptual bistability are driven by noise in agreement with existing evidence [1315]. In contrast to models based on lateral inhibition between local populations, bistable perception is interpreted as a brain-wide phenomenon linked to inhibitory control of feedforward and feedback processes (as is generally required for hierarchical perceptual inference [22]). Its dynamical behavior has important similarities with that of other attractor models [12], but the bistable attractor is hereby not imposed to explain certain features of bistability, but instead a direct consequence of the descending loops. In the same vein, our model makes a clear distinction between a bias induced by sensory evidence and bias resulting from the system’s implicit preference (prior knowledge), thus enabling the generation of asymmetries in the absence of stimulation (intermittent presentation).

Another important feature of bistable perception, shared by human and nonhuman observers, is the distribution of dominance durations. Although there is considerable variability in the mean phase duration between participants (but also within participants and between conditions or stimuli), there is an impressive similarity in the shape of the distribution of phase durations, relatively well approximated by a gamma or log-normal distribution [6971] (but see also [72]). The dCI model, like all the noise-driven attractor models, generates exponential distributions of phase durations [12]. Several extensions of the model can engender gamma-like distributions, in which simple mechanisms are added on an ad-hoc basis. For example, one could assume that inference is preceded by filtering, which takes place at the very first levels of the sensory hierarchy (e.g. retina, LGN in case of visual inputs); filtered noise is smoother than gaussian noise and precludes the occurrence of fast switches. Alternatively, one could introduce an adaptation-like mechanism (see also [12]); in the dCI context, this could be implemented as time-dependent transition rates, e.g. as a form of learning. Finally, a third option is to replace MAP with a more complex decision criterion, e.g. a more conservative criterion, implemented as a moving threshold, where switches occur only when there is substantial evidence in favour of the opposite interpretation.

It has been argued that CI are linked at the neurophysiological level to an imbalance between neural excitation and inhibition in favor of excitation [24,27]. This imbalance might concern only local microcircuits, encompassing pyramidal cells and local interneurons (Fig 1D), or more global networks, potentially involving thalamocortical or corticostriatal long-range connections [24]. Although both are plausible implementations of loops, local interneurons make a better candidate in the particular case of bistable perception. Indeed, it has been argued that bistability is a rather low-level process mainly occurring within the visual cortex ([4,73,74]; but see [75,76], arguing for the involvement of high-level areas) while the involvement of local inhibition is also supported by pharmacological evidence [77].

Apart from normal brain functioning, CI has been used to account for clinical dimensions in schizophrenia [22,25]. Our model implies that generic mechanisms involved in hallucinations and delusions could also explain common perceptual phenomena, such as bistable perception, in agreement with the idea that psychosis may exist along a continuum with normal experience [7881]. Nevertheless, when and how exactly those mechanisms go awry and generate pathological symptoms remains an open question. In addition, the present model provides a dynamical system interpretation of CI models, relating them to other influential frameworks [8284].

Could circularity offer a relative advantage to perceptual systems or is it simply a manifestation of the inherent limitations of neural systems? Our present results suggest that a system performing exact inference with ambiguous information could be more vulnerable to noise and have difficulties in forming stable percepts. Moderate descending loops could improve the system, allowing rapid and robust decisions even when evidence is not conclusive (after all, both “fighting” and “fleeing” are better than standing still; a similar explanation was suggested by Moreno-Bote and colleagues, who interpreted bistability as exploratory behavior under uncertainty [45]). Moving a step further, a system with flexible descending loops (e.g., a system that can regulate its E/I balance through neuromodulators, such as dopamine, serotonin or acetylcholine [85,86]) could vary the perceptual strategy from impulsive to deliberative in accordance with task requirements. This suggestion, although speculative, could reconcile the present results with evidence showing a balance between excitation and inhibition at different scales [8789] and is furthermore easily testable (e.g., by measuring E/I balance during bistability and during stimulation with unambiguous stimuli).

In conclusion, we described bistable perception as a probabilistic inference process, under the influence of amplified priors due to the presence of descending loops in the cortical hierarchy. The model explains why bistable perception occurs in the first place and qualitatively predicts several of its properties. Additionally, it has important implications for the neural correlates of bistability and the relation between normal brain functioning and pathology, ultimately linking computation, behavior and neural implementation.

Supporting information

S1 Text. Mathematical derivations.

(DOCX)

S2 Text. Bifurcation analysis.

(DOCX)

S3 Text. Phenomenology of bistable perception.

(DOCX)

S4 Text. Parameter recovery.

(DOCX)

Data Availability

The Matlab codes can be found here: github.com/VincentBt/dynamical_CI_bistable.

Funding Statement

P.L. was supported by a PSL Research University PhD fellowship (https://www.psl.eu/en). V.B. was supported by a ANR-16-CE37-0015 PhD fellowship led by R.J (https://anr.fr/). S.D. was supported by an ERC consolidator grant ‘‘Predispike’’ (https://erc.europa.eu/) and by the James McDonnell Foundation award ‘‘Human Cognition” (https://www.jsmf.org/). This research was also supported by: ANR-17-EURE-0017 FrontCog and ANR-10-IDEX-0001-02 PSL grants (Département d’ Etudes Cognitives of the Ecole Normale Supérieure). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Weiss Y, Simoncelli EP, Adelson EH. Motion illusions as optimal percepts. Nat Neurosci. 2002;5: 598–604. 10.1038/nn0602-858 [DOI] [PubMed] [Google Scholar]
  • 2.Notredame C-E, Pins D, Denève S, Jardri R. What visual illusions teach us about schizophrenia. Front Integr Neurosci. 2014;8: 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Arnold DH. Why is binocular rivalry uncommon? Discrepant monocular images in the real world. Front Hum Neurosci. 2011;5: 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Blake R, Logothetis NK. Visual Competition. Nat Rev Neurosci. 2002;3: 1–11. 10.1038/nrn701 [DOI] [PubMed] [Google Scholar]
  • 5.Blake R. A Neural Theory of Binocular Rivalry. Psychol Rev. 1989;96: 145–167. 10.1037/0033-295x.96.1.145 [DOI] [PubMed] [Google Scholar]
  • 6.Lago-Fernandez LF, Deco G. A model of binocular rivalry based on competition in IT. Neurocomputing. 2002;44–46: 503–507. [Google Scholar]
  • 7.Laing CR, Chow CC. A Spiking Neuron Model for Binocular Rivalry. J Comput Neurosci. 2002;12: 39–53. 10.1023/a:1014942129705 [DOI] [PubMed] [Google Scholar]
  • 8.Wilson HR. Computational evidence for a rivalry hierarchy in vision. Proc Natl Acad Sci U S A. 2003;100: 14499–503. 10.1073/pnas.2333622100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Noest AJ, Van Ee R, Nijs MM, Van Wezel RJA. Percept-choice sequences driven by interrupted ambiguous stimuli: A low-level neural model. J Vis. 2007;7: 1–14. 10.1167/7.8.10 [DOI] [PubMed] [Google Scholar]
  • 10.Wilson HR. Minimal physiological conditions for binocular rivalry and rivalry memory. Vision Res. 2007;47: 2741–2750. 10.1016/j.visres.2007.07.007 [DOI] [PubMed] [Google Scholar]
  • 11.Vattikuti S, Thangaraj P, Xie HW, Gotts SJ, Martin A, Chow CC. Canonical Cortical Circuit Model Explains Rivalry, Intermittent Rivalry, and Rivalry Memory. PLoS Comput Biol. 2016;12: 1–22. 10.1371/journal.pcbi.1004903 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Moreno-Bote R, Rinzel J, Rubin N. Noise-Induced Alternations in an Attractor Network Model of Perceptual Bistability. J Neurophysiol. 2007;98: 1125–1139. 10.1152/jn.00116.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Shpiro A, Moreno-Bote R, Rubin N, Rinzel J. Balance between noise and adaptation in competition models of perceptual bistability. J Comput Neurosci. 2009;27: 37–54. 10.1007/s10827-008-0125-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Panagiotaropoulos TI, Kapoor V, Logothetis NK, Deco G. A Common Neurodynamical Mechanism Could Mediate Externally Induced and Intrinsically Generated Transitions in Visual Awareness. PLoS One. 2013;8 10.1371/journal.pone.0053833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Huguet G, Rinzel J, Hupé J. Noise and adaptation in multistable perception: Noise drives when to switch, adaptation determines percept choice. J Vis. 2014;14: 1–24. 10.1167/14.3.19 [DOI] [PubMed] [Google Scholar]
  • 16.Brascamp J, Sterzer P, Blake R, Knapen T. Multistable Perception and the Role of the Frontoparietal Cortex in Perceptual Inference. Annu Rev Psychol. 2018;69: 1–27. [DOI] [PubMed] [Google Scholar]
  • 17.Hohwy J, Roepstorff A, Friston K. Predictive coding explains binocular rivalry: An epistemological review. Cognition. 2008;108: 687–701. 10.1016/j.cognition.2008.05.010 [DOI] [PubMed] [Google Scholar]
  • 18.Weilnhammer V, Stuke H, Hesselmann G, Sterzer P, Schmack K. A predictive coding account of bistable perception—a model-based fMRI study. PLoS Comput Biol. 2017;13: 1–21. 10.1371/journal.pcbi.1005536 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Sundareswara R, Schrater P. Perceptual multistability predicted by search model for Bayesian decisions. J Vis. 2008;8: 1–19. 10.1167/8.5.12 [DOI] [PubMed] [Google Scholar]
  • 20.Reichert D, Seriès P, Storkey A. Neuronal adaptation for sampling-based probabilistic inference in perceptual bistability. Adv Neural Inf …. 2011; 1–9. http://papers.nips.cc/paper/4404-neuronal-adaptation-for-sampling-based-probabilistic-inference-in-perceptual-bistability
  • 21.Gershman SJ, Vul E, Tenenbaum JB. Multistability and Perceptual Inference. Neural Comput. 2012;24: 1–24. 10.1162/NECO_a_00226 [DOI] [PubMed] [Google Scholar]
  • 22.Jardri R, Denève S. Circular inferences in schizophrenia. Brain. 2013;136: 3227–41. 10.1093/brain/awt257 [DOI] [PubMed] [Google Scholar]
  • 23.Deneve S, Jardri R. Circular inference: Mistaken belief, misplaced trust. Curr Opin Behav Sci. 2016;11: 40–48. 10.1016/j.cobeha.2016.04.001 [DOI] [Google Scholar]
  • 24.Leptourgos P, Denève S, Jardri R. Can circular inference relate the neuropathological and behavioral aspects of schizophrenia? Curr Opin Neurobiol. 2017;46: 154–161. 10.1016/j.conb.2017.08.012 [DOI] [PubMed] [Google Scholar]
  • 25.Jardri R, Duverne S, Litvinova AS, Denève S. Experimental evidence for circular inference in schizophrenia. Nat Commun. 2017;8: 14218 10.1038/ncomms14218 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Leptourgos P, Notredame CE, Eck M, Jardri R, Denève S. Circular inference in bistable perception. J Vis. 2020;20: 12 10.1167/jov.20.4.12 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Jardri R, Hugdahl K, Hughes M, Brunelin J, Waters F, Alderson-Day B, et al. Are Hallucinations Due to an Imbalance Between Excitatory and Inhibitory Influences on the Brain? Schizophr Bull. 2016;42: 1124–1134. 10.1093/schbul/sbw075 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Friston K. Hierarchical Models in the Brain. PLoS Comput Biol. 2008;4 10.1371/journal.pcbi.1000211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Clark A. Whatever next? Predictive brains, situated agents, and the future of cognitive science. Behav Brain Sci. 2013;36: 181–204. 10.1017/S0140525X12000477 [DOI] [PubMed] [Google Scholar]
  • 30.Mathys CD, Lomakina EI, Daunizeau J, Iglesias S, Brodersen KH, Friston KJ, et al. Uncertainty in perception and the Hierarchical Gaussian Filter. Front Hum Neurosci. 2014;8: 1–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Finlayson NJ, Zhang X, Golomb JD. Differential patterns of 2D location versus depth decoding along the visual hierarchy. Neuroimage. 2017;147: 507–516. 10.1016/j.neuroimage.2016.12.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Felleman DJ, Van Essen DC. Distributed hierachical processing in the primate cerebral cortex. Cereb Cortex. 1991;1: 1–47. [DOI] [PubMed] [Google Scholar]
  • 33.Lee TS, Mumford D. Hierarchical bayesian inference in the visual cortex. J Opt Soc Am A. 2003;20: 1434–1448. 10.1364/josaa.20.001434 [DOI] [PubMed] [Google Scholar]
  • 34.Mamassian P, Landy MS. Observer biases in the 3D interpretation of line drawings. Vision Res. 1998;38: 2817–2832. 10.1016/s0042-6989(97)00438-0 [DOI] [PubMed] [Google Scholar]
  • 35.Douglas RJ, Koch C, Mahowald M, Martin KAC, Suarez HH. Recurrent excitation in neocortical circuits. Science (80-). 1995;269: 981–985. 10.1126/science.7638624 [DOI] [PubMed] [Google Scholar]
  • 36.Hupé J, James A, Payne B, Lomber S, Girard P, Bullier J. Cortical feedback improves discrimination between figure and background by V1, V2 and V3 neurons. Nature. 1998;394: 784–787. Available: https://search.proquest.com/openview/09bbe4a11f10a409727670219c4b017b/1?pq-origsite=gscholar&cbl=40569 [DOI] [PubMed] [Google Scholar]
  • 37.Bullier J, Hupé JM, James AC, Girard P. The role of feedback connections in shaping the responses of visual cortical neurons. Prog Brain Res. 2001;134: 193–204. 10.1016/s0079-6123(01)34014-1 [DOI] [PubMed] [Google Scholar]
  • 38.Manita S, Suzuki T, Homma C, Matsumoto T, Odagawa M, Yamada K, et al. A Top-Down Cortical Circuit for Accurate Sensory Perception. Neuron. 2015;86: 1304–1316. 10.1016/j.neuron.2015.05.006 [DOI] [PubMed] [Google Scholar]
  • 39.Deneve S. Bayesian Spiking Neurons I: Inference. Neural Comput. 2008;20: 91–117. 10.1162/neco.2008.20.1.91 [DOI] [PubMed] [Google Scholar]
  • 40.Knapen T, Brascamp J, Pearson J, van Ee R, Blake R. The role of frontal and parietal brain areas in bistable perception. J Neurosci. 2011;31: 10293–10301. 10.1523/JNEUROSCI.1727-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Levelt WJM. The Alternation Process in Binocular Rivalry. Br J Psychol. 1966;57: 225–238. 10.1111/j.2044-8295.1966.tb01023.x [DOI] [Google Scholar]
  • 42.Klink PC, van Ee R, van Wezel RJ a. General validity of Levelt’s propositions reveals common computational mechanisms for visual rivalry. PLoS One. 2008;3: e3473 10.1371/journal.pone.0003473 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Shpiro A, Curtu R, Rinzel J, Rubin N. Dynamical Characteristics Common to Neuronal Competition Models. J Neurophysiol. 2007;97: 462–473. 10.1152/jn.00604.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Brascamp JW, Klink PC, Levelt WJM. The ‘laws’ of binocular rivalry: 50 years of Levelt’s propositions. Vision Res. 2015;109: 20–37. 10.1016/j.visres.2015.02.019 [DOI] [PubMed] [Google Scholar]
  • 45.Moreno-Bote R, Shpiro A, Rinzel J, Rubin N. Alternation rate in perceptual bistability is maximal at and symmetric around equi-dominance. J Vis. 2010;10: 1–1. 10.1167/10.11.1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Walker P. Stochastic properties of binocular rivalry alternations. Percept Psychophys. 1975;18: 467–473. [Google Scholar]
  • 47.Lehky SR. Binocular rivalry is not chaotic. Proc R Soc London B Biol Sci. 1995;259: 71–76. [DOI] [PubMed] [Google Scholar]
  • 48.Nawrot M, Blake R. Neural Integration of Information Specifying Structure from Stereopsis and Motion. Science (80-). 1989;244: 716–718. 10.1126/science.2717948 [DOI] [PubMed] [Google Scholar]
  • 49.Pastukhov A, Braun J. Cumulative history quantifies the role of neural adaptation in multistable perception. J Vis. 2011;11: 12–12. 10.1167/11.10.12 [DOI] [PubMed] [Google Scholar]
  • 50.Orbach J, Ehrlich D, Heath HA. Reversibility of the Necker Cube: I. An examination of the concept of “satiation of orientation.” Percept Mot Skills. 1963;17: 439–458. 10.2466/pms.1963.17.2.439 [DOI] [PubMed] [Google Scholar]
  • 51.Leopold DA, Wilke M, Maier A, Logothetis NK. Stable perception of visually ambiguous patterns. Nat Neurosci. 2002;5: 605–609. 10.1038/nn0602-851 [DOI] [PubMed] [Google Scholar]
  • 52.Pearson J, Brascamp J. Sensory memory for ambiguous vision. Trends Cogn Sci. 2008;12: 334–41. 10.1016/j.tics.2008.05.006 [DOI] [PubMed] [Google Scholar]
  • 53.Maier A, Wilke M, Logothetis NK, Leopold DA. Perception of Temporally Interleaved Ambiguous Patterns. Curr Biol. 2003;13: 1076–1085. 10.1016/s0960-9822(03)00414-7 [DOI] [PubMed] [Google Scholar]
  • 54.Sterzer P, Rees G. A Neural Basis for Percept Stabilization in Binocular Rivalry. J Cogn Neurosci. 2008;20: 389–399. 10.1162/jocn.2008.20039 [DOI] [PubMed] [Google Scholar]
  • 55.Kornmeier J, Ehm W, Bigalke H, Bach M. Discontinuous presentation of ambiguous figures: How interstimulus-interval durations affect reversal dynamics and ERPs. Psychophysiology. 2007;44: 552–560. 10.1111/j.1469-8986.2007.00525.x [DOI] [PubMed] [Google Scholar]
  • 56.Schmack K, Gòmez-Carrillo de Castro A, Rothkirch M, Sekutowicz M, Rössler H, Haynes J-D, et al. Delusions and the role of beliefs in perceptual inference. J Neurosci. 2013;33: 13701–12. 10.1523/JNEUROSCI.1778-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Schmack K, Schnack A, Priller J, Sterzer P. Perceptual instability in schizophrenia: Probing predictive coding accounts of delusions with ambiguous stimuli. Schizophr Res Cogn. 2015;2: 72–77. 10.1016/j.scog.2015.03.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Kersten D, Mamassian P, Yuille A. Object perception as Bayesian inference. Annu Rev Psychol. 2004;55: 271–304. 10.1146/annurev.psych.55.090902.142005 [DOI] [PubMed] [Google Scholar]
  • 59.Ratcliff R, Smith PL, Brown SD, McKoon G. Diffusion Decision Model: Current Issues and History. Trends Cogn Sci. 2016;20: 260–281. 10.1016/j.tics.2016.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Palmer J, Huk AC, Shadlen MN. The effect of stimulus strength on the speed and accuracy of a perceptual decision. J Vis. 2005;5: 376–404. 10.1167/5.5.1 [DOI] [PubMed] [Google Scholar]
  • 61.Bitzer S, Bruineberg J, Kiebel SJ. A Bayesian Attractor Model for Perceptual Decision Making. PLoS Comput Biol. 2015;11: 1–35. 10.1371/journal.pcbi.1004442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron. 2002;36: 955–968. 10.1016/s0896-6273(02)01092-9 [DOI] [PubMed] [Google Scholar]
  • 63.Dayan P. A Hierarchical Model of Binocular Rivalry. Neural Comput. 1998;10: 1119–1135. Available: http://www.scopus.com/inward/record.url?eid=2-s2.0-0032111193&partnerID=40&md5=2220a1a71a4cfd3e9066c68547e73897 [DOI] [PubMed] [Google Scholar]
  • 64.Albert S, Schmack K, Sterzer P, Schneider G. A hierarchical stochastic model for bistable perception. PLoS Computational Biology. 2017. 10.1371/journal.pcbi.1005856 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Bishop C. Pattern Recognition and Machine Learning. Springer; 2006. [Google Scholar]
  • 66.Rao RPN, Ballard DH. Predictive coding in the visual cortex: A functional interpretation of some extra-classical receptive-field effects. Nat Neurosci. 1999;2: 79–87. 10.1038/4580 [DOI] [PubMed] [Google Scholar]
  • 67.Spratling MW. A review of predictive coding algorithms. Brain Cogn. 2017;112: 92–97. 10.1016/j.bandc.2015.11.003 [DOI] [PubMed] [Google Scholar]
  • 68.Beck JM, Ma WJ, Pitkow X, Latham PE, Pouget A. Not Noisy, Just Wrong: The Role of Suboptimal Inference in Behavioral Variability. Neuron. 2012;74: 30–39. 10.1016/j.neuron.2012.03.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Levelt WJM. Note on the distribution of dominance times in binocular rivalry. Br J Psychol. 1967;58: 143–145. 10.1111/j.2044-8295.1967.tb01068.x [DOI] [PubMed] [Google Scholar]
  • 70.Zhou YH, Gao JB, White KD, Merk I, Yao K. Perceptual dominance time distributions in multistable visual perception. Biol Cybern. 2004;90: 256–263. 10.1007/s00422-004-0472-8 [DOI] [PubMed] [Google Scholar]
  • 71.Gigante G, Mattia M, Braun J, Del Giudice P. Bistable perception modeled as competing stochastic integrations at two levels. PLoS Comput Biol. 2009;5: 1–9. 10.1371/journal.pcbi.1000430 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Brascamp JW, van Ee R, Pestman WR, van den Berg A V. Distributions of alternation rates in various forms of bistable perception. J Vis. 2005;5: 287–98. 10.1167/5.4.1 [DOI] [PubMed] [Google Scholar]
  • 73.Brascamp J, Sohn H, Lee S-H, Blake R. A monocular contribution to stimulus rivalry. Proc Natl Acad Sci. 2013;110: 8337–8344. 10.1073/pnas.1305393110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Brascamp J, Blake R, Knapen T. Negligible fronto-parietal BOLD activity accompanying unreportable switches in bistable perception. Nat Neurosci. 2015;18: 1672–1678. 10.1038/nn.4130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Lumer ED, Frsiton KJ, Rees G. Neural Correlates of Perceptual Rivalry in the Human Brain. Science (80-). 1998;280: 1930–1934. 10.1126/science.280.5371.1930 [DOI] [PubMed] [Google Scholar]
  • 76.Sterzer P, Kleinschmidt A. A neural basis for inference in perceptual ambiguity. Proc Natl Acad Sci U S A. 2007;104: 323–8. 10.1073/pnas.0609006104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Van Loon AM, Knapen T, Scholte HS, St. John-Saaltink E, Donner TH, Lamme VAF. GABA shapes the dynamics of bistable perception. Curr Biol. 2013;23: 823–827. 10.1016/j.cub.2013.03.067 [DOI] [PubMed] [Google Scholar]
  • 78.Waters F, Blom JD, Dang-Vu TT, Cheyne AJ, Alderson-Day B, Woodruff P, et al. What Is the Link Between Hallucinations, Dreams, and Hypnagogic-Hypnopompic Experiences? Schizophr Bull. 2016;42: 1098–1109. 10.1093/schbul/sbw076 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Alderson-Day B, Lima CF, Evans S, Krishnan S, Shanmugalingam P, Fernyhough C, et al. Distinct processing of ambiguous speech in people with non-clinical auditory verbal hallucinations. Brain. 2017;140: 2475–2489. 10.1093/brain/awx206 [DOI] [PubMed] [Google Scholar]
  • 80.Baumeister D, Sedgwick O, Howes O, Peters E. Auditory verbal hallucinations and continuum models of psychosis: A systematic review of the healthy voice-hearer literature. Clin Psychol Rev. 2017;51: 125–141. 10.1016/j.cpr.2016.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Powers AR, Mathys C, Corlett PR. Pavlovian conditioning–induced hallucinations result from overweighting of perceptual priors. Science (80-). 2017;357: 596–600. 10.1126/science.aan3458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Loh M, Rolls ET, Deco G. A dynamical systems hypothesis of schizophrenia. PLoS Comput Biol. 2007;3: 2255–2265. 10.1371/journal.pcbi.0030228 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Rolls ET, Deco G. A computational neuroscience approach to schizophrenia and its onset. Neurosci Biobehav Rev. 2011;35: 1644–1653. 10.1016/j.neubiorev.2010.09.001 [DOI] [PubMed] [Google Scholar]
  • 84.Adams RA, Napier G, Roiser JP, Mathys C, Gilleen J. Attractor-like dynamics in belief updating in schizophrenia. J Neurosci. 2018;38: 9471–9485. 10.1523/JNEUROSCI.3163-17.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Lucas-Meunier E, Monier C, Amar M, Baux G, Frégnac Y, Fossier P. Involvement of nicotinic and muscarinic receptors in the endogenous cholinergic modulation of the balance between excitation and inhibition in the young rat visual cortex. Cereb Cortex. 2009;19: 2411–2427. 10.1093/cercor/bhn258 [DOI] [PubMed] [Google Scholar]
  • 86.Moreau WA, Amar M, Le Roux N, Morel N, Fossier P. Serotoninergic fine-tuning of the excitation-inhibition balance in rat visual cortical networks. Cereb Cortex. 2010;20: 456–467. 10.1093/cercor/bhp114 [DOI] [PubMed] [Google Scholar]
  • 87.Wehr M, Zador AM. Balanced inhibition underlies tuning and sharpens spike timing in auditory cortex. Nature. 2003;426: 442–446. 10.1038/nature02116 [DOI] [PubMed] [Google Scholar]
  • 88.Okun M, Lampl I. Instantaneous correlation of excitation and inhibition during ongoing and sensory-evoked activities. Nat Neurosci. 2008;11: 535–537. 10.1038/nn.2105 [DOI] [PubMed] [Google Scholar]
  • 89.Xue M, Atallah B V., Scanziani M. Equalizing excitation–inhibition ratios across visual cortical neurons. Nature. 2014;511: 596–600. 10.1038/nature13321 [DOI] [PMC free article] [PubMed] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008480.r001

Decision Letter 0

Ulrik R Beierholm, Samuel J Gershman

17 Apr 2020

Dear Dr Leptourgos,

Thank you very much for submitting your manuscript "A functional theory of bistable perception based on dynamical circular inference" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

From the reviewer comments it should be pretty clear that there are just a few, but serious concerns that need to be addressed. We would call your attention to reviewer 1's comment about model fitting, and especially reviewer 2's comments about Levelt’s laws. If you believe that you can address all the reviewers points we would be happy to reconsider the manuscript. We would like to emphasize that the issues raised by the reviewers concerning Levelt's laws possibly indicate fundamental problems with the modeling approach, so a revised manuscript might be rejected without being sent for re-review if we feel that these problems have not been adequately addressed (though we sincerely hope they can be addressed).

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Ulrik R. Beierholm

Associate Editor

PLOS Computational Biology

Samuel Gershman

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors present a new computational approach toward bistable perception. Based on the established algorithm of circular inference, they deduce minimal conditions under which bistable perception can occur in this framework. With this, the authors place bistable perception in the general context of perceptual inference. Lastly, they relate their model to alternations in perceptual inference related with psychotic symptoms.

Next to analytical methods, the authors performed simulation analyses and compared the model predictions to a number of empirical characteristics of bistable perception (Levelt’s laws and the stabilization of bistable perception by intermittent presentation of ambiguous stimuli).

I think that the circular-inference approach to bistable perception outlined in the manuscript is highly relevant for two reasons: Firstly, it deduces the perceptual dynamics of bistable perception from general considerations of perceptual inference. To my mind, such a functional take on bistability is important, since it may help translate perceptual phenomena observed largely in a laboratory context (eg, the frequent perceptual transitions experienced during perceptions) to the characteristics of perception in real-world scenarios. Secondly, by inverting the circular inference model based on behavior, researchers may be able to quantify the relative contribution sensory evidence and prior knowledge on perceptual inference. This may proffer new opportunities in the study of alterations in perceptual inference (eg, positive psychotic symptoms in schizophrenia).

Although I am very sympathetic toward this work, I have several concerns and wishes for clarification:

General Comments

First, broadly, the authors’ claim that this model can be fitted to behavioral data (and thus be useful to study perceptual function in health and disease) does not seem to be backed up empirically in the manuscript. Personally, I would think that the authors could strengthen their circular-inference model of bistable perception significantly if they could show that the latent variables of the model (eg, weights for ascending/descending loops, feed-forward weight ws, the response variable theta, rates of change ron/roff) can be reliably estimated from data. To my mind, this would mean that, when simulating behavioral responses for a set of latent variables, such latent variables could be recovered from the simulated data.

Second, I have some specific wishes for further clarification regarding the methods and analysis, which I outline in detail below. Lastly, I have a few recommendation on how to improve the connection of the authors’ findings to the existing literature. Specific comments are below.

Major comments:

1. I did not fully understand the role of the decision threshold theta. To my understanding, in this circular-inference (CI) model, setting theta to a sufficiently high value should be necessary to maintain stable perceptual states (view from above; view from below in the example of the Necker Cube). For low values of theta, I could imagine that spontaneous fluctuation in L should lead to frequent switches between dominant perceptual states. Yet, in the method section, the authors note that:

“(…) in the case of continuous presentation, it is necessary to set to a sufficiently high positive value to obtain robust perceptual switches. If = 0, the percept would switch multiple times (just because of the noisy input causing small random fluctuations in ) around the time of perceptual transitions.”

Specifically, I did not understand why the stabilizing effect of a high decision threshold stabilized perception only around the time of perceptual transitions,

2. On a more general view, I would find it helpful to see an illustration of the effects of theta on the model’s predictions.

“In our model, a nonzero decision threshold precludes percepts with very short durations. As a result, the distribution resembles a gamma or log-normal distribution. The rising phase and peak of this distribution depend on the decision threshold, but the tail of the distribution does not and is imposed by the mathematical properties of bistable dynamics driven by noise.”

Did I understand correctly that, in the presence of descending loops, it is only due to the decision threshold that simulated phase durations are distributed in a gamma/log-normal distribution? Is the location of the peak of the distribution uniquely determined by the value of the decision threshold? Is there any relation between the energy landscapes shown in Figure 3 and the theta parameter? Moreover, is the minimum value of the decision threshold that is necessary to induce stable perceptual inference correlated with the standard deviation of sensory evidence / the likelihood function?

3. In a related point, I would need additional clarifications on how the role of theta relates to the function of descending loops:

“Note that large values of can lead to a distribution of phase duration similar to the system of descending loops. However, while the distribution of phase duration cannot be considered proof of the presence of circular inference, the resulting confidence is often below the decision thresholds. This may preclude the emergence of strong and stable percepts in the absence of descending loops.”

Here, the authors introduce the concept of “confidence”. If I understood it correctly, high-confidence perceptual states only emerge in the presence of descending loops. I would find it helpful if the authors could contextualize this to the existing literature on bistable perception:

A number of studies has devoted a lot of attention of mixed percepts during bistability (eg., Knapen 2011). Do the authors assume that such mixed percepts (low-confidence/high-uncertainty perceptual states) arise at the time of state transitions between the energy wells in Figure 3c? How would the energy landscape look like for other types of bistable stimuli that show sudden transitions, such as discontinuous structure-from-motion stimuli (eg., Weilnhammer 2013)?

4. With regard to perceptual biases: From Figure 3d, should it be concluded that the CI model assumes a difference in confidence when there is a bias between perceptual alternatives? In the example of the Necker Cube, this would mean that the view-from-below is generally associated with reduced confidence. To my mind, this would be an important prediction of the CI model. Are the authors aware of any empirical evidence for this model prediction?

5. In the section on Levelt’s 4th law, the authors investigate the effect on an increase in the strength of both interpretations on the alternation rate of a bistable stimulus. They captured this increase in stimulus strength by increasing the variance of the noise distribution. This choice did not seem straightforward to me. Several alternatives would also seem plausible to me: Could an increase in stimulus strength be reflected by a decrease in variance of the noise distribution? Or by a modulation in variance of the likelihood distribution?

Minor comments

With regard to the Abstract, I would like to make a few suggestions that could render the content more accessible to the naïve reader:

1. While these points become clear after reading the manuscript, my personal impression was that they are difficult to understand on the basis of the abstract and general knowledge about bistable perception. Readers without a background in computational modelling of bistable perception might have a hard time understanding these points.

“We show that in the face of ambiguous sensory stimuli, circular inference can

turn what should be a leaky integrator into a bistable attractor switching between two highly trusted interpretations. (…) Since it is related to the generic perceptual inference mechanism, this approach can be used to predict the tendency of individuals to form aberrant beliefs

from their bistable perception behavior.”

2. Maybe I have overlooked something, but while the main text contains a section of psychotic symptoms in schizophrenia patients, I could not find a discussion on cognitive functions in non-clinical populations.

“Overall, we suggest that feedforward/feedback information loops in hierarchical neural networks, a phenomenon that could lead psychotic symptoms when overly strong, could also underlie cognitive functions in nonclinical populations.”

With regard to the introduction, I have a few additional comments:

3. If I understood correctly, the authors introduced perceptual inference during bistable perception as “suboptimal”:

“In most cases, this task is performed very accurately, and the correct interpretation is found. Sometimes, perceptual systems fail to detect any meaningful interpretation (e.g., when sensory evidence is too degraded) or converge to the wrong interpretation. Finally, a third possibility occurs (mainly in lab conditions [3]) when ambiguity is high; the system detects more than one plausible interpretations but instead of committing to one interpretation, it switches every few seconds, a phenomenon known as bistable perception [4].

(….) Crucially, there is a discrepancy between the real input and the input assumed by the internal model. This, together with the loops, predicts the suboptimal inference at the heart of

bistable perception (Figure 1; caption).

I was wondering whether the authors could add a little more detail as to why they view perceptual inference is suboptimal or incorrect. As they authors note throughout the manuscript, truly ambiguous images (eg. the line drawings of a Necker cube, disparate monocular inputs in case of binocular rivalry) are very rare.

Could it also be that, because of the extremely low probability of a fully ambiguous real-word cause of sensory input, committing to one highly trusted stimulus interpretation is indeed adaptive/optimal? This thought also appears in the discussion (“Moderate descending loops could improve the system, allowing rapid and robust decisions even when evidence is not conclusive”)

Reviewer #2: General comment

In this paper, Pantelis Leptourgos and colleagues propose 'circular inference', a model of hierarchical Bayesian inference, for the modeling of bistable perception. They suggest that bistability can be conceived as a process whereby sensory responses trigger activity in higher-level areas but are also modulated by feedback projections from these same areas, thus reverberating prior beliefs in the cortical hierarchy. They show that in the face of ambiguous sensory stimuli, circular inference can turn what should be a leaky integrator into a bistable attractor that switches between two alternative interpretations. Moreover, they report that their model is able to capture various aspects of bistable perception, including Levelt’s laws and the stabilizing effect of intermittent stimulus presentation. They speculate that feedforward/feedback information loops in hierarchical neural networks, a phenomenon that could lead to psychotic symptoms when overly strong, could also underlie cognitive functions in nonclinical populations.

Overall, this is a very interesting, timely and well-written paper that adds an interesting new approach to the modeling of bistable perception. Since this algorithmic model is also based on established theories of cortical neural circuit function, such as E/I balance it is a promising candidate to link modeling at the algorithm level to the level of neural implementation. In addition, it may be well suited to the modeling of altered perceptual inference in neuropsychiatric disorders such as schizophrenia, as the authors point out. I have a few concerns and questions that I would like to ask the authors to consider. In particular, there seem to be a few problems with the part regarding Levelt's laws that need to be addressed (see below for details).

Specific points

1. In the introduction, the authors state that a few important questions have remained unanswered by previous modeling approaches. The first is, why a system should form "such strong percepts based on unreliable sensory evidence". I'm not sure that the sensory evidence that typically gives rise to bistable perception is what I would call "unreliable". It is perceptually ambiguous, which means that it is compatible with two competing perceptual interpretations. However, as opposed to noisy stimuli, the evidence is quite clear-cut and reliable. It's not the sensory information that's unreliable, it's the perceptual system that has a hard time "making up it's mind" which of two clear-cut interpretations to go for. It is therefore not obvious to me what should be puzzling about the fact that two "strong" percepts are formed.

The second question is why percepts persist "instead of switching rapidly". To my mind, this question has been answered quite satisfactorily, from a normative perspective, by Hohwy et al. 2008. In their conceptual model (which was later implemented as a computational model, see Weilnhammer et al. 2017), it is the current percept that determines the prediction of what the sensory input is caused by. This prediction is fed back from higher to lower hierarchical levels and thereby stabilizes perception. This is very similar to what the authors suggest in their current work, so I'm struggling to see what is new here (again, from a normative perspective).

The third question is "how the behavior of individuals in bistable perception tasks may predict their performance in other probabilistic inference tasks". This is a potentially interesting topic, but I did not find it addressed in this paper.

2. Related to the second of the above questions: From a normative perspective and conceptually, the circular inference model of bistable perception is extremely similar to the predictive coding model proposed by Hohwy et al./Weilnhammer et al.. Both models assume top-down projections (feedback/descending loops) to stabilize and bottom-up projections (feedforward/ascending loops) to destabilize perception. Since these two models are so closely related, I think it would be worthwhile to add a paragraph to the discussion section where the circular inference model is discussed in the light of this previous model, how the new model differs from the predictive coding model, and how it may go beyond it?

3. Why is the theta parameter needed at all, as it only has a thresholding function to avoid fast reversals. It is not needed in the presence of descending loops, and is stated therefore not to be considered further (Lines 352 f.)? So why is it needed in the model at all? And what could be an equivalent of this parameter in terms of a neural mechanism or function? Please clarify.

4. Maybe I din't get the point here, but I'm not sure I understood why sensory evidence is referred to as "noise" (e.g. line 418). The sensory data do contain some information, not only noise, don't they?

5. Levelt's 2nd law is misrepresented and it therefore seems that the authors' conclusions in this regard are not tenable. It is stated that manipulating the strength of one perceptual interpretation "mainly affects the persistence of the stronger interpretation". This is not what Levelt's 2nd law says. In fact it states that increasing stimulus strength for one eye will NOT affect the average perceptual dominance duration of that eye’s stimulus. Instead, it will reduce the average perceptual dominance duration of the other eye’s stimulus. What results from the model presented here is exactly the opposite.

6. Levelt's 3rd law is similarly misrepresented. The authors state "… that increasing the difference in the stimulus strength between the two perceptual interpretations should result in a decrease in the perceptual alternation rate". Again this is not what Levelt's 3rd says. What it does state is that increasing the strength of one stimulus will increase the alternation rate. Increasing the strength of one stimulus effectively increases the difference in stimulus strength, therefore the authors' statement says exactly the opposite and, again, what results from the model does not conform to Levelt's 3rd law.

7. The model manipulation that is used to test Levelt's 4th law is inconsistent with what was done to test the other three laws. For the first three laws, which entail a difference in stimulus strength, stimulus differences are modeled by changing the noise drift parameter. This is obviously not possible if the strength of both stimuli needs to be changed in the same direction. Instead, the authors resort to another parameter, the noise distribution. In other words, this looks like the authors model changes in stimulus strength by changing ad libitum whichever model parameter suits them best. There may be a good reason to do it this way, but if there is, it should be explained.

8. Line 324: "resulting in an exponential decay observed in the distribution of dominance durations". It would be very helpful if the dominance distributions were presented in a figure.

9. Line 127: It is misleading to say that the Necker cube is "equally compatible with 2 different 3D cubes", as it is actually more compatible with the view from above (as the authors also state further down in the paper).

10. Line 348: "see the Results section". This IS the results section.

11. "Severity of loops" (used multiple times) should probably better read "Strength of loops".

12. Line 714: "flying" should read "fleeing".

Signed: Philipp Sterzer

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Veith Weilnhammer

Reviewer #2: Yes: Philipp Sterzer

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

Attachment

Submitted filename: PlosComp_biol_L.pdf

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008480.r003

Decision Letter 1

Ulrik R Beierholm, Samuel J Gershman

30 Oct 2020

Dear Dr Leptourgos,

We are pleased to inform you that your manuscript 'A functional theory of bistable perception based on dynamical circular inference' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Ulrik R. Beierholm

Associate Editor

PLOS Computational Biology

Samuel Gershman

Deputy Editor

PLOS Computational Biology

***********************************************************

You will notice that reviewer 1 has some minor comments and recommendations that you should consider when submitting your final manuscript. However no response to the reviewer is required.

Reviewer's Responses to Questions

Comments to the Authors:

Reviewer #1: The review is uploaded as an attachment.

Reviewer #2: I thank the authors for the clarifications and the careful revision. I have no further comments.

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Veith Weilnhammer

Reviewer #2: No

Attachment

Submitted filename: PlosComp_biol_II.pdf

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1008480.r004

Acceptance letter

Ulrik R Beierholm, Samuel J Gershman

3 Dec 2020

PCOMPBIOL-D-20-00430R1

A functional theory of bistable perception based on dynamical circular inference

Dear Dr Leptourgos,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Nicola Davies

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Mathematical derivations.

    (DOCX)

    S2 Text. Bifurcation analysis.

    (DOCX)

    S3 Text. Phenomenology of bistable perception.

    (DOCX)

    S4 Text. Parameter recovery.

    (DOCX)

    Attachment

    Submitted filename: PlosComp_biol_L.pdf

    Attachment

    Submitted filename: Response to reviewers.docx

    Attachment

    Submitted filename: PlosComp_biol_II.pdf

    Data Availability Statement

    The Matlab codes can be found here: github.com/VincentBt/dynamical_CI_bistable.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES