Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2018 Mar;111:334–343. doi: 10.1016/j.neuropsychologia.2018.01.041

Active inference and the anatomy of oculomotion

Thomas Parr 1,, Karl J Friston 1
PMCID: PMC5884328  PMID: 29407941

Abstract

Given that eye movement control can be framed as an inferential process, how are the requisite forces generated to produce anticipated or desired fixation? Starting from a generative model based on simple Newtonian equations of motion, we derive a variational solution to this problem and illustrate the plausibility of its implementation in the oculomotor brainstem. We show, through simulation, that the Bayesian filtering equations that implement ‘planning as inference’ can generate both saccadic and smooth pursuit eye movements. Crucially, the associated message passing maps well onto the known connectivity and neuroanatomy of the brainstem – and the changes in these messages over time are strikingly similar to single unit recordings of neurons in the corresponding nuclei. Furthermore, we show that simulated lesions to axonal pathways reproduce eye movement patterns of neurological patients with damage to these tracts.

Keywords: Free energy, Saccades, Oculomotor, Brainstem, Predictive coding, Active inference

Highlights

  • Eye movement control is cast as an inferential process that induces Newtonian rotational forces.

  • A plausible computational anatomy of Bayesian filtering in the brainstem is proposed.

  • Smooth pursuit and saccadic eye movements are simulated.

  • Lesions to the model cause similar patterns of eye movements to those found in neurological patients.

1. Introduction

There are many neurological (Sereno and Holzman, 1995, Büttner et al., 1999, Perry and Zeki, 2000, Anderson and MacAskill, 2013) and psychiatric (Holzman and Levy, 1977, Lipton et al., 1983, Sereno and Holzman, 1995) conditions that cause impairments of eye movement control. As such, assessment of oculomotion forms a crucial part of any neurological examination. We aim to characterise the functional anatomy of eye movement control by appealing to active inference, a principled approach to describing Bayes optimal behaviour (Friston et al., 2009). Our agenda here is to try and understand the oculomotor system in terms of its computational anatomy, as a complement to similar attempts to understand the control of eye movements at higher levels of the visual system; e.g., (Itti and Koch, 2001, Bruce and Tsotsos, 2009).

Previous active inference accounts of eye movements have focused on saccadic target selection (Mirza et al., 2016, Friston et al., 2017b) and ignored the mechanics of oculomotion, or have made use of the simplifying assumption that the position of the eyes can be altered directly through simple attractor dynamics (Friston et al., 2012, Friston et al., 2017a). Here, we follow the example of models that have treated the eyes as physical objects, subject to Newton's laws (Robinson, 1964, Robinson, 1968, McSpadden, 1998, Adams et al., 2012, Perrinet et al., 2014). We build upon these models by equipping each eye with separate kinetics, which are predicted by the brain using a model that is common to both eyes. We emphasise the anatomy and electrophysiology that emerge from this theoretical treatment and their striking resemblance to the properties of the brainstem (Büttner-Ennever and Büttner, 1988, Büttner and Büttner-Ennever, 2006).

The oculomotor system is a crucial interface between inferential processes of the brain, and the Newtonian world that it inhabits. It forms a distributed network (Parr and Friston, 2017a) that involves the cerebral cortex (Paus, 1996, Corbetta et al., 1998), the cerebellum (Berretta et al., 1993), and the basal ganglia (Hikosaka and Wurtz, 1985b, Hikosaka et al., 2000). Ultimately, neuronal messages from these regions combine to generate signals to the extraocular muscles to move the eyes. It is the brainstem that performs the translation of these instructions into motor nerve signals (Sparks, 1986, Sparks, 2002, Sparks and Mays, 1990). In this paper, we seek to understand the computations that must be performed to do this, and their neurobiological substrates. We begin by describing the mechanics of the eyes. We then provide an overview of the principles of active inference, and use these to motivate a predictive (generative) model of eye movements. We demonstrate through simulation that this reproduces eye movements consistent with health and disease, and show the emergence of established electrophysiological observations from these simulations.

2. Mechanics of eye movements

Saccadic eye movements implement the transition from one stationary fixation to another. While we may select a new target for fixation, the physical world does not allow us to alter position directly. Instead, changes in position must be brought about by applying forces that accelerate the eyes towards their target. We will first discuss the influence of these forces, and consider the translation of a desired location into forces in the next section. For simplicity, we assume only two forces acting on each eye. These are resultant forces in the horizontal and vertical dimensions. Each force gives rise to a torque, made up of an active term (muscle contraction), an elastic term, and a viscous term. Using Newton's second law in its rotational form, we arrive at the equations of motion shown on the left of Fig. 1. These equations are relatively simple, but could in principle be replaced by a set of more realistic equations that take account of, among other things, the non-linear relationship between muscle elasticity and length (McSpadden, 1998).

Fig. 1.

Fig. 1

Equations of motion This schematic shows the equations used to determine the motion of the eyes, and the sensations they generate. On the left, the pair of equations defining the ‘real-world’ generative process are shown. On the right, the analogous equations are shown for a generative model of that process. Note that the dimension of the sensory data, y, is equal for both, but the dimensions of the hidden states, x, differ. In the generative process, x1,2,3,4 are the (2 × 2) angular horizontal and vertical positions for the right and left eye (components of the xθ vectors). x5,6,7,8 are the angular velocities (components of the xω vectors). Each of these is associated with a result torque involving the extraocular muscles, a1,2,3,4, an elastic torque with spring constant k1, and a viscous torque with a viscosity constant k2. The resultant torque is converted to acceleration through division by the moment of inertia of the eyeballs J. In the generative model, x1,2 are the horizontal and vertical positions of both eyes, which are crucially assumed to be the same. x3,4 are the velocities. v1,2 are the two components of the target fixation vector. w and z are random Gaussian fluctuations with means of zero and precisions of Πx and Πy respectively.

In addition to the equation describing the movement of the eyes themselves, it is necessary to specify how the angular position and velocity of each eye gives rise to sensory data. The information carried from the eye to the brainstem can be classified into two broad categories. Visual information is passed through the optic nerve (Cranial nerve II), while proprioceptive data from the extraocular muscles travels through afferent fibres in the oculomotor nerves (CN III, IV, VI). We have assumed a simple visual signal in this paper: it is generated through an identity mapping, with added noise, from the position of the eyes (Faisal et al., 2008). In other words, what the eyes see depends entirely on where they look.

The nature of proprioceptive signals from the extraocular muscles is a controversial topic (Donaldson, 2000), but the presence of muscle spindles – the sensory organs of proprioception – in human extraocular muscles has been convincingly demonstrated (Cooper and Daniel, 1949), as has the type of reflex associated with these spindles in other muscles (Sherrington, 1893). It is worth acknowledging that the structure of these spindles is simpler than those found in other muscles (Ruskell, 1989), but the density is comparable (Lukas et al., 1994). In most skeletal muscle, afferent nerve fibres from the muscle spindles carry data about the velocity (type Ia afferents) and instantaneous length of a muscle (type II afferents). Similar signals have been recorded from the oculomotor nerve (Cooper et al., 1951, Tomlinson and Schwarz, 1977), when the extraocular muscles are stretched. We therefore assume that there are two proprioceptive modalities from each eye, carrying signals analogous to the II (position) and Ia (velocity) afferent fibres. Each of these has a horizontal and a vertical component. The equations determining these outputs are shown on the left of Fig. 1. Having specified these primary afferents, we turn to the treatment of these sensory signals by the brain.

3. Active inference

The Free energy principle states that living systems must minimise their variational free energy over time (Friston et al., 2006, Friston, 2009). The Free energy is an upper bound on surprise – or negative log evidence – so this is equivalent to the (almost tautological) statement that organisms are ‘self-evidencing’ (Hohwy, 2016), and seek out the sensory data that maximises the evidence for their own existence. For example, humans exist only within narrow range of temperatures. Sensing a temperature that is comfortably within this range carries greater evidence for existence than one outside it, so the free energy principle mandates that humans should act to ensure the former (Bruineberg et al., 2016). Minimisation of free energy through action and perception is referred to as active inference. The equivalence between active inference and self-evidencing can be seen through Jensen's inequality (Beal, 2003):

F(y~,q)FreeEnergy=Eqlnp(y~,x~,v~)q(x~,v~)lnEqp(y~,x~,v~)q(x~,v~)Jensen'sinequality=lnp(y~)Negativelogevidence:q=argminF

In the above, p is a probability distribution that defines the beliefs an organism has about the way in which sensory data is generated. q is an arbitrary probability distribution that approximates a posterior probability distribution when the free energy is minimised. We refer to v as hidden causes, while x are latent or hidden states. The sensory data y is the only set of variables an organism has access to. The tilde notation implies generalised coordinates of motion (Friston et al., 2008), y˜=vec(y,y,y,), a vector of temporal derivatives. This defines the trajectory of a variable in the same way as a Taylor series. With these definitions, we can write the equations of active inference as gradient descents on the variational free energy.

μ˜˙v=Dμ˜vμ˜vFμ˜˙x=Dμ˜xμ˜xFa˙=aF

The notation uu is used to simplify the equations above. μv and μx are the means (expectations) of the approximate posterior distributions of v and x, respectively. D is a block diagonal matrix with components

Dij={1ifji=10otherwise.

The foregoing provides a brief account of a very general formulation of (self evidencing) systems that effectively infer the causes of their sensory input to suppress surprise – or maximise Bayesian model evidence. Technically, the first pair of equations above corresponds to a generalised (Bayesian) filter. In this setting, a ‘filter’ is a process that recovers latent or hidden states from observed signals. However, the last equation changes the game profoundly. This is because it describes action on the generative process – that changes the ‘filtered’ signals, as we will see below. It is clear from this formulation that we must compute the free energy gradients in the above equation to perform a gradient descent. To do this, we have to define the joint distribution p(y˜,x˜,v˜) that expresses an organism's beliefs about the processes that generate its sensations – its generative model.

4. Generative model

To specify the generative model, we factorise the joint distribution above to give

p(y˜,x˜,v˜)=p(y˜|x˜,v˜)p(x˜|v˜)p(v˜)p(y˜|x˜,v˜)=N(g˜,Π˜y)p(x˜|v˜)=N(f˜,Π˜x)p(v˜)=N(η˜,Π˜v)

This factorisation rests on a pair of equations, f and g, analogous to those in the generative process above: one that determines the temporal dynamics of the system, and one that determines how the system gives rise to sensory data. These are depicted on the right of Fig. 1. η˜ is the mean of the prior distribution over v˜, and Π˜v is its precision (i.e., inverse covariance matrix).

The interface between the generative model and process is illustrated in the Bayesian network in Fig. 2, and this highlights the important differences between the two. The model is much simpler than the process. This is because the model does not allow for each eye to move independently, whereas the position of one eye offers no constraint over that of the other in the physical world. The other key differences are that action is part of the generative process, while hidden causes are only found in the model. The former causes changes in angular velocity, while the latter changes angular position. The hidden cause acts as a point attractor, drawing the eyes towards this position.

Fig. 2.

Fig. 2

The interface between model and process This Bayesian network shows how the generative process (filled circles) gives rise to sensory data, and how the generative model (unfilled circles) proposes this data is generated. Arrows connecting two variables indicate that the second variable is conditionally dependent on the first. Note that, as described in the main text, action of the extraocular muscles (EOM) in the real world causes changes in velocity (i.e. accelerations); while fictive fixation locations cause changes in position in the generative model. The relationship between the vectors in this graph and the variables of Fig. 1 are shown on the right.

If we compute the free energy gradients using a generative model of the form outlined above (Appendix), they can be substituted into the gradient descent equations to arrive at the differential equations in Fig. 3 (Friston et al., 2010). On the right hand side of Fig. 3, we illustrate how these equations could be implemented by passing messages between populations of neurons (Friston and Kiebel, 2009, Bastos et al., 2012, Shipp, 2016). Ascending messages here are (excitatory) prediction errors, while descending messages are (inhibitory) predictions. It is this pattern that characterises predictive coding (Rao and Ballard, 1999, Friston and Kiebel, 2009).

Fig. 3.

Fig. 3

Neuronal message passing On the left are the equations describing a gradient descent on variational free energy. On the right, we show how these equations map to a neuronal message passing scheme for the generative model outlined above. To do so, we have simply assigned the terms on the left hand side of each equation to a neuronal population, and mapped the influences between each population with excitatory and inhibitory connections. We have separated the states representing positions and velocities into right and left components; for consistency with the representation of each hemifield on the contralateral side of the sagittal plane in the brain. The numbers in little blue circles refer to the anatomical designation of expectation and error units in Fig. 5.

Fig. 4 shows the results of applying these equations, with two different prior distributions over the trajectory of a fictive fixation location. The first is a discontinuous function that changes discretely to different values, inducing saccades. The second is a sinusoidal function that gives rise to smooth pursuit eye movements. For both priors, the active inference scheme successfully computes the forces required to fulfil these beliefs. The common generative model for both eyes ensures the eye movements are conjugate – i.e. the eyes move together. In summary, using a plausible generative model and standard (active inference or filtering) dynamics we can reproduce the control of eye movements. Notice that we have not appealed to any control theory: in active inference, motor control follows naturally from the suppression of prediction errors generated by prior expectations: see Fig. 3. In other words, the active filter has prior beliefs about where it should be looking and action fulfils those beliefs in a Bayes optimal fashion. The plausibility of this sort of scheme has been addressed in the context of visual search (Friston et al., 2012) and oculomotor delays (Perrinet et al., 2014).

Fig. 4.

Fig. 4

Simulated eye movements These plots show the changes in expectations (solid lines) and prediction errors (dotted lines) over time for the hidden causes and states during saccadic eye movements (upper), and smooth pursuit movements (lower). The eye positions at various times are shown on the left of each set of plots. The grey regions correspond to 90% Bayesian confidence intervals around the inferred hidden states; namely the vertical and horizontal angular positions and velocities. The legend in the lower right of each plot indicates the modality represented by each line (visual = V, type II afferent/position = θ, type Ia afferent/velocity = ω). For example, a dotted line with a colour associated with V represents a prediction error in the visual domain. To see the key variables plotted individually, please refer to Fig. 5, where these are represented in separate raster plots.

We now turn to the question of the biological substrates of the active filtering equations used to generate oculomotor behaviour per se.

5. Anatomy and electrophysiology

The biological implementation of the equations in Fig. 3 is anatomically constrained in several ways. First, sensory inputs must reach the brain by the cranial nerves that carry that information. The neuronal populations that receive these inputs directly must reside in regions of the brain that contain the terminals of the relevant sensory afferent fibres. Similarly, neurons encoding actions should be lower motor neurons that contribute efferent fibres to the cranial nerves. The abducens nucleus mediates movements in the horizontal dimension only, and all movements in the vertical dimension are mediated by cranial nerves originating in the midbrain. The computational anatomy shown in Fig. 5 satisfies these constraints, and is remarkably consistent with the patterns of excitatory and inhibitory connectivity of the brainstem (Parr and Friston, 2017b).

Fig. 5.

Fig. 5

The computational anatomy of oculomotion On the left of this schematic, we show a plausible anatomical implementation of the Bayesian filtering equations in Fig. 3. This satisfies the connectivity constraints described in the main text. Note that we have included motor neurons (grey) that represent action. As Fig. 3 indicates, these only receive direct influences from the prediction error units at the sensory level. On the right, we show the simulated neuronal activities, along with a horizontal electrooculographic (HEOG) trace indicating the eye position. Each of the numbered raster plots is associated with a particular neuronal population indicated by numbers in little blue circles. See the main text for a description of these units and Fig. 3 for their equivalent location in the computational architecture. SC = superior colliculus; riMLF = rostral interstitial nucleus of the medial longitudinal fasciculus; PPRF = parapontine reticular formation; RIP = raphe interpositus nucleus.

To illustrate the neuronal plausibility of this computational anatomy, electrophysiological responses of cells in each region were simulated by taking the representations of each variable, as shown in the plots in Fig. 4, and converting them into raster plots. The first two raster plots in Fig. 5 show the firing rates of two of the three neuronal populations in the superior colliculus. The colliculus contains cells with three distinct electrophysiological phenotypes: ‘burst’, ‘fixation’, and ‘build-up’ cells (Munoz and Wurtz, 1995a). Burst cells fire at the start of a saccade, as can be seen in the first raster plot. This cell type is known to disnaptically inhibit cells in the Raphe nucleus interpositus (RIP) (Yoshida et al., 2001). This is consistent with the computational anatomy here, as there is an excitatory connection to a second collicular population that has inhibitory connections to the RIP. Both physiologically and anatomically, this cell type appears to be consistent with prediction error units signalling visual prediction (or ‘retinal-slip’) errors of the type implicated in models of eye movement (Krauzlis and Lisberger, 1989).

Fixation cells are active while a fixation is maintained. The second firing rate plot shows a cell that is active maximally only during fixations in one direction. These cells are known to project directly to cells in the RIP (Gandhi and Keller, 1997), again showing consistency with our proposed anatomy. These cells appear to signal the expected hidden cause. Build-up cells have yet another distinct phenotype, and must be assigned to the only remaining collicular cell type in Fig. 5, which signals the error in the expected hidden cause. We discuss this cell type in more detail below, but first turn to a key target of projections from the superior colliculus.

The RIP contains a population of cells known as ‘omnipause’ cells (Büttner-Ennever et al., 1988). These cease firing at the start of a saccade, but are active during fixations. This corresponds well to the third raster plot that shows a decrease in activity locked to each saccade. This signal is the prediction error related to the hidden states encoding current eye position. Neurons in the RIP inhibit those in the rostral interstitial nucleus of the medial longitudinal fasciculus (thought to coordinate vertical saccades (Büttner-Ennever and Büttner, 1978)) and in the parapontine reticular formation (that coordinates horizontal saccades (Cohen et al., 1968, Henn, 1992)) (Strassman et al., 1986). The fourth and fifth rows of raster plots show neurons in the latter area. These neurons show bursting activity that triggers a saccade, here related to the error in positional (proprioceptive) sensations. We have simulated such neurons representing saccades to either side of space.

The pattern of activity of the build-up cells is very interesting, when viewed at a population level (Lee et al., 1988, Munoz and Wurtz, 1995b). To simulate the spatiotemporal characteristics of electrophysiological responses in collicular build-up cells during saccades, we treated the retinotopic location vectors (i.e. the horizontal and vertical components of the error) as encoding the peaks of activity in the superior colliculus. This enabled us to generate simulated responses of colliculus neurons in which (Gaussian) ‘bumps’ of activity moved over a retinotopic map, similar to those elicited in computational models of the superior colliculus (Bozis and Moschovakis, 1998, Seung, 1998, Seung et al., 2000, Trappenberg et al., 2001, Richert et al., 2013). In turn, this enabled us to simulate spatiotemporal responses that would have been observed (by assuming a fixed shape of bump); either by imaging perisaccadic population responses in the deep layers of the superior colliculus (see Fig. 6A) – or unit responses at any particular location – over time – in terms of perisaccadic time histograms (see Fig. 6B). The post stimulus (saccade) time histograms bear a remarkable similarity to empirical results of the sort shown in Fig. 6C (Munoz and Wurtz, 1995b).

Fig. 6.

Fig. 6

Collicular ‘build-up’ cells This shows the population activity in collicular build-up cells during one of the saccades illustrated in Fig. 4 (left). Our simulated build-up cells are those that signal the error in the hidden cause (target fixation location). A shows this as if we had imaged the right superior colliculus, which represents the left side of space. We have made use of the known retinotopy of the colliculus (Quaia et al., 1998) to plot this activity. B shows a set of simulated recordings of single cells from the onset to end of the saccade. Each cell represents a different retinotopic location, indicated by the angles given for each plot. Note that the eccentricity increases with each row. C shows real data (adapted from Munoz and Wurtz, 1995b) from single unit recordings of build-up cells in the superior colliculus.

6. Lesions

Having demonstrated the anatomical and physiological plausibility of an active inference formulation of oculomotor control, we used the anatomical constraints underwriting the computational anatomy in Fig. 5 to motivate simulated lesions. Our first lesion removed all the connections that travel in the oculomotor cranial nerves on the left. This is to demonstrate that the simulation reproduces sensible results; i.e. the paralysis of the left eye (Fig. 7, left). Computationally, this disconnection precludes the receipt of sensory data by proprioceptive prediction error units for the left eye, and disconnects action units from the extraocular muscles.

Fig. 7.

Fig. 7

Computational lesions These plots demonstrate the consequences of simulated lesions. The first is a lesion of all the connections between the brainstem and the extraocular muscles of the left eye. As both the plots and the simulated eyes show, this causes a paralysis of the left eye, in keeping with what we would expect. On the right, we show the consequences of a lesion to the medial longitudinal fasciculus. The images and the plot of ‘action’ show that rightward gaze occurs normally in both eyes, but that leftward gaze reveals a deficit. The right eye fails to adduct to the same degree as the left abducts, and this induces nystagmus in both eyes – primarily the left. This is known clinically as an internuclear ophthalmoplegia. Please see refer to Fig. 4 for an explanation of these plots.

The second simulation aims to model a subtler lesion: damage to the medial longitudinal fasciculus, that travels from the abducens (CN VI) nucleus in the pons to the contralateral oculomotor (CN III) nucleus in the midbrain, causes a clinical sign referred to as an ‘internuclear ophthalmoplegia’. This is commonly seen in demyelinating conditions, such as multiple sclerosis, that induce white matter lesions. This pathology represents a disconnection syndrome (Catani and ffytche, 2005) that manifests as a failure of conjugate control of eye movements.

Fig. 7 (right) shows the results of performing this lesion in silico. Our lesion disrupts the signal from the left CN VI to the right CN III (see Fig. 5). Computationally, this represents a disconnection between error and expectation units encoding horizontal positional error and angular velocity respectively. As in real patients, both eyes are able to look to the right normally. However, when looking to the left, the left eye is able to look laterally, but the right eye fails to keep up while moving medially. This violation of conjugacy induces nystagmus in the (healthy) left eye. In our simulation, nystagmus is seen in both eyes, but more the left than the right. The deficit is most obvious in the plot labelled ‘action’.

7. Discussion

We have demonstrated in the above that, given a prior belief about anticipated fixation locations, η˜, Bayesian filtering can be used to generate movements that fulfil these beliefs. An important outstanding issue relates to the source of these priors. In predictive coding, there are typically higher hierarchical levels in play that send descending messages (predictions) to the lower level (Kiebel et al., 2008). These are used to derive the (empirical) prior beliefs at the lower level. In short, in this paper we have focused on the lowest level of deep (hierarchical) active vision that translates predictions about "where I am going to look next" into oculomotion that realises these predictions. As the predictions η˜ enter the Bayesian filtering equations to form prediction errors ε˜v, any descending connections would have to target units encoding these prediction errors. The anatomy of connections to the superior colliculus therefore hints at the anatomy of higher levels generating top-down predictions (Parr and Friston, 2017a). This anatomy includes projections from the frontal eye fields (Fries, 1984), the parietal cortex, and the substantia nigra pars reticulata (Hikosaka and Wurtz, 1983). We will attempt to address the role of these connections in future work, and to link them to the decision processes we have previously attributed to cortical and subcortical regions (Parr and Friston, 2017a). This will be essential in order to account for more complex, oculomotor behaviour, including the spatial patterns of saccadic searches their resemblance to ‘Lèvy flights’ (Brockmann and Geisel, 1999, Roberts et al., 2013).

We note that there are some subtle differences in the neuronal responses we have simulated (Fig. 5) compared to those measured in real neurons. For example, our simulated burst neurons show not only an increase in firing before a saccade in a given direction, but also a decrease in firing rate before a contralateral saccade. When these neurons have been interrogated in vivo (Munoz and Wurtz, 1995a), a directional sensitivity of this type has been demonstrated. The firing rate of a burst neuron is higher when a saccade is performed in one direction compared to a saccade in the opposite direction. However, there is no clear decrease in activity, relative to baseline firing rate, in response to a saccade contralateral to the preferred direction of a burst neuron – as seen in our simulations. There are several possible explanations for this discrepancy. One is that, as firing rates cannot be negative, the positive and negative parts of the variables encoded by our synthetic neurons are actually represented by different groups of burst neurons. A second possibility is that the mapping between these variables and neuronal firing rates is a convex function. If this is the case, we would expect very small changes in firing rate for a change in a variable at the lower end of the scale compared to those induced by the same change at higher values. The low baseline firing rate of burst neurons (Munoz and Wurtz, 1995a) supports this interpretation.

In addition to the oculomotor syndromes simulated here, an interesting next step would be to consider a broader range of pathologies. For example, schizophrenia is a psychiatric disorder associated with subtle oculomotor abnormalities, including changes in smooth pursuit eye movements (Thaker et al., 1998). Previous research using this form of modelling has been useful in characterising this kind of deficit in terms of abnormal estimates of precision in the generative model (Adams et al., 2012). In addition, eye movement signs are ubiquitous in neurology (Anderson and MacAskill, 2013). To take this model forward – to address cardinal oculomotor deficits in psychiatry and neurology – we may need to develop a more complete model that, in addition to accounting for visual and proprioceptive data, accounts for vestibular inputs. This is likely to be important in the development of nystagmus due to cerebellar or brainstem damage (Troost, 1989).

8. Conclusion

In this paper, we have demonstrated that active inference provides a sufficient and principled account of oculomotor forces that fulfil prior beliefs about eye movements. By using a generative model that is common to both eyes, we enforce conjugate eye movements. When we map the ensuing Bayesian filtering equations to their associated process theory; namely, predictive coding, we find a connectivity structure that is remarkably consistent with the neuroanatomy of the oculomotor brainstem. Once this anatomical assignment is made, it is possible to simulate saccade-related responses we would expect to record from these regions with an electrode. These were formally very similar to recordings from the homologous anatomical regions in the electrophysiological literature. Finally, we showed that anatomically motivated computational lesions reproduced the eye movement deficits seen in neurological patients.

Acknowledgements

TP is supported by the Rosetrees Trust (Award Number 173346). KJF is a Wellcome Principal Research Fellow (Ref: 088130/Z/09/Z). We thank two anonymous reviewers for their helpful suggestions.

Acknowledgments

Disclosure statement

The authors have no disclosures or conflict of interest.

Contributor Information

Thomas Parr, Email: thomas.parr.12@ucl.ac.uk.

Karl J. Friston, Email: k.friston@ucl.ac.uk.

Appendix. Generalised (Bayesian) filtering

To derive the filtering equations (Friston et al., 2010) in Fig. 3, we first must specify the form of the approximate posterior distribution, q(x˜,v˜). We start by making a mean field approximation: this assumes the full distribution can be obtained through a product of marginal distributions for each temporal derivative (x[i] means ith derivative of x) of the hidden states and causes.

q(x˜,v˜)=iq(x[i])q(v[i])

If we take one of these marginal posterior distributions, we can relate this to the joint density given by the generative model through Bayes rule. We can then expand this, using a Taylor series, to find an appropriate form for the distribution.

q(x[i])p(x[i]|x~\i,v~,y~)p(x[i],x~\i,v~,y~)1Zexp(lnp(μx[i],x~\i,v~,y~)+12(μx[i]x[i])2Px[i]+...)=N(μx[i],Px[i])Px[i]=x[i]x[i]lnp(μx[i],x~\i,v~,y~)

The Taylor series expansion reveals the approximate equality between a marginal posterior and a Gaussian distribution (this is also true for q(v[i])). This is known as the Laplace approximation (Friston et al., 2007). Notably, the precision of this distribution is an analytic function of the mean. This means that we only need optimise the mean explicitly.

To find the free energy gradients, we can take the variational derivative with respect to each marginal, and set this to zero. Omitting terms constant terms, this gives:

δq(x[i])F=lnq(x[i])Eq(v~)q(x~\i)[lnp(y~,x~,v~)]δq(x[i])F=0q(x[i])=1Zexp(Eq(v~)q(x~\i)[lnp(y[i]|x[i],v[i])+lnp(x[i+1]|x[i],v[i])+lnp(x[i]|x[i1],v[i1])])δq(v[i])F=lnq(v[i])Eq(v~\i)q(x~)[lnp(y~,x~,v~)]δq(v[i])F=0q(v[i])=1Zexp(Eq(v~\i)q(x~)[lnp(y[i]|x[i],v[i])+lnp(x[i+1]|x[i],v[i])+lnp(v[i])])q(x~\i)jiq(x[j])

As the expectations above are with respect to Gaussian distributions, it follows that the optimal means of these distributions are the values that maximise the terms over which the expectation is taken. This allows us to substitute the means into the arguments of the expectations above, and to perform a generalised gradient ascent of the quantity within the expectation.

μx[i]μx[i+1]=x[i](lnp(y[i]|μx[i],μv[i])+lnp(μx[i+1]|μx[i],μv[i])+lnp(μx[i]|μx[i1],μv[i1]))=12x[i](εy[i]Πy[i]εy[i]+εx[i]Πx[i]εx[i]+εx[i+1]Πx[i+1]εx[i+1])=x[i]g[i]Πy[i]εy[i]+x[i]f[i]Πx[i]εx[i]x[i]Πx[i1]εx[i1]

The second line comes from expanding the log Gaussians, where εy[i]=y[i]g[i], εx[i]=μx[i+1]f[i], and εv[i]=μv[i]η[i]. Similarly,

μ˙v[i]μv[i+1]=v[i]g[i]Πy[i]εy[i]+v[i]f[i]Πx[i]εx[i]Πv[i]εv[i]

Expressing these equalities in terms of generalised coordinates of motion gives the generalised filtering equations shown in Fig. 3.

References

  1. Adams R.A., Perrinet L.U., Friston K. Smooth pursuit and visual occlusion: active inference and oculomotor control in schizophrenia. PLOS ONE. 2012;7(10):e47502. doi: 10.1371/journal.pone.0047502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Anderson T.J., MacAskill M.R. Eye movements in patients with neurodegenerative disorders. Nat. Rev. Neurol. 2013;9(2):74–85. doi: 10.1038/nrneurol.2012.273. [DOI] [PubMed] [Google Scholar]
  3. Bastos A.M., Usrey W.M., Adams R.A., Mangun G.R., Fries P., Friston K.J. Canonical microcircuits for predictive coding. Neuron. 2012;76(4):695–711. doi: 10.1016/j.neuron.2012.10.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beal M.J. University of London United Kingdom; 2003. Variational Algorithms for Approximate Bayesian Inference. [Google Scholar]
  5. Berretta S., Bosco G., Giaquinta G., Smecca G., Perciavalle V. Cerebellar influences on accessory oculomotor nuclei of the rat: a neuroanatomical, immunohistochemical, and electrophysiological study. J. Comp. Neurol. 1993;338(1):50–66. doi: 10.1002/cne.903380105. [DOI] [PubMed] [Google Scholar]
  6. Bozis A., Moschovakis A.K. Neural network simulations of the primate oculomotor system III. An one-dimensional, one-directional model of the superior colliculus. Biol. Cybern. 1998;79(3):215–230. doi: 10.1007/s004220050472. [DOI] [PubMed] [Google Scholar]
  7. Brockmann, D., Geisel, T., 1999. Are human scanpaths Levy flights? IET Conference Proceedings, 263-268.
  8. Bruce N.D.B., Tsotsos J.K. Saliency, attention, and visual search: an information theoretic approach. J. Vis. 2009;9:3. doi: 10.1167/9.3.5. [DOI] [PubMed] [Google Scholar]
  9. Bruineberg J., Kiverstein J., Rietveld E. The anticipating brain is not a scientist: the free-energy principle from an ecological-enactive perspective. Synthese. 2016:1–28. doi: 10.1007/s11229-016-1239-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Büttner-Ennever J.A., Büttner U. A cell group associated with vertical eye movements in the rostral mesencephalic reticular formation of the monkey. Brain Res. 1978;151(1):31–47. doi: 10.1016/0006-8993(78)90948-4. [DOI] [PubMed] [Google Scholar]
  11. Büttner-Ennever J.A., Büttner U. Neuroanatomy of the oculomotor system. The reticular formation. Rev. Oculomot. Res. 1988;2:119–176. [PubMed] [Google Scholar]
  12. Büttner-Ennever J.A., Cohen B., Pause M., Fries W. Raphe nucleus of the pons containing omnipause neurons of the oculomotor system in the monkey, and Its homologue in man. J. Comp. Neurol. 1988;267(3):307–321. doi: 10.1002/cne.902670302. [DOI] [PubMed] [Google Scholar]
  13. Büttner U., Büttner-Ennever J.A. Present concepts of oculomotor organization. Prog. Brain Res. 2006;151:1–42. doi: 10.1016/S0079-6123(05)51001-X. [DOI] [PubMed] [Google Scholar]
  14. Büttner U., Helmchen C., Brandt T. Diagnostic criteria for central versus peripheral positioning nystagmus and vertigo: a review. Acta oto-Laryngol. 1999;119(1):1–5. doi: 10.1080/00016489950181855. [DOI] [PubMed] [Google Scholar]
  15. Catani M., ffytche D.H. The rises and falls of disconnection syndromes. Brain. 2005;128(10):2224–2239. doi: 10.1093/brain/awh622. [DOI] [PubMed] [Google Scholar]
  16. Cohen B., Komatsuzaki A., Bender M.B. Electrooculographic syndrome in monkeys after pontine reticular formation lesions. Arch. Neurol. 1968;18(1):78–92. doi: 10.1001/archneur.1968.00470310092008. [DOI] [PubMed] [Google Scholar]
  17. Cooper S., Daniel P.M. Muscle spindles in human extrinsic eye Muscles. Brain. 1949;72(1):1–24. doi: 10.1093/brain/72.1.1. [DOI] [PubMed] [Google Scholar]
  18. Cooper S., Daniel P.M., Whitteridge D. Afferent impulses in the oculomotor nerve, from the extrinsic eye muscles. J. Physiol. 1951;113(4):463–474. doi: 10.1113/jphysiol.1951.sp004588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Corbetta M., Akbudak E., Conturo T.E., Snyder A.Z., Ollinger J.M., Drury H.A., Linenweber M.R., Petersen S.E., Raichle M.E., Van Essen D.C., Shulman G.L. A common network of functional areas for attention and eye movements. Neuron. 1998;21(4):761–773. doi: 10.1016/s0896-6273(00)80593-0. [DOI] [PubMed] [Google Scholar]
  20. Donaldson I.M. The functions of the proprioceptors of the eye muscles. Philos. Trans. R. Soc. B: Biol. Sci. 2000;355(1404):1685–1754. doi: 10.1098/rstb.2000.0732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Faisal A.A., Selen L.P.J., Wolpert D.M. Noise in the nervous system. Nat. Rev. Neurosci. 2008;9(4):292–303. doi: 10.1038/nrn2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fries W. Cortical projections to the superior colliculus in the macaque monkey: a retrograde study using horseradish peroxidase. J. Comp. Neurol. 1984;230(1):55–76. doi: 10.1002/cne.902300106. [DOI] [PubMed] [Google Scholar]
  23. Friston K. The free-energy principle: a rough guide to the brain? Trends Cogn. Sci. 2009;13(7):293–301. doi: 10.1016/j.tics.2009.04.005. [DOI] [PubMed] [Google Scholar]
  24. Friston K., Adams R.A., Perrinet L., Breakspear M. Perceptions as Hypotheses: saccades as experiments. Front. Psychol. 2012;3:151. doi: 10.3389/fpsyg.2012.00151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Friston K., Kiebel S. Predictive coding under the free-energy principle. Philos. Trans. R. Soc. B: Biol. Sci. 2009;364(1521):1211. doi: 10.1098/rstb.2008.0300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Friston K., Kilner J., Harrison L. A free energy principle for the brain. J. Physiol. 2006;100(1–3):70–87. doi: 10.1016/j.jphysparis.2006.10.001. [DOI] [PubMed] [Google Scholar]
  27. Friston K., Mattout J., Trujillo-Barreto N., Ashburner J., Penny W. Variational free energy and the Laplace approximation. Neuroimage. 2007;34(1):220–234. doi: 10.1016/j.neuroimage.2006.08.035. [DOI] [PubMed] [Google Scholar]
  28. Friston K., Stephan K., Li B., Daunizeau J. Generalised filtering. Math. Probl. Eng. 2010;2010 [Google Scholar]
  29. Friston K.J., Daunizeau J., Kiebel S.J. Reinforcement learning or active inference? PLOS ONE. 2009;4(7):e6421. doi: 10.1371/journal.pone.0006421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Friston K.J., Parr T., Vries B. d. The graphical brain: belief propagation and active inference. Netw. Neurosci. 2017:1–78. doi: 10.1162/NETN_a_00018. (0(ja)) [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Friston K.J., Rosch R., Parr T., Price C., Bowman H. Deep temporal models and active inference. Neurosci. Biobehav. Rev. 2017;77:388–402. doi: 10.1016/j.neubiorev.2017.04.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Friston K.J., Trujillo-Barreto N., Daunizeau J. DEM: a variational treatment of dynamic systems. NeuroImage. 2008;41(3):849–885. doi: 10.1016/j.neuroimage.2008.02.054. [DOI] [PubMed] [Google Scholar]
  33. Gandhi N.J., Keller E.L. Spatial distribution and discharge characteristics of superior colliculus neurons antidromically activated from the omnipause region in monkey. J. Neurophysiol. 1997;78(4):2221. doi: 10.1152/jn.1997.78.4.2221. [DOI] [PubMed] [Google Scholar]
  34. Henn V. Pathophysiology of rapid eye movements in the horizontal, vertical and torsional directions. Bailliere's. Clin. Neurol. 1992;1(2):373–391. [PubMed] [Google Scholar]
  35. Hikosaka O., Takikawa Y., Kawagoe R. Role of the basal ganglia in the control of purposive saccadic eye movements. Physiol. Rev. 2000;80(3):953. doi: 10.1152/physrev.2000.80.3.953. [DOI] [PubMed] [Google Scholar]
  36. Hikosaka O., Wurtz R.H. Visual and oculomotor functions of monkey substantia nigra pars reticulata. IV. Relation of substantia nigra to superior colliculus. J. Neurophysiol. 1983;49(5):1285. doi: 10.1152/jn.1983.49.5.1285. [DOI] [PubMed] [Google Scholar]
  37. Hikosaka O., Wurtz R.H. Modification of saccadic eye movements by GABA-related substances. II. Effects of muscimol in monkey substantia nigra pars reticulata. J. Neurophysiol. 1985;53(1):292. doi: 10.1152/jn.1985.53.1.292. [DOI] [PubMed] [Google Scholar]
  38. Hohwy J. The self-evidencing brain. Noûs. 2016;50(2):259–285. [Google Scholar]
  39. Holzman P.S., Levy D.L. Smooth pursuit eye movements and functional psychoses: a review. Schizophr. Bull. 1977;3(1):15. doi: 10.1093/schbul/3.1.15. [DOI] [PubMed] [Google Scholar]
  40. Itti L., Koch C. Computational modelling of visual attention. Nat. Rev. Neurosci. 2001;2(3):194–203. doi: 10.1038/35058500. [DOI] [PubMed] [Google Scholar]
  41. Kiebel S.J., Daunizeau J., Friston K.J. A hierarchy of time-scales and the brain. PLoS Comput. Biol. 2008;4(11):e1000209. doi: 10.1371/journal.pcbi.1000209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Krauzlis R.J., Lisberger S.G. A control systems model of smooth pursuit eye movements with realistic emergent properties. Neural Comput. 1989;1(1):116–122. [Google Scholar]
  43. Lee C., Rohrer W.H., Sparks D.L. Population coding of saccadic eye movements by neurons in the superior colliculus. Nature. 1988;332(6162):357–360. doi: 10.1038/332357a0. [DOI] [PubMed] [Google Scholar]
  44. Lipton R.B., Levy D.L., Holzman P.S., Levin S. Eye movement dysfunctions in psychiatric patients: a review. Schizophr. Bull. 1983;9(1):13–32. doi: 10.1093/schbul/9.1.13. [DOI] [PubMed] [Google Scholar]
  45. Lukas J.R., Aigner M., Blumer R., Heinzl H., Mayr R. Number and distribution of neuromuscular spindles in human extraocular muscles. Investig. Ophthalmol. Vis. Sci. 1994;35(13):4317–4327. [PubMed] [Google Scholar]
  46. McSpadden A. Texas Tech University; 1998. A Mathematical Model of Human Saccadic Eye Movement. [Google Scholar]
  47. Mirza M.B., Adams R.A., Mathys C.D., Friston K.J. Scene construction, visual foraging, and active inference. Front. Comput. Neurosci. 2016;10:56. doi: 10.3389/fncom.2016.00056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Munoz D.P., Wurtz R.H. Saccade-related activity in monkey superior colliculus. I. Characteristics of burst and buildup cells. J. Neurophysiol. 1995;73(6):2313. doi: 10.1152/jn.1995.73.6.2313. [DOI] [PubMed] [Google Scholar]
  49. Munoz D.P., Wurtz R.H. Saccade-related activity in monkey superior colliculus. II. Spread of activity during saccades. J. Neurophysiol. 1995;73(6):2334. doi: 10.1152/jn.1995.73.6.2334. [DOI] [PubMed] [Google Scholar]
  50. Parr T., Friston K.J. The active construction of the visual world. Neuropsychologia. 2017;104:92–101. doi: 10.1016/j.neuropsychologia.2017.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Parr T., Friston K.J. The computational anatomy of visual neglect. Cereb. Cortex. 2017:1–14. doi: 10.1093/cercor/bhx316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Paus T. Location and function of the human frontal eye-field: a selective review. Neuropsychologia. 1996;34(6):475–483. doi: 10.1016/0028-3932(95)00134-4. [DOI] [PubMed] [Google Scholar]
  53. Perrinet L.U., Adams R.A., Friston K.J. Active inference, eye movements and oculomotor delays. Biol. Cybern. 2014;108(6):777–801. doi: 10.1007/s00422-014-0620-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Perry R.J., Zeki S. The neurology of saccades and covert shifts in spatial attention: an event-related fMRI study. Brain. 2000;123(11):2273–2288. doi: 10.1093/brain/123.11.2273. [DOI] [PubMed] [Google Scholar]
  55. Quaia C., Aizawa H., Optican L.M., Wurtz R.H. Reversible inactivation of monkey superior colliculus. II. Maps of saccadic deficits. J. Neurophysiol. 1998;79(4):2097. doi: 10.1152/jn.1998.79.4.2097. [DOI] [PubMed] [Google Scholar]
  56. Rao R.P., Ballard D.H. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat. Neurosci. 1999;2(1):79–87. doi: 10.1038/4580. [DOI] [PubMed] [Google Scholar]
  57. Richert M., Nageswaran J.M., Sokol S., Szatmary B., Petre C., Piekniewski F., Izhikevich E. A spiking model of superior colliculus for bottom-up saliency. BMC Neurosci. 2013;14(1):P185. [Google Scholar]
  58. Roberts J.A., Wallis G., Breakspear M. Fixational eye movements during viewing of dynamic natural scenes. Front. Psychol. 2013;4:797. doi: 10.3389/fpsyg.2013.00797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Robinson D. The mechanics of human saccadic eye movement. J. Physiol. 1964;174(2):245–264. doi: 10.1113/jphysiol.1964.sp007485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Robinson D.A. The oculomotor control system: a review. Proc. IEEE. 1968;56(6):1032–1049. [Google Scholar]
  61. Ruskell G.L. The fine structure of human extraocular muscle spindles and their potential proprioceptive capacity. J. Anat. 1989;167:199–214. [PMC free article] [PubMed] [Google Scholar]
  62. Sereno A.B., Holzman P.S. Antisaccades and smooth pursuit eye movements in schizophrenia. Biol. Psychiatry. 1995;37(6):394–401. doi: 10.1016/0006-3223(94)00127-O. [DOI] [PubMed] [Google Scholar]
  63. Seung H.S. Continuous attractors and oculomotor control. Neural Netw. 1998;11(7–8):1253–1258. doi: 10.1016/s0893-6080(98)00064-1. [DOI] [PubMed] [Google Scholar]
  64. Seung H.S., Lee D.D., Reis B.Y., Tank D.W. Stability of the memory of eye position in a recurrent network of conductance-based model neurons. Neuron. 2000;26(1):259–271. doi: 10.1016/s0896-6273(00)81155-1. [DOI] [PubMed] [Google Scholar]
  65. Sherrington C.S. Further experimental note on the correlation of action of antagonistic muscles. Br. Med. J. 1893;1(1693) doi: 10.1136/bmj.1.1693.1218. (1218-1218) [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Shipp S. Neural elements for predictive coding. Front. Psychol. 2016;7:1792. doi: 10.3389/fpsyg.2016.01792. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Sparks D.L. Translation of sensory signals into commands for control of saccadic eye movements: role of primate superior colliculus. Physiol. Rev. 1986;66(1):118. doi: 10.1152/physrev.1986.66.1.118. [DOI] [PubMed] [Google Scholar]
  68. Sparks D.L. The brainstem control of saccadic eye movements. Nat. Rev. Neurosci. 2002;3(12):952–964. doi: 10.1038/nrn986. [DOI] [PubMed] [Google Scholar]
  69. Sparks D.L., Mays L.E. Signal transformations required for the generation of saccadic eye movements. Annu. Rev. Neurosci. 1990;13(1):309–336. doi: 10.1146/annurev.ne.13.030190.001521. [DOI] [PubMed] [Google Scholar]
  70. Strassman A., Highstein S.M., McCrea R.A. Anatomy and physiology of saccadic burst neurons in the alert squirrel monkey. I. Excitatory burst neurons. J. Comp. Neurol. 1986;249(3):337–357. doi: 10.1002/cne.902490303. [DOI] [PubMed] [Google Scholar]
  71. Thaker G.K., Ross D.E., Cassady S.L. Smooth pursuit eye movements to extraretinal motion signals: deficits in relatives of patients with schizophrenia. Arch. Gen. Psychiatry. 1998;55(9):830–836. doi: 10.1001/archpsyc.55.9.830. [DOI] [PubMed] [Google Scholar]
  72. Tomlinson R.D., Schwarz D.W.F. Response of oculomotor neurons to eye muscle stretch. Can. J. Physiol. Pharmacol. 1977;55(3):568–573. doi: 10.1139/y77-079. [DOI] [PubMed] [Google Scholar]
  73. Trappenberg T.P., Dorris M.C., Munoz D.P., Klein R.M. A model of saccade initiation based on the competitive integration of exogenous and endogenous signals in the superior colliculus. J. Cogn. Neurosci. 2001;13(2):256–271. doi: 10.1162/089892901564306. [DOI] [PubMed] [Google Scholar]
  74. Troost B.T. Nystagmus: a clinical review. Rev. Neurol. 1989;145(6–7):417–428. [PubMed] [Google Scholar]
  75. Yoshida K., Iwamoto Y., Chimoto S., Shimazu H. Disynaptic Inhibition of Omnipause Neurons Following Electrical Stimulation of the Superior Colliculus in Alert Cats. J. Neurophysiol. 2001;85(6):2639. doi: 10.1152/jn.2001.85.6.2639. [DOI] [PubMed] [Google Scholar]

RESOURCES