Abstract
Many cognitive theories have described behavior as the summation of independent contributions from separate components. Contrasting views have emphasized the importance of multiplicative interactions and emergent structure. We describe a statistical approach to distinguishing additive and multiplicative processes and apply it to the dynamics of eye movements during classic visual cognitive tasks. The results reveal interaction-dominant dynamics in eye movements in each of the three tasks, and that fine-grained eye movements are modulated by task constraints. These findings reveal the interactive nature of cognitive processing and are consistent with theories that view cognition as an emergent property of processes that are broadly distributed over many scales of space and time rather than a componential assembly line.
Keywords: visual search, lognormal, interaction-dominance, eye movements
Interactions Dominate the Dynamics of Visual Cognition
Cognition is the product of a large, complex system. Like any complex system, the cognitive system consists of components (i.e., constituent parts) that interact with one another. Much of cognitive science has concerned itself with understanding the parts or components: Is there a face recognition module? Does language processing require a rule module? Etc. Other researchers have focused on the interactions and the structure that emerges from those interactions. Clearly, understanding cognition requires understanding both the components and their interactions, but where to begin? That is, should cognition be regarded as component-dominant or interaction-dominant?
The distinction between component-dominance and interaction-dominance is a matter of organization (Van Orden, Holden, & Turvey, 2003). In a system dominated by components, cognitive labor is divided among constituent parts. Specialized functions are locally encapsulated, and cognition is the summed activity of all constituent parts. When interactions dominate, organization is emergent and context-dependent. Functions are not encapsulated – the constituent parts arrange themselves according to the current demands of context. That is, components may flexibly bind together or break down to suit the changing conditions for a given task (Kay, 1988). Here, organization is an emergent coordination, and rather than obeying local divisions into parts, this coordination emerges in response to ongoing changes in information flow (Kelso, 1995).
Controversy in cognitive science
Cognitive science has traditionally focused on identifying the constituent parts of the cognitive architecture. For example, visual cognition would be divided among an attention component, a feature detection component, a recognition component, a feature integration component, etc., and each component would have a separate context-independent function (Treisman, 1993). Each component receives information from the last and, having processed the information in its specialized manner, passes the information on to the next component. The amount of time that it takes to find the target is the sum of processing time needed by each component along the way. Performance differences due to experimental manipulations or individual differences can be reduced to changes within specific components. This formulation remains an influential way to think about cognition (e.g., Coltheart, 1999; Dietrich & Markman, 2003; Marr, 1982; Wagenmakers, Farrell, & Ratcliff, 2004).
However, throughout the history of cognitive science, alternative perspectives have emphasized the role of emergence. Component-dominance makes for concise schematics of cognitive process, but the evidence suggests that components may not be so rigidly specialized and that the interactions may be more fundamental for cognition. For instance, Lashley (1950) and Hebb (1949) argued that learning could not be localized within a specific part of the brain but is encoded as distributed patterns of activation and connection among individual neurons. Based on neuropsychological data, Luria (1973a, 1973b) argued that cognition was based on distributed “functional systems”, rather than components, and that these systems could reorganize. More recently, Parallel Distributed Processing (PDP) has taken up the pursuit of interactivity and emergent structure as the foundation for cognition. PDP stresses interactions among simple processing elements (e.g., microfeatures) and the emergent properties of such systems (e.g., Elman, Bates, Johnson, Karmiloff-Smith, Parisi, & Plunkett, 1996; Rumelhart, McClelland, & the PDP Research Group, 1986). PDP networks begin with a finite set of microfeatures, such as “has wings”, “is yellow”, and “can sing” for canary (e.g., Rogers & McClelland, 2004; McRae, Cree, Seidenberg, & McNorgan, 2005). The emergent structure is limited to the interactions among these microfeatures; thus, the use of microfeatures in PDP may credit undue influence to components. Although the combinatoric variety of emergence is somewhat limited in this case (Boogerd, Bruggeman, Richardson, Stephan, & Westerhoff, 2005; Cariani, 1997), PDP serves as a well-known example of an interaction-based account of cognition.
Distinguishing component-dominance and interaction-dominance
Cognitive scientists interested in the role of interactions have gained new inspiration from work in statistical mechanics on the properties of complex systems. Indeed, even the PDP approach was significantly influenced by work in physics and thermodynamics (Ackley, Hinton, & Sejnowski, 1985; Hopfield, 1982, 1984). More recent complex-systems perspectives on cognition have adopted methods from complexity science in order to distinguish component-dominant dynamics from interaction-dominant dynamics in cognitive behavior. These methods involve assessing the statistical distributions of system behavior (Holden, Van Orden, & Turvey, 2009; Van Orden et al., 2003). We now briefly describe one such method, involving the test of a continuum of distributions. After briefly reviewing the existing evidence of interaction-dominance in cognition, we present new evidence of interaction-dominant dynamics in three classic visual cognitive tasks.
In order to use statistical mechanics techniques to distinguish between component-dominance and interaction-dominance, we take a step away from cognitive issues for a moment and consider the dynamics of random variables. Random variables represent the components composing a system and may assume a variety of values within an arbitrary domain. The behavior of the system depends on the current value of the random variables and the relationship that links them together. In a mathematical description, the two basic relationships are addition and multiplication. Hence, outcomes of system behavior are due to the addition or multiplication of random variables within the system.
These two different mathematical relations lead to different distributions of outcomes. To illustrate this difference, let us consider a system whose outcomes depend only on two random variables at a time, a and b. In the first case, a and b are each independently selected from the domain and then added together. The resulting numbers range from the sum of the two smallest possible values of a and b to the sum of the two largest possible values of a and b. After very many sums of pairs of random variables, the sums will tend to aggregate most heavily half-way between the two extremes. The extremely small sums and extremely large sums will both be rare compared to the median sums. A plot of these frequencies would give us a normal distribution. Most important for our purposes, the role of each participating number is always the same: to increase the sum by a specific amount. That is, addends invariantly serve the same role, much like the parts in a component-dominant theory of cognitive processes.
Now, consider the second case in which a and b are independently selected as before but are now multiplied rather than added together. As before with sums, the products range from the product of the smallest pair of random variables to the product of the largest pair of random variables. However, the landscape between these two extremes is now quite different from the normal distribution. Specifically, the mode and median have shifted to lower values and the tail has become longer: smaller products tend to be more frequent than the larger products and the upper limit on products is much higher than the upper limit on sums. The frequencies of products are best described by a lognormal distribution, a positively skewed curve with a relatively thick tail (e.g., Limpert, Stahel, & Abbt, 2001). This skewed result arises from the fact that the effect of multiplication depends nonlinearly on the values of the elements. Furthermore, the impact of one variable is not independent of the impact of the other: Outcomes due to either random variable can be radically different depending on the current value of the other variable. Just like in an interaction term for a regression, the magnitude of one random variable’s effect will be moderated by the value of the other (e.g., the effect of spelling-sound regularity depends on word frequency: Seidenberg, Waters, Barnes, & Tanenhaus, 1984; Taraban & McClelland, 1987). The behavior of the system in this interaction-dominated situation depends much more on context (i.e., the context of the other variable).
Thus, the test between component-dominance and interaction-dominance boils down to a distinction between additive and multiplicative distributions, respectively. We now describe a continuum from pure additivity/component-dominance to pure multiplicativity/interaction-dominance (Holden et al., 2009; West & Deering, 1995). This set of distributions is not exhaustive (see Nakatani & van Leeuwen, 2008) but captures a broad range of dynamics that might be found in cognition. We describe this continuum in mathematical terms of graded additivity and graded multiplicativity without a theoretical view as to whether systems may fundamentally belong to one or another distribution. At this point, whether cognitive dynamics are additive or multiplicative and how the distributions are influenced by context remain empirical questions. Thus, we take an agnostic position and consider the conventional range of possible distributions from various mixtures of additivity and multiplicativity (Edwards et al., 2007). For the purposes of illustration, Figure 1 shows the probability density functions (pdfs) for each distribution. In cases of strong skew, empirical distributions can be difficult to distinguish visually. Logarithmically scaling both axes can present a more easily legible picture; Figure 2 shows the same pdfs as in Figure 1, only on logarithmic axes.
Pure additivity: The normal distribution
The quintessential portrait of component-dominant dynamics is the normal distribution (Figure 1–Figure 2: top row, left panel). The normal distribution arises when independently-selected distinct components are added as described above. Because distributions of observed behavior are usually skewed (Engbert, Neuthman, & Kliegl, 2007; Andrews & Heathcote, 2001), the normal distribution serves here only as the anchor for the remaining continuum of distributions.
Skewed additivity: The gamma and exponential distributions
Modest multiplicative distortion will turn a symmetric additive model into a skewed distribution (Arellano-Valle, Branco, & Genton, 2006). For example, the exponential (Figure 1–Figure 2) and the gamma distributions (Figure 1–Figure 2: top row, middle and right panels) both reflect summations of independent components (Edwards et al., 2007; Kendal, 2001). The exponential distribution exhibits slightly greater skew. Both exponential and gamma distributions are often used in models of reaction times (Luce, 1986; Miller & Ulrich, 2003) as well as fixation durations (Engbert & Kliegl, 2001).
Multiplicativity among components: The lognormal distribution
The lognormal distribution describes a normal distribution of logarithmically scaled values (Figure 1–Figure 2: bottom row, left panel). Whereas the normal distribution expresses probability p as the sum of two random variables a and b
(1) |
the lognormal distribution expresses the probability p as the sum of logarithmically scaled random variables
(2) |
Because the sum of logarithms is equal to a logarithm of the product:
(3) |
the lognormal distribution arises from multiplicativity. The lognormal distribution retains the independence of individual components but reflects a constrained variety of interaction-dominant dynamics (Farmer, 1990; Holden et al., 2009). The increased multiplicativity is apparent in the greater skew of the lognormal across x in Figure 1–Figure 2 relative to the gamma and the exponential.
Multiplicativity with feedback: The (inverse) power-law distribution
This most-skewed distribution is a multiplicative process in which probabilities decay by a constant ratio (Figure 1–Figure 2: bottom row, right panel; Montroll & Shlesinger, 1982). Power laws entail the weakest role for independent components and the strongest role for interactions, exhibited in the shallowest decay compared to other distributions. The dependence of random variables in the power-law case can be considered a result of resonant feedback dynamics. Under feedback dynamics, the occurrence of an event increases the probability that it will occur again; that is, once a random value is selected, the same (or similar) values for a random variable will be selected multiple times in sequence. The power law distribution arises due to exponentiation of random variables (i.e., ab). Exponentiation relies on multiplication, but now the elements that are multiplied are not independent: small values are multiplied by small values and large values are multiplied by large values. Consequently, the range of the products is amplified. Power laws have been found in a variety of cognitive processes, from word naming reaction times (Van Orden et al., 2003), to self-paced tapping (Lemoine, Torre, & Delignières, 2006), to forgetting curves (Anderson & Schooler, 1991).
In summary, normal distributions reflect pure addition of independent variables, gamma and exponential distributions reflect addition of independent variables with slight multiplicative distortion, lognormal distributions reflect multiplication of independent variables, and power law distributions reflect multiplication of non-independent variables.
Cognitive behavior reflects interaction-dominant dynamics
Recent evidence suggests that interactions dominate cognitive behavior. For instance, variability in cognitive tasks repeated over time may exhibit power-law structure. Specifically, variability over time may exhibit 1/f noise, the power-law decay across frequencies in the power spectrum. That is, response times oscillate over long ranges, and the power (i.e., squared amplitude) of these oscillations diminishes very slowly as the frequency of these oscillations increases. This pattern has been found in response times for mental rotation (Gilden, 2001) and word naming (Van Orden et al., 2003; Holden et al., 2009). Speech appears to follow the same pattern: syllable utterance duration, acoustic power of vocalization at a variety of frequency bandwidths, and intervals between turns taken in a conversational setting all exhibit 1/f noise (Kello, Anderson, Holden, & Van Orden, 2008). In a set of findings convergent with the power-law structure in 1/f noise, the aggregate pdfs of response times in reading printed words aloud followed power-law and lognormal distributions, as well as mixtures of these two distributions (Holden et al., 2009). Task factors such as word ambiguity lengthened reaction times and shifted the distributions to be more like power law than lognormal.
Interaction-dominance in visual cognition
Aks and colleagues have found similar results in visual cognition. For instance, when participants looked at the reversible figure commonly known as the Necker cube, the time intervals between reversals (i.e., perceived changes in the Necker cube’s orientation) exhibited 1/f noise (Aks & Sprott, 2003). In order to manipulate depth cues and duration viewing conditions, participants viewed the Necker cube through red-blue anaglyph eyeglasses for a variety of durations. The Necker cube was presented for 15, 30, or 60 minutes in red and blue contours offset by 0, 1, 2, and 4 seconds of visual arc. Stronger depth cues and briefer viewing conditions increased the exponent of the power-law decay in the power spectrum. In a visual search task that required participants to find an upright “T” amidst rotated “Ts,” Euclidean distances between consecutive gaze positions (sampled at 500 Hz) exhibited 1/f noise (Aks, Zelinsky, & Sprott, 2002).
Stephen, Mirman, Magnuson, and Dixon (2009) examined the dynamics of eye movements while participants followed spoken instructions to click on one of four images presented on a computer screen (i.e., the visual world paradigm, or VWP; Tanenhaus, Spivey-Knowlton, Eberhard, & Sedivy, 1995). Eye movements were sampled at a relatively coarse grain (50 Hz). The aggregate pdf of Euclidean distances between consecutive gaze positions was better fit by a power-law distribution than by a gamma or exponential distribution. Such interaction-based, nonlinear dynamics have been critical to previous – namely PDP – accounts of visual search (e.g., Spivey & Dale, 2004) and VWP (e.g., Magnuson, Tanenhaus, Aslin, & Dahan, 2003).
Previous to Holden et al.’s (2009) inclusion of the lognormal distribution, interaction-dominance in cognitive tasks had been described largely in terms of power laws. So far, there has been a similar reliance upon the power law in tests of interaction-dominance in visual cognition (Aks & Sprott, 2003; Aks et al., 2002; Stephen et al., 2009). This focus on the power-law has led to a dichotomous view in which cognition is either interaction-dominant (and fits the power law distribution) or it is not. As reviewed above, there is a potential continuum of interaction dominance, and the lognormal distribution is a critical point in that continuum that reflects interaction-dominance under constraints, possibly from biological and physical properties. Indeed, lognormal distributions often appear to be more realistic models of biological and neural dynamics than power-law distributions (Hyman et al., 1995; Limpert et al., 2001; Qian & Bassingthwaighte, 2000).
The visual cognitive tasks
We examined where eye movement dynamics fall on a continuum of interaction-dominance in three classic visual cognitive tasks: two visual search tasks (e.g., Treisman & Gelade, 1980) and the VWP. We examined distributions of gaze steps: the Euclidean pixel distance displacements between consecutive gaze positions sampled by an eye-tracking device (as in previous studies: Aks & Sprott, 2003; Aks et al., 2002; Stephen et al., 2009). If cognitive dynamics are interaction-dominant, the observed distributions of gaze steps should be better fit by power-law and lognormal distributions than exponential and gamma distributions.
We also examined how task differences influenced the kind of distributions that unfold. The two visual search tasks were quite similar in nature, requiring the participant to search among distractors for a target that was distinguished by either a single visual feature or the conjunction of two features. The consistency among the targets and the distractors may constitute a constraint, pushing the system toward additive distributions in eye movements. A VWP task was chosen to be a considerably different visual cognitive task: participants were asked to mouse-click the picture of an animal among a set of four pictures (one target picture of an animal and three pictures of non-animal objects). The variety of targets (“animal” for VWP vs. “O” or “green N” for feature and conjunction search) and distractors (non-animal objects vs. brown “Ns” and green “Xs”) was much greater in the VWP task than in the visual search tasks. This wider variety of targets and distractors may place weaker constraints on the participant’s eye movements and thus allow greater interactivity to unfold in their visual exploration, much like ambiguity (spelling-pronunciation inconsistency) shifted distributions of reading times closer to power-law functions (Holden, 2002). Under this hypothesis, the distribution of gaze steps in the VWP task should tend closer to power-law distributions than those in the visual search tasks.
Methods
Participants
Six participants completed the experiment as one option to fulfill a course requirement. All participants reported normal or corrected-to-normal vision. Participants gave informed consent as required by the Institutional Review Board of the University of Connecticut.
Equipment and procedure
Participants were seated with their eyes approximately 27” from a 20” screen with 1024 × 768 resolution. Participants completed three tasks in counterbalanced order: two search tasks (single-feature and conjunction) and one VWP task. Gaze positions were recorded by a remote SR EyeLink 1000 eye-tracker at 500 Hz.
Visual search tasks
Participants completed 24 trials of each visual search task. In the single feature search task, participants had to indicate whether or not the letter “O” was present among distractors that were letters “N” and “X”. In the conjunction search task, participants had to indicate whether or not a green letter “N” was present among distractors that were green “X’s” and brown “N’s”. There were six trials for each set size (1, 5, 15, and 30 letters), and the target letter was present in half of the trials.
VWP task
Participants completed 30 trials of the VWP task. Participants were instructed at the start of the block to mouse-click on the image of an animal in each display. Each display consisted of images of 3 common non-animal objects (e.g., jukebox, mug, plum) and one image of an animal (e.g., gorilla). Each of the four images was displayed near one of the screen corners and the position of each image was randomized.
Data Analysis
Traditional inferential statistical tests (t-test and ANOVA) were used for behavioral data analysis. For distribution fitting, we used the MATLAB Statistics Toolbox for maximum likelihood (ML) estimation. Whereas ordinary least-squares (OLS) evaluates model fit with r-squared, ML evaluates model fit with log-likelihood. For a candidate distribution, ML estimation fits parameters that maximize log-likelihood. The candidate distribution with the highest log-likelihood is taken to be the best fitting distribution (Edwards et al., 2007; Sims et al., 2008). Of course, empirical data are unlikely to conform precisely to any model distribution; rather, we define a sample of candidate distributions and take the best-fitting (that is, most likely) distribution to be the best approximation of the empirical data (Burnham & Anderson, 2002; Schunn & Wallach, 2005). If the best-fitting distribution is additive, this is evidence of component-dominance. If it is multiplicative, it is evidence of interaction-dominance. The relative goodness-of-fit of one distribution over another is expressible by the difference in log-likelihoods of the distributions in question.1 Because the difference of logarithms equals the logarithm of a ratio (i.e., log(b) − log(c) = log(b/c)), this difference in log-likelihoods is called the log ratio (Singer & Willett, 2003).
Results
Behavioral results
Accuracy
All six participants completed all tasks at 87.5% accuracy or higher, and the differences in accuracy among participants were not statistically reliable, F(2, 15) = 2.39, p > .05. (see Table 1)
Table 1.
Task | |||
---|---|---|---|
Participant | VWP | Simple-Feature | Conjunction |
1 | 100.00% | 87.50% | 100.00% |
2 | 100.00% | 100.00% | 95.83% |
3 | 100.00% | 95.83% | 87.50% |
4 | 100.00% | 95.83% | 91.67% |
5 | 100.00% | 100.00% | 100.00% |
6 | 100.00% | 100.00% | 100.00% |
Reaction time
Reaction times were not significantly different between the VWP task and the visual search tasks, t(12) = .10, p > .05. For the visual search tasks, there were main effects of task, F(1, 5) = 39.67, p < .01, and set size, F(3, 15) = 45.11, p < .0001, and a significant interaction of task with set size, F(3, 15) = 20.66, p < .0001. Figure 3 shows the mean reaction times for each participant across the two different visual search tasks. The interaction reflects the strong effect of set size on conjunction-feature search and relatively weak effect of set size on single-feature search – a replication of the classic visual-search result (e.g., Triesman & Gelade, 1980).
Relative likelihood estimation
Gaze step size
Gaze steps for the VWP task (M = 2.45, SE = .14) were marginally (F(2, 10) = 3.73, p = .06) larger than for the single-feature search task (M = 1.98, SE = .35) or the conjunction search task (M = 1.84, SE = .23).
Probability distribution functions
Figure 4–Figure 6 show empirical pdfs by participant for the VWP task, the single-feature search task, and the conjunction search task, respectively. Logarithmic axes are used for ease of visualization.
Basic model fits
We tested the model fit of exponential, gamma, lognormal, and power-law distributions to the distributions of gaze steps for each task. Table 2 shows the log-likelihood for each fit by task and by participant. Higher log-likelihood indicates better model fit. For the VWP task, one participant exhibited power-law-distributed gaze steps, and five participants exhibited lognormal-distributed gaze steps. For visual search, gaze steps were all lognormal-distributed. These multiplicative distributions of eye movements suggest that interactions dominate cognition in these tasks. Fits for the four distributions tested are shown on the same axes as empirical pdfs in Figure 4–Figure 6: lognormal and power-law distributions were better fits than gamma and exponential distributions. Often, the best fitting lognormal and power-law distributions were quite similar on simply visual inspection.
Table 2.
Distribution | |||||
---|---|---|---|---|---|
Participant | Task | Gamma | Exponential | Lognormal | Pareto |
1 | VWP | −38312.27 | −50889.10 | 27925.59 | −27532.58* |
S-F | −25010.68 | −31945.15 | −10978.22* | −12496.94 | |
Conj | −19414.35 | −28503.78 | −3140.16* | −4973.41 | |
2 | VWP | −50987.68 | −57705.02 | −39835.88* | −40853.14 |
S-F | −65568.10 | −70883.54 | −48376.19* | −51439.16 | |
Conj | −58913.71 | −62164.20 | −44955.34* | −48016.11 | |
3 | VWP | −48481.12 | −54565.41 | −35671.84* | −36997.10 |
S-F | −68171.09 | −76503.19 | −49796.42* | −52119.44 | |
Conj | −63501.46 | −68963.30 | −47271.86* | −49988.98 | |
4 | VWP | −20099.90 | −23791.34 | −13920.05* | −14375.34 |
S-F | −50106.03 | −59003.13 | −24100.57* | −28385.39 | |
Conj | −45057.44 | −54031.50 | −19867.38* | −23923.36 | |
5 | VWP | −48844.32 | −57470.51 | −36892.35* | −37408.63 |
S-F | −62020.48 | −74280.42 | −41675.59* | −43407.09 | |
Conj | −38120.47 | −45336.28 | −24246.53* | −25622.05 | |
6 | VWP | −59391.24 | −78970.37 | −35907.80* | −36646.79 |
S-F | −72909.36 | −96165.02 | −42515.25* | −44195.32 | |
Conj | −69789.61 | −97055.82 | −42278.09* | −42790.18 |
Task differences
The distributions of gaze steps were influenced by task, suggesting that they were not simply generic products of oculomotor dynamics. We examined the log ratio between lognormal and power-law fit to reveal how task influenced the dynamics of eye movements.
Figure 7 shows that the difference in log-likelihoods favoring lognormal over power-law fit was significantly lower in the VWP task (M = 610.01, SE = 240.34) than in either the single-feature search task (M = 2259.10, SE = 519.48) or the conjunction search task (M = 2433.67, SE = 437.10). An F-test comparing these differences in average log-likelihood by task was significant, F(2, 10) = 11.37, p < .01. That is, relative to visual search, the cognitive dynamics underlying performance in the VWP task were closer to power-law. Far from being an outlier, Participant 1 reflected a general tendency for gaze-step distributions to move towards power-law structure in the VWP task. This result is consistent with the hypothesis that cognitive behavior appears more interaction-dominant when performing a less constrained task.
The distributions were equally lognormal for both single-feature and conjunction search (t(4) = .257, p > .40), consistent with interactive models that propose the same underlying process for both types of search (e.g., Spivey & Dale, 2004), rather than componential models that propose fundamentally different processes (e.g., Triesman & Gelade, 1980). Note also that the heavier tail of a power-law distribution would produce the slightly larger average gaze step size found for VWP than visual search.
Discussion
The present research examined the dynamics of the cognitive system during three visual cognitive tasks. The distributions of gaze steps were best fit by lognormal or power-law distributions, which is consistent with the hypothesis that cognitive dynamics are interaction-dominant. We also found task-specific differences in the shape of gaze step distributions, indicating that task differences shifted cognitive dynamics along a continuum of interaction-dominance between lognormal and power-law distributions, consistent with previous work suggesting that the apparent degree of interaction-dominance depends on task constraints (Aks & Sprott, 2003; Kello et al., 2008). These findings of lognormal distributions in gaze steps and task-modulation of distribution shape provide new evidence of system-wide interactivity in visual cognition.
Interactions appear to dominate the dynamics of visual cognition
The classic description of visual perception and cognition rests on a componential division of labor. Light hits the retina, progresses through a sequence of ever more abstract levels of processing that elicit the intention to shift gaze, and this intention is then executed by the peripheral oculomotor system. Under a strictly componential view, information from these different levels would be combined in an additive manner, eventually producing an additive distribution of eye-movements indicative of component-dominant processing. There is, however, tremendous evidence of interactivity in the visual system: from extra-classical receptive fields (e.g., Angelucci et al., 2002) to visual illusions (e.g., Lee & Nguyen, 2001) to context effects on object recognition (e.g., Bar, 2004). Interaction-dominant dynamics suggest a major departure from the classic description. Under interaction-dominant dynamics, the structure of the cognitive system is emergent and the organization is fluidly dependent on the ongoing flow of information through the system. The individual parts of the visual system assume functions and relationships that may change dramatically depending on context.
Borrowing analytical techniques from statistical mechanics and complexity science, we present a different perspective on components and interactions in cognitive systems. A system of stable, independent components – as described in traditional cognitive science theories, particularly those that emphasize modularity (e.g., Coltheart, 1999; Dietrich & Markman, 2003; Fodor, 1983; Marr, 1982; Wagenmakers, Farrell, & Ratcliff, 2004) – would produce additive normal distributions of behavior. A system of relatively stable but highly interactive components would produce multiplicative lognormal distributions; and an interactive system with broad interdependence and feedback among components would produce power-law distributions. The present findings of lognormal and power-law distributions and task sensitivity in eye movement dynamics suggest that cognition (at least visual cognition) is dominated by interactive processes and emergent structure. This view is closely related to perspectives in which cognition is an emergent property of interactions of simple processing components rather than the sum of independent sub-systems.
It is important to stress the conceptual distance between empirical and theoretical distributions. Although lognormal distributions were generally the best fitting distribution, there were small but systematic departures of the empirical distributions from the best-fitting theoretical distributions. The peaks fell short of the fitted curves (i.e. to the left), and the extreme ends of the tails tended to sprawl above and beyond the fitted curves (i.e., to the right). First, there is no guarantee that an empirical distribution will be fit perfectly by any theoretical distribution (Schunn & Wallach, 2005). Second, if we permit the possibility that the cognitive system may exhibit behavior falling along a continuum of distributions, there must necessarily be mixtures of distributions as the cognitive system moves from one mode to another. It is possible that, as Holden et al. (2009) found in distributions of response time, these distributions of gaze steps are mixtures of power law and lognormal functions. We were interested only in demonstrating the generally interaction-dominated structure. Future work might investigate the relative advantage of such mixtures, with the caveat that systems whose dynamics generally reflect multiplicativity may not simply be the added sum of two multiplicative distributions but may involve multiplications of constituent distributions.
Interactivity and strategies for computational models of cognition
An emphasis on emergent structure is found in a range of theories of cognitive processing, particularly those that fall within the PDP framework. Interactivity and nonlinear (multiplicative) processing are central principles in the PDP framework (e.g., McClelland, 1993) and one critical motivation for the development of PDP models of cognition has been to provide an account of cognition in which structure is emergent rather than stipulated (e.g., McClelland, Plaut, Gotts, & Maia, 2003). Although these models have typically been applied to task-specific performance scores, the approach we have taken here examined fine-grained (i.e., densely sampled, at 500 Hz) dynamics of eye-movements and yielded new evidence of interaction-dominance in visual cognition.
Despite the shared theme of interactivity, interaction-dominant dynamics makes a few substantial departures from the PDP approach. For instance, the framework of interaction-dominant dynamics is purposefully agnostic about specifying a preexisting architecture. The reason for this agnosticism is the very point that interaction-dominated systems are inherently context specific: architecture will evolve based on the flow of information. In the PDP framework, a baseline or input/output structure is specified and the focus is on the emergent mediating (often called “hidden level”) structure. Interaction-dominant dynamics take this one step further to allow the architecture itself to emerge. Information flow stirs a structurally uncommitted system into an orchestra of subsystems and parts. In this sense, interaction-dominant approaches are slightly more similar to computational modeling in liquid state machines (LSMs; Maass, Natschläger, & Markram, 2002). Whereas PDP networks are built to solve specific tasks, LSMs are neural nets that are built without predesigned functions or predetermined networks of connectivity between nodes (Morse & Aktius, 2009). Nodes in LSMs are connected randomly but recurrently, prone to any of a wide number of possible large-scale nonlinearities in the presence of activation. The architecture is uncommitted, and the flow of activation (i.e., information) governs the coordinations between nodes.
Concluding remarks
A componential division of labor would predict additive distributions and no modulation by task (e.g., oculomotor control mechanisms independently executing cognitive commands). Component-dominant theories describe cognition as the driver at the wheel of a biological automobile: the fine-grained mechanics of the engine, the valves, and the wheels remain the same no matter the destination, and the driver only changes the broader scales of dynamics. In contrast, we found multiplicative distributions and modulation by task. That is, cognition extended far beyond the intentional, abstract levels of processing -- even the biological nuts and bolts of motor periphery showed sensitivity to task. These results indicate that cognition is broadly distributed over many anatomical scales and that looking behavior is an emergent property of the entire system.
Acknowledgments
Partial support was provided by NIH grant HD052364 to D. M. We thank James Magnuson and James Dixon for their helpful comments and also Katie Haggans, Lindsey Maddern, and Lauren Long for their assistance.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Although the use of ML estimation and log-likelihood comparisons is common to other analyses such as generalized and mixed linear models (Breslow & Clayton, 1993; Singer & Willett, 2003), the likelihood ratio test for distribution-fitting reported here is a distinct analysis used for statistical mechanics.
References
- Ackley DH, Hinton GE, Sejnowski TJ. A learning algorithm for Boltzmann machines. Cognitive Science. 1985;9:147–169. [Google Scholar]
- Aks DJ, Sprott JC. Resolving perceptual ambiguity in the Necker Cube: A dynamical systems approach. Journal of Nonlinear Dynamics & the Life Sciences. 2003;7:159–178. [Google Scholar]
- Aks DJ, Zelinsky G, Sprott JC. Memory across eye-movements: 1/f dynamic in visual search. Journal of Nonlinear Dynamics & the Life Sciences. 2002;6:1–25. [Google Scholar]
- Anderson JR, Schooler LJ. Reflections of the environment in memory. Psychological Science. 1991;2:396–408. [Google Scholar]
- Andrews S, Heathcote A. Distinguishing common and task-specific processes in word identification: A matter of some moment? Journal of Experimental Psychology: Learning, Memory, & Cognition. 2001;27:514–544. doi: 10.1037/0278-7393.27.2.514. [DOI] [PubMed] [Google Scholar]
- Angelucci A, Levitt JB, Walton EJS, Hupe´ J-M, Bullier J, Lund JS. Circuits for local and global signal integration in primary visual cortex. Journal of Neuroscience. 2002;22(19):8633–8646. doi: 10.1523/JNEUROSCI.22-19-08633.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arellano-Valle RB, Branco MD, Genton MG. A unified view on skewed distributions arising from selections. Canadian Journal of Statistics. 2006;33:561–574. [Google Scholar]
- Bar M. Visual objects in context. Nature Reviews Neuroscience. 2004;5:617–629. doi: 10.1038/nrn1476. [DOI] [PubMed] [Google Scholar]
- Boogerd FC, Bruggeman FJ, Richardson RC, Stephen A, Westerhoff HV. Emergence and its place in nature: A case study of biochemical networks. Synthese. 2005;145:131–164. [Google Scholar]
- Burnham KP, Anderson DR. Model selection, and multimodel inference: A practical information-theoretic approach. New York: Springer; 2002. [Google Scholar]
- Cariani P. Emergence of new signal-primitives in neural systems. Intellectica. 1997;2:95–143. [Google Scholar]
- Coltheart M. Modularity and cognition. Trends in Cognitive Sciences. 1999;3:115–120. doi: 10.1016/s1364-6613(99)01289-9. [DOI] [PubMed] [Google Scholar]
- Dietrich E, Markman AB. Discrete thoughts: Why cognition must use discrete representations. Mind and Language. 2003;18:95–119. [Google Scholar]
- Edwards AM, Phillips RA, Watkins NW, Freeman MP, Murphy EJ, Afanasyev V, Buldyrev SV, da Luz MGE, Raposo EP, Stanley HE, Viswanathan GM. Revisiting Lévy flight search patterns of wandering albatrosses, bumblebees, and deer. Nature. 2007;449:1044–1048. doi: 10.1038/nature06199. [DOI] [PubMed] [Google Scholar]
- Elman JL, Bates EA, Johnson MH, Karmiloff-Smith A, Parisi D, Plunkett K. Rethinking innateness: A connectionist perspective on development. Cambridge, MA: MIT Press; 1996. [Google Scholar]
- Engbert R, Kliegl R. Mathematical models of eye movements in reading: A possible role for autonomous saccades. Biological Cybernetics. 2001;85:77–87. doi: 10.1007/PL00008001. [DOI] [PubMed] [Google Scholar]
- Engbert R, Nuthmann A, Kliegl R. An iterative algorithm for the estimation of the distribution of mislocated fixations during reading. In: Van Gompel RPG, Fischer MH, Murray WS, Hill RL, editors. Eye movements: A window on the mind and brain. Oxford: Elsevier; 2007. pp. 319–337. [Google Scholar]
- Farmer JD. A Rosetta stone for connectionism. Physica D. 1990;42:153–187. [Google Scholar]
- Fodor JA. Modularity of mind: An essay of faculty psychology. Cambridge, MA: MIT Press; 1983. [Google Scholar]
- Gilden DL. Cognitive emissions of 1/f noise. Psychological Review. 2001;108:33–56. doi: 10.1037/0033-295x.108.1.33. [DOI] [PubMed] [Google Scholar]
- Hebb D. The organization of behavior. New York: Wiley; 1949. [Google Scholar]
- Holden JG. Fractal characteristics of response time variability. Ecological Psychology. 2002;14:53–86. [Google Scholar]
- Holden JG, Van Orden GC, Turvey MT. Dispersion of response times reveals cognitive dynamics. Psychological Review. 2009;116:318–342. doi: 10.1037/a0014849. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hopfield JJ. Neural networks and physical systems with emergent collective computational abilities. Proceedings of the National Academy of Sciences. 1982;79:2554–2558. doi: 10.1073/pnas.79.8.2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hopfield JJ. Neurons with graded response have collective computational properties like those of two-state neurons. Proceedings of the National Academy of Sciences. 1984;81:3088–3092. doi: 10.1073/pnas.81.10.3088. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyman BT, West HL, Rebeck GW, Buldyrev SV, Mantegna RN, Uklejal M, Havlin S, Stanley HE. Quantitative analysis of senile plaques in Alzheimer disease: Observation of log-normal size distribution and molecular epidemiology of differences associated with apolipoprotein E genotype and trisomy 21 (Down syndrome) Proceedings of the National Academy of Science. 1995;92:3586–3590. doi: 10.1073/pnas.92.8.3586. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kay BA. The dimensionality of movement trajectories and the degrees of freedom problem: A tutorial. Human Movement Science. 1988;7:343–364. [Google Scholar]
- Kello CT, Anderson GG, Holden JG, Van Orden GC. The pervasiveness of 1/f scaling in speech reflects the metastable basis for cognition. Cognitive Science. 2008;32:1217–1231. doi: 10.1080/03640210801944898. [DOI] [PubMed] [Google Scholar]
- Kelso JAS. Dynamic patterns: The self-organization of brain and behavior. Cambridge, MA: MIT Press; 1995. [Google Scholar]
- Kendal WS. A stochastic model for the self-similar heterogeneity of regional organ blood flow. Proceedings of the National Academy of Sciences. 2001;98:837–841. doi: 10.1073/pnas.021347898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lashley KS. In search of the engram. Society of Experimental Biology Symposium. 1950;4:454–482. [Google Scholar]
- Lee TS, Nguyen M. Dynamics of subjective contour formation in the early visual cortex. Proceedings of the National Academy of Sciences. 2001;98:1907–1911. doi: 10.1073/pnas.031579998. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lemoine L, Torre K, Delignières D. Testing for the presence of 1/f noise in continuation tapping. Canadian Journal of Experimental Psychology. 2006;60:247–257. doi: 10.1037/cjep2006023. [DOI] [PubMed] [Google Scholar]
- Limpert E, Stahel WA, Abbt M. Log-normal distributions across the sciences. Bioscience. 2001;51:341–352. [Google Scholar]
- Luce RD. Response times: Their role in inferring elementary mental organization. New York: Oxford University Press; 1986. [Google Scholar]
- Luria AR. Neuropsychological studies in the USSR: A review (part 1) Proceedings of the National Academy. 1973a;70:959–964. doi: 10.1073/pnas.70.3.959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luria AR. Neuropsychological studies in the USSR: A review (part 1) Proceedings of the National Academy. 1973a;70:1278–1283. doi: 10.1073/pnas.70.4.1278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maass W, Natschläger T, Markram H. Real-time computing without stable states: A new framework for neural computation based on perturbations. Neural Computation. 2002;14:2531–2560. doi: 10.1162/089976602760407955. [DOI] [PubMed] [Google Scholar]
- Magnuson JS, Tanenhaus MK, Aslin RN, Dahan D. The time course of spoken word learning and recognition: Studies with artificial lexicons. Journal of Experimental Psychology: General. 2003;132(2):202–227. doi: 10.1037/0096-3445.132.2.202. [DOI] [PubMed] [Google Scholar]
- Marr D. Vision. San Francisco: Freeman; 1982. [Google Scholar]
- McClelland JL. Towards a theory of information-processing in graded, random, interactive networks. In: Meyer DE, Kornblum S, editors. Attention & Performance XIV: Synergies in experimental psychology and cognitive neuroscience. Cambridge, MA: MIT Press; 1993. pp. 655–688. [Google Scholar]
- McClelland JL, Plaut DC, Gotts SJ, Maia TV. Developing a domain-general framework for cognition: What is the best approach? Commentary on a target article by Anderson and Lebiere. Behavioral & Brain Sciences. 2003;26:611–614. [Google Scholar]
- McRae K, Cree GS, Seidenberg MS, McNorgan C. Semantic feature production norms for a large set of living and nonliving things. Behavior Research Methods. 2005;37:547–559. doi: 10.3758/bf03192726. [DOI] [PubMed] [Google Scholar]
- Miller J, Ulrich R. Simple reaction time and statistical facilitation: A parallel-grains model. Cognitive Psychology. 2003;46:101–151. doi: 10.1016/s0010-0285(02)00517-0. [DOI] [PubMed] [Google Scholar]
- Montroll EW, Shlesinger MF. On 1/f noise and other distributions with long tails. Proceedings of the National Academy of the Sciences. 1982;79:3380–3383. doi: 10.1073/pnas.79.10.3380. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morse A, Aktius M. Dynamic liquid association: Complex learning without implausible guidance. Neural Networks. 2009;22:875–889. doi: 10.1016/j.neunet.2008.10.008. [DOI] [PubMed] [Google Scholar]
- Nakatani C, van Leeuwen C. Multi-modality and non-normality in fixation duration distribution. Journal of Eye Movement Research. 2008;1:1–12. [Google Scholar]
- Qian H, Bassingthwaighte JB. A class of flow bifurcation models with lognormal distribution and fractal dispersion. Journal of Theoretical Biology. 2000;205:261–268. doi: 10.1006/jtbi.2000.2060. [DOI] [PubMed] [Google Scholar]
- Rogers TT, McClelland JL. Semantic cognition: A parallel distributed processing approach. Cambridge, MA: MIT Press; 2004. [DOI] [PubMed] [Google Scholar]
- Rumelhart DE, McClelland JL the PDP Research Group. Parallel distributed processing: Explorations in the microstructure of cognition. Volume I. Cambridge, MA: MIT Press; 1986. [Google Scholar]
- Schunn CD, Wallach D. Evaluating goodness-of-fit in comparison of models to data. In: Tack W, editor. Psychologie der Kognition: Reden and Vortrage anlässlich der Emeritierung von Werner Tack. Saarbrueken: University of Saarland Press; 2005. pp. 115–154. [Google Scholar]
- Seidenberg MS, Waters GS, Barnes MA, Tanenhaus MK. When does irregular spelling or pronunciation influence word recognition? Journal of Verbal Learning & Verbal Behavior. 1984;23:383–404. [Google Scholar]
- Sims DW, Southall EJ, Humphries NE, Hays GC, Bradshaw CJA, Pitchford JW, James A, Ahmed MZ, Brierly AS, Hindell MA, Morritt D, Musyl MK, Righton D, Shepard ELC, Wearmouth VJ, Wilson RP, Witt MJ, Metcalfe JD. Scaling laws of marine predator search behavior. Nature. 2008;451:1098–1102. doi: 10.1038/nature06518. [DOI] [PubMed] [Google Scholar]
- Singer JD, Willett JB. Applied longitudinal data analysis: Modeling change and event occurrence. New York: Oxford University Press; 2003. [Google Scholar]
- Spivey M, Dale R. On the continuity of mind: Toward a dynamical account of cognition. In: Ross B, editor. The psychology of learning & motivation. Volume 45. San Diego: Elsevier; 2004. pp. 87–142. [Google Scholar]
- Stephen DG, Mirman D, Magnuson JS, Dixon JA. Lévy-like diffusion in eye movements during spoken-language comprehension. Physical Review E. 2009;79 doi: 10.1103/PhysRevE.79.056114. 056114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanenhaus MK, Spivey-Knowlton MJ, Eberhard KM, Sedivy JC. Integration of visual and linguistic information in spoken language comprehension. Science. 1995;268:1632–1634. doi: 10.1126/science.7777863. [DOI] [PubMed] [Google Scholar]
- Taraban R, McClelland JL. Conspiracy effects in word pronunciation. Journal of Memory and Language. 1987;26:608–631. [Google Scholar]
- Treisman AM. The perception of features and objects. In: Baddeley AD, Weiskrantz L, editors. Attention: Selection, awareness, and control. A tribute to Donald Broadbent. New York, NY: Oxford University Press; 1993. pp. 5–35. [Google Scholar]
- Treisman AM, Gelade G. A feature-integration theory of attention. Cognitive Psychology. 1980;12:97–136. doi: 10.1016/0010-0285(80)90005-5. [DOI] [PubMed] [Google Scholar]
- Van Orden GC, Holden JG, Turvey MT. Self-organization of cognitive performance. Journal of Experimental Psychology: General. 2003;132:331–350. doi: 10.1037/0096-3445.132.3.331. [DOI] [PubMed] [Google Scholar]
- Wagenmakers E-J, Farrell S, Ratcliff R. Estimation and interpretation of 1/f noise in human cognition. Psychonomic Bulletin & Review. 2004;11:579–615. doi: 10.3758/bf03196615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- West BJ, Deering B. The lure of modern science. River Edge, NJ: World Scientific; 1995. [Google Scholar]