Abstract
Previous studies of global–local processing in autism spectrum disorders (ASDs) have indicated mixed findings, with some evidence of a local processing bias, or preference for detail-level information, and other results suggesting typical global advantage, or preference for the whole or gestalt. Findings resulting from this paradigm have been used to argue for or against a detail focused processing bias in ASDs, and thus have important theoretical implications. We applied Systems Factorial Technology, and the associated Double Factorial Paradigm (both defined in the text), to examine information processing characteristics during a divided attention global–local task in high-functioning individuals with an ASD and typically developing controls. Group data revealed global advantage for both groups, contrary to some current theories of ASDs. Information processing models applied to each participant revealed that task performance, although showing no differences at the group level, was supported by different cognitive mechanisms in ASD participants compared to controls. All control participants demonstrated inhibitory parallel processing and the majority demonstrated a minimum-time stopping rule. In contrast, ASD participants showed exhaustive parallel processing with mild facilitatory interactions between global and local information. Thus our results indicate fundamental differences in the stopping rules and channel dependencies in individuals with an ASD.
Keywords: Autism spectrum disorders, Information processing, Mathematical modeling, Serial, Parallel, Double factorial, Parallel interactive processing, Global and local perception, Divided attention, Weak Central Coherence
1. Introduction
Autism Spectrum Disorders (ASDs) are characterized by a triad of symptoms: impaired social interactions, deficits in communication, and the presence of repetitive behaviours and interests. This spectrum encompasses a wide range of clinical presentations, symptom severity, and associated strengths and disabilities. Autistic Disorder, Asperger’s Disorder, and Pervasive Developmental Disorder, Not Otherwise Specified (DSM-IV-TR American Psychiatric Association, 2000) typically comprise the autism spectrum. Although much research has focused on characterizing atypical behaviour in ASDs, there has been less progress in revealing the cognitive or perceptual mechanisms that underlie the core differences. Given the heterogeneity of the spectrum, observed symptoms and features may well have various explanations. There is growing discussion of multiple “autisms” and thus, an increased focus on identifying and characterizing meaningful subgroups, or “informative phenotypes” (see Szatmari et al., 2007). An important step toward this goal is the elucidation of mechanisms that underlie core features of behaviour. This knowledge may lead to the identification of an informative phenotype of ASD, as well as provide critical information for the development of interventions that can be individually tailored according to this information.
The current study characterizes information processing approaches in ASDs in one standard perceptual task using sophisticated, theoretically driven mathematical modeling techniques. Our central experimental paradigm, the global–local task, has proven extremely useful in basic cognitive and perceptual research, as well as in a wide array of applied studies, including the perceptual–cognitive functioning of ASD patients. Nonetheless, even in the ‘basic’ domain, the information processing mechanisms underpinning performance on this paradigm have never been uncovered. In this investigation, we utilized theory-driven methodologies that can identify these mechanisms, as well as how they differ across individuals and clinical groups. The methodologies are quite powerful and robust since the classes of mathematical models are parameter-free. The latter statement is analogous to being able to perform multivariate ANOVAs without having to assume normality, homogeneity of variances, and so on. Our data and theoretical analyses identify striking differences between ASD and control participants that fail to support previous claims about how individuals with an ASD approach global–local processing.
Atypical perceptual and cognitive processes are a central feature of ASDs and have been the focus of much research in recent years. Although social deficits are the hallmark of ASDs, many previous findings indicate that differences in autism extend beyond social dysfunction and that understanding non-social features may provide important insights into the nature of the disorder. Furthermore, recent research (Happé & Ronald, 2008) indicates that the social and non-social features of ASDs are “fractionable” (i.e., do not necessarily co-occur or have a common genetic basis), suggesting that examining specific non-social differences in ASDs is warranted. It is also possible that there are relationships between some aspects of non-social processing and social deficits that may be elucidated through characterization of basic perceptual and cognitive processes.
Clinically, individuals with an ASD are often highly attuned to details and demonstrate narrow interests and repetitive behaviours. In the laboratory, many studies have examined perceptual processing in ASDs, and results strongly suggest a “detail focused processing style” (for a review, see Happé & Frith, 2006). One explanation for these findings argues that attention is biased toward details, rather than the whole or gestalt, and is referred to as the Weak Central Coherence theory of ASD (WCC; Frith, 1989; Frith & Happé, 1994; Happé & Frith, 2006)). Central coherence refers to the ability to integrate various pieces of information into a meaningful whole. This ability, according to research findings, is “weaker” in those with an ASD relative to their typically developing peers. The bias toward attending to details may occur at the expense of the overall meaning or the whole. Alternatively, some researchers (Mottron, Burack, Iarocci, Belleville, & Enns, 2003; Plaisted, Dobler, Bell, & Davis, 2006) have stated that processing of configurations/gestalt is intact, but individuals with an ASD also demonstrate enhanced processing of features. Although these explanations are quite similar and both can account for many of the reported differences in the processing of details in ASDs, there is a subtle and important difference between them. The WCC theory assumes an atypical bias toward details rather than configurations or gestalt, whereas the “Enhanced Perceptual Functioning” view emphasizes superior processing of details in the context of typical gestalt/configural processing. It is important to note, however, that both theories predict atypical global–local processing in ASDs and that local processing, in particular, would be expected to differ relative to performance by controls. These claims were examined in the current study.
For some tasks, such as visual search, attention to detail lends itself to superior performance in those with an ASD (O’Riordan & Plaisted, 2001; Plaisted, O’Riordan, & Baron-Cohen, 1998). One example that contributed to the development of the WCC theory of ASDs was the finding of superior performance on the Children’s Embedded Figures Test (Shah & Frith, 1983). Participants with an ASD showed more accurate identification of a smaller figure embedded within a meaningful larger figure than control participants, presumably because the larger figure interfered less for the ASD group relative to the controls. It is important to note, however, that not all studies have demonstrated support for WCC (e.g. Rinehart, Bradshaw, Simon, Brereton, & Tonge, 2000; Rutherford, Richards, Moldes, & Sekuler, 2007). These studies suggest typical attention to details, rather than a bias for processing and attending to details, and suggest that there is still much to be learned about the contexts in which a local processing bias may be present or absent in ASDs. Furthermore, it is not yet clear if a detail focused bias or enhanced processing of details is a common feature of individuals with an ASD or present in only a subset.
Perceptual processing of parts versus wholes has been studied extensively in non-clinical populations using hierarchical figures. A robust set of findings supports the global-precedence hypothesis (Navon, 1977) which proposes that people process the global stimulus information prior to or without interference from the local stimulus details (for a review, see Kimchi, 1992). To examine this effect, hierarchical stimuli are employed in a cognitive paradigm often referred to as a global–local task (Navon, 1977). Hierarchical stimuli are visual objects consisting of a global shape formed by an appropriate configuration of local elements. For example, a set of local elements might be the typed character 2 (the size of this text) arranged into a single, larger number 2.
In a typical global–local selective attention task, participants are instructed to identify the local or global level stimuli from a set of two target choices (e.g., are the local elements 1’s or 2’s?). Global–local divided attention tasks require participants to attend to both the global and local levels simultaneously (e.g., is there a 1 present anywhere?). There are two typical findings on these tasks in control participants supporting the global-precedence hypothesis: global advantage and global interference. Global advantage, or global superiority, is the empirical finding that people make faster responses and fewer errors when identifying the global configuration, relative to local elements. Global interference arises from employing a Stroop-like or Garner (1974) interference task, in which the global and local levels may be congruent (e.g. global number 1 composed of local number 1 elements) or incongruent (e.g. global number 1 composed of local number 2 elements). Global interference is the empirical finding that conflicting information within incongruent stimuli results in more errors and slower responses to the local elements, but no such inhibitory interference is evident in the global responses. Together, global interference and global advantage are interpreted as evidence that image processing proceeds in a global-to-local fashion.
Returning to cognitive theories of ASDs, it was proposed that global–local processing in ASDs is characterized by an absence of global precedence, as well as a lack of interaction between global and local information (Mottron & Belleville, 1993). However, global–local experimental findings to date are inconsistent, which may reflect the variability and limitations of the methods employed. Studies that have examined global–local processing of hierarchical stimuli with selective attention tasks have generally found conflicting results. For example, Plaisted, Swettenham, and Rees (1999) found no differences between ASDs and typically developing participants, indicating global advantage and global interference for both groups. Rinehart and colleagues (Rinehart et al., 2000) also demonstrated global advantage and global interference in ASDs. However, they also found that two ASD subgroups (high-functioning autism and Asperger’s Disorder) exhibited slower global response times (RTs) when local information was incongruent than when local information was congruent, indicating a local interference effect. Rinehart and colleagues concluded that their findings provided evidence for a lack of global precedence, as described by Mottron and Belleville (1993). Studies using a divided attention global–local task have shown a lack of global advantage in ASDs (Plaisted et al., 1999) and a deficit in shifting attention from local to global, but not in the opposite direction (Rinehart, Bradshaw, Moss, Brereton, & Tonge, 2001).
Although the few studies that have been completed in ASDs suggest atypical global–local processing, perhaps consistent with a detail focused processing style, there are limitations with the foregoing studies. Importantly, certain essential ingredients of the paradigms used in cognitive psychological investigations (the discussion of which is beyond the scope of the current paper) appear to have been lost in the translation to studies of clinical samples. We are currently pursuing a deeper investigation of these paradigms and their impact on conclusions about global–local perception in ASDs. Furthermore, there is currently no information about the mechanisms that may contribute to atypical global–local processing.
The bias to attend to details appears to be relatively robust in ASDs, but, as we have described, is not always supported by research findings. The theory of WCC, as well as the concept of Enhanced Perceptual Functioning, appeals to ASD researchers and clinicians because these ideas capture important features of ASDs and help explain some of the core symptoms. Given the emphasis on WCC in the current literature, it is critical that we begin to examine this aspect of cognitive processing in ASDs by using rigorous empirical and theoretical approaches. Happé and Frith (2006) suggested that the bias for attending to details versus the gestalt or gist can be viewed as existing on a continuum. Although individuals with an ASD appear to be more likely to have a bias at the details end of this continuum, not all individuals demonstrate this bias (Jarrold & Russell, 1997). This is not surprising given the heterogeneity of the autism spectrum and suggests that studies with a focus on individual performance are warranted. It is important to understand which individuals prefer details and in what contexts, as this information may allow us to predict the situations in which details will be most salient.
The current study is our initial investigation of information processing mechanisms in high-functioning children and adolescents with an ASD. As stated, we utilized a global–local paradigm for this purpose. We investigated four specific features of information processing (see below for a description) using the models and measures of Townsend and colleagues’ Systems Factorial Technology (SFT) and the associated, well-developed Double Factorial Paradigm (DFP; Townsend & Ashby, 1983; Townsend & Nozawa, 1995). It is worth noting that despite the broad usefulness of the global–local paradigm and the multitude of studies that have employed it, up until now, the underlying processing mechanisms and how they operate have not been open to identification even in the general experimental milieu. The present investigation is the first to identify critical features of information processing associated with this valuable paradigm in a clinical sample.
1.1. Human information processing systems
Models of human information processing systems often address questions about four fundamental characteristics of cognitive (perceptual, memory, decision making, etc.) processing: architecture, stopping rule, (in)dependence, and capacity. In the following definitions, we assume that a given cognitive task engages a set of subprocesses, which we refer to as processing channels.2 Architecture defines the spatiotemporal arrangement of the channels. The two canonical model architecture classes we consider here are parallel (simultaneously operating channels) and serial (sequentially operating channels). The stopping rule specifies how many channels must complete processing before the system can make a response. Of particular interest to our current work are minimum-time, sometimes called first-terminating, and exhaustive stopping rules. Dependence within the system describes if and how the channels interact with each other. Finally, capacity describes how efficiently the channels/system responds to changes in workload demands. These characteristics will be defined more rigorously later.
The Systems Factorial Technology, and associated Double Factorial Paradigm, provide a rigorous, mathematically well-defined framework for characterizing the above aspects of information processing within a single experiment (Townsend & Ashby, 1983; Townsend & Nozawa, 1995; Townsend & Wenger, 2004). Within the SFT framework, we assume that each facet of a given task (e.g., identification of each object in a visual search display) is subserved by a corresponding processing channel. Given the nature of global–local processing tasks, we will restrict our consideration here to models consisting of two channels, one associated with global object identification and the other dedicated to local feature processing.3 We further assume that physical manipulations of global features and local features influence only the processing speed of the global channel and local channel, respectively. This latter assumption is known as the assumption of selective influence (Dzhafarov, 1997; Schweickert, 1978; Schweickert & Giorgini, 1999; Schweickert, Giorgini, & Dzhafarov, 2000; Townsend, 1984; Townsend & Nozawa, 1995) and we specify it in more detail below.
As noted above, one of the strengths of the SFT approach is its non-parametric nature. The measures that assess architecture, stopping rule, and capacity make no assumptions about the distributions underlying the response time processes. They are based on transformations of the empirical response time distributions. However, the SFT does operate under the aforementioned assumption of selective influence. Stemming from Donders’ (1869/1969) method of subtraction and Sternberg’s (1969) additive factors method, selective influence theoretically specifies that physical (stimulus) variables affect one and only one cognitive process. Let A be the cognitive channel processing physical factor α, and let B be the cognitive channel processing physical factor β. In our modern terminology selective influence means both that (1) physical manipulation of α affects the processing rate of Channel A and physical manipulation of β affects the processing rate of Channel B, and (2) the physical manipulation of α does not affect processing in Channel B, and the physical manipulation of β does not affect processing in Channel A (Ashby & Townsend, 1980; Townsend & Nozawa, 1995; Townsend & Thomas, 1994).
For example, in a simple visual detection task, we can present a single lighted dot to each visual field and further manipulate the luminance of each dot. If the assumption of selective influence is met, changing the brightness of the left dot would not influence the speed of detecting the right dot and vice versa (for empirical results on this task, see Townsend & Nozawa, 1995). Selective influence in a global–local task holds when changes to the global configuration influence only global processing and changes to the local elements affect only local processing. We might anticipate, given previous findings of global interference on local processing (Navon, 1977, 1981), that the assumption of selective influence may not be met under some global–local testing conditions.
More technically, we can empirically assess selective influence by examining response time data, because the actions of experimental factors that change the rates of processing manifest as shifts in the response time cumulative distribution functions (CDFs).4 We define influence per se as the measurable changes in response time distributions resulting from manipulation of experimental factors. Influence implies an ordering of the RT distributions, such that factors manipulated to slow processing result in slower response times across the entire distribution. When more than one channel is being assessed by multiple experimental factors, selective influence implies an ordering in the marginal CDF of a given channel while the distributions of the other channels remain unchanged. Let A1 and A2 denote the Channel A processing rates when α is at level 1 and level 2, respectively, and denote the marginal CDF of response times for Channel A at a given level by FAi(t), for i = 1, 2. If A1 is faster than A2, selective influence is supported by FA1(t) ≤ FA2(t) or equivalently SA1(t) ≤ SA2(t) where Sk(t) = 1 – Fk(t) is the survivor function of process k. We note that while the CDFs may be equal at some time, under selective influence, there exists at least one measurable time interval (t1, t2) with t1 < t2 where the ordering is strict, so that FA1(t) > FA2(t) on that interval.5
A necessary, but not sufficient, condition for the assumption of selective influence is the ordering of response time distributions. A system that shows crossings of its CDFs violates the assumption of selective influence and may have strong dependencies between its processing channels. Hence, we tested for violations of selective influence by examining orderings of our empirical CDFs.
There are two types of selective influence violations that can occur, direct and indirect, and both imply some form of dependency in the processing system. First, a direct violation of selective influence on, say, Channel A, occurs when manipulation of factor β affects Channel A without mediation through Channel B. In the global–local testing framework, direct non-selective influence would be present if changes to the global configuration directly affect the processing of local elements or vice versa (for example, at the early input or perceptual encoding stages of processing). Direct non-selective influence results in an input-based correlation between the channel processing times (Townsend & Thomas, 1994), and its presence also negates the assumptions of the Systems Factorial Technology theorems. Careful construction of the stimuli should prevent direct selective influence violations.
Non-selective influence can occur indirectly when physical manipulation of β directly affects only Channel B, but Channel B in turn affects Channel A via interchannel statistical dependence.6 Note that channel dependencies can be facilitatory in nature, with one channel accelerating the processing in another, or inhibitory, with one channel slowing another. Either form qualifies as indirect non-selective influence. Note also that stochastic dependencies can occur in both serial and parallel channels. If indirect non-selective influence occurred in a global–local experiment, it would mean that although the global channel processes only global inputs and the local channel processes only local inputs, the speed of processing in one channel affects the speed of the other channel.
As mentioned above, the architecture of a processing system defines the spatiotemporal arrangement of channels or subprocesses. The canonical architectures considered here are depicted in Fig. 1. If the channels proceed in a sequential manner, with one channel completing its processing before the next begins, then the system possesses a serial architecture (Fig. 1(a)). If, however, the channels operate simultaneously, with decisions made on each channel separately, then the system possesses a parallel architecture (Fig. 1(b)). We also consider a special case of parallel processing known as the coactive architecture, wherein channels operating in parallel feed into a single decision channel (Fig. 1(c)). Responses in the coactive system are based on the pooled information from all the channels rather than individual decisions being made on each channel as observed in standard parallel processing.
Logically distinct from architecture and often governed by task demands, the stopping rule specifies how many channels must reach completion for the system to make a correct response. If only a single channel needs to be completed, such as in the case of visual searching for one target item in a display of only target items, then the system is operating with a minimum-time stopping rule, also known as an OR-gate. When all subprocesses must be completed, such as visual search for a target item in a display of only distractor items, then the system is operating under an exhaustive stopping rule or AND-gate. Note that coactive architectures do not employ one of these traditional stopping rules. In this case, a single decisional threshold exists on the decision channel, and the system responds when the pooled information reaches that threshold.
Two measures have been developed to identify the processing architecture and stopping rule: the Mean Interaction Contrast (MIC) and the Survivor Interaction Contrast (SIC) functions (Townsend & Nozawa, 1995). These measures are based on the RT patterns from the processing of double targets whose processing speeds have been factorially manipulated. Factorially combining two targets at two levels of salience creates four critical conditions for assessing architecture. We denote these conditions by HH, HL, LH, and LL, where, in the present global–local task, the first letter refers to the manipulation of global target salience and the second letter indicates the salience manipulation of the local targets. An H “high” salience manipulation facilitates faster target processing, while an L “low” salience manipulation slows processing relative to the H manipulation. Under the assumption of selective influence, double-target processing speeds exhibit the following predictable orders: HH > {HL, LH} > LL. The MIC, then, is defined by the double difference of the double-target mean RTs:
(1) |
Analogous to the MIC, the SIC is defined by the double difference of survivor functions for the four factorially manipulated redundant target conditions:
(2) |
Note that while the MIC provides a single number, the SIC is a function of time and produces a curve for each model.7
Each architecture and stopping rule pair makes a unique prediction for the combination of MIC value and SIC shape. Aside from the coactive architecture, which uses a single decisional threshold, there are four distinct logical combinations of the architectures and stopping rules defined above. These are: (1) serial minimum-time, (2) serial exhaustive, (3) parallel minimum-time, and (4) parallel exhaustive. Fig. 2 illustrates the MIC and SIC signatures for each of these five models with independent channels (Townsend & Nozawa, 1995). Because of their non-parametric nature, these are descriptive model predictions, providing guidelines for the qualitative patterns that may be observed empirically in each participant’s RT data. Again, the five MIC/SIC combinations in Fig. 2 assume selective influence, and in previous applications of these techniques, empirical support for selective influence has been observed consistently (e.g., Fifić, unpublished; Ingvalson & Wenger, 2005; Townsend, Fifić, & Neufeld, 2007; Townsend & Nozawa, 1995; Wenger & Townsend, 2001). Anticipating our results for global–local processing, which appears to lend itself to channel dependencies as evidenced by global interference, we will encounter evidence of indirect non-selective influence through the crossover of a single empirical survivor function. Such violations, termed single-factor reversals, were developed theoretically by Townsend and Thomas (1994); we will return to the implications of the single-factor reversal in the results.
As discussed above, indirect non-selective influence occurs through channel dependencies. In order to then model the processing systems under non-selective influence, we applied the theory of parallel interactive processing (Townsend & Wenger, 2004). A lack of evidence from both our current results and a group of adult controls (Blaha, Johnson, & Townsend, 2007) for serial processing suggests we may reasonably restrict ourselves to parallel interactive models. Interactive parallel processing is defined by the presence of stochastic dependencies between parallel processing channels (Townsend & Wenger, 2004). These systems differ from the aforementioned coactive processing models in that the interactive parallel models are still defined with separate decision mechanisms for each channel. Allowing interactions between parallel channels results in a wide range of SIC predictions. For example, as can be seen in Fig. 3, both inhibitory and facilitatory exhaustive models with minimal to moderate interactions can produce entirely negative SIC functions or S-shaped SIC functions that have a large negativity followed by a small positivity. Under high interactions, exhaustive models produce an S-shaped SIC similar to the coactive shape in Fig. 2 (for more detailed model predictions, see Eidels, Houpt, Pei, Altieri, & Townsend, in preparation). More detail on the behaviours of interactive parallel processing models will be presented in the Results.
In addition to architecture, stopping rule and processing dependence, we can use SFT to evaluate the system’s workload capacity, which describes the efficiency of processing as the amount of work to be done (i.e., the number of channels operating) increases. If, as the number of active target-present channels increases, processing rates within each channel are unchanged, the system is described as exhibiting unlimited capacity at the channel level. If channel processing slows under increasing workload, the system is described as limited in capacity, and if processing speeds up, the system is said to exhibit super-capacity (Townsend, 1972, 1974; Townsend & Ashby, 1978, 1983; Wenger & Townsend, 2000).
We measure processing workload capacity by the capacity coefficient ratio (Townsend & Nozawa, 1995; Townsend & Wenger, 2004; Wenger & Townsend, 2000), defined for global–local processing by
(3) |
where is the cumulative hazard function for target condition i.8 The hazard function defines the conditional probability that processing completes at time t given that the system has not yet finished processing before time t. While the hazard function provides a measure of instantaneous energy (termed “power” in physics) in the system at each point of time, the cumulative hazard function provides a measure of the dispersed energy or total amount of work completed by the system at each point in time. The capacity coefficient, then, provides a comparison of the amount of work that is completed by the system while processing double targets (here, targets at both the global and local level) with the summed amount of work that is completed in the same amount of time by each single target processed individually. The denominator is the prediction made by an unlimited capacity, independent parallel (UCIP) model, providing an unlimited capacity efficiency benchmark: C(t) = 1. This type of processing seems closest to what is generally meant by the term “parallel processing”. Thus, we accept this variety as exemplifying a standard parallel model.
C(t) provides a fine-grained indicator of capacity over the entire range of RTs. As mentioned above, if C(t) = 1, the work completed in t time on double targets was equal to the summed amount of work completed on each of the single targets in the same amount of time.9 This is the case of unlimited capacity, and again it implies that the system is acting like a standard parallel model in terms of its workload efficiency. If C(t) < 1, processing is limited in capacity. Finally, if C(t) > 1, then processing is super-capacity. Note that capacity hovering around C(t) = ½ can be caused by a serial system, working on global, then local levels or vice versa; or, C(t) = ½ could be associated with a parallel system which divides up a fixed resource across the two operating channels, one local, and the other global (e.g. Townsend & Nozawa, 1995; Townsend & Wenger, 2004). Examining C(t) together with the MIC/SIC tests of architecture allows us to disambiguate these possibilities.
2. Method
To model the information processing system supporting hierarchical form processing, we adapted the Double Factorial Paradigm (DFP) for divided attention identification of global and local stimulus features. New compound arrows were designed according to the DFP criteria. The first target factor manipulated was the presence of a target right-pointing arrow at either the global or local stimulus level or both. The absence of a target at either level was the presence of a neutral (dash-shaped) distractor. This first factor, the number of targets present/absent, provides the workload manipulation to determine the capacity of the global–local processing system. The second factorial manipulation was the salience of the target arrows relative to the shape of the distractor dash. This second factor enables the assessment of processing architecture by modeling the responses to the redundant target trials. We employed a basic divided attention OR-decision task (see Townsend & Ashby, 1983; Townsend & Nozawa, 1995). We did not employ Stroop-like incongruent distractors (left-pointing arrows), as our goal was to minimize perceptual and cognitive interference (i.e., to minimize possible dependence) in modeling the basic structures of the global–local processing system.
2.1. Participants
Participants were 10 high-functioning adolescents and young adults with a diagnosis of an autism spectrum disorder (ASD; 6 males, 4 females) and 11 typically developing controls (5 males, 6 females) matched for age and cognitive ability (see Table 1). Diagnoses were established according to the Autism Diagnostic Interview-Revised (ADI-R; Lord, Rutter, & Le Couteur, 1994) and the Diagnostic and Statistical Manual for Psychiatric Disorders, Fourth Edition, Text Revision (DSM-IV-TR; American Psychiatric Association, 2000). The groups were similar in age, gender composition, and estimated intellectual abilities (Wechsler Abbreviated Scale of Intelligence; Wechsler, 1999). In addition, we utilized a parent-report (Baron-Cohen, Wheelwright, Skinner, Martin, & Clubley, 2001) and self-report (Baron-Cohen, Hoekstra, Knickmeyer, & Wheelwright, 2006) version of the Autism Spectrum Quotient (AQ) to gather information about autism symptom severity. As expected, the ASD group means were significantly higher than controls, indicating more symptoms of autism, on both versions of this measure. Exclusion criteria for both groups included parent- or self-report of histories of significant head trauma, major neurological disorders, lack of normal or corrected-to-normal vision, and systemic diseases that might affect the central nervous system.
Table 1.
Measure | ASD group |
Control group |
t-value | P-value | ||
---|---|---|---|---|---|---|
Mean | SD | Mean | SD | |||
Age (years) | 16.3 | 2.3 | 14.3 | 2.6 | 1.86 | .08 |
WASI two-subtest IQ | 112.4 | 9.6 | 114.6 | 10.3 | .45 | .66 |
Vocabulary (t-score) | 59.9 | 8.4 | 61.2 | 7.5 | .41 | .73 |
Matrix reasoning (t-score) | 54.2 | 6.3 | 55.4 | 6.4 | .36 | .69 |
AQ parent-report | 34.3 | 3.6 | 12.7 | 6.0 | 9.22 | <.001 |
AQ self-report | 21.8 | 7.5 | 15.1 | 3.8 | 2.54 | <.05 |
WASI = Wechsler Abbreviated Scale of Intelligence (Wechsler, 1999); AQ = Autism Spectrum Quotient (Baron-Cohen et al., 2006, 2001).
2.2. Stimuli and apparatus
Visual stimuli were presented on a 17 inch Dell Trinitron monitor (refresh rate ≈ 100 Hz), and responses were made using a standard two-button mouse. A Dell Intel Pentium II computer controlled stimulus presentation and recorded responses using DMDX software.10
The hierarchical images were large (global) arrows composed of smaller (local) arrows. The arrow shapes were either a right-pointing arrow or a dash. At the viewing distance of 50 cm, global elements subtended a horizontal visual angle ranging from 3.88 degrees (global dash) to 4.98 degrees (global high salience arrow). All global objects subtended a vertical visual angle of 1.48 degrees. Each local element subtended 0.55 by 0.37 horizontal by vertical degrees of visual angle.
The first factorial manipulation was the presence or absence of an arrow. Consequently, we used all combinations of global and local right-pointing arrows mixed with global and local dashes, as seen in Fig. 4. The absence of a right-pointing arrow was defined by the presence of a dash. Thus, stimuli either contained no targets (dash composed of dashes), a single target (global arrow with local dashes or global dash with local arrows), or double targets (global arrow consisting of local arrows).
The second factorial manipulation was the salience of the arrows relative to the dashes. Low salience arrows were right-pointing arrows visually similar to the shape of the dash; high salience arrows had a more distinct arrow shape, including a longer point. Salience manipulations influenced processing speed, which allowed us to directly assess both the processing architecture and stopping rule. High salience arrows encourage faster processing than low salience arrows, because they are easier to identify.
Fig. 4 depicts the full set of double-factorial stimuli, including all factorial combinations of arrows present/absent and arrow salience. As shown here, the stimuli were black figures presented on a white background.
2.3. Procedure
Participants were asked to complete a standard divided attention, or OR-gate, task in which all stimuli containing a target (a right-pointing arrow of any salience at either the global or local stimulus level) mapped to a target-present response, and the stimulus containing only distractors (dashes) mapped to a target-absent response. Participants were instructed to respond ‘yes’ with a right mouse button press if a right-pointing arrow was present at any level (global or local or both) in the stimulus, and ‘no’ with a left mouse button press if no arrows were present (the global dash composed of local dashes).
A single trial began with an approximately 80 ms tone and central fixation. Immediately following this, a single stimulus was randomly presented either at fixation or in one of four off-centered locations within 7.94 by 3.7 horizontal by vertical degrees of fixation. Stimuli were presented for 65 ms, and then replaced by a blank screen until a mouse button response was made. When an incorrect response was made, the word “wrong” was displayed for 2400 ms; no feedback was given when a correct response was made. The blank screen inter-stimulus interval (ISI) was 1000 ms.
All participants completed 2560 trials in total. These trials were presented in four separate runs, with each run containing 4 blocks of 160 trials. Typically, a participant completed the study in two separate testing sessions that comprised two runs (8 blocks) of the task. Within each block, the prior probabilities of target-absent, global target only, local target only, and double-target trials were equated, as were the four types of double-target stimuli and the high and low salience stimuli for the single-target trials.
At the beginning of each testing session, participants completed two blocks of practice trials. In the first block of 27 trials, stimuli were presented centrally and remained on screen until a response was made. In the second block of 27 practice trials, stimuli were presented in the testing manner described above. If participants had a practice error rate over 5%, they repeated the two practice blocks.
2.4. Data analysis
Global advantage at the group level was assessed by repeated measures analysis of variance (ANOVA) with the within-subjects factor of Target Location (Local Only, Global Only, Double Targets) and the between-subjects factor of Group (ASD, Controls). Repeated measures ANOVAs were applied to both the group-averaged RT data and the group-averaged error rates. Individual participant global RT advantage was assessed by three two-sample, two-tailed Kolmogorov–Smirnov tests for differences in CDFs (Double Targets v. Local Only, Double Targets v. Global Only, Local Only v. Global Only).11 Individual participant error rates were not analyzed because individual error rates were too low to permit meaningful comparisons.
Evidence for selective influence was assessed by a series of non-parametric two-sample, one-tailed Kolmogorov–Smirnov tests between pairs of double-target conditions (denoted HH, HL, LH, LL). Under the assumption of selective influence, double-target RT survivor functions exhibit the ordering SHH (t) ≤ {SHL(t), SLH (t)} ≤ SLL(t), which translates into five distinct orderings that must be tested. In the two-sample Kolmogorov–Smirnov test, the null hypothesis is that two empirical distributions are equal, and the alternative is that the two distributions are ordered in some way. To assess selective influence, we performed the following set of tests for each participant individually:
H0: SHH(t) = SHL(t)v:Ha: SHH(t) < SHL(t),
H0: SHH(t) = SLH (t)v:Ha: SHH(t) < SLH (t),
H0: SHH(t) = SLL(t)v:Ha: SHH(t) < SLL(t),
H0: SHL(t) = SLL(t)v:Ha: SHL(t) < SLL(t),
H0: SLH(t) = SLL(t)v:Ha: SLH(t) < SLL(t).
A significant Kolmogorov–Smirnov test statistic, rejecting the null hypothesis, provides evidence supporting the assumption of selective influence. Kolmogorov–Smirnov test statistics and exact p-values were calculated using MATLAB.
All Systems Factorial Technology measures, including C(t), mean interaction contrast and survivor interaction contrast defined above, were applied to individual participants’ response time data.
3. Results
3.1. Group-averaged results
Mean response time (RT) results are plotted in Fig. 5. There was a significant effect of Target Location (F (2, 38) = 57.14, p < .001). Post hoc paired t-tests indicated that responses to Global Only stimuli () were significantly faster than responses to Local Only stimuli (; t(20) = 3.54, p = 0.001). Responses to Double Targets () were significantly faster than both Local Only targets (t(20) = 9.12, p < 0.001) and Global Only targets (t(20) = 11.09, p = 0.001). There was no difference between the mean RTs of the groups (F (1, 19) = 0.67, p = 0.42), and there was not a significant interaction of Target Location and Group (F (2, 38) = 0.69, p = 0.51). Thus, at the group level, both the ASD and control groups demonstrated global advantage. Both groups also showed a decrease in RT, or redundancy gain, when targets were present at both the global and local levels compared to targets presented at either level alone.
Mean probabilities of error are plotted in Fig. 5. There was a significant effect of Target Location (F (2, 38) = 16.64, p < 0.001). Post hoc paired t-tests show that significantly more errors were made on Local Only trials () than on both Global Only trials (; t(20) = 3.78, p < 0.001) and Double-Target trials (; t(20) 4.48 p < 0.001) Significantly more errors were made on Global Only trials than on Double-Target trials (t(20) = 3.89, p < 0.001). There was no difference in the probabilities of error between Group (F (1, 19) = 0.01, p = 0.9), and there was not a significant interaction of Target Location and Group (F (2, 38) = 0.10, p = 0.91).
3.2. Individual analyses and models
Results of the individual participant analyses are summarized in Tables 2 and 3.
Table 2.
RT distribution ordering |
Capacity | |||
---|---|---|---|---|
Double v. Global Only | Double v. Local Only | Global Only v. Local Only | ||
ASD1 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Limited |
ASD2 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
ASD3 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)** | Super→Limited |
ASD4 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)* | Limited |
ASD5 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
ASD6 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) ≈ FLocal(t) | Super→Limited |
ASD7 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) ≈ FLocal(t) | Limited |
ASD8 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
ASD9 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
ASD10 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
| ||||
CON1 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Limited |
CON2 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Limited |
CON3 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) ≈ FLocal(t) | Limited |
CON4 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)* | Limited |
CON5 | FDouble(t) > FGlobal(t)** | FDouble(t) > FLocal(t)*** | FGlobal(t) < FLocal(t)*** | Limited→Unlimited |
CON6 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
CON7 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)* | Limited |
CON8 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) > FLocal(t)*** | Super→Limited |
CON9 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) < FLocal(t)* | Limited |
CON10 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) < FLocal(t)* | Limited |
CON11 | FDouble(t) > FGlobal(t)*** | FDouble(t) > FLocal(t)*** | FGlobal(t) < FLocal(t)* | Limited |
Note that the arrow→indicates the progression of time, t → ∞, over the range of RTs.
p < 0.15
p < 0.05
p < 0.01
Table 3.
S(t) ordering | MIC | SIC | Candidate systems | |
---|---|---|---|---|
ASD1 | Predicted ordering | Overadditive | S-shaped Overadditive |
|
ASD2 | Predicted ordering | Overadditive | S-shaped Overadditive |
|
ASD3 | Predicted ordering | Overadditive | S-shaped Overadditive |
|
ASD4 | Predicted ordering | Overadditive | S-shaped Overadditive |
|
ASD5 | Predicted ordering | Underadditive | S-shaped Underadditive |
|
ASD6 | Multiple S(t) crossings | Crossover | Positive |
|
ASD7 | Multiple S(t) crossings | Crossover | Positive |
|
ASD8 | Predicted ordering | Overadditive | Positive |
|
ASD9 | Predicted ordering | Underadditive | S-shaped Underadditive |
|
ASD10 | Multiple S(t) crossings | Crossover | Positive |
|
| ||||
CON1 | Crossing type II | Underadditive |
Pos→Neg Underadditive |
|
CON2 | Crossing type I | Crossover | Positive |
|
CON3 | Crossing type I | Crossover | Positive |
|
CON4 | Crossing type I | Crossover | Neg→Pos→Neg |
|
CON5 | Crossing type I | Overadditive |
Positive |
|
CON6 | Crossing type I | Crossover | Neg→Pos→Neg |
|
CON7 | Crossing type I | Crossover | Neg→Pos→Neg |
|
CON8 | Crossing type I | Crossover | Positive |
|
CON9 | Crossing type I | Overadditive |
S-shaped Overadditive |
|
CON10 | Crossing type I | Crossover | S-shaped Overadditive |
|
CON11 | Crossing type II | Crossover | Negative |
|
Note that the arrow→indicates the progression of time, t → ∞, over the range of RTs.
Predicted ordering: SHH(t) ≤ SHL(t) ≤ SLH(t) ≤ SLL(t).
Crossing type I: SHH(t) crosses SHL(t) ≤ SLH(t) ≤ SLL(t).
Crossing type II: SHH(t) ≤ SHL(t) ≤ SLH(t) crosses SLL(t).
Global advantage and redundancy gain
All participants exhibited a response time (RT) redundancy gain at the cumulative distribution level, where FDouble(t) > FGlobal(t) and FDouble(t) > FLocal(t). Eight of ten ASD participants exhibited a global RT advantage at the distribution level, with FGlobal(t) > FLocal(t). The remaining two participants showed at least a marginal global RT advantage at the mean level (ASD6: , , t(1154) = 1.13, p < 0.13; ASD7: , t(1222) = 0.83, p = 0.21). Only six of 11 controls exhibited a global RT advantage with at least marginal significance. One control exhibited FGlobal(t) = FLocal(t) at the distribution level, and a non-significant difference at the mean level (CON3: , , t(1222) = 0.83, p = 0.21). Four controls showed a reversal such that FGlobal(t) < FLocal(t) with at least marginal significance at the distribution level. Thus, global advantage did not hold for several individuals in the control group.
Capacity
Capacity coefficient values for ASD and control participants are plotted in Fig. 6 and summarized in Table 2. For both groups of participants, C(t) < 1 over most of the RT range, with three ASD participants exhibiting C(t) < 1 for all times t. Seven ASD participants showed super-capacity C(t) > 1 for early processing times (responses faster than approximately 450 ms) before dropping to limited capacity processing.
The majority (9 of 11) of the control participants exhibited C(t) < 1 for all times t, shown in Fig. 6. Two control participants showed C(t) ≥ 1 for very early RTs followed by limited capacity C(t) < 1 after 450 ms. For both groups, the lower bound of observed capacity coefficient values was C(t) = ½, approximately fixed capacity.
Selective influence
Seven of ten ASD participants exhibited evidence supporting selective influence with survivor function orderings SHH(t) ≤ SHL(t) ≤ SLH (t) ≤ SLL(t), with a strict ordering of all survivor functions holding for at least some range of time. In particular, these seven participants exhibited strict orderings of their fast RTs, in the range of 200–450 ms after the onset of stimulus. Following this early strict ordering, each participant had some period of time in which SHH (t) < SLL(t), but SHH(t) ≈ SHL(t) or SLH (t) ≈ SLL(t) or both, indicating that global dimension processing dominated local dimension processing. It is important to note that while the survivor functions were not strictly ordered at all times, they did not show violations (survivor function crossings) of the above ordering.12
Three of the ten ASD participants exhibited violations of the selective influence assumption, with response time distributions that were either equal or crossed. These survivor function crossings occurred between multiple conditions and at multiple points in time in no obviously predictable pattern. Consequently we were unable to examine the processing architecture employed by these individuals.
All control participants showed evidence of non-selective influence through distribution misorderings. Nine of 11 controls exhibited the ordering SHL(t) ≤ SLH(t) ≤ SLL(t) with SHH(t) crossing at least SHL(t) if not all the other distributions at some point in time. That is, SHH (t) was the smallest survivor function at early times but then slowed over processing to be larger than at least one of the other three double-target conditions.
Two controls showed a survivor function ordering such that SHH(t) ≤ SHL(t) ≤ SLH (t) all cross SLL(t) at some point in time. This type of misordering could indicate that channel processing rates within the LL condition increased over the course of a trial, but this is unlikely given the nature of the low salience stimulus manipulations. A more parsimonious explanation is that the above misorderings resulted from processing in the HH, HL and LH conditions that was slower than predicted by selective influence. That is, the channel processing rates decreased over the course of a trial through interchannel inhibition.
The systematic violations of selective influence exhibited by all controls are indicative of particular types of interactions between the processing channels. Specifically, orderings of early response times followed by the crossover of a single distribution suggests that the control participants were processing under single-factor reversal indirect non-selective influence (Townsend & Thomas, 1994). Mathematically, a single-factor reversal is defined at the mean level by E(THL) < E(THH), E(TLH) > E(TLL), or both. That is, for either the local or global arrows, the low salience arrows were processed faster than the high salience arrows, resulting in the reversal of a factor ordering. Single-factor reversals are generally evident as crossovers in the mean interaction contrast (MIC). The MIC results, summarized in Table 3 and depicted for two control participants in Fig. 7, reflected such reversals for 8 of 11 control participants and mean RT equalities for the three other control participants. For all participants, the reversal was observed on the salience of the local arrows, where responses to low salience local arrows were faster than high salience local arrows. According to Townsend and Thomas’ Theorem 1, a sufficient condition for the occurrence of a single-factor reversal within a parallel model is the presence of negative stochastic dependence between subprocesses.
Architecture and stopping rule
The survivor interaction contrast (SIC) functions for the seven ASD participants supporting selective influence, together with the pooled SIC curve, are plotted in Fig. 8 and summarized in Table 3. As depicted in the scatter points for the individual participants, mirrored in the group curve, the SIC functions exhibited an S-shaped characteristic of coactive or parallel interactive processing. The SIC functions for all participants were negative for early processing times and positive for later processing times. Exhaustive serial processing requires that the early negative region be equal to the subsequent positive area (see Fig. 2), and we did not observe this pattern for any participant.
Considering each individual participant, there was evidence for two different processing architectures within the ASD group. Examples of these trends from two typical participants are plotted in Fig. 9. Five ASD participants had overadditive (positive) MIC values with S-shaped SIC functions that were negative for short, early periods of time and then predominantly positive for later times (Fig. 9, ASD3). This pattern is evidence for a coactive processing architecture (see Fig. 2) or facilitatory parallel exhaustive processing.
The remaining two ASD participants showed the combination of underadditive (negative) MIC values with S-shaped SIC functions that were negative for a much longer period of time than they were positive (Fig. 9, ASD5). This pattern, together with ordered survivor functions, is predicted by parallel exhaustive processing with mild facilitatory interactions. The facilitation was not strong enough to cause any survivor function misorderings but resulted in late positivity in the SIC curves which would not be found in strictly independent parallel exhaustive processing (Townsend & Nozawa, 1995). Thus we can conclude that all ASD participants were using either coactive or facilitatory exhaustive parallel processing in our global–local task.
Non-selective influence was consistently observed for control participants, meaning that we were unable to determine architecture from our non-parametric MIC and SIC measures. While we have identified these violations as single-factor reversals, Townsend and Thomas (1994) did not have a state space model to consider processing models that might account for these patterns. We utilized a Poisson counter model instantiation of parallel interactive processing (Eidels et al., in preparation) to simulate and fit these data. These more intricate models, which do not assume selective influence and allow for stochastic dependencies, provided a means of explicating the differences in processing systems between the control and ASD participants. Details of the structure and implementation of the Poisson parallel interactive models can be found in the Appendix.
Poisson parallel interactive models (PPIMs)
Intuitively, PPIMs act as a system of parallel accumulators, building evidence for or against the presence of targets in each channel (Eidels et al. (in preparation); see also the Appendix and Fig. 10). Channels are able to stochastically interact with each other, either accelerating accumulation by sending evidence-supporting information between channels, or slowing accumulation by sending competing information between channels. We considered four classes of PPIMs: facilitatory minimum-time systems, facilitatory exhaustive systems, inhibitory minimum-time systems, and inhibitory exhaustive systems. Double-target conditions were modeled by two accumulators, one representing global information and one representing local information. The two accumulators could either mutually facilitate or mutually inhibit each other. Single-target conditions were modeled as an accumulator in the presence of a competing (inhibitory) accumulator representing distractor information. The presence of inhibitory distractor information bounded the speed of single-target processing. Consequently, both facilitatory and inhibitory PPIMs produced redundancy gains similar to the redundancy gains observed in the data. These models predict a variety of SIC patterns, depicted in Fig. 3, depending on the nature and degree of channel interactions. Despite the relative generality of the interactive models, the predicted SIC patterns are still highly constrained. At least one type of PPIM was predicted by each participant’s data.
Table 3 provides a summary of candidate PPIMs together with the empirical findings for all participants (control participants in the lower panel). Facilitatory minimum-time PPIMs exhibit capacity coefficient values C(t) ≥ 1, with C(t) = 1 occurring when facilitations have been minimized or are 0. Thus we eliminated them as candidate models. Facilitatory exhaustive models, on the other hand, can exhibit C(t) < 1.13 However, no facilitatory PPIMs, exhaustive or minimum-time, show survivor function crossovers (a correlate, proven by Townsend & Thomas, 1994, showed this at the mean level). Specifically, because the double targets are positively interacting with each other, these models do not exhibit a slowing of the HH condition, as was observed. Consequently, we ruled out facilitatory exhaustive PPIMs and inferred that all control participants were processing with inhibitory interactions.
Stopping rules were determined by the shapes of the survivor interaction contrast (SIC) function under inhibitory crosstalk. The SIC results for all control participants are plotted in Fig. 8, and the SICs of two typical control participants are plotted in Fig. 7. By comparing their results with the predictions of the interactive models in Fig. 3, we conclude that the seven controls who exhibited a strictly positive or W-shaped SIC function (Fig. 7, Control 8) likely employed a minimum-time stopping rule. Two control participants exhibited negative SIC functions indicative of inhibitory exhaustive processing (Fig. 7, Control 11). Two control participants exhibited S-shaped SIC functions that were predicted by both minimum-time and exhaustive inhibitory PPIMs; so we cannot conclusively determine these individuals’ stopping rules under inhibitory parallel processing.
PPIMs also offer a second candidate model for the ASD data. Those participants who exhibited ordered survivor functions, S-shaped SIC functions, and limited capacity processing may have been utilizing a facilitatory exhaustive parallel processing system.
4. Discussion
In a divided attention global–local perception task, both the ASD and control groups showed response time and accuracy data supporting global advantage. Consistent with previous studies of global–local processing (see Kimchi, 1992, for a review), both groups were faster at responding to global level stimulus information than local information. These data do not support previous claims that individuals with an ASD have a local bias or show a lack of global advantage (Plaisted et al., 1999).
Despite these group similarities, we found fundamental differences between the ASD and control groups when we employed information processing models of response time data to examine four characteristics of global–local processing in individual participants. Differences were not evident in architecture; all participants across both groups used a parallel processing architecture. This finding is consistent with our previous study (Blaha et al., 2007), which demonstrated that a parallel minimum-time processing system was typically employed by adults performing the same divided attention global–local task.
Differences between groups were evident in the type of interaction between the parallel channels and the stopping rule employed. Participants in the control group all exhibited strong inhibitory (negative) interactions between global and local information and the majority (7/11) employed a minimum-time stopping rule. Two control individuals employed an exhaustive stopping rule, and the data were ambiguous as to the stopping rule for two others. Inhibitory parallel processing means that the control participants were processing global and local information simultaneously, but that information in these two channels was slowing the processing of each other. Generally, control participants responded as soon as enough information had been processed to make a correct decision (i.e., minimum-time stopping rule). The limited capacity coefficient values exhibited by each control participant offer additional evidence that global and local information were processed inefficiently, which is not surprising given the inhibition between channels.
Two candidate models were found within the ASD participant group: coactive processing under the assumption of selective influence and facilitatory exhaustive parallel processing when the assumption of selective influence is relaxed. Both candidate models predict the double-target survivor function orderings observed in the ASD data (Table 3). However, only the facilitatory exhaustive parallel processing system would predict the limited capacity C(t) values observed for each ASD participant. Typically, coactive architectures, operating on independent channels, result in highly efficient processing, evidenced by C(t) ≥ 1 (Townsend & Nozawa, 1995) or violations of the race-model bound on standard parallel processing response time distributions (Miller, 1982). Thus, a facilitatory exhaustive parallel processing system is most consistent with the response time patterns of the ASD participants. This means that the individuals with an ASD processed global and local information simultaneously and, unlike most of the control participants, waited until they had processed all available information before responding. In contrast to the controls, the presence of both global and local information actually benefitted processing rates through facilitatory interactions. That is, global target information sped up the processing of local target information and vice versa.
Additional analyses are possible to confirm our conclusions about dependencies and stopping rule in the parallel processing systems. In particular, comparisons may be made between each of the nine stimulus types employed in this task. Investigation of the response time distributions for all conditions show that processing of each double-target condition (HH, HL, LH, LL) is consistently faster than processing of each single-target condition, with response times to H targets being faster than those to L targets. The non-target object (dash composed of dashes) is the slowest response time distribution overall. For many participants, these results hold at the level of the integrated hazard function (Townsend, 1990), implying significant differences at the mean. Thus, comparisons between the individual stimulus conditions are consistent with the resulting Poisson parallel interactive model conclusions. These comparisons also offer additional support for the single-factor reversals observed in the control participants; as we see high salience manipulations are consistently faster than low salience manipulations in the presence of a distractor, but that the processing speed for high salience manipulations slows in the presence of targets. Although a full accounting of the results of these analyses is beyond the scope of this paper, data can be made available to the reader upon request.
As described earlier, previous studies of global–local processing in ASD have yielded mixed results. Our findings of global advantage for the majority of ASD participants are inconsistent with some previous findings (Plaisted et al., 1999), but consistent with others (Ozonoff, Strayer, McMahon, & Filloux, 1994; Rinehart et al., 2000). Given our adherence to methodological features that have been identified in cognitive psychological studies as critical to the global–local paradigm, we feel confident that our findings of global advantage are valid. The modeling results offer further insights into ASD performance. Importantly, the observed parallel architectures appear to be inconsistent with the Weak Central Coherence (WCC) theory of ASDs because they refute the use of any sequential global-to-local or local-to-global processing strategies. A recent conceptualization of WCC (Happé & Booth, 2008) suggested that some individuals with an ASD have difficulty constructing the global or whole percept out of the local elements (i.e., poor integration), implying a local-to-global processing order. Our current findings support neither a preference for first processing local information nor faster identification of local targets for any of the ASD participants. Further, parallel processing of global and local information in ASD participants was characterized by facilitatory rather than inhibitory interactions, indicating that global information and local information were not only processed simultaneously but interacted to accelerate responses. This positive interaction would not be predicted by WCC. Although we did not directly examine interference effects in this study (via incongruent distractors), our results do not suggest a local bias or local interference.
The information processing mechanisms revealed here are consistent with some previous studies in ASDs that have examined attention in contexts other than global–local perception. The use of exhaustive processing when not required is consistent with studies that have reported an inability to ignore irrelevant stimulus features in individuals with an ASD (e.g. Brian, Tipper, Weaver, & Bryson, 2003). Similarly, clinical observations of some individuals with an ASD indicate difficulty filtering information in the environment, such as sensory information. Additionally, Rutherford et al. (2007) demonstrated an advantage (less cost) in individuals with autism when dividing attention between central and peripheral task demands. This divided attention advantage might be attributable to facilitation or a lack of inhibition between the channels processing the central task and those processing the peripheral task. More research is needed to determine if individual participants with ASD demonstrate different processing characteristics under distinct task demands or if these characteristics are stable across cognitive and perceptual paradigms. Given the heterogeneity in ASDs, it will be important for future studies to begin to map modeling results onto clinical symptoms and associated features. A finding such as exhaustive processing on a particular task may provide important clues about more general underlying information processing approaches in certain individuals and a clear target for intervention.
Although our findings provide some important insights regarding basic information processing in ASDs they do not address the hallmark symptoms of this spectrum, social impairments. As stated earlier, we do not claim that the processing of non-social information is necessarily relevant for understanding social deficits, nor do we feel it is essential to link our findings to social dysfunction. However, we acknowledge that it is possible that understanding basic perceptual and cognitive processes may further our understanding of social deficits. For example, it is not difficult to imagine how challenging it would be to navigate social situations if information were always processed exhaustively. Timing of exchanges and the ability to identify and respond to the salient features of social interactions would likely suffer. In future studies, we plan to use SFT to examine processing of social stimuli in ASDs. Townsend, Wenger, Fifić and colleagues have already developed and employed tasks that use the Double Factorial Paradigm (DFP) for examining face processing in controls (e.g. Fifić, unpublished; Ingvalson & Wenger, 2005; Innes-Kerr, unpublished; Wenger & Townsend, 2001). It will be important to determine if individuals, both with and without ASDs, demonstrate similar processing approaches for non-social and social stimuli. The current findings are the initial step in a larger set of studies that will examine information processing in ASD and other clinical groups.
Not only do our findings further our understanding of ASDs, but several results have bolstered our knowledge of information processing models and advanced the sophistication of the tools with which we measure and model response time data. Findings from this study highlight the integral role of the assumption of selective influence to human information processing modeling, particularly the strong relationship between failures of selective influence and interchannel interactions. It is noteworthy that the majority of non-selective influences observed across control participants were single-factor reversals, which to date have only been discussed and developed theoretically (Townsend & Thomas, 1994). To our knowledge, ours is the first study to empirically demonstrate single-factor reversals, thus confirming the psychological validity of this modeling construct. Additional investigation is now called for at the distribution function level to enhance our understanding of how single-factor reversals may affect our DFP analyses, particularly the survivor interaction contrast function. This is a clear example of how the nature of clinical research engenders an environment in which to test the limits of mathematical models and to consider alternative approaches to explaining behaviour.
As noted in the introduction, most previous applications of the DFP have observed selective influence, and thus there has not been a need to adjudicate between processing systems under non-selective influence. These previous studies, though, have often employed either simplified psychophysical detection tasks, such as dots presented in separate visual fields (Townsend & Nozawa, 1995), or visual or memory targets with separate physical locations and/or structures, including face features (e.g., eyes, distance between eyes, Fifić, unpublished; Ingvalson & Wenger, 2005; Wenger & Townsend, 2001), halves of split faces (Innes-Kerr, unpublished), and different characters in strings of letters (Fifić, 2001; Townsend & Fifić, 2004). In a recent study using configural line figures, Eidels, Townsend, and Pomerantz (2008) demonstrated that selective influence was consistently violated unless the features of the configural figures were spatially separated. Together, these studies imply that some spatial separation of stimulus features facilitates selective influence.
The global configurations utilized in global–local tasks would not exist without the local elements of which they are composed, and, consequently, the two types of objects are always presented in a spatially contiguous manner. In this sense, global–local processing invites interactions during the course of processing. Prior evidence of global interference on local processing confirms that some degree of interaction typically occurs between global and local information (for a review, see Kimchi, 1992). Thus, it is not surprising that even under conditions in which we eliminated competing/incongruent non-target information, participants were susceptible to processing interactions. The presence of processing interactions suggests that these stimuli may be more integral than separable in nature, a factor shown to affect the architecture used for image classification (Fifić, Nosofsky, & Townsend, 2008).
With the exception of three ASD participants, we confirmed that the observed violations of selective influences were not the result of direct interactions at the stimulus input level because the survivor functions for all participants were ordered for fast response times (less than approximately 450 ms). The global–local processing channels received independent global and local input information, and during this early period of time, the accumulating effects of interchannel interactions had only a minimal effect on processing rates. The use of the Poisson parallel interactive models (PPIMs) enabled us to capture these patterns with interchannel dependencies. Further theoretical work is under way to explicate response time patterns for parallel interactive models in which the inputs are correlated.
The fact that our data exhibited various types and degrees of interaction allowed us to advance the theory development of parallel interactive processing models, specifically the Poisson counter instantiation. Although we (L.M.B. and J.W.H.) have been exploring PPIMs through simulations, this project provided the first opportunity to apply these models to empirical data. Constraining model behaviour to match the empirical patterns necessitated additional theoretical development of the PPIMs. In particular, basic interactive parallel processing treated single-target conditions as channels operating in isolation, with no other channels operating simultaneously and no interactions affecting the single-target processing (Townsend & Wenger, 2004). In practical application, these models, particularly the inhibitory models, could not produce response time distributions showing a redundancy gain because the single channels in isolation were generally faster than the double-target channels inhibiting each other. Yet all ASD and most control participants consistently exhibited redundancy gains for the double-target conditions. This can be attributed, at least partially, to the fact that the single targets (Global Only Arrows and Local Only Arrows) in this experiment were never presented in isolation but always in the presence of non-target distractors (neutral dashes at the non-target level), representing some information competing for processing resources.
To capture this aspect of the data, we introduced an inhibitory distractor channel to the single-target condition, modifying the Markov chains to slow the single-target conditions. Note that like the target channels, the distractor channels processed until reaching their thresholds, but the system could not respond to the distractor information.14 This new feature of the parallel interactive models makes them more flexible and will aid in the future applications of these models to other tasks with complex or integral stimuli.
5. Conclusion
The results of this study highlight the value of employing well-established mathematical modeling techniques to examine perceptual and cognitive processes in clinical populations. In this study, we assessed global–local processing in individuals with an ASD using Systems Factorial Technology and the associated Double Factorial Paradigm (Townsend & Ashby, 1983; Townsend & Nozawa, 1995). Utilizing these tools for the first time in the field of autism research, we were able to begin characterizing the differences in the information processing mechanisms employed by individuals with an ASD and their typically developing counterparts at the individual participant level. The results did not support a bias toward local information, as predicted by the Weak Central Coherence theory and some previous global–local studies in ASDs (Plaisted et al., 1999), but instead indicated a difference in interactions between global and local information processing channels and the stopping rules preferred by each group. This study illustrates the feasibility of applying these methods and models to children and adolescents, as well as developmental clinical samples, expanding the initial clinical applications of SFT by Neufeld, Townsend, and Jetté (2007); Townsend et al. (2007). Furthermore, expanding the application of these methods to samples other than “normal” adults yielded unexpected advances in our modeling techniques. This underscores the synergy that can be obtained by merging mathematical psychology and clinical science, a common theme of this special issue.
Acknowledgments
This study was completed in the Clinical and Cognitive Neuroscience Laboratory of Julie C. Stout, Ph.D. at Indiana University and graciously supported by Dr. Stout’s start-up fund. This research was also supported by NIH-NIMH Training Grant T32 MH019879-11 and NSF Graduate Research Fellowship to Leslie M. Blaha, and NIH-NIMH grant R01MH57717 to James T. Townsend. We give special thanks to the participants and families who took part in this project. We thank Robin R. Murphy, Ph.D. for her key role in the diagnosis and recruitment of participants, and Ami Eidels, Ph.D. for helpful comments and insights on the parallel interactive models. We also thank Amanda Wolfe and Matt Primeau for assistance with data collection.
Appendix
A.1. Double-target processing
We model interactive parallel processing of double targets with two channels acting as simultaneous Poisson accumulators with some additional probability of information passing between the two channels. Fig. 10 provides a diagram of the two-channel system. Let Ui, i = 1, 2, be the random variable for the number of counts accumulated within a single channel i operating alone, and let Ui be distributed as a Poisson random variable with accumulation rate parameter λi, i = 1, 2, given by
Let γi, i = 1, 2, be the criterion for channel i, the number of counts that must accumulate for the channel to finish processing. Thus the model is defined over the γ1 γ2 state space. Channels are initialized at zero activation, denoted (0, 0); as processing proceeds, the density over the state space shifts toward the completion states.
Let Kj,i, i, j = 1, 2, i ≠ j be the random variable total amount of information shared from channel j to channel i. The probability that a single count is shared from channel j to channel i is distributed as a Bernoulli random variable:
The total amount of shared information, Kj,i, is distributed as a binomial distribution with
where kj,i and pj,i are defined as above. Note that when pj,i = pi,j = 0, the system is a standard, independent parallel processing model.
Let Xi, i = 1, 2, be the random variable representing the total activation (total number of counts) in a single channel. If the interchannel interactions are facilitatory, then kj,i(t) is added to the channel activation: Xi = Ui + Kj,i; if the interactions are inhibitory, then kj,i(t) is subtracted: Xi = Ui – Kj,i. Criterion γi acts as an absorbing state; so the channel will cease accumulating when xi(t) = γi. Thus, for a given channel i, its position in the state space at a given time t will be defined by 0 ≤ ui(t) ± kj,i(t) ≤ γi.
Responses of the whole system are made according to a predetermined stopping rule; here we implement the two canonical stopping rules possible within the global–local task: minimum-time (respond after processing global or local) and exhaustive (stop after processing both global and local). In the minimum-time (OR-gate) models, a response is made (the channels stop accumulating information and the state space is reset to (0, 0)) when any single channel reaches its criterion. In the exhaustive (AND-gate) models, both channels must reach their respective criteria before the system responds and the state space is reset to (0, 0).
Facilitatory exhaustive processing
Facilitatory exhaustive processing is modeled with two channels that mutually facilitate each other (i.e., xi = ui+kj,i), each of which must reach their criterion for the model to respond. Let Ti, i = 1, 2, be the channel completion time of channel i. We readily arrive at an analytic solution for the facilitatory exhaustive model, given by the joint cumulative distribution function shown in Box I. The four terms in the cumulative probability function give the probabilities for the four mutually exclusive ways the system might reach its exhaustive criteria. The first term is the probability of the system responding when both channels required counts from the other channel to reach their respective thresholds. The second term is the probability that Channel 1 required shared counts to reach its threshold and Channel 2 completed through the Poisson accumulator alone. Similarly, the third term reflects Channel 1 completing by the Poisson accumulator alone and Channel 2 requiring shared counts to reach threshold. Finally, the fourth term is the joint probability that both channels reach their respective thresholds by the Poisson processes without requiring shared counts.
Box I.
where u1, u2, λ1, λ2, γ1, γ2, p2,1, and p1,2 have been defined previously.
Facilitatory minimum-time processing
Facilitatory OR processing is modeled by the complement of the cumulative probability that neither channel has reached criterion at time t. We, again, readily arrive at an analytic formulation for facilitatory minimum-time processing as shown in Box II. The terms in this equation give first the probability that within-channel count ui(t) < γi, i = 1, 2, and second, the probability that the shared counts are not enough for the total to reach criterion, i.e., xi(t) = ui(t) + kj,i(t) < γi, i = 1, 2, i ≠ j.
Box II.
where u1, u2, λ1, λ2, γ1, γ2, p2,1 and p1,2 are defined as above.
Inhibitory processing
In defining the facilitatory models, we took advantage of the fact that the total activation in a channel is a monotonically increasing function of time. However, with inhibition it is possible for the total activation in a channel to decrease. To accommodate this behaviour, we utilize a different formalism: the Markov chain. See Diederich and Busemeyer (2003) for a tutorial on the use of Markov models of information accumulation.
Let Ui, Kj,i, Xi, and λi, i, j = 1, 2, i ≠ j, be defined as above, and let Δt be a sufficiently small increment of time. Note that we constrain λi and Δt such that P[ui(t + Δt) > 1] < 0.01 to bound the error on the Poisson approximation. Transitions in the two-dimensional state space are made as x1 and/or x2 increase or decrease; failure of both channels to increase or decrease their total activation in a single time step Δt results in a “stationary” transition. Let P[m, n] denote the transition probability such that, at time t + Δt, xi has changed by m and xj has changed by n. The transition probabilities for all possible transitions of the inhibitory AND Markov chain are defined, for i, j = 1, 2, i ≠ j, by
Note that P[0, 1], P[−1, 1] and P[−1, 0] are obtained by proper rearrangement of the subscripts i, j in the equations for P[1, 0], P[−1, 1] and P[0, −1], respectively.
To intuit the basis of these distributions, consider the possible ways the model would make a stationary transition (P[0, 0]). Either both channels gain a within-channel count (i.e., u1(t + Δt) = u1(t) + 1 and u2(t + Δt) = u2(t) + 1) and share that count, subtracting it from the other (i.e., k1,2(t + Δt) = k2,1(t + Δt) = −1) with probability (λiλj(Δt)2(1 – pi,j)(1 – pi,j)), or neither channel gains a within-channel count with probability (1 – λiΔt)(1 – λjΔt). As these are disjoint events, the probability of either occurring is just their sum as given in P[0, 0] above.
We assume xi(t) ≥ 0, i = 1, 2, for all t. Thus, if channel i shares an inhibitory count with channel j ≠ i, i.e., ki,j(t + Δt) = 1, when xj(t) = 0 and uj(t + Δt) = 0, then xj remains at 0. In order to ensure valid probability density functions over the rows of the transition matrix, we set the probability that x1 increases while x2 stays at zero to be P[1, 0]+P[1, −1]. Likewise, we set the probability that x2 increases while x1 stays at zero to be P[0, 1] + P[−1, 1].
Inhibitory exhaustive processing
Exhaustive processing requires that processing continues until both x1(t) = γ1 and x2(t) = γ2, which results in a single absorbing state (γ1, γ2) in the transition matrix. Assume that if at time tγ, xi(tγ) = γi, i = 1, 2, then for all t > tγ, xi(t) = γi. It follows that, for all remaining time steps, ki,j(t + Δt) = 0. Thus, if x2(t) = γ2 then the probability of x1 decreasing is 0 (k2,1(t + Δt) = 0), P[x1(t + Δt) = x1(t)] = P[0, 0] + P[0, 1] + P[−1, 1] = 1 – λ1Δt, and P[x1(t + Δt) = x1(t) + 1] = P[1, 0] + P[1, 1] + P[1, −1] = λ1Δt.
With the transition probabilities defined above and the absorbing state (γ1, γ2), we can state the model in terms of its transition matrix T. Table A.1 gives an example T for a system where γ1 = γ2 = 2. The transition matrix can be then partitioned into the transient [(γ1+1)(γ2+1)−1]×[(γ1+1)(γ2+1)−1] state matrix Q and the [(γ1+1)(γ2+1)−1]×1 absorbing vector R, such
Table A.1.
Activation state at time t | Activation state at time t + Δt |
||||||||
---|---|---|---|---|---|---|---|---|---|
(0, 0) | (0, 1) | (0, 2) | (1, 0) | (1, 1) | (1, 2) | (2, 0) | (2, 1) | (2, 2) | |
(0, 0) | P[0, 0] |
P[0, 1] + P[−1, 1] |
0 |
P[1,0] + P[1,−1] |
P[1, 1] | 0 | 0 | 0 | 0 |
(0, 1) | 0 | P[0, 0] |
P[0, 1] + P[−1, 1] |
P[1,−1] | P[1, 0] | P[1, 1] | 0 | 0 | 0 |
(0, 2) | 0 | 0 |
P[0, 0] + P[0, 1] + P[−1, 1] |
0 | 0 |
P[1, 0] + P[1,−1] + P[1, 1] |
0 | 0 | 0 |
(1, 0) | 0 | P[−1, 1] | 0 | P[0, 0] | P[0, 1] | 0 |
P[1, 0] + P[1, −1] |
P[1, 1] | 0 |
(1, 1) | 0 | 0 | P[−1, 1] | 0 | P[0, 0] | P[0, 1] | P[1, −1] | P[1, 0] | P[1, 1] |
(1, 2) | 0 | 0 | 0 | 0 | 0 |
P[0, 0] + P[0, 1] + P[−1, 1] |
0 | 0 |
P[1, 0] + P[1,−1] + P[1, 1] |
(2, 0) | 0 | 0 | 0 | 0 | 0 | 0 |
P[0, 0] + P[0, 1] + P[−1, 1] |
P[0, 1] + P[−1, 1] + P[1, 1] |
0 |
(2, 1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
P[0, 0] + P[0, 1] + P[−1, 1] |
P[0, 1] + P[−1, 1] + P[1, 1] |
(2, 2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
Table A.2.
Activation state at time t | Activation state at time t + Δt |
||||||||
---|---|---|---|---|---|---|---|---|---|
(0, 0) | (0, 1) | (0, 2) | (1, 0) | (1, 1) | (1, 2) | (2, 0) | (2, 1) | (2, 2) | |
(0, 0) | P[0, 0] | P[0, 1] + P[−1, 1] | 0 | P[1, 0] + P[1,−1] | P[1, 1] | 0 | 0 | 0 | 0 |
(0, 1) | 0 | P[0, 0] | P[0, 1] + P[−1, 1] | P[1, −1] | P[1, 0] | P[1, 1] | 0 | 0 | 0 |
(0, 2) | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
(1, 0) | 0 | P[−1, 1] | 0 | P[0, 0] | P[0, 1] | 0 | P[1, 0] + P[1,−1] | P[1, 1] | 0 |
(1, 1) | 0 | 0 | P[−1, 1] | 0 | P[0, 0] | P[0, 1] | P[1,−1] | P[1, 0] | P[1, 1] |
(1, 2) | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
(2, 0) | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
(2, 1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
(2, 2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
that . R represents the transition probabilities from any state in which the model has not completed to an absorbing state, and Q represents the transitions between non-completion states in the Markov chain. Under the assumption of sufficiently small, equal, discrete time steps Δt, the cumulative distribution of finishing times is given by (Kemeny & Snell, 1976)
where N is the total number of transitions made in the Markov chain. Note that the cumulative distribution, given the initial activation state (0, 0), is the first entry of vector C(N).
Inhibitory minimum-time processing
The inhibitory minimum-time model finishes when either x1(t) = γ1 or x2(t) = γ2, resulting in multiple absorbing states. It follows that R is a [(γ1γ2)]×[(γ1 + γ2 + 1)] matrix with each column representing the probability of transitioning into a different completion state. Transient matrix Q has dimension [(γ1γ2)]×[(γ1γ2)]. An example transition matrix T for the γ1 = γ2 = 2 case is given in Table A.2.
Let C(N) be defined as above. Under the minimum-time stopping rule, the cumulative distribution of finish times is the sum across the row in the resulting matrix C(N) corresponding to the initial activation state (0, 0).
A.2. Single-target processing
We assume that processing of an uninformative distractor will either process independently or with inhibitory interactions with the target. We again implement our two-channel Markov chain. Let xi, i ∈ {1, 2}, be the total activation state of target channel i, with within-channel Poisson accumulation rate λi, probability pij of channel i sharing information with channel j, and target threshold γi. Note that λi, pij, and γi are properties of channel i, and are fixed regardless of the presence of a target or distractor in channel j. Again, let Δt be a sufficiently small increment of time.
Define dj, j ∈ {1, 2}, j ≠ i, to be the activation state of channel j when processing a distractor, with within-channel Poisson accumulation rate λdj and threshold dj. Let the probability of the distractor sharing information with target channel i be given by pdj. Although channel j can finish processing the distractor when dj(t) = γdj, the system can only respond when xi(t) = γi.
Let the transition matrix be defined analogously to the inhibitory exhaustive model above, but using transition probabilities calculated with λdj and pdj instead of λj and pji. The transient state matrix Q has dimension [(γi + 1)γdj × (γi + 1)γdj]. R is the [(γi + 1)γdj × (γdj + 1)] set of transition probability vectors from transient activation states to target completion activation states. C(N) is defined as above, and the cumulative distribution of total processing times for the initial activation state (0, 0) is the sum over the first row of C(N).
Note that, save for the error due to our approximation, this model reduces to a Poisson process on target channel i when either pdj = 0 or λdj = 0.
Footnotes
Note that the term “channel” is often used to refer to the subprocesses in a parallel system, and the term “stages” used to designate the subprocesses in a serial architecture. Here we use the term “channels” to refer to subprocesses in any system without bias to the architecture.
The SFT framework generalizes to any finite number n ≤ 2 of subprocesses or channels. For example, Fifić (2001) established the predictions and model patterns for exhaustive processing in systems of n = 3 and n = 4 channels within a memory search paradigm.
Let the random variable TI denote the processing time for Channel I. It can be shown that manipulating a factor to slow processing in a channel is equivalent to adding a random variable to the processing time of that channel. See Definition 8 of Townsend and Schweickert (1989) for more details.
There may be some cases in which selective influence operates at the marginal density function level, which Townsend and Schweickert (1989) referred to as marginal selective influence. However, Dzhafarov (2003) indicates that conditional independence of the marginal distributions may be the only way marginal selective influence can occur, and most processing systems will not deliver marginal selective influence.
More detailed descriptions of systems allow interdependencies to occur through their input explicitly rather than depicted through stochastic dependencies. These notions are closely related but their analysis goes beyond our present scope.
Survivor functions and means in general have a known intimate relationship: the integral of the survivor function for a distribution is equal to the mean of that distribution. It follows from this relationship that ∫t SIC(t)dt = MIC (see Townsend and Nozawa (1995), Appendix for more details).
Note further that , which can easily be directly estimated from survivor function, S(t), of the empirical RT CDF. Empirical estimates of H(t) are less noisy than estimates of h(t), which require estimates of both the empirical density f(t) and the empirical survivor function.
An important distinction must be made here between the summing of two integrated hazard functions, as in the C(t) ratio, e.g., HA(t) + HB(t), and the integrated hazard function that is appropriate for the summing of two random strings of input, as occurs in a true coactive model. Suppose the two, independent stochastic processes associated with channels A, B, are denoted XA(t) and XB(t). Then the hazard function for the coactive process must be computed from the density, etc., of XA(t) + XB(t). As noted, with the conspicuous exception of exponentially distributed random variables, the hazard function for the latter will not, in general, be the same as the former.
DMDX is available online at http://www.u.arizona.edu/~kforster/dmdx/dmdx.htm.
We note the important hierarchy of statistical inference (Townsend, 1990), according to which statistically significant differences at the level of the CDF imply significant differences at the level of the mean (i.e., significant t-tests, F -test, etc).
Fine-grained examination of the survivor functions did show some crossings for very late times, corresponding to the slowest 5% or less of the data. However, the limited number of such slow RTs makes survivor function estimation difficult in this range, and we cannot conclude that these are meaningful crossovers.
We stress that this statement is based on testing capacity with the OR capacity statistic, as we have done here. If we instead used the capacity coefficient geared for AND (i.e., exhaustive = conjunctive) processing, (see Townsend & Wenger, 2004), facilitatory interactions would produce a capacity value greater than 1.
Within the current version of the PPIMs, we did not simulate error trials for any conditions, reflecting the near-perfect performance of our participants. For single targets, the system continued to process target information until the target channel reached its threshold for a correct response. In the double-target conditions, in which there were only target channels operating, the system did not terminate prematurely, but waited until the appropriate number of channels (according to the stopping rule) reached threshold to respond. Importantly, note that the addition of non-target channels to these models allows for the possibility in future applications of generating error responses and, subsequently, error RT distributions, which will enable a full comparison of PPIMs with other classes of stochastic dynamic systems models (e.g. Link & Heath, 1975; Ratcliff, 1978; Townsend & Ashby, 1983)
References
- American Psychiatric Association . Diagnostic and statistical manual of mental disorders. 4th ed. American Psychiatric Association; Washington, DC: 2000. Text revision. [Google Scholar]
- Ashby FG, Townsend JT. Decomposing the reaction time distribution: Pure insertion and selective influence revisited. Journal of Mathematical Psychology. 1980;21:93–123. [Google Scholar]
- Baron-Cohen S, Hoekstra RA, Knickmeyer R, Wheelwright S. The autism-spectrum quotient (AQ) – Adolescent version. Journal of Autism and Developmental Disorders. 2006;36(3):343–350. doi: 10.1007/s10803-006-0073-6. [DOI] [PubMed] [Google Scholar]
- Baron-Cohen S, Wheelwright S, Skinner R, Martin J, Clubley E. The autism-spectrum quotient (AQ): Evidence from Asperger syndrome/high-functioning autism, males and females, scientists and mathematicians. Journal of Autism and Developmental Disorders. 2001;31(1):5–17. doi: 10.1023/a:1005653411471. [DOI] [PubMed] [Google Scholar]
- Blaha LM, Johnson SA, Townsend JT. An information processing investigation of hierarchical form perception: Evidence for parallel processing. Visual Cognition: Object Perception, Attention and Memory (OPAM) 2006 Conference Report. 2007;15(1):73–77. [Google Scholar]
- Brian JA, Tipper SP, Weaver B, Bryson SE. Inhibitory mechanisms in autism spectrum disorders: Typical selective inhibition of location versus facilitated perceptual processing. Journal of Child Psychology and Psychiatry. 2003;44(4):552–560. doi: 10.1111/1469-7610.00144. [DOI] [PubMed] [Google Scholar]
- Diederich A, Busemeyer JR. Simple matrix methods for analyzing diffusion models of choice probability, choice response time and simple response time. Journal of Mathematical Psychology. 2003;47:304–322. [Google Scholar]
- Donders FC. On the speed of mental processes. Acta Psychologica. 1969;30:412–431. doi: 10.1016/0001-6918(69)90065-1. Translation of Donders’ original paper Over de snelheid van psychische processen which appeared in Onderzoekingen gedaan in het Physiologisch Laboratorium der Utrechtsche Hoogeschool. pp. 1868–1869. Tweede reeks. II:92–120.
- Dzhafarov EN. Process representations and decompositions of response times. In: Marley AAJ, editor. Choice, decision and measurement: Essays in honor of R. Duncan Luce. Lawrence Erlbaum Associates; Mahwah, NJ: 1997. pp. 255–278. [Google Scholar]
- Dzhafarov EN. Selective influence through conditional independence. Psychometrika. 2003;68(1):7–25. [Google Scholar]
- Eidels A, Houpt JW, Pei L, Altieri N, Townsend JT. Nice guys finish fast and bad guys finish last: Facilitatory vs. inhibitory interaction in parallel systems. (in preparation) [DOI] [PMC free article] [PubMed]
- Eidels A, Townsend JT, Pomerantz JR. Where similarity beats redundancy: The importance of context, higher-order similarity, and response assignment. Journal of Experimental Psychology: Human Perception and Performance. 2008;34(6):1441–1463. doi: 10.1037/a0012857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fifić M. Multiple-channel processing in short-term memory. Paper presented at the cognitive lunch series; Bloomington, IN. Indiana University; 2001. [Google Scholar]
- Fifić M. Emerging holistic properties at face value: Assessing characteristics of face perception. Doctoral dissertation. Indiana University; Bloomington, IN: 2006. (unpublished) [Google Scholar]
- Fifić M, Nosofsky RM, Townsend JT. Information-processing architectures in multidimensional classification: A validation test of the Systems Factorial Technology. Journal of Experimental Psychology: Human Perception and Performance. 2008;34(2):356–375. doi: 10.1037/0096-1523.34.2.356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frith U. Autism: Explaining the enigma. Blackwell; Oxford: 1989. [Google Scholar]
- Frith U, Happé F. Autism: Beyond theory of mind. Cognition. 1994;50:115–132. doi: 10.1016/0010-0277(94)90024-8. [DOI] [PubMed] [Google Scholar]
- Garner WR. The processing of information and structure. Lawrence Erlbaum, Associates; Potomac, Maryland: 1974. [Google Scholar]
- Happé F, Booth RDL. The power of the positive: Revisiting weak coherence in autism spectrum disorders. The Quarterly Journal of Experimental Psychology. 2008;61(1):50–63. doi: 10.1080/17470210701508731. [DOI] [PubMed] [Google Scholar]
- Happé F, Frith U. The weak coherence account: Detail-focused cognitive style in autism spectrum disorders. Journal of Autism and Developmental Disorders. 2006;36(1):5–25. doi: 10.1007/s10803-005-0039-0. [DOI] [PubMed] [Google Scholar]
- Happé F, Ronald A. The fractionable autism triad: A review of evidence from behavioural, genetic, cognitive and neural research. Neuropsychology Review. 2008;18(4):287–304. doi: 10.1007/s11065-008-9076-8. [DOI] [PubMed] [Google Scholar]
- Ingvalson EM, Wenger MJ. A strong test of the dual mode hypothesis. Perception & Psychophysics. 2005;67:14–35. doi: 10.3758/bf03195010. [DOI] [PubMed] [Google Scholar]
- Innes-Kerr AHK. Gestalt perception of emotional expressions. Doctoral dissertation. Indiana University; Bloomington, IN: 2003. (unpublished) [Google Scholar]
- Jarrold C, Russell J. Counting abilities in autism: Possible implications for central coherence theory. Journal of Autism and Developmental Disorders. 1997;27(1):25–37. doi: 10.1023/a:1025817121137. [DOI] [PubMed] [Google Scholar]
- Kemeny JG, Snell JL. Finite Markov chains. Springer-Verlag; New York: 1976. [Google Scholar]
- Kimchi R. Primacy of wholistic processing and global/local paradigm: A critical review. Psychological Bulletin. 1992;112(1):24–38. doi: 10.1037/0033-2909.112.1.24. [DOI] [PubMed] [Google Scholar]
- Link SW, Heath RA. A sequential theory of psychological discrimination. Psychometrika. 1975;40(1):77–105. [Google Scholar]
- Lord C, Rutter M, Le Couteur A. Autism diagnostic interview-revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders. Journal of Autism and Developmental Disorders. 1994;24:659–686. doi: 10.1007/BF02172145. [DOI] [PubMed] [Google Scholar]
- Miller J. Divided attention: Evidence for coactivation with redundant signals. Cognitive Psychology. 1982;14(2):247–279. doi: 10.1016/0010-0285(82)90010-x. [DOI] [PubMed] [Google Scholar]
- Mottron L, Belleville S. A study of perceptual analysis in a high-level autistic subject with exceptional graphic abilities. Brain and Cognition. 1993;23:279–309. doi: 10.1006/brcg.1993.1060. [DOI] [PubMed] [Google Scholar]
- Mottron L, Burack JA, Iarocci G, Belleville S, Enns JT. Locally oriented perception with intact global processing among adolescents with high-functioning autism: Evidence from multiple paradigms. Journal of Child Psychology and Psychiatry. 2003;44(6):904–913. doi: 10.1111/1469-7610.00174. [DOI] [PubMed] [Google Scholar]
- Navon D. Forest before trees: The precedence of global features in visual perception. Cognitive Psychology. 1977;9:353–383. [Google Scholar]
- Navon D. The forest revisited: More on global precedence. Psychological Research. 1981;43:1–32. [Google Scholar]
- Neufeld WJ, Townsend JT, Jetté J. Quantitative response time technology for measuring cognitive-processing capacity in clinical studies. In: Neufeld WJ, editor. Advances in clinical cognitive sciences: Formal modeling of processes and symptoms. American Psychological Association; Washington, DC: 2007. [Google Scholar]
- O’Riordan M, Plaisted K. Superior visual search in autism. Journal of Experimental Psychology: Human Perception and Performance. 2001;27(3):719–730. doi: 10.1037//0096-1523.27.3.719. [DOI] [PubMed] [Google Scholar]
- Ozonoff S, Strayer DL, McMahon WM, Filloux F. Executive function abilities in autism and Tourette syndrome: An information processing approach. Journal of Child Psychology and Psychiatry. 1994;35(6):1015–1032. doi: 10.1111/j.1469-7610.1994.tb01807.x. [DOI] [PubMed] [Google Scholar]
- Plaisted K, Dobler V, Bell S, Davis G. The microgenesis of global perception in autism. Journal of Autism and Developmental Disorders. 2006;36(1):107–116. doi: 10.1007/s10803-005-0047-0. [DOI] [PubMed] [Google Scholar]
- Plaisted K, O’Riordan M, Baron-Cohen S. Enhanced discrimination of novel, highly similar stimuli by adults with autism during a perceptual learning task. Journal of Child Psychology and Psychiatry, and Allied Disciplines. 1998;39(5):765–775. [PubMed] [Google Scholar]
- Plaisted K, Swettenham J, Rees L. Children with autism show local precedence in a divided attention task and global precedence in a selective attention task. Journal of Child Psychology and Psychiatry, and Allied Disciplines. 1999;40(5):733–742. [PubMed] [Google Scholar]
- Ratcliff R. A theory of memory retrieval. Psychological Review. 1978;85(2):59–108. [Google Scholar]
- Rinehart NJ, Bradshaw JL, Moss SA, Brereton AV, Tonge BJ. A deficit in shifting attention present in high-functioning autism but not Asperger’s disorder. Autism. 2001;5(1):67–80. doi: 10.1177/1362361301005001007. [DOI] [PubMed] [Google Scholar]
- Rinehart NJ, Bradshaw JL, Simon AM, Brereton AV, Tonge BJ. Atypical interference of local detail on global processing in high-functioning autism and Asperger’s disorder. Journal of Child Psychology, Psychiatry, and Allied Disciplines. 2000;41(6):769–778. [PubMed] [Google Scholar]
- Rutherford MD, Richards ED, Moldes V, Sekuler AB. Evidence of a divided-attention advantage in autism. Cognitive Neuropsychology. 2007;24(5):505–515. doi: 10.1080/02643290701508224. [DOI] [PubMed] [Google Scholar]
- Schweickert R. A critical path generalization of the additive factor method: Analysis of a Stroop task. Journal of Mathematical Psychology. 1978;18(2):105–139. [Google Scholar]
- Schweickert R, Giorgini M. Response time distributions: Some simple effects of factors selectively influencing mental processes. Psychonomic Bulletin & Review. 1999;6(2):269–288. doi: 10.3758/bf03212330. [DOI] [PubMed] [Google Scholar]
- Schweickert R, Giorgini M, Dzhafarov E. Selective influence and response time cumulate distribution functions in serial–parallel task networks. Journal of Mathematical Psychology. 2000;44:504–535. doi: 10.1006/jmps.1999.1268. [DOI] [PubMed] [Google Scholar]
- Shah A, Frith U. An islet of ability in autistic children: A research note. Journal of Child Psychology and Psychiatry. 1983;34:1351–1364. doi: 10.1111/j.1469-7610.1983.tb00137.x. [DOI] [PubMed] [Google Scholar]
- Sternberg S. Memory-scanning: Mental processes revealed by reaction–time experiments. American Scientist. 1969;57(4):421–457. [PubMed] [Google Scholar]
- Szatmari P, Maziade M, Zwaigenbaum L, Mérette C, Roy M, Joober R, et al. Informative phenotypes for genetic studies of psychiatric disorders. American Journal of Medical Genetics, Part B: Neuropsychiatric Genetics. 2007;144B:581–588. doi: 10.1002/ajmg.b.30426. [DOI] [PubMed] [Google Scholar]
- Townsend JT. Some results concerning the identifiability of parallel and serial processes. Journal of Mathematical and Statistical Psychology. 1972;25:168–199. [Google Scholar]
- Townsend JT. Issues and models concerning the processing of a finite number of inputs. In: Kantowitz BH, editor. Human information processing: Tutorials in performance and cognition. Erlbaum Press; Hillsdale, NJ: 1974. pp. 133–168. [Google Scholar]
- Townsend JT. Uncovering mental processes with factorial experiments. Journal of Mathematical Psychology. 1984;28(4):363–400. [Google Scholar]
- Townsend JT. Truth and consequences of ordinal differences in statistical distributions: Toward a theory of hierarchical inference. Psychological Bulletin. 1990;108(3):551–567. doi: 10.1037/0033-2909.108.3.551. [DOI] [PubMed] [Google Scholar]
- Townsend JT, Ashby FG. Methods of modeling capacity in simple processing systems. In: Castellan JN, Restle F, editors. Cognitive theory. Lawrence Erlbaum Associates; Hillsdale, NJ: 1978. [Google Scholar]
- Townsend JT, Ashby FG. Stochastic modeling of elementary psychological processes. Cambridge University Press; London: 1983. [Google Scholar]
- Townsend JT, Fifić M. Parallel and serial processing and individual differences in high-speed scanning in human memory. Perception and Psychophysics. 2004;66:953–962. doi: 10.3758/bf03194987. [DOI] [PubMed] [Google Scholar]
- Townsend JT, Fifić M, Neufeld WJ. Assessment of mental architecture in clinical/cognitive research. In: Treat BRRTA, Baker TB, editors. Psychological clinical science: Papers in honor of Richard M. McFall. Lawrence Erlbaum Associates; Mahwah, NJ: 2007. [Google Scholar]
- Townsend JT, Nozawa G. Spatio-temporal properties of elementary perception: An investigation of parallel, serial, and coactive theories. Journal of Mathematical Psychology. 1995;39(4):321–359. [Google Scholar]
- Townsend JT, Schweickert R. Toward the trichotomy method: Laying the foundation of stochastic mental networks. Journal of Mathematical Psychology. 1989;33:309–327. [Google Scholar]
- Townsend JT, Thomas RD. Stochastic dependencies in parallel and serial models: Effects on systems factorial interactions. Journal of Mathematical Psychology. 1994;38(1):1–34. [Google Scholar]
- Townsend JT, Wenger MJ. A theory of interactive parallel processing: New capacity measures and predictions for a response time inequality series. Psychological Review. 2004;111(4):1003–1035. doi: 10.1037/0033-295X.111.4.1003. [DOI] [PubMed] [Google Scholar]
- Wechsler D. Wechsler abbreviated scale of intelligence. The Psychological Corporation; San Antonio, TX: 1999. [Google Scholar]
- Wenger MJ, Townsend JT. Basic response time tools for studying general processing capacity in attention, perception, and cognition. Journal of General Psychology. 2000;127(1):67–99. doi: 10.1080/00221300009598571. [DOI] [PubMed] [Google Scholar]
- Wenger MJ, Townsend JT. Computational, geometric, and process perspectives on facial cognition: Contexts and challenges. Lawrence Erlbaum Associates; Mahwah, NJ: 2001. [Google Scholar]