Why do we miss rare targets? Exploring the boundaries of the low prevalence effect

Anina N Rich; Melina A Kunar; Michael J Van Wert; Barbara Hidalgo-Sotelo; Todd S Horowitz; Jeremy M Wolfe

doi:10.1167/8.15.15

. Author manuscript; available in PMC: 2011 Apr 1.

Published in final edited form as: J Vis. 2008 Nov 24;8(15):15.1–1517. doi: 10.1167/8.15.15

Why do we miss rare targets? Exploring the boundaries of the low prevalence effect

Anina N Rich ¹, Melina A Kunar ², Michael J Van Wert ³, Barbara Hidalgo-Sotelo ⁴, Todd S Horowitz ⁵, Jeremy M Wolfe ⁶

PMCID: PMC3069706 NIHMSID: NIHMS279570 PMID: 19146299

Abstract

Observers tend to miss a disproportionate number of targets in visual search tasks with rare targets. This ‘prevalence effect’ may have practical significance since many screening tasks (e.g., airport security, medical screening) are low prevalence searches. It may also shed light on the rules used to terminate search when a target is not found. Here, we use perceptually simple stimuli to explore the sources of this effect. Experiment 1 shows a prevalence effect in inefficient spatial configuration search. Experiment 2 demonstrates this effect occurs even in a highly efficient feature search. However, the two prevalence effects differ. In spatial configuration search, misses seem to result from ending the search prematurely, while in feature search, they seem due to response errors. In Experiment 3, a minimum delay before response eliminated the prevalence effect for feature but not spatial configuration search. In Experiment 4, a target was present on each trial in either two (2AFC) or four (4AFC) orientations. With only two response alternatives, low prevalence produced elevated errors. Providing four response alternatives eliminated this effect. Low target prevalence puts searchers under pressure that tends to increase miss errors. We conclude that the specific source of those errors depends on the nature of the search.

Keywords: attention, eye movements, search, target prevalence, feature search, conjunction search

Introduction

Visual search is an important component of everyday life. We look for keys, for cups, for a particular book on the shelf. As a general rule, we tend to look for things that are fairly common (e.g., milk in the fridge). When we look for things that are rare (e.g., caviar in the fridge), we can abandon the search fairly rapidly because the probability of success is low and the stakes are not particularly high. There are some searches, however, that combine low target prevalence with very high stakes. For example, medical screening tasks like mammography or cytopathology screening (‘Pap tests’) are critically important searches for targets that are only rarely present (typically under 1% of cases; Fenton et al., 2007; Gur et al., 2004; Smith & Turnbull, 1997). Similarly, in airport baggage screening, a serious threat in a bag is a highly unusual occurrence (Rubenstein, 2001). Nevertheless, missing these targets can have serious consequences.

Wolfe, Horowitz, and Kenner (2005) demonstrated that when a target is rare, participants are surprisingly poor at detecting it in complex visual displays. Their displays consisted of overlapping, semi-transparent grayscale photographs on a noise background. When the target (a tool) was present on 50% of trials, participants made few ‘miss’ errors (a proportion of 0.07 of target-present trials). In contrast, when the target was only present on 1% of trials, error rates increased dramatically (0.30 of target-present trials). In addition, reaction times (RTs) for target-absent responses were much faster at 1% than at 50% prevalence. At 1%, mean RT for correct target-absent trials was actually less than mean RT for correct target-present trials, a highly unusual pattern for visual search.

This result might suggest a simple speed-accuracy trade-off, but subsequent work points to a more complicated story. The task of Wolfe et al. (2005) produced very few false alarm errors. More recent experiments have used a more difficult task with simulated x-ray images of luggage (Wolfe et al., 2007). In this task, participants made false alarms as well as miss errors, allowing computation of signal detection performance measures. Wolfe et al. (2007) found that the dramatic increase in miss errors at low prevalence was accompanied by a decrease in the false alarm rate. In signal detection terms, this corresponded to a shift in criterion without a significant change in sensitivity (d′), whereas a straightforward speed-accuracy tradeoff would result in a loss of sensitivity as speed increased. This result is consistent with other work in the decision literature showing robust shifts in criterion with changes in the probability of one choice over another (Healy & Kubovy, 1981; Maddox, 2002).

Alternatively, observers might simply make the wrong response. In a low prevalence task of the sort described here, observers will make the same target-absent response on the vast majority of trials. The resulting response priming (Bertelson & Tisseyre, 1966) might cause observers to press the target-absent key when they meant to press the target-present key. Fleck and Mitroff (2007) and Li et al. (submitted) used tasks similar to the original Wolfe et al. (2005) task and found that the excess miss errors at low prevalence could be largely eliminated if observers were forced to slow down or permitted to correct their errors. Such correctable errors might occur either because the observer responded with the wrong motor act or because the observer decided on a target-absent response and then subsequently detected the target too late to withhold the response. In other studies, however, neither slowing participants down (Wolfe et al., 2007) nor permitting correction (Van Wert, Wolfe, & Horowitz, in press) eliminated the prevalence effect in a more difficult baggage screening task.

The diversity of low prevalence results reflects the fact that search termination is a complex process when a target is not found. How do you know when to quit? Miss errors occur when a search task is terminated before the target is found. From a signal detection vantage point, rather than being a single decision, a search task is really a sequence of decisions. For each item, the observer must decide whether or not it is a target. If the decision is ‘target’, then the observer can produce a target-present response. If not, the observer must make a second decision to continue searching or to terminate the search with a target-absent response.

It has proven difficult to produce a fully satisfying model of the decision to terminate an unsuccessful search with an absent response (Chun & Wolfe, 1996; Hong, 2005; Zenger & Fahle, 1997). The data suggest that participants engage in an adaptive process that leads them to terminate the next search sooner after a successful search, and later after an error. It is not clear what is being adjusted in this process. It could be search time (modulated by some estimate of set size), or the proportion of items examined during search. A threshold might be set in terms of some internal measure of salience or activation (i.e., examine all items above some criterion level of interest). It may be that several of these adjustments are made simultaneously. At low prevalence, the input to this adaptive process will be quite extreme. The vast majority of responses will be target-absent responses and most of these would be correct even if the observer did not bother to examine the display. This suggests that any search task will show effects of low prevalence.

In the present paper, we employ much simpler search stimuli than those used in the studies cited above. We show that the effects of prevalence are indeed ubiquitous, elevating miss errors in even the simplest search tasks, and that, as in the work of Fleck and Mitroff and Li et al., the manifestation of the prevalence effect differs depending on the nature of the search. Of course, the simple stimuli used here are far removed from the real-world search tasks that are an important motivation for this line of work. However, these well-controlled stimuli are useful to uncover the fundamental processes that govern search termination on target-absent trials.

In Experiment 1, observers searched for a T among Ls. This spatial configuration search is typically inefficient, with target-present RTs increasing about 20–30 ms for each additional distractor (e.g., Treisman & Gelade, 1980; Wolfe & DiMase, 2003). Nevertheless, once an item is attended, it is trivial to distinguish a T from an L. Such tasks produce very few false alarm errors. In Experiment 2, the task was a feature search for a horizontal line target among vertical line distractors. Here, there were virtually no false alarms and the task does not require much search—in fact, it is effectively a ‘pop-out’ target detection task. To anticipate our findings, we replicated the prevalence effect with these simple, easily discriminable stimuli. Both spatial configuration search (Experiment 1) and feature search (Experiment 2) produced higher miss error rates at low (2%) target prevalence than at high (50%) target prevalence.

Although the first two experiments both showed an effect of target prevalence, the pattern of results differed, suggesting different sources of the effect. Eye movement data in Experiment 1 are consistent with the idea that most of the additional low prevalence errors during spatial configuration search were produced when participants stopped the search early, before fixating the target. Relatively few appeared to be motor errors, where the participants fixated the target but inadvertently pressed the wrong button. In contrast, the additional low-prevalence errors in feature search seemed more likely to be due to motor response errors, since the search component is trivial in this task. This account was tested directly in Experiment 3, where we introduced a minimum duration before participants were allowed to respond. This should primarily reduce motor errors. Consistent with our hypothesis, an enforced delay eliminated the prevalence effect in feature search, but not spatial configuration search.

Finally, in Experiment 4, we show that it is not the simple prevalence of the rare item that determines the rate of miss errors for that target. The target, now present on every trial, was a T which could be upright, rotated 90° to the left or the right of vertical, or inverted. Participants were asked to identify the orientation of the target. When the rare orientation appeared on only 2% of trials, and a single alternative orientation was present on the other 98% of trials, there was a prevalence effect. However, when the rare orientation was presented among three equally common alternative orientations, there was no effect of the prevalence manipulation indicating an important role for the nature of the response, at least when the search task is perceptually simple.

The prevalence effect is not peculiar to perceptually challenging X-ray-like stimuli. Low target prevalence puts pressure on all types of search tasks, changing search behavior in several ways. The relative contribution of these factors differs depending on the structure of the task and the nature of the stimuli.

Experiment 1: Prevalence effects in spatial configuration search

In Fleck and Mitroff (2007) and Wolfe et al. (2005, 2007), the search stimuli were heterogeneous complex objects. In this first experiment, we explore whether the prevalence effect is a general property of visual search, or if it can only be observed when perceptual decisions about each item are relatively difficult. The aim of Experiment 1 was to determine whether the prevalence effect occurs when search is ‘inefficient’, apparently requiring attention to each item in turn, but where the perceptual difficulty of the target/distractor decision is minimized. Participants searched for a T among offset Ls (Figure 1). Once attended, a capital letter T is easily distinguished from an L, even when the lines of the L are slightly offset. Without focal attention, however, the difference in the spatial configuration of the line segments is not available to guide attentional deployment, making search for a T among Ls inefficient (e.g., Treisman & Gelade, 1980; Wolfe & DiMase, 2003). Here, we tested whether this perceptually simple task still produced a prevalence effect when observers had to search for a T that was present on only a small fraction of trials.

Sample target-present display for Experiment 1. Participants searched for a T in any orientation among offset L distractors, forming a difficult spatial configuration search.

We also measured eye position, making it possible to address a number of questions about the pattern of search under different prevalence conditions. First, by measuring number and duration of fixations, we can ascertain if faster response times under low prevalence conditions are due to a reduction in the number of items scanned, or in the time spent on each item. Second, by assessing the spatial distribution of fixations, we can ask if participants are searching the whole display or restricting themselves to a brief search around fixation. Finally, we can ask if, and when, the participants fixated the target item.

Methods

Participants

There were 10 participants (Mean age: 24 years; SD: 6.2), all with at least 20/25 visual acuity (with corrective lenses if necessary). All participants gave informed consent and were paid $15/hour.

Apparatus

Displays were presented on a 16″ Trinitron-Dell CRT monitor running at a refresh rate of 100 Hz, controlled by a Dell Optiplex GX270 Intel (R); Pentium (R) 4CPU 2.40 GHz computer. Two-dimensional movements of the right eye were recorded by an ISCAN RK-464 video-based eyetracker sampling at 240 Hz. A chin rest was used to stabilize head position. An initial calibration was performed prior to the beginning of each experimental session to ensure tracking accuracy within 0.5 degrees of visual angle. We used Matlab 5.2 and the Psychophysics Toolbox-2 (Brainard, 1997; Pelli, 1997) to control stimulus presentation and data collection.

Stimuli and procedure

The stimuli were white (42 cd/m-sq) rotated Ts and offset Ls of visual angle 1.0° × 1.0° presented on a mid-gray background (19 cd/m-sq; Figure 1). The stroke width was 0.23 deg. Viewing distance was 75 cm. Stimuli were presented on an imaginary 5 × 5 grid within which each cell was 150 pixels wide × 120 pixels high. Each stimulus was centered within the cell and then randomly jittered ±10 pixels.

Each trial began with a central fixation cross. When the participant had maintained fixation for 500 ms, the search display appeared, and remained until response. Within a block, set sizes of 6 or 12 items were randomly interleaved. A single target T in one of four possible orientations (upright, 90° to the left or the right of vertical, or inverted) was present on either 2% (low prevalence blocks) or 50% (high prevalence blocks) of trials. The remaining stimuli were offset Ls, also presented in the four possible orientations. Participants gave a present/absent judgment using two keys on the keyboard. Participants were informed as to whether targets would be common or rare and were given feedback regarding accuracy after each trial. In the low prevalence condition, it was emphasized that although the targets were rare, it was very important to detect them.

Each participant completed 2200 trials over two sessions. Following initial calibration, each session started with a high prevalence block of 20 practice trials. In Session 1, participants then completed a high prevalence block of 160 trials and a low prevalence block of 1000 trials (with optional breaks every 100 trials). The order of blocks was counterbalanced across participants. In Session 2, participants completed the high prevalence practice block followed by a low prevalence block of 1000 trials (again with optional breaks every 100 trials).

Eye-tracking analyses

Eye movement analyses were performed on smoothed eye position data, averaging the raw data (sampled at 240 Hz) within a moving window of 8 data points (33 ms). The beginning and end positions of saccades were detected using an algorithm implementing an acceleration criterion (Araujo, Kowler, & Pavel, 2001). Specifically, the velocity was calculated for two overlapping 17 ms intervals; the onset of the second interval was 4.17 ms after the onset of the first. The acceleration threshold was set at a velocity change of 6°/s between the two intervals. Saccade onset was defined as the time when acceleration exceeded the threshold, and saccade termination was defined as the time when acceleration dropped below the threshold. Fixations were defined as the periods between successive saccades, with a supplemental threshold criterion of minimum 50 ms fixation duration. Successive saccades that occurred within 50 ms of each other were considered to be a continuous saccade.

Results

Data from one participant had to be excluded due to poor eye-tracking calibration. We removed trials on which the duration of the initial fixation was less than 50 ms and trials on which there were more than 4 instances where the eye-tracker ‘lost’ the eye. This resulted in the exclusion of 0.08 (proportion) of high prevalence and 0.07 of low prevalence trials. We also discarded as outliers trials on which RTs were less than 100 ms or greater than 4000 ms. This resulted in the exclusion of a further 0.04 of high prevalence and 0.001 of low prevalence trials. We used arc-sine transformed accuracy data (y′= arcsin(sqrt(y))) in all statistical analyses to compensate for unequal variances present in binomial data (Hogg & Craig, 1995). Calculation of mean slope for all experiments was based on the group average of individual participant slopes.

Figure 2a shows the mean error rates for low and high prevalence trials. There were far more miss errors under low than high prevalence conditions. This was confirmed by a repeated-measures ANOVA on arcsine-transformed error data for target-present trials (‘misses’) with the factors of Prevalence (low, high), and Set Size (6, 12). There was a significant effect of prevalence (F(1,8) = 34.2, p < 0.001, partial eta² = 0.81), but no effect of set size (F(1,8) < 1, n.s.) and no interaction (F(1,8) < 1, n.s.). False alarm rates were too low for sensible analysis, constituting just 4 of 1285 trials at high prevalence and 12 out of 16656 trials at low prevalence.

Results from Experiment 1. (a) Mean error rate (proportion), (b) mean RT (ms), (c) mean number of fixations, and (d) mean fixation duration (ms), for correct and incorrect responses for each of the prevalence and target presence conditions, plotted by set size. Error bars represent within-subjects 95% confidence intervals. Note the fixation data are from search fixations only (excludes the initial central fixation for all conditions and the target fixation for target-present trials). HP = high (50%) prevalence, LP = low (2%) prevalence.

Figure 2b shows the mean RT for the two prevalence conditions. When targets were rare, participants were faster to respond on target-absent trials than when targets were frequent. The pattern evident in the figure was confirmed by a repeated-measures ANOVA with the factors of Prevalence (low, high), Target Presence (present, absent) and Set Size (6, 12). Although the three-way interaction did not reach significance (F(1,8) = 4.6, p = 0.064, partial eta² = 0.37), all two-way interactions were significant (Prevalence and Target Presence, F(1,8) = 72.5, p < 0.01, partial eta² = 0.90; Prevalence and Set Size, F(1,8) = 30.0, p < 0.001, partial eta² = 0.79; Target Presence and Set Size, F(1,8) = 52.4, p < 0.001, partial eta² = 0.87).

Post-hoc pairwise comparisons demonstrated that whereas there was no effect of prevalence on target-present RT (p > 0.1), participants were faster under low than high prevalence conditions on target-absent trials (p < 0.001). Further, while we observed the typical pattern of slower target-absent than target-present responses at high prevalence (p < 0.001), there was no difference between these conditions at low prevalence (p > 0.1). There were significant effects of set size on RT for both prevalence conditions and for both target-present and target-absent trials (all p < 0.01). Correct target-present slopes of the RT × Set Size functions were 50 ms/item for low prevalence and 81 ms/item for high prevalence. Correct target-absent slopes were 107 ms/item for low prevalence and 168 ms/item for high prevalence.

There are only a small number of trials contributing to the miss RT data in Figure 2b, so these means should be interpreted with caution. With this caveat, we can see that RTs for trials on which participants missed the target are similar to correct target-absent responses under both prevalence conditions. Misses were slower than correct target-present responses for high prevalence but not for low prevalence. This was confirmed by a repeated-measures ANOVA with the factors of Prevalence (low, high), Accuracy (hit, miss) and Set-Size (6, 12). There was a significant interaction between Prevalence and Accuracy (F(1,8) = 9.5, p < 0.02). This interaction was due to a significant difference between hits and misses for high prevalence (p < 0.001) but not for low prevalence (p > 0.3). (Note: In this analysis the group mean was substituted for one missing data point because one subject had no errors in the high prevalence, set size 12 condition).

Figures 2c and 2d show mean fixation count and duration data for high and low prevalence conditions for correct (upper) and miss (lower) trials. The number of fixations on correct trials follows the pattern of RT. As in other work, the longer the RT, the more fixations (Hooge & Erkelens, 1998; Motter & Belky, 1998; Shen & Reingold, 1999; Zelinsky, 1996; Zelinsky & Sheinberg, 1997). This pattern was confirmed with a repeated-measures ANOVA with factors of Prevalence (low, high), Target Presence (present, absent) and Set Size (6, 12).

Following a significant three-way interaction (F(1,8) = 7.0, p < 0.05, partial eta² = 0.47), we performed simple main effects analyses on each prevalence condition separately. For high prevalence data, there was a significant Target Presence by Set Size interaction (F(1,8) = 44.5, p < 0.001, partial eta² = 0.85). Pairwise comparisons demonstrated that, as one would expect, there were more fixations in set size 12 than set size 6 for both Target Presence conditions (p < 0.001). There were also significantly more fixations on target-absent than target-present trials (p < 0.001).

In contrast, for low prevalence data, while there was also a significant Target Presence by Set Size interaction (F(1,8) = 25.0, p < 0.01, partial eta² = 0.76), and an effect of set size for both target-present and target-absent conditions (p < 0.01 and p < 0.001, respectively), there was no difference in the number of fixations on target-present and target-absent trials (p > 0.2). There were no effects of these factors on the duration of search fixations for either prevalence condition (Figure 2d; for Prevalence, Target Presence, and all interaction terms, F(1,8) < 1, n.s.; for Set Size, F(1,8) = 1.7, p > 0.2).

What hints can we get regarding the cause of miss errors by looking at the pattern of fixations? Given that participants were making fewer saccades before giving target-absent responses at low prevalence, they may be systematically failing to examine some portion of the display at low prevalence. The periphery is the most obvious candidate.

In Figure 3, fixations in low and high prevalence conditions are binned by distance from the initial central fixation. The distributions look similar, but a two-tailed Kolmogorov–Smirnoff test revealed a significant difference between the distributions (p < 0.001). This difference, however, does not seem to be a systematic neglect of the periphery (or any other area), as one might expect, leaving any theoretical significance unclear.

Spatial distribution of fixations relative to the initial central fixation, plotted separately for high (blue) and low (red) prevalence, and collapsed across the other conditions. HP = high (50%) prevalence, LP = low (2%) prevalence.

Fleck and Mitroff (2007) have proposed that the prevalence effect is the consequence of execution errors where participants successfully find the targets under low prevalence conditions, but then make the wrong response. They suggest that these errors are either because participants cannot override the frequent (and therefore pre-potent) target-absent response, or because they only see the target (either directly on the screen or in a lingering representation in iconic or sensory memory) after initiating the incorrect target-absent response. If participants do indeed see the target but push the wrong button, either accidentally due to the pre-potent target-absent response, or due to seeing the target too late (after initiation of an target-absent response), the last eye movement on miss trials should be on, or close to, the target. Although eye movements can be dissociated from the locus of attention, in free-viewing studies like this they give a good indication of the region of space to which attention is deployed (e.g., Hoffman, 1996; Moore & Fallah, 2004; Zelinsky, Rao, Hayhoe, & Ballard, 1997).

To investigate this we sorted the miss trials into three categories (see Figure 4):

Example scan paths from low prevalence miss trials classified into trials where (a) the target was never fixated, (b) the target was fixated during the trial, and (c) the target was fixated at the end of the trial. Red dots indicate fixations, green dots identify data points that were indicative of the fixation and yellow dots trace the saccade. The red box denotes the region defined as ‘landing on the target’.

Trials on which the eyes never landed on or near the target (Figure 4a).
Trials on which the eyes landed on the target and then went elsewhere before the end of the trial (Figure 4b).
Trials on which the eyes landed on the target as the last fixation prior to response (Figure 4c).

In each case, the 150 × 120 pixel cell of the 5 × 5 stimulus grid was used to define the location of the target. Thus, an eye movement counted as ‘landing on the target’ if there was a fixation within the target cell (denoted by the red square in Figure 4).

Although we cannot simply infer causation from a pattern of fixations, different types of miss errors may well correlate with different patterns of fixation. For example, on trials when an observer commits a miss error because she quit searching before finding the target, it is likely she never fixated that target. Of course, there are alternative explanations for such misses (e.g., the observer found the target without ever fixating it, and then produced a motor error). Nevertheless, if we accept that some accounts are more plausible than others, we can use the eye movement record as an indication of the source of elevated miss errors at low prevalence.

If participants miss more targets at low prevalence because they quit searching earlier, then we should see an increase in the proportion of miss trials where the eyes never reached the target. If participants required more evidence to classify a stimulus as a target at low prevalence (a criterion shift), this might lead to an increase in the proportion of miss errors when the participant fixated the target during the trial, then continued on to fixate other stimuli. Finally, if the misses at low prevalence are due primarily to execution errors (either accidentally pushing the wrong button or finding the target after initiating a target-absent response), we should see an increase in proportion of trials where the target is fixated at the end of the trial, but an incorrect, target-absent response is produced. Errors due to noticing a target in iconic memory after the response has been initiated may also fall in the latter category, as the ‘last fixation’ is measured relative to the response. It is not clear how else we might use fixation patterns to identify this class of errors.

If we look at the breakdown of errors across the different miss categories (Table 1), it is clear that the distribution differs between high and low prevalence. Although the low numbers of trials contributing to some of the categories mean we have to be cautious, a chi-square test using the high prevalence values as predictors showed a significant deviation under low prevalence conditions (χ²(2) = 182, p < 0.001). The final column of Table 1 highlights that the largest change for low prevalence is an increase in the number of miss trials on which the participant did not fixate the target, consistent with termination of search before attention was ever directed to the target. Very few additional miss errors occurred where participants fixated the target and failed to recognize it, providing evidence that the criterion for each individual stimulus decision has not shifted. Finally, there is a small increase in the misses on which the participant fixated the target and then responded target-absent, suggesting some contribution from execution errors.

Table 1.

Miss errors classified by fixations on target, summed across participants, collapsed across set-size.

Fixations on target	Proportion of total target-present trials (N misses)		Additional errors due to low prevalence (low–high; proportion)
Fixations on target	High prevalence	Low prevalence
None	0.06 (39)	0.26 (82)	0.20
During trial	0.04 (27)	0.05 (16)	0.01
End of trial	0.02 (13)	0.08 (25)	0.06

Open in a new tab

To further examine the source of the errors in this task, we sorted the target-present RTs for each observer at each set size for low and high prevalence into quartiles¹ (i.e., Quartile 1 has the fastest 25% of RTs, Quartile 4 the slowest 25% of RTs). There are not many trials in each cell, but by summing across observers, we can calculate proportion correct as a function of RT quartile, as shown in Figure 5a.

(a) Proportion of correct target-present trials as a function of RT quartile. (b) Difference in proportion correct target-present trials between high and low prevalence. Positive values indicate increased miss error rate at low prevalence. HP = high (50%) prevalence, LP = low (2%) prevalence.

The critical question for present purposes concerns the source of the extra errors in the low prevalence condition. The data in Figure 5a suggest that, at high prevalence, faster RTs are more accurate. This pattern is less clear at low prevalence, especially for set size 6. What does this suggest about the additional miss errors at low prevalence? A prime source of miss errors is the abandonment of search before a target is found. If we subtract the low prevalence accuracy from the high prevalence accuracy shown in Figure 5a, we obtain a plot of the change in accuracy as a function of quartile (Figure 5b). The effect of prevalence on this measure is greater in the earlier quartiles. If miss errors occur when observers incorrectly conclude that search has gone on long enough, these data suggest that observers come to that conclusion more rapidly when they are operating at low prevalence. Note that a similar pattern could occur if participants made motor errors more frequently for targets discovered quickly. However, the eye movement data do not support this hypothesis.

We will briefly consider one more question that can be addressed by these data. Figure 6 presents RTs for correct target-absent trials broken into 100 trial bins. The data demonstrate that the low prevalence effect develops rapidly (within 100 trials for set size 12: target-absent RTs faster in low prevalence than high prevalence, p < 0.05). Within 100 trials of the start of the second session (on another day), participants were responding more quickly to low prevalence target-absent trials than to high prevalence target-absent trials, despite a practice block of 20 high prevalence trials (set size 6: p < 0.01; set size 12: p < 0.001).

Mean RT for correct target-absent trials separated by set size in 100 trial bins. Error bars represent one standard error of the mean. Data from the high prevalence condition are shown in blue for comparison. HP = high (50%) prevalence, LP = low (2%) prevalence.

Discussion

This experiment demonstrates that a large reliable prevalence effect across a group of participants can be obtained with easily discriminable targets. Even slightly offset Ls are easily discriminable from Ts. Nevertheless, under the pressure of low prevalence, participants missed around 40% of the rare targets, and responded more rapidly on target-absent trials than under high prevalence conditions. Thus, the effect is not limited to the perceptually complex stimuli used in previous work.

There was no change in the duration of participants’ fixations, and only subtle changes in the area of the display searched under low prevalence. However, we did observe a dramatic decrease in the number of items fixated on low relative to high prevalence trials. The effect of low prevalence on RT was evident within 100 trials, and was quickly reinstated after a one-day break.

The RT patterns suggest that when the target is rare, participants search for a shorter time before concluding the target is absent. At high prevalence, RTs for misses and correct target-absent responses are longer than the mean time required to find a target. At low prevalence, the RTs for target-absent responses and misses are markedly reduced. Similarly, the eye-tracking data suggest that many miss errors at low prevalence were attributable to participants terminating the search without ever fixating the target. Additionally, there was some evidence for errors occurring at the response stage. In the next experiment, we removed the effortful search component from the task to isolate the effect of prevalence on the decision and response stage.

Experiment 2: Prevalence effects in feature search

Experiment 1 demonstrated that the prevalence effect can be observed with perceptually simple stimuli. While most misses did not seem to be simple motor errors, there was at least a small contribution from errors of this sort. In Experiment 2, we look for a prevalence effect with the simplest of ‘search’ tasks: a ‘pop-out’ feature detection task where a target immediately attracts attention, minimizing the search component.