A cortical pooling model of spatial summation for perimetric stimuli

Fei Pan; William H Swanson

doi:10.1167/6.11.2

. Author manuscript; available in PMC: 2013 Sep 19.

Published in final edited form as: J Vis. 2006 Oct 13;6(11):1159–1171. doi: 10.1167/6.11.2

A cortical pooling model of spatial summation for perimetric stimuli

Fei Pan ¹, William H Swanson ²

PMCID: PMC3777700 NIHMSID: NIHMS513343 PMID: 17209726

Abstract

Contemporary models of perimetric sensitivity assume probability summation of retinal ganglion cell sensitivities, ignoring cortical processing. To assess the role of cortical processing in perimetric spatial summation, we used a common form of multiple-mechanism spatial vision model in which the stimulus is sampled by receptive fields analogous to those of simple cells in primary visual cortex. Psychophysical threshold was computed by probability summation across the receptive fields. When the receptive fields were nonoriented (like ganglion cells), the spatial summation function had a large nonmonotonic transitional region that was inconsistent with perimetric spatial summation data. When the receptive fields were orientation tuned (like cortical cells), the model was able to give good fits to perimetric spatial summation data. The predictions of the model were evaluated with a masking study, in which noise masks either enlarged the critical area or changed the shape of the spatial summation functions. We conclude that cortical pooling by multiple spatial mechanisms can account for perimetric spatial summation, whereas probability summation across ganglion cells cannot.

Keywords: spatial summation, contrast sensitivity, perimetry, glaucoma, color vision, spatial vision

Introduction

Automated static perimetry is one of the most frequently performed psychophysical tests, in that it is one of the primary diagnostic tools for glaucoma, a major blinding eye disease. Clinical perimetry adopted standardized stimuli 60 years ago: circular luminance increments presented on a uniform photopic background (Goldmann, 1999). Since then, basic vision research has moved on to more complex sinusoidal stimuli produced on computer-controlled displays. Over the past 25 years, basic spatial vision researchers have developed a wide range of models for visual thresholds, which, in general, agree on common features: detection is mediated by cortical processes that vary in spatial and orientation tuning and whose outputs are combined with a nonlinear summation process (Graham, 1989). However, the insights from the past 25 years of basic vision research have not yet been applied to perimetric studies. Models of sensitivity to perimetric stimuli to date have only considered ganglion cell responses (Gardiner, Demirel, & Johnson, 2006; Harwerth et al., 2004). As a result, clinical researchers have expressed great uncertainty about how to compare sensitivities for traditional stimuli versus more complex spatial stimuli and have based comparisons on the dynamic ranges of different devices rather than on visual processing of the stimuli (Spry, Johnson, McKendrick, & Turpin, 2001).

Contemporary perimetric theories analyze effects of varying stimulus size in two different ways. One approach (Garway-Heath, Caprioli, Fitzke, & Hitchings, 2000; Harwerth et al., 2004) uses an empirical equation with a “summation exponent” k, which varies with eccentricity: Sensitivity = cG^k, where G is the number of ganglion cell bodies in the region being tested and c and k are free parameters. The other approach (Inui, Mimura, & Kani, 1981; Wilson, 1970) uses Ricco's law for small stimuli (threshold is inversely related to stimulus area) and characterizes the effects of eccentricity in terms of increase in critical diameter (the largest stimulus for which Ricco's law holds). For the first approach, the empirical parameters have no straightforward theoretical interpretation and can vary dramatically depending on how the data are analyzed. For the second approach, there is no standard way of describing the effects of stimulus size for stimuli larger than the critical diameter. Both approaches assume that detection is mediated by ganglion cells, with little role for cortical processing (Gardiner et al., 2006; Glezer, 1965).

The ganglion-cell-based perimetric theories have limited utility for design of improved perimetric stimuli. In earlier studies, we have demonstrated several advantages of using low spatial frequency sinusoids for perimetric testing, including decreased variability without loss of ability to detect glaucomatous defect (Pan, Swanson, & Dul, 2006; Sun, Dul, & Swanson, 2006). These results cannot be explained by models based on detection by retinal ganglion cells. A recent review concluded that the field of perimetry would benefit greatly from better theoretical underpinnings (Anderson, 2006). At this point, a bridge is needed to connect the gap between the ganglion-cell-based approach that perimetric researchers use and the cortical-processing approach that has been used in basic spatial vision for decades.

The purpose of this study was to provide improved theoretical underpinnings through quantitative modeling of spatial summation for conventional perimetric stimuli—circular luminance increments. Our results are consistent with the critical diameter being determined by the peak spatial frequency of the cortical processes mediating detection rather than by the ganglion cell receptive field centers. The summation exponent is interpreted as reflecting the difference between stimulus size and peak spatial frequency, rather than as a pooling exponent for ganglion cell number. The results of the model provide a good description for normative spatial summation data from a wide range of perimetric studies. A masking experiment showed an example of revealing responses to perimetric types of stimuli by mechanisms tuned to low spatial frequencies.

Part I. A model of spatial summation for circular increments

Methods

The model is a typical form of multiple-mechanism model for spatial vision (Graham, 1989), which assumes that the stimulus is sampled by an ensemble of spatial filters that are each characterized by their receptive field structure. A single filter is an array of filter-elements that all have the same receptive field structure (the “filter kernel”) and are centered at different locations in visual space. The model simulation was for retinal regions outside the fovea; thus, it was assumed that nearby filter-elements have similar spatial and temporal features. Sensitivity of a spatial mechanism was then computed with probability summation across spatial filters that are tuned to different orientations but to the same spatial frequency. In a degenerate form of spatial vision model, a single spatial mechanism with circularly symmetric filters was used to represent a ganglion-cell-based model.

Stimulus

Circular stimuli were used, with stimulus area that varied from –3.0 to +2.0 log deg² (0.025° to 5.64° in diameter) in steps of 0.1 log unit.

Spatial filters and filter-elements

The filter-elements are analogous to populations of cortical neurons, and the sensitivity of each filter-element was computed by multiplying the stimulus by the receptive field.

The filters were distinguished in terms of peak spatial frequency, spatial phase, spatial bandwidth, and orientation tuning of their receptive fields. Figure 1 shows receptive fields of the various types of spatial filters used in the model, together with their spatial-tuning and orientation-tuning functions. Bandwidths are summarized in Table 1.

(a) Two-dimensional images of the receptive fields for the different spatial filters. (b) Orientation-tuning functions for the weakly orientation-tuned (upper) and the strongly orientation-tuned (lower) filter-elements. (c). Spatial-tuning functions for the DN filter-elements (upper) and the DoG filter-elements with a range of surround strengths (lower).

Table 1.

Spatial filter characteristics and the spatial summation parameters from the predicted spatial summation functions for different spatial filters.

Spatial filters	Spatial bandwidth (octaves)	Orientation bandwidth (deg)	Space constant of the orthogonal Gaussian (deg)	Log critical area (log deg²)/critical diameter (deg)	Extended slope
D1 long	2.6	34	0.11	–1.6/0.18	0.15
D1 short	2.6	120	0.45	–1.6/0.18	0.13
D2 long	1.8	24	0.16	–1.5/0.19	0.19
D2 short	1.8	90	0.64	–1.6/0.18	0.13
D6 long	1.0	14	0.28	–1.4/0.23	0.24
D6 short	1.0	54	1.10	–1.5/0.20	0.15
DoG 4×	1.9	–	–	–1.6/0.18	0.13
DoG 2×	2.4	–	–	–1.7/0.16	0.11
DoG 1×	3.2	–	–	–1.7/0.16	0.11
Gaussian no surround	Low pass	–	–	–1.6/0.18	0.27

Open in a new tab

Receptive fields and tuning functions of the filter-elements are shown in Figure 1, and the predicted spatial summation functions are shown in Figure 2. All examples are for filters with a peak spatial frequency of 2 cpd.

Two main classes of filters were used. The ganglion-cell-based model used circularly symmetric Differences of Gaussians (DoG) filters. The width of the inhibitory surround of the DoG filters was varied to produce filters with four different spatial bandwidths ranging from low pass (no inhibitory surround) to 1.9 octaves. Cortical filters were represented by DN filters: Nth derivatives of Gaussian windowed by an orthogonal Gaussian, which provide both sine-phase and cosine-phase filters that integrate to zero and have a small number of zero crossings (and, therefore, only a few excitatory and inhibitory regions). The DN filters were orientation selective. More details on the choice of spatial filters can be found in Swanson, Felius, and Pan (2004). Quantitative expression for DN filter kernels is given in Swanson, Wilson, and Giese (1984).

For a given filter, the filter-element locations were arranged in hexagonal arrays centered on the stimulus. We have justified the use of a single-filter orientation in the Appendix. Generally speaking, because the stimuli were circular and the grid of filters was centered on the stimulus, increasing the number of filter orientations shifted the predicted spatial summation function vertically but had minimal impact on the shape of the function. Primary calculations were for filters with a peak spatial frequency of 2 cycles per degree (cpd), and filter-element center-to-center spacing was 0.125° (i.e., the centers of the six nearest filter-elements were 0.125° from the center of a given filter-element) to yield four filter-elements per spatial cycle. Secondary calculations showed that further decrease in spacing of the filter-elements had minimal effect on the results. When modeling responses of mechanisms tuned to other spatial frequencies, filter-element spacing was also set to four filter-elements per spatial cycle.

Spatial probability summation

Psychophysical sensitivity of a filter was computed by probability summation across the sensitivities of the filter-elements, using Minkowski (vector) summation with an exponent of 4.0. This exponent was originally suggested by Quick (1974), consistent with contrast sensitivity increasing as the fourth root of stimulus area for gratings outside the parafovea (Robson & Graham, 1981) as well as for varying numbers of Gabors at different locations (Meese & Williams, 2000). An exponent of 4 is also consistent with effects of channel uncertainty for a fixed attentional field and minimal multiplicative noise (Tyler & Chen, 2000). Perimetric stimuli are presented throughout the central visual field; hence, the attentional field is greater than 1,500 deg². Multiplicative noise from eye movements should be minimal because the stimuli are briefly flashed.

Analysis

Spatial summation functions for different spatial filters were characterized in terms of three aspects: critical area, transitional region, and extended slope. Critical area was defined as the largest stimulus area for which sensitivity remained within 0.1 log unit of Ricco's law (sensitivity increases linearly with stimulus area). Extended slope was defined as the slope of the best fit line for stimuli with areas at least 1 log unit larger than the critical area. The transitional region was the section of the function between the critical area and the extended slope.

The model was implemented with Igor Pro software (versions 4.01 through 5.02, Wavemetrics, Inc., Lake Oswego, OR) on Macintosh G4 and G5 computers (Apple Computers, Cupertino, CA).

Results

Figure 2 shows predicted spatial summation functions for the different spatial filters whose receptive fields are represented in Figure 1 for primary calculations with a peak spatial frequency of 2.0 cpd (Figures 2a and 2b) and for secondary calculations with a 3-octave range of peak spatial frequencies (Figure 2c). Critical area and extended slope are listed in Table 1. Sensitivities were normalized to be equal at the smallest stimulus area to compare the shapes of different spatial summation functions. For all mechanisms, Ricco's law was obtained for small stimuli (line with a slope of 1). For the primary calculations, the critical area was similar for many different filters having the same peak spatial frequency, varying from –1.6 to –1.7 log deg² for the DoG filters and from –1.4 to –1.6 log deg² for the DN filters. Critical area increased systematically as peak spatial frequency was decreased, as shown in Figure 2c. The critical area increased by 1.8 log unit as peak spatial frequency decreased from 4.0 to 0.5 cpd, corresponding to a linear relation between peak spatial frequency and square root of the critical area. The extended slope varied from 0.11 to 0.13 for DoG filters and from 0.13 to 0.24 for DN filters. Critical area and extended slope were greatest for the mechanism whose filter-elements had the narrowest orientation and spatial frequency bandwidths (long D6 receptive fields).

Simulated spatial summation functions for single spatial mechanisms with the 2 cpd DN filters (a), 2 cpd DoG filters (b), and the weakly orientation-tuned D1 filters with four different peak spatial frequencies (c). Sensitivities for the smallest stimulus were normalized to –3.0. The functions could be described by a line with a slope of 1 for small stimuli and with lines of varying slopes for large stimuli.

For most of the primary calculations, the transitional region was nonmonotonic, in that sensitivity reached a local maximum near the critical area and then showed a moderate decline before increasing again with the extended slope. The only monotonic functions were for the D1 filters (whose receptive field has a single zero crossing) and the Gaussian filters with no inhibitory surround.

To interpret the predicted spatial summation functions, we show tuning functions for individual filter-elements as log sensitivity versus stimulus radius in the left column in each panel of Figure 3. For stimuli smaller than the critical area (thin vertical line), the filter-elements with the highest sensitivity were centered on the stimulus, whereas for large stimuli, the filter-elements with the highest sensitivity were centered near the edge of the stimulus. The right column in each panel of Figure 3 shows the tuning functions from the left panel replotted as log sensitivity versus log stimulus area, with each tuning function scaled vertically to incorporate the effects of probability summation across multiple filter-elements at the same offset. At the top of each graph, the number of filter-elements contributing to detection is shown, which is defined as the smallest number of filter-elements over which probability summation produced sensitivity within 0.01 log unit of the sensitivity obtained when all filter-elements were included. The number of filter-elements mediating detection increased with stimulus size, with greater rate of increase for filters with broader orientation and spatial bandwidths. The greatest rate of increase was with the circular DoG filters.

The left column in each panel shows log sensitivity versus stimulus radius for individual filter-elements at different offsets from the stimulus center. The black curve shows the tuning function for the filter-element centered on the stimulus. The dashed vertical line indicates the stimulus radius corresponding to 1/2 cycle of the peak spatial frequency. The right column in each panel shows reconstructed spatial summation functions for single mechanisms with filter-element (F-E) tuning functions scaled vertically to represent effects of probability summation; at the top of each figure, the asterisks show the number of filter-elements responding to each stimulus size. The filled circles are the probabilistic sums of the sensitivities of the filter-elements. The black curve shows the function for the filter-element centered on the stimulus. The dashed vertical line indicates the critical area. See text for details.

The effects of filter characteristics on spatial summation functions (illustrated in Figure 2 and Table 1) can be readily interpreted in terms of the effects of filter location on the heights of the tuning curves and on the numbers of filter-elements mediating detection (illustrated in Figure 3). Ricco's law reflects detection by filter-elements centered on stimuli with diameters smaller than the width of the receptive field center, and the critical area reflects the stimulus size at which the response of these filter-elements stops increasing linearly. The extended slope reflects detection by filter-elements centered near the edge of the stimulus, where probability summation results in sensitivity increasing with the fourth root of the number of filter-elements mediating detection. The transitional region represents the transition to sensitivity being mediated by filter-elements offset from stimulus center. The heights of the tuning functions for these filter-elements are usually lower than those for the filter-elements centered on the stimulus, except for filter-elements with a single zero crossing (D1). Therefore, in the transitional region, the sensitivity of a filter usually shows a slight decline until the increase in number of filter-elements offsets the decline in peak of the tuning functions.

Empirical template

The strongly oriented D1 filters yielded spatial summation functions similar to perimetric data. Therefore, we used this spatial summation function as an empirical template for analyzing perimetric spatial summation data, scaling it horizontally by varying critical area and scaling it vertically by varying sensitivity at the critical area. For a given spatial summation function, the template was derived from filter-elements that were identical in sensitivity and in spatial and orientation tuning. Across different spatial summation functions, the only two parameters that varied were the peak spatial frequency and the peak sensitivity of the filter. The suitability of the template was evaluated by fitting 59 data sets from six classic perimetric studies of spatial summation in normal eyes (Dannheim & Drance, 1971; Johnson, Keltner, & Balestrery, 1978; Kasai, Takahashi, Koyama, & Kitahara, 1993; Latham, Whitaker, Wild, & Elliott, 1993; Sloan, 1961; Wilson, 1970), as shown in Figure 4. We used decibel rather than log units in this figure to be consistent with the units used in perimetry, where 1 dB is equal to 0.1 log unit. For each data set, the two parameters were varied independently. The empirical template accounted for at least 97% of the variance in perimetric sensitivity for 50 of the data sets and at least 90% in the remaining data sets.

Ganglion cells versus cortical pooling

The predictions in Figures 2 and 3 are for single spatial mechanisms, where all filter-elements have the same peak spatial frequency and where the shape of the spatial summation function is determined by the properties of the receptive field (e.g., orientation and spatial bandwidths, number of zero crossings). Unlike most of the computed spatial summation functions, perimetric spatial summation functions appear to have a monotonic transitional region and an extended slope of 0.25. Only the strongly oriented D1 filters yielded similar properties. The DoG filters and the rest of the DN filters all yielded nonmonotonic transitional regions and shallow extended slopes and are not consistent with perimetric data. Therefore, these initial comparisons between single-mechanism models and perimetric data are consistent with a cortical-processing approach but not with ganglion-cell-based models.

If we assume that detection is mediated by multiple mechanisms tuned to different spatial frequencies, then perimetric spatial summation data are consistent with a wide range of spatial and orientation bandwidths for the filter-elements. Figure 5 shows examples of this approach, where the shape of the spatial summation function is determined by the relative sensitivities of the spatial filters rather than by the form used for the receptive fields. In this example, there are five different spatial filters, with peak spatial frequencies from 0.2 to 1.25 cpd (insets). As with the single-mechanism predictions, Ricco's law for small stimuli is found when the most sensitive filter-elements are those with peaks near the center of the stimulus, and shallower extended slopes for large stimuli are found when detection is mediated by filter-elements centered near the edge of the stimulus.

Examples of multiple-mechanism models. The colored curves show spatial summation functions for individual spatial mechanisms. The inset shows the peak sensitivity for each of the five spatial mechanisms used in each figure. The filled circles represent psychophysical spatial summation, which is the probabilistic sum of the five underlying spatial mechanisms and is fit with the empirical template. A short blue line in the bottom panel marks complete summation for a mechanism tuned to low spatial frequencies, which is revealed due to reduction in sensitivity of mechanisms tuned to higher spatial frequencies. This example shows results for mechanisms whose filter-elements have identical phase (cosine), spatial bandwidth (1.0 octave), and orientation tuning and vary only in peak spatial frequency (0.20, 0.45, 0.65, 0.9, and 1.25 cpd) and relative sensitivity (insets).

The top graph shows a good reproduction of the empirical function, with a small critical area. The middle graph shows a good reproduction of the empirical function, with a 0.3 log unit larger critical area achieved by a 0.3 log unit decrease in sensitivity of the two mechanisms tuned to the highest spatial frequencies and a 0.1 log unit decrease in sensitivity of a mechanism tuned to intermediate spatial frequencies. The lower graph shows failure to reproduce the empirical function, with a second region of complete summation beyond the critical area, produced by a 0.4 log unit decrease in sensitivity of all mechanisms except for the mechanism tuned to the lowest spatial frequencies. These graphs illustrate how masking could increase the critical area and/or change the shape of the spatial summation by reducing sensitivities of the spatial filters tuned to higher spatial frequencies.

Minkowski exponent

The primary calculations used an exponent of 4.0 for the vector sum. Some researchers have obtained values of 2 to 3 for the Minkowski exponent in the fovea, where sensitivity can decrease rapidly with offset from fixation and where models can assume that the subject has a small attentional aperture (Watson & Ahumada, 2005). The use of a foveal aperture is inappropriate for perimetry, where the attentional field is large and primarily outside the fovea. Nonetheless, to demonstrate the role of the Minkowski exponent, we performed secondary calculations using Minkowski exponents of 2 and 3.

We found that reducing the Minkowski exponent had little impact on the results and had effects that were similar to increasing the spatial bandwidth or the decreasing orientation bandwidth: larger critical diameter, smoother transitional region, and steeper extended slope. With a Minkowski exponent of 2, the empirical template was obtained with a D1 filter having an orientation bandwidth of 54° rather than 14°.

Evaluation

Classical perimetric spatial summation data for circular increments can be described well by an empirical template, with only vertical and horizontal scaling. The empirical template is compatible with detection by multiple mechanisms tuned to different spatial frequencies as well as with detection by a single mechanism composed of highly oriented filters with only one zero crossing. However, the template is not compatible with contemporary perimetric models that ignore cortical processing and assume probability summation of ganglion cell responses (Gardiner et al., 2006; Harwerth et al., 2004).

Classic perimetric spatial summation functions cannot distinguish whether detection is mediated by more than one mechanism. In the following experiment, we used masking to look for evidence of multiple mechanisms. If detection is mediated by a single mechanism, then the mask should shift the spatial summation functions vertically, decreasing sensitivity equally for all stimulus sizes. In contrast, when multiple spatial mechanisms mediate detection, then spatial masks have the potential to also shift the functions horizontally (change the critical area) and/or to change the shape of the spatial summation function, as demonstrated in Figure 5. Detection of perimetric stimuli by multiple spatial mechanisms is consistent with visual processing at the level of the cortex rather than at the level of the retinal ganglion cells.