Abstract
What parts of a visual stimulus produce the greatest neural signal? Previous studies have explored this question and found that the onset of a stimulus's edge is what excites early visual neurons most strongly. The role of inhibition at the edges of stimuli has remained less clear, however, and the importance of neural responses associated with the termination of stimuli has only recently been examined. Understanding all of these spatiotemporal parameters (the excitation and inhibition evoked by the stimulus's onset and termination, as well as its spatial edges) is crucial if we are to develop a general principle concerning the relationship between neural signals and the parts of the stimulus that generate them. Here, we use visual masking illusions to explore this issue, in combination with human psychophysics, awake behaving primate neurophysiology in the lateral geniculate nucleus of the thalamus, and optical recording in the primary visual cortex of anesthetized monkeys. The edges of the stimulus, rather than its interior, generate the strongest excitatory and inhibitory responses both perceptually and physiologically. These edges can be imaged directly by using optical recording techniques. Excitation and inhibition are moreover most powerful when the stimulus turns both on and off (what might be thought of as the stimulus's temporal edges). We thus conclude that there is a general principle that relates the generation of neural signals (excitatory and inhibitory) to the spatiotemporal edges of stimuli in the early visual system.
This is a study of the neural and perceptual responses to visual spatiotemporal edges. It is well known that in a lateral inhibitory network such as the retina, the lateral geniculate nucleus of the thalamus (LGN), or the primary visual cortex (area V-1), the spatial edges of stimuli excite neurons strongly (1–6), whereas the interiors of stimuli evoke relatively little response. This fact introduces a paradox, however: despite the fact that early visual neurons respond most strongly to the edges of a solid object (such as a rectangle), the interior of the object appears perceptually to be filled-in. This suggests the presence of a mechanism that produces the perception of the filled-in interior of an object by using responses generated at spatial edges (7). If this is true, then it may be possible to create a stimulus that can excite early visual neurons in a fashion that does not sustain the filling-in computation. One way to do this would be to display the stimulus for a short duration; long enough for early visual neurons to respond, but too short to provide meaningful information for the subsequent filling-in computation. From this concept, we predicted the following visual illusion: a stimulus that flickers slowly (i.e., with long duration) appears to be filled in, but if it flickers fast enough, the filling-in effect is defeated to some extent, and the edges of the object are more salient than its interior. This “Unfilled Flicker” illusion can be viewed on the worldwide web at http://cortex.med.harvard.edu/∼macknik.
We previously observed that both the onset-discharge and the termination-discharge of a stimulus are important for the stimulus's visibility (8). This is made evident when we consider that an image that is stabilized on the retina disappears after a short period (9–13). Edges must therefore be renewed in normal vision by eye movements, stimulus motion across visual space, or stimulus temporal modulation (i.e., flicker). It is not yet clear, however, which parts of the stimulus's lifetime are most effective in generating inhibitory signals. Here, we probe the role of inhibition at spatiotemporal edges using a combination of psychophysics and electrophysiology. We moreover used an optical imaging technique (14, 15) to observe activity-correlated signals derived by spatial edges on the surface of the primary visual cortex.
Visual masking is a phenomenon in which an otherwise visible stimulus, called a target, is rendered invisible (or less visible) by the suppressive action of a different stimulus, called the mask. Targets and masks can come in any number of shapes and colors, and here we have used oriented bars for both targets and masks. As we will discuss in detail, the temporal sequence of targets and masks is critically important in determining the target's resultant level of visibility. By manipulating the distance between targets and masks, as well as their relative size, we measured the excitatory and suppressive strengths of the stimuli's edges as opposed to their interiors. By manipulating the duration and temporal overlap of targets and masks, we measured the strength of excitation and suppression as a function of stimulus lifetime.
Materials and Methods
Psychophysical Stimuli.
Psychophysical stimuli and procedures (approved by the Harvard Medical School Committee on Human Studies, docket numbers D-082994-1 and X-62896-2) have been described previously (8).
Electrophysiological Stimuli.
Physiological stimuli were presented on a NEC 5FG monitor at a refresh rate of 100 Hz. The monitor subtended 59° by 40° at a viewing distance of 28 cm. The dimensions of the target (a single oriented black or white bar on a background with opposite contrast) were optimized for each cell. Masks were given the same characteristics as the targets and flanked, but never spatially overlapped, the targets.
Electrophysiological Procedure.
Standard electrophysiological techniques for recording from awake behaving primates and for minimizing the monkeys' discomfort were used (6, 8, 16). The Harvard Medical Area Standing Committee On Animals (protocol no. 02078) approved all electrophysiological experimentation.
Optical Recording Stimuli.
Stimuli were displayed at 100% contrast on a Mitsubishi Diamond Scan monitor. White stimuli on a black background were used in the images shown, and black on white stimuli generated similar images.
Optical Recording Procedures.
Optical imaging and electrophysiology techniques for recording from anesthetized paralyzed animals were conducted by using standard methods described previously (15, 17). Our images were generated in three anesthetized rhesus monkeys by viewing the reflectance of 720 nm light (the light source that was powered by a DC-regulated power supply) from the cortex with a charge-coupled device camera, which collected images at a rate of 32 Hz. Images were sampled using a custom 8-bit analog video system that had a focal length of 50 mm. We stimulated the visual field with each stimulus and recorded 20 s of cortical intrinsic signal (changes in reflectance from the cortex). We then averaged the 640 images that were collected (20 s × 32 Hz) and subtracted the average of 640 other frames that were recorded while the brain was unstimulated. Each image had a resolution of 512 × 480 pixels and was approximately 1.2 cm across on the cortical surface. The Duke University Institutional Animal Care and Use Committee approved all optical recording experiments, protocol no. A471-97-10R2.
Optical Image Processing.
Images shown here have been processed with matlab using standard techniques (14, 15, 17–20). Specifically, they have been cropped to a size of about 1 cm2, smoothed with a Gaussian filter having a standard deviation of 4 pixels, and their look-up tables have been normalized and equalized.
Results
The Effect of Distance on Visual Masking.
Werner (21) showed psychophysically that a mask's inhibitory effect on a visual target depends critically on the spatial separation between the mask and the target. This fall off in suppression with distance has not (as far as we know) been measured physiologically. To address this, we recorded from 26 neurons in the LGN of the awake rhesus monkey (Macaca mulatta), while presenting an illusion in which a flickering target (a white or black bar of 60 ms duration) is rendered invisible (to human observers) by a mask (two bars of 110 ms duration that flank the target to either side). The mask flickered in counterphase alternation to the target (8). (This illusion, “Standing Wave of Invisibility” can be viewed on the worldwide web at http://cortex.med.harvard.edu/∼macknik.) By moving the mask away from the target, the target becomes more visible, and responses in the LGN show accordingly that physiological inhibition from the mask decreases as its distance from the target increases (Fig. 1).
Psychophysical studies of spatial edges.
The spatial edges of stimuli produce the strongest excitatory neural signals (2, 4–6). Ratliff (22) moreover suggested that physiological inhibition should also be strongest at the edge of a stimulus, and several perceptual studies have suggested that inhibition is stronger near the edge of a mask rather than within the mask's interior, at least when the mask and the target are presented simultaneously (3, 23–25). Here, we expand on these experiments by testing the suppressive effect of a mask stimulus that varies in size and onset time relative to the onset of the target, while using high-contrast suprathreshold stimuli in a two-alternative forced choice discrimination task. Subjects were required to determine the taller of two target bars of 30 ms duration, presented at various stimulus onset asynchronies (SOAs) with respect to spatially overlapping bars (masks) of 50 ms duration and various sizes (Fig. 2). Because the masks completely overlapped the target in every condition, any change in the magnitude of visual masking as a function of mask size was because of the distance between the mask's edge and the target (Fig. 2, Inset). As the mask increases in size, so does the subject's ability to discriminate the length of the two targets. That is, as the edges of the mask become more distant from the target, visual masking decreases, and counterintuitively, thinner masks have the larger effect, in congruence with earlier reports (3). Thus, the spatial edges of the mask convey greater inhibition than the mask's interior. Masking was furthermore strongest when the mask turned off just before the target turned on (forward masking) and when the mask turned off about 110 ms after the target turned off (backward masking), in agreement with masking studies using spatially nonoverlapping stimuli (8). [Lateral inhibition radiating from the mask can be seen directly by viewing the Stoper– Mansfield (26) illusion on the worldwide web at http://cortex.med.harvard.edu/∼macknik.]
Optical Images of Edges in the Primary Visual Cortex.
If the edges of stimuli generate the strongest signals, then an optical image of the cortex (14, 15, 27), generated with a flickering bar stimulus, should reveal stronger activity at the bar's edge than within its interior. When we flickered a thin bar (with 50 ms duration and 100 ms interstimulus interval), we obtained a single stripe of activation in area V-1 of anesthetized rhesus monkeys (Fig. 3a). When we widened the bar, the optical signal split into two separate stripes of activity corresponding to the edges of the stimulus (Fig. 3b). This confirms that the neural signals that represent a stimulus in the early visual system are strongest at the spatial edges of the stimulus.
The Perception of Visual Masking at the Temporal Edges of Stimuli.
Responses to a stimulus in primate area V-1 are strongest when the stimulus turns both on and off (8). But it remains unclear how the strength of inhibition varies with stimulus lifetime. Is inhibition produced most strongly at the onset and termination of a mask stimulus, is inhibition instead constant over the lifetime of the mask, or does inhibition build up over time? Earlier studies examined this question by testing the brightness detection threshold of a target while it was suppressed visually with a spatially overlapping conditioning field (or mask), that sometimes had a different brightness or color (3, 23–25). Because the mask overlapped spatially the target, adaptation effects, rather than lateral inhibition signals, could have played a role in these results. To better isolate the effects of lateral inhibition from intrinsic adaptation effects, we conducted a psychophysical length-discrimination experiment in humans, similar to that shown in Fig. 2. Here, two 10 ms duration targets were presented at various temporal onset asynchronies with respect to nonoverlapping masks of varied duration (Fig. 4). The results show that target length is more difficult to discriminate when the targets are presented simultaneously to either the onset or the termination of the mask, rather than to the midlife of the mask, suggesting that the suppression generated by the mask occurs at the temporal edges of the mask.
Discussion
The question we have addressed here is: “What parts of a visual stimulus produce the greatest neural signal?” Previous studies have shown that excitatory signals for a stimulus are localized at its edges (1, 2, 4–6, 28). It has also been suggested that inhibitory signals are strongest at the spatial edges (21–25, 29). We expanded on the previous psychophysical studies concerning inhibition at the edge by carrying out visual masking experiments, perceptually and physiologically, in which we increased the distance of the mask's edge from the target, using both overlapping and nonoverlapping masks (Figs. 1 and 2). In the psychophysical experiments, the masks varied in size and timing with respect to the target. We also directly measured the strength of neural signals (presumably both excitatory and inhibitory) generated at a stimulus's edges by optically recording the responses to flickering targets in primate area V-1 (Fig. 3). Most previous optical imaging studies used sweeping stimuli to generate cortical activity, and so they could not view directly the cortical activation from simple stationary edges. Because the optical images generated here were made with stationary stimuli, they moreover represent the finest resolution pictures of the minimal spread of activity across the surface of the cortex (the cortical point-spread function; refs. 30–32). These images correlate well with similar images we published previously (15).
Having confirmed that the strongest excitatory and inhibitory neural signals occur at the spatial edges of a stimulus, we examined the variation in signal strength over the entire time course of the stimulus. Our previous research with visual masking illusions revealed that excitatory neural signals for target visibility were strongest at the temporal edges, or onset and termination times, of the target (8), but it remained unclear whether inhibitory signals followed the same temporal dynamics as excitatory signals. Crawford (33) found that the visibility of a stimulus was inhibited most completely when a full-field mask turned on, and he moreover showed a slight increase in masking magnitude when the mask turned off. The target inhibition he saw with mask termination was small, perhaps, because the spatial edges of his mask were far (11.5°) from the target. Furthermore, he used only one duration for his mask stimulus, so one cannot be sure that the inhibition at the mask's termination was truly caused by the termination of the mask, or whether it instead built up subsequent to the onset of the mask. Battersby and Wagman (3) controlled for some of these concerns by extending Crawford's experiment and adding a second condition that had a mask stimulus of longer duration, and found that, for each masking stimulus duration, the termination point of the stimulus generated more inhibition of the target than did mask midlife. Both of these studies, however, used spatially overlapping target and masks, and so the suppression caused by the mask could have been mediated by either extracellular lateral inhibition circuits (a neural network) or intracellular adaptation mechanisms (an intrinsic cell property). To address this issue, we performed the perceptual experiment shown in Fig. 4, in which we inhibited a target with masks of varied durations that did not spatially overlap the targets. The results show that masks suppress targets most strongly at the mask's onset and termination, possibly through a lateral inhibition circuit.
Our interests have also been to determine, within a spike train, those elements most significant to representing a stimulus. Many studies have suggested that the transient bursty firing generated by the onset of a stimulus is most important for neural information processing (6, 34–36). We previously showed that transient responses derived by both the onset and termination of a stimulus in area V-1 cells are most responsible for that stimulus's visibility; when these transient events were suppressed, the stimulus became invisible (8). We have also shown that V-1 neurons produce bursty firing after microsaccades, thereby preventing adaptation and regenerating visibility (13). The perceptual experiments in Figs. 2 and 4 show that the timing of visual masking correlates with transient responses generated by the mask's onset and termination. This supports the concept that transient bursty firing produces strong neural signaling.
These results bring us to the conclusion that a stimulus's spatiotemporal edges evoke the strongest neural signals (both excitatory and inhibitory). This is accomplished by generating transient activity in response to both the stimulus's onset and termination (its temporal edges) in neurons with receptive fields lying at the stimulus's spatial edges. The excitatory and inhibitory interplay that occurs between early visual neurons during visual masking also helps us to understand the seemingly mysterious spatiotemporal dynamics of visual masking.
Although these results seem to go far in answering what parts of the stimulus produce the strongest signals, there may be cognitive factors that also play a role in stimulus visibility. Several recently discovered contextual and shading effects may or may not be explained by the spatiotemporal nature of edges (37, 38). Attention has also been shown to play a role in illusions of invisibility (39, 40).
Acknowledgments
We thank Wei Ying Gao, Carlos Bagley, Michael LaFratta, Gail Robertson, and Frederic Russo for technical and administrative assistance. We thank Dr. Peter Schiller for his advice concerning constructing our stimulus in the periphery, and we are also indebted to Drs. Gary Blasdel, William Bosking, and David Fitzpatrick for their advice concerning image processing. We furthermore thank Drs. Margaret Livingstone, Martin Usrey, Pamela Reinagel, Rok Cerne, Ty Olson, and David Hubel for their comments on the manuscript. This project was funded by a National Institutes of Health Clinical Investigator Developments Award, Charles A. Dana Foundation Grant, and a Sloan Fellowship (to M.M.H.), as well as an individual National Research Service Award grant (to S.L.M.) from the National Eye Institute (NEI). The postdoctoral advisor to S.M.-C. and S.L.M., Dr. David Hubel, contributed support from his NEI R01 grant, as well as from an NEI Core Grant. S.M.-C. is a fellow from the Ministerio de Educación y Cultura, Formación de Personal Investigador program (Spain).
Abbreviations
- LGN
lateral geniculate nucleus of the thalamus
- SOAs
stimulus onset asynchronies
Footnotes
Article published online before print: Proc. Natl. Acad. Sci. USA, 10.1073/pnas.110142097.
Article and publication date are at www.pnas.org/cgi/doi/10.1073/pnas.110142097
References
- 1.Hubel D H, Wiesel T N. J Physiol (London) 1959;148:574–591. [Google Scholar]
- 2.Ratliff F, Hartline H K. J Gen Physiol. 1959;42:1241–1255. doi: 10.1085/jgp.42.6.1241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Battersby W S, Wagman I H. Am J Physiol. 1962;203:359–365. doi: 10.1152/ajplegacy.1962.203.2.359. [DOI] [PubMed] [Google Scholar]
- 4.De Weerd P, Gattass R, Desimone R, Ungerleider L G. Nature (London) 1995;377:731–734. doi: 10.1038/377731a0. [DOI] [PubMed] [Google Scholar]
- 5.Paradiso M A, Hahn S. Vision Res. 1996;36:2657–2663. doi: 10.1016/0042-6989(96)00033-8. [DOI] [PubMed] [Google Scholar]
- 6.Livingstone M S, Freeman D C, Hubel D H. Cold Spring Harbor Symp Quant Biol. 1996;61:27–37. [PubMed] [Google Scholar]
- 7.Paradiso M A, Nakayama K. Vision Res. 1991;31:1221–1236. doi: 10.1016/0042-6989(91)90047-9. [DOI] [PubMed] [Google Scholar]
- 8.Macknik S L, Livingstone M S. Nat Neurosci. 1998;1:144–149. doi: 10.1038/393. [DOI] [PubMed] [Google Scholar]
- 9.Day E C. Am J Physiol. 1915;38:369–398. [Google Scholar]
- 10.Riggs L A, Ratliff F. J Opt Soc Am. 1952;42:872–873. [Google Scholar]
- 11.Ditchburn R W, Ginsborg B L. Nature (London) 1952;170:36–37. doi: 10.1038/170036a0. [DOI] [PubMed] [Google Scholar]
- 12.Coppola D, Purves D. Proc Natl Acad Sci USA. 1996;93:8001–8004. doi: 10.1073/pnas.93.15.8001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Martinez-Conde S, Macknik S L, Hubel D H. Nat Neurosci. 2000;3:251–258. doi: 10.1038/72961. [DOI] [PubMed] [Google Scholar]
- 14.Blasdel G G, Salama G. Nature (London) 1986;321:579–585. doi: 10.1038/321579a0. [DOI] [PubMed] [Google Scholar]
- 15.Macknik S L, Haglund M M. Proc Natl Acad Sci USA. 1999;96:15208–15210. doi: 10.1073/pnas.96.26.15208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Livingstone M S. Neuron. 1998;20:509–526. doi: 10.1016/s0896-6273(00)80991-5. [DOI] [PubMed] [Google Scholar]
- 17.Haglund M M, Blasdel G G. In: Monitoring Neuronal Activity. Stamford J A, editor. New York: Oxford Univ. Press; 1992. pp. 85–111. [Google Scholar]
- 18.Weliky M, Bosking W H, Fitzpatrick D. Nature (London) 1996;379:725–728. doi: 10.1038/379725a0. [DOI] [PubMed] [Google Scholar]
- 19.Blasdel G G. J Neurosci. 1992;12:3115–3138. doi: 10.1523/JNEUROSCI.12-08-03115.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Blasdel G G. J Neurosci. 1992;12:3139–3161. doi: 10.1523/JNEUROSCI.12-08-03139.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Werner H. Am J Psychol. 1935;47:40–64. [Google Scholar]
- 22.Ratliff F. MACH BANDS: Quantitative Studies on Neural Networks in the Retina. San Francisco: Holden-Day; 1965. [Google Scholar]
- 23.Crawford B H. Proc R Soc London Ser B. 1940;129:94–106. [Google Scholar]
- 24.Rushton W A H, Westheimer G. J Physiol (London) 1962;164:318–329. doi: 10.1113/jphysiol.1962.sp007024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Westheimer G. J Physiol (London) 1965;181:881–894. doi: 10.1113/jphysiol.1965.sp007803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Stoper A E, Mansfield J G. Vision Res. 1978;18:1669–1674. doi: 10.1016/0042-6989(78)90259-6. [DOI] [PubMed] [Google Scholar]
- 27.Grinvald A, Lieke E, Frostig R D, Gilbert C D, Wiesel T N. Nature (London) 1986;324:361–364. doi: 10.1038/324361a0. [DOI] [PubMed] [Google Scholar]
- 28.Mach E. The Monist. 1890;1:48–68. [Google Scholar]
- 29.Nakayama K. Vision Res. 1971;11:501–509. doi: 10.1016/0042-6989(71)90073-3. [DOI] [PubMed] [Google Scholar]
- 30.Das A, Gilbert C D. Nature (London) 1995;375:780–784. doi: 10.1038/375780a0. [DOI] [PubMed] [Google Scholar]
- 31.Hubel D H, Wiesel T N. J Comp Neurol. 1974;158:295–305. doi: 10.1002/cne.901580305. [DOI] [PubMed] [Google Scholar]
- 32.Hubel D H, Wiesel T N. J Comp Neurol. 1974;158:267–293. doi: 10.1002/cne.901580304. [DOI] [PubMed] [Google Scholar]
- 33.Crawford B H. Proc R Soc London Ser B. 1947;134:283–302. doi: 10.1098/rspb.1947.0015. [DOI] [PubMed] [Google Scholar]
- 34.Adrian E D, Matthews R. J Physiol. 1927;63:378–414. doi: 10.1113/jphysiol.1927.sp002410. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.deCharms R C, Merzenich M M. Nature (London) 1996;381:610–613. doi: 10.1038/381610a0. [DOI] [PubMed] [Google Scholar]
- 36.Lisman J E. Trends Neurosci. 1997;20:38–43. doi: 10.1016/S0166-2236(96)10070-9. [DOI] [PubMed] [Google Scholar]
- 37.Adelson E H, Pentland A P. In: Perception as Bayesian Inference. Knill D, Richards W, editors. New York: Cambridge Univ. Press; 1996. pp. 409–423. [Google Scholar]
- 38.Lotto R B, Purves D. Nat Neurosci. 1999;2:1010–1014. doi: 10.1038/14808. [DOI] [PubMed] [Google Scholar]
- 39.Enns J T, Dilollo V. Psychol Sci. 1997;8:135–139. [Google Scholar]
- 40.Ramachandran V S, Cobb S. Nature (London) 1995;373:66–68. doi: 10.1038/373066a0. [DOI] [PubMed] [Google Scholar]
- 41.Hubel D H, Wiesel T N. Proc Natl Acad Sci USA. 1966;55:1345–1346. doi: 10.1073/pnas.55.6.1345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Wiesel T N, Hubel D H. J Neurophysiol. 1966;29:1115–1156. doi: 10.1152/jn.1966.29.6.1115. [DOI] [PubMed] [Google Scholar]
- 43.Wässle H, Grunert U, Rohrenbeck J, Boycott B B. Vision Res. 1990;30:1897–1911. doi: 10.1016/0042-6989(90)90166-i. [DOI] [PubMed] [Google Scholar]
- 44.Ahrelt P K. Eye. 1998;12:531–540. [Google Scholar]