Significance
Motion perception is important for primates, and direction selectivity (DS), the ability to perceive the direction a target is moving, is an essential part of motion perception. Yet no satisfactory mechanistic explanation has been proposed for the origin of DS in the primate visual cortex up until now. In this paper, we hypothesize that DS is initiated in feed-forward LGN input as a result of the dynamic differences between the ON and OFF pathways. The mechanisms we propose are biology based, and our theory explains experimental data for all spatial and temporal frequencies in visual stimuli. Exploiting temporal biases in parallel pathways is relevant beyond visual neuroscience; similar ideas likely apply to other types of neural signal processing.
Keywords: direction selectivity, primary visual cortex, ON/OFF pathways, temporal frequency, spatial frequency
Abstract
This paper offers a theory for the origin of direction selectivity (DS) in the macaque primary visual cortex, V1. DS is essential for the perception of motion and control of pursuit eye movements. In the macaque visual pathway, neurons with DS first appear in V1, in the Simple cell population of the Magnocellular input layer 4C. The lateral geniculate nucleus (LGN) cells that project to these cortical neurons, however, are not direction selective. We hypothesize that DS is initiated in feed-forward LGN input, in the summed responses of LGN cells afferent to a cortical cell, and it is achieved through the interplay of 1) different visual response dynamics of ON and OFF LGN cells and 2) the wiring of ON and OFF LGN neurons to cortex. We identify specific temporal differences in the ON/OFF pathways that, together with item 2, produce distinct response time courses in separated subregions; analysis and simulations confirm the efficacy of the mechanisms proposed. To constrain the theory, we present data on Simple cells in layer 4C in response to drifting gratings. About half of the cells were found to have high DS, and the DS was broadband in spatial and temporal frequency (SF and TF). The proposed theory includes a complete analysis of how stimulus features such as SF and TF interact with ON/OFF dynamics and LGN-to-cortex wiring to determine the preferred direction and magnitude of DS.
This paper proposes a solution to a longstanding question in visual neuroscience, namely, the origin of direction selectivity (DS) in the visual cortex of macaque monkeys. Motion perception is a vital visual capability well developed in primates. As perceiving motion requires perceiving the direction in which a target moves, DS, the ability of visual neurons to sense the direction of movement, is essential for motion perception (1) and for the control of pursuit eye movements (2). For these reasons, understanding DS is an important first step toward understanding how the cortex processes motion signals.
DS in cortical neurons was first documented in the cat (3). Since then, it has been found in neurons all along the visual dorsal stream (an area associated with motion processing) in primates like macaque monkeys (4–7), whose vision is like that of humans. Neurons with DS are, in fact, present across species; they are widespread among visual mammals, an experimental fact that testifies to their biological significance.
In the visual pathway of macaques, DS appears first in the primary visual cortex (V1), in the Simple cell population of the input layer 4C (8). These neurons provide feed-forward direction-selective signals to subsequent cortical layers and brain regions in the dorsal pathway. Thus, to discover the origin of DS, one is led to examining how neurons in layer 4C acquire their DS—and that is where it gets interesting: The neurons that provide visual signals to layer 4C, the Magnocellular cells in the lateral geniculate nucleus (LGN), are not direction selective (9–12). Yet many of the cells in the input layer of V1 to which they project are direction selective. A fundamental scientific question, therefore, is how 4C neurons acquire their DS. That is the question we would like to answer in this paper.
Although many papers have been written on DS since its discovery over half a century ago, and there is continued interest in the subject (13–16), no satisfactory mechanistic explanation for the origin of DS in primate cortex has been proposed before now: Early conceptual models of how DS may arise, such as the Reichardt multiplier (17) or the motion energy model (18), were not concerned with biological mechanisms. Later work proposed neural mechanisms for the motion energy model (19), but they are not sufficient for explaining DS in primate cortex. See Discussion for comparisons of different model mechanisms.
It is widely accepted that the DS computation requires spatiotemporal inseparability (STI); that is, different subregions of the receptive field have different time courses of response (18, 20, 21). What were lacking were biological mechanisms that could produce STI, and a clear understanding of how DS depends on the interaction between STI and the spatial and temporal character of the visual stimulus. These are the issues we address in this paper.
We hypothesize that a plausible biological mechanism is the interplay between 1) the different dynamics of ON and OFF LGN cells and 2) the specific wiring that connects ON and OFF cells to V1. Item 2 refers here to the well-known fact that OFF and ON LGN cells are wired to segregated V1 receptive field subregions (3, 22, 23). Our main contribution is item 1: We identify, in Results, dynamic differences in the ON/OFF pathways that, together with item 2, produce distinct response time courses in separated receptive-field subregions. The mechanisms we propose are biologically grounded, and, as we show, they are sufficient for initiating DS in the feed-forward LGN input to cortical cells.
To constrain our theory, we present experimental results on the responses of macaque 4C Simple cells to drifting gratings. Most Simple cells we recorded in 4C were unambiguously direction selective, preferring, consistently, the same direction over their entire visible ranges of spatial frequency (SF) and temporal frequency (TF); about half of the cells had high DS. Our data reveal also an important characteristic of neurons with DS, namely, the approximate invariance of DS with SF and TF. Explaining the broadband character of DS (in TF and SF) is a challenge for all previous theories. Our theory includes a complete analysis of how stimulus features like SF and TF interact with ON/OFF dynamics and LGN-to-cortex wiring to explain the broadband character of DS. The theoretical predictions are in good agreement with data.
With regard to broader implications, although the theory as described in this paper is specifically about DS, an important message is that, when combining information from multiple channels, slight biases in their temporal filters can greatly enhance the capability of a system. Thus, it may be possible to exploit the temporal axis further in the processing of biological and nonbiological signals, especially in the neural processing of sensory inputs and, possibly, in computer vision.
Results
As discussed in the Introduction, many neurons in layer 4C of the macaque are known to be direction selective, but the LGN cells which provide them with feed-forward inputs are not, and the challenge is to explain the origin of DS. The LGN input to layer 4C is sparse; the total LGN input to thousands of 4C cells preferring the full range of orientations and directions comes from only LGN cells (24–27). Hypothesizing that DS appears already in the summed responses of groups of LGN cells afferent to a cortical cell, we reason that mechanisms for DS likely arise from visual response properties of the LGN cells and from the way outputs of LGN cells are combined. The problem is, which LGN properties are implicated, and how do they combine to give DS? As each cortical cell receives input from two or three rows of ON and OFF cells, an idea going back to Hubel and Wiesel (3, 22), a natural candidate for the origin of DS is ON−OFF interaction. In Proposed Mechanisms for DS for ON−OFF LGN Pairs, below, we show, in the simplified setting of a pair of ON−OFF LGN cells, that DS in their summed responses can be deduced from known biological factors. These ideas are extended to more realistic LGN configurations in DS in Typical Feed-Forward LGN Inputs. But first we present, in Experimental Data, the experimental data to document DS properties of cortical neurons, setting conditions that must be met for the proposed theory of DS to follow.
Experimental Data.
First, we show that most Simple cells in macaque 4C are cells with DS. The data that show it are the responses of a population of Simple cells that were recorded with microelectrodes. The visual receptive fields of the cells were located at 1° to eccentricity. For each cell, responses were recorded for high-contrast, drifting sinusoidal gratings in different orientations (ori), SFs, and TFs. All measurements of Simple cell responses are in terms of the amplitude of the f1 component of neural firing rate in response to the drifting grating (14, 28). We also recorded visual responses of Complex cells in layer 4C, but they did not exhibit DS and are not considered in this paper.
In the experiments, after the optimal orientation of each cell was determined, responses to the two opposite directions at the optimal orientation were measured with what we thought were stimuli that would produce the maximum firing rate. If response in one direction was larger, that direction was called preferred (abbreviated as Pref), and the other was the opposite (to Pref, or Opp) direction. Pref and Opp directions defined by this procedure were fixed, and we studied responses in these two directions over a range of TF and SF.
Using the ratio of Pref and Opp responses as a measure of DS, we present, below, statistics for two groups of cells: a “highly direction selective” (denoted “highly DS”) group consisting of cells with Pref/Opp > 3, and a group with Pref/Opp > 2 which we refer to as “DS.” We required Pref/Opp > 2 to ensure that the preferred direction could be meaningfully identified.
DS as a function of TF.
We have complete TF data for 53 Simple cells in layer 4C. Of these 53 cells, about half (25/53) were highly DS, while 75% (40/53) were DS. Fig. 1 A, Left shows the medians of the Pref and Opp responses for these two groups as functions of TF. Observe that, for both groups, the median Pref response is larger than the median Opp response consistently across the entire TF range. The Pref response for the highly DS cells is temporally band passed, with a much smaller response at 1 Hz than at 10 Hz. Notice also that response amplitude drops off steeply at high TF, so that, at 32 Hz, both Pref and Opp responses are very small and often below the noise level.
Fig. 1 A, Right shows, for the same two groups of cells, the medians of the Pref/Opp ratio as functions of TF. For both groups, these ratios are roughly constant throughout the visible range of TF, demonstrating that, in macaque layer 4C, DS is broadband in TF.
DS as a function of SF.
We have complete SF data for 54 Simple cells, of which 26 were highly DS and 42 were DS. Fig. 1B shows the same plots as in Fig. 1A but as functions of SF, where the TF was at or near optimal for the neuron. The Pref/Opp ratios were uniformly above 1 throughout the visible range of SF, implying that DS in layer 4C neurons is broadband in SF, although Pref/Opp declined gradually as SF increased from 1 to 5 cycles per degree (c/d) in the DS group.
Aligning cells on their preferred SF and TF gives similar results; see SI Appendix, Fig. S1.
To summarize, our data show that most of the Simple cells recorded in layer 4C had some DS, and roughly half had high DS, with Pref/Opp > 3. DS was found to be broadband in both TF and SF; each cell had a consistent preferred direction and roughly constant Pref/Opp ratios over the entire visible ranges of both TF and SF.
Proposed Mechanisms for DS for ON−OFF LGN Pairs.
This section considers the summed responses of two LGN cells, one ON and one OFF. We propose mechanisms for how DS could be produced, giving a complete analysis in this simplified setting under idealized conditions. Predictions from the theory developed here are then verified in DS in Typical Feed-Forward LGN Inputs against typical configurations of LGN afferents to cortical cells.
ON−OFF pair setup.
We study here the sum of the responses of two LGN cells, one ON and one OFF, representing two rows of ON and OFF LGN cells. Throughout the analysis, the OFF cell is placed on the left, the ON cell is on the right, and they are separated spatially by degrees, as shown in Fig. 2A. The stimuli considered are moving sinusoidal grating patterns; most experimental data about DS in cortical neurons have been obtained with such stimuli, including those in Experimental Data. Our theory generalizes to drifting bars and edges, but that will not be discussed here.
We consider only gratings aligned with the rows of ON and OFF cells, so that, in the simplified setting of an ON−OFF pair, they are either left or right moving. The following notation will be used: Drifting gratings are Right (from left to right) when , and Left (from right to left) when ; here (units: c/d) is SF, (units: Hertz) is TF, and is contrast. Throughout the discussion, we assume the responses and of the ON and OFF cells are given by convolutions of against the cell’s spatial and temporal kernels, that is,
where is the temporal kernel and is the spatial kernel. The cells are located at ; for OFF, and for ON. The forms of the temporal and spatial kernels are illustrated in Fig. 2B; exact formulas and other details are given in Materials and Methods.
Of interest is , the sum of the ON and OFF responses.
DS via differential in phase differences.
We start with the following questions: Does the summed response of the ON−OFF pair have a preferred direction, and what is the preferred direction? The answers are clear in the frequency domain, an insight that goes back to Watson and Ahumada (20, 21). DS depends on the phase difference between and , denoted , which takes values between zero and . The closer is to zero, the larger the amplitude of , because, then, ON and OFF responses are in phase and reinforce each other. The closer is to , the smaller is ’s amplitude, because, then, ON and OFF are in antiphase and cancel each other.
We define to be the ON−OFF phase difference for a right-moving grating, and use for motion to the left. If ON and OFF cells have the same response time course, then , due to a left−right symmetry, and there is no DS (18, 20, 21). (See the analysis under Mechanism #1 for more detail.) But, if the symmetry of ON and OFF is broken and , the phase difference for one of the directions is closer to zero, and that is the preferred direction.
Mechanism #1: Delay of ON response relative to OFF.
A delay in the ON response with respect to OFF is consistent with retinal circuitry required for the sign inversion in the ON pathway (29) and with direct measurements (30, 31). We consider a delay of ms in the response of the ON cell, and explain how that leads to unequal phase differences for the right- and left-moving gratings.
To illustrate how the introduction of a delay breaks the symmetry between and , we show, in Fig. 3, numerical simulations of the response of an ON−OFF pair to drifting gratings. For those simulations, =10 Hz, around the peak TF for Magnocellular cells (12) and = 2.5 c/d, at or near an optimal SF for many cells in V1 representing the parafovea (32), which is the region we are modeling. The distance (Fig. 2A) between ON and OFF is set at , a first approximation based on retinal topography (25).
In Fig. 3A, there is zero delay. ON and OFF responses are shown separately (in white and black), and their sum is shown in red. The responses to left and right motion are identical with the roles of ON and OFF interchanged, resulting in the same summed response amplitude for Left and Right. In Fig. 3 B, Left, a delay between ON and OFF is introduced. Data (30, 31) suggest that is likely between 5 ms and 15 ms. Here we use ms. The summed response is larger for Right motion, for which the ON and OFF responses are closer to being in phase (Fig. 3B); Right is the preferred direction.
To understand the effect of the delay, we analyze ON/OFF phase differences in the Right and Left directions. For Right, ON lags OFF because of the separation and the delay ; there is also a phase difference of (=) because of the sign difference between ON and OFF. Thus, , where we use the formalism that denotes the shortest distance between and the set . (The phase difference is the distance between two points on a circle, and all multiples of are equivalent to zero.) For Left, ; in the Left direction, the ON cell has a phase lead due to the separation , but a phase lag due to the delay. Response amplitudes in the two directions are predictable from the phase differences. In Fig. 3B, is smaller, so Right is the preferred direction. The formulas above also show that, when , ; hence there is no preferred direction.
Because the delay enters as , ON time delays are effective in producing DS at higher TF, but not at lower TF (Fig. 3D). For instance, a grating with TF = 10 Hz travels 1/10 of a cycle in 10 ms, whereas, at TF = 2 Hz, 10 ms is only 1/50 of a cycle. Fig. 3C shows the phase shifts of an ON cell due to a 10-ms delay in response to gratings with TF = 2 and 10 Hz. The 10-Hz grating produced significant DS (Fig. 3B). However, for the 2-Hz grating, the phase shift is negligible, and the picture is closer to that of no delay; it does not produce much DS (Fig. 3D).
The data in Fig. 1A show, however, that DS is present at both high and low TF in the macaque. We therefore need another mechanism in addition to the time delay.
Mechanism #2: Modified structure of ON kernels.
Another suggestion from LGN data (30) is that the waveforms of ON and OFF time-kernels have different temporal structure. We use the following notation: Let the OFF-kernel where is as in Fig. 2B (see Materials and Methods for more detail), and say a temporal kernel has structure if
where and are the positive and negative parts of the function , respectively. In this notation, has the structure (1, 1). It can be shown mathematically (see Materials and Methods) that a kernel with produces a phase shift relative to . For , it is a phase lag, and the lag is more prominent at low TF and declines as TF increases (Fig. 4A). Other kernels with produce similar results. In general, the larger the , the larger the shift. Ratios of are consistent with data for ON kernels (30).
Combining mechanisms #1 and #2, that is, letting , and letting the ON kernel for some with a delay of ms, we may write and as
and
where is the phase lag contributed by the temporal structure of the ON kernel, and has the same meaning as before.
From the discussion above, it follows that mechanisms #1 and #2 combined would produce DS in the summed response of an ON−OFF pair across a wide range of TF with a consistent preferred direction. Simulations confirm the correctness of this reasoning. Fig. 4B shows the individual and summed responses for left- and right-moving gratings, where the ON kernel has temporal structure in addition to a time delay of 10 ms, for both = 2 Hz and = 10 Hz. The response shows a clear preference for Right motion.
Fig. 4C confirms these results across the full range of TF. We compare this panel to Fig. 1A: Fig. 4C is our theoretical prediction of the summed feed-forward inputs to a cortical cell (for Left and Right gratings), while Fig. 1 A, Left, shows data on the responses of DS cortical cells. Similarities in the shapes of the input and output curves as functions of TF and the fact that Pref/Opp for both are roughly constant over the entire range of TF (Figs. 1 A, Right and 4C) lend credibility to the theory proposed.
Preferred direction depends on and SF (equivalently, receptive field size).
In Figs. 3 and 4, the preferred direction is from OFF to ON (Right), but is that always the case? If not, then is there a direction that is consistently preferred by an ON−OFF pair, or does the preference vary with stimulus? To answer these questions systematically involves examining the relation between the SF of the grating and the physical separation between the ON and OFF pair.
A crucial observation is that, if we fix and vary SF, then the preferred direction switches at , where is ON/OFF separation and is SF. Specifically, in our OFF−ON configuration with OFF on the left, when , that is, when the spatial cycle of the grating, Right motion is preferred (this was the case in the examples shown in Figs. 3 and 4). For , Left motion is preferred, and directional preference is reversed again as increases past one. We found that preferred directions are determined solely by the relation between and SF, and are independent of the TF of the grating, provided that the temporal shift is , a condition easily satisfied by the mechanisms proposed. A mathematical proof for the assertions in this paragraph is given in Materials and Methods.
Fig. 5A illustrates the consequences of these predictions by showing Right and Left responses as functions of SF, for four different values of , and five TFs. In the rows in which ° or °, the predicted directional switch can be seen as SF increases in each plot: When , the preferred direction is from OFF to ON (Right), and, when , it is from ON to OFF (Left), and this is true across the entire range of TF. Reversals for are less visible because ON/OFF separation of corresponds to reversals at 5 c/d, where the curves have steep slopes. As for , reversals are predicted to occur at 10 c/d, a high enough SF that the response is too small to be visible.
The following points are illustrated in Fig. 5A. One is that, while, mathematically, direction reversals occur at , as a visual response, a reversal is registered only if the SF at which it occurs falls within the animal’s range of visible SF, which is known to scale with receptive field size (32, 33). A second point is that as important as the phenomenon of reversal itself is the loss of DS near reversal points. The lowering of Pref/Opp for an entire region of (, SF) near reversal occurs as a result of the gradual interchange of preferred directions.
That leads to our third point, the prediction that DS is stronger for smaller ON/OFF separation. In Fig. 5A, peak firing rate occurs at TF = 10 Hz and SF = 2 c/d to 3 c/d, a good approximation to data (Fig. 1). Observe that, at these values of TF and SF, Pref/Opp is close to one at , is a little larger at , and continues to increase as gets smaller. The Pref/Opp ratio is larger for smaller because, with decreasing , reversal occurs farther from SF = 2 c/d to 3 c/d.
Implications for the macaque.
In macaque V1, we have not observed evidence for reversals of preferred direction (Fig. 1 B, Right), suggesting that reversal occurs outside of the range of SF that produces a response. This range can be seen in Fig. 1 B, Left. The discussion above together with the results in Fig. 1 predict therefore that, to provide feed-forward input with robust DS across the full range of SF, ON−OFF rows that project to the same cortical cell are likely apart at the eccentricities of the cells recorded. The decline in DS for SF between 1 c/d and 5 c/d seen in Fig. 1B suggests also that we may be heading for a reversal point, so this ON−OFF separation is likely .
Comparing theory to data, our theoretical prediction of the summed LGN feed-forward inputs to cortical cells at TF = 10 Hz (the optimal TF) and -, as shown in Fig. 5, shows a remarkable resemblance to the data on cortical responses shown in Fig. 1 B, Left.
DS in Typical Feed-Forward LGN Inputs.
In Proposed Mechanisms for DS for ON−OFF LGN Pairs, we developed a theory for the DS of an ON−OFF pair. We now extend this theory to typical feed-forward LGN inputs to layer 4C, assuming that the set of LGN cells that project to a Simple cell in layer 4C is organized in two or three rows of ON and OFF cells, alternating in ON/OFF in the case of three rows. These widely accepted rules of LGN-to-cortex connections were proposed by Hubel and Wiesel (3) and have been confirmed in experiments (23, 34–36). The setup is as described at the beginning of Proposed Mechanisms for DS for ON−OFF LGN Pairs, but for arbitrary collections of LGN cells. The response of each LGN cell is computed as a convolution of the light intensity function with spatial and temporal kernels as before, and DS for the summed responses from the entire group is evaluated.
LGN sampling of visual space is known to be very sparse; two independent estimates based on M-retinal ganglion cell density (25) and on macaque LGN data (24) point to an average of about nine LGN cells projecting to an area of in the visual field at eccentricity. Respecting this density, we modeled a sheet of LGN cells using either a randomly perturbed hexagonal lattice (Fig. 6 A–C) or a randomly perturbed rectangular lattice (Fig. 6 D–E). Once the ON and OFF mosaics were fixed, the temporal kernel of each LGN cell was drawn randomly and independently following the prescription from Proposed Mechanisms for DS for ON−OFF LGN Pairs.
We consider, in Fig. 6, some typical configurations of LGN cells that form the inputs to a cortical cell. Following ref. 37, it is likely that no more than four to six LGN cells project to a Simple cell, and we learned from our earlier discussion of ON/OFF pairs that ON/OFF separation d should be , with stronger DS predicted for smaller separation .
The configurations in Fig. 6 A and B translate easily into our setting of ON–OFF pairs: projecting along a direction orthogonal to the rows of ON and OFF cells, we arrive at precisely the picture treated earlier with near-optimal spacing between the two cells. As expected, DSs of the summed responses of these configurations are strong, and, as predicted, the preferred direction is from OFF to ON.
Fig. 6 C and D requires extensions of the ideas developed for ON−OFF pairs. In Fig. 6C, the projected configuration has three cells, an ON cell flanked by two OFFs. DS for this three-cell configuration can be thought of as competition for DS dominance between the OFF−ON pair on the left and the ON−OFF pair on the right; one should also expect cancellation, as these two pairs will prefer opposite directions. In the configuration shown in Fig. 6C, one would predict that the left OFF−ON pair should prevail, because it carries more weight, and ON/OFF distance is smaller. The orientation tuning curve shown confirms these predictions: The left ON−OFF pair is dominant, and DS is weaker due to partial cancellation. Fig. 6D projects to an ON−OFF pair, with the ON cell carrying significantly more weight. That weakens the DS, although the preferred direction is still quite evident.
The configuration in Fig. 6E projects to an OFF−ON−OFF triplet that is left−right symmetric; no DS can be expected for such a configuration, and little is observed.
Fig. 6 and other examples not shown illustrate the fact that the mechanisms proposed in Proposed Mechanisms for DS for ON−OFF LGN Pairs lead to variable amounts of DS in feed-forward LGN inputs, depending on the spatial alignment of the group of LGN cells (and their temporal kernels). The conclusion about variable amounts of DS is consistent with the data presented in Experimental Data, which show that not all 4C neurons are direction selective, and, among those that are direction sensitive, Pref/Opp can take on a range of values; see Fig. 1.
Discussion
The main contributions of this paper can be summarized as follows:
-
1)
We presented experimental data that document the abundance of high DS cells in layer 4C of macaque V1 and the dependence of DS on TF and SF for these cells. We then proposed a theory the predictions of which are consistent with these data.
-
2)
Our theory asserts that DS could originate from dynamic differences between ON and OFF cells and should therefore already present in feed-forward LGN inputs to cortex. Two dynamic differences, time delay and kernel shape, are sufficient for initiating DS across the full spectrum of TF.
-
3)
We made precise how stimulus features such as TF and SF, the dynamics of ON/OFF LGN cells, and LGN-to-cortex wiring interact to determine the preferred direction and magnitude of DS in feed-forward LGN inputs.
These results should be understood in the following light: First, Proposed Mechanisms for DS for ON−OFF LGN Pairs discusses mechanisms for DS under idealized conditions. Not all groups of LGN cells afferent to a cortical cell can be projected neatly to an ON−OFF pair, as we have explained in DS in Typical Feed-Forward LGN Inputs. The time delay for ON cells in mechanism #1 can, in reality, take on a range of values, and the kernels for ON cells in mechanism #2 are not always of type with as suggested in ref. 30. We expect that data in relation to the properties in mechanisms #1 and #2 are more variable than in the theory described in Proposed Mechanisms for DS for ON−OFF LGN Pairs, consistent with the fact that not all cells in 4C have high DS (Fig. 1).
Second, we do not claim that the mechanisms proposed here are the only mechanisms that contribute to DS. The theory is about DS properties in feed-forward LGN inputs to cortex, and it asserts only that DS could originate from dynamic differences between ON and OFF LGN cells. Cortico−cortical interaction can modify DS properties. We will report on that in a separate paper in which we implement the ideas of the present paper to build a large-scale cortical model. It will be shown that mechanisms #1 and #2 alone are sufficient for producing a degree of DS comparable to that found in data. The presence of specific intracortical connections would not contradict our theory, however, especially in layers beyond 4C where longer-range connections are present; see, for example, ref. 13.
Previous theoretical work has proposed STI in − plots as a characterization of DS (18, 20, 21). Findings of STI have been obtained for V1 neurons in experimental studies (38, 39). While − plots in and of themselves are not intended to shed light on mechanisms, STI is an indication of DS. In SI Appendix, Fig. S2, we show that, when the mechanisms proposed in Proposed Mechanisms for DS for ON−OFF LGN Pairs are implemented, the summed responses of ON−OFF pairs of LGN cells generate STI. The converse need not be true, however: STI need not imply DS. The amount of DS depends also on the spatial and temporal properties of the visual stimulus, a fact we also demonstrate in SI Appendix.
We finish with comparisons of our theory for DS in macaque layer 4C with theories of DS offered for other species. One neural mechanism for DS proposed in previous modeling studies was for specific connections between V1 cells and LGN cells that are either Sustained or Transient in their step responses (40, 41). The Sustained−Transient theory was proposed originally as a mechanism for DS in cat V1 cortex. The concept was revived recently in models of DS in mouse V1, by Lien and Scanziani (14) and Billeh et al. (15). We consider the Sustained−Transient theory to be implausible for macaque because there are not clearly separate classes of Sustained and Transient Magnocellular LGN neurons. In our proposed theory, ON vs. OFF LGN cells have different temporal kernels, but these differences are much smaller and subtler than those observed in Sustained and Transient neurons in rodent or cat retinal and LGN cells (16). Even accepting Sustained/Transient differences as underlying mechanisms of DS for cat and mouse V1, the story there is incomplete without information on the wiring of these two subpopulations and how that leads to the DS properties observed. In the mouse, estimates are that there are either 10 (14) or 40 (15) LGN cells converging into one subfield. The problem of how to place all (or most) of the Sustained inputs into one subfield and Transient inputs into the other remains to be clarified. Our theory solves this problem by postulating that ON and OFF cells have different temporal impulse responses. That, together with the consensus view on how to segregate ON and OFF inputs from LGN into different subfields (22, 23), is sufficient for generating DS.
A different mechanism for DS is the idea that there are different types of LGN cells, so-called lagged and nonlagged cells, with widely different temporal kernels. This mechanism has been invoked to explain DS data in cat (19) and ferret (42) visual cortical cells, and to be the source of STI in the motion energy model (19). It is important to note that the data on the TF dependence of DS in ferret V1 (42) are very different from the macaque data in Fig. 1, suggesting that the temporal dependence of cortical DS in ferret and cat may be different from that in macaque. The lagged vs. nonlagged theory does not predict the broadband character of DS found in macaque 4C cells; it predicts that DS should weaken markedly at high TF (42) as in the ferret V1 data. Therefore, the mechanism of DS in macaque V1 must be sought elsewhere. In support of that conclusion, lagged cells have not been found in macaque LGN (9–12).
Other DS models invoked intracortical inhibitory mechanisms for DS in cat V1 (16, 43, 44), in particular, cortical inhibition tuned to the nonpreferred direction, but intracellular recording in cat V1 (39) showed such models also to be implausible. Our theory indicates that direction-tuned cortical inhibition is not necessary for the initiation of DS in primate V1.
Materials and Methods
Modeling LGN Response.
We model the input current to an LGN cell the center of whose receptive field is located at by
where
Here the sign is for ON and OFF cells, respectively, is background current, and are the temporal and spatial kernels of the LGN cell, is the light intensity map, and is a constant to be adjusted, and denotes the maximum of the bracketed value and zero.
In Proposed Mechanisms for DS for ON−OFF LGN Pairs, the OFF cell is placed at , the ON cell is placed at , and depends only on and . The more general formulas above are used in DS in Typical Feed-Forward LGN Inputs. In both Proposed Mechanisms for DS for ON−OFF LGN Pairs and DS in Typical Feed-Forward LGN Inputs, we have used as our ON/OFF LGN response, conflating the input current to LGN with its output (in addition to omitting the background firing and the part of the operation). This conflation was on purpose: The two quantities are roughly proportional, and the exact sinusoidal form of makes the analysis much more transparent.
Formulas for the spatial and temporal kernels are as follows: The spatial kernel is a difference of Gaussians,
with , , , and (45); and are chosen to make the center part of the receptive field correspond to a Gaussian with SD ° (13). The preferred SF of the spatial kernel is 2.5 c/d. The reference temporal kernel , also referred to as in Mechanism #2: Modified Structure of ON Kernels, has the form [adapted from Zhu et al. (45)]
with ms and ms. Parameters are chosen following guidance from temporal kernels derived in experiments (30); specifically, the cross-over point from positive to negative was chosen to lie at 36 ms so that the peak response occurs at about TF = 10 Hz, following the data reported in Fig. 1 A, Left. Note that is positive before the zero crossing, as the polarity of the LGN cell is implemented in the sign of in the expression for , not in the kernel.
Theoretical basis for mechanism #2.
Mechanism #2 is based on the observation that, at low TF, the response to a sinusoidal signal is effectively a cosine function for kernels, and modifying it to an kernel with adds a sine function to the response, causing a phase displacement.
In more detail, we consider a signal of the form . For , let and denote the responses of an LGN cell with a and kernels, respectively. If the kernel is , then the kernel is
At low TF, convolving a function with a kernel is similar to computing its derivative—without dividing by the distance between the centers of mass of the positive and negative parts of the kernel (here that distance is s). Convolving with a nonnegative kernel is simply averaging. Assuming , we obtain
For example, for a 2-Hz grating, . For a kernel, this gives a phase difference between and . The smaller the TF, the larger the phase displacement caused by the sine term, providing DS at low TF when a Simple ON delay (mechanism #1) does not. (For larger TF, this argument loses its validity when kernel widths are as large as temporal cycles.)
It remains to explain why an kernel with causes a delayed response in the ON cell. This is because the maximum of occurs at . For , ; that is, is still increasing when is at its maximum, and hence peaks later, as illustrated in Fig. 4A.
Mathematical analysis of reversal of directional preference.
In Proposed Mechanisms for DS for ON−OFF LGN Pairs, we asserted that preferred direction switches at , where is ON/OFF separation and is the SF of the grating. A mathematical proof goes as follows. Look at the quantities inside the brackets in the formal expression for phase differences derived in Proposed Mechanisms for DS for ON−OFF LGN Pairs, and recall that, without the temporal phase lag, the spatial phase lead for ON is for Right-moving gratings and for Left-moving gratings. Consider first the case (Fig. 7A), where is between zero and and is between and . These phase leads are depicted as dashed red arrows. The effect of the temporal phase lag for ON is to shift these arrows to the left, to the solid red arrows depicting the net phase leads for ON. Provided that the temporal phase shift is less than (the mechanisms proposed produce shifts ), we have because the red arrow for Right is closer to zero than the red arrow for Left is to . Hence Right motion is preferred.
The picture is different for (Fig. 7B). For this range of , lies between and zero, and lies between and . Here the temporal phase lag causes , resulting in a preference for Left motion.
A similar argument shows that preference is reversed again at .
Experimental Methods.
Animal preparation.
Adult male old-world monkeys (Macaca fascicularis) were used in acute experiments in compliance with NIH and New York University Animal Use Committee regulations. The animal preparation and recording were performed as described in detail previously (46, 47). Anesthesia was initially induced using ketamine (5 mg/kg to 20 mg/kg, intramuscularly) and was maintained with isofluorane (1 to 3%) during venous cannulation and intubation. For the remainder of the surgery and recording, anesthesia maintained with sufentanil citrate (6 gkg−1h−1 to 18 gkg−1h−1, intravenously [i.v.]). After surgery was completed, muscle paralysis was induced and maintained with vecuronium bromide (Norcuron, 0.1 mgkg−1h−1, i.v.), and anesthetic state was assessed by continuously monitoring the animals’ heart rate, EKG, blood pressure, expired CO2, and EEG.
After the completion of each electrode penetration, three to five small electrolytic lesions (3 A for 3 s) were made at separate locations along the electrode track. At the end of the experiments, the animals were deeply anesthetized with sodium pentobarbital (60 mg/kg, i.v.) and transcardially exsanguinated with heparinized lactated Ringer’s solution, followed by 4 L of chilled fresh 4% paraformaldehyde in 0.1 M phosphate buffer, pH 7.4. The electrolytic lesions were located in the fixed tissue, and electrode tracks were reconstructed to assign the recorded neurons to cortical layers as described previously (8). The data from the current study were obtained during the course of the basic characterization of neuronal receptive field properties that is standard for many visual cortical experiments. In 49 animals, more than 700 neurons were recorded in oblique penetrations. For this paper, a full dataset was obtained for 94 neurons localized to layer 4C.
Characterization of visual properties of V1 neurons.
We recorded action potentials (spikes) extracellularly from single units in V1 using glass-coated tungsten microelectrodes. Each single neuron was stimulated monocularly through the dominant eye (the nondominant eye occluded). Receptive fields were located in the visual field between 1° and 6° from the center of gaze. Stimuli were displayed at a screen resolution of 1,024 × 768 pixels, a refresh rate of 100 Hz, and a viewing distance of 115 cm on either a Sony Trinitron GDM-F520 CRT monitor (mean luminance 90 cd/ to 100 cd/) or an Iiyama HM204DT-A CRT monitor (mean luminance 60 cd/). Monitor luminance was calibrated using a Photo Research PR-650 spectroradiometer and linearized via a lookup table.
The response to drifting gratings was used to characterize visual response properties. In this paper, we report measurements of orientation tuning, SF tuning, and TF tuning. We used the f1 response for neurons (traditionally called Simple cells) with f1/f0 ratios .
Orientation tuning.
The responses of each neuron were recorded for different orientations between 0° and , in either 20° or 15° steps. The stimuli were achromatic gratings at the preferred SF and TF, at a contrast of 64% or greater. All stimuli were presented in a circular window confined to the classical receptive field.
SF tuning.
Each neuron was presented with a range of spatial frequencies, usually in -octave steps from 0.1 c/d to around 10 c/d. For some neurons, the upper limit was extended if the neurons had responses to higher spatial frequencies. For almost all neurons that were orientation selective, we measured SF tuning at the preferred orientation and drift direction as well as at the nonpreferred drift direction.
TF tuning.
Each neuron was presented with a range of temporal frequencies, usually in one-octave steps from 0.5 Hz to 32 Hz. Measurements were made for drifting gratings at the optimal orientation and in the preferred and nonpreferred directions.
Supplementary Material
Acknowledgments
This research was supported by NIH Grant R01 EY001472 (R.S.), NIH Grants R01 EY008300 and NIH R01 EY 15549 (M.H.), NSF Grant 1734854 (R.S. and L.-S.Y.), NIH Core Grant P30 EY13079, and NIH Training Grant T32 EY7136. We thank the graduate students and postdoctoral fellows who participated in the experiments—Dario Ringach, Michael Sceniak, Elizabeth Johnson, Siddhartha Joshi, J. A. Henrie, Patrick Williams, Dajun Xing, Christopher Henry, and Anita Disney—and Madhura Joglekar for early numerical explorations.
Footnotes
The authors declare no competing interest.
This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2105062118/-/DCSupplemental.
Data Availability
Simulation code used to generate the results shown have been deposited to GitHub and are publicly accessible at https://github.com/LoganChariker/PNAS_2021.
References
- 1.Parker A. J., Newsome W. T., Sense and the single neuron: Probing the physiology of perception. Annu. Rev. Neurosci. 21, 227–277 (1998). [DOI] [PubMed] [Google Scholar]
- 2.Newsome W. T., Wurtz R. H., Dürsteler M. R., Mikami A., Deficits in visual motion processing following ibotenic acid lesions of the middle temporal visual area of the macaque monkey. J. Neurosci. 5, 825–840 (1985). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Hubel D. H., Wiesel T. N., Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 160, 106–154 (1962). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zeki S. M., Functional specialisation in the visual cortex of the rhesus monkey. Nature 274, 423–428 (1978). [DOI] [PubMed] [Google Scholar]
- 5.Movshon J. A., Adelson E., Gizzi M., Newsome W. T., “The analysis of moving visual patterns” in Pontificiae Academiae Scientiarum Scripta Varia, Chagas C., Gattass R.Gross C., Eds. (Pontificia Academiae Scientiarum, Rome, Italy, 1985) vol. 54, pp. 117–151. [Google Scholar]
- 6.Kiper D. C., Levitt J. B., Gegenfurtner K. R., “Extrastriate visual areas” in Color Vision, from Genes to Perception, Gegenfurtner K. R., Sharpe L. B., Eds. (Cambridge Univ. Press, Cambridge, UK, 2001), chap. 13. [Google Scholar]
- 7.Pack C. C., Born R. T., Livingstone M. S., Two-dimensional substructure of stereo and motion interactions in macaque visual cortex. Neuron 37, 525–535 (2003). [DOI] [PubMed] [Google Scholar]
- 8.Hawken M. J., Parker A. J., Lund J. S., Laminar organization and contrast sensitivity of direction-selective cells in the striate cortex of the Old World monkey. J. Neurosci. 8, 3541–3548 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wiesel T. N., Hubel D. H., Spatial and chromatic interactions in the lateral geniculate body of the rhesus monkey. J. Neurophysiol. 29, 1115–1156 (1966). [DOI] [PubMed] [Google Scholar]
- 10.Kaplan E., Shapley R. M., X and Y cells in the lateral geniculate nucleus of macaque monkeys. J. Physiol. 330, 125–143 (1982). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Hicks T. P., Lee B. B., Vidyasagar T. R., The responses of cells in macaque lateral geniculate nucleus to sinusoidal gratings. J. Physiol. 337, 183–200 (1983). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Derrington A. M., Lennie P., Spatial and temporal contrast sensitivities of neurones in lateral geniculate nucleus of macaque. J. Physiol. 357, 219–240 (1984). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wilson D. E., Scholl B., Fitzpatrick D., Differential tuning of excitation and inhibition shapes direction selectivity in ferret visual cortex. Nature 560, 97–101 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lien A. D., Scanziani M., Cortical direction selectivity emerges at convergence of thalamic synapses. Nature 558, 80–86 (2018). [DOI] [PubMed] [Google Scholar]
- 15.Billeh Y. N., et al., Systematic integration of structural and functional data into multi-scale models of mouse primary visual cortex. Neuron 106, 388–403.e18 (2020). [DOI] [PubMed] [Google Scholar]
- 16.Freeman A. W., A model for the origin of motion direction selectivity in visual cortex. J. Neurosci. 41, 89–102 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Reichardt W., “Autocorrelation, a principle for the evaluation of sensory information by the central nervous system” in Sensory Communication, Rosenblith W., Ed. (Wiley, New York, NY, 1961), pp. 303−317. [Google Scholar]
- 18.Adelson E. H., Bergen J. R., Spatiotemporal energy models for the perception of motion. J. Opt. Soc. Am. A 2, 284–299 (1985). [DOI] [PubMed] [Google Scholar]
- 19.Emerson R. C., Bergen J. R., Adelson E. H., Directionally selective complex cells and the computation of motion energy in cat visual cortex. Vision Res. 32, 203–218 (1992). [DOI] [PubMed] [Google Scholar]
- 20.Watson A., Ahumada A., “A look at motion in the frequency domain” (NASA Tech. Memo. 84352, NASA, 1983). [Google Scholar]
- 21.Watson A. B., Ahumada A. J. Jr, Model of human visual-motion sensing. J. Opt. Soc. Am. A 2, 322–341 (1985). [DOI] [PubMed] [Google Scholar]
- 22.Hubel D. H., Wiesel T. N., Receptive fields and functional architecture of monkey striate cortex. J. Physiol. 195, 215–243 (1968). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Reid R. C., Alonso J. M., Specificity of monosynaptic connections from thalamus to visual cortex. Nature 378, 281–284 (1995). [DOI] [PubMed] [Google Scholar]
- 24.Connolly M., Van Essen D., The representation of the visual field in parvicellular and magnocellular layers of the lateral geniculate nucleus in the macaque monkey. J. Comp. Neurol. 226, 544–564 (1984). [DOI] [PubMed] [Google Scholar]
- 25.Silveira L. C., Perry V. H., The topography of magnocellular projecting ganglion cells (M-ganglion cells) in the primate retina. Neuroscience 40, 217–237 (1991). [DOI] [PubMed] [Google Scholar]
- 26.Chariker L., Shapley R., Young L. S., Orientation selectivity from very sparse LGN inputs in a comprehensive model of macaque V1 cortex. J. Neurosci. 36, 12368–12384 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Garcia-Marin V., Kelly J. G., Hawken M. J., Major feedforward thalamic input into layer 4c of primary visual cortex in primate. Cereb. Cortex 29, 134–149 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Hawken M. J., Shapley R. M., Grosof D. H., Temporal-frequency selectivity in monkey visual cortex. Vis. Neurosci. 13, 477–492 (1996). [DOI] [PubMed] [Google Scholar]
- 29.Masland R. H., The neuronal organization of the retina. Neuron 76, 266–280 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Reid R. C., Shapley R. M., Space and time maps of cone photoreceptor signals in macaque lateral geniculate nucleus. J. Neurosci. 22, 6158–6175 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Jin J., Wang Y., Lashgari R., Swadlow H. A., Alonso J. M., Faster thalamocortical processing for dark than light visual targets. J. Neurosci. 31, 17471–17479 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.DeValois R., DeValois K., Spatial Vision (Oxford University Press, New York, NY, 1988). [Google Scholar]
- 33.van Santen J. P., Sperling G., Temporal covariance model of human motion perception. J. Opt. Soc. Am. A 1, 451–473 (1984). [DOI] [PubMed] [Google Scholar]
- 34.Tanaka K., Cross-correlation analysis of geniculostriate neuronal relationships in cats. J. Neurophysiol. 49, 1303–1318 (1983). [DOI] [PubMed] [Google Scholar]
- 35.DeAngelis G. C., Ohzawa I., Freeman R. D., Receptive-field dynamics in the central visual pathways. Trends Neurosci. 18, 451–458 (1995). [DOI] [PubMed] [Google Scholar]
- 36.Yeh C. I., Xing D., Williams P. E., Shapley R. M., Stimulus ensemble and cortical layer determine V1 spatial receptive fields. Proc. Natl. Acad. Sci. U.S.A. 106, 14652–14657 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Angelucci A., Sainsbury K., Contribution of feedforward thalamic afferents and corticogeniculate feedback to the spatial summation area of macaque V1 and LGN. J. Comp. Neurol. 498, 330–351 (2006). [DOI] [PubMed] [Google Scholar]
- 38.McLean J., Palmer L. A., Contribution of linear spatiotemporal receptive field structure to velocity selectivity of simple cells in area 17 of cat. Vision Res. 29, 675–679 (1989). [DOI] [PubMed] [Google Scholar]
- 39.Priebe N. J., Ferster D., Direction selectivity of excitation and inhibition in simple cells of the cat primary visual cortex. Neuron 45, 133–145 (2005). [DOI] [PubMed] [Google Scholar]
- 40.Marr D., Ullman S., Directional selectivity and its use in early visual processing. Proc. R. Soc. Lond. B Biol. Sci. 211, 151–180 (1981). [DOI] [PubMed] [Google Scholar]
- 41.Baker P. M., Bair W., Inter-neuronal correlation distinguishes mechanisms of direction selectivity in cortical circuit models. J. Neurosci. 32, 8800–8816 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Moore B. D. 4th, Alitto H. J., Usrey W. M., Orientation tuning, but not direction selectivity, is invariant to temporal frequency in primary visual cortex. J. Neurophysiol. 94, 1336–1345 (2005). [DOI] [PubMed] [Google Scholar]
- 43.Suarez H., Koch C., Douglas R., Modeling direction selectivity of simple cells in striate visual cortex within the framework of the canonical microcircuit. J. Neurosci. 15, 6700–6719 (1995). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Maex R., Orban G. A., Model circuit of spiking neurons generating directional selectivity in simple cells. J. Neurophysiol. 75, 1515–1545 (1996). [DOI] [PubMed] [Google Scholar]
- 45.Zhu W., Shelley M., Shapley R., A neuronal network model of primary visual cortex explains spatial frequency selectivity. J. Comput. Neurosci. 26, 271–287 (2009). [DOI] [PubMed] [Google Scholar]
- 46.Ringach D. L., Shapley R. M., Hawken M. J., Orientation selectivity in macaque V1: Diversity and laminar dependence. J. Neurosci. 22, 5639–5651 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Henry C. A., Joshi S., Xing D., Shapley R. M., Hawken M. J., Functional characterization of the extraclassical receptive field in macaque V1: Contrast, orientation, and temporal dynamics. J. Neurosci. 33, 6230–6242 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Simulation code used to generate the results shown have been deposited to GitHub and are publicly accessible at https://github.com/LoganChariker/PNAS_2021.