Abstract
Although investigations in computational neuroscience have been extensive, the opportunity (that has made such a marked difference in physical sciences) to test detailed and subtle quantitative consequences of a theory against experimental results is rare. In this paper, we outline a testable consequence of two contrasting theories of synaptic plasticity applied to the disconnection in visual cortex of the closed eye in monocular deprivation. This disconnection is sometimes thought to be the consequence of a process that stems from a competition of inputs for a limited resource such as neurotrophin. Such a process leads to what we call spatial competition, or heterosynaptic synaptic modification. A contrasting view—exemplified by the Bienenstock, Cooper, and Munro (BCM) theory—is that patterns of input activity compete in the temporal domain. This temporal competition is homosynaptic and does not require a conserved resource. The two mechanisms, homosynaptic and heterosynaptic, are the distinguishing characteristics of two general classes of learning rules we explore by using a realistic environment composed of natural scenes. These alternative views lead to opposite dependence on the level of presynaptic activity of the rate of disconnection of the closed eye in monocular deprivation. This strong and testable consequence sets the stage for a critical distinguishing experiment. This experiment has been done and supports the second view. These results have important implications for the processes of learning and memory storage in neocortex.
Although investigations in computational neuroscience have been extensive, the opportunity (that has made such a marked difference in physical sciences) to test detailed and subtle quantitative consequences of a theory against experimental results is rare (1). In this paper, we outline a testable consequence of two contrasting theories of synaptic plasticity applied to the disconnection in visual cortex of the closed eye in monocular deprivation (MD). This disconnection is sometimes thought to be the consequence of a process that stems from a competition of inputs for a limited resource (2, 3) such as neurotrophin (4). Such a process leads to what we call spatial competition, or heterosynaptic synaptic modification. A contrasting view—exemplified by the Bienenstock, Cooper, and Munro (BCM) theory (5)—is that patterns of input activity compete in the temporal domain. This temporal competition is homosynaptic and does not require a conserved resource. These alternative views lead to opposite dependence on the level of presynaptic activity of the rate of disconnection of the closed eye in MD. This strong and testable consequence sets the stage for a critical distinguishing experiment. This experiment has been done (6) and supports the second view. These results have important implications for the processes of learning and memory storage in neocortex.
In this paper we present results using two different classes of learning rules, one that exhibits heterosynaptic synaptic modification and one (of which BCM is a member) that exhibits homosynaptic synaptic modification. These general classes of learning rules are characterized by their dominant method of stabilization—weight decay for the heterosynaptic class and a sliding threshold for the homosynaptic class. The dynamics of the closed-eye disconnection in MD enables us to draw an experimentally testable distinction between the two different classes. We use a visual environment composed of natural scenes that is realistic enough so that the simulations can meaningfully be compared with the experiment.
METHODS
We use 13 × 13 circular patches from 12 images of natural scenes to represent the visual environment. The images are processed either by a retinal difference of Gaussians, with the biologically observed ≈3:1 ratio of the surround to the center of the ganglion receptive field (8), or a whitening filter corresponding to the lateral geniculate nucleus response properties (9). The response of the cortical cell is given by c = s(m⋅d), where s(⋅) is a rectifying sigmoid, which sets the minimum and maximum values of the postsynaptic response. Neurons with a particular learning rule are trained with natural scene stimulus to both eyes until we obtain binocular oriented receptive fields. To model deprivation, we continue training but present uniform noise to the deprived eye(s). To quantitatively measure the timing of the deprivation experiments, we measure the response of the neurons by using oriented stimuli and then estimate the characteristic half-time for the decay of neuronal response. The specific learning rules used are shown in Learning Rules. The details of the different classes of learning rules and how they are derived are given in Blais et al. (10).
RESULTS
In MD, one eye is deprived of patterned stimuli; this results in the loss of responsiveness of cells in kitten′s striate cortex to stimulation through the deprived eye (11, 12). Because the loss of response is much more striking in monocular, as opposed to binocular, deprivation (13, 14), it is sometimes thought to be the consequence of a competitive process. We call such a process (generally arising through a subtractive term in the synaptic modification equation) spatially competitive. Mathematically it has the form
1 |
where m is the vector of synaptic weights, d and c are the pre- and postsynaptic activity of the cell, respectively, and f(c) and g(c) are functions of the postsynaptic activity. This form includes most stabilized Hebb-like rules as well as some others we have explored (see Learning Rules and ref. 10). When the subtractive term plays the crucial role in stabilizing the system, a heterosynaptic learning rule is at work. This has the effect of decreasing some synapses if others increase to keep some measure (such as Σ mj or |m|2) constant. Thus, the strength of synapses can change even in the absence of presynaptic activity to those synapses. Not all heterosynaptic processes lead to the observed receptive fields or ocular dominance distributions (15, 16). When they do (as they have in all of the cases we have analyzed), the disconnection of the closed eye either decreases or remains constant as the presynaptic activity from the closed eye increases, and presynaptic activity from the closed eye is not required for the disconnection (Fig. 1a).
In contrast, in the BCM theory (5) and other homosynaptic rules we explored (see Learning Rules and ref. 10), synaptic modification leads to what we call temporal competition between input patterns. This results in the disconnection of the closed eye because of homosynaptic long-term depression driven by presynaptic activity from that eye. As the presynaptic activity from the closed eye increases the rate of disconnection increases (Fig. 1b). To illuminate the origin of this counterintuitive result, as well as to display further consequences and precise parameter dependence, we present below a simplified analysis of MD according to the BCM theory.
The BCM theory (5) postulates synaptic modification of the form
2 |
where θM is the moving modification threshold and φ(c,θM) is shown in Fig. 2. The moving threshold stabilizes the weights. The general form of BCM modification has been directly verified experimentally (17–19).
In normal visual environments, extensive simulations have shown that BCM synaptic modification leads to experimentally observed receptive fields (20) as well as to cells with varying ocular dominance (15, 16) and direction selectivity (21). Furthermore, such simulations have shown that in various deprived-visual environments, there is qualitative agreement between theory and experiment on the kinetics and final states of the receptive fields (21). Examples of BCM simulations in a visual environment composed of natural images (20) are shown in Fig. 3. These simulations illustrate how binocular BCM neurons develop in normal (Fig. 3, NR) and deprived visual conditions (see Methods). In these simulations, the closed-eye input is taken to be noise corresponding to the levels of lateral geniculate nucleus activity driven by input from the closed eye. A single set of parameters can be found such that both the final weight configurations and the time sequences are in qualitative agreement with experimental results for the various rearing conditions.
The effects of binocular deprivation (BD) are much less severe than those observed for MD (13, 14). This has led to the hypothesis that a spatially competitive process is at work. However, BD also can be produced by a homosynaptic mechanism. In the BCM theory, the difference between the rates of disconnection in BD and MD is the result of a moving threshold. This effect is illustrated in Fig. 4. During MD, the patterned input into the open eye keeps the modification threshold (θM) high, resulting in a large depression region of the modification function (Fig. 4a). In BD, the unpatterned activity entering both eyes is insufficient to maintain the threshold at a high value, resulting in a significantly reduced depression (Fig. 4b). Eventually, binocularly deprived cells lose orientation selectivity, but the time scale is much longer than for the loss of response to the closed eye in MD.
The noise dependence of the rate of disconnection of the closed eye in MD according to the BCM theory is obtained as follows. BCM modification (Eq. 2) for the deprived channel in MD is.
3 |
where n is the presynaptic input from the closed eye and c is the postsynaptic response. By using a linear approximation, c becomes
4 |
To best illustrate the effect, we calculate in the simplified situation in which the neuron has become selective to open eye inputs, so that mopen·dopen ≃ θM or zero, and c ≃ θM + mclosed⋅n or mclosed⋅n, depending on whether the input to the open eye is a preferred or a nonpreferred pattern.
Near the two zero crossings, we can make a linear approximation to φ so that φ ≃ −ɛ2c near c = 0 and φ ≃ +ɛ1(c − θM) near c ≃ θM. Here ɛ2 and ɛ1 are the slopes near zero and θM. Averaging over the noise input, for mean zero noise, we obtain for non-preferred input to the open eye
5 |
and for preferred input to the open eye
6 |
For selective neurons, nonpreferred inputs greatly outnumber preferred inputs, so that
7 |
Thus, the rate of disconnection of the closed eye increases exponentially with increasing noise from that channel; faster disconnection occurs if the noise input from this deprived eye is greater. This is a subtle and counterintuitive consequence of the BCM theory. The results of this simplified analysis carry over into the more complex case of simulations using real images as input and learning rules of the homosynaptic class (Figs. 1 and 5).
DISCUSSION
We are thus capable of distinguishing between two contrasting points of view, both of which can account for the disconnection of the closed eye in MD but differ fundamentally in their dynamics and parameter dependence. Other rules have been proposed to account for segregation of ocular dominance columns (22, 23). These studies used environments and constraints that are not easily compared with the current work without the addition of extra assumptions. In most cases, however, when applied to the case of MD, these rules can be understood as correlational rules (24) and be categorized into the heterosynaptic class of learning rules. Recent experimental evidence (6) clearly is consistent with the predictions of the BCM theory and the other rules in the homosynaptic class.
It is important to note that these two classes of rules require different underlying molecular and physiological mechanisms. It is equally important that evolving experimental techniques will make it possible to determine the parameters of the theory with increasing precision. This will lead to deep insights into the mechanisms underlying learning and memory storage. In addition, these results may have clinical implications in the treatment of amblyopia (28).
LEARNING RULES
The following learning rules (derived in ref. 10) were explored.
Homosynaptic Class.
Quadratic BCM (25).
8 |
where E[⋅] is approximated by the temporal average:
Skewness 1.
This learning rule is based on the statistical measure of skewness.
9 |
Kurtosis 1.
This learning is based on the statistical measure of kurtosis.
10 |
Heterosynaptic Class.
Skewness 2. This learning rule is based on a different form of the statistical measure of skewness.
11 |
Kurtosis 2.
This learning is based on a different form of the statistical measure of kurtosis.
12 |
Linear and nonlinear principal component analysis (PCA).
This learning rule is a stabilized Hebb rule introduced by Oja (26) for performing PCA. The nonlinear version is a straightforward generalization of the original PCA rule.
13 |
In all of the above learning rules, except for both versions of the PCA rule, the value of the activity of the cell is given by
14 |
where s(⋅) is a nonlinear function of its argument. For linear, PCA the activity is given simply as c = m⋅d. The nonlinear PCA rule uses a polynomial function, such as s(m⋅d)3, which is not biologically plausible but has some nice mathematical properties (27). All of the other rules use the more realistic function
15 |
where s+ and s− set the maximum and minimum activity levels, respectively.
Acknowledgments
This work supported in part by the Charles A. Dana Foundation and the Office of Naval Research.
ABBREVIATIONS
- BCM
Bienenstock, Cooper, and Munro
- MD
monocular deprivation
- BD
binocular deprivation
- PCA
principal component analysis
References
- 1.Mountcastle V B. Daedalus (Boston) 1998;127:1–36. [Google Scholar]
- 2.Stent G. Proc Natl Acad Sci USA. 1973;70:997–1001. doi: 10.1073/pnas.70.4.997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.von der Malsburg C. Kybernetik. 1973;14:85–100. doi: 10.1007/BF00288907. [DOI] [PubMed] [Google Scholar]
- 4.Cabelli R J, Hohn A, Shatz C J. Science. 1995;267:1662–1666. doi: 10.1126/science.7886458. [DOI] [PubMed] [Google Scholar]
- 5.Bienenstock E L, Cooper L N, Munro P W. J Neurosci. 1982;2:32–48. doi: 10.1523/JNEUROSCI.02-01-00032.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rittenhouse C D, Shouval H Z, Paradiso M A, Bear M F. Nature (London) 1999;397:347–350. doi: 10.1038/16922. [DOI] [PubMed] [Google Scholar]
- 7.Clothiaux E E, Cooper L N, Bear M F. J Neurophysiol. 1991;66:1785–1804. doi: 10.1152/jn.1991.66.5.1785. [DOI] [PubMed] [Google Scholar]
- 8.Linsenmeier R, Frishman L J, Jakiela H G, Enroth-Cugell C. Vision Res. 1982;22:1173–1183. doi: 10.1016/0042-6989(82)90082-7. [DOI] [PubMed] [Google Scholar]
- 9.Atick J J, Redlich A. Neural Comput. 1992;4:196–211. [Google Scholar]
- 10.Blais B S, Intrator N, Shouval H, Cooper L N. Neural Computation. 1998;10:1797–1813. doi: 10.1162/089976698300017142. [DOI] [PubMed] [Google Scholar]
- 11.Mioche L, Singer W. J Neurophysiol. 1989;62:185–197. doi: 10.1152/jn.1989.62.1.185. [DOI] [PubMed] [Google Scholar]
- 12.Wiesel T N, Hubel D H. J Neurophysiol. 1963;26:1003–1017. doi: 10.1152/jn.1963.26.6.1003. [DOI] [PubMed] [Google Scholar]
- 13.Wiesel T, Hubel D. J Physiol. 1962;180:106–154. doi: 10.1113/jphysiol.1962.sp006837. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Freeman R, Mallach R, Hartley S. J Neurophysiol. 1981;45:1074–1084. doi: 10.1152/jn.1981.45.6.1074. [DOI] [PubMed] [Google Scholar]
- 15.Shouval H, Intrator N, Law C C, Cooper L N. Neural Comp. 1996;8:1021–1040. doi: 10.1162/neco.1996.8.5.1021. [DOI] [PubMed] [Google Scholar]
- 16.Shouval H, Intrator N, Cooper L N. Vision Res. 1997;37:3339–3342. doi: 10.1016/s0042-6989(97)00087-4. [DOI] [PubMed] [Google Scholar]
- 17.Dudek S M, Bear M F. Proc Natl Acad Sci USA. 1992;89:4363–4367. doi: 10.1073/pnas.89.10.4363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kirkwood A, Rioult M G, Bear M F. Nature (London) 1996;381:526–528. doi: 10.1038/381526a0. [DOI] [PubMed] [Google Scholar]
- 19.Mayford M, Wang J, Kandel E, O′Dell T. Cell. 1995;81:1–20. doi: 10.1016/0092-8674(95)90009-8. [DOI] [PubMed] [Google Scholar]
- 20.Law C, Cooper L. Proc Natl Acad Sci USA. 1994;91:7797–7801. doi: 10.1073/pnas.91.16.7797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Blais B S. Ph.D. thesis. Providence, RI: Brown; 1998. [Google Scholar]
- 22.Miller K D, Keller J, Stryker M P. Science. 1989;240:605–615. doi: 10.1126/science.2762813. [DOI] [PubMed] [Google Scholar]
- 23.Elliott T, Shadbolt N R. Neural Comp. 1998;10:1939–1981. doi: 10.1162/089976698300016927. [DOI] [PubMed] [Google Scholar]
- 24.Miller K D. Neural Comp. 1998;10:529–547. doi: 10.1162/089976698300017647. [DOI] [PubMed] [Google Scholar]
- 25.Intrator N, Cooper L N. Neural Networks. 1992;5:3–17. [Google Scholar]
- 26.Oja E. J Math Biol. 1982;15:267–273. doi: 10.1007/BF00275687. [DOI] [PubMed] [Google Scholar]
- 27.Olshausen B A, Field D J. Nature (London) 1996;381:607–609. doi: 10.1038/381607a0. [DOI] [PubMed] [Google Scholar]
- 28.Jampolsky A. A Trans Am Ophthalmol Soc. 1994;92:349–373. [PMC free article] [PubMed] [Google Scholar]