Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Nov 27.
Published in final edited form as: Cell. 2019 Nov 14;179(6):1382–1392.e10. doi: 10.1016/j.cell.2019.10.020

Continual Learning in a Multi-Layer Network of an Electric Fish

Salomon Z Muller 1,2, Abigail N Zadina 1, LF Abbott 1,3, Nathaniel B Sawtell 1,*
PMCID: PMC6986370  NIHMSID: NIHMS1542038  PMID: 31735497

Summary

Distributing learning across multiple layers has proven extremely powerful in artificial neural networks. However, little is known about how multi-layer learning is implemented in the brain. Here we provide an account of learning across multiple processing layers in the electrosensory lobe (ELL) of mormyrid fish and report how it solves problems well-known from machine learning. Because the ELL operates and learns continuously, it must reconcile learning and signaling functions without switching its mode of operation. We show that this is accomplished through a functional compartmentalization within intermediate layer neurons in which inputs driving learning differentially affect dendritic and axonal spikes. We also find that connectivity based on learning rather than sensory response selectivity assures that plasticity at synapses onto intermediate-layer neurons is matched to the requirements of output neurons. The mechanisms we uncover have relevance to learning in the cerebellum, hippocampus and cerebral cortex, as well as in artificial systems.

Graphical Abstract

graphic file with name nihms-1542038-f0001.jpg

In Brief

Using a cerebellum-like structure in an electric fish as a model system for investigating mechanisms of learning in multi-layer networks, it is observed that functional compartmentalization within individual neurons allows synaptic plasticity at an intermediate processing layer to adaptively shape network output.

Introduction

Work on learning in neural systems has focused largely on the effects of plasticity at synapses that provide direct input to the neurons being studied (Bear and Malenka, 1994; Buonomano and Merzenich, 1998; Caporale and Dan, 2008; Knudsen, 1994). Learning a model of the environment or a complex skill, however, relies on plasticity that is widely distributed and may occur at synapses far from the neurons driving decisions or actions. As is well-known from multi-layer (or 'deep') artificial networks, distributing learning over multiple layers is substantially more powerful but also more difficult to implement than learning at a single layer (LeCun et al., 2015; Marblestone et al., 2016). In this study, we leverage a tractable system, the cerebellum-like circuitry of the mormyrid electrosensory lobe (ELL) (Bell et al., 2008), to identify neural mechanisms that implement multi-layer network learning.

Weakly electric mormyrid fish use passive electrosensing to detect electric fields in their environment, including the minute fields emitted by their prey. This task is made more difficult because mormyrids produce their own much larger electric fields by discharging an electric organ in their tail. Though vital for communication and active electrosensing, the fish’s electric organ discharge (EOD) induces large, long-lasting responses in passive electroreceptors that mask behaviorally relevant signals (Figure 1A) (Bell and Russell, 1978). How do mormyrid fish distinguish self-generated from external sensory stimuli (Sperry, 1950; von Holst and Mittelstaedt, 1950)? Past studies suggest that this problem is solved in the ELL by the convergence of sensory input from electroreceptors on the body surface (Figure 1B,D, red) with corollary discharge signals triggered by the motor command nucleus that generates the EOD (Figure 1B,D, blue). Corollary discharge signals cancel self-generated sensory input from the EOD, allowing behaviorally relevant external signals to be detected and processed more effectively (Enikolopov et al., 2018). Cancellation is a continual dynamic learning process; the ELL can adapt to changes in the EOD signal within minutes and fine-tunes cancellation over a period of an hour or more (Bell, 1981, 1982).

Figure 1. Function and circuitry of the mormyrid ELL.

Figure 1.

(A) Signals (red) conveyed by electroreceptor afferents contain both, behaviorally relevant, prey-related and uninformative, EOD-related components that must be separated through learning in the ELL. (B) Schematic of cancellation of the EOD-evoked component of the sensory input by a negative image constructed from corollary discharge (CD) inputs conveyed by granule cells (blue). (C) Micrograph of the ventrolateral zone (VLZ) of the ELL. Output cells retrogradely labeled after a neurobiotin injection into the midbrain (green). MG cells stained with a parvalbumin antibody (pseudo-color magenta). Scale bar: 100 μM (mol - molecular layer, ga - ganglion layer, pl - plexiform layer, gr - granular layer). (D) Schematic of ELL circuitry. Major inputs to the ELL are sensory input from electroreceptors on the skin (red) and CD inputs (blue) time-locked to the motor command (cmnd) that discharges the electric organ. Electrosensory input is relayed via excitatory and inhibitory interneurons, allowing some MG and output cells to be excited by sensory input (circles) and others to be inhibited (lines). CD inputs are relayed to the apical dendrites of MG and output cells via granule cells (GCs), the axons of which form the parallel fibers of the ELL molecular layer. Circles with arrows represent plastic synapses. Dashed boxes indicate the two distinct processing layers that are the focus of the present study.

To perform cancellation, the ELL must predict the sensory input produced by the EOD, subtract this prediction from the total sensory input, and transmit the difference, e.g. the prey signal, from its output layer (Figure 1B). The EOD cancelling signal, known as a negative image, is constructed from a temporally distributed set of corollary discharge inputs that are conveyed to the ELL through a granule cell-parallel fiber system (similar to that of cerebellum) and modified by anti-Hebbian synaptic plasticity (Bell et al., 1997b; Kennedy et al., 2014; Roberts and Bell, 2000).

Past studies have primarily treated sensory cancellation as cell autonomous, but ELL anatomy suggests a more complex, network-level process. The ELL is composed of two layers (Figure 1C,D) both of which receive electrosensory (red) and corollary discharge (blue) input. The first, the MG layer, consists of GABAergic neurons known as medium ganglion (MG) cells, and the second, the output layer, consists of output cells that project to higher processing stages (Bell et al., 1981). Output cells comprise two distinct classes known as E (excited) and I (inhibited) cells which respond with opposite polarity to sensory input. MG cells inhibit output cells and thus act as an intermediate or 'hidden' processing layer. Both MG and output cells receive the sensory and corollary discharge inputs needed for cancellation but, critically, ~90% of granule cell synapses are made onto MG cells (Bell et al., 2005; Meek et al., 1996). This suggest that the main site of plasticity in the ELL is at the intermediate MG layer.

Despite obvious differences between this biological system and artificial neural networks, processing and learning at an intermediate layer raise a number of questions relevant to machine learning. The first concerns the nature of the processing that occurs at the intermediate layer of the ELL; what signals are the intermediate-layer MG cells conveying to the output layer and how do these signals contribute to ELL function? It has previously been proposed that MG cell output conveys EOD-subtracted information about relevant stimuli such as prey to the output cells (Mohr et al., 2003; Sugawara et al., 1999). Our results do not support this hypothesis. We argue, instead, that MG cells convey an estimate of the negative image output cells need for cancellation. This requires that MG cells simultaneously learn and transmit a negative image, raising a second basic question.

Learning and signaling can be conflicting requirements in networks because the activity needed to drive learning may not match the activity required for signaling. In machine learning, this problem is typically solved by cycling between separate phases of learning and signaling (Rumelhart et al., 1988). Such a solution may not be viable for brain structures like the ELL that must continuously transmit sensory information. In this study, we show that the ELL divides learning and signaling functions into separate neuronal compartments that operate simultaneously, obviating the need for temporal cycling.

The third question we address is related to the infamous credit assignment problem in machine learning: what assures that learning at intermediate layer synapses has a favorable impact on the performance of the output layer? We show that this issue is resolved in the ELL by an organization of synaptic connections between MG and output cells based on what MG cells learn through plasticity.

Results

Examining MG cell output

In studying MG cell responses, it is important to note that MG cells fire two distinct types of action potentials: broad spikes, likely initiated in the apical dendrites, have a high threshold, are emitted at low spontaneous rates (~1 Hz), and drive plasticity and narrow spikes, initiated in the axon, have a low threshold, are emitted at high rates (~50 Hz), and do not induce plasticity (Bell et al., 1997b; Han et al., 2000). The output of MG cells is dominated by narrow spikes because of their high rates. Importantly, MG cells do not receive a single powerful excitatory input that evokes broad spikes (like climbing fiber input to cerebellar Purkinje cells). Instead, in vitro studies indicate that both broad and narrow spikes can be evoked by granule cell and electrosensory inputs (Grant et al., 1998). The signals conveyed by broad and narrow spikes in vivo have not previously been examined in the region of the ELL involved in passive electrosensation.

We used both intracellular and extracellular recordings to measure MG broad and narrow spike responses as well as output cell responses in awake, paralyzed fish (see STAR Methods) (Bell, 1982; Enikolopov et al., 2018). Under these conditions, the fish continue to generate EOD motor commands at 2-5 Hz, but the EOD itself is blocked by the paralytic. This allows responses to corollary discharge inputs to be studied in isolation from electrosensory input (Figure 2A-C, blue). In addition, the electric field normally produced by the electric organ can be replaced, in the paralyzed condition, by a laboratory-generated mimic. Delivery of the mimic can be temporally uncorrelated with the recorded EOD command (Figure 2A-C, red) producing 'unpaired' sensory input, which allows the effects of the sensory input to be measured in isolation from corollary discharge. Alternatively, the mimic can be delivered after a fixed (4.5 ms) delay that matches the normal interval between the motor command and the EOD (Figure 2A-C, magenta). This condition, which produces 'paired' sensory input, is used to induce learning of the negative image (the pairing-induced change in response to corollary discharge input alone) and to study the resulting sensory cancellation (the decrease in response to paired sensory and corollary discharge inputs).

Figure 2. Cancellation of output cell and MG cell broad spike, but not MG cell narrow spike, responses.

Figure 2.

(A-C) Example output, MG broad spike (Bspk) and MG narrow spike (Nspk) firing rate responses after >1 hour of paired sensory input (magenta), as well as responses to unpaired sensory input (red), and to corollary discharge input in the absence of sensory input (CD, blue). Dotted lines indicate SEM. Blue and red triangles indicate the times of the EOD command and of the sensory input, respectively. All recordings in this and subsequent figures were performed in the ventrolateral zone of the ELL which receives input from electroreceptors mediating passive electrosensation. Inset, example Bspk (gray) and Nspk (black) waveforms. Scale: 20 mV, 5 ms. (D) Cancellation of paired sensory input (residual variance < 1) was observed in output (n = 95) and Bspk (n = 49) but not Nspk (n = 26; P < 0.0001) responses. Error bars are SEM. (E) Negative images, indicated by negative correlations between responses to CD and unpaired sensory input, were observed in output (n = 95) and Bspk (n = 48) but not Nspk responses (n = 26; P < 0.001).

In our studies, we refer to the EOD mimic as the sensory input. Because we do not include prey-like electric fields, the sensory input we discuss is entirely predictable on the basis of the EOD command signal and is therefore entirely uninformative to the fish. Thus, we consider a situation in which the ELL attempts to cancel all of its sensory input. It is important to appreciate that, in a natural setting, the mechanisms we analyze would only cancel the predictable self-generated component of the sensory input, leaving intact the unpredictable inputs of interest to the fish. Note that the ELL cannot construct the negative image directly from sensory input because this would cancel the desired (prey) response along with the EOD response. Instead sensory input is used as a teaching signal to construct a prediction of the EOD response based purely on motor corollary discharge.

Consistent with previous results (Bell, 1981, 1982), we found that output cell responses to paired sensory input exhibit cancellation (Figure 2A, magenta), and their responses to corollary discharge without any mimic resemble negative images (Figure 2A, blue) of their responses to unpaired sensory input (Figure 2A, red). We also found that MG cell broad spike responses to paired sensory input are canceled by negative images (Figure 2B) in a manner similar to the cancellation seen in output cells. In contrast, MG narrow spike responses remain strongly modulated by paired sensory input (Figure 2C), even after >1 hour of pairing when most MG cell broad spike and output cell responses show full cancellation (Figure 2D, Figure S1B-E). Consistent with this lack of cancellation, MG narrow spike responses to corollary discharge input alone do not resemble negative images of narrow spike sensory responses (Figure 2C,E). These results conflict with the idea that MG cell narrow spikes convey information about external signals (e.g. prey) to output cells because this would require cancellation. In addition, we found that MG cells have low sensitivity to prey-like sensory stimuli (Figure S2), prompting us to further investigate the nature of the signals carried by MG cell narrow spikes.

Sensory input affects broad and narrow spikes differently

The differences between cancelled broad spike and uncancelled narrow spike responses to paired sensory input could arise because they react differently to sensory input, to corollary discharge input, or to both. We began by examining sensory input. Interneurons convey both excitatory and inhibitory sensory input to the MG layer, as they do to the output layer. However, we found that the classification of MG cells into subtypes is more complex than for output cells. Surprisingly, broad and narrow spike responses to unpaired sensory input often have different polarities (Figure 3A-C, Figure S3). In fact, narrow and broad spike sensory responses are no more correlated in the same MG cell than they are in randomly chosen pairs of different cells (Figure 3C). For reasons that will become apparent, we chose to classify MG cells on the basis of their broad spike responses, defining BS+ (broad spikes excited by sensory input; Figure 3A) and BS− (broad spikes inhibited by sensory input; Figure 3B) subtypes.

Figure 3. Effects of sensory and corollary discharge input on broad and narrow spikes in MG cells.

Figure 3.

(A) Subthreshold membrane potential (Vm; gray), narrow spike (Nspk, black) and broad spike (Bspk, magenta) rates, and broad spike raster for 2 example BS+ MG cells in response to unpaired sensory input (red triangles). (B) Same as in A but for 2 example BS− MG cells. BS± cells are classified according to the polarity of the initial phase of their Bspk response. (C) Left, MG cell Nspk and Vm sensory responses are correlated (Pearson r = 0.72, n = 30). Nspk and Bspk responses recorded in the same cells (middle) are no more correlated than pairs of responses selected randomly from different cells (right; r = 0.12 versus r = 0.11, n = 37, P = 0.52). Red crosses indicate Vm responses and gray circles indicate Nspk responses. Error bars are SEM. (D) Overlaid traces from a BS− cell showing responses to corollary discharge alone (blue triangle), unpaired sensory input (red triangle) and intracellular current injections (bottom traces). The sensory input is excitatory for Nspks (lower panel, left column of traces), but it inhibits the generation of Bspks by current injection (compare two panels, right column of traces). Scale indicates range −50 to −60 mV. Inset (top left): Effect of sensory input on the membrane potential (left points and axis) and on the probability of evoking a Bspk with current injection (right points and axis). Sensory input reduces the probability of evoking Bspks via current injection (n = 7 BS− MG cells). (E) Responses of BS− (left) and BS+ (right) MG cells before, during, and after a 4 min delivery of paired sensory input. (F) Pairing-induced changes in responses to the corollary discharge (CD) are positively correlated between Nspks and Bspks (circles; r = 0.52, n = 38) and between Vm and Bspks (crosses; r = 0.56, n = 31). (G) Changes in Bspk CD responses are negatively correlated with Bspk sensory responses to the mimic (left, r = −0.42, n = 73). Changes in Nspk and Vm CD responses are also negatively correlated with Bspk sensory responses (middle; r = −0.36, n = 38 for Nspks and r = −0.57, n = 28 for Vm). However, changes in Nspk or Vm CD responses are uncorrelated, on average, with their sensory responses (r = −0.003, n = 39 for Nspk and r = −0.06, n = 30 for Vm; P < 0.0001). (H) Schematic of a simplified two-compartment model of an MG cell. Sensory input selectively affects Bspks through local inhibition. Nspks transmit a learned copy (blue) of the predictable, self-generated component of the sensory input (red).

In a subset of intracellular recordings from BS− MG cells, we also examined the effects of sensory input on broad spikes evoked by intracellular current injection. We found cells in which sensory input evoked membrane potential depolarization and increased narrow spike firing even though it dramatically reduced the probability of evoking a broad spike (Figure 3D). This provides further evidence that sensory input can affect broad and narrow spikes differently.

Granule to MG cell plasticity affects broad and narrow spikes similarly

We next examined how learning-induced changes in the corollary discharge input conveyed by granule cells affect broad and narrow spikes. A brief (4 min) pairing period was used so that we could compare responses in the same MG cell before, during, and after learning (Figure 3E). Plastic changes measured both during and after this period are highly correlated between narrow spikes and broad spikes (Figure 3F, Figure S4A,C), consistent with corollary discharge inputs affecting both responses similarly. Because plasticity is controlled by broad spikes, it constructs a negative image of the EOD sensory input affecting broad spikes. Narrow-spikes are affected equally by this plasticity, so the broad-spike negative image is 'inherited' by the narrow spikes (Figure 3G, Figure S4B,D). This inherited negative image does not cancel narrow spike responses (Figure 2D, Figure S4G) due to the lack of correlation between the effects of sensory input on narrow and broad spikes (Figure 3C).

A two-layer, two-compartment model of learning in the ELL

Our results suggest that sensory input to an MG cell affects broad and narrow spikes differently. Past studies in zones of the ELL involved in active electrolocation have described a local inhibition of MG cell proximal apical dendrites that potently and selectively modulates broad spikes (Sawtell et al., 2007). We therefore assume that there are two types of sensory input to MG cells, one, Iboth, that affects both broad and narrow spikes (presumably due to synapses onto basal dendrites), and another, IBS, that potently affects broad spikes only (presumably due to synapses onto proximal apical dendrites). In addition, as shown above, broad and narrow spikes are affected equally by input, Igran, from granule cells. Thus, the total broad spike input is Iboth + IBS + Igran, and narrow spikes, being unaffected by IBS, are driven by Iboth + Igran. We assume that IBS is larger in magnitude than Iboth.

Anti-Hebbian plasticity at granule cell synapses onto MG cells forces broad spikes rates to a constant value, such that after learning broad spikes are unmodulated by sensory input (Roberts and Bell, 2000). This implies that, after learning, the current driving broad spikes, Iboth + IBS + Igran, is equal to a constant value C. As a result, the current driving narrow spikes, Iboth + Igran, is equal to C - IBS. The first term in this expression generates tonic firing and the second modulates narrow spikes in a pattern that is a negative image of the broad-spike sensory input. Using a linear approximation for fluctuations in the narrow spike rate around its high baseline, the narrow spike modulation by learned granule cell input matches the shape and timing of EOD sensory input, making it a negative image that would be suitable for helping output cells cancel their sensory input.

Our analysis suggest that MG cells have two functional compartments: one that generates broad spikes and drives learning, and the other that generates narrow spikes and transmits a negative image to other neurons (Figure 3H). This further suggests that the cancellation of self-generated sensory input in the ELL is a two-layer, target-based computation. The sensory input to broad spikes provides a target function, and broad spike mediated plasticity causes granule cell input to produce a negative image of this target that is transmitted to output cells by MG cell narrow spikes.

Because output cells come in E and I varieties, MG cells must generate and transmit two different types of negative images to the output layer. Taking into account that MG cells inhibit output cells, BS+ cells transmit negative images appropriate for output I cells and BS− cell output is appropriate for output E cells (Figure 4A). Ten to twenty MG cells converge onto each output cell (Bell et al., 2005), which may allow imperfections in the negative images carried by individual MG cells to be averaged out. This two-compartment, two-layer model makes two key predictions: 1) Summed narrow spike responses from BS− MG cells must match the timing and shape of paired sensory input to output E cells, and narrow spike responses summed across BS+ MG cells should similarly match sensory input to output I cells. 2) The system must be wired so that BS− MG cells inhibit output E cells and BS+ MG cells inhibit output I cells. If true, these two predictions assure that learned MG narrow-spike-mediated inhibition is of the appropriate sign and shape to cancel unwanted sensory input to ELL output cells.

Figure 4. Grouping MG cells according to broad spike sensory response polarity reveals narrow spike signals appropriate for output cell cancellation.

Figure 4.

(A) Hypothesized two-layer model of output cell cancellation. Negative images (dashed lines) appropriate for cancelling self-generated sensory input to output cells (solid lines) are transmitted from BS− MG cells (green) to E output cells and from BS+ MG cells (magenta) to I output cells. Arrows indicate that MG cells transmit learned granule cell (as opposed to sensory) responses. (B) Pooled change in narrow spike (Nspk) corollary discharge responses induced by 4 minutes (top, BS+, n = 15; BS−, n = 9) or >1 hour (bottom, BS+, n = 5; BS−, n = 14) of paired sensory input. Traces are averages across MG cells pooled according to the polarity of their broad spike sensory responses (BS+ or BS−). Dashed gray lines in B and C show the polarity and temporal profile of average sensory input recorded intracellularly in E and I output cells (arbitrary scale). (C) Pooled change in paired Nspk responses (EOD mimic + command) after 4 minutes (top, BS+, n = 13; BS−, n = 9) or >1 hour (bottom, BS+, n = 5; BS−, n = 14) of paired sensory input.

Physiological and anatomical evidence for the two-layer model

To test the first model prediction, we summed narrow spike output from recorded MG cells, either BS− and BS+, and compared the result to the subthreshold responses to unpaired sensory input in the appropriate type of output cell, either E or I. Learning-induced changes in narrow-spike responses pooled across BS− cell have a polarity and temporal profile that matches the sensory response of E cells. Similarly, response changes pooled across BS+ cells match the sensory response of I cells (Figure 4B). This indicates that learning in each type of MG cell is appropriate for the requirements of the corresponding output cells. Importantly, the narrow-spike outputs of BS− and BS+ cells to paired sensory input also matched negative images appropriate for E and I output cells (Figure 4C; Figure S5A,B). These results indicate that MG cells transmit appropriate cancellation signals to output cells provided that the connectivity we have assumed is correct. We therefore tested this essential prediction.

E and I output cells are located in different layers of the ELL, and MG cell synapses target their somata (Grant et al., 1996; Meek et al., 1996). These features allowed us to check connectivity by performing morphological reconstructions of physiologically identified BS+ and BS− MG cells recorded in vivo and filled with biocytin (Figure 5A,B, Figure S6A,B). Consistent with the two-layer model, we found that the axonal arbors of BS+ cells are largely restricted to the ganglion layer where output I cell somata are located, while the axonal arbors of BS− MG cells are mainly found below the ganglion layer where output E cell somata are located (Figure 5C). These results are consistent with past studies demonstrating two anatomical classes of MG cells (Han et al., 1999; Mohr et al., 2003), but are novel in showing that these classes correspond functionally to MG cells with different polarities of broad spike sensory input. Cells grouped by the polarity or strength of narrow spike sensory responses do not exhibit differences in axon location (Figure 5D,E, Figure S6C,D). Thus, as required by the two-layer model, ELL connectivity is organized on the basis of MG broad spike sensory responses or, by extension, on the basis of what the MG cells 'learn' through broad spike evoked plasticity.

Figure 5. Laminar distribution of MG cell axons reflects broad spike sensory response polarity and is appropriate for sensory cancellation in output cells.

Figure 5.

(A) Broad spike (Bspk) sensory responses used to classify BS+ (n = 5) and BS− (n = 9) MG cells that were then morphologically reconstructed. Due to the low spontaneous firing rate of Bspks, reductions in current-evoked Bspk firing relative to baseline were used to classify some BS− cells. Bspk responses of some cells are truncated for clarity. (B) Camera lucida reconstructions of physiologically identified BS+ (left, n = 5) and BS− cells (right, n = 9). Axon is drawn in red and ELL layer boundaries are indicated by dashed lines (mol - molecular layer, ga - ganglion layer, pl - plexiform layer, gr - granular layer). Inset, photomicrograph of a reconstructed BS+ cell. Red and black arrows indicate axonal and basal dendritic processes, respectively. (C) Axon distribution across layers is different for BS+ and BS− cells (P < 0.0001, Skillings-Mack test). Arrows show laminar locations of the somata of E and I output cells. (D-E) Unsupervised clustering of MG cells based on the laminar distribution of their axon areas by using k-means clustering with k=2 (distance to centroid (circle with cross) was measured by squared Euclidean). The average silhouette value (STAR Methods) of the cluster denoted by triangles is 0.9038 and average silhouette value of the cluster denoted by squares is 0.6147. For visualization purposes we show values for 3 layers. (D) There is a complete overlap of cluster assignment with electrophysiologically defined Bspk response classes (BS+ in magenta and BS− in green). (E) In contrast, there is no clear relationship between cluster assignment and narrow spike (Nspk) sensory responses.

A biophysical model of MG cell function

Our model relies on an assumption that MG cell broad spikes are affected by an input (which we called IBS) that has no appreciable impact on narrow spikes. We constructed a biophysical model to see if such an input is plausible. Moreover, we wanted to test whether two types of sensory input can produce opposite responses, as we observe in the data, and whether granule cell input to the apical dendrite can propagate to affect narrow spikes despite the local dendritic inhibition that suppresses broad spikes. We built a conductance-based multi-compartment model with morphology taken from a Neurolucida reconstruction of a biocytin-filled MG cell (see STAR Methods) (Figure 6A). Voltage-gated sodium and potassium channels in axonal and apical dendritic compartments of the model cell generated dendritic and axonal spikes resembling broad and narrow spikes in MG cells (broad and narrow spikes are both blocked by tetrodotoxin and hence considered sodium based) (Engelmann et al., 2008; Roberts and Leen, 2010). The densities of dendritic and axonal voltage-gated channels were adjusted to yield ~10 mV difference in threshold between broad and narrow spikes (Engelmann et al., 2008; Grant et al., 1998; Sugawara et al., 1999). Baseline excitatory and inhibitory synaptic inputs were adjusted to evoke broad and narrow spikes at rates consistent with those observed in vivo.

Figure 6. Biophysical basis for compartmentalization of learning and signaling in MG cells.

Figure 6.

(A) Left, Neurolucida reconstruction of the BS+ MG cell used to construct a multi-compartment model. Distal apical dendrites are shown in blue, proximal apical dendrites in orange, basal dendrites in green, and a partial reconstruction of the axon in red. Right, sample voltage traces from a recorded MG cell compared to the model. Arrows indicate broad spikes (Bspks). Scale: 10 mV, 20 ms. (B-D) Model Bspk (magenta) and narrow spike (Nspk) (black) rates evoked by different combinations of excitatory (circles) and inhibitory (lines) input onto the compartments indicated in the schematics. Gray patch in B and D indicate mean (black line) and standard deviation of inhibitory onset timing across proximal apical compartments. Gray patch in C indicates when tonic inhibitory input was removed. (B) Model BS− cell with opposite Bspk and Nspk responses. (C) Model BS+ cell with opposite Bspk and Nspk responses. (D) (top) Bspks, but not Nspks, are inhibited by local dendritic inhibition (IBS). (bottom) Granule cell input accompanied by dendritic inhibition cancels Bspk modulation but drives Nspk modulation, reflecting the negative image.

Sensory input was delivered to the model MG cell through excitatory inputs to basilar dendrites and inhibitory inputs to the proximal apical dendrites. Under these conditions, sensory input can robustly increase narrow spike firing while decreasing broad spike firing, consistent with experimental results from BS− cells (Figure 6B). An increase in broad spiking together with a decrease in narrow spikes, as in BS+ cells, can also be obtained if sensory input inhibits the basilar dendrites and diminishes inhibition in the proximal dendrites (Figure 6C). Thus, broad- and narrow-spike responses of opposite polarity are easily obtained.

We next explored the effect of granule-cell input in the model. Granule-cell excitation was adjusted to cancel the inhibitory effect of sensory input on the broad spike rate, simulating the effects of plasticity in a BS− cell. Under these conditions, narrow spikes are strongly modulated by granule cell excitation despite inhibitory input to the proximal dendrites, consistent with narrow spikes transmitting a negative image (Figure 6D). The selective effects of inhibition on broad spikes in the model are due, in part, to their higher threshold. Inhibition has a stronger effect at the higher broad spike threshold than at the lower narrow spike threshold due to its greater difference from the inhibitory reversal potential. These modeling results show that standard neuronal biophysics can support the compartmentalization of learning and signaling in MG cells.

Two sites of learning in the ELL

In our two-layer model, MG cells learn to extract a negative image from the large number of granule cell inputs they receive (~20,000), and transmit it to output cells. Granule cells also synapse directly onto output cells, although in smaller numbers (~5,000). In vivo studies suggest that these synapses are also plastic (Bell et al., 1997a) and, although this plasticity has not been characterized in vitro, it appears to operate in a manner similar to the broad spike evoked plasticity in MG cells, except that it is driven by conventional spikes (output cells do not fire broad spikes). Thus, the negative images that cancel paired sensory input in output cells appear to come from two sources: MG cells and granule cells.

To isolate each of these contributions to the negative image in output cells, we performed manipulations that prevented plasticity from occurring either in the output cell being recorded (isolating the MG contribution to the negative image) or in the MG cells (isolating the direct granule cell contribution). These two conditions were compared with normal command-mimic pairings in the same cells (all pairing were 4 minutes in duration). In the first condition, intracellular current injections were used to counteract the effects of the paired sensory input on the output cell being recorded, thereby preventing anti-Hebbian plasticity from acting at its granule-cell synapses (Figure 7A, magenta). Because this manipulation only affected the output cell being recorded, plasticity acted normally in MG cells. For both E and I output cells, a substantial fraction of the negative image remained intact, suggesting that much of the negative image in output cells is transmitted from the MG layer, at least in the context of the brief pairings tested here (Figure 7B,C). In the second condition, we replaced the paired sensory input with a command-paired intracellular current injection into the recorded output cell. Because the MG cells received no sensory input in this case, plasticity was restricted to the recorded output cell (Bell et al., 1997a; Bell et al., 1993). This manipulation, in which the MG contribution to the negative image is absent, caused a significant reduction in the magnitude of the negative image in output cells compared to controls (Figure 7D; Figure S5C). Both results indicate that a substantial fraction of the negative image in output cells is relayed from MG cells rather than being generated by direct granule cell input. This suggest that MG cells provide a 'generic' negative image to output cells, while plasticity at granule cell to output cell synapses 'fine-tunes' the negative image, cell by cell.

Figure 7. Isolating single-cell and network contributions to negative images in output cells.

Figure 7.

(A) Rasters from an I output cell illustrating responses during control (black) and experimental pairing conditions (magenta, blue). Timing of current injections are indicated by traces beneath the rasters. Bottom trace in the middle panel shows overlaid traces of the output cell membrane potential recorded during pairing. Scale bar: 25 mV. (B) Average traces from output I cells (n = 7) showing responses during pairing (middle) and the resulting changes in corollary discharge (CD) responses (negative images; bottom). Black indicates control pairing and magenta indicates pairing in the same cell but with depolarizing current used to counteract effects of the paired sensory input. Dashed lines indicate SEM. The magnitude of negative images was not different in the two conditions (P = 0.66). (C) Average traces from output E cells (n = 14) in which same manipulation and control as in B were performed (top). The magnitude of negative images was again not different in the two conditions (P = 0.32). (D) Average traces from experiments (n = 7) in which control pairings of commands with an inhibitory sensory input were compared to pairings with hyperpolarizing current injections that evoked similar responses. Negative images were significantly larger under control conditions (P < 0.001). Pairings with depolarizing current were not performed because of the difficulty of matching current- and sensory-evoked spiking patterns.

Discussion

This study leverages a tractable model system, the mormyrid ELL, to address the general question of how synaptic plasticity at an intermediate layer of a network supports behaviorally relevant computation at the output layer. We show that intermediate layer plasticity sculpts corollary discharge responses that aid in the cancellation of unwanted self-generated sensory input at the critical output stage of the ELL network. Accomplishing this requires that intermediate layer MG cells solve two problems, both of which are broadly relevant to multilayer learning in biological and artificial systems. First, we demonstrated that MG cells compartmentalize two functions: learning via dendritic broad spikes and transmitting cancellation signals via axonal narrow spikes. Without such separation, corollary discharge responses would cancel both narrow- as well as broad-spike responses, preventing MG cells from transmitting a cancellation signal to output cells. Second, we provide anatomical and electrophysiological evidence that opposite polarity signals learned by two distinct classes of MG cells are appropriately routed such that they contribute to cancellation in two opponent classes of output neurons. This implies that the ELL solves a version of the error credit assignment problem by organizing the connectivity between intermediate and output layer neurons on the basis of learning.

Mechanisms for compartmentalization of function and credit assignment in MG cells

MG cells exhibit a striking separation of activity related to learning (broad spikes) and signaling (narrow spikes) despite lacking an obvious anatomical specialization for doing so, such as the climbing fiber input to Purkinje cells, or an electrotonically remote site of dendritic spike initiation, as exists in cortical pyramidal neurons (Larkum et al., 1999; Schiller et al., 1997). Our compartmental model showed that the separation of learning and signaling observed in MG cells does not require highly specialized mechanisms or fine-tuning of biophysical parameters. In addition to the high sensitivity of broad spikes to inhibition due to the distance of their threshold from the inhibitory reversal potential, several other factors likely contribute to this separation, including inhibitory inputs spatially localized to the site of broad spike initiation and sensitivity of broad spike to suppression arising from the spatially and temporally distributed dynamics of broad (versus narrow) spike initiation (Engelmann et al., 2008). In vitro experiments are needed to reveal the relative importance and interplay between these factors.

We have argued that MG cell activity is driven by two classes of inputs, one that affects both broad and narrow spikes (Iboth) and one that affects only broad spikes (IBS). The second of these inputs, IBS, drives learning and is the source of the negative image transmitted by MG cell narrow spikes. The shared input, Iboth, is a curious feature according to our current understanding. After learning, Iboth is cancelled by granule cell input and thus has no effect on either form of spiking. Its functional role, if any, is hence unclear.

Our proposed solution to the credit problem in the ELL involves two functionally separate classes of MG cells defined by opposite responses to the sensory input that drives learning. How do these arise? It is known from past work that input from electroreceptor afferents on the skin is relayed to MG cells via a heterogeneous layer of small cells comprising distinct GABAergic and glutamatergic subpopulations and projecting to different layers within the ELL, including the ventral molecular layer where broad spikes are likely initiated (Bell et al., 2005; Hollmann et al., 2016; Zhang et al., 2007). The ventral molecular layer also contains GABAergic interneurons and excitatory feedback projections from the preeminential nucleus (Meek et al., 1999). Further delineating these circuits, for example by using connectomics, could provide an anatomical basis for the opposite and selective effects of sensory input on broad spikes in BS+ and BS− cells.

Learning in MG cells without output error backpropagation

Most multilayer error-correcting systems require some form of output-error feedback to guide learning. For example, output of the cerebellum that is correlated with errors has long been hypothesized to produce activity in the inferior olive, generating climbing fiber input to intermediate-layer Purkinje cells. This, in turn, drives complex-spike-mediated plasticity that is error correcting. In artificial networks, output errors are backpropagated to intermediate layers to control learning. In contrast, learning at the intermediate layer of the ELL, appears not to require feedback.

'Error', in the ELL, corresponds to unwanted output neuron responses to the EOD. We have shown that MG cells provide a major contribution to error reduction at the output layer through the negative images they transmit. The error signal that drives learning in MG cells does not arise from feedback; there is no analog of the climbing fiber system in the ELL. Instead, MG cells generate their own error signal, internally, in the form of broad spikes that drive error-correcting plasticity. This avoids the need for error backpropagation, but it introduces a potential problem.

We have shown that the signals carried by the narrow-spike outputs of BS+ and BS− MG cells are of the right shape to provide negative images to output I and E cells, respectively. However, in the absence of an error signal from the output of the ELL, there is no guarantee that these transmitted negative images will be of the right amplitude to minimize the output response to the EOD, which is the output error. Nevertheless, our data strongly indicate that negative images conveyed by MG cells are, at least approximately, of the required strength. After 4 minutes of pairing, the putative MG cell component of the output cell negative image that we measured was ~1 mV (Figure 7) compared to an average 1.5 mV response evoked by sensory input prior to cancellation (Figure S1F). Furthermore, an increase in the amplitude of sensory responses due to pairing was extremely rare in output cell responses (Figure S1B,D), arguing that cancellation signals are not too large. Plasticity at granule cell to output cell synapses could correct for moderately sized mismatches between the amplitude of MG inhibition and the required negative image. Alternatively, or in addition, there may be other active mechanisms for matching the scale of MG inhibition. Feedback connections from the preeminential nucleus to the ventral molecular layer of the ELL or plasticity (as of yet undescribed) at the synapses between MG cells and output cells are two possibilities for implementing such adjustments. Thus, although error correction at the MG layer does not directly rely on feedback, there may be some feedback-driven regulation of the strength of MG input onto the output layer. Whether or not such mechanisms exist, the absence of error backpropagation, either through a climbing fiber or any other mechanism, is an interesting feature given the difficulty of implementing more general backpropagation schemes in biological networks.

Broader implications for network learning

The realization that negative images in the ELL are computed by both MG and output cells raises an interesting question: what are the advantages of having plasticity at two circuit layers rather than one? Definitive answers to this question will require addition work, but two possible advantages are suggested by features of the ELL. First, plasticity at granule cell synapses onto MG and output cells may operate at different rates, with MG plasticity providing a rapid source of approximate cancelation and output cell plasticity serving to fine-tune the cancelation over longer timescales. Second, estimates of the maximum amplitude of granule cell input onto output cells suggest that output cells could be stretched beyond the dynamic range of their plasticity if they were required to produce negative images without a contribution from MG cells.

Our results illuminate a number of key issues relevant to multi-layer computation and learning beyond the ELL. First, the ELL reconciles learning with processing through a compartmental separation of learning and signaling functions in MG cells. Related two-compartment neuronal models have been proposed on theoretical grounds as biologically plausible implementations of machine-learning schemes in cerebral cortical circuits (Guerguiev et al., 2017; Kording and Konig, 2001; Marblestone et al., 2016; Schiess et al., 2016; Urbanczik and Senn, 2014). Hippocampal and neocortical neurons are increasingly implicated in complex and diverse forms of learning, including learning based on error or reward (Bittner et al., 2015; Hangya et al., 2015; Lacefield et al., 2019 ; Larkum et al., 2009). Our results may be relevant to understanding the plasticity that drives such learning.

Second, ELL wiring is organized on the basis of the effect of learning on MG cells, not according to their conventionally defined (narrow spike) response selectivities. This scheme ensures that plastic changes at synapses onto MG cells (at an intermediate layer of the ELL) are in the appropriate direction to aid cancellation at the output layer and, thus, represents a solution to the credit assignment problem in this circuit. More complex schemes are likely required to solve 'deep' credit assignment problems, such as those that may exist in cerebral cortical networks. Nevertheless, the 'hard-wired' solution we describe may be sufficient for circuits like the cerebellum, where similar wiring-based solutions have been proposed (Apps and Garwicz, 2005; Herzfeld et al., 2015; Oscarsson, 1979).

Finally, it is notable that learning in the ELL appears to operate continuously without any mechanism for gating it on and off. This may be possible because anti-Hebbian learning assures stability. Like the ELL, hippocampal and cortical circuits may use internally-generated error signals, such as dendritic plateau potentials, but these appear to drive Hebbian forms of plasticity. The complexity of inhibition in these circuits, much of it directed toward dendrites, may be required to gate learning (Basu et al., 2016; Lovett-Barron et al., 2014; Wilmes et al., 2016), assuring circuit stability despite the presence of unstable Hebbian forms of plasticity.

STAR Methods

LEAD CONTACT AND MATERIALS AVAILABILITY

Data and data analysis code will be available at http://dx.doi.Org/10.17632/4pxbpc6d5c.1 Morphology of the MG cell as well as code of biophysical model will be available at http://modeldb.yale.edu/259261

Further information and requests for data should be directed to and will be fulfilled by the Lead Contact, Nate Sawtell (ns2635@columbia.edu).

EXPERIMENTAL MODEL AND SUBJECT DETAILS

Male and female Mormyrid fish (7-12 cm in length) of the species Gnathonemus petersii were used in these experiments. Fish were housed in 60 gallon tanks in groups of 5-20. Water conductivity was maintained between 40-65 microsiemens both in the fish’s home tanks and during experiments. All experiments performed in this study adhere to the American Physiological Society’s Guiding Principles in the Care and Use of Animals and were approved by the Institutional Animal Care and Use Committee of Columbia University.

For surgery to expose the brain for recording, fish were anesthetized (MS:222, 1:25,000) and held against a foam pad. Skin on the dorsal surface of the head was removed and a long-lasting local anesthetic (0.75% Bupivacaine) was applied to the wound margins. A plastic rod was cemented to the anterior portion of the skull to secure the head. The posterior portion of the skull overlying the ELL was removed. In a subset of experiments, the valvula cerebelli was reflected laterally to expose the eminentia granularis posterior (EGp) and the molecular layer of the ELL, facilitating whole-cell recordings. Gallamine triethiodide (Flaxedil) was given at the end of the surgery (~20 μg/cm of body length) and the anesthetic was removed. Aerated water was passed over the fish’s gills for respiration. Paralysis blocks the effect of electromotoneurons on the electric organ, preventing the EOD, but the motor command signal that would normally elicit an EOD continues to be emitted at a rate of 2 to 5 Hz.

METHOD DETAILS

Electrophysiology

The EOD motor command signal was recorded with a Ag-AgCl electrode placed over the electric organ. The command signal is the synchronized volley of electromotoneurons that would normally elicit an EOD in the absence of neuromuscular blockade. The command signal lasts about 3 ms and consists of a small negative wave followed by three larger biphasic waves. Onset of EOD command was defined as the negative peak of the first large biphasic wave in the command signal. For pairing experiments, the EOD mimic was presented 4.5 ms following EOD command onset. Recordings were started ~1 hour after paralysis.

Extracellular single-unit recordings were made using glass microelectrodes (2-10 MΩ) filled with 2M NaCl. The location of the ventrolateral zone (VLZ) was established using characteristic field potentials evoked by the EOD command. The precise location of the recording pipette with respect to the VLZ somatotopic map was subsequently determined by finding the skin region for which low-frequency electrosensory stimulation delivered via a local dipole electrode evoked multi-unit responses. Ampullary electroreceptor afferents, E cells and I cells are located in different layers of ELL and have distinctive electrophysiological characteristics (Bell, 1981, 1982; Enikolopov et al., 2018). Ampullary afferents terminate in the deep layers of the ELL, exhibit highly regular spontaneous firing at ~50 Hz, and increase firing rate in response to an electrosensory stimulus that makes the pore of the receptor positive with respect to the basal face within the body. E cells are located in the plexiform layer and I cell in the ganglion layer. Spontaneous firing in E and I output cells is much more irregular and lower rate (~15 Hz) than in afferents. E cells are excited by the same stimulus polarity as ampullary afferents, while I cells are excited by the opposite polarity. Previous studies using intracellular recording and biocytin labeling and antidromic stimulation from the midbrain have shown that E and I cells correspond to two morphologically distinct types of ELL efferent cells known as large fusiform and large ganglion cells (Bell et al., 1997; Bell et al., 1993). In addition to efferent cells, the other major large cells of the VLZ are the MG cells, also located in the ganglion layer. As in previous studies, recordings from MG cells were identified based on the presence of two distinct spike types (broad and narrow spikes) differing in both their waveform and frequency of occurrence (Bell et al., 1997; Bell et al., 1993; Mohr et al., 2003; Sawtell et al., 2007). MG broad spikes with characteristics identical to those described above, could also be recorded in the molecular layer in the absence of narrow spikes, consistent with their separate initiation sites and the failure of narrow spikes to actively propagate to the soma or apical dendrites of MG cells (Engelmann et al., 2008; Grant et al., 1998).

For in vivo whole-cell recordings electrodes (8-15 MΩ) were filled with an internal solution containing, in mM: K-gluconate (122); KCl (7); HEPES (10); Na2GTP (0.4); MgATP (4); EGTA (0.5), and 0.5-1% biocytin (pH 7.2, 280-290 mOsm). No correction was made for liquid junction potentials. Only cells with stable membrane potentials more hyperpolarized than −45 mV and spike amplitudes >40 mV were analyzed. Membrane potentials were recorded and filtered at 3-10 kHz (Axoclamp 2B amplifier, Axon Instruments) and digitized at 20 kHz (CED micro1401 hardware and Spike2 software; Cambridge Electronics Design, Cambridge, UK).

Electrosensory stimulation

The EOD mimic was a 0.2 ms duration square pulse delivered between an electrode in the stomach and another positioned near the electric organ in the tail. The amplitude was 25 μA at the output of the stimulus isolation unit (stomach electrode negative). Recordings from ampullary afferents showed that firing rate modulations evoked by this mimic are within the range of those induced by the fish’s natural EOD (Bell and Russell, 1978). We use the terms sensory input or sensory response to refer to the effect of the mimicked electric field on the ELL. Because we do not include prey-like electric fields (except for in Figure S2) the sensory input we discuss is entirely predictable on the basis of the EOD command signal and is therefore entirely uninformative to and 'unwanted' by the fish. Thus, we consider a situation where the ELL attempts to cancel all of its sensory input. It is important to appreciate that, in a natural setting, the mechanisms we analyze would only cancel the predictable self-generated component of the sensory input, leaving the unpredictable inputs of interest to the fish intact. For the experiments in Figure S2 low-frequency stimuli (white noise to which a 5-20 Hz band-pass Butterworth filter was applied) was delivered between electrodes attached to the edges of the recording chamber in front of and behind the fish on the side ipsilateral to the recording. For all of the cells included in the analysis of prey-like responses we confirmed the presence of typical responses to the EOD mimic. Stimulus amplitudes were chosen such that electroreceptor afferents were driven through their entire dynamic range.

Intracellular pairing experiments (Figure 7)

Effects of pairing on corollary discharge responses of output neurons were evaluated based on averages of 30 seconds of membrane potential data taken immediately before and after a 4 minute pairing period. Spikes were removed prior to averaging using a median filter. Recordings in which spike height changed by >10% over the course of the pairing or in which apparent access resistance changed abruptly were excluded from the analysis. The amplitude and timing of current injection needed to counteract the effects of the EOD mimic was adjusted during the experiment based on online spike rasters. When recording time allowed, several different current strengths were tested sequentially. In such cases, we selected for analysis the case in which the current injection most effectively counteracted the sensory response or averaged the results in cases in which sensory responses were equivalent. Data was only analyzed for cells in which at least one control and one experimental pairing were obtained. As in previous studies of the ELL, we observed that washout of plasticity is rapid (occurring over roughly the same time course as the pairing) and no evidence for order effects, such as savings, was observed. Nevertheless, to minimize possible order effects, control and experimental pairing were performed in pseudo-random order across cells. Paired responses shown in the middle panel of Figure 7B-D were computed by subtracting the pre pairing response.

Histology and morphological reconstructions

After recording, fish were deeply anesthetized with a concentrated solution of MS:222 (1:10,000) and perfused through the heart with a teleost Ringer solution followed by a fixative, consisting of 4% paraformaldehyde and 0.2% glutaraldehyde in 0.1 M phosphate buffer. The brains were postfixed for 12-24 hours, cryoprotected with 30% sucrose, and sectioned at 50 μM on a cryostat. Sections were subsequently processed with the nickel-intensified avidin-biotin-peroxidase method (Vectastain Elite ABC kit) to reveal the biocytin filled cells, dehydrated, cleared in xylenes, and counterstained with neutral red to visualize the layers of ELL. Morphologically recovered neurons were inspected and reconstructed using a 100x oil immersion objective and a camera lucida system. Only well-filled cells in which fine processes, including apical dendritic spines, were clearly visible were selected for reconstruction. As described previously in a combined light and electron microscopy study (Meek et al., 1996), MG axons can be distinguished from basal dendrites based on their initial lateral direction from the soma and looping course. Axons also were generally thinner than basal dendrites and exhibited numerous swellings along their length suggestive of terminal boutons. Processes were only considered to be axon (as opposed to basal dendrite) if they could be traced continuously back to their origination at the soma. The laminar distribution of axon was quantified as pixel area using a custom routine written in FIJI. Laminar boundaries were determined by superimposing photomicrographs of the neutral red counterstained section containing the soma of the labeled neuron onto the digitized camera lucida drawing.

Biophysical model

The morphology of the model MG cell was taken from a reconstruction of a biocytin-stained BS+ cell made using Neurolucida software. Care was taken to accurately represent the thickness of dendritic processes. No attempt was made to reconstruct dendritic spines. The reconstructed cell consists of 78 compartments, and further divided to 230 segments. Simulation of cell activity was done using the simulation software NEURON (Carnevale and Hines, 2006), along with a Python 3 wrapper. Voltage gated Na+ and K+ channels inserted in the apical dendrites and axon are Hodgkin-Huxley type channels. Temperature was set to 20° Celsius. To broaden the dendritic spikes, we added L-type calcium channels (Jaffe et al., 1994), consistent with experimental evidence (Engelmann et al., 2008; Sugawara et al., 1999). Narrow spikes were attenuated at the soma due to passive decay from their site of initiation at the thin axon initial segment. Next, we adjusted the conductance of these channels as described in the following table to achieve higher spike threshold for broad spikes. We matched distance between narrow and broad thresholds to values measured experimentally (Sugawara et al., 1999; Engelmann et al., 2008).

gl
(S/cm2)
leakage
reversal
potential
(mV)
axial
resistance
(Ωcm)
capacitance
(μF/cm2)
g¯Na
(S/cm2)
g¯K
(S/cm2)
g¯Ca
(S/cm2)
axon 0.0003 −70 100 1 4 0.5 0
apical 0.0003 −65 100 1 0.1 0.008 0.073
rest 0.0003 −65 100 1 0 0 0

In the table, ‘rest’ includes the soma, the somatic-connected axonal and apical compartments and all basal dendrite compartments. gl is leakage conductance. g¯Na, g¯K, and g¯Ca are the maximal conductances of the sodium, potassium and L-type calcium channels, respectively.

To drive activity in the model cell, we inserted AMPA and GABAA receptor channels (Destexhe et al., 1994). Reversal potential of the AMPA and GABAA channel are 0 mV and −65 mV, respectively. Each reported simulation was run for 100 seconds and averaged. Broad and narrow spike rate traces were smoothed using a Gaussian filter with 8 ms and 5 ms standard deviation, respectively. Proximal compartments were defined as those whose center is within 100 μm of the center of the soma. For the rest of the section, units of time are in ms and units of maximal conductance, g¯, are in μS.

Simulation of BS− cell (Figure 6B,D).

To achieve baseline firing of ~60 Hz for narrow spikes, and firing of ~3.5 Hz for broad spikes (we set firing of broad spikes ~ 2 Hz higher than observed in data in order to visualize inhibitory effects clearly) we activated AMPA channels in a compartment in the basal dendrites with timing onset N(ti,62) where ti = 18 · i; g¯ = 0.004 + 0.005 · Poisson(0.074). To simulate inhibitory inputs (Figure 6D top), we activated a GABAA channel in each proximal compartment with timing onset N(ti,72) where ti = 200 · i + 97; g¯ = 0.0027 To simulate cancellation of inhibitory input by parallel fiber input (Figure 6D bottom), we added activation of an AMPA channel in each apical dendrite with timing onset N(ti,32) , where ti = 200 · i + 97.3; g¯ = 0.00023. To simulate opposite responses to sensory inputs (Figure 6B) we activated 1) a GABAA channel in each proximal compartment with timing onset N(ti,22) , where ti = 200 · i + 97; g¯ = 0.2 and 2) AMPA channel in a compartment in the basal dendrites with timing onset N(ti,42) , where ti = 200 · i + 100; g¯ = 0.021.

Simulation of BS+ cell (Figure 6C).

Baseline firing in BS+ was achieved by inserting 3 inputs; 1) AMPA input in a basal dendrite compartment with timing onset N(ti,102) , where ti = 36 · i + 10; g¯ = 0.004 + 0.0053 · Poisson(0.3). 2) AMPA input in an apical dendrite compartment (the apical compartment connected to the soma) with timing onset ~ N(ti,102) , where ti = 36 · i + 28; g¯ = 0.002 + 0.0055 · Poisson(0.3). And 3) GABAA input in each proximal apical dendrite compartment with timing onset ~ N(ti,52) , where ti = 10 · i + 10; g¯N(0.009,0.0012). This tonic inhibition only affected broad spike rate (not shown). To simulate opposite responses to sensory inputs we removed the tonic inhibition from ti – 10 to ti + 10 where ti = 200 · i + 100, and added GABAA input in a basal dendrite compartment with timing onset ~ N(ti,72), where ti = 200 · i + 97; g¯ = 0.1.

QUANTIFICATION AND STATISTICAL ANALYSIS

Software used and general statistical methods

Data were analyzed off-line using Spike2 (Cambridge Electronic Design) and Matlab (MathWorks) software. Biophysical model was simulated using NEURON module in Python 3 and analyzed using Python 3. No statistical methods were used to predetermine sample size. The experimenters were not blinded to the condition during data collection or analysis. Nonparametric tests were used for testing statistical significance. Unless otherwise indicated, we used the two-sided Wilcoxon rank sum test for unpaired samples and the Wilcoxon signed ranks test for paired samples. Differences were considered significant at P < 0.05. 3 stars indicate that P < 0.001.

Shuffled Responses

In Figure 3C we used bootstrap analysis with 1,000 repetitions to measure the mean and SEM of the shuffled response.

Smoothing

Output neuron spike trains and MG cell narrow spike trains were smoothed using a 10 ms Gaussian kernel. MG cell broad spikes (which have a much lower baseline rate of ~1 Hz) were smoothed using a 25 ms Gaussian kernel.

Subtracting pre-pairing response

MG cell narrow spikes and membrane potential tend to have an excitatory response to the command alone prior to any pairing. Therefore, to isolate negative image responses (or sensory responses from experiments where the mimic was paired with EOD commands) we subtracted the pre-pairing response. For pooling MG narrow spike responses after >1 hour pairing (Figure 4B,C, >1hr) we subtracted the average pre-pairing response taken from a different set of MG cells for which data was obtained prior to pairing, i.e. those from the 4 minutes pairing experiments.

Residual Variance

(Figure 2D; Figure S1A)

ResidualVariance=<pair2><pair>2<EOD2><EOD>2

<x > ≡ E[x]. “Pair” is the response after some period of pairing/learning (cmnd+mimic) and “EOD” is the response to the EOD mimic.

Cross Correlation

We first define,

Rxy(m)=n=1n=Nmxn+mynform0.
Rxy(m)=Ryx(m)form<0.

m is the lag, N is sample size.

Cross correlation for lag m was measured as xcorr(m)=Rxy(m)Rxx(0)Ryy(0). The normalization limits the range of values to be [−1 1] akin to Pearson Coefficient. The traces were generally mean subtracted. However, for processed traces (i.e. when pre pairing was subtracted) the median was subtracted. Cross correlation was measured across lag of 8 ms, and the value reported is maxxm(∣xcorr(x)∣). However, for Figure 3C showing lack of relationship in the mimic response between broad spikes and narrow spikes we calculated lag up to 15 ms.

Analysis of magnitude of response related to Figures S3&S4

To calculate the magnitude of responses we focused our analysis on the area around the peak response, restricted to a range 0.5-45 ms following the command. We report the average change from pre pairing in a window of 16 ms around the peak response. Peak response time is the index of max(∣EOD∣, ∣CD∣). For broad spikes and output cells, the traces were mean subtracted. For narrow spikes and membrane potential of MG cells we first subtracted prepairing from the trace and then the trace was baseline subtracted (baseline was approximated by the median). In all the plots using this method, the time of peak response found for the data plotted in the x axis was used for the data plotted in the y axis.

DATA AND CODE AVAILABILITY

Data and data analysis code will be available at http://dx.doi.Org/10.17632/4pxbpc6d5c.1 Morphology of the MG cell as well as code of biophysical model will be available at http://modeldb.yale.edu/259261

Supplementary Material

1

Figure S1 related to Figure 2: Time course of sensory cancellation. (A) Similar to Figure 2D, but from brief (~4 min) periods of sensory pairing. In this data set we also recorded corollary discharge responses before pairing (pre pairing) and subtracted them from responses after pairing in order to reveal the negative image. Cancellation of the effects of the mimic (residual variance < 1), was observed in output (n = 78) and MG-Bspk (n = 78) but not in MG-Nspk responses (n = 44; P < 0.0001). (B) Cancelation of the effects of the mimic in E output neurons measured as the standard deviation of the firing rate response during pairing. After >30 minutes (n = 51) and after 4 minutes (n = 32) of pairing, responses are significantly reduced relative to the standard deviation of the response to sensory input alone (n = 78; P < 0.0001 and P < 0.001, respectively). After >30 minutes of pairing most cells are within the range of prepairing responses (n = 21). Note, the rate and accuracy of cancellation is qualitatively similar in I cells. However, quantifying cancellation in I cells is hindered by firing rate rectification. Hence data from I cells are not shown here. (C) Cancellation of the effects of the mimic in broad spikes of BS+ MG cells. After >30 minutes of pairing (n = 22) and after 4 minutes (n = 43), paired responses to the mimic are significantly reduced relative to the standard deviation of the response to the sensory input alone (n = 65; P < 0.0001). Most cells are within the range of prepairing responses (n = 44). (D) Average responses of E cells under same conditions as described above. Note, that responses to the paired stimulus are markedly reduced after pairing but are still larger than pre-pairing responses, consistent with under- rather than over-cancellation. (E) Same as in D but for BS+ cells. (F) Distribution of peak amplitudes of subthreshold sensory responses on membrane voltage measured intracellularly in E and I output cells. Spikes were blocked by hyperpolarization or removed off-line using a median filter.

2

Figure S2 related to Figure 2: Responses of output and MG cells to prey-like stimuli. (A) Raw voltage trace (top), stimulus waveform (middle, 5-20 Hz band-passed noise stimulus), and smoothed firing rate (bottom, 35 ms Gaussian kernel) for an example E cell. (B) Same display for an example MG cell. Arrow indicates the occurrence of a broad spike. Firing rate analysis was performed only on the narrow spikes. (C) Cross-correlation between the firing rate and the stimulus waveform for the example output cell (black) and MG cell narrow spikes (red) shown in A and B. (D) Summary plot of the magnitude of the peak cross-correlation value for output cells (E cells: n = 7-16 cell per data point; I cells: n = 6-17 cell per data point) versus MG cells (n = 4-14 cell per data point) across a range of stimulus amplitudes. *** indicates P < 0.001, ** indicates P < 0.01, * indicates P < 0.05, Student’s t-test with Bonferroni correction for multiple comparisons.

3

Figure S3 related to Figure 3: EOD responses of MG cell broad and narrow spikes are uncorrelated. Additional analysis related to Figure 3C. Data points indicate the average change in response relative to the pre-pairing condition calculated in a 16 ms window centered on the peak response (see STAR Methods). (A) Sensory responses of narrow spikes (Nspks) (circles) and subthreshold membrane potential (Vm; crosses) recorded in the same cell exhibit an approximately linear relationship (gray line, linear regression; n = 41, r2 = 0.64). (B) In contrast, sensory responses of Nspk (circles) and Vm (crosses) exhibit no clear relationship with the sensory responses of broad spikes (Bspks) recorded in the same cell (Nspk: n = 60, Vm: n = 42).

4

Figure S4 related to Figure 3: MG cells transmit a negative image of the response of broad spikes to the EOD mimic. All data are from experiments in which paired sensory input was delivered for 4 minutes. Data points in the scatter plots indicate the average change in response relative to the pre-pairing condition calculated in a 16 ms window centered on the peak response (see STAR Methods). (A) Changes in corollary discharge (CD) response of narrow spikes (Nspks) (circles; n = 30) and Vm (crosses; n = 38) have the same polarity as changes in the CD response of broad spikes (Bspks) recorded in the same cell and (B) opposite polarity to the sensory response of Bspks recorded in the same cell. (C) Related to Figure 3F, but here responses are changes during the pairing period, i.e. subtracting early (initial ~30 sec) from late (last ~30 sec) pairing responses, rather than subtracting pre from post pairing responses. Changes in CD responses during pairing were positively correlated between Nspks and Bspks (circles, r = 0.33, n = 41) and between Vm and Bspks, (crosses, r = 0.37, n = 30). (D) Related to the middle bar in Figure 3G, changes in Nspk and Vm CD responses during pairing were negatively correlated with Bspk sensory responses (r = −0.30, n = 41 for Nspks and r = −0.40, n = 30 for Vm). (E) Changes in CD responses are opposite to sensory responses in output neurons (n = 52). (F) Same as in (E) but for Bspks (n = 74). (G) Changes in CD responses of Nspks (circles, n = 41) and Vm (crosses, n = 33) are uncorrelated with the Nspk response to sensory input. Note, sensory response can be excitatory or inhibitory. Sensory-evoked firing rate response of I cells, BS− cells as well as CD response of E cells and BS+, are rectified at 0.

5

Figure S5 related to Figure 4. Negative images during pairing with sensory input. (A-B) Analysis related to Figure 4B,C, but showing pooled changes in corollary discharge (CD) responses during (as opposed to after) pairing measured by subtracting early (initial ~30 sec) from late (last ~30 sec) pairing responses (BS+, n = 13; BS−, n = 9). BS+ and BS− cells show approximately opposite changes, similar to those observed after pairing (Figure 4B). For BS− the average change in the CD response during pairing is smaller than the change relative to the pre pairing condition (Figure 4C). This is likely due to the fact that the initial narrow spike (Nspk) response to the mimic is, on average, in the same direction as the negative image. Also, for BS− cells the average change in the Nspk CD response during pairing is smaller than the average change in the CD response observed after pairing (Figure 4B). This may be due to the fact that during pairing there is some shunting of the CD input by the sensory input. (C) Related to Figure 7D. Average traces from experiments (n = 7) in which control pairings with an inhibitory stimulus were compared with pairing with hyperpolarizing current injections which evoked similar responses. Negative images measured under control conditions during pairing (measured by subtracting the early response from the late response during pairing) (black trace) were larger than changes induced by current injections (blue trace). The latter was measured after pairing because the current injection obscures the membrane potential trace during pairing.

6

Figure S6. Related to Figure 5: BS+ and BS− cells correspond to distinct anatomical classes. (A) Camera lucida reconstructions of remaining BS− cells not shown in Figure 5. Axon is drawn in red and ELL layer boundaries indicated by dashed lines. mol, molecular layer, ga, ganglion layer, pl, plexiform layer, gr, granular layer. (B) Camera lucida reconstructions of remaining BS+ cells not shown in Figure 5. (C) Narrow spike (Nspk) firing evoked by the EOD in morphologically reconstructed BS+ (n = 5) and BS− (n = 7) cells. Nspk responses of two BS− cells recorded in the dorsolateral zone of ELL were excluded from the clustering analysis in Figure 5E because the absolute magnitude of Nspk responses is expected to differ systematically across the zones. However, the results are in qualitative agreement with those from VLZ MG cells. A few cells show negative firing rates because we computed the Nspk response to sensory input by subtracting pre pairing from early pairing responses.

  • Biological solutions to general problems in multi-layer learning are shown.

  • Intermediate layer function requires compartmentalization of learning and signaling.

  • Circuit organization based on learning solves the credit assignment problem.

Acknowledgments:

We thank R. Bruno for assistance with Neurolucida reconstruction, C. Dempsey for contributing data analysis code, H. Hou for FIJI scripts used for quantifying MG axon distributions, M. Hibi for the parvalbumin antibody, P. Roberts for advice on biophysical modeling, and H. Hou, K. Key, K. Perks, A. Wallach, and R. Warren for comments on the manuscript. This work was supported by grants from the NIH (NS075023) and Irma T. Hirschl Trust to N.B.S. and by a grant from the NSF (1025849) to N.B.S. and L.F.A. L.F.A. was further supported by the Simons and Gatsby Foundations and by NSF NeuroNex Award DBI-1707398.

Footnotes

Declaration of interests: The authors declare no competing interests.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Apps R, and Garwicz M (2005). Anatomical and physiological foundations of cerebellar information processing. Nat Rev Neurosci 6, 297–311. [DOI] [PubMed] [Google Scholar]
  2. Basu J, Zaremba JD, Cheung SK, Hitti FL, Zemelman BV, Losonczy A, and Siegelbaum SA (2016). Gating of hippocampal activity, plasticity, and memory by entorhinal cortex long-range inhibition. Science 351, aaa5694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bear MF, and Malenka RC (1994). Synaptic plasticity: LTP and LTD. Curr Opin Neurobiol 4, 389–399. [DOI] [PubMed] [Google Scholar]
  4. Bell CC (1981). An efference copy modified by reafferent input. Science 214, 450–453. [DOI] [PubMed] [Google Scholar]
  5. Bell CC (1982). Properties of a modifiable efference copy in electric fish. J Neurophysiol 47, 1043–1056. [DOI] [PubMed] [Google Scholar]
  6. Bell CC, Caputi A, and Grant K (1997a). Physiology and plasticity of morphologically identified cells in the mormyrid electrosensory lobe. J Neurosci 17, 6409–6422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bell CC, Caputi A, Grant K, and Serrier J (1993). Storage of a sensory pattern by anti-Hebbian synaptic plasticity in an electric fish. Proc Nat Acad Sci 90, 4650–4654. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Bell CC, Finger TE, and Russell CJ (1981). Central connections of the posterior lateral line lobe in mormyrid fish. Exp Brain Res 42, 9–22. [DOI] [PubMed] [Google Scholar]
  9. Bell CC, Han V, and Sawtell NB (2008). Cerebellum-like structures and their implications for cerebellar function. Annu Rev Neurosci 31, 1–24. [DOI] [PubMed] [Google Scholar]
  10. Bell CC, Han VZ, Sugawara S, and Grant K (1997b). Synaptic plasticity in a cerebellum-like structure depends on temporal order. Nature 387, 278–281. [DOI] [PubMed] [Google Scholar]
  11. Bell CC, Meek J, and Yang JY (2005). Immunocytochemical identification of cell types in the mormyrid electrosensory lobe. J Comp Neurol 483, 124–142. [DOI] [PubMed] [Google Scholar]
  12. Bell CC, and Russell CJ (1978). Effect of electric organ discharge on ampullary receptors in a mormyrid. Brain Res 145, 85–96. [DOI] [PubMed] [Google Scholar]
  13. Bittner KC, Grienberger C, Vaidya SP, Milstein AD, Macklin JJ, Suh J, Tonegawa S, and Magee JC (2015). Conjunctive input processing drives feature selectivity in hippocampal CA1 neurons. Nat Neurosci 18, 1133–1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Buonomano DV, and Merzenich MM (1998). Cortical plasticity: from synapses to maps. Annu Rev Neurosci 21, 149–186. [DOI] [PubMed] [Google Scholar]
  15. Caporale N, and Dan Y (2008). Spike timing-dependent plasticity: a Hebbian learning rule. Annu Rev Neurosci 31, 25–46. [DOI] [PubMed] [Google Scholar]
  16. Carnevale NT, and Hines ML (2006). The NEURON book (Cambridge, UK ; New York: Cambridge University Press; ). [Google Scholar]
  17. Destexhe A, Mainen ZF, and Sejnowski TJ (1994). An Efficient Method for Computing Synaptic Conductances Based on a Kinetic-Model of Receptor-Binding. Neural Computation 6, 14–18. [Google Scholar]
  18. Engelmann J, van den BE, Bacelo J, de Ruijters M, Kuwana S, Sugawara Y, and Grant K (2008). Dendritic backpropagation and synaptic plasticity in the mormyrid electrosensory lobe. J Physiol Paris 102, 233–245. [DOI] [PubMed] [Google Scholar]
  19. Enikolopov AG, Abbott LF, and Sawtell NB (2018). Internally Generated Predictions Enhance Neural and Behavioral Detection of Sensory Stimuli in an Electric Fish. Neuron 99, 135–146 e133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Grant K, Meek J, Sugawara Y, Veron M, Denizot JP, Hafmans J, Serrier J, and Szabo T (1996). Projection neurons of the mormyrid electrosensory lateral line lobe:morphology, immunocytochemistry and synaptology. J Comp Neurol 375, 18–42. [DOI] [PubMed] [Google Scholar]
  21. Grant K, Sugawara S, Gomez L, Han VZ, and Bell CC (1998). The Mormyrid Electrosensory Lobe In Vitro: Physiology and Pharmacology of Cells and Circuits. J Neurosci 18, 6009–6025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Guerguiev J, Lillicrap TP, and Richards BA (2017). Towards deep learning with segregated dendrites. Elife 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Han VZ, Bell CC, Grant G, and Sugawara Y (1999). Mormyrid electrosensory lobe in vitro: morphology of cells and circuits. Journal of Comparative Neurology 404, 359–374. [DOI] [PubMed] [Google Scholar]
  24. Han VZ, Grant G, and Bell CC (2000). Reversible associative depression and nonassociative potentiation at a parallel fiber synapse. Neuron 27, 611–622. [DOI] [PubMed] [Google Scholar]
  25. Hangya B, Ranade SP, Lorenc M, and Kepecs A (2015). Central Cholinergic Neurons Are Rapidly Recruited by Reinforcement Feedback. Cell 162, 1155–1168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Herzfeld DJ, Kojima Y, Soetedjo R, and Shadmehr R (2015). Encoding of action by the Purkinje cells of the cerebellum. Nature 526, 439–442. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hollmann V, Engelmann J, and Gomez-Sena L (2016). A quest for excitation: Theoretical arguments and immunohistochemical evidence of excitatory granular cells in the ELL of Gnathonemus petersii. J Physiol Paris 110, 190–199. [DOI] [PubMed] [Google Scholar]
  28. Jaffe DB, Ross WN, Lisman JE, Lasserross N, Miyakawa H, and Johnston D (1994). A Model for Dendritic Ca2+ Accumulation in Hippocampal Pyramidal Neurons Based on Fluorescence Imaging Measurements. Journal of Neurophysiology 71, 1065–1077. [DOI] [PubMed] [Google Scholar]
  29. Kennedy A, Wayne G, Kaifosh P, Alvina K, Abbott LF, and Sawtell NB (2014). A temporal basis for predicting the sensory consequences of motor commands in an electric fish. Nat Neurosci 17, 416–422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Knudsen EI (1994). Supervised learning in the brain. J Neurosci 14, 3985–3997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kording KP, and Konig P (2001). Supervised and unsupervised learning with two sites of synaptic integration. J Comput Neurosci 11, 207–215. [DOI] [PubMed] [Google Scholar]
  32. Lacefield CO, Pnevmatikakis EA, Paninski L, and Bruno RM (2019). Reinforcement Learning Recruits Somata and Apical Dendrites across Layers of Primary Sensory Cortex. Cell Rep 26, 2000–2008 e2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Larkum ME, Nevian T, Sandler M, Polsky A, and Schiller J (2009). Synaptic integration in tuft dendrites of layer 5 pyramidal neurons: a new unifying principle. Science 325, 756–760. [DOI] [PubMed] [Google Scholar]
  34. Larkum ME, Zhu JJ, and Sakmann B (1999). A new cellular mechanism for coupling inputs arriving at different cortical layers. Nature 398, 338–341. [DOI] [PubMed] [Google Scholar]
  35. LeCun Y, Bengio Y, and Hinton G (2015). Deep learning. Nature 521, 436–444. [DOI] [PubMed] [Google Scholar]
  36. Lovett-Barron M, Kaifosh P, Kheirbek MA, Danielson N, Zaremba JD, Reardon TR, Turi GF, Hen R, Zemelman BV, and Losonczy A (2014). Dendritic inhibition in the hippocampus supports fear learning. Science 343, 857–863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Marblestone AH, Wayne G, and Kording KP (2016). Toward an Integration of Deep Learning and Neuroscience. Front Comput Neurosci 10, 94. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Meek J, Grant K, and Bell C (1999). Structural organization of the mormyrid electrosensory lateral line lobe. J Exp Biol 202, 1291–1300. [DOI] [PubMed] [Google Scholar]
  39. Meek J, Grant K, Sugawara S, Hafmans TGM, Veron M, and Denizot JP (1996). Interneurons of the ganglionic layer in the mormyrid electrosensory lateral line lobe: morphology,immunocytochemistry,and synaptology. J Comp Neurol 375, 43–65. [DOI] [PubMed] [Google Scholar]
  40. Mohr C, Roberts PD, and Bell CC (2003). Cells of the mormyromast region of the mormyrid electrosensory lobe: I. Responses to the electric organ corollary discharge and to electrosensory stimuli. J Neurophysiol 90, 1193–1210. [DOI] [PubMed] [Google Scholar]
  41. Oscarsson O (1979). Functional Units of the Cerebellum - Sagittal Zones and Microzones. Trends in Neurosciences 2, 143–145. [Google Scholar]
  42. Roberts PD, and Bell CC (2000). Computational consequences of temporally asymmetric learning rules: II. sensory image cancellation. J Comput Neurosci 9, 67–83. [DOI] [PubMed] [Google Scholar]
  43. Roberts PD, and Leen TK (2010). Anti-hebbian spike-timing-dependent plasticity and adaptive sensory processing. Front Comput Neurosci 4, 156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Rumelhart DE, Hinton GE, and Williams RJ (1988). Learning representations by back-propagating errors. Cognitive modeling 5, 1. [Google Scholar]
  45. Sawtell NB, Williams A, and Bell CC (2007). Central control of dendritic spikes shapes the responses of Purkinje-like cells through spike timing-dependent synaptic plasticity. J Neurosci 27, 1552–1565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Schiess M, Urbanczik R, and Senn W (2016). Somato-dendritic Synaptic Plasticity and Error-backpropagation in Active Dendrites. PLoS Comput Biol 12, e1004638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Schiller J, Schiller Y, Stuart G, and Sakmann B (1997). Calcium action potentials restricted to distal apical dendrites of rat neocortical pyramidal neurons. J Physiol 505 (Pt 3), 605–616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Sperry RW (1950). Neural basis of the spontaneous optokinetic response produced by visual inversion. J Comp Physiol Psychol 43, 482–489. [DOI] [PubMed] [Google Scholar]
  49. Sugawara Y, Grant K, Han V, and Bell CC (1999). Physiology of electrosensory lateral line lobe neurons in Gnathonemus petersii. J Exp Biol 202, 1301–1309. [DOI] [PubMed] [Google Scholar]
  50. Urbanczik R, and Senn W (2014). Learning by the dendritic prediction of somatic spiking. Neuron 81, 521–528. [DOI] [PubMed] [Google Scholar]
  51. von Holst E, and Mittelstaedt H (1950). The reafference principle. Naturwissenschaften 37, 464–476. [Google Scholar]
  52. Wilmes KA, Sprekeler H, and Schreiber S (2016). Inhibition as a Binary Switch for Excitatory Plasticity in Pyramidal Neurons. PLoS Comput Biol 12, e1004768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Zhang J, Han VZ, Meek J, and Bell CC (2007). Granular cells of the mormyrid electrosensory lobe and postsynaptic control over presynaptic spike occurrence and amplitude through an electrical synapse. J Neurophysiol 97, 2191–2203. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

Figure S1 related to Figure 2: Time course of sensory cancellation. (A) Similar to Figure 2D, but from brief (~4 min) periods of sensory pairing. In this data set we also recorded corollary discharge responses before pairing (pre pairing) and subtracted them from responses after pairing in order to reveal the negative image. Cancellation of the effects of the mimic (residual variance < 1), was observed in output (n = 78) and MG-Bspk (n = 78) but not in MG-Nspk responses (n = 44; P < 0.0001). (B) Cancelation of the effects of the mimic in E output neurons measured as the standard deviation of the firing rate response during pairing. After >30 minutes (n = 51) and after 4 minutes (n = 32) of pairing, responses are significantly reduced relative to the standard deviation of the response to sensory input alone (n = 78; P < 0.0001 and P < 0.001, respectively). After >30 minutes of pairing most cells are within the range of prepairing responses (n = 21). Note, the rate and accuracy of cancellation is qualitatively similar in I cells. However, quantifying cancellation in I cells is hindered by firing rate rectification. Hence data from I cells are not shown here. (C) Cancellation of the effects of the mimic in broad spikes of BS+ MG cells. After >30 minutes of pairing (n = 22) and after 4 minutes (n = 43), paired responses to the mimic are significantly reduced relative to the standard deviation of the response to the sensory input alone (n = 65; P < 0.0001). Most cells are within the range of prepairing responses (n = 44). (D) Average responses of E cells under same conditions as described above. Note, that responses to the paired stimulus are markedly reduced after pairing but are still larger than pre-pairing responses, consistent with under- rather than over-cancellation. (E) Same as in D but for BS+ cells. (F) Distribution of peak amplitudes of subthreshold sensory responses on membrane voltage measured intracellularly in E and I output cells. Spikes were blocked by hyperpolarization or removed off-line using a median filter.

2

Figure S2 related to Figure 2: Responses of output and MG cells to prey-like stimuli. (A) Raw voltage trace (top), stimulus waveform (middle, 5-20 Hz band-passed noise stimulus), and smoothed firing rate (bottom, 35 ms Gaussian kernel) for an example E cell. (B) Same display for an example MG cell. Arrow indicates the occurrence of a broad spike. Firing rate analysis was performed only on the narrow spikes. (C) Cross-correlation between the firing rate and the stimulus waveform for the example output cell (black) and MG cell narrow spikes (red) shown in A and B. (D) Summary plot of the magnitude of the peak cross-correlation value for output cells (E cells: n = 7-16 cell per data point; I cells: n = 6-17 cell per data point) versus MG cells (n = 4-14 cell per data point) across a range of stimulus amplitudes. *** indicates P < 0.001, ** indicates P < 0.01, * indicates P < 0.05, Student’s t-test with Bonferroni correction for multiple comparisons.

3

Figure S3 related to Figure 3: EOD responses of MG cell broad and narrow spikes are uncorrelated. Additional analysis related to Figure 3C. Data points indicate the average change in response relative to the pre-pairing condition calculated in a 16 ms window centered on the peak response (see STAR Methods). (A) Sensory responses of narrow spikes (Nspks) (circles) and subthreshold membrane potential (Vm; crosses) recorded in the same cell exhibit an approximately linear relationship (gray line, linear regression; n = 41, r2 = 0.64). (B) In contrast, sensory responses of Nspk (circles) and Vm (crosses) exhibit no clear relationship with the sensory responses of broad spikes (Bspks) recorded in the same cell (Nspk: n = 60, Vm: n = 42).

4

Figure S4 related to Figure 3: MG cells transmit a negative image of the response of broad spikes to the EOD mimic. All data are from experiments in which paired sensory input was delivered for 4 minutes. Data points in the scatter plots indicate the average change in response relative to the pre-pairing condition calculated in a 16 ms window centered on the peak response (see STAR Methods). (A) Changes in corollary discharge (CD) response of narrow spikes (Nspks) (circles; n = 30) and Vm (crosses; n = 38) have the same polarity as changes in the CD response of broad spikes (Bspks) recorded in the same cell and (B) opposite polarity to the sensory response of Bspks recorded in the same cell. (C) Related to Figure 3F, but here responses are changes during the pairing period, i.e. subtracting early (initial ~30 sec) from late (last ~30 sec) pairing responses, rather than subtracting pre from post pairing responses. Changes in CD responses during pairing were positively correlated between Nspks and Bspks (circles, r = 0.33, n = 41) and between Vm and Bspks, (crosses, r = 0.37, n = 30). (D) Related to the middle bar in Figure 3G, changes in Nspk and Vm CD responses during pairing were negatively correlated with Bspk sensory responses (r = −0.30, n = 41 for Nspks and r = −0.40, n = 30 for Vm). (E) Changes in CD responses are opposite to sensory responses in output neurons (n = 52). (F) Same as in (E) but for Bspks (n = 74). (G) Changes in CD responses of Nspks (circles, n = 41) and Vm (crosses, n = 33) are uncorrelated with the Nspk response to sensory input. Note, sensory response can be excitatory or inhibitory. Sensory-evoked firing rate response of I cells, BS− cells as well as CD response of E cells and BS+, are rectified at 0.

5

Figure S5 related to Figure 4. Negative images during pairing with sensory input. (A-B) Analysis related to Figure 4B,C, but showing pooled changes in corollary discharge (CD) responses during (as opposed to after) pairing measured by subtracting early (initial ~30 sec) from late (last ~30 sec) pairing responses (BS+, n = 13; BS−, n = 9). BS+ and BS− cells show approximately opposite changes, similar to those observed after pairing (Figure 4B). For BS− the average change in the CD response during pairing is smaller than the change relative to the pre pairing condition (Figure 4C). This is likely due to the fact that the initial narrow spike (Nspk) response to the mimic is, on average, in the same direction as the negative image. Also, for BS− cells the average change in the Nspk CD response during pairing is smaller than the average change in the CD response observed after pairing (Figure 4B). This may be due to the fact that during pairing there is some shunting of the CD input by the sensory input. (C) Related to Figure 7D. Average traces from experiments (n = 7) in which control pairings with an inhibitory stimulus were compared with pairing with hyperpolarizing current injections which evoked similar responses. Negative images measured under control conditions during pairing (measured by subtracting the early response from the late response during pairing) (black trace) were larger than changes induced by current injections (blue trace). The latter was measured after pairing because the current injection obscures the membrane potential trace during pairing.

6

Figure S6. Related to Figure 5: BS+ and BS− cells correspond to distinct anatomical classes. (A) Camera lucida reconstructions of remaining BS− cells not shown in Figure 5. Axon is drawn in red and ELL layer boundaries indicated by dashed lines. mol, molecular layer, ga, ganglion layer, pl, plexiform layer, gr, granular layer. (B) Camera lucida reconstructions of remaining BS+ cells not shown in Figure 5. (C) Narrow spike (Nspk) firing evoked by the EOD in morphologically reconstructed BS+ (n = 5) and BS− (n = 7) cells. Nspk responses of two BS− cells recorded in the dorsolateral zone of ELL were excluded from the clustering analysis in Figure 5E because the absolute magnitude of Nspk responses is expected to differ systematically across the zones. However, the results are in qualitative agreement with those from VLZ MG cells. A few cells show negative firing rates because we computed the Nspk response to sensory input by subtracting pre pairing from early pairing responses.

Data Availability Statement

Data and data analysis code will be available at http://dx.doi.Org/10.17632/4pxbpc6d5c.1 Morphology of the MG cell as well as code of biophysical model will be available at http://modeldb.yale.edu/259261

RESOURCES