Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2010 Apr 29;6(4):e1000759. doi: 10.1371/journal.pcbi.1000759

How Informative Are Spatial CA3 Representations Established by the Dentate Gyrus?

Erika Cerasti 1, Alessandro Treves 1,2,*
Editor: Lyle J Graham3
PMCID: PMC2861628  PMID: 20454678

Abstract

In the mammalian hippocampus, the dentate gyrus (DG) is characterized by sparse and powerful unidirectional projections to CA3 pyramidal cells, the so-called mossy fibers. Mossy fiber synapses appear to duplicate, in terms of the information they convey, what CA3 cells already receive from entorhinal cortex layer II cells, which project both to the dentate gyrus and to CA3. Computational models of episodic memory have hypothesized that the function of the mossy fibers is to enforce a new, well separated pattern of activity onto CA3 cells, to represent a new memory, prevailing over the interference produced by the traces of older memories already stored on CA3 recurrent collateral connections. Can this hypothesis apply also to spatial representations, as described by recent neurophysiological recordings in rats? To address this issue quantitatively, we estimate the amount of information DG can impart on a new CA3 pattern of spatial activity, using both mathematical analysis and computer simulations of a simplified model. We confirm that, also in the spatial case, the observed sparse connectivity and level of activity are most appropriate for driving memory storage – and not to initiate retrieval. Surprisingly, the model also indicates that even when DG codes just for space, much of the information it passes on to CA3 acquires a non-spatial and episodic character, akin to that of a random number generator. It is suggested that further hippocampal processing is required to make full spatial use of DG inputs.

Author Summary

The CA3 region at the core of the hippocampus, a structure crucial to memory formation, presents one striking anatomical feature. Its neurons receive many thousands of weak inputs from other sources, but only a few tens of very strong inputs from the neurons in the directly preceding region, the dentate gyrus. It had been proposed that such sparse connectivity helps the dentate gyrus to drive CA3 activity during the storage of new memories, but why it needs to be so sparse had remained unclear. Recent recordings of neuronal activity in the dentate gyrus (Leutgeb, et al. 2007) show the firing maps of granule cells of rodents engaged in exploration: the few cells active in a given environment, about 3% of the total, present multiple firing fields. Following these findings, we could now construct a network model that addresses the question quantitatively. Both mathematical analysis and computer simulations of the model show that, while the memory system would function also otherwise, connections as sparse as those observed make it function optimally, in terms of the bits of information new memories contain. Much of this information, we show, is encoded however in a difficult format, suggesting that other regions of the hippocampus, until now with no clear role, may contribute to decode it.

Introduction

The hippocampus presents the same organizaton across mammals, and distinct ones in reptiles and in birds. A most prominent and intriguing feature of the mammalian hippocampus is the dentate gyrus (DG). As reviewed in [1], the dentate gyrus is positioned as a sort of intermediate station in the information flow between the entorhinal cortex and the CA3 region of the hippocampus proper. Since CA3 receives also direct, perforant path connections from entorhinal cortex, the DG inputs to CA3, called mossy fibers, appear to essentially duplicate the information that CA3 can already receive directly from the source. What may be the function of such a duplication?

Within the view that the recurrent CA3 network operates as an autoassociative memory [2], [3], it has been suggested that the mossy fibers (MF) inputs are those that drive the storage of new representations, whereas the perforant path (PP) inputs relay the cue that initiates the retrieval of a previously stored representation, through attractor dynamics, due largely to recurrent connections (RC). Such a proposal is supported by a mathematical model which allows a rough estimate of the amount of information, in bits, that different inputs may impart to a new CA3 representation [4]. That model, however, is formulated in the Marr [5] framework of discrete memory states, each of which is represented by a single activity configuration or firing pattern.

Conversely, the prediction that MF inputs may be important for storage and not for retrieval has received tentative experimental support from experiments with spatial tasks, either the Morris water maze [6] or a dry maze [7]. Two-dimensional spatial representations, to be compatible with the attractor dynamics scenario, require a multiplicity of memory states, which approximate a 2D continuous manifold, isomorphic to the spatial environment to be represented. Moreover, there has to be of course a multiplicity of manifolds, to represent distinct environments with complete remapping from one to the other [8]. Attractor dynamics then occurs along the dimensions locally orthogonal to each manifold, as in the simplified “multi-chart” model [9], [10], whereas tangentially one expects marginal stability, allowing for small signals related to the movement of the animal, reflecting changing sensory cues as well as path integration, to displace a “bump” of activity on the manifold, as appropriate [9], [11].

Although the notion of a really continuous attractor manifold appears as a limit case, which can only be approximated by a network of finite size [12], [13], [14], [15], even the limit case raises the issue of how a 2D attractor manifold can be established. In the rodent hippocampus, the above theoretical suggestion and experimental evidence point at a dominant role of the dentate gyrus, but it has remained unclear how the dentate gyrus, with its MF projections to CA3, can drive the establishment not just of a discrete pattern of activity, as envisaged by [4], but of an entire spatial representation, in its full 2D glory. This paper reports the analysis of a simplified mathematical model aimed at addressing this issue in a quantitative, information theoretical fashion.

Such an analysis would have been difficult even only a few years ago, before the experimental discoveries that largely clarified, in the rodent, the nature of the spatial representations in the regions that feed into CA3. First, roughly half of the entorhinal PP inputs, those coming from layer II of the medial portion of entorhinal cortex, were found to be often in the form of grid cells, i.e. units that are activated when the animal is in one of multiple regions, arranged on a regular triangular grid [16]. Second, the sparse activity earlier described in DG granule cells [17] was found to be concentrated on cells also with multiple fields, but irregularly arranged in the environment [18]. These discoveries can now inform a simplified mathematical model, which would have earlier been based on ill-defined assumptions. Third, over the last decade neurogenesis in the adult dentate gyrus has been established as a quantitatively constrained but still significant phenomenon, stimulating novel ideas about its functional role [19]. The first and third of these phenomena will be considered in extended versions of our model, to be analysed elsewhere; here, we focus on the role of the multiple DG place fields in establishing novel CA3 representations.

A simplified mathematical model

The complete model considers the firing rate of a CA3 pyramidal cell, Inline graphic, to be determined by the firing rates Inline graphic of other cells in CA3, which influence it through RC connections; by the firing rates Inline graphic of DG granule cells, which feed into it through MF connections; by the firing rates Inline graphic of layer II pyramidal cells in entorhinal cortex (medial and lateral), which project to CA3 through PP axons; and by various feedforward and feedback inhibitory units. A most important simplification is that the fine temporal dynamics, e.g. on theta and gamma time scales, is neglected altogether, so that with “firing rate” we mean an average over a time of order the theta period, a hundred msec or so. Very recent evidence indicates, in fact, that only one of two competing spatial representations tends to be active in CA3 within each theta period [Jezek et al, SfN abstract, 2009]. Information coding over shorter time scales would require anyway a more complex analysis, which is left to future refinements of the model.

For the different systems of connections, we assume the existence of anatomical synapses between any two cells to be represented by fixed binary matrices Inline graphic taking 0 or 1 values, whereas the efficacy of those synapses to be described by matrices Inline graphic. Since they have been argued to have a minor influence on coding properties and storage capacity [20], consistent with the diffuse spatial firing of inhibitory interneurons [21], the effect of inhibition and of the current threshold for activating a cell are summarized into a subtractive term, of which we denote with Inline graphic the mean value across CA3 cells, and with Inline graphic the deviation from the mean for a particular cell Inline graphic.

Assuming finally a simple threshold-linear activation function [22] for the relation between the activating current and the output firing rate, we write

graphic file with name pcbi.1000759.e010.jpg (1)

where Inline graphic indicates taking the sum inside the brackets if positive in value, and zero if negative, and Inline graphic is a gain factor. The firing rates of the various populations are all assumed to depend on the position Inline graphic of the animal, and the notation is chosen to minimize differences with our previous analyses of other components of the hippocampal system (e.g. [22], [23]).

The storage of a new representation

When the animal is exposed to a new environment, we make the drastic modelling assumption that the new CA3 representation be driven solely by MF inputs, while PP and RC inputs provide interfering information, reflecting the storage of previous representations on those synaptic systems, i.e., noise. Such “noise” can in fact act as an undesired signal and bring about the retrieval of a previous, “wrong” representation, an interesting process which is not however analysed here. We reabsorb the mean of such noise into the mean of the “threshold+inhibition” term Inline graphic and similarly for the deviation from the mean. We use the same symbols for the new variables incorporating RC and PP interference, but removing in both cases the “Inline graphic” sign, thus writing

graphic file with name pcbi.1000759.e016.jpg (2)

where the gain has been set to Inline graphic, without loss of generality, by an appropriate choice of the units in which to measure Inline graphic (pure numbers) and Inline graphic (Inline graphic).

As for the MF inputs, we consider a couple of simplified models that capture the essential finding by [18], of the irregularly arranged multiple fields, as well as the observed low activity level of DG granule cells [24], while retaining the mathematical simplicity that favours an analytical treatment. We thus assume that only a randomly selected fraction Inline graphic of the granule cells are active in a new environment, of size Inline graphic, and that those units are active in a variable number Inline graphic of locations, with Inline graphic drawn from a distribution with mean Inline graphic. In model A, which we take as our reference, the distribution is taken to be Poisson (the data reported by Leutgeb et al [18] are fit very well by a Poisson distribution with Inline graphic, but their sampling is limited). In model B, which we use as a variant, the distribution is taken to be exponential (this better describes the results of the simulations in [25], though that simple model may well be inappropriate). Therefore, in either model, the firing rate Inline graphic of DG unit Inline graphic is a combination of Inline graphic gaussian “bumps”, or fields, of equal effective size Inline graphic and equal height Inline graphic, centered at random points Inline graphic in the new environment

graphic file with name pcbi.1000759.e033.jpg (3)

The informative inputs driving the firing of a CA3 pyramidal cell, during storage of a new representation, result therefore from a combination of three distributions, in the model. The first, Poisson but close to normal, determines the MF connectivity, that is how it is that each CA3 unit receives only a few tens of connections out of Inline graphic granule cells (in the rat), whereby Inline graphic with Inline graphic. The second, Poisson, determines which of the DG units presynaptic to a CA3 unit is active in the new environment, with Inline graphic. The third, either Poisson or exponential (and see model C below), determines how many fields an active DG unit has in the new environment. Note that in the rat Inline graphic [26] whereas Inline graphic, even when considering presumed newborn neurons [24]. As a result, the total number of active DG units presynaptic to a given CA3 unit, Inline graphic, is of order one, Inline graphic, so that the second Poisson distribution effectively dominates over the first, and the number of active MF impinging on a CA3 unit can approximately be taken to be itself a Poisson variable with mean Inline graphic. As a qualification to such an approximation, one has to consider that different CA3 pyramidal cells, among the Inline graphic present in the rat (on each side), occasionally receive inputs from the same active DG granule cells, but rarely, as Inline graphic, hence the pool of active units Inline graphic is only one order of magnitude smaller than the population of receiving units Inline graphic.

In a further simplification, we consider the MF synaptic weights to be uniform in value, Inline graphic. This assumption, like those of equal height and width of the DG firing fields, is convenient for the analytical treatment but not necessary for the simulations. It will be relaxed later, in the computer simulations addressing the effect of MF synaptic plasticity.

The new representation is therefore taken to be established by an informative signal coming from the dentate gyrus

graphic file with name pcbi.1000759.e048.jpg (4)

modulated, independently for each CA3 unit, by a noise term Inline graphic, reflecting recurrent and perforant path inputs as well as other sources of variability, and which we take to be normally distributed with zero mean and standard deviation Inline graphic.

The position Inline graphic of the animal determines the firing Inline graphic of DG units, which in turn determine the probability distribution for the firing rate of any given CA3 pyramidal unit

graphic file with name pcbi.1000759.e053.jpg

where

graphic file with name pcbi.1000759.e054.jpg

is the integral of the gaussian noise up to given signal-to-noise ratio

graphic file with name pcbi.1000759.e055.jpg

and Inline graphic is Heaviside's function vanishing for negative values of its argument. The first term, multiplying Dirac's Inline graphic, expresses the fact that negative activation values result in zero firing rates, rather than negative rates.

Note that the resulting sparsity, i.e. how many of the CA3 units end up firing significantly at each position, which is a main factor affecting memory storage [21], is determined by the threshold Inline graphic, once the other parameters have been set. The approach taken here is to assume that the system requires the new representation to be sparse and regulates the threshold accordingly. We therefore set the sparsity parameter Inline graphic, in broad agreement with experimental data [14], and adjust Inline graphic (as shown, for the mathematical analysis, in the third section of the Methods).

The distribution of fields per DG unit is given in model A by the Poisson form

graphic file with name pcbi.1000759.e061.jpg

in model B by the exponential form

graphic file with name pcbi.1000759.e062.jpg

and we also consider, as another variant, model C, where each DG unit has one and only one field

graphic file with name pcbi.1000759.e063.jpg

Assessing spatial information content

In the model, spatial position Inline graphic is represented by CA3 units, whose activity is informed about position by the activity of DG units. The activity of each DG unit is determined independently of others by its place fields

graphic file with name pcbi.1000759.e065.jpg

with

graphic file with name pcbi.1000759.e066.jpg

where each contributing field is a gaussian bump

graphic file with name pcbi.1000759.e067.jpg

The Mutual Information Inline graphic quantifies the efficiency with which CA3 activity codes for position, on average, as

graphic file with name pcbi.1000759.e069.jpg (5)

where the outer brackets Inline graphic indicate that the average is not just over the noise Inline graphic, as usual in the estimation of mutual information, but also, in our case, over the quenched, i.e. constant but unknown values of the microscopic quantities Inline graphic, the connectivity matrix, Inline graphic, the number of fields per active unit, and Inline graphic, their centers. For given values of the quenched variables, the total entropy Inline graphic and the (average) equivocation Inline graphic are defined as

graphic file with name pcbi.1000759.e077.jpg (6)
graphic file with name pcbi.1000759.e078.jpg (7)

where Inline graphic is the area of the given environment; the Inline graphics are intended in base 2, to yield information values in bits.

The estimation of the mutual information can be approached analytically directly from these formulas, using the replica trick (see [27]), as shown by [28] and [29], and briefly described in the first section of the Methods. As in those two studies, however, here too we are only able to complete the derivation in the limit of low signal-to-noise, or more precisely of limited variation, across space, of the signal-to-noise around its mean, that is Inline graphic. In this case we obtain, to first order in Inline graphic, an expression that can be shown to be equivalent to

graphic file with name pcbi.1000759.e083.jpg (8)

where we use the notation Inline graphic (cp. [29], Eqs.17, 45).

Being limited to the first order in Inline graphic, the expression above can be obtained in a straightforward manner by directly expanding the logarithms, in the large noise limit Inline graphic, in the simpler formula quantifying the information conveyed by a single CA3 unit

graphic file with name pcbi.1000759.e087.jpg (9)

This single-unit formula cannot quantify the higher-order contributions in Inline graphic, which decrease the information conveyed by a population in which some of the units inevitably convey some of the same information. The replica derivation, instead, in principle would allow one to take into proper account such correlated selectivity, which ultimately results in the information conveyed by large CA3 populations not scaling up linearly with Inline graphic, and saturating instead once enough CA3 units have been sampled, as shown in related models by [28], [29]. In our case however the calculation of e.g. the second order terms in Inline graphic is further complicated by the fact that different CA3 units receive inputs coming from partially overlapping subsets of DG units. This may cause saturation at a lower level, once all DG units have been effectively sampled. The interested reader can follow the derivation sketched in the Methods.

Having to take, in any case, the large noise limit implies that the resulting formula is not really applicable to neuronally plausible values of the parameters, but only to the uninteresting case in which DG units impart very little information onto CA3 units. Therefore we use only the single-unit formula, and resort to computer simulations to assess the effects of correlated DG inputs. The second and third sections of the Methods indicate how to obtain numerical results by evaluating the expression in Eq. 9.

Computer simulations can be used to estimate the information present in samples of CA3 units of arbitrary size, and at arbitrary levels of noise, but at the price of an indirect decoding procedure. A decoding step is required because the dimensionality of the space spanned by the CA3 activity Inline graphic is too high. It increases in fact exponentially with the number Inline graphic of neurons sampled, as Inline graphic, where Inline graphic is the number of possible responses of each neuron. The decoding method we use, described in the fourth section of the Methods, leads to two different types of information estimates, based on either the full or reduced localization matrix. The difference between the two, and between them and the analytical estimate, is illustrated under Results and further discussed at the end of the paper.

Results

The essential mechanism described by the model is very simple, as illustrated in Fig. 1. CA3 units which happen to receive a few DG overlapping fields combine them in a resulting field of their own, that can survive thresholding. The devil is in the quantitative details: what proportion of CA3 cells express place fields, how large are the fields, and how strong are the fields compared with the noise, all factors that determine the information contained in the spatial representation. Note that a given CA3 unit can express multiple fields.

Figure 1. Network scheme.

Figure 1

The DG-CA3 system indicating examples of the fields attributed to DG units and of those resulting in CA3 units, the connectivity between the two populations, and the noise Inline graphic that replaces, in the model, also the effect of recurrent connections in CA3.

It is convenient to discuss such quantitative details with reference to a standard set of parameters. Our model of reference is a network of DG units with fields represented by Gaussian-like functions of space, with the number of fields per each DG units given by a Poisson distribution with mean value Inline graphic, and parameters as specified in Table 1.

Table 1. Parameters: Values used in the standard version of the model.

Parameter Symbol Standard Value
probability a DG unit is active in one environment Inline graphic 0.033
number of DG inputs to a CA3 unit Inline graphic 50
mean number of fields per active DG unit Inline graphic 1.7
mean number of fields activating a CA3 unit Inline graphic Inline graphic
strength of MF inputs Inline graphic 1, otherwise Inline graphic
noise affecting CA3 activity Inline graphic 1 (in units in which Inline graphic)
sparsity of CA3 activity Inline graphic 0.1

In general, the stronger the mean DG input, the more it dominates over the noise, and also the higher the threshold has to be set in CA3 to make the pattern of activity as sparse as required, by fixing Inline graphic. To control for the trivial advantage of a higher signal-to-noise, we perform comparisons in which it is kept fixed, by adjusting e.g. the MF synaptic strength Inline graphic.

Multiple input cells vs. multiple fields per cell

The first parameter we considered is Inline graphic, the average number of fields for each DG unit, in light of the recent finding that DG units active in a restricted environment are more likely to have multiple fields than CA3 units, and much more often than expected, given their weak probability of being active [18]. We wondered whether receiving multiple fields from the same input units would be advantageous for CA3, and if so whether there is an optimal Inline graphic value. We therefore estimated the mutual information when Inline graphic varies and Inline graphic, the total mean number of DG fields that each CA3 cell receives as input, is kept fixed, by varying Inline graphic correspondigly. As shown in Fig. 2, varying Inline graphic in this manner makes very little difference in the bits conveyed by each CA3 cell. This figure reports the results of computer simulations, that illustrate also the dependence of the mutual information on Inline graphic, the number of cells sampled. The dependence is sub-linear, but rather smooth, with significant fluctuations from sample-to-sample which are largely averaged out in the graph. The different lines correspond to different distributions of the input DG fields among active DG cells projecting to CA3, that is different combinations of values for Inline graphic and Inline graphic, with Inline graphic kept constant; these different distributions do not affect much the information in the representation.

Figure 2. The exact multiplicity of fields in DG units is irrelevant.

Figure 2

Information about position plotted versus the number of CA3 units, Inline graphic from which it is decoded, with the mean number of fields in the input to each CA3 unit constant at the value Inline graphic. Different lines correspond to a different mean number of fields per DG input units, balanced by different mean number of input units per CA3 unit. Inset: analytical estimate of the information per CA3 unit, from numerically integrating Eq. 9.

The analytical estimate of the information per CA3 unit confirms that there is no dependence on Inline graphic (Fig. 2, inset). This is not a trivial result, as it would be if only the parameter Inline graphic entered the analytical expression. Instead, the second section of the Methods shows that the parameters Inline graphic of the Inline graphic-field decomposition depend separately on Inline graphic and Inline graphic, so the fact that the two separate dependencies almost cancel out in a single dependence on their product, Inline graphic, is remarkable. Moreover, such analytical estimate of the information conveyed by one unit does not match the first datapoints, for Inline graphic, extracted from the computer simulation; it is not higher, as might have been expected considering that the simulation requires an additional information loosing decoding step, but lower, by over a factor of 2. The finding that the analytical estimate differs from, and is in fact much lower than, the slope parameter extracted from the simulations, after the decoding step, is further discussed below. Despite their incongruity in absolute values, neither the estimate derived from the simulations nor the analytical estimate have separate dependencies on Inline graphic and Inline graphic, as shown in Fig. 2.

More MF connections, but weaker

Motivated by the striking sparsity of MF connections, compared to the thousands of RC and PP synaptic connections impinging on CA3 cells in the rat, we have then tested the effect of changing Inline graphic without changing Inline graphic. In order to vary the mean number of DG units that project to a single CA3 unit, while keeping constant the total mean input strength, assumed to be an independent biophysically constrained parameter, we varied inversely to Inline graphic the synaptic strength parameter Inline graphic. As shown in Fig. 3, the information presents a maximum at some intermediate value Inline graphic, which is observed both in simulations and in the analytical estimate, despite the fact that again they differ by more than a factor of two.

Figure 3. A sparse MF connectivity is optimal, but not too sparse.

Figure 3

Left: information plotted versus the number of CA3 cells, with different colors for different values of Inline graphic. Dots represent information values obtained from simulations, while curves are exponentially saturating fits to the data points, as described in Methods. Right: plot of the two parameters of the fit curves. Main figure: slope parameter describing the slope of the linear part of the curve (for low Inline graphic), constrasted with the analytical estimate of the term proportional to Inline graphic (Eq.9); inset: total information parameter, describing the saturation level reached by the curve.

Again we find that the analytical estimate differs from, and is in fact much lower than, the slope parameter extracted from the simulations, after the decoding ste. Both measures, however, show that the standard model is not indifferent to how sparse are the MF connections. If they are very sparse, most CA3 units receive no inputs from active DG units, and the competition induced by the sparsity constraint tends to be won, at any point in space, by those few CA3 units that are receiving input from just one active DG unit. The resulting mapping is effectively one-to-one, unit-to-unit, and this is not optimal information-wise, because too few CA3 units are active – many of them in fact have multiple fields (Fig. 4, right), reflecting the multiple fields of their “parent” units in DG. As Inline graphic increases (with a corresponding decrease in MF synaptic weight), the units that win the competition tend to be those that summate inputs from two or more concurrently active DG units. The mapping ceases to be one-to-one, and this increases the amount of information, up to a point. When Inline graphic is large enough that CA3 units begin to sample more effectively DG activity, those that win the competition tend to be the “happy few” that happen to summate several active DG inputs, and this tends to occur at only one place in the environment. As a result, an ever smaller fraction of CA3 units have place fields, and those tend to have just one, often very irregular, as shown in Fig. 4, right. From that point on, the information in the representation decreases monotonically. The optimal MF connectivity is then in the range which maximizes the fraction of CA3 units that have a field in the newly learned environment, at a value, roughly one third, broadly consistent with experimental data (see e.g. [30]).

Figure 4. Information vs. connectivity.

Figure 4

Left: Examples of CA3 firing rate maps for Inline graphic (top row); Inline graphic (middle) and Inline graphic (bottom); Right: Histogram that shows the fraction of CA3 units active somewhere in the environment, left, and the fraction of active CA3 units with more than one field, right, for different Inline graphic values.

It is important to emphasize that what we are reporting is a quantitative effect: the underlying mechanism is always the same, the random summation of inputs from active DG units. DG in the model effectively operates as a sort of random number generator, whatever the values of the various parameters. How informative are the CA3 representations established by that random number generator, however, depends on the values of the parameters.

Other DG field distribution models

We repeated the simulations using other models for the DG fields distribution, the exponential (model B) and the single field one (model C), and the results are similar to those obtained for model A: the information has a maximum when varying Inline graphic on its own, and is instead roughly constant if the parameter Inline graphic is held constant (by varying Inline graphic inversely to Inline graphic). Fig. 5 reports the comparison, as Inline graphic varies, between models A and B, with Inline graphic, and model C, where Inline graphic, so that in this latter case the inputs are 1/1.7 times weaker (we did not compensate by multiplying Inline graphic by 1.7). Information measures are obtained by decoding several samples of 10 units, averaging and dividing by 10, and not by extracting the fit parameters. As one can see, the lower mean input for model C leads to lower information values, but the trend with Inline graphic is the same in all three models. This further indicates that the multiplicity of fields in DG units, as well as its exact distribution, is of no major consequence, if comparisons are made keeping constant the mean number of fields in the input to a CA3 unit.

Figure 5. Information vs. connectivity.

Figure 5

Information plotted versus different values of connectivity between DG and CA3. Solid lines are all from simulations (localization information from samples of 10 units, divided by 10), as follows: for the blue line, the distribution defining the number of fields in DG cells is Poisson (model A); for the green line, it is exponential (model B); and for the red line, each DG active unit has one field only (model C).

Sparsity of DG activity

We study also how the level of DG activity affects the information flow. We choose diffferent values for the probability Inline graphic that a single DG unit fires in the given environment, and again we adjust the synaptic weight Inline graphic to keep the mean DG input per CA3 cell constant across the comparisons.

Results are simular to those obtained varying the sparsity of the MF connections (Fig. 6). Indeed, the analytical estimate in the two conditions would be exactly the same, within the approximation with which we compute it, because the two parameters Inline graphic and Inline graphic enter the calculation in equivalent form, as a product. The actual difference between the two parameters stems from the fact that increasing Inline graphic, CA3 units end up sampling more and more the same limited population of active DG units, while increasing Inline graphic this population increases in size. This difference can only be appreciated from the simulations, which however show that the main effect remains the same: an information maximum for rather sparse DG activity (and sparse MF connections), The subtle difference between varying the two parameters can be seen better in the saturation information value: with reference to the standard case, in the center of the graph in the inset, to the right increasing Inline graphic leads to more information than increasing Inline graphic, while to the left the opposite is the case, as expected.

Figure 6. Sparse DG activity is effective at driving CA3.

Figure 6

Left: Information plotted versus the number of CA3 units, different colors correspond to different values for Inline graphic. Dots represent information values obtained from simulations, while the curves are exponentially saturating fits to the data points, as described in Methods. Right: Plot of the two parameters of the fits. Main figure: slope parameter describing the slope of the linear part of the information curve (for low Inline graphic); inset: total information parameter describing the saturation level reached by the information - both are contrasted with the corresponding measures (dashed lines) obtained varying Inline graphic instead of Inline graphic.

Full and simplified decoding procedures

As noted above, we find that the analytical estimate of the information per unit is always considerably lower than the slope parameter of the fit to the measures extracted from the simulations, contrary to expectations, since the latter require an additional decoding step, which implies some loss of information. We also find, however, that the measures of mutual information that we extract from the simulations are strongly dependent on the method used, in the decoding step, to construct the “localization matrix”, i.e. the matrix which compiles the frequency with which the virtual rat was decoded as being in position Inline graphic when it was actually in position Inline graphic. All measures reported so far, from simulations, are obtained constructing what we call the full localization matrix Inline graphic which, if the square environment is discretized into Inline graphic spatial bins, is a large Inline graphic matrix, which requires of order 160,000 decoding events to be effectively sampled. We run simulations with trajectories of 400,000 steps, and additionally corrected the information measures to avoid the limited sampling bias [31].

An alternative, that allows extracting unbiased measures from much shorter simulations, is to construct a simplified matrix Inline graphic, which averages over decoding events with the same vector displacement between actual and decoded positions. Inline graphic is easily constructed on the torus we used in all simulations, and being a much smaller Inline graphic matrix it is effectively sampled in just a few thousand steps.

The two decoding procedures, given that the simplified matrix is the shifted average of the rows of the full matrix, might be expected to yield similar measures, but they do not, as shown in Fig. 7. The simplified matrix, by assuming translation invariance of the errors in decoding, is unable to quantify the information implicitly present in the full distribution of errors around each actual position. Such errors are of an “episodic” nature: the local view from position Inline graphic might happen to be similar to that from position Inline graphic, hence neural activity reflecting in part local views might lead to confuse the two positions, but this does not imply that another position Inline graphic has anything in common with Inline graphic. Our little network model captures this discrepancy, in showing, in Fig. 7, that for any actual position there are a few selected position that are likely to be erroneously decoded from the activity of a given sample of units; when constructing instead the translationally invariant simplified matrix, all average errors are distributed smoothly around the correct position (zero error), in a roughly Gaussian bell. The upper right panel in Fig. 7 shows that such episodic information always prevails, whatever the connectivity, i.e. in all three parameter regimes illustrated in Fig. 4. The lower right panel in Fig. 7 compares, instead, the entropies of the decoded positions with the two matrices, conditioned on the actual position – that is, the equivocation values. Unlike the mutual information, such equivocation is much higher for the simplified matrix; for this matrix, it is simply a measure of how widely displaced are decoded positions, with respect to the actual positions, represented at the center of the square; and for small samples of units, which are not very informative, the “displacement” entropy approaches that of a flat distribution of decoded positions, i.e. Inline graphic bits. For larger samples, which enable better localization, the simplified localization matrix begins to be clustered in a Gaussian bell around zero displacement, so that the equivocation gradually decreases (the list of displacements, with their frequencies, is computed for each sample, and it is the equivocation, not the list itself, which is averaged across samples). In contrast, the entropy of each row of the full localization matrix, i.e. the entropy of decoded positions conditioned on any actual position, is lower, and also decreasing more steeply with sample size; it differs from the full entropy, in fact, by the mutual information between decoded and actual positions, which increases with sample size. The two equivocation measures therefore both add up to the two mutual information measures to yield the same full entropy of about 8.64 bits (a bit less in the case of the full matrix, where the sampling is more limited), and thus serve as controls that the difference in mutual information is not due, for example, to inaccuracy. As a third crucial control, we calculated also the average conditional entropy of the full localization matrix, when the matrix is averaged across samples of a given size: the resulting entropy is virtually identical to the displacement entropy (which implies instead an average of the full matrix across rows, i.e. across actual positions). This indicates that different samples of units express distinct episodic content at each location, such that averaging across samples is equivalent to averaging across locations.

Figure 7. Localization matrices.

Figure 7

Left: the rows of the full matrix represent the actual positions of the virtual rat while its columns represent decoded positions (the full matrix is actually Inline graphic); three examples of rows are shown, rendered here as Inline graphic squares, all from decoding by a given sample of 10 units. The simplified matrix is a single Inline graphic matrix obtained (from the same sample) as the average of the full matrix taking into account traslation invariance. Right, top: the two procedures lead to large quantitative differences in information (here, the measures from samples of 10 units, divided by 10, from the full matrix, cyan, and from the simplified matrix, black), but with the same dependence on Inline graphic. Right, bottom: The conditional entropies of the full and simplified localization matrices (cyan and black, dashed) in both cases add up to the respective mutual information measure (cyan and black, solid) to give the full entropy of Inline graphic bits (green line). The conditional entropy calculated from the full matrix averaged across samples (red, dashed) is equivalent to that calculated from the displacements, for each sample (black, dashed).

Apparently, also the analytical estimate is unable to capture the spatial information implicit in such “episodic” errors, as its values are well below those obtained with the full matrix, and somewhat above those obtained with the simplified matrix (consistent with some loss with decoding). One may wonder how can the information from the full localization matrix (which also requires a decoding step) be higher than the decoding-free analytical estimate, without violating the basic information processing theorem. The solution to the riddle, as we understand it, is subtle: when decoding, one takes essentially a maximum likelihood estimate, assigning a unique decoded position per trial, or time step. This leads to a “quantized” localization matrix, which in general tends to have substantially higher information content than the “smoothed” matrix based on probabilities [32]. In the analytical derivation there is no concept of trial, time step or maximal likelihood, and the matrix expresses smoothly varying probabilities. The more technical implications are discussed further at the end of the Methods. These differences do not alter the other results of our study, since they affect the height of the curves, not their shape, however they have important implications. The simplified matrix has the advantage of requiring much less data, i.e. less simulation time, but also less real data if applied to neurophysiological recordings, than the full matrix, and in most situations it might be the only feasible measure of spatial information (the analytical estimate is not available of course for real data). So in most cases it is only practical to measure spatial information with methods that, our model suggests, miss out much of the information present in neuronal activity, what we may refer to as “dark information”, not easily revealed. One might conjecture that the prevalence of dark information is linked to the random nature of the spatial code established by DG inputs. It might be that additional stages of hippocampal processing, either with the refinement of recurrent CA3 connections or in CA1, are instrumental in making dark information more transparent.

Effect of learning on the mossy fibers

While the results reported this far assume that MF weights are fixed, Inline graphic, we have also conducted a preliminary analysis of how the amount of spatial information in CA3 might change as a consequence of plasticity on the mossy fibers. In an extension of the standard model, we allow the weights of the connections between DG and CA3 to change with a model “Hebbian” rule. This is not an attempt to capture the nature of MF plasticity, which is not NMDA-dependent and might not be associative [33], but only the adoption of a simple plasticity model that we use in other simulations. At each time step (that corresponds to a different place in space) weights are taken to change as follows:

graphic file with name pcbi.1000759.e185.jpg (10)

where Inline graphic is a plasticity factor that regulates the amount of learning. Modifying in this way the MF weights has the general effect of increasing information values, so that they approach saturation levels for lower number of CA3 cells; in particular this is true for the information extracted from both full and simplified matrices. In Fig. 8, the effect of such “learning” is shown for different values of the parameter Inline graphic, as a function of connectivity.

Figure 8. Information vs. connectivity for different levels of learning.

Figure 8

Information is plotted as a function of the connectivity level between DG and CA3, different colors correspond to different values of the learning factor Inline graphic. Simulations run for 100,000 training steps, during a fraction Inline graphic of which each postsynaptic units is strongly activated, and its incoming weights liable to be modified. The Inline graphic values tested hence span the range from minor modification of the existing weight, for Inline graphic, to major restructuring of all available weights for Inline graphic.

We see that allowing for this type of plasticity on mossy fibers leads to shift the maximum of information as a function of the connectivity level. The structuring of the weights effectively results in the selection of favorite input connections, for each CA3 unit, among a pool of availables ones; the remaining strong connections are a subset of those “anatomically” present originally. It is logical, then, that starting with a larger pool of connnections, among which to pick the “right” ones, leads to more information than starting with few connections, which further decrease in effective number with plasticity. We expect better models of the details of MF plasticity to preserve this main effect.

A further effect of learning, along with the disappearance of some CA3 fields and the strengthening of others, is the refinement of their shape, as illustrated in Fig. 9. It is likely that also this effect will be observed even when using more biologically accurate models of MF plasticity.

Figure 9. MF plasticity can suppress, enlarge and in general refine CA3 place fields.

Figure 9

The place fields of five example units are shown before plasticity is turned on (top row) and after 100,000 steps with a large plasticity factor Inline graphic (bottom row). The rounding and regularization of the fields was observed also for several other units in the simulation.

Retrieval abilities

Finally, all simulations reported so far involved a full complement of DG inputs at each time step in the simulation. We have also tested the ability of the MF network to retrieve a spatial representation when fed with a degraded input signal, with and without MF plasticity. The input is degraded, in our simulation, simply by turning on only a given fraction, randomly selected, of the DG units that would normally be active in the environment. The information extracted after decoding by a sample of units (in Fig. 10, 10 units) is then contrasted with the size of the cue itself. In the absence of MF plasticity, there is obviously no real retrieval process to talk about, and the DG-CA3 network simply relays partial information. When Hebbian plasticity is turned on, the expectation from similar network models (see e.g. [34], Fig. 9) is that there would be some pattern completion, i.e. some tendency for the network to express nearly complete output information when the input is partial, resulting in a more sigmoidal input-output curve (the exact shape of the curve depends of course also on the particular measure used).

Figure 10. Information reconstructed from a degraded input signal.

Figure 10

Slope parameter Inline graphic of the information curve as a function of the percentage of the DG input that CA3 receives. Inset: the same plot for the total information parameter Inline graphic. The same training protocol was run as for Figs. 89.

It is apparent from Fig. 10 that while, in the absence of plasticity, both parameters characterizing the information that can be extracted from CA3 grow roughly linearly with the size of the cue, with plasticity the growth is supralinear. This amounts to the statement that the beneficial effects of plasticity require a full cue to be felt – the conceptual opposite to pattern completion, the process of integrating a partial cue using information stored on modified synaptic weights. This result suggests that the sparse MF connectivity is sub-optimal for the associative storage that leads to pattern completion, a role that current perspectives ascribe instead to perforant path and recurrent connections to CA3. The role of the mossy fibers, even if plastic, may be limited to the establishment of new spatial representations.

Discussion

Ours is a minimal model, which by design overlooks several of the elements likely to play an important role in the functions of the dentate gyrus - perhaps foremost, neurogenesis [35]. Nevertheless, by virtue of its simplicity, the model helps clarify a number of quantitative issues that are important in refining a theoretical perspective of how the dentate gyrus may work.

First, the model indicates that the recently discovered multiplicity of place fields by active dentate granule cells [18] might be just a “fact of life”, with no major computational implications for dentate information processing. Still, requiring that active granule cells express multiple fields seems to lead, in another simple network model (of how dentate activity may result from entorhinal cortex input [25]), to the necessity of inputs coming from lateral EC, as well as from medial EC. The lateral EC inputs need not carry any spatial information but help to select the DG cells active in one environment. Thus the multiplicity of DG fields refines the computational constraints on the operation of hippocampal circuits.

Second, the model shows that, assuming a fixed total MF input strength on CA3 units, it is beneficial in information terms for the MF connectivity to be very sparse; but not vanishingly sparse. The optimal number of anatomical MF connections on CA3 units, designated as Inline graphic in the model, depends somewhat on the various parameters (the noise in the system, how sparse is the activity in DG and CA3, etc.) and it may increase slightly when taking MF plasticity into account, but it appears within the range of the number, 46, reported for the rat by [26]. It will be interesting to see whether future measures of MF connectivity in other species correspond to those “predicted” by our model once the appropriate values of the other parameters are also experimentally measured and inserted into the model. A similar set of consideration applies to the fraction of granule cells active in a given environment, Inline graphic, which in the model plays a similar, though not completely identical, role to Inline graphic in determining information content.

Third, the model confirms that the sparse MF connections, even when endowed with associative plasticity, are not appropriate as devices to store associations between input and output patterns of activity – they are just too sparse. This reinforces the earlier theoretical view [2], [4], which was not based however on an analysis of spatial representations, that the role of the dentate gyrus is in establishing new CA3 representations and not in associating them to representations expressed elsewhere in the system. Availing itself of more precise experimental paramaters, and based on the spatial analysis, the current model can refine the earlier theoretical view and correct, for example, the notion that “detonator” synapses, firing CA3 cells on a one-to-one basis, would be optimal for the mossy fiber system. The optimal situation turns out to be the one in which CA3 units are fired by the combination of a couple of DG input units, although this is only a statistical statement. Whatever the exact distribution of the number of coincident inputs to CA3, DG can be seen as a sort of random pattern generator, that sets up a CA3 pattern of activity without any structure that can be related to its anatomical lay-out [36], or to the identity of the entorhinal cortex units that have activated the dentate gyrus. As with random number generators in digital computers, once the product has been spit out, the exact process that led to it can be forgotten. This is consistent with experimental evidence that inactivating MF transmission or lesioning the DG does not lead to hippocampal memory impairments once the information has already been stored, but leads to impairments in the storage of new information [6], [7]. The inability of MF connection to subserve pattern completion is also consistent with suggestive evidence from imaging studies with human subjects [37].

Fourth, and more novel, our findings imply that a substantial fraction of the information content of a spatial CA3 representation, over half when sampling limited subsets of CA3 units, can neither be extracted through the simplified method which assumes translation invariance, nor assessed through the analytical method (which anyway requires an underlying model of neuronal firing, and is hence only indirectly applicable to real neuronal data). This large fraction of the information content is only extracted through the time-consuming construction of the full localization matrix. To avoid the limited sampling bias [38] this would require, in our hands, the equivalent of a ten hour session of recording from a running rat (!), with a square box sampled in Inline graphic spatial bins. We have hence labeled this large fraction as dark information, which requires a special effort to reveal. Although we know little of how the real system decodes its own activity, e.g. in downstream neuronal populations, we may hypothesize that the difficulty at extracting dark information affects the real system as well, and that successive stages of hippocampal processing have evolved to address this issue. If so, qualitatively this could be characterized as the representation established in CA3 being episodic, i.e. based on an effectively random process that is functionally forgotten once completed, and later processing, e.g. in CA1, may be thought to gradually endow the representations with their appropriate continuous spatial character. Another network model, intended to elucidate how CA1 could operate in this respect, is the object of our on-going analysis.

The model analysed here does not include neurogenesis, a most striking dentate phenomenon, and thus it cannot comment on several intriguing models that have been put forward about the role of neurogenesis in the adult mammalian hippocampus [39], [40], [41]. Nevertheless, presenting a simple and readily expandable model of dentate operation can facilitate the development of further models that address neurogenesis, and help interpret puzzling experimental observations. For example, the idea that once matured newborn cells may temporally “label” memories of episodes occurring over a few weeks [42], [43], [44], [45] has been weakened by the observation that apparently even young adult-born cells, which are not that many [45], [46], [47], are very sparsely active, perhaps only a factor of two or so more active than older granule cells [24]. Maybe such skepticism should be reconsidered, and the issue reanalysed using a quantitative model like ours. One could then investigate the notion that the new cells link together, rather than separating, patterns of activity with common elements (such as the temporal label). To do that clearly requires extending the model to include a description not only of neurogenesis, but also of plasticity within DG itself [48] and of its role in the establishment of successive representations one after the other.

Methods

Replica calculation

Estimation of the equivocation

Calculating the equivocation from its definition in Eq.7 is straightforward, thanks to the simplifying assumption of independent noise in CA3 units. We get

graphic file with name pcbi.1000759.e200.jpg (11)

where

graphic file with name pcbi.1000759.e201.jpg

although the spatial integral remains to be carried out.

Estimation of the entropy

For the entropy, Eq.6, the calculation is more complicated. Starting from

graphic file with name pcbi.1000759.e202.jpg

we remove the logarithm using the replica trick (see [27])

graphic file with name pcbi.1000759.e203.jpg (12)

which can be rewritten (Nadal and Parga [49] have shown how to use the replica trick in the Inline graphic limit, a suggestion used in [50] to analyse information transfer in the CA3-CA1 system)

graphic file with name pcbi.1000759.e205.jpg (13)

using the spatial averages, defined for an arbitrary real-valued number Inline graphic of replicas

graphic file with name pcbi.1000759.e207.jpg (14)

where we have defined a quantity dependent on both the number Inline graphic of replicas and on the position in space, later to be integrated over, of each replica Inline graphic:

graphic file with name pcbi.1000759.e210.jpg

We need therefore to carry out integrals over the firing rate of each CA3 unit, Inline graphic, in order to estimate Inline graphic, while keeping in mind that in the end we want to take Inline graphic. Carrying out the integrals yields a below-threshold and an above-threshold term

graphic file with name pcbi.1000759.e214.jpg (15)

where we have defined the quantities

graphic file with name pcbi.1000759.e215.jpg (16)

and Inline graphic, while Inline graphic.

One might think that Inline graphic, hence in the product over cells, that defines the entropy Inline graphic, the only terms that survive in the limit Inline graphic would just be the summed single-unit contributions obtained from the first derivatives with respect to Inline graphic. This is not true, however, as taking the replica limit produces the counterintuitive effect that replica-tensor products of terms, which individually disappear for Inline graphic, only vanish to first order in Inline graphic, as shown by [29]. The replica method is therefore able, in principle, to quantify the effect of correlations among units, expressed in entropy terms stemming from the product of Inline graphic across units.

Briefly, one has

graphic file with name pcbi.1000759.e225.jpg (17)

where the first two rows come from the term below threshold, and the last two from the one above threshold. Then, following [29],

graphic file with name pcbi.1000759.e226.jpg (18)

where

graphic file with name pcbi.1000759.e227.jpg (19)

and where we have considered that in the limit Inline graphic we have Inline graphic appear in all terms of finite weight.

The products between the matrices Inline graphic attached to each CA3 unit generate the higher order terms in Inline graphic. Calculating them in our case, in which different CA3 units can receive partially overlapping inputs from DG units, is extremely complex (see [51], where information transmission across a network is also considered), and we do not pursue here the analysis of such higher order terms. One can retrieve the result of the TG model in Ref. [29] by taking the further limit Inline graphic, which implies Inline graphic and Inline graphic. A further subtlety is that, in taking the Inline graphic limit, there is a single replica, say Inline graphic, which is counted once in the limit, but also several different replicas, denoted Inline graphic, whose weights vanish, but which remain to determine e.g. the terms proportional to Inline graphic emerging from the derivatives. Thus, in the very last term of Eq. 17, one has to derive Inline graphic with respect to Inline graphic to produce the Inline graphic term of Eq. 19, which is absent in [29] because it vanishes with Inline graphic. In the off-diagonal terms of the Inline graphic matrix there are Inline graphic entries dependent on replicas Inline graphic and Inline graphic, and Inline graphic entries dependent on replicas Inline graphic and Inline graphic.

Focusing now solely on terms of order Inline graphic, note that the term Inline graphic is effectively a spatial signal. In the Inline graphic limit it can be rewritten, using Inline graphic for the single surviving replica, as

graphic file with name pcbi.1000759.e254.jpg

This allows us to derive, to order Inline graphic, our result for the spatial information content, Eq. 8.

Note that when the threshold of each unit tends to Inline graphic, and therefore its mean activation Inline graphic, our units behave as threshold-less linear units with gaussian noise, and the information they convey tends to

graphic file with name pcbi.1000759.e258.jpg (20)

which is simply expressed in terms of a spatial signal-to-noise ratio, and coincides with the results in Refs. [28], [29].

Inline graphic-Field decomposition

Eqs. 8 and 9 simply sum equivalent average contributions from each CA3 unit. Each such contribution can then be calculated as a series in Inline graphic, the number of DG fields feeding into the CA3 unit. One can in fact write, for example,

graphic file with name pcbi.1000759.e261.jpg

where in each term there are Inline graphic active DG units, indexed by Inline graphic, presynaptic to CA3 unit Inline graphic, and each has Inline graphic fields (including the possibility that Inline graphic), indexed by Inline graphic. A similar expansion can be written for the other terms. One then realizes that the spatial component reduces to integrals that depend solely on the total number of fields Inline graphic, no matter how many DG active units they come from, and the expansion can be rearranged into an expansion in Inline graphic

graphic file with name pcbi.1000759.e270.jpg (21)

where one of the components in each term is, for example,

graphic file with name pcbi.1000759.e271.jpg (22)

with Inline graphic the mean signal-to-noise at position Inline graphic produced by Inline graphic fields, from no matter how many DG units. The numerical coefficient Inline graphic, instead, stems from the combination of the distribution for the number of fields for each presynaptic DG unit active in the environment, which differs between models A, B and C, and the Poisson distribution for the number of such units

graphic file with name pcbi.1000759.e276.jpg

The sum extends in principle to Inline graphic, but in practice it can be truncated after checking that successive terms give vanishing contributions. The appropriate truncation point obviously depends on the mean number of fields Inline graphic, as well as on the model distribution of fields per unit. Note that the first few terms (e.g. for Inline graphic) may give negative but not necessarily negligible contributions if the effective threshold Inline graphic is high.

For model A,

graphic file with name pcbi.1000759.e281.jpg

and combining the two Poisson series one finds

graphic file with name pcbi.1000759.e282.jpg (23)

where Inline graphic and the other Inline graphic are the polynomials

graphic file with name pcbi.1000759.e285.jpg

given by the modified Khayyam-Tartaglia recursion relation

graphic file with name pcbi.1000759.e286.jpg

and where Inline graphic.

For model B,

graphic file with name pcbi.1000759.e288.jpg

and combining the Poisson with the exponential series one finds

graphic file with name pcbi.1000759.e289.jpg (24)

where again Inline graphic, while the other Inline graphic are the distinct polynomials

graphic file with name pcbi.1000759.e292.jpg

given by the further modified Khayyam-Tartaglia recursion relation

graphic file with name pcbi.1000759.e293.jpg

and where Inline graphic.

For model C,

graphic file with name pcbi.1000759.e295.jpg

there is no parameter Inline graphic (i.e., Inline graphic), and one simply finds

graphic file with name pcbi.1000759.e298.jpg (25)

Note that in the limit Inline graphic, when the mean input per CA3 unit Inline graphic remains finite, for both models A and B one finds

graphic file with name pcbi.1000759.e301.jpg

which is equivalent to Eq. 25, in line with the fact that both models A and B reduce, in the Inline graphic limit, to single-field distributions, but even units with single fields become vanishingly rare, so formally one has to scale up the mean number of active presynaptic units, Inline graphic, to keep Inline graphic finite and establish the correct comparison to model C.

Sparsity and threshold

The analytical relation between the threshold Inline graphic of CA3 units and the sparsity Inline graphic of the layer is obtained starting from the formula defining the sparsity Inline graphic (see below) which can be rewritten

graphic file with name pcbi.1000759.e308.jpg (26)

Since in the analytical calculation we have Inline graphic as parameter, this equation can be taken as a relation Inline graphic which has to be inverted to allow a comparison with the simulations, which are run controlling the sparsity level at a predefined level (in our case Inline graphic) and adjusting the threshold parameter accordingly. The inversion requires using the Inline graphic-field decomposition and numerical integration. A graphical example of the numerical relation is given in Fig. 11.

Figure 11. Sparsity-threshold relation.

Figure 11

The sparsity Inline graphic of CA3 layer vs. the threshold Inline graphic of CA3 units, from the numerical integration of Eq. 26. Different lines correspond to different degrees of connectivity between DG and CA3.

Simulations

The mathematical model described above was simulated with a network of 15000 DG cells and 500 CA3 cells. A virtual rat explores a continuous two dimensional space, intended to represent a Inline graphic square environment but realized as a torus, with periodic boundary conditions. For the numerical estimation of mutal information, the environment is discretized in a grid of Inline graphic locations, whereas trajectories are in continuous space, but in discretized time steps. In each time step (intended to correspond to roughly Inline graphic, half a theta cycle) the virtual rat moves half a grid unit (Inline graphic) in a direction similar to the direction of the previous time step, with a small amount of noise. To allow construction of a full localization matrix with good statistics, simulations are run for typically 400,000 time steps (while for the simplified translationally invariant matrix 5,000 steps would be sufficient). The space has periodic boundary conditions, as in a torus, to avoid border effects; the longest possible distance between any two locations is hence equal to 14.1 grid units, or Inline graphic.

DG place fields

After assigning a number of firing fields for each DG units, according to the distributions of models A, B and C, we assign to each field a randomly chosen center. The shape of the field is then given by a Gaussian bell with that center. The tails of the Gausssian function are truncated to zero when the distance from the center is larger than a fixed radius Inline graphic, with Inline graphic the ratio between the area of the field and the environment area Inline graphic. In the standard model, only about 3 percent of the DG units on average are active in a given environment, in agreement with experimental findings [24]; i.e. the DG firing probability is Inline graphic. The firing of DG units is not affected by noise, nor by any further threshold. Peak firing is conventionally set, in the center of the field, at the value Inline graphic, but DG units can fire at higher levels if they are assigned two or more overlapping fields.

CA3 activation

CA3 units fire according to Eq. 2: the firing of a CA3 unit is a linear function of the total incoming DG input, distorted by a noise term. This term is taken from a gaussian distribution centered on zero, with variance Inline graphic, and it changes for each unit and each time step. A threshold is imposed in the simulations to model the action of inhibition, hypothesizing that it serves to adjust the sparsity Inline graphic of CA3 activity to its required value. The sparsity is defined as

graphic file with name pcbi.1000759.e327.jpg

and it is fixed to Inline graphic. This implies that the activity of the CA3 cells population is under tight inhibitory control.

The decoding procedure and information extraction

At each time step, the firing vector of a set of CA3 units is compared to all the average vectors recorded at each position in the Inline graphic grid, for the same sample, in a test trial (these are called template vectors). The comparison is made calculating the Euclidean distance between the current vector and each template, and the position of the closest template is taken to be the decoded position at that time step, for that sample. This procedure has been termed maximum likelihood Euclidean distance decoding [32]. The frequency of each pair of decoded and real positions are compiled in a so-called “confusion matrix”, or localization matrix, that reflects the ensemble of conditional probabilities Inline graphic for that set of units. Should decoding “work” in a perfect manner, in the sense of always detecting the correct position in space of the virtual rat, the confusion matrix would be the identity matrix. From the confusion matrix obtained at the end of the simulation, the amount of information is extracted, and plotted versus the number of CA3 units present in the set. We averaged extensively over CA3 samples, as there are large fluctuations from sample to sample, i.e. for each given number of CA3 units we randomly picked several different groups of CA3 units and then averaged the mutual information values obtained. In all the results reported we averaged also over 3–4 simulation run with a different random number generator, i.e. over different trajectories. The same procedure leading to the information curve was repeated for different values of the parameters. In all the information measures we reported, we also corrected for the limited sampling bias, as discussed by [31]. In our case of spatial information, the bias is essentially determined by the spatial binning we used (Inline graphic) and by the decoding method [52].

One should note the maximum likelihood decoding procedure to better understand the discrepancy between the information estimated from simulations (with the procedure based on the full matrix) and that calculated analytically. The analytical calculation distinguishes in a clear-cut manner so called annealed variables, which are interpreted as “fast” noise and are averaged in computing the relation between position and neuronal activity, and so called quenched variables, which are interpreted as frozen disorder and are averaged over only later, in computing average the entropy, free-energy or mutual information [27]. In using maximum likelihood decoding, instead, the localization matrix that relates actual and decoding position effectively averages only trial-to-trial variability, i.e. the noise that occurs on intermediate time scales. The variability on genuinely fast time scales is suppressed, in fact, by the maximum likelihood operation, which acts as a sort of temporal low pass filter with a cut-off time equal to one time step. This suppression of part of the annealed noise leads to larger information values extracted from the simulations, and hence to the notion of “dark” information. In the real system, the spiking nature of neuronal activity may induce a similar cut-off, although its quantitative relation to the one-time-step cut-off in the simulations (here intended to be half a theta cycle) remains to be firmly established.

Fitting

We fit the information curves obtained in simulations to exponentially saturating curves as a function of Inline graphic in order to get the values of the two most relevant parameter that describe their shape: the initial slope Inline graphic (i.e. the average information conveyed by the activity of individual units) and the total amount of information Inline graphic (i.e. the asymptotic saturation value). The function we used for the fit is the following

graphic file with name pcbi.1000759.e335.jpg (27)

In most cases the fit was in excellent agreement with individual data points, as expected on the basis of previous analyses [28].

Acknowledgments

We had valuable discussion with Jill Leutgeb, Bailu Si and Federico Stella.

Footnotes

The authors have declared that no competing interests exist.

This work was partially supported by the EU Spacebrain grant. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Treves A, Tashiro A, Witter MP, Moser EI. What is the mammalian dentate gyrus good for? Neuroscience. 2008;154:1155–1172. doi: 10.1016/j.neuroscience.2008.04.073. [DOI] [PubMed] [Google Scholar]
  • 2.McNaughton BL, Morris RGM. Hippocampal synaptic enhancement and information storage within a distributed memory system. Trends Neurosci. 1987;10:408–415. [Google Scholar]
  • 3.Rolls ET. Functions of neuronal networks in the hippocampus and neocortex in memory. In: Byrne JH, Berry WO, editors. Neural Models of Plasticity: Experimental and Theoretical Approaches. San Diego: Academic Press; 1989. pp. 240–265. [Google Scholar]
  • 4.Treves A, Rolls ET. Computational constraints suggest the need for two distinct input systems to the hippocampal CA3 network. Hippocampus. 1992;2:189–199. doi: 10.1002/hipo.450020209. [DOI] [PubMed] [Google Scholar]
  • 5.Marr D. Simple memory: A theory for archicortex. Philos Trans R Soc Lond B Biol Sci. 1971;262:23–81. doi: 10.1098/rstb.1971.0078. [DOI] [PubMed] [Google Scholar]
  • 6.Lassalle JM, Bataille T, Halley H. Reversible inactivation of the hippocampal mossy fiber synapses in mice impairs spatial learning, but neither consolidation nor memory retrieval, in the Morris navigation task. Neurobiol Learn Mem. 2000;73:243–257. doi: 10.1006/nlme.1999.3931. [DOI] [PubMed] [Google Scholar]
  • 7.Lee I, Kesner RP. Encoding versus retrieval of spatial memory: Double dissociation between the dentate gyrus and the perforant path inputs into CA3 in the dorsal hippocampus. Hippocampus. 2004;14:66–76. doi: 10.1002/hipo.10167. [DOI] [PubMed] [Google Scholar]
  • 8.Leutgeb S, Leutgeb JK, Barnes CA, Moser EI, McNaughton BL, et al. Independent codes for spatial and episodic memory in hippocampal neuronal ensembles. Science. 2005;309:619–623. doi: 10.1126/science.1114037. [DOI] [PubMed] [Google Scholar]
  • 9.Samsonovich A, McNaughton BL. Path integration and cognitive mapping in a continuous attractor neural network model. J Neurosci. 1997;17:5900–5920. doi: 10.1523/JNEUROSCI.17-15-05900.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Battaglia FP, Treves A. Attractor neural networks storing multiple space representations: A model for hippocampal place fields. Phys Rev E. 1998;58:7738–7753. [Google Scholar]
  • 11.Stringer MS, Rolls ET. Invariant object recognition in the visual system with novel views of 3D objects. Neural Comput. 2002;14:2585–2596. doi: 10.1162/089976602760407982. [DOI] [PubMed] [Google Scholar]
  • 12.Tsodyks M, Sejnowski T. Associative memory and hippocampal place cells. Int J Neural Syst. 1995;6:81–86. [Google Scholar]
  • 13.Hamaguchi K, Hatchett JPL. Analytic solution of neural network with disordered lateral inhibition. Phys Rev E Stat Nonlin Soft Matter Phys. 2006;73:art. 051104. doi: 10.1103/PhysRevE.73.051104. [DOI] [PubMed] [Google Scholar]
  • 14.Papp G, Witter MP, A. Treves A. The CA3 network as a memory store for spatial representations. Learn Mem. 2007;14:732–744. doi: 10.1101/lm.687407. [DOI] [PubMed] [Google Scholar]
  • 15.Roudi Y, Treves A. Representing where along with what information in a model of a cortical patch. PLoS Comput Biol. 2008;4:e1000012. doi: 10.1371/journal.pcbi.1000012. doi: 10.1371/journal.pcbi.1000012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hafting T, Fyhn M, Molden S, Moser MB, Moser EI. Microstructure of a spatial map in the entorhinal cortex. Science. 2005;436:801–806. doi: 10.1038/nature03721. [DOI] [PubMed] [Google Scholar]
  • 17.Jung MW, Wiener SI, McNaughton BL. Comparison of spatial firing characteristics of units in dorsal and ventral hippocampus of the rat. J Neurosci. 1994;14:7347–7356. doi: 10.1523/JNEUROSCI.14-12-07347.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Leutgeb JK, Leutgeb S, Moser MB, Moser EI. Pattern separation in the dentate gyrus and CA3 of the hippocampus. Science. 2007;315:961–966. doi: 10.1126/science.1135801. [DOI] [PubMed] [Google Scholar]
  • 19.Aimone JB, Wiskott L. Computational modeling of neurogenesis. Adult Neurogenesis. 2008;52:463–481. [Google Scholar]
  • 20.Treves A, Rolls ET. What determines the capacity of autoassociative memories in the brain? Network: Computation in Neural Systems. 1991;2:371–397. doi: 10.1088/0954-898X/2/4/004. [Google Scholar]
  • 21.Wilson MA, McNaughton BL. Dynamics of the hippocampal ensemble code for space. Science. 1993;261:1055–1058. doi: 10.1126/science.8351520. [DOI] [PubMed] [Google Scholar]
  • 22.Treves A. Graded-response neurons and information encodings in autoassociative memories. Phys Rev A Gen Phys. 1990;42:2418–2430. doi: 10.1103/physreva.42.2418. [DOI] [PubMed] [Google Scholar]
  • 23.Kropff E, Treves A. The emergence of grid cells: Intelligent design or just adaptation? Hippocampus. 2008;18:1256–1269. doi: 10.1002/hipo.20520. [DOI] [PubMed] [Google Scholar]
  • 24.Chawla MK, Guzowski JF, Ramirez-Amaya V, Lipa P, Hoffman KL, et al. Sparse, environmentally selective expression of arc rna in the upper blade of the rodent fascia dentata by brief spatial experience. Hippocampus. 2005;15:579–586. doi: 10.1002/hipo.20091. [DOI] [PubMed] [Google Scholar]
  • 25.Si B, Treves A. The role of competitive learning in the generation of DG fields from EC inputs. Cogn Neurodyn. 2009;3:119–187. doi: 10.1007/s11571-009-9079-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Amaral DG, Ishizuka N, Claiborne B. Neurons, numbers and the hippocampal network. Prog Brain Res. 1990;83:1–11. doi: 10.1016/s0079-6123(08)61237-6. [DOI] [PubMed] [Google Scholar]
  • 27.Mezard M, Parisi G, Virasoro . Spin glasses and beyond. World Scientific; 1986. [Google Scholar]
  • 28.Samengo I, Treves A. Representational capacity of a set of independent neurons. Phys Rev E Stat Nonlin Soft Matter Phys. 2000;63:art. 011910, 2000. doi: 10.1103/PhysRevE.63.011910. [DOI] [PubMed] [Google Scholar]
  • 29.DelPrete V, Treves A. Theoretical model of neuronal population coding of stimuli with both continuous and discrete dimensions. Phys Rev E Stat Nonlin Soft Matter Phys. 2001;64:art. 021912, Jul 2001. doi: 10.1103/PhysRevE.64.021912. [DOI] [PubMed] [Google Scholar]
  • 30.Leutgeb S, Leutgeb JK, Treves A, Moser MB, Moser EI. Distinct ensemble codes in hippocampal areas CA3 and CA1. Science. 2004;305:1295–1298. doi: 10.1126/science.1100265. [DOI] [PubMed] [Google Scholar]
  • 31.Treves A, Panzeri S. The upward bias in measures of information derived from limited data samples. Neural Comput. 1995;2:399–407. [Google Scholar]
  • 32.Rolls ET, Treves A, Tovee MJ. The representational capacity of the distributed encoding of information provided by populations of neurons in primate temporal visual cortex. Exp Brain Res. 1997;114:149–162. doi: 10.1007/pl00005615. [DOI] [PubMed] [Google Scholar]
  • 33.Nicoll RA, Schmitz D. Synaptic plasticity at hippocampal mossy fibre synapses. Nat Rev Neurosci. 2005;6:863–876. doi: 10.1038/nrn1786. [DOI] [PubMed] [Google Scholar]
  • 34.Treves A. Computational constraints between retrieving the past and predicting the future, and the CA3-CA1 differentiation. Hippocampus. 2004;14:539–556. doi: 10.1002/hipo.10187. [DOI] [PubMed] [Google Scholar]
  • 35.Kuhn HG, Dickinson-Anson H, Gage FH. Neurogenesis in the dentate gyrus of the adult rat: age-related decrease of neuronal progenitor proliferation. J Neurosci. 1996;16:2027–2033. doi: 10.1523/JNEUROSCI.16-06-02027.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Redish AD, Battaglia FP, Chawla MK, Ekstrom AD, Gerrard JL, et al. Independence of firing correlates of anatomically proximate hippocampal pyramidal cell. J Neurosci. 2001;21: RC134:1–6. doi: 10.1523/JNEUROSCI.21-05-j0004.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Bakker A, Kirwan CB, Miller M, Stark CEL. Pattern separation in the human hippocampal CA3 and dentate gyrus. Science. 2008;319:1640–1642. doi: 10.1126/science.1152882. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Panzeri S, Treves A. Analytical estimates of limited sampling biases in different information measures. Network: Computation in Neural Systems. 1996;7:87–107. doi: 10.1080/0954898X.1996.11978656. [DOI] [PubMed] [Google Scholar]
  • 39.Aimone JB, Wiles J, Gage FH. Potential role for adult neurogenesis in the encoding of time in new memories. Nat Neurosci. 2006;9:723–727. doi: 10.1038/nn1707. [DOI] [PubMed] [Google Scholar]
  • 40.Becker S. A computational principle for hippocampal learning and neurogenesis. Hippocampus. 2005;15:722–738. doi: 10.1002/hipo.20095. [DOI] [PubMed] [Google Scholar]
  • 41.Wiskott L, Rasch MJ, Kempermann G. A functional hypothesis for adult hippocampal neurogenesis: Avoidance of catastrophic interference in the dentate gyrus. Hippocampus. 2006;16:329–343. doi: 10.1002/hipo.20167. [DOI] [PubMed] [Google Scholar]
  • 42.Kee N, Teixeira CM, Wang AH, Frankland PW. Preferential incorporation of adult-generated granule cells into spatial memory networks in the dentate gyrus. Nat Neurosci. 2007;10:355–362. doi: 10.1038/nn1847. [DOI] [PubMed] [Google Scholar]
  • 43.Ge S, Yang C, Hsu K, Ming G, Song H. A critical period for enhanced synaptic plasticity in newly generated neurons of the adult brain. Neuron. 2007;54:559–566. doi: 10.1016/j.neuron.2007.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Buzzetti RA, Marrone DF, Schaner MJ, Chawla MK, Bohanick JD, et al. Do dentate gyrus granule cells tag time-specific experiences? Soc Neurosci Abstr. 2007;744:16. [Google Scholar]
  • 45.Tashiro A, Makino H, Gage FH. Experience-specific functional modification of the dentate gyrus through adult neurogenesis: A critical period during an immature stage. J Neurosci. 2007;27:3252–3259. doi: 10.1523/JNEUROSCI.4941-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Cameron HA, Mckay RDG. Adult neurogenesis produces a large pool of new granule cells in the dentate gyrus. J Comp Neurol. 2001;435:406–417. doi: 10.1002/cne.1040. [DOI] [PubMed] [Google Scholar]
  • 47.McDonald HY, Wojtowicz JM. Dynamics of neurogenesis in the dentate gyrus of adult rats. Neurosci Lett. 2005;385:70–75. doi: 10.1016/j.neulet.2005.05.022. [DOI] [PubMed] [Google Scholar]
  • 48.McHugh TJ, Jones MW, Quinn JJ, Balthasar N, Coppari R, et al. Dentate gyrus nmda receptors mediate rapid pattern separation in the hippocampal network. Science. 2007;317:94–99. doi: 10.1126/science.1140263. [DOI] [PubMed] [Google Scholar]
  • 49.Nadal J, Parga N. Information processing by a perceptron in an unsupervised learning task. Network: Computation in Neural Systems. 1993;4:295–312. doi: 10.1088/0954-898X/4/3/004. [Google Scholar]
  • 50.Treves A. Quantitative estimate of the information relayed by the schaffer collaterals. J Comput Neurosci. 1995;2:259–272. doi: 10.1007/BF00961437. [DOI] [PubMed] [Google Scholar]
  • 51.DelPrete V, Treves A. Replica symmetric evaluation of the information transfer in a two-layer network in the presence of continuous and discrete stimuli. Phys Rev E Stat Nonlin Soft Matter Phys. 2002;65:art. 041918. doi: 10.1103/PhysRevE.65.041918. [DOI] [PubMed] [Google Scholar]
  • 52.Panzeri S, Treves A, Schultz S, Rolls ET. On decoding the responses of a population of neurons from short time windows. Neural Comput. 1999;11:1553–1577. doi: 10.1162/089976699300016142. [DOI] [PubMed] [Google Scholar]

Articles from PLoS Computational Biology are provided here courtesy of PLOS

RESOURCES