Significance
False memories can arise in daily life through a mixture of factors, including misinformation and prior conceptual knowledge. This can have serious consequences in settings, such as legal eyewitness testimony, which depend on the accuracy of memory. We investigated the brain basis of false memory with fMRI, and found that patterns of activity in the temporal pole region of the brain can predict false memories. Furthermore, we show that each individual has unique patterns of brain activation that can predict their own idiosyncratic set of false-memory errors. Together, these results suggest that the temporal pole may be responsible for the conceptual component of illusory memories.
Keywords: false memory, semantic, temporal pole, fMRI, pattern similarity
Abstract
Recent advances in neuroscience have given us unprecedented insight into the neural mechanisms of false memory, showing that artificial memories can be inserted into the memory cells of the hippocampus in a way that is indistinguishable from true memories. However, this alone is not enough to explain how false memories can arise naturally in the course of our daily lives. Cognitive psychology has demonstrated that many instances of false memory, both in the laboratory and the real world, can be attributed to semantic interference. Whereas previous studies have found that a diverse set of regions show some involvement in semantic false memory, none have revealed the nature of the semantic representations underpinning the phenomenon. Here we use fMRI with representational similarity analysis to search for a neural code consistent with semantic false memory. We find clear evidence that false memories emerge from a similarity-based neural code in the temporal pole, a region that has been called the “semantic hub” of the brain. We further show that each individual has a partially unique semantic code within the temporal pole, and this unique code can predict idiosyncratic patterns of memory errors. Finally, we show that the same neural code can also predict variation in true-memory performance, consistent with an adaptive perspective on false memory. Taken together, our findings reveal the underlying structure of neural representations of semantic knowledge, and how this semantic structure can both enhance and distort our memories.
Each of us has a vast store of semantic knowledge that we apply to incoming sensory data to extract meaning from the world around us. Semantic representations are capable of capturing important structural features of the world at many different levels of abstraction, which allows for rapid and flexible responses to a diverse array of environmental challenges. This preexisting knowledge structure guides ongoing cognition, which usually aids performance, but under some circumstances can lead us into error (1–3). A striking example is the widely studied DRM (Deese, Roediger, and McDermott) false-memory illusion (4, 5). In a typical DRM task, subjects are asked to memorize a set of words such as “snow,” “winter,” “ice,” and “warm.” After a delay, subjects will typically falsely remember having seen the semantically related word “cold.” It is widely agreed that this memory illusion is driven by the semantic relatedness between words contained in the encoding list (e.g., “snow”) and falsely remembered words that were not actually presented (e.g., “cold”). As such, it is thought that each list item automatically, but weakly, activates the semantically related concept (Fig. 1A). This activation leads to memory confusion, either through a cumulative priming of the related lure (5, 6) or the encoding of the semantic overlap as a “gist” memory (3), resulting in a false memory unless the error is detected by some internal monitoring process (7). As such, the DRM effect provides a powerful method for investigating the nature of false memories, as well as the structure of semantic knowledge and its effects on cognition.
Despite the well-characterized cognitive mechanisms involved in the DRM effect (7), its neural basis is currently not well understood. Previous neuroimaging and patient studies have provided robust evidence that a core network of regions in the medial and lateral temporal lobe, as well as frontal and parietal regions (8–16), is involved when encoding or retrieving semantic false memories. However, a mechanistic understanding of how these regions generate false memories is lacking. In particular, although it is known that the semantic relatedness between the different words drives the illusion (3, 6, 7), little is known about the neural basis of this semantic relatedness. Computational models of semantic cognition propose that concepts are represented by a similarity-based code in an amodal “semantic hub,” situated in the apex of the ventral processing stream in the temporal pole (TP) (17, 18). Although other regions, such as the temporo-parietal cortex (19), have also been linked to the representation of abstract conceptual knowledge, the TP is most consistently implicated in both patient and neuroimaging studies (17, 20, 21).
These computational models therefore make clear predictions about the expected neural basis of semantic false memory. Namely, the TP semantic hub should contain a similarity-based code, such that the neural representations of DRM words reflect the known semantic relatedness between those words. Furthermore, the likelihood that a given word list will generate a false memory should be directly related to the degree of neural overlap. This prediction has not previously been investigated, despite the clear implications for understanding both false memory and the structure of semantic knowledge. Here we used fMRI to measure the neural overlap between DRM lists and related lures, allowing us to directly test this prediction.
Results
We used a representational similarity analysis approach, which uses the neural pattern similarity between pairs of stimuli to infer the representational similarity (22). This method is therefore well-suited for assessing neural overlap between semantic representations (23–25), as the degree of overlap should be directly reflected in the representational similarity. We used this approach to measure the degree of neural overlap between each set of DRM words and their related lure word (Fig. 1). Crucially, each DRM list is known to have a different probability of inducing a false memory, with some much greater than others (26, 27). If our prediction is correct, then we should find a brain region displaying a direct correspondence between the degree of neural overlap and false-memory likelihood across the different DRM word lists. To measure the neural representations of the DRM word lists, 18 participants viewed 40 separate four-word DRM lists, along with the 40 associated lure words (Table S1), while we collected fMRI data. While viewing the words, subjects performed an incidental categorization task (man-made or natural) to ensure that all words were processed at the semantic level. We used the canonical false-recognition scores reported in ref. 27 as our measure of false-memory likelihood, and applied a searchlight analysis (28) across the whole brain to establish whether any brain region displayed the predicted positive correlation between neural overlap and false-memory likelihood.
Table S1.
Critical lure | List items | |||
Cold | Snow | Warm | Winter | Ice |
Smell | Nose | Breathe | Sniff | Aroma |
Window | Door | Glass | Pane | Ledge |
Rough | Smooth | Bumpy | Road | Tough |
Cup | Mug | Saucer | Tea | Coaster |
Soft | Hard | Pillow | Plush | Loud |
Sleep | Bed | Rest | Awake | Tired |
Wish | Want | Dream | Desire | Hope |
Anger | Mad | Fear | Hate | Rage |
Sweet | Sour | Candy | Sugar | Bitter |
Trash | Garbage | Waste | Can | Refuse |
Justice | Peace | Law | Courts | Judge |
Chair | Table | Sit | Legs | Seat |
Smoke | Cigarette | Puff | Blaze | Billows |
High | Low | Clouds | Up | Tall |
Doctor | Nurse | Sick | Lawyer | Medicine |
Stove | Hot | Heat | Pipe | Cook |
Thief | Steal | Robber | Crook | Burglar |
Mountain | Hill | Valley | Climb | Summit |
Music | Note | Sound | Piano | Sing |
Slow | Fast | Lethargic | Stop | Listless |
Needle | Thread | Pin | Eye | Sewing |
River | Water | Stream | Lake | Boat |
Rubber | Elastic | Bounce | Gloves | Tire |
Bread | Butter | Food | Eat | Sandwich |
City | Town | Crowded | State | Capital |
Lamp | Light | Shade | Bulb | Post |
Foot | Shoe | Hand | Toe | Kick |
Man | Woman | Husband | Uncle | Lady |
Flag | Banner | Symbol | Anthem | Stripes |
Girl | Boy | Dolls | Female | Young |
Spider | Web | Insect | Bug | Fright |
Pen | Pencil | Write | Fountain | Leak |
Command | Order | Obey | Officer | Performance |
Health | Sickness | Good | Happiness | Wealth |
Shirt | Blouse | Sleeves | Pants | Tie |
Trouble | Bad | Shooter | Worry | Danger |
Army | Navy | Soldier | Rifle | Draft |
Whiskey | Drink | Drunk | Beer | Liquor |
Black | White | Dark | Cat | Charred |
The left-hand column lists the 40 critical lure items, and the right-hand columns list the four list items associated with each of the critical lures. The lists are ranked from high to low false-memory likelihood.
This analysis revealed a significant cluster in the left TP, with no other significant information anywhere else in the brain. This result provides evidence that this specific region is responsible for encoding the semantic relatedness between thematically related words (Fig. 2). Furthermore, this result shows that the precise level of neural overlap in the TP predicts the probability that a false memory will be constructed for a given DRM list. This result is therefore fully consistent with the computational accounts of semantic cognition, and demonstrates that a similarity-based code in the TP is capable of generating false memories. Strikingly, our measure of false-memory likelihood is a canonical measure taken from an independent set of subjects (26, 27), yet we can nevertheless successfully predict this information based purely on the neural data of our group of subjects. This result clearly demonstrates a robust level of agreement across different individuals in the neural representation of the concepts contained within the DRM lists. Thus, it appears that the TP is responsible for representing a shared conceptual space, which is a vital component of successful communication. For completeness, we also looked for regions displaying a negative correlation between neural overlap and false-memory likelihood, although it is not clear that any such correlation is theoretically meaningful. This analysis revealed a single significant cluster in the right superior frontal gyrus [peak Montreal Neurological Institute (MNI) coordinates: 24, 20, 50; T = 4.71; cluster extent = 163 voxels].
To ensure that TP neural data are really capturing meaningful semantic representations, we ran an additional set of control analyses based on the functionally defined TP region of interest (ROI). We examined four issues. First, if the neural data are capturing semantic relatedness between the lure and list items, we should find that each lure is more similar to its own list than any other list, regardless of any differences in false-memory strength. To assess this hypothesis, we directly compared the neural similarity within and across the 40 DRM sets. As expected, this analysis revealed a significant within-set increase in similarity (Z = 3.11, P < 0.001). Second, to ensure that the neural effects were not driven by extraneous factors, such as the word frequency or the visual similarity of the lure and list words, we investigated whether either of these factors correlated with the TP neural overlap. Neither of these variables significantly predicted the neural data (word frequency: Z = 0.02, P = 0.98; visual similarity: Z = 1.72, P = 0.085), suggesting that they are not significant drivers of neural similarity in this region. Third, the task performed in the scanner while subjects viewed the words was a semantic category judgment task (man-made or natural). Although the categorical nature of the encoding task was incidental to the effect of interest, it is nevertheless possible that neural representations related to the semantic categories are present in the TP. To investigate this possibility, we derived a subject-specific measure of task category similarity between the lure and list of each DRM set based on the pattern of responses to each word (Methods). We found no evidence for a correlation between this variable and the neural data (Z = 0.46, P = 0.65), suggesting that task-driven categorical representations are not present in the TP. Finally, we investigated whether the correlation between neural overlap and canonical false-memory strength was still present after controlling for the three additional variables (word frequency, visual similarity, and categorical representation). Using a cross-validated ROI approach to avoid issues of statistical circularity (29), we found clear evidence for a significant correlation between neural overlap and canonical false memory even after partialling out the control variables (Z = 2.03, P = 0.021). Thus, our result cannot be explained by extraneous factors, such as word frequency or visual similarity. We further explored each of these three control variables using a searchlight analysis across the whole brain, but none of these analyses revealed any significant results. Given that our measure of neural overlap is in each case based on the average pattern expressed over four list items, this will greatly reduce the power of any analysis that is not explicitly based on some shared representation, such as the semantic gist. Thus, it is not surprising that these additional analyses did not find any significant results.
Although our initial analysis focused on shared semantic representations, it is also likely that each of us forms some idiosyncratic semantic associations through our own individual experience. Such quirks of experience could lead to measurable differences in neural overlap in the TP, and consequently to unique patterns of false-memory errors. To investigate this possibility, our subjects participated in a DRM false-memory recognition task in a separate session that took place several weeks before the scanning session. The same 40 DRM word lists were used in both the behavioral and scanning sessions, which allowed us to directly compare each individual’s neural data to their behavioral data. As expected, the subjects displayed the typical false-memory effect, and committed a large number of high-confidence false alarms to the critical lure stimuli (Fig. 3). As a further quality-control check, we explored the consistency of our group’s behavioral data compared with the canonical false-memory data (26, 27). The group false-memory likelihood correlated positively with both the canonical data r(39) = 0.53, P < 0.001 and with the neural data (Z = 1.68, P = 0.047), demonstrating that this subject group’s data conform to the canonical data, as expected.
However, the key question was whether there might be a subject-specific mapping between the TP neural overlap and the pattern of false-memory errors, over and above any shared semantic representations common to all subjects. To assess this issue we used an individuation analysis (30), comparing the within-subject neural-behavioral correlations (unique semantic information) to the between-subject neural-behavioral correlations (shared semantic information), in each case controlling for the canonical false-memory strength to remove additional shared semantic information. This analysis revealed a significantly higher within- than between-subject correlation (Z = 2.63, P = 0.0042). This result was still significant after additionally partialling out the influence of the three control variables discussed above (Z = 2.55, P = 0.0054). This result provides clear evidence that each individual has a partially unique set of semantic representations within the TP that have a direct impact on memory distortions (Fig. 4). Such a result cannot be explained by incidental differences in TP physiology or anatomy alone (30), as these more basic properties would not predict each subject’s false-memory behavior.
Importantly, the fMRI and behavioral data for each subject were collected in separate sessions separated by many weeks, which demonstrates that the structure of neural overlap must be stable over at least this length of time, and plausibly for much longer than this. This long delay also minimized any possible influence of the initial behavioral session on the neural representations expressed during scanning. To further ensure that there was no such influence, we leveraged the wide range of intersession delay lengths across subjects (minimum = 21 d, maximum = 239 d) that emerged as a consequence of differences in subject availability. If there were an effect of the behavioral session due to memory for the items experienced in this session, we would expect this effect to degrade over time. We would therefore expect a negative correlation between the length of delay and the strength of neural-behavioral mapping. In fact, we find a nonsignificant positive correlation instead (r = 0.26, P = 0.30), which clearly shows that memory is not enhancing the neural overlap data.
The main focus of this study was to investigate the neural basis of false memory. However, an adaptive perspective on false memory (1) would suggest that our semantic knowledge should aid cognition under most circumstances (31), rather than purely acting as a source of memory distortion. This hypothesis would therefore suggest that we ought to also find a positive correlation between neural overlap and true-memory performance for the list items that were actually presented during encoding. As predicted, we found a significant mapping between TP neural overlap and true-memory strength (Z = 2.33, P = 0.0099), which remained significant after partialling out the three control variables (Z = 2.29, P = 0.011). We also investigated the possibility that subject-specific TP neural coding might predict true memory performance, using an individuation analysis (30). This analysis demonstrated a significant individuation effect in the TP for true memories (Z = 2.16, P = 0.016) as well as false memories, and this result remained significant after partialling out the three control variables (Z = 2.11, P = 0.017). These results provide clear evidence that the semantic similarity code within this region can be beneficial for memory systems, as well as potentially leading to memory distortions.
Discussion
Our study demonstrates that the left TP contains partially overlapping neural representations of related concepts, and that the extent of this neural overlap directly reflects the degree of semantic similarity between the concepts. Furthermore, the neural overlap between sets of related words predicts the likelihood of making a false-memory error. Together, these findings provide support for neural network models of semantic cognition that posit that the TP uses a similarity-based coding scheme to sustain amodal-distributed representations of individual concepts and their relationships within an abstract semantic space (17, 18). The similarity-based coding of concepts has significant computational advantages in allowing efficient generalization of existing knowledge to novel situations, while still allowing for the “grounding” of each concept in a full set of domain-specific cortical regions mediated by the hub-like connectivity of the TP (17, 18), consistent with previous studies showing that domain-specific semantic features are indeed distributed across a wide range of cortical regions (32). Nevertheless, our results suggest that the type of coding scheme underpinning the representation of concepts within the TP has a potential cost, specifically the emergent property of false memories.
Although the focus of our experiment was on the structure of semantic representations, it is likely that interactions between regions in the medial temporal lobe and the TP are critical to the generation of false memories. Although our data cannot speak directly to this issue, there are two clear lines of evidence that are suggestive. First, results from both rodent (33) and human research (34) demonstrate that false memories can be established as memory “engrams” within the hippocampus through artificial manipulation or reconsolidation processes. Second, recent studies have shown that recognition memory performance can be driven by a dynamic similarity-based computation within the medial temporal lobe at retrieval (35, 36). It is likely that both sets of processes interact with the semantic representations in the TP to produce semantic false memories (3, 6, 37).
Although we found clear evidence for a semantic code within the left TP, other neighboring regions have also been found to contain semantic-like representations, particularly the anterior ventral temporal cortex (23, 25). We suggest that these differing locations are likely a result of the level of semantic abstraction, as the set of studies with results located in anterior ventral temporal cortex all used highly concrete stimuli (either concrete words, or direct use of pictures). In contrast, our set of words included many abstract concepts, such as “justice” and “desire” (see Table S1 for full list). This suggestion is supported by a meta-analysis that contrasted regions involved in abstract versus concrete words, and that found clear evidence that the TP was more active in response to abstract words, whereas ventral temporal regions showed a preference for concrete words (20).
Finally, our results show that each individual’s unique TP representations predict idiosyncratic patterns of false-memory errors. Given that we rely on shared semantic representation to communicate with one another, this individual variation is perhaps surprising. However, it does converge with other recent reports of individual differences in semantic (30) and episodic representation (38), and suggests that divergent personal experience is sufficient to create individually unique representations in higher-level semantic regions. This striking finding suggests that it will be important to further characterize both the shared and individually unique aspects of semantic cognition to better understand the nature of conceptual knowledge.
Methods
Participants.
Eighteen participants (11 female; mean age 22, range 18–29) took part in this study. All were right-handed native English speakers and had normal or corrected-to-normal vision. The study was approved by the University College London Research Ethics Committee, and all participants gave written consent to take part in the study.
Stimuli.
For both testing sessions, the stimuli were drawn from 40 standard DRM lists (26, 27). Because of time-constraints in the fMRI scanning session, we used only four list items from each DRM list (see Fig. 3B for a comparison with previous results based on the full set of 15 list items). Where possible, these were four list items with the highest associative strength with the lure word. However, this full set of 40 lists contains some words that were repeated across lists, and some words that we considered to be culturally specific in semantic relatedness to the related concept, such as “United States” in the “army” list. Any unsuitable list words were therefore excluded, and alternative, lower list associates were included instead. The full set of stimuli used is displayed in Table S1. In total, the stimulus set included 40 DRM related concept lures and 160 associated DRM list items. For the behavioral DRM task, we also used an additional 160 unrelated novel words, which were matched to the DRM lists in average concreteness and frequency.
Overall Task Structure.
All participants took part in two separate experimental sessions separated by several weeks (mean 65 d, minimum = 21, maximum = 239). The first session was a behavioral session involving a standard DRM recognition paradigm, providing subject-specific false-memory data. The second session was a fMRI session, where subjects viewed words taken from the DRM lists during an incidental task. Both sessions are explained in more detail below. We elected to run the behavioral session before the scanning session to ensure that repeated exposure to the DRM words during scanning did not impact the behavioral false-memory effects. This procedure therefore provided us with a “pure” measure of individual false memories. However, we acknowledge that this design could still have the reverse problem, in that there could be carryover effects from the behavioral to the fMRI session. The long delay between sessions was built in to minimize any such issues, and further control analyses were conducted to further rule this out as a problem (Results).
Behavioral DRM Task.
The first testing session was purely behavioral and involved a standard DRM recognition paradigm. During an encoding phase, subjects were presented with 40 sets of four-word DRM lists. Participants were instructed to memorize as many of the presented words as possible. For each list, the four words were presented consecutively for 500 ms each, with a 3-s interval between each list. The order of the 40 lists was randomized across subjects. Subjects were then required to perform an incidental visual discrimination task for 15 min, to minimize explicit rehearsal of the list words. Following this distraction period, subjects were given a recognition memory test for the previously presented words. All 160 DRM list words were presented, along with the 40 related concept lure words and 160 unrelated novel words. This set of 360 words was presented one at a time, in a randomized order. Subjects were required to decide whether they thought the word was old or new, along with a confidence judgment (sure or unsure). The task was self-paced, with no time limit.
Behavioral DRM Analysis.
The recognition-memory test data were analyzed to determine whether the subjects displayed the expected false-memory effect. To do this analysis, for each subject we calculated the proportion of words categories as “old” for each of the three conditions (old items, new items, and lure items). This approach provided us with a measure of the hit rate, false-alarm rate, and for the related concept lure items, false-memory rate. To assess whether the expected false-memory effects were present, we conducted a series of planned pairwise comparisons between conditions using two-tailed Wilcoxon signed rank tests. First, we tested for a basic recognition memory effect by comparing the hit rate to the false-alarm rate. Next, we investigated whether our subjects displayed the expected false-memory effect, by comparing the false-memory rate with the false-alarm rate. Following this process, we compared false-memory and false-alarm rate using just high-confidence trials to determine whether the task induced robust false memories. Finally, we investigated whether the pattern of false-memory errors across the 40 DRM lists correlated with the false-memory rate reported by ref. 27, despite the fact that we used only the first four list associates from each DRM list, as opposed to the full set of 15. All behavioral results are reported in Fig. 3.
fMRI Task.
Several weeks later, the same subjects came in for a second testing session in the fMRI scanner. During each of four functional runs, the 40 DRM related concepts and 160 DRM list words were presented one at a time for 3 s each. Thus, in total, each word was presented four times, to allow a more stable estimate of the neural pattern. The order of presentation was randomized, with the additional constraint that words from the same DRM set were never presented consecutively. The behavioral task involved a semantic category decision for each presented word. Specifically, subjects had to decide whether they thought each word was more related to the category of “man-made” or “natural,” and indicate this by pressing the relevant button. Because we were simply interested in measuring the neural patterns expressed for each DRM concept, the task itself was incidental. However, we reasoned that a semantic categorization task would require the subjects to fully process the semantic meaning of each word. The fact that each word would be repeated four times during scanning introduces a possible source of noise because of novelty effects on the first presentation. We therefore allowed the subjects one practice block before entering the scanner to reduce any novelty signals in the subsequent scanning session.
MRI Scan Details.
It is well known that fMRI of the TPs can be problematic because of susceptibility artifacts (signal dropout) in this region, which can substantially reduce blood-oxygen level-dependent (BOLD) sensitivity (39). We therefore elected to use a 1.5 Tesla MRI scanner, which suffers from less-pronounced dropout in this region, and therefore can actually have greater BOLD sensitivity than higher field-strength scanners (40). The precise sequences used were further optimized to reduce signal dropout in ventral anterior regions, including the TP. All MRI data were collected using a Siemens Avanto 1.5 Tesla MRI scanner with a 32-channel head coil at the Birkbeck-UCL Centre for Neuroimaging in London. The functional data were acquired using a gradient-echo echo-planar imaging sequence in an ascending sequence, with a slice thickness of 2 mm and a 1-mm gap, TR = 85 ms, TE = 50 ms, slice tilt = −30°, field-of-view 192 mm, and matrix size 64 × 64. The whole brain was acquired with 40 slices, leading to a volume acquisition time of 3.4 s. The precise slice tilt was chosen as a compromise between sensitivity, coverage, and speed (40, 41). Following functional imaging, an anatomical image was acquired for each participant (T1-weighted FLASH, TR = 12 ms, TE = 5.6 ms, 1-mm3 resolution).
fMRI Preprocessing.
The first six functional volumes were discarded to allow for T1 equilibration. The remaining data were slice-time corrected and spatially realigned. Each participant’s structural image was coregistered to the first functional image. The structural images were segmented, and the deformations estimated during this step were applied to both the structural and functional images to normalize them into MNI space. All preprocessing steps were conducted using SPM12. Default parameters were chosen for each step.
Pattern Estimation.
We were interested in investigating neural overlap between the neural representations of each set of DRM list words and their related concept. Our hypothesis was that each DRM list word should have a neural representation that overlaps with that related concept. To assess this hypothesis, we estimated two patterns for each DRM set: one pattern for the related concept itself, and another for all four of the list words combined. This latter pattern captured the neural pattern that was common across all four list items, which should therefore capture the representational overlap. If our hypothesis is correct, then this pattern should correlate with the related concept pattern. To estimate this set of patterns, we used the GLMdenoise toolbox (42), which implements a de-noising step in addition to estimating the β-weights for each regressor. Each pattern was estimated using an event-related regressor indicating the onset of the related concept/set of list words across the four functional sessions. This procedure resulted in a set of 80 β-weight images (40 DRM list patterns and 40 related lure concept patterns). These were converted to t-statistics by dividing the parameter estimate by the estimate of the SE, thereby normalizing the responses of each voxel (43). The resulting t-statistic images were left unsmoothed to preserve any fine-grained spatial information (44).
Searchlight Analysis.
We used a searchlight representational similarity analysis (22, 28) to search for brain regions containing the predicted neural code. Representational similarity analysis uses the neural pattern similarity between pairs of stimuli to infer the representational similarity. This approach is therefore highly appropriate for assessing neural overlap between semantic representations, as the degree of overlap should be directly reflected in the representational similarity. We first assessed the neural overlap between each DRM list and its related lure concept by measuring the Pearson correlation between the pair of voxel patterns. In each case, we normalized the similarity data by subtracting the mean Pearson correlation between the DRM list and each unrelated lure concept. This procedure removed any general effects of similarity that were not driven by semantic relatedness, and resulted in a vector of 40 neural overlap scores for the 40 DRM lists. Our prediction is that the degree of neural overlap within the TP should reflect semantic relatedness, and therefore predict false-memory likelihood across the 40 DRM lists. We used the canonical false-recognition scores reported in (26, 27), as our measure of false-memory likelihood for the 40 DRM lists. We used a searchlight approach (28) to search across the whole brain for regions containing a neural code consistent with our predictions. This approach involves stepping through each voxel in the brain, and in each case running a representational similarity analysis on the cluster of voxels surrounding that central voxel (for all analyses, we used a spherical searchlight with 10-mm radius). We used a variation of this approach, where the value at each voxel was the average value of all searchlight analyses that included that voxel. This information-averaging approach more accurately reflects the multivariate nature of the analysis, and results in a smoother image (45). For computational efficiency, we restricted our analysis to a whole-brain gray-matter mask, created by averaging the normalized, segmented gray-matter images, and applying a threshold of 0.5. This searchlight approach was applied to the analysis described above, using a Fisher-transformed Pearson correlation to assess the mapping between neural overlap and false-memory likelihood in each searchlight. This approach was repeated for all subjects, and statistical significance at each voxel was assessed at the group level using a nonparametric permutation approach (46). This procedure provides a means of applying strict family-wise error correction for multiple comparisons without any parametric assumptions. For this analysis, 10,000 permutations were applied with 10-mm variance smoothing, and a standard cluster threshold of T > 3 was used to assess statistical significance (46). Only regions that are significant at P < 0.05 with family-wise error correction are reported.
TP ROI.
To further explore the representations contained within the TP, a ROI was created based on the initial searchlight results. The ROI included all voxels within the TP that passed the searchlight cluster threshold of t > 3. The resulting ROI consisted of 92 voxels.
Correlation Between Canonical False Memory and Neural Overlap.
In Fig. 2B we report the correlation between the group average TP neural overlap for each DRM list and the canonical false-memory likelihood (26, 27), to illustrate the strength of the relationship. To avoid an artificial inflation of the effect size estimate due to nonindependence in the choice of ROI, we used a leave-one-subject-out cross-validation approach (29). On a given cross-validation fold, we took the searchlight maps for 17 of 18 subjects, averaged the maps, and selected the top 200 voxels. We then used these voxels as an ROI to measure the neural overlap in the 40 DRM lists for the remaining subject (note that in this case we did not normalize the neural overlap score by subtracting the between-list correlation, as we consider the raw correlation values to be descriptively more informative). This procedure was repeated 18 times, each time leaving out a different subject. This analysis resulted in neural overlap data for all 18 subjects based on an independently selected ROI, thereby avoiding statistical “double-dipping” (29). We averaged the neural overlap across the subjects to create a single summary neural overlap score for each of the 40 DRM lists. This score was then correlated with the canonical false-memory scores, as reported in Fig. 2B. An additional correlation was conducted after removing two potential outliers, identified using a bootstrapped Mahalanobis distance, and a threshold of Ds > 6. Note that whereas this cross-validation approach is guaranteed to provide an unbiased correlation, it is not guaranteed that the voxels used will be based on the same TP region as reported in the searchlight results. We therefore investigated the number of voxels falling with the TP region [defined using the Harvard–Oxford Atlas (47)] for each fold of the cross-validation. Every single fold included at least some voxels within this region, ranging from 5 to 42, with an average of 25.5. Thus, a good proportion of the neural information going into this analysis was indeed based on the TP.
False Memory Individuation Analysis.
To test for the presence of unique TP neural information that predicts subject-specific false memories, we used an individuation analysis (30). The logic here is that if an individual has a unique set of TP neural representations that meaningfully influences cognition, then that individual’s neural overlap data should predict their own pattern of false-memory errors better than any other subject’s false-memory errors. To assess this possibility, we created a false-memory vector for each individual subject based on their specific pattern of false-memory errors in the behavioral DRM session. Given that we were specifically interested in genuine false memories rather than mistakes driven by uncertainty, we defined a false memory as a high-confidence old response to a related concept lure. The false-memory vector was in each case a binary vector of 40 values for the 40 DRM lists, with a 1 indicating the DRM lists that result in a false memory, and 0 elsewhere. For each subject, we calculated the Spearman correlation between their TP neural overlap and false-memory data (within-subject correlation). We then calculated the Spearman correlation between that subject’s neural overlap and each other subject’s false-memory data, and averaged across these correlation values to provide a summary between-subject correlation. This procedure resulted in a within-subject and between-subject correlation for every subject. To assess whether there was significantly greater within-subject predictive information in the neural data, we compared the within- and between-subject correlations with a Wilcoxon sign rank test. A one-tailed test was used a result of our one-sided hypothesis that the correlation should be greater within- than between-subject.
True-Memory Individuation Analysis.
To be consistent with the false-memory analysis, we defined a “true memory” as a high-confidence old response to a previously presented DRM list item. The true-memory vector was created by calculating for each DRM list the proportion of words that were judged to be old with high confidence. This resulted in a true-memory vector of length 40 for each individual subject. The true-memory individuation analysis was otherwise identical to the false-memory individuation analysis described above.
Control Analyses.
To ensure that the results were driven by neural similarity that was specific to the false-memory strength and not additional extraneous factors, we conducted three control analyses. First, we established that word frequency (48) was not contributing to the neural data. Given that the neural overlap was based on the similarity between each lure and the respective list items, the absolute word frequency of the critical lure could not by itself explain this measure. Instead, we calculated the average difference in word frequency between each DRM lure and the four DRM list items. Second, we investigated whether visual similarity between the words could be contributing to the effects. We used the Levenshtein edit distance to assess the similarity between each pair of words, as this has been shown to be a good predictor of various lexical effects (49) and is therefore appropriate for assessing low-level word similarity. For each DRM set, the visual similarity was defined as the average edit distance between the lure and list items. Finally, we explored whether the incidental task performed in the scanner could be driving the neural similarity results. To investigate this possibility, for each of the 40 DRM lists we quantified the number of list words where the subject had indicated the same category as the related lure concept for that list. The stronger the degree of correspondence, the stronger any task category representation should be for that particular DRM list. Each of these three control variables was correlated with the neural overlap data to determine whether each significantly contributed to the neural data. Additionally, all three variables were controlled for in each analysis reported in Results.
Acknowledgments
H.J.S. is funded by the James S. McDonnell Foundation and The Wellcome Trust; D.L.S. is supported by National Institute of Mental Health Grant MH060941.
Footnotes
The authors declare no conflict of interest.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1610686113/-/DCSupplemental.
References
- 1.Schacter DL, Guerin SA, St Jacques PL. Memory distortion: An adaptive perspective. Trends Cogn Sci. 2011;15(10):467–474. doi: 10.1016/j.tics.2011.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Qin S, Hermans EJ, van Marle HJF, Fernández G. Understanding low reliability of memories for neutral information encoded under stress: Alterations in memory-related activation in the hippocampus and midbrain. J Neurosci. 2012;32(12):4032–4041. doi: 10.1523/JNEUROSCI.3101-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Reyna VF, Corbin JC, Weldon RB, Brainerd CJ. How fuzzy-trace theory predicts true and false memories for words, sentences, and narratives. J Appl Res Mem Cogn. 2016;5(1):1–9. doi: 10.1016/j.jarmac.2015.12.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Deese J. On the prediction of occurrence of particular verbal intrusions in immediate recall. J Exp Psychol. 1959;58(1):17–22. doi: 10.1037/h0046671. [DOI] [PubMed] [Google Scholar]
- 5.Roediger HL, McDermott KB. Creating false memories: Remembering words not presented in lists. J Exp Psychol Learn Mem Cogn. 1995;21(4):803–814. [Google Scholar]
- 6.Roediger HL, III, Balota DA, Watson JM. Spreading activation and arousal of false memories. In: Roediger HL III, Nairne JS, Neath I, Surprenant AM, editors. The Nature of Remembering: Essays in Honor of Robert G. Crowder. APA; Washington, DC: 2001. pp. 95–115. [Google Scholar]
- 7.Gallo DA. False memories and fantastic beliefs: 15 years of the DRM illusion. Mem Cognit. 2010;38(7):833–848. doi: 10.3758/MC.38.7.833. [DOI] [PubMed] [Google Scholar]
- 8.Schacter DL, Verfaellie M, Pradere D. The neuropsychology of memory illusions: False recall and recognition in amnesic patients. J Mem Lang. 1996;35(2):319–334. [Google Scholar]
- 9.Cabeza R, Rao SM, Wagner AD, Mayer AR, Schacter DL. Can medial temporal lobe regions distinguish true from false? An event-related functional MRI study of veridical and illusory recognition memory. Proc Natl Acad Sci USA. 2001;98(8):4805–4810. doi: 10.1073/pnas.081082698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.McDermott KB, Watson JM, Ojemann JG. Presurgical language mapping. Curr Dir Psychol Sci. 2005;14(6):291–295. [Google Scholar]
- 11.Kim H, Cabeza R. Differential contributions of prefrontal, medial temporal, and sensory-perceptual regions to true and false memory formation. Cereb Cortex. 2007;17(9):2143–2150. doi: 10.1093/cercor/bhl122. [DOI] [PubMed] [Google Scholar]
- 12.Boggio PS, et al. Temporal lobe cortical electrical stimulation during the encoding and retrieval phase reduces false memories. PLoS One. 2009;4(3):e4959. doi: 10.1371/journal.pone.0004959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Gallate J, Chi R, Ellwood S, Snyder A. Reducing false memories by magnetic pulse stimulation. Neurosci Lett. 2009;449(3):151–154. doi: 10.1016/j.neulet.2008.11.021. [DOI] [PubMed] [Google Scholar]
- 14.Drowos DB, Berryhill M, André JM, Olson IR. True memory, false memory, and subjective recollection deficits after focal parietal lobe lesions. Neuropsychology. 2010;24(4):465–475. doi: 10.1037/a0018902. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Atkins AS, Reuter-Lorenz PA. Neural mechanisms of semantic interference and false recognition in short-term memory. Neuroimage. 2011;56(3):1726–1734. doi: 10.1016/j.neuroimage.2011.02.048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Warren DE, Jones SH, Duff MC, Tranel D. False recall is reduced by damage to the ventromedial prefrontal cortex: Implications for understanding the neural correlates of schematic memory. J Neurosci. 2014;34(22):7677–7682. doi: 10.1523/JNEUROSCI.0119-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Patterson K, Nestor PJ, Rogers TT. Where do you know what you know? The representation of semantic knowledge in the human brain. Nat Rev Neurosci. 2007;8(12):976–987. doi: 10.1038/nrn2277. [DOI] [PubMed] [Google Scholar]
- 18.McClelland JL, Rogers TT. The parallel distributed processing approach to semantic cognition. Nat Rev Neurosci. 2003;4(4):310–322. doi: 10.1038/nrn1076. [DOI] [PubMed] [Google Scholar]
- 19.Skipper-Kallal LM, Mirman D, Olson IR. Converging evidence from fMRI and aphasia that the left temporoparietal cortex has an essential role in representing abstract semantic knowledge. Cortex. 2015;69:104–120. doi: 10.1016/j.cortex.2015.04.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wang J, Conder JA, Blitzer DN, Shinkareva SV. Neural representation of abstract and concrete concepts: A meta-analysis of neuroimaging studies. Hum Brain Mapp. 2010;31(10):1459–1468. doi: 10.1002/hbm.20950. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Clarke A, Tyler LK. Understanding what we see: How we derive meaning from vision. Trends Cogn Sci. 2015;19(11):677–687. doi: 10.1016/j.tics.2015.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kriegeskorte N, Kievit RA. Representational geometry: Integrating cognition, computation, and the brain. Trends Cogn Sci. 2013;17(8):401–412. doi: 10.1016/j.tics.2013.06.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bruffaerts R, et al. Similarity of fMRI activity patterns in left perirhinal cortex reflects semantic similarity between words. J Neurosci. 2013;33(47):18597–18607. doi: 10.1523/JNEUROSCI.1548-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Devereux BJ, Clarke A, Marouchos A, Tyler LK. Representational similarity analysis reveals commonalities and differences in the semantic processing of words and objects. J Neurosci. 2013;33(48):18906–18916. doi: 10.1523/JNEUROSCI.3809-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Carlson TA, Simmons RA, Kriegeskorte N, Slevc LR. The emergence of semantic meaning in the ventral temporal pathway. J Cogn Neurosci. 2014;26(1):120–131. doi: 10.1162/jocn_a_00458. [DOI] [PubMed] [Google Scholar]
- 26.Stadler MA, Roediger HL, 3rd, McDermott KB. Norms for word lists that create false memories. Mem Cognit. 1999;27(3):494–500. doi: 10.3758/bf03211543. [DOI] [PubMed] [Google Scholar]
- 27.Roediger HL, 3rd, Watson JM, McDermott KB, Gallo DA. Factors that determine false recall: A multiple regression analysis. Psychon Bull Rev. 2001;8(3):385–407. doi: 10.3758/bf03196177. [DOI] [PubMed] [Google Scholar]
- 28.Kriegeskorte N, Goebel R, Bandettini P. Information-based functional brain mapping. Proc Natl Acad Sci USA. 2006;103(10):3863–3868. doi: 10.1073/pnas.0600244103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kriegeskorte N, Simmons WK, Bellgowan PSF, Baker CI. Circular analysis in systems neuroscience: The dangers of double dipping. Nat Neurosci. 2009;12(5):535–540. doi: 10.1038/nn.2303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Charest I, Kievit RA, Schmitz TW, Deca D, Kriegeskorte N. Unique semantic space in the brain of each beholder predicts perceived similarity. Proc Natl Acad Sci USA. 2014;111(40):14565–14570. doi: 10.1073/pnas.1402594111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.van Kesteren MTR, Ruiter DJ, Fernández G, Henson RN. How schema and novelty augment memory formation. Trends Neurosci. 2012;35(4):211–219. doi: 10.1016/j.tins.2012.02.001. [DOI] [PubMed] [Google Scholar]
- 32.Mitchell TM, et al. Predicting human brain activity associated with the meanings of nouns. Science. 2008;320(5880):1191–1195. doi: 10.1126/science.1152876. [DOI] [PubMed] [Google Scholar]
- 33.Ramirez S, et al. Creating a false memory in the hippocampus. Science. 2013;341(6144):387–391. doi: 10.1126/science.1239073. [DOI] [PubMed] [Google Scholar]
- 34.Edelson M, Sharot T, Dolan RJ, Dudai Y. Following the crowd: Brain substrates of long-term memory conformity. Science. 2011;333(6038):108–111. doi: 10.1126/science.1203557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.LaRocque KF, et al. Global similarity and pattern separation in the human medial temporal lobe predict subsequent memory. J Neurosci. 2013;33(13):5466–5474. doi: 10.1523/JNEUROSCI.4293-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Davis T, Xue G, Love BC, Preston AR, Poldrack RA. Global neural pattern similarity as a common basis for categorization and recognition memory. J Neurosci. 2014;34(22):7472–7484. doi: 10.1523/JNEUROSCI.3376-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Arndt J, Hirshman E. True and false recognition in MINERVA2: Explanations from a global matching perspective. J Mem Lang. 1998;39(3):371–391. [Google Scholar]
- 38.Chadwick MJ, Bonnici HM, Maguire EA. CA3 size predicts the precision of memory recall. Proc Natl Acad Sci USA. 2014;111(29):10720–10725. doi: 10.1073/pnas.1319641111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Devlin JT, et al. Susceptibility-induced loss of signal: Comparing PET and fMRI on a semantic task. Neuroimage. 2000;11(6 Pt 1):589–600. doi: 10.1006/nimg.2000.0595. [DOI] [PubMed] [Google Scholar]
- 40.Weiskopf N, Hutton C, Josephs O, Deichmann R. Optimal EPI parameters for reduction of susceptibility-induced BOLD sensitivity losses: A whole-brain analysis at 3 T and 1.5 T. Neuroimage. 2006;33(2):493–504. doi: 10.1016/j.neuroimage.2006.07.029. [DOI] [PubMed] [Google Scholar]
- 41.Weiskopf N, Hutton C, Josephs O, Turner R, Deichmann R. Optimized EPI for fMRI studies of the orbitofrontal cortex: Compensation of susceptibility-induced gradients in the readout direction. MAGMA. 2007;20(1):39–49. doi: 10.1007/s10334-006-0067-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kay KN, Rokem A, Winawer J, Dougherty RF, Wandell BA. GLMdenoise: A fast, automated technique for denoising task-based fMRI data. Front Neurosci. 2013;7:247. doi: 10.3389/fnins.2013.00247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Misaki M, Kim Y, Bandettini PA, Kriegeskorte N. Comparison of multivariate classifiers and response normalizations for pattern-information fMRI. Neuroimage. 2010;53(1):103–118. doi: 10.1016/j.neuroimage.2010.05.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Chadwick MJ, Bonnici HM, Maguire EA. Decoding information in the human hippocampus: a user’s guide. Neuropsychologia. 2012;50(13):3107–3121. doi: 10.1016/j.neuropsychologia.2012.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Björnsdotter M, Rylander K, Wessberg J. A Monte Carlo method for locally multivariate brain mapping. Neuroimage. 2011;56(2):508–516. doi: 10.1016/j.neuroimage.2010.07.044. [DOI] [PubMed] [Google Scholar]
- 46.Nichols TE, Holmes AP. Nonparametric permutation tests for functional neuroimaging: A primer with examples. Hum Brain Mapp. 2002;15(1):1–25. doi: 10.1002/hbm.1058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Desikan RS, et al. An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. Neuroimage. 2006;31(3):968–980. doi: 10.1016/j.neuroimage.2006.01.021. [DOI] [PubMed] [Google Scholar]
- 48.Brysbaert M, New B. Moving beyond Kucera and Francis: A critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav Res Methods. 2009;41(4):977–990. doi: 10.3758/BRM.41.4.977. [DOI] [PubMed] [Google Scholar]
- 49.Yarkoni T, Balota D, Yap M. Moving beyond Coltheart’s N: A new measure of orthographic similarity. Psychon Bull Rev. 2008;15(5):971–979. doi: 10.3758/PBR.15.5.971. [DOI] [PubMed] [Google Scholar]
- 50.Xia M, Wang J, He Y. BrainNet Viewer: A network visualization tool for human brain connectomics. PLoS One. 2013;8(7):e68910. doi: 10.1371/journal.pone.0068910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Morey R. Confidence intervals from normalized data: A correction to Cousineau (2005) Tutor Quant Methods Psychol. 2008;4(2):61–64. [Google Scholar]