Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Jun 1.
Published in final edited form as: Brain Res. 2015 Nov 2;1640(Pt B):264–277. doi: 10.1016/j.brainres.2015.10.048

Auditory short-term memory in the primate auditory cortex

Brian H Scott a, Mortimer Mishkin a
PMCID: PMC4853305  NIHMSID: NIHMS735009  PMID: 26541581

Abstract

Sounds are fleeting, and assembling the sequence of inputs at the ear into a coherent percept requires auditory memory across various time scales. Auditory short-term memory comprises at least two components: an active ‘working memory’ bolstered by rehearsal, and a sensory trace that may be passively retained. Working memory relies on representations recalled from long-term memory, and their rehearsal may require phonological mechanisms unique to humans. The sensory component, passive short-term memory (pSTM), is tractable to study in nonhuman primates, whose brain architecture and behavioral repertoire are comparable to our own. This review discusses recent advances in the behavioral and neurophysiological study of auditory memory with a focus on single-unit recordings from macaque monkeys performing delayed-match-to-sample (DMS) tasks. Monkeys appear to employ pSTM to solve these tasks, as evidenced by the impact of interfering stimuli on memory performance. In several regards, pSTM in monkeys resembles pitch memory in humans, and may engage similar neural mechanisms. Neural correlates of DMS performance have been observed throughout the auditory and prefrontal cortex, defining a network of areas supporting auditory STM with parallels to that supporting visual STM. These correlates include persistent neural firing, or a suppression of firing, during the delay period of the memory task, as well as suppression or (less commonly) enhancement of sensory responses when a sound is repeated as a ‘match’ stimulus. Auditory STM is supported by a distributed temporo-frontal network in which sensitivity to stimulus history is an intrinsic feature of auditory processing.

Keywords: working memory, short-term memory, macaque, cortex, hearing

1. Introduction

Nonhuman primates navigate a rich auditory world in which sounds guide social interactions and signal environmental dangers. Their repertoire of acoustically driven behaviors suggests monkeys should perceive and remember sounds much as humans do. However, accumulating evidence suggests this is only half true: perceptually, hearing in macaques is comparable to our own (Jackson et al., 1999; Stebbins and Moody, 1994), but this does not seem to be true of auditory memory, a domain in which monkeys are slow to learn and their performance is consistently poor relative to that in the visual modality (Colombo and D'Amato, 1986; Colombo and Graziano, 1994; Cowey and Weiskrantz, 1976; Fritz et al., 2005; Kojima, 1985; Wright, 1999; Wright, 2007). Does this discrepancy represent a species difference between monkeys and humans, a modality difference between vision and audition, or some combination of both? How does the faculty of long-term memory (LTM) constrain possible models of short-term memory (STM), and their instantiation in the cortex? And in light of these discrepancies, is the nonhuman primate a valid model for the neural mechanisms of human audition?

This review summarizes recent work on auditory memory in the nonhuman primate and addresses outstanding questions about the application of these findings to human hearing. A key distinction will be the separation of auditory STM into two components: a sensory trace, which may be passively retained, and a phonological rehearsal mechanism, as would subserve active (or ‘working’) memory (Baddeley, 2003; Cowan, 1984; Cowan, 2008). These components may be mapped roughly to the ventral and dorsal streams (Hickok and Poeppel, 2007; Rauschecker and Scott, 2009) of auditory cortex, respectively. We will propose that nonhuman primates offer a tractable model for the perceptual and mnemonic processes of the ventral stream, including the trace that underlies passive STM (pSTM). By contrast, the contribution of the dorsal stream to auditory memory in humans has likely evolved in tandem with language, and may exist in nonhuman primates in only a rudimentary form.

2. Behavioral studies of auditory memory in monkeys

2.1 Do monkeys possess long-term auditory memory?

The nonhuman primate has been a productive model for the study of memory, establishing that long-term visual recognition relies upon cortical areas in the medial temporal lobe (MTL), particularly the rhinal cortex (Mishkin, 1982). This finding was soon extended to tactile memory (Murray and Mishkin, 1983), setting the expectation that the MTL may also support auditory LTM. To test whether selective ablation of the MTL disrupted auditory LTM, monkeys were trained to perform an auditory variant of the classic test for recognition memory, delayed-match-to-sample (DMS). Training monkeys to perform auditory memory tasks has been shown to be slow and arduous relative to training on visual tasks (Colombo and D'Amato, 1986; Colombo and Graziano, 1994; Cowey and Weiskrantz, 1976; Kojima, 1985; Wegener, 1964; Wright, 1999; Wright, 2007). This problem may have been exacerbated in earlier studies that used small stimulus sets, thereby creating confusion as to whether a sound was heard in the current trial or a previous one (Colombo et al., 1990; Colombo et al., 1996; the role of stimulus set size is discussed in section 2.5). However, despite using a large (nearly trial-unique) stimulus set, many thousands of trials were required for monkeys to reach criterion performance, after which they showed no evidence of forming long-term memories: their forgetting thresholds were between 30–50 s (Fritz et al., 2005). This was consistent with studies of auditory list memory that found monkeys could retain only about two sounds for delays up to 30s, the longest delays tested (Wright, 2007). Curiously, selective ablation of the rhinal cortex had no effect on auditory memory, and the auditory forgetting curves of normal monkeys resembled the visual forgetting curves of monkeys with bilateral ablations of the rhinal cortex (Fritz et al., 2005), suggesting that monkeys are natural born ‘auditory amnesiacs’.

The claim that monkeys do not possess long-term auditory memory is seemingly at odds with the ethological evidence for long-term auditory stores in nonhuman primates. Monkeys communicate through a repertoire of vocalizations that appear to carry representational meaning (e.g., food availability or the presence of a predator), and recognize conspecific vocalizations with respect to the kinship and social rank of the caller (Gil-da-Costa et al., 2003; Gouzoules et al., 1984; Rendall et al., 1996; Seyfarth et al., 1980). The auditory abilities of nonhuman primates could be mediated by at least two other types of memory that require a form of long-term storage, but are not sufficient to enable DMS. First, stimulus-response (or ‘habit’) learning and perceptual learning have been demonstrated by classical and instrumental conditioning in many species, and are associated with plasticity in the auditory cortex (Weinberger, 2004; Weinberger, 2015) and neostriatum (Xiong et al., 2015). Habit learning in the monkey depends upon the neostriatum but not the MTL (Mishkin, 1982). A second alternative is cross-modal association memory, in which a sound activates an associated image that is stored, presumably, in the cortical visual system. For instance, the association of individual voices to faces has been shown to arise spontaneously in monkeys for familiar conspecifics (Adachi and Hampton, 2011; Habbershon et al., 2013) as well as familiar humans (Sliwa et al., 2011), and vocalizations are associated with the facial expressions corresponding to their production (Ghazanfar and Logothetis, 2003; Payne and Bachevalier, 2013; Romanski, 2012). Monkeys can also learn arbitrary sound-picture associations by instrumental conditioning, in which a sound cues the monkey to select the associated image after a delay (Colombo and Gross, 1994; Fuster et al., 2000; Gaffan and Harrison, 1991); the reverse condition, in which an image cues the recollection of an associated sound, has not been demonstrated to our knowledge. That the sound stimulus activates an associated visual memory is supported by the finding that matching to the subsequent image was impaired by the presentation of visual – but not auditory – distractors during the delay (Colombo and Graziano, 1994).

The inability of monkeys to perform DMS beyond a few tens of seconds suggests that although some form of long-term trace exists in the brain (to support habit or associative learning), monkeys can not reactivate that trace for use in an identity comparison. A possible anatomical correlate of this inability lies in the paucity of auditory inputs to the MTL, which receives sensory input predominantly from higher visual cortex (Munoz-Lopez et al., 2010; Munoz-Lopez et al., 2015; Suzuki and Amaral, 1994). As the MTL is necessary for the formation of long-term visual memories (Mishkin, 1978), purely auditory stimuli may not have direct access to this mechanism of storage. This peculiarity of long-term auditory memory places important constraints upon models of short-term auditory memory.

2.2 Do monkeys possess auditory working memory?

Working memory (WM) refers to the ability to temporarily maintain and manipulate information. The term WM is sometimes used interchangeably with STM, and definitions of the two vary (Baddeley, 2007; Cowan, 2008), but WM is generally held to be an active process requiring attentional control and manipulation. By contrast, the simple storage of sensory information that is no longer present in the external world falls under the purview of STM, which we distinguish as passive STM (pSTM). Discrimination of two stimuli that differ along a single parameter (e.g., tone frequency, image contrast, or vibratory flutter frequency) across a brief delay is often referred to as a WM task (Pasternak and Greenlee, 2005; Romo and Salinas, 2003), but would fall under our definition of pSTM.

Working memory is believed to operate not on the maintained representation of the recently presented stimulus, but on the activation of a representation of that stimulus previously stored in LTM (Cowan, 1988; Cowan, 2008; Fuster, 1995; Jacob and Nieder, 2014; Norman, 1968). This would confer several advantages over pSTM. For one, stimuli could be classified into categories based on low-level or abstract properties, and that category could be held in memory rather than the full representation of the stimulus itself. This seems to be a critical factor in the capacity of visual memory, which is reduced from roughly 4–7 items to one or two items if those images can not be categorized (Olsson and Poom, 2005). Interestingly, when non-categorical auditory and visual stimuli are carefully equated in discriminability, the capacities of visual and auditory STM are equally limited (Visscher et al., 2007; but see Bigelow and Poremba, 2014). A second advantage of WM over pSTM is that the representation of an actively maintained item can be continually reactivated (or 'rehearsed'), which confers resistance to decay and to interference from other sensory stimuli encountered during the delay (Lewandowsky et al., 2009; Saults and Cowan, 2007).

2.3 Properties of auditory STM in monkeys: active WM or pSTM?

Without evidence of long-term recognition memory, the ability of monkeys to perform auditory DMS at short delays (5–30 s) was hypothesized to depend upon WM or STM (Fritz et al., 2005). This capacity was explored in later experiments using a serial DMS task with short (~1 s) delays (Scott et al., 2012; Fig 1), an auditory analog of an established visual paradigm (Eskandar et al., 1992; Miller et al., 1993; Miller et al., 1996). By the definition of WM above (section 2.2), the apparent absence of LTM would preclude the use of active WM, because there is no LTM trace to reactivate for on-line maintenance. If monkeys are instead dependent upon pSTM, this would lead to two predictions about their performance of serial DMS: (i) matching accuracy should be compromised by interference from nonmatch stimuli presented during the delay, and (ii) sound category should not play a role in matching behavior because categories themselves require LTM (Olsson and Poom, 2005).

Figure 1.

Figure 1

Timing and trial types for two versions of the auditory delayed-match-to-sample (DMS) paradigm. A correct trial is depicted for each trial type. In the task used by the Poremba group, a sample sound was followed by a test sound after a 5-s delay; after a mandatory wait period, the monkey pressed a button (and received a food reward) if the test sound matched the sample, or withheld response for a nonmatching test stimulus. Below, the serial DMS paradigm, in which sounds were separated by a 1-s delay and the matching test sound may appear after zero, one, or two intervening nonmatches. Release of a touch-bar following the match earned a liquid reward. Note that a match/nonmatch decision was required for the second and (on some trials) third stimulus in the sequence.

Monkeys’ performance of serial DMS confirmed both predictions, suggesting that auditory memory was reliant on a passively retained sensory trace (Scott et al., 2012). Accuracy was strongly dependent on trial type, such that performance was ~90% correct with no nonmatch stimulus intervening between sample and match but dropped to ~40% with two intervening nonmatch stimuli. Varying the interstimulus interval confirmed that performance was affected by the intervening stimuli, rather than by temporal decay. Active WM should be resistant to these nonmatch stimuli, but the strong effect of interference favors the interpretation that the monkeys used pSTM.

The brief (~300 ms) sounds used in this task were drawn from seven categories, including synthetic and natural sounds. In an attempt to make the task easier, the nonmatch sound(s) on a given trial were not drawn from the same category as the sample; e.g., if the sample was a pure tone, the nonmatching test sound could be a vocalization or modulated noise, but not another pure tone. Thus, while 'identity' was the intended rule for DMS performance, matching to category would have been sufficient. There was no evidence that monkeys extracted these categories in order to exploit this strategy. Surprisingly, their performance was slightly better for synthetic than for natural sound categories, including macaque vocalizations (Scott et al., 2012), which may be a general feature of auditory DMS with small stimulus sets (21 in this case; see also Gaffan and Harrison, 1991; Ng, 2011; effect of stimulus set size is discussed below in Section 2.5). Quantifying the spectral and temporal complexity of the stimuli confirmed that performance was better for sounds that were simpler in either dimension, particularly tones, noise, and frequency sweeps (Scott et al., 2013). From these data it appears that monkeys solved auditory DMS using a sensory or feature-based memory trace, which favors the retention of acoustically simple sounds, rather than a categorical or object-based representation.

2.4 Role of interference and acoustic features in pSTM

Whereas human subjects can be questioned as to the strategy they employed in solving a task, the strategy used by nonhuman subjects at best can be inferred from their behavior. The monkeys’ mediocre performance in the serial DMS task afforded the opportunity to infer, post-hoc, what criteria contributed to identifying a ‘match’ (Scott et al., 2013). In both monkey subjects, a select subset of sample/nonmatch pairs was particularly likely to elicit ‘false alarm’ (incorrect match) responses. Tellingly, this pattern was much stronger for the first nonmatch in the sequence, when the false alarm rate was relatively low, than for the second nonmatch, when the false alarm rate was higher and errors seemed to occur randomly among sample/nonmatch pairs. This would be predicted if the mnemonic trace forming a basis for the match comparison were degraded by the presentation of the intervening nonmatch. This strong effect of interference is more characteristic of sensory memory (pSTM) than of an actively maintained WM.

The pairs of stimuli eliciting high error rates seemed to share common spectral characteristics; for example, pure tones and band-pass noise of equal center frequency were often confused, despite their obvious qualitative difference to human ears (Scott et al., 2013). The error rate for a given sample/nonmatch pair was predicted by the spectral similarity (correlation of frequency spectra) between sounds; temporal similarity (between amplitude envelopes) was a much less effective predictor. The predominance of spectral cues in DMS behavior was confirmed by selectively removing spectral or temporal information from the original memoranda: performance with only temporal cues was at or near chance, whereas performance with only spectral cues was only slightly reduced from that for intact stimuli (Scott et al., 2013). The particular spectral feature that best predicted error rate was the spectral centroid, which correlates closely with "pitch" as estimated by analytical software (Boersma and Weenink, 2012). Rhesus monkeys may perceive pitch much as humans do (Joly et al., 2014), lending credibility to the idea that monkeys performed auditory DMS by relying primarily on a pitch cue.

Parallels between DMS performance in monkeys and pitch memory in humans suggest that the same pSTM mechanism underlies both faculties. Pitch is a continuous sound quality that is largely immune to verbal labels or categorization in most listeners. Pitch memory begins to decay at intervals ≥3 s (Harris, 1952), consistent with the temporal limits of DMS performance in macaques (Fritz et al., 2005; Scott et al., 2012). Pitch memory is independent of verbal memory, in that it is not affected by interference from verbal material presented during the delay (Deutsch, 1970) but is nevertheless highly susceptible to interference from tones close in frequency to the sample (Deutsch, 1972; Zatorre et al., 1994). Similar manipulations of interference indicate that attributes like pitch, loudness, and timbre are represented independently in STM (Caclin et al., 2006; Clement et al., 1999; Nousak et al., 1996; Semal and Demany, 1991). This feature-based representation distinguishes pSTM from WM, which operates on configural or item-based representations (Joseph et al., 2015).

2.5 Cross-trial interference in auditory and visual STM

We have proposed that an advantage of WM over pSTM is resistance to interference. Thus far we have referred to retroactive interference within a trial, in which a nonmatch stimulus interferes with the previously presented sample being held in memory (referred to as ‘inhibition’ by (Wright, 2007). A related process is proactive interference, in which a previously presented sound affects the maintenance of a subsequently presented sample. Susceptibility to proactive interference offers a glimpse into the strategy that an animal may be employing to solve a DMS task.

If the stimulus set is large, then items will seldom re-appear from trial to trial, and successful performance could be achieved merely by judging if an item had been presented recently. In this case, a passive memory mechanism would suffice to identify the match. But if the stimulus set is small, then the match stimulus from a given trial may reappear as the nonmatch on a subsequent trial. A recency criterion, perhaps supported by pSTM, would lead to false-alarm errors in response to nonmatch stimuli that had appeared in a previous trial. By contrast, an active WM process would allow for the memory buffer to be ‘reset’ between trials (Yakovlev et al., 2013), and for the sample on the current trial to be selectively attended and maintained. Under these conditions, cross-trial proactive interference would be minimal.

The effect of proactive interference on auditory memory in the monkey was addressed by Wright using a list memory paradigm (Wright, 1999; Wright, 2007), and by Poremba and colleagues using a non-serial DMS paradigm (Bigelow and Poremba, 2013a; Bigelow and Poremba, 2013b; Bigelow and Poremba, 2015). Consistent with the attentional demand induced by repeating stimuli between trials, accuracy decreased with stimulus set size (Bigelow and Poremba, 2013b; Wright, 1999; Wright, 2007). Effects were detectable for stimuli that had appeared several trials back, spanning up to 3 minutes, suggesting that the effects of interference endured beyond the memory span that could be inferred from the current performance of the DMS task alone (Bigelow and Poremba, 2013a). However, we would argue that proactive interference evident in our data and others’ reflects a stimulus-response association that is distinct from the recognition memory that DMS is designed to test (Scott et al., 2012); if the response to a stimulus on the previous trial was rewarded, that reinforcement increases the probability of response when the same stimulus is presented as a nonmatch in the current trial. Overall, the effects of proactive interference are consistent with the idea that monkeys rely on a passive short-term sensory store to perform auditory STM tasks. Recent evidence suggests that this may not be unique to audition.

It has been argued that the monkey’s cortical memory mechanisms in audition are intrinsically different from those in vision and touch (Fritz et al., 2005), and that this difference in LTM extends to their reliance on pSTM rather than active WM. This distinction between modalities rests on prior evidence from monkeys performing visual DMS, in which active WM is used to selectively memorize a sample image while interfering distractor images are ignored or forgotten (Basile and Hampton, 2013; Miller et al., 1993; Miller and Desimone, 1994; Yakovlev et al., 2013). However, more recent evidence challenges the assumption that this necessarily reflects true WM, suggesting instead that monkeys rely on a passive recency criterion to solve serial DMS in the visual domain (Wittig and Richmond, 2014). Even when required to memorize only the first item in a sequence, residual effects of proactive interference reveal that monkeys will not use selective memory exclusively (Wittig and Richmond, 2014). Our data align with those of Wittig and Richmond to suggest that passive memory for stimulus recency is the default mechanism for STM in nonhuman primates across both audition and vision.

2.6 Differences across monkeys, apes, and humans

What endows humans with our ability to store and recognize words, songs, and other sounds? One theory is that humans enlist the inferior frontal motor areas underlying speech production, which could transform a fluctuating acoustic stimulus into a subvocal oromotor sequence (Galantucci et al., 2006; Hickok and Poeppel, 2007; Liberman and Mattingly, 1985; Rauschecker and Scott, 2009). This motor theory of auditory memory is supported by the finding that recognition memory for reversed words, which cannot be pronounced, is inferior to that for pronounceable pseudo-words (Schulze et al., 2012). This transformation is likely to involve Broca’s area (Flinker et al., 2015) and thus to depend upon the arcuate fasciculus, the pathway connecting posterior temporal auditory regions to the inferior frontal cortex (Catani and Mesulam, 2008; Petrides and Pandya, 2009). Temporary disruption of the posterior auditory cortex by transcranial magnetic stimulation impairs recognition (but not discrimination) of pseudowords, implicating the caudal terminus of the arcuate fasciculus in the storage of speech sounds (Karabanov et al., 2015).

The arcuate fasciculus is prominent in humans, of intermediate strength in chimpanzees, and very weak in macaques (Rilling et al., 2008). Despite their stronger arcuate fasciculus, chimpanzees perform no better than macaques on tests of auditory memory (Hashiya and Kojima, 2001; Siebert et al., 2013), suggesting that the mnemonic advantage conferred by the human dorsal-stream language pathway is all-or-none. The extent to which the dorsal auditory-motor pathway in humans is specialized relative to that in nonhuman primates is a topic of debate (Bornkessel-Schlesewsky et al., 2015a), but to compare auditory memory systems between species any linguistic advantage (e.g., labeling or mimicry) must be carefully circumvented to study sensory STM in isolation from verbal memory.

3. Cortical systems for auditory STM

Where in the brain should we expect to find the neural signature of STM? Contemporary theories favor the view that memory is supported by the same systems that enable perception (i.e., sensory and association cortex) as opposed to a redundant memory-specific buffer elsewhere in the brain (Cowan, 2008; Postle, 2006). This would focus the search for neural correlates of pSTM in the monkey on the auditory cortex, from the core areas (primary recipients of thalamic input) to the belt, parabelt, and higher-order regions comprising the ventral sound processing stream (Hackett, 2011; Kikuchi et al., 2010; Munoz-Lopez et al., 2010; Rauschecker and Scott, 2009). Beginning with the secondary belt regions, every stage of the auditory cortical hierarchy is interconnected with the prefrontal cortex (PFC; Plakke and Romanski, 2014; Romanski et al., 1999a; Romanski et al., 1999b), which is known to support visual STM through its interactions with higher-order visual cortex (Fuster, 1973; Fuster et al., 1985; Miller et al., 1996). This section will review an emerging body of work that identifies a similar temporo-frontal network that may support auditory STM in the nonhuman primate.

3.1 Neural correlates of pSTM

Studies of single-unit activity during DMS performance have identified two general classes of effects potentially associated with STM. First, the sensory response (i.e., firing rate) of a neuron to a given stimulus may be modulated by the context in which that stimulus appears (sample, match, or nonmatch). This modulation is typically referred to as match suppression (MS) or match enhancement (ME; Miller and Desimone, 1994), though as discussed below, effects of this type (particularly suppression; section 3.1.2) can be observed outside of an explicit memory task. Second, activity may be modulated (elevated or suppressed) during the delay period between the sample and test stimuli. These phenomena were initially identified in inferotemporal (IT) visual cortex during visual STM (Baylis and Rolls, 1987; Brown et al., 1987; Fuster et al., 1981; Fuster and Jervey, 1982; Miyashita and Chang, 1988). Contemporaneous experiments in the auditory domain reported similar effects in the secondary auditory cortex of a baboon comparing tone frequencies across a 1-s delay (Gottlieb et al., 1989). Half of the neurons recorded exhibited activity during the delay that was dependent on the frequency of the sample tone, and a quarter exhibited a match/nonmatch difference.

Recent studies have re-opened the question of how cortical neurons represent auditory STM in the nonhuman primate model (Fig 2; Bigelow et al., 2014; Ng et al., 2014; Plakke et al., 2013; Scott et al., 2014). As described above (section 2.2), Scott et al. (2012; 2012; 2014) applied a serial DMS paradigm using stimuli and delays of a similar duration to those of Gottlieb et al. (1989) but with two key differences: stimuli comprised a range of acoustic categories, and intervening nonmatch stimuli could appear between the sample and match (Fig. 1). This created a direct auditory analog to a class of serial visual DMS tasks used in studies of IT visual cortex (Eskandar et al., 1992; Miller et al., 1993). Neurons were recorded from areas referred to collectively as the rostral superior temporal cortex (rSTC), including rostral core, belt, parabelt, and higher-order areas (Scott et al., 2014). Part of the recorded region was co-extensive with the rostral superior temporal gyrus lesion applied by Fritz et al. (2005; see section 2.1), which reduced forgetting thresholds from 30–50 s to <10 s, implicating this region in supporting auditory DMS. A complementary set of studies from Poremba and colleagues recorded from areas bracketing the rSTC at lower and higher stages of cortical processing: the primary auditory cortex (AI; Bigelow et al., 2014), and the dorsal temporal pole (dTP: Ng et al., 2014), as well as the dorsolateral PFC (Plakke et al., 2013). The following sections (3.1–3.3) integrate these findings to outline a temporal-frontal network engaged during auditory STM in the nonhuman primate.

Figure 2.

Figure 2

Cortical regions recorded in neurophysiological studies of auditory STM in monkeys, collected on a schematic depiction of the macaque brain. Top, the bold black line highlights the lateral sulcus (ls), which has been opened up to reveal the auditory areas of the supratemporal plane (STP) illustrated below. The ascending hierarchical levels of core, belt, and parabelt are shaded from dark to light gray. Areas recorded in frequently cited studies are indicated in bold type and outlined in blue: the primary auditory cortex (AI, a subdivision of the core), caudal belt, rostral superior temporal cortex (rSTC), and dorsal temporal pole (dTP). Note that rSTC includes rostral core and belt, as well as the superior temporal gyrus (STGr), rostral STP, and some overlap with the dTP. The dorsolateral (dlPFC) and ventrolateral (vlPFC) prefrontal cortex is indicated on the lateral view at the top. A right-hemisphere view is used for convenience, but in actuality recordings were obtained from the right hemisphere in caudal belt (N=1 baboon; Gottlieb et al., 1989), one right and three left hemispheres in rSTC (N=3 rhesus monkeys; Scott et al., 2014), one left and two right hemispheres in vlPFC (N=2 rhesus, Hwang and Romanski, 2015; and the left hemisphere in AI, dTP, and PFC (N=2 rhesus; Bigelow et al., 2014; Ng et al., 2014; Plakke et al., 2013).

3.1.1 Delay-period activity

Spiking activity during the inter-stimulus delay period has been consistently identified across studies of auditory DMS. Activity during the delay period was shifted from the baseline rate in about one third of rSTC neurons, with an equal split between delay enhancement (elevation of spike rate) and delay suppression (Scott et al., 2014). Similar responses were seen in the adjacent dTP region and in AI: among the 20–25% of units that showed an effect, a roughly equal proportion showed suppression or enhancement (Bigelow et al., 2014; Ng et al., 2014). Delay enhancement and suppression in rSTC showed different dynamics across the trial: enhancement and suppression were equally common in the first delay period, but in the second delay period (after the presentation of a nonmatching stimulus) enhancement was diminished; by contrast, suppression was robust across two intervening nonmatch stimuli (Scott et al., 2014). The decrement in the proportion of units showing delay excitation coincided with a drop in behavioral performance (matching was much more accurate for the first test stimulus than for the second test stimulus, which followed an intervening nonmatch). This is suggestive of a role for delay enhancement in maintaining the sensory trace upon which DMS performance depends, but clarifying the role of this activity will require further study.

Delay activity could be conceived as a continuation of the sensory response evoked by the sample, bridging the temporal gap between the sample and test stimuli. However, two aspects of the delay activity observed during auditory DMS are incompatible with this idea. First, delay activity in rSTC was only weakly selective for the preceding sample, even among the subset of neurons exhibiting delay enhancement (Scott et. al, 2014). This contrasts with the sample-selective delay activity reported previously (Gottlieb et al., 1989). (A possible reason for the discrepancy is that the earlier study used a small set of tones selected to elicit strong responses from tonotopically-organized cortex, whereas the rSTC recordings were made with a larger set of complex sounds that was not tailored to each neuron.) Second, delay activity in AI and dTP was not stable across the full 5-s delay used in those studies (Bigelow et al., 2014; Ng et al., 2014); this instability could not be observed in studies that used a shorter 1-s delay (Gottlieb et al., 1989; Scott et al., 2014).

It is generally believed that selective delay activity in IT visual cortex is not the basis for serial DMS performance, but the reasons behind that claim are difficult to interpret in the case of auditory DMS. For example, delay activity in IT cortex does not survive an intervening nonmatch stimulus, yet matching behavior remains accurate (Miller et al., 1993), which rules out delay activity as the relevant signal (Sugase-Miyamoto et al., 2008). In the auditory domain, delay enhancement and DMS performance drop off in tandem as intervening stimuli appear in the trial (Scott et al., 2014), so the same argument does not apply. In addition, delay activity in visual cortex was reported only for familiar stimuli (those presumably retrieved from LTM), yet monkeys were capable of performing DMS for novel visual stimuli (Miyashita and Chang, 1988). Similarly, disruption of delay activity in PFC by microstimulation only affected visual DMS for familiar stimuli, linking this activity to recall of a stored memory rather than to sensory maintenance (Sobotka and Ringo, 1993). If monkeys are unable to temporarily reactivate representations of sounds from LTM for use in WM (see sections 2.1 and 2.2, above), as humans appear to do with visual images (Lewis-Peacock and Postle, 2008), then delay activity in auditory DMS may represent a different process.

Some of the evidence above suggests that delay activity, despite its ubiquity, does not carry the sustained sensory signal that we propose is the basis for auditory DMS performance. However, the relatively recent application of multivariate analysis methods to fMRI data has shown that information about a stimulus in STM can be decoded from the activity of sensory cortex, both visual (Harrison and Tong, 2009; Serences et al., 2009) and auditory (Linke et al., 2011; Linke and Cusack, 2015); for a recent review, see (Sreenivasan et al., 2014). Before the role of delay activity is dismissed, similar decoding methods should be applied to spiking activity from macaques during auditory STM to determine whether a representation exists at the population level that is not evident in single neurons (Machens et al., 2010; Mante et al., 2013; Meyers et al., 2008; Rigotti et al., 2013).

3.1.2 Match response modulation

In the absence of persistent spiking activity in single units, a memory trace could lie in a subthreshold shift in synaptic weights (Sugase-Miyamoto et al., 2008), which may manifest as match suppression (see above, section 3.1). Averaged over the population of neurons, responses in all three cortical regions (AI, rSTC, and dTP) were diminished for match stimuli relative to the same stimuli presented as nonmatch or sample (Bigelow et al., 2014; Ng et al., 2014; Scott et al., 2014). In all three areas, a transient MS emerged as early as 30–50 ms from stimulus onset. The proportion of individual units showing MS varied across areas: 23% in AI vs. 9% in dTP across a 5-s delay, and 25% in caudal belt vs. 12 % in rSTC across a 1-s delay (Bigelow et al., 2014; Ng et al., 2014; Gottlieb et al., 1989; Scott et al., 2014). However, these proportions may not be directly comparable due either to differences in statistical criteria across studies, and/or to different firing rates between neurons in primary and higher-level cortex.

The most parsimonious explanation of MS is response adaptation, a ubiquitous feature of sensory neurons that is present without any explicit memory task, and even under anesthesia (Perez-Gonzalez and Malmierca, 2014). Adaptation effects lasting 1 s are commonplace in AI (Bartlett and Wang, 2005; Brosch and Scheich, 2008), and effects lasting up to 5 s have been reported (Werner-Reiss et al., 2006), a period long enough to span the interval in the Bigelow et al. (2014) task. The time scale of adaptation in higher-order auditory areas (rSTC and dTP) has not been systematically studied, but is likely to be longer than that at more caudal stations in the ventral pathway (Bendor and Wang, 2008; Lu et al., 1992; Scott et al., 2011). However, MS during serial DMS was observed even with one or two intervening nonmatch stimuli (Scott et al., 2014), suggesting that rSTC neurons may show a particularly robust form of adaptation that is inherent to these areas, or recruited during DMS performance. This latter idea, that some form of attentional control may enhance inherent adaptive processes, is supported by the observation that MS (and delay activity) were observed to a greater degree during STM behavior than during passive listening (Gottlieb et al., 1989). Until MS is demonstrated to be selectively engaged during a STM task, the most conservative stance is to follow Miller and Desimone (1994) and consider MS a passive mechanism that is intrinsic to higher stages of cortical sensory processing.

Although stimulus repetition caused suppression of the population response, a small subset of neurons in rSTC (7%) exhibited the opposite effect, match enhancement (Scott et al., 2014). Whereas MS appeared at the same latency as the sensory response itself, ME lagged MS by >50 ms. Unlike MS, ME is not easily explained as a form of adaptation. And much like the difference between excitatory and suppressive delay-period effects, ME and MS displayed different dynamics across the trial: the prevalence of ME was significantly reduced after an intervening nonmatch, but MS was robust across all trial lengths (Scott et al., 2014). Thus the dynamics of ME matched that of the monkeys’ behavioral accuracy, which declined after an intervening stimulus. Match enhancement arose considerably later in area AI (~250 ms after test stimulus onset) than in rSTC (~100 ms), and was evident only in the multi-unit response. (The single-unit population in AI exhibited ME only around the time of the behavioral response, which may result from top-down feedback related to response selection [Bigelow et al., 2014; Brosch et al., 2005] but not the match per se.) The relative timing of the ME signal across areas suggests that the relatively early ME in the rSTC originates locally and propagates back to AI via recurrent connections; alternatively, ME originating in PFC (see section 3.3, below) may propagate back to AI via the higher-order fields of the rSTC. Simultaneous recordings from the temporal and prefrontal cortex could clarify the dynamics of this network during auditory STM.

In IT visual cortex, robust ME was observed only under a particularly demanding DMS task in which nonmatch stimuli were repeated within the trial (e.g., ‘ABBA’), requiring the monkey to detect the repetition of the first item exclusively, not any recently presented stimulus (Miller and Desimone, 1994). Because ME was seen only for the match, but not the repeated nonmatch, Miller and Desimone (1994) associate ME with ‘active’ memory (selective memorization; but see Wittig and Richmond, 2014). By contrast, MS was also observed for the repeated nonmatch and was interpreted as a passive mechanism signaling the recency of stimulus appearance irrespective of task demands (Miller and Desimone, 1994). To our knowledge monkeys have never been trained on this variant of auditory DMS, but from the available evidence an ‘active’ WM robust to intervening distractors (and its associated ME) is weak or absent in the auditory domain. Studies of auditory DMS in humans should use a comparable serial paradigm, including distractor stimuli, to determine whether match enhancement and delay activity in the human cortex is robust to interference, and capable of supporting active auditory WM.

3.2 Neural effects of stimulus repetition

The enhancement and suppression of match responses in single neurons during DMS may be related to more general effects of stimulus repetition that are evident in population measures of neural activity. In studies using functional MRI, a reduction in response magnitude to a repeated stimulus is typically called repetition suppression, and is thought to reflect a correlate of priming that may underlie other memory processes as well (Grill-Spector et al., 2006). Importantly, repetition suppression is stimulus-specific, and that specificity has been used to infer the stimulus selectivity of the underlying neuron population (Krekelberg et al., 2006; but see Sawamura et al., 2006). Several neural models could plausibly explain repetition suppression (Grill-Spector et al., 2006), including a population-wide decrease in activation (fatigue), a selective decrease among a subset of neurons that results in a sparser representation (sharpening), or a decrease in the latency and duration of activation that does not necessarily affect the response magnitude (facilitation). These models are not mutually exclusive, but aspects of MS in single neurons of auditory cortex are most compatible with the sharpening model: only a subset of neurons exhibit MS (not all, as the fatigue model would predict), and match responses have not been shown to have shorter latencies (as the facilitation model would predict).

Less widely discussed than repetition suppression is the complementary effect of repetition enhancement of the fMRI signal. Whereas repetition suppression may be explained by adaptation, repetition enhancement has been associated with several cognitive variables including stimulus recognition, learning, attention, and expectation (Segaert et al., 2013). Auditory-verbal STM elicits repetition suppression localized to the rostral STG, as well as repetition enhancement in caudal STG (Buchsbaum and D'Esposito, 2009). Suppression in rostral areas is consistent with single-unit data from macaques (Ng et al., 2014) and points to a common ventral-stream mechanism for sensory STM. Match enhancement was also observed in single units in rostral STG (Scott et al., 2014) but was less common than suppression and may not be evident in a spatially averaged fMRI signal. Repetition suppression and enhancement can be detected concurrently in extrastriate visual cortex using multivariate analysis methods (de Gardelle et al., 2013), which may reveal a similarly mixed response in the human auditory cortex. Whether the enhancement effects indicated by fMRI (Buchsbaum and D'Esposito, 2009) are evident among single units in caudal STG is unknown, which underscores the need for further studies of the dorsal auditory pathway in nonhuman primates.

The spatial resolution of fMRI allows for the cortical locus of suppression and enhancement to be distinguished (Buchsbaum and D'Esposito, 2009), but the temporal resolution of fMRI is coarse. The reverse is true of electro-and magneto-encephalography (EEG and MEG), which can identify differences in timing between enhancement and suppression that may parallel those observed in single units. Evidence from MEG identifies both repetition suppression and repetition enhancement in the establishment of auditory memory traces in humans (Recasens et al., 2015; Rong et al., 2011); like the MS and ME observed in single units during DMS tasks in monkeys, these effects show a difference in timing: initial suppression, followed by later enhancement (Recasens et al., 2015).

The effect of repetition in auditory sequences is most often studied using an oddball paradigm, in which the response to a given sound is dependent upon its appearance as a ‘standard’ within a repeating sequence (which induces adaptation), or as an infrequent ‘deviant’ within that sequence. These repetition effects, namely the ‘mismatch negativity’ (MMN, the difference between the evoked potential under the deviant and standard condition) are interpreted as deviance detection, i.e. a form of sensory memory or predictive coding within the auditory system (Baldeweg, 2006; Friston, 2005; Naatanen and Winkler, 1999). However, a strong case has been made that the MMN can be explained through simple adaptation in the auditory cortical network (May and Tiitinen, 2010). This would position the MMN as the EEG/MEG correlate of the stimulus-specific adaptation that is evident in single units of the anesthetized auditory cortex under similar oddball paradigms (Nelken and Ulanovsky, 2007; Ulanovsky et al., 2003; Ulanovsky et al., 2004). Stimulus-specific adaptation has been demonstrated in AI of the awake macaque (Fishman and Steinschneider, 2012), and may be related to the match suppression that we propose to underlie passive short-term memory in the nonhuman primate. Although the oddball paradigm is quite different from the DMS tasks that are the focus of this review, the same mechanism of pSTM may support DMS as well as other auditory memory tasks (Brosch et al., 2004; Hwang and Romanski, 2015; Russ et al., 2008).

3.3 Is auditory STM distributed or localized?

If STM maintenance is supported by the same areas that support perception, then a pitch memory task should engage pitch processing centers (Grimault et al., 2014). Pitch sensitivity in human auditory cortex appears to follow a caudorostral gradient (Norman-Haignere et al., 2013), which would predict some degree of specialization for pitch memory in the fields rostral to AI, for which the rSTC in the monkey may be a homolog. But in general the representation of pitch, timbre, and other auditory features appears to be widely distributed in the auditory cortex (Bizley et al., 2009), which would predict a similarly distributed memory trace if subjects were relying on a pitch-based cue as monkeys’ behavior suggests (section 2.4, above). Accordingly, neural effects related to STM have been observed across nearly the entire supratemporal plane, from the caudal belt (Gottlieb et al., 1989) to the dTP (Ng et al., 2014).

Neural correlates of auditory STM have also been described in the same regions of PFC that are active during visual STM (Fig. 2). In dorsolateral PFC, the population response exhibits a sustained ME effect, appearing at a latency that is possibly short enough to drive ME in lower cortical regions by top-down modulation (Plakke et al., 2013). Neurons in ventrolateral PFC exhibit match and delay effects during a multimodal (face/voice) memory task (Hwang and Romanski, 2015), and inactivation of ventrolateral PFC impairs auditory and auditory-visual memory performance (Plakke et al., 2015). Although ventrolateral PFC is strongly interconnected with the ventral stream of auditory cortex (i.e., the rSTC and dTP; Romanski et al., 1999a; Romanski et al., 1999b), neurons in ventrolateral PFC were predominantly sensitive to changes the visual component of the memoranda (Hwang and Romanski, 2015). This identifies ventrolateral PFC as a site of multimodal integration, rather than a a specialized module for auditory STM. Taken together, the available data support a distributed temporo-frontal network that subserves auditory STM.

An advantage of a distributed representation would be resilience to cortical damage, as was observed by Fritz et al. (2005): removal of the rostral STG reduced the monkeys’ forgetting threshold, but did not affect performance at the shortest delays tested (5 s). The hierarchical organization of the auditory system may serve to build a lengthened sensitivity to stimulus history (Hasson et al., 2015), which would manifest as MS or ME across progressively longer time scales at higher stages of processing. Within each stage, the differential response of populations of neurons that do or do not display MS (or ME) would signal the relative recency of stimulus occurrence, as has been proposed to operate in IT visual cortex (Miller et al., 1993).

4. Conclusions

The argument that auditory LTM is absent in the monkey (Fritz et al., 2005), and that STM is limited relative to that in vision (Scott et al., 2012), appears to undermine a key rationale for studying hearing in the nonhuman primate: viz., that the behavior, neuroanatomy, and perceptual ability of monkeys closely resemble our own (Belmonte et al., 2015; Seyfarth and Cheney, 2014; Stebbins and Moody, 1994). However, several lines of evidence argue that studies in nonhuman primates (and animal models in general) are applicable to the human auditory system, including auditory memory.

First, if retention of a pitch trace is independent of vocal processes, then the pSTM evident in the nonhuman primate offers a tenable model of the neural mechanisms supporting this component of auditory memory. Sensitivity to stimulus history is intrinsic to auditory perception and appears to occur automatically (Demany and Semal, 2007). Pitch memory is largely divorced from verbal and oromotor processes, as demonstrated by the fact that memory is not improved by allowing subjects to hum the remembered pitch, and that articulatory suppression does not interfere with tone retention (Demany and Semal, 2007; Deutsch, 1970; Guimond et al., 2011). Although active rehearsal bolsters verbal STM, it is largely irrelevant to auditory sensory STM.

Second, neuroimaging studies have identified similarities between the cortical systems supporting auditory STM in human and in nonhuman primates (see section 3.2, above). Suppression of responses to match stimuli was observed in rostral STG of human subjects (Buchsbaum and D'Esposito, 2009) as well as in single units of monkeys (Ng et al., 2014; Scott et al., 2014). Suppression occurs at a shorter latency than enhancement in MEG recordings from humans, and in single neuron recordings from monkeys (Recasens et al., 2015; Scott et al., 2014). These parallels suggest at least some overlap in pSTM mechanisms between the auditory ventral pathway in humans and monkeys.

Third, the disparity between visual and auditory memory may not be unique to monkeys. Recent evidence shows that human auditory memory is inferior to visual memory, whether for colors and tones in STM or complex objects in LTM (Bigelow and Poremba, 2014; see also Cohen et al., 2009). Connectional neuroanatomy in the monkey reveals an impoverished pathway from the auditory cortex to the medial temporal lobe, which may underlie the relative difficulty of forming and recalling auditory memories relative to visual memories (Munoz-Lopez et al., 2010; Munoz-Lopez et al., 2015). The homology between auditory memory pathways in the human and nonhuman primate is a critical open question, but both species may be operating under similar constraints.

The advantage for humans is likely to lie not in the ventral stream, but in the dorsal stream associated with mapping sounds to articulatory movements. Whether the dorsal stream in monkeys differs qualitatively or only quantitatively from that in humans remains a subject of debate (Bornkessel-Schlesewsky et al., 2015a; Bornkessel-Schlesewsky et al., 2015b; Skeide and Friederici, 2015). But the parallels in behavior, anatomy, and physiology between monkeys and humans suggest that the mechanism of pSTM in the nonhuman primate offers a tenable model of the ventral stream mechanisms supporting auditory sensory memory in humans (Kaiser, 2015), in isolation from related mechanisms of verbal memory.

Highlights.

  • Mechanisms of auditory memory may differ between species and between modalities

  • Auditory passive short-term memory (pSTM) is comparable between humans and monkeys

  • Correlates of pSTM are evident in neurons throughout auditory and prefrontal cortex

  • Auditory and visual STM may share some common neural mechanisms

Acknowledgements

The authors would like to acknowledge Pingbo Yin for establishing the auditory serial DMS paradigm while at the Laboratory of Neuropsychology. We thank Corrie Camalier and John Wittig, Jr. for comments on the manuscript. This work was supported by the Division of Intramural Research Programs, NIMH/NIH.

Abbreviations

pSTM

passive (auditory) short-term memory

DMS

delayed match-to-sample

LTM

long-term memory

STM

short-term memory

DE

delay enhancement

DS

delay suppression

ME

match enhancement

MS

match suppression

MTL

medial temporal lobe

AI

primary auditory cortex

rSTC

rostral superior temporal cortex

dTP

dorsal temporal pole

PFC

prefrontal cortex

STGr

rostral superior temporal gyrus

EEG

electroencephalography

MEG

magnetoencephalography

MMN

mismatch negativity

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Adachi I, Hampton RR. Rhesus monkeys see who they hear: spontaneous cross-modal memory for familiar conspecifics. PLoS One. 2011;6:e23345. doi: 10.1371/journal.pone.0023345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baddeley A. Working memory: looking back and looking forward. Nat Rev Neurosci. 2003;4:829–839. doi: 10.1038/nrn1201. [DOI] [PubMed] [Google Scholar]
  3. Baddeley A. Working Memory, Thought, and Action. Vol. New York: Oxford University Press; 2007. [Google Scholar]
  4. Baldeweg T. Repetition effects to sounds: evidence for predictive coding in the auditory system. Trends Cogn Sci. 2006;10:93–94. doi: 10.1016/j.tics.2006.01.010. [DOI] [PubMed] [Google Scholar]
  5. Bartlett EL, Wang X. Long-lasting modulation by stimulus context in primate auditory cortex. J Neurophysiol. 2005;94:83–104. doi: 10.1152/jn.01124.2004. [DOI] [PubMed] [Google Scholar]
  6. Basile BM, Hampton RR. Dissociation of active working memory and passive recognition in rhesus monkeys. Cognition. 2013;126:391–396. doi: 10.1016/j.cognition.2012.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baylis GC, Rolls ET. Responses of neurons in the inferior temporal cortex in short term and serial recognition memory tasks. Exp Brain Res. 1987;65:614–622. doi: 10.1007/BF00235984. [DOI] [PubMed] [Google Scholar]
  8. Belmonte JC, Callaway EM, Churchland P, Caddick SJ, Feng G, Homanics GE, Lee KF, Leopold DA, Miller CT, Mitchell JF, Mitalipov S, Moutri AR, Movshon JA, Okano H, Reynolds JH, Ringach D, Sejnowski TJ, Silva AC, Strick PL, Wu J, Zhang F. Brains, Genes, and Primates. Neuron. 2015;86:617–631. doi: 10.1016/j.neuron.2015.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Bendor D, Wang X. Neural response properties of primary, rostral, and rostrotemporal core fields in the auditory cortex of marmoset monkeys. J Neurophysiol. 2008;100:888–906. doi: 10.1152/jn.00884.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Bigelow J, Poremba A. Auditory memory in monkeys: costs and benefits of proactive interference. Am J Primatol. 2013a;75:425–434. doi: 10.1002/ajp.22076. [DOI] [PubMed] [Google Scholar]
  11. Bigelow J, Poremba A. Auditory proactive interference in monkeys: the roles of stimulus set size and intertrial interval. Learn Behav. 2013b;41:319–332. doi: 10.3758/s13420-013-0107-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bigelow J, Poremba A. Achilles' ear? Inferior human short-term and recognition memory in the auditory modality. PLoS One. 2014;9:e89914. doi: 10.1371/journal.pone.0089914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bigelow J, Rossi B, Poremba A. Neural correlates of short-term memory in primate auditory cortex. Front Neurosci. 2014;8:250. doi: 10.3389/fnins.2014.00250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bigelow J, Poremba A. Item-nonspecific proactive interference in monkeys' auditory short-term memory. Hear Res. 2015;327:69–77. doi: 10.1016/j.heares.2015.05.002. [DOI] [PubMed] [Google Scholar]
  15. Bizley JK, Walker KM, Silverman BW, King AJ, Schnupp JW. Interdependent encoding of pitch, timbre, and spatial location in auditory cortex. J Neurosci. 2009;29:2064–2075. doi: 10.1523/JNEUROSCI.4755-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Boersma P, Weenink D. Praat: doing phonetics by computer. 2012 Vol., ed.^eds. [Google Scholar]
  17. Bornkessel-Schlesewsky I, Schlesewsky M, Small SL, Rauschecker JP. Neurobiological roots of language in primate audition: common computational properties. Trends Cogn Sci. 2015a;19:142–150. doi: 10.1016/j.tics.2014.12.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Bornkessel-Schlesewsky I, Schlesewsky M, Small SL, Rauschecker JP. Response to Skeide and Friederici: the myth of the uniquely human 'direct' dorsal pathway. Trends Cogn Sci. 2015b doi: 10.1016/j.tics.2015.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Brosch M, Selezneva E, Bucks C, Scheich H. Macaque monkeys discriminate pitch relationships. Cognition. 2004;91:259–272. doi: 10.1016/j.cognition.2003.09.005. [DOI] [PubMed] [Google Scholar]
  20. Brosch M, Selezneva E, Scheich H. Nonauditory events of a behavioral procedure activate auditory cortex of highly trained monkeys. J Neurosci. 2005;25:6797–6806. doi: 10.1523/JNEUROSCI.1571-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Brosch M, Scheich H. Tone-sequence analysis in the auditory cortex of awake macaque monkeys. Exp Brain Res. 2008;184:349–361. doi: 10.1007/s00221-007-1109-7. [DOI] [PubMed] [Google Scholar]
  22. Brown MW, Wilson FA, Riches IP. Neuronal evidence that inferomedial temporal cortex is more important than hippocampus in certain processes underlying recognition memory. Brain Res. 1987;409:158–162. doi: 10.1016/0006-8993(87)90753-0. [DOI] [PubMed] [Google Scholar]
  23. Buchsbaum BR, D'Esposito M. Repetition suppression and reactivation in auditory-verbal short-term recognition memory. Cereb Cortex. 2009;19:1474–1485. doi: 10.1093/cercor/bhn186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Caclin A, Brattico E, Tervaniemi M, Naatanen R, Morlet D, Giard MH, McAdams S. Separate neural processing of timbre dimensions in auditory sensory memory. J Cogn Neurosci. 2006;18:1959–1972. doi: 10.1162/jocn.2006.18.12.1959. [DOI] [PubMed] [Google Scholar]
  25. Catani M, Mesulam M. The arcuate fasciculus and the disconnection theme in language and aphasia: history and current state. Cortex. 2008;44:953–961. doi: 10.1016/j.cortex.2008.04.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Clement S, Demany L, Semal C. Memory for pitch versus memory for loudness. J Acoust Soc Am. 1999;106:2805–2811. doi: 10.1121/1.428106. [DOI] [PubMed] [Google Scholar]
  27. Cohen MA, Horowitz TS, Wolfe JM. Auditory recognition memory is inferior to visual recognition memory. Proc Natl Acad Sci U S A. 2009;106:6008–6010. doi: 10.1073/pnas.0811884106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Colombo M, D'Amato MR. A comparison of visual and auditory short-term memory in monkeys (Cebus apella) Q J Exp Psychol B. 1986;38:425–448. [PubMed] [Google Scholar]
  29. Colombo M, D'Amato MR, Rodman HR, Gross CG. Auditory association cortex lesions impair auditory short-term memory in monkeys. Science. 1990;247:336–338. doi: 10.1126/science.2296723. [DOI] [PubMed] [Google Scholar]
  30. Colombo M, Graziano M. Effects of auditory and visual interference on auditory-visual delayed matching to sample in monkeys (Macaca fascicularis) Behav Neurosci. 1994;108:636–639. doi: 10.1037//0735-7044.108.3.636. [DOI] [PubMed] [Google Scholar]
  31. Colombo M, Gross CG. Responses of inferior temporal cortex and hippocampal neurons during delayed matching to sample in monkeys (Macaca fascicularis) Behav Neurosci. 1994;108:443–455. doi: 10.1037//0735-7044.108.3.443. [DOI] [PubMed] [Google Scholar]
  32. Colombo M, Rodman HR, Gross CG. The effects of superior temporal cortex lesions on the processing and retention of auditory information in monkeys (Cebus apella) J Neurosci. 1996;16:4501–4517. doi: 10.1523/JNEUROSCI.16-14-04501.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Cowan N. On short and long auditory stores. Psychol Bull. 1984;96:341–370. [PubMed] [Google Scholar]
  34. Cowan N. Evolving conceptions of memory storage, selective attention, and their mutual constraints within the human information-processing system. Psychol Bull. 1988;104:163–191. doi: 10.1037/0033-2909.104.2.163. [DOI] [PubMed] [Google Scholar]
  35. Cowan N. What are the differences between long-term, short-term, and working memory? Prog Brain Res. 2008;169:323–338. doi: 10.1016/S0079-6123(07)00020-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Cowey A, Weiskrantz L. Auditory sequence discrimination in Macaca mulatta: the role of the superior temporal cortex. Neuropsychologia. 1976;14:1–10. doi: 10.1016/0028-3932(76)90002-6. [DOI] [PubMed] [Google Scholar]
  37. de Gardelle V, Waszczuk M, Egner T, Summerfield C. Concurrent repetition enhancement and suppression responses in extrastriate visual cortex. Cereb Cortex. 2013;23:2235–2244. doi: 10.1093/cercor/bhs211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Demany L, Semal C. The Role of Memory in Auditory Perception. In: Yost WA, Poppe AN, Fay RR, editors. Auditory Perception of Sound Sources. Springer Handbook of Auditory Research. Vol. 29. New York: Springer US; 2007. pp. 77–113. ^eds. [Google Scholar]
  39. Deutsch D. Tones and numbers: specificity of interference in immediate memory. Science. 1970;168:1604–1605. doi: 10.1126/science.168.3939.1604. [DOI] [PubMed] [Google Scholar]
  40. Deutsch D. Mapping of interactions in the pitch memory store. Science. 1972;175:1020–1022. doi: 10.1126/science.175.4025.1020. [DOI] [PubMed] [Google Scholar]
  41. Eskandar EN, Richmond BJ, Optican LM. Role of inferior temporal neurons in visual memory. I. Temporal encoding of information about visual images, recalled images, and behavioral context. J Neurophysiol. 1992;68:1277–1295. doi: 10.1152/jn.1992.68.4.1277. [DOI] [PubMed] [Google Scholar]
  42. Fishman YI, Steinschneider M. Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation? J Neurosci. 2012;32:15747–15758. doi: 10.1523/JNEUROSCI.2835-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Friston K. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005;360:815–836. doi: 10.1098/rstb.2005.1622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Fritz J, Mishkin M, Saunders RC. In search of an auditory engram. Proc Natl Acad Sci U S A. 2005;102:9359–9364. doi: 10.1073/pnas.0503998102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Fuster JM. Unit activity in prefrontal cortex during delayed-response performance: neuronal correlates of transient memory. J Neurophysiol. 1973;36:61–78. doi: 10.1152/jn.1973.36.1.61. [DOI] [PubMed] [Google Scholar]
  46. Fuster JM, Bauer RH, Jervey JP. Effects of cooling inferotemporal cortex on performance of visual memory tasks. Exp Neurol. 1981;71:398–409. doi: 10.1016/0014-4886(81)90098-4. [DOI] [PubMed] [Google Scholar]
  47. Fuster JM, Jervey JP. Neuronal firing in the inferotemporal cortex of the monkey in a visual memory task. J Neurosci. 1982;2:361–375. doi: 10.1523/JNEUROSCI.02-03-00361.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Fuster JM, Bauer RH, Jervey JP. Functional interactions between inferotemporal and prefrontal cortex in a cognitive task. Brain Res. 1985;330:299–307. doi: 10.1016/0006-8993(85)90689-4. [DOI] [PubMed] [Google Scholar]
  49. Fuster JM. Memory in the Cerebral Cortex. Vol. Cambridge, MA: MIT Press; 1995. [Google Scholar]
  50. Fuster JM, Bodner M, Kroger JK. Cross-modal and cross-temporal association in neurons of frontal cortex. Nature. 2000;405:347–351. doi: 10.1038/35012613. [DOI] [PubMed] [Google Scholar]
  51. Gaffan D, Harrison S. Auditory-visual associations, hemispheric specialization and temporal-frontal interaction in the rhesus monkey. Brain. 1991;114(Pt 5):2133–2144. doi: 10.1093/brain/114.5.2133. [DOI] [PubMed] [Google Scholar]
  52. Galantucci B, Fowler CA, Turvey MT. The motor theory of speech perception reviewed. Psychon Bull Rev. 2006;13:361–377. doi: 10.3758/bf03193857. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Ghazanfar AA, Logothetis NK. Neuroperception: facial expressions linked to monkey calls. Nature. 2003;423:937–938. doi: 10.1038/423937a. [DOI] [PubMed] [Google Scholar]
  54. Gil-da-Costa R, Palleroni A, Hauser MD, Touchton J, Kelley JP. Rapid acquisition of an alarm response by a neotropical primate to a newly introduced avian predator. Proceedings of the Royal Society B-Biological Sciences. 2003;270:605–610. doi: 10.1098/rspb.2002.2281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Gottlieb Y, Vaadia E, Abeles M. Single unit activity in the auditory cortex of a monkey performing a short term memory task. Exp Brain Res. 1989;74:139–148. doi: 10.1007/BF00248287. [DOI] [PubMed] [Google Scholar]
  56. Gouzoules S, Gouzoules H, Marler P. Rhesus-Monkey (Macaca-Mulatta) Screams - Representational Signaling in the Recruitment of Agonistic Aid. Animal Behaviour. 1984;32:182-&. [Google Scholar]
  57. Grill-Spector K, Henson R, Martin A. Repetition and the brain: neural models of stimulus-specific effects. Trends Cogn Sci. 2006;10:14–23. doi: 10.1016/j.tics.2005.11.006. [DOI] [PubMed] [Google Scholar]
  58. Grimault S, Nolden S, Lefebvre C, Vachon F, Hyde K, Peretz I, Zatorre R, Robitaille N, Jolicoeur P. Brain activity is related to individual differences in the number of items stored in auditory short-term memory for pitch: evidence from magnetoencephalography. Neuroimage. 2014;94:96–106. doi: 10.1016/j.neuroimage.2014.03.020. [DOI] [PubMed] [Google Scholar]
  59. Guimond S, Vachon F, Nolden S, Lefebvre C, Grimault S, Jolicoeur P. Electrophysiological correlates of the maintenance of the representation of pitch objects in acoustic short-term memory. Psychophysiology. 2011;48:1500–1509. doi: 10.1111/j.1469-8986.2011.01234.x. [DOI] [PubMed] [Google Scholar]
  60. Habbershon HM, Ahmed SZ, Cohen YE. Rhesus macaques recognize unique multimodal face-voice relations of familiar individuals and not of unfamiliar ones. Brain Behav Evol. 2013;81:219–225. doi: 10.1159/000351203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Hackett TA. Information flow in the auditory cortical network. Hear Res. 2011;271:133–146. doi: 10.1016/j.heares.2010.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Harris JD. The decline of pitch discrimination with time. J Exp Psychol. 1952;43:96–99. doi: 10.1037/h0057373. [DOI] [PubMed] [Google Scholar]
  63. Harrison SA, Tong F. Decoding reveals the contents of visual working memory in early visual areas. Nature. 2009;458:632–635. doi: 10.1038/nature07832. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Hashiya K, Kojima S. Acquisition of auditory-visual intermodal matching-to-sample by a chimpanzee (Pan troglodytes): comparison with visual-visual intramodal matching. Anim Cogn. 2001;4:231–239. doi: 10.1007/s10071-001-0118-3. [DOI] [PubMed] [Google Scholar]
  65. Hasson U, Chen J, Honey CJ. Hierarchical process memory: memory as an integral component of information processing. Trends Cogn Sci. 2015;19:304–313. doi: 10.1016/j.tics.2015.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Hickok G, Poeppel D. The cortical organization of speech processing. Nat Rev Neurosci. 2007;8:393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]
  67. Hwang J, Romanski LM. Prefrontal neuronal responses during audiovisual mnemonic processing. J Neurosci. 2015;35:960–971. doi: 10.1523/JNEUROSCI.1328-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Jackson LL, Heffner RS, Heffner HE. Free-field audiogram of the Japanese macaque (Macaca fuscata) J Acoust Soc Am. 1999;106:3017–3023. doi: 10.1121/1.428121. [DOI] [PubMed] [Google Scholar]
  69. Jacob SN, Nieder A. Complementary roles for primate frontal and parietal cortex in guarding working memory from distractor stimuli. Neuron. 2014;83:226–237. doi: 10.1016/j.neuron.2014.05.009. [DOI] [PubMed] [Google Scholar]
  70. Joly O, Baumann S, Poirier C, Patterson RD, Thiele A, Griffiths TD. A perceptual pitch boundary in a non-human primate. Front Psychol. 2014;5:998. doi: 10.3389/fpsyg.2014.00998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Joseph S, Kumar S, Husain M, Griffiths TD. Auditory working memory for objects vs. features. Front Neurosci. 2015;9:13. doi: 10.3389/fnins.2015.00013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Kaiser J. Dynamics of auditory working memory. Front Psychol. 2015;6:613. doi: 10.3389/fpsyg.2015.00613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Karabanov AN, Paine R, Chao CC, Schulze K, Scott B, Hallett M, Mishkin M. Participation of the classical speech areas in auditory long-term memory. PLoS One. 2015;10:e0119472. doi: 10.1371/journal.pone.0119472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Kikuchi Y, Horwitz B, Mishkin M. Hierarchical auditory processing directed rostrally along the monkey's supratemporal plane. J Neurosci. 2010;30:13021–13030. doi: 10.1523/JNEUROSCI.2267-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Kojima S. Auditory short-term memory in the Japanese monkey. Int J Neurosci. 1985;25:255–262. doi: 10.3109/00207458508985378. [DOI] [PubMed] [Google Scholar]
  76. Krekelberg B, Boynton GM, van Wezel RJ. Adaptation: from single cells to BOLD signals. Trends Neurosci. 2006;29:250–256. doi: 10.1016/j.tins.2006.02.008. [DOI] [PubMed] [Google Scholar]
  77. Lewandowsky S, Oberauer K, Brown GD. No temporal decay in verbal short-term memory. Trends Cogn Sci. 2009;13:120–126. doi: 10.1016/j.tics.2008.12.003. [DOI] [PubMed] [Google Scholar]
  78. Lewis-Peacock JA, Postle BR. Temporary activation of long-term memory supports working memory. J Neurosci. 2008;28:8765–8771. doi: 10.1523/JNEUROSCI.1953-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Liberman AM, Mattingly IG. The motor theory of speech perception revised. Cognition. 1985;21:1–36. doi: 10.1016/0010-0277(85)90021-6. [DOI] [PubMed] [Google Scholar]
  80. Linke AC, Vicente-Grabovetsky A, Cusack R. Stimulus-specific suppression preserves information in auditory short-term memory. Proc Natl Acad Sci U S A. 2011;108:12961–12966. doi: 10.1073/pnas.1102118108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Linke AC, Cusack R. Flexible Information Coding in Human Auditory Cortex during Perception, Imagery, and STM of Complex Sounds. J Cogn Neurosci. 2015;27:1322–1333. doi: 10.1162/jocn_a_00780. [DOI] [PubMed] [Google Scholar]
  82. Lu ZL, Williamson SJ, Kaufman L. Human auditory primary and association cortex have differing lifetimes for activation traces. Brain Res. 1992;572:236–241. doi: 10.1016/0006-8993(92)90475-o. [DOI] [PubMed] [Google Scholar]
  83. Machens CK, Romo R, Brody CD. Functional, but not anatomical, separation of "what" and "when" in prefrontal cortex. J Neurosci. 2010;30:350–360. doi: 10.1523/JNEUROSCI.3276-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Mante V, Sussillo D, Shenoy KV, Newsome WT. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature. 2013;503:78–84. doi: 10.1038/nature12742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. May PJ, Tiitinen H. Mismatch negativity (MMN), the deviance-elicited auditory deflection, explained. Psychophysiology. 2010;47:66–122. doi: 10.1111/j.1469-8986.2009.00856.x. [DOI] [PubMed] [Google Scholar]
  86. Meyers EM, Freedman DJ, Kreiman G, Miller EK, Poggio T. Dynamic population coding of category information in inferior temporal and prefrontal cortex. J Neurophysiol. 2008;100:1407–1419. doi: 10.1152/jn.90248.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Miller EK, Li L, Desimone R. Activity of neurons in anterior inferior temporal cortex during a short-term memory task. J Neurosci. 1993;13:1460–1478. doi: 10.1523/JNEUROSCI.13-04-01460.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Miller EK, Desimone R. Parallel neuronal mechanisms for short-term memory. Science. 1994;263:520–522. doi: 10.1126/science.8290960. [DOI] [PubMed] [Google Scholar]
  89. Miller EK, Erickson CA, Desimone R. Neural mechanisms of visual working memory in prefrontal cortex of the macaque. J Neurosci. 1996;16:5154–5167. doi: 10.1523/JNEUROSCI.16-16-05154.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Mishkin M. Memory in monkeys severely impaired by combined but not by separate removal of amygdala and hippocampus. Nature. 1978;273:297–298. doi: 10.1038/273297a0. [DOI] [PubMed] [Google Scholar]
  91. Mishkin M. A memory system in the monkey. Philos Trans R Soc Lond B Biol Sci. 1982;298:83–95. doi: 10.1098/rstb.1982.0074. [DOI] [PubMed] [Google Scholar]
  92. Miyashita Y, Chang HS. Neuronal correlate of pictorial short-term memory in the primate temporal cortex. Nature. 1988;331:68–70. doi: 10.1038/331068a0. [DOI] [PubMed] [Google Scholar]
  93. Munoz-Lopez M, Mohedano-Moriano A, Insausti R. Anatomical Pathways for Auditory Memory in Primates. Frontiers in Neuroanatomy. 2010;4:129. doi: 10.3389/fnana.2010.00129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Munoz-Lopez M, Insausti R, Mohedano-Moriano A, Mishkin M, Saunders RC. Anatomical pathways for auditory memory II: information from rostral superior temporal gyrus to dorsolateral temporal pole and medial temporal cortex. Front Neurosci. 2015;9:158. doi: 10.3389/fnins.2015.00158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Murray EA, Mishkin M. Severe tactual memory deficits in monkeys after combined removal of the amygdala and hippocampus. Brain Research. 1983;270:340–344. doi: 10.1016/0006-8993(83)90610-8. [DOI] [PubMed] [Google Scholar]
  96. Naatanen R, Winkler I. The concept of auditory stimulus representation in cognitive neuroscience. Psychol Bull. 1999;125:826–859. doi: 10.1037/0033-2909.125.6.826. [DOI] [PubMed] [Google Scholar]
  97. Nelken I, Ulanovsky N. Mismatch negativity and stimulus-specific adaptation in animal models. Journal of Psychophysiology. 2007;21:214–223. [Google Scholar]
  98. Ng CW. Psychology. University of Iowa; 2011. Behavioral and neural correlates of auditory encoding and memory functions in Rhesus Macaques. Vol. Ph.D., ed.^eds. [Google Scholar]
  99. Ng CW, Plakke B, Poremba A. Neural correlates of auditory recognition memory in the primate dorsal temporal pole. J Neurophysiol. 2014;111:455–469. doi: 10.1152/jn.00401.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Norman DA. Toward a Theory of Memory and Attention. Psychological Review. 1968;75:522-&. [Google Scholar]
  101. Norman-Haignere S, Kanwisher N, McDermott JH. Cortical pitch regions in humans respond primarily to resolved harmonics and are located in specific tonotopic regions of anterior auditory cortex. J Neurosci. 2013;33:19451–19469. doi: 10.1523/JNEUROSCI.2880-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Nousak JM, Deacon D, Ritter W, Vaughan HG., Jr Storage of information in transient auditory memory. Brain Res Cogn Brain Res. 1996;4:305–317. doi: 10.1016/s0926-6410(96)00068-7. [DOI] [PubMed] [Google Scholar]
  103. Olsson H, Poom L. Visual memory needs categories. Proc Natl Acad Sci U S A. 2005;102:8776–8780. doi: 10.1073/pnas.0500810102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Pasternak T, Greenlee MW. Working memory in primate sensory systems. Nat Rev Neurosci. 2005;6:97–107. doi: 10.1038/nrn1603. [DOI] [PubMed] [Google Scholar]
  105. Payne C, Bachevalier J. Crossmodal integration of conspecific vocalizations in rhesus macaques. PLoS One. 2013;8:e81825. doi: 10.1371/journal.pone.0081825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Perez-Gonzalez D, Malmierca MS. Adaptation in the auditory system: an overview. Front Integr Neurosci. 2014;8:19. doi: 10.3389/fnint.2014.00019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Petrides M, Pandya DN. Distinct parietal and temporal pathways to the homologues of Broca's area in the monkey. PLoS Biol. 2009;7:e1000170. doi: 10.1371/journal.pbio.1000170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Plakke B, Ng CW, Poremba A. Neural correlates of auditory recognition memory in primate lateral prefrontal cortex. Neuroscience. 2013;244:62–76. doi: 10.1016/j.neuroscience.2013.04.002. [DOI] [PubMed] [Google Scholar]
  109. Plakke B, Romanski LM. Auditory connections and functions of prefrontal cortex. Front Neurosci. 2014;8:199. doi: 10.3389/fnins.2014.00199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Plakke B, Hwang J, Romanski LM. Inactivation of Primate Prefrontal Cortex Impairs Auditory and Audiovisual Working Memory. J Neurosci. 2015;35:9666–9675. doi: 10.1523/JNEUROSCI.1218-15.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Postle BR. Working memory as an emergent property of the mind and brain. Neuroscience. 2006;139:23–38. doi: 10.1016/j.neuroscience.2005.06.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Rauschecker JP, Scott SK. Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing. Nat Neurosci. 2009;12:718–724. doi: 10.1038/nn.2331. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Recasens M, Leung S, Grimm S, Nowak R, Escera C. Repetition suppression and repetition enhancement underlie auditory memory-trace formation in the human brain: an MEG study. Neuroimage. 2015;108:75–86. doi: 10.1016/j.neuroimage.2014.12.031. [DOI] [PubMed] [Google Scholar]
  114. Rendall D, Rodman PS, Emond RE. Vocal recognition of individuals and kin in free-ranging rhesus monkeys. Animal Behaviour. 1996;51:1007–1015. [Google Scholar]
  115. Rigotti M, Barak O, Warden MR, Wang XJ, Daw ND, Miller EK, Fusi S. The importance of mixed selectivity in complex cognitive tasks. Nature. 2013;497:585–590. doi: 10.1038/nature12160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Rilling JK, Glasser MF, Preuss TM, Ma X, Zhao T, Hu X, Behrens TE. The evolution of the arcuate fasciculus revealed with comparative DTI. Nat Neurosci. 2008;11:426–428. doi: 10.1038/nn2072. [DOI] [PubMed] [Google Scholar]
  117. Romanski LM, Bates JF, Goldman-Rakic PS. Auditory belt and parabelt projections to the prefrontal cortex in the rhesus monkey. J Comp Neurol. 1999a;403:141–157. doi: 10.1002/(sici)1096-9861(19990111)403:2<141::aid-cne1>3.0.co;2-v. [DOI] [PubMed] [Google Scholar]
  118. Romanski LM, Tian B, Fritz J, Mishkin M, Goldman-Rakic PS, Rauschecker JP. Dual streams of auditory afferents target multiple domains in the primate prefrontal cortex. Nat Neurosci. 1999b;2:1131–1136. doi: 10.1038/16056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  119. Romanski LM. Integration of faces and vocalizations in ventral prefrontal cortex: implications for the evolution of audiovisual speech. Proc Natl Acad Sci U S A. 2012;109(Suppl 1):10717–10724. doi: 10.1073/pnas.1204335109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. Romo R, Salinas E. Flutter discrimination: neural codes, perception, memory and decision making. Nat Rev Neurosci. 2003;4:203–218. doi: 10.1038/nrn1058. [DOI] [PubMed] [Google Scholar]
  121. Rong F, Holroyd T, Husain FT, Contreras-Vidal JL, Horwitz B. Task-specific modulation of human auditory evoked response in a delayed-match-to-sample task. Front Psychol. 2011;2:85. doi: 10.3389/fpsyg.2011.00085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Russ BE, Orr LE, Cohen YE. Prefrontal neurons predict choices during an auditory same-different task. Curr Biol. 2008;18:1483–1488. doi: 10.1016/j.cub.2008.08.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Saults JS, Cowan N. A central capacity limit to the simultaneous storage of visual and auditory arrays in working memory. J Exp Psychol Gen. 2007;136:663–684. doi: 10.1037/0096-3445.136.4.663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Sawamura H, Orban GA, Vogels R. Selectivity of neuronal adaptation does not match response selectivity: a single-cell study of the FMRI adaptation paradigm. Neuron. 2006;49:307–318. doi: 10.1016/j.neuron.2005.11.028. [DOI] [PubMed] [Google Scholar]
  125. Scott BH, Malone BJ, Semple MN. Transformation of temporal processing across auditory cortex of awake macaques. J Neurophysiol. 2011;105:712–730. doi: 10.1152/jn.01120.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Scott BH, Mishkin M, Yin P. Monkeys have a limited form of short-term memory in audition. Proc Natl Acad Sci U S A. 2012;109:12237–12241. doi: 10.1073/pnas.1209685109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Scott BH, Mishkin M, Yin P. Effect of acoustic similarity on short-term auditory memory in the monkey. Hear Res. 2013;298:36–48. doi: 10.1016/j.heares.2013.01.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Scott BH, Mishkin M, Yin P. Neural correlates of auditory short-term memory in rostral superior temporal cortex. Curr Biol. 2014;24:2767–2775. doi: 10.1016/j.cub.2014.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Segaert K, Weber K, de Lange FP, Petersson KM, Hagoort P. The suppression of repetition enhancement: a review of fMRI studies. Neuropsychologia. 2013;51:59–66. doi: 10.1016/j.neuropsychologia.2012.11.006. [DOI] [PubMed] [Google Scholar]
  130. Semal C, Demany L. Dissociation of Pitch from Timbre in Auditory Short-Term-Memory. Journal of the Acoustical Society of America. 1991;89:2404–2410. doi: 10.1121/1.400928. [DOI] [PubMed] [Google Scholar]
  131. Serences JT, Ester EF, Vogel EK, Awh E. Stimulus-specific delay activity in human primary visual cortex. Psychol Sci. 2009;20:207–214. doi: 10.1111/j.1467-9280.2009.02276.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Seyfarth RM, Cheney DL, Marler P. Monkey responses to three different alarm calls: evidence of predator classification and semantic communication. Science. 1980;210:801–803. doi: 10.1126/science.7433999. [DOI] [PubMed] [Google Scholar]
  133. Seyfarth RM, Cheney DL. The evolution of language from social cognition. Curr Opin Neurobiol. 2014;28:5–9. doi: 10.1016/j.conb.2014.04.003. [DOI] [PubMed] [Google Scholar]
  134. Siebert E, Parr L, Mishkin M, Saunders RC. Chimpanzees, like monkeys, appear to lack long-term auditory memory. Society for Neuroscience. 2013 Abstracts 768.08. [Google Scholar]
  135. Skeide MA, Friederici AD. Response to Bornkessel-Schlesewsky et al. - towards a nonhuman primate model of language? Trends Cogn Sci. 2015 doi: 10.1016/j.tics.2015.05.011. [DOI] [PubMed] [Google Scholar]
  136. Sliwa J, Duhamel JR, Pascalis O, Wirth S. Spontaneous voice-face identity matching by rhesus monkeys for familiar conspecifics and humans. Proc Natl Acad Sci U S A. 2011;108:1735–1740. doi: 10.1073/pnas.1008169108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  137. Sobotka S, Ringo JL. Investigation of long-term recognition and association memory in unit responses from inferotemporal cortex. Exp Brain Res. 1993;96:28–38. doi: 10.1007/BF00230436. [DOI] [PubMed] [Google Scholar]
  138. Sreenivasan KK, Curtis CE, D'Esposito M. Revisiting the role of persistent neural activity during working memory. Trends Cogn Sci. 2014;18:82–89. doi: 10.1016/j.tics.2013.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  139. Stebbins WC, Moody DB. How monkeys hear the world: auditory perception in nonhuman primates. In: Fay RR, Popper AN, editors. Comparative Hearing: Mammals. Springer Handbook of Auditory Research, Vol. New York: Springer-Verlag; 1994. pp. 97–133. ^eds. [Google Scholar]
  140. Sugase-Miyamoto Y, Liu Z, Wiener MC, Optican LM, Richmond BJ. Short-term memory trace in rapidly adapting synapses of inferior temporal cortex. PLoS Comput Biol. 2008;4:e1000073. doi: 10.1371/journal.pcbi.1000073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  141. Suzuki WL, Amaral DG. Perirhinal and parahippocampal cortices of the macaque monkey: Cortical afferents. The Journal of Comparative Neurology. 1994;350:497–533. doi: 10.1002/cne.903500402. [DOI] [PubMed] [Google Scholar]
  142. Ulanovsky N, Las L, Nelken I. Processing of low-probability sounds by cortical neurons. Nat Neurosci. 2003;6:391–398. doi: 10.1038/nn1032. [DOI] [PubMed] [Google Scholar]
  143. Ulanovsky N, Las L, Farkas D, Nelken I. Multiple time scales of adaptation in auditory cortex neurons. J Neurosci. 2004;24:10440–10453. doi: 10.1523/JNEUROSCI.1905-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  144. Visscher KM, Kaplan E, Kahana MJ, Sekuler R. Auditory short-term memory behaves like visual short-term memory. PLoS Biol. 2007;5:e56. doi: 10.1371/journal.pbio.0050056. [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. Wegener JG. Auditory discrimination behavior of normal monkeys. Journal of Auditory Research. 1964;4:81–106. [Google Scholar]
  146. Weinberger NM. Specific long-term memory traces in primary auditory cortex. Nat Rev Neurosci. 2004;5:279–290. doi: 10.1038/nrn1366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Weinberger NM. New perspectives on the auditory cortex: learning and memory. In: Celesia GG, Hickok G, editors. The Human Auditory System. Handbook of Clinical Neurology. Vol. 129. Amsterdam: Elsevier B.V.; 2015. pp. 117–147. ^eds. [DOI] [PubMed] [Google Scholar]
  148. Werner-Reiss U, Porter KK, Underhill AM, Groh JM. Long lasting attenuation by prior sounds in auditory cortex of awake primates. Exp Brain Res. 2006;168:272–276. doi: 10.1007/s00221-005-0184-x. [DOI] [PubMed] [Google Scholar]
  149. Wittig JH, Jr, Richmond BJ. Monkeys rely on recency of stimulus repetition when solving short-term memory tasks. Learn Mem. 2014;21:325–333. doi: 10.1101/lm.034181.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  150. Wright AA. Auditory list memory and interference processes in monkeys. J Exp Psychol Anim Behav Process. 1999;25:284–296. [PubMed] [Google Scholar]
  151. Wright AA. An experimental analysis of memory processing. J Exp Anal Behav. 2007;88:405–433. doi: 10.1901/jeab.2007.88-405. [DOI] [PMC free article] [PubMed] [Google Scholar]
  152. Xiong Q, Znamenskiy P, Zador AM. Selective corticostriatal plasticity during acquisition of an auditory discrimination task. Nature. 2015;521:348–351. doi: 10.1038/nature14225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. Yakovlev V, Amit Y, Hochstein S. It's hard to forget: resetting memory in delay-match-to-multiple-image tasks. Front Hum Neurosci. 2013;7:765. doi: 10.3389/fnhum.2013.00765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  154. Zatorre RJ, Evans AC, Meyer E. Neural mechanisms underlying melodic perception and memory for pitch. J Neurosci. 1994;14:1908–1919. doi: 10.1523/JNEUROSCI.14-04-01908.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES