Talker-specificity and adaptation in quantifier interpretation

Ilker Yildirim; Judith Degen; Michael K Tanenhaus; T Florian Jaeger

doi:10.1016/j.jml.2015.08.003

. Author manuscript; available in PMC: 2017 Apr 1.

Published in final edited form as: J Mem Lang. 2016 Apr 1;87:128–143. doi: 10.1016/j.jml.2015.08.003

Talker-specificity and adaptation in quantifier interpretation

Ilker Yildirim ¹, Judith Degen ², Michael K Tanenhaus ³, T Florian Jaeger ⁴

PMCID: PMC4742339 NIHMSID: NIHMS737639 PMID: 26858511

Abstract

Linguistic meaning has long been recognized to be highly context-dependent. Quantifiers like many and some provide a particularly clear example of context-dependence. For example, the interpretation of quantifiers requires listeners to determine the relevant domain and scale. We focus on another type of context-dependence that quantifiers share with other lexical items: talker variability. Different talkers might use quantifiers with different interpretations in mind. We used a web-based crowdsourcing paradigm to study participants’ expectations about the use of many and some based on recent exposure. We first established that the mapping of some and many onto quantities (candies in a bowl) is variable both within and between participants. We then examined whether and how listeners’ expectations about quantifier use adapts with exposure to talkers who use quantifiers in different ways. The results demonstrate that listeners can adapt to talker-specific biases in both how often and with what intended meaning many and some are used.

Keywords: adaptation, talker-specificity, quantifiers, semantics, pragmatics

Introduction

The meaning of many, if not all, words is context-dependent. For example, whether we want to say that John is tall depends on whether John is being compared to other boys his age, professional basketball players, dwarves, etc. (e.g., Halff, Ortony, & Anderson, 1976; Kamp, 1995; Kennedy & McNally, 2005; Klein, 1980). Other words whose interpretation requires reference to context are pronouns and quantifiers (Bach, 2012). For example, the interpretation of a quantifier like many depends on the class of objects that is being quantified over: the number of crumbs that many crumbs refers to is judged to be higher than the number of mountains that many mountains refers to (Hörmann, 1983).

A less-well studied aspect of context-dependence is how a given talker uses quantifiers like many and some. Talkers exhibit individual variability at just about any linguistic level investigated –including, for example, pronunciation (e.g., Allen, Miller, & DeSteno, 2003; Bauer, 1985; Harrington, Palethorpe, & Watson, 2000; Yaeger-Dror, 1994), lexical preferences (e.g., Finegan & Biber, 2001; Roland, Dick, & Elman, 2007; Tagliamonte & Smith, 2005), and syntactic preferences (e.g., the frequency with which they use passives, Weiner & Labov, 1983). Therefore, talkers are also likely to differ in how they use quantifiers. For example, talkers may differ in how many crumbs they consider to be many crumbs, and these differences would consequently be reflected in their productions. In this case, listeners would be well served by taking into account talker-specific knowledge in order to successfully infer what the talker intended to convey.

Talker-specific knowledge has been observed experimentally in cases of variation in pronunciation and syntactic production (e.g., Clayards, Tanenhaus, Aslin, & Jacobs, 2008; Creel & Bregman, 2011; Creel & Tumlin, 2009; Fine, Jaeger, Farmer, & Qian, 2013; Kamide, 2012; Kraljic & Samuel, 2007). While this question has received less attention in lexical processing, there is some evidence that listeners can learn to anticipate talker-specific biases in the frequency with which referents are being referred to (Metzing & Brennan, 2003) and that these talker-specific expectations are reflected in online processing (e.g., Creel, Aslin, & Tanenhaus, 2008). These studies complement classic work on conceptual pacts in which interlocutors adjust their use of referential expressions to create temporary, shared context-specific names (Brennan & Clark, 1996).

Previous work on talker-specific lexical expectations has focused on open class, semantically rich, content words – typically nouns (Brennan & Clark, 1996; Creel et al., 2008; Metzing & Brennan, 2003). This raises the question of whether listeners are capable of adapting to talker-specific differences in the use of words that convey more abstract meanings, such as those of quantifiers. If listeners do in fact adapt to talker-specific differences, what specifically are listeners adapting to, i.e., what is the nature of the representations that are being updated and what are the underlying mechanisms?

The current paper begins to address these questions by studying adaptation to talker-specific differences in the use of the quantifiers some and many. We present four experiments that investigate lexical adaptation. Taken together, these experiments establish i) that listeners can adapt to talker-specific differences in the usage of even abstract lexical items, such as quantifiers; ii) that, provided sufficient exposure, such adaptation can be achieved even for multiple talkers simultaneously; iii) that lexical adaptation is observed both to talker-specific differences in the frequency with which lexical items are used and to talker-specific differences in how they are being used; and thus, finally, iv) that lexical adaptation –although often studied as a separate phenomena—exhibits many of the hallmarks of adaptation observed for other linguistic domains. Next, we elaborate on these points, while introducing the four experiments presented below. In doing so, we relate our research to previous work and highlight the contributions of the current work.

Before we investigate lexical adaptation to talker-specific quantifier use, we first assess whether the premise for adaptation is given: Experiment 1 demonstrates that listeners differ in their initial expectations about a talker’s use of a variety of quantifiers, including some and many. This shows that if listeners want to arrive at an interpretation of an utterance that is close to the talker’s intended meaning, they might sometimes need to adapt their expectations about quantifier use to match those of the current talker. Experiment 1 thus provides the first direct evidence that there would potentially be a benefit to adaptation to talker-specific differences in quantifier use.

This then raises the question whether listeners do adapt to these changes. This is the central motivation for Experiment 2. Going beyond this question and previous work, Experiment 2 also begins to investigate the nature of the changes in expectations that result from exposure to a novel talker. Specifically, we ask whether lexical adaptation can be talker-specific. The answer to this question is of theoretical relevance, as it speaks to the nature of the mechanisms underlying lexical adaptation. We briefly elaborate on this point, as it has so far received relatively little attention in the literature on lexical adaptation (but see Brennan & Clark, 1996; Pickering & Garrod, 2004).

A priori, there are several ways in which a listener can treat experience with a novel talker. A listener might treat new experience as evidence that can be used to sharpen prior expectations about quantifier use without taking into account the specific context, including the talker. Any adaptation would then be to talkers in general. At the other extreme, adaptation might be completely context-specific. If that were the case, then adaptation would be specific to a particular talker in a particular context and would not at all generalize to other talkers. A more likely possibility is that listeners strike a subtle balance between context-general and context-specific adaptation (cf. Kleinschmidt & Jaeger, 2015). Prima facie, it would seem undesirable for a language processing system to allow a small amount of recent exposure to overwrite life-long experience with language. At the same time, it is beneficial to be able to rapidly adapt to talker-specific lexical preferences, potentially increasing the efficiency of communication (for related discussion, see Brennan & Clark, 1996; McCloskey & Cohen, 1989; McRae & Hetherington, 1993; Pickering & Garrod, 2004; Seidenberg, 1994)

One way to meet both the need for adaptation and the need to maintain previously acquired knowledge is to learn and maintain talker-specific expectations, so that adaptation to a novel talker does not imply loss of previously acquired knowledge. Research in speech perception has explored and found support for this hypothesis (Goldinger, 1996; Johnson, 2006; Kraljic & Samuel, 2007; for review, see Kleinschmidt & Jaeger, 2015). More recent research has found support for this idea in other domains of language processing (e.g., prosodic processing, Kurumada, Brown, Bibyk, Pontillo, & Tanenhaus, 2014; Kurumada, Brown, & Tanenhaus, 2012; and sentence processing, Fine et al., 2013; Jaeger & Snider, 2013). For example, in episodic and exemplar-based models, linguistics experiences are assumed to be stored along with knowledge about the context in which they occurred (Goldinger, 1996; Keith Johnson, 2006; Pierrehumbert, 2001). This is how these models capture talker-specific expectations. (Similar reasoning applies to Bayesian models of adaptation that assume generative processes over hierarchically organized indexical alignment, Kleinschmidt & Jaeger, 2015.) Similarly, memory-based models of lexical alignment (Horton & Gerrig, 2005, 2015) can in theory account for both talker-specific expectations –if talkers are included as contexts (Brown-Schmidt, Yoon, & Ryskin, 2015).

Changes in the use of lexical forms and structures due to exposure are often attributed to temporary changes in expectation within a spreading-activation framework. These “priming-based” accounts assume that exposure increases the activation of a particular word, structure and perhaps conceptually related words and structures (e.g., Arai, van Gompel, & Scheepers, 2007; Branigan, Pickering, & McLean, 2005; Chang, Dell, & Bock, 2006; Goudbeek & Krahmer, 2012; Pickering & Branigan, 1998; Reitter, Keller, & Moore, 2011; Traxler & Tooley, 2008). Although lexical priming accounts have not been applied to the issues we are exploring, the simplest version of these models would most naturally predict that changes in expectations would apply across talkers and thus be talker-independent. In contrast the models discussed above –while compatible with generalization across talkers— predict there to be also talker-specific expectations (as we discuss later, some generalization is, in fact, expected under these alternative accounts).

In Experiment 2a and Experiment 2b, respectively, we ask listeners to either make judgments about “a talker”, which leaves ambiguous the possibility that we are referring to any talker, or “the talker”, referring to the specific talker to whom they were exposed. Comparing the two experiments allow us to ask whether listeners adapt, at least in part, to a specific talker, rather than changing their expectations across the board to reflect how any new talker might use some and many.

Building on the basic effect observed in Experiments 2a and 2b, Experiment 3 then asks whether listeners adjust not only to changes in the frequency with which quantifiers are used by a given talker, but also to changes in how quantifiers are used to refer to specific quantities by a given talker. Both of these quantities are of theoretical interest: talkers might differ in either or both of these aspects, so that the ability to adapt to such differences is potentially beneficial for listeners. Additionally, if lexical adaptation at least qualitatively follows the principles of rational inference and learning (as has been proposed for phonetic adaptation, Kleinschmidt & Jaeger, 2011, 2015, 2015b and syntactic adaptation, Fine, Qian, Jaeger, & Jacobs, 2010; Kleinschmidt, Fine, & Jaeger, 2012), listeners are expected to be sensitive to both prior probability of quantifiers (i.e., their frequency of use) and the likelihood of quantifiers given an intended interpretation (i.e., how quantifiers are used). Although not framed in these terms, previous work has exclusively focused on adaptation to changes in the frequency (and only for content words, e.g., Creel et al., 2008; Metzing & Brennan, 2003), leaving open whether listeners can adapt to changes in the likelihood. Experiment 3 tests whether listeners can also adapt to changes in the likelihood.

Finally, in Experiment 4 we return to the question of talker-specificity and ask whether listeners can adapt to multiple talkers simultaneously, when these talkers differ in how they use some and many. This prediction is made by episodic (Goldinger, 1996), exemplar-based (Johnson, 1997, 2006; Pierrehumbert, 2001), and certain Bayesian models (Kleinschmidt & Jaeger, 2015) of adaptation in speech perception. Talker-specific adaptation to multiple talkers has been observed in experiments on speech perception (Kraljic & Samuel, 2007) and, more recently, during syntactic processing (Kamide, 2012). To the best of our knowledge, it has not previously been tested for lexical processing. Experiment 4 exposes listeners to two talkers with different usage of some and many.

The studies presented here thus extends previous research on lexical adaptation and alignment in comprehension both methodologically --by establishing the exposure-test paradigm frequently used in research on speech perception as suitable for research on lexical adaptation—and empirically. We find that listeners can adapt to both how often and with what intended interpretation specific talkers use some and many, and that–at least in simple situations like those investigated here—listeners can adapt to talker-specific quantifier use of multiple talkers from very little input. The experiments presented here establish a novel paradigm to investigate lexical adaptation in ways parallel to research on adaptation to talker variability in speech perception. This makes our results comparable to research in these other fields. Indeed, we find several parallels between lexical adaptation and adaptation at other levels of language processing. We close by discussing venues for future research on lexical adaptation that, we think, are facilitated by the current paradigm.

Experiment 1: Variability in quantifier interpretation

It is well-known that there are gradient context-dependent differences in the interpretation of quantifiers (e.g., Hörmann, 1983; Newstead, 1988; Pepper & Prytclak, 1974). It is less clear, however, whether talkers differ in their use of quantifiers. For example, talkers could differ in the overall frequency with which they use a certain quantifier, in their interpretation of a quantifier (i.e., when they will use it), or both. If there is such variation, different listeners –who have been exposed to different talkers—are expected to vary in their assumptions about how quantifiers are used. If there is no such variation, it seems unlikely that there is talker variability and thus there is no reason to expect that listeners should adapt to talker-specific usage of quantifiers. Thus, Experiment 1 seeks to establish whether listeners have different expectations about talkers’ usage of quantifiers. As our plan going into Experiment 1 was to investigate talker-specific adaptation in quantifier use in subsequent experiments, we explored listener-specific expectations for five quantifiers, few, many, most, several, and some.

Methods

Participants

A total of 200 participants were recruited via Amazon’s crowdsourcing platform Mechanical Turk (20 per list; see below). All participants were self-reported native speakers of English. The experiment took about 10 minutes to complete. Participants were paid $1.00 ($6.00/hour).

Materials and Procedure

On each trial, participants saw a candy scene in the center of the display (example trial Figure 1(a)). The bowl always contained a mixture of green and blue candies. The total number of candies in the bowl was constant at 25 but the distribution of green and blue candies and the spatial configuration of the candies differed between scenes. At the bottom of the scene, participants saw three alternative descriptions. One of the alternatives was always “Other”. The two other alternatives were two sentences that differed only in their choice of quantifier (e.g., Some of the candies are green and Many of the candies are green). The alternatives a given participant saw remained the same throughout the experiment. For the five English quantifiers we were interested in (few, many, most, several, and some), there were ten possible pairwise combinations: (1) many and most; (2) many and several; (3) many and few; (4) many and some; (5) most and several; (6) most and few; (7) most and some; (8) several and few; (9) several and some; and (10) few and some.

Panel (a) illustrates the procedure of Experiment 1. The two phases of Experiment 2 are (a) Exposure and (b) Post-exposure.

Each participant saw only one of these 10 possible combinations and each combination was seen by equally many participants (20 each). Between participants and within quantifier combinations, the order of presentation of the quantifiers was balanced (e.g., 10 participants saw Some of the candies are green on the top and Many of the candies are green on the bottom and 10 other participants saw these two sentences in the opposite order).

Participants were asked to rate how likely they thought a talker would be to describe the scene using each of the alternative descriptions. They performed this task by distributing a total of 100 points across the two alternatives (the first and the second slider bars in Figure 1(a)) and “Other” to reflect how likely they thought that neither alternative was likely to be used to describe the scene (the third slider bar). Sliders adjusted automatically to guarantee that a total of 100 points were used. An example display for the two quantifiers some and many is shown in Figure 1(a).

To assess participants’ beliefs about talkers’ use of all the five quantifiers, we sampled scenes representing the entire scale – a scene could contain any number of green candies from none to 25. Over 78 test trials, participants rated each possible number of green candies 3 times. The order of the scenes was pseudo-randomized, and the mapping from alternative descriptions to slider bars was counterbalanced.

Exclusions

To ensure that participants were attending to the task, the experiment contained catch trials after about every 6 trials, totaling 13 catch trials. Catch trial occurrence was randomized so as to rule out strategic allocation of attention. On about half of the catch trials, a gray cross appeared at a random location in the scene. After the scene was removed from the screen and before the next scene was shown, participants were asked if they had seen a gray cross in the previous scene. In all experiments reported in this paper, we excluded participants who did not respond correctly on at least 75% of the catch trials. We also excluded participants who did not adjust the slider bars for the entirety of the experiment. We excluded five participants out of 200 participants, all on the basis of their catch trial performance: one participant in some vs. many, one participant in few vs. many, one participant in few vs. some, one participant in many vs. several, and one participant in several vs. some.

Results and Discussion

In Figure 2 we show participants’ marginal expectations about quantifier use for the five quantifiers. These expectations were obtained by pooling the ratings for each quantifier (e.g., ratings for some across the four pairs it appeared), thereby averaging across contrasts (quantifier pairs) and a total of about 80 participants per quantifier.

Naturalness ratings for the five English quantifiers *few*, *many*, *most*, *several*, and *some*. Error bars are 95% confidence intervals.

Analyses revealed considerable individual variation in participants’ expectations about the use of these five quantifiers. Here we focus on the assessment of individual variability in participants’ expectations about many and some, the two quantifiers that the rest of the paper will be concerned with. We chose to focus on these two quantifiers, because the paradigm we introduce in Experiment 2 aims to ‘shift’ listeners’ expectations about quantifier use through exposure. We thus focused on quantifiers that –across participants—had peaks in their distributions that were clearly distinct from the edges of our scale (i.e., 1 and 25 candies). Among the three quantifiers that fulfilled this criterion (some, several, and many), we chose to focus on two more frequent ones (some and many).

We illustrate the variability in listeners’ expectations about the use of some and many by fitting a linear mixed model (Baayen, Davidson, & Bates, 2008) using the lme4 package (Bates, Maechler, Bolker, & Walker, 2014) in R to the data of the 19 participants that rated many compared to some (recall that one participant was excluded because of poor performance). The distributions of ratings of some and many (cf. Figure 2) were separately fit using natural splines (Harrell, 2014) with two degrees of freedom (locations of knots automatically determined using the package rms, Harrell, 2014). Random by-participant slopes were included for both of the spline parameters and for the intercepts. The results of this procedure are shown for three representative participants in Figure 3. This was also evidenced by the estimated variance in the by-participant slopes or the two parameters of the natural splines (e.g., in the case of many distributions: σ₁ = 24.4, σ₂ = 23.9, compared to σ_residual = 15.7). Inclusion of these random slopes was clearly justified by model comparison ( χ² = 67.8, p < .0001), indicating that there was significant variation across participants’ quantifier belief distributions.

Parametric fits to *many* and *some* ratings for three representative participants.

Although it is well established that context-dependent gradient expectations are ubiquitous in quantifier use, we are not aware of earlier studies that quantify between-talker differences in the usage of quantifiers. The results establish that there is variation in listeners’ expectations of talkers’ quantifier usage even when the context is held constant (see also Budescu & Wallsten, 1985, for evidence of between-participant variability in the interpretation of probability terms). This sets the stage for Experiment 2, which asks whether listeners’ expectations about how a talker uses quantifiers adapt.

Experiment 2: Adaptation of beliefs about quantifier use based on recent input

Experiment 2 investigates whether listeners can adjust their beliefs about the use of some and many based on recent input specific to the current context. We used a variation of the exposure-and-test paradigm frequently used in research on perceptual learning, including research on speech perception (e.g., Eisner & McQueen, 2006; Kraljic & Samuel, 2007; Norris, McQueen, & Cutler, 2003; van Linden & Vroomen, 2007). A post-exposure test assessed participants’ beliefs about the typical use of some and many. Before this test, participants watched videos of a talker describing various visual scenes with sentences like Some of the candies are green. This procedure is illustrated in Figure 1.¹

Exposure was manipulated between participants. Half of the participants were exposed to a novel talker’s use of the word some (some-biased group). Paralleling perceptual recalibration experiments (e.g., Norris et al., 2003), this talker used the quantifier some to describe the scene that was maximally ambiguous as to whether it fell in the some or the many category. This scene (13 green candies, which we refer to as the Maximally Ambiguous Scene or Maximally Ambiguous Scene) was determined on the basis of the ratings from Experiment 1. Using the (fixed effect) parameter estimates from the natural spline fitting procedure described in Experiment 1, we obtained the population-level some and many curves for all values between 1 and 25. The closest integer to the intersection point of these two curves – i.e. the point that was equally likely to give rise to an expectation for some and many, 13 green candies – was considered the Maximally Ambiguous Scene. The other half of the participants was exposed to the same novel talker describing the Maximally Ambiguous Scene with the quantifier many (many-biased group). This manipulation –with minor modifications– was employed in all experiments reported below.

If passive exposure to a specific talker’s use of many or some is sufficient for listeners to adapt their expectations about the use of many and some, adaptation should be reflected in shifted belief distributions in the post-test compared to the pre-test. The direction of this shift should depend on the exposure condition. We elaborate on this prediction after introducing the paradigm in more detail below.

As outlined in the introduction, Experiment 2 further aims to assess exactly what expectations are affected by exposure to a novel talker’s use of some and many. Specifically, we ask whether exposure to a novel talker leads listeners to develop talker-specific expectations, rather than just changes in expectations that could apply across any type of talker. Episodic (Goldinger, 1996), exemplar-based (Johnson, 1997; Pierrehumbert, 2001), certain Bayesian models of adaptation (Kleinschmidt & Jaeger, 2015) predict talker-specific expectations. These models were originally developed to account for adaptation in speech perception, but their logic straightforwardly extends to lexical processing. Indeed, although not necessarily framed as such, memory-based alignment accounts of lexical processing (e.g., Horton & Gerrig, 2005) are essentially exactly such an extension.

To begin to answer whether lexical adaptation to quantifiers can be talker-specific, we conducted two versions of Experiment 2. In Experiment 2a, we asked participants “How likely do you think it is that a speaker will describe this scene with each of these alternatives?”. Using the indefinite “a speaker” leaves ambiguous whether we are referring to a generic talker or to the specific talker they were exposed to. In Experiment 2b, we changed the wording to “How likely do you think it is that the speaker will describe this scene with each of these alternatives?”. Using the definite “the speaker” makes it clear that we are referring to the specific talker they were exposed to. If exposure leads to adaptation globally, i.e., to talkers in general, we expect the adaptation effect to be similar across Experiments 2a and 2b. If instead adaptation is local and specific to the exposure talker (or a mixture of local and global adaptation), there should be a difference in the size of the adaptation effect such that a larger effect should be observed when participants are asked about “the speaker” compared to “a speaker”.