Reference as an Interactive Achievement: Sequential and Longitudinal Analyses of Labeling Interactions in Shared Book Reading and Free Play

Vivien Heller; Katharina J Rohlfing

doi:10.3389/fpsyg.2017.00139

. 2017 Feb 14;8:139. doi: 10.3389/fpsyg.2017.00139

Reference as an Interactive Achievement: Sequential and Longitudinal Analyses of Labeling Interactions in Shared Book Reading and Free Play

Vivien Heller ^1,^*, Katharina J Rohlfing ²

PMCID: PMC5306378 PMID: 28261122

Abstract

The present study examines how young children and their caregivers establish reference by jointly developing stable patterns of bodily, perceptual, and interactive coordination. Our longitudinal investigation focuses on two mother–child dyads engaged in picture-book reading and play. The dyads were videotaped at home once every 6 weeks while the children aged from 9 to 24 months. Inspired by conversation analysis and multimodal analysis, our developmental approach builds on the insight that the situated and embodied production of reference is fundamentally an interactive achievement. To examine the acquisition of reference, we developed a descriptive instrument that takes account of not only the dyad's joint accomplishment but also each participant's contributions to it. The instrument is based on the sequential reconstruction of the jobs that both participants have to accomplish jointly in order to achieve reference: establishing visual perception as a relevant resource, constituting a domain of scrutiny, locating a target, and construing the (meaning of the) referent. Methodologically, these jobs serve as a tertium comparationis for the longitudinal comparison of both the adult's as well as the child's contributions to establishing reference. We used this instrument to examine (1) what bodily and verbal resources the participants employed, and (2) how their contributions to accomplishing the jobs changed over time. Findings showed that the acquisition of reference was closely related to the child's increasing ability to recognize, fulfill, and set up conditional relevancies. We conclude that the adult's dynamic and contextualized use of conditional relevancies, recipient design, and observability is a crucial driving force in the acquisition of reference.

Keywords: reference, sequential organization, conditional relevance, observability, coordination, interaction, language acquisition, joint attention

Introduction

Determining how young children come to understand that words refer to something has been a continuous topic in language acquisition research. For Bruner (1976, p. 69), the acquisition of reference entails the problem of “how one individual manages to get another to share, attend to, zero in upon a topic that is occupying him.” Arriving at a shared understanding of a referent is a substantial challenge when reference is conceived merely as words being mapped onto their referents, because in the real world, there are simply too many options when it comes to selecting one of the numerous potential referents (Trueswell et al., 2016). Considering the fact that speakers often produce “proxy” or “dummy” noun phrases (e.g., “what's-his-name”) for the referent, Clark and Wilkes-Gibbs (1986) asked how it is possible for participants to be sufficiently sure of having achieved a mutual understanding of the referent—a problem that Clark and Marshall (1981) referred to as the “mutual knowledge paradox.” This paradox also exists when reference is established non-verbally by, for example, pointing to an object within the coparticipants' joint perceptual space. Pointing is usually understood as a “communicative body movement that projects a vector from a body part” and “indicates a certain direction, location, or object” (Kita, 2003, p. 1). At first sight, the meaning of pointing seems to be self-evident in that it requires only the recipient to “trace, by symbolic extrapolation, a path from the gesture to the thing” (Fillmore, 1997, p. 6). Yet the mutual knowledge paradox remains, because pointing gestures only roughly indicate a certain area that may be populated by various persons, objects, and so forth. Even if the recipient manages to locate the pointed-to target and thus to resolve this perceptual ambiguity, she or he still needs to sort out another problem: Does the pointing refer to the object as such, or to one of its features; or does it simply predicate that the object is located in a particular area (see Kita, 2003, p. 3)? The meaning of the pointed-to target—the actual referent—still remains ambiguous. And yet, in everyday interaction, reference is usually achieved without problems.

In this article, we assume that participants themselves have developed procedural and linguistic solutions for dealing with perceptual and semantic ambiguities. Acquiring reference would then mean acquiring these procedural and linguistic solutions. Following a pragmatic perspective (Rohlfing et al., 2016), we assume that for a situation to become “shared,” interactants have to arrive at a joint understanding of the purpose of their activity. As a result, children need to learn “as much about the rules of dialogue” as they learn about the “lexical labels” (Bruner, 1976, p. 74).

A number of answers have been proposed in response to the question when and how children engage in establishing joint reference. In the following, we shall give a rough overview of relevant streams of research, and show how existing studies have mapped out the necessary cognitive and communicative resources as well as the necessary external resources for the acquisition of reference.

Cognitive and communicative resources for establishing reference

Children have been found to engage in joint attention (JA) from 9 months onward. JA is achieved when both partners manage to engage with the same referent. However, it was results reported by Baldwin (1991, 1993) that first motivated a closer investigation of the child's sociocognitive abilities. She demonstrated that infants “are not just passive in the joint reference enterprise” (Baldwin, 1993, p. 398). They have a range of communicative means at their disposal with which not only to display their interest in objects, persons, and so forth but also to direct their coparticipant's attention (e.g., Liszkowski et al., 2004; Liszkowski, 2005; Begus and Southgate, 2012). They use these resources for both imperative and declarative purposes (Bates et al., 1976; Franco and Butterworth, 1996; Liszkowski et al., 2004, 2007). Moreover, they understand that their actions have a bearing on their partner, and they use this knowledge to elicit a label or further talk (Begus and Southgate, 2012; Begus et al., 2014). Pointing is among the first communicative means for directing the coparticipant's attention to objects and events (Bruner, 1983; Franco and Butterworth, 1991; Marcos, 1991; Butterworth and Itakura, 2000; Behne et al., 2012). At around 14 months of age, children accompany their pointing with the local deictic “da!” or “there” (Clark, 1978; Clark and Sengul, 1978; Murphy, 1978). Clark (1978) has proposed four stages in the development from deictic gestures to deictic words:

At the first stage, children use gestures like pointing to pick out an object for their “listeners.” At the second, they add to their gesture their first deictic word, often in the form eh (from adult there) or da (from adult that). Later still, at a third stage, they combine a deictic word with other words to form longer utterances like That shoe… Finally, at a fourth stage, they learn how to use deictic words in utterances without any accompanying gesture (p. 96).

Whereas the stages capture a progression in the child's use of deictic means, they do not reflect the need for deixis to also be embedded in the ongoing interaction. Yet to be successful, the child has to make sure that the partner is ready to perceive the pointing (“visual checking,” see Franco and Butterworth, 1996). In other words, pointing must be prepared interactively. Likewise, pointing grants relevance to a certain reaction by the recipient. Filipi (2013, p. 145) has shown that children first learn to establish joint attention and are then held “accountable for ‘doing’ something with that attention when it is provided.” Hence, it seems that the “recognition of a situation as communication” (Gliga and Csibra, 2009, p. 352) and the child's sensitivity to the organization and the purpose of the task is important for acquiring reference. Studies applying sequential analyses to young children's interactions stress the public nature or “observability” of each participant's actions as a crucial resource (Wootton, 1997; Kidwell and Zimmerman, 2006, 2007). What is lacking, however, is studies on early interactions showing how this “observability” is achieved and adapted to children's communicative and cognitive abilities.

External resources for the acquisition of reference

Input-oriented approaches have examined how adults facilitate JA; how they modify their talk in episodes of JA; and how adult feedback affects developments in referential communication (see Ateş-Şen and Küntay, 2015, for an overview). Mothers have been found to point and refer to objects verbally more often in episodes of JA (e.g., Bruner, 1981; Tomasello and Farrar, 1986; Marcos, 1991). Furthermore, parameters for “referential transparency” (Trueswell et al., 2016, p. 11; Schmidt, 1996) have been identified that help children to attend to novel objects visually and thus to resolve ambiguities when linking objects with words (Pruden et al., 2006; Horst and Samuelson, 2008; Axelsson et al., 2012; Liszkowski, 2014; Trueswell et al., 2016; Yu and Smith, 2016). Adult coparticipants often present objects and actions in salient ways. They bring objects into the child's visual focus, shake them, and thus exploit the child's sensibility to human movement (e.g., Rader and Zukow-Goldring, 2010; Pitsch et al., 2014; Yu and Smith, 2016). In interactions with older children, mothers rely on verbal behavior to initiate and maintain their child's attention (Estigarribia and Clark, 2007). Although it could be shown that the caregiver's “input” in episodes of JA correlated positively with the child's use of pointing (Murphy, 1978; Marcos, 1991) and vocabulary (Tomasello and Farrar, 1986), these studies do not fully explain how participants actually arrive at a shared situation and a mutual understanding of the referent—a demand that goes clearly beyond joint attention to a particular target and requires the solving of semantic tasks.

Another strand of research investigating external resources looks beyond the phenomenon of JA. These studies take a broader view on the interactive contexts in which reference is established, and examine how interaction forms a source in the child's cognitive development (Vygotsky, 1998). A number of studies taking this approach have examined how the sequential structure of routines such as games or joint book readings is established (Ninio and Bruner, 1978; Snow and Goldfield, 1983; Filipi, 2009, 2013; Fantasia et al., 2014; Rossmanith et al., 2014; Heller and Rohlfing, 2015; Rohlfing et al., 2015, 2016). Based on a longitudinal study of one mother–child dyad, Ninio and Bruner (1978, p. 8) demonstrated that picture-book reading takes the form of a “standard action format” that consists of recurring dialogue cycles, each comprising an orderly sequence of moves. From a conversation analytic perspective, the structure is underpinned by “conditional relevancies” (Schegloff and Sacks, 1973); that is, normative expectations regarding what type of “relevant next” should follow a move of a certain type. In interactions with young children, adults have been found to “plan ahead” for conditional relevancies, thus guiding the child and creating “an interactional context that is most likely to occasion a desired response” (Mehus, 2011, p. 133). Such stable organization helps children to identify and predict recurring semantic-pragmatic elements in a sequence (Ratner and Bruner, 1978; Snow and Goldfield, 1983). Drawing on microanalyses, Rossmanith and colleagues have examined how caregivers structure book reading routines by shaping parts of activities into bigger or smaller dynamic “action arcs” with a beginning, build up, climax, and resolution (Rossmanith et al., 2014, p. 8). These render the structure of the routine visible for the child. By providing a recurring pattern, they facilitate the coordination of not only visible behaviors but also cognitive and perceptual operations (Rohlfing et al., 2016).

Focusing on adult–adult interactions, multimodal and sequential approaches have examined which “practical problems” participants have to solve when establishing reference. They have shown that joint reference is a sequentially organized process that requires participants' coordination of body posture, gaze, movements and verbal resources (Hanks, 2000; Hindmarsh and Heath, 2000; Goodwin, 2003b; Stukenbrock, 2009; Mondada, 2012; Sidnell and Enfield, 2016). The present study examines how children become involved in this interactive and sequentially organized process and how stable patterns of bodily, perceptual, and interactive coordination emerge over time. In the following section, we present an analytical instrument with which to describe this process. The instrument is based on the sequential reconstruction of the interactive jobs (see next section) that are constitutive for establishing reference. Using these jobs as a tertium comparationis, we examine how each job is achieved interactively at different data points and relate changes in the devices available to children and their shares in performing the jobs to changes in the adult's interactive demands and support. In the last section, we develop an explanatory account of what drives the acquisition of reference. We argue that fundamental features of interaction—sequential organization, recipient design, and observability—inform the supportive practices that adults employ to achieve joint reference in interactions with young children.

A descriptive instrument for analyzing reference and its acquisition as interactive achievements

Interactive jobs of establishing reference

When establishing reference, participants have to solve at least two problems: First, they have to deal with the perceptual problem of locating a target. Second, they have to solve the semantic problem of identifying or rather construing the referent. Hence, it appears that establishing reference inheres recurrent practical problems that require the ongoing and dynamic coordination of the participants' bodily and visual conduct. This is why participants rely on procedural solutions or “practical methods” (Garfinkel, 1967) that enable them to treat and perform “establishing reference” as an “unproblematic” activity in their everyday lives. Building on a framework based on sequential analyses of establishing reference in different settings such as dinner talk, guided tours, self-defense classes, physician–patient consultations (Stukenbrock, 2009, 2015), and picture-book reading (Heller and Rohlfing, 2015), we assume that the procedural solution to establishing reference entails four sequentially ordered jobs.

Job 1: establishing visual perception as a relevant resource

To make a pointing gesture perceptible, the pointing person has to establish her or his body as a perceptually relevant resource (Hindmarsh and Heath, 2000; Goodwin, 2003b; Stukenbrock, 2009; Mondada, 2012). Therefore, bodily displays must be coordinated with the recipient's visual attention. Hindmarsh and Heath (2000) have shown that speakers employ verbal resources such as deictic terms (“here!”) to highlight the very moment at which visual orientation becomes relevant—a resource that is also employed in interactions with children (Estigarribia and Clark, 2007, p. 804). The recipient, on the other hand, is required to direct her or his visual attention toward the speaker and to understand that the partner's arm or index finger is not relevant in itself but should be interpreted as an instrument referring to something else and thus serving as an intermediary locus of attention (Stukenbrock, 2009; Rader and Zukow-Goldring, 2010).

Job 2: constituting a domain of scrutiny

Next, the recipient needs to understand what space the speaker is orienting toward. It is important to emphasize that the speaker's display of attention—her or his orientation toward a certain space by posture, pointing, or local deictics—does not yet indicate a particular object in space. Rather than transparently locating the target itself, it “specifies…a domain of scrutiny, a region where the addressee should begin to search for something that might count as target” (Goodwin, 2003a, p. 73). The co-participant is thus required to reorient her or his visual attention; that is, to shift it from the body of the speaker to a “search space” (Stukenbrock, 2009, p. 304). At the same time, the speaker needs to monitor whether the co-participant construes the search space in the same way as her- or him self. Hence, this job is accomplished when both participants have established a particular space as a shared focus of attention.

Job 3: locating the target

This job requires the recipient to determine the particular target of the pointing gesture. Unlike Butterworth, we do not assume that the act of locating coincides with the identification of the referent. Butterworth (2003) suggests that certain ecological mechanisms enable a “‘meeting of minds’ in the selfsame object” (p. 22). Likewise, other studies have assumed that locating a target already implies understanding its meaning (e.g., Pruden et al., 2006; Axelsson et al., 2012; Trueswell et al., 2016). Admittedly, locating the target and construing the referent are often achieved at one go. Yet, misunderstandings and repairs do occur in the process of establishing reference (see below), suggesting that locating a target and construing the referent are in fact different achievements (Stukenbrock, 2009, 2015). Whereas locating a target requires a perceptual effort (which may lead to shared perception), construing the referent is a semantic process (occasioning shared understanding). Our own analyses of the ways in which not yet competent members are involved in establishing reference (Heller and Rohlfing, 2015) provide further evidence for the need to distinguish between the two.

Job 4: construing the referent

Once the target is located, the recipient needs to disambiguate its meaning. Therefore, she or he needs to tie acts of pointing or verbal deictics and labels “to the construals of entities and events provided by other meaning-making resources as participants work to carry out courses of collaborative action with each other” (Goodwin, 2003b, p. 218). Hence, to identify the referent, the coparticipant draws on contextual resources; that is, her or his understanding of the joint activity (e.g., book reading, building a tower) in which the reference is embedded (Hindmarsh and Heath, 2000; Liszkowski, 2014). She or he then develops hypotheses about the meaning of the pointed-to target (Stukenbrock, 2009, p. 307). This semantic work is conducted visibly and verbally: Adult recipients often display their understanding that can then be confirmed, specified, or repaired by the speaker (Stukenbrock, 2015, p. 316).

To summarize, we conceptualize reference as an interactive and sequentially organized process that requires participants to observably and methodically orient themselves toward four jobs. Whereas previous developmental research has focused mainly on Jobs 1 and 3 (Estigarribia and Clark, 2007), sequential analyses provide evidence that establishing reference also requires participants to constitute a domain of scrutiny and to construe the referent. The four sequentially ordered jobs thus serve as a procedural solution to practical problems of perceptual and semantic ambiguity. Note that scope of our descriptive instrument covers basic forms of reference; that is, activities in which participants refer to something in their immediate surroundings. It does not apply to references to past, future, or fictitious events.

Descriptive levels of the instrument

Starting from the perspective that reference is fundamentally an interactive achievement, a developmental approach to reference has to tackle the question how individual abilities can be described without ignoring the fact that reference is a collaboratively organized process. Our solution to this problem is to view the interactive process itself as a part of the analysis. Therefore, we build on an analytical approach developed by Hausendorf and Quasthoff (2005) designed originally to examine the acquisition of narrative competence. Adopting this instrument for the acquisition of reference, we distinguish two levels of description: the level of jobs and the level of the devices needed to get the jobs done.

Jobs represent the organizational tasks (Sacks, 1995; Quasthoff et al., 2017) the participants orient toward in the joint achievement of reference. Because these jobs follow a sequential logic, this level of description captures the sequential organization of reference. Furthermore, the present analysis will demonstrate that each of the four jobs is organized as a two-part exchange or adjacency pair in which a move of type A establishes a “conditional relevance” for a move of type B (Schegloff and Sacks, 1973). Hence, the second move is functionally dependent on (or made normatively expectable by) the first. Each job has been achieved when the second pair part of the expected type has been produced. Reference, then, is successfully established when each of the four jobs has been fulfilled regardless of how and by whom. The jobs thus serve as a tertium comparationis for the longitudinal comparison of both the adult's and the child's contributions to establishing reference.

Devices is the term given to the bodily, prosodic, and verbal means or resources with which the jobs are accomplished. They describe each participant's contributions to the jobs. Moreover, different devices can be deployed to accomplish the jobs.

By distinguishing between interactive jobs and devices, the instrument takes into account both the dyad's joint accomplishment and each participant's contributions to establishing reference. It thus provides the basis for a longitudinal comparison of the adult's and the child's contributions without losing sight of the fact that reference is coconstructed. This allows us to examine (1) what bodily-visual and verbal resources participants employ to accomplish the jobs and (2) how their shares in the jobs change over time.

Materials and methods

Participants

The longitudinal analysis is based on video recordings of face-to-face interactions between caregivers and two typically developing children as they aged from 9 to 24 months. These dyads were selected from a larger corpus (e.g., Rohlfing et al., 2015) and include children of both genders. Based on our corpus, they represent “typical” courses of language acquisition. Participants were recruited in the German city of Bielefeld and its surroundings. The mothers' educational background was comparable; both had university degrees.

Data collection and transcription

Each family was visited at home once every 6 weeks (12 data points). Two different activities were videotaped, free play (lasting 20–25 min) and picture-book reading (lasting 5–10 min). For the latter activity, the dyads were given a colorful folder: Each page presented photographs showing, for example, a spoon on a mug or a child on a swing. Altogether, the corpus comprises 10.5 h of video recordings. For each point of data collection, three to eight episodes were transcribed in Elan (EUDICO Linguistic Annotator; Lausberg and Sloetjes, 2009). The 93 transcripts cover 42 min of interaction. The transcription follows the notation conventions of Gesprächsanalytisches Transkriptionssystem 2 (GAT 2, Couper-Kuhlen and Barth-Weingarten, 2011). It depicts participants' verbal, non-verbal (e.g., pointings, depictive gestures, gaze), and paraverbal actions (e.g., accentuation, pitch movement, loudness) in their sequential order (see Appendix). All transcripts were checked by two research assistants. Parents provided written informed consent for the study as well as specific consent for the publication of images in the transcripts. The names used in the transcripts are pseudonyms. The first number in the transcript title refers to the dyad (01 and 07); “BR” and “FP” refer to “book reading” and “free play.”

Analytical procedure

The analysis entailed two steps: Drawing on conversation analysis (Sacks, 1995) and multimodal analysis (Streeck et al., 2011), we first examined how each job was achieved by the dyad in different interaction episodes (section Age-Related Sequential Analyses). This sequential analysis focused on the devices adults and children employed to get the jobs done. Examples are presented for four age spans (9–14, 15–17, 18–22, and 23–24 months). The age spans were not determined a priori, but are based on our analyses. They reflect changes in the adults' interactive demands and/or the children's contributions to establishing reference. In the second step, we related changes in the children's devices and shares in the jobs to changes in the adult's interactive demands and support (sections Longitudinal Comparison: Children's Devices and Shares in the Jobs and Longitudinal Comparison: Adults' Devices and Shares in the Jobs).

Analyses and findings

Age-related sequential analyses

Establishing visual perception as a relevant resource (Job 1)

9–14 months

How visual perception is established as a relevant resource depends decisively on the participants' bodily arrangements. For book reading with young children, mothers typically arrange a nested configuration (Ochs et al., 2005) and position the child on their lap facing outwards (Figure 1). Thus, the child shares a visual field with the mother and does not need to redirect her or his gaze from the mother's body to the pointed-to domain of scrutiny (Job 2). When the mother points to the book, both her finger and the domain of scrutiny can be perceived simultaneously (see Yu and Smith, 2013). During play, participants sit face to face or side by side (Figure 2). This arrangement requires the pointing person to first draw the coparticipant's visual attention to her or his own body.

In the first sequence, Lea (9 months) is in a nested position.

(1) 07-BR-spoon (9 months)
`001`	`L`	`[((turns page, looks at rings))]`
002	`M`	[AH::::	was	ham	wir	denn	DA:::;]
		`AH:::`	`what`	`do`	`we`	`have`	`the::re;`
`003`	`L`	`((looks at picture))`

(2) 07-BR-spoon (9 months)
002	M	[AH::::	was	ham	wir	denn	DA:::;]
		`AH:::`	`what`	`do`	`we`	`have`	`the::re;`
(3) 01-BR-book (10 months)
006	M	OAH	(.)	was	ham_wa		denn (-)	↑DA::;
		`OAH`	`(.)`	`what`	`do`	`we`	`have`	`THE::RE;`

(4) 01-FP-bag (10 months)
`001`	`M:`	`\|KOMM`	`her`	`ole; \|`
		`COME`	`here`	`ole;`
`002`		\|°hhh	SCHAU	mal. \|
		°hhh	LOOK.
		\|((opens bag))\|

(5) 07-BR-red flower (15 months)
`001`	`L`	`((turns page))`
002		°h-
`003`	`M`	`BLUmen;`
		`FLOwers;`
(6) 07-BR-mug (17 months)
`001`	`M`	`\|U:::ND, \|`
		`A:::ND,`
		`\|((turns page))\|`
002	L	oh;
`003`		`((rIF points to book))`
`004`	`M`	`ein LÖFfel,`
		`a SPOON,`

(7) 01-BR-dino (19 months)
`001`	`M:`	`((turns page))`
002	O:	\|!Ä!O;		\|
		\|((points to tiger…)) \|

`003`		`\|DAS-`		`\|`
		`THAT-`
		`\|((…points to tiger))\|`
`004`	`M:`	<<`p`>	`was ist DAS;`>
			`what is THAT;`

(8) 07-BR-star (24 months)
001	L	\|IST	das? \|
		IS	that?
		`\|points to picture\|`
`002`	`M`	`SAG_s`	`mir.`
		`TELL`	`me.`

	9–14 months	15–17 months	18–22 months	23–24 months
Adult	Initiates job by setting up a relevance for visual coordination. Devices: Breathing in or interjection What question or summons		Higher expectation: Verbal cues are omitted
Child	Responds by coordinating visual attention	Initiates job by setting up a relevance for visual coordination. Devices: Breathing in Interjections		Initiates job by setting up a relevance for visual coordination. Devices: What questions and summons

(9) 01-BR-dog (10 months)
`001`	`M:`	`GUCK mal;`
		`LOOK;`
002		\|HIER;	\|
		HERE;
		\|((holds	book	above	Ole’s	head))	\|
`003`	`O:`	`((touches`				`book))`

(10) 07-BR-spoon (9 months)
004	M	\|WO:	is	der	lÖffel; \|
		WHE:RE	is	the	spoon;
		\|((moves book, lifts it up)) \|
`005`		`WO::`	`ist`	`der`	`lÖffel;`
		`WHE::RE`	`is`	`the`	`spoon;`
`006`	`L`	`((touches`	`book`	`with`	`face))`
`007`	`M`	`WO`	`ist`	`der`	`lÖffel?`
		`WHERE`	`is`	`the`	`spoon?`

(11) 07-BR-pen (14 months)
`005`	`M`	`[WO`	`ist`	`der`	`stift; ]`
		`WHERE`	`is`	`the`	`pen;`
`006`	`L`	`[((tries`	`to`	`grasp`	`pen)) ]`

`007`	`M`	`ah`	`den`	`möchtste`	`wieder`
		`you`	`wanna`	`take`	`it`
		`GREIfen;` = `ne,`
		`again;` = `right,`
`008`		= `GEHT`	`nich;`
		`doesn’t`	`work;`
`009`		`\|DA`	`is`	`der`	`stift. \|`
		`THERE`	`is`	`the`	`pen.`
	`\|((traces pen with rIF))` `\|`
`((…))`
`019`	`L`	`[((strokes with rIF over picture ]`

`020`	`M`	`[ja`	`is`	`ganz`	`GLATT; (-) ]`
		`yes`	`it’s`	`completely`	`SMOOTH;`

(12) 07-BR-fishing rod (15 months)
`002`	`L`	`((turns page))`
`003`	`M`	`!OH!;`
`004`		`eine ANGel;`
		`a fishing rod;`
`005`	`L`	`[((lifts lh, [holds it))`	`]`

`006`	`M`	`[EIne ANGel;`	`]`
		`a fishing rod;`

(13) 01-BR-thinking (17 months)
`021`	`O:`	`((stands up, moves into M’s visual focus))`
`021`		`\|!DA!-`	`\|`
		`!THERE!-`
		`\|((looks at M, points to person standing behind the wall))`		`\|`

`022`	`M:`	`sanDAlen;`
		`sandals`
`023`	`O:`	`\|!DA!-`	`\|`
		`!THERE!-`
		`\|((looks at M, points to place behind him)) \|`
`024`	`M:`	`wollts nochma GUCKen geh:n,`
		`wanna go looking again,`
`025`	`O:`	`((thinking face))`

(14) 01-BR-dog (10 months)
`003`	`O:`	`((touches book with rH))`

004	M:	°hhh;
005		\|bs::::t,		\|
		\|((moves IF over picture))		\|
`006`		`\|OH:::`	`eine MAUS;\|`
			`a` `MOUSE;`
		`\|((turns page))` `\|`
007		\|bs:::t,		\|
		\|((moves IF over picture))		\|
`008`		`eine`	`KATze;`
		`a`	`CAT;`

(16) 07-BR-mug (11 months)
`019`	`(2.5)`
020	M	\|DA::	ist	der	becher;	\|
		THE::RE	is	the	mug;
		\|((guides Lea‘s hand, [taps on picture))				\|]

`021`	`L`	`[((looks at picture)) ]`
`022`	`M`	`\|DA:`	`ist`	`der`	`becher;`	`\|`
		`THE:RE`	`is`	`the`	`mug;`
		`\|((taps on picture)) \|`

(17) 01-BR-lion (17 months)
`011`	`M`	`wo`	`ist`	`das`	`AUge`,
		`where`	`is`	`the`	`EYE,`
012	L	[\|!DA:!;					\|]
		!THERE!;
		[\|((points to eye))				\|]

`013`	`M`	`[DA::]`	`is`	`das`	`AUge`	`vom`		`kleinen`
		`THE::RE`	`is`	`the`	`eye`	`of`	`the`	`little`
`014`		`löwen;` =	`genau;`
		`lion;`	`exactly;`

	9–14 months	15–17 months	18–22 months	23–24 months
Adult	Initiates job by setting up a conditional relevance for orienting toward the domain of scrutiny. Devices: Where question (prosodic emphasis on interrogative/search) or summons (“HERE”) Marking search space (book) by moving and lifting it Providing time for exploring the materiality of the book/Scrutinizing the search space Rendering general features of depictions visible Demonstrating the use of the book	Book-reading setting: Job is skipped as soon as the child understands the book as a potential domain of scrutiny
Child	Responds by orienting toward and exploring the domain of scrutiny	Play setting: Initiates job by setting up a conditional relevance. Devices: Directing the adult's attention toward distant entities by establishing diverging focuses Pursuing a response/Reestablishing conditional relevancies

(15) 07-BR-spoon (9 months)
`004`	`M`	`\|WO:`	`is`	`der`	`lÖffel;`	`\|`
		`WHE:RE`	`is`	`the`	`spoon;`
		`\|((moves book, lifts it up))`				`\|`
005		WO::	ist	der	lÖffel;
		WHE::RE	is	the	spoon;
`006`	`L`	`((touches book with face))`
`007`	`M`	`WO`	`ist`	`der`	`lÖffel?`
		`WHERE`	`is`	`the`	`spoon?`
`008`		`[\|` < < `breathy`> `DA:`> `ist der löffel.\|]`
				`THE:RE is the spoon.`
		`\|((points to spoon))`				`\|`
`009`	`L`	`[((places rh on picture))]`

`010`	`L`	`[((lh touches picture, fingers splayed))]`

`011`	`M`	`[` < < ☺>`DA:]`	`ist`	`der`	`LÖFfel-`> =
		`THE:RE`	`is`	`the`	`spoon-`

(18) 07-BR-peg (17 months)
005		LEa	wo	ist	der	TISCH.
		LEa	where	is	the	TABle.
`006`	`L`	`((points to table))`

007	M	und	wo	ist	die	KLAMmer?
		and	where	is	the	PIN?
`008`	`L`	`((points to other part of the table))`
`009`		< < `nodding`> `WUW;`>
010	M	die	!WÄ!scheklammer;
		the	!PIN!;
011		zeig	mir	mal	die WÄscheklammer.
		show	me	the	clothesPIN.
`012`	`L`	`((points to pin))`
`013`	`M`	< < `creaky`> `AH::`> `die wäscheklammer`
		`the clothespin is`
		`ist am TISCH-`
		`on the TAble-`

(19) 01-BR-stirring (16 months)
`001`	`O:`	`((turns page))`
002		\|((points to spoon))\|
		\|mh::; \|

	9–14 months	15–17 months	18–22 months	23–24 months
Adult	Initiates job by setting up a conditional relevance for locating a target. Devices: Where questions (prosodic emphasis on target) Taking over the task of locating (in place of child) Demonstrating the action by making their own perception observable Manual guiding Distinguishing between “meaningful” and ”not meaningful” movements, formulating the child's action (temporally aligned)	Initiates job/responds to child's initiations Where questions in the context of three-part sequences → contextualizes activity as instruction Other-initiating self-correction
Child	Responds by coordinating visual attention	Responds to/Initiates conditional relevance. Devices: Pointing Pointing + emphatic DA/THERE

(19) 01-BR-spoon (17 months)
`001`	`O`	`((turns page))`
002		\|((points to mug in the book))\|
		\|pf::\|
`003`	`M`	°`h:::;`
004		was ist DAS? =
		what is that
`005`		`[` = `ne TASse ] mit EInem?`
		`a mug with a`
`006`	`O`	`[points to mug]`
007		\|ÖFfel;\|
		oon
		\|((circular movement))\|

`008`	`M`	`LÖFfel;`
		`spoon`
`009`		`[geNAU::;]`
		`exactly`
`010`	`O`	`[((repeats circling movement))]`

(20) 07-BR-mug, hearts (22 months)
`011`	`M`	`was ham wa DA?`
		`what do we have THERE?`
`012`	`L`	`g`ε`:`
`013`	`M`	`SAG ma,`
		`SAY,`
`014`		= `was IS das?`
		`what IS that?`
`015`	`L`	`TASse.`
		`MUG.`
`016`	`M`	`ne TASse-`
		`a MUG-`
`017`		= `und WAS is obendrauf?`
		`and WHAT is on top of it?`
`018`	`L`	`\|LÖFfel.`	`\|`
		`SPOON.`
		`\|((points to picture))`	`\|`
`020`	`M`	`\|und was is das hier AUF der TASse?`	`\|`
		`and what is that here ON the mug?`
		`\|((taps on picture))`	`\|`
`021`	`L`	`p`ε`tseɐ` `;`
`022`	`M`	↑`HERzen;`
		`HEARTS;`
`023`	`M`	`[der LÖFfel is auf der HERZtasse.`	`]`
		`the SPOON is on the HEART mug.`
`024`	`L`	`[\|((turns page))\|`	`]`
		`\|ja\|`
		`yes.`

(21) 07-BR-dino (24 months)
`001`	`L`	`\|((oints to picture))\|`
		`\|DInoSAUrier;\|`
		`DInoSAUR;`
`002`	`M`	`ja::,`
		`yes::,`
`003`		`und was ist oben AUF dem`
		`and what is there ON top`
		`dinoSAUrier?`
		`of the dinosaur?`
`004`	`L`	`LÖwe;`
		`LION;`

	9–14 months	15–17 months	18–22 months	23–24 months
Adult	As long as where questions are asked, Jobs 3 and 4 merge For the devices, see Table 3	Initiates job by setting up a conditional relevance for labeling familiar objects What questions	Initiates job by setting up series of conditional relevancies for labeling. Devices: Reestablishing conditional relevance or initiating self-corrections Reformulating the child's utterance Asking series of questions
Child	Fulfills conditional relevance. Device: Pointing	Fulfills conditional relevance. Devices: Acting gestures Handling gestures Pointing + verbal label	Fulfills conditional relevance and initiates job. Devices: For responding: pointing + verbal label Verbal label For initiating: what question

PERMALINK

Reference as an Interactive Achievement: Sequential and Longitudinal Analyses of Labeling Interactions in Shared Book Reading and Free Play

Vivien Heller

Katharina J Rohlfing

Abstract

Introduction

Cognitive and communicative resources for establishing reference

External resources for the acquisition of reference

A descriptive instrument for analyzing reference and its acquisition as interactive achievements

Interactive jobs of establishing reference

Job 1: establishing visual perception as a relevant resource

Job 2: constituting a domain of scrutiny

Job 3: locating the target

Job 4: construing the referent

Descriptive levels of the instrument

Materials and methods

Participants

Data collection and transcription

Analytical procedure

Analyses and findings

Age-related sequential analyses

Establishing visual perception as a relevant resource (Job 1)

9–14 months

Figure 1.

Figure 2.

15–17 months

18–22 months

23–24 months

Table 1.

Constituting a domain of scrutiny (Job 2)

9–14 months

15–17 months

Table 2.

Locating the target (Job 3)

9–14 months

15–17 months

18–22 months

Table 3.

Construing the referent (Job 4)

9–14 months

15–17 months

18–22 months

23–24 months

Table 4.

Longitudinal comparison: children's devices and shares in the jobs

Developments on the level of jobs

Developments on the level of devices

Longitudinal comparison: adults' devices and shares in the jobs

Changes on the level of jobs

Changes on the level of devices

Discussion: what are the driving forces in the acquisition of reference?

Ethics statement

Author contributions

Funding

Conflict of interest statement

Supplementary material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases