Anticipatory coarticulation facilitates word recognition in toddlers

doi:10.1016/j.cognition.2015.05.009

Cognition

Volume 142, September 2015, Pages 345-350

https://doi.org/10.1016/j.cognition.2015.05.009 Get rights and content

Highlights

•
We report a looking-while-listening eyetracking study with 18–24 month-olds.
•
We manipulated the coarticulatory cues on the word “the”.
•
Under facilitating coarticulation, the cues predicted the following noun.
•
Looking patterns were compared for facilitating vs. neutral coarticulation.
•
Toddlers looked to target sooner when “the” contained facilitating coarticulation.

Abstract

Children learn from their environments and their caregivers. To capitalize on learning opportunities, young children have to recognize familiar words efficiently by integrating contextual cues across word boundaries. Previous research has shown that adults can use phonetic cues from anticipatory coarticulation during word recognition. We asked whether 18–24 month-olds (n = 29) used coarticulatory cues on the word “the” when recognizing the following noun. We performed a looking-while-listening eyetracking experiment to examine word recognition in neutral vs. facilitating coarticulatory conditions. Participants looked to the target image significantly sooner when the determiner contained facilitating coarticulatory cues. These results provide the first evidence that novice word-learners can take advantage of anticipatory sub-phonemic cues during word recognition.

Introduction

To learn from their environment, young children must be able to process familiar words efficiently. Word recognition mediates toddlers’ ability to learn words from caregivers (Weisleder & Fernald, 2013), and efficiency of lexical processing during the first two years predicts vocabulary and working memory later in childhood (Marchman & Fernald, 2008). Grammatical, pragmatic and phonetic contextual cues can constrain word recognition by simplifying the search space, but many such cues to word identification are not word-internal. Therefore, integrating contextual cues across word boundaries is essential for efficient word recognition.

One of the most well established context-sensitive phenomena in phonetics is coarticulation: the overlap of articulatory gestures in neighboring sounds. Coarticulation influences the production of sound patterns both within and across word boundaries. Typical English examples include coronal place assimilation (e.g., saying in case with a velar nasal consonant) and fronting of /k/ in keep (cf. backing and lip-rounding on /k/ in coop). A coarticulated sound carries acoustic information about neighboring sounds, introducing redundant and locally coherent information into the speech signal. In this respect, coarticulation provides regularity or “lawful variability” that can support speech perception (Elman & McClelland, 1986).

Indeed, adult listeners access and exploit coarticulatory cues during speech perception and word recognition (Gow, 2002, Gow and McMurray, 2007). Adults are slower to recognize words when there is a mismatch between coarticulatory cues in a vowel and the following consonant (e.g., Dahan et al., 2001, McQueen et al., 1999, Tobin et al., 2010). Conversely, appropriate coarticulation can facilitate spoken word recognition (e.g., Mattys, White, & Melhorn, 2005). For example, adult English listeners are faster to recognize a noun when the preceding determiner the carries information about the onset of the noun (Salverda, Kleinschmidt, & Tanenhaus, 2014).

It is not known whether young children can take advantage of coarticulatory cues during word recognition. Toddlers encode subsegmental details in their lexical representations (Fisher, Church, & Chambers, 2004), so coarticulatory cues should be accessible to these listeners in principle. In addition, toddlers recognize spoken words incrementally, using acoustic cues as they become available as a word unfolds over the speech signal (e.g., Fernald et al., 2001, Swingley et al., 1999). Moreover, toddlers rely on contextual cues when recognizing words produced in fluent speech (Plunkett, 2006). These findings raise an important question: Can young listeners use coarticulatory cues to facilitate recognition of a following word?

This question is important given the longstanding debate concerning the nature of early phonological representations. One point of view holds that these representations are under-specified and that children differentiate between words using relatively holistic phonological representations (Charles-Luce and Luce, 1990, Charles-Luce and Luce, 1995, Jusczyk, 1993). Based on a corpus analysis, Charles-Luce and Luce argued that young children do not need the same phonological detail in their lexical representations as adults do because children’s phonological neighborhoods are much sparser. Researchers supporting this point of view have hypothesized that children’s phonological representations gradually become more detailed as vocabulary size increases (Edwards et al., 2004, Metsala, 1999, Werker and Curtin, 2005, Werker et al., 2002). An opposing point of view posits that children’s phonological representations are segmental from very early in development (Dollaghan, 1994, Magnuson et al., 2003). This view is supported by studies showing that infants are sensitive to one-feature mispronunciations of familiar words (e.g., Swingley and Aslin, 2000, Swingley and Aslin, 2002, White and Morgan, 2008; see also review in Mayor & Plunkett, 2014). If toddlers use anticipatory coarticulation for word recognition, this finding would provide additional support for the viewpoint that children’s phonological representations are well specified even when their vocabularies are relatively small.

In the present study, we investigated whether toddlers took advantage of sub-phonemic anticipatory coarticulatory cues between words. Specifically, we asked whether coarticulatory acoustic cues on the determiner the facilitate recognition of the following word. We used a looking-while-listening task (Fernald, Zangl, Portillo, & Marchman, 2008) to determine whether toddlers looked more quickly to a named image in facilitating vs. neutral coarticulatory contexts (manipulated within subjects). Crucially, all of the items were cross-spliced to ensure that the recordings were otherwise comparable. We hypothesized that if toddlers are sensitive to coarticulation, we should see earlier recognition of the target noun in facilitating contexts relative to neutral contexts.

Section snippets

Participants

Twenty-nine 18–24-month-olds (M = 20.8, range = 18.1–23.8, 13 male) participated in this study. An additional 11 toddlers were excluded from the analyses due to inattentiveness (10) or having more than 50% missing data during non-filler trials (1). Caregivers completed the short version of the Words and Sentences Form of the MacArthur–Bates Communicative Development Inventory (MBCDI; Fenson et al., 2007).

Materials and stimuli

We selected target words that are familiar to toddlers in this age group. For the facilitating

Results

Overall looking patterns are presented in Fig. 2. Accuracy hovers around chance performance over the course of the determiner and approximately 250 ms into the target word. Accuracy increases from 250 to 1000 ms, and after 1000 ms accuracy begins to plateau then decline. Time clearly predicts accuracy; the probability of looking to target increases as the word unfolds. Importantly, coarticulatory information also predicts accuracy because participants have a noticeable head-start on the

Discussion

The present study provides the first evidence that toddlers take advantage of coarticulatory cues across word boundaries when recognizing familiar words. Participants on average looked to a named image approximately 100 ms earlier when the determiner the contained coarticulatory cues about the onset of the following noun. These results indicate that novice word-learners can take advantage of anticipatory coarticulatory information across word boundaries to support recognition of familiar words.

Acknowledgements

This research was supported by NIDCD Grant R01 DC012513 to Susan Ellis Weismer, Jan Edwards, and Jenny R. Saffran, NICHD Grant R37-HD037466 to Jenny R. Saffran, a grant from the James F. McDonnell Foundation to Jenny R. Saffran, NIDCD Grant R01-02932 to Jan Edwards, Mary E. Beckman, and Benjamin Munson, NICHD Grant 2-T32-HD049899 to Maryellen MacDonald, and NICHD Grant P30-HD03352 to the Waisman Center. We thank Eileen Haebig, Franzo Law II, Erin Long, Courtney Venker, and Matt Winn for help

References (32)

D.J. Barr
Analyzing “visual world” eyetracking data using multilevel logistic regression
Journal of Memory and Language
(2008)
P.W. Jusczyk
From general to language-specific capacities: The WRAPSA model of how speech perception develops
Journal of Phonetics
(1993)
J. Mayor et al.
Infant word recognition: Insights from TRACE simulations
Journal of Memory and Language
(2014)
A.P. Salverda et al.
Immediate effects of anticipatory coarticulation in spoken-word recognition
Journal of Memory and Language
(2014)
D. Swingley et al.
Spoken word recognition and lexical representation in very young children
Cognition
(2000)
D. Swingley et al.
Continuous processing in word recognition at 24 months
Cognition
(1999)
K.S. White et al.
Sub-segmental detail in early lexical representations
Journal of Memory and Language
(2008)
Bates, D., Mächler, M., Bolker, B., & Walker, S. (2014).lme4: linear mixed-effects models using Eigen and S4. Retrieved...
J. Charles-Luce et al.
Similarity neighbourhoods of words in young children’s lexicons
Journal of Child Language
(1990)
J. Charles-Luce et al.
An examination of similarity neighbourhoods in young children’s receptive vocabularies
Journal of Child Language
(1995)

D. Dahan et al.

Subcategorical mismatches and the time course of lexical access: Evidence for lexical competition

Language and Cognitive Processes

(2001)

C.A. Dollaghan

Children’s phonological neighbourhoods: Half empty or half full?

Journal of Child Language

(1994)

J. Edwards et al.

The interaction between vocabulary size and phonotactic probability effects on children’s production accuracy and fluency in nonword repetition

Journal of Speech, Language, and Hearing Research

(2004)

J.L. Elman et al.

Exploiting lawful variability in the speech wave

L. Fenson et al.

MacArthur-Bates communicative development inventories: User’s guide and technical manual

(2007)

A. Fernald et al.

When half a word is enough: infants can recognize spoken words using partial phonetic information

Child Development

(2001)

Cited by (47)

Even young children make multiple predictions in the complex visual world
2023, Journal of Experimental Child Psychology
Children can anticipate upcoming input in sentences with semantically constraining verbs. In the visual world, the sentence context is used to anticipatorily fixate the only object matching potential sentence continuations. Adults can process even multiple visual objects in parallel when predicting language. This study examined whether young children can also maintain multiple prediction options in parallel during language processing. In addition, we aimed at replicating the finding that children’s receptive vocabulary size modulates their prediction. German children (5–6 years, n = 26) and adults (19–40 years, n = 37) listened to 32 subject–verb–object sentences with semantically constraining verbs (e.g., “The father eats the waffle”) while looking at visual scenes of four objects. The number of objects being consistent with the verb constraints (e.g., being edible) varied among 0, 1, 3, and 4. A linear mixed effects model on the proportion of target fixations with the effect coded factors condition (i.e., the number of consistent objects), time window, and age group revealed that upon hearing the verb, children and adults anticipatorily fixated the single visual object, or even multiple visual objects, being consistent with the verb constraints, whereas inconsistent objects were fixated less. This provides first evidence that, comparable to adults, young children maintain multiple prediction options in parallel. Moreover, children with larger receptive vocabulary sizes (Peabody Picture Vocabulary Test) anticipatorily fixated potential targets more often than those with smaller ones, showing that verbal abilities affect children’s prediction in the complex visual world.
Coarticulation facilitates lexical processing for toddlers with autism
2021, Cognition
Citation Excerpt :
Recent research suggests that TD children as young as 18 months also use coarticulatory cues for spoken word recognition. Mahr et al. (2015) found that 18- to 24-month-olds were faster to fixate a target object when it was labelled using a sentence (e.g., Find the ball) where the preceding determiner (the) contained coarticulatory information about the onset of the target word (ball) compared to when coarticulatory cues were removed. We predicted that children with ASD would also use coarticulatory cues during incremental speech processing.
Many children with autism spectrum disorder (ASD) are delayed in learning language. The mechanisms underlying these delays are not well understood but may involve differences in how children process language. In the current experiment, we compared how 3- to 4-year-old children with ASD (n = 58) and 2- to 3-year-old children who are typically developing (TD, n = 44) use phonological information to incrementally process speech. Children saw pictures of objects displayed on a screen and heard sentences labeling one of the objects (e.g., Find the ball). For some sentences, the determiner the contained coarticulatory information about the onset of the target word. For other sentences, the determiner the did not contain any coarticulatory information. Children were faster to fixate the target object for sentences with vs. without coarticulation. This effect of coarticulation was the same for children with ASD compared to their TD peers. When controlling for group differences in receptive language ability, the effect of coarticulation was stronger for children with ASD compared to their TD peers. These results suggest that phonological processing is an area of relative strength for children with ASD.
The impact of alphabetic literacy on the perception of speech sounds
2021, Cognition
The aim of the present study was to evaluate the impact of literacy on phoneme perception. It built on previous research by using more controlled stimuli than in former studies and by independently examining the impacts of literacy and age on phoneme perception. Participants were adult and children beginning readers, and skilled adult readers. They were presented with identification and discrimination tasks, using a voicing continuum. In addition to examining their categorical perception of speech sounds and the precision of phonemic categories, participants' literacy level was carefully evaluated. The results confirmed that neither age nor literacy modulated categorical perception. However, level of literacy did have a significant impact on the precision of phonemic categories, which was independent from the influence of age.
The role of speaker eye gaze and mutual exclusivity in novel word learning by monolingual and bilingual children
2020, Journal of Experimental Child Psychology
Citation Excerpt :
Thus, we again expected group to interact with the time terms in the Conflict condition, with bilingual children showing a linear increase in looks to target for familiar objects. A critical time window of 300 to 2000 ms was derived empirically, where the start of the window was determined by approximating the earliest point when participants’ looks to target (accuracy) increased and the end of the window was determined by approximating the point when their fixations tended to asymptote (e.g., Magnuson, Tanenhaus, Aslin, & Dahan, 2003; Mahr, McMillan, Saffran, Ellis Weismer, & Edwards, 2015; Vouloumanos & Werker, 2009). Data cleaning led to the elimination of any trials where 50% of the eye-tracking data was missing and exclusion of participants where more than 50% of their total data points were missing.
The current study examined the combined effect of a speaker’s eye gaze and mutual exclusivity (ME) on novel word retention in monolingual and bilingual children. A novel object was presented with a familiar object, and children were taught new labels for objects under two conditions. In the Align condition, the speaker’s gaze and the ME cue provided the same information (the speaker looked at the novel object while labeling it with a novel name). In the Conflict condition, the speaker’s gaze and the ME cue provided competing information (the speaker looked at the familiar object while labeling it with a novel name). Using a visual world eye-tracking paradigm, children’s retention was assessed by testing novel objects with novel labels and by testing the familiar objects with novel labels. We found that all children successfully retained the novel labels for novel objects when both eye gaze and ME provided the same information. However, when the cues conflicted, bilingual children did not perform above chance for either novel objects or familiar objects. In contrast, monolingual children demonstrated retention of novel labels for familiar objects but not for novel objects. Together, the findings suggest that redundant cues benefit word retention in all children regardless of linguistic background. Furthermore, when speaker gaze and ME conflict, bilingual children appear to disregard both cues during retention, whereas monolingual children may be more willing to retain novel labels for familiar words, suggesting that they prioritize a speaker’s eye gaze over ME.
Frequent vs. infrequent words shape toddlers' real-time sentence comprehension
2023, Journal of Child Language
A lexical advantage in four-year-old children's word repetition
2021, Journal of Child Language

View all citing articles on Scopus

View full text

Brief articleAnticipatory coarticulation facilitates word recognition in toddlers

Highlights

Abstract

Introduction

Section snippets

Participants

Materials and stimuli

Results

Discussion

Acknowledgements

Journal of Memory and Language

Journal of Phonetics

Journal of Memory and Language

Journal of Memory and Language

Cognition

Cognition

Journal of Memory and Language

Similarity neighbourhoods of words in young children’s lexicons

Journal of Child Language

An examination of similarity neighbourhoods in young children’s receptive vocabularies

Journal of Child Language

Subcategorical mismatches and the time course of lexical access: Evidence for lexical competition

Language and Cognitive Processes

Children’s phonological neighbourhoods: Half empty or half full?

Journal of Child Language

The interaction between vocabulary size and phonotactic probability effects on children’s production accuracy and fluency in nonword repetition

Journal of Speech, Language, and Hearing Research

Exploiting lawful variability in the speech wave

MacArthur-Bates communicative development inventories: User’s guide and technical manual

When half a word is enough: infants can recognize spoken words using partial phonetic information

Child Development

Brief article
Anticipatory coarticulation facilitates word recognition in toddlers