Skip to main content
Top
Gepubliceerd in: Psychological Research 5/2017

31-08-2016 | Original Article

Familiar units prevail over statistical cues in word segmentation

Auteurs: Bénédicte Poulin-Charronnat, Pierre Perruchet, Barbara Tillmann, Ronald Peereman

Gepubliceerd in: Psychological Research | Uitgave 5/2017

Log in om toegang te krijgen
share
DELEN

Deel dit onderdeel of sectie (kopieer de link)

  • Optie A:
    Klik op de rechtermuisknop op de link en selecteer de optie “linkadres kopiëren”
  • Optie B:
    Deel de link per e-mail

Abstract

In language acquisition research, the prevailing position is that listeners exploit statistical cues, in particular transitional probabilities between syllables, to discover words of a language. However, other cues are also involved in word discovery. Assessing the weight learners give to these different cues leads to a better understanding of the processes underlying speech segmentation. The present study evaluated whether adult learners preferentially used known units or statistical cues for segmenting continuous speech. Before the exposure phase, participants were familiarized with part-words of a three-word artificial language. This design allowed the dissociation of the influence of statistical cues and familiar units, with statistical cues favoring word segmentation and familiar units favoring (nonoptimal) part-word segmentation. In Experiment 1, performance in a two-alternative forced choice (2AFC) task between words and part-words revealed part-word segmentation (even though part-words were less cohesive in terms of transitional probabilities and less frequent than words). By contrast, an unfamiliarized group exhibited word segmentation, as usually observed in standard conditions. Experiment 2 used a syllable-detection task to remove the likely contamination of performance by memory and strategy effects in the 2AFC task. Overall, the results suggest that familiar units overrode statistical cues, ultimately questioning the need for computation mechanisms of transitional probabilities (TPs) in natural language speech segmentation.
Voetnoten
1
While decreasing the number of words to learn seems to make learning easier (think for instance of a list of to-be-memorized items or of the decreased number of to-be learned words used for infants in the work of Saffran et al., 1996a compared to adults), in fact, decreasing the number of words composing the artificial language reduces the differences between word-internal and word-external TPs. With three words, the TPs within words were 1 and the TPs between words were .5 (compared respectively to 1. and .33 for a four-word language), and the difference in frequency of occurrence decreases between words and part-words.
 
2
For completion, we performed Linear Mixed Model (LMM) on the data with participants and items as random effects and Group as fixed effect. The LMM showed a significant effect of Group, F(2, 474) = 40.21, p < .001, a significant difference between the unfamiliarized group and both the part-word familiarized group, F(1, 313) = 44.71, p < .001, and the non-word familiarized group, F(1, 313) = 77.01, p < .001, while the difference between the part-word and nonword familiarized groups failed to reach the conventional significance threshold, F(1, 322) = 2,80, p = .095. These results thus lead to the same conclusions as the results obtained with the ANOVA.
 
3
Taking the last two syllables of a word and the first syllable of another word would be another possible segmentation requiring six part-words. However, in that case, the TP of the last syllable of the part-words would be .50 and not 1.00 (as for the words). For the sake of equality, only the part-words composed of the last syllable of a word and the first two syllable of another word were used.
 
4
To address the possibility that the preselected range may have been too narrow, analyses were run again with a [−300, 1000 ms] range, ensuring a very broad coverage. This new range was in fact the largest possible one, because a still larger range would have generated some overlaps between the response windows surrounding two successive target syllables. The resulting changes were quite minor. Means differed only by a few milliseconds, and the p values of the statistical tests reported in the main text differed only on their third or fourth decimals, never affecting their interpretation in terms of significance. Unsurprisingly, the rates of false alarms and misses decreased, but remained substantial. The mean rate of false alarms was 5.34 % for the part-word familiarized group and 5.30 % for the nonword-familiarized group. The mean rates of misses were 14.10% and 19.91 %, respectively. These analyses suggest that the relatively high rates of false alarms and misses were not due to ill-fitted exclusion criteria, but to genuine detection errors.
 
5
We additionally performed LMM with participants as random effect and Group and Target as fixed effects. There was no effect of Group, F(1, 3723) = 0.295, p = 0.587, but a significant main effect of Target, F(1, 3723) = 59.37, p < 0.001, which was qualified by a significant Group × Target interaction, F(1, 3723) = 28.70, p < 0.001. Subsequent analyses taking into account the division between familiar and unfamiliar part-words were performed. For the part-word familiarized group, the response times were faster for the last syllables of both the familiar part-words, F(1, 1461) = 69.84, p < 0.001, and the unfamiliar part-words, F(1, 1425) = 64.15, p < 0.001, than for the last syllables of the words. There was no significant difference between familiar part-words and unfamiliar part-words, F(1, 998) = 0.02, p = 0.874. For the nonword-familiarized group, a significant difference was observed between familiar part-words and words, F(1, 1366) = 4.91, p = 0.027, while no significant difference was observed for unfamiliar part-words versus words, F(1, 1348) = 2.29, p = 0.131, and familiar part-words versus unfamiliar part-words, F(1, 902) = 0.40, p = 0.528. These results thus lead to very similar conclusions as the results obtained with the ANOVA.
 
6
According to the UP interpretation, the mean response times for the last syllables of the part-words should not differ between the nonword familiarized group (i.e., 2nd syllable of the segmented unit if this group segment the speech stream into words) and the part-word familiarized group (i.e., last syllable of the segmented unit if this group segment the speech stream into part-words). Contradicting this prediction, the mean RTs were numerically slower for the nonword-familiarized group than for the part-word familiarized group. However, this difference did not reach significance (p = 0.25).
 
Literatuur
go back to reference Bertels, J., Franco, A., & Destrebecqz, A. (2012). How implicit is visual statistical learning? Journal of Experimental Psychology. Learning, Memory, and Cognition, 38(5), 1425–1431. doi:10.1037/a0027210.CrossRefPubMed Bertels, J., Franco, A., & Destrebecqz, A. (2012). How implicit is visual statistical learning? Journal of Experimental Psychology. Learning, Memory, and Cognition, 38(5), 1425–1431. doi:10.​1037/​a0027210.CrossRefPubMed
go back to reference Christiansen, M. H., Allen, J., & Seidenberg, M. S. (1998). Learning to segment speech using multiple cues: a connectionist model. Language and Cognitive Processes, 13, 221–268. doi:10.1080/016909698386528.CrossRef Christiansen, M. H., Allen, J., & Seidenberg, M. S. (1998). Learning to segment speech using multiple cues: a connectionist model. Language and Cognitive Processes, 13, 221–268. doi:10.​1080/​016909698386528.CrossRef
go back to reference Cutler, A., & Norris, D. (1988). The role of strong syllables in segmentation for lexical access. Journal of Experimental Psychology: Human Perception and Performance, 14, 113–121. doi:10.1037/0096-1523.14.1.113. Cutler, A., & Norris, D. (1988). The role of strong syllables in segmentation for lexical access. Journal of Experimental Psychology: Human Perception and Performance, 14, 113–121. doi:10.​1037/​0096-1523.​14.​1.​113.
go back to reference Dahan, D., & Brent, M. R. (1999). On the discovery of novel wordlike units from utterances: an artificial-language study with implications for native-language acquisition. Journal of Experimental Psychology: General, 128, 165–185. doi:10.1037/0096-3445.128.2.165.CrossRef Dahan, D., & Brent, M. R. (1999). On the discovery of novel wordlike units from utterances: an artificial-language study with implications for native-language acquisition. Journal of Experimental Psychology: General, 128, 165–185. doi:10.​1037/​0096-3445.​128.​2.​165.CrossRef
go back to reference Dutoit, T., Pagel, N., Pierret, F., Bataille, O., & Van Der Vrecken, O. (1996). The MBROLA Project: towards a Set of High-Quality Speech Synthesizers Free of Use for Non-Commercial Purposes. Proc. ICSLP’96. Philadelphia, 3, 1393–1396. Dutoit, T., Pagel, N., Pierret, F., Bataille, O., & Van Der Vrecken, O. (1996). The MBROLA Project: towards a Set of High-Quality Speech Synthesizers Free of Use for Non-Commercial Purposes. Proc. ICSLP’96. Philadelphia, 3, 1393–1396.
go back to reference Franco, A., Eberlen, J., Destrebecqz, A., Cleeremans, A., & Bertels, J. (2015a). Rapid serial auditory presentation. A new measure of statistical learning in speech segmentation: Experimental Psychology. doi:10.1027/1618-3169/a000295. Franco, A., Eberlen, J., Destrebecqz, A., Cleeremans, A., & Bertels, J. (2015a). Rapid serial auditory presentation. A new measure of statistical learning in speech segmentation: Experimental Psychology. doi:10.​1027/​1618-3169/​a000295.
go back to reference Franco, A., Gaillard, V., Cleeremans, A., & Destrebecqz, A. (2015b). Assessing segmentation processes by click detection: online measure of statistical learning, or simple interference? Behavior Research Methods,. doi:10.3758/s13428-014-0548-x.PubMed Franco, A., Gaillard, V., Cleeremans, A., & Destrebecqz, A. (2015b). Assessing segmentation processes by click detection: online measure of statistical learning, or simple interference? Behavior Research Methods,. doi:10.​3758/​s13428-014-0548-x.PubMed
go back to reference French, R. M., Addyman, C., & Mareschal, D. (2011). TRACX: a recognition-based connectionist framework for sequence segmentation and chunk extraction. Psychological Review, 118, 614–636. doi:10.1037/a0025255.CrossRefPubMed French, R. M., Addyman, C., & Mareschal, D. (2011). TRACX: a recognition-based connectionist framework for sequence segmentation and chunk extraction. Psychological Review, 118, 614–636. doi:10.​1037/​a0025255.CrossRefPubMed
go back to reference Gómez, R. (2007). Statistical learning in infant language development. In M. G., Gaskell (Ed.), The Oxford handbook of psycholinguistics (pp. 601-616). New York: Oxford University Press. Gómez, R. (2007). Statistical learning in infant language development. In M. G., Gaskell (Ed.), The Oxford handbook of psycholinguistics (pp. 601-616). New York: Oxford University Press.
go back to reference Hunt, R. H., & Aslin, R. N. (2001). Statistical learning in a serial reaction time task: access to separable statistical cues by individual learners. Journal of Experimental Psychology: General, 130, 658–680. doi:10.1037/0096-3445.130.4.658.CrossRef Hunt, R. H., & Aslin, R. N. (2001). Statistical learning in a serial reaction time task: access to separable statistical cues by individual learners. Journal of Experimental Psychology: General, 130, 658–680. doi:10.​1037/​0096-3445.​130.​4.​658.CrossRef
go back to reference Johnson, E. K. (2012). Bootstrapping language: Are infant statisticians up to the job? In P. Rebuschat & J. N. Williams (Eds.), Statistical learning and language acquisition (pp. 55–89). Berlin: De Gruyter Mouton. Johnson, E. K. (2012). Bootstrapping language: Are infant statisticians up to the job? In P. Rebuschat & J. N. Williams (Eds.), Statistical learning and language acquisition (pp. 55–89). Berlin: De Gruyter Mouton.
go back to reference Perruchet, P., & Poulin-Charronnat, B. (2012). Beyond transitional probability computations: extracting word-like units when only statistical information is available. Journal of Memory and Language, 66, 807–818. doi:10.1016/j.jml.2012.02.010.CrossRef Perruchet, P., & Poulin-Charronnat, B. (2012). Beyond transitional probability computations: extracting word-like units when only statistical information is available. Journal of Memory and Language, 66, 807–818. doi:10.​1016/​j.​jml.​2012.​02.​010.CrossRef
go back to reference Perruchet, P., Tyler, M. D., Galland, N., & Peereman, R. (2004). Learning nonadjacent dependencies: no need for algebraic-like computations. Journal of Experimental Psychology: General, 133(4), 573–583.CrossRef Perruchet, P., Tyler, M. D., Galland, N., & Peereman, R. (2004). Learning nonadjacent dependencies: no need for algebraic-like computations. Journal of Experimental Psychology: General, 133(4), 573–583.CrossRef
Metagegevens
Titel
Familiar units prevail over statistical cues in word segmentation
Auteurs
Bénédicte Poulin-Charronnat
Pierre Perruchet
Barbara Tillmann
Ronald Peereman
Publicatiedatum
31-08-2016
Uitgeverij
Springer Berlin Heidelberg
Gepubliceerd in
Psychological Research / Uitgave 5/2017
Print ISSN: 0340-0727
Elektronisch ISSN: 1430-2772
DOI
https://doi.org/10.1007/s00426-016-0793-y

Andere artikelen Uitgave 5/2017

Psychological Research 5/2017 Naar de uitgave