Skip to main content
Top
Gepubliceerd in: Psychological Research 1/2010

01-01-2010 | Original Article

Perceptual scaling of voice identity: common dimensions for different vowels and speakers

Auteurs: Oliver Baumann, Pascal Belin

Gepubliceerd in: Psychological Research | Uitgave 1/2010

Log in om toegang te krijgen
share
DELEN

Deel dit onderdeel of sectie (kopieer de link)

  • Optie A:
    Klik op de rechtermuisknop op de link en selecteer de optie “linkadres kopiëren”
  • Optie B:
    Deel de link per e-mail

Abstract

The aims of our study were: (1) to determine if the acoustical parameters used by normal subjects to discriminate between different speakers vary when comparisons are made between pairs of two of the same or different vowels, and if they are different for male and female voices; (2) to ask whether individual voices can reasonably be represented as points in a low-dimensional perceptual space such that similarly sounding voices are located close to one another. Subjects were presented with pairs of voices from 16 male and 16 female speakers uttering the three French vowels “a”, “i” and “u” and asked to give speaker similarity judgments. Multidimensional analyses of the similarity matrices were performed separately for male and female voices and for three types of comparisons: same vowels, different vowels and overall average. The resulting dimensions were then interpreted a posteriori in terms of relevant acoustical measures. For both male and female voices, a two-dimensional perceptual space was found to be most appropriate, with axes largely corresponding to contributions of the larynx (pitch) and supra-laryngeal vocal tract (formants), mirroring the two largely independent components of source and filter in voice production. These perceptual spaces of male and female voices and their corresponding voice samples are available at: http://​vnl.​psy.​gla.​ac.​uk section Resources.
Literatuur
go back to reference Aronovitch, D. S. (1976). The voice of personality: Stereotyped judgments and their relation to voice quality and sex of speaker. The Journal of Social Psychology, 99, 207–220.PubMed Aronovitch, D. S. (1976). The voice of personality: Stereotyped judgments and their relation to voice quality and sex of speaker. The Journal of Social Psychology, 99, 207–220.PubMed
go back to reference Bachorowski, J. A., & Owren, M. J. (1999). Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. The Journal of the Acoustical Society of America, 106, 1054–1063.CrossRefPubMed Bachorowski, J. A., & Owren, M. J. (1999). Acoustic correlates of talker sex and individual talker identity are present in a short vowel segment produced in running speech. The Journal of the Acoustical Society of America, 106, 1054–1063.CrossRefPubMed
go back to reference Belin, P., Fecteau, S., & Bédard, C. (2004). Thinking the voice: neural correlates of voice perception. Trends in Cognitive Science, 8, 129–135.CrossRef Belin, P., Fecteau, S., & Bédard, C. (2004). Thinking the voice: neural correlates of voice perception. Trends in Cognitive Science, 8, 129–135.CrossRef
go back to reference Borg, I., & Staufenbiel, T. (1989). Theorien und Methoden der Skalierung. Bern: Huber. Borg, I., & Staufenbiel, T. (1989). Theorien und Methoden der Skalierung. Bern: Huber.
go back to reference Bricker, P. D., & Pruzansky, S. (1976). Speaker recognition. In N. J. Lass (Ed.), Contemporary issues in experimental phonetics (pp. 295–326). New York: Academic. Bricker, P. D., & Pruzansky, S. (1976). Speaker recognition. In N. J. Lass (Ed.), Contemporary issues in experimental phonetics (pp. 295–326). New York: Academic.
go back to reference Bruckert, L., Liénard, J. S., Lacroix, A., Kreutzer, M., & Leboucher, G. (2006). Women use voice parameters to assess men’s characteristics. In: Proceedings of the royal society. Biological sciences (Vol. 273, pp. 83–89). Bruckert, L., Liénard, J. S., Lacroix, A., Kreutzer, M., & Leboucher, G. (2006). Women use voice parameters to assess men’s characteristics. In: Proceedings of the royal society. Biological sciences (Vol. 273, pp. 83–89).
go back to reference Carroll, J. D., & Chang, J. (1970). An analysis of individual differences in multidimensional scaling via an N-way generalization of Eckart-Young decomposition. Psychometrica, 35, 283–319.CrossRef Carroll, J. D., & Chang, J. (1970). An analysis of individual differences in multidimensional scaling via an N-way generalization of Eckart-Young decomposition. Psychometrica, 35, 283–319.CrossRef
go back to reference Clarke, F. R., & Becker, R. W. (1969). Comparison of techniques for discriminating among talkers. Journal of Speech and Hearing Research, 12, 747–762.PubMed Clarke, F. R., & Becker, R. W. (1969). Comparison of techniques for discriminating among talkers. Journal of Speech and Hearing Research, 12, 747–762.PubMed
go back to reference Coleman, R. O. (1976). A comparison of the contributions of two voice quality characteristics to the perception of maleness and femaleness in the voice. Journal of Speech and Hearing Research, 19, 168–180.PubMed Coleman, R. O. (1976). A comparison of the contributions of two voice quality characteristics to the perception of maleness and femaleness in the voice. Journal of Speech and Hearing Research, 19, 168–180.PubMed
go back to reference Collins, S. A. (2000). Men’s voices and women’s choices. Animal Behaviour, 40, 773–780.CrossRef Collins, S. A. (2000). Men’s voices and women’s choices. Animal Behaviour, 40, 773–780.CrossRef
go back to reference Collins, S. A., & Missing, C. (2003). Vocal and visual attractiveness are related in women. Animal Behaviour, 65, 997–1004.CrossRef Collins, S. A., & Missing, C. (2003). Vocal and visual attractiveness are related in women. Animal Behaviour, 65, 997–1004.CrossRef
go back to reference Endres, W., Bambach, W., & Flösser, G. (1971). Voice spectrograms as a function of age, voice disguise, and voice imitation. The Journal of the Acoustical Society of America, 49, 1842–1848.CrossRefPubMed Endres, W., Bambach, W., & Flösser, G. (1971). Voice spectrograms as a function of age, voice disguise, and voice imitation. The Journal of the Acoustical Society of America, 49, 1842–1848.CrossRefPubMed
go back to reference Fant, G. (1960). Acoustic theory of speech production. The Hague: Mouton & Co. Fant, G. (1960). Acoustic theory of speech production. The Hague: Mouton & Co.
go back to reference Hanson, H. (1997). Glottal characteristics of female speakers: Acoustic correlates. The Journal of the Acoustical Society of America, 101, 466–481.CrossRefPubMed Hanson, H. (1997). Glottal characteristics of female speakers: Acoustic correlates. The Journal of the Acoustical Society of America, 101, 466–481.CrossRefPubMed
go back to reference Hecker, M. H. L. (1971). Speaker recognition: An interpretive survey of the literature. ASHA Monographs No. 16 Hecker, M. H. L. (1971). Speaker recognition: An interpretive survey of the literature. ASHA Monographs No. 16
go back to reference Holmgren, G. (1967). Physical and psychological correlates of speaker recognition. Journal of Speech and Hearing Reserch, 10, 57–66. Holmgren, G. (1967). Physical and psychological correlates of speaker recognition. Journal of Speech and Hearing Reserch, 10, 57–66.
go back to reference Horii, Y. (1980). Vocal shimmer in sustained phonation. Journal of Speech and Hearing Research, 23, 202–209.PubMed Horii, Y. (1980). Vocal shimmer in sustained phonation. Journal of Speech and Hearing Research, 23, 202–209.PubMed
go back to reference Kreiman, J., Gerratt, B. R., Precoda, K., & Berke, G. S. (1992). Individual differences in voice quality perception. Journal of Speech and Hearing Research, 35, 512–520.PubMed Kreiman, J., Gerratt, B. R., Precoda, K., & Berke, G. S. (1992). Individual differences in voice quality perception. Journal of Speech and Hearing Research, 35, 512–520.PubMed
go back to reference Matsumoto, H., Hiki, S., Sone, T., & Nimura, T. (1973). Multidimensional representation of personal quality of vowels and its acoustical correlates. IEEE Transactions on Audio and Electroacoustics, 21, 428–436.CrossRef Matsumoto, H., Hiki, S., Sone, T., & Nimura, T. (1973). Multidimensional representation of personal quality of vowels and its acoustical correlates. IEEE Transactions on Audio and Electroacoustics, 21, 428–436.CrossRef
go back to reference Moore, B. C. J. (2003). An introduction to the psychology of hearing. Amsterdam: Academic Press. Moore, B. C. J. (2003). An introduction to the psychology of hearing. Amsterdam: Academic Press.
go back to reference Murry, T., & Singh, S. (1980). Multidimensional analysis of male and female voices. The Journal of the Acoustical Society of America, 68, 1294–1300.CrossRefPubMed Murry, T., & Singh, S. (1980). Multidimensional analysis of male and female voices. The Journal of the Acoustical Society of America, 68, 1294–1300.CrossRefPubMed
go back to reference Singer, H., & Sagayama, S. (1992). Pitch dependent phone modelling for HMM based speech recognition. Acoustics, Speech, and Signal Processing, 1, 273–276. Singer, H., & Sagayama, S. (1992). Pitch dependent phone modelling for HMM based speech recognition. Acoustics, Speech, and Signal Processing, 1, 273–276.
go back to reference Singh, S., & Murry, T. (1978). Multidimensional classification of normal voice qualities. The Journal of the Acoustical Society of America, 64, 81–87.CrossRefPubMed Singh, S., & Murry, T. (1978). Multidimensional classification of normal voice qualities. The Journal of the Acoustical Society of America, 64, 81–87.CrossRefPubMed
go back to reference Tabachnick, B. G., & Fidell, L. S. (1996). Using multivariate statistics. New York: HarperCollins. Tabachnick, B. G., & Fidell, L. S. (1996). Using multivariate statistics. New York: HarperCollins.
go back to reference van Dommelen, W. A. (1990). Acoustic parameters in human speaker recognition. Language and Speech, 33, 259–272.PubMed van Dommelen, W. A. (1990). Acoustic parameters in human speaker recognition. Language and Speech, 33, 259–272.PubMed
go back to reference Voiers, W. D. (1964). Perceptual bases of speaker identity. The Journal of the Acoustical Society of America, 36, 1065–1073.CrossRef Voiers, W. D. (1964). Perceptual bases of speaker identity. The Journal of the Acoustical Society of America, 36, 1065–1073.CrossRef
go back to reference Walden, B. E., Montgomery, A. A., Gibeily, G. T., Prosek, R. A., & Schwartz, D. M. (1978). Correlates of psychological dimensions in talker similarity. Journal of Speech and Hearing Research, 21, 265–275.PubMed Walden, B. E., Montgomery, A. A., Gibeily, G. T., Prosek, R. A., & Schwartz, D. M. (1978). Correlates of psychological dimensions in talker similarity. Journal of Speech and Hearing Research, 21, 265–275.PubMed
go back to reference Yumoto, E., Sasaki, Y., & Okamura, H. (1984). Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. Journal of Speech and Hearing Research, 27, 2–6.PubMed Yumoto, E., Sasaki, Y., & Okamura, H. (1984). Harmonics-to-noise ratio and psychophysical measurement of the degree of hoarseness. Journal of Speech and Hearing Research, 27, 2–6.PubMed
Metagegevens
Titel
Perceptual scaling of voice identity: common dimensions for different vowels and speakers
Auteurs
Oliver Baumann
Pascal Belin
Publicatiedatum
01-01-2010
Uitgeverij
Springer-Verlag
Gepubliceerd in
Psychological Research / Uitgave 1/2010
Print ISSN: 0340-0727
Elektronisch ISSN: 1430-2772
DOI
https://doi.org/10.1007/s00426-008-0185-z

Andere artikelen Uitgave 1/2010

Psychological Research 1/2010 Naar de uitgave