Skip to main content
Log in

In a Nervous Voice: Acoustic Analysis and Perception of Anxiety in Social Phobics’ Speech

  • Original paper
  • Published:
Journal of Nonverbal Behavior Aims and scope Submit manuscript

Abstract

This study investigated the effects of anxiety on nonverbal aspects of speech using data collected in the framework of a large study of social phobia treatment. The speech of social phobics (N = 71) was recorded during an anxiogenic public speaking task both before and after treatment. The speech samples were analyzed with respect to various acoustic parameters related to pitch, loudness, voice quality, and temporal aspects of speech. The samples were further content-masked by low-pass filtering (which obscures the linguistic content of the speech but preserves nonverbal affective cues) and subjected to listening tests. Results showed that a decrease in experienced state anxiety after treatment was accompanied by corresponding decreases in (a) several acoustic parameters (i.e., mean and maximum voice pitch, high-frequency components in the energy spectrum, and proportion of silent pauses), and (b) listeners’ perceived level of nervousness. Both speakers’ self-ratings of state anxiety and listeners’ ratings of perceived nervousness were further correlated with similar acoustic parameters. The results complement earlier studies on vocal affect expression which have been conducted on posed, rather than authentic, emotional speech.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  • Adelmann, P. K., & Zajonc, R. B. (1989). Facial efference and experience of emotion. Annual Review of Psychology, 40, 249–280.

    Article  PubMed  Google Scholar 

  • Ambady, N., & Rosenthal, R. (1992). Thin slices of expressive behavior as predictors of interpersonal consequences: A meta-analysis. Psychological Bulletin, 111, 256–274.

    Article  Google Scholar 

  • American Psychiatric Association. (1994). Diagnostic and statistical manual for mental disorders (4th ed.). Washington, DC: American Psychiatric Press.

    Google Scholar 

  • Aubergé, V., Audibert, N., & Rilliard, A. (2006). De E-Wiz à C-Clone: Recueil, modélisation et synthèse d’expressions authentiques. Revue d’Intelligence Artificielle, 20, 499–527.

    Article  Google Scholar 

  • Bachorowski, J.-A., & Owren, M. J. (1995). Vocal expression of emotion: Acoustical properties of speech are associated with emotional intensity and context. Psychological Science, 6, 219–224.

    Article  Google Scholar 

  • Barrett, J., & Paus, T. (2002). Affect-induced changes in speech production. Experimental Brain Research, 146, 531–537.

    Article  Google Scholar 

  • Batliner, A., Fischer, K., Huber, R., Spilker, J., & Nöth, E. (2003). How to find trouble in communication. Speech Communication, 40, 117–143.

    Article  Google Scholar 

  • Biersack, S., & Kempe, V. (2005). Exploring the influence of vocal emotion expression on communicative effectiveness. Phonetica, 62, 106–119.

    Article  PubMed  Google Scholar 

  • Boersma, P., & Weenink, D. (2007). Praat: Doing phonetics by computer (Version 4.6.12) [Computer program]. http://www.praat.org/. Retrieved 27 July 27 2007.

  • Bonner, M. R. (1943). Changes in the speech pattern under emotional tension. American Journal of Psychology, 56, 262–273.

    Article  Google Scholar 

  • Cowie, R., & Cornelius, R. R. (2003). Describing the emotional states that are expressed in speech. Speech Communication, 40, 5–32.

    Article  Google Scholar 

  • Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., et al. (2001). Emotion recognition in human–computer interaction. IEEE Signal Processing Magazine, 18(1), 32–80.

    Article  Google Scholar 

  • Darwin, C. (1998). The expression of the emotions in man and animals (with introduction, afterword, and commentaries by P. Ekman). New York: Oxford University Press. (Original work published 1872).

  • Davitz, J. R. (Ed.). (1964). The communication of emotional meaning. New York: McGraw-Hill.

    Google Scholar 

  • Devillers, L., Vidrascu, L., & Lamel, L. (2005). Challenges in real-life emotion annotation and machine learning based detection. Neural Networks, 18, 407–422.

    Article  PubMed  Google Scholar 

  • Egloff, B., Schmukle, S. C., Burns, L. R., & Schwerdtfeger, A. (2006). Spontaneous emotion regulation during evaluated speaking tasks: Associations with negative affect, anxiety expression, memory, and physiological responding. Emotion, 6, 356–366.

    Article  PubMed  Google Scholar 

  • Ekman, P. (1992). An argument for basic emotions. Cognition and Emotion, 6, 169–200.

    Article  Google Scholar 

  • Ekman, P. (2003). Emotions revealed. New York: Henry Holt.

    Google Scholar 

  • Ekman, P., & Friesen, W. V. (1969). The repertoire of nonverbal behavior: Categories, origins, usage, and coding. Semiotica, 1, 49–98.

    Google Scholar 

  • Eldred, S. H., & Price, D. B. (1958). A linguistic evaluation of feeling states in psychotherapy. Psychiatry, 21, 115–121.

    PubMed  Google Scholar 

  • Fernandez-Dols, J., Sanchez, F., Carrera, P., & Ruiz-Belda, M. (1997). Are spontaneous expressions and emotions linked? An experimental test of coherence. Journal of Nonverbal Behavior, 21, 163–177.

    Article  Google Scholar 

  • First, M. B., Gibbon, M., Spitzer, R. L., & Williams, J. B. W. (1998). SCID-I: Interview protocol [Swedish]. Stockholm: Pilgrim Press.

    Google Scholar 

  • Forsell, M., Elenius, K., & Laukka, P. (2007). Acoustic correlates of frustration in spontaneous speech. Speech, Music and Hearing: Quarterly Progress and Status Report, 50, 37–40. Stockholm, Sweden: Department of Speech, Music and Hearing, KTH.

  • Fuller, B. F., Horii, Y., & Conner, D. A. (1992). Validity and reliability of nonverbal voice measures as indicators of stressor-provoked anxiety. Research in Nursing and Health, 15, 379–389.

    Article  PubMed  Google Scholar 

  • Furmark, T., Appel, L., Michelgård, Å., Wahlstedt, K., Åhs, F., Zancan, S., et al. (2005). Cerebral blood flow changes after treatment of social phobia with the neurokinin-1 antagonist GR205171, citalopram, or placebo. Biological Psychiatry, 58, 132–142.

    Article  PubMed  Google Scholar 

  • Furmark, T., Tillfors, M., Everz, P.-O., Marteinsdottir, I., Gefvert, O., & Fredrikson, M. (1999). Social phobia in the general population: Prevalence and sociodemographic profile. Social Psychiatry and Psychiatric Epidemiology, 34, 416–424.

    Article  PubMed  Google Scholar 

  • Furmark, T., Tillfors, M., Marteinsdottir, I., Fischer, H., Pissiota, A., Långström, B., et al. (2002). Common changes in cerebral blood flow in patients with social phobia treated with citalopram or cognitive-behavioral therapy. Archives of General Psychiatry, 59, 425–433.

    Article  PubMed  Google Scholar 

  • Greasley, P., Sherrard, C., & Waterman, M. (2000). Emotion in language and speech: Methodological issues in naturalistic settings. Language and Speech, 43, 355–375.

    Article  PubMed  Google Scholar 

  • Gross, J. J. (2002). Emotion regulation: Affective, cognitive, and social consequences. Psychophysiology, 39, 281–292.

    Article  PubMed  Google Scholar 

  • Gross, J. J., John, O. P., & Richards, J. M. (2000). The dissociation of emotion expression from emotion experience: A personality perspective. Personality and Social Psychology Bulletin, 26, 712–726.

    Article  Google Scholar 

  • Hagenaars, M. A., & van Minnen, A. (2005). The effect of fear on paralinguistic aspects of speech in patients with panic disorder with agoraphobia. Journal of Anxiety Disorders, 19, 521–537.

    Article  PubMed  Google Scholar 

  • Harrigan, J. A., Wilson, K., & Rosenthal, R. (2004). Detecting state and trait anxiety from auditory and visual cues: A meta-analysis. Personality and Social Psychology Bulletin, 30, 56–66.

    Article  PubMed  Google Scholar 

  • Haskard, K. B., Williams, S. L., DiMatteo, M. R., Heritage, J., & Rosenthal, R. (2008). The provider’s voice: Patient satisfaction and the content-filtered speech of nurses and physicians in primary medical care. Journal of Nonverbal Behavior, 32, 1–20.

    Article  Google Scholar 

  • Hofmann, S. G., Gerlach, A. L., Wender, A., & Roth, W. T. (1997). Speech disturbances and gaze behavior during public speaking in subtypes of social phobia. Journal of Anxiety Disorders, 11, 573–585.

    Article  PubMed  Google Scholar 

  • Johnstone, T., van Reekum, C. M., Bänziger, T., Hird, K., Kirsner, K., & Scherer, K. R. (2007). The effects of difficulty and gain versus loss on vocal physiology and acoustics. Psychophysiology, 44, 827–837.

    Article  PubMed  Google Scholar 

  • Johnstone, T., van Reekum, C. M., Hird, K., Kirsner, K., & Scherer, K. R. (2005). Affective speech elicited with a computer game. Emotion, 5, 513–518.

    Article  PubMed  Google Scholar 

  • Juslin, P. N., & Laukka, P. (2001). Impact of intended emotion intensity on cue utilization and decoding accuracy in vocal expression of emotion. Emotion, 1, 381–412.

    Article  PubMed  Google Scholar 

  • Juslin, P. N., & Laukka, P. (2003). Communication of emotions in vocal expression and music performance: Different channels, same code? Psychological Bulletin, 129, 770–814.

    Article  PubMed  Google Scholar 

  • Kashdan, T. B., & Steger, M. F. (2006). Expanding the topography of social anxiety: An experience-sampling assessment of positive emotions, positive events, and emotion suppression. Psychological Science, 17, 120–128.

    Article  PubMed  Google Scholar 

  • Kasl, S. V., & Mahl, G. F. (1965). The relationship of disturbances and hesitations in spontaneous speech to anxiety. Journal of Personality and Social Psychology, 1, 425–433.

    Article  Google Scholar 

  • Kessler, R. C., Stein, M. B., & Berglund, P. (1998). Social phobia subtypes in the National Comorbidity Survey. American Journal of Psychiatry, 155, 613–619.

    PubMed  Google Scholar 

  • Koerner, A. F., & Fitzpatrick, M. A. (2002). Nonverbal communication and marital adjustment and satisfaction: The role of decoding relationship relevant and relationship irrelevant affect. Communication Monographs, 69, 33–51.

    Article  Google Scholar 

  • Kuroda, I., Fujiwara, O., Okamura, N., & Utsuki, N. (1976). Method for determining pilot stress through analysis of voice communication. Aviation, Space, and Environmental Medicine, 47, 528–533.

    PubMed  Google Scholar 

  • Lang, P. J. (1985). The cognitive psychophysiology of emotion: Fear and anxiety. In A. H. Tuma & J. D. Maser (Eds.), Anxiety and the anxiety disorders (pp. 131–170). Hillsdale, NJ: Lawrence Erlbaum.

    Google Scholar 

  • Laukka, P. (2005). Categorical perception of vocal emotion expressions. Emotion, 5, 277–295.

    Article  PubMed  Google Scholar 

  • Laukka, P. (2008). Research on vocal expression of emotion: State of the art and future directions. In K. Izdebski (Ed.), Emotions in the human voice. Vol 1. Foundations (pp. 153–169). San Diego, CA: Plural Publishing.

    Google Scholar 

  • Laukka, P., Juslin, P. N., & Bresin, R. (2005). A dimensional approach to vocal expression of emotion. Cognition and Emotion, 19, 633–653.

    Article  Google Scholar 

  • Lazarus, R. S. (1991). Emotion and adaptation. New York: Oxford University Press.

    Google Scholar 

  • Lee, C. M., & Narayanan, S. (2005). Towards detecting emotion in spoken dialogs. IEEE Transactions on Speech and Audio Processing, 13, 293–303.

    Article  Google Scholar 

  • Levenson, R. W. (1994). Human emotions: A functional view. In P. Ekman & R. J. Davidson (Eds.), The nature of emotion: Fundamental questions (pp. 123–126). New York: Oxford University Press.

    Google Scholar 

  • Lewin, M. R., McNeil, D. W., & Lipson, J. M. (1996). Enduring without avoiding: Pauses and verbal dysfluencies in public speaking fear. Journal of Psychopathology and Behavioral Assessment, 18, 387–402.

    Article  Google Scholar 

  • Litman, D. J., & Forbes-Riley, K. (2006). Recognizing student emotions and attitudes on the basis of utterances in spoken tutoring dialogues with both human and computer tutors. Speech Communication, 48, 559–590.

    Article  Google Scholar 

  • Mahl, G. F. (1956). Disturbances and silences in the patient’s speech in psychotherapy. Journal of Abnormal and Social Psychology, 53, 1–15.

    Article  Google Scholar 

  • McNally, R. J., Otto, M. W., & Hornig, C. D. (2001). The voice of emotional memory: Content-filtered speech in panic disorder, social phobia, and major depressive disorder. Behaviour Research and Therapy, 39, 1329–1337.

    Article  PubMed  Google Scholar 

  • Perry, C. K., Ingrisano, D. R., Palmer, M. A., & McDonald, E. J. (2000). Effects of environmental noise on computer-derived voice estimates from female speakers. Journal of Voice, 14, 146–153.

    Article  PubMed  Google Scholar 

  • Planalp, S., DeFrancisco, V. L., & Rutherford, D. (1996). Varieties of cues to emotion in naturally occurring situations. Cognition and Emotion, 10, 137–153.

    Article  Google Scholar 

  • Pope, B., Blass, T., Siegman, A. W., & Raher, J. (1970). Anxiety and depression in speech. Journal of Consulting and Clinical Psychology, 35, 128–133.

    Article  PubMed  Google Scholar 

  • Russell, J. A., Bachorowski, J.-A., & Fernandez-Dols, J.-M. (2003). Facial and vocal expressions of emotion. Annual Review of Psychology, 54, 329–349.

    Article  PubMed  Google Scholar 

  • Scherer, K. R. (1986). Vocal affect expression: A review and a model for future research. Psychological Bulletin, 99, 143–165.

    Article  PubMed  Google Scholar 

  • Scherer, K. R. (1989). Vocal correlates of emotional arousal and affective disturbance. In H. Wagner & A. Manstead (Eds.), Handbook of social psychophysiology (pp. 165–197). New York: Wiley.

    Google Scholar 

  • Scherer, K. R., & Ceschi, G. (2000). Criteria for emotion recognition from verbal and nonverbal expression: Studying baggage loss in the airport. Personality and Social Psychology Bulletin, 26, 327–339.

    Article  Google Scholar 

  • Scherer, K. R., Banse, R., Wallbott, H. G., & Goldbeck, T. (1991). Vocal cues in emotion encoding and decoding. Motivation and Emotion, 15, 123–148.

    Article  Google Scholar 

  • Scherer, K. R., Koivumaki, J., & Rosenthal, R. (1972). Minimal cues in the vocal communication of affect: Judging emotions from content-masked speech. Journal of Psycholinguistic Research, 1, 269–285.

    Article  Google Scholar 

  • Siegman, A. W. (1987). The telltale voice: Nonverbal messages of verbal communication. In A. W. Siegman & S. Feldstein (Eds.), Nonverbal behavior and communication (2nd ed., pp. 351–434). Hillsdale, NJ: Lawrence Erlbaum Associates.

    Google Scholar 

  • Smith, G. A. (1977). Voice analysis for the measurement of anxiety. British Journal of Medical Psychology, 50, 367–373.

    PubMed  Google Scholar 

  • Spielberger, C. D., Gorsuch, R. L., Lushene, R. E., Vagg, P. R., & Jacobs, G. A. (1983). Manual for the state-trait anxiety inventory. Palo Alto, CA: Consulting Psychologists Press.

    Google Scholar 

  • Thompson, A. R. (1995). Pharmacological agents with effects on voice. American Journal of Otolaryngology, 16, 12–18.

    Article  PubMed  Google Scholar 

  • Tillfors, M., Furmark, T., Marteinsdottir, I., Fischer, H., Pissiota, A., Långström, B., et al. (2001). Cerebral blood flow in subjects with social phobia during stressful speaking tasks: A PET study. American Journal of Psychiatry, 158, 1220–1226.

    Article  PubMed  Google Scholar 

  • Turk, C. L., Heimberg, R. G., Luterek, J. A., Mennin, D. S., & Fresco, D. M. (2005). Emotion dysregulation in generalized anxiety disorder: A comparison with social anxiety disorder. Cognitive Therapy and Research, 29, 89–106.

    Article  Google Scholar 

  • van Bezooijen, R., & Boves, L. (1986). The effects of low-pass filtering and random splicing on the perception of speech. Journal of Psycholinguistic Research, 15, 403–417.

    Article  PubMed  Google Scholar 

  • Williams, C. E., & Stevens, K. N. (1969). On determining the emotional state of pilots during flight: An exploratory study. Aerospace Medicine, 40, 1369–1372.

    Google Scholar 

  • Williams, C. E., & Stevens, K. N. (1972). Emotions and speech: Some acoustical correlates. Journal of the Acoustical Society of America, 52, 1238–1250.

    Article  PubMed  Google Scholar 

  • Zaider, T. I., Heimberg, R. G., Fresco, D. M., Schneier, F. R., & Liebowitz, M. R. (2003). Evaluation of the clinical global impression scale among individuals with social anxiety disorder. Psychological Medicine, 33, 611–622.

    Article  PubMed  Google Scholar 

  • Zellner, B. (1994). Pauses and the temporal structure of speech. In E. Keller (Ed.), Fundamentals of speech synthesis and speech recognition (pp. 41–62). New York: Wiley.

    Google Scholar 

Download references

Acknowledgments

This research was supported by the Swedish Research Council and the Ryoichi Sasakawa Young Leaders Fellowship Fund (SYLFF) through grants to the first author.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Petri Laukka.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Laukka, P., Linnman, C., Åhs, F. et al. In a Nervous Voice: Acoustic Analysis and Perception of Anxiety in Social Phobics’ Speech. J Nonverbal Behav 32, 195–214 (2008). https://doi.org/10.1007/s10919-008-0055-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10919-008-0055-9

Keywords

Navigation