Modulation of the startle reflex by pleasant and unpleasant music

https://doi.org/10.1016/j.ijpsycho.2008.07.010Get rights and content

Abstract

The issue of emotional feelings to music is the object of a classic debate in music psychology. Emotivists argue that emotions are really felt in response to music, whereas cognitivists believe that music is only representative of emotions. Psychophysiological recordings of emotional feelings to music might help to resolve the debate, but past studies have failed to show clear and consistent differences between musical excerpts of different emotional valence. Here, we compared the effects of pleasant and unpleasant musical excerpts on the startle eye blink reflex and associated body markers (such as the corrugator and zygomatic activity, skin conductance level and heart rate). The startle eye blink amplitude was larger and its latency was shorter during unpleasant compared with pleasant music, suggesting that the defensive emotional system was indeed modulated by music. Corrugator activity was also enhanced during unpleasant music, whereas skin conductance level was higher for pleasant excerpts. The startle reflex was the response that contributed the most in distinguishing pleasant and unpleasant music. Taken together, these results provide strong evidence that emotions were felt in response to music, supporting the emotivist stance.

Introduction

The emotional power of music remains a mystery. Unlike most emotional inducers, music is not a sentient being nor does it seem to have any obvious adaptive value (Pinker, 1997). Yet, most people affirm that they feel strong emotions when they listen to music (Sloboda and O'Neill, 2001). This paradox led many music scholars to believe that music is only iconic or representative of emotion, a position coined as ‘cognitivist’ by Kivy (1990). Opponents to this view, known as ‘emotivists’, feel that the cognitivist position does not render justice to the direct and unmediated fashion in which emotions are experienced by listeners (Davies, 2001). Although the debate is at a theoretical level, its resolution has practical implications for interpreting music effects. Indeed, if music is only representative of emotion, its therapeutic value could be seriously questioned. Studies measuring physiological, endocrine and brain responses to music as indices of emotional reactivity have supported the emotivist view, but the nature of these emotional responses and their resemblance with emotions induced by other stimuli is unclear.

In order to show that people not only recognize but feel emotions in response to music, emotional reactions should be measured by techniques that are independent of voluntary subject control, such as psychophysiological measures. Following this line of research, Krumhansl (1997) compared the autonomic responses elicited by different musical emotions and found that sad, happy, and fearful music could be differentiated by their autonomic activation patterns: Sad music was most strongly associated with changes in heart rate, blood pressure, skin conductance and skin temperature, fearful music was mostly associated with changes in the rate and amplitude of blood flow, and happy music principally produced changes in respiratory activity and showed the highest skin conductance level (SCL). However, subsequent studies have failed to replicate many of these findings. Khalfa et al. (2002) found that skin conductance responses (SCR) were highest during the listening of fearful music, Baumgartner et al. (2006) observed increased SCL during sad and fearful music compared to happy music, and Nater et al. (2006) found higher SCL during the listening of unpleasant compared to pleasant music. Moreover, Nater et al. (2006) found higher heart rates during unpleasant compared to pleasant music, whereas Sammler et al. (2007), Witvliet and Vrana (2007), and Krumhansl (1997) found the opposite. Therefore, there are inconsistent findings of the intensity and direction of these autonomic responses between studies.

Such inconsistencies across psychophysiological emotion studies are relatively common (Cacioppo et al., 2000), and the outcomes may be related to some context-bound patterns of actions that allow the same emotion to be associated with a wide range of behavior and varying patterns of somatovisceral activation (Lang et al., 1990). However, it should be noted that some psychophysiological measures appear more reliable than others. For example, respiration rate appears to be consistently higher during happy and fearful music than during sad music (Baumgartner et al., 2006, Etzel et al., 2006, Krumhansl, 1997, Nyklicek et al., 1997), although this effect may reflect differences in arousal that differentiate happiness and fear from sadness, and not musical emotions per se (Nyklicek et al., 1997). Indeed, cognitive theories of emotion have criticised the use of autonomic measures as indexes of felt emotions due to the non-specific nature of arousal (Schacter and Singer, 1962). For example, high arousal characterizes both fear and happiness. Moreover, in music, arousal is known to be mainly driven by its tempo (Gomez and Danuser, 2007). The fact that respiration rate has been linked to tempo through what appears to be a general entrainment mechanism further contributes to discredit respiration rate as a clear index of musical emotions (Etzel et al., 2006). Although tempo is one of the main determinants of musical emotions, musical emotions depend on many other factors than simple tempo perception (Peretz et al., 1998). Thus, until the context-bound patterns of action that affect the autonomic responses to musical emotions understood and controlled, more specific measures of emotional reactions to music are needed to convince the sceptical cognitivist that music effectively induces emotions in the listener.

Neuroendocrine and hormonal responses constitute yet another type of involuntary response that can be linked to emotional feelings. Contrary to physiological responses, some hormones can be more readily associated with positive or negative emotion (Barak, 2006), such as cortisol with stress and negative emotions, or immunoglobin A (S-IgA) with relaxation and positive emotions (Watanuki and Kim, 2005). A few studies have found that listening to relaxing and pleasant music was associated with lower levels of cortisol (Khalfa et al., 2003, Miluk-Kolasa et al., 1994), lower plasmatic levels of β-endorphins (McKinney et al., 1997) and higher mu-opiate receptor expression (Stefano et al., 2004). However, those studies only compared music with a silent control condition. Therefore, the observed effect may be attributed to non-emotional aspects of the musical condition, such as distraction. Indeed, when two musical conditions are compared, no differences were found between music inducing positive or negative moods on levels of cortisol (Clark et al., 2001), nor between up- or down-lifting musical excerpts on levels of S-Iga, dopamine, norepinephrenine, epinephrine or number of lymphocites (Hirokawa and Ohira, 2003), suggesting that the differences previously observed were mainly related to non-specific aspects of the task. One exception is the study by Gerra et al. (1998), who observed higher levels of β-endorphins, adrenocorticotropic hormone (ACTH), cortisol, norepinephrine and growth hormone in youngsters listening to techno-music compared to classical music. However, these changes in neuroendocrine responses appeared to be mainly linked to the high arousal induced by the techno-music, combined with the novelty-seeking temperament of the participants. Neuroendocrine responses, although promising, appear to have the same limitations as autonomic responses.

Brain imaging techniques provide yet another way to measure emotional reactions objectively. Studies using such techniques have shown that pleasant emotional reactions to music activate regions previously known to be involved in approach-related behaviors, such as the prefrontal cortex (Blood and Zatorre, 2001, Blood et al., 1999, Koelsch et al., 2006, Menon and Levitin, 2005), periacqueductal gray matter (Blood and Zatorre, 2001), and the nucleus accumbens (Blood and Zatorre, 2001, Menon and Levitin, 2005). Negative emotions in contrast activate regions involved in withdrawal-related behavior, such as the parahippocampal gyrus (Blood et al., 1999) and amygdala (Koelsch et al., 2006). Although these observations are fairly consistent with activations observed with other emotional inducers, brain activations alone do not allow for the distinction between processes involved in emotional perception and emotional feeling. Physiological changes that affect the body and its responses are necessary to demonstrate the induction of emotional feelings.

Although these studies demonstrate that some emotions are felt in response to music, the results do not definitely refute the cognitivist viewpoint, as many psychophysiological responses are inconsistent, and the responses that appear to induce the most stable responses (e.g., respiration rate or hormonal responses) may be influenced by other confounding factors, such as arousal or distraction. Finally, brain imaging techniques cannot solely discriminate emotional feelings from other aspects of emotional processing.

In order to demonstrate the induction of emotional feelings, involuntary changes that affect the body and emotional processing have to be observed in response to musical excerpts conveying different emotions. In this context, the startle reflex is a good candidate measure, as it has been extensively and successfully used to probe emotional reactions. It is an automatic defensive reaction to surprising stimuli and can be measured by the magnitude of the eye blink triggered by a loud white noise. As a response of the defensive emotional system, it is frequently used to test the efficacy of anxiolytic drugs (Winslow et al., 2007) or to explore emotional reactivity in affective disorders (Grillon and Baas, 2003). In normal individuals, it is typically enhanced by negative emotions and diminished by positive ones, using pictures (Lang et al., 1998), films (Kaviani et al., 2004), or sounds (Bradley and Lang, 2000) to induce emotions. The present study applied an affective startle modulation paradigm to musical stimuli and compared the effects of pleasant and unpleasant musical excerpts on the acoustic startle blink reflex. If emotions are induced during music listening, then the startle reflex should be larger and of shorter latency during unpleasant music compared to pleasant music.

Moreover, in order to measure music effects on emotional reactions, heart rate and skin conductance responses were also obtained along with facial expressions by assessing electromyographic (EMG) activity of the zygomaticus major (smiling) and the corrugator supercilii (frowning). Previous studies have shown that the activity of these muscles discriminated well between pleasant and unpleasant emotions elicited by pictures (Lang et al., 1998). Thus, it was expected that zygomatic activity would be higher during pleasant music, and corrugator activity to be more noticeable during unpleasant music (Witvliet and Vrana, 2007).

Section snippets

Participants

Sixteen participants (9F, 7M), aged between 20 and 40 years (M = 25.1 ± 9.3 years) took part in this study. None were musicians, all reported fewer than five years of musical training, and none claimed any regular practice of a musical instrument.

Musical excerpts

The musical excerpts used in this study were adapted from a prior study on pain modulation (Roy et al., 2008). Three 100 s excerpts of pleasant music and three 100 s excerpts of unpleasant music were selected from a pool of 30 musical excerpts. Each of the

Self-reported emotions

The mean valence and arousal ratings were calculated for the pleasant and unpleasant excerpts. The t-tests performed on these average ratings confirmed that the intended emotions of the musical excerpts were well recognized. The pleasant and unpleasant excerpts differed significantly on the dimension of valence (with a mean rating of 8.49 and 1.91, respectively; t (15) = 13.04, p < 0.001). In contrast, pleasant and unpleasant musical excerpts did not differ on the dimension of arousal (with a

Induction of emotional feelings by music

The startle reflex was of higher amplitude and shorter latency during the listening of unpleasant in comparison with pleasant excerpts, suggesting that different emotional states were effectively induced by music. As the musical excerpts were manipulated to vary on the dimension of valence independently of arousal or loudness, the observed effects are likely to reflect the induction of positive and negative emotional states in response to music, thereby supporting the emotivist's stance in

Acknowledgements

The work was supported by a grant from the Natural Science and Engineering Research Council of Canada (NSERC) to Isabelle Peretz, and by a doctoral scholarship from the NSERC to Mathieu Roy. We thank Amee Baird for English editing, Francine Giroux for statistical advice and Pierre Rainville for his suggestions on physiological signal analysis.

References (47)

  • NaterU.M. et al.

    Sex differences in emotional and psychophysiological responses to musical stimuli

    Int. J. Psychophysiol.

    (2006)
  • PeretzI. et al.

    Music and emotion: perceptual determinants, immediacy, and isolation after brain damage

    Cognition

    (1998)
  • RoyM. et al.

    Emotional valence contributes to music-induced analgesia

    Pain

    (2008)
  • WinslowJ.T. et al.

    Modulation of fear-potentiated startle and vocalizations in juvenile rhesus monkeys by morphine, diazepam, and buspirone

    Biol. Psych.

    (2007)
  • BalabanM.T. et al.

    Off-line latency and amplitude scoring of the human reflex eye blink with Fortran IV

    Psychophysiology

    (1986)
  • BloodA.J. et al.

    Intensely pleasurable responses to music correlate with activity in brain regions implicated in reward and emotion

    Proc. Natl. Acad. Sci. U. S. A.

    (2001)
  • BloodA.J. et al.

    Emotional responses to pleasant and unpleasant music correlate with activity in paralimbic brain regions

    Nat. Neurosci.

    (1999)
  • BlumenthalT.D. et al.

    Committee report: guidelines for human startle eyeblink electromyographic studies

    Psychophysiology

    (2005)
  • BradleyM.M. et al.

    Affective reactions to acoustic stimuli

    Psychophysiology

    (2000)
  • CacioppoJ.T. et al.

    The psychophysiology of emotion

  • ComreyA.L. et al.

    A First Course in Factor Analysis

    (1992)
  • DaviesS.

    Philosophical perspectives on music's expressiveness

  • FowlesD.C. et al.

    Publication recommendations for electrodermal measurements

    Psychophysiology

    (1981)
  • Cited by (67)

    • Isn't There Room for Music in Chronic Pain Management?

      2022, Journal of Pain
      Citation Excerpt :

      Anxiety often precedes the pain onset, whereas depressive disorders usually follow the pain.22 Music-induced analgesia has been demonstrated in healthy participants exposed to experimental pain30,31,40,41 and in patients with acute19 and chronic pain.15,16 Yet, the mode of action of music on pain has remained elusive until recently.

    • “Stopping for knowledge”: The sense of beauty in the perception-action cycle

      2020, Neuroscience and Biobehavioral Reviews
      Citation Excerpt :

      Moreover, the existence of a negative correlation between motor responses to sounds and their pleasantness has been recently hypothesized in the auditory domain (Brattico et al., 2013). Coherently with this hypothesis, startle eye blink reactions registered with EmG were larger for unpleasant than for pleasant consonant intervals (Roy et al., 2009). More recently, our research group demonstrated the presence of a positive correlation between motor inhibition and pleasant sounds in a series of EEG experiments (Sarasso et al., 2019): more appreciated musical intervals induced slower response times in a detection task and the concomitant enhancement of the motor inhibition N2-P3 complex in a go-nogo task.

    • Rhythm and blues: Influence of CLOCK T3111C on peripheral electrophysiological indicators of negative affective processing

      2020, Physiology and Behavior
      Citation Excerpt :

      ASR is enhanced when additional unpleasant stimuli are present (fear potentiated startle; FPS) and reduced in pleasant conditions (pleasure attenuated startle; PAS; [46,41]). In human studies, ASR modulation is often induced with affective pictures [42] although various other stimuli have been used including videos [47], music [48] or odours [49,50]. The neural circuit underlying ASR comprises sensory receptors, the auditory nerve, the cochlear nucleus, the ventrolateral lemniscus, the nucleus reticularis pontis caudalis (PnC), and spinal motoneurons, while ASR modulation additionally involves the amygdala [46,41].

    View all citing articles on Scopus
    View full text