Introduction

Emotions play an essential role in our life. The processing of emotional stimuli is vital for survival because they provide crucial social and environmental information/cues. When many stimuli in the environment compete for processing resources, emotional stimuli receive prioritization over neutral stimuli and therefore affect cognitive processes (Kalanthroff et al., 2013). Maladaptive processing of emotional information can lead to various psychopathological conditions (e.g., depression, anxiety, post-traumatic stress disorder). Therefore, understanding the nature of the interaction between cognition processes and emotions is essential (Cisler et al., 2010; Gupta, 2007; Gupta & Kar, 2008, 2012; Sippel & Marshall, 2013).

Attention and emotion (positive and negative)

Ample research indicates that both “emotional states” and “emotional stimuli” modulate attention processes (see Gupta, 2019; Pessoa, 2009; Vuilleumier, 2005, for a review). For example, positive emotional states (e.g., joy, contempt) and positive stimuli (e.g., happy faces, reward) broaden and distribute the scope of attention. In contrast, negative states (e.g., depression, anxiety) and negative stimuli (e.g., sad and angry faces) narrow or focus the scope of attention (Fredrickson, 2004; Gupta, 2019; Gupta & Srinivasan, 2015; Gupta et al., 2016; Srinivasan & Gupta, 2010, 2011). These studies indicate that positive and negative emotions interact with attention differently.

Emotional stimuli enhance (when used as a target) and interfere (when used as a distractor) with various cognitive processes. For example, emotional stimuli, especially negative emotional stimuli, capture attention compared to positive and neutral stimuli (Eastwood, Smilek, & Merikle, 2001; Frischen, Eastwood, & Smilek, 2008). Shallower slopes of visual search functions reflect higher guidance on the focus of attention, which indicates that angry faces guide focal attention more effectively than happy faces (Fox et al., 2000). However, the preferential detection of happy faces was observed over angry faces in some studies (Becker et al., 2011; Savage et al., 2016). Moreover, when less attentional resources are available, happy faces capture attention compared to angry faces because fewer attention resources are required to process it (Gupta, 2019, for a review; Gupta et al., 2016; Gupta & Srinivasan, 2015; Gupta, 2016; Gupta et al., 2019; Gupta & Raymond, 2012; Leppänen & Hietanen, 2004; Mack & Rock 1998; Pandey & Gupta, 2019).

In line with this view, Gupta et al. (2016) conducted a series of experiments where positive (happy faces, erotic images, and neutral faces associated with the gain outcome) and negative (angry faces, mutilated bodies, and neutral faces associated with loss outcome) images were used as distractors in a primary letter-search task (low and high load). Participants were required to ignore distractor images and detect a target letter (X or N) among five circular letters (all Os: low-load condition) or five angular letters (Z, W, H, K, and M: high-load condition). The authors found that both positive and negative distractors captured the attention and interfered with the letter-search performance in the low-load task. However, when attention was constrained (high-load condition), only positive (not negative) distractors interfered with the primary letter search task. Consistent with this finding, the higher recognition accuracy of happy faces (relative to sad faces) under high-load conditions in an inattentional blindness task was reported (Gupta & Srinivasan 2015; Mack & Rock, 1998).

The prioritization of positive stimuli over negative stimuli was also found in the temporal selection paradigm. For example, using rapid serial visual presentation (RSVP) task, Most et al. (2007) presented a series of neutral images where one image (positive or negative) was used as a distractor that was followed by a neutral target image. Only the target image was rotated 90° to the left or right. Participants were required to detect the orientation of the target image while ignoring the distractor. They found that positive distractors involuntarily oriented attention and interfered with identifying the target more than negative distractors did.

Response inhibition and emotion (positive and negative)

It is essential to ignore/inhibit irrelevant emotional information to perform well on a primary task (Kalanthroff et al., 2013; Verbruggen et al., 2004). Response inhibition has been defined as “the suppression of inappropriate actions in a given context, and that interfere with goal-driven behavior” (Mostofsky & Simmonds, 2008, p. 751). Attention plays an essential role in inhibiting irrelevant information (Lavie, 2005, for a review). Both “emotion” (Kalanthroff et al., 2013; Pessoa et al., 2012; Verbruggen & de Houwer, 2007) and “attention” (Scalzo et al., 2016; Tian et al., 2014) modulate response inhibition processes. Given a link between attention, emotion, and response inhibition, the present study examines the role of irrelevant positive and negative emotions in response inhibition/inhibitory control.

A handful of studies have explored the link between irrelevant emotional information and response inhibition using a stop-signal paradigm, but the results are inconsistent and inconclusive (Kalanthroff et al., 2013; Pawliczek et al., 2013; Pessoa et al., 2012; Sagaspe et al., 2011; Verbruggen & de Houwer, 2007). For example, using a modified version of the stop-signal task, Kalanthroff et al. (2013) presented negative or neutral images as a prime that was followed by a visual go-signal (@ or #) to which participants were required to press the corresponding keys. Occasionally, an auditory stop-signal was presented on which participants were required to withhold their motor response. Results indicated that the presentation of negative primes (relative to neutral) increased both response and stopping latencies. In another study, faces with irrelevant fearful and neutral expressions were used as go-signals, and participants were required to detect the gender (male vs. female) (Sagaspe et al., 2011). The authors found that irrelevant, fearful information prolonged response latencies but not stopping latencies. Fearful information also did not modulate the neural circuits involved in response inhibition processes.

Notably, in most of the studies, only negative emotion was incorporated and was compared against neutral stimuli. Negative and neutral stimuli not only differ on valence but also on the arousal level. For example, negative emotional stimuli are more arousing compared to neutral stimuli (Lohani et al., 2013). Therefore, prolonged stopping latencies could be due to the arousal difference between negative and neutral information (see Pessoa et al., 2012, Experiment 1 for similar effect and interpretation).

The role of positive emotion in response inhibition is unclear. To get a complete picture of emotion-based modulation in response inhibition, it is crucial to compare positive and negative emotions directly (Gupta, 2019; Pandey & Gupta, 2019) while controlling their arousal difference. There is only one study where authors have manipulated the valence (positive vs. negative) and the arousal value (high vs. low) of the stimuli orthogonally in the stop-signal task. They found that irrelevant, high-arousing positive and negative emotional stimuli (relative to low arousing positive and negative stimuli) impaired the stopping latencies. In this study, irrelevant emotional stimuli were not part of the stop-signals; instead, they were presented as primes before the go and stop-trials. Also, in this study, IAPS pictures were used as stop-signals, whereas in previous (Pessoa et al., 2012) and the current study, faces were used as stop-signals. Notably, IAPS pictures are quite complex. Also, low-level visual features varied a lot between positive and negative pictures (e.g., negative images such as mutilated bodies are dominated by red color, whereas the skin color dominates positive images such as erotic pictures). Therefore, processing of these complex images could have taken more resources, thereby interfering with the response inhibition in Verbruggen and de Houwer’s (2007) study.

There is another study where both positive and negative stimuli were used as stop-signals (Pessoa et al., 2012, Experiment 1). However, the arousal level of happy, fearful, and neutral faces was not measured and controlled. They used low-threat stimuli (happy or fearful faces in Experiment 1) and high-threat stimuli (digits paired with shock in Experiment 2) as stop-signals. They found that a low-threat stop-signal improved response inhibition, whereas a high-threat stop-signal impaired response inhibition. Authors have speculated that both positive and fearful faces are low in arousal level, and therefore, generated enhanced sensory representations of the stop-signal in the visual cortex (Pessoa et al., 2002; Vuilleumier, 2005), which in turn facilitated response inhibition in Experiment 1. In contrast, stop-signals impaired response inhibition in Experiment 2. Authors argued that in Experiment 2, negative stop-signals (words associated with shock) were threatening and high arousing that might have consumed most of the central processing resources required for inhibition (as suggested by the “dual competition framework”; Pessoa, 2009). Notably, in Experiment 2, stop-signals with positive emotion were not included in their study. Therefore, it is unclear whether stop-signals with a positive emotion would modulate the response inhibition. Moreover, without measuring and controlling the arousal level of stop-signals, Pessoa et al. (2012) concluded that the arousal but not valence of stop-signals modulates response inhibition. Therefore, to what extent is this conclusion justifiable?

Notably, Pessoa et al. (2012) compared happy faces with fearful faces and did not find the difference. This could be attributed to the motivational tendencies (approach/avoidance) modulated by these faces as the perception of both faces activates approach-related behavior (Hammer & Marsh, 2015; Fusar-Poli et al., 2009) and a similar neural response (Fischer-Shofty et al., 2010; Marsh et al., 2010). In addition, these faces do not signal danger and can easily be approached (Nikitin & Freund, 2019; Grose-Fifer et al., 2013). Neuroimaging studies further strengthen this claim. For example, the administration of oxytocin in healthy adults increases approach-related behavior sensitivity to both fearful and happy facial expressions (Fischer-Shofty et al., 2010; Marsh et al., 2010). A quantitative voxel-based meta-analysis of the brain has shown that happy and fearful faces specifically activated the amygdala brain region. Therefore, to what extent is the comparison between happy faces and fearful faces meaningful?

Given these similar motivational and neural responses evoked by happy and fearful faces, we argue that it is better to compare happy and angry faces against each other in the stop-signal paradigm because they each produce a specific and different motivational and neural response. For example, the perception of happy and angry faces strongly activates approach and avoidance related behaviors, respectively (Marsh et al., 2005; Nikitin & Freund, 2019). Neuroimaging studies further strengthen this claim. The amygdala was activated in response to happy faces, whereas the insula was activated in response to angry faces (Fusar-Poli et al., 2009). We also discussed above that angry and happy faces interact with attention differently (see Gupta, 2019, for a review; Gupta et al., 2016; Mack & Rock 1998; Srinivasan & Gupta, 2011; Gupta & Srinivasan, 2015; Gupta & Deák, 2015).

There is only one study where angry emotional faces were compared against neutral faces in the stop-signal paradigm (Pawliczek et al., 2013). The authors found that angry faces facilitated response inhibition compared to neutral faces. Similar to previous studies, this study also did not control the arousal level of emotional faces; the arousal level of angry faces could be more in comparison to neutral faces. Notably, happy faces were not used in this study. Therefore, this study was also unable to comment on the effect of valence (positive vs. negative) on response inhibition.

The present study

Given a difference in the interaction between positive and negative emotion with attention, as well as the role of attention in response inhibition, the present study aimed to examine the role of valence (happy vs. angry) in response inhibition. Notably, happy and angry faces used in the present study were matched on the arousal level. Also, we aimed to better understand the role of valence rather than low-level visual differences between emotional faces that are visually similar but vary in emotional expressions (happy vs. angry). These are the strengths of the present study.

Based on previous findings, two contrary hypotheses are formulated. “Dual competition framework” (Pessoa, 2009) predicted that happy faces would facilitate the response inhibition (smaller SSRT value) because it requires fewer resources to process and leaving enough resources required for response inhibition. In contrast, an angry stop-signal would interfere with the response inhibition (higher SSRT value) because it consumes more resources to process and leaves fewer resources for response inhibition. In contrast to the prediction of the “Dual competition framework,” “Approach and avoidance framework” (Hammer & Marsh, 2015; Marsh et al., 2005) predicted that happy stop-signal would reduce the response inhibition (higher SSRT value) because it facilitates approach-related behavior. Also, happy faces are appetitive in nature. In contrast, angry stop-signal would facilitate response inhibition (smaller SSRT value) because it facilitates avoidance-related behavior and is perceived as aversive. In line with this framework, it has been reported that “it is difficult to disengage attention from an angry face once it is seen” (Becker et al., 2019, p. 107). Authors found that angry faces hold the attention and resist attentional disengagement more than happy faces. This may suggest that angry faces produce freezing behavior, and therefore, are more likely to facilitate response inhibition than happy faces are.

Since the arousal level of both happy and angry faces was matched in the present study, therefore, the difference between happy and angry faces in response inhibition would be completely attributed to valence difference.

Methods

Participants

Sixty young adults (43 males, Mage = 26.48 years, SD = 5.58 years) participated after giving informed consent. We estimated (using G-Power) a necessary sample size of 45 to detect a medium-size effect of 0.25 (Cohen, 1988), and to obtain a power level of 0.95. All participants were right-handed with normal or corrected-to-normal vision and with no past neuropsychiatric disorder. The ethics committee of the Indian Institute of Technology Bombay approved the study.

Apparatus

Stimuli were displayed on a 15.6-in. color monitor (60 Hz; resolution 1,920 × 1,080 pixels) viewed binocularly from a distance of 60 cm. Octave (an open-source version of the MATLAB interpreter) (Brainard, 1997; Kleiner et al., 2007) operating on a computer with a 2.0 GHz Ryzen 5 processor generated stimuli and recorded responses acquired via a number pad.

Stimuli

A total of four static grey-scale photographs of young adults (two male and two female identities with three emotions) were selected as stop-signals from the Karolinska Directed Emotional Faces (KDEF) (Lundqvist, Flykt, & Öhman, 1998), and the Nimstim Face Stimulus Set (Tottenham et al., 2009). Each face displayed all three emotions (happy, angry, and neutral) and subtended approximately 2.1° × 2.9°. 31 (17 males, Mage = 28.8 years, SD = 4.17 years) healthy participants, who did not take part in the main experiment, rated these original faces on arousal level. Each emotion of a face was presented at a time on a computer screen during the rating phase. Participants were required to make a response by selecting one of the three emotion options of emotion: happy, angry, and neutral. After this, participants had to choose a number that corresponded with the arousal level of a target face from a Likert scale ranging from 1 to 9, where one indicated “not at all aroused” and nine indicated “very aroused.”

A two-way ANOVA was performed on arousal score using gender of the faces (male vs. female) and emotion (happy, angry, neutral) as within-group variables. The main effect of emotion was significant, F(2, 60) = 15.57, MSE = 5.52, p < .0001, ηp2 = .342. The arousal score was significantly lower for the neutral faces (M = 4.37, SD = 2.26) compared to angry (M = 6.60, SD = 1.72), t(1, 30) = 4.19, p < .0001, and happy faces (M = 6.13, SD = 1.55), t(1, 30) = 4.38, p < .0001. There was no significant difference in the arousal score between angry and happy faces, t(1, 30) = 1.57, p = .126. There was no significant main effect of gender of the faces, F(1, 30) = 0.580, MSE = 1.56, p = .452, ηp2 = .019, which indicates that arousal score was similar for male (M = 5.63, SD = 1.32) and female faces (M = 5.77, SD = 1.45). There was no significant interaction between face gender and emotion on arousal ratings, F(2, 60) = 0.491, MSE = 1.43, p = .615, ηp2 = .016.

Procedure for the stop-signal task

The stop-signal paradigm (Gupta et al., 2011; Gupta & Kar, 2009) was used to measure response inhibition (see Fig. 1). There were two types of trials: go and stop-trials. Each trial starts with a fixation cross (0.49 × 0.49°) at the center of the screen for 500 ms, which was followed by the presentation of a letter “X” (2.2° × 2.4°) or “O” (2.3° × 2.5°) for 1,000 ms. In go-trials subjects were required to indicate “X” or “O” via key-press, using their right-hand, which stayed on the screen for 1,000 ms irrespective of participants’ response and was followed by a blank screen for 2,000 ms. Stop-trials were similar to the go-trials except that a brief picture of a face (happy, angry, or neutral) was presented above the letter “X” or “O” for 250 ms after a random duration with a variable stop-signal delay (SSD) relative to the onset of the go stimulus. Each face was separated from the letter “X” or “O” by 0.20°. The initial value of the SSD was set at 250 ms for all three stop conditions and adjusted by a staircase tracking procedure. After each successful stopping the SSD was increased by 50 ms on a subsequent stop trial and after each unsuccessful stopping the SSD was reduced by 50 ms on a subsequent stop trial (Logan et al., 1997; Pessoa et al., 2012). This staircase was run separately for each condition to ensure successful inhibition of approximately 50% of the stop trials in each condition. Participants were required to respond to letters as quickly and as accurately as possible and withhold their motor response upon viewing a face that followed the letters. Equal emphasis was given on executing and withholding the motor response. A full practice run was given before the experimental session. In the practice run, a separate set of neutral faces was used as a stop-signal.

Fig. 1
figure 1

An example of go and stop-trials. During go trials, participants responded to go-signals (X or N?). During stop-trials, they were required to withhold their motor response (signaled by a face with irrelevant happy, angry, or neutral emotions). The stop-signals followed the go-signals after a variable delay called stop-signal delay. The SSD was updated based on a staircase procedure separately for each stop-signal condition to get stop-performance at approximately 50% correct. SSD Stop-Signal Delay

There were a total of 192 trials in the experiment. A total of four blocks was completed. Each block contained a total of 48 trials (36 go-trials (75%) and 12 stop-trials (25%; four for each of the three-stop conditions). Each stop-trial was followed by at least one go-trial. “X” and “O” appeared for the equal number of times in both go and stop trials.

Data analysis

Data of ten participants were removed from the analysis due to excessive omission errors (inability to respond to go-stimulus on go-trials) rate (≥13%; omission error rate range = 13–6%; mean omission error rate = 21.87%, SD = 14.24%), which indicates that these participants were waiting for the stop-signals for a more extended period and missed to respond on go-trials. Also, the data of five participants were removed from the analysis due to excessive commission error (inability to withhold motor response on stop-trials), rate (≥75%, range = 89–100%), which indicates that these participants were not able to pay attention in the task (see footnoteFootnote 1, for detailed discussion).

In the final analysis, there were 45 participants (33 males, Mage = 26.02 years, SD = 5.37 years; mean omission error rate = 3.47%, SD = 3.54%). Stop-signal reaction time (SSRT), which is a measure of response inhibition time, was calculated (see footnoteFootnote 2) for each participant in each condition as median RT during correct go-trials minus median SSD (SSD values: happy: M = 268.88, ms, SD = 135.38 ms; angry: M = 291.66 ms, SD = 133.56 ms; neutral: M = 280.55, SD = 136.08 ms) (Logan & Cowan, 1984; Verbruggen & Logan, 2009). A one-way repeated-measure ANOVA was performed on SRRT values using stop-signal type (happy, angry, and neutral) as a within-group factor. The SSD was adjusted using the stair-casing method for each stop-signal type separately to achieve an approximately 50% success rate of inhibition. Likewise, one-way repeated-measure ANOVAs was performed on commission error and stop-respond RT (unsuccessful stop-trials (UNSUCC): stop trials on which participants failed to inhibit the motor response) (Pessoa et al., 2012). Alpha level was set to 0.05.

Results

The one-way ANOVA on SRRT score yields a significant main effect of stop-signal type, F(2, 88) = 3.52, MSE = 1657.09, p = 0.034, ηp2 = .074, which suggests that irrelevant emotional information modulates response inhibition (see Fig. 2). The SSRT score was significantly lower for angry (M = 307.6 ms, SD = 50.52 ms) compared to happy (M = 330.4, ms, SD = 61.3 ms), t(44) = -2.35, p = 0.02, but close to significantly different for neutral (M = 318.8, SD = 52.4 ms), t(44) = 1.86, p = 0.070, stop-signal. This may indicate that an angry stop-signal facilitated the response inhibition. There was no significant difference in SSRT between the happy and neutral conditions, t(44) = -1.22, p = .22., which may indicate that both happy and neutral stop-signal modulate response inhibition similarly. As expected, the one-way ANOVA on commission error rate yields a non-significant main effect of stop-signal type, F(2, 88) = 1.26, MSE = 19.27, p = 0.28, ηp2 = .268, (neutral: M = 41.52%, SD = 13.98%; happy: M = 42.63%, SD = 13.34%; angry: M = 41.25%, SD = 13.41%), which indicates that stop performance was approximately 50% correct during all three emotional conditions. Finally, there was no significant main effect of a stop-signal type on stop-respond RT, F(2, 88) = .402, MSE = 2999.42, p = .67, ηp2 = .009. The reaction times of stop-respond (UNSUCC) trials were significantly faster than those of correct go trials (angry: t(44) = 6.80, p = .001; happy: t(44) = 8.09, p = .001; neutral: t(44) = 5.97, p = .001), as predicted by the race model (Logan & Cowan, 1984).

Fig. 2
figure 2

Mean SSRT was plotted as a function of emotion. Vertical lines represent ± 1 within-subject standard error. SSRT Stop-Signal Reaction Time

Discussion

The present study is the first study where faces with irrelevant positive (happy) and negative (angry) expressions were used as stop-signals and compared with the neutral faces to examine the response inhibition process. Moreover, both happy and angry faces were matched on the arousal level. The SSRT value was smaller for angry face stop-signals than for happy and neutral face stop-signals; therefore, we argue that the valence (positive and negative) of the stop-signals matter and that they modulate response inhibition.

The present study holds many similarities to Pessoa et al.’s (2012) study (see footnoteFootnote 3). However, the results are quite different. Pessoa et al. (2012) found a facilitatory effect of the irrelevant emotional information on response inhibition, irrespective of the positive and negative valence of the stop-signals (see Experiment 1). However, in the present study, the facilitatory effect of irrelevant emotional information was only observed for angry faces compared to happy and neutral faces. This result is consistent with the prediction of “approach and avoidance framework,” but inconsistent with the prediction of “dual competition framework.” Why only irrelevant angry face stop-signals, but not happy face stop-signals, facilitated response inhibition in the present study? There could be several explanations, which are discussed next.

First, it has been observed that angry emotion elicits avoidance reactions if contrasted with the expression of happiness. However, when angry emotion was contrasted with other negative emotions like fear and sadness, it produces an approach tendency (Paulus & Wentura, 2016). In the present study, angry faces were contrasted with happy faces, whicht may have activated the avoidance response and defensive motivational system that facilitates freezing behavior (“a statue-like inhibition of movement” (Facchinetti et al., 2006; Lang & Bradley, 2013; Lang et al., 2000) and may have held attention (Becker et al., 2019), which in turn facilitated response inhibition. These explanations are consistent with the prediction of the “Approach and avoidance framework.” In the present study, no other negative emotional faces (e.g., fearful, sadness) were included; therefore, we were not able to examine the differential effect of different negative emotions on response inhibition. Future studies should include arousal matched different negative emotional faces (e.g., angry, fearful, sad) in the same study to examine this.

Second, it has been observed that when fewer attentional resources are available, happy faces capture attention and interfere with the primary task more compared to angry faces (Gupta, 2019, for a review; Gupta et al., 2016). In the present study, we found the same results that irrelevant happy emotion of the stop-signals reduced response inhibition (primary task) more compared to angry face stop-signals.

Third, the behavioral results of the present study may indicate that the processing of irrelevant angry faces may have modulated neural processing pathways involved in response inhibition (Pawliczek et al., 2013). Neuroimaging studies indicated that the stop-signal on angry face trials activated the occipital brain areas compared to the neutral face. It may reflect higher representation or higher attention processing of angry faces, leading to a facilitation effect in response inhibition compared to happy and neutral faces (Pawliczek et al., 2013). In line with this view, it has been suggested that the detection of angry faces is fast and efficient (Fox et al., 2000), which may indicate an enhanced representation of angry faces in the visual cortex compared to happy and neutral faces. Also, Pawliczek et al. (2013) have found increased activation in the bilateral fusiform gyrus (see also Jackson et al., 2008; Kesler-West et al., 2001) and posterior cingulate cortex (PCC) (see also Sprengelmeyer et al., 1998) in response to angry faces. Activation in both brain areas has also been found for emotional and non-emotional inhibition tasks. Higher activation in the PCC supports that major attentional resources were allocated to process angry faces (Boehler et al., 2009). Together, higher activation in all three brain areas (visual cortex, fusiform gyrus, and PCC) in response to angry face stimuli may have facilitated response inhibition (Pawliczek et al., 2013). The present study should be replicated with neuroimaging studies to test these explanations.

To summarize, the results of the present study indicate that stop-signals with irrelevant angry facial expressions facilitate response inhibition compared to stop-signals with irrelevant happy and neutral facial expressions. Our results extend previous findings by suggesting the valence of the stop-signals modulates inhibitory control. We propose comparing happy faces against angry faces to examine the role of valence (positive and negative) on response inhibition. The “approach and avoidance framework” and “freezing account” of emotion can explain the results of the present study compared to the “dual competition framework.” Therefore, it is essential to test the “dual competition framework” using other facial expressions. Future studies should include arousal matched angry, happy, and fearful faces as stop-signals in a paradigm used in Pessoa et al.'s (2012) study to strengthen the results of the present study, which will further help examine the role of valence in response inhibition.