On the value-dependence of value-driven attentional capture

Anderson, Brian A.; Halpern, Madeline

doi:10.3758/s13414-017-1289-6

On the value-dependence of value-driven attentional capture

Published: 07 February 2017

Volume 79, pages 1001–1011, (2017)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

On the value-dependence of value-driven attentional capture

Download PDF

Brian A. Anderson¹ &
Madeline Halpern²

4574 Accesses
61 Citations
1 Altmetric
Explore all metrics

Abstract

Findings from an increasingly large number of studies have been used to argue that attentional capture can be dependent on the learned value of a stimulus, or value-driven. However, under certain circumstances attention can be biased to select stimuli that previously served as targets, independent of reward history. Value-driven attentional capture, as studied using the training phase-test phase design introduced by Anderson and colleagues, is widely presumed to reflect the combined influence of learned value and selection history. However, the degree to which attentional capture is at all dependent on value learning in this paradigm has recently been questioned. Support for value-dependence can be provided through one of two means: (1) greater attentional capture by prior targets following rewarded training than following unrewarded training, and (2) greater attentional capture by prior targets previously associated with high compared to low value. Using a variant of the original value-driven attentional capture paradigm, Sha and Jiang (Attention, Perception, and Psychophysics, 78, 403–414, 2016) failed to find evidence of either, and raised criticisms regarding the adequacy of evidence provided by prior studies using this particular paradigm. To address this disparity, here we provided a stringent test of the value-dependence hypothesis using the traditional value-driven attentional capture paradigm. With a sufficiently large sample size, value-dependence was observed based on both criteria, with no evidence of attentional capture without rewards during training. Our findings support the validity of the traditional value-driven attentional capture paradigm in measuring what its name purports to measure.

Reward history but not search history explains value-driven attentional capture

Article 19 April 2018

Sensitivity to value-driven attention is predicted by how we learn from value

Article Open access 29 June 2016

Don’t let it distract you: how information about the availability of reward affects attentional selection

Article Open access 21 July 2017

Decades of research firmly support the idea that the automatic capture of attention is influenced by both the current goals of the observer (i.e., (goal)-contingent attentional capture; e.g., Folk, Remington, & Johnston, 1992; Serences et al., 2005) and the physical salience of stimuli (e.g., Theeuwes, 1992, 2010). The idea that attentional capture can be uniquely driven by learned value (i.e., value-driven) is much more recent (Anderson, Laurent, & Yantis, 2011b). In order to study value-driven attention, Anderson and colleagues (2011b) developed what has come to be referred to as the value-driven attentional capture paradigm, which utilizes a training phase-test phase design. Specifically, in the training phase, participants are provided (often monetary) rewards for finding feature (often color)-defined targets; participants then complete an unrewarded test phase in which the prior target-defining feature(s) are explicitly task-irrelevant. Critically, in the test phase, a non-target is rendered in the color of a formerly reward-predictive target (referred to as a valuable distractor) on some trials. Slowing of response time (RT) when the valuable distractors are present compared to distractor-absent trials is taken as evidence of attentional capture by the distractors (see Anderson, 2013).

Value-driven attention has become a topic of great interest in the field, popularizing the use of the value-driven attentional capture paradigm (see Anderson, 2016b, for a recent review). However, the usefulness of this paradigm in measuring attentional capture that is value-driven, the very thing it purports to measure, has recently been questioned (Sha & Jiang, 2016). Given the widespread use of the paradigm in attention research and the claims that rest on its assumptions, this is a serious criticism that demands careful consideration. The criticism has two primary components, which will be addressed in turn:

Does the presence of reward during training actually matter?

Without any explicit reward feedback, simply locating a target repeatedly over trials can give rise to attentional biases that mirror value-driven attention (e.g., Kyllingsbaek, Schneider, & Bundesen, 2001; Kyllingsbaek, Van Lommel, Sorensen, & Bundesen, 2014; Qu, Hillyard, & Ding, in press; Shiffrin & Schneider, 1977). It was traditionally thought that such selection history biases require substantial training to develop, typically thousands of trials over multiple days (Kyllingsbaek et al., 2001, 2014; Shiffrin & Schneider, 1977). However, statistically significant attentional biases for former targets have more recently been measured using much shorter single-session training, the length of which was more comparable to the length of training used in value-driven attentional capture studies (Lin, Lu, & He, 2016; Sha & Jiang, 2016; Wang et al., 2013). If significant attentional biases can be measured without reward feedback following a single session of training, the question arises as to whether the reward feedback actually modulates attentional capture in the value-driven attentional capture paradigm.

Earlier studies on value-driven attention were sensitive to this potential criticism (e.g., Anderson et al., 2011a, 2011b), although rigorous tests of value-dependence were often lacking. No significant attentional capture was observed using an unrewarded but otherwise identical version of the value-driven attentional capture paradigm in the original demonstration (Anderson et al., 2011b), although it has been suggested that this could be the due to the study's small sample size (Sha & Jiang 2016). Furthermore, a direct comparison between the (lack of) capture in the unrewarded version and the purportedly value-driven attentional capture in the rewarded version of the task was lacking (although see Reanalysis of Anderson et al. (2011b) section).

Other studies including an unrewarded version of the training phase were subsequently published, showing either no (Anderson, 2016c; Anderson et al., 2012, 2014a; Qi et al., 2013; see also Anderson, 2015b) or small but reliable (Wang et al., 2013) capture by former targets. In some studies, direct comparisons between rewarded and unrewarded training were shown to be significant (Anderson et al., 2011a; Roper & Vecera, 2016; Sali, Anderson, & Yantis, 2014; Wang et al., 2013). However, in these studies with direct comparisons, modifications to the original paradigm were used, leading some to question the degree to which such findings might generalize to the specific conditions frequently used in the traditional implementation of the paradigm (Sha & Jiang, 2016). Using a design similar to the original value-driven attentional capture paradigm (although, see General discussion for some differences), Sha and Jiang (2016) showed significant attentional capture by former target colors following unrewarded training that did not differ in magnitude from capture following otherwise equivalent rewarded training, compounding this potential criticism.

Does the magnitude of reward during training actually matter?

Perhaps the most clear-cut evidence that can be provided in favor of value-dependence should demonstrate a difference in the magnitude of attentional capture that parallels a difference in learned value between stimuli. If distractors previously associated with high reward capture attention to a greater degree than distractors previously associated with comparatively low reward, the difference between the two must be attributed to the difference in value. Both stimuli served as targets in the same context, such that target/selection history and even global motivational factors linked to the availability of reward (although see Sali et al., 2014) cannot explain any difference in capture.

However, the difference in capture between distractors of different specific values during training is notoriously small, especially in the most common implementations of the value-driven attentional capture paradigm. Capture by the high-value distractor is often in the order of 10–20 ms, leaving little room for variation along this metric (see Anderson, 2013). The magnitude of the capture effect in general is perhaps unsurprising, as the value-driven attentional capture paradigm intentionally "stacks the deck" in favor of no capture by the distractor in order to make strong claims about automaticity. That is, the target is more physically salient than the distractor, whose defining feature is explicitly task-irrelevant and never (not even incidentally) coincides with the target (Anderson et al., 2011b). This powerful design is perhaps what gives the paradigm its widespread appeal, but, as Sha and Jiang (2016) indicate, it can also create ambiguity in the interpretation of capture, particularly when the former criterion for value-dependence has not been explicitly met.

The original demonstration of value-driven attentional capture (Anderson et al., 2011b) did not contain a direct comparison of distractors of relative value (although see Reanalysis of Anderson et al. (2011b) section), relying instead on the unrewarded control condition. In certain other experiments utilizing this paradigm, the comparison was explicitly not significant (e.g., Anderson & Yantis, 2012; Anderson et al., 2013b; Laurent, Hall, Anderson, & Yantis, 2015). However, many clear cases of value dependence have been observed using the training phase-test phase design along the lines of Anderson and colleagues (e.g., Anderson, 2015a, 2015b, 2016a, 2016b; Anderson & Yantis, 2013; Anderson et al., 2011a, 2012, 2016b; Theeuwes & Belopolsky, 2012; Failing & Theeuwes, 2014; Hickey & Peelen, 2015; Jiao et al., 2015; Mine & Saiki, 2015; Moher, Anderson, & Song, 2015; Pool, Brosch, Delplanque, & Sander, 2014; Roper, Vecera, & Vaidya, 2014), demonstrating a difference in attentional capture that corresponds with a difference in the learned value of the distractor; at least three of these studies have involved a near-identical replication of the original paradigm (Anderson et al., 2016b; Jiao et al., 2015; Roper et al., 2014; see also Anderson et al., 2016c, for correlations between striatal dopamine and this measure of value-dependence). As Sha and Jiang (2016) point out, though, these compelling demonstrations use a variety of specific design features, dependent measures, reward manipulations, and study populations that differ from the original study of Anderson et al. (2011b).

So, although the concept of value-dependence in the control of attention is robustly supported as a theoretical principle, the utility of the traditional value-driven attentional capture paradigm in its assessment (at least of college-age participants) can, understandably, be questioned. Given the widespread use of this paradigm and the claims that have been made on its basis, such criticism should be seriously considered, especially in light of Sha and Jiang's (2016) recent study, which failed to observe an influence of the relative value of the distractors on the capture of attention.

Rationale for the present study

Given the large number of studies that have rested their conclusions concerning value-dependence on the assumptions of the traditional value-driven attentional capture paradigm (see Anderson, 2016b), we thought it necessary to firmly establish its ability to measure attentional capture that is truly value-dependent. To this end, we replicated the original value-driven attentional capture study (Anderson et al., 2011b; see also Anderson et al., 2014b) with a larger sample size in an effort to establish whether a critical difference in capture between high- and low-value distractors is present. Evidence affirming such a difference would support the theory that learned value indeed contributes to the magnitude of capture observed for high-value distractors.

Experiment 1

Experiment 1 was a direct replication of the paradigm introduced by Anderson et al. (2011b), with a larger sample size (n = 40). Given its substantially wider use in the field, we used the shorter version of the task (Experiment 3, 240 trials in each phase).

Methods

Participants

Forty participants were recruited from the Johns Hopkins University community. All reported normal or corrected-to-normal visual acuity and normal color vision.

Apparatus

A Mac Mini equipped with Matlab software and Psychophysics Toolbox extensions (Brainard, 1997) was used to present the stimuli on an Asus VE247 monitor. The participants viewed the monitor from a distance of approximately 50 cm in a dimly lit room. Manual responses were entered using a standard keyboard.

Training phase

Stimuli

Each trial consisted of a fixation display, a search array, and a feedback display (Fig. 1A). The fixation display contained a white fixation cross (.5° x .5° visual angle) presented in the center of the screen against a black background, and the search array consisted of the fixation cross surrounded by six colored circles (each 2.3° x 2.3°) placed at equal intervals on an imaginary circle with a radius of 5°. The target was defined as the red or green circle, exactly one of which was presented on each trial; the color of each non-target circle was drawn from the set {blue, cyan, pink, orange, yellow, white} without replacement. Inside the target circle, a white bar was oriented either vertically or horizontally, and inside each of the non-targets, a white bar was tilted at 45° to the left or to the right (randomly determined for each non-target). The feedback display indicated the amount of monetary reward earned on the current trial, as well as the total accumulated reward.

Design

One of the two color targets (counterbalanced across participants) was followed by a high reward of 10¢ on 80% of correct trials and a low reward of 2¢ on the remaining 20% (high-reward target); for the other color target, these percentages were reversed (low-reward target). Each color target appeared in each location equally often, and trials were presented in a random order.

Procedure

The training phase consisted of 240 trials, which were preceded by 50 practice trials. Each trial began with the presentation of the fixation display for a randomly varying interval of 400, 500, or 600 ms. The search array then appeared and remained on-screen until a response was made or 800 ms had elapsed, after which the trial timed out. The search array was followed by a blank screen for 1,000 ms, the reward feedback display for 1,500 ms, and a 1,000-ms inter-trial interval (ITI).

Participants made a forced-choice target identification by pressing the "z" and the "m" keys for the vertically- and horizontally-orientated bars within the targets, respectively. Correct responses were followed by monetary reward feedback in which a small amount of money was added to the participant's total earnings. Incorrect responses or responses that were too slow were followed by feedback indicating 0¢ had been earned. If the trial timed out, the computer emitted a 500-ms 1,000 Hz tone.

Test phase

Stimuli

Each trial consisted of a fixation display, a search array, and a feedback display (Fig. 1B). The six shapes now consisted of either a diamond among circles or a circle among diamonds, and the target was defined as the unique shape. On a subset of the trials, one of the non-target shapes was rendered in the color of a formerly reward-associated target from the training phase (referred to as the valuable distractor); the target was never red or green. The feedback display only informed participants if their prior response was correct or not.

Design

Target identity, target location, distractor identity, and distractor location were fully crossed and counterbalanced, and trials were presented in a random order. Valuable distractors were presented on 50% of the trials, half of which were high-value distractors and half of which were low-value distractors (high- and low-reward color from the training phase, respectively).

Procedure

Participants were instructed to ignore the color of the shapes and to focus on identifying the unique shape using the same orientation-to-response mapping. The test phase consisted of 240 trials, which were preceded by 20 practice (distractor absent) trials. The search array was followed immediately by non-reward feedback for 1,000 ms in the event of an incorrect response (this display was omitted following a correct response) and then by a 500-ms ITI; no monetary rewards were given. Trials timed out after 1,200 ms. As in the training phase, if the trial timed out, the computer emitted a 500-ms 1,000 Hz tone. Upon completion of the experiment, participants were paid the cumulative reward they had earned in the training phase.

Data analysis

Only correct responses were included in the mean RT for each participant, and RTs exceeding 3 standard deviations (SDs) of the mean for each condition for each participant were trimmed. The RT trimming procedure resulted in the exclusion of 0.4% of trials.

Results

Training phase

Mean RTs were 568 ms to high-value targets and 564 ms to low-value targets, which did not significantly differ, t(39) = –1.28, p = .207. Mean accuracy was 83.0% to high-value targets and 81.5% to low-value targets, which did not significantly differ, t(39) = 1.39, p = .171. Even when focusing analysis on the second half of trials, no reliable effects of reward were detected in either RT or accuracy (550 ms and 85.1% vs. 548 ms and 83.9% for high- and low-value targets, respectively, ts < 0.87, ps > .38).

Test phase

Mean RTs for the distractor-absent, low-value, and high-value conditions were 688, 689, and 698 ms, respectively (see Fig. 2). An ANOVA revealed a main effect of distractor condition, F(2,78) = 4.48, p = .014, η² _p = .103. RTs in the high-value distractor condition were significantly slower than RTs in both the distractor-absent, t(39) = 2.73, p = .009, d = .43, and critically, the low-value distractor conditions, t(39) = 2.30, p = .027, d = .36. Accuracy did not significantly differ by distractor condition (absent: 83.9%, low-value: 83.5%, high-value: 82.9%), F(2,78) = 0.67, p = .517 (see Fig. 2).

Discussion

In contrast to the results of Sha and Jiang (2016), but consistent with a large number of prior reports (e.g., Anderson, 2017 c, 2016a; Anderson et al., 2011a, 2012, 2013a, 2013b, 2014b, 2016b; Miranda & Palmer, 2014; Roper et al., 2014), reward did not significantly modulate performance in the training phase. However, and most critically, performance was significantly affected by the distractors in the test phase. Unlike in Sha and Jiang (2016), clear value-dependence was observed in that a significant difference emerged between the high-value and low-value distractor conditions. The results provide direct support for the value-dependence of attentional capture as measured in the value-driven attentional capture paradigm.

Experiment 2a

Another means of establishing value-dependence is to demonstrate significantly weaker capture following training without reward feedback. Experiment 2a examined the role of selection history, divorced from prior reward, in biasing attention. Participants performed a task that was identical to that in Experiment 1, with the exception that no monetary reward feedback was provided (see Fig. 1C).