Top-down control is critical in essentially every theory of attention (Bundesen, 1990; Desimone & Duncan, 1995; Duncan & Humphreys, 1989; Wolfe, Cave, & Franzel, 1989). Top-down control directs the mechanisms of perceptual attention to relevant items in the visual field, interacting with bottom-up factors, such as stimulus salience, to determine what stimuli are processed most efficiently. The biased-competition theory of attention (Desimone & Duncan, 1995) specifies a direct connection between the contents of working memory (WM) and top-down attention. According to biased competition, maintaining target information in WM leads to elevated activity in the cells selective for target features. This creates a competitive advantage for target-matching items in the visual field, increasing the probability that task-relevant information will be represented. This elegantly simple mechanism suggests a mandatory link between WM storage and top-down attentional control.

Conflicting findings in studies that have focused on whether WM maintenance is sufficient to control attention have spurred a growing debate (for reviews, see Olivers, Peters, Houtkamp, & Roelfsema, 2011; Soto, Hodsoll, Rotshtein, & Humphreys, 2008). In these studies, participants’ WM is typically loaded, and then they perform a visual search task that occasionally includes a memory-matching distractor. If WM maintenance is sufficient to provide top-down attentional control, attention should involuntarily be guided to WM-matching items in the visual field. Many studies have reported that attention is drawn to WM-matching distractors, which is most frequently evidenced by increased search reaction times (RTs) when a WM-matching distractor is present in the array (Olivers, Meijer, & Theeuwes, 2006; Soto, Heinke, Humphreys, & Blanco, 2005; Soto & Humphreys, 2007, 2009; Soto, Humphreys, & Heinke, 2006). However, notable exceptions (Dalvit & Eimer, 2011; Dombrowe, Olivers, & Donk, 2010; Houtkamp & Roelfsema, 2006), including reports of faster RTs when WM-matching distractors are present (Downing & Dodds, 2004; Han & Kim, 2009; Woodman & Luck, 2007), suggest that task-irrelevant WM matches do not always compete for attention and can be effectively ignored (e.g., Arita, Carlisle, & Woodman, 2013).

Due to the contradictory findings from behavioral measures, converging evidence from event-related potentials (ERPs) has been sought in order to provide definitive tests of WM’s guidance of attention (Carlisle & Woodman, 2011b; Kumar, Soto, & Humphreys, 2009; Peters, Goebel, & Roelfsema, 2009). The temporal resolution of ERPs can show early differences in attentional selection that might not be evident in behavioral output. However, two previous ERP studies with similar tasks have reported contradictory results. Kumar, Soto, and Humphreys loaded WM with a colored shape (Fig. 1A) that could be present as a distractor in a subsequent search task for a tilted cyan line. The N2pc component was measured to determine whether the WM matches captured attention. The N2pc (for N2-posterior-contralateral) can track the focus of covert attention (Woodman & Luck, 1999, 2003), with the electrodes contralateral to the focus of perceptual attention becoming more negative than ipsilateral electrodes (Woodman & Luck, 2003). When the memory-matching distractor was in the opposite hemifield from the search target, Kumar and colleagues reported that targets elicited a reduced-amplitude N2pc relative to target-only trials, whereas trials with a memory-matching distractor in the same hemifield as the target elicited an increased-amplitude N2pc relative to target-only trials. This pattern was taken as evidence that the WM matches captured attention, leading to the conclusion that maintaining a representation in WM is sufficient to create top-down control of attention.

Fig. 1
figure 1

(A) Stimulus sequences from Kumar, Soto, and Humphreys (2009). (B) Stimulus sequences from Carlisle and Woodman (2011b). The stimuli are not to scale, but represent the overall stimulus differences. Note that in the present work, a fixation point was added to the stimuli of Kumar et al

As is shown in Fig. 1B, Carlisle and Woodman (2011b) used a highly similar paradigm. However, they reported a very different pattern of results. Across three experiments in which they manipulated features of the search display, they found no evidence of an N2pc to the memory-matching distractors. Instead, they found evidence that memory-matching items were suppressed, because they elicited a lateralized positivity (Hickey, Di Lollo, & McDonald, 2009). In Experiment 4, the memory-matching item was the search target. When the memory match was task-relevant because it indicated the search target, it elicited a large N2pc, demonstrating that the stimuli were capable of producing early and robust N2pc effects. Overall, this suggested that goals were necessary for creating a top-down attentional bias from the memory representation.

The conflicting findings and conclusions from the Kumar et al. (2009) and Carlisle and Woodman (2011b) studies, as well as from previous behavioral work, have done little to resolve the debate in the literature about the mechanism of top-down attentional control. Throughout this debate, different laboratories have consistently used different stimuli, with few exceptions (Olivers, 2009). Here, we sought to reach across this divide by using the stimuli of Kumar et al., while manipulating the goal relevance of the memory-matching items across experiments, as in Carlisle and Woodman (2011b). Specifically, we felt it necessary to use the stimuli from the Kumar et al. study because they consisted of large colored shapes, one of which could match the memory representation, with a small tilted line as the search target element. This difference in the saliency between the memory-matching information and the search target might have contributed importantly to the ERP and behavioral effects observed. We then manipulated the goal relevance of the memory-matching items for search, as in Carlisle and Woodman’s (2011b) study. The memory-matching items never contained the search target element in Experiment 1A, whereas the memory-matching items always contained the search target element in Experiment 1B. This allowed us to determine whether the strengths of the attentional bias to the salient memory-matching items of Kumar et al. were always similar, regardless of goal of the search task at hand.

If a match between the bottom-up input and the memory representation is sufficient to control attention with the Kumar stimuli, we would expect to find similar N2pcs to the salient memory-matching items, regardless of their task relevance (i.e., the same N2pc to memory matches across Exp. 1A, in which the match was irrelevant for search, and Exp. 1B, it which it was relevant for search). In contrast, if the control of attention by WM is dependent on goals, we would expect to find a larger N2pc for the task-relevant memory matches in Experiment 1B, relative to the task-irrelevant memory matches in Experiment 1A.

Method

Participants

A group of 13 individuals from the Vanderbilt community participated in these experiments. All gave informed consent and were compensated at a rate of $10/h. Participants reported no history of neurological problems, normal or corrected-to-normal visual acuity, and normal color vision, and were between the ages of 18 and 35.

Stimuli and procedure

In Experiment 1A, participants performed the task of Kumar et al. (2009) (see Fig. 1A). The memory and search items were combinations of five possible colors (red, yellow, blue, green, or pink) and five possible outlined shapes (a circle, subtending 1.80º × 1.80º of visual angle; hexagon, 2.38º × 0.95º; square, 1.50º × 1.50º; diamond, 1.91º × 1.91º; or triangle, 2.00º × 1.50º, with each line width 0.24º). The memory cue and test arrays contained one item centered on the screen. The search arrays contained four objects presented on an imaginary circle 6º from the screen center, with one object per quadrant either 30º or 60º from vertical. The search target was a cyan line (0.57º × 0.12º, tilted 38º from vertical) centered in one of the objects. All other objects had a black vertical line (0.57º × 0.12º) presented in their center. A memory-matching distractor was present on half of the search trials. We only deviated from the methods of Kumar et al. in that we added a black fixation point (0.18º × 0.18º) beginning 800–1,200 ms before the memory cue until the end of the trial. This was necessary in order to help participants maintain fixation throughout each trial. Otherwise, all stimulus and timing parameters were identical to those of Kumar et al.

Participants were first shown an object to keep in memory (133 ms on, 133 ms off, and 500 ms on, followed by a 800-ms fixation-only period). Participants were then presented with the search array (2,500 ms) and reported whether the top of the cyan line tilted to the left or the right of vertical by pressing a left or right directional arrow on a gamepad with their left hand. On 20% of trials, a memory test was presented after the search array (800-ms blank screen followed by 2,000-ms memory test), with change and no-change tests being equally likely. Participants reported “change” or “no change” by pressing one of two buttons on the gamepad with their right hand. They were explicitly instructed that the search target line would never be within the memory match. After a short practice, they completed 400 experimental trials.

In Experiment 1B, all of the stimuli and procedures were the same, except as follows. The memory-matching item was now present on each trial and contained a black line tilted 38º to the left or the right of vertical, which served as the search target. Participants’ task was to report the tilt of this black line, and thus the tilted cyan line in one of the other colored shapes was now a distractor. Participants were informed that the black tilted line would always appear on the memory-matching item.

ERP recording and analysis

The electroencephalogram (EEG) was recorded from electrodes in an elastic cap (Electrocap International) using a subset of the International 10–20 sites (Fz, Cz, Pz, F3/F4, C3/C4, P3/P4, PO3/PO4, T3/T4, T5/T6, and O1/O2) and sites OL/OR (between O1 and T5 and O2 and T6, respectively) using our standard methods (i.e., Carlisle, Arita, Pardo, & Woodman, 2011; Carlisle & Woodman, 2011b). Trials containing ocular or myogenic artifacts were excluded from the averages (see Woodman & Luck, 2003). Participants with artifacts on more than 25% of correct search trials or residual systematic eye movements greater than 3.2 μV after averaging would have been replaced, but no participants met these criteria (one participant did not complete the study due to being unable to maintain fixation, and was replaced). Averages of 10.3% and 9.3% of trials per participant were rejected in Experiments 1A and 1B, respectively.

We measured the N2pc amplitude to the target as the difference in mean amplitudes from 200 to 300 ms after the search onset between electrode sites O1/2, OL/R, and T5/6 contralateral versus ipsilateral to the target location on correct search trials. Similarly, the N2pc to the memory-matching item was the difference between electrodes contralateral versus ipsilateral to the memory-match location. In Experiment 1B, the memory-match location was also the search target location.

The fractional-area latency method was used to measure the onset of the N2pc in order to provide an empirical metric of the salience of the memory-matching item relative to the tilted-line search target (Luck, 2005; Woodman, 2010). The N2pc onset was measured as the time point at which 25% of the area under the curve had occurred for each participant’s averaged contralateral-minus-ipsilateral waveforms, relative to the memory match. The measurement window for the N2pc was set to match the window used in all of the other N2pc analyses, resulting in a window that spanned 200–300 ms poststimulus. Only trials on which no memory item was present were included in the analysis of the tilted-line search target, to remove any influence of the memory-matching distractor on our fractional area latency measures. Greenhouse–Geisser epsilon corrections for nonsphericity (Jennings & Wood, 1976) were applied to the p values when appropriate, for all analyses.

Results

In Experiment 1A, participants were 99% correct on the search task, and responses were faster when the memory match was absent (727 ms) than when it was present (764 ms), t(12) = 4.51, p < .001. On the 20% of trials that contained a memory test, participants were 94% correct on trials (with 3% of the trials being incorrect due to no response). We found no significant difference based on whether the memory match had been present versus absent from the search array (95% vs. 94% correct, p = .33)

Figure 2A shows the N2pcs to the search target. The amplitudes of the target N2pc were similar when no memory-matching item was present (0.83 μV) and when the memory-matching item was in the same hemifield as the search target (0.82 μV), and smaller when the memory-matching item was in the opposite hemifield (0.15 μV). We performed an analysis of variance (ANOVA) with the factors Contralaterality (ipsilateral vs. contralateral to the target), Array Type (no memory vs. memory same hemifield as target vs. memory opposite hemifield from target), and Electrode Site (O1/2 vs. OL/R vs. T5/6). This showed significant main effects of contralaterality [F(1, 12) = 11.33, MSE = 1.86, p < .01], indicating that the N2pc to the search target was significant, and electrode [F(2, 24) = 4.75, MSE = 13.75, p < .05], due to generally more positive waveforms at OL/R, followed by T5/6 and then O1/2. We found an interaction of contralaterality and electrode [F(2, 24) = 4.57, MSE = 0.08, p < .05], with a larger N2pc at OL/R than at O1/2 and T5/6. Critically, a significant Array Type × Contralaterality interaction emerged [F(2, 24) = 5.57, MSE = 0.54, p < .05]. Follow-up analyses showed that this interaction was driven by the N2pc to the search target in the opposite hemifield from the memory match being significantly smaller than when the memory match was in the same hemifield [t(12) = 2.39, p < .05] or when no memory match was present [t(12) = 3.29, p < .01]. No other main effects or interactions were significant.

Fig. 2
figure 2

Event-related potentials (ERPs) recorded at electrodes OL/R. (A) N2pcs to the search target, as a function of the array type in Experiment 1A. (B) N2pc to the task-irrelevant memory-matching distractor in Experiment 1A. (C) N2pc to the task-relevant memory match in Experiment 1B. (D) N2pc difference waves (contralateral – ipsilateral) to the tilted-line target in Experiment 1A and the memory-matching target in Experiment 1B. Vertical lines indicate the points of 25% fractional-area latency of the N2pcs from 200 to 300 ms after search array onset

Next, we examined the N2pc to the memory-matching item. Figure 2B shows that the waveforms contralateral to the memory match were more negative than the ipsilateral waveforms (i.e., a 0.34-μV difference). We performed an ANOVA with the factors Contralaterality (ipsilateral vs. contralateral to the memory match) and Electrode Site (O1/2 vs. OL/R vs. T5/6). We found significant effects of electrode site [F(2, 24) = 4.55, MSE = 4.73, p < .05] and contralaterality [F(1, 12) = 5.82, MSE = 0.38, p < .05], indicating that participants were attending to the memory-matching distractor.

In Experiment 1B, participants were highly accurate on the search task (99% correct) and the memory task (96% correct, including no response recorded on 3% of the trials). As is shown in Fig. 2C, the task-relevant memory-matching item elicited a large N2pc (i.e., 1.30 μV). An ANOVA with the factors Contralaterality and Electrode showed significant effects of contralaterality [F(1, 12) = 31.84, MSE = 1.03, p < .001] and electrode site [F(2, 24) = 8.47, MSE = 3.85, p < .05], as well as a Contralaterality × Electrode interaction [F(2, 24) = 9.46, MSE = 0.05, p < .01], due to N2pc amplitudes being largest at OL/R, followed by T5/6 and then O1/2. A planned comparison of the N2pcs elicited by memory matches in Experiment 1A relative to Experiment 1B confirmed that the N2pc to the memory matches was significantly larger when the match was task-relevant rather than task-irrelevant [t(12) = 4.08, p < .01].

Finally, we performed a 25% fractional-area latency measure to contrast the speeds of orienting to the tilted-line target in Experiment 1A and the memory-matching search target in Experiment 1B. As is shown in Fig. 2D, the 25% fractional-area latency occurred at 250 ms when participants were orienting to the tilted-line search target and at 228 ms when participants were orienting to the memory-matching search target. A paired-samples t test indicated that the onset of the N2pc was significantly earlier in Experiment 1B than in Experiment 1A [t(12) = 3.84, p < .01].

Discussion

The memory-matching distractors did significantly modulate the amplitude of the N2pc to the search target, replicating the pattern reported by Kumar et al. (2009). In addition, the significant N2pc elicited by the memory match throughout Experiment 1 showed that the memory-matching item was attended even when it was a distractor. In Experiment 1B, the task-relevant memory matches elicited a robust N2pc, approaching 4 times larger than the N2pc to the memory-matching distractor in Experiment 1A.Footnote 1 This pattern replicated the goal-dependent influence of WM on the deployment of attention, as measured using the N2pcs described in Carlisle and Woodman (2011b). As we will discuss next, this does not mean that the bottom-up characteristics of the stimuli were not important. Indeed, the present finding of a significant N2pc to memory-matching items indicates that a highly salient memory-matching input may be necessary in order to observe attentional biases to such items.

Our findings provide a reconciliation of seemingly contradictory findings in the literature. It appears that the stimuli used in the study of Kumar et al. (2009) and in most of the experiments from their group (e.g., Soto et al., 2005; 2008) do bias attention to memory-matching items more strongly than do the stimuli in Carlisle and Woodman (2011b). This is not surprising when we consider the physical characteristics of these stimuli. The memory and search stimuli used here and in Kumar et al.’s study consisted of a larger memory-matching item (i.e., a large colored shape outline) and a less salient target feature (i.e., a small, oriented line). Our fractional-area latency measures confirmed that participants could orient more quickly to the memory item in Experiment 1B than to the oriented line target in Experiment 1A. In Carlisle and Woodman (2011b)—and in many of the previous reports in which memory-matching items had no effect, or even facilitated visual search performance—the features that defined the search target and the memory match were similarly salient (the same size, shape, average luminance, etc.; see Downing & Dodds, 2004; Peters et al., 2009; Woodman & Luck, 2007). The present study demonstrates that a highly salient memory-matching input will have a stronger influence on attention than stimuli that are matched for saliency (see also Olivers, 2009).

In addition to suggesting that the physical stimuli matter, the present study shows that goals matter, too. That is, even with a memory-matching stimulus that was much more salient than the search target, the task relevance of the stimuli modulated the size of the attention effects approximately fourfold. This shows that simply representing a target in WM is not sufficient to control attention at the level of a search template. Instead, top-down control is contingent on higher-level control settings in the brain that can make use of the information in WM when it is consistent with the goals of the task.

The present findings are in line with a growing body of evidence that the relationship between WM and attentional guidance is flexible and dependent on the current task goals (Carlisle & Woodman, 2011a; Han & Kim, 2009; Moher, Abrams, Egeth, Yantis, & Stuphorn, 2011; Peters et al., 2009). The present findings are also consistent with recent behavioral work showing that representations maintained in WM create a weaker attentional influence than does maintaining an attentional set for an item in WM (Olivers & Eimer, 2011). Unfortunately, it seems that the elegantly simple mechanism of top-down control proposed by biased competition (Desimone & Duncan, 1995) cannot explain the variable relationship between WM and attention present in the empirical literature. WM representations can be used to guide attention, but WM maintenance alone does not lead to the same attentional influence during the processing of complex visual scenes as does a goal-related attentional template.