Limits in feature-based attention to multiple colors

Liu, Taosheng; Jigo, Michael

doi:10.3758/s13414-017-1390-x

Limits in feature-based attention to multiple colors

Published: 21 July 2017

Volume 79, pages 2327–2337, (2017)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Limits in feature-based attention to multiple colors

Download PDF

Taosheng Liu^1,2 &
Michael Jigo¹

1767 Accesses
14 Citations
Explore all metrics

Abstract

Attention to a feature enhances the sensory representation of that feature. Although much has been learned about the properties of attentional modulation when attending to a single feature, the effectiveness of attending to multiple features is not well understood. We investigated this question in a series of experiments using a color-detection task while varying the number of attended colors in a cueing paradigm. Observers were shown either a single cue, two cues, or no cue (baseline) before detecting a coherent color target. We measured detection threshold by varying the coherence level of the target. Compared to the baseline condition, we found consistent facilitation of detection performance in the one-cue and two-cue conditions, but performance in the two-cue condition was lower than that in the one-cue condition. In the final experiment, we presented a 50% valid cue to emulate the situation in which observers were only able to attend a single color in the two-cue condition, and found equivalent detection thresholds with the standard two-cue condition. These results indicate a limit in attending to two colors and further imply that observers could effectively attend a single color at a time. Such a limit is likely due to an inability to maintain multiple active attentional templates for colors.

Feature-based attention warps the perception of visual features

Article Open access 20 April 2023

We can guide search by a set of colors, but are reluctant to do it

Article 06 November 2018

Flexible weighting of target features based on distractor context

Article 18 November 2019

Visual attention allows us to selectively process a limited set of visual stimuli from the multitude of sensory input. Voluntary attentional selection can be based on spatial locations (Carrasco, 2006; Posner, 1980) and nonspatial features (Egeth & Yantis, 1997; Theeuwes, 2010). Here, we focus on a particular type of nonspatial attention, namely feature-based attention, in which selection is based on specific values within a dimension (e.g., selecting the color red among other colors) without a change in focus of spatial attention (Maunsell & Treue, 2006; Scolari, Ester, & Serences, 2014).

It is now well-established that attending to a feature can enhance its early sensory representations, as shown by a variety of studies employing psychophysical (Boynton, Ciaramitaro, & Arman, 2006; Liu & Hou, 2011; Liu & Mance, 2011; Saenz, Buraĉas, & Boynton, 2003; White & Carrasco, 2011), neurophysiological (Cohen & Maunsell, 2011; Martinez-Trujillo & Treue, 2004) and brain imaging measures (Liu, Larsson, & Carrasco, 2007; Saenz, Buraĉas, & Boynton, 2002). An enhanced feature representation would be useful for other cognitive operations requiring the selection of that feature (e.g., during visual search for a specific feature). This body of work generally tested attention to a single feature, thus leaving open an important question regarding attentional capacity—that is, how many features can be attended simultaneously? Answering this question would deepen our understanding of the mechanisms of attention and have practical implications on optimizing human performance in visually guided tasks.

Importantly, the question on attentional capacity is distinct from questions regarding the capacity to process multiple features (Shiffrin & Gardner, 1972; Townsend, 1990),^{Footnote 1} or the storage capacity in short-term memory (Cowan, 2001). Instead, we focus on attentional templates/attentional sets, which have been theorized to underlie successful visual selection (Duncan & Humphreys, 1992; Folk, Remington, & Johnston, 1992; Wolfe, 2007). Specifically, our question concerns the limits in actively maintaining multiple attentional templates. This question has been addressed in visual search studies where the number of possible targets was varied. For example, Wolfe (2012) found that as the number of possible targets increased, search reaction time also increased (Wolfe, 2012). In particular, searching for two targets lead to lower performance than searching for a single target (Dombrowe, Donk, & Olivers, 2011; Stroud, Menneer, Cave, Donnelly, & Rayner, 2011). These results thus suggest that the number of active attentional templates is severely limited (possibly limited to one). However, other studies have found evidence that there could be multiple (at least two) active attentional templates (Adamo, Pun, & Ferber, 2010; Beck, Hollingworth, & Luck, 2012; Becker, Ravizza, & Peltier, 2015; Irons, Folk, & Remington, 2012; Moore & Weissman, 2010). No apparent consensus has emerged from these studies, likely due to the complex nature of the visual-search task. First, search is inherently spatial as the locus of attention needs to be moved in space. In difficult searches and ones involving eye movements, search is likely serial, making it difficult to infer the number of concurrently active templates. Second, search performance is usually measured by reaction time, which reflects both attentional selection and postselection decisional processes. These factors complicate the interpretation of results in terms of the quality of attentional templates.

To achieve a more mechanistic understanding of the limit in feature-based attention, we used a psychophysical approach to examine the quality of feature representation when the locus of spatial attention is fixed (i.e., nonsearch task). A small number of studies have used this approach to test the limit of feature-based attention to motion directions. Two previous studies used directional cues to direct attention to motion and manipulated the reliability of the cue (Ball & Sekuler, 1981; Herrmann, Heeger, & Carrasco, 2012). A reliable cue indicated a narrow range of possible directions for an upcoming moving target, whereas an unreliable cue indicated a wide range of possible target directions. It was found that performance deteriorated as the cue became less reliable. This implies a limit in feature-based attention in that attention cannot be directed to more directions as effectively as to fewer directions. However, these studies do not provide a precise estimate of the limit of feature-based attention, nor were they designed to achieve such an objective. A recent study by us addressed this question by manipulating the number of discrete directional cues in a motion-detection task (Liu, Becker, & Jigo, 2013). Compared to a baseline neutral condition, performance was improved when observers attended to a single direction as well as when they attended to two orthogonal directions. However, there was a significant performance decrement when attending to two directions compared to attending to a single direction, thus revealing a limitation in our ability to attend to multiple directions.

An important question is whether this previously demonstrated limit is specific to the motion feature, or if it is a general property of feature-based attention. Here we extend this work by investigating attention to colors. A priori, color is an important visual feature and has been shown to be particularly effective in guiding attention (Motter & Belky, 1998; Williams, 1966). In addition, the aforementioned studies of visual search all examined the color feature. Hence, it is important to know whether results obtained for motion direction can be generalized to color. These considerations prompted us to investigate the limit of feature-based attention to color. To directly assess the quality of color representation during feature-based attention, we manipulated the number of color precues and measured the detection threshold of a color target in a psychophysical task. This allowed us to assess changes in the sensitivity to color when observers attended one or two colors.

Experiment 1

In this experiment, we used a two-interval forced-choice (2-IFC) task to assess the behavioral consequence of attending one versus two colors. Observers viewed noisy color stimuli and were instructed to report the temporal interval that contained a coherent color target. Three cueing conditions were employed to manipulate feature-based attention. In the no-cue (baseline) condition, observers were provided with no prior information about the color target. Whereas the one-cue and two-cue conditions contained one and two precues, respectively, that indicated the color target. These cues were always valid, thus prompting observers to attend to the cued colors.

Method

Observers

Six observers (1 male and five female; mean age = 22 years; SD = 3) participated in the experiment and were naïve to its purpose (except one author, M.J.). All observers had normal or corrected-to-normal acuity, and their color vision were assessed with the Dvorine Pseudo-Isochromatic Plates (Dvorine, 1953). Observers gave written informed consent under the study protocol approved by the Institutional Review Board at Michigan State University and were remunerated at a rate of $10/hour (except the author). We based our sample size on our previous study on feature-based attention to motion (Experiment 1 of Liu et al., 2013), which used a similar experimental design and analytical approach. The effect size for comparison between one-cue and no-cue condition in that experiment was 1.88. Assuming that cueing color feature would yield similar effects, we found that a sample size of six would yield a power of .90 given α = .05 for a paired-samples t test (Faul, Erdfelder, Lang, & Buchner, 2007).

Apparatus

Visual stimuli were generated using MGL (http://justingardner.net/mgl), a set of OpenGL libraries running in MATLAB (MathWorks, Natick, MA), and displayed on a 21-in. CRT monitor with a refresh rate of 100 Hz and a resolution of 1024 × 768. Observers rested their heads on a chin rest positioned 68 cm away from the monitor.

Stimuli

Stimuli comprised of static arrays of 240 chromatic dots (size: 0.1°) that were drawn in an annulus (inner radius = 1°, outer radius = 5°) and centered on a central fixation disc (white; size: 0.3°; luminance: 14.8 cd/m²). On each trial, each dot was drawn in one of six isoluminant colors (see Isoluminance Task section) that was selected from a pool of seven colors (red, green, blue, yellow, purple, orange, or cyan) and randomly positioned within the annulus. During no-cue and one-cue conditions, the six colors were randomly selected on each trial. During the two-cue condition, the colors were pseudorandomly selected such that the cued nontarget color was excluded from the dot display. For example, if the target color was red and an observer was cued to “red” and “green”, green dots were not presented on that trial. Observers were cued to the target color by colored discs (cues; size: 0.5°) that preceded the dot displays. Cues were positioned 1.5° to the left or right of fixation.

Color coherence

Color coherence refers to the proportion of dots drawn in a particular color (the target color) relative to the other five colors in the display (note that there were six colors in each dot stimulus). Numerically, coherence was defined by the following equation:

$$ color\; coherence={P}_t-{P}_n $$

(1)

where P_t is the proportion of dots drawn in the target color and P_n is the proportion of dots drawn in the other five colors (noise) with the following constraint:

$$ {P}_n=\frac{1-{P}_t}{5}\cdot $$

(2)

This ensured that the noise colors were equally proportioned after accounting for the target color. Displays with zero color coherence had an equal number of dots for each of the six colors (i.e., 40 dots per color) whereas in nonzero coherence displays, a disproportionately large number of dots were drawn in the target color. This measure of coherence is a color analog of motion coherence implemented in the classic random-dot motion stimulus (Newsome & Pare, 1988) and has been used in our previous study (Wang, Miller, & Liu, 2015).

Procedure

Isoluminance task

Prior to participating in the experiment, observers equated the perceived brightness across all seven colors with heterochromatic flicker photometry (Kaiser, 1991; B. B. Lee, Martin, & Valberg, 1988). Observers viewed gray (luminance: 6.3 cd/m²) and chromatic square tiles (size: 1.8° × 1.8°) that were arranged in a checkerboard pattern and constrained within an annulus (inner radius = 1.5°; outer radius = 6°) that was centered on a central fixation cross (white; size: 0.5°; luminance: 21.1 cd/m²). The gray and chromatic tiles flickered at 8 Hz in a counterphase fashion, and observers adjusted the luminance of the chromatic tiles until the flicker was minimized. The resulting luminance was an estimate of the color’s isoluminance value relative to the constant gray. Thresholds for each of the seven colors were obtained in separate blocks of four trials and the average value across the four trials served as the final luminance value for that color in the attention experiment.

Attention task

Observers performed a 2-IFC task (see Fig. 1a) at six fixed levels of color coherence. At trial onset, the fixation disc dimmed for 0.5 s (luminance: 4.2 cd/m²) to signal observers of the upcoming stimuli. During cued blocks (one-cue or two-cue), one or two cues appeared in this interval. During one-cue blocks, the cue was always drawn in the target color. During two-cue blocks, one cue was drawn in the target color while the other was drawn in the color that was absent from the display (see Stimuli section). A 0.7-s fixation period followed the cue interval.

After the fixation period, two intervals of chromatic stimuli were displayed. Each interval lasted 0.1 s and was separated by a 0.7-s fixation period. One interval contained a zero-coherence stimulus while the other contained a color target at one of six possible coherence levels (0.025, 0.05, 0.1, 0.15, 0.2, or 0.4; see Fig. 1b). These coherence values were chosen because they met the criteria of producing numbers of target and noise dots that were integers that summed to 240 total dots and adequately sampling the range of the psychometric function, based on pilot data. Following the second chromatic stimulus, observers reported the interval that contained the coherent color target by using 1 or 2 on the keyboard’s numeric keypad for the first and second interval, respectively. We explicitly instructed observers to report the interval that they perceived to contain a dominant color (i.e., a color that was disproportionately represented). An intertrial interval that varied between 1 and 1.5 s followed the observer’s response.

The task was performed in blocks of 48 trials with cue condition (no-cue, one-cue, and two-cue) held constant in each block. Within a block, target color, coherence level, and the location of the target-colored cue (left or right of fixation) were randomized. Each observer performed 14 blocks (672 trials) of each cue condition with their order pseudorandomized such that each occurred once every three blocks. The experiment spanned 2 hour-long sessions that were completed on separate days.

Training

Prior to the main experiment, observers familiarized themselves with the task in a separate practice session. In this session, observers performed blocks of each cueing condition until their performance increased monotonically as a function of color coherence. On average, observers performed 1.7 blocks of each cueing condition (five blocks total; SD = 3). The practice session always took place on a different day.

Analysis

For each observer, performance was assessed separately for each cueing condition and fit with a Weibull function:

$$ P(c)=\gamma +\left(1-\gamma -\lambda \right)\cdot \left(1-{e}^{-{\left(\frac{c}{\alpha}\right)}^{\beta }}\right) $$

(3)

where P(c) represents performance as a function of color coherence, γ is the lower asymptote, λ is the deviation from one at the upper asymptote, c is color coherence, α is the range of the Weibull function, and β is its slope. The function was fit using maximum-likelihood estimation as implemented in the Palamedes Toolbox (Prins & Kingdon, 2009).

Performance, P(c), was evaluated as the proportion of correct responses. When fitting performance, γ was fixed at 0.5 and λ was constrained between zero and 0.1. Color coherence threshold was evaluated at a proportion correct of 0.75, and planned t tests were conducted between the thresholds for each cueing condition.

Results and discussion

To visualize overall task performance, we fit the aggregate data across observers for each cue (see Fig. 2a). One and two cues improved performance relative to baseline (no cue), as evidenced by a leftward shift of both psychometric functions. To quantify these effects, we fit the Weibull function to individual observer data and obtained threshold estimates for each observer. Group-averaged thresholds are shown in Fig. 2b and individual thresholds were compared with planned t tests (see Fig. 2b). Color coherence thresholds were significantly lower for one-cue, t(5) = 4.2, p < .01, and two-cue conditions, t(5) = 3.5, p < .05, relative to baseline. In addition, the one-cue was lower than the two-cue threshold, t(5) = 4.5, p < .01. We also separated our data, conditioning on whether the target occurred in the first or second interval, and computed separate thresholds for all conditions. A two-way repeated-measures ANOVA, with cue condition (no, one, two) and target interval (first, second) as factors revealed a main effect of cue condition, F(2, 10) = 11.8, p < .01, but no main effect or interaction for target interval (both ps > .1). Therefore, the effect we observed was consistent for targets occurring in both intervals.

The reduced cueing effect in the two-cue condition demonstrates a limit in the ability to attend multiple colors, which is similar to our previous finding on attention to motion directions (Liu et al., 2013). We note that two colors are well within the storage capacity of working memory, which is estimated to be three to four items (Cowan, 2001; Luck & Vogel, 1997). In addition, we also queried observers after the experiment and none reported any confusion about which colors they needed to attend. Thus, the weaker cueing effect cannot be attributed to a failure in memory.

Another important consideration is whether the cue simply reduced decisional uncertainty (Lawrence & Coles, 1954; Shiu & Pashler, 1994), and, in particular, our observed effects could be attributed to a greater uncertainty reduction in one-cue versus two-cue conditions. Here, we highlight that our experimental design minimized such contributions of variable uncertainty reduction across cueing conditions. Importantly, in the two-cue condition, one of the cued colors was the target color while the other color was never presented in either stimulus on that trial. This should have prevented observers from basing their decision on the cued nontarget color. Had we presented both cued colors in the stimuli in the two-cue condition, for example, by presenting both red and green dots when “red” and “green” were cued and red was the target color, the presence of green dots could have caused confusion and biased observers to choose the noise (incorrect) interval. However, because green was never presented in either interval, such uncertainty should have been greatly reduced.

Nevertheless, the 2-IFC task does require an explicit comparison between the two stimuli, and it also requires consistent attentional deployment across both intervals. The temporally extended nature of the task is somewhat atypical in feature cueing studies, which tend to contain a single interval of stimuli. Therefore, we simplified the task demand in the next experiment and assessed whether our results can be generalized to a single-interval detection task. Because the single-interval task does not require comparison between two stimuli, this should further reduce the impact of decisional uncertainty on performance.

Experiment 2

Here, we used a single-interval detection task to examine the limit of attention to colors. Observers viewed a single stimulus whose color coherence varied on a trial-by-trial basis and were instructed to report whether or not a target was present.