Eyetracking and selective attention in category learning☆
Introduction
Selective attention has played a prominent role in theories of categorization ever since Roger Shepard’s influential work (Shepard, Hovland & Jenkins, 1961) demonstrated that a simple stimulus generalization account of category learning is untenable. The stimulus generalization account took category learning to be a process of simple associations between stimuli and category labels. This account predicted that it should be easy for participants to associate stimuli that shared many features with one category label, and difficult to associate such stimuli with different labels. Unexpectedly, one important determiner of difficulty was the number of stimulus dimensions needed for correct classification. It has been generally accepted that this pattern of results is best understood in terms of learners optimally allocating their selective attention to those dimensions diagnostic of category membership (Medin and Schaffer, 1978, Nosofsky, 1984, Shepard et al., 1961).
Currently, selective attention is an integral component of all major categorization theories. For example, in both exemplar models (Hampton, 1995; Medin and Schaffer, 1978, Nosofsky, 1986) and prototype models (Nosofsky, 1992; Smith & Minda, 1998), selective attention is formalized in terms of the influence, or weight, that different stimulus dimensions have on a classification decision. Rule-based models also implicitly assume the operation of selective attention to those stimulus dimensions referred to by the current hypothesis (i.e., rule) being tested (Smith, Patalano, & Jonides, 1998).
Moreover, in more recent years, these theories have been extended to include the mechanisms by which selective attention changes with learning. One prominent example is Kruschke’s (1992) ALCOVE, a connectionist exemplar model that changes attention weights as a function of error feedback. Another is Nosofsky, Palmeri, and McKinley’s (1994) rule-plus-exception (RULEX) model, which first performs hypothesis (rule) testing on single dimensions, then on multi-dimensional rules and exceptions to those rules if needed.
Despite its prominence in modern categorization theory, however, evidence for the operation of selective attention has always amounted to demonstrations that dimensions vary in their influence on explicit categorization judgments (or same-different judgments, Goldstone, 1994), but not on the operation of selective attention per se (Lamberts, 1998). Accordingly, this study had two main goals. The first was to determine if eyetracking data would support the claim that learners allocate their attention to optimize classification performance. To this end, we replicated the Shepard et al. (1961) category learning experiment with an eyetracker. Specifically, we asked whether Shepard et al.’s claims regarding learners’ reallocation of attention to only those stimulus dimensions relevant to producing correct classification decisions would be directly corroborated by eyetracking data.
To our knowledge, the current work is the first to apply eyetracking to the domain of categorization research. At the outset then, one concern that must be addressed is the interpretation of eye movements as a surrogate measure of attention during category learning. It is of course well known that attention can dissociate from eye gaze under certain circumstances (Posner, 1980). However, in many cases changes in attention are immediately followed by the corresponding eye movements (e.g., Kowler, Anderson, Dosher, & Blaser, 1995), and there is evidence that attention and eye movements are tightly coupled for all but the simplest stimuli (Deubel & Schneider, 1996). Not surprisingly then, eye tracking has proven to be an effective tool in many areas of research, most notably of course reading (Ferreira and Clifton, 1986, Just and Carpenter, 1984, Makie et al., 2002, Rayner, 1998, Tanenhaus et al., 1995) but also language production (Griffin and Bock, 2000, Meyer et al., 1998), scene perception (Biederman et al., 1982, Henderson, 1999, Loftus and Mackworth, 1978), problem solving (Grant and Spivey, 2003, Hegarty and Just, 1993), skill acquisition (Haider & Frensch, 1999), and face perception (Althoff & Cohen, 1999), to name a few. In the current study, we will take the presence of eye fixations to spatially separated stimulus dimensions as a proxy measure of attention to those dimensions, and predict that fixations to dimensions irrelevant to correct classification will cease as a result of classification experience. An important feature of the category learning task is the availability of an overt behavioral measure (the elimination of classification errors) as a source of converging evidence about which aspects of stimuli are being attended. Specifically, learning entails that a participant attend to those stimulus dimensions needed to discriminate members of the categories. Thus, confirmation that learners primarily attend to relevant dimensions will not only corroborate the basic claim of Shepard et al.’s, it will also cross-validate the use of eyetracking as an index of attention in category learning.
The second goal of our study was to use eyetracking data to determine whether the manner in which attention changes during the course of learning was well described by ALCOVE, RULEX, or either model. Of course, these models were not specifically designed to account for eye movements. Nevertheless, eye movement predictions for each can be derived if we assume, on the basis of the research reviewed above, that the mapping between selective attention and eye movements is roughly one-to-one (an assumption we revisit later). For example, according to ALCOVE, learners will generally start off attending to all stimulus dimensions equally (or perhaps in a manner that reflects differences in their perceptual salience), and then gradually shift attention to only relevant dimensions as a result of error feedback. In the experiment which follows, dimensions will be of roughly equal salience, and thus the prediction we derive from ALCOVE is that learners will initially spend an equal amount of time fixating each stimulus dimension. As learning proceeds, fixations to irrelevant dimensions will gradually decrease until they are eliminated altogether.
In contrast, a hypothesis-testing model like RULEX makes very different predictions regarding how selective attention changes during learning. According to RULEX, learners first search for a single-dimension rule that successfully discriminates members of the two categories. Thus, our RULEX-derived prediction is that learners will fixate single dimensions early in learning. When no single-dimension rule is found, learners will fixate multiple dimensions as they attempt to form more complex rules (e.g., conjunctions, disjunctions, etc.), or to memorize exceptions to an imperfect rule. That is, whereas the ALCOVE-derived predictions are that learners will initially fixate all dimensions and then gradually reduce the number fixated to the minimum needed, the RULEX-derived predictions are that they will first fixate one dimension, and then increase the number fixated as needed.
Once again, an important characteristic of the category learning task is the presence of an overt behavioral measure in the form of classification errors that can corroborate any conclusions we reach regarding changes in selective attention on the basis of eye movements. For example, one diagnostic feature of hypothesis-testing models is the all-or-none learning (i.e., the sudden elimination of classification errors) that obtains when a learner discovers a correct single-dimension rule (Bower & Trabasso, 1963). Thus, the RULEX-derived prediction is that the fixations to a single dimension which are supposed to reflect rule application should be closely accompanied by the elimination of classification errors when that dimension is one which can be used to discriminate category members. Similarly, an important characteristic of associationist learning models like ALCOVE is the gradual learning that obtains as a result of the incremental adjustment of connection weights on the basis of error feedback.1 Thus, the ALCOVE-derived prediction is that a gradual shift of eye movements should be accompanied by a gradual decrease of errors. More generally, a close correspondence between error reduction and changes in eye movements will not only provide evidence for one or the other model of learning, it would also validate eyetracking as an effective measure of the changes in selective attention during category learning.
Although we believe our predictions provide a useful initial framework for the evaluation of eye movements in category learning, we acknowledge at the outset that there are a number of reasons to expect something other than a simple one-to-one mapping between eye movements and the construct of “selective attention” as operationalized by categorization models. One reason of course is that eye fixations may be influenced by low-level perceptual characteristics of stimuli which do not necessarily have any bearing on how those items are classified. Another is that participants may attempt to learn more about the categories than just how to classify correctly (e.g., they might try to learn how to predict features given a category label rather than just vice versa). Because effects such as these are not part of category learning per se, they are beyond the purview of models such as ALCOVE or RULEX as currently formulated. Additionally, it is important to note that eye movements most directly measure a learner’s selective attention to spatial locations (on a computer screen), a construct which is theoretically distinct from their selective attention to stimulus dimensions (see, e.g., Logan, 2004, for a discussion). Nevertheless, the application of eyetracking to category learning is new, and thus we believe that for now our (perhaps overly simplistic) predictions provide a useful initial framework for the evaluation of eye movements in the Shepard et al. (1961) category structures. In the general discussion, we will reevaluate the relationship between eye movements and selective attention to stimulus dimensions in light of our experimental results.
Section snippets
The Shepard et al. (1961) study
Shepard et al. (1961) constructed stimuli with three binary-valued dimensions, resulting in eight stimuli split into two categories. There were six unique divisions of stimuli into categories, four of which are shown in Fig. 1A. Here, the dimensions have been arbitrarily instantiated by shape, color, and size.
Type I is the most basic category structure, requiring information from a single dimension for classification (the shape dimension in Fig. 1A). The Type II structure is an exclusive-or
Participants
A total of 72 New York University undergraduates were randomly assigned to one of the four category structures.
Materials
The characters which composed the stimuli ($ and ¢, ? and !, and x and o) were presented in a light gray (RGB: 128, 128, 128) and within ∼1/2 by ∼1 degree of visual angle. The three symbols were situated ∼20° apart on the CRT at ∼12° eccentricity, forming an equilateral triangle. An example stimulus is presented in Fig. 2. The assignment of physical dimensions and location to the
Results
We first set out to establish that we replicated the basic ordering of problem difficulty found by Shepard et al. (1961). The number of participants out of 18 that reached the learning criterion of four perfect blocks in a row was 18, 18, 16, and 10 for Types I, II, IV, and VI, respectively. We also analyzed the number of blocks to criterion (assuming, conservatively, that nonlearners would have reached perfect performance by block 29). The average number of blocks to criterion was 7.1, 14.1,
Discussion
Since Shepard et al.’s (1961) seminal study a core assumption of categorization theory has been that category learning involves learning to attend to those stimulus dimensions necessary for category discrimination. However, evidence for this claim has consisted of demonstrations that dimensions vary in their influence on explicit categorization (and similarity) judgments, not on the operation of selective attention per se. Our findings provide strong support for the claim that categorizers
References (64)
- et al.
A two-stage model of category construction
Cognitive Science
(1992) - et al.
A general learning theory and its applications to schema abstraction
- et al.
Scene perception: Detecting and judging objects undergoing relational violations
Cognitive Psychology
(1982) - et al.
Contextual cueing: Implicit learning and memory of visual context guides spatial attention
Cognitive Psychology
(1998) - et al.
Saccade target selection and object recognition: Evidence for a common attentional mechanism
Vision Research
(1996) - et al.
The independence of syntactic processing
Journal of Memory and Language
(1986) - et al.
The role of attention in the programming of saccades
Vision Research
(1995) Toward a unified model of attention in associative learning
Journal of Mathematical Psychology
(2001)- et al.
Family resemblance, conceptual cohesiveness, and category construction
Cognitive Psychology
(1987) - et al.
Viewing and naming objects: Eye movements during noun phrase production
Cognition
(1998)
Reminding-based category learning
Cognitive Psychology
Alternative strategies of categorization
Cognition
Specializing the operation of the explicit rule
Journal of Experimental Psychology: General
Eye-movement-based memory effect: A reprocessing effect in face perception
Journal of Experimental Psychology: Learning, Memory, and Cognition
The architecture of cognition
The adaptive nature of human categorization
Psychological Review
A neuropsychological model of multiple systems in category learning
Psychological Review
Decision rules in the perception and categorization of multi-dimensional stimuli
Journal of Experimental Psychology: Learning, Memory, and Cognition
Theories of learning
Reversals prior to solution in concept identification
Journal of Experimental Psychology
Explaining basic categories: Feature predictability and information
Psychological Bulletin
Rules and exemplars in category learning
Journal of Experimental Psychology: General
Eye movement during skill acquisition: More evidence for the information-reduction hypothesis
Journal of Experimental Psychology: Learning, Memory, and Cognition
The effects of semantic consistency on eye movements during complex scene viewing
Journal of Experimental Psychology: Human Perception and Performance
A systematic application of the concepts of generalization and differentiation to verbal learning
Psychological Review
Influences of categorization on perceptual discrimination
Journal of Experimental Psychology: General
Eye movements and problem solving: Guiding attention guides thoughts
Psychological Science
What the eyes say about speaking
Psychological Science
Constructing mental models of machines from text and diagrams
Journal of Experimenal Psychology : Learning, Memory, and Cognition
Using eye fixations to study reading comprehension
The acquisition of category structure in unsupervised learning
Memory & Cognition
Category learning with minimal prior knowledge
Journal of Experimental Psychology: Learning, Memory, and Cognition
Cited by (249)
A dynamic, relational approach to B2B customer experience: A customer-centric perspective from a longitudinal investigation
2024, Journal of Business ResearchCognition: The power of simple associative learning
2023, Current BiologyCategory learning is shaped by the multifaceted development of selective attention
2023, Journal of Experimental Child PsychologyLooking fear in the face: Adults but not adolescents gaze at social threat during observational learning
2022, International Journal of PsychophysiologyThe quest for simplicity in human learning: Identifying the constraints on attention
2022, Cognitive PsychologyCitation Excerpt :Importantly, our analysis is the first to use both choice and eye tracking data to quantitatively adjudicate between different theories of attention during learning. Although other researchers have used eye tracking data to provide qualitative evidence for selective attention mechanisms in other contexts (Rehder & Hoffman, 2005a, 2005b), we determined that additional quantified insights from model comparison was necessary for inferring computational goals from eye tracking data in the current investigation. To gain insight into the strategic and capacity-related constraints that humans engage during learning, we conducted an investigation in three parts.
The Role of Attention in Category Representation
2024, Cognitive Science
- ☆
We thank John K. Kruschke, Bradley C. Love, Gregory L. Murphy, Robert M. Nosofsky, and Jonathon Nelson for their comments on a previous version of this manuscript.