Eyetracking and selective attention in category learning

https://doi.org/10.1016/j.cogpsych.2004.11.001Get rights and content

Abstract

An eyetracking version of the classic Shepard, Hovland, and Jenkins (1961) experiment was conducted. Forty years of research has assumed that category learning often involves learning to selectively attend to only those stimulus dimensions useful for classification. We confirmed that participants learned to allocate their attention optimally. We also found that learners tend to fixate all stimulus dimensions early in learning. This result obtained despite evidence that participants were also testing one-dimensional rules during this period. Finally, the restriction of eye movements to only relevant dimensions tended to occur only after errors were largely (or completely) eliminated. We interpret these findings as consistent with multiple-systems theories of learning which maximize information input in order to maximize the number of learning modules involved, and which focus solely on relevant information only after one module has solved the learning problem.

Introduction

Selective attention has played a prominent role in theories of categorization ever since Roger Shepard’s influential work (Shepard, Hovland & Jenkins, 1961) demonstrated that a simple stimulus generalization account of category learning is untenable. The stimulus generalization account took category learning to be a process of simple associations between stimuli and category labels. This account predicted that it should be easy for participants to associate stimuli that shared many features with one category label, and difficult to associate such stimuli with different labels. Unexpectedly, one important determiner of difficulty was the number of stimulus dimensions needed for correct classification. It has been generally accepted that this pattern of results is best understood in terms of learners optimally allocating their selective attention to those dimensions diagnostic of category membership (Medin and Schaffer, 1978, Nosofsky, 1984, Shepard et al., 1961).

Currently, selective attention is an integral component of all major categorization theories. For example, in both exemplar models (Hampton, 1995; Medin and Schaffer, 1978, Nosofsky, 1986) and prototype models (Nosofsky, 1992; Smith & Minda, 1998), selective attention is formalized in terms of the influence, or weight, that different stimulus dimensions have on a classification decision. Rule-based models also implicitly assume the operation of selective attention to those stimulus dimensions referred to by the current hypothesis (i.e., rule) being tested (Smith, Patalano, & Jonides, 1998).

Moreover, in more recent years, these theories have been extended to include the mechanisms by which selective attention changes with learning. One prominent example is Kruschke’s (1992) ALCOVE, a connectionist exemplar model that changes attention weights as a function of error feedback. Another is Nosofsky, Palmeri, and McKinley’s (1994) rule-plus-exception (RULEX) model, which first performs hypothesis (rule) testing on single dimensions, then on multi-dimensional rules and exceptions to those rules if needed.

Despite its prominence in modern categorization theory, however, evidence for the operation of selective attention has always amounted to demonstrations that dimensions vary in their influence on explicit categorization judgments (or same-different judgments, Goldstone, 1994), but not on the operation of selective attention per se (Lamberts, 1998). Accordingly, this study had two main goals. The first was to determine if eyetracking data would support the claim that learners allocate their attention to optimize classification performance. To this end, we replicated the Shepard et al. (1961) category learning experiment with an eyetracker. Specifically, we asked whether Shepard et al.’s claims regarding learners’ reallocation of attention to only those stimulus dimensions relevant to producing correct classification decisions would be directly corroborated by eyetracking data.

To our knowledge, the current work is the first to apply eyetracking to the domain of categorization research. At the outset then, one concern that must be addressed is the interpretation of eye movements as a surrogate measure of attention during category learning. It is of course well known that attention can dissociate from eye gaze under certain circumstances (Posner, 1980). However, in many cases changes in attention are immediately followed by the corresponding eye movements (e.g., Kowler, Anderson, Dosher, & Blaser, 1995), and there is evidence that attention and eye movements are tightly coupled for all but the simplest stimuli (Deubel & Schneider, 1996). Not surprisingly then, eye tracking has proven to be an effective tool in many areas of research, most notably of course reading (Ferreira and Clifton, 1986, Just and Carpenter, 1984, Makie et al., 2002, Rayner, 1998, Tanenhaus et al., 1995) but also language production (Griffin and Bock, 2000, Meyer et al., 1998), scene perception (Biederman et al., 1982, Henderson, 1999, Loftus and Mackworth, 1978), problem solving (Grant and Spivey, 2003, Hegarty and Just, 1993), skill acquisition (Haider & Frensch, 1999), and face perception (Althoff & Cohen, 1999), to name a few. In the current study, we will take the presence of eye fixations to spatially separated stimulus dimensions as a proxy measure of attention to those dimensions, and predict that fixations to dimensions irrelevant to correct classification will cease as a result of classification experience. An important feature of the category learning task is the availability of an overt behavioral measure (the elimination of classification errors) as a source of converging evidence about which aspects of stimuli are being attended. Specifically, learning entails that a participant attend to those stimulus dimensions needed to discriminate members of the categories. Thus, confirmation that learners primarily attend to relevant dimensions will not only corroborate the basic claim of Shepard et al.’s, it will also cross-validate the use of eyetracking as an index of attention in category learning.

The second goal of our study was to use eyetracking data to determine whether the manner in which attention changes during the course of learning was well described by ALCOVE, RULEX, or either model. Of course, these models were not specifically designed to account for eye movements. Nevertheless, eye movement predictions for each can be derived if we assume, on the basis of the research reviewed above, that the mapping between selective attention and eye movements is roughly one-to-one (an assumption we revisit later). For example, according to ALCOVE, learners will generally start off attending to all stimulus dimensions equally (or perhaps in a manner that reflects differences in their perceptual salience), and then gradually shift attention to only relevant dimensions as a result of error feedback. In the experiment which follows, dimensions will be of roughly equal salience, and thus the prediction we derive from ALCOVE is that learners will initially spend an equal amount of time fixating each stimulus dimension. As learning proceeds, fixations to irrelevant dimensions will gradually decrease until they are eliminated altogether.

In contrast, a hypothesis-testing model like RULEX makes very different predictions regarding how selective attention changes during learning. According to RULEX, learners first search for a single-dimension rule that successfully discriminates members of the two categories. Thus, our RULEX-derived prediction is that learners will fixate single dimensions early in learning. When no single-dimension rule is found, learners will fixate multiple dimensions as they attempt to form more complex rules (e.g., conjunctions, disjunctions, etc.), or to memorize exceptions to an imperfect rule. That is, whereas the ALCOVE-derived predictions are that learners will initially fixate all dimensions and then gradually reduce the number fixated to the minimum needed, the RULEX-derived predictions are that they will first fixate one dimension, and then increase the number fixated as needed.

Once again, an important characteristic of the category learning task is the presence of an overt behavioral measure in the form of classification errors that can corroborate any conclusions we reach regarding changes in selective attention on the basis of eye movements. For example, one diagnostic feature of hypothesis-testing models is the all-or-none learning (i.e., the sudden elimination of classification errors) that obtains when a learner discovers a correct single-dimension rule (Bower & Trabasso, 1963). Thus, the RULEX-derived prediction is that the fixations to a single dimension which are supposed to reflect rule application should be closely accompanied by the elimination of classification errors when that dimension is one which can be used to discriminate category members. Similarly, an important characteristic of associationist learning models like ALCOVE is the gradual learning that obtains as a result of the incremental adjustment of connection weights on the basis of error feedback.1 Thus, the ALCOVE-derived prediction is that a gradual shift of eye movements should be accompanied by a gradual decrease of errors. More generally, a close correspondence between error reduction and changes in eye movements will not only provide evidence for one or the other model of learning, it would also validate eyetracking as an effective measure of the changes in selective attention during category learning.

Although we believe our predictions provide a useful initial framework for the evaluation of eye movements in category learning, we acknowledge at the outset that there are a number of reasons to expect something other than a simple one-to-one mapping between eye movements and the construct of “selective attention” as operationalized by categorization models. One reason of course is that eye fixations may be influenced by low-level perceptual characteristics of stimuli which do not necessarily have any bearing on how those items are classified. Another is that participants may attempt to learn more about the categories than just how to classify correctly (e.g., they might try to learn how to predict features given a category label rather than just vice versa). Because effects such as these are not part of category learning per se, they are beyond the purview of models such as ALCOVE or RULEX as currently formulated. Additionally, it is important to note that eye movements most directly measure a learner’s selective attention to spatial locations (on a computer screen), a construct which is theoretically distinct from their selective attention to stimulus dimensions (see, e.g., Logan, 2004, for a discussion). Nevertheless, the application of eyetracking to category learning is new, and thus we believe that for now our (perhaps overly simplistic) predictions provide a useful initial framework for the evaluation of eye movements in the Shepard et al. (1961) category structures. In the general discussion, we will reevaluate the relationship between eye movements and selective attention to stimulus dimensions in light of our experimental results.

Section snippets

The Shepard et al. (1961) study

Shepard et al. (1961) constructed stimuli with three binary-valued dimensions, resulting in eight stimuli split into two categories. There were six unique divisions of stimuli into categories, four of which are shown in Fig. 1A. Here, the dimensions have been arbitrarily instantiated by shape, color, and size.

Type I is the most basic category structure, requiring information from a single dimension for classification (the shape dimension in Fig. 1A). The Type II structure is an exclusive-or

Participants

A total of 72 New York University undergraduates were randomly assigned to one of the four category structures.

Materials

The characters which composed the stimuli ($ and ¢, ? and !, and x and o) were presented in a light gray (RGB: 128, 128, 128) and within ∼1/2 by ∼1 degree of visual angle. The three symbols were situated ∼20° apart on the CRT at ∼12° eccentricity, forming an equilateral triangle. An example stimulus is presented in Fig. 2. The assignment of physical dimensions and location to the

Results

We first set out to establish that we replicated the basic ordering of problem difficulty found by Shepard et al. (1961). The number of participants out of 18 that reached the learning criterion of four perfect blocks in a row was 18, 18, 16, and 10 for Types I, II, IV, and VI, respectively. We also analyzed the number of blocks to criterion (assuming, conservatively, that nonlearners would have reached perfect performance by block 29). The average number of blocks to criterion was 7.1, 14.1,

Discussion

Since Shepard et al.’s (1961) seminal study a core assumption of categorization theory has been that category learning involves learning to attend to those stimulus dimensions necessary for category discrimination. However, evidence for this claim has consisted of demonstrations that dimensions vary in their influence on explicit categorization (and similarity) judgments, not on the operation of selective attention per se. Our findings provide strong support for the claim that categorizers

References (64)

  • B.H. Ross et al.

    Reminding-based category learning

    Cognitive Psychology

    (1990)
  • E.E. Smith et al.

    Alternative strategies of categorization

    Cognition

    (1998)
  • S.W. Allen et al.

    Specializing the operation of the explicit rule

    Journal of Experimental Psychology: General

    (1991)
  • R.R. Althoff et al.

    Eye-movement-based memory effect: A reprocessing effect in face perception

    Journal of Experimental Psychology: Learning, Memory, and Cognition

    (1999)
  • J.R. Anderson

    The architecture of cognition

    (1983)
  • J.R. Anderson

    The adaptive nature of human categorization

    Psychological Review

    (1991)
  • F.G. Ashby et al.

    A neuropsychological model of multiple systems in category learning

    Psychological Review

    (1998)
  • F.G. Ashby et al.

    Decision rules in the perception and categorization of multi-dimensional stimuli

    Journal of Experimental Psychology: Learning, Memory, and Cognition

    (1988)
  • G.H. Bower et al.

    Theories of learning

    (1981)
  • G.H. Bower et al.

    Reversals prior to solution in concept identification

    Journal of Experimental Psychology

    (1963)
  • J.E. Corter et al.

    Explaining basic categories: Feature predictability and information

    Psychological Bulletin

    (1992)
  • M.A. Erickson et al.

    Rules and exemplars in category learning

    Journal of Experimental Psychology: General

    (1998)
  • H. Haider et al.

    Eye movement during skill acquisition: More evidence for the information-reduction hypothesis

    Journal of Experimental Psychology: Learning, Memory, and Cognition

    (1999)
  • J.M. Henderson

    The effects of semantic consistency on eye movements during complex scene viewing

    Journal of Experimental Psychology: Human Perception and Performance

    (1999)
  • E.J. Gibson

    A systematic application of the concepts of generalization and differentiation to verbal learning

    Psychological Review

    (1940)
  • R.L. Goldstone

    Influences of categorization on perceptual discrimination

    Journal of Experimental Psychology: General

    (1994)
  • E.R. Grant et al.

    Eye movements and problem solving: Guiding attention guides thoughts

    Psychological Science

    (2003)
  • Z. Griffin et al.

    What the eyes say about speaking

    Psychological Science

    (2000)
  • M. Hegarty et al.

    Constructing mental models of machines from text and diagrams

    Journal of Experimenal Psychology : Learning, Memory, and Cognition

    (1993)
  • M.A. Just et al.

    Using eye fixations to study reading comprehension

  • A.S. Kaplan et al.

    The acquisition of category structure in unsupervised learning

    Memory & Cognition

    (1999)
  • A.S. Kaplan et al.

    Category learning with minimal prior knowledge

    Journal of Experimental Psychology: Learning, Memory, and Cognition

    (2000)
  • Cited by (249)

    • The quest for simplicity in human learning: Identifying the constraints on attention

      2022, Cognitive Psychology
      Citation Excerpt :

      Importantly, our analysis is the first to use both choice and eye tracking data to quantitatively adjudicate between different theories of attention during learning. Although other researchers have used eye tracking data to provide qualitative evidence for selective attention mechanisms in other contexts (Rehder & Hoffman, 2005a, 2005b), we determined that additional quantified insights from model comparison was necessary for inferring computational goals from eye tracking data in the current investigation. To gain insight into the strategic and capacity-related constraints that humans engage during learning, we conducted an investigation in three parts.

    View all citing articles on Scopus

    We thank John K. Kruschke, Bradley C. Love, Gregory L. Murphy, Robert M. Nosofsky, and Jonathon Nelson for their comments on a previous version of this manuscript.

    View full text