The impact of category structure and training methodology on learning and generalizing within-category representations

Ell, Shawn W.; Smith, David B.; Peralta, Gabriela; Hélie, Sébastien

doi:10.3758/s13414-017-1345-2

The impact of category structure and training methodology on learning and generalizing within-category representations

Published: 05 June 2017

Volume 79, pages 1777–1794, (2017)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

The impact of category structure and training methodology on learning and generalizing within-category representations

Download PDF

Shawn W. Ell¹,
David B. Smith²,
Gabriela Peralta² &
…
Sébastien Hélie³

1575 Accesses
11 Citations
Explore all metrics

Abstract

When interacting with categories, representations focused on within-category relationships are often learned, but the conditions promoting within-category representations and their generalizability are unclear. We report the results of three experiments investigating the impact of category structure and training methodology on the learning and generalization of within-category representations (i.e., correlational structure). Participants were trained on either rule-based or information-integration structures using classification (Is the stimulus a member of Category A or Category B?), concept (e.g., Is the stimulus a member of Category A, Yes or No?), or inference (infer the missing component of the stimulus from a given category) and then tested on either an inference task (Experiments 1 and 2) or a classification task (Experiment 3). For the information-integration structure, within-category representations were consistently learned, could be generalized to novel stimuli, and could be generalized to support inference at test. For the rule-based structure, extended inference training resulted in generalization to novel stimuli (Experiment 2) and inference training resulted in generalization to classification (Experiment 3). These data help to clarify the conditions under which within-category representations can be learned. Moreover, these results make an important contribution in highlighting the impact of category structure and training methodology on the generalization of categorical knowledge.

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

Recognize the Value of the Sum Score, Psychometrics’ Greatest Accomplishment

Article Open access 17 April 2024

Thematic Analysis

The ability to learn categorical representations is foundational for cognition. Categories enable the navigation of familiar situations with increasing efficiency and can also be generalized to facilitate function in novel situations. Not surprisingly, much research has been dedicated to understanding categorical representations and how they are learned. This research has been fertile ground for a vigorous and healthy debate regarding the nature of category representations. Throughout this debate, most research groups advocating for one theory or another have tended to focus on a single paradigm, suggesting that some theoretical disagreements may be driven by methodological differences. This article investigates the impact of two methodological variants on the learning of category representations focusing on within-category similarities. Namely, how does variability in the structure of the categories, and the training methodology that dictates how participants interact with the to-be-learned information, impact within-category representations? Furthermore, once learned, what are some of the limits on the generalization of within-category representations?

Category representations

Category representations are largely dependent upon the goal of the task (Goldstone, 1996; Hoffman & Rehder, 2010; Markman & Ross, 2003; Minda & Ross, 2004; Yamauchi & Markman, 1998). For instance, in the typical category learning experiment, participants are presented with stimuli (each drawn from one of a number of contrasting categories) and instructed to make a decision about the category membership of each stimulus. Such classification instructions have often been argued to lead to the development of a representation that focuses on between-category differences (e.g., learn what dimensions are relevant for classification, along with decision criteria or category boundaries; Ashby, Alfonso-Reese, Turken, & Waldron, 1998; Erickson & Kruschke, 1998; Maddox & Ashby, 1993; Nosofsky, Palmeri, & McKinley, 1994; Smith & Minda, 2002). In a slightly different paradigm, participants are presented with a subset of the stimulus features as well as a category label and instructed to infer the missing feature. Such inference instructions lead to the development of a category representation that focuses on within-category similarities (e.g., the correlational structure of the stimulus dimensions; Chin-Parker & Ross, 2002; Markman & Ross, 2003). Thus, the goal of classifying the stimuli into one of a number of contrasting categories may lead to a between-category representation, whereas the goal of inferring missing information for stimuli from a known category may lead to a within-category representation.

Task goal is clearly an important factor, but it is not the only factor in producing within-category representations. For instance, observational training (Carvalho & Goldstone, 2015; Levering & Kurtz, 2015), training emphasizing the comparison of members from the same category (Hammer, Diesendruck, Weinshall, & Hochstein, 2009), and blocked training (Carvalho & Goldstone, 2014; Goldstone, 1996) can promote within-category representations. Another factor that is investigated in the present article involves a seemingly minor tweak of the typical classification instructions to emphasize concept learning called the yes/no task (i.e., participants learn categories by classifying stimuli as a member/nonmember of a target category; Maddox, Bohil, & Ing, 2004; Posner & Keele, 1968; Reber, 1998; Smith & Minda, 2002; Zeithamova, Maddox, & Schnyer, 2008). Both classification and concept training are active tasks and have the goal of classification on a trial-by-trial basis. Concept training, however, has been argued to shift the emphasis from between-category differences to within-category similarities (Casale & Ashby, 2008; Hélie, Shamloo, & Ell, 2017).

The very structure of the categories themselves can influence category representations (Ashby et al., 1998; Carvalho & Goldstone, 2014). Consider, for example, the distinction between rule-based (RB) and information-integration (II) category structures (Ashby & Ell, 2001). RB structures can be learned using logical rules. Although logical rules can be based on either within- or between-category representations (e.g., large or larger than), the subset of logical rules learned with RB structures tends to depend upon between-category representations (Casale, Roeder, & Ashby, 2012; Ell & Ashby, 2012; Ell, Ing, & Maddox, 2009; Hélie et al., 2017). In contrast, II structures are those in which information from multiple dimensions needs to be integrated prior to making a categorization response. Unlike RB structures, II structures generally promote within-category representations (Ashby & Waldron, 1999; Hélie et al., 2017; Thomas, 1998). Again, even when classification is the goal, RB structures would be expected to promote between-category representations, whereas II structures would be expected to promote within-category representations. Neurocomputational models that have been applied to RB and II structures implicitly echo this between- versus within-category distinction (Ashby et al., 1998; Ashby & Crossley, 2011).

Utility of within- and between-category representations

Categorical representations in and of themselves have little value. Rather, it is the efficiencies afforded by categories that are a better measure of their cognitive utility (Hoffman & Rehder, 2010; Markman & Ross, 2003). Category representations can facilitate interactions with category members (e.g., Rosch & Mervis, 1975). Arguably more important is that category representations can also facilitate interactions with novel stimuli. Indeed, the field has a well-established tradition of probing the extent to which learned category representations can support the classification of novel stimuli (e.g., Smith & Minda, 1998). Clearly this is an important function of category representations.

Importantly, we argue that the generalizability of category representations depends upon the nature of the representation itself (Carvalho & Goldstone, 2014; Hoffman & Rehder, 2010; Levering & Kurtz, 2015). For instance, between-category representations may be better suited to generalize to novel stimuli that are beyond the range of the previously encountered stimuli (e.g., because the representation is not tied to the stimuli themselves but rather between-category differences; Casale et al., 2012; Hoffman & Rehder, 2010; Maddox, Filoteo, Lauritzen, Connally, & Hejl, 2005). Similarly, within-category representations may be better suited for generalization that would benefit from knowledge of within-category regularities, such as prototypicality or the covariation of stimulus dimensions (Chin-Parker & Ross, 2002, 2004; Yamauchi & Markman, 1998).

The ability to generalize between-category representations, however, may be task dependent. Although knowledge of between-category differences would facilitate classification of novel stimuli, such knowledge is inextricably tied to the goal of classification. When successful generalization depends upon the ability to reconfigure knowledge acquired during training to solve a new decision-making problem, within-category representations would seem to have far greater utility than between-category representations. Indeed, within-category representations can support generalization to novel tasks (Chin-Parker & Ross, 2002). Within-category representations are also better able than between-category representations to support the reconfiguration of categorical knowledge (Hélie et al., 2017; Hoffman & Rehder, 2010).

Formal models of categorization have been successful in accounting for generalization of learned representations to support the classification of novel stimuli. Attempts to test the ability of formal models to account for generalization to novel tasks, however, are not as common (see Maddox & Bogdanov, 2000; Nosofsky & Zaki, 1998; Smith & Minda, 2001, for notable examples). Thus, the approach taken in the current study—investigating generalization to novel stimuli and tasks—will provide an important test bed for the development and testing of formal models.

The current study

Although participants often demonstrate an initial bias toward between-category representations (Ashby, Queller, & Berretty, 1999; Ell & Ashby, 2006; Medin, Wattenmaker, & Hampson, 1987; Smith, Beran, Crossley, Boomer, & Ashby, 2010), within-category representations may be a common outcome of interacting with categories (e.g., Anderson & Fincham, 1996; Hélie et al., 2017; Hoffman & Rehder, 2010; Thomas, 1998). Previous work suggests that numerous methodological factors can promote within-category representations, but there is variability in how within-category representations were measured, if measured at all. For example, some studies used a two-alternative, forced-choice procedure (e.g., Hoffman & Rehder, 2010) while others asked for typicality ratings (e.g., Levering & Kurtz, 2015).

Studies using inference training consistently demonstrate the development of within-category representations but have not given much attention to the impact of category structure. For example, when trained by inference, participants learn the correlational structure of the categories despite such information possibly being irrelevant to category membership (e.g., Chin-Parker & Ross, 2002). Motivated by this work, we employ knowledge of correlational structure as our primary dependent measure of within-category representation and extend this work by considering variability in training methodology and category structure.

Using a transfer task that required the reconfiguration of within-category representations, Hélie and colleagues (2017) showed that learning an II structure resulted in successful transfer with both concept and classification training. In contrast, learning a RB structure resulted in successful transfer with concept training, but not classification training. Although these data are consistent with the claim that within-category representations may be a more common outcome of categorization, this claim would be bolstered by using a more traditional measure of within-category representations (i.e., knowledge of the correlational structure).

A second goal of the current study is to investigate the extent to which within-category representations can be generalized to support performance with novel stimuli and/or novel tasks. Within-category representations developed with inference training appear to be quite versatile and can support generalization to novel tasks (Chin-Parker & Ross, 2002). Although the within-category representations developed with concept and classification training can support knowledge reconfiguration (Hélie et al., 2017), it is unclear if these within-category representations can also support generalization to novel stimuli and tasks. For example, some researchers have demonstrated that within-category correlations can be learned during a classification task (Anderson & Fincham, 1996; Thomas, 1998), whereas others have argued that such demonstrations are a byproduct of simplistic stimuli, overtraining, and/or classification tasks that incorporate additional inference-like training (e.g., Chin-Parker & Ross, 2002).

The current study tests these hypotheses using classification, concept, and inference training methodologies to learn RB and II structures. For classification training, participants were instructed to distinguish between members of contrasting categories (e.g., Is the image a member of Category A or Category B?—hereafter referred to as A/B training). For concept training, participants were instructed to distinguish between category members and nonmembers (e.g., Is the image a member of Category A?—hereafter referred to as YES/NO training; Hélie et al., 2017; Maddox, Bohil, et al., 2004). For inference training, participants were instructed to produce the missing stimulus feature given the category label and another stimulus feature (hereafter referred to as INF training; Chin-Parker & Ross, 2002; Thomas, 1998; Zotov, Jones, & Mewhort, 2011).

In Experiment 1, participants learned RB or II structures that incorporated a correlation between the stimulus dimensions using either A/B, YES/NO, or INF training. Knowledge of the correlation between the stimulus dimensions was subsequently tested using inference. The test phase included stimuli that were consistent with the training categories (allowing for the assessment of within-category representations developed during training) and novel stimuli (allowing for the assessment of generalization of within-category representations beyond the trained stimuli). Importantly, the design also enabled an analysis of the extent to which knowledge could be generalized across methodologies (e.g., from classification to inference).

Following Hélie et al. (2017), we hypothesized that within-category representations would be learned in all but the RB-A/B condition, and that within-category representations could be generalized to support inference across stimuli and methodologies. Experiments 2 and 3 were designed to replicate and extend Experiment 1. Experiment 2 investigated the impact of extended training on the ability to generalize within-category representations. Experiment 3 aimed to investigate if the generalization results were specific to using an inference procedure at test by testing participants on A/B classification rather than inference.

To anticipate, the results of Experiments 1 and 2 demonstrate that the II structure consistently resulted in within-category representations that could be generalized to novel stimuli and across methodologies (Experiments 1 and 2). The RB structure, however, resulted in within-category representations only when paired with INF training (Experiments 1 and 2). The within-category representations acquired in the RB-INF condition could be generalized to novel stimuli and across methodologies, but only when provided with extended INF training (Experiment 2). Furthermore, generalization across methodologies was asymmetric as the within-category representations acquired with INF training could be generalized to support A/B classification, but only with the RB structure (Experiment 3).

Experiment 1

The goals of Experiment 1 were twofold. First, Experiment 1 investigated the extent to which training methodology and category structure promotes the learning of within-category representations. Second, Experiment 1 investigated the ability of within-category representations to support knowledge generalization across stimuli and tasks. Specifically, participants were trained on either RB or II category structures using classification training (A/B), concept training (YES/NO), or inference training (INF). The stimulus dimensions were correlated within each category, thereby allowing the use of knowledge of the correlational structure of the categories as a probe for within-category representations. All participants were subsequently tested using an inference procedure that included exemplars from the training categories as well as novel exemplars. Knowledge of the within-category correlations for training exemplars indexed learning of the within-category representations whereas knowledge of the within-category correlations for transfer exemplars indexed generalization to novel stimuli. Successful test performance for participants in the A/B and YES/NO conditions provided a measure of generalization across methodologies. It was predicted that all but the RB-A/B condition would evidence within-category representations and that these representations would be able to be generalized across stimuli and methodology.

Method

Participants and design

In all experiments, a target sample size of approximately 30 participants in each experimental condition was determined a priori (based upon previous experience with similar experiments). Participants (193 total) were recruited from the University of Maine community and received partial course credit for participation. Participants were randomly assigned to one of six experimental conditions in the 2 category structure (RB vs. II) × 3 training methodology (A/B, YES/NO, INF) design. A total of seven participants were excluded from analysis: two participants due to a software error (RB-INF: 1; II-INF: 1), three participants did not complete the task within the hour-long experimental session (II-AB: 1; II- YES/NO: 1; II-INF: 1), and two participants were statistical outliers (i.e., more than three standard deviations from the mean on both average training accuracy and accuracy during the final training block; RB-YES/NO: 2). The resulting sample sizes by condition were RB-A/B: 32; RB-YES/NO: 29; RB-INF: 32; II-A/B: 32; II- YES/NO: 30; II-INF: 31. All participants reported normal (20/20) or corrected-to-normal vision. Each participant completed one session of approximately 60 minutes duration.

Stimuli and apparatus

The stimuli in all experiments comprised circles and lines that varied continuously in diameter and orientation, respectively (see Fig. 1). These dimensions were selected in an effort to facilitate the ability of participants to complete the inference task. The training categories were generated using a variation of the randomization technique introduced by Ashby and Gott (1988), in which the stimuli were generated by sampling from bivariate normal distributions defined in a Diameter × Angle (from horizontal) space in arbitrary units. For the II structure, the category means were $ {\mu}_A=\left[485,-20\right] $ and $ {\mu}_B=\left[415,40\right]. $ For the RB structure, the category means were $ {\mu}_A=\left[635,-20\right] $ and $ {\mu}_B=\left[265,40\right]. $ The covariance matrix $ \Sigma =\left[\begin{array}{cc}3125& 2875\\ {}2875& 3175\end{array}\right] $ (i.e., a correlation of 1 between diameter and angle) was the same for all tasks and categories. Recall that the primary dependent measure of within-category knowledge was the extent to which participants learned the diameter-angle correlation. As a consequence, it was necessary to have a nonzero covariance within each category and to increase the category separation in the RB task in order to allow for a unidimensional rule on diameter to produce optimal accuracy.

On each trial, a random sample (x, y) was drawn from the Category A or B distribution, and these values were used to construct a stimulus with circle of $ \frac{x}{2} $ pixels in diameter and line of $ \frac{180 y}{800} $ degrees (counterclockwise from horizontal) with length of 200 pixels. The line was always connected at the circle’s highest point. The scaling factors were selected in an effort to equate the perceived salience of the stimulus dimensions. Eighty stimuli (40 from each category) were generated for each of the four blocks of trials. All stimuli were generated off-line, and a linear transformation was applied to ensure that the sample statistics matched the population parameters. The experiment was run using the Psychophysics Toolbox (Brainard, 1997; Kleiner et al., 2007; Pelli, 1997) in the MATLAB computing environment. Each stimulus was displayed on a 20-inch LCD with 1600 × 1200 pixel resolution at a viewing distance of 20 inches in a dimly lit room.

Two sets of test phase stimuli (112 total) were selected to assess the learning and generalization of the within-category correlations. The training set was selected to approximate the training categories and was used to assess learning of the within-category correlations (red circles in Fig. 1). The transfer set was selected to broadly sample the untrained region of the stimulus space while maintaining the within-category correlation from the training categories and was used to assess generalization of the within-category correlations (blue circles in Fig. 1). The coordinates of the test phase stimuli are presented in Appendix 1.

Consistent with previous work, participants were expected to learn unidimensional rules in the RB task (Ell & Ashby, 2006). Given the large category separation, however, there are many alternative strategies that would also yield perfect performance (e.g., the optimal strategy for the II task). Thus, probe stimuli were included to differentiate between unidimensional and integration strategies (e.g., the solid lines in Fig. 1). A subset (14) of the test stimuli that lie between Category A and B were included as probe stimuli during the final block of training (resulting in a total of 94 trials during the final block). In an effort to increase the similarity between the RB and II conditions, these same probe stimuli were also included during the final block with the II structure. Because the probe stimuli do not aid in the identifiability of the decision strategy used with the II structure, the probe stimuli were excluded from the analysis of the II training data. No feedback was provided for probe trials. The coordinates of the probe stimuli are presented in Appendix 1.

Procedure

Each participant was run individually. At the beginning of the training phase, participants were told that stimuli would comprise a circle with a line connected at the top and that the stimuli would be presented individually but would vary across trials in circle diameter and line angle. In the A/B condition, participants were instructed that their goal was to learn, by trial and error, to distinguish between members of Category A and B. On each trial, a stimulus was presented, and participants were prompted “Is this image a member of Category A or Category B?” and responded by pressing the button labeled “A” or “B” on the keyboard. In the YES/NO condition, participants were instructed that their goal was to learn, by trial and error, if each image is a member of a particular category or not. On each trial, a stimulus was presented and participants were prompted with either “Is this image a member of Category A?” or “Is this image a member of Category B?” (with equal probability) and responded by pressing the button labeled “Yes” or “No” on the keyboard. In the INF condition, participants were instructed that their goal was to learn, by trial and error, to draw the missing stimulus component. Example stimulus displays are shown in Fig. 1. On each trial, a partial stimulus (i.e., line or circle along with the category label) was presented and participants were prompted to draw the missing component—that is, “Draw the circle that goes with this line angle” or “Draw the line angle that goes with this circle” (with equal probability). Participants initially responded by using the mouse to select the location of either the bottom of the circle (indicating the diameter of the circle relative to the dot at the beginning of the line) or the end of the line (indicating the orientation of the line relative to horizontal). The circle or line was drawn by the computer based upon the participant’s selection with a line beginning at the dot at the top of the circle (at a constant length of 200 pixels). After the line was drawn, participants were able to adjust the diameter or angle using the arrow keys, pressing the space bar when satisfied. Any selected stimulus values outside the allowable range were reset to the nearest allowable value (allowable range: diameter 10 to 600 pixels, angle: -50 to 110 degrees).

Stimulus presentation was response terminated with an upper limit of 60 s. After responding, feedback was provided. In the A/B and YES/NO conditions, the screen was blanked and the word “CORRECT” (in green, accompanied by a 500 Hz tone) or “WRONG” (in red, accompanied by a 200 Hz tone) was displayed. In the INF condition, the correct circle or line was overlaid upon the participant’s response. In all conditions, feedback duration was 2 s and the screen was then blanked for 1 s prior to the appearance of the next stimulus.

In addition to trial-by-trial feedback, summary feedback was given at the end of each 80-trial block, indicating percentage correct for that block (A/B and YES/NO, participants were informed that higher numbers are better) or the root mean square error between the drawn and correct stimulus components (INF, participants were informed that lower numbers are better). The presentation order of the stimuli was randomized within each block, separately for each participant. Prior to starting the training phase, participants completed several practice trials to familiarize themselves with the task using stimuli randomly sampled (with equal probability) from the training categories.

During the test phase, all participants performed the inference task (one block of 112 trials). Instruction was provided for all conditions and participants completed several practice trials prior to beginning the test phase using stimuli randomly sampled (with equal probability) from all test phase stimuli. No feedback was provided during the test phase.

Results

Training phase

The dependent measure varied across training methods, thus the training phase data from the A/B, YES/NO, and INF conditions were analyzed separately. Performance generally improved across blocks for all training methodologies (Fig. 2). A 2 category structure × 4 block mixed ANOVA conducted on the data from the A/B condition revealed significant main effects, structure: $ F\left(1,62\right)=282.14, p<.05,{\upeta}_{\mathrm{p}}^2=.82 $; block: $ F\left(2.62,162.41\right)=27.37, p<.05,{\upeta}_{\mathrm{p}}^2=.31, $ and a significant interaction, $ F\left(2.62,162.41\right)=4.16, p<.05,{\upeta}_{\mathrm{p}}^2=.06 $.^{Footnote 1} To decompose the interaction, a series of pairwise comparisons were conducted within each structure. For the II structure, accuracy increased across the first three blocks (ps < .05), but not from Block 3 to Block 4 (p = .80). For the RB structure, there was no significant block-over-block increase in accuracy (ps > .10), but there was a more general increase with Block 4 accuracy being higher than Block 1 (p < .05). These results suggest that with A/B training, there was more consistent improvement across blocks in the II structure, but caution is warranted given a possible ceiling effect in the RB structure.

A 2 category structure × 4 block mixed ANOVA conducted on the data from the YES/NO condition revealed significant main effects, structure: $ F\left(1,57\right)=261.62, p<.05,{\upeta}_{\mathrm{p}}^2=.82 $; block: $ F\left(2.14,122.06\right)=46.97, p<.05,{\upeta}_{\mathrm{p}}^2=.45 $, but the interaction was not significant, $ F\left(2.14,122.06\right)=2.85, p=.06,{\upeta}_{\mathrm{p}}^2=.05 $. Pairwise comparisons indicated that the main effect of block was driven by an increase in accuracy across the first three blocks (p's < .05), but not from Block 3 to Block 4 (p = .63). With YES/NO training, participants in both structures learned, but accuracy was higher in the RB structure.

To analyze the data from the INF condition, the correlation between the presented and produced dimensions was computed separately for each category, then averaged across categories (Fig. 2, right panel). A 2 category structure × 4 block mixed ANOVA indicated a significant effect of block, F(2.89, 176.22) = 16.26, p < .05, $ {\upeta}_{\mathrm{p}}^2 $ = .21, with consistent improvement across Blocks 1–3 [p's < .05. No other effects were statistically significant, category structure: $ F\left(1,61\right)=3.06, p=.08,\;{\upeta}_{\mathrm{p}}^2=.05 $; Category Structure × Block: $ F\left(2.89,176.22\right)=1.23, p=.3,{\upeta}_{\mathrm{p}}^2=.02 $. In sum, participants in the two inference training conditions evidenced learning of the within-category correlation although this learning was modest, only being statistically greater than zero in Blocks 2–4 (p's < .05) and asymptoting near a correlation of .2.

Categorization performance in the RB task was expected to be mediated by unidimensional decision strategies (Ell & Ashby, 2006), but given the large separation between the RB categories, a number of qualitatively different decision strategies could have produced high accuracy. In order to confirm that participants were using unidimensional strategies in the RB task, a number of decision-bound models (Ashby, 1992a; Maddox & Ashby, 1993) were fit to the individual participant data from the A/B and Yes/No conditions. Three different types of models were evaluated, each based on a different assumption concerning the participant’s strategy. Rule-based models assume that the participant sets decision criteria on one (or both) stimulus dimensions (e.g., unidimensional model: If the circle is large, respond A; otherwise respond B). Information-integration models assume that the participant integrates the stimulus information from both dimensions prior to making a categorization decision. Finally, random responder models assume that the participant guessed. Each of these models were fit separately to the data from the final block, for each participant, using a standard maximum likelihood procedure for parameter estimation (Ashby, 1992b; Wickens, 1982) and the Bayes information criterion for goodness of fit (Schwarz, 1978; see Appendix 2 for a more detailed description of the models and fitting procedure).

As expected, most participants in the RB task were best fit by a unidimensional model assuming participants attended selectively to diameter (A/B: 91%, YES/NO: 86%). Similarly, most participants in the II task were best fit by information-integration models (A/B: 63%, YES/NO: 67%). The results of the model-based analysis indicate that the majority of participants used task appropriate strategies at the end of training.

Test phase

Correlations between the presented and produced dimensions were computed separately for each cluster of test stimuli in Fig. 1. Preliminary analyses were conducted on the correlations to determine if the data could be safely aggregated across clusters. For test phase data from the two training clusters, a 2 cluster × 2 category structure × 3 training methodology mixed ANOVA did not reveal any significant effects of cluster (main effect and interactions: all $ F < 1, p\ge .38,\;{\upeta}_{\mathrm{p}}^2\le .01 $). Similarly, for test phase data from the six transfer clusters, a 6 cluster × 2 category structure × 3 training methodology mixed ANOVA did not reveal any significant effects of cluster (main effect and interactions: all $ F\le 1.6, p\ge .1,{\upeta}_{\mathrm{p}}^2\le .018 $). Thus, the subsequent analyses average across clusters within the two sets of test stimuli (i.e., training and transfer).

Inspection of the correlations during the test phase (see Fig. 3) suggests more consistent learning and generalization of the correlational structure of the training categories for the II structure. A series of one-sample t tests (see Table 1) were consistent with this observation. For the RB structure, neither the correlations for the training stimuli nor the transfer stimuli were significantly greater than zero. In contrast, for the II structure, almost all of the correlations were significantly greater than zero, with the correlation for training items in the YES/NO condition not surviving the correction for multiple comparisons. Consistent with the previous analysis, a 2 stimulus set (training, transfer) × 2 category structure × 3 training methodology mixed ANOVA comparing the magnitude of the correlation across conditions indicated only a significant main effect of category structure, $ F\left(1,180\right)=9.98,\mathrm{p}<.05,{\upeta}_{\mathrm{p}}^2=.05 $. None of the other effects were statistically significant (all $ F\le 2.43, p\ge .12,\;{\upeta}_{\mathrm{p}}^2\le .02 $). In sum, these data suggest learning and generalization of the within-category correlations, but only for the II category structure.

Table 1 Knowledge of the within-category correlational structure during the test phase of Experiment 1

Full size table

Summary

The goal of Experiment 1 was to investigate the impact of category structure and training methodology on the ability to learn and generalize within-category representations (i.e., correlational structure of the categories). Structure and methodology were predicted to interact such that within-category representations would be learned in all but the RB-A/B condition. The results, however, did not support these predictions. First, although participants demonstrated some evidence of learning within-category representations during the training phase of the INF condition, this information was only significantly maintained for the II structure. That being said, there may have been learning in the RB-INF condition that did not survive the statistical correction for multiple comparisons given the small-to-moderate effect sizes for the training and transfer stimuli during the test phase. Second, YES/NO training did not generally result in the learning of within-category representations. Instead, the results suggest that the II structure consistently resulted in the learning of within-category representations, regardless of training methodology. Moreover, the within-category representations could be generalized to a novel task (i.e., from categorization to inference) and to novel stimuli.

Experiment 2

The results of Experiment 1 suggest that learning and generalization of within-category representations may be limited to II category structures. The inference task, however, was fairly challenging. Thus, it may be that there would be more robust evidence of within-category knowledge with extended training on the inference task. In addition, providing extended training may also provide more of an opportunity for participants given categorization training to learn the within-category representations. The goal of Experiment 2 was to investigate the impact of extended training on the ability to learn and generalize within-category representations. The design of Experiment 2 was identical to Experiment 1 with two exceptions. First, the amount of training was doubled (across two training sessions). Second, given the similarity of the results in the A/B and YES/NO conditions of Experiment 1, only A/B training was included in Experiment 2.