Elsevier

Intelligence

Volume 38, Issue 6, November–December 2010, Pages 625-635
Intelligence

The relationship between n-back performance and matrix reasoning — implications for training and transfer

https://doi.org/10.1016/j.intell.2010.09.001Get rights and content

Abstract

We have previously demonstrated that training on a dual n-back task results in improvements in fluid intelligence (Gf) as measured by matrix reasoning tasks. Here, we explored the underlying mechanisms of this transfer effect in two studies, and we evaluated the transfer potential of a single n-back task. In the first study, we demonstrated that dual and single n-back task performances are approximately equally correlated with performance on two different tasks measuring Gf, whereas the correlation with a task assessing working memory capacity was smaller. Based on these results, the second study was aimed on testing the hypothesis that training on a single n-back task yields the same improvement in Gf as training on a dual n-back task, but that there should be less transfer to working memory capacity. We trained two groups of students for four weeks with either a single or a dual n-back intervention. We investigated transfer effects on working memory capacity and Gf comparing the two training groups' performance to controls who received no training of any kind. Our results showed that both training groups improved more on Gf than controls, thereby replicating and extending our prior results.

Introduction

Fluid intelligence (Gf) is defined as a complex human ability that allows us to adapt our thinking to new cognitive problems or situations for which we cannot rely on previously acquired knowledge (e.g. Carpenter, Just, & Shell, 1990). Gf is considered critical for a wide variety of cognitive tasks (Engle et al., 1999, Gray and Thompson, 2004), and it seems to be one of the most important factors in learning (Deary et al., 2007, Neisser et al., 1996, Rohde and Thompson, 2007, te Nijenhuis et al., 2007). There is considerable agreement that a substantial proportion of the variance in Gf is hereditary (Baltes et al., 1999, Cattell, 1963, Gray and Thompson, 2004), but it has also been shown that social class and age moderate the heritability of Gf (Haworth et al., 2009, Turkheimer et al., 2003). Although high heritability in principle does not preclude alteration of Gf through environmental factors or interventions (Jensen, 1981), evidence of such alteration has been sparse. However, there is now accumulating evidence showing that certain interventions seem to increase performance in Gf tasks (Buschkuehl & Jaeggi, 2010), although the mechanisms that underlie such change are not well understood (Basak et al., 2008, Jaeggi et al., 2008, Klingberg et al., 2005, Klingberg et al., 2002, Rueda et al., 2005, Tranter and Koutstaal, 2007). For example, we have shown that the n-back task can be used as a training vehicle to improve performance on matrix reasoning tasks which are commonly used as a typical measure of Gf (e.g. Gray and Thompson, 2004, Kane and Engle, 2002, Snow et al., 1984). In our study, subjects were pretested on measures of Gf, after which they were given up to four weeks of daily training on a dual n-back task (Jaeggi, Buschkuehl, Jonides, & Perrig, 2008). The dual n-back task consisted of a position that was pseudo-randomly marked on a computer screen in each stimulus frame which subjects had to match for spatial position to the stimulus presented n frames back in the sequence. Simultaneously with the spatial task, subjects had to process an auditory stream of stimuli in which a single letter was presented in each auditory frame that had to be matched to the letter that appeared n items ago. The value of n was matched for the spatial and verbal tasks, both of which required responses. The level of n changed during the experiment according to the participants' performance to keep overall task difficulty approximately constant. Following training, subjects were given non-overlapping items from an instrument measuring Gf. The results showed that training on a dual n-back task yielded improvements in Gf relative to a control group that did not train.

Why was this training regimen successful? That is, what mechanisms drive such transfer effects? We believe it is critical that the training and the transfer tasks share overlapping cognitive processes for transfer to succeed. Thus, we think that the gain in Gf emerges because the processes that are engaged by the training task also mediate performance in Gf tasks. We proposed that the framework by Halford, Cowan, and Andrews (2007) might serve as a useful model to understand why Gf can be improved by means of a working memory task. Their claim is that working memory and intelligence share a common capacity constraint, which is driven by attentional control processes. Other authors have come to a related conclusion (Gray et al., 2003, Kane et al., 2004), and in particular, Carpenter, Just, and Shell (1990) have proposed that the ability to derive abstract relations and to maintain a large set of possible goals in working memory accounts for individual differences in typical tasks that measure Gf. The underlying neural circuitries provide additional evidence for the shared variance between working memory and Gf in that both seem to rely on similar neural networks, most consistently located in lateral prefrontal and parietal cortices (Gray et al., 2003, Kane and Engle, 2002). Thus, it seems plausible that the training of a certain neural circuit might lead to transfer to other tasks that engage similar or at least overlapping neural circuits. Indeed, recent evidence shows that transfer occurs if the training and the transfer task engage overlapping brain regions, but not if they engage different regions (Dahlin, Neely, Larsson, Backman, & Nyberg, 2008; see also Persson & Reuter-Lorenz, 2008).

But overlapping processes and neural circuits might not be the only prerequisites for transfer. We believe that the training task has to be very carefully designed in a certain way to promote transfer. First, a successful training task must minimize the development of strategies that are specific to the task in question because the object of training must be changes in the information processing system, not changes in the way one particular task is performed (cf. Ericsson & Delaney, 1998). Second, we think that it is very important to keep a persistently high level of training demand while also considering inter-individual performance differences. This can be achieved by using an adaptive training method that continuously adjusts the current training difficulty to the actual performance of each subject. Third, we argue that it is necessary to stress the information processing system during training, for example by taxing more than one input modality at a time or by having the subject engage in two tasks simultaneously (Oberauer, Lange, & Engle, 2004). As we have shown in our work, the dual n-back training paradigm is a task that fulfills these requirements and subsequently leads to the predicted transfer effects (Jaeggi et al., 2007, Jaeggi et al., 2008). Nevertheless, although various versions of the n-back task are widely used in research, only few studies have examined the processes involved in n-back performance (e.g. Hockey and Geffen, 2004, Jaeggi et al., 2010, Kane et al., 2007). Therefore, little knowledge is available about the cognitive processes that mediate performance in this task and consequentially, about the processes underlying n-back training that eventually promote transfer to Gf. In addition, although the n-back task is commonly regarded as a measure of working memory, its concurrent validity is still open to question (Jaeggi et al., 2010, Jarrold and Towse, 2006, Kane et al., 2007, Oberauer, 2005). For example, research from Kane's lab as well as our own work suggests that the n-back task and more traditional measures of working memory capacity (e.g. reading span or operation span tasks) do not share a great deal of common variance, although they independently predict performance in Gf tasks (Jaeggi et al., 2010, Kane et al., 2007). This is in line with findings from training on the n-back task which leads to improvements in Gf (Jaeggi, Buschkuehl, Jonides, & Perrig, 2008), but not in measures of working memory capacity (Jaeggi et al., 2008, Li et al., 2008). Therefore, we do not know whether training on an n-back task results in transfer to Gf due to an improvement in basic working memory processes, or whether there are other processes that are better predictive of such transfer.

Study 1

The main goal of Study 1 was to document the results of a correlational study investigating the relationship between the n-back task and selected cognitive tasks chosen so that they might reveal factors that underlie the transfer effect that we have observed by training on the dual n-back task. Based on our own work and Kane's work, we included measures of matrix reasoning and a measure of working memory capacity (Jaeggi et al., 2010, Kane et al., 2007). Further, as we were interested in investigating the transfer potential of a simpler n-back task version as well as the dual n-back task, we included both single and dual n-back task versions. There are four reasons to investigate the transfer potential of a single n-back task: First, the dual n-back task is relatively new and not much is known about its constituent processes (Jaeggi et al., 2007, Jaeggi et al., 2008, Jaeggi et al., 2009, Jaeggi et al., 2003). Second, the dual n-back task is inherently complex, and so it is not easy to disentangle the underlying processes. Third, the dual n-back task includes an obvious task-switching component (i.e., going back and forth between the two stimulus streams that must be tracked). This task-switching component might contribute to increased reasoning performance because in many matrix reasoning problems, it seems important to be able to switch back and forth between different representations. However, it is not at all clear that task-switching processes are an essential component of Gf; thus, if task-switching processes are not critical to matrix reasoning, then a single n-back task should correlate just as well with Gf as a dual n-back task. Finally, the dual n-back task is very challenging for participants, thereby restricting its range of application mainly to healthy young adults. We know from our previous research that a frequently used and well-established single n-back task recruits similar neural networks to a dual n-back task (Jaeggi et al., 2003), and also, that single n-back tasks share common variance with Gf tasks (e.g. Gray et al., 2003, Hockey and Geffen, 2004, Jaeggi et al., 2010, Kane et al., 2007). Thus, we investigated the relationship between single n-back performance and measures of Gf, and whether and how this relationship is different from that of the dual n-back task and Gf. We also investigated the role of working memory capacity, hypothesizing that working memory capacity predicts performance in n-back tasks, however, to a lesser extent than Gf.

Section snippets

Subjects

A total of 104 participants (65 women) with a mean age of 21.3 years (SD = 2.2) were tested. Subjects were recruited from the student population of the University of Michigan and were paid $14 per hour for participation.

Single n-back task

Participants were shown a sequence of visual stimuli and they had to respond each time the current stimulus was identical to the one presented n positions back in the sequence. The stimulus material consisted of 8 random shapes (Vanderplas & Garvin, 1959) which we have used

Results

Means, standard deviations, and reliability estimates for all measures are reported in Table 1. The Pearson's correlations among the variables used for the regression analyses are reported in Table 2.

Overall, our data revealed a strong relationship between the single and dual n-back tasks with a correlation of r = .72, indicating that the two versions share a considerable amount of common variance (see Table 2). Further, the correlation of both n-back tasks with the matrices tasks were stronger

Discussion

The findings of Study 1 confirm other findings from the literature (Jaeggi et al., 2010, Kane et al., 2007): Consistent with our hypotheses, both n-back task variants were highly correlated, and both were best predicted by Gf.

In general, matrix reasoning tasks seem to be better predictors for both the single and the dual n-back tasks than a measure of working memory capacity. As the reliability estimates were appropriate for the n-back tasks, the lack of correlation between the n-back tasks and

Participants

Ninety-nine undergraduates (mean age = 19.4 years; SD = 1.5; 76 women) from the National Taiwan Normal University in Taipei volunteered to take part in the study. Fifty-two (41 women) were assigned to the control group and 47 (35 women) were assigned to the experimental group. In return for participation, participants earned course credit. In addition, the training groups received NT$ 600 (about US$20) as well as the training software after study completion. After the pretest, participants in the

Results

Descriptive data for each of the intervention groups and test session are reported in Table 9. Note that there were no significant group differences at pre-test in any of the criterion measures.

Discussion

The goal of Study 2 was to investigate whether a single n-back intervention is a useful alternative to the complex dual n-back task that we used previously to demonstrate a transfer effect on tests of Gf. We based our assumption regarding the effectiveness of the single n-back task on our earlier findings showing that dual and single n-back tasks recruit similar neural networks (Jaeggi et al., 2003), and on the fact that single n-back performance correlates with Gf as well as dual n-back

References (58)

  • N. Unsworth et al.

    Working memory capacity and fluid abilities: examining the correlation between Operation Span and Raven

    Intelligence

    (2005)
  • P.B. Baltes et al.

    Lifespan psychology: theory and application to intellectual functioning

    Annual Review of Psychology

    (1999)
  • S.M. Barnett et al.

    When and where do we apply what we learn? A taxonomy for far transfer

    Psychological Bulletin

    (2002)
  • C. Basak et al.

    Can training in a real-time strategy video game attenuate cognitive decline in older adults?

    Psychology and Aging

    (2008)
  • M. Buschkuehl et al.

    Improving intelligence: a literature review

    Swiss Medical Weekly

    (2010)
  • P.A. Carpenter et al.

    What one intelligence test measures: a theoretical account of the processing in the Raven Progressive Matrices Test

    Psychological Review

    (1990)
  • R.B. Cattell

    Theory of fluid and crystallized intelligence: a critical experiment

    Journal of Educational Psychology

    (1963)
  • J.M. Chein et al.

    Expanding the mind's workspace: training and transfer effects with a complex working memory span task

    Psychonomic Bulletin & Review

    (2010)
  • A.R.A. Conway et al.

    Working memory span tasks: a methodological review and user's guide

    Psychonomic Bulletin & Review

    (2005)
  • E. Dahlin et al.

    Transfer of learning after updating training mediated by the striatum

    Science

    (2008)
  • R.W. Engle et al.

    Working memory, short-term memory, and general fluid intelligence: a latent-variable approach

    Journal of Experimental Psychology: General

    (1999)
  • K.A. Ericsson et al.

    Working memory and expert performance

  • W. Frearson et al.

    Intelligence, reaction time (RT), and a new odd-man-out RT paradigm

    Personality and Individual Differences

    (1986)
  • J.R. Gray et al.

    Neural mechanisms of general fluid intelligence

    Nature Neuroscience

    (2003)
  • J.R. Gray et al.

    Neurobiology of intelligence: science and ethics

    Nature Reviews. Neuroscience

    (2004)
  • R. Hamel et al.

    The 20-minute version as a predictor of the Raven Advanced Progressive Matrices Test

    Educational and Psychological Measurement

    (2006)
  • C.M. Haworth et al.

    The heritability of general cognitive ability increases linearly from childhood to young adulthood

    Molecular Psychiatry

    (2009)
  • A. Heron et al.

    Age and Function

    (1967)
  • R. Hossiep et al.

    Bochumer Matrizentest. BOMAT-advanced-short version

    (1999)
  • Cited by (377)

    View all citing articles on Scopus

    This work was supported by a grant of The Michigan Center for Advancing Safe Transportation throughout the Lifespan (MCASTL #DTRT07-G-0058) to MB, a fellowship from the Swiss National Science Foundation (PA001-117473) to SMJ, and a grant from the Office of Naval Research to JJ (N00014-09-0213). Study 2 was conducted as a part of BSL's master thesis with the guidance of the other co-authors and with the support of the University of Bern and the National Taiwan Normal University. The authors wish to thank Chao-Yi Ho and Philip Cheng for their help with the construction of the Mandarin stimuli and the help translating the instructions, as well as Courtney Behnke, Kirti Thummala, and Patrick Bissett, Wu Shan-Yun, Yi Han Chiu, and David Studer for their help with data collection, and finally, Randall Engle's group for providing us with the automated version of their automated operation span task.

    View full text