Compatibility between stimulus and response is a major determinant of the efficiency of response selection (Fitts & Seeger, 1953; Proctor & Vu, 2006; Welford, 1960). In choice-reaction tasks, responses are faster and more accurate when stimuli and responses are spatially compatible than when they are spatially incompatible (e.g., pressing a left key to stimuli on the left, as opposed to pressing the same key to stimuli on the right). This stimulus-response compatibility (SRC) effect occurs when spatial attributes of stimuli are task-relevant or -irrelevant (e.g., when responses are determined by the stimulus color rather than the location). When spatial attributes of stimuli are task-irrelevant, the effect is known as the Simon effect (see Lu & Proctor, 1995, for a review).

The Simon effect has been an important phenomenon of investigation due to its implications about the mechanisms of response selection (Hommel, 2011). For instance, the Simon effect has been a useful tool to investigate the principles underlying transfer of learning (e.g., Baroni, Yamaguchi, Chen, & Proctor, 2013; Proctor & Lu, 1999; Tagliabue, Zorzi, Umiltà, & Bassignani, 2000). In this line of research, participants are first asked to perform a choice-reaction task in which they respond to spatial stimuli (e.g., circles that appear on the left or right of the display) by pressing a key whose location is incompatible with the stimulus location. After performing this incompatible-mapping task, participants transfer to the Simon task, in which they press the left or right key according to non-spatial attributes of stimuli, such as colors. Proctor and Lu (1999) found that the Simon effect was reversed (i.e., responses were faster when the stimulus and response locations were incompatible) after participants performed three sessions of 600 trials (a total of 1,800 trials) on the incompatible-mapping task. Similarly, Tagliabue et al. (2000) found that the Simon effect was eliminated after performing less than 100 trials on the incompatible-mapping task. These results indicate that participants acquire the incompatible stimulus-response (S-R) associations through practice and transfer these incompatible associations to the Simon task even when these associations are no longer relevant to the current task.

This transfer method is useful for identifying factors that affect transfer of learned associations to a novel task. For instance, Tagliabue, Zorzi, and Umiltà (2002) eliminated the Simon effect when the practice task used auditory stimuli and the transfer task used visual stimuli, implying that the transfer effect is not specific to the stimulus modality (also see Proctor, Yamaguchi, & Vu, 2007; Vu, Proctor, & Urcuioli, 2003). Proctor, Yamaguchi, Zhang, and Vu (2009) tested three types of spatial stimuli; physical stimulus location (circles that appeared on the left and right), spatial words (“LEFT” and “RIGHT”), and arrows pointing to the left and right, and found that the incompatible mapping transferred to the Simon task when stimulus type matched between the practice and transfer tasks but not when it did not match. For instance, the Simon effect was eliminated when stimuli in the practice and transfer tasks varied in physical locations, whereas the Simon effect was not modulated when stimuli in the practice task were physical locations but those in the transfer task were spatial words. These results indicate that although transfer of learned associations is independent of stimulus modality as long as stimuli convey physical location information (also see Vu, 2007), it is sensitive to how spatial information is presented.

The specificity of transfer of learning to the learning context is consistent with the idea that knowledge and skill are utilized most effectively when the test context resembles the context in which learning has taken place (Bouton, 1993; Godden & Baddeley, 1975; Healy, Wohldmann, Sutton, & Bourne, 2006; Tulving & Thomson, 1973). However, the previous studies all used manual responses (e.g., pressing keys, deflecting a joystick), and it is not known how much the mode of responding contributes to transfer of learned associations in this paradigm. Therefore, the present study focused on this factor, using vocal responses. By doing so, it allows examining whether the degree of set-level compatibility affects the acquisition and transfer of newly learned associations to a novel task context.

Set-level compatibility for learning and transfer

Set-level compatibility refers to the compatibility between stimuli and responses at the level of stimulus and response sets, as opposed to individual elements in these sets (Kornblum, Hasbroucq, & Osman, 1990; Proctor & Wang, 1997). The degree of set-level compatibility is usually defined in terms of the similarity between stimulus and response sets. For instance, the stimulus set consisting of the words LEFT and RIGHT would be more compatible with the response set consisting of vocal utterance of “left” and “right” than the response set consisting of pressing of left and right keys. A higher degree of set-level compatibility is associated with a larger SRC effect (Kornblum & Lee, 1995; Wang & Proctor, 1996) or Simon effect (Baldo, Shimamura, & Prinzmetal, 1998; Lu & Proctor, 1994). A higher degree of set-level compatibility, particularly referring to “modality compatibility,” is also associated with smaller task-switching cost (Stephan & Koch, 2010) and dual-task interference (Hazeltine, Ruthruff, & Remington, 2006). Therefore, set-level compatibility appears to play a significant role in performing cognitive tasks. Nevertheless, less is known about the role of set-level compatibility in learning of new S-R associations (Yamaguchi & Proctor, 2009).

When responses are made manually, many more practice trials are required to eliminate the Simon effect with spatial words (600 trials) than with physical location stimuli (<100 trials; Proctor et al., 2009). These results may reflect the influence of set-level compatibility in the acquisition of new S-R associations; the acquisition may be more efficient when the practice task involves a high degree of set-level compatibility, or it may be due to the type of stimuli; the acquisition of spatial mappings is more efficient with physical location stimuli than spatial words. Also, using a similar transfer method but with the Stroop task (Marini, Iani, Nicoletti, & Rubichi, 2011), the Stroop effect was reduced after the incompatible-mapping task when the practice and transfer tasks both required manual responses but not when the practice task required manual responses and the transfer task required vocal responses. This outcome is consistent with the finding that contextual match between the practice and transfer tasks with respect to the response mode plays a major role in the transfer of incompatible mappings (Yamaguchi & Proctor, 2009), but it may also reflect the importance of the type of response itself: the transfer effect may occur only with manual responses. Hence, it is necessary to examine the contribution of the mode of responding to transfer of learned associations to dissociate these possibilities.

Present study

The present study used the transfer paradigm (Proctor & Lu, 1999; Tagliabue et al., 2000) to examine the roles of stimulus type, response mode, and set-level compatibility in transfer of incompatible mappings to the Simon task. The main findings in previous studies are that the Simon effect is reduced, or eliminated, after participants practice with the incompatible spatial mappings for less than 100 trials (Tagliabue et al., 2000) and that the modulation is sensitive to the match of stimulus sets between the practice and transfer tasks (Proctor et al., 2009). Also, compared to physical location stimuli (stimuli that appear on the left or right of the display), spatial words (“left” and “right”) require more practice trials to modulate the Simon effect (Proctor et al., 2009). As previous studies all used manual responses (e.g., pressing keys) that are more compatible with physical location stimuli than with spatial words, it is not clear whether these results reflect the effects of stimulus type alone or the effect of the set-level compatibility between stimuli and response sets.

In the first two experiments, responses were made vocally, and the type of stimuli (physical location vs. spatial words) was varied between the practice and transfer tasks to examine whether set-level compatibility plays a role in the acquisition and transfer of associations. In both experiments, stimuli in the practice task were either physical locations or spatial words, and stimuli in the transfer task were physical locations in Experiment 1 and were spatial words in Experiment 2. These experiments allow dissociating the effect of stimulus type from that of set-level compatibility: If the previous finding, that physical location stimuli modulates the Simon effect with fewer practice trials than spatial words do, is due to the higher degree of set-level compatibility, spatial words should modulate the Simon effect with a small number of practices (<100 trials) when responses are made vocally, but physical locations stimuli should not.

In the subsequent two experiments, the mode of responding was manipulated. The design of these experiments was complementary to the design of the first two experiments; stimuli were of the same type between the practice and transfer tasks (physical location stimuli in Experiment 3, and spatial words in Experiment 4), but responses were changed from manual to vocal, or vice versa, between the practice and transfer tasks. Thus, Experiments 3 and 4 dissociated the effect of set-level compatibility from that of the mode of responding.

Experiment 1

In Experiment 1, participants performed the incompatible-mapping tasks using physical location stimuli (circles that appeared in left and right locations on the display) or spatial words (LEFT and RIGHT that appeared in the center of the display) in the practice task. Participants in both conditions performed the Simon task with the physical location stimuli in the transfer task. Responses were always made by saying “left” or “right” into a microphone. In previous studies using manual keypresses, the Simon effect was reduced when stimuli in the practice task were physical locations but not when they were spatial words if stimuli in the transfer task were physical locations (Proctor et al., 2009). Experiment 1 examined whether these outcomes were due to the degree of set-level compatibility or due to the type of stimuli. If the match between the two tasks is the primary determinant of the transfer effect, then practice with physical location stimuli should produce a larger modulation of the Simon effect. However, if the higher degree of set-level compatibility is the primary determinant of the transfer effect, practice with spatial words should produce a larger modulation of the Simon effect.

Also, a previous study (Marini et al., 2011) found that practice with a spatially incompatible mapping task can modulate the Stroop effect in the transfer task when using manual keypresses but not when using vocal responses. The lack of the transfer effect could have been due to the altered task context between the practice and transfer tasks (see also Yamaguchi & Proctor, 2009), but it could also be due to the nature of vocal responses used in the transfer task (e.g., keypress responses are spatially distinct, whereas vocal responses are not. Hence, the present experiment assessed whether the transfer of incompatible mapping can occur when both practice and transfer tasks used vocal responses) as no studies have shown transfer of incompatible mappings with vocal responses.

Method

Participants

Forty-eight undergraduate students at Purdue University participated in the experiment. They received experiment credits toward their introductory psychology courses for participation. All participants reported having normal or corrected-to-normal visual acuity and normal color vision. They were randomly assigned to one of the three groups (see Procedure).

Apparatus and stimuli

The apparatus consisted of a 19-in LCD monitor and a personal computer. For the practice task, the stimuli were white circles (2.5 cm in diameter) that appeared at the left or right side of the screen (12.5 cm from the screen center), or the words LEFT (1.5 cm in height, 3 cm in width) and RIGHT (1.5 cm in height, 4 cm in width) printed in white that appeared at the center of the screen. For the transfer task, the imperative stimuli were 2.5-cm circles colored in green or red, which appeared on the left or right side of the screen. Responses were collected by speaking into a microphone, saying “left” or “right.” The experimenter pressed one of two keys on the keyboard to register participants’ responses. Response time (RT) was the interval between onset of a circle and a trigger of voice key.

Procedure

The experiment was conducted individually in a dimly lit cubicle. Participants were seated in front of the computer monitor at an unrestricted distance of approximately 60 cm from the monitor and read on-screen instructions. The experimenter sat in the experiment room in order to register participants’ vocal responses.

Participants were assigned randomly to one of the three conditions. In the first condition, participants performed the practice task in which they responded to circles that appeared on the left or right side of the screen (location practice). In the second condition, participants performed the practice task in which they responded to the words LEFT and RIGHT (word practice). In the third condition, participants did not perform the practice task (control). For the former two conditions, there was a mandatory break for 5 min after the practice task. All participants performed the same transfer task in which they responded to colored circles that appeared on the left or right side of the screen.

For the practice session, a trial started with a cross (0.8 cm in height and width) presented at the center of the screen, which lasted for 1,000 ms. The cross was replaced by an imperative stimulus. Participants then responded by speaking into the microphone. They said “left” if a circle occurred on the right or the word RIGHT appeared at the center; they said “right” if a circle occurred on the left or the word LEFT appeared. The stimulus stayed on the screen until the microphone detected a vocal response, and a blank display replaced the screen. The experimenter pressed one of two keys to indicate whether participants said “left” or “right” within a 1,000-ms time window after the onset of response. If the experimenter failed to register the response within this time window, the trial was recorded as no response. For error or no response, a 400-Hz tone was presented for 500 ms from the speaker positioned on both sides of the screen. For correct response, a blank screen was presented for 500 ms. The trial ended with another blank screen that lasted for 1,000 ms.

The procedure of the transfer task was essentially the same as that of the practice task, except that the imperative stimuli were color-filled circles that occurred on the left or right of the screen. Participants were encouraged to ignore the circle location and responded to the color by saying “left” and “right.” For half the participants in each condition, the “left” and “right” responses were mapped to green and red stimuli, respectively; the mapping was reversed for the other half.

The practice and transfer tasks consisted of 84 and 156 trials, respectively. For both sessions, the first 12 trials were considered warm-up and excluded from the analysis. A session lasted for less than half an hour.

Results

Trials with RTs of less than 150 ms or greater than 1,500 ms were discarded (3.30 % for the practice task; 2.03 % of all trials for the transfer task). For each participant, mean RTs for correct responses and percentage errors (PEs) were computed. Figure 1 depicts RTs in the transfer task. Table 1 summarizes RTs and PEs in the practice task, and Table 2 summarizes PEs in the transfer task. RTs in the transfer task are shown in Fig. 1.

Fig. 1
figure 1

Response time (RT) as a function of Practice Mode (control, physical location, spatial word) in Experiment 1; the numbers are the Simon effects and an asterisk indicates a significant reduction of the Simon effect compared to the No Practice condition; error bars are standard errors of the means

Table 1 Response time (RT) and percentage errors (PE) in the practice tasks of Experiments 1–4
Table 2 Percentage errors (PE) in the transfer tasks of Experiments 1–4

For the practice task, RT was shorter for the location practice group (M = 539 ms) than for the word practice group (M = 671 ms), t(30) = 4.15, SE = 31.67, p < .001, but PE was not significantly different, t(30) < 0.2, SE = .73 (see Table 1).

For the transfer task, RT and PE were submitted to separate analyses of variance (ANOVAs) as a function of Practice Mode (location vs. word vs. control; between-subject) and Compatibility (compatible vs. incompatible; within-subject). For RT, the main effect of Compatibility was significant, F(1, 45) = 41.77, MSE = 230, p < .001, ηp 2 = .481, yielding a 20-ms Simon effect (Ms = 566 ms and 586 ms for compatible and incompatible trials, respectively). Compatibility interacted with Practice Mode, F(2, 45) = 11.89, MSE = 223, p < .001, ηp 2 = .346. The Simon effect was 31 ms for the control group and 30 ms for the word practice group, compared to –1 ms for the location practice group. There was no main effect of Practice Mode, F(2, 45) = 1.92, MSE = 16,301, p = .159, ηp 2 = .078. For PE, there was no significant effect, Fs < 1.9, ps > .1.Footnote 1

Discussion

The Simon effect was modulated when stimuli in the practice task were physical locations but not when they were spatial words. The results indicate the importance of contextual match between the practice and transfer tasks in terms of stimulus type, providing little evidence suggesting the importance of set-level compatibility. The modulation of the Simon effect also indicates that incompatible spatial associations do transfer to the Simon task even when responses are not spatially distinct (cf. Marini et al., 2011). Therefore, the present results reinforce the idea that newly acquired spatial associations are utilized more effectively when contextual features present during the acquisition are also present at test (Yamaguchi & Proctor, 2009). Nevertheless, it is still possible that incompatible associations were acquired quickly with spatial words due to the higher degree of set-level compatibility but were not expressed in the transfer task due to the altered task context. Thus, Experiment 2 tested the same manipulations in the practice task but with spatial words in the transfer task.

Experiment 2

Experiment 2 examined transfer of incompatible spatial mapping to the Simon task using spatial words with vocal responses. Stimuli in the practice task were physical location stimuli or spatial words. Set-level compatibility was higher for the transfer task of the present experiment than that of Experiment 1, and there was a contextual match between the practice and transfer tasks when spatial words were used in the practice task as well. Thus, if set-level compatibility plays a major role, practice with spatial words should produce a larger modulation of the Simon effect than practice with physical location stimuli.

The present experiment also examined whether the degree of set-level compatibility allowed more efficient acquisition of incompatible associations. With manual responses, practice with spatial words required more trials to yield the transfer effect than did practice with physical location stimuli (Proctor et al., 2009). If this outcome was due to high set-level compatibility between physical locations and manual responses, practice with spatial words would require a small number of practice trials (<100 trials) to modulate the Simon effect. If learning depends on stimulus type, a larger number of practice trials would be required for spatial words than for physical locations even when responses are made vocally. Thus, the present experiment also manipulated the number of practice trials across groups.

Method

Participants

Eighty undergraduate students at Purdue University participated in the present study. They were divided into five groups of 16 participants (see Procedure). All participants received experimental course credits toward their psychology courses, and they reported having normal or corrected-to-normal visual acuity and normal color vision. None had participated in Experiment 1.

Apparatus, Stimuli, and Procedure

Experiment 2 followed the procedure of Experiment 1, with the following changes: Two groups of participants were assigned to the conditions in which there were 84 practice trials with spatial words or physical locations, and another two groups were assigned to the conditions in which there were 168 practice trials with spatial words or physical locations. The final group had no practice trials. The stimuli were identical with those used in Experiment 1. All groups performed the same Simon task with spatial words, the words LEFT and RIGHT colored in green or red. Half the participants in each group said “left” to green words and “right” to red words, and the other half said “right” to green words and “left” to red words. Experiment 2 was essentially the same as Experiment 1 in other respects.

Results

The percentages of discarded trials were 2.27 % for the practice task and 2.21 % for the transfer task, with the same filtering criteria as in Experiment 1. RTs and PEs for the practice task are summarized in Table 1, and RTs and PEs for the transfer task are given in Fig. 2 and Table 2, respectively.

Fig. 2
figure 2

Response time (RT) as a function of Practice Mode (control, location, word) with 84 practice trials and 168 practice trials in Experiment 2; the numbers are the Simon effects and an asterisk indicates a significant reduction of the Simon effect compared to the No Practice condition; error bars are standard errors of the means

RTs and PEs for the practice task were submitted to separate ANOVAs as a function of Practice Mode (physical location vs. spatial word) and Practice Length (84 vs. 168). Both factors were between-subject variables. For RT, there was a significant main effect of Practice Mode, F(1, 60) = 10.63, MSE = 8781, p = .002, ηp 2 = .150, reflecting shorter RT for physical locations (M = 555 ms) than for spatial words (M = 631 ms). No other effects were significant, Fs < 1. For PE, no effects were significant, Fs < 1.

For the transfer task, separate ANOVAs were conducted for the two groups with a different number of practice trials, each of which involved Practice Mode (physical location vs. spatial word vs. control; between-subject) and Compatibility (compatible vs. incompatible; within-subject) as factors. For RT of the groups with 84 practice trials, the only significant effect was that of Compatibility, F(1, 45) = 72.14, MSE = 646, p < .001, ηp 2 = .616, yielding a 44-ms Simon effect (Ms = 611 ms and 655 ms for compatible and incompatible trials). The interaction was not significant, F(2, 45) = 2.13, MSE = 646, p = .131, ηp 2 = .086, which indicates that the Simon effect was not modulated by the prior incompatible mappings with word or location stimuli (the Simon effect was 44 ms for control, 57 ms for location practice, and 31 ms for word practice). For PE, the main effect of Compatibility was significant, F(1, 45) = 27.03, MSE = 3.81, p < .001, ηp 2 = .375, reflecting a Simon effect of 2.07% (Ms = 2.21% and 4.29% for compatible and incompatible trials). No other effects were significant, Fs < 1.6, ps > .2.

For RT of the groups with 168 practice trials, a main effect of Compatibility was significant, F(1, 45) = 62.00, MSE = 310, p < .001, ηp 2 = .579, yielding a 28-ms Simon effect (Ms = 579 ms and 607 ms for compatible and incompatible trials), and compatibility interacted with Practice Mode, F(2, 45) = 7.03, MSE = 310, p < .002, ηp 2 = .238. As compared to the Simon effect for the control group (M = 44 ms), the effect was significantly smaller for those who practiced with spatial words (M = 11 ms; t(30) = 3.92, p < .001) but not for those who practiced with physical locations (M = 30 ms; t(30) = 1.35, p = .187). The main effect of Practice Mode was not significant, F(2, 45) < 1. For PEs, the main effect of Compatibility was significant, F(1, 45) = 8.02, MSE = 5.28, p < .007, ηp 2 = .151, yielding the Simon effect (Ms = 2.05 % and 3.38 % for compatible and incompatible trials). The main effect of Practice Mode approached the .05 level, F(2, 45) = 2.73, MSE = 8.36, p = .076, reflecting higher error rate for the control group (M = 3.58%) than for the other two (Ms = 2.68 % and 1.89 % for physical location and spatial words, respectively). These two factors did not interact, F(2, 45) = 2.43, p > .1.

Including the data from Experiment 1 and the groups with 168 practice trials in Experiment 2, an additional 2 (Practice Mode: physical location vs. spatial word) × 2 (Transfer Mode: physical location vs. spatial word) × 2 (Compatibility) ANOVA was conducted, of which the main focus was the interaction among the three factors. As expected, it revealed a significant 3-way interaction, F(1, 60) = 20.66, MSE = 257, p < .001, η p 2 = .256, reflecting the outcomes that the Simon effect was reduced when the stimulus type matched between the practice and transfer tasks but not when they mismatched.

Discussion

Two outcomes of the present experiment are particularly relevant. First, the incompatible spatial mappings were transferred to the Simon task after practice with spatial words for 168 trials but not for 84 trials. Spatial words required more practice trials than physical location stimuli, which produced the transfer effect only after 84 trials in Experiment 1. The results are consistent with those obtained with manual responses (Proctor et al., 2009). Therefore, the acquisition of spatial associations depends on the type of stimuli, not the degree of set-level compatibility. Second, with longer practice, the incompatible spatial mappings transferred to the Simon task only after practice with spatial words but not after practice with physical locations when stimuli were spatial words in the transfer task. This outcome corroborates the result of Experiment 1, supporting the conclusion that transfer of new spatial associations depends on contextual overlap between the practice and transfer tasks with respect to the type of stimuli, but set-level compatibility plays no major role.

Experiment 3

The preceding two experiments varied the type of stimuli used in the practice and transfer tasks, while maintaining the response mode between the tasks. The following two experiments varied the response modes in the practice and transfer tasks while maintaining the type of stimuli. These manipulations allowed dissociating the role of the response mode from that of set-level compatibility. For both manual and vocal response modes, only short practice is required to yield the transfer effect when participants responded to physical location stimuli. Also, for both manual and vocal responses, contextual match between the practice and transfer tasks determined transfer of spatial associations. The similarity between the outcomes with manual and vocal responses raises the possibility that the differences between the two response modes are superficial and a common mechanism underlies both situations (see e.g., Stoet & Hommel, 1999; Wühr, 2006). Therefore, Experiments 3 and 4 examined whether acquired spatial associations transfer between the two modes of responding.

In the present experiment, physical location stimuli were used for the practice and transfer tasks, and half of the participants responded vocally in the practice task and manually (by pressing left and right keys) in the transfer task, whereas the other half responded manually in the practice task and vocally in the transfer task. As short practice is sufficient to acquire incompatible spatial associations for physical location stimuli with manual and vocal responses, all participants had only 84 practice trials. If a common mechanism underlies the two modes of responding, transfer of incompatible spatial associations would be expected in the Simon task regardless of the response modes used in the two tasks.

Method

Participants

Forty-eight undergraduate students at Purdue University were newly recruited for the present experiment. They received experimental credits for their psychology courses. None had participated in the preceding experiments, and all reported having normal or corrected-to-normal visual acuity and normal color vision.

Apparatus, stimuli, and procedure

Stimuli were the location stimuli (white circles in the practice task, and red and green circles in the transfer task) used in Experiment 1. The practice task consisted of 84 trials. The apparatus was the same as that used in Experiment 1. Participants were assigned randomly to one of three groups of 16 participants each. The first group performed the practice task using a keyboard, pressing the “z” (left) and “/” (right) keys; in the transfer task, this group performed the Simon task by speaking into a microphone. The second group used these response modes in the reversed order: they spoke into a microphone in the practice task and pressed keys in the transfer task. The third group served as the control group for the second group (who performed the Simon task with keypresses); those in the control group only performed the Simon task. For those who performed the transfer task with vocal responses, the no-practice group in Experiment 1 served as the control.

Results

Percentages of discarded trials were 0.30 % for the practice task and 2.32 % for the transfer task. The data were analyzed separately for those who used the keyboard in the transfer task and those who spoke into a microphone. RTs and PEs for the practice task are shown in Table 1.

Vocal practice to manual Simon

For the transfer task, RTs and PEs were submitted to 2 (Practice Mode: keyboard vs. control) × 2 (Compatibility: compatible vs. incompatible) ANOVAs. RT showed a main effect of Compatibility, F(1, 30) = 10.39, MSE = 346, p = .003, ηp 2 = .257, yielding a 15-ms Simon effect (Ms = 568 ms and 583 ms for compatible and incompatible trials; see Fig. 3). Although the magnitude of the Simon effect appears small as compared to that obtained with vocal responses (see Experiments 1 and 2), this is a typical size when responses are keypresses (e.g., Proctor et al., 2009; Yamaguchi & Proctor, 2009). No other effects were significant, Fs < 1. PE did not yield any significant effect, Fs < 1.4, ps > .2 (see Table 2).

Fig. 3
figure 3

Response time (RT) as a function of Practice Mode (control vs. keypress or vocal) for the keypress and vocal Simon conditions of Experiment 3; the numbers are the Simon effects and an asterisk indicates a significant reduction of the Simon effect compared to the No Practice condition (the data for No Practice Vocal Transfer are those from the corresponding group of Experiment 1); error bars are standard errors of the means

Manual practice to vocal Simon

For the transfer task, RTs and PEs were analyzed in the same manner as for the Keypress Simon group. RT showed a significant main effect of Compatibility, F(1, 30) = 30.00, MSE = 221, p < .001, ηp 2 = .500, yielding a 20-ms Simon effect (Ms = 550 ms and 571 ms for compatible and incompatible trials; see Fig. 3). The factor interacted with Practice Mode, F(1, 30) = 7.27, MSE = 221, p = .011, ηp 2 = .195. The Simon effect was smaller for those who performed the practice task with the keyboard (M = 10 ms) than for the control group (M = 30 ms). The main effect of Practice Mode was not significant, F(1, 30) < 1.4. PE did not involve any significant effect, Fs < 2.1, ps > .1 (see Table 2).

Discussion

The present results showed little evidence that prior spatial associations transferred to the Simon task when the response mode was vocal in the practice task and manual in the transfer task. However, transfer of incompatible mappings was observed when the response mode was manual in the practice task and vocal in the transfer task. Although this transfer effect is consistent with the idea that a common mechanism underlies the Simon task with manual and vocal responses, the lack of the transfer effect in the other group complicates the issue. This curious pattern of results may be because responses were represented differently according to the mode of responding in the practice task (e.g., Ansorge & Wühr, 2009). For instance, participants might have continued using response codes that they adopted in the practice task in order to represent responses in the transfer task when they switched from manual responses to vocal responses, thus representing vocal responses based on spatial codes. Participants might have adopted lexical codes to represent vocal responses in the practice task, but it may have been more difficult to continue using the same lexical codes to represent keypresses in the transfer task because stimuli were physical locations. Although speculative, such asymmetric transfer of response codes from the practice task to the transfer task could explain the present results. If so, it is not always possible to predict transfer of learned associations merely by looking at contextual overlap at the surface features of the tasks. However, it is noteworthy that the 20-ms reduction of the Simon effect in the present experiment was smaller than the 31-ms reduction obtained in Experiment 1 in which the response mode was vocal in both tasks. Thus, although spatial associations transferred to the Simon task, contextual match still appeared to have played a role in the present experiment.

Experiment 4

Experiment 4 was essentially the same as Experiment 3 but used spatial words in the practice and transfer tasks. The practice task consisted of 168 trials, as Experiment 2 indicated that spatial words required more practice trials to produce the transfer effect. If set-level compatibility played any role in the results of Experiment 3 (e.g., asymmetric transfer effects), the Simon effect with manual responses should be reduced after practice with vocal responses. If contextual match with respect to the response mode is the primary determinant of the transfer effect, practice with the incompatible-mapping task should have little influence on the Simon effect because the mode of responding differed between the practice and transfer tasks.

Method

Participants

A new group of 48 participants were recruited from the same subject pool as in the preceding experiments. None had participated in the preceding experiments.

Apparatus, stimuli, and procedure

The procedure was the same as that in Experiment 3, but stimuli were spatial words in both practice and transfer tasks. The practice task consisted of 168 trials. A group of participants was assigned to the condition in which they responded vocally in the practice task and manually in the transfer task; another group was assigned to the condition in which they responded manually in the practice task and vocally in the transfer task; and the last group was assigned to the condition in which they performed only the transfer task with manual responses. The last group served as the control group for those who used keypresses in the transfer task. The control group in Experiment 2, who only performed the transfer task with vocal responses, served as the control for those who used vocal responses in the transfer task. Experiment 4 was identical with Experiment 3 in other respects.

Results

Percentages of discarded trials were 1.36 % for the practice task and 1.76 % for the transfer task. As in Experiment 3, the data were analyzed separately for those who used the keyboard in the transfer task and those who spoke into a microphone. RTs and PEs are shown in Table 1.

Vocal practice to manual Simon

For the transfer task, RTs and PEs were submitted to 2 (Practice Mode: keyboard vs. control) × 2 (Compatibility: compatible vs. incompatible) ANOVAs. For RT, there was a main effect of Compatibility, F(1, 30) = 15.09, MSE = 574, p < .001, ηp 2 = .335, yielding a 23-ms Simon effect (Ms = 468 ms and 491 ms for compatible and incompatible trials; see Fig. 4), but no other effect was significant, Fs < 1. Also, for PE, there was a main effect of Compatibility, F(1, 30) = 9.56, MSE = 6.46, p = .004, ηp 2 = .242, yielding a 1.96% Simon effect (see Table 2); no other effect reached significance, Fs < 1.

Fig. 4
figure 4

Response time (RT) as a function of Practice Mode (control vs. keypress or vocal) for the keypress and vocal Simon conditions of Experiment 3; the numbers are the Simon effects (the data for No Practice Vocal Transfer are those from the corresponding group of Experiment 2); error bars are standard errors of the means

Manual practice to vocal Simon

RTs and PEs were analyzed in the same manner as for the Keypress Simon group. For RT, there was a main effect of Compatibility, F(1, 30) = 60.43, MSE = 379, p < .001, ηp 2 = .668, yielding a 36-ms Simon effect (Ms = 587 ms and 623 ms for compatible and incompatible trials; see Fig. 4). No other effect was significant, Fs < 1.3, ps > .2. Also, for PE, the main effect of Compatibility was significant, F(1, 30) = 16.24, MSE = 4.73, p < .001, ηp 2 = .351, reflecting a 2.19% Simon effect (see Table 2). There was also a significant main effect of Practice Mode, F(1, 30) = 5.16, MSE = 6.52, p = .030, ηp 2 = .147, reflecting more accurate response for those who performed the practice task (M = 2.12 %) than for the control group (M = 3.58 %). However, there was no significant interaction between the two factors, F < 1.

Discussion

In contrast to Experiment 3, the present experiment did not show any reduction of the Simon effect in either group. Again, the outcome indicates the importance of contextual match between the practice and transfer tasks in terms of response mode. If the modulation of the Simon effect in Experiment 3 was due to the degree of set-level compatibility being higher for manual responses (because stimuli were physical locations), the Simon effect should have been modulated after practice with vocal responses that were highly compatible with spatial words. The lack of such outcome corroborates the conclusion that transfer of newly acquired spatial associations depends primarily on contextual match between the practice and transfer tasks, but set-level compatibility plays no major role in transfer. Although this conclusion rests on the null effects of the present experiment, it is consistent with the findings of Experiments 1 and 2, in which there were significant interactions between compatibility and stimulus type, indicating that the match of stimulus type between the practice and transfer tasks is a strong determinant of the transfer effect, but set-level compatibility is not.

General discussion

The Simon effect has been a useful tool in investigating the fundamental mechanisms underlying response selection (Hommel, 2011; Kornblum et al., 1990; Yamaguchi & Proctor, 2012). In the present study, the transfer paradigm was used to examine the role of set-level compatibility and response mode in transfer of learning in choice reactions. The Simon effect can be modulated when participants perform an incompatible-mapping task prior to the Simon task (e.g., Proctor & Lu, 1999; Tagliabue et al. 2000), but little has been known about the role of set-level compatibility in the acquisition of incompatible spatial associations and transfer of these associations to the Simon task. The present study extended the previous findings by dissociating the contributions of stimulus type, response mode, and set-level compatibility between them, and also by assessing the contextual match between the practice and transfer tasks using vocal responses. The results of the four experiments are consistent, showing that contextual match with respect to the type of stimuli and the mode of responding is the primary determinant of transfer of incompatible mappings to the Simon task. There was little evidence that set-level compatibility plays a major role in the acquisition or transfer of associations.

The first main finding of the present study is that newly acquired incompatible spatial associations can transfer to the Simon task when responses are made vocally: this is the first demonstration of transfer using vocal responses. A previous study has used the Stroop task after practice with the incompatible-mapping task using manual responses (Marini et al., 2011). The Stroop effect was modulated after the incompatible-mapping task if responses were made manually but not when responses were made vocally. This finding led to the proposal that transfer of incompatible mapping is based on the acquisition of an abstract rule that depends on the spatial distinction between alternative responses (e.g., left vs. right response keys). However, the present finding that the Simon effect can be modulated with vocal responses indicates that it is not due to the mode of responding or the lack of spatial distinction within the vocal response set that prevented the incompatible mapping to transfer to the subsequent task. Instead, transfer of associations is prevented due to the change in the response mode between the practice and transfer tasks.

The second main finding is that the Simon effect with vocal responses was modulated more strongly when both stimuli in the practice and transfer sessions were physical locations (Experiment 1) or spatial words (Experiment 2), as compared to when stimuli were physical locations in one task and spatial words in the other task. These results are consistent with the findings in the previous study using manual responses (Proctor et al., 2009), highlighting the importance of contextual overlap. On the other hand, spatial words were more compatible with vocal responses than were physical location stimuli, but there was little evidence that this higher set-level compatibility plays any role in transfer of spatial associations. There is evidence indicating that similarity between two stimulus events facilitates the association between them (e.g., Yamaguchi & Proctor, 2009), but the present results clearly showed that such a finding does not generalize to the acquisition of incompatible associations between stimulus and response.

The third main finding is that the ease of acquiring new spatial associations depended on the type of stimuli; in particular, associations are acquired more quickly with physical location stimuli than with spatial words. The importance of stimulus type in the acquisition of spatial associations was also indicated by a previous study with manual responses (Proctor et al., 2009). The present study reinforces such a finding by showing that even with vocal responses, physical locations required fewer practice trials than did spatial words. Tagliabue et al. (2000) suggested that it was an acquisition of temporary S-R associations during the practice task that counters pre-existing S-R associations representing compatible mappings. This account would suggest a critical role of associability between stimulus and response. Marini et al. (2011) suggested that it may be a general rule of “emit an alternative response” that is acquired during practice. This account would suggest a critical role of response mode in transfer of new associations. However, neither account speaks much to the finding that incompatible mappings transfer more easily with physical locations than spatial words, regardless of the mode of responding. A tentative explanation for this outcome is that spatial information is conveyed more effectively with physical locations than with spatial words, so it is associated with responses better. This possibility needs to be subjected to further investigation.

Finally, the present study also demonstrated the importance of contextual overlap between the practice and transfer tasks with respect to response mode. In Experiments 3 and 4, there was little influence of the prior incompatible-mapping task on the Simon task when response mode differed between the two tasks, despite the fact that stimuli were of the same type between the tasks. Thus, contextual overlap in the mode of responding is also an important determinant of the transfer effect (also see Yamaguchi & Proctor, 2009). A puzzling finding is the outcome that the Simon effect was modulated in Experiment 3 when responses were made manually in the practice task and vocally in the transfer task. There is a study indicating that response representation adopted in a previous task can transfer to a subsequent task (Ansorge & Wühr, 2009); thus, it is possible that spatial codes used to represent manual responses in the practice task were also adopted to represent vocal responses in the transfer task. However, the results were not replicated when stimuli were spatial words in Experiment 4. Hence, this aspect of the study requires further scrutiny in future investigations.

A picture that emerges from the present study is that transfer of learned associations depends on contextual match between the practice and transfer tasks with respect to stimulus type and response mode, but not much on the relation between the two factors (set-level compatibility). Also, the results indicate that the acquisition of spatial associations depends primarily on stimulus type: spatial associations can be acquired more easily based on physical locations than on the spatial meanings of words. This makes sense because spatial information is extracted from the actual stimulus locations more quickly than from the concepts of locations, as implied by faster responses to physical location stimuli than to spatial words in the practice task even if responses were made vocally (see Table 1). A question still remains as to what was actually learned during practice, whether it is some abstract rule of stimulus-to-response translation or simple link between stimulus and response codes (Baroni et al., 2013). However, it is clear from the present findings that acquisition depends on the manner in which spatial information is represented.

The contextual dependence of transfer of spatial associations is consistent with general findings in the studies of animal and human learning (Bouton, 1993; Godden & Baddeley, 1975; Healy et al., 2006; Tulving & Thomson, 1973), and it is well supported by the previous studies using the transfer paradigm (Baroni et al., 2013; Proctor et al., 2009; Marini et al., 2011; Yamaguchi & Proctor, 2009). Although transfer can occur in the face of some changes in task contexts, such as sensory modality (Tagliabue et al., 2002) and type of manual responses (Yamaguchi & Proctor, 2009), especially after extensive training (Proctor et al., 2007; Vu, 2007), all of these cases are situations in which different stimuli and responses are used to convey physical locations; thus, all stimuli and responses are likely represented by the same spatial codes. Hence, it would be interesting to see whether transfer of spatial associations can be induced between different stimulus types (e.g., words vs. locations) and response mode (e.g., keypress vs. vocal) after more extensive training. If more extensive training allows transfer between different stimulus types or between response modes, then it would be important to determine whether any qualitative changes emerge in the way these different stimuli and responses are represented cognitively. Such studies would provide important implications as to how transfer of learning can be promoted in more practical task settings, such as in educational or occupational settings.