Distinguishing the time- and magnitude-difference accounts of the Simon effect: Evidence from the reach-to-touch paradigm

Finkbeiner, Matthew; Heathcote, Andrew

doi:10.3758/s13414-015-1044-9

Distinguishing the time- and magnitude-difference accounts of the Simon effect: Evidence from the reach-to-touch paradigm

Published: 29 December 2015

Volume 78, pages 848–867, (2016)
Cite this article

Download PDF

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Distinguishing the time- and magnitude-difference accounts of the Simon effect: Evidence from the reach-to-touch paradigm

Download PDF

Matthew Finkbeiner¹ &
Andrew Heathcote^2,3

2431 Accesses
14 Citations
1 Altmetric
Explore all metrics

Abstract

A Simon effect occurs when the irrelevant spatial attributes of a stimulus conflict with choice responses based on non-spatial stimulus attributes. Many theories of the Simon effect assume that activation from task-irrelevant spatial attributes becomes available before the activation from task-relevant attributes. We refer to this as the time-difference account. Other theories follow a magnitude-difference account, assuming activation from relevant and irrelevant attributes becomes available at the same time, but with the activation from irrelevant attributes initially being stronger. To distinguish these two accounts, we incorporated the response-signal procedure into the reach-to-touch paradigm to map out the emergence of the Simon effect. We also used a carefully calibrated neutral condition to reveal differences in the initial onset of the influence of relevant and irrelevant information. Our results establish that irrelevant spatial information becomes available earlier than relevant non-spatial information. This finding is consistent with the time-difference account and inconsistent with the magnitude-difference account. However, we did find a magnitude effect, in the form of reduced interference from irrelevant information, for the second of a sequence of two incongruent trials.

Action selection by temporally distal goal states

Article 20 June 2016

Markus Janczyk, Moritz Durst & Rolf Ulrich

Reaching trajectories unravel modality-dependent temporal dynamics of the automatic process in the Simon task: a model-based approach

Article 12 April 2019

Yael Salzer & Jason Friedman

Trajectory deviations in spatial compatibility tasks with peripheral and central stimuli

Article 10 July 2014

Timothy N. Welsh, Sandra M. Pacione, … Jerry Ou

Introduction

Being fast and accurate in a choice task often requires selective attention to relevant stimulus attributes, particularly when irrelevant attributes are associated with a conflicting response. In the laboratory, choice conflict is commonly studied using variants of the Stroop (1935) task, where the irrelevant dimension is usually linguistic, and the Simon task (Simon & Rudell, 1967), where interference arises from correspondence between the irrelevant spatial attributes of stimuli and responses. MacLeod’s (1991) review of the Stroop literature endorses two models, Cohen, Dunbar and McClelland (1990) – where interference depends on the relative strengths of relevant and irrelevant stimulus attributes, and Logan (1980) – which also allows for differences in the relative speed-of-processing of relevant and irrelevant attributes. Subsequent, highly influential papers have elaborated the strength-only perspective into a general behavioral and neural theory of conflict resolution and cognitive control (e.g., Botvinick, Braver, Barch, Carter, & Cohen, 2001; Miller & Cohen, 2001). Theories of the Simon task, in contrast, have in the main emphasized the importance of relative speed-of-processing.

Hommel (1993a) proposed that the task-irrelevant spatial information in the Simon task is encoded rapidly, but that it also decays rapidly. The magnitude of interference depends on the temporal overlap between response activation caused by the transient spatial information and response activation caused by a slower and more persistent encoding of task-relevant information. De Jong, Liang, and Lauber (1994) proposed a dual-route model (see also Frith & Done, 1986; Kornblum, Hasbroucq & Osman, 1990; Sanders, 1967) combining a fast direct route that transiently encodes spatial information with a second slower route that can, by itself, also be a source of interference. The slower route performs a transformation of the stimulus that is supposed to follow task instructions, but it can also produce interference because both relevant and irrelevant attributes are encoded (Hedge & Marsh, 1975). This stimulus-compatibility interference broadened earlier purely stimulus-response compatibility conceptualizations, leading Hasbroucq and Guiard (1991) to characterize the Simon effect as a spatial variant of the Stroop effect. They located the stimulus compatibility effect in a stimulus-identification stage but subsequently Lu and Proctor (1994) reported evidence consistent with both compatibility effects influencing the same response-selection stage.

Dual-route architectures are now common in models of the Simon effect (e.g., Ridderinkhof, 2002a; Tagliabue, Zorzi, Umilta, & Bassignani, 2000). The fast and slow routes are often referred to as, respectively, “direct” and “cognitive” routes, and the magnitude of Simon-effect interference is assumed to be a function of both the relative speed and strength of the two routes (Lu & Proctor, 1995). A direct route instantiating “an initial tendency to react towards the source of stimulation” (Craft & Simon, 1970, p.415) has strong face validity. This is particularly so in the most commonly used “horizontal” version of the Simon task – where the stimuli and response keys are located on the left and right – given the tendency to reach for a source of lateral stimulation with the corresponding hand that is learned early in life and is perhaps to some degree innate (Umilta & Zorzi, 1997).

The primary aim of the present study is to determine whether, as is usually assumed, activation from task-irrelevant attributes becomes available earlier than activation from task-relevant attributes. We will refer to this possibility as the “time-difference account.” Alternatively, it is equally possible that activation from both relevant and irrelevant attributes become available at the same time, but with the activation from irrelevant attributes initially being stronger. We will refer to this as the “magnitude-difference account.” The a priori plausibility of the time difference account can be validated by consulting the neuroanatomical paths supporting the processing of the different attributes and independently tested, such as by measuring simple reaction times to each, but the same cannot be said for the magnitude approach. Our analysis aims to address this imbalance.

Figure 1 schematically depicts the magnitude-difference and time-difference explanations for conflict trials, where positive values indicate activation favoring the correct response (see Ivanoff, Klein, & Lupianez, 2002, and Ivanoff, 2003, for related depictions of models of the time course of the factors underlying the Simon effect). Critically, the total activation, which is the sum of relevant and irrelevant activation, is similar in both cases, varying between −1 and 1, demonstrating that both explanations yield the same (correct) predictions for the time course of the conflict effect. This is especially true for measures, like response latencies or accuracy rates, insofar as both accounts correctly predict larger interference effects in the earliest responses and attenuated interference effects in later responses. Thus, to date it has been very difficult to distinguish the time- and magnitude-difference accounts of the Simon effect. Fortunately, it is possible to adjudicate these two accounts by including a neutral condition in a task that incorporates a measure that can reveal the emergence of the Simon effect over time.

To distinguish the time- and magnitude-difference accounts, one needs to distinguish the relative differences in time between when the task-relevant and task-irrelevant information first begin to influence the behavioral response. To isolate the influence of task-relevant information, we use a neutral stimulus, which is presented along the vertical meridian and, therefore, does not carry with it any task-irrelevant spatial information. We then take advantage of the temporal resolution afforded by the reach-to-touch paradigm (Buetti & Kerzel, 2008, 2009; Finkbeiner, Coltheart & Coltheart, 2014) to identify the points in time that the task-relevant and task-irrelevant spatial information first begin to influence the response. If these are the same points in time, the magnitude-difference account should be favored. If, though, one finds that the task-irrelevant spatial information begins to influence the response earlier than the task-relevant information, the time-difference account should be favored. In what follows, we take advantage of the ability of the reach-to-touch paradigm to reveal the emergence of experimental effects across time to identify the points in time that the task-relevant and irrelevant information first begin to influence the response. To anticipate our results, our findings support the time-difference account – insofar as we find that the task-irrelevant spatial position of the target stimulus begins to influence responses before the task-relevant information does – but that a magnitude-difference account has a role in modulation of the Simon effect by trial history (i.e., sequential effects, Hommel, Proctor, & Vu, 2004; Stürmer et al., 2002).

In the next section we examine three major sources of evidence about the time course of interference in the Simon task: (1) manipulations of discriminability and stimulus-onset asynchrony that affect the time at which relevant and irrelevant information becomes available, (2) differences in the magnitude of interference for fast and slow responses, and (3) hemispheric asymmetries in evoked response potentials (ERPs). We then report the results of two experiments that use quantitative characteristics of reaching movements to provide a fine-grained characterization of the time course of the influence of relevant and irrelevant information in the Simon effect.

The Simon effect

The Simon effect was first quantified in terms of an increase in mean error rates and response time (RT) for incongruent trials (e.g., a stimulus requiring a left-hand response presented on the right) relative to congruent trials (e.g., a stimulus requiring a left hand response presented on the left). In Simon and Rudell’s (1967) original study auditory stimuli were presented either on the left or right and participants were instructed to press a button with their left hand when they heard the word “left” and press a button with their right hand when they heard the word “right.” Interference can be defined as a positive quantity corresponding to RT or error rates for congruent trials subtracted from the corresponding measure for incongruent trials. Interference occurs both when stimulus-response associations are pre-existing, as in Simon and Rudell’s study, and also when they are defined by task instructions (e.g., left and right responses to, respectively, high and low tones, Simon & Small, 1969, or red and green patches, Hedge & Marsh, 1975).

Simon, Acosta, Mewaldt, and Speidel (1976) investigated the time course of the Simon effect by delaying responding. Their first experiment used what is now referred to as a “standard” design (Proctor Miles & Baroni’s, 2011), with responses made by either the left or right hand to visual stimuli presented on either the left or right. However, it added an extra requirement, that participants delayed their response until they heard a tone that could occur at durations of 0 ms, 150 ms, 250 ms, or 350 ms after the visual stimulus. The Simon effect was eliminated for the latter two intervals (see also Burle, van den Wildenberg & Ridderinkhof, 2005; Vallesi & Umiltà, 2009; Wiegand & Wascher, 2005; but see Ivanoff, 2003 for a different pattern when the cue could also indicate that the response be withheld). A second experiment using high- and low-tone stimuli sought to directly control when response selection could begin. Participants were either given the same stimulus-response mapping on all trials or the mapping was varied, being specified on each trial 1 s before, simultaneously with, or 150 ms, 250 ms, or 350 ms after the visual stimulus. The Simon effect disappeared for the latter three intervals. Simon et al. concluded that interference affects the response selection stage, but that it persists for only around 250 ms.

Several authors have shown that the Simon effect is reduced by manipulations that slow discrimination of the relevant attribute. Hommel (1993a) found a reduced effect when he slowed discrimination through greater retinal eccentricity, reduced contrast, adding overlaid patterns, or gradually building up the stimulus over time. Hommel (1994a) found the Simon effect completely disappeared for a difficult square versus rectangle discrimination, but was present in the same paradigm for an easier color discrimination. Roswarski and Proctor (1996) compared color, circle versus Square, and rectangle versus square discriminations that took increasingly longer times to perform, and found an associated decrease in the Simon effect. Vallesi and Umiltà (2009) also found that a decrease in the standard Simon task when the difficulty of a color discrimination was increased. All of these results are consistent with the idea that the slowed discrimination reduces the Simon effect by decreasing the temporal overlap of activation in the cognitive route and the early occurring – but then quickly reducing – activation in the direct route. Ivanoff, Klein, and Lupianez (2002) reported an apparent exception to this pattern, where the Simon effect was increased when responding was slowed by an inhibition of return (IOR: Taylor & Klein, 1998). However, this was attributed to IOR producing a large increase in interference (by delaying, increasing, or slowing the decay of activation in the direct route) rather than as evidence against the overlap hypothesis.

De Jong et al.’s (1994) seminal paper proposed a more fine-grained analysis of the Simon effect than is provided by mean RT, by quantifying interference as a function of overall response speed. They introduced the delta function (a term coined later by Ridderinkhof, 2002b), which is created by calculating a set of quantiles (e.g., deciles) for incongruent and congruent RT distributions and then plotting the difference between corresponding values (see Speckman, Rouder, Morey, & Pratte, 2008, for technical discussion). De Jong et al. found decreasing delta functions (i.e., a larger interference effect for fast than for slow responses), which they interpreted as indicative of a tendency to respond toward the source of stimulation that arose quickly after stimulus onset but which also dissipated quickly. They speculated that dissipation might be due to either passive decay or active suppression. Subsequently, support has been reported for both the decay (e.g., Hommel, 1994b) and suppression (e.g., Ridderinkhof, 2002a, 2002b; Forstmann, van den Wildenberg, & Ridderinkhof, 2008) accounts.

De Jong et al. (1994) came to the strong conclusion that “the results of the distributional analyses of spatial SRC [Simon] effects in the experiments reported here almost certainly provide a reliable estimate of the actual time course of these effects” (p.733). Burle et al. (2005) provided converging evidence for this claim, finding that delta function slopes were strongly correlated with changes in the Simon effect due to a direct manipulation of the relative times at which location and colour information became available. However, Zhang and Kornblum (1997; see also Roswarski & Proctor, 2003) challenged De Jong et al.’s conclusion. They pointed out that delta-function slopes could simply be due to differences between incongruent and congruent RT variance. If the slower incongruent responses are also less variable than the faster congruent responses, a decreasing delta function results. Zhang and Kornblum provided a detailed analysis of Kornblum’s (1994) Simon-effect data where variance was larger for incongruent than congruent conditions and the slope of the delta function was positive. They also pointed to a variety of other data sets where the ordering of congruent and incongruent variances was inconsistent.

In addition to these cued versions of the Simon task, there have been numerous reports with un-cued Simon tasks where the Simon interference effect does not decrease with increasing RTs, such as in a horizontal Simon task with auditory stimuli and also with visual stimuli and response hands crossed (Wascher, Schatz, Kuder, & Verleger, 2001), and when responses and visual stimuli are presented in a vertical arrangement (Wiegand & Wascher, 2005). To explain the different patterns of results obtained with horizontal and vertical displays, Wascher et al. (2001) proposed that the fast direct route corresponds to a specialized brain network associated with grasping that is only active with visual stimuli presented horizontally and bi-manual uncrossed responding (i.e., the standard Simon task). Wiegand and Wascher (2005, 2007) refined this position to address decreasing delta function found with uni-manual (i.e., different fingers on the same hand) responding, and vertical presentations with variable response rules. However, controversy surrounds Wascher et al.’s (2001) original explanation (cf. Roswarski & Proctor, 2003) and subsequent developments of it, with Proctor et al. (2011) saying that these developments need further testing.

The dual-route theories proposed by both De Jong et al. (1994) and Wascher et al. (2001) have also drawn support from ERPs, and in particular the lateralized readiness potential (LRP), which is calculated from the average asymmetry in ERPs over left and right primary motor cortex. In a vertical Simon task, De Jong et al. found that the LRP on congruent trials reached significant amplitudes at 170 ms after stimulus onset. On incongruent trials, they observed an initial asymmetry corresponding to the wrong choice in the same time frame. This initial asymmetry or “LRP dip” (Leuthold, 2011) in the wrong direction on incongruent trials generally reaches significant amplitudes in the same time window (or slightly later) as the LRP in the correct direction on congruent trials (see also Stürmer, Leuthold, Soetens, Schröter, & Sommer, 2002; Valle-Inclán, 1996). While this finding is clear in establishing that the task-irrelevant spatial information is able to gain control of the response formulation process, we are not aware of any LRP studies that have attempted to distinguish between the time- and magnitude-difference accounts. In fact, we are aware of only one LRP study that tried to quantify the response to the task-relevant source of information in isolation (Cespón, Galdo-Álvarez, & Díaz, 2012). Somewhat surprisingly, Cespón and colleagues found in this study that the LRP for incompatible trials (task-irrelevant information) was delayed relative to the LRP for neutral trials (task-relevant information). Taking this result at face value, it would appear to go against the widely-held view that task-irrelevant information gains control of the response formulation process earlier than the task-relevant information.

It is not clear at this point how this result should be interpreted, but we note that the neutral stimuli were presented at fixation in this study, meaning that the neutral stimuli were potentially better attended than the peripherally presented stimuli. Furthermore, and more generally, it is well known that the LRP dip in the horizontal Simon task is very difficult to disentangle from early sensory ERP components associated with laterally presented stimuli (Praamstra, 2007). Indeed, Leuthold (2011) concluded that further work is required to clarify the conclusions that can be drawn from findings about an LRP dip given the inconsistent results for vertical Simon task and the potential for confounds by sensory components in the horizontal Simon task. In the present paper, we use statistics measured from reaching movements in a horizontal Simon task to provide the same sort of fine-grained temporal analysis of the time course of the Simon effect afforded by electroencephalograms (EEGs) while avoiding the confounds noted by Praamstra and Leuthold.

Reach-to-touch paradigm

The reach-to-touch paradigm is becoming an increasingly popular tool for cognitive scientists. Reaching trajectories offer researchers a continuous behavioral measure that promises to reveal the unfolding of decision processes with much better temporal resolution than standard behavioral measures. Not surprisingly, the reach-to-touch paradigm has already been used to investigate temporal properties of the Simon task. For example, Buetti and Kerzel (2008, 2009) had subjects perform the Simon task by reaching out and touching one of two response regions, depending on the color of the target stimulus. In their studies they used “initial-movement angle” as their dependent measure. Initial-movement angle is calculated by taking the angle between the linear path connecting the start and end positions and the position of the hand at the point corresponding to 20 % of the total distance traveled. Buetti and Kerzel argued that the initial-movement angle reveals important details about the motor plan that is assembled just prior to movement initiation.

Buetti and Kerzel (2008, 2009) found an average deviation in the initial-movement angle towards the incorrect response for incongruent trials relative to congruent trials. Consistent with the decreasing delta function reported by De Jong et al. (1994), the interference effect in the initial-movement angle decreased as the period of time between stimulus onset and the movement initiation increased. We will refer to this period of time as “movement initiation time” (MIT). More specifically, they found the distribution of initial movement angles to be bimodal in the incongruent (but not congruent) condition for trials with the fastest MITs (first 20 % of the MIT distribution). In contrast, the distribution of initial movement angles was unimodal for trials with longer MITs in both congruent and incongruent conditions. This finding suggests that the direction of the initial reaching response on some trials was strongly affected by the location of the target stimulus, but only if the reaching movement was initiated relatively soon after stimulus onset.

Although these findings from the reach-to-touch paradigm are encouraging in their consistency with the RT data, we note that Buetti and Kerzel (2008, 2009; see also Kerzel & Buetti, 2012) consistently observed strong Simon effects in subjects’ fastest set of responses (first quantile). As a result, the findings reported by Buetti and Kerzel are very similar to the RT effects reported elsewhere in that they reveal a strong Simon effect early on in stimulus processing followed by a weak to non-existent effect later on, but fail to reveal the gradual emergence of the effect. To distinguish the time- and magnitude-difference accounts, one needs to identify when the task-relevant and irrelevant information first begins to influence responses. Thus, the goal of the present study was to take full advantage of the temporal resolution offered by the reach-to-touch paradigm to establish when in stimulus-processing time (i.e., relative to stimulus onset) the Simon effect first emerges.

The potential of reaching movements to reveal the real-time unfolding of mental processes hinges on their co-occurrence. That is, a reaching movement can only reasonably be thought to provide a real-time window on internal cognitive processes if it is made while those processes are still ongoing. To achieve this end, in the present study we made use of the response-signal procedure (Finkbeiner et al., 2014; Ghez, Favilla, Ghilardi, Gordon, Bermejo, & Pullman, 1997; Reed, 1973, 1976), whereby subjects are trained to initiate their movement within a 300-ms wide response window that “opens” 100 ms before an imperative go signal and “closes” 200 ms after the go signal. The imperative “go” signal is the final beep in a sequence of three beeps (cf. Kello & Plaut, 2000 for a similar procedure). Importantly, we positioned the train of three beeps on each trial so that the final “go” beep was presented at specific points in time relative to the onset of the target stimulus. In this way, we elicited reaching responses that commenced across a range of stimulus viewing times, from before target onset to ~400 ms after target onset. Note that the response-signal procedure alone with a button-press response is unable to measure early effects, as stated by Hilchey, Ivanoff, Klein, and Taylor (2011 see also Ivanoff, Blagdon, Feener, McNeil, & Muir, 2014): “In this and other efforts … we have been unable to get participants to respond much earlier than about 250 ms post target. Thus, we have been unable to visualize information processing dynamics reflecting the hypothesized accumulation of the irrelevant location information.” (p. 240)

In terms of the dependent measure, ours is very similar to the initial movement angle used by Buetti and Kerzel (2008, 2009), but differs in that we use the mean lateral or “x” velocity (see Fig. 2 for the orientation of the x-axis in our paradigm) calculated across the first 150 ms of the reaching response (cf. Finkbeiner, et al., 2014; Quek & Finkbeiner, 2013; 2014). Similar to Buetti and Kerzel’s use of the 20 % point in the movement, the reason for limiting the dependent measure to the initial segment of the trajectory is because the initial segment of the response is informative of the motor plan that participants had assembled just prior to movement initiation. To establish the time course of the Simon effect, we grouped the reaching movements according to when they were initiated relative to the onset of the target stimulus (MIT) and analyzed the initial x-velocities as a function of this period of target viewing time. As we demonstrate, this analysis allows us to map out the onset, growth, and subsequent decrease of the Simon effect in stimulus-processing time. To anticipate our results once again, we find that subjects’ movements early on in stimulus processing (0~200 ms) are strongly influenced by the stimulus’ task-irrelevant spatial information but not its task-relevant properties. In contrast, movements initiated >200 ms after stimulus onset reveal how the task-relevant information begins to gain control of the responses. Taken together, this pattern of results is consistent with the time-difference account and presents a serious challenge to the magnitude-difference account.

Experiment 1

In Experiment 1, subjects were instructed to respond to the meaning of a target stimulus (the word “left” or “right”) while ignoring its position.^{Footnote 1} Subjects responded by reaching out and touching the appropriate response panel that was positioned to the left or right of the computer monitor (see Fig. 2). Note that the monitor was far enough behind the response panels (see “Apparatus” section below) that participants were able to reach out and touch both response panels without their arm obscuring the monitor. The target stimuli were presented at one of four positions, at equal eccentricities, either to the left or right or above or below the center of the screen. Left and right positioned targets constituted the congruent and incongruent conditions. The stimuli presented above and below fixation constituted the neutral condition.

A neutral condition was included to provide a baseline that enabled us to establish at what points in time the different stimulus attributes (word meaning vs. position) begin to affect the decision process. We did not present a neutral stimulus at fixation as in some earlier studies (e.g., Cespón, et al., 2012; Wiegand & Wascher, 2005) to avoid confounding the neutral condition, and hence differences between it and the congruent or incongruent condition, with the effects of a difference in eccentricity.

Methods

Subjects

Eighteen undergraduates (12 female) from the psychology subject pool at Macquarie University participated in return for course credit. All subjects had normal or corrected-to-normal vision and all reported being right-handed.

Apparatus

Subjects sat at a desk with a CRT monitor placed 75 cm from the front edge of the desk (see Fig. 2). Two lateral response panels (30 cm × 9 cm) were positioned 50 cm from the front of the desk, one on the left edge of the desk, and one on the right edge. The desk was 75 cm wide. Subjects initiated a trial by moving their right index finger to the “start” position, which was located in the middle of the desk and close to its front edge. To respond, subjects reached out and touched the appropriate response panel; the left response panel for the target word LEFT, and the right response panel for the target word RIGHT.

We used a Polhemus Liberty (240 Hz) motion capture system with a small sensor taped to the tip of the subject’s right index finger to record the subject’s movements during the experiment. The advantage of using a motion-capture device, as opposed to a computer mouse (e.g., Freeman, Ambady, Rule, & Johnson, 2008; Koop & Johnson, 2011), is that it allows subjects to move freely and naturally in three dimensions. This is important to obtain movements with bell-shaped velocity profiles that are typical of natural movements and which are critical to the analyses that we report below.

Subjects wore headphones (Sennheiser 280 Pro), which we used to present the three beeps guiding response initiation, the final of which constituted the go signal. In Experiment 1 we used two different stimulus-onset asynchronies (SOA) between the target stimulus and go signal: 0 ms and 300 ms relative to the onset of the target. As mentioned, subjects were asked to initiate their response within a 300-ms wide window that encompassed the go signal. If subjects did not initiate their movement within the response window, the trial was terminated with a buzz and the appropriate visual feedback (e.g., “Too Early!” or “Too Late!”) was presented. Trials that were terminated due to a response window failure were re-presented at a later point in the experiment. Upon initiating a movement, subjects were required to maintain a continuous forward movement over the first 50 recorded samples (~208 ms) and trials were terminated with a buzz and visual feedback if this criterion was not met.

Design and procedure

Each trial began with a fixation cross for 500 ms at which point the sequence of three beeps began. The final go beep was presented 1,200 ms after the onset of the fixation cross, either simultaneously with the target stimulus (0-ms SOA) or 300 ms after the target stimulus (300-ms SOA). The target stimulus appeared in one of four different positions (right, top, left, bottom), with the innermost edge 6° from fixation. The two target words (LEFT, RIGHT) were fully crossed with the three levels of Trial Type (Congruent, Incongruent, Neutral), to yield six different Trial-Type × Target-Word combinations. These six combinations were repeated eight times per block for a total of 48 trials per block. The target appeared above fixation on half of the neutral trials and below fixation on the other half of neutral trials. Thus 33 % of the trials were congruent (e.g., the word LEFT presented on the left), 33 % were incongruent (e.g., the word LEFT presented on the right), and 33 % were neutral (e.g., the word LEFT presented at the top or bottom position). The experiment began with two blocks of practice trials (N = 96), followed by ten blocks of experimental trials (N = 480).

Data analysis

Perhaps the most common way to analyze reaching trajectories is to calculate the “path offset” or “curvature” of the trajectory (Atkeson & Hollerbach, 1985; Smit & van Gisbergen, 1990). However, this dependent measure relies on movement corrections that occur relatively late within the reaching movement and, hence, is relatively insensitive as a measure of how responses systematically vary with stimulus-viewing times. Because our primary interest was in establishing how the Simon effect unfolds during the course of stimulus processing, our approach is different. As we alluded to above, we couple the response-signal procedure with an analysis of the initial x-velocity, which reveals information about the motor plan that participants had assembled just prior to movement initiation. We then employ a distribution analysis whereby we bin the trajectories according to their MIT to see how the Simon effect emerges over stimulus viewing time.

Note that in a small percentage of trials, the MIT value is negative due to subjects having initiated their movement within the 100 ms of the response window that precedes target onset (this occurs only on 0-ms SOA trials). Having a small percentage of trials with negative target-viewing times is intended. Because the initial segment of movements that are initiated prior to target onset should not reveal any experimental effects, these trials provide a baseline.

To calculate x-velocity, we first filtered the position data with a two-way low-pass Butterworth filter at 7 Hz and then we calculated the derivatives (velocity, acceleration) through numerical differentiation. We then identified the movement’s onset and offset. Movement onset was defined as the first of 20 consecutive samples that exceeded 10 cm/s in tangential velocity^{Footnote 2}; movement offset was defined as the first of 20 consecutive samples that occurred after peak velocity and that fell below 10 cm/s in tangential velocity. We then used a modified orthogonal polynomial trend analysis (OPTA) to improve the signal to noise ratio (cf. Woestenburg Verbaten, Van Hees, and Slangen’s (1983; see also Karayanidis Provost, Brown, Paton, & Heathcote, 2011). In our modified version of this approach (cf. Finkbeiner et al., 2014; Quek & Finkbeiner, 2014), the MITs for each trial are used as the only covariate in a polynomial regression model of the observed x-velocity profiles. With this approach, one can use the regression model to analyze the x-velocity profile for any given MIT, allowing for a very fine-grained analysis of changes in the reaching response as a function of target-viewing time. Because all trials from each cell of the experimental design contribute to the regression model, this approach is very effective in increasing the signal-to-noise ratio. For example, Woestenburg et al. (1983) reported an increase in signal-to-noise ratio over simple averaging and Wiener filtering by up to a factor of ten and Karayanidis et al. (2011) found that OPTA improved the signal-to-noise ratio by 2.5 times over simple averaging. In our OPTA procedure, trials within each individual design cell were ordered by MIT latency from 1 to n, such that the trial with the shortest MIT was ranked first and the trial with the longest MIT was ranked nth (where n is the number of trials in that design cell). We then fitted a polynomial regression model to the x-velocity profiles that included MIT rank as the only covariate and polynomial terms up to the 15th order. Terms that did not explain significant variance were dropped from the model so that only significant coefficients were used to generate predicted x-velocity profiles (one for each trial). The specific OPTA procedure used here has been described elsewhere and we point the interested reader to those publications for a detailed description (cf., Finkbeiner et al., 2014; Quek & Finkbeiner, 2014). Following the OPTA procedure, we averaged the fitted x-velocity values from the first 150 ms of the reaching movement and submitted this mean value, which we refer to as initial x-velocity, to a linear mixed-effects model (LMM, Bates, 2005), with MIT semi-decile included as a fixed effect.

To visualize the effect of target-viewing time (i.e., MIT latency) on the complete reaching response (see the upper panel of Fig. 3 for the distribution of MIT latencies in Experiment 1), we used the corresponding MIT rank values to group the x-velocity profiles into 20 bins of equal proportion (i.e., semi-deciles). The mean trajectory (averaged across subjects) from each inter-decile range of the MIT ranks is presented in the bottom panel of Fig. 3 for the incongruent condition. It shows that the longer subjects delay their movement, the more quickly they reach peak positive x-velocity (i.e., movement in the correct direction). The same was true for congruent and neutral conditions.

Results

The final (i.e., movement end-point) accuracy of subjects’ reaching responses was very high, 99.6 % overall, 99.9 % in the congruent condition, and 99.2 % in the incongruent condition. Presumably this was because even if the initial movement is in the wrong direction, subjects take approximately 1 second to finish their movement, which appears to be sufficient time to correct any initial waywardness in the hand’s flight path and arrive at the correct location. Despite the very high overall accuracy, a paired t-test indicated a reliable difference between the congruent and incongruent conditions (t(17) = 2.66, p = 0.016). The accuracy in the neutral condition (99.7 %) was reliably different from the incongruent condition (t(17) = 2.36, p = 0.03), but the difference between the congruent and neutral conditions was not reliable (t(17) < 2).

The analyses of initial x-velocity was limited to correct responses. We used an incremental model comparison approach in which we evaluated the contribution of each term by comparing a model that included the term of interest with a model that did not include that term. In each case, likelihood tests were used to indicate which model should be preferred. Only terms that significantly improved the fit of the model were included. We included subjects as a random effect and Trial Type (congruent, neutral, incongruent) and MIT percentile (semi-decile) as fixed effects. Given the curvilinear relationship between initial x-velocity and MIT, we included up to fourth order terms for MIT percentile in the model (including higher-order terms did not improve the fit of the model further).

We report the coefficients (b), standard errors (SE), and t-values for the model that provided the best fit to the data. As is typical in reporting the results of LMM analyses, we take a coefficient magnitude of at least twice its standard error (i.e. |t| > 2) as our criterion for significance (cf., Kliegl, Masson, & Richter, 2010; Kliegl, Wei, Dambacher, Yan, & Zhou, 2010). For the purposes of exposition we only report terms that are linear in MIT percentile, but note that including the higher-order terms leads to the same interpretations. Coefficients for the trial-type factor used the congruent condition as a baseline, so that negative coefficients indicate smaller x-velocities relative to the congruent condition.

There was a strong effect trial type with a larger x-velocity for the congruent condition than the incongruent (b = -0.58, SE = 0.01, t = 52.48) and neutral (b = -0.30, SE = 0.01, t = 28.67) conditions. There was also a significant increase in x-velocity with MIT percentile (b = 42.67, SE = 0.68, t = 61.94) and an interaction between trial type and MIT percentile (incongruent: b = -15.03, SE = 0.98, t = 15.27; neutral: MIT b = -6.93, SE = 0.96, t = 7.20). The nature of this interaction is clear in Fig. 4. Movements initiated within the first ~100 ms after target onset were, on average, “down the middle.” That is, the net movement was neither in the correct (positive x-velocities) nor in the incorrect (negative x-velocities) direction. In contrast, movements that commenced after ~100 ms were dramatically different depending on Trial Type. On congruent trials, initial x-velocity steadily increased with target viewing time; on incongruent trials, initial x-velocity became increasingly negative (i.e., in the wrong direction) for a period of time (~150 ms) before becoming positive again. On neutral trials, the net movement was “down the middle” (x-velocities were not different from zero) for movements that were initiated within the first ~300 ms after target onset.^{Footnote 3}

To better understand the unfolding of the effect of trial type, we then conducted a series of one-sample t-tests (Bonferroni corrected, 17 degrees of freedom) to determine when (i.e., at which MIT percentile) the net movements were significantly different from zero. On congruent trials, the initial x-velocities were all reliably greater than zero (net movement was in the correct direction) beginning with movements that commenced in the seventh MIT percentile (~109 ms). On incongruent trials, the pattern was biphasic. First, the initial x-velocities were reliably less than zero (net movement was in the wrong direction) for movements that commenced in the eighth through the 12th MIT percentile (~128 ms through ~287 ms). They then became reliably greater than zero for movements that commenced in the 17th through the 20th MIT percentile (~356 ms through ~430 ms). On neutral trials, initial x-velocities were reliably greater than zero beginning with movements that commenced in the 13th MIT percentile (~300 ms).

Discussion

Experiment 1 revealed that the initial direction of movements that commence ~150 ms after target onset in both the congruent and incongruent conditions is strongly influenced by the location of the stimulus. This clear influence of the task-irrelevant spatial information stands in sharp contrast to the lack of any influence by the task-relevant information during this early phase of stimulus processing in the neutral condition (where the stimulus’ position is orthogonal to the response). However, the influence of stimulus location on the initial direction of the reaching response is short-lived. The initial direction of movements that commence just 150 ms later (~300 ms after target onset) is clearly controlled by the meaning of the target word, not its location. This is seen most clearly in the comparison between the incongruent condition and the neutral condition, where the target meaning should be the only influence on x-velocity. This finding appears to clearly reject a pure magnitude-difference explanation, which assumes that both task-relevant and task-irrelevant information are available at the same time.

However, it is possible that encoding of task-relevant information was slowed in the neutral condition because participants paid less attention to the stimulus locations above and below fixation than to the locations to the left and right of fixation. This may have occurred because, even though the overall proportion of neutral trials was the same as the proportion of congruent and incongruent trials (one-third each), the proportion of stimuli in the upper and lower locations was less than it was in the left and right locations (one-sixth vs. one-third). Experiment 2 addresses this potential confound by using each of the four possible stimulus locations equally often, so that neutral trials were twice as common as congruent and incongruent trials.

Experiment 2

An additional motivation for Experiment 2 was to establish the generality of the pattern of findings observed in Experiment 1 by using red and green squares (as opposed to words) as targets. This allows us to compare the dynamics of the Simon effect that we have observed with targets that enjoy over-learned response mappings (i.e., the words “LEFT” and “RIGHT” in Experiment 1) with the Simon effect that arises with targets that share an arbitrary relationship with response location (i.e., the colors red and green mapping onto “left” and “right” responses). We also implemented an improved SOA manipulation in order to eliminate the bimodal distribution of MITs shown in Fig. 3 and to obtain a more even spread concentrated around the critical middle region of the MIT range.