Task relevance determines binding of effect features in action planning

Mocke, Viola; Weller, Lisa; Frings, Christian; Rothermund, Klaus; Kunde, Wilfried

doi:10.3758/s13414-020-02123-x

Task relevance determines binding of effect features in action planning

Open access
Published: 10 September 2020

Volume 82, pages 3811–3831, (2020)
Cite this article

Download PDF

You have full access to this open access article

Attention, Perception, & Psychophysics Aims and scope Submit manuscript

Task relevance determines binding of effect features in action planning

Download PDF

Viola Mocke ORCID: orcid.org/0000-0003-0474-6566¹,
Lisa Weller¹,
Christian Frings²,
Klaus Rothermund³ &
…
Wilfried Kunde¹

1978 Accesses
17 Citations
Explore all metrics

Abstract

Action planning can be construed as the temporary binding of features of perceptual action effects. While previous research demonstrated binding for task-relevant, body-related effect features, the role of task-irrelevant or environment-related effect features in action planning is less clear. Here, we studied whether task-relevance or body-relatedness determines feature binding in action planning. Participants planned an action A, but before executing it initiated an intermediate action B. Each action relied on a body-related effect feature (index vs. middle finger movement) and an environment-related effect feature (cursor movement towards vs. away from a reference object). In Experiments 1 and 2, both effects were task-relevant. Performance in action B suffered from partial feature overlap with action A compared to full feature repetition or alternation, which is in line with binding of both features while planning action A. Importantly, this cost disappeared when all features were available but only body-related features were task-relevant (Experiment 3). When only the environment-related effect of action A was known in advance, action B benefitted when it aimed at the same (vs. a different) environment-related effect (Experiment 4). Consequently, the present results support the idea that task relevance determines whether binding of body-related and environment-related effect features takes place while the pre-activation of environment-related features without binding them primes feature-overlapping actions.

Optimizing performance through intrinsic motivation and attention for learning: The OPTIMAL theory of motor learning

Article 29 January 2016

Twenty years of load theory—Where are we now, and where should we go next?

Article 04 January 2016

Motivation and Action: Introduction and Overview

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

How do humans plan motor actions? A possible, so-called ideo-motor, view on this process originates from the idea that we generate motor activities by setting up a mental representation of the perceptual effects that a certain motor activity will produce (Greenwald, 1970; James, 1981; Shin, Proctor, & Capaldi, 2010; Stock & Stock, 2004). Anticipating an action effect, which is “a change of sensory input that is triggered by a bodily movement” (Pfister, 2019, p. 154), should reactivate the bodily movement to which the action effect has been associated through previous experience. Indeed, there is now ample evidence that such perceptual representations mediate action production (Elsner & Hommel, 2001; Kunde, Koch, & Hoffmann, 2004; Pfister, 2019; Pfister & Kunde, 2013; Shin & Proctor, 2012). In other words, motor activities seem to be mentally represented and planned in terms of those perceptual events that a to-be-accessed motor activity will foreseeably produce.

Feature codes in action planning

Perceptual events in general and perceptual effects that mediate action planning in particular are likely coded in terms of features (Frings, Hommel, et al., 2020; Frings, Koch, et al., 2020; Hommel, 1998, 2004, 2009; Hommel, Müsseler, Aschersleben, & Prinz, 2001). The idea that action planning is based on features is not new, as for example Rosenbaum (1980) argued that motor activities are prepared as motor programs with free slots that are filled by specific feature values during action planning or, respectively, the filling in of these feature values is in fact the planning process (see Leuthold & Jentzsch, 2011, for corresponding evidence). According to the theory of event coding (TEC), specifying a single feature is usually not sufficient to plan an action (Hommel et al., 2001). Rather, codes of multiple action effect features would become activated in a first step. At this point, the merely activated features should prime other action plans with overlapping features. As a second step, the activated feature codes would become integrated, or bound, through associative connections now resulting in interference instead of facilitation of partly overlapping action plans.

Importantly, the authors describe action plans as temporary composites of feature codes that describe to-be-produced external, or distal, events. That means, while they clearly rule out the possibility of event coding on the basis of proximal information – that is, neural codes or muscular innervation patterns – they argue that depending on the intended action effect feature codes can represent attributes of any kind of perceptual effect. Following this logic, it should be possible to plan actions by means of anticipated body-related feedback as well as feature codes of environment-related action effects that have an even more distal and oftentimes more artificial nature (Hommel et al., 2001). To clarify, planning a right index finger keypress might not only comprise representations of the anticipated tactile or proprioceptive impression of the right index finger movement (body-related action effects), but also of the sound (e.g., the click of the keyboard) or of some visual consequences that this movement might reliably produce (e.g., a certain letter on a computer screen, environment-related action effects). This basic underlying idea of action planning by means of to-be-produced effects should be kept in mind for the remainder of the present work. That is, because when using the term “feature” in the present work, we constantly refer to features of to-be-produced action effects, rather than features of the motor responses that produce these effects. Also, for the sake of brevity, we refer to features of body-related or environment-related effects as body-related and environment-related features.

Pfister (2019) argued that in many experimental tasks representing actions by body-related features (e.g., which finger to use to press a certain key) should be sufficient to achieve the task goal. Contrarily, using feature codes relating to environmental action effects might only be favorable if representing the action with feature codes of its body-related effects alone is in some way disadvantageous. For instance, this seems to hold true when the environmental effects are highly similar to the corresponding body movements (e.g., a cursor movement on screen mirroring a hand movement; Shin & Proctor, 2012, Experiment 2). In that case, environment-related features even become part of action representations when instructions demand participants to attend to body-related effects and ignore environment-related effects (see also Janczyk, Pfister, & Kunde, 2012, for related findings on hand-tool compatibility effects). Furthermore, when instructions explicitly demand participants produce environment-related events, actions are likely to be represented by environment-related features (Hommel, 1993). Such instructions can for example force participants to pay attention to changes in a display instead of their own movements, hence making the environment-related effects more salient (Janczyk et al., 2015, Experiment 2).

In particular, the latter aspect that increased attention on (or saliency of) environment-related effects promotes the integration of such effects in action representations leads to the question which role task-relevance of action effects plays in event coding (i.e., whether a feature relating to a particular action effect has to be part of the action plan for the actor to perform the correct action). According to the authors of TEC, all features on task-relevant effect dimensions should have a higher basic activation level than those on irrelevant effect dimensions due to a process called intentional weighting (see Memelink & Hommel, 2013). As a result, if planning an action activates such a task-relevant feature code, its activation level should be higher than that of activated irrelevant feature codes. A higher activation level of a feature might increase the chances of being bound to other features. Consequently, task-relevance might determine the integration of feature codes in action plans. In other words, the question remains whether a task-irrelevant feature is bound less likely in the action plan than a relevant feature, if it is bound at all.

Partial feature overlap costs

As mentioned above, the unique idea of the feature approach pursued here is that features are temporarily bound together. Such binding should have consequences for other actions that occur in close temporal proximity to the planning process. Firstly, binding a feature while planning a certain action might render this feature less accessible for other actions, which require this feature as well. A second, but surely not incompatible, possibility is that this feature is still accessible for other actions but reactivates other, unwanted features to which it is still bound (Frings, Hommel, et al., 2020a; Frings, Koch, et al., 2020b; see General discussion section for a more thorough discussion of the different mechanisms).

To illustrate such costs, consider a study by Stoet and Hommel (1999, Experiment 2). These authors asked participants to plan an index finger movement with the left or right hand (which likely involves binding of the features left or right and hand, action A). While participants were planning this movement, that is, before its eventual execution, a pedal action with the left or right foot was requested (which likely involves binding the features left or right and foot, action B). In line with the binding hypothesis, initiating the pedal action was delayed when it relied on a feature also used to concurrently plan the hand action, that is, in the partial feature-overlap condition (e.g., a left foot action while a left hand action was planned), as compared to a pedal action in the no feature-overlap condition (e.g., a right foot action while a left hand action was planned). Crucially, in the design by Stoet and Hommel (1999), both feature dimensions available to plan the actions (left versus right and hand versus foot) were body-related, as they referred to spatio-anatomical characteristics, and task-relevant.

Other work that adopted this design (Fournier, Behmer, & Stubblefield, 2014; Mattson & Fournier, 2008; Mattson, Fournier, & Behmer, 2012) also used the required hand (left or right) as potentially overlapping feature dimension (notably, again body-related and task-relevant). These studies also revealed performance costs for action B in the partial feature-overlap condition (i.e., when it required the same hand as a previously planned action A) compared to a no-overlap condition in which action B required the other hand (see also Fournier & Gallimore, 2013; Fournier, Gallimore, Feiszli, & Logan, 2014, for similar observations with movement direction as an overlapping feature dimension). Remarkably, such partial repetition costs even occur when both actions make use of different modalities, with action A being a manual response with the right or left hand and action B a vocal response that imposes a demand on working memory (uttering "right" or "left" as a response to a visual stimulus, Fournier et al., 2010, Experiments 1 and 3). Therefore, both action plans can overlap regarding the same task-relevant feature dimension although the features refer to entirely different body-related effects depending on the action (the experience of pressing a key vs. uttering a word).

Partial feature overlap benefits

Interestingly, such costs of partial feature overlap as compared to no feature overlap have not always been obtained. For example, Kunde, Hoffmann, and Zellmann (2002, Experiment 3) asked participants to plan a left- or right-hand movement (task-relevant, body-related feature), which would foreseeably produce a high or low tone (task-irrelevant, environment-related feature) for action A. However, before executing this movement, participants had to execute Action B, a weak or forceful finger press (task-relevant, body-related feature) that equally foreseeably produced a high or low tone. Consequently, both actions could predictably either produce the same or different tones. This design resulted in a partial overlap condition (e.g., when participants planned a right hand movement, which would produce a high tone, and executed a weak keypress, which resulted in a high tone) and a no overlap condition (e.g., when participants instead executed a weak keypress, which resulted in a low tone). If binding took place in the same way as in the above-described experiments, a similar pattern of results, that is, feature overlap costs, should occur. Contrarily, the weak or forceful actions were initiated faster if they resulted in the same rather than a different tone to the planned (left or right hand) action. That is, participants’ performance was superior with partial feature overlap as compared to no feature overlap. Following TEC, this finding suggests that while features of the to-be-produced tones did affect performance in concurrently executed actions, these features were apparently not bound into an action plan, as they did not interfere with the executed actions. The authors argued that facilitation of actions that share features of a certain environment-related outcome (a tone in this case) with a currently planned action might be quite useful as this would allow quick replacement of an initially planned action with a functionally equivalent one if, for sudden reasons, the initially planned action cannot be carried out (which might thus be termed a “functional equivalence” benefit). This interpretation is in line with a study by Janczyk and Kunde (2014) in which participants first planned an index or middle finger keypress, which would foreseeably produce a certain action effect. In some trials, they were then asked for a freely chosen keypress with the middle or index finger of the other hand. Importantly, participants tended to overcome their preference for using the homologous finger when switching fingers produced the same action effect as planned (vs. a different one),

In fact, partial overlap benefits as observed by Kunde et al. (2002) seem to occur whenever feature activation, but not binding, takes place for action A or action B. Regarding the former, Stoet and Hommel (1999, Experiment 3) showed that a lack of time and incentive for planning action A in advance led to better performance in the partial overlap condition than the no overlap condition. The authors explained this pattern in the sense that feature codes were cued and hence pre-activated, but not yet bound when participants executed action B. Regarding lacking integration of action B features, Wiediger and Fournier (2008) and Fournier, Wiediger, and Taddese (2015) found partial overlap benefits when they had participants perform a visually guided reach action with the right or left hand (action B) while holding a right- or left-hand action in preparation (action A). According to the authors, visually guided reach actions carried out online (i.e., adjusted while moving according to the discrepancy between current and intended hand position) invoke automatic visuomotor mechanisms and thus, unlike actions that base on advance planning, do not interfere with concurrently held action plans (for reviews of motorvisual facilitation and impairment see Thomaschke, Hopkins, & Miall, 2012a, 2012b).

Two possible explanations: Task-relevance versus body-relatedness

What are the reasons for observing partial feature overlap costs in some cases but partial feature overlap benefits in other cases despite sufficient planning of action A and cognitive control required for action B? Based on the previous literature review we see two not mutually exclusive explanations. First, whether features are integrated into an action plan might be a matter of feature relevance (task-relevance hypothesis). Specifically, only those features that are used to distinguish between action alternatives might be bound to form an action plan. It seems likely that features like left and hand are rather relevant to distinguish a left hand movement from a right foot movement as used by Stoet and Hommel (1999). By contrast, the tones that were produced by certain movements in the study by Kunde et al. (2002) were task-irrelevant. They were consistently produced by certain finger movements, but neither did the instructions emphasize these tones, nor did they ask to produce them. Participants might thus have relied on features other than those that relate to the produced tones to distinguish between action alternatives.

It should be noted that this hypothesis is not all trivial, as in many situations other than action planning there is clear evidence for the binding of even irrelevant features into event files, such as bindings of responses and task-irrelevant distractor features (Frings, Rothermund, & Wentura, 2007; Rothermund, Wentura, & De Houwer, 2005) or short-term binding of responses to response-evoked perceptual feedback (Dutzi & Hommel, 2009; Elsner & Hommel, 2001; Hommel, 2005, Experiment 2; Moeller, Pfister, Kunde, & Frings, 2019). However, the bindings in all of these studies resulted from actual action execution and not action planning (i.e., a top-down process resulting from the internally driven anticipation of action effects instead of a bottom-up process arising from the experience of an actual response-effect episode). So, the task relevance hypothesis affords empirical testing in the case of feature-based action planning.

Another possible explanation relates to the type of events that such features describe. The participants in the study by Stoet and Hommel (1999) produced by their movement nothing perceptible other than the observable body movement itself. Thus, the perceptual event that represents that movement likely included only features that relate to the body itself. By contrast, in the study by Kunde et al. (2002), the participants’ movements produced a tone, which, like other (e.g., visual) effects, has a body-external nature. Consequently, features like high and low, when relating to a tone, can code a body-external event. Perhaps only features that relate to body-related events, but less so features of body-external events, become bound when it comes to planning an action (body-relatedness hypothesis).

In a recent study, Moeller, Pfister, Kunde, and Frings (2019) studied stimulus-response-effect (S-R-E) episodes in their entirety. In their design, a task-relevant stimulus (e.g., a letter) was accompanied by an irrelevant distractor (e.g., a color) and prompted a certain response (i.e., a task-relevant body-related effect such as a left index finger movement), which in turn produced a certain task-irrelevant perceptual effect (e.g., a tone). The authors found that features of the irrelevant distractor (the color) were bound to features of the task-relevant body-related effect (the finger movement) but not of the task-irrelevant environment-related effect (the tone). Bindings between task-relevant (body-related) and task-irrelevant (environment-related) effect features (the finger movement and the tone) were also clearly demonstrated. The authors explain the fact that there was binding of irrelevant distractor features to body-related effect features, but no binding to environment-related effect features, with the task-relevance of the former and the irrelevance of the latter. However, one could equally well view this as preliminary evidence for different potentials of features of body-related and body-external effects to become bound. To sum up, it is still unclear whether task-relevance and/or body-relatedness of action effects determine feature binding in action planning.

The present research

The present research aims to clarify under which conditions binding of action effect features occurs during action planning. Specifically, we aim to test the task relevance hypothesis and the body-relatedness hypothesis described above. To do so, we asked participants to plan and carry out actions that comprised both a body-related feature and an environment-related feature (adopted from Giesen & Rothermund, 2016). Importantly, this paradigm allowed to orthogonally combine body- and environment-related features, and to render the environment-related effect features task-relevant or irrelevant. To illustrate this, consider Fig. 1. Participants were asked to move a round cursor on a computer screen either towards or away from a reference object (i.e., a stick figure). Thus, each of the participants’ correct responses to certain stimuli (i.e., pressing the correct of two response keys) produced one of two environment-related effects. These effects (i.e., cursor movements on screen) encompassed the features towards and away, respectively, and the color of the cursor indicated which kind of environment-related effect participants should produce. Depending on the position of the stick figure, this required a keypress with either the index or the middle finger. Please note, following ideo-motor theory, action plans rely on features of perceptual effects rather than muscular innervation patterns. From that perspective, the features index and middle are meant to describe the origin of the sensory changes that a motor pattern of a corresponding finger will bring about (i.e., the body-related action effects, such as the change in visual motion or tactile stimulation that comes with a corresponding finger movement), rather than the muscular activity, which, according to this theory, is mentally inaccessible.

Participants were asked to plan such an action (action A). However, before its eventual execution, that is, at varying time points after the announcement of the to-be-planned action A, another action (B) with varying degrees of feature overlap was requested for immediate execution. Specifically, the first initiated action B could share neither feature with the concurrently planned action A, or it could share one feature, or both (see Table 1). Please note, for the first time, this study allows for the full design including no, partial, and full-feature overlap, whereas previous research relied on the comparison of conditions with no and partial overlap alone (e.g., Fournier et al., 2015; Kunde et al., 2002; Stoet & Hommel, 1999).

Table 1 Example of effect features for action B and the resulting feature-overlap conditions while planning a middle finger keypress with a cursor movement towards the stick figure for action A (i.e., A: towards, middle)

Full size table

Unlike in most previous studies, we decided to present the stimulus display for action A not only during the initial planning phase but again when action A is actually to be executed. This could potentially reduce participants’ incentive for planning action A compared to a design without second stimulus presentation. However, in foreshadowing the results we show in various ways that planning action A actually occurred. More importantly, though, this design enabled us to prevent that participants simply memorized the finger to be used in action A (i.e., only the body-related feature would become part of the action plan). Specifically, to ensure that participants properly planned action A at the beginning of each trial (a prerequisite for influences of feature overlap on action B), participants had to detect catch-trials (10% of all trials) in which one of the to-be produced action effects at the end of the trial (i.e., either the finger that is to be used or whether the cursor will move towards or away from the stick figure) differed from the initial planning phase (see Procedure section for details). By doing so, participants had to include both features in their action plan. This way, we ensured equal task relevance of the body- and the environment-related action effect, which is crucial for disentangling the influences of body-relatedness and task relevance in binding. Also, introducing catch-trials enabled us to identify those participants who refrained from planning action A at all.

We further manipulated the time interval available for planning action A for exploratory purposes as well as to prevent participants from responding prematurely. Stoet and Hommel (1999) found significant binding effects when they let participants plan a more complex action for 3,350 ms. Hence, the time interval during which our participants could plan action A (1,500 vs. 2,000 ms, i.e., 1,000-ms presentation of the cue for action A and subsequently 500- or 1,000-ms interstimulus interval, ISI) should be sufficient for action planning in the current design.

Our analyses focused on performance in action B, which was emitted before any other efferent activity had occurred. Performance in action A was also assessed, but performance in this task is less easy to interpret, as it might be affected by feature overlap as well as by peripheral biomechanical factors from having just executed action B before (e.g., muscular priming or fatigue due to using the same finger twice in a row).

According to TEC, observing inferior performance in the first initiated action B, if it partly shares features with the concurrently planned action A as compared to a condition with full or no overlap, would suggest binding of these features in action planning. Previous work on bindings between stimulus and response features further suggests that performance in full-feature-overlap conditions is equivalent to no feature overlap (Hommel, 1998, 2004). Thus, in case that binding occurs, a characteristic interaction of repetition/alternation of the features of actions A and B is predicted. More specifically, we expect both reaction times (RTs) and error rates to be higher when either the body-related feature or the environment-related feature overlaps between action A and action B than when either both or none of the features overlap.

It should be noted that we took great care to avoid any sort of binding to certain stimulus characteristics that might otherwise occur. Specifically, a set of three different colors cued every towards (e.g., red, green, or purple) or away movement (e.g., blue, gray, or yellow), respectively. By doing so, every display in a trial (the cue for the planned action A, the stimulus for the first initiated action B, and the stimulus for the finally requested action A) contained a different cursor color so that no retrieval of features by stimulus color was possible.

With this basic paradigm, we conducted four experiments. Experiments 1 and 2 tested whether there is binding of features of environment- and body-related effects when both are equally task-relevant. In Experiment 3, we examined whether binding of features of environment-related effects occurs if they become task-irrelevant. Experiment 4 tested whether features of task-relevant environment-related events can be activated in advance even if they cannot be bound because of lacking body-related features that would be necessary to form a full-fletched action plan.

Experiment 1

Experiment 1 aimed to test whether binding of features of environment-related action effects occurs, providing these features are equally task-relevant as features of body-related effects. If binding of features occurred, the initiation of action B should suffer when there is partial overlap with the features of the concurrently planned action A, as compared to full feature repetition or full feature alternation. On top of this interaction, there might be main influences of repetition of either body- or environment-related features. For example, responding for action B might be generally faster and/or more accurate if the body- or environment-related features overlap with the planned action A, respectively.

Method

Participants

We conducted an a priori power analysis for the described binding effect, that is, the mean difference between the partial overlap conditions and the full alternation/repetition conditions for RTs of action B, by means of a two-tailed paired samples t-test using G*Power (Faul, Erdfelder, Lang, & Buchner, 2007). This yielded a minimum required sample size of n = 34 to detect a medium-sized effect (d_z = 0.50) with a power of 1–β = .80 and α = .05. Please note, the effect of feature overlap in related studies is typically larger (e.g., d = 1.01 in Stoet and Hommel, 1999), and thus our assumption of a medium-sized effect is rather conservative. We recruited a total of 34 participants via an online participant pool management platform of the University of Würzburg. The study was performed in accordance with the Declaration of Helsinki (Rickham, 1964) and had been approved by the local ethics committee (Ethikkommission des Institutes für Psychologie der Humanwissenschaftlichen Fakultät der Julius-Maximilians-Universität Würzburg, GZEK 2019-39). All participants gave their written informed consent before participation and received financial compensation of 10€. We excluded all participants who did not detect any catch-trials throughout the experiment (i.e., trials in which either the body- or the environment-related feature changed, n = 7) to ensure a sufficient degree of planning action A. Furthermore, we excluded one additional participant who failed to produce ten correct trials in one or more experimental cells. As a result, with the final sample size of n = 26 (17 females, M_age = 29.1 years, range_age = 20–55 years) the smallest possible effect size that could be detected with a power of 1–β = .80 and α = .05 was slightly higher than initially planned (i.e., d_z = 0.57).

Apparatus and stimuli

Participants sat in front of an LCD monitor (24-in., BenQXL2411, BenQ) with a resolution of 1,920 × 1,080 pixels and a 100-Hz refresh rate. Stimuli were presented on screen using the E-Prime 2.0 software (Psychology Software Tools, 2002). Each display presented a round cursor in the center of the black screen that could move upwards or downwards (see Fig. 2). This resulted in a movement towards or away from a white stick figure, which could appear either on the top or bottom of the screen. To emphasize the distinctness of action A and action B, different stick figures were presented as reference objects for these two actions.

The cursor color signified which action participants had to plan and perform. For half of the participants, a red, green, and purple cursor indicated a towards movement, and a blue, gray, and yellow cursor indicated an away movement. For the other half of the participants, the mapping was reversed. The colored stimuli could trigger a keypress with the index or the middle finger, depending on the position of the stick figure (above or below the cursor).

Responses were given on a standard QWERTZ keyboard with the K key (pressed with the right middle finger) resulting in an upwards cursor movement and the M key (pressed with the right index finger) resulting in a downwards movement. Furthermore, participants responded to catch trials by pressing the D key.

Procedure

The experiment started with two practice instances, in which the participants were familiarized with the setup. Then, they worked through ten experimental blocks of 64 trials each with breaks after each block. Figure 2a illustrates one exemplary trial. Each trial started with a white fixation cross appearing for 300 ms, which was followed by display A (i.e., a colored round cursor in the middle of the screen and a stick figure above or below the cursor), during which participants should plan action A (e.g., a middle finger keypress that would produce a cursor movement towards the stick figure when seeing a stick figure above a green cursor). After 1,000 ms, the cursor turned white for a certain ISI (500 or 1,000 ms). Thereafter, display B appeared with another stick figure above or below a differently colored cursor. At this point, participants had 2,000 ms to respond to this display (e.g., with a middle finger keypress that made the cursor move away from the figure as a response to the figure being below a yellow cursor). After a correct keypress with the index or middle finger (i.e., the body-related effect) as instructed by the cursor color and dependent on the stick figure position, they observed the respective cursor movement towards or away from the stick figure for 500 ms (i.e., the environment-related effect). Subsequently, display A appeared again for a maximum of 2,000 ms and participants now had to execute the pre-planned action A. After pressing the correct key, they observed the respective cursor movement for 500 ms. Importantly, while the last display showed a stick figure and a colored cursor that asked for the pre-planned action A, the color of the cursor always changed with regard to the presented color at the beginning of the trial (e.g., as planned, a middle finger keypress that would produce a cursor movement towards the stick figure now as a result of seeing a stick figure above a purple cursor). This was achieved by having three colors that all signified the same towards-away movement. Thus, the stick figure appeared in the same position and the new cursor color entailed the same environment-related effect as in the beginning. Yet, in about 10% of the trials (catch-trials), display A suggested a different action than the pre-planned action A, because either the stick figure switched position as compared to the beginning of the trial or the requested environment-related effect changed (towards instead of away or vice versa) as suggested by the cursor color. Participants had to detect these catch-trials by responding with a separate key (D).

In case of an erroneous response (i.e., responding during the first display or ISI, pressing the wrong response key for action A or B, not responding in time for action A or B, missing a catch-trial or incorrectly indicating a catch-trial; see Table 2 for frequencies), an error message appeared for 1,000 ms and the trial terminated without later replacement. The subsequent trial started after an intertrial interval of 500 ms.

Table 2 Percentage of error trials within total trials for each experiment. Trials terminated as soon as an error occurred; therefore, errors are mutually exclusive. Ten percent of trials were catch-trials. Correct trials comprise correct non-catch-trials and correctly detected catch-trials. Values in a row might not add up to 100% due to rounding errors

Full size table

Design

The experiment followed a 2 × 2 × 2 repeated-measures design, with trial-wise manipulation of the three within-subject factors body-related feature overlap (same vs. different), environment-related feature overlap (same vs. different) and ISI (500 ms vs. 1,000 ms). Dependent variables were RTs and error rates for actions A and B.

Data analysis

The data and syntaxes for statistical analyses of all experiments, as well as the preregistrations for Experiments 3 and 4 adhere to the disclosure requirements and are publicly available on the Open Science Framework (https://osf.io/3xush/). The first two blocks served to familiarize participants with the task and were not further analyzed. Moreover, we excluded all trials in which participants responded prematurely from all analyses.

For the RT analysis of action B, we further only considered trials in which responses for action B as well as action A (to ensure sufficient planning of action A) were correct. For the action A analysis, we additionally excluded all correctly detected catch-trials. To account for outliers, we then excluded all trials with RTs deviating more than 2.5 standard deviations from the participant’s respective cell mean separately for both actions.

For the error-rate analysis of actions A and B, the respective RT outliers were discarded again and, for action B, all trials with erroneous responses for action A were excluded (again, to omit trials without sufficient planning). Error rates were then calculated as the percentage of relevant errors within the sum of erroneous and correct responses. For action B, relevant errors were commission errors (i.e., wrong keypresses) and delayed responses, and for action A additionally missed and incorrectly indicated catch-trials. Subsequently, we conducted repeated-measures analyses of variance (ANOVAs) on both RTs and error rates with the factors body-related feature overlap, environment-related feature overlap, and ISI.

For comparability with previous work that mainly reported this effect-size measure, we additionally present a Cohen d_z effect-size measure for the effect of main interest, that is, the RT difference between partial overlap conditions and full/no feature-overlap conditions in action B collapsed over both ISI conditions, obtained by a post hoc two-tailed paired-samples t-test. Lastly, for an additional check of whether participants planned action A in advance, we compared the RTs for action A and action B with a post hoc paired-samples t-test (see Table 4).

Results

Action B

Table 3 and the upper panel of Fig. 3 shows the results for action B. Regarding RTs, the main effects of environment-, F(1,25) = 2.79, p = .11, η_p² = .10, and of body-related feature overlap with action A, F(1,25) = 2.06, p = .16, η_p² = .08, failed to reach significance. However, responding was generally faster with a long compared to a short ISI, F(1,25) = 40.99, p < .001, η_p² = .62. Most importantly, there was a significant cross-over interaction between environment- and body-related feature overlap, F(1,25) = 17.38, p < .001, η_p² = .41. Specifically, reactions were slower in the partial feature-overlap conditions (averaged over both conditions: M = 841) than in the full-alternation or repetition conditions combined (M = 809, t(25) = 4.17, p < .001, d_z = .82). Furthermore, neither the environment-related, F(1,25) = 0.34, p = .57, η_p² = .01, nor the body-related feature overlap, F(1,25) = 0.05, p = .82, η_p² < .01, interacted significantly with ISI. Similarly, the three-way interaction did not reach significance, F(1,25) = 1.18, p = .29, η_p² = .05.

Table 3 Means (and standard errors of the means) of reaction times (RTs) and error rates according to interstimulus interval (ISI), and environment-related and body-related feature overlap for actions A and B for Experiment 1 (n = 26)

Full size table

The analysis of error rates failed to yield a significant main effect of environment-related feature overlap, F(1,25) = 3.60, p = .07, η_p² = .13, and ISI, F(1,25) = 2.20, p = .15, η_p² = .08. Conversely, responding was more accurate when action B relied on a different compared to the same finger than the planned action A, F(1,25) = 16.06, p < .001, η_p² = .39. Importantly, the interaction between environment- and body-related feature overlap was again significant, F(1,25) = 14.20, p = .001, η_p² = .36. Again, neither the interaction of ISI and environment-related feature overlap, F(1,25) = 0.09, p = .77, η_p² < .01, nor of ISI and body-related feature overlap, F(1,25) = 0.36, p = .55, η_p² = .01, nor the three-way interaction, F(1,25) = 2.42, p = .13, η_p² = .09, were significant.

Action A

The RTs of action A (see Fig. 3, lower panel) were significantly faster when both actions produced the same as opposed to different environment-related effects, F(1,25) = 38.10, p < .001, η_p² = .60, and with finger alternation compared to repetition, F(1,25) = 8.86, p < .01, η_p² = .26. It should be noted though that, as for action B, environment- and body-related feature overlap interacted in the sense that responses were slower in the partial compared to the full or no feature-overlap conditions, F(1,25) = 59.24, p < .001, η_p² = .70. The main effect of ISI was not significant, F(1,25) = 1.93, p = .18, η_p² = .07, and ISI did not significantly interact with environment-related feature overlap, F(1,25) = 1.42, p = .24, η_p² = .05, or body-related feature overlap, F(1,25) = 0.86, p = .36, η_p² = .03. There was also no three-way interaction, F(1,25) = 0.46, p = .51, η_p² = .02.

Error rates for action A were lower with repetition rather than alternation of the environment-related effect with action B, F(1,25) = 18.98, p < .001, η_p² = .43, and with alternation rather than repetition of the body-related effect, F(1,25) = 8.64, p < .01, η_p² = .26. Importantly, as in the RT analysis, the interaction between environment- and body-related feature overlap was again significant, F(1,25) = 28.95, p < .001, η_p² = .54. There was no significant main effect of ISI, F(1,25) = 0.17, p = .69, η_p² < .01, and neither ISI and environment-related feature overlap, F(1,25) = 1.05, p = .32, η_p² = .04, nor ISI and body-related feature overlap, F(1,25) = 0.92, p = .35, η_p² = .04, nor all factors, F(1,25) = 0.77, p = .39, η_p² = .03, interacted.

Discussion

Experiment 1 revealed clear evidence for binding of features of body-related and environment-related effects in action planning. Initiating action B was facilitated if that action shared both or none of its features with a concurrently planned action A as compared to partial feature sharing. Importantly, in this experiment, both the environment- and body-related features were equally task-relevant. This clearly shows that even features that relate to environment-related effects become integrated into action plans, providing they are task-relevant, thereby supporting the task relevance hypothesis.

On top of this indication of feature binding, there was a tendency (action B) for a general benefit if both actions shared the same environment-related feature (significantly so for action A). This is preliminary support for the idea that an action that aims at the same environment-related effect as another prepared action generally benefits from the environment-related feature overlap with that action.

It should be noted that, although for both actions the task was the same, the RTs for action A were lower than those for action B (see Table 4). This strongly indicates that participants indeed planned action A in advance. Also, the data pattern did not heavily depend on the ISI, that is, the time interval between the offset of the first display and the request to initiate action B. Most importantly, the specific interaction pattern of body-related and -external feature overlap was already present 1,500 ms after announcement of action A (i.e., with an ISI of 500 ms). This suggests that binding of the relevant features of action A occurred rather quickly.

Table 4 Paired samples t-tests of the mean differences between action A and action B reaction times for each experiment. Cohen’s d calculated according to Dunlop, Cortina, Vaslow, and Burke (1996)

Full size table

Before considering these results in more detail, however, we have to deal with a possibly problematic aspect of the design: Whenever there was a partial feature repetition, the position of the stick figure on the screen that determined whether the prepared towards or away movement required an index or middle finger keypress did repeat (see Fig. 2a). For example, when a movement towards the stick figure with the index finger was prepared for action A, the stick figure was at the bottom of the screen. If now a towards movement with the middle finger was requested for action B, the stick figure changed its position to the top. However, when both features repeated or both changed, the stick figure position always remained the same. For example, when a movement towards the stick figure with the index finger was prepared for action A, the stick figure was at the bottom of the screen. If now an away movement with the middle finger was requested for action B, the stick figure remained at the bottom of the screen. In other words, the benefit for full feature repetition or alternation might be because the displays signaling the requested action contained fewer changes.

Experiment 2

To consolidate the obtained results with a higher power as well as address the problem of varying changes of displays that signaled actions A and B, we conducted a second experiment and modified the setup in one respect. For action A, the towards or away movement of the cursor was always prepared in the vertical dimension, while the towards or away movement for action B was requested always in the horizontal dimension (see Fig. 2b). Consequently, the stick figures always changed their positions between displays used for preparing action A and requesting action B from a vertical to a horizontal position, independent of whether action B came with full, partial, or no feature repetition of action A.