Validating a visual version of the metronome response task

Laflamme, Patrick; Seli, Paul; Smilek, Daniel

doi:10.3758/s13428-018-1020-0

Validating a visual version of the metronome response task

Published: 12 February 2018

Volume 50, pages 1503–1514, (2018)
Cite this article

Download PDF

Behavior Research Methods Aims and scope Submit manuscript

Validating a visual version of the metronome response task

Download PDF

Patrick Laflamme¹,
Paul Seli² &
Daniel Smilek³

1717 Accesses
16 Citations
1 Altmetric
Explore all metrics

Abstract

The metronome response task (MRT)—a sustained-attention task that requires participants to produce a response in synchrony with an audible metronome—was recently developed to index response variability in the context of studies on mind wandering. In the present studies, we report on the development and validation of a visual version of the MRT (the visual metronome response task; vMRT), which uses the rhythmic presentation of visual, rather than auditory, stimuli. Participants completed the vMRT (Studies 1 and 2) and the original (auditory-based) MRT (Study 2) while also responding to intermittent thought probes asking them to report the depth of their mind wandering. The results showed that (1) individual differences in response variability during the vMRT are highly reliable; (2) prior to thought probes, response variability increases with increasing depth of mind wandering; (3) response variability is highly consistent between the vMRT and the original MRT; and (4) both response variability and depth of mind wandering increase with increasing time on task. Our results indicate that the original MRT findings are consistent across the visual and auditory modalities, and that the response variability measured in both tasks indexes a non-modality-specific tendency toward behavioral variability. The vMRT will be useful in the place of the MRT in experimental contexts in which researchers’ designs require a visual-based primary task.

Slow and steady: Validating the rhythmic visual response task as a marker for attentional states

Article 09 May 2024

Effects of the Visual and Auditory Components of a Brief Mindfulness Intervention on Mood State and on Visual and Auditory Attention and Memory Task Performance

Article 20 October 2016

The metronome response task for measuring mind wandering: Replication attempt and extension of three studies by Seli et al

Article 30 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Recently, there has been rapidly growing interest in the mental state of “mind wandering”—a phenomenon often defined as task-unrelated thought—with a particular focus on how mind wandering influences ongoing primary-task performance (see Smallwood & Schooler, 2006, 2015, for reviews). The available evidence indicates that mind wandering interferes with performance on numerous tasks, ranging from basic continuous-response tasks in which participants respond to the presentation of frequently presented “go stimuli” and withhold responses to infrequently presented “no-go stimuli” (e.g., Christoff, Gordon, Smallwood, Smith, & Schooler, 2009; McVay & Kane, 2009; Seli, 2016), to more complex tasks such as those assessing reading comprehension (e.g., Feng, D’Mello, & Graesser, 2013; Unsworth & McMillan, 2013). Notably, mind-wandering rates during some tasks have been shown to increase as time on task progresses, with commensurately increasing costs to performance (Thomson, Seli, Besner, & Smilek, 2014).

As research on mind wandering has progressed, studies have begun to reveal that, in some situations (particularly those requiring constrained responding), mind wandering is associated with behavioral variability (e.g., Seli, Carriere, et al., 2014; Seli, Cheyne, & Smilek, 2013). Specifically, relative to moments of on-task attentiveness, moments of mind wandering have been associated with increased levels of fidgeting (Seli, Carriere, et al., 2014) and increased response time variability (e.g., Seli, Cheyne, & Smilek, 2013). The relation between mind wandering and behavioral variability has also been observed at the level of individual differences: People who report more fidgeting in their daily lives also tend to report higher rates of everyday spontaneous mind wandering (Carriere, Seli, & Smilek, 2013).

The foregoing findings are intriguing at a theoretical level because they could be interpreted as suggesting that mind wandering is associated with a general failure of control, not only over one’s stream of consciousness, but also over one’s body. Relatedly, these findings could be taken to suggest that mind wandering is related to an underlying tendency toward experiencing variability, both in terms of one’s thoughts and one’s actions/behaviors. Indeed, it could be argued that variability in one’s thoughts and behaviors is the default state of human beings, and that limiting thought and behavior to a focal task or goal is the more unusual and remarkable ability (Seli, Carriere, et al., 2014). Along these lines, mind wandering and response variability during a constrained task (such as one requiring button presses to a target stimulus) might reflect a retreat to one’s natural state of variability.

One task commonly used to assess mind wandering, and one that has been used to examine the link between mind wandering and response variability (McVay & Kane, 2009), is the Sustained Attention to Response Task (SART; Robertson, Manly, Andrade, Baddeley, & Yiend, 1997). In this task, single digits are presented on a screen, one at a time, and participants are instructed to respond (via button press) to each of the digits except for one infrequently occurring target digit, which, historically, is often the digit 3. Studies employing the SART have shown that, relative to periods of on-task performance, periods of mind wandering are associated with increased failures to withhold a response to the infrequent target digit (i.e., errors of commission). More relevant for present purposes, individual differences in everyday inattention (as measured by subjective-report measures) have also been shown to relate to individual differences in response time variability on non-target trials in the SART (e.g., Cheyne, Solman, Carriere, & Smilek, 2009; McVay & Kane, 2009; Seli, Cheyne, Barton, & Smilek, 2012).

Although the SART’s measure of response variability has been touted as an index of inattention and mind wandering (Cheyne et al., 2009; McVay & Kane, 2009; Seli, Cheyne, Barton, & Smilek, 2012), unfortunately, this task has some noteworthy limitations that render this measure problematic. One general limitation concerns the instructions used in the SART. Typically, participants are instructed to respond, “as quickly and accurately as possible” (Robertson et al., 1997). As has been highlighted in numerous studies (e.g., Seli, Cheyne, & Smilek, 2012; Seli, Jonker, Cheyne, & Smilek, 2013; Seli, Jonker, Solman, Cheyne, & Smilek, 2013), the weakness of such instructions is that they could be interpreted as emphasizing the importance of either the speed of responding or the accuracy of one’s responses (Seli, Cheyne, & Smilek, 2012). Given that the SART inherently includes a speed–accuracy trade-off (Helton, Kern, & Walker, 2009; Seli, Cheyne, Barton, & Smilek, 2012; Seli, Jonker, Solman, et al., 2013), it is unclear whether moments of inattention manifest in reaction times and/or in accuracy scores. To deal with this issue, researchers have applied various statistical methods allowing them to better individuate or model the reaction time data in the context of the SART (e.g., Seli, 2016; Seli, Jonker, Cheyne, & Smilek, 2013).

Perhaps the most substantive problem with the use of the SART as an index of response variability is that the presence of rare target trials may induce perturbations in response variability that are conflated with perturbations in response variability that owe specifically to mind wandering/inattention (that latter of which are often of key interest to researchers; Cheyne et al., 2009; Smallwood et al., 2004). In particular, response time variability might be affected by posttarget processing, such as posterror slowing, which has been well-documented in the SART (e.g., Jonker, Seli, Cheyne, & Smilek, 2013). In addition, target expectation effects have been observed in the SART, as evidenced by participants’ tendency to slow down their responses as the time since the appearance of the last target increases and the presentation of the next target becomes more imminent (Cheyne, Carriere, Solman, & Smilek, 2011). Critically, these target-related determinants of response variability might be quite different from the response variability caused by inattention, and it is therefore unclear how statistical methods might tease apart these different types of response variability.

Here, we focus on one particular task that was developed to overcome the aforementioned shortcomings of the SART: the metronome response task (MRT; Seli, Cheyne, & Smilek, 2013). In the standard version of the MRT, participants are required to respond to an auditory tone that is presented once every 1,300 ms. More specifically, participants are instructed to respond (via button press) synchronously with the onset of each tone so that they produce a button press at the exact moment at which each tone is presented (Seli, Cheyne, & Smilek, 2013). The primary measure of interest yielded by the MRT is the variability in participants’ responses to the metronome tone. Also of interest in the MRT are participants’ reports of mind wandering, which have typically been obtained via the intermittent presentation of “thought probes” that require participants to report on the content of their thoughts (e.g., “on task” or “mind wandering”) just prior to the onset of each probe.

Numerous studies have now reliably shown that, as participants’ minds wander away from the MRT, the variability in their responses to the metronome tones tends to increase (e.g., Seli, Carriere, et al., 2013; Seli, Carriere, et al., 2014; Seli, Cheyne, & Smilek, 2013; Seli, Cheyne, Xu, Purdon, & Smilek, 2015; Seli, Jonker, Cheyne, Cortes, & Smilek, 2015). The relation between mind wandering and response variability has been demonstrated in two complementary ways. First, the variability in responses on trials immediately preceding thought probes has been found increase as a function of participants’ self-reported depth of mind wandering (e.g., Seli, Carriere, et al., 2014). Second, at the level of individual differences, correlational analyses have shown that people who report higher rates of mind wandering also tend to exhibit greater response variability throughout the MRT (e.g., Seli, Carriere, et al., 2013; Seli, Cheyne, et al., 2015).

The general conclusion regarding response variability and its co-occurrence with reported mind wandering in the MRT is that it does not simply reflect a modality-specific tendency to synchronize responding with an auditory tone, but that it instead reflects a more general tendency toward behavioral variability that generalizes across modalities. However, to date, the MRT has not been validated across modalities. This lack of cross-modality validation of the MRT is problematic because responding in synchrony to an auditory tone might be a rather unique behavior. Indeed, given the similarity between MRT responses and the common behavior of tapping along to the beat of a song, it is possible that participants can produce a stable pattern of responses to the MRT tones independently of their level of attention to the task. If synchronizing with an auditory tone can be done automatically, then the MRT might not accurately estimate the relation between mind wandering and performance variability. However, tapping in synchrony to a visual metronome is much less common, and as such, a task requiring such tapping should help to mitigate this problem. For this reason, a visual version of the MRT may help to increase the task’s sensitivity to states of mind wandering.

In the present article, we report on the development and validation of a visual version of the MRT. Developing and validating a visual version of the MRT (a vMRT) is important for two primary reasons: First, it allows for the assessment of the generality of the original MRT findings across auditory and visual modalities. Second, the ability to administer a visual task that does not suffer from the shortcomings of the SART allows for greater experimental flexibility. For instance, accessibility to a visual version of the MRT would allow researchers to examine MRT performance in the context of auditory distractions, which are common in everyday life, and hence, a visual version of the MRT could permit more ecologically valid research on mind wandering.

To adapt the MRT to a visual form, we replaced each metronome tone with a gray square, which was presented on a computer monitor. To confirm that mind wandering was linked with increased response variability in the vMRT—as in the standard version of the MRT (see Seli, Cheyne, et al., 2013)—throughout the vMRT, we intermittently presented thought probes that required participants to report on the depth of their mind wandering (as in Seli, Carriere, et al., 2014).

Study 1

Method

Participants

Forty-two participants (mean age = 20.4, 28 female) were recruited from the undergraduate Research Experiences Group (REG) at the University of Waterloo. As per the conditions of recruitment, all participants reported that they had normal or corrected-to-normal vision, had normal or corrected-to-normal hearing, and could read and write fluently in English. In this validation study, up to five participants were tested in the same room at a time. As in Seli, Cheyne, et al. (2013), we first identified participants whose rates of omissions (i.e., failures to respond on a given trial) were greater than 10%, which indicates a failure to comply with the task instructions. These participants’ data were then removed from all subsequent analyses. In total, seven participants’ data were excluded for this reason, leaving data from 35 participants for the subsequent analyses.

vMRT stimuli and procedures

The vMRT was designed to closely match the presentation parameters of the standard MRT (see Seli, Cheyne, et al., 2013). The vMRT consisted of 900 trials, and each trial began with the presentation of a blank screen for 650 ms, followed by a gray square for 150 ms, followed by another blank screen for 500 ms. From the participants’ perspective, the onsets of the gray squares were separated by 1,300 ms (see Fig. 1). Critically, the visual stimuli were presented for a longer period of time than are the auditory stimuli in the standard MRT. This was done in order to ensure that the stimulus would synchronize with the frame rate of the monitor being used (60 Hz), with the stimulus being presented for exactly nine frames. As a result, the tempo of the metronome remained the same as in the original MRT, but the interstimulus interval was reduced by 75 ms. The gray box measured 1.5 cm × 1.5 cm and was located in the center of the screen. The square was set to RGB values of 126, 126, 126 (i.e., gray), and the background RGB values were set to 0, 0, 0 (i.e., black). Participants were instructed to “press the spacebar in synchrony with the flashing box so that you press the spacebar exactly when each box is presented.”

Thought probes

To assess participants’ depth of mind wandering, 18 thought probes were pseudorandomly presented throughout the vMRT. One thought probe was presented in each block of 50 trials, with the constraint that no two thought probes were presented within ten trials of each other (Seli, Carriere, et al., 2013). When a thought probe was presented, the vMRT temporarily stopped and participants were instructed to select the degree to which they were focused on the vMRT or were thinking about task-unrelated concerns. The response options were as follows: (1) “completely on task,” (2) “mostly on task,” (3) “equally on task and thinking about unrelated concerns,” (4) “mostly thinking about unrelated concerns,” and (5) “completely thinking about unrelated concerns” (e.g., Mrazek, Franklin, Phillips, Baird, & Schooler, 2013; Seli, Carriere, et al., 2014). After a response had been provided to each probe, the vMRT resumed. Participants were instructed that being “on task” meant they were thinking about things related to the task (e.g., their performance on the task, the gray box, or their response), whereas thinking about unrelated concerns meant that they were thinking about things that were not related to the task at all (e.g., plans with friends, an upcoming test, plans for dinner, etc.).

Measures

Rhythmic response times (RRTs) were calculated as the relative time difference (in milliseconds) between the moment at which the response was recorded and the moment of stimulus onset (Seli, Cheyne, & Smilek, 2013). Because participants’ responses could precede or follow stimulus onset, the RRT value was negative if a participant responded prior to the stimulus onset, and positive if a participant responded following the stimulus onset. Figure 1 shows a few example trials and the corresponding time periods represented by the RRT. Since variability in response times is the main measure of attention yielded by the MRT, three measures of the variability of RRTs were calculated for the vMRT. Our first measure of variability, overall mean RRT variability, was computed using a moving window of the current and preceding four trials across all trials throughout the task (see Seli, Jonker, Cheyne, & Smilek, 2013).^{Footnote 1} Our second measure of variability, odd/even RRT variability, was obtained by separately computing the variance of RRTs on all nonoverlapping even and odd five-trial windows throughout the task (i.e., the overall mean RRT variance across trials 5–9, 15–19, 25–29, etc., and across trials 10–14, 20–24, 30–34, etc., respectively). Our third measure of variability was computed as the variance in RRTs produced on the five trials preceding each of the five thought-probe responses. As in previous work (e.g., Seli, Carriere, et al., 2013), the variance data were highly positively skewed, so we adjusted each variance measure using a natural-logarithm transform. All analyses were performed using the R statistical language (R Core Team, 2015).

Results

In Study 1, we conducted four primary analyses. The first analysis focused on establishing the reliability of the vMRT. The second explored whether vMRT response variability on the trials immediately preceding thought probes increased linearly as a function of participants’ depth of mind wandering (see Seli Cheyne, & Smilek, 2013, and Seli, Carriere, et al., 2014, for similar analyses in the context of the MRT). Third, a time-course analysis was performed in order to assess the changes in depth of mind wandering and MRT task performance as the task progressed (Thomson et al., 2014). Finally, at the level of individual differences, we explored the relation between the average response variability across the entire vMRT and the average depth of mind wandering (see Seli, Carriere, et al., 2013, and Seli, Cheyne, et al., 2015, for similar analyses in the context of the original MRT).

Moment-to-moment reliability

As in Seli, Carriere, et al. (2014), to estimate the reliability of the RRT variance measure, we conducted a correlational analysis examining the relation between the log transformed RRT variance on all the nonoverlapping even and odd five-trial windows throughout the task. This analysis yielded a strong significant positive correlation coefficient, r(33) = .96, p < .001, indicating good reliability of the vMRT variance measure.

Split-half reliability

In addition to the moment-to-moment measure of reliability, we examined the reliability of mean overall RRT variance between the first and second halves of the task. The goal of this secondary reliability analysis was to provide an overall estimate of the reliability of changes in response variability during the task, across participants. This reliability score was quantified with a Pearson product–moment correlation analysis examining the relation of the mean overall log-transformed RRT variance scores between the first and second halves. As we observed in the moment-to-moment measure of reliability, we found a very strong, positive relationship between the mean overall log-transformed RRT variance of the first and second halves of the task, r(33) = .86, p < .001.

Performance prior to thought probes

Next we sought to determine whether the vMRT variance on the five trials immediately preceding thought probes varied as a function of each of the five possible probe reports (i.e., depth of mind wandering). To explore this possibility, we conducted a linear mixed-effects analysis with depth of mind wandering (1–5) as a fixed factor and participant as a random factor, which allowed both the intercept and the effect of depth of mind wandering to vary by participant. Importantly, this analysis allowed for the inclusion of data from participants who did not report at least one instance of each level of mind-wandering depth (i.e., 1, 2, 3, 4, and 5) across the 18 thought probes. In addition, this analysis permitted the inclusion of each observation for each participant, which thereby provided an estimate of within-subjects variability. To evaluate the significance of a term within a linear mixed-effects model, we compared the performance of the complete model, with all effects, with the performance of a model will all but the effect of interest (Magezi, 2015). The depth-of-mind-wandering measure contributed significantly to the model [estimate = 0.21; χ²(1) = 12.132, p < .001], indicating that vMRT variance increased as a function of increasing depth of mind wandering (see Fig. 2).

Performance and mind wandering over time on task

To further test the convergent validity of the vMRT, we sought to test for decrements in task performance as a function of time on task. If the vMRT assesses inattentiveness, then we would expect to see a performance decrement over time and for this decrement to be associated with an increase in mind-wandering rates (as reported by Thomson et al., 2014). We examined this possibility by splitting the task into six blocks, each with 150 vMRT trials and three thought probes, with Block 1 corresponding to the first 150 trials, Block 2 corresponding to the second 150 trials, and so on. A linear mixed-effects model was fit to the data to assess the linear effect of block on the reported depth of mind wandering. The slope associated with a block and the intercept were permitted to vary by participant. We found a significant positive effect of block on depth of mind wandering [estimate = 0.27; χ²(1) = 56.20, p < .001], as is shown in Fig. 3. A similar model was fit in order to assess the linear effect of block on the measured log-transformed RRT variance. The slope associated with a block and the intercept were again permitted to vary by participant. There was again a significant positive effect of block on the log-transformed RRT variance [estimate = 0.14; χ²(1) = 42.00, p < .001], shown in Fig. 4.

Individual differences (correlational) analysis

Finally, we examined the relation between depth of mind wandering and vMRT performance by conducting a Pearson product–moment correlation, entering in each participant’s mean overall variance across the entire task and their average depth of mind wandering. This analysis revealed that these measures were significantly positively correlated, r(33) = .266, p = .002 (see Fig. 5). Thus, as in previous work employing the standard MRT (e.g., Seli, Carriere, et al., 2013), participants who reported higher rates of mind wandering tended to produce greater response variability.

Discussion

The primary purpose of Study 1 was to verify that the measures associated with the MRT extended beyond the auditory domain. This was achieved by testing for similarities between the MRT and vMRT in terms of their behavioral outcomes and their relationships to mind wandering. Our results suggest that the vMRT has four key similarities with the MRT. First, the vMRT shows very high moment-to-moment reliability; a property that is mirrored in the MRT (Seli, Cheyne, et al., 2013). This reliability measure suggests that two separate measurements of behavioral variability within close temporal proximity are very similar in nature, and the strong relationship between the two measurements implies that the measure of behavioral variability is highly reliable.

Second, periods of greater self-reported depth of mind wandering were associated with greater response variability. This finding suggests that, as has been observed with the SART and MRT, behavioral variability is reliably associated with mind wandering in the vMRT (Cheyne et al., 2009; McVay & Kane, 2009; Seli, Cheyne, Barton, & Smilek, 2012). The consistency of this effect across modalities and tasks provides strong evidence that self-reported mind wandering and behavioral variability are linked at a level that is not modality- nor task-specific.

Third, the results showed that both depth of mind wandering and response variability increased as a function of time on task. Time on task has been associated with increased mind wandering and decreased performance on a variety of tasks (e.g., Thomson et al., 2014). That both mind wandering and response variability follow this trend supports the notion that fluctuations in the vMRT response variability metric do indeed reflect fluctuations in depth of mind wandering, thus lending further support for the validity of the behavioral measure.

Finally, at the level of individual differences, participants who reported greater depths of mind wandering also tended to produce greater response variability. This, too, is in agreement with previous findings, with similar results being observed in the original MRT (Seli et al., 2013). The relatively small magnitude of the effect in this study is not altogether surprising. Such crude measures of mind wandering and response variability would be unlikely to be strongly associated, since many additional variables that are not associated with mind wandering may influence average response variability across individuals. For instance, individual differences in experience with holding a rhythm may account for some variance in response variability, with individuals with greater experience with rhythms having a lower overall response variability. In any case, the small relation between response variability and depth of mind wandering across individuals adds to the previously mentioned findings pointing to the similarity between the MRT and vMRT.

Overall, the results of Study 1 suggest a high degree of similarity among the behavioral outcomes of the MRT and vMRT. This, in turn, suggests that the two tasks measure a common construct. However, to verify this conclusion, direct comparison of the two tasks was required.

Study 2

Study 2 was designed to extend the results of Study 1 in two ways. First, to confirm our initial findings, we attempted to directly replicate the results of Study 1 with a larger sample size. Second, we sought to directly compare the results from the vMRT to those from the original (auditory) MRT by having participants complete each of these tasks (within subjects). As in Study 1, participants again responded to periodically presented thought probes assessing their depth of mind wandering.