A new method for detecting interactions between the senses in event-related potentials

doi:10.1016/j.brainres.2005.12.050

Brain Research

Volumes 1073–1074, 16 February 2006, Pages 389-397

https://doi.org/10.1016/j.brainres.2005.12.050 Get rights and content

Abstract

Event-related potentials (ERPs) can be used in multisensory research to determine the point in time when different senses start to interact, for example, the auditory and the visual system. For this purpose, the ERP to bimodal stimuli (AV) is often compared to the sum of the ERPs to auditory (A) and visual (V) stimuli: AV − (A + V). If the result is non-zero, this is interpreted as an indicator for multisensory interactions. Using this method, several studies have demonstrated auditory–visual interactions as early as 50 ms after stimulus onset. The subtraction requires that A, V, and AV do not contain common activity: This activity would be subtracted twice from one ERP and would, therefore, contaminate the result. In the present study, ERPs to unimodal, bimodal, and trimodal auditory, visual, and tactile stimuli (T) were recorded. We demonstrate that (T + TAV) − (TA + TV) is equivalent to AV − (A + V), but common activity is eliminated because two ERPs are subtracted from two others. With this new comparison technique, the first auditory–visual interaction starts around 80 ms after stimulus onset for the present experimental setting. It is possible to apply the new comparison method to other brain imaging techniques, as well, e.g. functional magnetic resonance imaging.

Introduction

Perception relies heavily on the integration of input from different sensory systems (Welch and Warren, 1986). Basic mechanisms of multisensory integration have been studied with event-related potentials (ERPs). In many of these studies, unimodal and bimodal stimuli were used (e.g. auditory, visual, and auditory–visual stimuli), and the ERP to the bimodal stimulus (AV) was compared to the sum of the ERPs to the unimodal stimuli (A, V): if the senses operate independently (that is, they form separate ‘mental modules’; Sternberg, 2001), the ERP to the bimodal stimulus should be equal to the sum of the ERPs to the unimodal stimuli (Barth et al., 1995): AV = A + V, or AV − (A + V) = 0. By contrast, if the ERP to the bimodal stimulus differs from the sum of the ERPs to unimodal stimuli (AV ≠ A + V), it is concluded that the senses interact (Barth et al., 1995). The time point at which the expression AV − (A + V) starts to differ from zero is thought to indicate the processing stage at which the inputs of the different sensory systems are integrated. Using this approach, several studies have demonstrated interactions of the auditory and the visual system (Fort et al., 2002, Giard and Peronnet, 1999, Molholm et al., 2002), the auditory and the somatosensory system (Foxe et al., 2000, Gobbelé et al., 2003), and the visual and the somatosensory system (Schürmann et al., 2002). In some of these studies, AV − (A + V) differed from zero as early as 50 ms after stimulus onset (e.g. Foxe et al., 2000, Giard and Peronnet, 1999, Molholm et al., 2002), which has been interpreted as evidence for multisensory interactions at early sensory processing stages.

This analysis method has recently been criticized by Teder-Sälejärvi et al. (2002): the authors emphasized that the AV − (A + V) subtraction requires that A, V, and AV do not elicit any common ERP activity (C). If such common activity exists, it is subtracted twice (AC, VC) from the bimodal ERP (AVC): AVC − (AC + VC) = −C. Therefore, the resulting term not only reflects interactions of the auditory and visual system, but is also the inverse of this common activity. Two types of common activity can be distinguished: If the auditory and the visual processing pathways converge at jointly used neural structures, this might be considered as a ‘real’ multisensory interaction. Problems arise if A, V, and AV contain unspecific common activity, e.g. activity related to the expectation of the stimulus, or motor preparation. Teder-Sälejärvi et al. demonstrated that the onset of a significant multisensory interaction in ERPs is influenced systematically by the duration of the baseline: Using a baseline correction interval of −100 ms to 0 ms before stimulus onset, AV differed from A + V starting at 60 ms after stimulus onset. By contrast, a baseline of −100 ms to −50 ms before stimulus onset moved the first significant auditory–visual interaction to 18 ms after stimulus onset. The authors suggested that these signs of multisensory interactions did not origin from multisensory processes proper but rather were due to superimposed slow waves such as the contingent negative variation (CNV, Walter et al., 1964). Since the CNV is equally present in A, V, and AV, it will affect the result of AV − (A + V) during the entire ERP interval, even before stimulus onset. Teder-Sälejärvi et al. suggested the use of a high-pass filter which eliminates the stimulus-preceding slow deflections in the ERPs to A, V, and AV. Indeed, after high-pass filtering, they found a first significant AV − (A + V) interaction at central and parietal electrodes, starting at around 160 ms after stimulus onset rather than at 50 ms as reported in earlier studies.

Nevertheless, the stimulus-preceding CNV activity is only one candidate for common activity. For example, it is plausible to assume that processes associated with the P300 (e.g. stimulus evaluation) are active during target detection. These and other processes are part of ‘C’, and only a subset of them is eliminated by the high-pass filter. Moreover, the high-pass filter might eliminate low frequency ERP components unique to A, V, or AV, which are not part of C.

The central aim of the present study is to introduce a new approach to assess auditory–visual interactions in ERPs. In a first step, the ERP to an omitted stimulus (O, ‘null-stimulus’) is added to the minuend side of the comparison: (O + AV) − (A + V). If the omitted stimulus elicited only C, it would be eliminated because two ERPs are subtracted from two others. Unfortunately, omitted stimuli may elicit rather specific ERP deflections (a prolonged CNV, and the so-called ‘missing stimulus related potential’ Busse and Woldorff, 2003, Simson et al., 1976). Therefore, in a second step, each stimulus is presented together with an additional tactile stimulus (T).

The ERP comparison is now (T + TAV) − (TA + TV). Unisensory ERP activity and common activity are eliminated in this comparison. Theoretically, the trimodal stimulus elicits additional ERP activity due to the interaction of the auditory and the visual system, the auditory and the tactile system, the visual and the tactile system, and possibly even trisensory interactions (Table 1). However, auditory–tactile and visuo-tactile interactions should be eliminated because both should be equally visible in TA and TV. Therefore, auditory–visual and—if present—trisensory interactions are isolated in the comparison.

An ERP study was run to compare the two methods. ERPs to uni-, bi-, and trimodal auditory, visual, and tactile stimuli were recorded, while participants had to make speeded responses to infrequent target stimuli. Multisensory interactions were investigated using the new comparison (T + TAV) − (TA + TV). The results were compared with those of the ‘classical’ analysis approach, AV − (A + V). A new variant of the race model test was developed to check for redundancy gains due to trisensory interactions (Gondan and Röder, in revision; for the general procedure, see Miller, 1982).

Section snippets

Reaction time data

False alarms and misses were below 10% on average (i.e., less than 4 misses per target condition) and were not further analyzed. Reaction times for the seven types of target stimuli are shown in Table 2: responses to trimodal targets were fastest followed by responses to bimodal targets, and responses to unimodal targets were slowest (unimodal vs. bimodal: t₁₈ = 12.3, P < 0.01; bimodal vs. trimodal: t₁₈ = 7.9, P < 0.01). The reaction time gain in bimodal redundant targets (AV, TA, TV) was

Acknowledgment

This study was supported by the Emmy Noether grant Ro 1226/4-1/2/3 to BR of the German Research Foundation (DFG).

References (23)

D.S. Barth et al.
The spatiotemporal organization of auditory, visual, and auditory–visual evoked potentials in rat cortex
Brain Res.
(1995)
L. Busse et al.
The ERP omitted stimulus response to “no-stim” events and its implications for fast-rate event-related fMRI designs
NeuroImage
(2003)
A. Fort et al.
Early auditory–visual interactions in human cortex during nonredundant target identification
Cogn. Brain Res.
(2002)
J.J. Foxe et al.
Multisensory auditory–somatosensory interactions in early cortical processing revealed by high-density electrical mapping
Cogn. Brain Res.
(2000)
R. Gobbelé et al.
Activation of the human posterior parietal and temporoparietal cortices during audiotactile interaction
NeuroImage
(2003)
J. Miller
Divided attention: evidence for coactivation with redundant signals
Cognit. Psychol.
(1982)
S. Molholm et al.
Multisensory auditory–visual interactions during early sensory processing in humans: a high-density electrical mapping study
Cogn. Brain Res.
(2002)
R. Simson et al.
The scalp topography of potentials associated with missing visual or auditory stimuli
Electroencephalogr. Clin. Neurophysiol.
(1976)
S. Sternberg
Separate modifiability, mental modules, and the use of pure and composite measures to reveal them
Acta Psychol.
(2001)
W.A. Teder-Sälejärvi et al.
An analysis of audio–visual crossmodal integration by means of event-related potential (ERP) recordings
Cogn. Brain Res.
(2002)

A. Diederich

Probability inequalities for testing separate activation models of divided attention

Percept. Psychophys.

(1992)

Cited by (52)

Testing trisensory interactions
2021, Journal of Mathematical Psychology
In the redundant signals task, participants respond in the same way to stimuli of different sources, for example, to auditory as to visual signals. The stimuli are presented either alone (single signals) or in combination (redundant signals). In this task, one often observes substantially faster responses to redundant signals than to single signals. Different mechanisms may underlie this so-called redundancy gain, including separate activation and coactivation models. The race model inequality (Miller, 1982) is typically used to distinguish these models. In this article, we consider the generalization of Miller’s inequality to redundant stimuli from three sources (e.g., auditory, visual, tactile). We review the approaches from the literature, and derive several inequalities that have been used to investigate interactions between three sensory modalities. We discuss the limitations and generality of a model which assumes up to three distinct signals representing pairwise interactions of sensory systems, and outline statistical approaches for testing the inequalities in empirical observations.
Probing the neural representations of body-related stimuli: A reply to Tamè & Longo's commentary
2021, Cortex
Audio-visual spatial alignment improves integration in the presence of a competing audio-visual stimulus
2020, Neuropsychologia
In order to parse the world around us, we must constantly determine which sensory inputs arise from the same physical source and should therefore be perceptually integrated. Temporal coherence between auditory and visual stimuli drives audio-visual (AV) integration, but the role played by AV spatial alignment is less well understood. Here, we manipulated AV spatial alignment and collected electroencephalography (EEG) data while human subjects performed a free-field variant of the “pip and pop” AV search task. In this paradigm, visual search is aided by a spatially uninformative auditory tone, the onsets of which are synchronized to changes in the visual target. In Experiment 1, tones were either spatially aligned or spatially misaligned with the visual display. Regardless of AV spatial alignment, we replicated the key pip and pop result of improved AV search times. Mirroring the behavioral results, we found an enhancement of early event-related potentials (ERPs), particularly the auditory N1 component, in both AV conditions. We demonstrate that both top-down and bottom-up attention contribute to these N1 enhancements. In Experiment 2, we tested whether spatial alignment influences AV integration in a more challenging context with competing multisensory stimuli. An AV foil was added that visually resembled the target and was synchronized to its own stream of synchronous tones. The visual components of the AV target and AV foil occurred in opposite hemifields; the two auditory components were also in opposite hemifields and were either spatially aligned or spatially misaligned with the visual components to which they were synchronized. Search was fastest when the auditory and visual components of the AV target (and the foil) were spatially aligned. Attention modulated ERPs in both spatial conditions, but importantly, the scalp topography of early evoked responses shifted only when stimulus components were spatially aligned, signaling the recruitment of different neural generators likely related to multisensory integration. These results suggest that AV integration depends on AV spatial alignment when stimuli in both modalities compete for selective integration, a common scenario in real-world perception.
Multiple phases of cross-sensory interactions associated with the audiovisual bounce-inducing effect
2020, Biological Psychology
Citation Excerpt :
ERP processing was carried out using “Scan” software (version 4.5). ERP components associated with cross-modal interaction were isolated by calculating cross-modal difference (CMdiff) waveforms, which were obtained by subtracting the summed ERPs elicited by the unimodal V and A stimuli from ERPs evoked by the bimodal VA stimuli (c.f. Giard & Peronnet, 1999; Molholm et al., 2002; Fort, Delpuech, Pernier, & Giard, 2002; Teder-Sälejärvi, McDonald, Di Russo, & Hillyard, 2002; Teder-Sälejärvi, Di Russo, McDonald, & Hillyard, 2005; Talsma & Woldorff, 2005; Gondan & Röder, 2006; Bonath et al., 2007; Talsma, Doty, & Woldorff, 2007; Mishra, Martinez, Sejnowski, & Hillyard, 2007; Mishra, Martinez, & Hillyard, 2008, 2010; Li, Wu, & Touge, 2010; Senkowski, Saint-Amour, Höfle, & Foxe, 2011; Van der Burg, Talsma, Olivers, Hickey, & Theeuwes, 2011; Yang et al., 2013; Gao et al., 2014; Zhao et al., 2018). In order to examine whether the variations of early cross-modal neural activities are responsible for the occurrence of ABE, these cross-modal difference waveforms were calculated separately for VA_bouncing trials [CMdiff_bou = VA_bou - (V + A–N)] and VA_streaming trials [CMdiff_str = VA_str - (V + A–N)].
Using event-related potential (ERP) recordings, the present study investigated the cross-modal neural activities underlying the audiovisual bounce-inducing effect (ABE) via a novel experimental design wherein the audiovisual bouncing trials were induced solely by the ABE. The within-subject (percept-based) analysis showed that early cross-modal interactions within 100–200 ms after sound onset over fronto-central and occipital regions were associated with the occurrence of the ABE, but the cross-modal interaction at a later latency (ND250, 220–280 ms) over fronto-central region did not differ between ABE trials and non-ABE trials. The between-subject analysis indicated that the cross-modal interaction revealed by ND250 was larger for subjects who perceived the ABE more frequently. These findings suggest that the ABE is generated as a consequence of the rapid interplay between the variations of early cross-modal interactions and the general multisensory binding predisposition at an individual level.
A matter of attention: Crossmodal congruence enhances and impairs performance in a novel trimodal matching paradigm
2016, Neuropsychologia
Citation Excerpt :
The mechanisms of these interactions are still far from being completely understood. One on-going challenge in the field of multisensory research is the question of how crossmodal interactions can be identified and quantified (Gondan and Röder, 2006,Stevenson et al., 2014). In many cases, crossmodal interactions have been investigated by means of redundant signal detection paradigms in which performance in unimodal trials is compared to performance in redundant multimodal trials (Diederich and Colonius, 2004).
A novel crossmodal matching paradigm including vision, audition, and somatosensation was developed in order to investigate the interaction between attention and crossmodal congruence in multisensory integration. To that end, all three modalities were stimulated concurrently while a bimodal focus was defined blockwise. Congruence between stimulus intensity changes in the attended modalities had to be evaluated. We found that crossmodal congruence improved performance if both, the attended modalities and the task-irrelevant distractor were congruent. If the attended modalities were incongruent, the distractor impaired performance due to its congruence relation to one of the attended modalities. Between attentional conditions, magnitudes of crossmodal enhancement or impairment differed. Largest crossmodal effects were seen in visual–tactile matching, intermediate effects for audio–visual and smallest effects for audio–tactile matching. We conclude that differences in crossmodal matching likely reflect characteristics of multisensory neural network architecture. We discuss our results with respect to the timing of perceptual processing and state hypotheses for future physiological studies. Finally, etiological questions are addressed.
A spatiotemporal signature of cortical pain relief by tactile stimulation: An MEG study
2016, NeuroImage
Recently, the cortical mechanisms of tactile-induced analgesia have been investigated; however, spatiotemporal characteristics have not been fully elucidated. The insular–opercular region integrates multiple sensory inputs, and nociceptive modulation by other sensory inputs occurs in this area. In this study, we focused on the insular–opercular region to characterize the spatiotemporal signature of tactile-induced analgesia using magnetoencephalography in 11 healthy subjects. Aδ (intra-epidermal electrical stimulation) inputs were modified by Aβ (mechanical tactile stimulation) selective stimulation, either independently or concurrently, to the right forearm. The optimal inter-stimulus interval (ISI) for cortical level modulation was determined after comparing the 40-, 60-, and 80-ms ISI conditions, and the calculated cortical arrival time difference between Aδ and Aβ inputs. Subsequently, we adopted a 60-ms ISI for cortical modulation and a 0-ms ISI for spinal level modulation. Source localization using minimum norm estimates demonstrated that pain-related activity was located in the posterior insula, whereas tactile-related activity was estimated in the parietal operculum. We also found significant inhibition of pain-related activity in the posterior insula due to cortical modulation. In contrast, spinal modulation was observed both in the posterior insula and parietal operculum. Subjective pain, as evaluated by the visual analog scale, also showed significant reduction in both conditions. Therefore, our results demonstrated that the multisensory integration within the posterior insula plays a key role in tactile-induced analgesia.

View all citing articles on Scopus

View full text

Research ReportA new method for detecting interactions between the senses in event-related potentials

Abstract

Introduction

Section snippets

Reaction time data

Acknowledgment

Brain Res.

NeuroImage

Cogn. Brain Res.

Cogn. Brain Res.

NeuroImage

Cognit. Psychol.

Cogn. Brain Res.

Electroencephalogr. Clin. Neurophysiol.

Acta Psychol.

Cogn. Brain Res.

Probability inequalities for testing separate activation models of divided attention

Percept. Psychophys.

Research Report
A new method for detecting interactions between the senses in event-related potentials