Introduction
Several stimulus dimensions show a horizontal, spatial association (Macnamara et al.,
2018). Probably one of the most prominent examples are numbers: Small numbers are assumed to be represented left while large numbers are assumed to be represented right on a spatial mental number line (Dehaene et al.,
1993; Feigenson et al.,
2004; Restle,
1970). Empirical evidence for this assumption stems from the SNARC effect (Spatial-Numerical Association of Response Codes) first investigated by Dehaene et al. (
1993). In that study, participants decided whether a presented number was odd or even by pressing a left-sided or right-sided response key. Participants responded faster to small numbers with the left-sided response key compared to responding with a right-sided response key and vice versa for large numbers. In recent decades, a comparable effect has been found for other stimulus dimensions which lead to the general term SARC effect (Spatial Association of Response Codes, Macnamara et al.,
2018).
Another term widely used is the abbreviation SQUARC effect (Spatial-Quantity Association of Response Codes) introduced by Walsh (
2003) in the proposal of A Theory of Magnitude (ATOM; Bueti & Walsh,
2009). According to ATOM, the three domains of time, space, and quantity are represented on a common cortical metric, which can also be interpreted as a generalized magnitude representation system (Bonn & Cantlon,
2012). An important prediction of ATOM is the existence of spatial associations for each magnitude dimension. This association is reflected in the SQUARC effect, that is, shorter reaction times to small quantities with a left-sided response than with a right-sided response and vice versa for large quantities (Walsh,
2003,
2015).
In addition, SARC effects have also been shown for the auditory dimensions loudness and pitch. In general, participants respond faster to soft or low tones with a left-sided response key compared to a right-sided response key and vice versa for loud or high tones (Fairhurst & Deroy,
2017; Guilbert,
2020; Hartmann & Mast,
2017; Lega et al.,
2020; Lidji et al.,
2007; Rusconi et al.,
2006). The SARC effects for loudness and pitch also occur in the vertical dimension indicating that soft or low tones are associated with the spatial information ‘bottom’ while loud or high tones are associated with the spatial information ‘top’ (Bruzzi et al.,
2017; Fernandez-Prieto et al.,
2017; Lega et al.,
2020; Lidji et al.,
2007; Pitteri et al.,
2017; Rusconi et al.,
2006). This is in line with the observation, that spatial associations for different stimulus dimensions can occur in several spatial axes such as vertical or radial (i.e. near-far) axes (see Winter et al.,
2015 for a review).
SARC effects for pitch and loudness are typically explained by assuming a spatial representation of the corresponding auditory dimension. However, the structure of the assumed spatial representation differs between pitch and loudness. Pitch is assumed to be represented on a two-dimensional spatial helix structure (Shepard,
1982; Ueda & Ohgushi,
1987). Contrary, the SARC effect for loudness is explained by loudness being represented as a magnitude dimension with a linear spatial association (e.g. Bruzzi et al.,
2017). This indicates that the effects rely on similar mechanisms but separated representations with different spatial structures. However, little is known how these assumed separated representations relate to each other, that is whether the SARC effects for pitch and loudness can occur simultaneously and independently from each other. Therefore, this study aimed to investigate the interrelation between the SARC effects for pitch and for loudness. For this, we tested whether both SARC effects interact with each other, which would be reflected in a larger SARC effect for pitch for loud tones compared to soft tones and a larger SARC effect for loudness for high tones compared to low tones. For a better distinction between both effects, we will refer to the effects as SPARC effect (Spatial-Pitch Association of Response Codes, Lidji et al.,
2007) and as SLARC effect (Spatial-Loudness Association of Response Codes).
1
Before outlining the assumed interrelation in more detail, we will first describe the main findings for the SPARC and SLARC effects, separately. The SPARC effect depends on the interplay of various factors, namely, musical experience of the participants, the spatial arrangement of response keys, and whether pitch is the task relevant dimension or not. When participants have to classify the pitch of a presented tone relative to a standard pitch, a SPARC effect occurs regardless of the spatial arrangement of the response keys and musical experience (Guilbert,
2020; Lega et al.,
2020; Lidji et al.,
2007; Rusconi et al.,
2006). In contrast, when pitch is not relevant for the task and participants have to respond to another attribute of the tone, for example its timbre, non-musicians show only a SPARC effect with vertically but not with horizontally aligned response keys. Contrary, musicians still show a SPARC effect in a timbre discrimination task in the horizontal dimension as well as in the vertical dimension (Lega et al.,
2020; Lidji et al.,
2007; Rusconi et al.,
2006).
The occurrence of the vertical SPARC effect is typically explained by the assumption of a mental spatial representation of pitch on a bottom-to-top helix structure (Shepard,
1982; Ueda & Ohgushi,
1987). Low pitches are assumed to be represented bottom while high pitches are assumed to be represented top. Additionally, the assumed representation model takes into account that the relationship between the physical dimension frequency and the perception of pitch is non-linear: An increasing frequency does not only lead to the impression of an increasing pitch height but does also change the perceived quality of the tone. This is referred to as pitch chroma and is indicated by the circular organization of tones within a helix plane. The occurrence of the vertical SPARC effect even under conditions in which pitch is not relevant for the task further indicates that the spatial information is automatically co-activated comparable to the automatic activation in case of the SNARC effect in a parity judgment task (Dehaene et al.,
1993), which is in line with the assumption of an innate spatial representation of pitch. Note that an alternative explanation could be the semantic overlap between the response codes and the stimulus codes, as pitch is described in spatial terms in some languages. However, the SPARC effect also occurs in participants whose native language does not describe pitch via spatial terms (Fernandez-Prieto et al.,
2017), thus invalidating the semantic overlap explanation.
Contrary to the SPARC effect, the SLARC effect does not dependent on the arrangement of response keys or the relevance of loudness for the task. Although earlier studies did not find a SLARC effect in the horizontal dimension (Ren et al.,
2011), the effect was later found by other studies (Chang & Cho,
2015; Fairhurst & Deroy,
2017; Hartmann & Mast,
2017). Furthermore, several studies found the SLARC effect in the vertical dimension (Bruzzi et al.,
2017; Fernandez-Prieto et al.,
2017). The SLARC effect is not only present when participants have to judge the loudness of a tone relative to a standard tone (Bruzzi et al.,
2017; Hartmann & Mast,
2017) but does also occur in timbre discrimination tasks with horizontally (Chang & Cho,
2015) and vertically arranged response keys (Koch et al.,
2023). This indicates an automatic activation of the spatial information of loudness comparable to the automatic activation of the spatial information in the case of pitch.
Typically, the SLARC effect is explained by assuming that loudness is represented as a magnitude according to ATOM (Bueti & Walsh,
2009; Walsh,
2003), as suggested for instance by Bruzzi et al. (
2017). The SLARC effect found in previous studies is in line with this prediction as loud tones, that is tones with high intensity, are associated with right and soft tones associated with left (Hartmann & Mast,
2017). Furthermore, ATOM predicts interactions between magnitude dimensions (Walsh,
2003,
2015) and several studies found that loudness interacts with other magnitude dimensions like numerical magnitude (Alards-Tomalin et al.,
2015; Hartmann & Mast,
2017; Heinemann et al.,
2013) or physical size (Smith & Sera,
1992; Sutherland et al.,
2014; Takeshima & Gyoba,
2013) which also supports the assumption that loudness is represented as a magnitude. Furthermore, loudness is a prothetic or quantitative dimension (Stevens,
1957; Stevens & Galanter,
1957), which is an important theoretical prerequisite for a dimension to be considered part of ATOM (Walsh,
2003,
2015). Taken together, assuming that loudness is represented as a magnitude in the sense of ATOM, the SLARC effect could be considered as an instance of the general SQUARC effect. This assumption is also supported by the notion that the SLARC effect seems to be continuous (Koch et al.,
2023) which contradicts the most prominent alternative explanation, namely the polarity correspondence principle (Chang & Cho,
2015; Proctor & Cho,
2006).
So far, empirical evidence suggests that the SPARC and SLARC effects are due to two different spatial representations but a direct empirical test of this assumption is still missing. In addition, previous studies investigating spatial associations for spoken number words have found that the SPARC and SLARC effects interact with the already mentioned SNARC effect in a way that contradicts several theoretical assumptions. Numbers are assumed to be represented as a magnitude in terms of ATOM (Bueti & Walsh,
2009; Walsh,
2003,
2015) and therefore would share a magnitude representation with loudness. Pitch, on the other hand, is explicitly excluded from the conceptualization of ATOM because it is a metathetic or qualitative dimension (Stevens,
1957; Stevens & Galanter,
1957). Based on these assumptions, two interaction patterns between the SLARC, SPARC, and SNARC effects are plausible. First, from the premise that an interaction between spatial associations indicates a common origin, one would expect that the SLARC and SNARC effects should interact. This interaction could be reflected in a larger SLARC effect for large spoken numbers compared to small numbers as well as a larger SNARC effect for loud spoken number words compared to soft spoken number words. The SPARC and SNARC effects should be independent of each other. Alternatively, since a shared representation does not rule out purely additive effects (Sternberg,
1969), a second possible scenario could be that loudness and numbers share a common representation, but that the SLARC and SNARC effect simply do not interact. Importantly, the SPARC and SNARC effects should still not interact. Results from previous studies contradict both scenarios: While the SLARC effect does not interact with the SNARC effect (Hartmann & Mast,
2017), the SPARC effect does interact with the SNARC effect (Fischer et al.,
2013; Weis et al.,
2015,
2016).
In a study by Hartmann and Mast (
2017), participants heard spoken number words and had to classify the numerical value, loudness level, or parity. There was no interaction between the SPARC effect and the SLARC effect. Additionally, both effects were limited to the dimension-related task. The SNARC effect only occurred in the parity and number judgment task while the SLARC effect was limited to the loudness judgment task. However, both effects are known to occur even when number magnitude or loudness are irrelevant (Chang & Cho,
2015; Fias,
2001; Koch et al.,
2023; for a review for the SNARC effect see Wood et al.,
2008). This raises the question of whether two SARC effects can generally occur simultaneously.
Indeed, studies that found an interaction between the SPARC effect and the SNARC effect also found a simultaneous occurrence of both effects (Weis et al.,
2015,
2016). However, the results are not entirely consistent, which might be due to different experimental setups. For example, Fischer et al. (
2013) investigated the SPARC effect and the SNARC effect in a pitch discrimination task and a number discrimination task with diagonally arranged response keys. Both effects only occurred when the corresponding dimension was task-relevant. Crucially, there was a reversed SNARC effect for the SPARC incompatible trials but not for the SPARC compatible trials. In the studies conducted by Weis and colleagues (Weis et al.,
2015,
2016), participants had to classify either the numerical value, pitch, or parity. Participants responded faster in SNARC compatible and SPARC compatible trials compared to incompatible trials regardless of the task. Furthermore, there was a significant interaction between SPARC compatibility and SNARC compatibility. The authors concluded that both effects share a common automatic decision mechanism and further suggested that this mechanism might be based on a common representation of pitch and numbers in the sense of ATOM (Weis et al.,
2016). However, as already mentioned, pitch is explicitly excluded from the conceptualization of ATOM due to its metathetic or qualitative characteristic (Walsh,
2003,
2015). From a theoretical point of view, the interaction between SPARC and SNARC compatibility cannot be explained in terms of a common representation in the sense of ATOM.
Taken together, previous findings contradict predictions regarding potential interactions between SARC effects. In addition, it is unclear under which circumstances and for which dimensions two SARC effects can occur simultaneously. Therefore, the aim of our study was twofold. First, we wanted to investigate whether the SLARC effect occurs simultaneously with another spatial association, namely the SPARC effect. We used a timbre discrimination task in which participants had to decide whether a single tone was a violin tone or an organ tone while pitch and loudness were varied orthogonally. Neither loudness nor pitch were relevant for the task, which allowed us to investigate whether both effects would show an automatic, simultaneous occurrence. Furthermore, timbre is equally strong related to both task-irrelevant dimensions, which is not the case for parity as used in previous studies (Hartmann & Mast,
2017; Weis et al.,
2015,
2016), which is stronger related to the numerical value than to pitch or loudness. As a second aim, we wanted to test whether the interaction between the SPARC and SNARC effects (Weis et al.,
2015,
2016) generalizes to the interrelation between the SLARC and the SPARC effects. The interaction between the SPARC and SNARC effects is explained by a shared representation according to ATOM (Weis et al.,
2016). If this is the case, then the interaction should generalize to other magnitude dimensions as well. The SLARC effect is explained by an assumed magnitude representation of loudness (Bruzzi et al.,
2017), and therefore one would also expect an interaction between the SLARC effect and the SPARC effect. This would be reflected in a larger SPARC effect for loud tones compared to soft tones as well as a larger SLARC effect for high tones compared to low tones.
To test the simultaneous occurrence as well as a possible interaction between both effects, we conducted a multiple linear regression, with the difference of reaction time between top-sided and bottom-sided responses (dRT) as dependent variable and loudness and pitch as predictors (Fias et al.,
1996). We predicted that the SPARC effect and the SLARC effect would occur simultaneously indicated by a negative regression coefficient for loudness as well as for pitch. Previous studies found that SARC effects for two prothetic dimensions did not occur simultaneously (Hartmann & Mast,
2017; Vellan & Leth-Steensen,
2022; Weis et al.,
2018) whereas SARC effects for one prothetic and one metathetic dimension occured simultaneously (Weis et al.,
2015,
2016). Because pitch and loudness are regarded as metathetic and prothetic dimensions, respectively (Stevens & Galanter,
1957), SARC effects for both dimensions should occur at the same time. With regard to our second research aim, we predicted that if an interaction between both effects occurred, it should be reflected in larger dRT differences between soft and loud tones which are high in pitch compared to tones which are low in pitch.
Discussion
The first aim of this study was to investigate whether the SPARC effect and the SLARC effect occur simultaneously in a timbre discrimination task, that is, when loudness and pitch are irrelevant for the task. Indeed, loudness as well as pitch interacted with response side: Participants responded faster to high and loud tones when responding with the top-sided response key compared to the bottom-sided response key and vice versa for soft and low tones. The dRT analyses further revealed, that mean dRT linearly decreased with increasing loudness as well as with increasing pitch. These results show that both the SPARC effect and the SLARC effect occurred, supporting our first hypothesis regarding the simultaneous occurrence of both effects. A second aim of this study was the investigation of a potential interrelation between the SPARC effect and the SLARC effect indicated by an interaction between both effects. Contrary to our second hypothesis, the predictors loudness and pitch did not interact in the dRT analyses and the effects were purely additive.
Previous studies investigated either the SLARC effect or the SPARC effect in a timbre discrimination task (Koch et al.,
2023; Lega et al.,
2020; Lidji et al.,
2007; Rusconi et al.,
2006). The results from our study did not only replicate these effects, but also showed that both effects can occur simultaneously. Our SPARC effect was numerically smaller compared to results from other studies. This can easily be explained by the use of a limited pitch range in our experiment compared to the pitch ranges used in previous studies (e.g. Lidji et al.,
2007; Rusconi et al.,
2006). As loudness and pitch were both task-irrelevant, the results indicate a simultaneous and automatic activation of the spatial information in both dimensions. Additionally, the continuous linear decrease of dRT with increasing loudness level indicates a continuous spatial representation rather than a categorization as it would be predicted by, for example, the polarity correspondence principle (Proctor & Cho,
2006).
The occurrence of SARC effects even when the corresponding dimension is not relevant for the task, is generally considered an indication of automatic activation of the implicit spatial information for the corresponding dimension (Dehaene et al.,
1993; Weis et al.,
2015). However, the use of bimanual responses may induce a spatial bias, and the processing of the spatial information would no longer be considered implicit (Shaki & Fischer,
2018; Sixtus et al.,
2019). These studies used non-lateralized responses, meaning participants responded with a single response key in a go/no-go task, and therefore processed the magnitude information and spatial information implicitly. In these paradigms, participants did not respond faster when the number magnitude and horizontal spatial information matched. Therefore, it was interpreted that the horizontal SNARC effect may not reflect a spatial representation but rather a spatial processing bias. In contrast to the horizontal spatial information, a reaction time benefit was observed when number magnitude and vertical spatial information matched. This suggests that the vertical association may be inherently linked to the concept of magnitude (Shaki & Fischer,
2018; Sixtus et al.,
2019). Regarding pitch and loudness, a next step may be to investigate, whether the vertical associations of these auditory dimensions still occur, when the spatial information is processed implicitly in a setting with non-lateralized responses.
The results show that the SLARC effect can indeed occur simultaneously with another SARC effect, in this case the SPARC effect. The question remains why this was not the case when investigating the SLARC effect and the SNARC effect (Hartmann & Mast,
2017). One possible explanation might be an influence of the task. Fischer et al. (
2013) argued, that in the case of two competing dimensions, a potential SARC effect might only arise for the dimension which is relevant for the task. Although loudness and number magnitude are both irrelevant in a parity judgment task as used by Hartmann and Mast (
2017), parity is stronger related to the numerical value of a number than to loudness. However, in the present experiment, the task-relevant dimension timbre might be equally strong related to pitch and loudness and therefore a comparable strong automatic activation of the spatial information in both dimensions might have been possible. On the other hand, Weis and colleagues (
2015,
2016) also used a parity judgment task and did find simultaneous SARC effects.
For our study, separate frequentist and Bayesian analyses showed that timbre had only a negligible influence on the SPARC and SLARC effect: Both effects occurred in most of the timbre conditions. Nevertheless, the interaction patterns involving timbre partially differed for pitch and loudness. There was a significant interaction between timbre and pitch with shorter reaction times for high violin and low organ tones compared to low violin and high organ tones. This interaction pattern is comparable to the timbre-pitch interaction found by Melara and Marks (
1990). Loudness and timbre did not interact in our study, indicating a slightly different influence of timbre on the processing of pitch and loudness. However, even though timbre might not be completely equally related to pitch and loudness in our study, it did not influence the SPARC and SLARC effect.
Another explanation for discrepancies with regard to the simultaneous occurrences of SARC effect could be that SARC effects for prothetic dimensions do not occur simultaneously in general. Previous studies suggest that this is at least the case for the SNARC effect and the SARC effect for physical size (Vellan & Leth-Steensen,
2022; Weis et al.,
2018). However, as these studies used either a number or a size discrimination task, the non-simultaneous occurrence might be due to the different relevance of the dimensions for the task (Fischer et al.,
2013). Therefore, and because empirical evidence of concurrent SARC effects is rare, this explanation should be taken with caution. Further research is needed on the simultaneous occurrence of different SARC effects and how this relates to metathetic and prothetic dimensions.
The interaction between the SPARC effect and SNARC effect found by prior studies (Fischer et al.,
2013; Weis et al.,
2015,
2016) did not generalize to the SLARC effect in our study. This indicates that the interaction between the SPARC effect and the SNARC effect was not due to a shared representation in the sense of ATOM as some authors suggested (Weis et al.,
2016). If this would have been the case, the interaction should have generalized to the SLARC effect, as loudness is suggested to be represented as a magnitude in the sense of ATOM (Bruzzi et al.,
2017; Hartmann & Mast,
2017). Instead, other mechanisms might have been responsible for the interdependence between the SPARC effect and SNARC effect, for example, sharing a common central processes as already mentioned by Weis et al. (
2016).
The lack of an interaction between the SPARC and the SLARC effect supports the assumption that loudness and pitch are represented separately. In addition, the continuous linear decrease of dRT with increasing pitch and loudness indicates that both distinct representations may be continuous. According to Lidji et al. (
2007), the SPARC effect may rely on a spatial representation as proposed in former representational models of musical pitch (Shepard,
1982; Ueda & Ohgushi,
1987); while Bruzzi et al. (
2017) suggest that the SLARC effect is due to a generalized magnitude representation of loudness according to ATOM (Walsh,
2003).
Models aiming to describe the mental representation of pitch assume that pitch is represented spatially on a helix structure (Shepard,
1982; Ueda & Ohgushi,
1987). This assumption takes into account that an increase in frequency does not only lead to an increase in perceived pitch height but also to a change of the perceived pitch chroma. Two pitches with the same pitch chroma but from different octaves, for example C4 (261 Hz) and C5 (523 Hz), are considered subjectively more similar than two pitches with different chromas but closer frequencies, such as C4 (261 Hz) and F4 (349 Hz). Nevertheless, while pitch chroma is assumed to be represented circular, the helix structure comprises a constant vertical increase in pitch height. Therefore, even musical tones with the same pitch chroma differ in their pitch height. Thus, a spatial helix representation would still predict a continuous decrease of dRT with increasing frequency, similar to the result pattern in the current study.
A continuous decrease of dRT would be also in line with the assumption of a one-dimensional, linear spatial representation of pitch, comparable to the representation of loudness. However, this representation would not be considered a magnitude representation according to ATOM, because pitch is a metathetic dimension (Stevens,
1957; Stevens & Galanter,
1957), and therefore not part of the generalized magnitude representation system according to ATOM (Walsh,
2003,
2015). Nevertheless, the question whether the SPARC effect relies on a helix structure, or another one-dimensional spatial representation remains, and results from the current study do not allow to distinguish these spatial representation structures. Future studies should address the question whether reaction times indicating a SPARC effect also indicate a spatially organized helix structure of the underlying representation, for example by taking into account the influence of pitch similarity on reaction times in same-different judgments (Cohen Kadosh et al.,
2008).
In contrast to pitch, loudness may be represented as a magnitude in the sense of ATOM (Bueti & Walsh,
2009; Walsh,
2003). In this case, the SLARC effect would be an instance of the more general SQUARC effect. This assumption is supported by the prothetic character of loudness (Stevens,
1957; Stevens & Galanter,
1957) and by interactions between loudness and other ATOM-related magnitudes (Alards-Tomalin et al.,
2015; Hartmann & Mast,
2017; Heinemann et al.,
2013; Takeshima & Gyoba,
2013). The question remains whether a vertical SLARC effect is in line with this interpretation. The direction of the spatial association in the context of ATOM is not narrowed to the horizontal dimension. Furthermore, it is assumed that numbers—one of ATOM’s most prominent quantity dimension—are also spatially represented in the vertical dimension (Aleotti et al.,
2023; Ito & Hatta,
2004; see Winter et al.,
2015 for a review). This vertical spatial association might be present in other magnitudes as well.
In conclusion, our study has shown that the SPARC effect and the SLARC effect occur simultaneously, but appear to be independent of each other. This supports the interpretation that both effects are due to separate spatial representations. Furthermore, our study extended the findings on simultaneous SARC effects and showed that the implicit spatial information of two dimensions can be automatically activated simultaneously. Whether and how this is influenced by task characteristics or by specific characteristics of the dimensions (e. g. the distinction between prothetic and metathetic dimensions) needs to be investigated in further research. In addition, future research could help to understand the complex patterns of interaction between different SARC effects and what leads to interdependencies between spatial associations in different dimensions.