Introduction

The relation between numbers and space is ubiquitous in everyday life. In Western cultures, small numbers are linked to the left side of space, whereas large numbers are linked to the right side of space (Toomarian & Hubbard, 2018). The pervasiveness of this association can be experienced in our environment. For instance, the arrangement of the numbers on our keyboard has a left-to-right orientation, and the same is true for the numbers on the speed indicator in our cars.

The relation between numbers and space has been experimentally investigated by means of studies on Spatial-Numerical Associations (SNAs; Fischer & Shaki, 2015). The Spatial-Numerical Association of Response Codes (SNARC) effect is a paradigmatic example of SNA. For instance, in magnitude comparison tasks, participants must judge whether a number is larger or smaller than a reference one. Participants’ responses to larger numbers are consistently faster when executed on the right rather than on the left side of the space. By contrast, participants’ responses to smaller numbers are faster when executed in the left side rather than in the right side of the space (Dehaene, Dupoux, & Mehler, 1990). The SNARC effect has also been observed in parity judgment tasks, where participants must judge whether a number is odd or even (Dehaene, Bossini, & Giraux, 1993). Thus, the SNARC effect suggests that numbers are coded along a left-to-right linear vector (i.e., the mental number line: MNL), where small numbers occupy the leftmost positions of the vector and large numbers the rightmost positions. Alternative accounts invoke the presence of polarity coding (Proctor & Cho, 2006) for all tasks requiring the performance of speeded binary classification tasks (e.g., the SNARC effect, the linguistic markedness of response codes: MARC effect, Nuerk, Iversen, & Willmes, 2004a). Nevertheless, in contrast to the polarity-coding account, a dissociation between the MARC and the SNARC effect has been recently reported (Di Rosa et al., 2017). Another influential theory argues that the SNARC effect might be the result of a neural overlap between the regions that process magnitudes, such as space, numbers, and time (Walsh, 2003). It should be noted, however, that these theories are not mutually exclusive. Indeed, recent evidence has shown that the SNARC effect is characterized by a multicomponential nature (Basso Moro, Dell’Acqua, & Cutini, 2018).

Interestingly, the SNARC effect has been consistently replicated with negative numbers (Fischer, 2003; Shaki & Petrusic, 2005) and with different representation formats, such as number-words (Fias, 2001; Nuerk, Kaufmann, Zoppoth, & Willmes, 2004b). With Arabic numerals, the SNARC effect has been also observed in tasks that do not require the processing of arithmetic information, such as phoneme monitoring, where participants are asked to judge whether a phoneme is present or not in the corresponding spoken number word of the presented Arabic digit (Fias, Brysbaert, Geypens, & d’Ydewalle, 1996). All these findings indicate that the SNARC effect is probably the most robust and widely replicated evidence of SNAs (for reviews, see Fischer & Shaki, 2014; Hubbard, Piazza, Pinel, & Dehaene, 2005; Wood, Nuerk, Willmes, & Fischer, 2008).

Although Arabic numerals have been employed in the vast majority of studies on the SNARC effect, numeric information can also be conveyed by means of other numerals (e.g., Roman numerals, written number words, spoken number words, etc.). In addition, number information (e.g., numerosity) can also be conveyed by sets of perceived stimuli (e.g., three fingers, seven tones, etc.) presented in different sensory modalities (e.g., visual, tactile, or auditory). To this extent, Nuerk, Wood, and Willmes (2005) tested the SNARC effect, with different stimuli, beside Arabic numerals, such as written number words, sequences of acoustic stimuli, and sets of dots spatially arranged to resemble dice patterns. The results revealed that the activation of an abstract representation of numbers (i.e., MNL) can be elicited by using different sensory modalities. Indeed, the SNARC effect with the auditory stimuli did not differ from that found with visual stimuli. Notably, dots arranged in dice-like patterns yielded the SNARC effect in the same way that Arabic numbers did.

The core feature of dots arranged in dice-like patterns, used by Nuerk et al. (2005), was their structured spatial arrangement. Indeed, the spatial arrangement of the stimuli was characterized by symmetry and equal inter-dots distances. Note, however, that the overlearned arrangement of dice-like patterns might guide access to the MNL not by means of visual enumeration of the displayed dots (i.e., subitizing and/or estimation; Cutini, Scatturin, Basso Moro, & Zorzi, 2014), but by means of automatic pattern recognition, in a way similar to that of Arabic digit recognition, letter (e.g., see Schmitt, 2016) and non-letter shapes (for a review, see Lachmann & van Leeuwen, 2014).

In the present study, we extended the conceptual framework of Nuerk et al. (2005) by investigating how the spatial configuration of the displayed stimuli could influence the emergence of the SNARC effect. Hence, we presented participants with visual stimuli, constructed according to different configurational criteria. Beside the structured arrangement (i.e., dice-like patterns), we used an unstructured arrangement of the stimuli (i.e., random pattern) whose numerosity can only be processed by means of visual enumeration. With this extension, we aimed to explore the presence of the SNARC effect in relation to the structure of the displayed visual stimuli. If the presence of the SNARC effect relied uniquely on pattern recognition (e.g., the dice-like patterns), we expected to find a SNARC effect with structured stimuli, but not with unstructured stimuli. Alternatively, if the SNARC effect did not depend entirely on the structure of the visual stimuli, then the SNARC effect should be observed with both structured and unstructured stimuli, suggesting that both stimulus arrangements have access to a common MNL.

Method

Participants

Forty students (20 males, 20 females; age: M = 22.6 years; SD = 1.7) from the University of Padua participated in this study. Thirty-nine participants were right-handed and one participant was left-handed. All participants had normal or corrected-to-normal visual acuity and they were naïve about the purpose of the experiment. The number of participants was defined a priori, by means of the software G*POWER 3 (Power = .95 α = .001, and effect size = .25; Faul, Erdfelder, Lang, & Buchner, 2007). The study was approved by the institutional ethics committee.

Apparatus

The experiment was conducted on an INTEL(R) CORE(TM) 2 PC (operating system: Microsoft Windows XP Professional, version, 2002, Service Pack 3; CPU: 6320, 1,86G Hz; RAM: 1 GB), connected to a Cathodic Ray Tube monitor (screen dimension: 21-in.; colors: 32 bits; refresh rate: 100 Hz; resolution: 1,024 x 768 pixels; video card: Intel(R) GMA 3000, NAC: Internal memory: 256 MB).

Participants were seated 57 cm from the computer screen with their head placed on a chinrest. Participants’ responses were collected using two keys on a response box (https://www.pstnet.com/hardware.cfm?ID=102) centrally positioned in front of the participants. A felt pad was applied over the right key of the response box, while the left key was left smooth. The keys were, hence, referred to as the “key with the felt pad” and “the key without the felt pad”, in order to avoid the explicit expression “right-sided key” and “left-sided key” (the position of the felt pad was not counterbalanced across participants). The experiment was programmed and administered by means of E-Prime2 ProRC 8 (Version: 2.0.8.90; Psychology Software Tools, Inc., 2012).

Stimuli

Stimuli were a central cross (white, dimension: 0.5°) followed by a set of white rectangles (width: 0.5°; height: 3°), included in a 12° x 12° area, and presented in the center of a black background screen. The rectangles varied in number size (from one to nine; five was excluded), orientation (0° or 90°; this manipulation was added in order to increase variability, to avoid ceiling effects due to perceptual learning), and configuration (structured or unstructured). Stimuli were “structured” when rectangles were organized following constant and regular distances in space, similar to the dice pattern. In contrast, stimuli were “unstructured” when rectangles were organized randomly, without any regularity in the distances among them (Fig. 1).

Fig. 1
figure 1

The complete set of stimuli

Procedure

Each participant was seated in front of the computer screen and his/her index fingers were placed on the respective response box’s keys (right and left). Participants had to decide whether the number of rectangles was smaller or larger than the reference number (five), by pressing, as fast and accurately as possible, one of the two response keys. In the first block, the “smaller-than-five” response was assigned to the “key without felt pad” and the “larger-than-five” response to the “key with felt pad”; in the second block, the response-key assignment was reversed. The abovementioned response-key assignment was counterbalanced across participants (i.e., half of the participants performed the sequence first block ➔ second block and half of the participants performed the sequence second block ➔ first block).

Each trial started with the presentation of the fixation cross, lasting for 500 ms. Then, the fixation cross disappeared and, after a variable inter-stimulus interval (ISI = 200–500 ms) in order to avoid automatic responses, the target stimulus (i.e., a set of rectangles) appeared for 100 ms. Afterwards, a white mask appeared (until the response was executed or 500 ms elapsed). Finally, a central cross appeared, lasting for a variable duration (i.e., 750–1,250 ms) before the next trial began. Responses were recorded from the target stimulus onset over a maximum period of 1,500 ms (Fig. 2).

Fig. 2
figure 2

Trial sequence

Design

We used a within-participants experimental design. Independent variables were number size (two levels: small: 1–4/large: 6–9), response hand assignment (two levels: left/right), and stimulus configuration (two levels: structured/unstructured). Therefore, independent variables were manipulated according to a 2 X 2 X 2 design. The dependent variable was the reaction time (RT) of each participant’s response. Eight conditions resulted by crossing all levels of independent variables with each other. Half of them were compatible with the SNARC effect (i.e., number size smaller than five – left-key response, and number size larger than five – right-key response), whereas the other half were incompatible (i.e., number size smaller than five – right-key response, and number size larger than five – left-key response).

The experiment consisted of two blocks of 160 trials each. The response-key assignment was counterbalanced across participants, and stimulus configuration was randomized. All participants received eight practice trials before the experimental blocks.

Results

We performed statistical analyses using IBM-SPSS for Windows (version 23). Responses on the practice trials were excluded from further analyses. The percentage of correct trials and errors was calculated for each participant. Error rate (i.e., incorrect responses and omissions) was 4.2% of all the trials. Participants with error percentages lower than 2.5 SDs from the mean of all participants would have been excluded from the analyses. Nevertheless, none of the participants fulfilled this criterion; thus, all participants were included in the following analyses. For correct trials only, we then calculated mean RTs and SDs, separately for each participant and each experimental condition. Finally, RTs above and below 2.5 SDs from each participant/condition mean (i.e., 4.09% of correct trials), were excluded from further analyses.

A three-way repeated-measures analysis of variance (ANOVA) with Number Size (small/large), Response Hand Assignment (left/right), and Stimulus Configuration (structured/unstructured) was carried out. The main effect of Number Size was significant, F(1, 39) = 4.765, p = .035, ηp2 = .109. The main effect of Response Hand Assignment was not significant, F(1, 39) = .439, p = .511, ηp2 = .011, nor was the main effect of Stimulus Configuration, F(1, 39) = .601, p = .443, ηp2 = .015.

The interaction Number Size X Response Hand Assignment was significant, indexing the SNARC effect, F(1, 39) = 10.427, p = .003, ηp2 = .211 (Fig. 3). A priori simple effects analyses, by means of one-tailed, paired-samples t-tests, revealed significant differences (Table 1). In particular, RTs for the small-left condition were significantly faster than RTs for the large-left condition. In addition, RTs for the large-right condition were significantly faster than RTs for the small-right condition (Table 1).

Fig. 3
figure 3

The interaction Number Size X Response Hand Assignment. Error bars represent confidence intervals (95%) corrected for within-participants designs (O’Brien & Cousineau, 2014; single method)

Table 1 Paired samples t-tests on the interaction Number Size X Response Hand Assignment

The interaction Number Size X Stimulus Configuration was significant F(1, 39) = 30.987, p = .000002, ηp2 = .443 (Fig. 4). A posteriori simple-effect analyses, by means of two-tailed, paired-samples t-tests, corrected with Bonferroni, revealed significant differences (Table 2). In particular, RTs for the large-unstructured condition were significantly faster than RTs on the small-unstructured condition. In contrast, RTs for the small-structured condition were significantly faster than RTs on the large-structured condition.

Fig. 4
figure 4

The interaction Number Size X Stimulus Configuration. Error bars represent confidence intervals (95%) corrected for within-participants designs (O’Brien & Cousineau, 2014; single method)

Table 2 Paired samples t-tests on the interaction Number Size X Stimulus Configuration

The interaction Response Hand Assignment X Stimulus Configuration was not significant, F(1, 39) = 1.384, p = .247, ηp2 = .034, as well as the crucial interaction Number Size X Response Hand Assignment X Stimulus Configuration F(1, 39) = .734, p = .397, ηp2 = .018.

Pearson’s correlation coefficient was computed to assess the relation between the SNARC effect for the structured condition and the SNARC effect for the unstructured condition. First, for each participant, the structured and unstructured conditions were considered separately. Then, the strength of the SNARC effect was tested by calculating the difference between RTs (i.e., dRT) for small numbers and RTs for large numbers, separately. For small numbers, RTs of the left hand (i.e., compatible condition with the SNARC effect) were subtracted from RTs of the right hand (i.e., incompatible condition with the SNARC effect). In contrast, for large numbers, RTs of the right hand (i.e., compatible condition with the SNARC effect) were subtracted from RTs of the left hand (i.e., incompatible condition with the SNARC effect). Afterwards, the mean dRTs for small and large numbers were calculated, for each participant, separately for the unstructured and the structured conditions. These mean dRTs were finally subjected to Pearson’s correlation analysis. The positive correlation between the dRTs of the unstructured and the mean dRTs of the structured stimulus configuration was remarkably high, r = .895, p = 6.4525e-15.

In order to establish whether the results provided evidence for or against the null hypothesis (H0), a Bayesian factor (BF01) hypothesis test was carried out using a prior distribution of .707. A Bayesian paired-sample t-test was performed to compare the strength of the SNARC effect between structured and unstructured conditions. The results indicated that the observed data were 4.161 times more likely under H0 than under the alternative hypothesis (H1; Table 3).

Table 3 Bayesian Pearson’s correlation results

Moreover, a Bayesian Pearson correlation test between structured and unstructured conditions was performed. The results were in line with the frequentist inference analysis results.

Results were further analyzed by means of a regression for repeated-measures designs (Method 3, Lorch & Myers, 1990). For each participant, a linear regression analysis was performed, both for structured and for unstructured stimuli. The independent variable was Number (i.e., 1–9, except five), whereas the dependent variable was the difference between the RTs for the right hand and the RTs for the left hand (dRT: RTs right hand − RTs left hand; note that negative dRTs indicate faster responses with the right hand and vice versa). The betas obtained with the regression were significantly different from zero for both structured (Mβ = 40.68, SD = 77.11, t(39) = 3.336, p = .002, 95% CI [16.02, 65.34], d = .528, CI [.193, .856]) and unstructured stimuli (Mβ = 43.18, SD = 99.24, t(39) = 2.752, p = .009, 95% CI [11.45, 74.92], d = .435, CI [.108, .757]).

Furthermore, the betas for the structured and unstructured stimuli were strongly correlated (r = .801, p = 5.5925e-10; Fig. 5), consistent with the positive correlation found between dRTs of the structured stimulus configuration and dRTs of the unstructured stimulus configuration.

Fig. 5
figure 5

Scatterplot of the correlation between the betas for unstructured and structured stimuli

Discussion

Our working hypothesis stemmed from the pioneering study of Nuerk et al. (2005), who found that structured stimuli, with dots arranged in a dice-like format, elicited a SNARC effect. Furthermore, Nemeh, Humberstone, Yates, and Reeve (2018) and Zhou, Shen, Li, and Cui (2016) have reported the SNARC effect with unstructured stimuli, by means of large numerosities (i.e., arrays with numerosities of between 11 and 60 elements). Our results support and extend these findings, by testing the specific effect of stimulus arrangement, with small numerosities (i.e., 1–9) that are typically used in classic SNARC paradigms. We have indeed shown that the SNARC effect is independent of the stimulus format, because both structured and unstructured stimuli elicited a consistent SNARC effect. If the SNARC effect was guided by the specific structure of the stimuli, the three-way ANOVA would have been significant. This was not the case: the effect size of the three-way non-significant interaction was very small (i.e., 0.018), and such a null effect could not be attributed to low statistical power, because our sample was a priori calculated to guarantee suitable power (.95). In the same vein, the Bayesian analysis confirmed that structured and unstructured stimuli yielded a comparable SNARC effect. As further proof, the SNARC effect for structured stimuli was remarkably correlated with that for unstructured stimuli.

On the other hand, the unexpected interaction between numerosity and stimulus configuration is difficult to interpret. For instance, it might originate from the specific spatial arrangement of the elements on each stimulus. Strikingly, while the size effect for structured stimuli shows the typical pattern observed for Arabic numerals (i.e., slower RTs as numerosity increases), we found the opposite pattern for unstructured stimuli: RTs for small numbers were slower than RTs for large numbers. This effect might be explained in terms of the peculiar aspects of the stimuli: in this vein, structured stimuli can be processed in a way similar to that of symbolic numerals (e.g., Arabic digits), whereas unstructured stimuli do not necessarily convey a symbolic connotation. An alternative explanation might reside in the influence exerted by visual features on the perceived numerosity of the stimuli: usually, a dense arrangement of the elements may trigger an overestimation while a sparse arrangement elicits an underestimation of numerosity (Gebuis & Reynvoet, 2012). Since unstructured stimuli are characterized by more densely arranged elements than structured ones (see Fig. 1), unstructured stimuli with large numerosity should yield faster RTs for the right hand than for the left hand (and vice versa for structured stimuli with low numerosity). Although this speculation is consistent with the present results, it deserves to be further investigated in future studies, in which the spatial arrangement of the stimuli should be further manipulated.

In conclusion, in the present study we highlighted for the first time that the spatial arrangement and structure of the array of stimuli does not impact visual enumeration processes leading to the SNARC effect, since we found that familiarity with the stimuli (i.e., dice), does not seem to influence the access to the MNL. Thus, we speculate that the presence of a similar SNARC effect, both with structured and with unstructured stimuli, indicates the existence of common access to the MNL.