Introduction

Our typical visual environment is overflowing with complexity, yet we still efficiently extract the gist of it, presumably by grouping redundant information. Specifically, it has been hypothesized that observers represent a set of similar items as an averaged group or ensemble, enabling the condensing of multiple individual elements into a single, higher level of description (Alvarez, 2011). This process would entail perceptually extracting a course-grained representation of the visual world (e.g., forest), rather than the fine-grained details of each object (e.g., individual trees). Prior research has demonstrated that ensemble, or summary, statistics implicitly bias the representation of individual items held in visual working memory, such that the remembered size of an individual item was biased toward the probed mean size of items sharing the same color (Brady & Alvarez, 2011). Given the capacity limits of visual working memory, this processing strategy may be critical for capturing the richness of our perceptual experience (Cohen et al., 2016). Recently, the integration of ensemble statistics with items held in visual working memory has been shown to correlate with individual differences in sensory processing patterns (Lowe et al., 2016b), which capture variability in behavioral strategies and neurological responses to general sensory experience (Dunn, 1997). Specifically, individuals with lower sensory thresholds were less likely to integrate mean size information across a set of similar items, suggesting a possible mechanism through which disordered perception may occur.

A growing body of evidence has shown that sensory processing abnormalities are inherently pervasive in individuals diagnosed with autism spectrum disorder (ASD) (Kern et al., 2006; for reviews, see Baum et al., 2015 and Simmons et al., 2009), suggesting that autistic traits may influence the processing of ensemble information from the visual environment. Many theories have emerged that may explain different perceptual processing in individuals with ASD, such as the weak central coherence cognitive theory (Frith, 1989), the temporal binding hypothesis (Brock et al., 2002), the theory of imbalanced excitation and inhibition (Rubenstein & Merzenich, 2003), and a Bayesian observer theory (Pellicano & Burr, 2012). One of the earlier frameworks, the weak central coherence cognitive theory, called attention to the tendency for individuals with ASD to show a processing bias for featural and local information relative to the ability to extract overall gist information. Indeed, individuals within the autism spectrum demonstrate superior performance on tasks requiring detail-focused processing compared with typically-developed (TD) individuals (Happé & Frith, 2006). For instance, autistic individuals may fail to integrate multiple pieces of visual information across a unified context, instead making more accurate judgments of individual items than TD individuals (Happé, 1996). Since the representation of ensemble statistics is dependent on the ability to integrate visual information within the context of similar items, the perceptual differences associated with ASD could compromise the ability to represent ensemble information accurately.

The Bayesian observer framework, which builds on the tenets of weak central coherence, further predicts that autistic individuals may be less able to utilize summary statistics from scenes due to either a reduced ability to integrate prior experience with current perception or a reduced ability to learn the statistics of the environment (Pellicano & Burr, 2012; Powell et al., 2016; Lawson et al., 2017). A number of studies have shown that the ability to utilize ensemble representations is reduced in ASD individuals relative to TD controls. For instance, global perceptual averaging of ensemble information, such as color, may be compromised in individuals with ASD (Maul et al. 2016), and ASD individuals show a reduced ability to average a set of faces (Rhodes et al., 2014). Likewise, individuals with ASD relative to their typically developed peers exhibit a reduced ability to estimate the mean size of a set of 16 circles via a two-alternative forced-choice task (it should be noted that this comparison was not directly reported, but calculations based on reported data show an effect size of d = 0.61; Corbett, Venuti, & Melcher, 2016). A second study exploring mean size estimation in ASD showed mixed results. Participants were presented with an array of ten circles followed by a probe and asked if the probe was smaller or larger than the mean. Disrupted ensemble processing in ASD was observed when the distributions of sizes were highly heterogeneous (ten different dot sizes), but not when more homogeneous (two different dot sizes) (Van der Hallen et al., 2017). With that said, individuals with ASD have demonstrated similar ensemble processing of arrays of emotional faces (Karaminis et al., 2017). Thus, while the majority of studies show some disruption in ensemble processing in ASD, there is some variability in findings, making it critical to investigate further how ensemble processing interacts with the autism spectrum.

As a spectrum disorder, traits associated with ASD are seen in the general population, and the study of autistic traits in non-clinical samples affords a practical way to explore aspects of the autism spectrum, and is an approach that has successfully generalized experimental findings in TD to similar findings in ASD populations (Bölte et al., 2011; Chandler et al., 2007; Constantino et al., 2003; Stevenson et al., 2018a). The present study examines the relationship between autistic traits and the representation of ensemble statistics in visual working memory in a TD population by combining an ensemble processing task (Brady & Alvarez, 2011), which has been previously used to measure individual differences in perception (Lowe et al., 2016b), with the self-assessed Autism-spectrum Quotient (AQ) questionnaire (Baron-Cohen et al., 2001). In this ensemble paradigm, participants are presented with an array of circles of varying colors, and asked to remember the size of particular colors of circles while ignoring others. When probed to recall the size of a single circle, participants often show a bias toward the mean circle size of the probed color, reflecting ensemble processing of that color set of circles (Fig. 1). Examining the encoding of ensemble bias for size estimates allows us to explore automatic encoding of simple ensemble statistics, and whether individual differences in autism traits are associated with perceptual biases. We predicted that individuals with higher levels of ASD traits would be less likely to integrate mean ensemble information with individual items, manifesting as a decreased likelihood that their estimates for individual items will be biased toward the probed mean of an ensemble display.

Fig. 1
figure 1

(a) Stimuli and trial order used in the ensemble statistics task. Participants were asked to remember the size of the red and blue circles, and ignore the green circles. After a short delay, they were asked to recall and select the size of a previously seen circle, which was cued by the presentation of a black probe circle. (b) Bias was determined from a pair of matched displays. Two matched displays had the same items, but the tested item’s colors (highlighted here with dashed lines) were swapped between the displays. Bias was calculated by dividing the reported size for probed items presented in the color of the larger circles by the reported size for probed items presented in the color of the smaller circles (see Brady & Alvarez, 2011). Note that the size of the circles is not to scale to illustrate how bias is determined

Method

Participants

Sixty-eight undergraduate psychology students in the University of Toronto community (18 male) ranging in age from 18 to 54 years (M = 21.3) participated and were compensated for participation with course credit. All participants had self-reported normal or corrected-to-normal visual acuity, no history of color-blindness or abnormal neurologic conditions, and gave informed consent in accordance with the University of Toronto Ethics Review Board.

Display and apparatus

The ensemble statistics task was presented electronically using Matlab (MathWorks, Natick, MA, USA), on a ViewSonic 21-in. CRT monitor (1,280 x 1,024; 85-Hz refresh rate) at a viewing distance of 57 cm (subtending 38.70° x 29.49° of visual angle) held constant by using a desk-mounted chinrest. The AQ was completed by pen and paper.

Ensemble statistics task

To examine the bias of ensemble statistics for a set of items sharing common features, we conducted a direct replication of the first experiment in Brady and Alvarez (2011).

Participants were presented with a total of 30 displays each consisting of nine circles of varying size (three red, three blue, and three green), and were asked to remember the size of the red and blue circles, while ignoring the green circles. By asking participants to ignore a subset of circles based on color, it was ensured that participants attended to the color of the items, as previous research has shown that when color was not task-relevant, ensemble biases are reduced or absent. That is, when only two sets of colored circles are presented in the array, participants can simply remember the size of all items, and thus fail to show a bias toward the probed mean as a function of color (Brady & Alvarez, 2011; Huang et al., 2007). It should also be noted that previous research has demonstrated that the inclusion of a third, unattended color set does not influence estimations of the attended color sets as long as participants are instructed as to which colors they must attend to in an a priori manner (Halberda et al., 2006). Each display appeared on screen for 1.5 s and was then followed by a 1-s blank period, after which participants were cued to a target location by a randomly-sized circle appearing in black at the location of a previously presented blue or red circle. Participants were then instructed to use the computer mouse to resize the black circle so that it matched the remembered size of the circle they had previously viewed in that location (see Fig. 1a).

The nine circles appeared on a gray background that measured 18° x 12° of visual angle, and each circle was randomly positioned in an unseen 6 x 4 grid, with a jitter of ± 0.3° of visual angle added to the circles’ locations to prevent collinearities. The mean diameter for the circles of each color (red, blue, and green) were randomly chosen from a uniform distribution ranging from 0.45° to 2.85° of visual angle. Each individual circle’s size was then chosen by sampling randomly from a distribution with that color’s chosen mean and an SD equal to one-eighths of this mean.

To directly test bias in reported size, 15 displays were generated, and then another 15 matched displays were created by switching the color of the to-be-tested item with the other non-distractor color (red and blue). Thus, two matched displays had the same items, but the tested item colors were swapped between the displays. The 30 displays were then randomly interleaved, with the constraint that paired displays could not appear one after the other.

Autism-spectrum quotient (AQ)

As a spectrum disorder, specific traits associated with ASD are not only found in individuals with a clinical diagnosis, but are also found in the general population. Such traits are commonly measured by using the Autism Quotient Questionnaire (AQ; Baron-Cohen et al. 2001), a 50-item, self-report measure, which all participants completed. The AQ has high internal consistency (Cronbach’s α = .97) and test-retest reliability (r = .85). The AQ has been known to reliably discriminate between individuals with and without ASD (sensitivity = 95%, specificity = 95%) at a cut-off score of 34. All participants were below this cut-off and reflected distributions previously collected with typically-developed samples (Baron-Cohen et al. 2001; Stevenson et al., 2017) (mean = 21.14, SD = 8.07, range = 9 to 31).

Results

Ensemble bias

To test ensemble bias, we examined whether participants’ size estimates were biased toward or away from the mean size representation of circles within the same color set. Each matched pair (consisting of two displays that had the same items, but the tested item’s colors were swapped between the displays) was divided on the basis of which of the pair contained a tested item the same color as the circles that were smaller on average and which contained a tested item the same color as the circles that were larger on average:

$$ \frac{{Estimated\ size}_{\left( large\ target\ set\right)}}{{Estimated\ size}_{\left( small\ target\ set\right)}} $$

A ratio of 1.0 would indicate participants’ size estimates were not biased. However, if participants’ size estimates were biased toward the probed mean size of the circles in the same color as the tested item, this ratio would be greater than 1.0 (see Fig. 1B), with a maximum bias calculated as the value if each participant reported the ensemble mean itself of 1.481. This latter outcome was predicted based on previous evidence (Brady & Alvarez, 2011). On average, the reported size of the tested circle was 1.03 times greater on trials with the larger same-colored circles than on trials with the smaller same-colored circles (Fig. 2a), and this ratio was significantly greater than 1.0 (t(67) = 2.00, p = 0.05, d = 0.49). As is commonly found, variation in the directionality and extent of ensemble bias across participants was observed (see Fig. 2b), and we explore the source of this variation below.

Fig. 2
figure 2

(a) Mean bias averaged across participants. (b) The distribution and direction of mean size bias for all participants. For both graphs, a bias greater than 1.00 represents a bias toward the probed mean, and a bias less than 1.00 represents a bias away from the mean. Error bars represent the standard error of the mean. *p = 0.05

Ensemble bias and ASD traits

We first examined whether demographic variables (sex; age) correlated with ensemble bias. This investigation revealed no significant correlations (Pearson; two-tailed) between ensemble bias and either sex (r(68) = -0.034, confidence interval (CI) = [-0.270, 0.206], p = 0.782) or age (r(68) = -0.049, CI = [-0.284, 0.191], p = 0.691). To investigate our main hypothesis, we examined the correlation between ensemble bias and level of ASD traits as measured by the AQ (Fig. 3). We found a significant correlation between these two scores (r(68) = -0.269, CI = [-0.476, -0.033], p = 0.027. These results reveal a significant inverse relationship between ensemble bias and ASD traits; higher levels of ASD traits were associated with a decreased bias toward the probed mean ensemble representation. In contrast, no relationship was observed between ensemble bias and demographic factors.

Fig. 3
figure 3

Scatter plot (with best-fitting regression line) showing the correlation between ensemble bias (x-axis) and the autism-spectrum quotient (y-axis). An ensemble bias greater than 1.00 represents a bias toward the probed mean, and a bias less than 1.00 represents a bias away from the mean. CI confidence interval

We also conducted a multiple linear regression analysis to confirm a significant relationship between these two factors when demographic variables were accounted for, to investigate whether AQ scores could still predict ensemble bias. Indeed, this analysis revealed that scores on the AQ significantly predicted ensemble bias (t(67) = 2.34, p = 0.022, d = 0.57) with both sex (t(67) = 0.43, p = 0.668, d =0.11) and age (t(67) = 0.70, p = 0.484, d =0.17) accounted for.

Finally, a number of analyses were conducted to ensure that extraneous variability did not account for these reported effects. First, with randomly selected stimulus sizes on each trial, the variability of size within the target set on each trial also randomly varied. To ensure that this variability did not affect our results, we calculated the variance within each target ensemble and correlated this variance with ensemble bias, finding no relationship between the two (r = -0.007, p = 0.82). Second, the task-irrelevant green circle sizes also varied randomly. A mixed linear model was conducted to measure the contribution that the mean size of each ensemble, in addition to the target items size, contributed to participants’ responses, while controlling for the demographic variables of age, gender, and IQ. As expected, the target item’s size was the most significant predictor of participants’ response (t(1009) = 10.07, p < 0.001), followed by the target ensembles mean size (t(1000) = 2.26, p = 0.02). Neither the mean size of the distractor ensemble (t(1005) = 0.32, p = 0.75) nor the mean size of the task-irrelevant ensemble (t(1010) = -1.23, p = 0.22) significantly contributed to participant responses. Likewise, no demographic variable, including age (t(67) = 0.47, p = 0.64), gender (t(67) = -0.45, p = 0.66), or IQ (t(67) = 0.36, p = 0.72), contributed significantly to the model.

Discussion

Sensory and perceptual issues associated with autism have garnered a great deal of research attention, with a primary hypothesis that individuals with ASD often exhibit difficulties integrating individual pieces of sensory information into a single, unified perception. Our current findings test this hypothesis using a particular instantiation of this processes, namely ensemble perception, and show that individuals with higher levels of autistic traits show decreased ensemble perception. This relationship holds even when controlling for demographic factors (i.e., age, sex). Specifically, individuals with levels of ASD traits were less likely to represent individual items of a certain color as biased toward the probed mean of an ensemble representation with items containing that color, and thus were less likely to integrate similar perceptual information from the environment. These results are consistent with previous findings linking sensory processing patterns with measures of ensemble bias (Lowe et al., 2016b), yet extend this work by providing a more direct investigation of the relationship between ensemble processing and autistic traits.

The visual world is crowded with objects sharing similar attributes (e.g., faces in a crowd; trees in a forest), and ensemble encoding can be used to extract a tremendous amount of redundant information to obtain an almost immediate sense of the visual environment (Cohen et al., 2016). In fact, neural regions involved in the processing of ensemble information overlap with scene-selective areas (Cant & Xu, 2012, 2015), and these areas may represent multiple properties of the visual environment that influence our perceptual experience of the world (Lowe et al., 2016a). The present findings may therefore shed light on the cognitive mechanisms underlying the different, and often detail-focused, experience of the visual world observed in individuals with ASD, and also build on existing evidence demonstrating how individual differences influence the processing of ensemble features in the environment (Yang et al., 2013; Haberman, Brady, & Alvarez, 2015; Lowe et al., 2016b).

Previous research has suggested a decreased ability of individuals with ASD to integrate multiple pieces of information into a coherent Gestalt more broadly (Frith, 1989), particularly pertaining to the integration of sensory information (Baum et al., 2015; Stevenson et al. 2014b; Wallace and Stevenson, 2014), as such disturbances in sensory integration have been linked to symptomology in ASD (Stevenson et al., 2014a, 2018b; Woynaroski et al., 2013). This impaired ability to integrate global information across a unified context has been pivotal to theoretical accounts of ASD, particularly the weak central coherence theory (Frith, 1989), yet there has been much debate about this claim. In fact, there is growing support that, while individuals with ASD may show a preference for local information, they do not show an impairment in the ability to process global information (Koldewyn et al., 2013; Van der Hallen, 2015). In a similar vein, the integration of current sensory inputs and prior experiences has been demonstrated to be reduced in ASD (Powell et al., 2016; Karaminis et al., 2016; Lawson et al., 2017; Noel et al., 2017, 2018; Sevgi, Diaconescu, Tittgeeyer, & Schilback, 2016, 2012; Turi et al., 2015; Stevenson et al., 2017). Specifically, it has been proposed that these findings reflect difficulties in the ability to extract summary statistics, difficulties in the ability to integrate sensory representations, or both (Maul, Stanworth, Pellicano, & Franklin, 2016). Our data here, showing that individuals with high levels of ASD traits exhibit a decreased influence of ensemble statistics, which are by nature reliant upon the integration of information across individual item presentations, support these hypotheses.

Individuals with ASD are able to represent some level of ensemble information, but this representation may be compromised when compared to TD individuals (Corbett, Venuti, & Melcher, 2016; Maul, Stanworth, Pellicano, & Franklin, 2016; Rhodes, Neumann, Ewing, & Palermo, 2014), though experimental variables such as ensemble heterogeneity may influence this effect (Van der Hallen et al., 2017). The results of the present study expand these latter findings to suggest that these findings are not constrained to individuals with a clinical diagnosis, but can also be observed in the general population relative to their individual level of autistic traits. The precise mechanisms underlying this relationship are presently unclear. One suggestion previously put forth suggests a reduced ability to integrate prior experience with current perception may affect the extraction of ensemble statistics in ASD individuals (Pellicano & Burr, 2012). Individual differences in sensory processing patterns predict the bias of ensemble statistics held in visual working memory (Lowe et al., 2016b). Since ASD traits as measured by the AQ have been shown to correlate significantly with sensory scales (Black et al., 2017; Holder, Wilson, Mendez, & Murphy, 2014; Schulz & Stevenson, in press), the reduced ensemble bias in individuals with high AQ scores seen here may be related to general sensory experience. Other mechanisms, including but not limited to global versus local deployment of attention, the ability to supress distracting information, and the ability to form perceptual gestalts for similar items, may also influence this relationship. These possibilities should be explored in greater detail in future studies. Moreover, since the present study investigated TD individuals, future research should also explore whether individuals clinically diagnosed with ASD exhibit similar patterns of ensemble processing, through tasks assessing not only the overall ability to represent ensemble information, but also the sensitivity to ensemble information across multiple sensory domains, both explicit and implicit.

It should also be noted that our results here pertain directly to ensemble processing of the size of objects in an array, and these results may not generalize to ensemble processing of all types of stimulus features. Though these results are similar to previous findings of decreased ensemble processing of size, color, and facial identity (Corbett et al. 2016; Maul et al., 2016; Rhodes et al., 2014; Van der Hallen et al., 2017), this decrease was not observed with ensemble processing of emotional expression (Karaminis et al., 2017) or with more homogeneous arrays (Van der Hallen et al., 2017). As such, future research incorporating multiple measures of ensemble processing across an array of stimulus features would be quite beneficial.

In summary, our findings reveal a relationship between autistic traits as measured by the AQ and the bias of ensemble statistics for perceptually similar information held in visual working memory. These findings may shed light on different perceptual processing within the autism spectrum, and provide insight into the relationship between individual differences and ensemble encoding.