Introduction
Pervasive developmental disorders, of which the most common are autistic disorder, Asperger Syndrome (AS) and pervasive developmental disorder not otherwise specified (PDD-NOS), are characterized by a triad of impairment: difficulties in reciprocal social interaction, communication, and the presence of stereotyped behavior, interests, and activities (American Psychiatric Association
2000). Together, these conditions are referred to as autism spectrum conditions (ASCs). This term reflects the assumption that (high functioning) autism and AS lie on a continuum reflecting severity of social communication disability, from classical autism at the most severe end of the spectrum, decreasing via high functioning autism (HFA), AS, and PPD-NOS into normal behavior. Twin and family studies have shown that genetic factors play a major role in the risk for ASC (Rutter
2000). A conservative estimate of the concordance rates of autism in monozygotic twins is 60%. Concordance in dizygotic twins, however, is only 0–5% (Folstein and Rutter
1977; Bailey et al.
1995). Moreover, if one MZ twin has autism, not only is the likelihood of an autism diagnosis in the co-twin increased, but also the risk of other neurodevelopmental difficulties affecting language and social interaction (Le Couteur et al.
1996; Bailey et al.
1995). This notion led to the idea that the same genetic variants affecting the risk for autism may lead to a broader phenotype of autistic traits, as reflected in an increased rate of social deficits, impairments in communication and language, a preference for routines and difficulty with change in non-autistic relatives of autistic individuals (Bolton et al.
1994; Bailey et al.
1998). Rather than a distinct disorder, it is now thought that the autism spectrum conditions as defined in the DSM-IV represent the upper extreme of one or more quantitative traits, and these traits may be continuously distributed in the population (Constantino and Todd
2003; Piven et al.
1997; Spiker et al.
2002). Studies using quantitative measurements of autistic traits found elevated scores in relatives of autistic and pervasive developmental disorder patients (Constantino et al.
2006; Bishop et al.
2004) and high scores in children whose parents showed high (but sub-diagnosis) endorsement on autistic traits (Constantino and Todd
2005).
The quantitative approach to autistic traits has led to the development of the Autism-Spectrum Quotient (AQ; Baron-Cohen et al.
2001). This self-administered questionnaire was developed to quantify autistic traits in individuals with normal intelligence. The AQ consists of 50 items, assessing personal preferences and habits. Subjects rate to what extent they agree or disagree with the statements on a 4-point Likert scale, with answer categories “definitely agree”; “slightly agree”; “slightly disagree” and “definitely disagree”. For approximately half the items an “agree” response is in line with autistic traits (e.g. item 23: “I notice patterns in things all the time”); for the other half a “disagree” response is indicative of an autistic trait (e.g. item 11: “I find social situations easy”). All the item scores are summed; a high AQ score indicates a high autistic load, close to the autistic end of the autism spectrum. In the original version of the AQ (Baron-Cohen et al.
2001), the 50 items were divided into five theoretically derived subscales of 10 items each: Social skill; Communication; Imagination; Attention to detail; and Attention switching.
Both the British (Baron-Cohen et al.
2001) and the Japanese version (Kurita et al.
2005) of the AQ reported good test-retest reliability and moderate internal consistency (Cronbach’s α varying from .63 to .78). However, some aspects of the AQ merit further study. Firstly, the five domains of the AQ have been derived on a theoretical basis and have undergone little empirical testing. Austin (
2005) conducted an exploratory factor analysis of the AQ in a group of 201 undergraduate students and found evidence for three, rather than five factors, with a focus on Social skill, Details/patterns, and Communication/mind reading. In a principal component analysis using data on the child-version of the AQ, Auyeung et al. (
2008) found support for 4 rather than 5 factors. Confirmatory factor analyses can further extend our knowledge on the psychometric qualities of the AQ subscales and empirically test the goodness of fit of the 5-domain model. The first aim of the current study was to examine the model fit of the 5 domain structure proposed by Baron-Cohen et al. (
2001) in a large student sample and a general population sample and compare the fit with alternative models. Some of the 5 domains assess traits that may show considerable overlap (e.g. satisfactory social skills are needed for good communication). Therefore we expected some of the domains to be substantially correlated and predicted that a model in which the highly correlated factors are clustered may be preferred.
Secondly, more research needs to be done on the criterion validity of the AQ. Studies in England (Baron-Cohen et al.
2001), and Japan (Wakabayashi et al.
2006) found significantly higher AQ scores in subjects with an AS or HFA diagnosis, compared to scores in a student sample and a general population sample. A preliminary study (Woodbury-Smith et al.
2005) reported satisfying ability of the AQ to distinguish between subjects with and without an AS/HFA diagnosis, in a group of 100 referrals to a diagnostic clinic for adults suspected of having AS or HFA. However, a Dutch study comparing AQ scores in mild ASC patients with scores in patients referred to a general outpatient clinic found little differences between the two groups (Ketelaars et al.
2007). The subjects in the latter study were not matched on sex, education and age. It thus remains unclear whether high AQ scores are specific to ASCs, or may be common to psychiatric disorders in general. Apart from a general population and a student sample, the current paper includes a small sample of three different patient groups: one group with AS/HFA and PPD-NOS patients, a group of patients diagnosed with obsessive-compulsive disorder (OCD), and a group with social anxiety disorder (SAD). Because SAD patients demonstrate problems with social interaction, and OCD patients show symptoms of repetitive behavior, we predicted that the AQ scores in these patient groups will be higher than the general population mean. Extremely high AQ scores were expected to be specific to ASC patients.
Lastly, this paper is the first to examine the psychometric properties of the AQ in a Dutch population. The original version of the AQ was translated to Dutch using the backward translation procedure. The characteristics of the Dutch AQ, including test–retest reliability and internal consistency, were studied in a large sample of students and subjects from the general population. In line with the empathizing–systemizing theory of autism (Baron-Cohen
2002; Baron-Cohen et al.
2005) and findings from previous studies (Baron-Cohen et al.
2001; Hoekstra et al.
2007; Wakabayashi et al.
2006) we expected: (a) AQ scores to be continuously distributed in the general population; (b) significantly higher mean AQ scores in men compared to women; and (c) higher AQ scores in science students than in students in the field of humanities and social sciences.
Methods
Participants
This study encompassed four different samples. The first group consisted of 961 students from the VU university in Amsterdam (n = 809) and the University of Twente in Enschede (n = 152). During the break of one of their classes, the students were asked to complete the AQ. Students were recruited from the fields of humanities (history and law; n = 128), the social sciences (psychology, education, and communication science; n = 594) and natural and technical sciences (including mathematics, physics and information sciences; n = 239). Participation rates varied from 65 to 100%. The mean age of the students was 21.19 years (SD 3.69).
The general population sample (n = 302) consisted of parents of twins who visited an information day for parents of multiples. They were asked to either fill out the AQ immediately or to return the questionnaire to our research group by mail. The participation rate for this data collection was 62%; mean age of the participants was 35.68 years (SD = 6.33). The student and general population groups were not matched on age and IQ. However, the two groups were included for separate research purposes. The parent group served as a normative sample, whereas the student sample was included to address differences in AQ scores in different fields of study.
To obtain data on test–retest reliability, a group of 18-year old twins and their brothers and sisters filled out the AQ. These twin families participated in an ongoing study on cognitive development in late puberty and completing the AQ was part of the test protocol (see Hoekstra et al.
2007 for more information on this study). The first 117 participants of the study were re-contacted 1–6 months later (the mean time interval between the first and second assessment was 3.9 months), and were asked to fill out the AQ for a second time. Seventy-five participants returned the questionnaire for the second time (64%). AQ scores of the responders in the retest did not significantly differ from the subjects who did not respond in the retest (
F(1, 115) = .066,
p = .797).
The last group consisted of three subgroups of psychiatric patients, who were all adult outpatients recruited from the anxiety outpatient services of GGZ Buitenamstel in Amsterdam (see also Cath et al.
2008). All subjects were administered the Structured Clinical Interview on DSM-IV diagnoses (SCID-I; First
1996) to establish in-, and exclusion criteria. Subjects suffering from co-morbid depression, psychosis, substance abuse, mental deficiency or inability to read or speak Dutch were excluded. To exclude any risk of cognitive deficit and/ or below average intelligence, only patients who had successfully completed an educational degree were included in the study, and patient groups were matched on age (range 19–57 years), sex (10 males; 2 females in all groups) and educational level. The SCID-I does not contain a section on autism disorders, and at the time of data collection no validated Dutch version of either the Autism Diagnostic Observation Schedule-Generic (ADOS-G; Lord et al.
2000) or the ADI-R (Lord et al.
1994) was available for adult subjects. Therefore all subjects were assessed on presence of ASCs according to DSM-IV criteria by two independent experienced clinicians and with the aid of a structured retrospective interview taken from one of the parents of the patients on early infant development in all domains of the spectrum of autistic conditions. The structured interview encompassed the following topics: age at onset of problem behavior, contact and communication skills, stereotyped behavior, development of speech and language, motor and sensory development, particular interests and skills, ability to display imagination, resistance against change and unexpected events, and impulse control. Only subjects who had independently been diagnosed with an ASC by the two clinicians were included in the study; diagnoses were made independent of the AQ responses. Subjects meeting the inclusion criteria completed the AQ at home, after they had given written informed consent, and returned the questionnaire during their next visit to the outpatient service. The patient groups encompassed (a) 12 patients with an autism spectrum condition (
n = 2 HFA;
n = 4 AS;
n = 6 PDD-NOS); (b) 12 patients with a “pure” obsessive-compulsive disorder, 3) 12 patients with a “pure” generalized social anxiety disorder.
The Dutch Autism-Spectrum Quotient
The AQ was translated after permission from prof. Simon Baron-Cohen (SBC). The translation into Dutch was conducted by an official translator. Subsequently, a second translator translated the Dutch version back into English. After comparing the outcome of the retranslated version to the original text, and discussing discrepancies in the retranslation with SBC, a final version was established (the Dutch version of the questionnaire is obtainable from the first author upon request). Total AQ and domain scores were based on the original 4-point Likert scale scores (1 = “definitely agree” up to 4 = “definitely disagree”). For the items in which an “agree” response is characteristic for autism, the scoring was reversed (“definitely agree” scored 4 points; “slightly agree” 3 points, etc.; This was the case in item 2, 4, 5, 6, 7, 9, 12, 13, 16, 18, 19, 20, 21, 22, 23, 26, 33, 35, 39, 41, 42, 43, 45, 46). All item scores were summed, resulting in a minimum total AQ score of 50 (no autistic traits) and a maximum score of 200 (full endorsement on all autistic items). Note that in most reports using the British version of the AQ (Baron-Cohen et al.
2001) the answer categories have been dichotomized into “agree”/disagree” scores. In these studies, all item responses in line with the autism phenotype scored one point, resulting in a maximum total AQ score of 50.
Missing Answers and Outliers
If more than five items were left blank (10% of the total number of items), the AQ was considered incomplete and the data were discarded in analyses (n = 2 in the student group; n = 7 in the general population group; none in the twin family or patient groups). Two social sciences students and one subject from the general population group obtained a score >160. Since these subjects completed the AQ anonymously, it could not be verified whether this reflected a true score. These outliers (deviation >4 SD’s of the mean) were therefore left out of the analyses.
Statistical Analyses
To examine the factor structure of the AQ, models were fitted on the student group data using confirmatory factor analyses in LISREL. The items of the AQ were measured on an ordinal scale, therefore the diagonally weighted least square procedure was used. First, a five-factor model, based on the 5 domains put forward by Baron-Cohen et al. (
2001) was fitted. The factors were allowed to correlate since it is reasonable to assume that different aspects of autistic traits will be related. The fit of this model was compared with the fit of alternative models. Secondly, the model fitting procedure was repeated on the data from the general population sample. Based on the model fitting results in these two samples, the most parsimonious best fitting model was chosen.
To evaluate model fit, several model fit statistics were inspected. The χ
2 test statistic and the Standardized Root Mean Square Residual (SRMR) were included as a measure of overall goodness of fit. Low χ
2 values compared with the number of degrees of freedom indicate that the model fits the data well. The SRMR is a standardized overall badness-of-fit measure and is based on the fitted residuals. An SRMR value of zero indicates no residuals and thus a perfect fit. As a rule of thumb, an SRMR of less than .05 indicates a good fit, whereas values smaller than .10 can be interpreted as acceptable fit (Schermelleh-Engel et al.
2003). To enable a comparison between different models, the Goodness of Fit Index (GFI), Parsimony Goodness of Fit Index (PGFI); and the Expected Cross Validation Index (ECVI) were included as well. The GFI measures the relative amount of the variances and covariances in the observed matrix that is predicted by the model. The GFI ranges between zero and one with higher values indicating a better fit. The PGFI is a modification of the GFI and takes model complexity into account. High PGFI values indicate good fit in a relatively parsimonious (and thus less complex) model. If choosing between alternative models, the highest PGFI indicates the superior model. The ECVI evaluates how well the model fitted in the sample under study would generalize to other samples and is thus a measure of cross validation. The model with the smallest ECVI indicates the model with the best fit.
Group differences in total AQ score and factor scores, and the validity and internal consistency of the scale were analyzed using SPSS. Group differences were tested using mixed model multivariate analysis of variance (MANOVA). The mixed linear model is an expansion of the general linear model and permits non-constant variability in the data. This way, it was possible to test for group differences in variables with unequal variances and unequal group sizes. In the student sample, overall sex differences and differences between students from different fields were explored. Effects of sex and age were studied in the general population sample. Furthermore, AQ score differences between patient groups and the general population sample were tested. Internal consistency was assessed in the student and general population using Cronbach’s alpha.
The test–retest reliability of AQ scores was assessed using AQ-scores collected in 18-year-old twins and their brothers and sisters (
n = 75). These participants are genetically related. In order to use all available data and to satisfy the independent observations assumption for statistical testing, the test–retest analysis was performed using structural equation modeling in the computer program Mx (Neale et al.
2006).
Discussion
The results of our study indicate that the Autism-Spectrum Quotient is a valid and reliable instrument to assess individual differences in autistic traits. The Dutch AQ was found to have satisfactory internal consistency and test–retest reliability. Moreover, high AQ scores were specific to ASC patients. Previously, an exploratory factor analysis performed in a British student population suggested three underlying factors within the AQ, encompassing Social skill, Details/patterns, and Communication/mind reading (Austin
2005). A recent study using the child version of the AQ found evidence for 4 of the 5 domains (Auyeung et al.
2008). Our confirmatory factor analyses in both a general population and a student sample indicated that 4 of the 5 domains of the AQ (Social skill, Communication, Attention switching, and Imagination) were highly correlated. Rather than four separate domains, we propose a hierarchical model allowing these domains to cluster together. This way, one broad band “Social interaction” factor (incorporating the 4 highly correlated domains) could be identified, together with a small second factor, consisting of items focusing on a preference for details and patterns (the domain “Attention to detail”).
It may seem surprising that the broad band scale also includes the domain Attention switching. As put forward by Courchesne and colleagues (
1994), difficulties in Attention switching make it harder to keep track of social information. Social interaction usually involves frequent and rapid changes in the source of information (visually or auditory information, change in objects or actions, etc.), and requires the ability to follow the flow of social cues (words, gestures, postures, background context, etc). The marked deficit in attention switching in ASC subjects may directly harm their social and communication abilities, which explains why this domain is included within the Social interaction factor.
The internal consistency and test-retest reliability of the total AQ score (α = .81/.71 and r = .78) and the broad band social interaction factor (α = .84/.77 and r = .79) were satisfactory. The internal consistency of the Attention to detail factor was moderate (α = .63/68) and its test–retest reliability was acceptable (r = .71). The internal consistencies of the four domains encompassing the broad band social interaction factor varied but were generally moderate (α between .39 and .76), the test–retest reliability varied between .60 and .81. Our results indicate that future studies using the AQ to assess autistic traits may benefit from a focus on the total AQ score and the 2 factor scores, rather than examining differences and similarities in the five original domains. The power to detect differences will be higher using the 2 factors; this could be of importance especially in linkage or association studies into autistic traits.
A small but significant correlation (
r = .19) was found between the two Social interaction factor and the Attention to detail factor. This result is in agreement with findings from a British twin project (Ronald et al.
2005) which assessed both social and non-social behaviors characteristic for autism using parent and teacher ratings. Social and non-social behaviors were weakly correlated with each other, both in the teacher (
r = .15) and the parent data (
r = .29).
Group differences in AQ scores were in line with previous studies using the AQ (Austin
2005; Baron-Cohen et al.
2001; Hoekstra et al.
2007; Wakabayashi et al.
2006). Males scored higher than females on the total AQ score and the Social interaction factor, although no sex difference on the Attention to detail factor was found. Similar to the findings reported by Baron-Cohen et al. (
2001) and Austin (
2005), science students obtained significantly higher scores than humanities and social science students. In our study, students enrolled in a social science degree in turn scored significantly lower than humanities students. Rather than a remarkable discrepancy with the British results (no differences between humanities and social sciences students), these findings are probably due to differences in the student sample. The majority of the social science students included in our study was enrolled in a psychology or education degree. Both these studies have a strong focus on human interaction on an individual level and are likely to attract students who enjoy social interaction. If social sciences focusing on a broader level (such as economics and political science) had been included, the difference in AQ score might not have been significant.
Subjects diagnosed with an ASC scored significantly higher on the total AQ than the general population and the other patient groups. This is a satisfying result, considering the potential symptom overlap between these patient groups and ASC. The overlap between ASC subjects and SAD patients entails problems with social interaction in both groups; the overlap between ASC and OCD subjects encompasses repetitive behaviors that occur in both groups. Relatives of autistic individuals are reported to have up to 10-fold higher rates of social phobia compared to control families (Smalley et al.
1995; Piven and Palmer
1999). Similarly, an increased incidence of OCD is found in autism relatives (Bolton et al.
1998), and the occurrence of obsessive–compulsive traits in parents of an autistic child is significantly more likely if the child displays strong repetitive behavior (Hollander et al.
2003). Recent research suggests that a common genetic pathway, the serotonin transporter gene, could explain a small part of the association between OCD and rigid compulsive behaviors in autism (Ozaki et al.
2003; Sutcliffe et al.
2005). Our study however shows that high AQ scores are specific to the ASC patients. ASC patients scored significantly higher than the other patient groups on both the Social interaction factor and the Attention to detail factor.
Moreover, all subjects with an AS/HFA diagnosis could be distinguished from the other samples, as no subjects without an ASC diagnosis obtained a score >145. The difference is less clear-cut for subjects with PDD-NOS, who obtained intermediate AQ scores. These findings could explain the discrepancy in previous studies using the AQ. Woodbury-Smith et al. (
2005) reported satisfying ability of the AQ to distinguish patients with an AS/HFA diagnosis from non-AS/HFA patients in a clinic for adults suspected of having AS or HFA. On the other hand, Ketelaars et al. (
2007) found little differences in AQ scores between mild ASC patients and patients referred to the general outpatient clinic. The latter study mainly included PDD-NOS patients and few patients with AS or HFA. PDD-NOS constitutes a broad diagnostic category with criteria less stringent than for autistic disorder (American Psychiatric Association
2000) and is often seen as a milder form of autism. A lower AQ score in PDD-NOS patients compared to AS/HFA patients, yet higher than in most subjects diagnosed with another psychiatric disorder or subjects from the general population, is compatible with the notion of a continuum of autistic traits.
This study had some limitations. Firstly, the data collection in the student and general population sample was based on anonymous reports. We could therefore not verify whether any subjects in these groups had an ASC diagnosis or should warrant a diagnosis. Furthermore, the sample sizes of the patient groups were small and findings from these groups should be interpreted with care. Future studies in clinical samples should explore the criterion validity of the AQ more extensively, and should especially focus on the differences between severely impaired ASC patients and mildly impaired PDD-NOS subjects.
In conclusion, this study shows that the AQ is a reliable instrument for examining variation in autistic traits. The AQ can be divided into two reliable sub factors, focusing on difficulties in social interaction and on marked interests and attention to details and patterns. Total AQ scores follow a continuous distribution in the general population, and patients with an ASC diagnosis fall in the upper end of this distribution. OCD and SAD patient obtain scores between the general population mean and scores typical for ASC patients. These findings indicate that the AQ is a valuable instrument to assess where an individual lies on the autism spectrum.