Why the factorial structure of the SCL-90-R is unstable: Comparing patient groups with different levels of psychological distress using Mokken Scale Analysis
Introduction
The Symptom Checklist-90-Revised (SCL-90-R) (Derogatis, 1994) was designed to cover nine different dimensions of psychological distress; the mean item score across all 90 items with theoretical values ranging from 0 through 4 is referred to as the Global Severity Index (GSI), which is widely used as a global index for psychological distress. Since the introduction of the SCL-90(-R), there has been a debate about the validity of the factorial structure, which was aptly expressed in the title of the paper ‘Factor structure of the SCL-90-R: is there one?’ (Cyr et al., 1985). More than two decades have passed since the publication of that paper; however, the debate has still not abated, as recent publications have demonstrated (Olsen et al., 2004, Arrindell et al., 2006, Elliott et al., 2006, Hafkenscheid et al., 2007). On one hand, there is a group of researchers that firmly believe in the multidimensionality of the instrument (Arrindell et al., 2004b, Arrindell et al., 2004a, Arrindell et al., 2006), whereas another group has pointed out that alternative models with only one or at most a few factors show an equally good or better fit (Hafkenscheid, 2004, Hafkenscheid et al., 2007). In a recent paper, Paap et al. (2011b) proposed a new scale solution of 7 scales based on a study involving patients referred for a personality disorder (PD); scales were built on two start items that reflected the content of the disorder that corresponded with the specific scale. The new solution included 60 of the 90 items clustered in seven scales: Depression, Agoraphobia, Physical Complaints, Obsessive-Compulsive, Hostility (unchanged), Distrust and Psychoticism. The authors found that most of the new scales discriminated reliably between patients with moderately low scores to moderately high scores. The items forming the GSI showed low scalability, and the authors concluded that their research findings lent support for a multidimensional model of the SCL-90-R. The authors speculated that the lack of agreement between studies might be due to several factors, such as difference in variance, the existence of structure generating factors, differences in the interpretation of the fit indices, and, finally, the chosen analytic strategy (Paap et al., 2011b).
In the current study, we investigate whether the findings in the study by Paap et al. can be generalized to other patient groups by comparing the dimensionality of the PD sample to that of a sample of persons with Gender Incongruence (GI) and a sample of depressed outpatients. The term ‘GI’ signifies the incongruence between one's gender identity on one hand, and one's assigned gender and/or one's congenital primary and secondary sex characteristics on the other hand (Kreukels et al., 2010, Meyer-Bahlburg, 2010).1 Following Kreukels et al., we use GI when referring to patients who have not yet been diagnosed with GID (APA, 1994) or transsexualism (WHO, 1992). We expect the reported level of psychological distress (estimated by the GSI) to be lower in the GI sample than in the depressed sample and PD sample. Haraldsen and Dahl (2000) showed that patients diagnosed with GID had slightly elevated GSI scores when compared to healthy adults, but did not reach the value of 1.0 which is the cut-off for clinically significant symptoms (GSIGID=0.6, GSIcontrols=0.4). In contrast, depressed outpatients have been found to exceed the cut-off (GSIDEP=1.4) (Leinonen and Niemi, 2007), and so have the patients in the PD sample used in the study by Paap et al. (GSIPD=1.5). Our main research questions are:
- (1)
Is the dimensionality of the SCL-90-R similar for patient groups that differ in level of reported psychological distress?
- (2)
Are the different factorial solutions found in the literature due to a difference in variance in reported psychological distress?
Following Paap et al. (2011b) and Meijer et al. (2011), Mokken Scale Analysis (MSA; Mokken, 1971) was used to analyze the data. MSA is a nonparametric Item Response Theory (IRT) approach that can be used to explore and test hypotheses about the dimensionality of a data-set, while at the same time resulting in scales adhering to a measurement model.
Section snippets
Personality disorder sample: PDlow and PDhigh
This sample consisted of 3078 patients admitted to 14 different day hospitals participating in the Norwegian Network of Personality-Focused Treatment Programs (Karterud et al., 1998), treated in the period from January 1993 through July 2007. This sample was also used in the study by Paap et al. (2011b). Sex ratio and age are depicted in Table 1. Seventy-nine percent were diagnosed with at least one personality disorder (PD). Of the PDs, Avoidant PD was most common (39%), followed by Borderline
Missing data: two-way imputation
Less than 1% of the data were missing in each of the data-sets. Following Paap et al. (2011b), we used Two-Way imputation (Bernaards and Sijtsma, 2000), which allows the user to transform an incomplete data-file into a complete one by using all available information about the proficiency of the respondent and the ‘difficulty’ of the item (Sijtsma and van der Ark, 2003). This method is easy to implement using SPSS (SPSS, 2007), using the syntax provided by van Ginkel and van der Ark (2005).
Description of the data
Table
Discussion
Studies reporting on the dimensionality of the SCL-90-R have had very diverse outcomes. To this day, the original 9-scale solution (Derogatis, 1994) remains controversial (Schwarzwald et al., 1991, Holi et al., 1998, Vassend and Skrondal, 1999, Schmitz et al., 2000, Olsen et al., 2004, Arrindell et al., 2006, Elliott et al., 2006, Hafkenscheid et al., 2007, Paap et al., 2011b). Here, we wanted to identify factors that could help explain the inconsistent findings in the literature. The main
Acknowledgments
We thank Jan van Bebber for imputating the missing data, Xi X. Zhao for preparing Fig. 1, and Mitzi Paap, Frøydis Hellem and Thomas Mengshoel for helpful discussions. We thank the patients and staff from the GID clinics in Amsterdam, Oslo, Hamburg and Ghent, as well as from the Department of Neuropsychiatry and Psychosomatic Medicine at Oslo University Hospital for their contribution to this study. Finally, we thank the patients and staff from the following treatment units in the Norwegian
References (42)
- et al.
Gender differences in the factor structure of posttraumatic stress disorder symptoms in war-exposed adolescents
Journal of Anxiety Disorders
(2011) - et al.
Assessing the utility of diagnostic criteria: a multisite study on gender identity disorder
Journal of Sexual Medicine
(2011) - et al.
Mokken scale analysis as time goes by: an update for scaling practitioners
Personality and Individual Differences
(2011) - et al.
The problem of structural indeterminacy in multidimensional symptom report instruments. The case of SCL-90-R.
Behaviour Research and Therapy
(1999) Diagnostic and Statistical Manual of Mental Disorders (3rd ed., revised) (DSM–III-R)
(1987)Diagnostic and Statistical Manual of Mental Disorders (4th ed.) (DSM–IV)
(1994)- et al.
Invariance of SCL-90-R dimensions of symptom distress in patients with peri partum pelvic pain (PPPP) syndrome
British Journal of Clinical Psychology
(2006) - et al.
Nog meer steun voor het multidimensionale karakter van de SCL-90-R [Even more support for the multidimensional nature of the SCL-90-R]
De Psycholoog
(2004) - et al.
Verdere steun voor het multidimensionale karakter van de SCL-90-R [Further support for the multidimensional nature of the SCL-90-R]
De Psycholoog
(2004) - et al.
Influence of simple imputation and EM methods on factor analysis when item nonresponse in questionnaire data is nonignorable
Multivariate Behavioral Research
(2000)
Factor structure of the SCL-90-R: is there one?
Journal of Personality Assessment
SCL-90-R: Administration, Scoring and Procedures Manual
Deconstructing therapy outcome measurement with rasch analysis of a measure of general clinical distress: the Symptom Checklist-90-revised
Psychological Assessment
Hoe multidimensionaal is de Symptom Checklist (SCL-90) nu eigenlijk? [How multidimensional is the Symptom Checklist (SCL-90) really?]
De Psycholoog
The dimensions of the Dutch SCL-90: more than one, but how many?
Netherlands Journal of Psychology
Symptom profiles of gender dysphoric patients of transsexual type compared to patients with personality disorders and healthy adults
Acta Psychiatrica Scandinavica
A Finnish validation study of the SCL-90
Acta Psychiatrica Scandinavica
Day treatment of patients with personality disorders: experiences from a Norwegian treatment research network
Journal of Personality Disorders
The Norwegian network of psychotherapeutic day hospitals
Therapeutic Communities
A European network for the investigation of gender incongruence: the ENIGI initiative
European Psychiatry
The influence of educational information on depressed outpatients treated with escitalopram: a semi-naturalistic study
Nordic Journal of Psychiatry
Cited by (28)
PROMIS Physical Function Short Forms Display Item- and Scale-Level Characteristics at Least as Good as the Roland Morris Disability Questionnaire in Patients With Chronic Low Back Pain
2020, Archives of Physical Medicine and RehabilitationEvaluating psychiatric symptoms in Parkinson's Disease by a clinimetric analysis of the Hopkins Symptom Checklist (SCL-90-R)
2018, Progress in Neuro-Psychopharmacology and Biological PsychiatryCitation Excerpt :According to the SCL-90-R manual (Derogatis, 1994) it is important to consider gender specific scorings but not age specific. This is in accordance both to Schmitz et al. (2000) and to Paap et al. (2012) who found no impact of age on the SCL-90-R subscales. Therefore, the two samples were gender matched.
SCL-90-R emotional distress ratings in substance use and impulse control disorders: One-factor, oblique first-order, higher-order, and bi-factor models compared
2017, Psychiatry ResearchCitation Excerpt :Paap et al. (2012) pointed to the fact that previously reported conflicting findings may be due to at least 2 sample characteristics: level of self-reported distress and biological sex. Paap et al. (2012) based their assertion on the outcome of analyses of SCL-90-R ratings yielded in three samples: severely psychiatrically disturbed patients, persons with gender incongruence, and depressed patients. Another determinant of factor structure was recently reported when SCL-90 ratings of adolescent psychiatric inpatients (at admission and discharge) and age- and gender-matched community youth were analyzed by Rytilä-Manninen et al. (2016).