Introduction

Adjustment after a spinal cord injury (SCI) presents a formidable challenge. While the primary emphasis on adjustment is physical function, psychological adjustment presents an ancillary challenge. Depression following SCI is common and is the most frequently cited psychological issue in SCI populations. Estimates of prevalence range from 11 to 60%,1, 2, 3 with the prevalence in women being approximately twice that of men.2 The occurrence of depression is not surprising and has even been considered inevitable4 and perhaps a natural and necessary part of the acceptance process. However, recent studies have shown that depression after SCI is neither inevitable nor beneficial, and moreover not everyone who sustains an SCI becomes depressed.5

Depression following SCI is concerning because of its negative influence on the rehabilitation process, increased length of hospitalization, decreased longevity, increased rates of suicide, restricted community participation and reduced health and daily functioning are all plausible consequences. Data obtained in earlier studies suggest that many newly injured persons, who meet the criteria for depression, remit within 3 months of onset.6 Because depression is treatable, early identification and treatment are essential to prevent further problems.

Diagnosing depression is not straight forward and various methodological challenges have constrained its study in the SCI population.7 One particular issue includes the difficulty defining and operationalizing depression as well as the inconsistent use of diagnostic criteria to identify this condition.8 The gold standard for identifying depression is through the use of structured diagnostic interviews based on the DSM-IV Axis I Disorders criteria. These interviews take considerable time and special expertise. As a result, such detailed interviews are not routinely adopted as a regular clinical practice outside of the context of research.9 Using a screening tool can facilitate case-finding and moreover provide evidence of outcome after intervention.

Several screening tools exist such as the Beck Depression Inventory and the Zung Self-Rating Depression scale. The 20-item Center for Epidemiological Studies Depression scale (CESD) has been used since the late 1970s.10 It is recognized for its good psychometric qualities, reliability and validity when used with general population and in primary care patients.11 It has also proven to be a useful indicator of depression with older adults. Like other self-report measures, it is convenient to use due to its ease and low cost of administration; however, to date, the measurement properties have not been well established for the SCI population. One clinically attractive alternative form of this tool is the 10-item CESD. The purpose of this study was to examine the internal consistency, 2-week retest reliability and the concurrent validity of both the 20- and 10-item CESD in a sample of individuals with traumatic SCI. We also provide normative data for a variety of sample characteristics such as age, sex, motor impairment level and duration of injury.

Materials and methods

Design/sample

A 2-week follow-up methodological study design was used to address the study objectives. Acknowledging that there is no ideal retest period, we chose a 2-week period as it would likely eliminate any potential effects of memory while minimizing the potential for change in symptoms of depression.12

Community living individuals more than 1-year after traumatic SCI were recruited using advertisements in a disability newsletter, locally distributed posters and from the GF Strong Rehabilitation Research database. The GF Strong Rehabilitation Centre is the primary provider of SCI rehabilitation in British Columbia. Inclusion criteria included age over 19; American Spinal Injury Association (ASIA) impairment level of A or B (indicating no motor function below the sacral levels 4 and 5) SCI; no mental health issues requiring active treatment; and residence within 50 km of the study centre. Individuals were excluded if they were unable to provide informed consent or read English; did not provide complete information; had ventilator-dependent tetraplegia; had active medical conditions such as anemia, hypothyroidism or sleep apnea as identified by physician screen; or experienced a change in health status (new medical conditions or medication changes), activity, caregiver and social situation over the retest interval. We excluded ventilator-dependent individuals with SCI as they make up a small proportion of our accessible population. Therefore, excluding this subgroup would keep the sample homogeneous.

Protocol

Upon providing consent, subjects were scheduled for two visits to the research laboratory approximately 2 weeks apart. During session one, subject demographics (age, education, marital and work status), SCI factors (date and cause of injury, mobility device use) and all standardized measures were collected. The ASIA assessment was performed by a physician trained in the protocol. The order of the self-report measures completed for the study was randomized. All but four subjects returned to the centre for the second session; those who did not return provided information by phone and e-mail. Subjects were screened for changes in health status (change in medication use, hospitalizations and new diagnoses), activity, caregiver and social situation. The study protocol was approved by the Clinical Research Ethics Review Board of the University of British Columbia.

Measurement

The 20-item CESD (CESD-20) was originally developed to screen for the frequency of symptoms of depression in the general population. The questions for this tool were generated by selecting items from previously validated tools.10 Its components cover elements related to depressed mood, feelings of guilt and worthlessness, helplessness and hopelessness, psychomotor retardation, appetite loss and sleep disturbance. Responses capture the frequency of feelings and behaviours over the past 7 days and are rated on a 4-point scale ranging from 0 (rarely or none of the time) to 3 (most or all of the time). Items number 4, 8, 12 and 16 are reverse scored and once corrected an overall score is calculated by summing the scores with higher scores suggesting greater levels of depressive symptoms. Scores of over 15 have been reported to indicate depression in the general population based on DSM III criteria; however, Kuptniratsaikul et al.13 suggested scores over 19 should be used for individuals with SCI. They reported high sensitivity (80%) and reasonable specificity (68%) using this cut score. In a separate study, the internal consistency among a diverse sample of individuals with SCI was alpha=0.88.14 The shortened CESD-10 was created by Andresen et al.15 Less has been reported about the psychometric properties of the shorter version, but Andresen et al. reported high agreement with respect to predictive accuracy (κ=0.97) between cut scores of >10 on the CESD-10 and scores of >15 on the CESD-20. Validity has been reported in the predicted direction with measures of poorer health status (r=0.37) and positive affect (r=−0.67) in a large sample of well-older adults. Retest correlations in the same sample were adequate (r=0.71).

Standardized measures other than the CESD used in this study were included to assess the validity. These tools consisted of a visual analogue scale for fatigue (VAS-F) and the Short Form-36 (SF-36). The VAS-F is a one-item self-report method of capturing global fatigue. The participant marks a point along a 100-mm line indicating the severity of his/her fatigue over the past week. The anchors for the scale are ‘absolutely no fatigue’ and ‘extremely severe fatigue’. It was included in this study given the moderate association between fatigue and depression.16

The SF-36 is the most widely used measure for assessing health status. It was designed for use in clinical practice and research17 to provide information about the individual's mental and physical health status. For the purposes of this study we used all eight of the subscales that make up the SF-36. These include the mental health, emotional role function, vitality, pain, physical health, physical role function, social role function and general health.

Data analysis

To determine the minimum number of subjects needed for the study, we based our sample size calculation on the work of Donner and Eliasziw.18 Given the relatively short retest period we hypothesized the retest reliability of the CESD using intraclass correlation coefficient (ICC) would be >0.80. We determined that given sufficient power (0.80) and an alpha of 0.05, 42 subjects would be needed.18 We over-sampled to compensate for missing data or failure to attend session two.

Descriptive statistics such as means, standard deviations and proportions were used to provide a summary of all variables. In addition, mean CESD values were derived relative to demographic and SCI-related factors. t-tests and one-way analysis of variance, for binary and multi-group (>2), respectively, were conducted to assess if statistically relevant group differences existed for these factors to provide normative data based on the sample.

Cronbach's alpha was derived to assess reliability related to internal consistency and one-way analysis of variance ICCs (1,1) were calculated to assess the 2-week reliability of the total score and the item-by-item scores of the CESD. The Bland–Altman or limits of agreement method was used to provide a visual assessment of within-test agreement between the baseline and follow-up ratings and to identify any bias that might exist.19 The difference in CESD scores between the two data collection sessions is plotted against the average for each subject. The mean and standard deviation of the mean difference as well as the true value of the mean using 95% confidence intervals (CIs) were calculated to further assess the existence of bias. Visual inspection of the plot is used to determine if bias is likely. Ideally the data points will be distributed evenly along the ‘zero line’ if there are no ceiling or floor effects, and an equal number of points should lie above and below if there is no pretest/post-test bias. Finally, the data points should fall within the 95% confidence intervals.

Analysis of validity was performed using Pearson's product moment correlation. We hypothesized the CESD would have a statistically significant positive correlation with the VAS-F and a negative correlation with each of the eight SF-36 subscales. We anticipated moderate or higher correlations (r>0.5) with scores that represented mental health attributes such as the vitality, mental health and emotional role limitation subscales of the SF-36 and the VAS-F. Weaker correlations were expected between the CESD scores and the physical and general health subscales of the SF-36. The level of statistical significance was set at P<0.05. All analyses were completed using SPSS version 14.1.

Results

Fifty-five individuals were recruited; however, five did not provide information on the CESD at session two, two reported a major change during the retest period and one subject exceeded the 2-week follow-up collection period by 2 weeks. The mean age of the remaining 47, primarily male (64%) subjects, was 40.6 (±12.6) years. The range in years from the time of lesion was 1–41 years, the majority had an ASI A classification (68%) and had their injury related to traumatic reasons (87%). Other sample demographic and SCI-related information is presented in Table 1.

Table 1 Sample descriptive statistics and mean CESD-20 and CESD-10 values by sample factors (N=47)

The mean CESD-20 score for the sample was 15.2 (range 0–42) with 39% of the sample scoring over 15 and 30% scoring over 19 and the mean CESD-10 score was 9.7 (range 0–25) with 38.3% scoring over 10. The only significant CESD mean difference based on demographic or clinical factors was for those who were working or attending school versus those who were not (Table 1). Although not statistically significant fewer symptoms of depression were evident the longer one had a SCI.

Internal consistency measured using Cronbach's alpha was 0.91 for the CESD-20 and 0.86 for the CESD-10. Item-wise deletion revealed that the alpha did not change by more than 0.05 with the exclusion of any item. The total score test–retest reliability was ICC=0.87 (95% CI 0.79–0.93) for the CESD-20 and ICC=0.85 for the CESD-10 (95% CI 0.75–0.92). The item-by-item reliability (Table 2) revealed that the lowest reliability was for item 4 (ICC=0.30) and highest for item 3 (ICC=0.73).

Table 2 Item wise and total ICC values for CESD

The Bland–Altman limits of agreement plots (Figures 1a and b) indicate that an equal number of subjects scored above and below zero, suggesting that there was minimal bias towards scoring higher or lower during the baseline or follow-up stages. The distribution of the points tended to cluster closer to the left-hand side of the plot. However <15% of the sample scored at either extreme of the scale suggesting there is likely no floor or ceiling effect.19, 20 Three of 47 data points were found to lie outside of the two standard deviations of the mean difference, suggesting that the error in reliability readings for the test–retest reliability may be statistically significant.19 A t-test of the mean values demonstrated there was no statistical difference for the CESD-20 (mean difference=−1.3; 95% CI=−2.76, 0.24; t=−1.70; P=0.10) or the CESD-10 (mean difference=−0.22; 95% CI=−1.21, 0.78; t=−0.44; P=0.66) at time 1 and time 2. The inclusion of 0 in the 95% CI suggests minimal bias between the measurement sessions.

Figure 1
figure 1

(a) Bland–Altman limits of agreement plot of CESD-20. (b) Bland–Altman limits of agreement plot of CESD-10.

The expected associations between the CESD and the other measures were observed in terms of the hypothesized direction and magnitudes of the correlations (Table 3). The strongest correlation was with the mental health subscale with approximately 50% shared variance followed by the vitality, fatigue, general health and role emotional subscale scores, all with approximately 21–36% shared variance. The weakest correlation was with the physical functioning and pain subscales.

Table 3 Pearson's correlations of the CESD-20 and CESD-10 with other measures

Discussion

The results of this study provide support for the reliability and validity of the original CESD-20 and the CESD-10. The internal consistency of the CESD-20 was marginally better than the CESD-10, which can be expected simply based on the difference in the number of items included; however, the retest reliability was virtually the same. Both the internal consistency and the retest reliability coefficients exceed the criteria Andresen et al.15 advocate as indicative of a top grade.

The Pearson's correlations between the validity measures and both versions of the CESD were in the same direction and of similar magnitude, although the coefficients were slightly larger for the CESD-10 except for the association with the SF-36 mental health score. The magnitude of the validity coefficients was similar to those reported in different populations. For instance, the mental health subscale score was related to (r=−0.76) the CESD in a sample of individuals with rheumatoid arthritis.21 Previous studies in cancer survivors,22 individuals with sarcoidosis23 and family caregivers of stroke survivors24 also reported stronger correlation of the mental health component scores to CESD than the physical health scores. The consistency of these findings with those of our study supports the notion that the CESD is valid for use with the SCI population.

Limitations of our study include the generalizability of our study results. Our sample did not include individuals in the first year after SCI or those with motor incomplete SCI. Moreover, the sample was well educated. A larger sample would have enabled us to assess the factor structure of the tool for the target population. Our retest period was 2 weeks and although we are confident that is long enough to reduce any influence of learning on the subjects’ responses,12 it is plausible that a longer period may have eliminated this potential source of bias. A number of variables were used to assess the construct validity of the CESD by determining how well it converged with a measure of fatigue and attributes of the SF-36. We recognize that using a single item measure of fatigue is less than ideal; however, we are unaware of any fatigue measures that have been validated with the SCI population. Therefore, when considered in combination with an effort to minimize burden on the sample, we elected to use the VAS-F. Finally, screening tools like the CESD should not be used to diagnose depression; however, they do provide valuable information that may alert clinicians to conduct further evaluation.

Conclusion

Depression after SCI can restrict participation in life and may predispose individuals to medical complications. Moreover on the basis of our clinical experience, depression may interfere with participation in inpatient and outpatient rehabilitation. The challenge for clinicians is to distinguish those normal emotional responses to SCI from clinically significant depression that requires treatment. The CESD-20 and CESD-10 are quick and easy to use and have the advantage of published normative data from studies of the general population. This study provides evidence in support of the reliability and validity of these tools as screening tools for depression and normative data for individuals with SCI.