The American Psychiatric Association (APA) has updated its Privacy Policy and Terms of Use, including with new information specifically addressed to individuals in the European Economic Area. As described in the Privacy Policy and Terms of Use, this website utilizes cookies, including for the purpose of offering an optimal online experience and services tailored to your preferences.

Please read the entire Privacy Policy and Terms of Use. By closing this message, browsing this website, continuing the navigation, or otherwise continuing to use the APA's websites, you confirm that you understand and accept the terms of the Privacy Policy and Terms of Use, including the utilization of cookies.

×
ArticlesFull Access

Divergent Outcomes in Cognitive-Behavioral Therapy and Pharmacotherapy for Adult Depression

Abstract

Objective:

Although the average depressed patient benefits moderately from cognitive-behavioral therapy (CBT) or pharmacotherapy, some experience divergent outcomes. The authors tested frequencies, predictors, and moderators of negative and unusually positive outcomes.

Method:

Sixteen randomized clinical trials comparing CBT and pharmacotherapy for unipolar depression in 1,700 patients provided individual pre- and posttreatment scores on the Hamilton Depression Rating Scale (HAM-D) and/or Beck Depression Inventory (BDI). The authors examined demographic and clinical predictors and treatment moderators of any deterioration (increase ≥1 HAM-D or BDI point), reliable deterioration (increase ≥8 HAM-D or ≥9 BDI points), extreme nonresponse (posttreatment HAM-D score ≥21 or BDI score ≥31), superior improvement (HAM-D or BDI decrease ≥95%), and superior response (posttreatment HAM-D or BDI score of 0) using multilevel models.

Results:

About 5%−7% of patients showed any deterioration, 1% reliable deterioration, 4%−5% extreme nonresponse, 6%−10% superior improvement, and 4%−5% superior response. Superior improvement on the HAM-D only (odds ratio=1.67) and attrition (odds ratio=1.67) were more frequent in pharmacotherapy than in CBT. Patients with deterioration or superior response had lower pretreatment symptom levels, whereas patients with extreme nonresponse or superior improvement had higher levels.

Conclusions:

Deterioration and extreme nonresponse and, similarly, superior improvement and superior response, both occur infrequently in randomized clinical trials comparing CBT and pharmacotherapy for depression. Pretreatment symptom levels help forecast negative and unusually positive outcomes but do not guide selection of CBT versus pharmacotherapy. Pharmacotherapy may produce clinician-rated superior improvement and attrition more frequently than does CBT.

Cognitive-behavioral therapy (CBT) and pharmacotherapy are, on average, equally and moderately efficacious acute-phase treatments for unipolar depression (1). For example, 50%−60% of patients receiving CBT or pharmacotherapy for depression respond, compared with 40% receiving pill placebo (2, 3), and the average CBT or pharmacotherapy patient has posttreatment symptom levels roughly 0.3 SD below those of patients receiving pill placebo (4, 5). However, these favorable averages may obscure some patients’ negative outcomes (6). Moreover, beyond conventionally defined response (≥50% symptom reduction) and remission (posttreatment scores ≤7 on the Hamilton Depression Rating Scale [HAM-D] [7]), only patients with unusually positive outcomes have no residual symptoms with attendant dysfunction (8).

In this study we focused on patient-level treatment outcomes diverging from the average. We analyzed two specific negative outcomes: deterioration (symptom severity increases from pre- to posttreatment) (9) and extreme nonresponse (severe depressive symptoms posttreatment) (10), although many others are possible (11). Complementarily, we defined two unusually positive outcomes: superior improvement (≥95% symptom reduction from pre- to posttreatment) and superior response (no depressive symptoms posttreatment). We estimated the frequency of these outcomes and tested antecedents in a large patient-level database (N=1,700) pooled from 16 randomized clinical trials comparing CBT to pharmacotherapy. Such analyses may clarify which patients are more likely to have negative or unusually positive outcomes, help match treatments to patients, and through such matching increase treatments’ overall benefits (12).

The frequency and causes of deterioration, extreme nonresponse, superior improvement, and superior response in CBT and pharmacotherapy for depression are unclear. Perhaps 5%−10% of psychotherapy patients show reliable deterioration (increases exceeding the symptom measure’s reliable change threshold), with evidence of higher risk among children than adults, in routine practice compared with randomized clinical trials, and in psychotherapy considered more broadly than CBT specifically (9, 13). Reliable deterioration may occur with similar frequency in pharmacotherapy for depression (14). The concept of extreme nonresponse was introduced in CBT research (10), but observed rates have varied widely in CBT patients, 6% (15) in one sample and 22% in another (10). Finally, based on distributions of symptom scores, superior response and superior improvement possibly occur in <10% of treated depressed patients (8, 16). The current large-sample analyses were designed to clarify these outcome frequencies in pharmacotherapy versus CBT for depression.

Patient characteristics may predict negative and unusually positive outcomes in CBT or pharmacotherapy for depression. Pretreatment markers of greater pathology (e.g., comorbidity, suicidality, unemployment, prior hospitalizations) have predicted less improvement in CBT (17, 18) and pharmacotherapy (19, 20). In addition, higher pretreatment levels of depressive symptoms, plus poorer pretreatment functioning and therapeutic alliance, have predicted extreme nonresponse in CBT (10, 15). Broadly, pretreatment symptom levels may relate directly to posttreatment symptom levels but inversely to the direction of symptom change (21). Thus, greater pretreatment severity may predict higher probabilities of extreme nonresponse and superior improvement but lower probabilities of superior response and deterioration. In contrast, demographic variables have been inconsistent predictors of CBT or pharmacotherapy outcomes (18, 22), perhaps because detecting modest effects without large individual-patient samples is difficult.

Whether treatments or illness trajectories cause particular outcomes is difficult to determine (6). Unipolar depression is often episodic, and mood reactivity within episodes is common (23). Thus, deterioration and extreme nonresponse could be adverse effects of treatment, unrelated to treatment, or even benefits of treatment (i.e., some patients would have been worse off untreated). Similarly, superior improvement and superior response may sometimes reflect “spontaneous” improvements due to biochemical, learning, or life events unrelated to treatment.

Pharmacotherapy and CBT likely share some active ingredients, including patients’ and clinicians’ expectancies of benefit and clinicians’ regular supportive interactions with patients. In addition, CBT may reduce negative emotion via top-down processes, including patients’ skill acquisition and use (24), whereas pharmacotherapy may act directly on neurochemical pathways to reduce negative emotion in bottom-up processes (25). Different hypothesized mechanisms highlight the possibility that CBT and pharmacotherapy may produce divergent outcomes for different patient subpopulations.

Data from randomized clinical trials support identification of potential causes of patient outcomes (6). Different frequencies of deterioration, extreme nonresponse, superior improvement, and superior response between patients randomly assigned to CBT or pharmacotherapy, for example, would suggest that these events may result from components of particular treatment protocols. Moreover, randomized clinical trials allow testing of moderators of outcomes. Moderators are interactions between treatments (e.g., CBT versus pharmacotherapy) and patient characteristics that provide information about individuals’ differential outcomes in one treatment versus another (18).

The goals of this study were to 1) estimate the proportions of patients with negative outcomes (deterioration, extreme nonresponse) and unusually positive outcomes (superior improvement, superior response) in CBT or pharmacotherapy for depression; 2) test predictors of these outcomes, including the type of treatment (CBT versus pharmacotherapy) and patient demographic and clinical characteristics; 3) test whether patient characteristics moderate the effects of treatment (CBT versus pharmacotherapy) on these outcomes; and 4) contrast pretreatment characteristics of patients with opposing outcomes (deterioration versus superior improvement; extreme nonresponse versus superior response).

Method

Identification and Selection of Studies

We analyzed individual patient data pooled from randomized clinical trials comparing CBT to pharmacotherapy for adults (≥18 years) with unipolar depression, defined by major depressive disorder, dysthymic disorder, or elevated ratings of depressive symptoms on standard measures. Psychological interventions with cognitive restructuring as a core component defined CBT for study selection (1).

We accessed an existing database of trials to identify potential studies for this meta-analysis (www.evidencebasedpsychotherapies.org). The database was updated through January 2014 for the current search, and 24 studies met the inclusion criteria. We contacted study authors by e-mail to invite their participation. We requested patient-level data including pre- and posttreatment depressive symptom scores, demographic variables, and clinical characteristics. Authors provided patient-level data for 16 trials, which are described in Table S1 in the data supplement that accompanies the online version of this article.

Participants

The analyzed sample included 1,700 adults from 16 randomized trials comparing CBT (N=794) with pharmacotherapy (N=906) for depression. As shown in Table 1, most patients were women, of majority ethnicity, educated beyond high school, and of middle age. Most patients (96.8%) had major depressive disorder, 1.4% had dysthymic but not major depressive disorder, and 1.8% had no depression diagnoses but elevated depressive symptom scores before treatment (HAM-D: mean=16.19, SD=4.78; BDI: mean=21.38, SD=9.36).

TABLE 1. Descriptive Statistics for Individual Patient Variables in Meta-Analysis of Divergent Outcomes in CBT and Pharmacotherapy for Depression

VariableTotal Number of StudiesTotal Number of PatientsMeanSD%
Demographic characteristics
 Female161,70069.4
 Age (years)141,62637.3811.64
 Minority ethnicity111,39014.0
 Education >12 years121,43965.5
 Employed full time111,28952.0
Clinical characteristics
 Major depressive disorder161,70096.8
 Dysthymic disorder791318.3
 Double depressiona791315.7
 Comorbid diagnosisb111,41732.1
Depressive symptom scores
 17-item Hamilton Depression Rating Scale
  Pretreatment with missing values141,46619.184.58
  Pretreatment with imputed values141,47119.184.58
  Posttreatment with missing values141,0378.606.49
  Posttreatment with imputed values141,4719.176.32
 Beck Depression Inventory-II
  Pretreatment with missing values141,40030.449.79
  Pretreatment with imputed values141,49130.349.81
  Posttreatment with missing values141,07411.169.84
  Posttreatment with imputed values141,49112.049.55

aDouble depression indicates diagnoses of both major depressive disorder and dysthymic disorder.

bComorbid diagnoses are those other than depressive disorders.

TABLE 1. Descriptive Statistics for Individual Patient Variables in Meta-Analysis of Divergent Outcomes in CBT and Pharmacotherapy for Depression

Enlarge table

Measures

Depressive symptom severity.

Fourteen studies used the clinician-report HAM-D, 14 studies used the patient-report BDI, and 12 used both measures. The 17-item HAM-D yields scores from 0 to 52. The 21-item BDI yields scores from 0 to 63. We converted first-edition BDI scores (26) to the slightly higher second-edition metric (27) before analysis. Higher HAM-D and BDI scores mark greater depressive symptoms.

Operational definitions.

“Any deterioration” was defined as an increase of ≥1 point on the HAM-D or BDI from pre- to posttreatment. “Reliable deterioration” was an increase meeting the p<0.05, two-tailed, change threshold of ≥8 points on the HAM-D (28) or ≥9 points on the BDI (29). The any-deterioration group contained all patients with deterioration, whether reliable or not.

Following previous standards (10, 15), we defined “extreme nonresponse” as posttreatment BDI scores ≥31. We converted this BDI threshold (30) to define extreme nonresponse as HAM-D scores ≥21.

“Superior improvement” was a pre- to posttreatment decrease in HAM-D or BDI score of ≥95%, marking improvement beyond traditional response thresholds (≥50%) and contrasting with deterioration.

“Superior response” was a posttreatment HAM-D or BDI score of 0, marking the absence of measured depressive symptoms and contrasting with extreme nonresponse.

Statistical Analyses

We multiply imputed missing BDI and HAM-D scores among the studies using these measures to allow intent-to-treat analyses. We used the Markov chain Monte Carlo method in PROC MI to generate 10 complete data sets, computed outcomes for each patient in the complete data sets, conducted multilevel analyses in each data set with PROC GLIMMIX, and pooled the results via PROC MIANALYZE in SAS 9.3 software (SAS Institute, Cary, N.C.). Because the patient data were nested within studies, multilevel analyses controlled for the random effect of study. Logistic regression models predicted binary outcomes (e.g., presence versus absence of deterioration) from the fixed effects of treatment (CBT versus pharmacotherapy), a predictor (e.g., age, gender), and the interaction of treatment and the predictor (i.e., the moderator effect). We contrasted dichotomous and continuous pretreatment characteristics of patients with opposing outcomes in logistic and normal linear multilevel models, respectively.

Previous Analyses of the Individual Patient Database

Whereas the current analyses focused on outcomes diverging from the mean, past studies using the individual patient database focused on averages and conventional outcomes (31, 32). Average treatment effects did not differ significantly between the eight excluded versus 16 included studies, suggesting that the included studies are representative (32). Among the included studies, pharmacotherapy relative to CBT produced statistically significantly lower average posttreatment HAM-D (but not BDI) scores, although the difference (0.88 HAM-D points) was small (32). Conventionally defined response (HAM-D decrease of ≥50%) and remission (posttreatment HAM-D ≤7), respectively, did not differ significantly between pharmacotherapy (63.4%, 51.0%) and CBT (57.5%, 47.0%) (32). Finally, gender (31) and symptom severity (32) did not moderate CBT versus pharmacotherapy’s effects on posttreatment symptom averages.

Results

Frequency and Treatment Differences in Outcomes

Table 1 displays descriptive statistics for the variables analyzed. With the random effect of study controlled for, attrition (defined as posttreatment HAM-D and BDI scores both missing) was more frequent in pharmacotherapy (18.5%) than in CBT (12.0%) (odds ratio=1.67, p<0.01) but did not correlate significantly with pretreatment HAM-D or BDI score (p>0.25). Patients began treatment with moderate to severe depressive symptoms and ended with mild symptoms, on average, according to their scores on the HAM-D and BDI.

Figure 1 shows scatterplots of HAM-D and BDI scores pre- and posttreatment with regions marking the analyzed outcomes (see Figure S1 in the online data supplement for plots separating CBT and pharmacotherapy). Visual analysis of the scatterplots suggested that patients with deterioration versus superior improvement, and extreme nonresponse versus superior response, marked opposite tails of the symptom-change and posttreatment symptom-level continua, respectively.

FIGURE 1.

FIGURE 1. Depressive Symptom Severity Before and After Treatment (CBT or Pharmacotherapy) for Depressiona

a Larger dots represent more patients. Plotted data include imputation of missing values. Pre, pretreatment score; Post, posttreatment score.

The correlation between continuous HAM-D and BDI scores was moderately high (r=0.85) for pooled pre- and posttreatment observations. Similarly, the percentage agreement between the HAM-D and BDI scores was moderately high in identifying patients with versus without deterioration (92.3%), reliable deterioration (98.2%), extreme nonresponse (94.3%), superior improvement (88.8%), and superior response (92.3%). Nonetheless, clinician ratings (HAM-D) and patient ratings (BDI) sometimes differed regarding the presence of the analyzed outcomes (Figure 2).

FIGURE 2.

FIGURE 2. Patients With Analyzed Outcomes According to Clinician Ratings (HAM-D) and Patient Ratings (BDI) in 12 Studies Using Both Measuresa

a Any deterioration and reliable deterioration, respectively, are increases of ≥1 and ≥8 points on the Hamilton Rating Scale for Depression (HAM-D) or ≥1 and ≥9 points on the Beck Depression Inventory-II (BDI) from pre- to posttreatment. Extreme nonresponse is a posttreatment HAM-D score ≥21 or BDI score ≥31. Superior improvement is a reduction of HAM-D or BDI score by ≥95%. Superior response is a posttreatment HAM-D or BDI score of 0.

Our first goal was to estimate the frequency of outcomes (see Table 2). Overall, 5%−7% of patients showed any deterioration (pre- to posttreatment increase ≥1 point on the HAM-D or BDI), 1% showed reliable deterioration (increase of ≥8 HAM-D or ≥9 BDI points), 4%−5% showed extreme nonresponse (posttreatment score on the HAM-D ≥21 or BDI ≥31), 6%−10% showed superior improvement (HAM-D or BDI decrease ≥95%), and 4%−5% showed superior response (posttreatment HAM-D or BDI=0). About 13% of patients had any negative outcome (deterioration or extreme nonresponse), and 15% had an unusually positive outcome (superior improvement or superior response) on the HAM-D or BDI.

TABLE 2. Estimated Proportions of Patients With Negative Outcomes and Unusually Positive Outcomes in Meta-Analysis of Divergent Outcomes in CBT and Pharmacotherapy for Depressiona

Estimated Proportion of Patients, Derived From Multilevel Logistic Regression Models Adjusting for Random Effect of Study
OverallCognitive-Behavioral TherapyPharmacotherapy
Outcomeb%95% CI%95% CI%95% CITreatment Difference (p)
Negative outcomes
Any deterioration
 HAM-D7.14.7, 10.77.74.8, 12.06.64.1, 10.50.48
 BDI5.23.4, 8.05.43.3, 8.85.03.0, 8.20.78
Reliable deterioration
 HAM-D0.90.4, 1.81.20.5, 2.60.60.2, 1.90.32
 BDI1.10.5, 2.50.80.3, 2.51.40.6, 3.40.37
Extreme nonresponse
 HAM-D5.43.7, 7.75.33.4, 8.15.53.6, 8.20.88
 BDI4.32.9, 6.44.62.9, 7.34.12.5, 6.50.62
Any negative outcome13.39.9, 17.614.110.1, 19.212.79.1, 17.60.54
Unusually positive outcomes
Superior improvement
 HAM-D6.44.4, 9.24.82.9, 7.77.75.2, 11.30.04
 BDI9.86.4, 14.78.95.6, 13.910.66.8, 16.10.32
Superior response
 HAM-D3.92.6, 5.93.21.8, 5.54.52.9, 7.00.23
 BDI5.43.0, 9.44.52.4, 8.36.23.4, 10.90.17
Any unusually positive outcome15.110.6, 21.012.88.5, 18.716.911.7, 23.70.06

aAnalyses included multiple imputation of missing scores on the HAM-D (Hamilton Depression Rating Scale) and BDI (Beck Depression Inventory-II). Fourteen studies each used the HAM-D and BDI, whereas 12 studies used both measures.

bAny deterioration and reliable deterioration, respectively, are increases of ≥1 and ≥8 points on the HAM-D and ≥1 and ≥9 points on the BDI from pre- to posttreatment. Extreme nonresponse is a posttreatment HAM-D score ≥21 or BDI score ≥31. Superior improvement is a reduction of HAM-D or BDI score by ≥95%. Superior response is a posttreatment HAM-D or BDI score of 0. “Any negative outcome” is deterioration and/or extreme nonresponse, and “any unusually positive outcome” is superior improvement and/or superior response, on either measure.

TABLE 2. Estimated Proportions of Patients With Negative Outcomes and Unusually Positive Outcomes in Meta-Analysis of Divergent Outcomes in CBT and Pharmacotherapy for Depressiona

Enlarge table

Frequencies of deterioration, extreme nonresponse, and superior response did not differ significantly between CBT and pharmacotherapy. Treatment with pharmacotherapy significantly increased patients’ chances, relative to CBT, of superior improvement on the HAM-D (odds ratio=1.67) but not on the BDI. (Among the five trials including pill placebo arms, deterioration, extreme nonresponse, superior improvement, and superior response did not differ significantly among CBT, pharmacotherapy, and placebo, perhaps owing to lower statistical power; see Table S2 in the online data supplement. However, the frequency of any negative outcome was lower with pharmacotherapy than with placebo [odds ratio=0.59].)

Prediction and Moderation of Outcomes

Our second and third goals were to test predictors and moderators of outcomes. We tested predictors (main effects) and moderators (interaction effects) in a series of multilevel logistic regression analyses (see Table S3 in the data supplement for regression coefficients). The patient characteristics listed in Table 1 did not significantly moderate differential treatment effects (CBT versus pharmacotherapy), but several statistically significant predictors were evident.

First, a lower pretreatment level of depressive symptoms predicted deterioration, but only within measures (HAM-D or BDI). Patients with lower pretreatment HAM-D scores had a higher chance of any (odds ratio=1.92) and reliable (odds ratio=2.46) deterioration on the HAM-D. (Odds ratios for continuous predictors reflect changes of 1 SD on the predictor. Table 1 shows the SDs.) Similarly, patients with lower pretreatment BDI scores had a higher chance of any (odds ratio=2.48) and reliable (odds ratio=2.19) deterioration on the BDI.

Second, higher pretreatment symptom severity predicted extreme nonresponse both within and between measures. Patients with higher pretreatment HAM-D scores had a higher chance of extreme nonresponse on the HAM-D (odds ratio=1.89) and BDI (odds ratio=1.48). Similarly, patients with higher pretreatment BDI scores had a higher chance of extreme nonresponse on the BDI (odds ratio=2.25) and HAM-D (odds ratio=1.65).

Third, younger age (odds ratio=1.36) and higher pretreatment HAM-D scores (odds ratio=1.33) predicted superior improvement on the HAM-D (odds ratios for continuous predictors, e.g., age, reflect changes of 1 SD on the predictor; Table 1 shows SDs). Finally, full-time employment (odds ratio=1.98) and lower pretreatment BDI scores (odds ratio=1.46) predicted superior response on the BDI.

We followed up the pretreatment symptom scores’ significant prediction of outcomes within the HAM-D by testing specific depressive symptoms captured by individual scale item scores (eight studies provided item-level HAM-D data). Lower pretreatment depressed mood (odds ratio=1.60), less guilt (odds ratio=1.37), and fewer general somatic symptoms (odds ratio=1.56) predicted any deterioration. (Odds ratios for HAM-D items reflect changes of 1 point on the item.) Individual HAM-D items did not predict reliable deterioration significantly. Initial insomnia (odds ratio=1.58), middle insomnia (odds ratio=1.57), and weight loss (odds ratio=1.70) predicted extreme nonresponse. Finally, gastrosomatic symptoms predicted superior improvement (odds ratio=1.51).

Contrasts of Outcome Groups

Our fourth goal was to contrast the pretreatment characteristics of patients reaching opposing outcomes, deterioration versus superior improvement and extreme nonresponse versus superior response. As shown in Table 3, patients with any deterioration on the HAM-D had lower pretreatment HAM-D scores than patients with superior improvement. Similarly, patients with any deterioration on the BDI had lower pretreatment BDI scores than did patients with superior improvement. Finally, as shown in Table 4, patients with extreme nonresponse on the HAM-D or BDI had higher pretreatment scores on both symptom measures than did patients with superior response.

TABLE 3. Pretreatment Characteristics of Patients With Any Deterioration Versus Superior Improvement After Treatment for Depressiona

Group With Any Deterioration Versus Group With Superior Improvement on HAM-DGroup With Any Deterioration Versus Group With Superior Improvement on BDI
Pretreatment VariableAny DeteriorationSuperior ImprovementAny DeteriorationSuperior Improvement
MeanSEMeanSEMeanSEMeanSE
Age (years)37.41.835.41.938.22.037.01.7
HAM-D score17.30.720.1b0.719.20.819.40.7
BDI score28.51.629.71.623.71.629.6b1.4
%SE%SE%SE%SE
Female68.48.276.07.170.57.276.45.0
Minority ethnicity15.06.99.74.86.23.911.04.6
Education >12 years78.28.470.69.271.410.179.27.3
Employed50.59.348.39.139.411.050.210.2
Double depression15.08.35.04.912.87.911.86.8
Comorbid diagnosis35.614.329.513.531.214.628.113.1

aThe means and percentages (standard errors) were estimated in multilevel models controlling for the random effect of study. HAM-D, Hamilton Depression Rating Scale; BDI, Beck Depression Inventory-II. Any deterioration was defined as an increase in score, and superior improvement was defined as a decrease of ≥95%.

bPairwise contrast p<0.05, two-tailed.

TABLE 3. Pretreatment Characteristics of Patients With Any Deterioration Versus Superior Improvement After Treatment for Depressiona

Enlarge table

TABLE 4. Pretreatment Characteristics of Patients With Extreme Nonresponse Versus Superior Response After Treatment for Depressiona

Group With Extreme Nonresponse Versus Group With Superior Response on HAM-DGroup With Extreme Nonresponse Versus Group With Superior Response on BDI
Pretreatment VariableExtreme NonresponseSuperior ResponseExtreme NonresponseSuperior Response
MeanSEMeanSEMeanSEMeanSE
Age (years)37.31.936.12.138.32.137.71.9
HAM-D score22.00.818.3b0.820.70.819.0b0.8
BDI score33.01.928.2b1.835.71.726.7b1.6
%SE%SE%SE%SE
Female72.58.172.28.462.97.774.75.9
Minority ethnicity12.56.48.65.75.83.713.15.7
Education (>12 years)68.210.170.210.168.710.180.57.7
Employed48.69.545.110.553.311.458.610.8
Double depression12.3c0.7c5.66.86.46.3
Comorbid diagnosis37.914.829.314.131.714.827.813.5

aThe means and percentages (standard errors) were estimated in multilevel models controlling for the random effect of study. HAM-D, Hamilton Depression Rating Scale; BDI=Beck Depression Inventory-II. Extreme nonresponse was defined as a posttreatment HAM-D score of ≥21 or BDI score of ≥31. Superior response was defined as a posttreatment HAM-D or BDI score of 0.

bPairwise contrast p<0.05, two-tailed.

cThe statistical model failed to converge because events were too rare in some cells. Inferential contrast was not computed; the estimates are from raw data.

TABLE 4. Pretreatment Characteristics of Patients With Extreme Nonresponse Versus Superior Response After Treatment for Depressiona

Enlarge table

Discussion

Among 1,700 depressed patients from 16 randomized clinical trials comparing CBT to pharmacotherapy, we found that 13% experienced negative and 15% experienced unusually positive symptomatic outcomes. The analyzed negative outcomes were any deterioration (5%−7% had increases of ≥1 HAM-D or BDI points), reliable deterioration (1% had increases ≥8 HAM-D or ≥9 BDI points), and extreme nonresponse (4%−5% had posttreatment HAM-D scores of ≥21 or BDI scores of ≥31). Negative outcomes may be more common in practice settings outside of research (12, 14) because frequent assessment of patients, feedback to clinicians about patients’ progress, and monitoring of treatment fidelity (e.g., clinicians’ competence and protocol adherence) in many randomized clinical trials parallel interventions shown to reduce deterioration (13). Patients’ unusually positive outcomes included superior improvement (6%−10% had HAM-D or BDI score decreases ≥95%) and superior response (4%−5% had posttreatment HAM-D or BDI scores of 0). Because few patients receiving CBT or pharmacotherapy reached absent (superior response) or minimal (superior improvement) residual depressive symptoms, our results highlight the potential value of continuing, sequencing, or augmenting treatments (19, 33).

Treatment with pharmacotherapy versus CBT increased patients’ odds of superior improvement from the clinician’s perspective (HAM-D: 7.7% versus 4.8%) but not from the patient’s (BDI). Pharmacotherapy also predicted greater attrition relative to CBT (18.5% versus 12.0%), and among the subset of studies with placebo arms, fewer negative outcomes overall relative to pill placebo (16.2% versus 24.8%). If replicated, more frequent superior improvement during pharmacotherapy than in CBT may signal operation of an unknown moderator (e.g., to be revealed by genotyping) among patients who tolerate treatment (e.g., do not drop out). In addition, younger age (among adults) predicted superior improvement on the HAM-D, whereas employment predicted superior response on the BDI. These latter findings fit broader patterns of better outcomes for patients with less (versus more) severe pathology (1720). Treatment modality did not change patients’ odds of deterioration, extreme nonresponse, or superior response on the HAM-D or BDI.

Pretreatment symptom levels predicted outcomes and varied significantly between patients with negative versus unusually positive outcomes. In general, pretreatment symptom levels related directly to posttreatment symptom levels and inversely to the direction of symptom change. More specifically, lower pretreatment symptom severity increased risk for deterioration, and patients who deteriorated had lower pretreatment symptom levels than did patients with superior improvement, within measures (HAM-D or BDI). In addition, higher pretreatment symptom levels increased the odds of superior improvement within the HAM-D only. Specific depressive symptoms (HAM-D items) most relevant to these predictions included less depressed mood, less guilt, and fewer general somatic symptoms (deterioration) and more gastrosomatic symptoms (superior improvement).

Psychotherapy may produce negative outcomes, or fail to evoke unusually positive outcomes, when psychotherapists are confrontational and over-interpret common experiences and patient characteristics as pathological (34). However, lack of generalization across measures in our analyses of deterioration and superior improvement also suggests methodological artifacts, such as regression to the mean within measures. The patient (BDI) and clinician (HAM-D) measures may also capture somewhat different information (35) and have finite reliability and validity (36), as do all measures. Finally, patients with lower pretreatment symptom scores may be entering treatment as a depressive episode is waxing, and their deterioration and lack of superior improvement during treatment might reflect the natural course of illness.

Higher pretreatment depressive symptom levels also predicted extreme nonresponse, and patients with extreme nonresponse had higher pretreatment symptom levels than did patients with superior response, both within and between the HAM-D and BDI. In addition, lower pretreatment symptom levels increased patients’ odds of superior response on the BDI. Specific depressive symptoms (HAM-D items) that predicted extreme nonresponse on the HAM-D included initial insomnia, middle insomnia, and weight loss. Although CBT and pharmacotherapy benefit many severely depressed patients (37), our analyses suggest that severe depression at the beginning of treatment adds risk for severe depression and reduces the likelihood of being symptom-free at the end of pharmacotherapy or CBT.

The pretreatment symptom levels and patient characteristics analyzed here (age, gender, education, ethnicity, pretreatment symptom severity, double depression, comorbidity) did not moderate treatment effects on deterioration, extreme nonresponse, superior improvement, or superior response. Consequently, the current results do not support differential treatment selection (CBT versus pharmacotherapy) from symptom levels and patient characteristics to prevent negative or potentiate unusually positive outcomes. Other patient characteristics or events outside of treatment may account for these outcomes and are potential targets for future research. Even so, empirical guidance for selecting CBT or pharmacotherapy for a given patient based on average posttreatment symptom scores is available (16, 38).

On the basis of the current findings, we recommend assessing symptom levels frequently and longitudinally (especially among patients with high pretreatment severity) and pursuing rapid corrective action (e.g., increased session frequency, switching or augmenting treatment) if patients do not progress adequately (12). However, other negative outcomes and adverse events, including discrete behaviors (e.g., suicide), psychosocial outcomes (e.g., divorce; domestic violence), dropping out of treatment, and medication side effects (e.g., sexual dysfunction) arguably are distinct from depressive symptom levels (11) and may require different preventive efforts. Similarly, positive outcomes distinct from depressive symptoms are also important therapeutic targets (e.g., social-interpersonal functioning) (39).

The current study has additional limitations that temper our conclusions. For example, although the sample was large, low base rates of divergent outcomes and attrition may have limited detection of predictors and moderators. In addition, many important patient characteristics (e.g., personality profiles, depressive cognitive content, therapeutic alliance, social support) were not analyzed and could be targets for future research on predictors and moderators of divergent outcomes (10, 40). Moreover, our results from randomized clinical trials may not generalize fully to routine clinical practice (12, 14). Similarly, generalization of our findings to other negative outcomes (e.g., adverse events, side effects) or positive outcomes (e.g., improvement in work or social functioning) not studied here is unknown. Finally, our analyses focused on trials of CBT versus pharmacotherapy and do not address combinations, sequences, dissemination, or the quality of implementation of treatments.

Preventing negative and potentiating unusually positive outcomes may improve the overall efficacy of CBT and pharmacotherapy for depression. In this effort, future research might profitably test replication and mechanisms of the current findings. For example, greater odds of clinician-rated superior improvement among younger (versus older) adults and those treated with pharmacotherapy (versus CBT) and greater odds of patient-rated superior response for patients with full-time (versus less) employment possibly reflect measurable intrapatient, environmental, or interacting mechanisms. Similarly, specific depressive symptoms linked with clinician-rated extreme nonresponse (insomnia, weight loss), deterioration (less depressed mood, less guilt, fewer general somatic symptoms), and superior improvement (gastrosomatic symptoms) may reflect sampling or measurement error but may also reveal patient subpopulations with different treatment-response trajectories.

The current results clarify expectations for outcomes in acute-phase CBT or pharmacotherapy for adult depression. First, whereas the majority of patients responded by conventional standards (32), we found that a few patients (13%) had negative outcomes and some (15%) had very positive outcomes. Second, pretreatment symptom levels help forecast negative and unusually positive outcomes: Patients with severe pretreatment symptoms are more likely to have large drops in symptoms (superior improvement) and unlikely to get worse (deteriorate), but they are also more likely to end treatment with severe symptoms (extreme nonresponse) and are unlikely to end treatment symptom-free (superior response). Conversely, patients with milder pretreatment symptoms are more likely to end treatment symptom-free (superior response) and unlikely to end treatment with severe symptoms (extreme nonresponse), but they are also more likely to get worse (deteriorate) and less likely to have large drops in symptoms (superior improvement). Finally, choosing pharmacotherapy versus CBT may increase patients’ odds of both discontinuing treatment and clinician-rated superior response.

From the Department of Psychology, Truman State University, Kirksville, Mo.; the Department of Psychiatry, University of Texas Southwestern Medical Center, Dallas; the Department of Clinical Psychology and the EMGO Institute for Health and Care Research, VU University Amsterdam, the Netherlands; the Department of Psychology, Vanderbilt University, Nashville, Tenn.; the Department of Clinical Psychology and Psychotherapy, Babes-Bolyai University, Cluj, Romania; the Department of Psychology, University of Pennsylvania, Philadelphia; the Department of Psychology and Neuroscience, University of Colorado, Boulder; the Department of Psychiatry and Behavioral Sciences, Emory University School of Medicine, Atlanta; the Fatemeh Zahra Infertility and Reproductive Health Research Center and the Department of Psychiatry, Babol University of Medical Sciences, Babol, Iran; the Department of Psychiatry and Psychotherapy, University of Leipzig, Leipzig, Germany; the Department of Psychiatry, Faculty of Medicine, University of Toronto; the Department of Psychology, University of Toronto–Scarborough; the Health Services Research Center, Neuropsychiatric Institute, University of California, Los Angeles; the Department of Preventive Medicine and the Center for Behavioral Intervention Technologies, Feinberg School of Medicine, Northwestern University, Chicago; the Duke–National University of Singapore Graduate Medical School, Singapore; and the Department of Psychology, University of Notre Dame, Notre Dame, Ind.
Address correspondence to Dr. Vittengl ().

Dr. Vittengl is a reviewer for UpToDate. Dr. Jarrett receives grants from NIMH, nonfinancial support from Parke Davis (during the conduct of the study), consulting fees from NIMH and NIH, and consulting fees from UpToDate, and her medical center receives fees for cognitive therapy she provides to patients. Dr. Hollon is supported by NIMH grants MH60713 and MH01697. Dr. DeRubeis is supported by NIMH grant MH60998. Dr. Dimidjian receives royalties from Guilford Press and is on the advisory board of MindfulNoggin, which is part of NogginLabs, a private company specializing in customized web-based learning. Dr. Dunlop has received research support from Assurex, Bristol-Myers Squibb, Forest, GlaxoSmithKline, Janssen, NIMH, Otsuka, Pfizer, and Takeda; he has served as a consultant to Bristol-Myers Squibb, Medavante, Pfizer, and Roche. Within the last 3 years, Prof. Hegerl was an advisory board member for Lilly, Lundbeck, Takeda, Servier, and Otsuka; a consultant for Bayer and Nycomed; and a speaker for Bristol-Myers Squibb, Medice Arzneimittel, Novartis, and Roche. Dr. Kennedy has received grant/research support from Brain Canada, Bristol-Myers Squibb, Canadian Depression Research and Intervention Network, Canadian Institutes of Health Research, Clera, Eli Lilly, GlaxoSmithKline, Janssen Ortho, Lundbeck, Ontario Brain Institute, and St. Jude Medical; he is a consultant to AstraZeneca, Boehringer Ingelheim, Bristol-Myers Squibb, Eli Lilly, Lundbeck, Pfizer, Servier, and St. Jude Medical. Dr. Mergl reports a consultancy agreement with Nycomed, a Takeda company. Dr. Mohr is supported by NIMH grants R01 MH100482, R01 MH095753, P20 MH090318, and R34 MH095907 and reports a consulting relationship with Otsuka Pharmaceuticals in the last 3 years. Dr. Rush has received consulting fees from Brain Resource Ltd., Duke–National University of Singapore, Eli Lilly, Emmes, Lundbeck, Medavante, Montana State University, the National Institute on Drug Abuse, Santium, Stanford University, Takeda, the University of Colorado, and the University of Texas Southwestern Medical Center at Dallas; speaking fees from the University of California at San Diego, Hershey Penn State Medical Center, New York State Psychiatric Institute, and the American Society for Clinical Psychopharmacology; royalties from Guilford Publications and the University of Texas Southwestern Medical Center; a travel grant from the International College of Neuropsychopharmacology; and research support from Duke–National University of Singapore; through the University of Texas Southwestern Medical Center, he has a potential financial interest in the Inventory of Depressive Symptomatology and several variations of it. Dr. Segal receives royalties from Guilford Press for books on mindfulness-based cognitive therapy and receives payments for training workshops and presentations related to this approach; he is also a member of the scientific advisory board for Mindful Noggin, which is part of NogginLabs, a private company specializing in customized web-based learning. Dr. Cuijpers has received royalties from Servier, Atheneum, and HB Publishers; speaking fees from the University of Trier, Vanderbilt University, the VGCt, and the NVGRT; and grant support from ZonMw, the European Commission, and the NutsOhra Foundation. The other authors report no financial relationships with commercial interests.

References

1 Cuijpers P, Berking M, Andersson G, et al.: A meta-analysis of cognitive-behavioural therapy for adult depression, alone and in comparison with other treatments. Can J Psychiatry 2013; 58:376–385Crossref, MedlineGoogle Scholar

2 Iovieno N, Tedeschini E, Ameral VE, et al.: Antidepressants for major depressive disorder in patients with a co-morbid axis-III disorder: a meta-analysis of patient characteristics and placebo response rates in randomized controlled trials. Int Clin Psychopharmacol 2011; 26:69–74Crossref, MedlineGoogle Scholar

3 Cuijpers P, Karyotaki E, Weitz E, et al.: The effects of psychotherapies for major depression in adults on remission, recovery and improvement: a meta-analysis. J Affect Disord 2014; 159:118–126Crossref, MedlineGoogle Scholar

4 Cuijpers P, Turner EH, Mohr DC, et al.: Comparison of psychotherapies for adult depression to pill placebo control groups: a meta-analysis. Psychol Med 2014; 44:685–695Crossref, MedlineGoogle Scholar

5 Dunlop BW, Thase ME, Wun CC, et al.: A meta-analysis of factors impacting detection of antidepressant efficacy in clinical trials: the importance of academic sites. Neuropsychopharmacology 2012; 37:2830–2836Crossref, MedlineGoogle Scholar

6 Dimidjian S, Hollon SD: How would we know if psychotherapy were harmful? Am Psychol 2010; 65:21–33Crossref, MedlineGoogle Scholar

7 Hamilton M: A rating scale for depression. J Neurol Neurosurg Psychiatry 1960; 23:56–62Crossref, MedlineGoogle Scholar

8 Zimmerman M, Posternak MA, Chelminski I: Is the cutoff to define remission on the Hamilton Rating Scale for Depression too high? J Nerv Ment Dis 2005; 193:170–175Crossref, MedlineGoogle Scholar

9 Ogles BM, Lambert MJ, Sawyer JD: Clinical significance of the National Institute of Mental Health Treatment of Depression Collaborative Research Program data. J Consult Clin Psychol 1995; 63:321–326Crossref, MedlineGoogle Scholar

10 Coffman SJ, Martell CR, Dimidjian S, et al.: Extreme nonresponse in cognitive therapy: can behavioral activation succeed where cognitive therapy fails? J Consult Clin Psychol 2007; 75:531–541Crossref, MedlineGoogle Scholar

11 Linden M, Schermuly-Haupt ML: Definition, assessment and rate of psychotherapy side effects. World Psychiatry 2014; 13:306–309Crossref, MedlineGoogle Scholar

12 Lambert MJ: Predicting negative treatment outcome: methods and estimates of accuracy, in Prevention of Treatment Failure: The Use of Measuring, Monitoring, and Feedback in Clinical practice. Washington, DC, American Psychological Association, 2010, pp 83–105CrossrefGoogle Scholar

13 Lambert MJ: The efficacy and effectiveness of psychotherapy, in Bergin and Garfield’s Handbook of Psychotherapy and Behavior Change, 6th ed. Edited by Lambert MJ. New York, Wiley, 2013, pp 169–218Google Scholar

14 Blais MA, Malone JC, Stein MB, et al.: Treatment as usual (TAU) for depression: a comparison of psychotherapy, pharmacotherapy, and combined treatment at a large academic medical center. Psychotherapy (Chic) 2013; 50:110–118Crossref, MedlineGoogle Scholar

15 Koenig AM, Jarrett RB, Gallop R, et al.: Extreme nonresponse to acute phase cognitive therapy for depression: an attempt to replicate and extend. Behav Ther 2014; 45:300–313Crossref, MedlineGoogle Scholar

16 DeRubeis RJ, Cohen ZD, Forand NR, et al.: The Personalized Advantage Index: translating research on prediction into individualized treatment recommendations: a demonstration. PLoS One 2014; 9:e83875Crossref, MedlineGoogle Scholar

17 Jarrett RB, Minhajuddin A, Kangas JL, et al.: Acute phase cognitive therapy for recurrent major depressive disorder: who drops out and how much do patient skills influence response? Behav Res Ther 2013; 51:221–230Crossref, MedlineGoogle Scholar

18 Driessen E, Hollon SD: Cognitive behavioral therapy for mood disorders: efficacy, moderators and mediators. Psychiatr Clin North Am 2010; 33:537–555Crossref, MedlineGoogle Scholar

19 Rush AJ, Wisniewski SR, Warden D, et al.: Selecting among second-step antidepressant medication monotherapies: predictive value of clinical, demographic, or first-step treatment features. Arch Gen Psychiatry 2008; 65:870–880Crossref, MedlineGoogle Scholar

20 Souery D, Oswald P, Massat I, et al.: Clinical factors associated with treatment resistance in major depressive disorder: results from a European multicenter study. J Clin Psychiatry 2007; 68:1062–1070Crossref, MedlineGoogle Scholar

21 Tedlow J, Fava M, Uebelacker L, et al.: Outcome definitions and predictors in depression. Psychother Psychosom 1998; 67:266–270Crossref, MedlineGoogle Scholar

22 Van HL, Schoevers RA, Dekker J: Predicting the outcome of antidepressants and psychotherapy for depression: a qualitative, systematic review. Harv Rev Psychiatry 2008; 16:225–234Crossref, MedlineGoogle Scholar

23 American Psychiatric Association: Diagnostic and Statistical Manual of Mental Disorders, 5th ed. Washington, DC, American Psychiatric Association, 2013CrossrefGoogle Scholar

24 Beck AT, Haigh EA: Advances in cognitive theory and therapy: the generic cognitive model. Annu Rev Clin Psychol 2014; 10:1–24Crossref, MedlineGoogle Scholar

25 Ma Y: Neuropsychological mechanism underlying antidepressant effect: a systematic meta-analysis. Mol Psychiatry 2015; 20:311–319Crossref, MedlineGoogle Scholar

26 Beck AT, Ward CH, Mendelson M, et al.: An inventory for measuring depression. Arch Gen Psychiatry 1961; 4:561–571Crossref, MedlineGoogle Scholar

27 Beck AT, Steer RA, Brown GK: Beck Depression Inventory, Second Edition Manual. San Antonio, Tex, Psychological Corp, 1996Google Scholar

28 Grundy CT, Lambert MJ, Grundy EM: Assessing clinical significance: application to the Hamilton Rating Scale for Depression. J Ment Health 1996; 5:25–33CrossrefGoogle Scholar

29 Seggar LB, Lambert MJ, Hansen NB: Assessing clinical significance: application to the Beck Depression Inventory. Behav Ther 2002; 33:253–269CrossrefGoogle Scholar

30 Vittengl JR, Clark LA, Thase ME, et al.: Nomothetic and idiographic symptom change trajectories in acute-phase cognitive therapy for recurrent depression. J Consult Clin Psychol 2013; 81:615–626Crossref, MedlineGoogle Scholar

31 Cuijpers P, Weitz E, Twisk J, et al.: Gender as predictor and moderator of outcome in cognitive behavior therapy and pharmacotherapy for adult depression: an “individual patient data” meta-analysis. Depress Anxiety 2014; 31:941–951Crossref, MedlineGoogle Scholar

32 Weitz ES, Hollon SD, Twisk J, et al.: Baseline depression severity as moderator of depression outcomes between cognitive behavioral therapy vs pharmacotherapy: an individual patient data meta-analysis. JAMA Psychiatry 2015; 72:1102–1109Crossref, MedlineGoogle Scholar

33 Jarrett RB, Minhajuddin A, Gershenfeld H, et al.: Preventing depressive relapse and recurrence in higher-risk cognitive therapy responders: a randomized trial of continuation phase cognitive therapy, fluoxetine, or matched pill placebo. JAMA Psychiatry 2013; 70:1152–1160Crossref, MedlineGoogle Scholar

34 Boisvert CM, Faust D: Iatrogenic symptoms in psychotherapy: a theoretical exploration of the potential impact of labels, language, and belief systems. Am J Psychother 2002; 56:244–259MedlineGoogle Scholar

35 Dunlop BW, McCabe B, Eudicone JM, et al.: How well do clinicians and patients agree on depression treatment outcomes? implications for personalized medicine. Hum Psychopharmacol 2014; 29:528–536Crossref, MedlineGoogle Scholar

36 Cusin C, Yang H, Yeung A, et al.: Rating scales for depression, in Handbook of Clinical Rating Scales and Assessment in Psychiatry and Mental Health. Edited by Baer L, Blais MA. Totowa, NJ, Humana, 2010, pp 7–35Google Scholar

37 Hollon SD: Cognitive and behavior therapy in the treatment and prevention of depression. Depress Anxiety 2011; 28:263–266Crossref, MedlineGoogle Scholar

38 Simon GE, Perlis RH: Personalized medicine for depression: can we match patients with treatments? Am J Psychiatry 2010; 167:1445–1455LinkGoogle Scholar

39 Renner F, Cuijpers P, Huibers MJ: The effect of psychotherapy for depression on improvements in social functioning: a meta-analysis. Psychol Med 2014; 44:2913–2926Crossref, MedlineGoogle Scholar

40 Mohr DC: Negative outcome in psychotherapy: a critical review. Clin Psychol Sci Pract 1995; 2:1–27CrossrefGoogle Scholar