Introduction

An increasing number of people are facing the burden of obesity, which is defined as a body mass index (BMI) of ≥30 kg/m2.1,2 This worldwide epidemic is a concern to health professionals, because obesity is closely linked to risk factors associated with impaired health, shortened life expectancy,3 and reduced health-related quality of life (HRQoL).4 Our meta-analysis focuses on the impact of obesity on HRQoL. HRQoL is of relevance as an outcome measure in obesity, when treatment options are evaluated in terms of risks and benefits with regard to the health, well-being, and general functioning of the patient. HRQoL may differ among subgroups of obese persons, who seek surgical or non-surgical treatment, or who do not seek treatment for their overweight. Some studies demonstrated greater impairment of HRQoL in people seeking treatment, especially treatment of greater intensity.58 The quantification of HRQoL in obese people seeking and not seeking treatment will indicate whether over and above other possible factors such as weight, age, and gender, the HRQoL differs among persons who seek a specific kind of treatment for obesity. In addition, such a quantification will provide reference data that are useful when evaluating the baseline status of obese individuals who apply for weight-reducing interventions.

The literature of the past 26 years was reviewed in order to examine differences in baseline HRQoL between seekers of surgical treatment, seekers of non-surgical treatment, and non-treatment seekers. We additionally investigated the role of weight, age, and gender in the associations between HRQoL and treatment status.

Materials and Methods

Selection of Studies

This meta-analysis comprises empirical studies in the English, French, German, or Dutch scientific literature. Included were reports of studies with adult, but not elderly or ill, populations, who were overweight (BMI ≥25 kg/m2), obese (BMI ≥30 kg/m2), or morbidly obese (BMI ≥40 kg/m2) and who were seeking or not-seeking treatment for their weight. Non-empirical studies (dissertations, reviews, and books) were excluded. To be included in the metaanalysis, pre-treatment descriptive statistics of the HRQoL (mean, SD) had to be available in the identified research reports or obtainable from the authors. We limited our search to the frequently used ‘Impact of Weight on Quality of Life-Lite’ (IWQOL-Lite) and ‘Short form-36’ (SF-36) questionnaires. The search strategy for identification of relevant literature was carried out in three phases. Figure A1 (see Appendix) presents the flow diagram. Eligibility was independently determined by two authors (AMAvN, EJMW).

Phase 1. The first phase determined which generic and obesity-specific instruments had been used to assess HRQoL in obese populations. The PubMed and PsycINFO databases were systematically searched from 1980 until April 2006 with the following key words (quality of life) AND (overweight OR obesity). This yielded 1,071 titles from PubMed and 170 titles from PsycINFO. After exclusion of studies with children, elderly, and disease groups as well as non-empirical studies, 432 titles resulted. The abstracts were evaluated to determine whether the article was about HRQoL as related to seeking or not-seeking treatment for overweight or obesity. The remaining 150 abstracts included a wide range of instruments. Only studies that applied the frequently used obesity-specific IWQOLLite questionnaire (18 articles) and the generic SF-36 questionnaire (47 articles) were selected; two articles used both instruments.

Phase 2. The second phase searched additional articles with IWQOL-Lite or SF-36 data for overweight, obese, or morbidly obese persons. The databases Web of Science, PubMed, and PsycINFO were searched until April 2006 with the following search strategy: (Impact of Weight on Quality of Life-Lite OR IWQOL-Lite OR Medical Outcome Survey Short-Form OR Short-Form 36 OR SF-36 OR Rand-36) AND (overweight OR obesity). The search resulted in one additional article that used the IWQOL-Lite and another 14 studies applying the SF-36.

Phase 3. The 82 full text articles of the abstracts identified in phase 2 were scrutinized. The aim of this third phase was to identify articles with the needed descriptive statistics (mean, SD). The authors of 24 studies have sent missing statistics upon request. When more articles of the same author(s) were found, the data were checked for duplications. In case of overlapping data sets, we asked the authors which data were the most recent and complete. This last phase left 54 articles (8 IWQOL-Lite, 44 SF-36, 2 IWQOL-Lite and SF-36) for metaanalysis. All articles were published after 1996.

Instruments

Impact of Weight on Quality of Life-Lite (IWQOLLite). The 31 items of the IWQOL-Lite assess the impact of weight on quality of life in five areas (physical function, self-esteem, sexual life, public distress, work) and additionally yields a total score.5 The five response categories range from “never true” to “always true”. The IWQOL-Lite has adequate psychometric properties: Cronbach’s alphas range from .90 to .94 for the scales and is .96 for the total scale.9 The test-retest stability coefficients range from .81 to .88 for the scales and is .94 for the total scale.10 The validity of the instrument is supported by findings such as sensitivity to weight loss11 and the results of confirmatory factor analysis. 9 The meta-analysis uses transformed scores ranging from 0 to 100, with 100 representing the best and 0 the worst quality of life. A change of 7.7–12 points (depending on baseline severity) on the IWQOL-Lite total transformed score represents a clinically meaningful change.12

Medical Outcomes Study SF-36 Health Status Survey (SF-36). The SF-36 is a 36-item generic questionnaire measuring subjective health status.13 It comprises eight domains of functioning: 1) physical functioning, 2) role limitations due to physical problems, 3) bodily pain, 4) general health, 5) vitality, 6) social functioning, 7) role limitations due to emotional problems, and 8) mental health. Transformed scores range from 0 (poor health) to 100 (good health). The SF-36 has adequate psychometric characteristics, including good construct validity, high internal consistency, and high test-retest stability.13 A population difference of ≥5 points on any scale is considered clinically significant.13

Data Extraction

Five populations were distinguished: 1) the general population, 2) general obese people, 3) non-treatmentseeking obese people, 4) conservative treatment-seeking obese patients, and 5) surgical treatment-seeking obese patients. All populations consisted of several groups, with exception of the non-treatment population, which comprised a single group in both the IWQOL-Lite7 and the SF-368 meta-analysis. Studies recruiting participants from a community sample were considered to belong to the ‘general population’; some of these studies identified groups who were obese or morbidly obese.1417 Participants who were recruited from the general population specifically because of their obesity were considered to be part of the ‘general obese population’. This ‘general obese population’ differs from the ‘non-treatment-seeking population’ in the sense that non-treatment-seeking persons are known to intentionally choose not to be treated for their obesity. Identified groups in the selected articles were assigned to a population following the recruitment criteria of the original study. The following data were extracted from the selected studies: BMI, age, gender, type of population, and means and standard deviations of the HRQoL variables.

Statistical Analysis

For each population the weighted means of BMI, age, the percentage of women, and HRQoL variables were computed. Inverse variance weights (the sample size divided by the variance) were used as weighing procedure. The METAF.SPS macro18 compared the weighted means of the populations by meta-analytic analog to analysis of variance. We specified the random effects model using the method-of-moments plug in estimate of the METAF.SPS macro for the between-study variance.

To examine the influence of person characteristics on HRQoL, weighted multiple regression using the METAREG.SPS macro (random effects model)18 examined differences between populations before and after adjustment for BMI, age, and gender, respectively. For each analysis, the dichotomized population indicators, i.e., belonging or not belonging to the general obese, non-treatment-seeking, conservative treatment-seeking, or surgical treatment-seeking population, were entered in regression analysis. The BMI, age, and the proportion of female participants were entered in separate analyses to examine the effects of these variables on quality of life. To graphically display the magnitude of differences between populations, effect sizes (statistic d) were computed. These statistics express the deviation from the norm group in standard deviation units.19 Effect size values between 0.2 and 0.5, between 0.5 and 0.8, and >0.8 reflect small, moderate, and large deviations, respectively.19

All statistical analyses were performed with SPSS version 11.0 (SPSS Inc, Chicago, IL).

Results

Impact of Weight on Quality of Life-Lite (IWQOL-Lite)

Comparison between Groups. The literature search yielded 27 groups from 11 studies, including over 6,000 individuals: five groups from the general population, 14,20 one non-treatment-seeking group,7 14 conservative treatment-seeking groups,5,14,2123 and seven surgical treatment-seeking groups.5,7,2426

No studies of the general obese population were found. Table A1 (see Appendix) shows frequencies and weighted means of the groups. The general, nontreatment, conservative treatment and surgical treatment populations differed significantly from each other with respect to BMI (P<.001) and age (P=.02), but not gender (P=.51). The weighted mean BMI was highest in the surgical treatment population (51 kg/m2), followed by the non-treatment-seeking population (44 kg/m2), the conservative treatment population (37 kg/m2), and the general population (29 kg/m2). The weighted mean age of the non-treatment population (49 yrs) was high as compared to the surgical treatment population (41 yrs), conservative treatment population (40 yrs) and the general population (37 yrs). All populations included considerably more women than men; the percentages of women varied between 68% for the general population and 83% for the surgical treatment population. The four populations differed significantly from each other on all IWQOL-Lite scales (P<.001).

Figure 1 shows the mean deviations from the norm in standard deviation units, effect size d.19 The HRQoL of the general population reflected a moderate to small (−0.8 < d < −0.3) deviation from the norm. The non-treatment and the conservative treatment populations had an intermediate position between the general population and the surgical treatment-seeking population. Mean deviations from the norm were large for the non-treatment-seeking population (−3.3 < d < −2.4) as well as for the conservative treatment population (−3.0 < d < −1.6). The surgical treatment-seeking population (−5.5 < d < −2.8) demonstrated very severely reduced HRQoL scores on all scales.

Figure 1
figure 1

Mean deviation from the norm group in standard deviation units on the IWQOL-Lite for four populations. PF: Physical function, SE: Self-esteem, SL: Sexual life, PD: Public distress, WO:Work, TT: Total.

Adjustment for Weight, Age, and Gender. Table 1 shows the unadjusted mean IWQOL-Lite scores of the populations as well as these scores after adjustment for BMI, age, and gender, respectively. The significance levels with the unadjusted means show that the non-treatment, conservative treatment, and surgical treatment populations have a significantly reduced HRQoL on all dimensions as compared to the other populations. The HRQoL differences between populations fully disappeared after adjustment for BMI. After this adjustment, the surgical treatment population even obtained the best score on public distress. This is probably due to over-correction as a consequence of the very high correlation between BMI and public distress (r = −.90) in this meta-analysis. Adjustment for age and gender did hardly influence the HRQoL scores.

Table 1 IWQOL-Lite quality of life scores of four populations: unadjusted weighted means and estimated weighted means after adjustment for BMI, age, and gender

Medical Outcomes Study SF-36 Health Status Survey (SF-36)

Comparison between Groups. For the SF-36, 88 groups from 46 studies were analyzed, involving nearly 88,000 individuals: 35 groups from the general population,1517,24,2734 7 groups from the general obese population,3538 1 non-treatment-seeking group,8 25 groups from the conservative treatmentseeking population,8,30,3552 and 20 groups from the surgical treatment-seeking population.24,25,5366

Table A2 (Appendix) shows frequencies and weighted means of the groups. The populations differed from each other with respect to BMI (P<.001) and gender (P=.05), but not age (P=.70). Concerning BMI, non-treatment (33 kg/m2), general obese (35 kg/m2), and conservative treatment (36 kg/m2) populations had an intermediate position between the morbidly obese surgical population (47 kg/m2) and the general population (28 kg/m2). The weighted mean age of the populations varied between 36 years for the non-treatment population and 44 years for the conservative treatment population. All populations included more women than men. The percentages of women varied between 54% for the general obese population and 86% for the surgical treatment population. The five populations differed significantly from each other on all SF-36 scales (P<.001).

Figure 2 shows the deviations from the norm13 in standard deviation units. HRQoL of the general population was about equal to the norm group (−0.1 < d < 0.1). The non-treatment-seeking population (−0.3 < d < 0.0), the general obese population (−0.5 < d < −0.2), and the conservative treatment population (−0.4 < d < −0.2) showed zero to moderate deviations from the norm. The surgical treatment population showed large to moderate deviations from the norm (−1.6 < d < −0.5).

Figure 2
figure 2

Mean deviation from the norm group in standard deviation units on the SF-36 for five populations. PF: Physical functioning, PR: Physical Role — role limitations due to physical problems, PA: Bodily pain, GH: General health, VI: Vitality, SF: Social functioning, ER: Emotional role — Role limitations due to emotional problems, MH: Mental health.

Adjustment for Weight, Age, and Gender. Table 2 represents the unadjusted mean SF-36 scores of the populations as well as these scores after adjustment for BMI, age, and gender, respectively. The significance levels with the unadjusted means demonstrate that the conservative treatment and surgical treatment populations report a highly significant reduced HRQoL as compared to the other populations. With a few exceptions, after adjustment for BMI, differences between populations disappeared for the non-treatment, conservative treatment, and general obese populations. The exceptions concern the two mental health scales: role limitations due to emotional problems (ER) and Mental Health (MH), which remained low after weight was taken into account. After adjustment for BMI, the surgical treatment population as compared to the other populations still demonstrated a relatively low HRQoL on 5 of the 8 scales. Adjustment for age and gender did not affect the SF-36 HRQoL scores to a large extent.

Table 2 SF-36 quality of life scores of five populations: unadjusted weighted means and estimated weighted means after adjustment for BMI, age and gender

Discussion

This meta-analysis is the first that summarizes and analyzes HRQoL in diverse obese populations. Studies in well-defined samples provided the data. The strengths of our study are the large sample sizes with the non-treatment-seeking population as the only exception, the geographical diversity of the groups with North and South American, European, Asian, and Australian studies included, and the use of two well-established and validated HRQoL measures. These strengths contribute to the generalizability of the findings. In agreement with several studies, 29,47,54,55 it was shown that obese persons experience a poorer HRQoL compared to the general population. In particular, those seeking surgical treatment reported by far the most severely reduced HRQoL. The results obtained with the two instruments are discussed separately, because of the different results after correction for body weight. IWQOL-Lite quality of life scores of obese populations deviated very much from scores of the norm group. However, these differences disappeared after adjustment for body weight, suggesting that body weight is a main determinant of HRQoL as assessed with this instrument. Our observations suggest that IWQOL-Lite scores will improve after successful weight reduction, as has been reported in two studies.11,20 These previous analyses and our findings indicate the usefulness of the IWQOL-Lite when one aims to explicate or evaluate weight-dependent HRQoL.

In contrast to the IWQOL-Lite, the SF-36 questionnaire suggested a less extreme deviation from the norm for obese populations. Not surprisingly,66 the surgical treatment-seeking obese population demonstrated a large deviation from the norm on virtually all aspects of HRQoL. The reduction in HRQoL for the other obese populations tended to be zero to moderate. Covariance analysis suggested that HRQoL as assessed with this generic instrument is only partly dependent on differences in body weight. In the surgical population, the reduced quality of life on five of the eight dimensions was not solely explained by weight. Also, the reduced scores on the two mental health scales in the general obese population and the conservative treatment population were not explained by weight alone. These findings, therefore, suggest that other factors than weight alone affect the quality of life of obese persons as measured by the SF-36. This result emphasizes the validity of the SF-36 as a partly weight-independent outcome measure for general quality of life.

The current SF-36 findings suggest that obese persons experience limitations in their daily life and work due to emotional problems that are not fully explained by the magnitude of excess weight. An implication of this finding is that weight reduction alone will not suffice when attempting to positively affect mental health of these individuals. A subgroup of obese persons may need specific attention for emotional problems.

In contrast to expectation,57 HRQoL as assessed by the IWQOL-lite failed to differ between the populations after adjustment for weight. Also, contrary to expectation, the general obese population and the conservative treatment-seeking population had virtually similar SF-36 scores. Only in the surgical treatment population, reduced physical and role functioning was observed that was not fully explained by weight. This may reflect the impact of co-morbid cardiovascular or joint problems that could be an additional reason to choose for or to be referred to surgical treatment. Overall, our analyses suggest that only in the surgical treatment population, reduced physical functioning is a reason for seeking treatment over and above weight and weight-related quality of life.

Besides weight, age and gender were included as variables in the meta-analysis. Previous studies suggested that obese persons who seek treatment have a higher weight, are older, and are more often female than obese persons not seeking treatment.7,8 Our results consistently confirmed that the weight of the surgical treatment population is higher than the weight of the other populations. No clear findings emerged with respect to age. Although epidemiological studies suggest that there is hardly a sex difference in obesity,1,67 all included populations, and most of all the surgical treatment-seeking population, included more women than men. Thus, especially females who are on average between 40–50 years old are more inclined to participate in research and to seek treatment for their obesity.

A weakness of the present study is that the general obese population is likely to be a rather heterogeneous population including not only persons that do or do not intend to seek treatment for their obesity, but also individuals who choose their own diets or alternative treatments. Another weakness of metaanalytic techniques is that rather homogeneous group means of age and gender are used, whereas multiple regression analysis uses the full range of these variables. Therefore, the possibility that age and gender affect HRQoL is not definitively refuted by our findings. A further major weakness of our study and this specific field in general is that only two studies have investigated the intentionally nontreatment-seeking population.7,8 The small sample size in these studies hampers the generalizability of these findings. Future studies should focus on the non-treatment population, including the large group of non-treatment seeking men, with the aim to examine and analyze the factors which account for their reluctance to seek professional help to reduce weight and to improve health and HRQoL.

In conclusion, both the IWQOL-Lite and the SF-36 findings demonstrate a reduced HRQoL for the obese population, especially for the morbidly obese population seeking surgical treatment. The IWQOL-Lite questionnaire predominantly reveals weight-related quality of life, whereas the SF-36 apparently assesses generic quality of life that is also determined by other factors than weight. Reductions in mental health could not be explained in terms of the magnitude of weight excess alone. This meta-analysis provides reference values that are useful when explaining or evaluating obesityspecific (IWQOL-Lite) and generic (SF-36) health-related quality of life, weight, and demographic characteristics of obese persons seeking or not seeking surgical or non-surgical treatment.