Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Developing the Breast Utility Instrument, a preference-based instrument to measure health-related quality of life in women with breast cancer: Confirmatory factor analysis of the EORTC QLQ-C30 and BR45 to establish dimensions

  • Teresa C. O. Tsui ,

    Roles Conceptualization, Data curation, Formal analysis, Funding acquisition, Investigation, Methodology, Project administration, Software, Validation, Visualization, Writing – original draft, Writing – review & editing

    teresa.tsui@utoronto.ca

    Affiliations Toronto Health Economics and Technology Assessment (THETA) Collaborative, University Health Network, Toronto, Ontario, Canada, Graduate Department of Pharmaceutical Sciences, Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, Ontario, Canada

  • Maureen Trudeau,

    Roles Conceptualization, Funding acquisition, Investigation, Resources, Supervision, Validation, Writing – review & editing

    Affiliation Odette Cancer Centre, Sunnybrook Health Sciences Centre, Toronto, Ontario, Canada

  • Nicholas Mitsakakis,

    Roles Conceptualization, Data curation, Methodology, Software, Supervision, Writing – review & editing

    Affiliations Children’s Hospital of Eastern Ontario, Ottawa, Ontario, Canada, Division of Biostatistics, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada

  • Sofia Torres,

    Roles Conceptualization, Data curation, Resources, Validation, Writing – review & editing

    Affiliation Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada

  • Karen E. Bremner,

    Roles Data curation, Methodology, Project administration, Writing – review & editing

    Affiliation Toronto Health Economics and Technology Assessment (THETA) Collaborative, University Health Network, Toronto, Ontario, Canada

  • Doyoung Kim,

    Roles Data curation, Project administration, Resources

    Affiliation Department of Pharmacology and Toxicology, Temerty Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada

  • Aileen M. Davis ,

    Roles Formal analysis, Investigation, Methodology, Supervision, Writing – review & editing

    ‡ AMD and MDK are joint senior authors on this work.

    Affiliation Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada

  • Murray D. Krahn

    Roles Conceptualization, Funding acquisition, Investigation, Methodology, Resources, Supervision, Writing – review & editing

    ‡ AMD and MDK are joint senior authors on this work.

    Affiliations Toronto Health Economics and Technology Assessment (THETA) Collaborative, University Health Network, Toronto, Ontario, Canada, Graduate Department of Pharmaceutical Sciences, Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, Ontario, Canada, Institute of Health Policy, Management and Evaluation, Dalla Lana School of Public Health, University of Toronto, Toronto, Ontario, Canada

Abstract

Objectives

Breast cancer (BrC) and its treatments impair health-related quality of life (HRQoL). Utility is a measure of HRQoL that includes preferences for health outcomes, used in treatment decision-making. Generic preference-based instruments lack BrC-specific concerns, indicating the need for a BrC-specific preference-based instrument. Our objective was to determine dimensions of the European Organisation for Research and Treatment of Cancer (EORTC) general cancer (QLQ-C30) and breast module (BR45) instruments, the first step in our development of the novel Breast Utility Instrument (BUI).

Methods

Patients (n = 408) attending outpatient BrC clinics at an urban cancer centre, and representing a spectrum of BrC health states, completed the QLQ-C30 and BR45. We performed confirmatory factor analysis of the combined QLQ-C30 and BR45 using mean-and variance-adjusted unweighted least squares estimation. The hypothesized factor model was based on clinical relevance, item distributions, missing data, item-importance, and internal reliability of dimensions. Models were evaluated based on global and item fit, local areas of strain, and likelihood ratio tests of nested models.

Results

Our final model had 10 dimensions: physical and role functioning, emotional functioning, social functioning, body image, pain, fatigue, systemic therapy side effects, sexual functioning and enjoyment, arm and breast symptoms, and endocrine therapy symptoms. Good overall model fit was achieved: χ2/df: 1.45, Tucker-Lewis index: 0.946, comparative fit index: 0.951, standardized root-mean-square residual: 0.069, root-mean-square error of approximation: 0.033 (0.030–0.037). All items had salient factor loadings (λ>0.4, p<0.001).

Conclusions

We identified important BrC HRQoL dimensions to develop the BUI, a BrC-specific preference-based instrument.

Introduction

Breast cancer (BrC) is the most common cancer, diagnosed in one in eight women during her lifetime [1], with one of the highest per-patient health system costs [2]. Treatments have increased progression-free and overall survival [35], however, health-related quality of life (HRQoL) is another important outcome in BrC [6]. Health utility is a preference-based measure of HRQoL, anchored at 0 (dead) and 1 (full health). Utility multiplied by length of life produces quality adjusted life years (QALYs), a key outcome in cost-utility analyses [7].

Existing methods to measure health utilities in BrC have limitations. Notably, generic (e.g. EQ-5D) [8], and general cancer (e.g., e.g., QLU-C10D, EORTC-8D) [9] utility instruments lack construct validity in key BrC-specific dimensions such as arm and breast symptoms, endocrine therapy symptoms, and endocrine sexual symptoms. A BrC-specific preference-based instrument may discriminate better among different BrC health states and be more responsive to mild disease-specific changes in BrC HRQoL than generic instruments [1012], allowing cost-utility analyses to integrate data derived from more comprehensive, and more valid, health utility measurement [11]. Therefore, our overall objective is to develop the novel Breast Utility Instrument (BUI), a BrC-specific preference-based instrument.

Our overall research program aims to develop and validate the EORTC-derived BrC-specific preference-based instrument, the Breast Utility Instrument (BUI). Novel preference-based instruments are frequently derived from existing psychometric instruments which contain key disease-specific dimensions [12, 13]. Building on Brazier et al’s stages of deriving a preference-based HRQoL instrument (12), we developed a 17-step framework, spanning four phases of instrument development: i) develop initial questionnaire items, ii) establish dimension structure, iii) reduce items per dimension, iv) value and model health state utilities (unpublished). The specific objective of this study was to identify dimensions that might be used in developing the BUI, a BrC-specific utility instrument by performing confirmatory factor analysis on the European Organisation for Research and Treatment of Cancer (EORTC) general cancer (QLQ-C30) and breast-specific module (BR45) instruments [14].

Methods

EORTC QLQ-C30 and BR45 instruments

The QLQ-C30 version 3 developed in 1993 [15] is a 30-item general cancer HRQoL patient-reported instrument with subscales representing functioning (physical, role, emotional, cognitive, social), symptoms (fatigue, nausea and vomiting, pain, dyspnea, insomnia, appetite loss, constipation, diarrhea, financial difficulties), and global health items. Each subscale has multiple items, except for six single-item symptom subscales. Each item has four response categories from 1 “not at all” to 4 “very much”. The global health items are rated from 1 “very poor” to 7 “excellent” [15].

The QLQ-C30 has demonstrated measurement properties in a range of cancers including breast cancer [16]. It has an established factor structure (construct validity) consistent with the original development population in lung cancer. Its internal consistency assessed by Cronbach’s alpha was >0.7 for all subscales except for role functioning and cognitive functioning where alpha was <0.70. Discrimination between local-regional and metastatic BrC was demonstrated in 6/9 subscales at pre-treatment (p<0.002) and in 4/9 subscales (p<0.002) 8 days after chemotherapy [16]. Comparing local-regional and metastatic BrC, subscales without significant difference in mean scores pre-treatment were: emotional functioning, cognitive functioning, and nausea / vomiting, and subscales without significant differences 8 days after chemotherapy were: emotional functioning, social functioning, cognitive functioning, nausea and vomiting, and fatigue [16]. The QLQ-C30 has established patient-observer agreement with a median kappa = 0.5 (range: 0.49–1.00) in patients with breast and gynecological cancers [17].

The BR45 is a BrC-specific module [14], updated in 2020 from the BR-23 originally developed in 1996 [18] with new (italicized) functioning and symptom scales to reflect current treatments [14]. The BR45 has five functioning sub-scales (body image, future perspective, sexual functioning, sexual enjoyment, breast satisfaction), and seven symptom subscales (arm, breast, endocrine therapy, skin mucositis, endocrine sexual symptoms, systemic therapy side effects, and upset by hair loss). It also has three open-ended items to capture additional symptoms or problems not addressed by the previous items. All BR45 items have the same four response options as the QLQ-C30 [14].

The developers of the BR45 pre-tested the breast module to evaluate the importance, comprehensibility, and acceptability of its questionnaire items (face validity and feasibility) [14]. The BR45 has also established preliminary psychometric properties, where all subscales have acceptable internal consistency (Cronbach’s alpha > 0.7), and the three new symptom subscales and new satisfaction subscale had no strong correlation with the existing BR23 subscales [14].

Participants and procedures

Patients.

Between September 2018 and August 2019, a cross-sectional sample of 1,536 patients diagnosed with invasive BrC were screened using appointment lists of six medical oncologists’ clinics and an electronic chart review at an urban hospital-based outpatient breast cancer centre. We identified 1,260 potentially eligible patients who were approached in clinic (S1 Fig). Of the 703 patients with BrC who provided written informed consent, 275 did not return QLQ-C30 and BR45 questionnaires after two reminders. Amongst the 428 patients who returned their questionnaires, seven were found to be ineligible. Thirteen patients who answered fewer than 50% of the questionnaire items were excluded. Thus, 408 patients were included in the study (S1 Fig).

Patients were excluded if they had non-invasive BrC, anther primary cancer within the prior five years, or did not understand English and did not have a translator.

Patients were stratified into one of five a priori mutually-exclusive health states, to ensure that our sample included patients from the spectrum of BrC [19, 20]:

I: first year after diagnosis of primary BrC;

R: first year after date of local recurrence, or new primary BrC;

II-V: second to fifth year after primary BrC or local recurrence treated with curative intent;

VI+: sixth and following years after a primary BrC or local recurrence treated with curative intent;

M: metastatic BrC.

Lidgren found that EQ-5D and TTO utility instruments differentiated between most health states except for between first year of local recurrence and second year and following years after primary or local recurrent BrC [19]. We adopted Torres et al.’s (unpublished) VI+ health state to account for recent guidelines recommending adjuvant endocrine therapy for up to 10 years for women with hormone receptor positive BrC [21].

A subset of patients with BrC (n = 81) rated the importance of all items in QLQ-C30 and BR45 on a five-point scale (0—not applicable, 5 –very important).

Clinicians.

Thirteen clinicians working with women with BrC rated the importance of QLC-C30 and BR45 as applicable to patients on a five-point scale (0—not applicable, 5 –very important) using a secure web-form. Demographic characteristics were not collected from clinicians to protect their anonymity.

Ethics

This study was approved by three research ethics boards (ID): Sunnybrook Health Sciences Centre (1796), University Health Network (18–5350), and the University of Toronto (36324). All participants provided written informed consent.

Statistical analysis—confirmatory factor analysis

Fig 1 shows our CFA process. We considered clinical relevance, item distributions, missing data, item-importance, and internal reliability of items within subscales of the QLQ-C30 and BR45 to create our a priori dimensions of our CFA. We started with King et al’s process of prioritizing dimensions in the QLU-C10D [9] with World Health Organization’s (WHO)’s core health dimensions [22] and cancer-specific dimensions. The WHO dimensions specific to the QLQ-C30 functioning dimensions were: physical, emotional, and social [22]. General cancer and BrC-specific dimensions were agreed by our multidisciplinary research team with expertise in patient outcome measurement, biostatistics, health economics, general internal medicine, and breast medical oncology. General cancer dimensions were: pain, fatigue (energy). BrC-dimensions were: breast and arm symptoms, sexuality, systemic therapy, endocrine therapy, and body image. The set of attributes used to develop a preference-based instrument should be both comprehensive (contain sufficient and clinically-relevant factors) and parsimonious (have a limited number of factors to minimize cognitive burden [23] for those estimating the multi-attribute utility function of the future BUI).

Since there were two sex-related dimensions, patient-rated item-importance ratings were ranked and prioritized over clinician-rated item-importance ratings to select the most important dimension to patients.

To construct the measurement model which consists of observed variables (items) and unobserved variables (dimensions), at least two items are needed to estimate each dimension, therefore, only multiple-item dimensions were included in the CFA [24]. We also removed global HRQoL items, because they are not related to particular dimensions of HRQoL [25, 26].

We conducted preliminary analyses prior to the CFA. First, the item response distributions of the QLQ-C30 and BR45 over BrC health states were visualized using stacked bar plots (S2 and S3 Figs). Next, correlations were inspected: inter-item, item-to-dimension (subscale), and inter-dimensional (S4 Fig), to ensure there was a sufficient association between items and dimensions to move forward with CFA. Polychoric correlations were used for ordinal response options of the items [27]. We assessed correlations using the following criteria:

  1. Inter-item correlations of 0.3–0.8, indicating high correlations [18],
  2. Item-to-dimension correlations of >0.4, suggesting convergent validity of items within the same subscale [18],
  3. Inter-dimension correlations of >0.4, supporting convergent validity.

Internal consistency, a measure of reliability, was evaluated using Cronbach’s alpha for items within each hypothesized dimension based on the QLQ-C30 and BR45 scoring manuals. A target value of Cronbach’s alpha >0.7 for group comparisons represented high internal consistency reliability [18, 28].

Preliminary item and dimensional analyses led to a priori dimension combinations or reallocation of an item (S4 Fig). We combined physical and role functioning and arm and breast symptoms based on conceptual overlap and high inter-dimension correlation (0.67 and 0.58, respectively) suggesting convergent validity. The sexual functioning and sexual enjoyment dimensions overlapped in content, and its items all had high inter-item correlations > 0.60, therefore we decided a priori to combine the sexual functioning and sexual enjoyment dimensions. Item 10 (needing to rest) overlapped in content with item 4 (needing to stay in bed or a chair), and both items were highly correlated (0.56), therefore we decided a priori to move item 10 from the Fatigue dimension to the Physical and Role functioning dimension. These combinations and item reallocations were validated by our clinician expert (MT).

Costa et al.’s factor analysis of the QLQ-C30 [25], also combined physical and role functioning due to high inter-dimensional correlation, and moved item 10 to the Physical and Role functioning dimension for the same reasons.

We performed CFA on an a priori 10-dimensional model to ensure a parsimonious set of attributes [23]. We used mean-and variance-adjusted unweighted least squares estimation (ULSMV) for ordinal response variables to obtain more robust model fit and standard errors and higher power, given our sample size [27, 29]. We fitted the CFA models with polychoric correlations because our item-level distributions departed from the normality assumption, and had fewer than five ordinally-scaled response variables [27, 30].

In the baseline factor model, items were specified to load onto one factor, the factors were allowed to correlate freely, and the residuals were uncorrelated, and a standardized solution was obtained.

We evaluated global model fit, saliency of parameters, and local areas of strain [31]. Nested models were compared using χ2 likelihood ratio tests. We also evaluated R2, the proportion of variance in the item response explained by the factor (>0.1) [31].

Global model fit is a descriptive indicator of how well the model reproduces the observed relationships between the indicators, represented by items, in the input matrix [31]. We used five tests to evaluate global model fit:

  • Sartorra Bentler (SB) scaled χ2 statistics were used if high kurtosis statistics suggested the items were not normally-distributed [32]. A non-significant SBχ2, where p> 0.05, is desired. Because the χ2 statistic is sensitive to sample size, a parsimony adjusted test statistic SBχ2/df <2 indicates a good fit [33].
  • Root mean square error of approximation (RMSEA) was used to estimate the discrepancy per degree of freedom between the model implied covariance matrix and the population covariance matrix [27]. The RMSEA includes an adjustment whereby more complex models with greater degrees of freedom are penalized. RMSEA cut-off values for fit are <0.08 (adequate), and <0.05 (good), with an upper limit of the 90% confidence interval <0.08 [34, 35].
  • Standardized root mean square residual (SRMR) is the mean absolute residual correlation, where <0.08 indicates acceptable fit, and <0.06 indicates good fit [34, 35].
  • Comparative fit index (CFI) and Tucker-Lewis Index (TLI) indicate improvement in fit comparing the researcher’s model to the baseline exact fit (χ2) model, where >0.90 is acceptable, and >0.95 is good [34, 35].

Saliency evaluates if the items are associated with the pre-specified factor. We inspected factor loadings (λ) and considered items with λ>0.3 with statistical significance at α = 0.01 (with Bonferroni correction) to be salient to a given factor [36].

Where there was poor global model fit, residuals and modification indices were inspected to identify local areas of strain. Residuals are the sources of unexplained variance in the model [37]. Correlated item residuals mean that there is a common variance that is not accounted for by the initial hypothesized factor structure [37]. This common variance can occur when item content overlaps, leading to suboptimal model fit [38, 39]. A modification index approximates the degree that a model’s χ2 statistic would decrease if a given fixed parameter became freely estimated, analogous to the χ2 difference (with a single degree of freedom) of nested models [31]. Therefore, well-fitted models have small modification indices. A model with local areas of strain would have item pairs in the same dimension with high residuals (>0.4), and high modification indices (>25) [40]. If there was substantial clinical rationale and overlapping item content, we re-specified models to correlate item residuals, and re-assessed residuals and modification indices.

Likelihood ratio tests were also performed to compare nested models for relative model fit. Our clinical expert (MT) checked the content validity and clinical meaningfulness of retained dimensions.

We followed the EORTC scoring manual to calculate subscale scores with missing data [41].

We performed CFA using R v1.2.5001 (http://cran.r-project.org/) using corrplot [42], psych [43], and lavaan packages [44].

Sample size

Patients.

Assuming six factor loadings to each factor, 400 patients were deemed to be sufficient to provide a high level of congruence (K>0.95) between the factors from the sample solution and the population solution [45].

Clinicians.

We aimed to recruit at least 10 clinicians from a range of professions to represent different perspectives.

Results

Patients

Table 1 shows the demographic and clinical characteristics of patients. Patients had a mean (SD) age of 59.1 (11.6) years. The majority (64.2%) were married or in a common-law relationship, with 80% completed at least college education. Most patients were diagnosed in pathological stage 1A (37%) or IIA (25.5%), which is comparable to the incidence of BrC stages in Ontario, Canada [46, 47]. The most common treatment intents were adjuvant (64.2%), palliative (22.5%), and neoadjuvant (6.4%). The most common treatment regimens were endocrine therapy (57.0%), chemotherapy (17.1%), and targeted therapy (16.7%). Most patients were in their second to fifth year health state (31.1%). A higher proportion of patients in our sample had metastatic disease than a development study of the BrC health states (25.2% vs 19.4%) [19].

thumbnail
Table 1. Participant characteristics and comparator population level characteristics.

https://doi.org/10.1371/journal.pone.0262635.t001

The subset of patients (n = 81) who rated item-importance were of comparable age, biomarker status, comorbidity status as all participants (Table 1). The item-importance sample consisted of a smaller percentage than the full sample with a graduate or professional degree (29.6% vs 37.3%), fewer in the metatstatic health state (16.0% vs 25.2%); and, a larger percentage were diagnosed with BrC from 5 to 9 years (34.6% vs 22.3%).

The 13 clinicians who completed importance ratings were five medical oncologists, one radiation oncologist, one surgical oncologist, two medical oncology fellows, two nurses, one physician assistant, and one social worker, predominantly representing the medical oncology clinical staff.

The item response distributions by subscale are shown in S2 and S3 Figs. Table 2 describes subscale scores on the QLQ-C30 and BR45. Our patients’ QLQ-C30 subscale scores were between the EORTC reference values of patients with early stage and metastatic BrC [49]. Reference values for BR45 scores are not yet available. Cronbach’s alpha for most subscales were greater than our cut-off of 0.70 (Table 2). Based on consultation with our clinical expert (MT), the factors with the lowest alpha were removed (e.g., nausea and vomiting, α = 0.41), or kept in the CFA model because of clinical significance (e.g., systemic therapy side effects, α = 0.69).

thumbnail
Table 2. QLQ-C30 and BR45 subscale and internal consistency scores.

https://doi.org/10.1371/journal.pone.0262635.t002

Missing responses, and removal of dimensions or items

Only raw scores were used in the CFA. If a patient completed at least 50% of the items in the dimension, the missing item(s) were imputed as the mean of the scale items the patient answered to calculate the QLQ-C30 and BR45 scores (Table 2) [41]. Most items had less than 2% missing, except for the sex-related items, which 17–51% of patients omitted (S1 Table).

Dimension and item importance rated by patients and clinicians

The mean importance ratings by patients and clinicians are shown in the Table 3. In 15/26 dimensions, clinician ratings were significantly higher than patients (p<0.05), otherwise, ratings were similar between the two groups. The sexual functioning dimension was rated significantly higher by patients than clinicians (4.22 vs 3.59, p 0.002). The three sex-related dimensions had mean patient-rated dimensional ratings of 4.22, 4.03, 3.94, for sexual functioning, sexual enjoyment, and endocrine sexual symptoms, respectively. These patient-rated importance ratings supported our a priori retention of the combined sexual functioning and sexual enjoyment dimension.

thumbnail
Table 3. Mean dimension importance rated by patients and clinicians.

https://doi.org/10.1371/journal.pone.0262635.t003

Given that 30% of items were rated scores 0 (not applicable), 1 (slight) and 2 (mild), we removed these scores prior to analysis to de-emphasize mild but frequent aspects of HRQoL [50].

Confirmatory factor analysis

Table 4 shows the a priori model. It includes items from the following dimensions of the QLQ-C30: physical and role functioning, pain, fatigue, emotional functioning, social functioning; and, from dimensions of the BR45: systemic therapy side effects, body image, sexual functioning and enjoyment, arm and breast symptoms, and endocrine therapy symptoms.

thumbnail
Table 4. A priori factor model–factors and item summary.

https://doi.org/10.1371/journal.pone.0262635.t004

The results from our CFA are presented in Table 5, showing a summary of robust fit indices after re-specifying the original model and after applying residual correlations. The baseline model had inadequate global model fit (Model A), and local areas of strain represented by highly correlated residuals (>0.3) and high modification indices (> 25). After re-specifications of the model, we obtained adequate global model fit (Model C). Modifications to the baseline model involved moving three items to different dimensions (Model B), and applying three pairs of residual correlations to reduce local areas of strain (Model C).

The modification indices supported high error covariances between item pairs and between several items and specific dimensions. The three largest and most significant modification indices (MI) between item pairs (> 25) [40] were consistently present in tested models. These same item pairs also exhibited high residual correlations (>0.3): PF2 (long walk) and PF3 (short walk); ET68 (gained weight) and ET69 (weight gain has been a problem); SYS37 (hot flushes) and ET54 (sweated excessively). These three item pairs with correlated residuals involved similar functional limitations and were within the same dimension. To reduce the high MIs between items and dimensions, SYS37 (hot flushes) was moved from the Systemic Therapy Side Effects dimension to the Endocrine Therapy dimension; ET55 (mood swings) was moved from the Endocrine Therapy Symptoms dimension to the Emotional Functioning dimension; and ET56 (dizziness) was moved from the Endocrine Therapy Symptoms dimension to the Fatigue dimension. These item re-assignments were approved by our clinical expert (MT). After re-assigning these items to the aforementioned dimensions, there were no high MIs.

Nested models were compared in likelihood ratio tests (model B vs A; model C vs B) (Table 5). The fit indices of the refined model with three correlated item residuals demonstrated significant improvements in fit compared with the a priori model. The re-specified model with SYS37 (hot flushes), ET55 (mood swings), and ET56 (dizziness) reassigned to Endocrine Therapy, Emotional Functioning and Fatigue dimensions, respectively (model B) demonstrated improved fit over the model with all items in their original dimensions (model A). Furthermore, model C performed significantly better than model B (ΔSBχ2 = 225.62, p <0.001).

Considering our goal of identifying the most parsimonious and best-fitting model, we chose model C as our final model. Fig 2 shows the final CFA of model C with standardized factor loadings. The factor loadings and proportions of variance explained by each item are presented in Table 6. This shows that all items have a factor loading >0.4 (all p<0.001). Items had an R2 ranging from 0.167 to 0.923, which are acceptable (>0.1).

thumbnail
Fig 2. Diagram of final ten-dimension CFA model (model C).

https://doi.org/10.1371/journal.pone.0262635.g002

thumbnail
Table 6. Final model factor loadings and proportion of variance of the item responses explained by the specific factor.

https://doi.org/10.1371/journal.pone.0262635.t006

For ease of visualization, the QLQ-C30 and BR45 dimensions are shown separately even though all ten dimensions were fitted in one CFA model. Ovals represent factors; boxes represent items; arrows with numbers represent factor loadings between factors, circles represent residuals (error term).

Residual correlations are shown within the Physical & Role Functioning dimension (between PF2 and PF3) and Endocrine Therapy dimension (between SYS37 and ET54, and ET68 and ET69).

Discussion

This is the first CFA model of the EORTC QLQ-C30 and its novel BrC module, BR45, also the first key step of developing the Breast Utility Instrument (BUI). Our contributions to the literature lie in our methodological approach to developing a novel BrC-specific preference-based instrument, the BUI. While a CFA begins with a strong hypothesis about the dimensional structure, methodologists still recommend testing competing models [32]. Similarly, since our a priori model did not have good model fit, after minor modification to the specification of three items to different dimensions, and applying three residual correlations, we were able to define a model with acceptable fit of BrC-related HRQoL.

The final model fit the a priori dimensions including the WHO’s conceptualization of health, and key BrC-specific dimensions from the EORTC QLQ-C30 and BR45 [14]. A similar approach was used to derive the general cancer utility instruments QLU-C10D from the QLQ-C30 [51] and the FACT-8D from the FACT-G [52].

All authors who performed CFA of the QLQ-C30, alone or with the previous BrC module, BR23, tested hypotheses based on the scoring manual [53], consulted patients’ and clinicians’ perspectives from literature searches [25, 54], or determined the core dimensions based on investigator consensus [51]. Other authors took an EFA approach, without starting from an a priori theoretical framework which specifies item alignment with latent variables [25, 26].

Despite not starting from the same a priori theoretical framework, our CFA model of the QLQ-C30 and BR45 is closely aligned with previous factor models of the BR23, while including updated treatment-related symptoms in the BR45: systemic therapy side effects, sexual functioning and enjoyment, and endocrine therapy symptoms.

Our patient characteristics suggest that the BUI may be more applicable to long-term survivors of BrC on adjuvant endocrine therapy or patients who have metastatic disease, than those with early-stage BrC or undergoing chemotherapy. Most of our patients were on adjuvant systemic therapy (64.2%), or endocrine therapy (57%), and 24% were on any chemotherapy (Table 1). In comparison, 71.9% of patients in the development study for BR45 were on taxane chemotherapy, and 64.7% and 62.1% were taking cyclophosphamide or anthracycline, respectively [14].

We balanced comprehensive coverage of relevant factors and items with adequate model fit, sacrificing comprehensiveness to achieve parsimony and global model fit. We removed single-item functioning (global QoL, future perspective) and symptom subscales (e.g., dyspnea, insomnia, upset by hair loss). When dimensions overlapped, we chose the dimension rated more important by patients, i.e., retained sexual functioning and enjoyment. We excluded the nausea and vomiting dimension because of low internal consistency (α = 0.4) and low patient-rated importance (3.50). Few patients reported nausea and vomiting, likely because 17.1% of patients were on chemotherapy.

We prioritized patient experience with the sex-related dimension a priori because patients have first-hand experience of the illness, are well informed about the burden of disease, and have experience undergoing treatment [55]. Patients may rate the importance of dimensions as lower than clinicians, because patients are known to adapt to their health conditions, including changing their internal standards and values [56, 57].

Our re-specified models considered clinical relevance when reducing local areas of strain. ET55 (mood swings), originally in the Endocrine Therapy dimension, had a high modification index with the Emotional Functioning dimension, so re-specifying the model with ET55 in the Emotional Functioning dimension was congruent. SYS37 (hot flushes) and ET54 (sweating excessively) had high item residuals and a high modification index. These climacteric symptoms could be more predominant in tamoxifen and ovarian function suppression [58], so were therefore moved to the Endocrine Therapy dimension. ET56 (dizziness), originally in the endocrine therapy dimension had a high modification index with the fatigue dimension. While the diagnoses of dizziness can generally be one of four types: vertigo, disequilibrium, pre-syncope, or light-headedness [59], a high MI between dizziness and the fatigue dimension suggested that our participants more often associated dizziness with fatigue.

All of the patients and clinicians in our study were accrued from one urban cancer centre. To mitigate this lack of generalizability in the development sample, future validation of the dimensional structure will ideally include responses from patients and clinicians from multiple hospital sites. Patients should represent a wider range of treatments spanning all five health states. Other developers of condition-specific preference-based instruments involved patients and clinicians to validate their a priori dimensions, namely, the QLU-C10D for general cancer [9], DUI for diabetes [60], and NQU for multiple sclerosis [61].

Conclusions

The results of this CFA established the dimensional structure and is a first step to developing the BUI, a BrC preference-based instrument. The next steps in developing the BUI will focus on selecting the core dimensions (attributes) and most representative items per dimension [12].

Overall, understanding the dimensional structure of a novel psychometric questionnaire contributes to the development of a novel condition-specific preference-based instrument. The BUI, derived from the EORTC QLQ-C30 and BR45, will incorporate patient preferences to improve clinical and policy decisions.

Supporting information

S2 Fig. Item response distributions of EORTC QLQ-C30 subscales by BrC health state.

https://doi.org/10.1371/journal.pone.0262635.s002

(PDF)

S3 Fig. Item response distributions on EORTC QLQ BR45 subscales by BrC health state.

https://doi.org/10.1371/journal.pone.0262635.s003

(PDF)

S4 Fig. Inter-subscale correlations proportional to colour intensity and dot size.

https://doi.org/10.1371/journal.pone.0262635.s004

(PDF)

S1 Table. QLQ-C30 and BR45 items, associated scale, percentage missing.

https://doi.org/10.1371/journal.pone.0262635.s005

(PDF)

Acknowledgments

Medical oncology clinicians who helped with patient recruitment: Andrea Eisen, Kataryna Jerzak, Rossanna Pezo, Sonal Gandhi, Ellen Warner, Danilo Giffoni, Lisa Verity, Elizabeth Matheson, Kim Nguyen, Neda Stjepanovic, and Sunnybrook nursing staff.

Biomatrix support: Kathy Pritchard, Nim Li, Cordelia He, Martin Yaffe.

THETA members who provided input on patient recruitment: Suzanne Chung, Josephine Wong, Chang-Ho Lee.

Patient chart data abstraction: Arcturus Phoon.

References

  1. 1. SEER, Female Breast Cancer: https://seer.cancer.gov/statfacts/html/breast.html: National Cancer Institute, Surveillance, Epidemiology, and End Results Program.; 2021 Available from: https://seer.cancer.gov/statfacts/html/breast.html.
  2. 2. Mittmann N, Liu N, Cheng SY, Seung SJ, Saxena FE, Look Hong NJ, et al. Health system costs for cancer medications and radiation treatment in Ontario for the 4 most common cancers: a retrospective cohort study. CMAJ Open. 2020;8(1):E191–E8. pmid:32184283
  3. 3. Tripathy D, Im SA, Colleoni M, Franke F, Bardia A, Harbeck N, et al. Ribociclib plus endocrine therapy for premenopausal women with hormone-receptor-positive, advanced breast cancer (MONALEESA-7): a randomised phase 3 trial. Lancet Oncol. 2018;19(7):904–15. pmid:29804902
  4. 4. Swain SM, Baselga J, Kim SB, Ro J, Semiglazov V, Campone M, et al. Pertuzumab, trastuzumab, and docetaxel in HER2-positive metastatic breast cancer. N Engl J Med. 2015;372(8):724–34. pmid:25693012
  5. 5. Michiels S, Pugliano L, Marguet S, Grun D, Barinoff J, Cameron D, et al. Progression-free survival as surrogate end point for overall survival in clinical trials of HER2-targeted agents in HER2-positive metastatic breast cancer. Ann Oncol. 2016;27(6):1029–34. pmid:26961151
  6. 6. Marschner N, Zacharias S, Lordick F, Hegewisch-Becker S, Martens U, Welt A, et al. Association of Disease Progression With Health-Related Quality of Life Among Adults With Breast, Lung, Pancreatic, and Colorectal Cancer. JAMA Netw Open. 2020;3(3):e200643. pmid:32154886
  7. 7. Drummond MF, Sculpher MJ, Claxton K, Stoddart GL, Torrance GW. Methods for the Economic Evaluation of Health Care Programmes. Fourth ed. Oxford, United Kingdom : Oxford University Press; 2015.
  8. 8. Rabin R, de Charro F. EQ-5D: a measure of health status from the EuroQol Group. Ann Med. 2001;33(5):337–43. pmid:11491192
  9. 9. King MT, Costa DSJ, Aaronson NK, Brazier JE, Cella DF, Fayers PM, et al. QLU-C10D: a health state classification system for a multi-attribute utility measure based on the EORTC QLQ-C30. Quality of Life Research. 2016;25(3):625–36. pmid:26790428
  10. 10. Longworth L, Yang Y, Young T, Mulhern B, Hernandez Alava M, Mukuria C, et al. Use of generic and condition-specific measures of health-related quality of life in NICE decision-making: a systematic review, statistical modelling and survey. Health Technol Assess. 2014;18(9):1–224. pmid:24524660
  11. 11. Rowen D, Brazier J, Ara R, Azzabi Zouraq I. The Role of Condition-Specific Preference-Based Measures in Health Technology Assessment. Pharmacoeconomics. 2017;35(Suppl 1):33–41. pmid:29052164
  12. 12. Brazier J, Rowen D, Mavranezouli I, Tsuchiya A, Young T, Yang Y, et al. Developing and testing methods for deriving preference-based measures of health from condition-specific measures (and other patient-based measures of outcome). Health Technology Assessment. 2012;16(32): pmid:22832015
  13. 13. Goodwin E, Green C. A Systematic Review of the Literature on the Development of Condition-Specific Preference-Based Measures of Health. Appl Health Econ Health Policy. 2016;14(2):161–83. pmid:26818198
  14. 14. Bjelic-Radisic V, Cardoso F, Cameron D, Brain E, Kuljanic K, da Costa RA, et al. An international update of the EORTC questionnaire for assessing quality of life in breast cancer patients: EORTC QLQ-BR45. Ann Oncol. 2020;31(2):283–8. pmid:31959345
  15. 15. Aaronson NK, Ahmedzai S, Bergman B, Bullinger M, Cull A, Duez NJ, et al. The European Organization for Research and Treatment of Cancer QLQ-C30: a quality-of-life instrument for use in international clinical trials in oncology. J Natl Cancer Inst. 1993;85(5):365–76. pmid:8433390
  16. 16. Osoba D, Zee B, Pater J, Warr D, Kaizer L, Latreille J. Psychometric properties and responsiveness of the EORTC quality of Life Questionnaire (QLQ-C30) in patients with breast, ovarian and lung cancer. Qual Life Res. 1994;3(5):353–64. pmid:7841968
  17. 17. Groenvold M, Klee MC, Sprangers MA, Aaronson NK. Validation of the EORTC QLQ-C30 quality of life questionnaire through combined qualitative and quantitative assessment of patient-observer agreement. J Clin Epidemiol. 1997;50(4):441–50. pmid:9179103
  18. 18. Sprangers MA, Groenvold M, Arraras JI, Franklin J, te Velde A, Muller M, et al. The European Organization for Research and Treatment of Cancer breast cancer-specific quality-of-life questionnaire module: first results from a three-country field study. J Clin Oncol. 1996;14(10):2756–68. pmid:8874337
  19. 19. Lidgren M, Wilking N, Jonsson B, Rehnberg C. Health related quality of life in different states of breast cancer. Qual Life Res. 2007;16(6):1073–81. pmid:17468943
  20. 20. Lidgren M, Wilking N, Jonsson B, Rehnberg C. Resource use and costs associated with different states of breast cancer. Int J Technol Assess Health Care. 2007;23(2):223–31. pmid:17493308
  21. 21. Burstein HJ, Lacchetti C, Anderson H, Buchholz TA, Davidson NE, Gelmon KA, et al. Adjuvant Endocrine Therapy for Women With Hormone Receptor-Positive Breast Cancer: ASCO Clinical Practice Guideline Focused Update. J Clin Oncol. 2019;37(5):423–38. pmid:30452337
  22. 22. Constitution of the World Health Organization: World Health Organization; 2006 Available from: https://www.who.int/governance/eb/who_constitution_en.pdf.
  23. 23. Miller GA. The magical number seven, plus or minus two: some limits on our capacity for processing information. 1956. Psychol Rev. 1994;101(2):343–52. pmid:8022966
  24. 24. O’Brien RM. Identification of Simple Measurement Models with Multiple Latent Variables and Correlated Errors. Sociological Methodology. 1994;24:137–70.
  25. 25. Costa DS, Aaronson NK, Fayers PM, Grimison PS, Janda M, Pallant JF, et al. Deriving a preference-based utility measure for cancer patients from the European Organisation for the Research and Treatment of Cancer’s Quality of Life Questionnaire C30: a confirmatory versus exploratory approach. Patient Related Outcome Measures. 2014;5:119–29. pmid:25395875
  26. 26. Rowen D, Brazier J, Young T, Gaugris S, Craig BM, King MT, et al. Deriving a preference-based measure for cancer using the EORTC QLQ-C30. Value in Health. 2011;14(5):721–31. pmid:21839411
  27. 27. Flora DB. Statistical Methods for the Social & Behavioural Sciences A Model-Based Approach: SAGE; 2018. 400 p.
  28. 28. Streiner DL, Norman GR. Health measurement scales: a practical guide to their development and use: Oxford University Press; 2008.
  29. 29. Li CH. The Performance of ML, DWLS, and ULS Estimation With Robust Corrections in Structural Equation Models With Ordinal Variables. Psychol Methods. 2016;21:369–87. pmid:27571021
  30. 30. Flora DB, Curran PJ. An empirical evaluation of alternative methods of estimation for confirmatory factor analysis with ordinal data. Psychol Methods. 2004;9(4):466–91. pmid:15598100
  31. 31. Brown TA. Confirmatory Factory Analysis for Applied Research. Kenny DA, editor. New York , London: The Guilford Press; 2006.
  32. 32. Flora DB, Flake JK. The purpose and practice of exploratory and confirmatory factor analysis in psychological research: decisions for scale development and validation. Canadian Journal of Behavioural Science. 2017;49(2):78–88.
  33. 33. Cangur S, Ercan I. Comparison of Model Fit Indices Used in Structural Equation Modeling Under Multivariate Normality. Journal of Modern Applied Statistical Methods. 2015;14(1):152–67.
  34. 34. Hu L, Bentler P. Fit indices in covariance structure modeling: sensitivity to underparameterized model misspecification. Psychol Methods. 1998;3:424–53.
  35. 35. Hu L, Bentler P. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct Equation Model. 1999(1):1–55.
  36. 36. Norman G, Streiner D. Chapter 19 Principal Components and Factor Analysis: Fooling Around with Factors. 3rd Edition Biostatistics: the Bare Essentials. Hamilton, ON: B.C. Decker, Inc.; 2008. p. 194–209.
  37. 37. Gerbing DW, Anderson JC. On the Meaning of within-Factor Correlated Measurement Errors. Journal of Consumer Research. 1984;11(1):572–80.
  38. 38. Byrne BM. Factor Analytic Models: Viewing the Structure of an Assessment Instrument From Three Perspectives. Journal of Personality Assessment. 2005;85:17–32. pmid:16083381
  39. 39. ten Klooster PM, Veehof MM, Taal E, van Riel PLCM, van de Laar MAFJ. Confirmatory Factor Analysis of the Arthritis Impact Measurement Scales 2 Short Form in Patients With Rheumatoid Arthritis. Arthritis & Rheumatism. 2008;59(5):692–8. pmid:18438904
  40. 40. Whittaker TA. Using the modification index and standardized expectation parameter change for model modification. The Journal of Experimental Education. 2012;80(1):26–44.
  41. 41. EORTC Quality of Life Questionnaires—Modules: EORTC; 2017. Available from: http://groups.eortc.be/qol/eortc-qlq-c30.
  42. 42. Wei T, Simko V, Levy M, Xie Y, Jin Y, Zemla J. corrplot: Visualization of a correlation matrix https://cran.r-project.org/web/packages/corrplot/index.html 2017.
  43. 43. Revelle W. psych: Procedures for Psychological, Psychometric, and Personality Research https://cran.r-project.org/web/packages/psych/index.html. 2.0.9 ed2018.
  44. 44. Rosseel Y. lavaan: An R Package for Structural Equation Modeling. Journal of Statistical Software. 2012;48:1–36.
  45. 45. MacCallum RC, Widaman KF, Zhang S, Hong S. Sample Size in Factor Analysis. Psychol Methods. 1999;4(1):84–99.
  46. 46. Seung SJ, Traore AN, Pourmirza B, Fathers KE, Coombes M, Jerzak KJ. A population-based analysis of breast cancer incidence and survival by subtype in Ontario women. Curr Oncol. 2020;27(2):e191–e8. pmid:32489268
  47. 47. Canadian Cancer Statistics Advisory Committee. Canadian Cancer Statistics Toronto, Ontario: Canadian Cancer Society; 2018. Available from: Available at: cancer.ca/Canadian-Cancer-Statistics-2018-EN.pdf.
  48. 48. Statistics Canada. Census Profile, 2016 Census. Catalogue no. 98-316-X2016001 Ottawa: Statistics Canada; 2017 [updated November 29, 2017]. Available from: https://www12.statcan.gc.ca/census-recensement/2016/dp-pd/prof/index.cfm?Lang=E.
  49. 49. Mierzynska J, Taye M, Pe M, Coens C, Martinelli F, Pogoda K, et al. Reference values for the EORTC QLQ-C30 in early and metastatic breast cancer. Eur J Cancer. 2020;125:69–82. pmid:31838407
  50. 50. Krahn M, Ritvo P, Irvine J, Tomlinson G, Bezjak A, Trachtenberg J, et al. Construction of the Patient-Oriented Prostate Utility Scale (PORPUS): a multiattribute health state classification system for prostate cancer. J Clin Epidemiol. 2000;53(9):920–30. pmid:11004418
  51. 51. King MT, Costa DS, Aaronson NK, Brazier JE, Cella DF, Fayers PM, et al. QLU-C10D: a health state classification system for a multi-attribute utility measure based on the EORTC QLQ-C30.[Erratum appears in Qual Life Res. 2016 Oct;25(10):2683; pmid:27060088]. Quality of Life Research. 2016;25(3):625–36.
  52. 52. King MT, Norman R, Mercieca-Bebber R, Costa DSJ, McTaggart-Cowan H, Peacock S, et al. The Functional Assessment of Cancer Therapy Eight Dimension (FACT-8D), a Multi-Attribute Utility Instrument Derived From the Cancer-Specific FACT-General (FACT-G) Quality of Life Questionnaire: Development and Australian Value Set. Value Health. 2021;24(6):862–73. pmid:34119085
  53. 53. Michels FA, Latorre Mdo R, Maciel Mdo S. Validity, reliability and understanding of the EORTC-C30 and EORTC-BR23, quality of life questionnaires specific for breast cancer. Rev Bras Epidemiol. 2013;16(2):352–63. pmid:24142007
  54. 54. Gundy CM, Fayers PM, Groenvold M, Petersen MA, Scott NW, Sprangers MA, et al. Comparing higher order models for the EORTC QLQ-C30. Qual Life Res. 2012;21(9):1607–17. pmid:22187352
  55. 55. Gandjour A. Theoretical foundation of patient v. population preferences in calculating QALYs. Med Decis Making. 2010;30(4):E57–63. pmid:20511562
  56. 56. Schwartz CE, Sprangers MA. Methodological approaches for assessing response shift in longitudinal health-related quality-of-life research. Soc Sci Med. 1999;48(11):1531–48. pmid:10400255
  57. 57. Menzel P, Dolan P, Richardson J, Olsen JA. The role of adaptation to disability and disease in health state valuation: a preliminary normative analysis. Soc Sci Med. 2002;55(12):2149–58. pmid:12409128
  58. 58. Chlebowski RT. Interpreting quality-of-life data from the SOFT and TEXT trials. Lancet Oncol. 2015;16(7):749–51. pmid:26092819
  59. 59. Reilly BM. Dizziness. In: Walker HK, Hall WD, Hurst JW, editors. Clinical Methods: The History, Physical, and Laboratory Examinations. 3rd edition ed. Boston, M.A.: Butterworths; 1990.
  60. 60. Sundaram M, Smith MJ, Revicki DA, Elswick B, Miller L-A. Rasch analysis informed the development of a classification system for a diabetes-specific preference-based measure of health. Journal of Clinical Epidemiology. 2009;62(8):845–56. pmid:19573741
  61. 61. Matza LS, Phillips G, Dewitt B, Stewart KD, Cella D, Feeny D, et al. A Scoring Algorithm for Deriving Utility Values from the Neuro-QoL for Patients with Multiple Sclerosis. Med Decis Making. 2020;40(7):897–911. pmid:33016238