Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms

Shah, Koonal K.; Bennett, Bryan; Lenny, Andrew; Longworth, Louise; Brazier, John E.; Oppe, Mark; Pickard, A. Simon; Shaw, James W.

doi:10.1007/s10198-021-01337-6

Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms

Original Paper
Open access
Published: 17 June 2021

Volume 22, pages 1301–1309, (2021)
Cite this article

Download PDF

You have full access to this open access article

The European Journal of Health Economics Aims and scope Submit manuscript

Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms

Download PDF

1603 Accesses
3 Citations
3 Altmetric
Explore all metrics

Abstract

It is important that patient-reported outcome (PRO) measures used to assess cancer therapies adequately capture the benefits and risks experienced by patients, particularly when adverse event profiles differ across therapies. This study explores the case for augmenting preference-based utility measures to capture the impact of cancer treatment-related symptoms. Additional cancer treatment-related items could be specific (e.g., rash) or global. While specific items are easier to describe and understand, their use may miss rarer symptoms and those that are currently unknown but will arise from future medical advancements. The appropriate number of additional items, the independence of those items, and their impact on the psychometric properties of the core instrument require consideration. Alternatively, a global item could encompass all potential treatment-related symptoms, of any treatments for any disease. However, such an item may not be well understood by general public respondents in valuation exercises. Further challenges include the decision about whether to generate de novo value sets for the modified instrument or to map to existing tariffs. The fluctuating and transient nature of treatment-related symptoms may be inconsistent with the methods used in conventional valuation exercises. Fluctuating symptoms could be missed by sub-optimal measure administration timing. The addition of items also poses double-counting risks. In summary, the addition of treatment-related symptom items could increase the sensitivity of existing utility measures to capture known and unknown treatment effects in oncology, while retaining the core domains. However, more research is needed to investigate the challenges, particularly regarding valuation.

Impact of Adverse Events on Health Utility and Health-Related Quality of Life in Patients Receiving First-Line Chemotherapy for Metastatic Breast Cancer: Results from the SELECT BC Study

Article Open access 17 October 2017

The FACT-8D, a new cancer-specific utility algorithm based on the Functional Assessment of Cancer Therapies-General (FACT-G): a Canadian valuation study

Article Open access 16 June 2022

Condition-specific or generic preference-based measures in oncology? A comparison of the EORTC-8D and the EQ-5D-3L

Article Open access 09 November 2016

Introduction

For many years, chemotherapy, surgery and radiotherapy have been the most common forms of cancer treatment available. More recently, dramatic improvements have been made in the field of immunotherapy, which has become an important therapeutic alternative and is now the first choice in many cases [1]. Immunotherapy enables the immune system to fight against cancer, infections, and other diseases. It has been shown to be effective in treating a range of advanced and metastatic cancers [2]. Recent successes have spurred a rapid increase in the number of immuno-oncology therapies being developed [3]. Traditional therapies for cancer, including chemotherapy and radiotherapy, are in general poorly tolerated, being associated with a plethora of (often severe) toxicities ranging from hair loss to bruising and bleeding. The advent of immuno-oncology has led to its widespread adoption as a new standard of care for multiple tumour types, thanks not only to its efficacy but also its tolerability relative to conventional treatments. For example, a recent meta-analysis of 22 randomized clinical trials involving 12,727 patients with solid organ malignancies concluded that patients receiving immunotherapy were less likely to develop severe treatment-related symptoms (also referred to as side effects, adverse effects, adverse events and treatment risks) than those receiving traditional chemotherapy [4]. Nevertheless, as experience with immuno-oncology has grown, concerns have arisen regarding certain treatment-related symptoms, including fatigue, diarrhoea, nausea and respiratory problems [5, 6]. Since candidate treatments tend to differ in terms of the severity of these effects and patients’ ability to tolerate them, it is important that any patient-reported outcome (PRO) measures used to assess the impact of treatments are able to adequately capture both their positive and negative effects [7].

The aim of this commentary paper is to examine the adequacy of existing generic and condition-specific preference-based measures for capturing important outcomes in cancer, and to explore the case for modifying or adding items to existing measures to capture the impact of treatment-related symptoms. This commentary paper provides a targeted overview and discussion of the literature and current issues, with the view to encouraging further discussion within the health economics and outcomes research field, and to informing future PRO development and refinement in the field of oncology.

The paper is structured as follows. First, the types of PRO measures used in cancer are described. The suitability of descriptive systems for capturing health effects of treatment is then discussed, with reflections on how existing generic measures could be adapted, and on how condition-specific measures have dealt with the issue. Next, challenges relating to valuation, capturing transient events, and modeling are explored. Finally, future steps towards addressing these challenges are summarized.

Generic and condition-specific preference-based measures used in cancer

PROs can be delineated in a variety of ways as measures of health/health-related quality of life, notably as generic or condition-specific measures, and as preference-based or non-preference-based measures (note that the term ‘preference-based measure’ has been criticized as these measures are also used in applications where social preference weights are not relevant; ‘preference-accompanied measure’ has been suggested as an alternative [8]). There has been tremendous interest in preference-based measures in recent years due to their relevance in economic evaluations, as they can facilitate the calculation of quality-adjusted life years [9].

The EQ-5D, Health Utilities Index Mark 3 (HUI3), and SF-6D are three of the most prominent generic preference-based measures. A review of the psychometric properties of these instruments by Longworth et al. [10] showed that of the three, EQ-5D was by far the most commonly used in oncology, with 71 of the 98 studies reviewed reporting EQ-5D utility data (compared to 24 and three studies reporting HUI3 and SF-6D data, respectively). While there is evidence that EQ-5D is valid and reliable in many cancers [10, 11], there are concerns that this and other such generic measures are not sensitive enough and inevitably miss domains that are important in capturing the benefits and risks of new cancer treatments [12, 13].

Condition-specific measures are, therefore, preferred in some situations because by focusing on the condition of interest, they cover important dimensions that generic measures may miss, and can be more sensitive for a given dimension [14]. It is important to note that generic and disease-targeted measures are often used for different purposes. In cancer, examples of condition-specific measures include the European Organization for Research and Treatment of Cancer Quality of Life Questionnaires (EORTC QLQ) [15], the MD Anderson Symptom Inventory (MDASI) suite of measures [16] and the Functional Assessment of Cancer Therapy (FACT) family of instruments [17]. Since these measures have been developed specifically for use in cancer, they tend to offer greater content validity than generic measures within the oncology setting [18], when used for the intended purposes. On the other hand, for the purpose of serving as a health status descriptor relevant for evaluation in the general population, the content validity of QLQ-C30/FACT-G (and, therefore, QLU/FACT-8D) can be questioned.

However, the use of condition-specific measures poses problems in achieving cross-program comparability [19]. Many cancer-specific measures are not preference based or amenable to valuation. This means that they cannot be used to calculate quality-adjusted life years (QALYs), thereby precluding their use in cost-utility analysis. As an alternative to developing an entirely new cancer-specific preference-based measure, researchers have developed mapping functions that allow the conversion of outcomes from a non-preference-based measure to the values for a preference-based measure [10, 20]. The ISPOR Good Practices Task Force Report on mapping to estimate health-state utility from non-preference-based outcome measures provides methodological recommendations to analysts undertaking such studies [21]. Further recommendations on best practices for reporting the results of utility mapping studies have been provided in the MAPS (MApping onto Preference-based measures reporting Standards) statement [22]. However, mapping is unsuitable in situations where there is little overlap in content between the two measures, and it should not be used when the target preference-based measure is considered inappropriate.

Another approach is to take existing non-preference-based measures and reduce them so as to make them amenable to valuation [23]. This usually involves using psychometric criteria to select a subset of items from the existing measure and to analyze the performance of the candidate items. Relevant methods include factor analysis (a technique for identifying structurally independent dimensions with low correlation between each other), Rasch analysis (a technique that uses logistic models to convert categorical responses to points on a continuous scale), and assessments of validity and responsiveness. For example, the EORTC Quality of Life Utility Measure (EORTC QLU-C10D) is a health state classification system based on the larger EORTC QLQ-Core 30 (C30) cancer-specific quality of life questionnaire [24]. The QLU-C10D, which succeeded the EORTC 8D [25], comprises 10 dimensions, linked to 13 items selected from the 30 items of the QLQ-C30. The QLU-C10D has been valued using discrete choice experiments, and several national value sets have been reported [26,27,28,29]. The QLU-C10D valuation studies included a duration attribute to enable the anchoring of values onto the QALY scale [30]. Similarly, the FACT-8D is an eight-dimension preference-based measure derived from the FACT–General (FACT-G) questionnaire [31]. However, this approach may not be sufficient as many existing non-preference-based based measures such as the EORTC QLQ-C30 and the FACT-G were developed when chemotherapy was the dominant treatment paradigm in oncology. Since this time, the treatment landscape has evolved significantly and, therefore, many items in these measures may not be fully valid.

Yet even preference-based condition-specific measures may be problematic as they involve naming the condition (which can lead to bias [32]), lack a common upper anchor, and often miss impact on possible co-morbidities [33]. Valuation study respondents may exaggerate the importance of problems associated with the condition underpinning the health states under evaluation compared to other conditions (thereby leading to relatively large utility decrements) due to the psychological tendency to focus on what is placed in front of them [19], though this finding has not always been observed in the QLU-C10D valuation studies [26,27,28,29]. These concerns may undermine consistency in making comparisons between QALYs calculated using different measures. While some measures include domains representing known treatment-related symptoms (the QLU-C10D includes fatigue, appetite and nausea dimensions, for example [24]), unknown and less common side effects tend to be missed. Looking to the future, the effects of emerging innovative oncology treatments may be different from those of the chemotherapeutic regimens of past eras or from current immunotherapy options, and the cancer-specific measures previously developed may no longer be well suited to capture the array of health impacts.

Health technology assessment (HTA) agencies requiring cost-utility analyses generally prefer generic measures over condition-specific measures to promote consistency and comparability across appraisals [34]. However, preference-based condition-specific measures are sometimes accepted by HTA agencies in cases where there is evidence that the use of a generic measure is inappropriate, e.g., due to poor psychometric performance in the relevant patient group [35]. For example, in a National Institute for Health and Care Excellence (NICE) appraisal of fluocinolone acetonide intravitreal implant for the treatment of chronic diabetic macular oedema, the manufacturer collected quality of life data using a vision-specific questionnaire, the NEI-VFQ-25 [36]. NICE’s appraisal committee accepted that a disease-specific instrument was likely to be more responsive to changes of relevance to patients than the Institute’s preferred generic measure, the EQ‑5D.

Adaptation of existing measures

While it is common to include both generic and condition-specific measures in a clinical trial data collection strategy (indeed, some generic measures have been designed to be used alongside other, more detailed measures) [37], it is often desirable to limit the number of instruments in a given study to reduce patient, investigator and operational burden. A potential compromise is to develop an adapted version of a generic measure for use in specific diseases. This notion has parallels with the extension of condition-specific measures for use in specific subtypes of the disease. For example, the FACT-G is considered appropriate for use in patients with any form of cancer, and is complemented by variants that include questions specific to particular sites/tumors (e.g., FACT-C for colorectal cancer) [38, 39]. The EORTC QLQ and MDASI instrument groups also have modules covering symptoms relevant for specific patient populations, intended to complement the core questionnaires or items.

One way of adapting a generic measure is by modifying the descriptive system to include additional dimensions of health. In the context of the EQ-5D, such dimensions have been described as ‘bolt-on’ items. Such an approach could improve the performance of the measure in certain settings, whilst retaining the general structure and conceptual framework underpinning the original measure and achieving better consistency with any utility values associated with the original measure. Existing research has examined the impact of expanding the EQ-5D to include bolt-on dimensions such as cognition [40], psoriasis (skin irritation and self-confidence) [41], sleep [42], vision, hearing, tiredness [43] and respiratory problems [44], amongst others. Beyond the EQ-5D, Brazier et al. have examined the impact of adding a pain and discomfort dimension to the AQL-5D, an asthma-specific preference-based measure [45]. Cancer has been a key area in which preference-based approaches have been applied to disease-specific measures, thereby offering some insight into opportunities for bolt-ons [46].

Figure 1 shows when the adaptation of an existing generic measure may be justified—namely, if the generic measures fail to pick up important aspects of health and show poor psychometric properties in the relevant patient populations, and if measures specific to the condition of interest either do not exist or are otherwise inadequate [47]. It should be noted that these are necessary but not sufficient conditions for adapting a measure. Ultimately, the adaptation should improve the psychometric properties of the measure, i.e., it should address existing concerns about its content or face validity amongst the relevant patient population, and should matter to people to the extent that it would make a difference to utility values (though there are challenges involved in assessing this; see below) and ultimately to cost-effectiveness estimates. Psychometric methods such as principal component analysis can be used to identify gaps and identify candidate bolt-on dimensions for measures [48]. Principal component analysis involves examining a matrix of item correlations to reduce the information into a smaller set of components, with high intercorrelations implying that items are measuring the same latent component. Components can then be selected based on their eigenvalues, which represent the relative share of total variance accounted for by each component [49].

Capturing treatment-related symptoms

The QLU-C10D comprises multiple concepts, including items relating to functioning (physical, role, social and emotional) and disease-related symptoms such as pain. It also includes items that capture common side effects of cancer treatments, such as nausea and bowel problems. However, it lacks a general (or ‘global’) treatment-related symptoms item. According to King et al. [24], this reflects the convention that attributes in utility instruments typically represent specific domains of health. By contrast, amongst the FACT measures, both the general and more specific questionnaires contain a global side effects item (FACIT GP5) which asks respondents to indicate the extent to which they are ‘bothered by side effects of treatment’ using a five-point scale. This is consistent with the US Food and Drug Administration recommendation that the adverse consequences of treatment are measured separately from treatment effectiveness [50].

The absence of a global side effects item means that measures such as the QLU-C10D may miss the full range of possible treatment-related symptoms, including for example, immune-related side effects such as breathing problems, rash and impacts on physical appearance [6] that do not correspond to any of the measure’s existing items (though shortness of breath is included in the larger QLQ-C30 questionnaire). It is practically difficult to identify an encompassing set of symptoms when using specific items rather than a global item [51]. Scientific understanding of immune-related side effects is evolving as novel classes of immuno-oncology therapies come to market. As experience with these agents grows, it is plausible that further important treatment-related symptoms may be identified in the future that are not well captured by existing items in these measures.

As noted above, an alternative to such cancer-specific measures would be to use a generic preference-based measure and to add items designed to improve its performance in oncology. Treatment-related symptoms could be captured via a global item or one or more specific items. A global item would allow the capturing of all possible side effects, including those that are less common or not currently known. This could allow researchers to more effectively compare new treatments versus standard of care by adding information on the severity of their respective side effect profiles. However, it may be difficult to frame a global treatment-related symptoms item in a way that reflects how patients themselves think about and describe their health and treatment. Patients may not use terms like ‘treatment-related symptoms’ (though phrases such as ‘bothered by the effects of your treatment’ may overcome this concern), and indeed may not know whether a particular health problem they are experiencing is a symptom of their disease or a consequence of their treatment. In other disease areas, single-item ratings of side effects have not been recommended due to concerns about their lack of reliability and sensitivity to change [52]. On the other hand, items describing specific side effects, such as breathing problems, are likely to be better understood, but adding only one or two items may be insufficient given the large variety of symptoms associated with cancer therapy in practice. Adding a large number of items may be undesirable as this introduces the risk that the brevity and core structure of the original instrument will be lost, i.e., the more dimensions that are added, the more likely it will be that the additional dimensions double-count the same construct (double-counting is also a concern for the global item approach due to overlap between the perceived adverse effects of treatment and impacts on core domains, particularly domains related to discomfort). A key challenge is to find a balance between the two competing options to describe side effects.

Valuation issues

Condition-specific measures can in principle be valued using stated preference methods, as demonstrated by the recently published suite of QLU-C10D value sets [26,27,28,29]. However, if a generic measure is preferred, then adding items to existing measures may overcome this problem by placing the condition-specific element within the context of a broader health status measure, thereby potentially lessening the impact of focusing effects. This would necessitate the generation of a new value set for the augmented measure [44]. Not only would this be a very expensive process, but the new value set could be discordant with the existing value sets, e.g., the rank order of existing parameters could change. While the possibility of such findings should not deter research in this area, it would undoubtedly introduce challenges for HTA agencies who may be faced with possible ‘gaming’ due to the choice of multiple value sets, each with different properties. A potential solution has been suggested by Yang et al. [53], who explored the feasibility of using parameters from existing EQ-5D-5L value sets to predict values when new items are added. These were used as fixed parameters in modeling the bolt-on data, with a scale parameter introduced to capture the effects of adding the bolt-on item. The new items are valued as an additive or multiplicative deviation from the existing tariff. However, the evidence base supporting this approach is limited, and complications may arise if the new items interact with and affect the relative weightings of the existing items.

Further, health state valuations are conventionally derived from the preferences of the general population [54], as opposed to current patients. It is not clear whether a global item describing treatment-related symptoms would be understood by such individuals, particularly if they have never before experienced an unexpected adverse effect of treatment. Lack of familiarity with treatment burden may have contributed to general population samples placing relatively low weighting on specific symptom items in the QLU-C10D. A vignette valuation approach may help to provide the necessary context, though this is associated with other limitations such as inflexibility and challenges in incorporating into economic models [14].

A further issue in valuation relates to possible bias and focusing effects when specifying that symptoms are related to treatment. Valuation survey respondents may place a different amount of emphasis on symptoms if they are told that these are caused by treatment rather than by the disease, even if the impact of the symptoms on patients’ health and lives is the same irrespective of their cause. This may be an argument for favoring specific items that are not framed as being treatment related.

Challenges of implementation: capturing transient events

Treatment side effects may be impactful, but are often short lived or variable over the course of treatment. Such fluctuations in health pose measurement challenges. Sanghera and Coast [55] note that when health fluctuates, standard measurement and analytic approaches may not be suitable due to recall periods and the timing of assessment. This is due to a phenomenon known as ‘recall bias’ in which the length of recall periods can introduce error or bias into clinical trials. For example, if the period is too long, it may lead to cognitive distortion in memory of an event (e.g., an event being perceived as less severe as when it was experienced); if too short, it may not allow enough time for an outcome to occur [56, 57].

The EQ-5D asks respondents to self-report their health status ‘today’, so the health state reported could differ depending on whether or not the symptoms of treatment are being experienced on the day of questionnaire completion (though this also applies to the core dimensions, and can be addressed by optimizing the timing of measurement; see below). Questionnaires with longer recall periods may run into other issues (such as the FACT-G which asks about the ‘past seven days’) since it is unclear whether respondents should consider their average health or worst health experiences over that period [17]. The QLQ-C30 mixes recall periods, with some items framed in the present tense (e.g., ‘Do you have any trouble […]’) and other items—including those covering common side effects—covering a one-week recall period [15].

Fixed-duration recall periods may be problematic in the context of health state valuation, particularly using techniques such as time trade-off which posit that the health state in question is experienced for a specified duration that differs from the measurement recall period (conventionally 10 years in many valuation protocols [58]). For this reason, when valuing the preference-based QLU-C10, all 10 dimensions are framed in the present tense, in contrast to the corresponding QLQ-C30 items. The use of fixed health state durations, like 10 years, in valuation may be problematic for side effects and other episodic or intermittent changes in health, irrespective of the recall period used in the measure. EQ-5D valuation studies, for example, require valuation survey respondents to imagine that they will experience the specified health problems (e.g., moderate problems in walking about) for 10 years, with no variation in the level of those problems throughout that period [58]. Although some respondents may question whether such a scenario is realistic, it is at least straightforward to specify and comprehend. It is less clear how a health state involving occasional or fluctuating levels of problems with treatment-related symptoms (or indeed fluctuating disease symptoms) would be described over a 10-year period. Some researchers have attempted to find solutions for valuing profiles in which health varies over time [59, 60]. An issue encountered is that respondents tend to neglect information about the amount of time spent in symptomatic states.

Related to recall period is the issue of timing of assessment. Patients may or may not be experiencing side effects at the point of questionnaire completion (which suggests that longer recall periods, more frequent collection, or event-driven questionnaire completion may be appropriate). The side effects of certain cancer treatments may be predictable. For example, if the adverse effects of chemotherapy typically occur during the first week of treatment and recede by the next administration of treatment, then measurement on the day of treatment would miss the impact of these side effects [55]. To capture fully the impact of side effects—whether via a bolt-on dimension or not—it is important to optimize the timing of measurement in clinical trials to reflect fluctuation patterns that are known and predictable [50]. Advances in the electronic collection of PRO data are expected to facilitate greater flexibility in this regard, allowing patients to self-report their health status at time points that are relevant, and not merely operationally convenient. If it is possible to capture PROs when symptoms occur, this would lessen the recall bias associated with retrospective data collection.

Challenges of implementation: modeling

If important side effects are omitted from a given PRO measure, and therefore from the health state utility values associated with that measure, analysts may adjust the utility data to capture the impact of these side effects in economic models. Indeed, the ISPOR Good Practices Task Force Report on the identification, review, and use of health state utilities in cost-effectiveness models [61] explicitly recommends assessing “the extent to which the utility effects of important adverse events are captured by the data used to estimate a model’s non-adverse-event HSUs [health state utilities]” (p.273).

In practice, disutility values relating to treatment-related symptoms are typically sought from the literature and applied by subtracting the disutility from the utility value associated with the health state of interest or multiplying a weighting associated with the adverse event with the value of the health state of interest. These approaches risk double-counting if the main utility values already partially reflect the impact of those symptoms because the measure used captures them implicitly (this kind of double-counting issues is likely to occur when using any measure that describes symptoms). Further, the disutility values are often sourced or synthesized from data from multiple studies, which may be of variable methodological quality that used different, non-comparable valuation methods, and may not all have examined exactly the same side effect as the one being incorporated in the model. In some cases, disutility values for side effects are omitted from the model due to the lack of relevant data [62].

The inclusion of specific treatment-related symptoms (core or additional) items could help mitigate these issues. Notwithstanding the valuation issues described above, the valuation of the treatment-related symptoms would be combined with the valuation of the other health outcomes, thereby ensuring consistency in the methods used to generate the utility data. However, in order for such an item to demonstrate useful psychometric properties, the framing of the items and the frequency and timing of the data collection would need to be optimized so that the (sometimes transient) symptoms are not systematically missed at the point of questionnaire completion. It would also need to be demonstrated that the incidence rates of these side effects are sufficiently high, and their expected impact on quality-adjusted life years is sufficiently great, so as to justify their inclusion in the measure.

Limitations

This commentary paper does not present any data or analyses that could be used to examine empirically some of the conjectures and discussion points presented. The points raised were drawn from the literature and the authors’ own knowledge and experiences, but no systematic review of the literature was undertaken. We are not aware of any existing reviews of studies to augment preference-based measures in general, but refer readers to a review of studies of bolt-ons specifically for the EQ-5D [63]. This commentary paper has focused on oncology, largely due to the importance of treatment-related symptoms when assessing and comparing immuno-oncology therapies. Some of the points raised may not be generalizable to other disease areas. However, the schematic shown in Fig. 1 is not specific to oncology and can be applied to any condition. Questions such as whether a global or specific treatment-related symptoms item is preferred are relevant in disease areas beyond oncology. For example, in systematic lupus erythematosus, researchers responsible for developing the LupusPRO opted to include items describing specific treatment-related symptoms as well as a general item capturing ‘bothersome side effects’ [64].

Conclusions

When a preference-based measure of health is required, an additional layer of complexity is cast upon the acknowledged strengths and limitations of generic and disease-specific measures. Adapting existing generic preference-based measures by adding treatment-related symptoms items potentially improves their sensitivity to health-related changes/differences in cancer patients, whilst retaining a degree of consistency with the original measures. This may be preferable to relying on cancer-specific preference-based measures, which are less useful for comparability across appraisals, and do not themselves always capture these symptoms satisfactorily. It may also be preferable to continually developing new measures to address the shortcomings of existing ones. Such an approach would facilitate a more complete assessment of competing treatments with adverse event profiles that may differ in important ways. It could also reduce the sometimes problematic need for separate adjustment for adverse events in economic models (though some aspects of these events, such as survival outcomes and costs, would still need to be modeled separately from the health-related quality of life data).

Several challenges and research questions remain. While a global treatment-related symptoms item could encapsulate a range of symptoms for a host of current and future treatments (and could even cover symptoms associated with treatments for conditions other than cancer), it is not clear how well such an item would be understood by patients, particularly if they cannot distinguish between the symptoms of their condition and the side effects of treatment. In addition, it is unclear whether general public respondents in a study seeking to obtain utility values for the adapted measure would be able to comprehend valuing a global item that does not refer to specific side effects. Both issues warrant further research prior to adding global treatment-related symptoms items to existing preference-based measures.

It is also unclear what the appropriate recall period would be for treatment-related symptoms, many of which are transient or fluctuate in way that differ from the symptoms of the disease. These issues need investigating in further research in order to assess the case for adding treatment-related symptoms items to an established measure such as the EQ-5D.

Further research is also required into the optimal approach for valuing these additional items. Methods that avoid the need for newly developed value sets bespoke to each new item (and associated measure) are desirable on efficiency grounds and from the perspective of HTA agencies who require a degree of consistency in their methods of assessment and decision-making. Research to date has suggested that adding an item may affect the valuation of the core items of the instrument, so the additional impact on utility may not be simply additive [10]. Further testing of the approach proposed by Yang et al. [53], and alternative methods such as the use of discrete choice experiments to assess preferences for the additional items relative to the core dimensions of the instrument, would be beneficial.

Any adaptation of an existing measure, including development of new treatment-related symptoms items, would require a full assessment of psychometric properties to assess if the adaptation offers an improvement to the status quo. Further, the adapted instrument should also have an impact on associated utility values to offer an improvement. This may not always be the case, as demonstrated by Yang et al. who found that the inclusion of a sleep item did not have a significant impact on utilities derived for EQ-5D-3L health states [42].

The era of immuno-oncology increasingly reveals that current approaches to measuring the impact of cancer treatment-related symptoms on utility values are sub-optimal. This commentary paper has outlined alternative approaches that could be adopted to better capture these impacts for current and future treatments. Further research is needed to test the feasibility of these approaches and assess their impact on decision-making.

References

Arruebo, M., Vilaboa, N., Sáez-Gutierrez, B., Lambea, J., Tres, A., Valladares, M., et al.: Assessment of the evolution of cancer treatment therapies. Cancers. 3(3), 3279–3330 (2011)
Article CAS PubMed PubMed Central Google Scholar
Farkona, S., Diamandis, E.P., Blasutig, I.M.: Cancer immunotherapy: the beginning of the end of cancer? BMC. Med. 14(1), 73 (2016)
Article PubMed PubMed Central CAS Google Scholar
Cesano, A., Warren, S.: Bringing the next generation of Immuno-Oncology biomarkers to the clinic. Biomedicines. 6(1), 14 (2018)
Article PubMed Central CAS Google Scholar
Magee, D.E., Hird, A.E., Klaassen, Z., Sridhar, S.S., Nam, R.K., Wallis, C.J.D., et al.: Adverse event profile for immunotherapy agents compared with chemotherapy in solid organ tumors: a systematic review and meta-analysis of randomized clinical trials. Ann. Oncol. 31(1), 50–60 (2020)
Article CAS PubMed Google Scholar
Oiseth, S.J., Aziz, M.S.: Cancer immunotherapy: a brief review of the history, possibilities, and challenges ahead. J. Cancer. Metastasis. Treat. 3(10), 250–261 (2017)
Article CAS Google Scholar
Kroschinsky, F., Stölzel, F., von Bonin, S., Beutel, G., Kochanek, M., Kiehl, M., et al.: New drugs, new toxicities: severe side effects of modern targeted and immunotherapy of cancer and their management. Crit. Care. 21(1), 89 (2017)
Article PubMed PubMed Central Google Scholar
Kluetz, P.G., Kanapuru, B., Lemery, S., Johnson, L.L., Fiero, M.H., Arscott, K., et al.: Informing the tolerability of cancer treatments using patient-reported outcome measures: summary of an FDA and critical path institute workshop. Value. Health. 21(6), 742–747 (2018)
Article PubMed Google Scholar
Devlin N. The Academic Health Economists' Blog. 2020. Available from: https://aheblog.com/2020/08/12/preference-based-measure-is-misleading-can-we-agree-on-something-better/
Feeny, D.: Preference-based measures: utility and quality-adjusted life years. Assessing quality of life in clinical trials. 2, 405–431 (2005)
Google Scholar
Longworth, L., Yang, Y., Young, T., Mulhern, B., Hernandez Alava, M., Mukuria, C., et al.: Use of generic and condition-specific measures of health-related quality of life in NICE decision-making: a systematic review, statistical modelling and survey. Health. Technol. Assess. 18(9), 1–224 (2014)
Article PubMed PubMed Central Google Scholar
Pickard, A.S., Wilke, C.T., Lin, H.-W., Lloyd, A.: Health utilities using the EQ-5D in studies of cancer. Pharmacoeconomics 25(5), 365–384 (2007)
Article PubMed Google Scholar
Devlin, N.J., Lorgelly, P.K.: QALYs as a measure of value in cancer. J. Cancer. Policy. 11, 19–25 (2017)
Article Google Scholar
Shah, K.K., Mulhern, B., Longworth, L., Janssen, M.: Views of the UK general public on important aspects of health not captured by EQ-5D. Patient. 10(6), 701–709 (2017)
PubMed Google Scholar
Brazier, J., Ratcliffe, J., Tsuchiya, A., Soloman, J.: Measuring and valuing health benefits for economic evaluation, 2nd edn. Oxford University Press, Oxford (2017)
Google Scholar
EORTC. n.d. [Quality of Life Group Website]. Available from: https://www.eortc.org/
MD Anderson. The MD anderson symptom inventory n.d. [Available from: https://www.mdanderson.org/research/departments-labs-institutes/departments-divisions/symptom-research/symptom-assessment-tools/md-anderson-symptom-inventory.html
FACIT. n.d. [Questionnaires]. Available from: https://www.facit.org/
van Roij, J., Fransen, H., van de Poll-Franse, L., Zijlstra, M., Raijmakers, N.: Measuring health-related quality of life in patients with advanced cancer: a systematic review of self-administered measurement instruments. Qual. Life. Res. 27(8), 1937–1955 (2018)
Article PubMed Google Scholar
Brazier, J., Tsuchiya, A.: Preference-based condition-specific measures of health: what happens to cross programme comparability? Health. Econ. 19(2), 125–129 (2010)
Article PubMed Google Scholar
Young, T.A., Mukuria, C., Rowen, D., Brazier, J.E., Longworth, L.: Mapping Functions in health-related quality of life: mapping from two cancer-specific health-related quality-of-life instruments to EQ-5D-3L. Med. Decis. Making. 35(7), 912–926 (2015)
Article PubMed PubMed Central Google Scholar
Wailoo, A.J., Hernandez-Alava, M., Manca, A., Mejia, A., Ray, J., Crawford, B., et al.: Mapping to estimate health-state utility from non-preference-based outcome measures: an ISPOR good practices for outcomes research task force report. Value. Health. 20(1), 18–27 (2017)
Article PubMed Google Scholar
Petrou, S., Rivero-Arias, O., Dakin, H., Longworth, L., Oppe, M., Froud, R., et al.: Preferred reporting items for studies mapping onto preference-based outcome measures: the MAPS statement. Pharmacoeconomics 33(10), 985–991 (2015)
Article PubMed PubMed Central Google Scholar
Brazier, J., Rowen, D., Mavranezouli, I., Tsuchiya, A., Young, T., Yang, Y., et al.: Developing and testing methods for deriving preference-based measures of health from condition-specific measures (and other patient-based measures of outcome). Health. Technol. Assess. 16(32), 1–114 (2012)
Article CAS PubMed Google Scholar
King, M.T., Costa, D.S., Aaronson, N.K., Brazier, J.E., Cella, D.F., Fayers, P.M., et al.: QLU-C10D: a health state classification system for a multi-attribute utility measure based on the EORTC QLQ-C30. Qual. Life. Res. 25(3), 625–636 (2016)
Article CAS PubMed Google Scholar
Rowen, D., Brazier, J., Young, T., Gaugris, S., Craig, B.M., King, M.T., et al.: Deriving a preference-based measure for cancer using the EORTC QLQ-C30. Value. Health. 14(5), 721–731 (2011)
Article PubMed Google Scholar
King, M.T., Viney, R., Simon Pickard, A., Rowen, D., Aaronson, N.K., Brazier, J.E., et al.: Australian utility weights for the EORTC QLU-C10D, a multi-attribute utility instrument derived from the cancer-specific quality of life questionnaire, EORTC QLQ-C30. Pharmacoeconomics. 36(2), 225–238 (2018)
Article PubMed Google Scholar
Norman, R., Mercieca-Bebber, R., Rowen, D., Brazier, J.E., Cella, D., Pickard, A.S., et al.: UK utility weights for the EORTC QLU-C10D. Health. Econ. 28(12), 1385–401 (2019)
Article PubMed Google Scholar
Kemmler, G., Gamper, E., Nerich, V., Norman, R., Viney, R., Holzner, B., et al.: German value sets for the EORTC QLU-C10D, a cancer-specific utility instrument based on the EORTC QLQ-C30. Qual. Life. Res. 28(12), 3197–3211 (2019)
Article PubMed PubMed Central Google Scholar
McTaggart-Cowan, H., King, M.T., Norman, R., Costa, D.S.J., Pickard, A.S., Regier, D.A., et al.: The EORTC QLU-C10D: the Canadian valuation study and algorithm to derive cancer-specific utilities from the EORTC QLQ-C30. MDM Policy Pract. 4(1), 2381468319842532 (2019)
PubMed PubMed Central Google Scholar
Norman, R., Mulhern, B., Viney, R.: The impact of different DCE-based approaches when anchoring utility scores. Pharmacoeconomics 34(8), 805–814 (2016)
Article PubMed Google Scholar
King, M., Norman, R., Viney, R., Costa, D., Brazier, J., Cella, D., et al.: Two new cancer-specific multi-attribute utility instruments: EORTC QLU-C10D and FACT-8D. Value. Health. 19(7), A807 (2016)
Article Google Scholar
McTaggart-Cowan, H., Regier, D.A., Peacock, S.J.: Exploring the role of disease labels on general population preferences. Presentation at the ISOQOL 22nd Annual Conference. Vancouver. 21–24 October (2015)
Versteegh, M.M., Leunis, A., Uyl-de Groot, C.A., Stolk, E.A.: Condition-specific preference-based measures: benefit or burden? Value. Health. 15(3), 504–513 (2012)
Article PubMed Google Scholar
Rowen, D., Zouraq, I.A., Chevrou-Severac, H., van Hout, B.: International regulations and recommendations for utility data for health technology assessment. Pharmacoeconomics 35(1), 11–19 (2017)
Article PubMed Google Scholar
Rowen, D., Brazier, J., Ara, R., Zouraq, I.A.: The role of condition-specific preference-based measures in health technology assessment. Pharmacoeconomics 35(1), 33–41 (2017)
Article PubMed Google Scholar
NICE. Fluocinolone acetonide intravitreal implant for treating chronic diabetic macular oedema in phakic eyes after an inadequate response to previous therapy. 2019
Kind, P., Brooks, R., Rabin, R.: EQ-5D concepts and methods. A developmental history. Springer, Dordrecht (2005)
Google Scholar
FACIT. Questionnaires n.d. [Available from: https://www.facit.org/FACITOrg/Questionnaires
Ward, W.L., Hahn, E.A., Mo, F., Hernandez, L., Tulsky, D.S., Cella, D.: Reliability and validity of the Functional Assessment of Cancer Therapy-Colorectal (FACT-C) quality of life instrument. Qual. Life. Res. 8(3), 181–195 (1999)
Article CAS PubMed Google Scholar
Krabbe, P.F., Stouthard, M.E., Essink-Bot, M.-L., Bonsel, G.J.: The effect of adding a cognitive dimension to the EuroQol multiattribute health-status classification system. J. Clin. Epidemiol. 52(4), 293–301 (1999)
Article CAS PubMed Google Scholar
Swinburn, P., Lloyd, A., Boye, K., Edson-Heredia, E., Bowman, L., Janssen, B.: Development of a disease-specific version of the EQ-5D-5L for use in patients suffering from psoriasis: lessons learned from a feasibility study in the UK. Value. Health. 16(8), 1156–1162 (2013)
Article PubMed Google Scholar
Yang, Y., Brazier, J., Tsuchiya, A.: Effect of adding a sleep dimension to the EQ-5D descriptive system: a “bolt-on” experiment. Med. Decis. Making. 34(1), 42–53 (2014)
Article PubMed Google Scholar
Yang, Y., Rowen, D., Brazier, J., Tsuchiya, A., Young, T., Longworth, L.: An exploratory study to test the impact on three “bolt-on” items to the EQ-5D. Value. Health. 18(1), 52–60 (2015)
Article PubMed PubMed Central Google Scholar
Hoogendoorn, M., Oppe, M., Boland, M.R.S., Goossens, L.M.A., Stolk, E.A., Rutten-van, M.M.: Exploring the impact of adding a respiratory dimension to the EQ-5D-5L. Med. Decis. Making. 39(4), 393–404 (2019)
Article PubMed PubMed Central Google Scholar
Brazier, J., Rowen, D., Tsuchiya, A., Yang, Y., Young, T.A.: The impact of adding an extra dimension to a preference-based measure. Soc. Sci. Med. 73(2), 245–253 (2011)
Article PubMed PubMed Central Google Scholar
Lin, F.-J., Longworth, L., Pickard, A.S.: Evaluation of content on EQ-5D as compared to disease-specific utility measures. Qual. Life Res. 22(4), 853–874 (2013)
Article PubMed Google Scholar
Finch, A.P., Brazier, J.E., Mukuria, C.: What is the evidence for the performance of generic preference-based measures? A systematic overview of reviews. Eur. J. Health. Econ. 19(4), 557–570 (2018)
Article PubMed Google Scholar
Finch, A.P., Brazier, J.E., Mukuria, C., Bjorner, J.B.: An exploratory study on using principal-component analysis and confirmatory factor analysis to identify bolt-on dimensions: the EQ-5D case study. Value. Health. 20(10), 1362–1375 (2017)
Article PubMed Google Scholar
Finch, A.P.: An investigation of methods for identifying and selecting bolt-on dimensions: the EQ-5D-5L case study. The University of Sheffield, White Rose eTheses Online (2017)
Google Scholar
FDA. Guidance for industry: patient-reported outcome measures: use in medical product development to support labelling claims. 2009. Contract No.: 235
Speck, R.M., Lenderking, W.R., Shaw, J.W.: Integrating the patient voice with clinician reports to identify a hepatocellular carcinoma-specific subset of treatment-related symptomatic adverse events. J. Patient. Rep. Outcomes. 2(1), 35 (2018)
Article PubMed PubMed Central Google Scholar
Kimel, M., Hsieh, R., McCormack, J., Burch, S.P., Revicki, D.A.: Validation of the revised Patient Perception of Migraine Questionnaire (PPMQ-R): measuring satisfaction with acute migraine treatment in clinical trials. Cephalalgia 28(5), 510–523 (2008)
Article CAS PubMed Google Scholar
Yang, Z., Rand, K.J.B., Luo, N, editors. Modelling TTO values of vision bolt-on and self-care bolt-off health states: can bolt-on and bolt-off value sets be built upon EQ-5D value set? Paper presented at the EuroQol Plenary Meeting, Brussels (2019)
Neumann, P.J., Sanders, G.D., Russell, L.B., Siegel, J.E., Ganiats, T.G.: Cost-effectiveness in health and medicine. Oxford University Press, New York (2016)
Book Google Scholar
Sanghera, S., Coast, J.: Measuring quality-adjusted life-years when health fluctuates. Value Health 23(3), 343–350 (2020)
Article PubMed Google Scholar
Tversky, A., Kahneman, D.: Availability: a heuristic for judging frequency and probability. Cogn. Psychol. 5(2), 207–232 (1973)
Article Google Scholar
Stull, D.E., Leidy, N.K., Parasuraman, B., Chassany, O.: Optimal recall periods for patient-reported outcomes: challenges and potential solutions. Curr. Med. Res. Opin. 25(4), 929–942 (2009)
Article PubMed Google Scholar
Oppe, M., Rand-Hendriksen, K., Shah, K., Ramos-Goni, J.M., Luo, N.: EuroQol protocols for time trade-off valuation of health outcomes. Pharmacoeconomics 34(10), 993–1004 (2016)
Article PubMed PubMed Central Google Scholar
Janssen, M.F., Birnie, E., Bonsel, G.: Feasibility and reliability of the annual profile method for deriving QALYs for short-term health conditions. Med. Decis. Making. 28(4), 500–510 (2008)
Article PubMed Google Scholar
Brazier, J., Dolan, P., Karampela, K., Towers, I.: Does the whole equal the sum of the parts? Patient-assigned utility scores for IBS-related health states and profiles. Health. Econ. 15(6), 543–551 (2006)
Article PubMed Google Scholar
Brazier, J., Ara, R., Azzabi, I., Busschbach, J., Chevrou-Séverac, H., Crawford, B., et al.: Identification, review, and use of health state utilities in cost-effectiveness models: an ISPOR good practices for outcomes research task force report. Value. Health. 22(3), 267–275 (2019)
Article PubMed Google Scholar
CRD/CHE Technology Assessment Group: Efalizumab and Etanercept for the Treatment of Psoriasis: Technology Assessment Report commissioned by the HTA Programme on behalf of The National Institute for Clinical Excellence. University of York, York (2005)
Google Scholar
Geraerds, A.J.L.M., Bonsel, G.J., Janssen, M.F., Finch, A.P., Polinder, S., Haagsma, J.A.: Methods used to identify, test, and assess impact on preferences of bolt-ons: a systematic review. Value Health 24(6), 901–916 (2021)
Article PubMed Google Scholar
Jolly, M., Pickard, A.S., Block, J.A., Kumar, R.B., Mikolaitis, R.A., Wilke, C.T., et al.: Disease-specific patient reported outcome tools for systemic lupus erythematosus. Semin. Arthritis. Rheum. 42(1), 56–65 (2012)
Article PubMed Google Scholar

Download references

Acknowledgements

The authors are grateful to David Mott and participants at the EuroQol Virtual Descriptive Systems Meeting for their comments on an earlier draft version of the paper.

Funding

Financial support was received from Bristol-Myers Squibb.

Author information

Authors and Affiliations

PHMR, London, UK
Koonal K. Shah, Andrew Lenny & Louise Longworth
Bristol-Myers Squibb, Uxbridge, UK
Bryan Bennett
School of Health and Related Research, University of Sheffield, Sheffield, UK
Koonal K. Shah & John E. Brazier
Axentiva Solutions, Tenerife, Spain
Mark Oppe
University of Illinois at Chicago, Chicago, IL, USA
A. Simon Pickard
Bristol-Myers Squibb, Lawrenceville, NJ, USA
James W. Shaw

Authors

Koonal K. Shah
View author publications
You can also search for this author in PubMed Google Scholar
Bryan Bennett
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Lenny
View author publications
You can also search for this author in PubMed Google Scholar
Louise Longworth
View author publications
You can also search for this author in PubMed Google Scholar
John E. Brazier
View author publications
You can also search for this author in PubMed Google Scholar
Mark Oppe
View author publications
You can also search for this author in PubMed Google Scholar
A. Simon Pickard
View author publications
You can also search for this author in PubMed Google Scholar
James W. Shaw
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the following activities: design, analysis, reviewing, editing and approval. The study was conceived by JWS, BB and LL. The first draft of the manuscript was prepared by KKS and AL.

Corresponding author

Correspondence to Koonal K. Shah.

Ethics declarations

Conflict of interest

KKS, AL and LL are employed by PHMR, a consultancy that receives income from pharmaceutical industry clients. BB and JWS are employees and shareholders of Bristol-Myers Squibb. MO was an employee of Axentiva Solutions, a consultancy that receives income from pharmaceutical industry clients. ASP is a partner in Second City Outcomes Research, a consultancy that receives income from pharmaceutical industry clients. KKS, LL, JEB, MO, ASP and JWS are members of the EuroQol Group.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shah, K.K., Bennett, B., Lenny, A. et al. Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms. Eur J Health Econ 22, 1301–1309 (2021). https://doi.org/10.1007/s10198-021-01337-6

Download citation

Received: 11 November 2020
Accepted: 08 June 2021
Published: 17 June 2021
Issue Date: November 2021
DOI: https://doi.org/10.1007/s10198-021-01337-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms

Abstract

Similar content being viewed by others

Impact of Adverse Events on Health Utility and Health-Related Quality of Life in Patients Receiving First-Line Chemotherapy for Metastatic Breast Cancer: Results from the SELECT BC Study

The FACT-8D, a new cancer-specific utility algorithm based on the Functional Assessment of Cancer Therapies-General (FACT-G): a Canadian valuation study

Condition-specific or generic preference-based measures in oncology? A comparison of the EORTC-8D and the EQ-5D-3L

Introduction

Generic and condition-specific preference-based measures used in cancer

Adaptation of existing measures

Capturing treatment-related symptoms

Valuation issues

Challenges of implementation: capturing transient events

Challenges of implementation: modeling

Limitations

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Adapting preference-based utility measures to capture the impact of cancer treatment-related symptoms

Abstract

Similar content being viewed by others

Impact of Adverse Events on Health Utility and Health-Related Quality of Life in Patients Receiving First-Line Chemotherapy for Metastatic Breast Cancer: Results from the SELECT BC Study

The FACT-8D, a new cancer-specific utility algorithm based on the Functional Assessment of Cancer Therapies-General (FACT-G): a Canadian valuation study

Condition-specific or generic preference-based measures in oncology? A comparison of the EORTC-8D and the EQ-5D-3L

Introduction

Generic and condition-specific preference-based measures used in cancer

Adaptation of existing measures

Capturing treatment-related symptoms

Valuation issues

Challenges of implementation: capturing transient events

Challenges of implementation: modeling

Limitations

Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation