Background
Name and description | Detection of response shift | Effect size metrics as reported in the included studies |
---|---|---|
1. Design-based methods This family of methods requires changes or extensions to common study designs to enable the detection of response shift [3]. | ||
1.1 Then-test The then-test is an additional measurement at follow-up occasion. Respondents complete the same measure as they did at baseline and follow-up, but now with the instruction to re-evaluate their level of baseline functioning [8, 32]. This includes: 1.1.1 Then-test: original Conventional use of the then-test as described above 1.1.2 Then-test: derivative Study-specific adaptations of the then-test. For example, when applied to valuation of health states, respondents at follow-up are asked to evaluate their own health state of that moment and are asked to recall their valuation of their health state at the previous interview [33]. | Then-test: original Recalibration: Comparison of the mean difference between baseline and then-test. Explanation: For example, when chemotherapy induces fatigue, patients may adapt to this higher fatigue level. As a consequence, they may recalibrate the response scale for fatigue. This is indicated when respondents retrospectively (at the then-test) report less fatigue than they did at baseline. The comparison of the mean difference between baseline and then-test is then indicative of recalibration. Then-test: derivative Recalibration: Unique for each study. | Then-test: original Standardized mean differences (SMDs) between baseline (\({\overline{X} }_{2})\) and then-test (\({\overline{X} }_{1})\) scores were calculated based on available information.a We used reported SMDs, if provided, when insufficient information was available to calculate the SMDs. Then-test: derivative Same as Then-test original. |
1.2 Individualized methods This family of methods either have respondent-generated content in terms of the domains or the scale anchors are respondent generated or defined. A necessary component of individualized approaches is some respondent-generated content (e.g., items, scale anchors). | ||
1.2.1 Schedule for the Evaluation of Individual Quality of Life (SEIQoL) Respondents nominate the five most relevant domains (also called cues) to their HRQoL and assess their current functioning for each domain using a visual analogue scale (VAS) ranging from best to worst possible functioning. Patients then rank the relative importance of each domain by allocating 100 points to the five domains, using a pie chart disk (judgment analysis can also be used). The SEIQoL generates an overall index score, which is the sum of all five domain products (multiplication of each domain’s weight by its corresponding level) [34]. If the SEIQoL is administered at two points in time, response shift can be assessed [8]. | Recalibration: Cannot be detected (only in combination with another method, e.g., the then-test) (e.g., [34]). Reprioritization: Statistical test of change in the domain weights. This may entail the difference in intra-class correlation coefficients between domain weights over time [35]; or subtraction of weights at follow-up from weights at baseline within Reconceptualization: Statistical test of change in the number or type of the nominated domains over time. This may entail a mere count of domains mentioned at follow-up but not at baseline [36, 38], or the number (or percentage of) participants who changed at least one domain over time [37] or the most important domain over time [39]. A qualitative review of change in domain content is part of the procedure [8]. | Recalibration, reprioritization, and reconceptualization: Effect sizes were not reported and could not be calculated based on information reported in the included studies, except for one study where the then-test method was applied to calculate SMDs based on the SEIQoL [34]. |
1.2.2 Patient-Generated Index (PGI) Respondents nominate up to five areas in relation to their disease that impacted their QOL and one additional area not related to their disease. Respondents are then asked to rate the severity of the nominated areas on a scale of 0–10 (e.g., with 0 being severe or worst they can imagine) and 10 being mild or as they would like to be). Finally, respondents are asked to distribute 12 tokens among the nominated areas, at least one to each area, and give more tokens to the areas that are in most need of improvement. The total score is calculated by multiplying the severity score by the proportion of the 12 tokens assigned to each area and then summing this across the six areas (five disease related and one non-disease related) [40]. An area-weighted score can also be calculated [41]. If the PGI is administered at two points in time, response shift can be assessed [42]. | Recalibration: Cannot be detected (only in combination with another method, e.g., the then-test). Reprioritization: Statistical test of change in the domain weights (change in number of tokens) over time. This may be combined with qualitative interviews to results (e.g., [41, 43]). Reconceptualization: | Recalibration, Reprioritization, and Reconceptualization: Effect sizes were not reported and could not be calculated based on information reported in the included studies. |
1.2.3 Cantril’s ladder and/or changes in anchors Respondents are asked to describe their best and worst imaginable life satisfaction as anchors for the ladder. They then rate their current life satisfaction on that ladder with the lowest rung being the worst and the highest rung being the best. If Cantril’s ladder is administered at two points in time, response shift can be assessed [3]. In some studies, patients are then invited to locate the pre-test anchors on the post-test scale with the post-test anchors indicating numbers 1 and 10. This rating scale is extended at both extremes allowing the anchor descriptions of the first assessment to be worse, better, or correspond with those of the second assessment [44, 45]. | Recalibration: Statistical test of the difference between baseline and transformed baseline. The transformed scores are a function of the baseline scores and the position of the best and worse anchors in Cantril’s ladder at follow-up [44]. | Recalibration: |
1.3 Other design-based methods | ||
1.3.1 Ideal-scale approach Respondents are asked to complete a questionnaire twice: First in reference to their actual status (e.g., how they perceive their current QOL) and second to their ideal status, e.g., how they would like their QOL to be or how they expect their QoL to change [46]. These two questionnaires are administered subsequently at the same assessment point. Administration of these two questionnaires is repeated over time [3]. | Recalibration: The response scale of what ideal entails may undergo recalibration, which is indicated by a statistical test of mean changes in ideal scores over time [3]. Alternatively, change in internal standards can be captured by comparing actual and ideal status (e.g., QoL expectancies) between baseline and follow-up [46]. | Recalibration: Effect sizes were not reported and could not be calculated based on information reported in the included studies. |
1.3.2 Appraisal | Direct response shift effects: How much changes in appraisal explain the discrepancy between expected and observed QoL (e.g., residuals in a regression model reflecting unexplained variance). Moderated response shift effects: Interaction effects between appraisal change scores*catalyst [8]. No distinction is made between response shift type. | Direct and moderated response shift effects: Effect sizes were not reported and could not be calculated based on information reported in the included study. |
1.3.3 Change in importance ratings Respondents are asked to indicate the importance of QoL domains over time, using response scales per domain (e.g., from very unimportant to very important) or by ranking the domains according to importance [48]. | Reprioritization: If the relative importance of (the QoL) domains changes over time, this is indicated by statistical tests of mean change in importance ratings over time. | Reprioritization: SMDs of importance ratings, based on the same formulas as for the then-test [49]. |
1.3.4 Preference-based methods using vignettes These methods assess the importance and value a patient explicitly places on a health state or quality-of-life dimension [3]. Patients are asked to rate (e.g., from poor to excellent) one or more anchoring vignettes, describing a particular (hypothetical) health state at different points in time [8]. | Reprioritization: Statistical test of mean change in vignette ratings over time [8]. | Reprioritization: |
2. Latent-variable approaches | ||
2.1 Structural equation models (SEMs) Latent-variable SEMs are used to test whether measurement model parameters that define the relationships between PROM indicators and their corresponding latent factors are consistent (or invariant) over time. Measurement indicators can be at the item level (where the SEM specifies the relationships between PROM items and latent factors) or subscale level (where the SEM specifies the relationships between PROM subscales and latent factors). Response shift is inferred when results indicate a lack of longitudinal measurement invariance [8, 9]. Includes: 2.1.1 Oort’s SEM method 2.1.2 Schmitt’s SEM method 2.1.3 Other SEM method | Oort’s SEM method Recalibration: This is reflected in changes in intercepts (uniform recalibration) or residual variances (non-uniform recalibration). Explanation: If respondents interpret the response-scale options differently at follow-up than at baseline, then domain-level mean will change even when the mean of the overall construct (i.e., the latent factor) remains constant. The corresponding intercept will change if this difference is the same for all scores of the latent factor. The domain-level residual variances will change if the difference is contingent on the latent factor score. Reprioritization: This is reflected in change in values of latent factor loadings for one or more of the domains. Explanation: If the importance of component domains constituting the target construct changes over time, then the relative contribution of each domain to the measurement of the overall construct (i.e., latent factor) will change. Reconceptualization: This is reflected in latent factor loadings for one or more of the domains having a value zero at one or more of the time points. Explanation: If respondents conceptualize the response scale differently over time, some constituting domains are absent at one time point and present at another. Schmitt’s SEM method Recalibration/‟beta-change”: Change in the latent factor co-variances, variances, or loadings.c Reconceptualization: Changes in the pattern of latent factor loadings. Of note, change in residual variances are assumed to represent change in random error over time. Other SEM method | Estimated SMDs for recalibration, reprioritization, and conceptualization response shift are based on models that adjust for a lack of longitudinal measurement invariance. SDMs were calculated in the same way for all SEM methods based on reported model parameter estimates using the formulas provided by Verdam et al. [23]).b We used reported SDMs, if provided, when insufficient information was available to calculate the standardized means differences. |
2.2 Item response theory (IRT) or Rasch models Latent factor models based on IRT or Rasch measurement theory are used to test whether measurement model parameters that define the relationships between PROM items and their corresponding latent factors are invariant over time. Response shift is inferred by a lack of invariance in discrimination power and difficulty parameters [8, 9]. | Recalibration: Change in item difficulty parameters estimates or thresholds. Reprioritization (applies only to IRT): Change in item discrimination parameter estimates. | Effect sizes were not reported and could not be calculated based on information reported in the included studies |
3 Regression methods Statistical methods that rely on regression analysis (excluding latent-variable models). | ||
3.1 Regression methods with classification Use of regression models to classify people as having had response shift or not. | ||
3.1.1 Classification and Regression Tree (CART) A non-parametric method that involves recursive partitioning of the longitudinal data into homogeneous subgroups (nodes) with respect to the change in the PROM scores and corresponding explanatory clinical status variables. Response shift is inferred when there is a discrepancy between clinical status and change in PROM scores or change in the relative importance of PROM domains (Sebille et al. 2021). | Recalibration: Inconsistent changes in PROM scores and clinical statu.s Reprioritization: Change in the order of importance of each domain over time. | Classification is based on the percentage of respondents identified as having had recalibration and/or reprioritization response shifts. |
3.1.2 Random forest regression Evaluates changes in the relative contribution of PROM domains to the prediction of an outcome over time in each group. The relative importance of each domain is assessed using the average variable importance (AVI), which is the relative contribution of a domain to the prediction of an outcome in a CART averaged across several bootstrap samples. The change in the AVI for each component domain in predicting a global PROM score over time for each group is examined. Response shift is indicated by crossing curves [8]. | Reprioritization: Interaction between change in AVI for different domains. | Classification is based on the percentage of respondents identified as having had reprioritization response shift. |
3.1.3 Mixed Models and Growth Mixture Models Mixed (random effects) models are used to obtain residuals of observed minus predicted PROM scores, after which growth mixture models are used to identify latent classes of the centered residuals’ growth trajectories. Response shift is inferred when there is change in centered residuals over time [8]. | General response shift effect: Discrepancy between observed and predicted scores (centered residuals having a pattern of fluctuation over time deviating from zero). Reprioritization: Effects of domain scores on global PROM scores that vary with time (i.e., interaction with time). | Classification is based on the percentage of respondents identified as having had (general or reprioritization) response shifts. |
3.2 Regression methods without classification Use of regression models that do not allow for classification | ||
3.2.1 Relative importance analysis Application of logistic regression or discriminant analysis to rank PROM domains based on their relative importance in discriminating between groups. Response shift is inferred based on changes in relative importance or rank ordering of the PROM domains [8]. | Reprioritization: Change in relative importance (logistic regression or discriminant analysis coefficients) of PROM domains over time. | Effect sizes were not reported and could not be calculated based on information reported in the included studies. |
3.2.2 Other regression methods without classification A variety of study-specific applications of regression models to test for response shift effects as defined by the researchers. | Unique for each study. | |
4 Other study-specific methods Methods that are unique to a particular study (and have not been applied in other studies). This includes various combinations of other design-based methods and other statistical methods. | Unique for each study. | Effect sizes were not reported and could not be calculated based on information reported in the included studies. |
Methods
Search strategy and eligibility criteria
Data extraction
Study-level results | Effect-level results | |||||||
---|---|---|---|---|---|---|---|---|
Total effects | Total effects | Recalibration | Reprioritization and/or Reconceptualization, or Unknowna | |||||
N | % RS detected | N | % RS detected | N | % RS detected | N | % RS detected | |
Design-based methods | ||||||||
Then-test | 82 | 86.6 | 1004 | 39.2 | 1004 | 39.2 | n/a | n/a |
Individualized methods | 12 | 100 | 31 | 74.2 | 9 | 44.4 | 22 | 86.4 |
Other methods | 11 | 72.7 | 214 | 10.7 | 12 | 50.0 | 202 | 8.4 |
Latent-variable models | ||||||||
SEM | 44 | 79.5 | 3139 | 7.7 | 986 | 16.4 | 2153 | 3.7 |
IRT/Rasch | 3 | 100 | 81 | 25.9 | 61 | 13.1 | 20 | 65.0 |
Regression methods | ||||||||
With classification | 11 | 81.8 | 44 | 81.8 | 8 | 100.0 | 36 | 77.8 |
Without classification | 13 | 76.9 | 351 | 14.5 | 5 | 40.0 | 346 | 14.2 |
Other study-specific methods | 4 | 50.0 | 4 | 50.0 | n/a | n/a | 4 | 50.0 |
Total | 150 | 86.7 | 4868 | 16.3 | 2085 | 28.0 | 2783 | 7.5 |
RS metric and method | Study-level results | Effect-level results | ||||||||
---|---|---|---|---|---|---|---|---|---|---|
Total effects | Total effects | Recalibration effects | Reprioritization and/or Reconceptualization effects, or unknowna | |||||||
Prevalence | Prevalence | Prevalence | Magnitude | Prevalence | Magnitude | |||||
N | % RS detected | N | % RS detected | N | % RS detected | ES | N | % RS detected | ES | |
Cohen’s db | 91 | 89.0 | 1062 | 46.1 | Median (IQR) | Median (IQR) | ||||
Design-based methods: then-test | 72 | 87.5 | 929 | 40.4 | 929 | 40.4 | 0.22 (0.10–0.38) | n/a | n/a | n/a |
Design-based methods: individualizedc | 1 | 100.0 | 7 | 28.6 | 7 | 28.6 | 0.03 (0.03–0.11) | n/a | n/a | n/a |
Design-based methods: other | 3 | 66.7 | 15 | 46.7 | 4 | 50.0 | 0.28 (0.09–0.45) | 11 | 45.5 | 0.09 (0.05–0.17) |
Latent-variable models: SEM | 19 | 100.0 | 111 | 95.5 | 84 | 98.8 | 0.22 (0.14–0.35) | 27 | 85.2 | 0.10 (0.10–0.14) |
R-squared: median (min–max) | 2 | 100.0 | 27 | 25.9 | median (IQR) | median (min–max) | ||||
Regression without classification | 2 | 100.0 | 27 | 25.9 | n/a | n/a | n/a | 27 | 25.9 | 0.01 (0.00–0.02) |
Classification: % respondents with RS | 17 | 100.0 | 27 | 100.0 | % sample with RS | % sample with RS | ||||
Design-based methods: individualized | 8 | 100.0 | 13 | 100.0 | n/a | n/a | n/a | 13 | 100.0 | 68.2 |
Design-based methods: other | 1 | 100.0 | 1 | 100.0 | n/a | n/a | n/a | 1 | 100.0 | 48.6 |
Regression with classification | 6 | 100.0 | 11 | 100.0 | 2 | 100.0 | 30.3 | 9 | 100.0 | 15.2 |
Other study-specific methods | 2 | 100.0 | 2 | 100.0 | n/a | n/a | n/a | 2 | 100.0 | 13.3 |
Other effect size metric | 3 | 66.7 | 14 | 92.9 | ||||||
Design-based methods: Individualized | 1 | 100.0 | 1 | 100.0 | 1 | 100.0 | n/a | 0 | n/a | n/a |
Latent-variable models: SEM | 1 | 100.0 | 12 | 100.0 | 7 | 100.0 | n/a | 5 | 100.0 | n/a |
Other study-specific methods | 1 | 0.0 | 1 | 0.0 | n/a | n/a | n/a | 1 | 0.0 | n/a |
Population characteristics | Study-level results | Effect-level results | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Total effects | Total effects | Recalibration effects | Reprioritization and/or Reconceptualization effects, or unknowna | |||||||||
Prevalenceb | Prevalenceb | Prevalenceb | Magnitudec | Prevalenceb | Magnitudec | |||||||
N | % RS detected | N | % RS detected | N | % RS detected | N | Median ES (IQR) | N | % RS detected | N | Median ES (IQR) | |
Sex | ||||||||||||
Mixed | 121 | 85.1 | 3734 | 14.5 | 1428 | 26.1 | 522 | 0.23 (0.10–0.43) | 2306 | 7.2 | 30 | 0.09 (0.01–0.17) |
Only female | 13 | 84.6 | 724 | 23.3 | 473 | 32.1 | 381 | 0.17 (0.09–0.28) | 251 | 6.8 | n/a | n/a |
Only male | 12 | 66.7 | 206 | 25.7 | 135 | 34.1 | 101 | 0.31 (0.22–0.42) | 71 | 9.9 | 6 | 0.12 (0.05–0.25) |
Other/unknown | 8 | 100.0 | 204 | 14.7 | 49 | 26.5 | 20 | 0.36 (0.19–0.57) | 155 | 11.0 | 2 | 0.12 (0.10–0.14) |
Age | ||||||||||||
Mostly adults | 100 | 84.0 | 3172 | 17.7 | 1410 | 27.6 | 754 | 0.21 (0.10–0.36) | 1762 | 9.7 | 17 | 0.14 (0.10–0.22) |
Mostly older adults | 34 | 88.2 | 825 | 16.6 | 342 | 30.7 | 184 | 0.24 (0.10–0.44) | 483 | 6.6 | 16 | 0.05 (0.02–0.12) |
Mostly children/adolescents | 8 | 87.5 | 227 | 12.3 | 93 | 30.1 | 28 | 0.04 (0.00–0.14) | 134 | 0.0 | n/a | n/a |
Other/unknown | 11 | 90.9 | 644 | 10.4 | 240 | 25.8 | 58 | 0.28 (0.20–0.53) | 404 | 1.2 | 5 | 0.03 (0.01–0.06) |
Medical condition | ||||||||||||
No | 9 | 88.9 | 201 | 12.4 | 47 | 38.3 | 25 | 0.23 (0.12–0.41) | 154 | 4.5 | 7 | 0.05 (0.04–0.09) |
Yes: cancer | 45 | 93.3 | 1961 | 18.6 | 1054 | 30.3 | 665 | 0.22 (0.10–0.35) | 907 | 5.1 | 14 | 0.08 (0.01–0.14) |
Yes: orthopedic | 9 | 77.8 | 78 | 52.6 | 77 | 51.9 | 77 | 0.43 (0.23–0.82) | 1 | 100.0 | n/a | n/a |
Yes: stroke | 11 | 90.9 | 520 | 9.2 | 152 | 13.2 | 21 | 0.28 (0.18–0.55) | 368 | 7.6 | 4 | 0.08 (0.05–0.12) |
Yes: mental health | 9 | 77.8 | 472 | 5.9 | 159 | 10.7 | 1 | 0.49 (0.49–0.49) | 313 | 3.5 | n/a | n/a |
Yes: other | 67 | 83.6 | 1636 | 17.4 | 596 | 28.5 | 235 | 0.18 (0.09–0.31) | 1040 | 11.1 | 13 | 0.17 (0.08–0.25) |
Intervention | ||||||||||||
No/unclear | 48 | 83.3 | 1837 | 10.6 | 605 | 18.5 | 217 | 0.21 (0.10–0.37) | 1232 | 6.7 | 13 | 0.09 (0.05–0.14) |
Yes: medical | 69 | 88.4 | 2315 | 21.0 | 1176 | 33.3 | 689 | 0.22 (0.11–0.38) | 1139 | 8.3 | 17 | 0.03 (0.00–0.11) |
Yes: psychological | 20 | 90.0 | 567 | 10.9 | 222 | 19.8 | 65 | 0.23 (0.11–0.41) | 345 | 5.2 | n/a | n/a |
Yes: other/unspecified | 13 | 84.6 | 149 | 32.9 | 82 | 43.9 | 53 | 0.16 (0.09–0.31) | 67 | 19.4 | 8 | 0.20 (0.10–0.39) |
Study design characteristics | Study-level results | Effect-level results | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Total effects | Recalibration effects | Reprioritization and/or Reconceptualization effects, or unknowna | ||||||||||
Prevalenceb | Prevalenceb | Prevalenceb | Magnitudec | Prevalenceb | Magnitudec | |||||||
N | % RS detected | N | % RS detected | N | % RS detected | N | Median ES (IQR) | N | % RS detected | N | Median ES (IQR) | |
Design | ||||||||||||
Observational | 122 | 88.5 | 3978 | 16.2 | 1724 | 28.1 | 879 | 0.21 (0.10–0.36) | 2254 | 7.1 | 33 | 0.10 (0.04–0.17) |
Experimental | 28 | 78.6 | 890 | 16.5 | 361 | 27.7 | 145 | 0.26 (0.16–0.41) | 529 | 8.9 | 5 | 0.03 (0.01–0.06) |
Data analysis | ||||||||||||
Primary analysis | 90 | 87.8 | 1590 | 30.2 | 1058 | 39.7 | 872 | 0.22 (0.11–0.38) | 532 | 11.3 | 15 | 0.14 (0.08–0.25) |
Secondary analysis | 59 | 84.7 | 3268 | 9.5 | 1027 | 16.0 | 152 | 0.19 (0.09–0.32) | 2241 | 6.6 | 23 | 0.05 (0.02–0.10) |
Unknown | 1 | 100.0 | 10 | 10.0 | n/a | n/a | n/a | n/a | 10 | 10.0 | n/a | n/a |
Sample sizes (Binned) | ||||||||||||
Q1 (< 57) | 59 | 76.3 | 1121 | 16.1 | 584 | 23.3 | 391 | 0.26 (0.13–0.45) | 537 | 8.4 | 5 | 0.03 (0.01–0.06) |
Q2 (57–254) | 79 | 82.3 | 1353 | 23.4 | 668 | 35.5 | 393 | 0.22 (0.10–0.38) | 685 | 11.5 | 13 | 0.14 (0.10–0.20) |
Q3 (255–410) | 34 | 85.3 | 1262 | 15.8 | 472 | 32.0 | 194 | 0.15 (0.09–0.24) | 790 | 6.2 | 8 | 0.10 (0.07–0.35) |
Q4 (> 411) | 27 | 77.8 | 1132 | 8.4 | 361 | 16.6 | 46 | 0.20 (0.12–0.35) | 771 | 4.5 | 12 | 0.05 (0.02–0.13) |
Time period classification | ||||||||||||
< 1 month | 15 | 80.0 | 488 | 13.1 | 205 | 24.9 | 95 | 0.23 (0.09–0.32) | 283 | 4.6 | 7 | 0.01 (0.00–0.03) |
1–6 months | 23 | 91.3 | 427 | 19.9 | 110 | 43.6 | 49 | 0.40 (0.22–0.71) | 317 | 11.7 | 11 | 0.05 (0.04–0.10) |
> 6–12 months | 36 | 72.2 | 905 | 12.3 | 347 | 21.0 | 78 | 0.18 (0.09–0.39) | 558 | 6.8 | n/a | n/a |
> 12 months | 90 | 87.8 | 2604 | 18.2 | 1211 | 29.7 | 733 | 0.22 (0.11–0.38) | 1393 | 8.1 | 16 | 0.13 (0.09–0.28) |
Not reported | 12 | 83.3 | 444 | 13.3 | 212 | 24.5 | 69 | 0.16 (0.10–0.28) | 232 | 3.0 | 4 | 0.18 (0.13–0.24) |
PROM characteristics | Study-level results | Effect-level results | |||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Total effects | Recalibration effects | Reprioritization and/or Reconceptualization effects, or Unknowna | |||||||||||
Prevalenceb | Prevalenceb | Prevalenceb | Magnitudec | Prevalenceb | Magnitudec | ||||||||
N | % RS detected | N | % RS detected | N | % RS detected | N | ES (IQR) | N | % RS detected | N | ES (IQR) | ||
PROM types | |||||||||||||
Generic PROMsd | 76 | 84.2 | 1971 | 14.7 | 769 | 23.0 | 324 | 0.23 (0.10–0.41) | 1202 | 9.3 | 18 | 0.10 (0.01–0.14) | |
#1 SF familye | 47 | 83.0 | 1248 | 16.3 | 428 | 25.2 | 182 | 0.23 (0.11–0.44) | 820 | 11.6 | 18 | 0.10 (0.01–0.14) | |
#2 EQ 5De | 14 | 78.6 | 112 | 19.6 | 76 | 18.4 | 64 | 0.20 (0.08–0.32) | 36 | 22.2 | n/a | n/a | |
#3 Other | 24 | 75.0 | 611 | 10.5 | 265 | 20.8 | 78 | 0.26 (0.11–0.41) | 346 | 2.6 | n/a | n/a | |
Disease-specific PROMsd | 57 | 80.7 | 1431 | 20.8 | 755 | 33.8 | 457 | 0.19 (0.10–0.33) | 676 | 6.4 | 4 | 0.17 (0.09–0.47) | |
#1 EORTC familye | 17 | 88.2 | 616 | 25.2 | 404 | 35.4 | 315 | 0.17 (0.09–0.28) | 212 | 5.7 | 4 | 0.17 (0.09–0.47) | |
#2 Oral impact profilee | 4 | 75.0 | 83 | 33.7 | 48 | 43.8 | 27 | 0.21 (0.17–0.29) | 35 | 20.0 | n/a | n/a | |
#3 Other | 37 | 78.4 | 732 | 15.7 | 303 | 30.0 | 115 | 0.29 (0.12–0.48) | 429 | 5.6 | n/a | n/a | |
Individualized PROM | 10 | 90.0 | 30 | 86.7 | 8 | 87.5 | 6 | 0.23 (0.20–0.30) | 22 | 86.4 | n/a | n/a | |
Other type of PROM | 46 | 69.6 | 1436 | 12.5 | 553 | 26.2 | 275 | 0.25 (0.13–0.37) | 883 | 3.9 | 16 | 0.07 (0.03–0.14) | |
PROM domains | |||||||||||||
General health/QOL | 95 | 68.4 | 561 | 28.0 | 295 | 32.2 | 182 | 0.23 (0.09–0.41) | 266 | 23.3 | 3 | 0.14 (0.14–0.20) | |
Physical | 96 | 70.8 | 1823 | 15.5 | 799 | 27.5 | 429 | 0.20 (0.10–0.35) | 1024 | 6.1 | 16 | 0.08 (0.02–0.28) | |
Psychological: depression | 8 | 62.5 | 153 | 11.1 | 58 | 27.6 | 12 | 0.27 (0.21–0.33) | 95 | 1.1 | n/a | n/a | |
Psychological: other | 85 | 62.4 | 1121 | 13.5 | 440 | 25.0 | 199 | 0.23 (0.11–0.33) | 681 | 6.0 | 12 | 0.09 (0.03–0.13) | |
Social | 57 | 59.6 | 534 | 13.7 | 203 | 27.1 | 78 | 0.23 (0.13–0.35) | 331 | 5.4 | 5 | 0.10 (0.05–0.11) | |
Pain | 48 | 64.6 | 236 | 28.0 | 104 | 55.8 | 76 | 0.33 (0.16–0.54) | 132 | 6.1 | n/a | n/a | |
Other | 30 | 56.7 | 440 | 10.5 | 186 | 16.1 | 86 | 0.15 (0.07–0.27) | 254 | 6.3 | 2 | 0.05 (0.00–0.09) |