Introduction
Juvenile idiopathic arthritis (JIA) encompasses all forms of arthritis with unknown causes starting before 16 years old [
1,
2]. It is the most common chronic rheumatic disease in children [
1,
2], affecting approximately 1 per 1,000 Canadian children and 0.07–4.01 per 1,000 children worldwide [
3,
4]. Different subtypes vary in clinical presentation, pathogenesis, and prognosis; however, they have common symptoms, such as morning joint stiffness, joint pain, and joint swelling [
5]. If untreated, JIA can lead to joint damage, functional limitation, and severe disability [
6]. There is no cure, and treatment mainly manages pain and inflammation through a combination of pharmacological, physical, and psychosocial therapies to control symptoms and avoid joint damage [
4,
7]. Among pharmaceutical treatments, the relatively-new biologic disease-modifying antirheumatic drugs, commonly referred to as biologics, have shown high efficacy for disease remission and preventing long-term disability, however, they have side effects, high costs, and unknown long-term safety [
7,
8].
Patient-reported outcome measures (PROMs) are used to assess functional status, pain, and health-related quality of life (HRQL) for JIA patients in clinical settings. They provide an inclusive picture of health status and help clinicians understand the impact of JIA on children’s everyday life and the life course [
9‐
11]. Measuring PROMs reflects whether clinical care makes patients feel better [
12,
13]. The commonly used JIA-specific PROMs, such as the Childhood Health Assessment Questionnaire (CHAQ), incorporate outcomes of patients’ direct interest, and can provide an effective measurement of health status and predict patient outcomes [
12,
14,
15].
Several generic pediatric HRQL measures including the EQ-5D-Y-3L have been validated and used among children with JIA [
16]. Valuation studies on the EQ-5D-Y-3L, published or underway, support the use of EQ-5D-Y-3L in future economic evaluation in JIA, which is important given the high costs of JIA treatments [
16,
17]. The EQ-5D-Y-3L has only 3 levels in each dimension, potentially leading to large ceiling effects [
18]. The EQ-5D-Y-5L, still experimental, is a 5-level version revised from the EQ-5D-Y-3L that aims to reduce ceiling effects and enhance sensitivity [
19]. It has been validated in several disease areas and the general population, mainly among children aged 8 years old or more, via the self-report or the caregiver-proxy (reported by a caregiver on behalf of the children) version, and found to have less ceiling effects, better responsiveness and discriminate power (versus the EQ-5D-Y-3L), good test-retest reliability, convergent validity, and known-group validity [
20‐
27]. Among studies exploring the psychometric properties of the caregiver-proxy version, good agreement between the proxy and the self-report version was identified [
25,
26]. There remains a knowledge gap in the performance of EQ-5D-Y-5L among children under 8 years old, and the psychometric properties of the EQ-5D-Y-5L among patients with JIA of all ages.
As young children have difficulty completing the PROM by themselves, a proxy version is required. For both the CHAQ and EQ-5D-Y, the age threshold for mandatory proxy-version is 8 years [
28,
29]. Many JIA patients have the disease onset at a very young age (2–4 years) [
30]. Valid tools are necessary to assess their HRQL before age 8.
This paper focuses on assessing the psychometric properties of the parent-proxy version of the EQ-5D-Y-5L among patients with JIA in terms of its ceiling effect, construct validity (convergent validity, divergent validity, and known-group validity), and informativity.
Discussion
This is the first study exploring the psychometric properties of the EQ-5D-Y-5L parent-proxy version among JIA patients. We used a heterogenous sample and found that the EQ-5D-Y-5L demonstrated a good convergent and divergent validity versus the CHAQ parent-proxy version and the ability to discriminate known groups defined by disease characteristics, functional ability, pain, and wellbeing, suggesting the future use of EQ-5D-Y-5L to assess HRQL for JIA patients.
We had a robust sample size representing a diverse range of patients. The base-case analytical sample included data from 467 visits representing 407 patients, which is more than previous EQ-5D-Y-5L validation studies [
20‐
26]. The sample size for the correlation analysis was 370 visits, and ranged from360-467 for the known-group analyses. There was mild to moderate disease burden among the study population, based on various disease activity measures, and the functional ability, pain, and well-being assessed by the CHAQ. This is consistent with findings from other studies: the advancement of JIA management in recent years has led to a decrease in symptoms and disease activity and better overall assessment [
16]. The HRQL measured by the EQ-5D-Y-5L reflects a similar pattern. The estimated mean and median of EQ-5D-Y-5L level summary score were around 9, which represents 4 dimensions with mild problems, 2 dimensions with moderate problems, or 1 dimension with severe problems (with the rest of the dimensions having no problems).
We examined the convergent and divergent validity of the EQ-5D-Y-5L by comparing its dimensions with the CHAQ dimensions. Each CHAQ dimension assesses the capability of a patient to perform tasks in that functional area. Only the “eating” dimension did not have a directly corresponding dimension in the EQ-5D-Y-5L that measured similar or the same constructs. For other functional areas, although some seem not semantically linked to any EQ-5D-Y-5L dimensions, e.g., “reach”, these dimensions had a wide range of tasks defining the functional ability, and it turns out that each dimension has one or more tasks reflecting similar constructs as the EQ-5D-Y-5L dimensions. In the “reach” dimension of the CHAQ, four tasks were included: (1) “Reach and get down a heavy object…”; (2) “Bend down to pick up…”; (3) Pull on a sweater over his/her head; (4) Turn neck to look back over shoulder. Based on the wording of these tasks, the “reach” dimension theoretically relates to the EQ-5D-Y-5L “looking after myself” and “doing usual activities” dimensions. Therefore, we were able to thoroughly analyze the convergent and divergent validity of the EQ-5D-Y-5L among JIA patients. The EQ-5D-Y-5L demonstrated good convergent and divergent validity. This reflected that though the EQ-5D-Y-5L, a generic measure, could not directly assess the JIA-specific functional ability, worse (or better) EQ-5D-Y-5L dimensional responses likely indicate that patients experience more (or less) difficulty in the corresponding functional areas.
We performed extensive known-group analyses to assess whether the EQ-5D-Y-5L could differentiate patients with known differences in HRQL, and the EQ-5D-Y-5L performed well in the specified known groups. 88% of the statistical tests were in accordance with the pre-specified hypotheses, surpassing the 75% threshold thus supporting that known-group validity is established [
38]. In post-hoc pairwise comparisons where the known group contains three or more categories, Bonferroni correction was used, which reduced the chance of Type I error. Furthermore, the differences in HRQL across categories were significant at
p < 0.001 level in most of the known groups, together with a moderate to large effect size.
Validation studies on the EQ-5D-Y-5L (not in the JIA population) made head-to-head comparison with the EQ-5D-Y-3L and show that the EQ-5D-Y-5L can decrease the ceiling effect and increase the informativity, compared with the EQ-5D-Y-3L [
20‐
24,
26]. In our study, as the data collection is still ongoing and there is a limited sample with both instruments, we have not compared the 5L instrument with the 3L instrument, however, this can be explored in the next steps of this research. Using the 15% threshold [
38], we found the EQ-5D-Y-5L has no ceiling/floor effect when measuring HRQL among patients with JIA. In terms of informativity, the reported Shannon’s evenness index in other EQ-5D-Y-5L validation studies ranged from 0.10 to 0.73 [
20,
21,
24], and our study population reported 0.52–0.88, indicating more even distributions. Based on the ceiling/floor effect and the informativity statistics, the EQ-5D-Y-5L parent-proxy version has the informational richness to assess JIA patients.
JIA can be diagnosed at a very young age, and our study did recruit some patients younger than 4 years old. For these patients, EQ-5D-Y-5L is not recommended to use as some dimensions might not be appropriate for infants and toddlers. As such, we excluded children aged 1–3 years old in the base case and included them in the sensitivity analysis. We did not observe many differences in the psychometric properties between the two samples. Patients under 4 years old accounted for 14% of the sample, so this subpopulation possibly did not affect the overall results in a significant way. According to another study that solely analyzed the performance of the EQ-5D-Y-3L proxy version among young children (aged 3–5), HRQL expressed by the summary scores of the EQ-5D-Y-3L worked well and the known-group validity can be demonstrated in this young population. However, the “looking after myself” dimension is problematic for children aged 3 years old due to age-related difficulties [
56]. This is consistent with our findings, as we also identified more problems reported in the “looking after myself” dimension in the sensitivity analysis. The EuroQol Research Foundation is developing EQ-TIPs [
57], an instrument for infants aged 0–3 years old, to better measure HRQL among this population. The EQ-TIPS has an “eating” dimension, as eating is an important aspect of an infant and toddler’s life and a large focus of attention for parents [
58]. Exploring the validity of EQ-TIPS on JIA patients is warranted considering the age characteristics of this population.
This study has strengths and limitations. The study sample was collected from national JIA cohorts in Canada and the Netherlands, which include many pediatric rheumatology clinics in these two countries. Recruited patients had various disease subtypes, were at different disease progression stages, and had diverse treatment experiences. There is also a great proportion of patients with relatively severe disease activities and disabilities. This suggests good generalizability of our study to JIA patients in western countries. However, we did not have enough data from patients aged 4–7 years old to analyze purely on this population. Patients aged 8 years and older dominated the study sample, and the psychometric performance of EQ-5D-Y-5L would be more determined by the older children. Therefore, the psychometric properties we observed might not provide the true picture among younger children. Also, the CHAQ does not have a dimension regarding mental health. We were unable to justify whether the “feeling worried, sad, or unhappy” dimension of the EQ-5D-Y-5L is valid to measure mental health conditions of the JIA population, although this dimension had a good divergent validity to all CHAQ dimensions.
Some clinical variables and the CHAQ used in the validation study had some missing values. The cJADAS10 disease activity score (23% missing) and the three CHAQ index (disability index (21%), health status (21%), and pain index (21%)) were the variables that had the most missing values. We used a complete dataset when we performed analyses with regard to those specific variable, and the sample size still ranged from 360 to 467.
The parent proxy is a recommended and common type of surrogate to rate the functional disability and disease activity among JIA population, especially for children younger than 8 years old [
29]. When comparing with the CHAQ parent-proxy version, we found the EQ-5D-Y-5L parent-proxy version is a valid instrument to measure HRQL of the JIA population. However, it is unclear whether the EQ-5D-Y-5L parent-proxy version performs better or worse compared to the self-reported version. In other pediatric disease areas, findings were diverse in terms of the agreement between the parent-proxy version and self-reported version of a PROM [
59,
60]. It is important to examine the parent-patient agreement of the EQ-5D-Y-5L among JIA population in future research.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.