A Systematic Review of the Use and Quality of Qualitative Methods in Concept Elicitation for Measures with Children and Young People

Husbands, Samantha; Mitchell, Paul Mark; Coast, Joanna

doi:10.1007/s40271-020-00414-x

A Systematic Review of the Use and Quality of Qualitative Methods in Concept Elicitation for Measures with Children and Young People

Systematic Review
Open access
Published: 29 April 2020

Volume 13, pages 257–288, (2020)
Cite this article

Download PDF

You have full access to this open access article

The Patient - Patient-Centered Outcomes Research Aims and scope Submit manuscript

A Systematic Review of the Use and Quality of Qualitative Methods in Concept Elicitation for Measures with Children and Young People

Download PDF

Samantha Husbands¹,
Paul Mark Mitchell¹ &
Joanna Coast¹

5508 Accesses
8 Citations
4 Altmetric
Explore all metrics

Abstract

Background

Qualitative research is recommended in concept elicitation for patient-reported outcome measures to ensure item content validity, and those developing measures are encouraged to report qualitative methods in detail. However, in measure development for children and young people, direct research can be challenging due to problems with engagement and communication.

Objectives

The aim of this systematic review was to (i) explore the qualitative and adapted data collection techniques that research teams have used with children and young people to generate items in existing measures and (ii) assess the quality of qualitative reporting.

Methods

Three electronic databases were searched with forward citation and reference list searching of key papers. Papers included in the review were empirical studies documenting qualitative concept elicitation with children and young people. Data on qualitative methods were extracted, and all studies were checked against a qualitative reporting checklist.

Results

A total of 37 studies were included. The quality of reporting of qualitative approaches for item generation was low, with information missing on sampling, data analysis and the research team, all of which are key to facilitating judgements around measure content validity. Few papers reported adapting methods to be more suitable for children and young people, potentially missing opportunities to more meaningfully engage children in concept elicitation work.

Conclusions

Research teams should ensure that they are documenting detailed and transparent processes for concept elicitation. Guidelines are currently lacking in the development and reporting of item generation for children, with this being an important area for future research.

Qualitative Research: Ethical Considerations

How to use and assess qualitative research methods

Article Open access 27 May 2020

Doing Reflexive Thematic Analysis

FormalPara Key Points for Decision Makers

The use of qualitative research for concept elicitation is important to ensuring the content validity of patient-reported outcome measures.
The quality of the reporting of qualitative concept elicitation for child and young person measures was generally poor, making judgements around the content validity of measure items challenging.
Few measures reported adapting their data collection techniques to be more suitable for children and young people, potentially missing opportunities to more meaningfully engage this population in item development, particularly younger children.
Those developing measures for children and young people would benefit from clear guidelines on how to undertake and report qualitative methods for concept elicitation.

1 Introduction

The process of healthcare decision making, specifically measuring and comparing the clinical and cost effectiveness of healthcare technologies, interventions or services, can be facilitated through the development and use of patient-reported outcome measures (PROMs). PROMs are questionnaires designed to capture the clinical and broader outcomes of treatments from the perspectives of patients [1]. They comprise items that should be designed to represent the concepts and outcomes most important to the population in which a measure will be used. Empirical work to develop measure items will be referred to here as ‘concept elicitation’ [2] but can also be known as conceptual attribute development [3,4,5]. Patients are asked to complete PROMs before and after receiving an intervention to record any differences in their outcomes as a result. The focus of a measure’s items will vary according to whether a measure has been developed for use in a specific disease area (condition-specific) or for generic use, with the latter facilitating the comparison of patient outcomes across a broad range of health and social care conditions [1].

An important consideration for all PROMs is to ensure that the contained items are relevant and sensitive to changes in aspects such as the health or well-being of that population [6]. Guidance on PROM development from the US Food and Drug Administration (FDA) [5] and the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) task force [7] suggests that qualitative, empirical research with the target population is essential to establishing a measure’s content validity, that is, whether it adequately captures the items of interest [8]. The goal of qualitative research is typically to understand a phenomenon from the perspectives of those who are knowledgeable, experienced or involved [9], and qualitative data are most commonly generated through listening to the views and experiences of participants. The FDA emphasise the importance of reaching data saturation for items, that is, ensuring that they achieve full coverage of all aspects important to a population and decision-making context. The importance of clear reporting of the qualitative development of these measures is also emphasised (e.g., [5, 10]) to allow users (i.e., clinicians, researchers, decision makers etc.) to decide on a measure’s content validity and how suitable it is for use.

The FDA give specific advice on PROM development in children and adolescents, centred around content validity and ensuring that measures can be understood and completed by children and young people (CYP) [10]. However, direct research with CYP can prove challenging for PROMs development [11]. This is because traditional qualitative methods are typically very adult-orientated and less appropriate for use with children, particularly with young children and those not able to articulate their opinions using formal or language-based methods [12,13,14]. Arbuckle and Abetz-Webb [11] suggest that further challenges include engaging children in research activities and finding methods that are appropriate to meet the different age and developmental abilities of CYP. Rowen and colleagues (2020) note similar issues with asking CYP to provide values for items for preference-based measures, with concerns around their understanding and ability to address the complexity of elicitation tasks [15]. This raises questions around whether and how researchers are developing items for PROMs with the CYP population, including how they are overcoming issues with involving CYP in direct research and how they are ensuring the generation of sensitive and valid measures.

This paper presents a systematic review of empirical studies documenting the development of measures using qualitative methods with CYP. The review has two aims: (i) to explore the qualitative methods that research teams have used with CYP to develop measure items, and whether methods have been adapted to suit the age and developmental needs of the population; and (ii) to explore the quality of the reporting of these methods. The discussion section of the paper synthesises the main findings from the retrieved studies and makes comparisons between what is being carried out in practice and the limited guidance available on CYP PROM development, as well as reporting standards in qualitative research generally.

2 Methods

2.1 Search Strategies for Studies

With a focus on exploring the qualitative approaches taken with CYP for concept elicitation, the search was designed to retrieve a breadth of papers, including condition-specific and generic measures. The search combined electronic database searching, reference list and forward citation searching of key papers and using existing systematic reviews of CYP measures to identify whether any of the measures featured had reported the use of qualitative methods in item development [16,17,18].

Three relevant electronic databases were searched: PubMed (includes MEDLINE), EMBASE and EconLit, with no limits on dates. The search was updated in November 2019. Search terms were developed in PubMed and adapted slightly to maximise sensitivity within each database. The terms used combined the population of interest (children and young people) with variations on the possible focus and outcomes of the developed measures (i.e., an economic, quality-of-life or well-being focus), with alternative terms for the methodological approach taken to measure development, centred around the language used in the FDA PROM development guidance (i.e., qualitative, qualitative research). The search terms developed for use in the electronic databases are detailed in Appendix 1 (see electronic supplementary material [ESM]). The ‘find citing articles’ feature of electronic journals was used to identify other studies that had cited key papers. Key papers for forward citation and reference list searching were studies that included a higher level of detail on the qualitative methods for item development, in anticipation that other papers may have followed and cited their work [19,20,21,22,23,24].

2.2 Selection Method

The lead reviewer (SH) screened the title and abstracts of each paper identified through the search. If the abstract did not contain enough information to make a judgement on its relevance, the full-text version of the paper was downloaded. All duplicate articles were excluded. An independent reviewer (PMM) screened a proportion (5%) of all paper abstracts in one electronic database (PubMed) against the inclusion and exclusion study criteria to ensure agreement and consistency in the papers included. The independent screening of the abstracts encouraged the authors to clarify which studies were and were not considered relevant against the inclusion and exclusion criteria.

2.3 Study Inclusion and Exclusion Criteria

Studies were included in the review if they were (i) empirical studies documenting the development of the items of a measure using qualitative research with CYP and (ii) were developing a measure for use with CYP aged between 0 and 18 years. Excluded studies included non-English language articles, review articles, methodological guidelines and research protocols. Studies were excluded if they only reported using qualitative methods for validation of items (rather than development) or if they only briefly cited or discussed linked and already existing/published item development work—although any linked articles were then searched (via Google Scholar) for possible inclusion in the review. Excluded studies extended to those that were found to be superseded by papers with more detail available on the qualitative concept elicitation work, if existing papers focused on the development of the same measure and no information important to the review was sacrificed. Finally, studies were excluded if they also involved those over the age of 18 years or if the qualitative research was undertaken with parents/guardians or families only, that is, no CYP were directly involved in the concept elicitation.

2.4 Data Extraction and Quality of Reporting of Qualitative Methods

Data were extracted from each article into a data extraction form (see Appendix 2 [in ESM]) to ensure that the same information was captured for all studies [25]. Details recorded for all articles were the author(s) and paper characteristics (i.e., year, title and paper objective). Information was also recorded on the measure name, the type of measure (i.e., condition-specific, generic), the age of the CYP the measure was developed for and whether parents/guardians had been involved in development work. Information was documented on the qualitative methods used and studies were assessed for quality using principles from the 32-item ‘Consolidated criteria for reporting qualitative research (COREQ)’ tool [26], which focuses on the adequacy of reporting provided on the research team and reflexivity (i.e., reflections on how a researcher’s personal and professional biases may affect research processes and outcomes [27]), study design and the analysis of findings. Details on the qualitative research in the data extraction form was collected under the following headings: information available on sampling, qualitative methods used, approach to analysis and positive and negative reflections on the methods (both the authors’ and the reviewer’s [SH]). The form also collected details on whether any other methods were used (aside from qualitative) to develop the items. Data extraction was completed independently by a second author for 20% of publications, as was the quality check through the COREQ checklist (PMM).

2.5 Synthesis of Results

Microsoft Excel was used to tabulate the extracted data. The data were then summarised and collated into a narrative report to describe the findings. After a summary of the paper characteristics, information from the articles were synthesised under two themes: (i) an overview of the qualitative approach used in CYP concept elicitation and (ii) the quality of reporting in concept elicitation for CYP.

3 Results

3.1 Search Results

The search strategy retrieved 5072 papers; nine duplicates were removed. After screening article abstracts and titles and full-text versions of the 70 articles retrieved, a total of 37 studies met the inclusion criteria and were included in the review. Of these, 29 were identified through electronic databases and eight through other means. One study retrieved in the review [28] was found to have a ‘sister’ paper that contained additional detail on the qualitative item development work but predated any specific CYP measure development [29]. Information from both studies were used to inform the review, but for clarity, were treated as one record [28]. The search process is documented in Fig. 1 and the full paper characteristics for the included papers are in Table 1. The result of the independent review of a proportion of all abstracts screened (n = 251) by two reviewers was an agreement of 99.6% abstracts to include/exclude (kappa statistic inter-rater agreement of 0.67, rated as ‘good’ [30]). There was no disagreement between SH and PMM regarding the accuracy and completeness of data extracted in the selected proportion of papers, including completion of the COREQ checklists.

Table 1 Retrieved paper characteristics

Full size table

3.1.1 Characteristics of Included Studies

All included studies had a similar aim: to document the development of a measure for children and/or young people. However, the studies differed in terms of how much of a focus there was on reporting the methods for, and results of, the development of the items. Two thirds of the papers discussed the quantitative psychometric validation and development of items, although this was in varying detail, and only seven focused solely on item development. Most studies aimed to develop a condition-specific measure (31/37), with many for use with specific diseases but some also designed for use generically across disease areas, for example, chronic conditions [31, 32]. Six studies reported on the development of generic measures for quality of life or health-related quality of life of CYP [19, 33,34,35,36,37]. Although most studies focused on measuring quality of life in CYP, others also aimed for the measure to be suitable for use in cost-effectiveness analyses and as a preference-based measure [19,20,21, 32].

Almost two-thirds of the studies used other approaches in addition to qualitative methods to develop items. These studies mostly used literature searches, searches for existing relevant measures and consultations with experts. The exceptions were two studies that used the experience of the research team/authors to decide on the factors important to include [38, 39]. Five of the 22 studies suggested that the findings of these other methods were used to inform the direction of questioning or analysis framework for the qualitative inquiry. However, in most studies these additional methods appeared to be used alongside qualitative methods to either support or add information to the developed items, although it was often not clear how this synthesis of information worked. Two of the 15 studies using qualitative methods only suggested that they thought it optimal for the items to be informed solely by direct research with CYP [19, 23].

Most of the measures reported in the papers had been developed for adolescents (11/37), with the next most common being those developed for CYP aged 0–18 years (6/37) or older primary school-aged children to adolescents (i.e., those aged 8–18 years) (7/37). The remaining measures were developed for primary school-aged CYP aged 5–12 years (4/37), secondary school-aged CYP aged 10–15 years (3/37), all school-aged children aged 5–15 years (n = 1) or for use across childhood but excluding very young children aged 0–4 years (3/37). Two papers [33, 39] included unclear information on the age of CYP that their measures had been developed with and for, stating their population as ‘high school students’ and ‘adolescents’ respectively.

Most papers explicitly specified that their measures should only be used with the population that the items had been developed with through empirical work. However, six studies implied that the developed measures could potentially be useful in age groups outside of this. As an example, Varni et al. [34], Ronen et al. [23], McMillan et al. [40] and Gilchrist et al. [28] did not involve any CYP from the upper range of their stated age groups in item development, and Khadra et al. [41] had very little representation from CYP at the lower end. Graham et al. [36] suggested that their measure could potentially be suitable for completion by children (or parent proxies) as young as 5 years, despite the youngest child in their concept elicitation sample being 9 years old. This raises questions around how representative the items in these measures might be for these ‘missing’ age groups, although this is likely to depend on the context and focus of each measure.

Nineteen of the measures involved CYP’s parents/guardians or carers in item development either alongside CYP in paired interviews or focus groups, or in separate data collection. Four papers gave justification for involving parents or guardians, stating that their perspectives can offer additional valuable and valid insight into CYP’s quality of life [24, 42,43,44]. Others also mentioned practical reasons for involving them—to act as proxies in instances where CYP are not able to participate [20, 43, 45]. One third of the 19 measures involved CYP and parents/guardians separately in data collection where possible, with authors suggesting that this was important to allow CYPs’ individual opinions to emerge [23, 24, 43, 46, 47].

3.2 Overview of Qualitative Approach Used in Children and Young People (CYP) Concept Elicitation

3.2.1 Data Collection Methods

The majority (n = 21) of included studies used either in-depth or semi-structured qualitative interviews. Eight studies used focus group methods, and six used a combination of interviews and focus groups. One paper used the nominal group technique, where the aim was for participants to present ideas to the group relevant to the factors important to the quality of life of CYP with heart disease [48]. Participants were asked to rank the shared ideas in order of importance. This method differs from focus groups because members do not discuss (the importance of) research themes between themselves, but instead make judgements independently [49]. In the remaining study [33], the methods for data collection were not explicitly stated; however, it was implied that a qualitative approach (most likely focus groups) was used, as the authors described undertaking ‘group meetings’ with high school pupils for instrument development.

Several papers offered justification for their choice of method. Oluboyede et al. [21] discussed using interviews with adolescents to gather individual perspectives on how being obese/overweight affected their quality of life, with the authors suggesting that adolescents felt more confident discussing this on a one-to-one basis. A further four papers suggested that they selected interviews because it either allowed CYP a more comfortable environment to discuss issues, or because it encouraged them to reflect on how their own lives were affected by their condition [19, 24, 35, 36, 38]. Markham et al. [22] and Ronen et al. [23], however, suggested that they used focus groups with CYP because they provided a supportive and social setting that encouraged CYP to share ideas and experiences.

3.2.2 The Use of Adapted Data Collection Techniques with CYP

Only five of the 37 papers reported adapting data collection methods to make them more suited to CYP, which for all involved using traditional qualitative methods alongside other techniques designed to involve/engage CYP in research. In the case of Stevens [19], this was setting up a warm-up activity for the children, asking them to decorate name badges to help them to relax prior to being interviewed. The author decided against using props or activities during interviews as they thought it would distract from data collection. However, the remaining four papers used adapted techniques during data collection, including the use of pre-set picture cards [22], drawings [21] and statements [47] aimed at prompting discussion about aspects potentially relevant to CYP’s quality of life. For example, Oluboyede et al. [21] used body shape drawings with adolescent focus groups to encourage participants to consider how individuals with bigger body shapes might be affected by their size.

Two of the papers reported using creative/participatory methods with CYP, asking them to use modelling clay [23] and ‘life maps’ [47] to express ways in which their quality of life is affected by their conditions. In the latter study, CYP were asked to create a character who had a foot or ankle problem and think about and map how that character’s life would be affected by their condition at different times of the day (morning, school, home, weekends). Two studies discussed adapting techniques to the different age groups of CYP [22, 47], with younger CYP in the former study drawing rather than writing about their experiences, and younger children in the latter study taking part in games to select topics for discussion, rather than choosing topics at random as with the older children.

There was suggestion from the studies that those using creative and participatory methods were able to engage their relative CYP population for a longer time period. For example, Markham et al. [22], Morris et al. [47] and Ronen et al. [23] undertook focus groups with those aged as young as 6 years old that lasted from 45 up to 90 min. In contrast, focus groups with 5- to 13-year olds in the study by Gilchrist et al. [28] lasted only 12–14 min. In studies using interviews, Gilchrist et al. [28] carried out interviews lasting 6–16 min, Khadra et al. [41] did interviews with adolescents lasting 18 min on average and Stevens [19]—who used warm up activities with CYP but avoided creative methods during data collection—undertook interviews with 7- to 11-year olds lasting from 4 to 26 min. A summary of the qualitative methods and perceived quality of retrieved papers is in Table 2.

Table 2 Details on the qualitative methods and quality of retrieved papers

Full size table

3.3 The Quality of Reporting in Concept Elicitation for CYP

The retrieved papers varied in terms of the number of COREQ checklist criteria met; however, almost half of the papers reported on none or very few of the 32 quality indicators.

3.3.1 Reporting on Data Analysis

Papers tended to miss reporting information on data analysis, with 15/37 not including any information on the approach to qualitative analysis used. An additional four papers included only very brief information on analysis, including the technique used (e.g., content analysis or constant comparison) but with little or no information on the process of data analysis, that is, how codes were developed and applied to the data and how themes were identified. In terms of findings, only eight of the 37 papers included quotations from the data to support the themes that had informed the items of their measures.

3.3.2 Reporting on Sampling

Seven papers included no information on sampling at all. A further seven studies included very basic information on either the sampling strategy (e.g., convenience or purposive sampling) or where participants were identified. The papers generally lacked information on the methods for initially contacting participants (e.g., though face-to-face consultation or postal invite) and information on those who had declined to participate. Two papers also lacked basic information on the age of the CYP included in their study [33, 39].

3.3.3 Reporting on Data Collection

More information was generally available on data collection, with all but one paper [33] making clear which data collection method they had used. Just under one third of the papers gave an indication of the average duration of focus groups or interviews, and a similar number mentioned reaching saturation of the themes identified to inform items. However, only nine papers included an interview/focus group topic guide or examples of the questions that were asked to participants. The papers also tended not to include information on where data collection took place and who was present.

3.3.4 Reporting on Research Team and Reflexivity

The most common area in which information was lacking was on research team and reflexivity, with only eight [22, 28, 38, 41, 43, 46, 50, 51] of the 37 papers including any sort of background information on the researchers (including gender and academic background). Of these seven papers, only two provided reflections on how the backgrounds of the authors may have influenced data collection or the nature of research findings. For example, Gilchrist et al. [28] commented on the potential impact of the researcher’s role as a dentist when exploring the consequences of dental caries on children’s quality of life. The authors reflected that due to the researcher not being the children’s personal dentist, it would have been unlikely to have inhibited children’s interview responses—and further, because the researcher was not aware of the children’s dental history until after interviews had been undertaken and transcripts analysed, it was unlikely to have affected the nature of this researcher’s questioning or analysis. In contrast, Davis et al. [50] reported that the comprehensiveness of their findings on the impact of cerebral palsy on adolescents may have been impacted by both the researchers being female, with the possibility that male adolescent participants may not have felt comfortable discussing more sensitive issues (such as relationships) with female researchers. Markham et al. [22] acknowledged that his professional and academic background would have potentially biased data collection and analysis but suggested that this potential had been “mitigated by the facilitator’s reflexivity, whereby a priori preconceptions were consciously noted and attempted to be bracketed from the study” [p. 753]. However, the author gave no indication of what these biases might have been, and how they had been avoided.

3.3.5 Strengths in Reporting

Despite many of the papers meeting limited quality criteria on the COREQ checklist, there were strengths to some of the studies reviewed. Eleven met 15 or more of the 32 checklist criteria, including greater coverage of information on sampling, data collection and analysis than other papers. Four studies (three of these being those identified as meeting a high number of criteria on the COREQ) reported following FDA guidelines for measure development [19, 21, 46, 52] and a further study (also highly detailed) mentioned following the COREQ guidelines for reporting [20]. Twenty of the 37 papers stated that they had ethical approval for the qualitative study, with twelve mentioning gaining informed consent (or assent) from research participants. It is important for researchers to show that they have thought about ethical issues, particularly when conducting research with CYP who may be vulnerable to pressure to take part in studies or who may not fully understand what they are being invited to participate in [53, 54]. However, despite the acknowledgement of ethical procedures within many of the papers, only two of these mentioned developing study information sheets specifically for CYP’s understanding, which if not developed, may have limited CYP’s ability to give informed assent for their participation in research [14].

4 Discussion

The review retrieved a total of 37 papers, featuring condition-specific and generic measures to record changes in the quality of life of CYP. Most studies had developed measures for adolescent populations and had used either interviews or focus groups for item generation, with those choosing interviews seemingly because the method provided a more comfortable environment for CYP to discuss individual and potentially sensitive issues. This fits with previous recommendations made for PROM development in paediatric populations, which suggest that focus groups might lead to social desirability bias, as CYP could feel inhibited to express their own opinions and more likely to agree with previously raised themes in group situations [11]. Therefore, the use of focus groups in this context could potentially cause problems around the representation of all CYP’s views in item generation. However, similar issues could conceivably arise in interviews, in situations where CYP might feel compelled to answer questions in a manner that they think will be viewed favourably by the interviewer.

A relatively low number of studies discussed adapting methods to be more suitable for the CYP population, with only four using creative and participatory methods alongside interviews and focus groups. Several PROM guidance papers recommend the use of such approaches with CYP to keep their attention [11] and to help overcome anxiety and encourage discussion [55]. Further, studies in the child methodology literature recommend these methods to allow CYP more time and freedom to express themselves, and to address power imbalances between CYP and adult researchers, by giving CYP more control over the topic and direction of research [12, 13, 56, 57]. Those using creative and participatory methods in the studies collected here appeared to engage their CYP population for a longer period, and although length of data collection is not necessarily an indication of quality, relatively short data collection periods might suggest that aspects important to a population may not have been discussed fully or in depth. The suggestion from the literature and this review therefore is that participatory and creative methods can be beneficial in helping CYP to engage in concept elicitation work in a more meaningful way, potentially helping to enhance the coverage and validity of included items.

However, the literature suggests that these methods are particularly relevant for engaging and keeping the attention of younger age groups [11, 55], with Arbuckle and Abetz-Webb recommending the use of creative approaches in research with 6- to 11-year olds, with traditional qualitative methods becoming more appropriate in adolescents aged 12 years and over [11]. Indeed, several studies in this review appeared to carry out successful concept elicitation work with very young children (as young as 6 years), and the increased use of such methods in this area may help with the development of further measures for younger children, which at the moment are less common than those for adolescents.

In terms of reporting quality, although there were strengths, none of the 37 papers met all criteria outlined on the COREQ checklist for qualitative research, and almost half of the papers met two, one or zero. Further, many of those meeting criteria did so in very little detail. Detail was most lacking on qualitative data analysis, sampling and the research team, with these missing details making it difficult for the reader/user to make judgements about content validity and whether the items in the measures had achieved full coverage. For example, evidence of a robust sampling strategy is crucial in ensuring that important characteristics of a population have been captured (i.e., purposive sampling) [58] and, in several of the studies retrieved in the review, there was no representation in the empirical work from specific age groups within their stated population. This is particularly important in light of guidance from the FDA and others [5, 11], which state that measures should be developed and saturation of items achieved in narrow age groupings of CYP, due to the rapid changes that take place in their developmental and cognitive abilities during childhood and into adulthood [59].

Details on the processes of qualitative data analysis and the research team are important to allow judgements around the robustness of the authors’ interpretations of collected data. Reflexivity regarding the authors’ acknowledgement of how their own personal characteristics and assumptions may have influenced findings is essential to judgements around validity [9, 60] and this review found that only a small number of papers had disclosed and discussed this information. Qualitative quality guidance states that researchers should be explicit about how final themes and concepts are developed from data and provide evidence in quotations from participants to support these [27]. This review has demonstrated that very few studies had a high level of detail on the analysis process, and under a quarter of the retrieved studies included any quotations to support the items generated, leaving measure content without a clear evidence base.

Many studies used other methods with qualitative data collection to inform measure items, such as literature reviews, expert opinion and even the expertise of the authors. Although these are potentially valuable sources of information [7], it is ambiguous in many of these papers as to how far final measure content was informed by CYP’s own opinions and experiences of what is important. An important quality indicator is transparency in the reporting of research processes and how research conclusions are generated [61] and this review has indicated that reporting of qualitative concept elicitation for CYP measures appears to be generally lacking in this respect. This mirrors findings of a systematic review of condition-specific preference-based measures (PBMs) by Brazier et al. [62], who found that measures using qualitative analysis in item development had reported their methods in very little detail, with the authors describing this as a ‘barrier’ to this aspect of measure development being better understood and becoming more scientifically rigorous (p. 26–8).

To the authors’ knowledge, this is the first review to summarise and critically analyse the qualitative methods used for concept elicitation for measures for children and young people. Existing reviews of generic paediatric measures have tended to summarise and critically analyse the items contained within the measures (e.g. [17, 18]) or review the usage of the measures in practice (e.g. [16]), with condition-specific measure reviews tending to summarise the measures available in particular disease areas. The strength of this review is that it has focused on how researchers have reported concept elicitation with CYP [5, 7], and has importantly highlighted where more transparency is needed to allow judgements around content validity. Although research teams are clearly recognising the value of having direct input from CYP into item development, the poor quality of reporting in these studies raises questions around how far the content of these measures is truly sensitive to what is important to these populations.

Despite this review critiquing the quality of reporting for concept elicitation in CYP measures, it is important to note that it is not necessarily that researchers have not followed robust research processes, but that this has not been made clear and described in a high level of detail. For example, some of the research teams also went on to perform further validation tests with CYP on the developed items, which may have strengthened content validity (i.e., using qualitative cognitive interviews with the relevant population to check their coverage). It is also important to acknowledge that these studies have followed recommendations to use qualitative methods in item generation. Given that the focus of this review has only been to retrieve studies using qualitative methods for concept elicitation, we are unable to calculate the number of studies not using qualitative research, but we know that in economics, for example, the vast majority of PBMs for child economic evaluation have not included CYP in item development [16]. The measures included here have therefore been successful in facilitating the inclusion of the ‘patient voice’ in content development, which is particularly important given that children and young people have often been excluded from research [63].

This review only searched for papers in peer-reviewed journals and it is possible that further papers may have been retrieved if the grey literature had also been searched. Further, a few more relevant papers may have been picked up if the search terms had been expanded slightly—for example, to include ‘health measures’ in the ‘focus and outcomes of developed measures’ criterion of the search. However, the authors used additional techniques such as searching in relevant systematic reviews and forward citation and reference list searching to encourage a more comprehensive and targeted search. It is unlikely that the inclusion of additional studies would have changed the overall message of this review, as the reporting quality was low or lacking in most included studies. It is possible that the authors of this review could have contacted the authors of the retrieved studies for further information on concept elicitation, but in practice this would not be helpful to the users of measures who need to make judgements around content validity using the (published) information that is readily available to them. Having said this, it is also important to note that authors are often restricted by manuscript length limits and the need to report other aspects of measure development. The development of detailed guidelines on how to undertake qualitative concept elicitation work with CYP [7], and particularly on what to prioritise when reporting measure development, may help to overcome issues around poor reporting and content validity, and therefore should be considered an important area for future research.

5 Conclusion

This systematic review has summarised the qualitative methods and, where relevant, the adapted data collection techniques used to develop the conceptual items in measures for children and young people. We found that very few of the retrieved studies had used creative and participatory methods for item development, despite these approaches being potentially beneficial for engaging children and generating more meaningful data for concept elicitation, particularly with younger populations. The review identified important gaps in terms of the quality and transparency of reporting for item generation, with many studies not reporting information central to establishing content validity. This review recommends that research teams report concept elicitation work with children and young people in greater detail, with the development of methodological and reporting guidelines in this area being key to facilitating this.

Data Availability Statement

Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.

References

Kingsley C, Patel S. Patient-reported outcome measures and patient-reported experience measures. BJA Educ. 2017;17(4):137–44.
Article Google Scholar
Patrick DL, et al. Content validity—establishing and reporting the evidence in newly developed patient-reported outcomes (PRO) instruments for medical product evaluation: ISPOR PRO good research practices task force report: part 1—eliciting concepts for a new PRO instrument. Value Health. 2011;14(8):967–77.
Article PubMed Google Scholar
Grewal I, et al. Developing attributes for a generic quality of life measure for older people: Preferences or capabilities? Soc Sci Med. 2006;62(8):1891–901.
Article PubMed Google Scholar
Al-Janabi H, Flynn TN, Coast J. Development of a self-report measure of capability wellbeing for adults: the ICECAP-A. Qual Life Res. 2012;21(1):167–76.
Article PubMed Google Scholar
(FDA), U.S.D.o.H.a.H.S.F.a.D.A. Patient-reported outcome measures: use in medical product development to support labeling claims: guidance for industry. 2009.
Stevens K, Palfreyman S. The use of qualitative methods in developing the descriptive systems of preference-based measures of health-related quality of life for use in economic evaluation. Value Health. 2012;15(8):991–8.
Article PubMed Google Scholar
Matza LS, et al. Pediatric patient-reported outcome instruments for research to support medical product labeling: report of the ISPOR PRO good research practices for the assessment of children and adolescents task force. Value Health. 2013;16(4):461–79.
Article PubMed Google Scholar
Fitzpatrick R, et al. Evaluating patient-based outcome measures for use in clinical trials. Health Technol Assess. 1998;2(14):1–74.
Article PubMed CAS Google Scholar
Mays N, Pope C. Qualitative research: rigour and qualitative research. BMJ. 1995;311(6997):109–12.
Article PubMed PubMed Central CAS Google Scholar
Terwee CB, et al. COSMIN methodology for evaluating the content validity of patient-reported outcome measures: a Delphi study. Qual Life Res. 2018;27(5):1159–70.
Article PubMed PubMed Central CAS Google Scholar
Arbuckle R, Abetz-Webb L. “Not just little adults”: qualitative methods to support the development of pediatric patient-reported outcomes. Patient. 2013;6(3):143–59.
Article PubMed Google Scholar
Punch S. Research with children: the same or different from research with adults? Childhood. 2002;9(3):321–41.
Google Scholar
Whale K. The use of Skype and telephone interviews in sensitive qualitative research with young people: experiences from the ROCCA continence study. Qual Methods Psychol Bull 2017;23.
Shaw C, Brady LM, Davey C. Guidelines for research with children and young people. London: N.C.s.B.N.R. Centre; 2011.
Google Scholar
Rowen D, Rivero-Arias O, Devlin N, et al. Review of valuation methods of preference-based measures of health for economic evaluation in child and adolescent populations: where are we now and where are we going? PharmacoEconomics. 2020;38:325–40. https://doi.org/10.1007/s40273-019-00873-7.
Article PubMed Google Scholar
Wolstenholme JL, et al. Preference-based measures to obtain health state utility values for use in economic evaluations with child-based populations: a review and UK-based focus group assessment of patient and parent choices. Qual Life Res. 2018;27(7):1769–80.
Article PubMed PubMed Central Google Scholar
Chen G, Ratcliffe J. A review of the development and application of generic multi-attribute utility instruments for paediatric populations. Pharmacoeconomics. 2015;33(10):1013–28.
Article PubMed Google Scholar
Janssens A, et al. A systematic review of generic multidimensional patient-reported outcome measures for children, part I: descriptive characteristics. Value Health. 2015;18(2):315–33.
Article PubMed Google Scholar
Stevens KJ. Working with children to develop dimensions for a preference-based, generic, pediatric, health-related quality-of-life measure. Qual Health Res. 2010;20(3):340–51.
Article PubMed Google Scholar
Bray N, et al. Defining health-related quality of life for young wheelchair users: a qualitative health economics study. PLoS One. 2017;12(6):e0179269.
Article PubMed PubMed Central CAS Google Scholar
Oluboyede Y, Hulme C, Hill A. Development and refinement of the WAItE: a new obesity-specific quality of life measure for adolescents. Qual Life Res. 2017;26(8):2025–39.
Article PubMed Google Scholar
Markham C, et al. Children with speech, language and communication needs: their perceptions of their quality of life. Int J Lang Commun Disord. 2009;44(5):748–68.
Article PubMed Google Scholar
Ronen GM, et al. Health-related quality of life in childhood epilepsy: the results of children’s participation in identifying the components. Dev Med Child Neurol. 1999;41(8):554–9.
Article PubMed CAS Google Scholar
Hareendran A, et al. Evaluating functional outcomes in adolescents with attention-deficit/hyperactivity disorder: development and initial testing of a self-report instrument. Health Qual Life Outcomes. 2015;13:133.
Article PubMed PubMed Central Google Scholar
Centre for Reviews and Dissemination. Systematic reviews: CRD’s guidance for undertaking reviews in health care. York: CRD, University of York. 2009. http://www.york.ac.uk/media/crd/SystematicReviews.pdf.
Tong A, Sainsbury P, Craig J. Consolidated criteria for reporting qualitative research (COREQ): a 32-item checklist for interviews and focus groups. Int J Qual Health Care. 2007;19(6):349–57.
Article PubMed Google Scholar
Mays N, Pope C. Qualitative research in health care. Assessing quality in qualitative research. BMJ. 2000;320(7226):50–2.
Article PubMed PubMed Central CAS Google Scholar
Gilchrist F, et al. Development and evaluation of CARIES-QC: a caries-specific measure of quality of life for children. BMC Oral Health. 2018;18(1):202.
Article PubMed PubMed Central Google Scholar
Gilchrist F, et al. The impact of dental caries on children and young people: what they have to say? Int J Paediatr Dent. 2015;25(5):327–38.
Article PubMed Google Scholar
Altman DG. Practical statistics for medical research. London: Chapman and Hall; 1991.
Google Scholar
Petersen C, et al. Development and pilot-testing of a health-related quality of life chronic generic module for children and adolescents with chronic health conditions: a European perspective. Qual Life Res. 2005;14(4):1065–77.
Article PubMed Google Scholar
Beusterien KM, et al. Development of the multi-attribute Adolescent Health Utility Measure (AHUM). Health Qual Life Outcomes. 2012;10:102.
Article PubMed PubMed Central Google Scholar
Raphael D, et al. The quality of life profile—Adolescent version: background, description, and initial validation. J Adolesc Health. 1996;19(5):366–75.
Article PubMed CAS Google Scholar
Varni JW, et al. The pediatric cancer quality of life inventory (PCQL) I Instrument development, descriptive statistics, and cross-informant variance. J Behav Med. 1998;21(2):179–204.
Article PubMed CAS Google Scholar
Simeoni MC, et al. Validation of a French health-related quality of life instrument for adolescents: the VSP-A. Qual Life Res. 2000;9(4):393–403.
Article PubMed CAS Google Scholar
Graham P, Stevenson J, Flynn D. A new measure of health-related quality of life for children: preliminary findings. Psychol Health. 1997;12(5):655–65.
Article Google Scholar
Ravens-Sieberer U, et al. KIDSCREEN-52 quality-of-life measure for children and adolescents. Expert Rev Pharmacoecon Outcomes Res. 2005;5(3):353–64.
Article PubMed Google Scholar
Franciosi JP, et al. Quality of life in paediatric eosinophilic oesophagitis: what is important to patients? Child Care Health Dev. 2012;38(4):477–83.
Article PubMed CAS Google Scholar
Resnick ES, et al. Development of a questionnaire to measure quality of life in adolescents with food allergy: the FAQL-teen. Ann Allergy Asthma Immunol. 2010;105(5):364–8.
Article PubMed Google Scholar
McMillan CV, et al. The development of a new measure of quality of life for young people with diabetes mellitus: the ADDQoL-Teen. Health Qual Life Outcomes. 2004;2:61.
Article PubMed PubMed Central Google Scholar
Khadra C, et al. Development of the adolescent cancer suffering scale. Pain Res Manag. 2015;20(4):213–9.
Article PubMed PubMed Central Google Scholar
Bruce AA, et al. Development and preliminary evaluation of the KIDCLOT PAC QL: a new health-related quality of life measure for pediatric long-term anticoagulation therapy. Thromb Res. 2010;126(2):e116–21.
Article PubMed CAS Google Scholar
Follansbee-Junger KW, et al. Development of the PedsQL epilepsy module: focus group and cognitive interviews. Epilepsy Behav. 2016;62:115–20.
Article PubMed Google Scholar
Fiume A, et al. Development and validation of the pediatric stroke quality of life measure. Dev Med Child Neurol. 2018;60(6):587–95.
Article PubMed Google Scholar
Waters E, et al. Development of a condition-specific measure of quality of life for children with cerebral palsy: empirical thematic data reported by parents and children. Child Care Health Dev. 2005;31(2):127–35.
Article PubMed CAS Google Scholar
Panepinto JA, Torres S, Varni JW. Development of the PedsQL sickle cell disease module items: qualitative methods. Qual Life Res. 2012;21(2):341–57.
Article PubMed Google Scholar
Morris C, et al. Development of the Oxford ankle foot questionnaire: finding out how children are affected by foot and ankle problems. Child Care Health Dev. 2007;33(5):559–68.
Article PubMed CAS Google Scholar
Marino BS, et al. The development of the pediatric cardiac quality of life inventory: a quality of life measure for children and adolescents with heart disease. Qual Life Res. 2008;17(4):613–26.
Article PubMed Google Scholar
Gallagher M, et al. The nominal group technique: a research tool for general practice? Fam Pract. 1993;10(1):76–81.
Article PubMed CAS Google Scholar
Davis E, et al. Quality of life of adolescents with cerebral palsy: perspectives of adolescents and parents. Dev Med Child Neurol. 2008;51(3):193–9.
Article Google Scholar
Hilliard ME, et al. Assessing health-related quality of life in children and adolescents with diabetes: development and psychometrics of the type 1 diabetes and life (T1DAL) measures. J Pediatr Psychol. 2020;45(3):328–39.
Article PubMed Google Scholar
Hoffman MF, Cejas I, Quittner AL. Health-related quality of life instruments for children with cochlear implants: development of child and parent-proxy measures. Ear Hear. 2019;40(3):592–604.
Article PubMed PubMed Central Google Scholar
Alderson P, Morrow V. The ethics of research with children and young people: a practical handbook. 2nd ed. London: SAGE; 2011.
Book Google Scholar
Harcourt D, Perry B, Waller T, editors. Researching young children’s perspectives: debating the ethics and dilemmas of educational research with children. New York: Routledge; 2011.
Google Scholar
Patel N, et al. Development of the Malocclusion Impact Questionnaire (MIQ) to measure the oral health-related quality of life of young people with malocclusion: part 1—qualitative inquiry. J Orthod. 2016;43(1):7–13.
Article PubMed PubMed Central Google Scholar
Barker J, Weller S. “Is it fun?” Developing children centred research methods. Int J Sociol Soc Policy. 2003;23:33–58.
Article Google Scholar
Angell R, Angell C. More than Just “Snap, Crackle, and Pop”: “Draw, Write, and Tell”: an innovative research method with young children. J Advert Res. 2013;53(4):377.
Article Google Scholar
Collingridge DS, Gantt EE. The quality of qualitative research. Am J Med Qual. 2008;23(5):389–95.
Article PubMed Google Scholar
Griebsch I, Coast J, Brown J. Quality-adjusted life-years lack quality in pediatric care: a critical review of published cost-utility studies in child health. Pediatrics. 2005;115(5):e600–14.
Article PubMed Google Scholar
Kitto SC, Chesters J, Grbich C. Quality in qualitative research. Med J Aust. 2008;188(4):243–6.
Article PubMed Google Scholar
Meyrick J. What is good qualitative research? A first step towards a comprehensive approach to judging rigour/quality. J Health Psychol. 2006;11(5):799–808.
Article PubMed Google Scholar
Brazier JE, et al. Developing and testing methods for deriving preference-based measures of health from condition-specific measures (and other patient-based measures of outcome). Health Technol Assess. 2012;16(32):1–114.
Article PubMed CAS Google Scholar
Kirk S. Methodological and ethical issues in conducting qualitative research with children and young people: a literature review. Int J Nurs Stud. 2007;44(7):1250–60.
Article PubMed Google Scholar
Moher D, et al. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. PLOS Med. 2009;6(7):e1000097.
Article PubMed PubMed Central Google Scholar
Angeles-Han ST, et al. Development of a vision-related quality of life instrument for children ages 8-18 years for use in juvenile idiopathic arthritis-associated uveitis. Arthritis Care Res (Hoboken). 2011;63(9):1254–61.
Article Google Scholar
Basra MKA, et al. Conceptualization, development and validation of T-QoL((c)) (Teenagers’ Quality of Life): a patient-focused measure to assess quality of life of adolescents with skin diseases. Br J Dermatol. 2018;178(1):161–75.
Article PubMed CAS Google Scholar
Das A, et al. Formation and psychometric evaluation of a health-related quality of life instrument for children living with HIV in India. J Health Psychol. 2018;23(4):577–87.
Article PubMed Google Scholar
Flokstra-de Blok BM, et al. Development and validation of the self-administered Food Allergy Quality of Life Questionnaire for adolescents. J Allergy Clin Immunol. 2008;122(1):139–44.
Article PubMed Google Scholar
Geister TL, et al. Qualitative development of the ‘Questionnaire on Pain caused by Spasticity (QPS)’, a pediatric patient-reported outcome for spasticity-related pain in cerebral palsy. Qual Life Res. 2014;23(3):887–96.
Article PubMed Google Scholar
Hartmaier SL, et al. Development of a brief 24-hour adolescent migraine functioning questionnaire. Headache. 2001;41(2):150–6.
Article PubMed CAS Google Scholar
Rutishauser C, et al. Development and validation of the Adolescent Asthma Quality of Life Questionnaire (AAQOL). Eur Respir J. 2001;17(1):52–8.
Article PubMed CAS Google Scholar
Tadic V, et al. Development of the functional vision questionnaire for children and young people with visual impairment: the FVQ_CYP. Ophthalmology. 2013;120(12):2725–32.
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the Wellcome Trust [205384/Z/16/Z].

Author information

Authors and Affiliations

Health Economics Bristol, Population Health Sciences, Bristol Medical School, University of Bristol, 1-5 Whiteladies Road, Bristol, BS8 1NU, UK
Samantha Husbands, Paul Mark Mitchell & Joanna Coast

Authors

Samantha Husbands
View author publications
You can also search for this author in PubMed Google Scholar
Paul Mark Mitchell
View author publications
You can also search for this author in PubMed Google Scholar
Joanna Coast
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Review preparation, data extraction and interpretation were performed by SH, PMM and JC. The first draft of the manuscript was written by Samantha Husbands and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Samantha Husbands.

Ethics declarations

Funding

This work was supported by the Wellcome Trust [205384/Z/16/Z].

Conflict of interest

Samantha Husbands, Paul Mark Mitchell and Joanna Coast declare that they have no conflict of interest.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 11 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Husbands, S., Mitchell, P.M. & Coast, J. A Systematic Review of the Use and Quality of Qualitative Methods in Concept Elicitation for Measures with Children and Young People. Patient 13, 257–288 (2020). https://doi.org/10.1007/s40271-020-00414-x

Download citation

Published: 29 April 2020
Issue Date: June 2020
DOI: https://doi.org/10.1007/s40271-020-00414-x

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A Systematic Review of the Use and Quality of Qualitative Methods in Concept Elicitation for Measures with Children and Young People

Abstract

Background

Objectives

Methods

Results

Conclusions

Similar content being viewed by others

Qualitative Research: Ethical Considerations

How to use and assess qualitative research methods

Doing Reflexive Thematic Analysis

1 Introduction

2 Methods

2.1 Search Strategies for Studies

2.2 Selection Method

2.3 Study Inclusion and Exclusion Criteria

2.4 Data Extraction and Quality of Reporting of Qualitative Methods

2.5 Synthesis of Results

3 Results

3.1 Search Results

3.1.1 Characteristics of Included Studies

3.2 Overview of Qualitative Approach Used in Children and Young People (CYP) Concept Elicitation

3.2.1 Data Collection Methods

3.2.2 The Use of Adapted Data Collection Techniques with CYP

3.3 The Quality of Reporting in Concept Elicitation for CYP

3.3.1 Reporting on Data Analysis

3.3.2 Reporting on Sampling

3.3.3 Reporting on Data Collection

3.3.4 Reporting on Research Team and Reflexivity

3.3.5 Strengths in Reporting

4 Discussion

5 Conclusion

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Funding

Conflict of interest

Electronic supplementary material

Supplementary material 1 (DOCX 11 kb)

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation