The independent mapping results showed consensus across all three mappers for 67% (24/36) of the items in the SF-36v2. All three mappers agreed that 22 items each clearly addressed at least one concept extracted from the literature and therefore did not require further discussion; all three agreed that 2 items required further discussion. The remaining 12 items were deemed relevant by one or two of the mappers, but not all three, and therefore required discussion. For example, for 2 items, all three mappers agreed on relevance to one concept, while one mapper considered a second concept also potentially relevant.
The first consensus meeting resulted in the 14 items in need of discussion being recategorized: 5 were classified as relevant, while the remaining 9 were classified as not relevant (Table
1). Next, the full study team met to review the results. All team members agreed with the decisions made by the three independent mappers. The final set of results showed that 27/36 items (75%) addressed the concepts extracted from the literature: 10 items were relevant to the physical impacts, 2 to the social impacts, 6 to the emotional impacts, 7 to role participation, and 2 to pain. The 6 general health items and 3 vitality items did not map to any specific concepts related to KO.
Discussion/Conclusion
The results of COA concept mapping are valuable for comparing measures, identifying gaps in concept coverage (both within and among measures), and making decisions as to which COA(s) are most appropriate for a particular context of use. Each of these benefits is a key step in the process of evaluating the content validity of a COA and developing a COA measurement strategy. The results from COA concept mapping can help researchers determine whether a measure is worth further investigation through patient cognitive interviews by generating objective data on how thoroughly it addresses the key concepts and the balance of relevant versus nonrelevant items.
The case study above provides a useful example of how these results can help with decision making. In terms of concepts, the results show that the SF-36v2 addresses all of the domains of the key concepts related to KO; if key concepts were missing, a second measure might be needed to fill those gaps. In terms of items, the results show that 75% of the items in the SF-36v2 are relevant to KO; but when deciding whether this is an appropriate measure to pursue for a KO-specific study, it is important to balance the usefulness of the relevant items against the 9 general health and vitality items that were not related to key concepts (e.g., would those 9 items be burdensome for patients and researchers to complete or analyze? Do those items provide other useful information? And if irrelevant items were to be dropped, how would such a change impact the ability to score the measure?). The comprehensiveness of the SF-36v2’s content coverage and minimal waste (i.e., items that are not directly mapped to concepts) suggests that in the case of KO, the SF-36v2 is worth pursuing for additional content validation in this condition. It is also important to note that the SF-36v2 lacks attribution to a specific disease, so, while useful, a disease-specific instrument would likely demonstrate even stronger mapping results.
This case study demonstrates just one use of the framework for COA concept mapping. Because COA concept mapping can be performed for different purposes, and use a variety of data sources, the process can differ slightly each time. Thus, the framework has both structure and flexibility: the structure allows for standardization and continuity (supporting reproducibility), and the flexibility allows for its use across studies, regardless of purpose, data source, or even study resources/limitations. In the case study presented here, we applied the framework to map concepts identified in a systematic literature review to an existing COA instrument, the SF-36v2, but it can just as easily be applied to the development of a short form from a large item bank, such as those within the PROMIS
®, Neuro-QOL™, and ASCQ-Me measurement systems. Similarly, this case study used concepts extracted from a literature review as its data source; while this is a common data source in health outcomes research, the framework can accommodate data derived from concept elicitation interviews or focus groups, as well as health records and online discussion boards, among others (and any combination thereof). Indeed, including results from concept elicitation interviews is an effective way to include the patient voice in this task. This study also used three team members to conduct the initial mapping independently. While this is ideal from a triangulation perspective, the framework allows for alternate approaches to help confirm mapping results. This structured but flexible approach mirrors the linking rules established by Cieza et al. (2005, 2019) for mapping to the ICF codes. [
14,
15]
This framework is intended to serve as a roadmap for researchers engaging in COA concept mapping as part of content validity evaluation process and can play an important role in helping researchers refine a list of candidate COAs early in the measurement strategy development process. Indeed, investigators often turn to existing instruments when the time and resources required to develop a new, condition-specific COA may be greater than those already allocated to a clinical study or clinical trial. In these circumstances, the results from COA concept mapping can help researchers determine if existing COAs are sufficient for measuring a condition, or if a new measure needs to be created to effectively cover the key concepts that define that condition. The framework presented here helps to promote consistent, reliable, and reproducible results. While it is possible to evaluate content validity without conducting COA concept mapping, such an approach risks the selection of a COA (or multiple COAs) that ask about irrelevant concepts or are missing key concepts. Concept mapping also minimizes the risk of conducting burdensome cognitive debriefing interviews in which patients will be tasked with completing and discussing COAs that may not have been fully vetted as appropriate. Including mapping as an interim step (i.e., after concept elicitation interviews or a literature review and before cognitive debriefing interviews or psychometric analysis) can help to streamline the COA selection and evaluation process by only advancing measures that are well-suited to a condition or by identifying early on that modifications to a measure, or even a new measure altogether, may be needed. Together, the framework and case study presented here demonstrate how a systematic assessment of the comprehensiveness and relevance of a COA can be achieved by establishing clear concept definitions and guiding principles before mapping and following a structured process throughout.