Skip to main content
main-content
Top

Tip

Swipe om te navigeren naar een ander artikel

Gepubliceerd in: Quality of Life Research 3/2013

01-04-2013

Content-balancing strategy in bifactor computerized adaptive patient-reported outcome measurement

Auteurs: Yi Zheng, Chih-Hung Chang, Hua-Hua Chang

Gepubliceerd in: Quality of Life Research | Uitgave 3/2013

Log in om toegang te krijgen
share
DELEN

Deel dit onderdeel of sectie (kopieer de link)

  • Optie A:
    Klik op de rechtermuisknop op de link en selecteer de optie “linkadres kopiëren”
  • Optie B:
    Deel de link per e-mail

Abstract

Purpose

Most multidimensional patient-reported outcomes (PRO) measures are lengthy to complete. Computerized adaptive testing (CAT) that selects the most informative items can potentially reduce respondent burden without sacrificing measurement accuracy. The commonly used maximum Fisher information item selection method has been reported to lead to highly unbalanced item bank usage and potentially imprecise trait estimation. This study employs the content-balancing strategy in a bifactor-modeled CAT item selection and examines its impact on measurement accuracy and item bank usage.

Methods

Item responses from a population-based SF-36 survey were first calibrated using the bifactor graded response model. Four post hoc CATs using items and responses from the SF-36 data set were then created. The content-balancing strategy was adopted in the item selection procedure of the bifactor-modeled CAT. The measurement accuracy and usage of items of the CAT were compared between the tests with and without the content-balancing strategy.

Results

The results indicate that the CAT implemented with the content-balancing strategy offers a better overall measurement accuracy of both the general health status and the two health domains (physical and mental) of the SF-36.

Conclusions

The content-balancing strategy helps the CAT–PRO to balance the selection of items and achieve improved measurement accuracy. Its implementation in real-time CAT administration to measure multidimensional PRO traits merits further studies.
Literatuur
1.
go back to reference Chang, C-H. (2007). Patient-reported outcomes measurement and management with innovative methodologies and technologies. Quality of Life Research, 16(Supplement I), 157–166. PubMedCrossRef Chang, C-H. (2007). Patient-reported outcomes measurement and management with innovative methodologies and technologies. Quality of Life Research, 16(Supplement I), 157–166. PubMedCrossRef
2.
go back to reference Chang, C-H., & Reeve, B. B. (2005). Item response theory and its applications to Patient-Reported Outcomes Measurement. Evaluation & the Health Professions, 28(3), 264–282. CrossRef Chang, C-H., & Reeve, B. B. (2005). Item response theory and its applications to Patient-Reported Outcomes Measurement. Evaluation & the Health Professions, 28(3), 264–282. CrossRef
3.
go back to reference Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. CA: Sage Publications. Hambleton, R. K., Swaminathan, H., & Rogers, H. J. (1991). Fundamentals of item response theory. CA: Sage Publications.
4.
go back to reference Chang, H.-H. (2004). Understanding computerized adaptive testing: From Robbins-Monro to Lord and beyond. In D. Kaplan (Ed.), The Sage handbook of quantitative methodology for the social sciences (pp. 117–133). Thousand Oaks, CA: Sage. Chang, H.-H. (2004). Understanding computerized adaptive testing: From Robbins-Monro to Lord and beyond. In D. Kaplan (Ed.), The Sage handbook of quantitative methodology for the social sciences (pp. 117–133). Thousand Oaks, CA: Sage.
5.
go back to reference Fayers, P. M. (2007). Applying item response theory and computer adaptive testing: The challenges for health outcomes assessment. Quality of Life Research, 16(Supplement 1), 187–194. PubMedCrossRef Fayers, P. M. (2007). Applying item response theory and computer adaptive testing: The challenges for health outcomes assessment. Quality of Life Research, 16(Supplement 1), 187–194. PubMedCrossRef
6.
go back to reference Ware, J. E., Jr, & Kosinski, M. (2003). Applications of CAT to the assessment of headache impact. Quality of Life Research, 12, 935–952. PubMedCrossRef Ware, J. E., Jr, & Kosinski, M. (2003). Applications of CAT to the assessment of headache impact. Quality of Life Research, 12, 935–952. PubMedCrossRef
7.
go back to reference Walter, O. B., Becker, J., Bjorner, J. B., Fliege, H., Klapp, B. F., & Rose, M. (2007). Development and evaluation of a computer adaptive test for ‘Anxiety’ (Axiety-CAT). Quality of Life Research, 16(Supplement I), 143–155. PubMedCrossRef Walter, O. B., Becker, J., Bjorner, J. B., Fliege, H., Klapp, B. F., & Rose, M. (2007). Development and evaluation of a computer adaptive test for ‘Anxiety’ (Axiety-CAT). Quality of Life Research, 16(Supplement I), 143–155. PubMedCrossRef
8.
go back to reference Choi, S. W., & Swartz, R. J. (2009). Comparison of CAT item selection criteria for polytomous items. Applied Psychological Measurement, 33(6), 419–440. PubMedCrossRef Choi, S. W., & Swartz, R. J. (2009). Comparison of CAT item selection criteria for polytomous items. Applied Psychological Measurement, 33(6), 419–440. PubMedCrossRef
9.
go back to reference Petersen, M. Aa., Groenvold, M., Aaronson, N., Fayers, P. M., Sprangers, M. A., & Bjorner, J. B. (2006). Multidimensional computerized adaptive testing of the EORTC QLQ-C30: Basic developments and evaluation. Quality of Life Research, 15, 315–329. PubMedCrossRef Petersen, M. Aa., Groenvold, M., Aaronson, N., Fayers, P. M., Sprangers, M. A., & Bjorner, J. B. (2006). Multidimensional computerized adaptive testing of the EORTC QLQ-C30: Basic developments and evaluation. Quality of Life Research, 15, 315–329. PubMedCrossRef
10.
go back to reference Haley, S. M., Ni, P. S., Ludlow, L. H., & Fragala-Pinkham, M. A. (2006). Measurement precision and efficiency of multidimensional computer adaptive testing of physical functioning using the pediatric evaluation of disability inventory. Archives of Physical Medicine and Rehabilitation, 87, 1223–1229. PubMedCrossRef Haley, S. M., Ni, P. S., Ludlow, L. H., & Fragala-Pinkham, M. A. (2006). Measurement precision and efficiency of multidimensional computer adaptive testing of physical functioning using the pediatric evaluation of disability inventory. Archives of Physical Medicine and Rehabilitation, 87, 1223–1229. PubMedCrossRef
11.
go back to reference Gibbons, R., & Hedeker, D. (1992). Full-information item bifactor analysis. Psychometrika, 57(3), 423–436. CrossRef Gibbons, R., & Hedeker, D. (1992). Full-information item bifactor analysis. Psychometrika, 57(3), 423–436. CrossRef
12.
go back to reference Gibbons, R. D., Bock, R., Hedeker, D., Weiss, D. J., Segawa, E., Bhaumik, D. K., Kupfer, D. J., Frank, E., Grochocinski, V. J., & Stover, A. (2007). Full-information item bifactor analysis of graded response data. Applied Psychological Measurement, 31(1), 4–19. Gibbons, R. D., Bock, R., Hedeker, D., Weiss, D. J., Segawa, E., Bhaumik, D. K., Kupfer, D. J., Frank, E., Grochocinski, V. J., & Stover, A. (2007). Full-information item bifactor analysis of graded response data. Applied Psychological Measurement, 31(1), 4–19.
13.
go back to reference Reise, S. P., Morizot, J., & Hays, R. D. (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research, 16, 19–31. PubMedCrossRef Reise, S. P., Morizot, J., & Hays, R. D. (2007). The role of the bifactor model in resolving dimensionality issues in health outcomes measures. Quality of Life Research, 16, 19–31. PubMedCrossRef
14.
go back to reference Haley, S. M., Ni, P., Dumas, H. M., Fragala-Pinkham, M. A., Hambleton, R. K., Montpetit, K., Bilodeau, N., Gorton, G. E., Watson, K., & Tucker, C. A. (2009). Measuring global physical health in children with cerebral palsy: Illustration of a multidimensional bi-factor model and computerized adaptive testing. Quality of Life Research, 18, 359–370. Haley, S. M., Ni, P., Dumas, H. M., Fragala-Pinkham, M. A., Hambleton, R. K., Montpetit, K., Bilodeau, N., Gorton, G. E., Watson, K., & Tucker, C. A. (2009). Measuring global physical health in children with cerebral palsy: Illustration of a multidimensional bi-factor model and computerized adaptive testing. Quality of Life Research, 18, 359–370.
15.
go back to reference Immekus, J. C., Gibbons, R. D., & Rush, A. J. (2007). Patient-reported outcomes measurement and computerized adaptive testing: An application of post-hoc simulation to a diagnostic screening instrument. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC conference on computerized adaptive testing. Immekus, J. C., Gibbons, R. D., & Rush, A. J. (2007). Patient-reported outcomes measurement and computerized adaptive testing: An application of post-hoc simulation to a diagnostic screening instrument. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC conference on computerized adaptive testing.
16.
go back to reference Weiss, D. J., & Gibbons, R. D. (2007). Computerized adaptive testing with the bifactor model. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC conference on computerized adaptive testing. Weiss, D. J., & Gibbons, R. D. (2007). Computerized adaptive testing with the bifactor model. In D. J. Weiss (Ed.). Proceedings of the 2007 GMAC conference on computerized adaptive testing.
17.
go back to reference Cheng, Y., Chang, H-H., Douglas, J., & Guo, F. (2009). Constraint-weighted a-stratification for computerized adaptive testing with nonstatistical constraints. Educational and Psychological Measurement, 69(1), 35–49. CrossRef Cheng, Y., Chang, H-H., Douglas, J., & Guo, F. (2009). Constraint-weighted a-stratification for computerized adaptive testing with nonstatistical constraints. Educational and Psychological Measurement, 69(1), 35–49. CrossRef
18.
go back to reference Leung, C-K., Chang, H.-H., & Hau, K.-T. (2003). Computerized adaptive testing: A comparison of three content balancing methods. Journal of Technology, Learning, and Assessment, 2( 5). Available from http://​www.​jtla.​org. Leung, C-K., Chang, H.-H., & Hau, K.-T. (2003). Computerized adaptive testing: A comparison of three content balancing methods. Journal of Technology, Learning, and Assessment, 2( 5). Available from http://​www.​jtla.​org.
19.
go back to reference Kingsbury, G. G., & Zara, A. R. (1989). Procedures for selecting items for computerized adaptive tests. Applied Measurement in Education, 2, 359–375. CrossRef Kingsbury, G. G., & Zara, A. R. (1989). Procedures for selecting items for computerized adaptive tests. Applied Measurement in Education, 2, 359–375. CrossRef
20.
go back to reference Chen, S., Ankenmann, R. D., & Spray, J. A. (1999, April). Exploring the relationship between item exposure rate and test overlap rate in computerized adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada. Chen, S., Ankenmann, R. D., & Spray, J. A. (1999, April). Exploring the relationship between item exposure rate and test overlap rate in computerized adaptive testing. Paper presented at the annual meeting of the National Council on Measurement in Education, Montreal, Canada.
21.
go back to reference Leung, C-K., Chang, H.-H., & Hau, K.-T. (2000, April). Content- balancing in stratified computerized adaptive testing designs. Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans, LA. Leung, C-K., Chang, H.-H., & Hau, K.-T. (2000, April). Content- balancing in stratified computerized adaptive testing designs. Paper presented at the Annual Meeting of the American Educational Research Association, New Orleans, LA.
22.
go back to reference Hays, R. D., Sherbourne, C. D., & Mazel, R. M. (1993). The RAND 36-item health survey 1.0. Health Economics, 2(3), 217–227. PubMedCrossRef Hays, R. D., Sherbourne, C. D., & Mazel, R. M. (1993). The RAND 36-item health survey 1.0. Health Economics, 2(3), 217–227. PubMedCrossRef
23.
go back to reference Ware, J. E. Jr., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483. PubMedCrossRef Ware, J. E. Jr., & Sherbourne, C. D. (1992). The MOS 36-item short-form health survey (SF-36). I. Conceptual framework and item selection. Medical Care, 30(6), 473–483. PubMedCrossRef
24.
go back to reference Ware, J. E., Kosinski, M., & Keller, S. D. (1994). SF-36 Physical and mental summary scale: A user’s manual. Boston, MA: The Health Institute. Ware, J. E., Kosinski, M., & Keller, S. D. (1994). SF-36 Physical and mental summary scale: A user’s manual. Boston, MA: The Health Institute.
25.
go back to reference Chang, C.-H., Wright, B. D., Cella, D., & Hays, R. D. (2007). The SF-36 physical and mental health factors were confirmed in cancer and HIV_AIDS patients. Journal of Clinical Epidemiology, 60(1), 68–72. PubMedCrossRef Chang, C.-H., Wright, B. D., Cella, D., & Hays, R. D. (2007). The SF-36 physical and mental health factors were confirmed in cancer and HIV_AIDS patients. Journal of Clinical Epidemiology, 60(1), 68–72. PubMedCrossRef
26.
go back to reference Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologist. NJ: Lawrence Erlbaum Associates. Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologist. NJ: Lawrence Erlbaum Associates.
27.
go back to reference Ware, J. E., Kosinski, M., & Keller, S. D. (1996). A 12-item short-form health survey: Construction of scales and preliminary tests of reliability and validity. Medical Care, 34(3), 220–233. PubMedCrossRef Ware, J. E., Kosinski, M., & Keller, S. D. (1996). A 12-item short-form health survey: Construction of scales and preliminary tests of reliability and validity. Medical Care, 34(3), 220–233. PubMedCrossRef
28.
go back to reference Ware, J. E. (2002). User’s manual for the SF-12v2 health survey (with a supplement documenting SF-12 health survey). Lincoln, RI: QualityMetric Inc. Ware, J. E. (2002). User’s manual for the SF-12v2 health survey (with a supplement documenting SF-12 health survey). Lincoln, RI: QualityMetric Inc.
29.
go back to reference Chang, H., & Ying, Z. (1999). A-stratified multistage computer adaptive testing. Applied Psychological Measurement, 23(3), 211–222. CrossRef Chang, H., & Ying, Z. (1999). A-stratified multistage computer adaptive testing. Applied Psychological Measurement, 23(3), 211–222. CrossRef
30.
go back to reference Chang, H., Qian, J., & Ying, Z. (2001). A-stratified multistage computer adaptive testing with b blocking. Applied Psychological Measurement, 25(4), 333–341. CrossRef Chang, H., Qian, J., & Ying, Z. (2001). A-stratified multistage computer adaptive testing with b blocking. Applied Psychological Measurement, 25(4), 333–341. CrossRef
31.
go back to reference Stewart, A. L., Sherbourne, C. D., Hays, R. D., Wells, K. B., Nelson, E. C., Kamberg, C., Rogers, W. H., Berry, S. H., Ware, J. E. (1992). Summary and discussion of MOS measures. In A. L. Stewart & J. E. Ware (Eds.), Measuring functioning and well-being: The medical outcomes study approach (pp. 345–371). Durham, NC: Duke University Press. Stewart, A. L., Sherbourne, C. D., Hays, R. D., Wells, K. B., Nelson, E. C., Kamberg, C., Rogers, W. H., Berry, S. H., Ware, J. E. (1992). Summary and discussion of MOS measures. In A. L. Stewart & J. E. Ware (Eds.), Measuring functioning and well-being: The medical outcomes study approach (pp. 345–371). Durham, NC: Duke University Press.
Metagegevens
Titel
Content-balancing strategy in bifactor computerized adaptive patient-reported outcome measurement
Auteurs
Yi Zheng
Chih-Hung Chang
Hua-Hua Chang
Publicatiedatum
01-04-2013
Uitgeverij
Springer Netherlands
Gepubliceerd in
Quality of Life Research / Uitgave 3/2013
Print ISSN: 0962-9343
Elektronisch ISSN: 1573-2649
DOI
https://doi.org/10.1007/s11136-012-0179-6