Missing Data Imputation in Quality-of-Life Assessment

Lin, Ting Hsiang

doi:10.2165/00019053-200624090-00008

Missing Data Imputation in Quality-of-Life Assessment

Imputation for WHOQOL-BREF

Original Research Article
Published: 23 December 2012

Volume 24, pages 917–925, (2006)
Cite this article

PharmacoEconomics Aims and scope Submit manuscript

Ting Hsiang Lin¹

216 Accesses
15 Citations
Explore all metrics

Abstract

Introduction: This study investigated the effects of imputing missing data in the WHO Quality of Life Abbreviated Questionnaire (WHOQOL-BREF). The imputation results from both the item and domain levels were compared and the impact of the missing data rate and the number of items included for imputation were examined.

Methods: An empirical analysis and a simulation study were used to examine the effects of missing data rates and the number of items used for imputation on the accuracy for imputation. In the empirical analysis, both item-level and domain-level imputations were performed, and the missing values were imputed using different amounts of data. In the simulation study, sets of 2%, 5% and 10% of the data were drawn randomly and replaced with missing values. Twenty datasets were generated for each situation. The data were imputed and the accuracy of the imputation was reported.

Results: In the empirical study, the number of items used for imputation had only a small impact on the accuracy of imputation. Furthermore, in the simulation study, the accuracy rates of imputation did not significantly change as the proportions of missing data increased. However, the number of items used in the computation did contribute to some extent to the missing values imputed. Extreme responses had the worst computations and the lowest accuracy rates.

Conclusion: It is recommended that as many items as possible be included for imputation within the same domain. However, it is not particularly helpful to use items from different domains for imputation. Researchers should exercise extra caution in interpreting the imputed values of extreme responses.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

Article Open access 19 November 2021

Inka Rösel, Lina María Serna-Higuita, … You-Shan Feng

Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?

Article 04 December 2014

Claire L. Simons, Oliver Rivero-Arias, … Judit Simon

The ability of different imputation methods for missing values in mental measurement questionnaires

Article Open access 27 February 2020

Xueying Xu, Leizhen Xia, … Hongbo Liu

Notes

¹ Consistency is a statistical property. A consistent estimator is an estimator that converges in probability to the quantity being estimated as the sample size grows.[9–11]

References

Olschewski M, Schilgen G, Schumacher M, et al. Quality of life assessment in clinical cancer research. Br J Cancer 1994; 70: 1–5
Article PubMed CAS Google Scholar
Curran D, Fayers P, Molenberghs G, et al. Analysis of incomplete quality-of-life data in clinical trials. In: Staquet MJ, Hays RD, Fayers PM, editors. Quality of life assessment in clinical trials: methods and practice. New York: Oxford University Press, 1998
Google Scholar
Fayers PM, Machin D. Quality of life: assessment, analysis, and interpretation. New York: Wiley, 2000
Google Scholar
Fayers PM, Curran D, Machin D. Aspects of incomplete quality of life data in randomized trials: I. Missing items. Stat Med 1998; 17: 679–96
Article PubMed CAS Google Scholar
Curran D, Molenberghs D, Fayers P, et al. Aspects of incomplete quality of life data in randomized trials: II. Missing forms. Stat Med 1998; 17: 697–709
Article PubMed CAS Google Scholar
Little R, Rubin D. Statistical analysis with missing data. New York: Wiley, 1987
Google Scholar
Muthén B, Kaplan D, Hollis M. On structural equation modeling with data that are not missing completely at random. Psychometrica 1987; 52: 431–462
Article Google Scholar
Myrtveit I, Stensrud E, Olsson U. Analyzing data sets with missing data: an empirical evaluation of imputation methods and likelihood based methods. IEEE Transactions on Software Engineering 2001; 27: 999–1013
Article Google Scholar
Arbuckle JL. Full information estimation in the presence of incomplete data. In: Marcoulides GA, Schumacker RE, editors. Advanced structural equation modeling. Mahwah (NJ): Lawrence Erlbaum Associates, 1996
Google Scholar
Brown RL. Efficacy of the indirect approach for estimating structural equation models with missing data: a comparison of five methods. Structural Equation Modeling 1994; 1: 287–316
Article Google Scholar
Wothke W. Longitudinal and multigroup modeling with missing data. In: Little TD, Schnabel KU, Baumert J, editors. Modeling longitudinal and multiple group data: practical issues, applied approaches and specific examples. Mahwah (NJ): Lawrence Erlbaum Associates, 2000
Google Scholar
Glasser M. Linear regression analysis with missing observations among the independent variables. J Am Stat Assoc 1964; 59: 834–844
Article Google Scholar
Haitovsky Y. Missing data in regression analysis. J R Stat Soc Ser B 1968; 30: 67–82
Google Scholar
Kim JO, Curry J. The treatment of missing data in multivariate analysis. Sociol Methods Res 1997; 6: 215–240
Article Google Scholar
Fayers PM, Aaronson NK, Bjordal K, et al. EORTC QLQ-C30 scoring manual. Brussels: European Organisation for the Research and Treatment of Cancer (EORTC), 1995
Google Scholar
Anderson TW. Maximum likelihood estimates for multivariate normal distribution when some observations are missing. J Am Stat Assoc 1956; 52: 200–203
Article Google Scholar
Browne CH. Asymptotic comparison of missing data procedure for estimating factor loadings. Psychometrika 1983; 48: 269–291
Article Google Scholar
Little JR, Rubin D. The analysis of social science data with missing values. Sociol Methods Res 1989; 18: 292–326
Article Google Scholar
Neal MC. Mx: statistical modeling. 3rd ed. Richmond (VA): Department of Psychiatry, Medical College of Virginia, Virginia Commonwealth University, 1995
Google Scholar
Graham JW, Hofer SM, MacKinnon DP. Maximizing the usefulness of data obtained with planned missing value patterns: an application of maximum likelihood procedures. Multivariate Behav Res 1996; 31: 197–218
Article Google Scholar
Graham JW, Hofer SM. Multiple imputation in multivariate research. In: Little TD, Schnabel KU, Baumert J, editors. Modeling longitudinal and multilevel data: practical issues, applied approaches, and specific examples. Mahwah (NJ): Lawrence Erlbaum Associates, 2000
Google Scholar
Rovine MJ. Latent variable models and missing data analysis. In: von Eye A, Clogg CC, editors. Latent variable analysis: applications for developmental research. Thousand Oaks (CA): Sage Publications, 1994
Google Scholar
Verleye G. Missing at random data problems and maximum likelihood structural equation modelling [reprint]. Interuniversity paper in demography. Gent: Universiteit Gent, 1997. Working paper no.: 1997-3
Google Scholar
Jöreskog KG, Sörbom D. LISREL 8 user’s reference guide. Chicago (IL): Scientific Software International, 1996
Google Scholar
Jöreskog KG, Sörbom D. PRELIS 2 user’s reference guide computer software. Chicago (IL): Scientific Software International, 1993
Google Scholar
World Health Organization. International classification of impairments, disabilities and handicaps. Geneva: World Health Organization, 1980
Google Scholar
Lin TH, Chang HY, Weng WS, et al. The National Health Interview Survey Information System: an overview. J Taiwan Public Health 2003; 22: 431–440
Google Scholar
Enders CK, Bandalos DL. The relative performance of full information maximum likelihood estimation for missing data in structural equation models. Structural Equation Modeling 2001; 8: 430–457
Article Google Scholar
van Buuren S, van Rijckevorsel JLA. Imputation of missing categorical data by maximizing internal consistency. Psychometrika 1992; 57: 567–580
Article Google Scholar

Download references

Acknowledgements

No sources of funding were used to assist in the preparation of this article. The author has no conflicts of interest that are directly relevant to the content of this article. The author thanks the Bureau of Health Promotion, Department of Health and National Health Research Institute in Taiwan for providing the data.

Author information

Authors and Affiliations

Department of Statistics, National Taipei University, 67, Section 3, Min-Sheng East Road, Taipei, Taiwan, ROC
Ting Hsiang Lin

Authors

Ting Hsiang Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ting Hsiang Lin.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lin, T.H. Missing Data Imputation in Quality-of-Life Assessment. Pharmacoeconomics 24, 917–925 (2006). https://doi.org/10.2165/00019053-200624090-00008

Download citation

Published: 23 December 2012
Issue Date: September 2006
DOI: https://doi.org/10.2165/00019053-200624090-00008

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Missing Data Imputation in Quality-of-Life Assessment

Abstract

Access this article

Similar content being viewed by others

What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?

The ability of different imputation methods for missing values in mental measurement questionnaires

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Missing Data Imputation in Quality-of-Life Assessment

Abstract

Access this article

Similar content being viewed by others

What difference does multiple imputation make in longitudinal modeling of EQ-5D-5L data? Empirical analyses of simulated and observed missing data patterns

Multiple imputation to deal with missing EQ-5D-3L data: Should we impute individual domains or the actual index?

The ability of different imputation methods for missing values in mental measurement questionnaires

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation