Original Article

On the Inappropriateness of Using Items to Calculate Total Scale Score Reliability via Coefficient Alpha for Multidimensional Scales

Gilles E. Gignac

School of Psychology, University of Western Australia, Crawley, WA, Australia

Published Online:January 01, 2014https://doi.org/10.1027/1015-5759/a000181

Abstract

Researchers have the implicit option of calculating internal consistency reliability (coefficient α) for total scale scores derived from multidimensional inventories based on either the inter-item correlation matrix (item unit-level) or the inter-subscale correlation matrix (subscale unit-level). It is demonstrated that item unit-level and subscale unit-level reliability estimates often diverge substantially in practice. Specifically, the item unit-level reliability estimation is often larger than the corresponding subscale unit-level estimate. It is recommended that if researchers calculate total scale score reliability at the item unit-level, then a model-based approach to the estimation of internal consistency reliability (i.e., omega hierarchical) should be applied, when the underlying model is multidimensional. If omega hierarchical cannot be applied for any particular reason, it is recommended that total scale score reliabilities be calculated at the subscale unit-level of analysis, not the item unit-level.

References

Beck, A. T. , Steer, R. A. , Brown, G. K. (1996). Beck Depression Inventory manual (2nd edn.). San Antonio, TX: Psychological Corporation. First citation in article Google Scholar
Bendig, A. W. (1952). Inter-judge vs. intra-judge reliability in the order of merit method. The American Journal of Psychology, 65, 84–88. First citation in article Crossref, Google Scholar
Bollen, K. A. (1989). Structural equations with latent variables. New York, NY: Wiley & Sons. First citation in article Crossref, Google Scholar
Brunner, M. , Süß, H.-M. (2005). Analyzing the reliability of multidimensional measures: An example from intelligence research. Educational and Psychological Measurement, 65, 227–240. First citation in article Crossref, Google Scholar
Cortina, J. M. (1993). What is coefficient alpha? An examination of theory and applications. Psychological Bulletin, 78, 98–104. First citation in article Google Scholar
Crocker, L. M. , Algina, J. (1986). Introduction to classical and modern test theory. New York, NY: Holt, Rinehart & Winston. First citation in article Google Scholar
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika, 16, 297–334. First citation in article Crossref, Google Scholar
Davidson, M. M. , Gervais, S. J. , Canivez, G. L. , Cole, B. P. (2013). A psychometric examination of the interpersonal sexual objectification scale among college men. Journal of Counseling Psychology, 60, 239–250. First citation in article Crossref, Google Scholar
Dozois, D. J. , Dobson, K. S. , Ahnberg, J. L. (1998). A psychometric evaluation of the Beck Depression Inventory – II. Psychological Assessment, 10, 83–89. First citation in article Crossref, Google Scholar
Fan, X. (2003). Using commonly available software for bootstrapping in both substantive and measurement analyses. Educational and Psychological Measurement, 63, 24–50. First citation in article Crossref, Google Scholar
Gignac, G. E. (2007). Working memory and fluid intelligence are both identical to g?! Reanalyses and critical evaluation. Psychology Science, 49, 187–207. First citation in article Google Scholar
Gignac, G. E. (2008). Higher-order models versus direct hierarchical models: g as superordinate or breadth factor? Psychology Science, 50, 21–43. First citation in article Google Scholar
Gignac, G. E. (2013). Modeling the Balanced Inventory of desirable responding: Evidence in favor of a revised model of socially desirable responding. Journal of Personality Assessment. doi: 10.1080/00223891.2013.816717 First citation in article Google Scholar
Gignac, G. E. , Bates, T. C. , Jang, K. (2007). Implications relevant to CFA model misfit, reliability, and the Five Factor Model as measured by the NEO–FFI. Personality and Individual Differences, 43, 1051–1062. First citation in article Crossref, Google Scholar
Gignac, G. E. , Palmer, B. , & Stough, C. (2007). A confirmatory factor analytic investigation of the TAS-20: Corroboration of a five-factor model and suggestions for improvement. Journal of Personality Assessment, 89, 247–257. First citation in article Crossref, Google Scholar
Gignac, G. E. , Watkins, M. W. (in press). Bifactor modeling and the estimation of model-based reliability in the WAIS-IV. Multivariate Behavioral Research. First citation in article Google Scholar
Graham, J. M. (2006). Congeneric and (essentially) tau-equivalent estimates of score reliability: What they are and how to use them. Educational and Psychological Measurement, 66, 930–944. First citation in article Crossref, Google Scholar
Holzinger, K. J. , Swineford, R. (1937). The bifactor method. Psychometrika, 2, 41–54. First citation in article Crossref, Google Scholar
Kóbor, A. , Takács, Á. , Urbán, R. (2013). The bifactor model of the strengths and difficulties questionnaire. European Journal of Psychological Assessment. doi: 10.1027/1015-5759/a000160 First citation in article Google Scholar
Lord, F. M. , Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley. First citation in article Google Scholar
Mayer, J. D. , Salovey, P. , Caruso, D. R. (2002). Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT)–User’s manual. North Tonawanda, NY: Multi-Health Systems. First citation in article Google Scholar
Mayer, J. D. , Salovey, P. , Caruso, D. R. , Sitarenios, G. (2003). Measuring emotional intelligence with the MSCEIT V2.0. Emotion, 3, 97–105. First citation in article Crossref, Google Scholar
McDonald, R. P. (1978). Generalizability in factorable domains: “Domain validity and generalizability”. Educational and Psychological Measurement, 38, 75–79. First citation in article Crossref, Google Scholar
McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, NJ: Erlbaum. First citation in article Google Scholar
McDonald, R. P. (1999). Test theory: A unified treatment. Mahwah, NJ: Erlbaum. First citation in article Google Scholar
Miller, M. B. (1995). Coefficient alpha: A basic introduction from the perspectives of classical test theory and structural equation modeling. Structural Equation Modeling, 2, 255–273. First citation in article Crossref, Google Scholar
Nevitt, J. , Hancock, G. R. (2001). Performance of bootstrapping approaches to model test statistics and parameter standard error estimation in structural equation modeling. Structural Equation Modeling, 8, 353–377. First citation in article Crossref, Google Scholar
Nunnally, J. C. , Bernstein, I. H. (1994). Psychometric theory. New York, NY: McGraw-Hill. First citation in article Google Scholar
Patton, J. H. , Stanford, M. S. , Barratt, E. S. (1995). Factor structure of the Barratt impulsiveness scale. Journal of Clinical Psychology, 6, 768–774. First citation in article Crossref, Google Scholar
R Development Core Team . (2013). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing Retrieved from www.R-project.org/ First citation in article Google Scholar
Raykov, T. (1997). Estimation of composite reliability for congeneric measures. Applied Psychological Measurement, 22, 173–184. First citation in article Crossref, Google Scholar
Reise, S. P. (2012). The rediscovery of bifactor measurement models. Multivariate Behavioral Research, 47, 667–696. First citation in article Crossref, Google Scholar
Reise, S. P. , Bonifay, W. E. , Haviland, M. G. (2012). Scoring and modeling psychological measures in the presence of multidimensionality. Journal of Personality Assessment, 95, 129–140. doi: 10.1080/00223891.2012.725437 First citation in article Crossref, Google Scholar
Rekart, K. N. , Mineka, S. , & Zinbarg, R. E. (2006). Autobiographical memory in dysphoric and non-dysphoric college students using a computerised version of the AMT. Cognition & Emotion, 20, 506–515. First citation in article Crossref, Google Scholar
Reuterberg, S. E. , Gustafsson, J.-E. (1992). Confirmatory factor analysis and reliability: Testing measurement model assumptions. Educational and Psychological Measurement, 52, 795–811. First citation in article Crossref, Google Scholar
Revelle, W. (1979). Hierarchical cluster analysis and the internal structure of tests. Multivariate Behavioral Research, 14, 57–74. First citation in article Crossref, Google Scholar
Revelle, W. (2013). psych: Procedures for personality and psychological research [Computer software manual]. Retrieved from cran.r-project.org/web/packages/psych/ (R package version 1.3.2). First citation in article Google Scholar
Revelle, W. , Zinbarg, R. E. (2009). Coefficients alpha, beta, omega, and the glb: Comments on Sijtsma. Psychometrika, 74, 145–154. First citation in article Crossref, Google Scholar
Rindskopf, D. , Rose, T. (1988). Some theory and applications of confirmatory second-order factor analysis. Multivariate Behavioral Research, 23, 51–67. First citation in article Crossref, Google Scholar
Schroeders, U. , Wilhelm, O. (2010). Testing reasoning ability with handheld computers, notebooks, and paper and pencil. European Journal of Psychological Assessment, 26, 284–293. doi: 10.1027/1015-5759/a000038 First citation in article Link, Google Scholar
Schweizer, K. , Altmeyer, M. , Reiß, S. , Schreiner, M. (2010). The c-bifactor model as a tool for the construction of semi-homogenous upper-level measures. Psychological Test and Assessment Modeling, 3, 298–312. First citation in article Google Scholar
Sijtsma, K. (2009). On the use, misuse, and the very limited usefulness of Cronbach’s alpha. Psychometrika, 74, 107–120. First citation in article Crossref, Google Scholar
Stanford, M. S. , Mathias, C. W. , Dougherty, D. M. , Lake, S. L. , Anderson, N. E. , Patton, J. H. (2009). Fifty years of the Barratt Impulsiveness Scale: An update and review. Personality and Individual Differences, 47, 385–395. First citation in article Crossref, Google Scholar
Streiner, D. L. (2003). Starting at the beginning: An introduction to coefficient alpha and internal consistency. Journal of Personality Assessment, 80, 99–103. First citation in article Crossref, Google Scholar
Zinbarg, R. E. , Revelle, W. , Yovel, I. , Li, W. (2005). Cronbach’s α, Revelle’s β, and McDonald’s ω _h: Their relations with each other and two alternative conceptualizations of reliability. Psychometrika, 70, 123–133. First citation in article Crossref, Google Scholar

Volume 30Issue 2May 2014

ISSN: 1015-5759eISSN: 2151-2426

History

AcceptedAugust 27, 2013

Licenses & Copyright

Keywords

Acknowledgments:

Special thanks to William Revelle for providing insightful reviewer comments and supplying the R command lines to estimate omega hierarchical.

PDF download

Verify Phone

Congrats!

On the Inappropriateness of Using Items to Calculate Total Scale Score Reliability via Coefficient Alpha for Multidimensional Scales

Abstract

References

History

Licenses & Copyright

Acknowledgments:

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners

Change Password

Your password must have 8 characters or more and contain 3 of the following:

Password Changed Successfully

Create a new account

Request Username

Verify Phone

Congrats!

On the Inappropriateness of Using Items to Calculate Total Scale Score Reliability via Coefficient Alpha for Multidimensional Scales

Abstract

References

History

Licenses & Copyright

Acknowledgments:

Support & Contact

Support & Contact

Legal information

Legal information

More offers

More offers

Our partners

Our partners