Abstract
The objectives of this study were (a) to investigate whether items of the Chinese version of Beck Depression Inventory II (BDI-II-C; Chinese Behavioral Science Corporation in Manual for the Beck Depression Inventory-II [in Chinese]. The Chinese Behavioral Science Corporation, Taiwan, 2000) exhibited DIF across adolescent gender groups, in addition to exploring meaningful patterns of item content by gender and (b) to methodologically show how detecting DIF can be done by utilizing a well-known factor-analysis method—a multi-group confirmatory factor analysis with mean and covariance structure (MACS; Sörbom in Br J Math Stat Psychol 27: 229–239, 1974; Sörbom in Structural equation models with structured means. North Holland, Amsterdam, pp 183–195, 1982). Two samples composed of 1,344 adolescent males and 1,578 adolescent females were analyzed. One nonuniform DIF item and seven uniform DIF items were identified across gender groups. The effects of DIF were inconsequential on the raw scores but significant on the latent mean. In regard to the patterns of item content by gender, the results have found that females were relatively more likely to endorse the item contents reflecting negative self-evaluation (item 7: self-dislike), emotional vulnerability (item 9: suicidal wishes; item 10: crying) and irritation (item 17); whereas males were relatively more likely to endorse the item contents associated with frustration (item 3: failure), moodiness (item 4: loss of pleasure) and somatic habits (item 16: sleep pattern). Also, the universal and culturally specific influences on DIF items across gender groups were suspected.
Similar content being viewed by others
References
Arnau, R. C., Meagher, M. W., Norris, M. P., & Bramson, R. (2001). Psychometric evaluation of the Beck Depression Inventory-II with primary care medical patients. Health Psychology, 20, 112–119. doi:10.1037/0278-6133.20.2.112.
Azocar, F., Areán, P., Miranda, J., & Muñoz, R. F. (2001). Differential item functioning in a Spanish translation of the Beck Depression Inventory. Journal of Clinical Psychology, 57, 355–365. doi:10.1002/jclp.1017.
Beck, A. T., Steer, R. A., & Brown, G. K. (1996). BDI-II, Beck Depression Inventory: Manual (2nd ed.). Boston: Harcour, Brace, & Company.
Bentler, P. M., & Wu, E. J. C. (2006). EQS for Windows structural equations program manual. Encino, CA: Multivariate Software.
Bollen, K. A. (1989). Structural equations with latent variables. New York: Wiley.
Borsboom, D. (2006). When does measurement invariance matter? Medical Care, 44, S176–S181. doi:10.1097/01.mlr.0000245143.08679.cc.
Borsboom, D., Mellenbergh, G. J., & van Heerden, J. (2002). Different Kinds of DIF: A distinction between absolute and relative forms of measurement invariance and bias. Applied Psychological Measurement, 26, 533–540. doi:10.1177/014662102237798.
Browne, M. W., & Cudeck, R. (1993). Alternative ways of assessing model fit. In K. A. Bollen & J. S. Long (Eds.), Testing structural equation models (pp. 445–455). Newbury Park, CA: Sage.
Byrne, B. M., Stewart, S. M., Kennard, B. D., & Lee, P. W. H. (2007). The Beck Depression Inventory-II: Testing for measurement equivalence and factor mean differences across Hong Kong and American adolescents. International Journal of Testing, 7, 293–309. doi:10.1080/15305050701438058.
Chan, D. (2000). Detection of differential item functioning on the Kirton Adaption-Innovation Inventory using multiple-group mean and covariance structure analyses. Multivariate Behavioral Research, 35, 169–199. doi:10.1207/S15327906MBR3502_2.
Cheung, P. C., & Rensvold, R. B. (1999). Testing factorial invariance across groups: A reconceptualization and proposed new method. Journal of Management, 25, 1–27. doi:10.1177/014920639902500101.
Chinese Behavioral Science Corporation. (2000). Manual for the Beck Depression Inventory- II [in Chinese]. Taiwan: The Chinese Behavioral Science Corporation.
Cooke, D. J., Kosson, D. S., & Michie, C. (2001). Psychopathy and ethnicity: Structural, item, and test generalizability of the psychopathy checklist-revised (PCL-R) in Caucasian and African American participants. Psychological Assessment, 13, 531–542. doi:10.1037/1040-3590.13.4.531.
Dozois, D. A., Dobson, K. S., & Ahnberg, J. L. (1998). A psychometric evaluation of the Beck Depression Inventory-II. Psychological Assessment, 10, 83–89. doi:10.1037/1040-3590.10.2.83.
Ferrando, P. (1996). Calibration of invariant item parameters in a continuous item response model using the extended Lisrel measurement submodel. Multivariate Behavioral Research, 31, 419–439. doi:10.1207/s15327906mbr3104_2.
González-Romá, V., Hernándea, A., & Gómez-Benito, J. (2006). Power and type I error of the mean and covariance structure analysis model for detecting differential item functioning in graded response items. Multivariate Behavioral Research, 41, 29–53. doi:10.1207/s15327906mbr4101_3.
González-Romá, V., Tomás, I., & Ferreres, D. (2005). Do items that measure self-perceived physical appearance function differentially across gender groups? An application of the MACS model. Structural Equation Modeling, 12, 148–162. doi:10.1207/s15328007sem1201_8.
Green, S. B., & Babyak, M. A. (1997). Control of Type I errors with multiple tests of constraints in structural equation modeling. Multivariate Behavioral Research, 32, 39–51. doi:10.1207/s15327906mbr3201_2.
Hernández, A., & González-Romá, V. (2003). Evaluating the multiple-group mean and covariance structure analysis model for the detection of differential item functioning in polytomous ordered items. Psichotema, 15, 322–327.
Holland, P. W., & Wainer, H. (1993). Differential item functioning. Hillsdale, NJ: Erlbaum.
Hu, L.-T., & Bentler, P. M. (1995). Evaluating model fit. In R. Hoyle (Ed.), Structural equation modeling: Issue, concepts and applications (pp. 76–99). Newbury Park, CA: Sage.
Hu, L.-T., & Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Structural Equation Modeling, 6, 1–55.
Hu, L.-T., Bentler, P. M., & Kano, Y. (1992). Can test statistics in covariance structure analysis be trusted? Psychological Bulletin, 112, 351–362. doi:10.1037/0033-2909.112.2.351.
Jöreskog, K. G., & Sörbom, D. (1993). LISREL 8: Structural equation modeling with the SIMPLIS command language. Chicago, IL: Scientific Software International.
Kim, Y., Pilkonis, P. A., Frank, E., Thase, M. E., & Reynolds, C. F. (2002). Differential functioning of the Beck Depression Inventory in late-life patients: Use of item response theory. Psychology and Aging, 17, 379–391. doi:10.1037/0882-7974.17.3.379.
MacCallum, R. C., Browne, M. W., & Sugawara, H. M. (1996). Power analysis and determination of sample size for covariance structure modeling. Psychological Methods, 1, 130–149. doi:10.1037/1082-989X.1.2.130.
McDonald, R. P. (2000). A basis for multidimensional item response theory. Applied Psychological Measurement, 24, 99–114. doi:10.1177/01466210022031552.
Mead, A. W., & Lautenschlager, G. J. (2004). A comparison of item response theory and confirmatory factor analytic methodologies for establishing measurement equivalence/invariance. Organizational Research Methods, 7, 361–388. doi:10.1177/1094428104268027.
Meredith, W. (1993). Measurement invariance, factor analysis and factorial invariance. Psychometria, 58(4), 525–543. doi:10.1007/BF02294825.
Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational measurement (3rd ed., pp. 13–103). New York: Macmillan.
Oort, F. J. (1998). Simulation study of item bias detection with restricted factor analysis. Structural Equation Modeling, 5, 107–124.
Osman, A., Downs, W. R., Barrios, F. X., Kopper, B. A., Gutierrez, P. M., & Chiros, C. E. (1997). Factor structure and psychometric characteristics of the Beck Depression Inventory-II. Journal of Psychopathology and Behavioral Assessment, 19, 359–376. doi:10.1007/BF02229026.
Raju, N. S., Laffitte, L. J., & Byrne, B. M. (2002). Measurement equivalence: A comparison of methods based on confirmatory factor analysis and item response theory. The Journal of Applied Psychology, 87, 517–529. doi:10.1037/0021-9010.87.3.517.
Reise, S. P., Smith, L., & Furr, M. R. (2001). Invariance on the NEO PI-R neuroticism scale. Multivariate Behavioral Research, 36, 83–110. doi:10.1207/S15327906MBR3601_04.
Reise, S. P., Widaman, K. F., & Pugh, R. H. (1993). Confirmatory factor analysis and item response theory: Two approaches for exploring measurement invariance. Psychological Bulletin, 114, 552–566. doi:10.1037/0033-2909.114.3.552.
Rensvold, R. B., & Cheung, G. W. (2001). Testing for metric invariance using structural equation models: Solving the standardization problem. In C. A. Schriesheim & L. L. Neider (Eds.), Research in management: Vol. 1 Equivalence in measurement (pp. 21–50). Greenwich, CT: Information Age.
Roussos, L. A., & Stout, W. F. (1996). A multidimensionality-based DIF analysis paradigm. Applied Psychological Measurement, 20, 355–371. doi:10.1177/014662169602000404.
Santor, D. A., Ramsay, J. O., & Zuroff, D. C. (1994). Nonparametric item analyses of the Beck Depression Inventory: Evaluating gender item bias and response option weights. Psychological Assessment, 6, 255–270. doi:10.1037/1040-3590.6.3.255.
Satorra, A., & Bentler, P. M. (1988). Scaling corrections for chi-square statistics in covariance structure analysis. In Proceedings of the business and economics sections (pp. 308–313). Alexandria, VA: American Statistical Association.
Satorra, A., & Bentler, P. M. (2001). A scaled difference chi-square test statistic for moment structure analysis. Psychometrika, 66, 507–514. doi:10.1007/BF02296192.
Shealy, R., & Stout, W. (1993). An item response theory model for test bias. In P. W. Holland & H. Wainer (Eds.), Differential item functioning (pp. 197–239). Hillsdale, NJ: Lawrence Erlbaum Associates.
Smith, L. L., & Reise, S. P. (1998). Gender differences on negative affectivity: An IRT study of differential item functioning on the multidimensional personality questionnaire stress reaction scale. Journal of Personality and Social Psychology, 75, 1350–1362. doi:10.1037/0022-3514.75.5.1350.
Sörbom, D. (1974). A general method for studying differences in factor means and factor structures between groups. The British Journal of Mathematical and Statistical Psychology, 27, 229–239.
Sörbom, D. (1982). Structural equation models with structured means. In K. G. Jöreskog & H. Wold (Eds.), Systems under indirect observation (pp. 183–195). Amsterdam: North Holland.
Stark, S., Chernyshenko, O. S., & Drasgow, F. (2006). Detecting differential item functioning with confirmatory factor analysis and item response theory: Toward a unified strategy. The Journal of Applied Psychology, 91, 1292–1306. doi:10.1037/0021-9010.91.6.1292.
Steer, R. A., & Clark, D. A. (1997). Psychometric characteristics of the Beck Depression Inventory-II with college students. Measurement & Evaluation in Counseling & Development, 30, 128–136.
Steer, R. A., Kumar, G., Ranieri, W., & Beck, A. T. (1998). Use of the Beck Depression Inventory-II with adolescent psychiatric outpatients. Journal of Psychopathology and Behavioral Assessment, 20, 1998. doi:10.1023/A:1023091529735.
Steer, R. A., Rissmiller, D. J., & Beck, A. T. (2000). Use of the Beck Depression Inventory-II with depressed geriatric inpatients. Behaviour Research and Therapy, 38, 311–318. doi:10.1016/S0005-7967(99)00068-6.
Su, Y.-L., & Chen, L.-C. (2007). The effect of sociometric status, cooperative learning, and traditional learning among junior high students on English academic performance, social anxiety, achievement motivation, and attribution. Bulletin of Educational Psychology, 39, 111–127. in Chinese.
Waller, N. G., Compas, B. E., Hollon, S. D., & Beckjord, E. (2005). Measurement of depressive symptoms in women with breast cancer and women with clinical depression: A differential item functioning analysis. Journal of Clinical Psychology in Medical Settings, 12, 127–141. doi:10.1007/s10880-005-3273-x.
Waller, N. G., Thompson, J. S., & Wenk, E. (2000). Using IRT to separate measurement bias from true group difference on homogenous and heterogeneous scales: An illustration with the MMPI. Psychological Methods, 5, 125–146. doi:10.1037/1082-989X.5.1.125.
Wasti, S. A., Bergman, M. E., Glomb, T. M., & Drasgow, F. (2000). Test of the cross-cultural generalizability of a model of sexual harassment. The Journal of Applied Psychology, 85, 766–778. doi:10.1037/0021-9010.85.5.766.
Whisman, M. A., Perez, J. E., & Ramel, W. (2000). Factor structure of the Beck Depression Inventory-second edition (BDI-II) in a student sample. Journal of Clinical Psychology, 56, 545–551. doi:10.1002/(SICI)1097-4679(200004)56:4≤545::AID-JCLP7≥3.0.CO;2-U.
Wu, P.-C., & Chang, L. (2008). Psychometric properties of the Chinese version of Beck Depression Inventory II (BDI-II-C) using Rasch model. Measurement & Evaluation in Counseling & Development, 41, 13–31.
Yang, M.-L. (2005). The value of educational achievement and adolescents’ mental health. Formosa Journal of Mental Health, 18, 75–99.
Acknowledgments
I am grateful for Dr. Gretchen Guiton for her helpful comments on the earlier version of manuscript.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Wu, PC. Differential Functioning of the Chinese Version of Beck Depression Inventory-II in Adolescent Gender Groups: Use of a Multiple-Group Mean and Covariance Structure Model. Soc Indic Res 96, 535–550 (2010). https://doi.org/10.1007/s11205-009-9491-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11205-009-9491-0