Testing for local dependency in dichotomous and polytomous item response models

Ip, Edward Hak-sing

doi:10.1007/BF02295736

Testing for local dependency in dichotomous and polytomous item response models

Articles
Published: March 2001

Volume 66, pages 109–132, (2001)
Cite this article

Psychometrika Aims and scope Submit manuscript

Edward Hak-sing Ip¹

825 Accesses
46 Citations
Explore all metrics

Abstract

Researchers studying item response models are often interested in examining the effects of local dependency on the validity of the resulting conclusion from statistical inference. This paper focuses on the detection of local dependency. We provide a framework for viewing local dependency within dichotomous and polytomous items that are clustered by design, and present a testing procedure that allows researchers to specifically identify individual item pairs that exhibit local dependency, while controlling for false positive rate. Simulation results from the study indicate that the proposed method is effective. In addition, a discussion of its relation to other existing methods is provided.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

Article 04 June 2018

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Article Open access 22 August 2014

Small is beautiful: In defense of the small-N design

Article Open access 19 March 2018

References

Agresti, A. (1990).Categorical data analysis. New York: Wiley & Sons.
Google Scholar
Bahadur, R. (1961). A representation of the joint distribution of responses ton dichotomous items. In. H. Solomon (Ed.),Studies in item analysis and prediction. (pp. 158–68). Palo Alto, CA: Stanford University Press.
Google Scholar
Becker, R.A., Chambers, J. M., & Wilks, A. R. (1988).The new S Language. New York: Chapman & Hall.
Google Scholar
Benjamini, Y., & Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing.Journal of the Royal Statistical Society, Series B, 57, 289–300.
Google Scholar
Birch, M.W. (1964). The detection of partial association I: The case.Journal of Royal Statistical Society, Series B, 27, 313–324.
Google Scholar
Bishop, Y., Fienberg, S., & Holland, P. (1975).Discrete multivariate analysis. Boston, MA: MIT Press.
Google Scholar
Bradlow, E., Wainer, H., & Wang, X. (1999). A Bayesian random effects model for testlets.Psychometrika, 64, 153–168.
Article Google Scholar
Breslow, N. (1981). Odds ratio estimators when the data are sparse.Biometrika, 68, 73–84.
Google Scholar
Chen, W., & Thissen, D. (1997). Local dependence indexes for item pairs using item response theory.Journal of Educational and Behavioral Statistics, 22, 265–289.
Google Scholar
Cochran, W.G. (1954). Some methods of strengthening the commonx ² tests.Biometrics, 10, 417–451.
Google Scholar
Dale, R. (1986). Global cross-ratio models for bivariate, discrete ordered responses.Biometrics, 42, 909–917.
PubMed Google Scholar
Darroch, J.N. (1981). The Mantel-Haenszel test and tests of marginal symmetry: Fixed effects and mixed models for a categorical response.International Statistical Review, 49, 285–307.
Google Scholar
Donner, A., & Hauck, W., (1988). Estimation of a common odds ratio in case-control studies of familial aggregation.Biometrics, 44, 369–378.
PubMed Google Scholar
Douglas, J., Kim, H., Habing B, & Gao, F. (1998). Investigating local dependence with conditional covariance functions.Journal of Educational and Behavioral Statistics, 23, 129–151.
Google Scholar
Efron, B. (1982).The jackknife, the bootstrap and other resampling plans (CBMS-NSF Regional Conference Series in Applied Mathematics, Volume 38). Philadelphia: SIAM.
Google Scholar
Gao, F. (1997).DIMTEST enhancements in some parametric IRT asymptotics. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.
Gibbons, R.D., Bock, R.D., & Hedeker, D.R. (1989).Conditional dependence (Biometric Lab. Rep. 89-1). Urbana-Champaign, IL: University of Illinois.
Google Scholar
Goldstein, H. (1980). Dimensionality, bias, independence and measurement scale problems in latent trait test score models.British Journal of Mathematical and Statistical Psychology, 33, 234–246.
Google Scholar
Habing, B.T. (1998).Some issues in weak local dependence in item response theory. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.
Habing, B., & Donoghue, J.R. (1998).Local dependence assessment for exams with polytomous items and incomplete item-examinee layouts. Manuscript submitted for publication.
Habing, B.T., & Roussos, L. (1998).A model for item response data with pairwise local dependence. Paper presented at the annual meeting of the National Council of Measurement in Education, San Diego, CA.
Hambleton, R.K., Swaminathan, H., Cook, L.L., Eignor, D.E., & Gifford, J.A. (1978). Developments in latent trait theory: Models, technical issues, and applications.Review of Educational Research, 48, 476–510.
Google Scholar
Harwell, M., Stone, C.A., Hsu, T., & Kirisci, L. (1996). Monte Carlo studies in item response theory.Applied Psychological Measurement, 20, 101–125.
Google Scholar
Hattie, J.A. (1985). Methodological review: Assessing unidimensionality of tests and items.Applied Psychological Measurement, 9, 139–164.
Google Scholar
Hattie, J., Krakowski, K., Rogers, H.J., Swaminathan, H. (1996). An assessment of Stout's index of essential unidimensionality.Applied Psychological Measurement, 20, 1–14.
Google Scholar
Hauck, W. (1979). The large sample variance of the Mantel-Haenszel estimator of a common odds ratio.Biometrics, 25, 817–820.
Google Scholar
Hochberg, Y., & Tamhane, A. (1987).Multiple comparison procedures. New York, NY: Wiley & Sons.
Google Scholar
Holland, P.W. (1981). When are item response models consistent with observed data?Psychometrika, 46, 79–92.
Article Google Scholar
Holland, P., & Rosenbaum, P. (1986). Conditional association and unidimensionality in montone latent variable models.Annals of Statistics, 14, 1523–1543.
Google Scholar
Holland, P.W., & Thayer, D.T. (1988). Differential item performance and the Mantel-Haenszel procedure. In H. Wainer & H.I. Braun (Eds.),Test validity (pp. 129–145). Hillsdale, NJ: Erlbaum.
Google Scholar
Hoskens, M. & De Boeck, P. (1997). A parametric model for local item dependencies among test items.Psychological Methods, 2, 261–277.
Article Google Scholar
Ip, E.H. (2000). Adjusting for information inflation due to local dependency in moderately large item clusters.Psychometrika, 65, 73–91.
Google Scholar
Jannarone, R. (1992a). Conjunctive measurement theory: Cognitive research prospects. In M. Wilson (Ed.),Objective measurement: Theory and practice, Volume 1 (pp. 210–235). Norwood, NJ: Ablex Publishing.
Google Scholar
Jannarone, R. (1992b). Local dependence: Objectively measurable or objectionably abominable?. In M. Wilson (Ed.),Objective Measurement: Theory and practice, Volume 2. Norwood, NJ: Ablex Publishing.
Google Scholar
Jennings, D.E. (1986). Outliers and Residual distributions in logistic regression.Journal of the American Statistical Association, 81, 987–990.
Google Scholar
Junker, B.W. (1991). Essential independence and likelihood-based ability estimation for polytomous items.Psychometrika, 56, 255–278.
Article Google Scholar
Junker, B.W. (1993). Progress in characterizing strictly unidimensional IRT representations.The Annals of Statistics, 21, 1359–1378.
Google Scholar
Kim, H. (1994).New techniques for the dimensionality assessment of standardized test data. Unpublished doctoral dissertation, University of Illinois at Urbana-Champaign, Department of Statistics.
Lehmann, E.L. (1991).Testing statistical hypothesis (2nd ed.). New York, NY: Springer-Verlag.
Google Scholar
Mantel, N. (1963). Chi-square tests with one degree of freedom: Extensions of the Mantel-Haenszel procedure.Journal of the American Statistical Association, 58, 690–700.
Google Scholar
Mantel, N., & Haenszel, W. (1959). Statistical aspects of the retrospective study of disease.Journal of the National Cancer Institute, 22, 719–748.
PubMed Google Scholar
McCullagh, P., & Nelder, J.A. (1989).Generalized linear models (2nd ed.). New York: Chapman & Hall.
Google Scholar
McDonald, R.P. (1981). The dimensionality of tests and items.British Journal of Mathematical and Statistical Psychology, 34, 100–117.
Google Scholar
McDonald, R. P. (1994). Testing for approximate dimensionality. In D. Laveault, B. Zumbo, M. Gessarli, & M. Boss (Eds.),Modern theory of measurement: Problems and issues (pp. 63–86). Ottawa: University of Ottawa Press.
Google Scholar
Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm.Applied Psychological Measurement, 16, 159–176.
Google Scholar
Nandakumar, R., & Stout, W.F. (1993). Refinements of Stout's procedure for assessing latent trait unidimensionality.Journal of Educational Statistics, 18, 41–68.
Google Scholar
Pashley, P.J., & Reese, L.M. (1995).On generating locally dependent item responses (Statistical Rep. 95-04). Newton, PA: Law School Admission Council.
Google Scholar
Plackett, R.L. (1965). A class of bivariate distributions.Journal of American Statistical Association, 65, 516–522.
Google Scholar
Reese, L. (1995).The impact of local dependencies on some LSAT outcomes (Statistical Rep. 95-02). Newton, PA: Law School Admission Council.
Google Scholar
Rosenbaum, P.R. (1984). Testing the conditional independence and monotonicity assumptions of item response theory.Psychometrika, 49, 425–435.
Google Scholar
Roussos, L.A., Stout, W.F., & Marden, J.I. (1998). Using new proximity measure with hierarchical cluster analysis to detect multidimensionality.Journal of Educational Measurement, 35, 1–30.
Article Google Scholar
Shaffer, J.P. (1995). Multiple hypothesis testing.Annual Review of Psychology, 46, 561–584.
Article Google Scholar
Somes, G.W., & O'Brien, K.F. (1985). Mantel-Haenszel statistics. In Johnson & Kotz (Eds.),Encyclopedia of Statistical Science, Vol.5 (pp. 214–217). New York, NY: Wiley & Sons.
Google Scholar
Stout, W.F. (1987). A nonparametric approach for assessing latent traitdimensionality.Psychometrika, 52, 589–617.
Article Google Scholar
Stout, W.F. (1990). A new item response theory modeling approach with application to unidimensionality assessment and ability estimation.Psychometrika, 55, 293–325.
Google Scholar
Stout, W.F., Habing, B., Douglas, J., Kim, H., Roussos, L., & Zhang, J. (1996). Conditional covariance based nonparametric multidimensionality assessment.Applied Psychological Measurement, 20, 331–354.
Google Scholar
Stout, W.F., Nandakumar, R., Junker, B., Chang, H.H., & Steidinger, D. (1991).DIMTEST and TESTSIM [Computer program]. Urbana-Champaign: University of Illinois, Department of Statistics.
Google Scholar
Suppes, P., & Zanotti, M. (1981). When are probabilistic explanations possible?Synthese, 48, 191–199.
Article Google Scholar
Tate, R.L. (1998).A comparison of selected methods for assessing the dimensionality of tests comprised of dichotomous items. Paper presented at the meeting of the National Council of Measurement in Education, San Diego, California.
Tuerlinckx, F., & De Boeck, P. (1998).The effect of ignoring local item dependencies on the estimated discrimination parameters (Research Rep. 98-2). Leuven, Belgium: University of Leuven.
Google Scholar
Williams, V.S.L., Jones, L.V., & Tukey, J. (1994).Controlling error in multiple comparisons, with special attention to National Assessment of Educational Progress (Tech. Rep. 33). Research Triangle Park, NC: National Institute of Statistical Sciences.
Google Scholar
Wu, H., & Stout, W.F. (1996, June).A test of local independence going beyond conditional covariance exploration. Paper presented at the Annual Meeting of the Psychometric Society, Banff, Canada.
Yen, W.M. (1984). Effects of local item dependence on the fit and equating performance of the three-parameter logistic model.Applied Psychological Measurement, 8, 125–145.
Google Scholar
Yen, W.M. (1993). Scaling performance assessments: Strategies for managing local item dependence.Journal of Educational Measurement, 30, 187–213.
Article Google Scholar
Zhang, J., & Stout, W.F. (1999). Conditional covariance structure of generalized compensatory multidimensional items.Psychometrika, 64, 129–152.
Google Scholar
Zwick, R. (1987). Assessing the dimensionality of NAEP reading data.Journal of Educational Measurement, 24, 293–308.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Marshall School of Business, Information and Operations Management Department, University of Southern California, 90089-1421, Los Angeles, CA
Edward Hak-sing Ip

Authors

Edward Hak-sing Ip
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Edward Hak-sing Ip.

Additional information

The research was supported under the National Assessment of Educational Progress (Grant No. R902B990007) administered by the National Center of Education Statistics, U.S. Department of Education. This work was started when the author was at the Division of Statistics and Psychometrics at the Educational Testing Service. I thank Juliet Shaffer for her comments on the multiple testing procedure. I also thank three anonymous referees and the Associate Editor for suggestions that greatly improved the presentation of the manuscript.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ip, E.Hs. Testing for local dependency in dichotomous and polytomous item response models. Psychometrika 66, 109–132 (2001). https://doi.org/10.1007/BF02295736

Download citation

Received: 21 August 1996
Revised: 28 February 2000
Issue Date: March 2001
DOI: https://doi.org/10.1007/BF02295736

Key words

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Testing for local dependency in dichotomous and polytomous item response models

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Small is beautiful: In defense of the small-N design

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

Testing for local dependency in dichotomous and polytomous item response models

Abstract

Access this article

Similar content being viewed by others

RMSEA, CFI, and TLI in structural equation modeling with ordered categorical data: The story they tell depends on the estimation methods

A new criterion for assessing discriminant validity in variance-based structural equation modeling

Small is beautiful: In defense of the small-N design

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation