Gain Scores Revisited Under an IRT Perspective

Fischer, Gerhard H.

doi:10.1007/978-1-4613-0169-1_3

Gerhard H. Fischer⁹

Part of the book series: Lecture Notes in Statistics ((LNS,volume 157))

739 Accesses
7 Citations

Abstract

For the measurement and statistical assessment of individual gain scores based on item sets that satisfy the assumptions of the Rasch, Rating Scale, or Partial Credit Models, a conditional maximum likelihood estimator, Clopper-Pearson confidence intervals, uniformly most accurate confidence intervals, and uniformly most powerful unbiased tests of the hypothesis of no change are presented. All methods are grounded on the exact conditional distribution of the gain score, given the total score for both time points, so that no asymptotic approximations are required. Typical applications of the methods are mentioned.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT

Article Open access 27 August 2022

Some Adventures in Reliability Estimation

Maximum Marginal Likelihood Estimation of a Monotonic Polynomial Generalized Partial Credit Model with Applications to Multiple Group Analysis

Article 09 December 2014

References

Andersen, E.B. (1972). The numerical solution of a set of conditional estimation equations. Journal of the Royal Statistical Society, Series B, 34, 42–54.
MATH Google Scholar
Andersen, E.B. (1990). The statistical analysis of categorical data. Heidelberg: Springer-Verlag.
Book MATH Google Scholar
Andersen, E.B. (1995). Polytomous Rasch models and their estimation. In G.H. Fischer & I.W. Molenaar (Eds.), Rasch models: Foundations, recent developments, and applications (pp. 271–291). New York: Springer-Verlag.
Google Scholar
Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, 561–573.
Article MATH Google Scholar
Bereiter, C. (1963). Some persisting dilemmas in the measurement of change. In C.W. Harris (Ed.), Problems in measuring change (pp. 3–20). Madison, WI: University of Wisconsin Press.
Google Scholar
Cronbach, L.J., & Furby, L. (1970). How should we measure change, or should we? Psychological Bulletin, 74, 68–80.
Article Google Scholar
Embretson, S.E. (1991). A multidimensional latent trait model for measuring learning and change. Psychometrika, 56, 495–515.
Article MATH Google Scholar
Fischer, G.H. (1987). Applying the principles of specific objectivity and generalizability to the measurement of change. Psychometrika, 52, 565–578.
Article MathSciNet MATH Google Scholar
Fischer, G.H. (1995). Some neglected problems in IRT. Psychometrika, 60, 459–487.
Article MATH Google Scholar
Fischer, G.H., & Ponocny, I. (1995). Extended rating scale and partial credit models for assessing change. In G.H. Fischer & I.W. Molenaar (Eds.), Rasch models: Foundations, recent developments, and applications (pp. 351–370). New York: Springer-Verlag.
Google Scholar
Fischer, G.H., & Ponocny-Seliger, E. (1998). Structural Rasch modeling: Handbook of the usage of LPCM-WIN 1.0 [Software manual]. Groningen: ProGAMMA.
Google Scholar
Guttmann, G., & Etlinger, S.C. (1991). Susceptibility to stress and anxiety in relation to performance, emotion, and personality: The ergopsychometric approach. In Ch. Spielberger, I.G. Sarason, J. Strelau, & J.M.T. Brebner (Eds.), Stress and anxiety (pp. 23–52). New York: Hemisphere.
Google Scholar
Hoijtink, H., & Boomsma, A. (1996). Statistical inference based on latent ability estimates. Psychometrika, 61, 313–330.
Article MATH Google Scholar
Holtzman, W.H. (1963). Statistical models for the study of change in the single case. In C.W. Harris (Ed.), Problems in measuring change (pp. 199–211). Madison, WI: The University of Wisconsin Press.
Google Scholar
Huber H. (1977). Zur Planung und Auswertung von Einzelfalluntersuchungen [On the planning and analysis of single case studies]. In L.J. Pongratz (Ed.), Handbuch der Psychologie: Vol. 8. Klinische Psychologie (pp. 1153–1199). Göttingen: Hogrefe.
Google Scholar
Klauer, K.C. (1991a). An exact and optimal standardized person test for assessing consistency with the Rasch model. Psychometrika, 56, 213–228.
Article MathSciNet Google Scholar
Klauer, K.C. (1991b). Exact and best confidence intervals for the ability parameter of the Rasch Model. Psychometrika, 56, 535–547.
Article MathSciNet MATH Google Scholar
Klauer, K.C. (1995). The assessment of person fit. In G.H. Fischer & I.W. Molenaar (Eds.), Rasch Models: Foundations, recent developments, and applications (pp. 97–110). New York: Springer-Verlag.
Google Scholar
Liou, M. (1993). Exact person tests for assessing model-data fit in the Rasch model. Applied Psychological Measurement, 17, 187–195.
Google Scholar
Liou, M., & Chang, C.-H. (1992). Constructing the exact significance level for a person fit statistic. Psychometrika, 47, 169–181.
Article Google Scholar
Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, 149–174.
Article MATH Google Scholar
Meijer, R.R., & Sijtsma, K. (in press). A review of methods for evaluating the fit of item score patterns on a test. Applied Psychological Measurement.
Google Scholar
Mellenbergh, G.J., & Van den Brink, W.P. (1998). The measurement of individual change. Psychological Methods, 3, 470–485.
Article Google Scholar
Molenaar, I.W., & Hoijtink, H. (1990). The many null distributions of person fit indices. Psychometrika, 55, 75–106.
Article MathSciNet Google Scholar
Mood, A.M., Graybill, F.A., & Boes, D.C. (1974). Introduction to the theory of statistics. Singapore: McGraw-Hill.
MATH Google Scholar
Ponocny, I. (2000). Exact person fit indexes for the Rasch model for arbitrary alternatives. Psychometrika, 65, 29–42.
Article MathSciNet Google Scholar
Ponocny, I., & Ponocny-Seliger, E. (1999). T-Rasch 1.0 [Software program]. Groningen: ProGAMMA.
Google Scholar
Prieler, J. (2000). Evaluation eines Ausleseverfahrens für Unteroffiziere, beim Österreichischen Bundesheer [Evaluation of a selection procedure for noncommissioned officers in the Austrian army]. Unpublished dissertation, University of Vienna, Department of Psychology.
Google Scholar
Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: The Danish Institute of Educational Research. (Expanded edition, 1980. Chicago: University of Chicago Press.)
Google Scholar
Rasch, G. (1965). Målingsmodellerne og deres principielle baggrund [Models for measurements and their fundamental background]. (Notes taken by J. Stene at the statistical seminar.) Copenhagen: Department of Statistics, University of Copenhagen.
Google Scholar
Santner, T.J., & Duffy, D.E. (1989). The statistical analysis of discrete data. New York: Springer-Verlag.
Book MATH Google Scholar
Willett, J.B. (1989). Some results on reliability for the longitudinal measurement of change: Implications for the design of studies of individual growth. Educational and Psychological Measurement, 49, 587–602.
Article Google Scholar
Williams, R.H., & Zimmerman, D.W. (1996). Are simple gain scores obsolete? Applied Psychological Measurement, 20, 59–69.
Article Google Scholar
Witting, H. (1985). Mathematische Statistik I [Mathematical statistics I]. Stuttgart: Teubner.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Psychology, University of Vienna, Liebiggasse 5, A-1010, Wien, Austria
Gerhard H. Fischer

Authors

Gerhard H. Fischer
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Statistics and Measurement Theory, University of Groningen, Grote Kruisstraat 2/1, 9712 TS, Groningen, The Netherlands
Anne Boomsma , Marijtje A. J. van Duijn & Tom A. B. Snijders , &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Fischer, G.H. (2001). Gain Scores Revisited Under an IRT Perspective. In: Boomsma, A., van Duijn, M.A.J., Snijders, T.A.B. (eds) Essays on Item Response Theory. Lecture Notes in Statistics, vol 157. Springer, New York, NY. https://doi.org/10.1007/978-1-4613-0169-1_3

Download citation

DOI: https://doi.org/10.1007/978-1-4613-0169-1_3
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-95147-8
Online ISBN: 978-1-4613-0169-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Gain Scores Revisited Under an IRT Perspective

Abstract

Access this chapter

Preview

Similar content being viewed by others

Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT

Some Adventures in Reliability Estimation

Maximum Marginal Likelihood Estimation of a Monotonic Polynomial Generalized Partial Credit Model with Applications to Multiple Group Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Gain Scores Revisited Under an IRT Perspective

Abstract

Access this chapter

Preview

Similar content being viewed by others

Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT

Some Adventures in Reliability Estimation

Maximum Marginal Likelihood Estimation of a Monotonic Polynomial Generalized Partial Credit Model with Applications to Multiple Group Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation