Reliability and Validity in Classical Test Theory

Andrich, David; Marais, Ida

doi:10.1007/978-981-13-7496-8_4

David Andrich³ &
Ida Marais³

Part of the book series: Springer Texts in Education ((SPTE))

66k Accesses
3 Citations

Abstract

In CTT, reliability is defined as the proportion of true score variance to total variance. It is most often estimated using the coefficient \( \alpha \). This index assumes the instrument is unidimensional and is not a test of unidimensionality. Construct validation addresses the substantive dimension of the variable assessed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alder, K. (2002). The measure of all things: The seven-year odyssey and hidden error that transformed the world. New York: Free Press.
Google Scholar
Andrich, D. (2014). A structure of index and causal variables. Rasch Measurement Transactions,28(3), 1475–1477.
Google Scholar
Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika,16, 297–334.
Article Google Scholar
Frisbie, D. A. (1988). Reliability of scores from teacher-made tests. Educational Measurement: Issues and Practices. National Council on Measurement in Education,7(1), 25–35.
Article Google Scholar
Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika,10, 255–282.
Article Google Scholar
Kane, M. (2011). The error of our ways. Journal of Educational Measurement,48(1), 12–30.
Article Google Scholar
Kuder, G. F., & Richardson, M. W. (1973). The theory of the estimation of test reliability. Psychometrika,2, 151–160.
Article Google Scholar
Mehrens, W. A., & Lehman, I. J. (1991). Measurement and evaluation in education and psychology (4th ed.). New York: Harcourt Brace.
Google Scholar
Messick, S. (1989). Meaning and values in test validation: The science and ethics of assessment. Educational Researcher,18(2), 5–11.
Article Google Scholar
Stenner, A. J., Stone, M. H., & Burdick, D. S. (2009). Indexing versus measuring. Rasch Measurement Transactions,22(4), 1176–1177.
Google Scholar
Tesio, L. (2014). Causing and being caused: Items in a questionnaire may play a different role, depending on the complexity of the variable. Rasch Measurement Transactions,28(1), 1454–1456.
Google Scholar
Traub, R. E., & Rowley, G. L. (1991). Understanding reliability. Educational Measurement: Issues and Practices. National Council on Measurement Education,10(1), 37–45.
Article Google Scholar

Author information

Authors and Affiliations

Graduate School of Education, The University of Western Australia, Crawley, WA, Australia
David Andrich & Ida Marais

Authors

David Andrich
View author publications
You can also search for this author in PubMed Google Scholar
Ida Marais
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David Andrich .

Exercises

In the Exercises of Chap. 3, you were given a table of person–item responses.

1.
Calculate the variance of each of the eight items in the test and the total score and summarize them as below:
2.
Calculate the reliability of this test according to coefficient \( \alpha \). Show your working. Use the variances of the eight items and the variance of the total score that you calculated in question 1.
3.
Comment on the size of the reliability.
4.
Consider a test or examination with which you are familiar with. Describe the test and its purposes first, then comment on the reliability of the examination and the validity in terms of the various functions the examination is supposed to serve. How might these be investigated?

For further exercises, see Exercise 1: Interpretation of RUMM2030 printout in Appendix C.

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Andrich, D., Marais, I. (2019). Reliability and Validity in Classical Test Theory. In: A Course in Rasch Measurement Theory. Springer Texts in Education. Springer, Singapore. https://doi.org/10.1007/978-981-13-7496-8_4

Download citation

DOI: https://doi.org/10.1007/978-981-13-7496-8_4
Published: 16 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-7495-1
Online ISBN: 978-981-13-7496-8
eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics

Reliability and Validity in Classical Test Theory

Abstract

Access this chapter

References

Further Reading

Author information

Authors and Affiliations

Corresponding author

Exercises

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Reliability and Validity in Classical Test Theory

Abstract

Access this chapter

References

Further Reading

Author information

Authors and Affiliations

Corresponding author

Exercises

Exercises

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation