Skip to main content

Reliability and Validity in Classical Test Theory

  • Chapter
  • First Online:
A Course in Rasch Measurement Theory

Part of the book series: Springer Texts in Education ((SPTE))

Abstract

In CTT, reliability is defined as the proportion of true score variance to total variance. It is most often estimated using the coefficient \( \alpha \). This index assumes the instrument is unidimensional and is not a test of unidimensionality. Construct validation addresses the substantive dimension of the variable assessed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 79.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 99.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Alder, K. (2002). The measure of all things: The seven-year odyssey and hidden error that transformed the world. New York: Free Press.

    Google Scholar 

  • Andrich, D. (2014). A structure of index and causal variables. Rasch Measurement Transactions,28(3), 1475–1477.

    Google Scholar 

  • Cronbach, L. J. (1951). Coefficient alpha and the internal structure of tests. Psychometrika,16, 297–334.

    Article  Google Scholar 

  • Frisbie, D. A. (1988). Reliability of scores from teacher-made tests. Educational Measurement: Issues and Practices. National Council on Measurement in Education,7(1), 25–35.

    Article  Google Scholar 

  • Guttman, L. (1945). A basis for analyzing test-retest reliability. Psychometrika,10, 255–282.

    Article  Google Scholar 

  • Kane, M. (2011). The error of our ways. Journal of Educational Measurement,48(1), 12–30.

    Article  Google Scholar 

  • Kuder, G. F., & Richardson, M. W. (1973). The theory of the estimation of test reliability. Psychometrika,2, 151–160.

    Article  Google Scholar 

  • Mehrens, W. A., & Lehman, I. J. (1991). Measurement and evaluation in education and psychology (4th ed.). New York: Harcourt Brace.

    Google Scholar 

  • Messick, S. (1989). Meaning and values in test validation: The science and ethics of assessment. Educational Researcher,18(2), 5–11.

    Article  Google Scholar 

  • Stenner, A. J., Stone, M. H., & Burdick, D. S. (2009). Indexing versus measuring. Rasch Measurement Transactions,22(4), 1176–1177.

    Google Scholar 

  • Tesio, L. (2014). Causing and being caused: Items in a questionnaire may play a different role, depending on the complexity of the variable. Rasch Measurement Transactions,28(1), 1454–1456.

    Google Scholar 

  • Traub, R. E., & Rowley, G. L. (1991). Understanding reliability. Educational Measurement: Issues and Practices. National Council on Measurement Education,10(1), 37–45.

    Article  Google Scholar 

Further Reading

  • Andrich, D. (1988). Rasch models for measurement (pp. 84–86). Newbury Park, CA: Sage.

    Book  Google Scholar 

  • Andrich, D. (2016). Components of variance of scales with a bi-factor structure from two calculations of coefficient alpha. Educational Measurement: Issues and Practice,35(4), 25–30.

    Article  Google Scholar 

  • Roscoe, J. T. (1975). Fundamental research statistics for the behavioral sciences (2nd ed.). New York: Holt, Reinhart and Winston.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to David Andrich .

Exercises

Exercises

In the Exercises of Chap. 3, you were given a table of person–item responses.

  1. 1.

    Calculate the variance of each of the eight items in the test and the total score and summarize them as below:

    A table has 9 columns and 2 rows. The upper row entries are as follows: s superscript 2 with subscripts from 1 through 8 and y. The lower row is empty.
  2. 2.

    Calculate the reliability of this test according to coefficient \( \alpha \). Show your working. Use the variances of the eight items and the variance of the total score that you calculated in question 1.

  3. 3.

    Comment on the size of the reliability.

  4. 4.

    Consider a test or examination with which you are familiar with. Describe the test and its purposes first, then comment on the reliability of the examination and the validity in terms of the various functions the examination is supposed to serve. How might these be investigated?

For further exercises, see Exercise 1: Interpretation of RUMM2030 printout in Appendix C.

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Andrich, D., Marais, I. (2019). Reliability and Validity in Classical Test Theory. In: A Course in Rasch Measurement Theory. Springer Texts in Education. Springer, Singapore. https://doi.org/10.1007/978-981-13-7496-8_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-7496-8_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-7495-1

  • Online ISBN: 978-981-13-7496-8

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics