Hostname: page-component-745bb68f8f-grxwn Total loading time: 0 Render date: 2025-01-07T18:56:39.438Z Has data issue: false hasContentIssue false

Accumulative Equating Error after a Chain of Linear Equatings

Published online by Cambridge University Press:  01 January 2025

Hongwen Guo*
Affiliation:
Educational Testing Service
*
Requests for reprints should be sent to Hongwen Guo, Educational Testing Service, Princeton, NJ, USA. E-mail: hguo@ets.org

Abstract

After many equatings have been conducted in a testing program, equating errors can accumulate to a degree that is not negligible compared to the standard error of measurement. In this paper, the author investigates the asymptotic accumulative standard error of equating (ASEE) for linear equating methods, including chained linear, Tucker, and Levine, under the nonequivalent groups with anchor test (NEAT) design. A recursive formula for the ASEE is provided for a series of equatings that makes use of only historical summary statistics. This formula can serve as a new tool to measure the magnitude of equating errors that have accumulated over a series of equatings, and to help monitor and design testing programs.

Type
Original Paper
Copyright
Copyright © 2010 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Braun, H., Holland, P. (1982). Observed-score test equating: A mathematical analysis of some ETS equating procedures. In Holland, P., Rubin, B. (Eds.), Test equating (pp. 949). New York: Academic Press.Google Scholar
Haberman, S., Guo, H., Liu, J., & Dorans, N. (2008). Consistency of SAT reasoning score conversions (ETS RR-08-67). Educational Testing Service, Princeton, NJ.Google Scholar
Haertel, E. (2006). Reliability. In Brennan, R.L. (Eds.), Educational measurement (pp. 65110). (4th ed.). Westport: American Council on Education/Praeger.Google Scholar
Kendall, M., Stuart, A. (1977). The advanced theory of statistics, (4th ed.). New York: Macmillan.Google Scholar
Kolen, M. (1985). Standard errors of Tucker equating. Applied Psychological Measurement, 9, 209223.CrossRefGoogle Scholar
Kolen, M., Brennan, R. (2004). Test equating, scaling, and linking, (2nd ed.). New York: Springer.CrossRefGoogle Scholar
Lord, F.M. (1950). Notes on comparable scales for test scores (ETS RB-50-48). Educational Testing Service, Princeton, NJ.Google Scholar
Lord, F.M. (1981). Standard error of equating by item response theory (ETS Tech. Rep. No. RR-81-49). Educational Testing Service, Princeton, NJ.Google Scholar
Zeng, L., Hanson, B., Kolen, M. (1994). Standard errors of a chain of linear equatings. Applied Psychological Measurement, 18, 369378.CrossRefGoogle Scholar