Asymptotic Standard Errors of IRT Observed-Score Equating Methods

Haruhiko Ogasawara

doi:10.1007/BF02294797

Asymptotic Standard Errors of IRT Observed-Score Equating Methods

Published online by Cambridge University Press: 01 January 2025

Haruhiko Ogasawara

Show author details

Haruhiko Ogasawara*: Affiliation:
Otaru University of Commerce
*: Requests for reprints should be sent to Haruhiko Ogasawaxa, Department of Information and Management Science, Otaru University of Commerce, 3-5-21, Midori, Otaru 047-8501 JAPAN. E-Mail: hogasa@res.otaru-uc.ac.jp

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

A method of the IRT observed-score equating using chain equating through a third test without equating coefficients is presented with the assumption of the three-parameter logistic model. The asymptotic standard errors of the equated scores by this method are obtained using the results given by M. Liou and P.E. Cheng. The asymptotic standard errors of the IRT observed-score equating method using a synthetic examinee group with equating coefficients, which is a currently used method, are also provided. Numerical examples show that the standard errors by these observed-score equating methods are similar to those by the corresponding true score equating methods except in the range of low scores.

Keywords

IRT observed-score equating equipercentile equating chain equating asymptotic standard errors

Information

Type: Articles
Information: Psychometrika , Volume 68 , Issue 2 , June 2003 , pp. 193 - 211

DOI: https://doi.org/10.1007/BF02294797 [Opens in a new window]
Copyright: Copyright © 2003 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

The author is indebted to Michael J. Kolen for access to the real data used in this article and anonymous reviewers for their corrections and suggestions on this work.

References

Angoff, W.H. (1971). Scales, norms, and equivalent scores. In Thorndike, R.L. (Eds.), Educational measurement 2nd ed., (pp. 508–600). Washington DC: American Council on Education.Google Scholar

Bahadur, R.R. (1966). A note on quantiles in large samples. Annals of Mathematical Statistics, 37, 577–580.CrossRef Google Scholar

Bentler, P.M., Dudgeon, P. (1996). Covariance structure analysis: Statistical practice, theory, and directions. Annual Review of Psychology, 47, 563–592.CrossRef Google Scholar

Bock, R.D., Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: Application of an EM algorithm. Psychometrika, 46, 443–459.CrossRef Google Scholar

Bock, R.D., Lieberman, M. (1970). Fitting a response model forn dichotomously scored items. Psychometrika, 35, 179–197.CrossRef Google Scholar

Braun, H.I., Holland, P.W. (1982). Observed-score test equating: A mathematical analysis of some ETS equating procedures. In Holland, P.W., Rubin, D.B. (Eds.), Test equating (pp. 9–49). New York, NY: Academic Press.Google Scholar

Cox, D.R. (1961). Tests of separate families of hypotheses. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability, 1, 105–123.Google Scholar

Ghosh, J.K. (1971). A new proof of he Bahadur representation of quantiles and an application. Annals of Mathematical Statistics, 42, 1957–1961.CrossRef Google Scholar

Han, T., Kolen, M.J., Pohlmann, J. (1997). A comparison among IRT true- and observed score equatings and traditional equipercentile equating. Applied Measurement in Education, 10, 105–121.CrossRef Google Scholar

Kolen, M.J. (1981). Comparison of traditional and item response theory methods for equating tests. Journal of Educational Measurement, 18, 1–11.CrossRef Google Scholar

Kolen, M.J., Brennan, R.L. (1995). Test equating: Methods and practices. New York, NY: Springer.CrossRef Google Scholar

Liou, M., Cheng, P.E. (1995). Asymptotic standard error of equipercentile equating. Journal of Educational and Behavioral Statistics, 20, 259–286.CrossRef Google Scholar

Liou, M., Cheng, P. E., Johnson, E. (1997). Standard errors of the kernel equating methods under the common-item design. Applied Psychological Measurement, 21, 349–369.CrossRef Google Scholar

Lord, F.M. (1977). Practical applications of item characteristic curve theory. Journal of Educational Measurement, 14, 117–138.CrossRef Google Scholar

Lord, F.M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum.Google Scholar

Lord, F.M. (1982). Item response theory and equating: A technical summary. In Holland, P.W., Rubin, D.B. (Eds.), Test equating (pp. 141–148). New York, NY: Academic Press.Google Scholar

Lord, F.M. (1982). Standard errors of an equating by item response theory. Applied Psychological Measurement, 6, 463–472.CrossRef Google Scholar

Lord, F.M. (1982). The standard error of equipercentile equating. Journal of Educational Statistics, 7, 165–174.CrossRef Google Scholar

Lord, F.M., Wingersky, M.S. (1984). Comparison of IRT true-score and equipercentile observed-score “equatings”. Applied Psychological Measurement, 8, 453–461.CrossRef Google Scholar

Loyd, B.H., Hoover, H.D. (1980). Vertical equating using the Rasch model. Journal of Educational Measurement, 17, 179–193.CrossRef Google Scholar

Ogasawara, H. (2000). Asymptotic standard errors of IRT equating coefficients using moments. Economic Review (Otaru University of Commerce), 51(1), 1–23.Google Scholar

Ogasawara, H. (2001). Standard errors of item response theory equating/linking by response function methods. Applied Psychological Measurement, 25, 53–67.CrossRef Google Scholar

Ogasawara, H. (2001). Item response theory true score equatings and their standard errors. Journal of Educational and Behavioral Statistics, 26, 31–50.CrossRef Google Scholar

Rubin, D.B. (1982). Discussion of “Observed-score test equating: A mathematical analysis of some ETS equating procedures”. In Holland, P.W., Rubin, D.B. (Eds.), Test equating (pp. 51–54). New York, NY: Academic Press.Google Scholar

Stocking, M.L., Lord, F.M. (1983). Developing a common metric in item response theory. Applied Psychological Measurement, 7, 201–210.CrossRef Google Scholar

Tsai, T.-H., Hanson, B.A., Kolen, M.J, Forsyth, R.A. (2001). A comparison of bootstrap standard errors of IRT equating methods for the common item nonequivalent groups design. Applied Measurement in Education, 14, 17–30.CrossRef Google Scholar

van der Linden, W.J. (2000). A test-theoretic approach to observed-score equating. Psychometrika, 65, 437–456.CrossRef Google Scholar

van der Linden, W.J., Luecht, R.M. (1998). Observed-score equating as a test assembly problem. Psychometrika, 63, 401–418.CrossRef Google Scholar

Zeng, L., Kolen, M.J. (1995). An alternative approach for IRT observed-score equating of number-correct scores. Applied Psychological Measurement, 19, 231–241.CrossRef Google Scholar

Article contents

Asymptotic Standard Errors of IRT Observed-Score Equating Methods

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests