On the Asymptotic Distribution of Pearson’s X2 in Cross-Validation Samples

Harry Joe; Albert Maydeu-Olivares

doi:10.1007/s11336-005-1284-z

On the Asymptotic Distribution of Pearson’s X2 in Cross-Validation Samples

Published online by Cambridge University Press: 01 January 2025

Harry Joe and

Albert Maydeu-Olivares

Show author details

Harry Joe: Affiliation:
University of British Columbia
Albert Maydeu-Olivares*: Affiliation:
University of Barcelona and Instituto de Empresa
*: Requests for reprints should be sent to Albert Maydeu-Olivares, Faculty of Psychology, University of Barcelona, P. Valle de Hebrón, 171, 0835 Barcelona, Spain. E-mail: amaydeu@ub.edu.

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

In categorical data analysis, two-sample cross-validation is used not only for model selection but also to obtain a realistic impression of the overall predictive effectiveness of the model. The latter is of particular importance in the case of highly parametrized models capable of capturing every idiosyncracy of the calibrating sample. We show that for maximum likelihood estimators or other asymptotically efficient estimators Pearson’s X2 is not asymptotically chi-square in the two-sample cross-validation framework due to extra variability induced by using different samples for estimation and goodness-of-fit testing. We propose an alternative test statistic, X2xval, obtained as a modification of X2 which is asymptotically chi-square with C - 1 degrees of freedom in cross-validation samples. Stochastically, X2xval≤ X2. Furthermore, the use of X2 instead of X2xval with a χ2C - 1 reference distribution may provide an unduly poor impression of fit of the model in the cross-validation sample.

Keywords

contingency tables item response theory modeling latent class analysis quadratic form statistics goodness-of-fit

Type: Original Paper
Information: Psychometrika , Volume 71 , Issue 3 , September 2006 , pp. 587 - 592

DOI: https://doi.org/10.1007/s11336-005-1284-z [Opens in a new window]
Copyright: Copyright © 2006 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

This paper is dedicated to the memory of Michael V. Levine.

References

Agresti, A. (2002). Categorical data dnalysis, (2nd ed.). Dordrecht: Wiley.CrossRef Google Scholar

Bishop, Y.M.M., Fienberg, S.E., Holland, P.W. (1975). Discrete multivariate analysis, Cambridge, MA: MIT Press.Google Scholar

Bock, R.D., Lieberman, M. (1970). Fitting a response model for n dichotomously scored items. Psychometrika, 35, 179–197.CrossRef Google Scholar

Browne, M.W. (2000). Cross-validation methods. Journal of Mathematical Psychology, 44, 108–132.CrossRef Google Scholar PubMed

Chernyshenko, O.S., Stark, S., Chan, K.-Y., Drasgow, F., Williams, B. (2001). Fitting item response theory models to two personality inventories: Issues and insights. Multivariate Behavioral Research, 36, 523–562.CrossRef Google Scholar PubMed

Collins, L.M., Graham, J.W., Long, J.D., Hansen, W.B. (1994). Crossvalidation of latent class models of early substance use onset. Multivariate Behavioral Research, 29, 165–183.CrossRef Google Scholar PubMed

Drasgow, F., Levine, M.V., Tsien, S., Williams, B., Mead, A. (1995). Fitting polytomous item response theory models to multiple-choice tests. Applied Psychological Measurement, 19, 143–165.CrossRef Google Scholar

Du Toit, M. (2003). IRT from SSI, Lincolnwood, IL: Scientific Software International.Google Scholar

Koehler, K., Larntz, K. (1980). An empirical investigation of goodness-of-fit statistics for sparse multinomials. Journal of the American Statistical Association, 75, 336–344.CrossRef Google Scholar

Levine, M.V. (1984). An introduction to multilinear formula score theory. Measurement series 84-4. Champaign, IL: Model Based Measurement Laboratory.Google Scholar

Lord, F.M., Novick, M.R. (1968). Statistical theories of mental test scores, Reading, MA: Addison-Wesley.Google Scholar

Maydeu-Olivares, A. (2005). Further empirical results on parametric vs. non-parametric IRT modeling of Likert-type personality data. Multivariate Behavioral Research, 40, 275–293.CrossRef Google Scholar

Thissen, D., Chen, W.-H., Bock, R.D. (2003). Multilog (version 7) [Computer software], Lincolnwood, IL: Scientific Software International.Google Scholar

Zucchini, W. (2000). An introduction to model selection. Journal of Mathematical Psychology, 44, 41–61.CrossRef Google Scholar PubMed

Article contents

On the Asymptotic Distribution of Pearson’s X2 in Cross-Validation Samples

Abstract

Keywords

Access options

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests