Paradoxical Results and Item Bundles

Giles Hooker; Matthew Finkelman

doi:10.1007/s11336-009-9143-y

Paradoxical Results and Item Bundles

Published online by Cambridge University Press: 01 January 2025

Giles Hooker and

Matthew Finkelman

Show author details

Giles Hooker*: Affiliation:
Cornell University
Matthew Finkelman: Affiliation:
Tufts School of Dental Medicine
*: Requests for reprints should be sent to Giles Hooker, Cornell University, Ithaca, NY, USA. E-mail: giles.hooker@cornell.edu

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Hooker, Finkelman, and Schwartzman (Psychometrika, 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to occur leads to the undesirable possibility of a subject’s best answer being detrimental to them. This paper considers the existence of paradoxical results in tests composed of item bundles when compensatory models are used. We demonstrate that paradoxical results can occur when bundle effects are modeled as nuisance parameters for each subject. However, when these nuisance parameters are modeled as random effects, or used in a Bayesian analysis, it is possible to design tests comprised of many short bundles that avoid paradoxical results and we provide an algorithm for doing so. We also examine alternative models for handling dependence between item bundles and show that using fixed dependency effects is always guaranteed to avoid paradoxical results.

Keywords

item response theory multidimensional item response theory likelihood paradoxical results item bundle random effects fixed dependency effects

Information

Type: Theory and Methods
Information: Psychometrika , Volume 75 , Issue 2 , June 2010 , pp. 249 - 271

DOI: https://doi.org/10.1007/s11336-009-9143-y [Opens in a new window]
Copyright: Copyright © 2010 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

The authors would like to thank an anonymous referee of Hooker et al. (2009) for suggesting the problem of item bundles.

References

Ackerman, T. (1996). Graphical representation of multidimensional item response theory analyses. Applied Psychological Measurement, 20(4), 311–329.CrossRef Google Scholar

Bock, R., Gibbons, R., & Muraki, E. (1988). Full-information item factor analysis. Applied Psychological Measurement, 12, 261–280.CrossRef Google Scholar

Craven, B.D. (1988). Fractional programming, Berlin: Heldermann.Google Scholar

Douglas, J.A., Roussos, L.A., & Stout, W. (1996). Item-bundle dif hypothesis testing: Identifying suspect bundles and assessing their differential functioning. Journal of Educational Measurement, 33, 465–484.CrossRef Google Scholar

Finkelman, M., Hooker, G., & Wang, J. (2009). Technical Report BU-1768-M, Department of Biological Statistics and Computational Biology, Cornell University.Google Scholar

Hooker, G., Finkelman, M., & Schwartzman, A. (2009). Paradoxical results in multidimensional item response theory. Psychometrika, 74(3), 419–442.CrossRef Google Scholar

Hoskens, M., & de Boeck, P. (1997). A parametric model for local dependence among test items. Psychological Methods, 2, 261–277.CrossRef Google Scholar

Kelderman, H. (1984). Loglinear Rasch model tests. Psychometrika, 49, 223–245.CrossRef Google Scholar

Li, Y., Bolt, D.M., & Fu, J. (2006). A comparison of alternative models for testlets. Applied Psychological Measurement, 20(1), 3–21.CrossRef Google Scholar

McCullagh, P., & Nelder, J.A. (1989). Generalized linear models, London: Chapman and Hall/CRC.CrossRef Google Scholar

Reckase, M. (1985). The difficulty of test items that measure more than one ability. Applied Psychological Measurement, 9, 401–412.CrossRef Google Scholar

Rijmen, F., Tuerlinckx, F., de Boeck, P., & Kuppens, P. (2003). A nonlinear mixed model framework for item response theory. Psychological Methods, 8(2), 185–205.CrossRef Google Scholar PubMed

Rosenbaum, P.R. (1988). Item bundles. Psychometrika, 53, 349–359.CrossRef Google Scholar

Veldkamp, B.P. (2002). Multidimensional constrained test assembly. Applied Psychological Measurement, 26(2), 133–146.CrossRef Google Scholar

Wang, W., & Wilson, M. (2005). The Rasch testlet model. Applied Psychological Measurement, 29(2), 126–149.CrossRef Google Scholar

Wang, X., Bradlow, E.T., & Wainer, H. (2002). A general Bayesian model for testlets: Theory and applications. Applied Psychological Measurement, 26(1), 109–128.CrossRef Google Scholar

Wilson, M., & Adams, R.J. (1995). Rasch models for item bundles. Psychometrika, 60, 181–198.CrossRef Google Scholar

Article contents

Paradoxical Results and Item Bundles

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests