Hostname: page-component-5f745c7db-tvc9f Total loading time: 0 Render date: 2025-01-06T07:30:13.544Z Has data issue: true hasContentIssue false

Joint Maximum Likelihood Estimation for Diagnostic Classification Models

Published online by Cambridge University Press:  01 January 2025

Chia-Yi Chiu*
Affiliation:
Rutgers, The State University of New Jersey
Hans-Friedrich Köhn
Affiliation:
University of Illinois at Urbana-Champaign
Yi Zheng
Affiliation:
Arizona State University
Robert Henson
Affiliation:
University of North Carolina
*
Correspondence should be made to Chia-Yi Chiu, Rutgers, The State University of New Jersey, New Brunswick, NJ, USA. Email: chia-yi.chiu@gse.rutgers.edu

Abstract

Joint maximum likelihood estimation (JMLE) is developed for diagnostic classification models (DCMs). JMLE has been barely used in Psychometrics because JMLE parameter estimators typically lack statistical consistency. The JMLE procedure presented here resolves the consistency issue by incorporating an external, statistically consistent estimator of examinees’ proficiency class membership into the joint likelihood function, which subsequently allows for the construction of item parameter estimators that also have the consistency property. Consistency of the JMLE parameter estimators is established within the framework of general DCMs: The JMLE parameter estimators are derived for the Loglinear Cognitive Diagnosis Model (LCDM). Two consistency theorems are proven for the LCDM. Using the framework of general DCMs makes the results and proofs also applicable to DCMs that can be expressed as submodels of the LCDM. Simulation studies are reported for evaluating the performance of JMLE when used with tests of varying length and different numbers of attributes. As a practical application, JMLE is also used with “real world” educational data collected with a language proficiency test.

Type
Article
Copyright
Copyright © 2016 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Baker, F. B., Kim, S.- H. Item response theory: Parameter estimation techniques, 2New York: Marcel Dekker.CrossRefGoogle Scholar
Birnbaum, A., Load, F. M., Novick, M. R. (1968). Some latent trait models and their use in inferring an examinee’s ability. Statistical theories of mental test scores, Reading, MA: Addison-Wesley. 397479.Google Scholar
Buck, G., Tatsuoka, K. K. (1998). Application of the rule-space procedure to language testing: Examining attributes of a free response listening test. Language Testing, 15, 119157.CrossRefGoogle Scholar
Chiu, C.- Y., Douglas, J. A. (2013). A nonparametric approach to cognitive diagnosis by proximity to ideal response profiles. Journal of Classification, 30, 225250.CrossRefGoogle Scholar
Chiu, C.- Y., Douglas, J. A., Li, X. (2009). Cluster analysis for cognitive diagnosis: Theory and applications. Psychometrika, 74, 633665.CrossRefGoogle Scholar
Chiu, C.- Y., Köhn, H.- F. (2015). Consistency of cluster analysis for cognitive diagnosis: The DINO model and the DINA model revisited. Applied Psychological Measurement, 39, 465479.CrossRefGoogle ScholarPubMed
de la Torre, J. (2011). (2000). The generalized DINA model framework. Psychometrika, 76, 179199.CrossRefGoogle Scholar
de la Torre, J., & Chiu, C.-Y. (2010, April). A general empirical method of Q-matrix validation. Paper presented at the annual meeting of the National Council on Measurement in Education, Denver, CO.Google Scholar
DiBello, L. V., Roussos, L. A., & Stout, W. F. (2007). Review of cognitively diagnostic assessment and a summary of psychometric models. In C. R. Rao & S. Sinharay (Eds.), Handbook of Statistics. Psychometrics (Vol. 26, pp. 979–1030). Amsterdam: Elsevier.Google Scholar
ECPE 2013 report. (2013). Retrieved March 10, 2013, from http://www.cambridgemichigan.org/resources/ecpe/reportsGoogle Scholar
Embretson, S. E., Reise, S. P. Item response theory for psychologists, Mahwah, NJ: Erlbaum.Google Scholar
Feng, Y., Habing, B. T., Huebner, A. (2014). Parameter estimation of the Reduced RUM using the EM algorithm. Applied Psychological Measurement, 38, 137150.CrossRefGoogle Scholar
Haberman, S. J. (2004, May). (2005, September). Joint and conditional maximum likelihood estimation for the Rasch model for binary responses. Research report no. RR-04-20. Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
Haberman, S. J., & von Davier, M. (2007). Some notes on models for cognitively based skill diagnosis. In C. R. Rao & S. Sinharay (Eds.), Handbook of statistics. Psychometrics (Vol. 26, pp. 1031–1038). Amsterdam: Elsevier.Google Scholar
Hartz, S. M. (2002). A Bayesian framework for the Unified Model for assessing cognitive abilities: Blending theory with practicality. Doctoral dissertation. Available from ProQuest Dissertations and Theses database. UMI No. 3044108.Google Scholar
Hartz, S. M., & Roussos, L. A. (October 2008). The Fusion Model for skill diagnosis: Blending theory with practicality. Research report no. RR-08-71. Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
Henson, R. A., & Templin, J. (2007, April). Large-scale language assessment using cognitive diagnosis models. Paper presented at the annual meeting of the National Council on Measurement in Education, Chicago, IL.Google Scholar
Henson, R. A., Templin, J. L., Willse, J. T. (2009). Defining a family of cognitive diagnosis models using log-linear models with latent variables. Psychometrika, 74, 191210.CrossRefGoogle Scholar
Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58 (301), 1330.CrossRefGoogle Scholar
Junker, B. W. (1991). Essential independence and likelihood-based ability estimation for polytomous items. Psychometrika, 56, 255278.CrossRefGoogle Scholar
Junker, B. W., Sijtsma, K. (2001). (2007). Cognitive assessment models with few assumptions, and connections with nonparametric item response theory. Applied Psychological Measurement, 25, 258272.CrossRefGoogle Scholar
Leighton, J., Gierl, M. Cognitive diagnostic assessment for education: Theory and applications, Cambridge, UK: Cambridge University Press.CrossRefGoogle Scholar
Leighton, J. P., Gierl, M. J., Hunka, S. (2004). The attribute hierarchy model: An approach for integrating cognitive theory with assessment practice. Journal of Educational Measurement, 41, 205236.CrossRefGoogle Scholar
Liu, Y., Douglas, J. A., Henson, R. A. (2009). Testing person fit in cognitive diagnosis. Applied Psychological Measurement, 33, 579598.CrossRefGoogle Scholar
Lunn, D., Spiegelhalter, D., Thomas, A., Best, N. (2009). The BUGS project: Evolution, critique, and future directions. Statistics in Medicine, 28, 30493067.CrossRefGoogle ScholarPubMed
Macready, G. B., Dayton, C. M. (1977). The use of probabilistic models in the assessment of mastery. Journal of Educational Statistics, 33, 379416.Google Scholar
Muthén, L. K., & Muthén, B. O. (1998–2012). Mplususer’s guide (7th edn.) Los Angeles: Muthén & Muthén.Google Scholar
Neyman, J., Scott, E. L. (1948). (2010). Consistent estimates based on partially consistent observations. Econometrica, 16, 132.CrossRefGoogle Scholar
Robitzsch, A., Kiefer, T., George, A. C., & Uenlue, A. (2015). CDM: Cognitive diagnosis modeling. R package version 3.1-14. Retrieved from the Comprehensive R Archive Network [CRAN] websiteGoogle Scholar
Rupp, A. A., Templin, J. L., Henson, R. A. Diagnostic measurement: Theory, methods, and applications, New York: Guilford.Google Scholar
Tatsuoka, K. (1985). A probabilistic model for diagnosing misconception in the pattern classification approach. Journal of Educational Statistics, 12, 5573.CrossRefGoogle Scholar
Templin, J. L., Henson, R. A. (2006). Measurement of psychological disorders using cognitive diagnosis models. Psychological Methods, 11, 287305.CrossRefGoogle ScholarPubMed
Templin, J., Bradshaw, L. (2014). Hierarchical diagnostic classification models: A family of models for estimating and testing attribute hierarchies. Psychometrika, 79, 317339.CrossRefGoogle ScholarPubMed
Templin, J., Hoffman, L. (2013). (2000). Obtaining diagnostic classification model estimates using Mplus. Educational Measurement: Issues and Practice, 32, 3750.CrossRefGoogle Scholar
Vermunt, J. K., Magidson, J. Latent GOLD’s users’s guide, Boston: Statistical Innovations Inc.Google Scholar
von Davier, M. (2005, September). A general diagnostic model applied to language testing data. Research report no. RR-05-16. Princeton, NJ: Educational Testing Service.CrossRefGoogle Scholar
von Davier, M. (2008). A general diagnostic model applied to language testing data. British Journal of Mathematical and Statistical Psychology, 61, 287301.CrossRefGoogle ScholarPubMed
Wang, S., Douglas, J. (2015). Consistency of nonparametric classification in cognitive diagnosis. Psychometrika, 80, 85100.CrossRefGoogle ScholarPubMed
Zheng, Y., & Chiu, C.-Y. (2014). NPCD: Nonparametric methods for cognitive diagnosis. R package version 1.0-5. http://CRAN.R-project.org/package=NPCDGoogle Scholar