A Mixed Stochastic Approximation EM (MSAEM) Algorithm for the Estimation of the Four-Parameter Normal Ogive Model

Xiangbin Meng; Gongjun Xu

doi:10.1007/s11336-022-09870-w

Akaike, H. (1998). Information theory and an extension of the maximum likelihood principle. In Selected papers of hirotugu akaike (pp. 199–213). Springer.CrossRef Google Scholar

Allassonnière, S., Kuhn, E., Trouvé, A., et al. Construction of Bayesian deformable models via a stochastic approximation algorithm: A convergence study Bernoulli. (2010 16(3), 641–678.CrossRef Google Scholar

Baker, F. B., & Kim, S.-H. (2004). Item response theory: Parameter estimation techniques. Boca Raton: CRC Press.CrossRef Google Scholar

Barton, M. A., & Lord, F. M. (1981). An upper asymptote for the three-parameter logistic item-response model. ETS Research Report Series, 1981(1), i–8.CrossRef Google Scholar

Battauz, M. (2020). Regularized estimation of the four-parameter logistic model. Psych, 2(4), 269–278.CrossRef Google Scholar

Béguin, A. A., & Glas, C. A. (2001). MCMC estimation and some model-fit analysis of multidimensional IRT models. Psychometrika, 66(4), 541–561.CrossRef Google Scholar

Berger, J. O. (1990). Robust Bayesian analysis: Sensitivity to the prior. Journal of Statistical Planning and Inference, 25(3), 303–328.CrossRef Google Scholar

Camilli, G., & Fox, J.-P. (2015). An aggregate IRT procedure for exploratory factor analysis. Journal of Educational and Behavioral Statistics, 40(4), 377–401.CrossRef Google Scholar

Camilli, G., & Geis, E. (2019). Stochastic approximation EM for large-scale exploratory IRT factor analysis. Statistics in Medicine, 38(21), 3997–4012.CrossRef Google Scholar PubMed

Celeux, G., Hurn, M., & Robert, C. P. (2000). Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95(451), 957–970.CrossRef Google Scholar

Culpepper, S. A. (2016). Revisiting the 4-parameter item response model: Bayesian estimation and application. Psychometrika, 81(4), 1142–1163.CrossRef Google Scholar PubMed

Culpepper, S. A. (2017). The prevalence and implications of slipping on low-stakes, large-scale assessments. Journal of Educational and Behavioral Statistics, 42(6), 706–725.CrossRef Google Scholar

Delyon, B., Lavielle, M., Moulines, E., et al. Convergence of a stochastic approximation version of the EM algorithm The Annals of Statistics. (1999 27(1), 94–128.CrossRef Google Scholar

DeMars, C. E. (2012). A comparison of limited-information and full-information methods in M plus for estimating item response theory parameters for nonnormal populations. Structural Equation Modeling: A Multidisciplinary Journal, 19(4), 610–632.CrossRef Google Scholar

Feuerstahler, L. M., & Waller, N. G. (2014). Estimation of the 4-parameter model with marginal maximum likelihood. Multivariate Behavioral Research, 49(3), 285.CrossRef Google Scholar PubMed

Fox, J.-P. (2003). Stochastic EM for estimating the parameters of a multilevel IRT model. British Journal of Mathematical and Statistical Psychology, 56(1), 65–81.CrossRef Google Scholar PubMed

Galarza, C. E., Lachos, V. H., & Bandyopadhyay, D. (2017). Quantile regression in linear mixed models: A stochastic approximation EM approach. Statistics and its Interface, 10(3), 471.CrossRef Google Scholar PubMed

Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., & Rubin, D. B. (2013). Bayesian data analysis. Boca Raton: CRC Press.CrossRef Google Scholar

Gu, M. G., & Zhu, H.-T. (2001). Maximum likelihood estimation for spatial models by Markov chain Monte Carlo stochastic approximation. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 63(2), 339–355.CrossRef Google Scholar

Guo, S., & Zheng, C. (2019). The Bayesian expectation–maximization–maximization for the 3plm. Frontiers in Psychology, 10 1175.CrossRef Google Scholar PubMed

Jank, W. (2006). Implementing and diagnosing the stochastic approximation EM algorithm. Journal of Computational and Graphical Statistics, 15(4), 803–829.CrossRef Google Scholar

Kern, J. L., & Culpepper, S. A. (2020). A restricted four-parameter IRT model: The dyad four-parameter normal ogive (Dyad-4PNO) model. Psychometrika, 85(3), 575–599.CrossRef Google Scholar PubMed

Kuhn, E., & Lavielle, M. (2004). Coupling a stochastic approximation version of EM with an MCMC procedure. ESAIM: Probability and Statistics, 8, 115–131.CrossRef Google Scholar

Lavielle, M., & Mbogning, C. (2014). An improved SAEM algorithm for maximum likelihood estimation in mixtures of non linear mixed effects models. Statistics and Computing, 24(5), 693–707.CrossRef Google Scholar

Liao, W.-W., Ho, R.-G., Yen, Y.-C., & Cheng, H.-C. (2012). The four-parameter logistic item response theory model as a robust method of estimating ability despite aberrant responses. Social Behavior and Personality: An International Journal, 40(10), 1679–1694.CrossRef Google Scholar

Loken, E., & Rulison, K. L. (2010). Estimation of a four-parameter item response theory model. British Journal of Mathematical and Statistical Psychology, 63(3), 509–525.CrossRef Google Scholar PubMed

McKinley, R. L., & Mills, C. N. (1985). A comparison of several goodness-of-fit statistics. Applied Psychological Measurement, 9(1), 49–57.CrossRef Google Scholar

Meng, X.-L., & Schilling, S. (1996). Fitting full-information item factor models and an empirical investigation of bridge sampling. Journal of the American Statistical Association, 91(435), 1254–1267.CrossRef Google Scholar

Meng, X., Xu, G., Zhang, J., & Tao, J. (2020). Marginalized maximum a posteriori estimation for the four-parameter logistic model under a mixture modelling framework. British Journal of Mathematical and Statistical Psychology, 73, 51–82.CrossRef Google Scholar

Orlando, M., & Thissen, D. (2000). Likelihood-based item-fit indices for dichotomous item response theory models. Applied Psychological Measurement, 24(1), 50–64.CrossRef Google Scholar

Orlando, M., & Thissen, D. (2003). Further investigation of the performance of s-x2: An item fit index for use with dichotomous item response theory models. Applied Psychological Measurement, 27(4), 289–298.CrossRef Google Scholar

Patsula, L. (1995). A comparison of item parameter estimates and ICCs produced with TESTGRAF and BILOG under different test lengths and sample sizes. University of Ottawa (Canada).Google Scholar

Reise, S. P., & Waller, N. G. (2003). How many IRT parameters does it take to model psychopathology items?. Psychological Methods, 8(2), 164–184.CrossRef Google Scholar PubMed

Robbins, H., & Monro, S. (1951). A stochastic approximation method. The Annals of Mathematical Statistics, 22, 400–407.CrossRef Google Scholar

Rulison, K. L., & Loken, E. (2009). I’ve fallen and I can’t get up: Can high-ability students recover from early mistakes in CAT?. Applied Psychological Measurement, 33(2), 83–101.CrossRef Google Scholar PubMed

Svetina, D., Valdivia, A., Underhill, S., Dai, S., & Wang, X. (2017). Parameter recovery in multidimensional item response theory models under complexity and nonnormality. Applied Psychological Measurement, 41(7), 530–544.CrossRef Google Scholar PubMed

Tang, K. L., Way, W. D., & Carey, P. A. (1993). The effect of small calibration sample sizes on TOEFL IRT-based equating. ETS Research Report Series, 1993(2), 1–38.CrossRef Google Scholar

Tao, J., Shi, N.-Z., & Chang, H.-H. (2012). Item-weighted likelihood method for ability estimation in tests composed of both dichotomous and polytomous items. Journal of Educational and Behavioral Statistics, 37(2), 298–315.CrossRef Google Scholar

Thissen, D. (1982). Marginal maximum likelihood estimation for the one-parameter logistic model. Psychometrika, 47(2), 175–186.CrossRef Google Scholar

von Davier, M. (2009). Is there need for the 3pl model? Guess what?. Measurement: Interdisciplinary Research and Perspectives, 7(2), 110–114.Google Scholar

Waller, N. G., & Feuerstahler, L. (2017). Bayesian modal estimation of the four-parameter item response model in real, realistic, and idealized data sets. Multivariate Behavioral Research, 52(3), 350–370.CrossRef Google Scholar PubMed

Wang, C., Su, S., & Weiss, D. J. (2018). Robustness of parameter estimation to assumptions of normality in the multidimensional graded response model. Multivariate Behavioral Research, 53(3), 403–418.CrossRef Google Scholar PubMed

Wei, G. C., & Tanner, M. A. (1990). A Monte Carlo implementation of the EM algorithm and the poor man’s data augmentation algorithms. Journal of the American Statistical Association, 85(411), 699–704.CrossRef Google Scholar

Wollack, J. A., Bolt, D. M., Cohen, A. S., & Lee, Y.-S. (2002). Recovery of item parameters in the nominal response model: A comparison of marginal maximum likelihood estimation and Markov Chain Monte Carlo estimation. Applied Psychological Measurement, 26(3), 339–352.CrossRef Google Scholar

Yen, W. M. (1981). Using simulation results to choose a latent trait model. Applied Psychological Measurement, 5(2), 245–262.CrossRef Google Scholar

Yen, W. M. (1987). A comparison of the efficiency and accuracy of bilog and logist. Psychometrika, 52(2), 275–291.CrossRef Google Scholar

Yoes, M. (1995). An updated comparison of micro-computer based item parameter estimation procedures used with the 3-parameter IRT model. St. Paul, MN: Assessment Systems Corporation Google Scholar

Zhang, J., Du, H., Zhang, Z., & Tao, J. (2020). Gibbs-slice sampling algorithm for estimating the four-parameter logistic model. Frontiers in Psychology, 11 2121.CrossRef Google Scholar PubMed

Zhang, S., Chen, Y., & Liu, Y. (2020). An improved stochastic EM algorithm for large-scale full-information item factor analysis. British Journal of Mathematical and Statistical Psychology, 73(1), 44–71.CrossRef Google Scholar PubMed

Zhang, X., Wang, C., Weiss, D. J., & Tao, J. (2020c). Bayesian inference for IRT models with non-normal latent trait distributions. Multivariate Behavioral Research 1–21.CrossRef Google Scholar