A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data

Jun Li; Yao Yu

doi:10.1007/s11336-014-9410-4

Abstract

Missing data occur in many real world studies. Knowing the type of missing mechanisms is important for adopting appropriate statistical analysis procedure. Many statistical methods assume missing completely at random (MCAR) due to its simplicity. Therefore, it is necessary to test whether this assumption is satisfied before applying those procedures. In the literature, most of the procedures for testing MCAR were developed under normality assumption which is sometimes difficult to justify in practice. In this paper, we propose a nonparametric test of MCAR for incomplete multivariate data which does not require distributional assumptions. The proposed test is carried out by comparing the distributions of the observed data across different missing-pattern groups. We prove that the proposed test is consistent against any distributional differences in the observed data. Simulation shows that the proposed procedure has the Type I error well controlled at the nominal level for testing MCAR and also has good power against a variety of non-MCAR alternatives.

Information

References

Chen, H. Y., & Little, R., (1999). A test of missing completely at random from generalised estimating equation with missing data. Biometrika, 86, 1–13.CrossRef Google Scholar

Davison, A. C., & Hinkley, D. V., (1997). Bootstrap methods and their application. Oxford: Cambridge University Press.CrossRef Google Scholar

Efron, B., & Tibshirani, R. (1993). An introduction to bootstrap. London: Chapman & Hall.CrossRef Google Scholar

Fuchs, C. (1982). Maximum likelihood estimation and model selection in contingency tables with missing data. Journal of the American Statistical Association, 77, 270–278.CrossRef Google Scholar

Jamshidian, M. & Jalal, S. (2010). Tests of homoscedasticity, normality and missing completely at random for incomplete multivariate data. Psychometrika, 75, 649–674.CrossRef Google Scholar PubMed

Kim, K. H. & Bentler, P. M., (2002). Tests of homogeneity of means and covariance matrices for multivariate incomplete data. Psychometrika, 67, 609–624.CrossRef Google Scholar

Little, R.J.A. (1988). A test of missing completely at random for multivariate data with missing values. Journal of the American Statistical Association, 83, 1198–1202.CrossRef Google Scholar

Little, R. J. A., & Rubin, D. B., (2002). Statistical analysis with missing data (2nd ed.). New York: Wiley.CrossRef Google Scholar

Qu, A., & Song, P.X.K. (2002). Testing ignorable missingness in estimating equation approaches for longitudinal data. Biometrika, 89, 841–850.CrossRef Google Scholar

Rizzo, M. L., & Székely, G. J., (2010). DISCO analysis: A nonparametric extension of analysis of variance. The Annals of Applied Statistics, 4, 1034–1055.CrossRef Google Scholar

Rubin, D.B. (1976). Inference and missing data. Biometrika, 63, 581–592.CrossRef Google Scholar

Székely, G. J., & Rizzo, M. L. (2005). A new test for multivariate normality. Journal of Multivariate Analysis, 93, 58–80.CrossRef Google Scholar

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Lee, Shen-Ming Hwang, Wen-Han and de Dieu Tapsoba, Jean 2016. Estimation in Closed Capture–Recapture Models when Covariates Are Missing at Random. Biometrics, Vol. 72, Issue. 4, p. 1294.

HERTWIG, RALPH 2017. When to consider boosting: some rules for policy-makers. Behavioural Public Policy, Vol. 1, Issue. 2, p. 143.

Shutoh, Nobumichi Nishiyama, Takahiro and Hyodo, Masashi 2017. Bartlett correction to the likelihood ratio test for MCAR with two‐step monotone sample. Statistica Neerlandica, Vol. 71, Issue. 3, p. 184.

Yuan, Ke-Hai Jamshidian, Mortaza and Kano, Yutaka 2018. Missing Data Mechanisms and Homogeneity of Means and Variances–Covariances. Psychometrika, Vol. 83, Issue. 2, p. 425.

Zhang, Shixiao Han, Peisong and Wu, Changbao 2019. A unified empirical likelihood approach for testing MCAR and subsequent estimation. Scandinavian Journal of Statistics, Vol. 46, Issue. 1, p. 272.

Mittelman, Mary S. O’Connor, Maureen K. Donley, Tiffany Epstein-Smith, Cynthia Nguyen, Andrew Nicholson, Roscoe Salant, Rebecca Shirk, Steven D. and Stevenson, Elizabeth 2021. Longitudinal study: understanding the lived experience of couples across the trajectory of dementia. BMC Geriatrics, Vol. 21, Issue. 1,

Shutoh, Nobumichi 2021. Effect of nonnormality on tests for a mean vector with missing data under an elliptically contoured pattern-mixture model. Communications in Statistics - Theory and Methods, Vol. 50, Issue. 19, p. 4448.

Yue, Tingyan and Zhang, Tao 2021. Bayesian network-based missing mechanism identification (BN-MMI) method in medical research. BMC Medical Informatics and Decision Making, Vol. 21, Issue. 1,

2022.

CrossRef

Sun, Xiaoyue and Chen, Mengtong 2022. Associations between perceived material deprivation, social support and violent victimization among Chinese children. Child Abuse & Neglect, Vol. 127, Issue. , p. 105583.

Mukherjee, Kumar Gunsoy, Necdet B. Kristy, Rita M. Cappelleri, Joseph C. Roydhouse, Jessica Stephenson, Judith J. Vanness, David J. Ramachandran, Sujith Onwudiwe, Nneka C. Pentakota, Sri Ram Karcher, Helene and Di Tanna, Gian Luca 2023. Handling Missing Data in Health Economics and Outcomes Research (HEOR): A Systematic Review and Practical Recommendations. PharmacoEconomics, Vol. 41, Issue. 12, p. 1589.

Wang, Hairu Lu, Zhiping and Liu, Yukun 2023. Score Test for Missing at Random or Not under Logistic Missingness Models. Biometrics, Vol. 79, Issue. 2, p. 1268.

Näf, Jeffrey Spohn, Meta-Lina Michel, Loris and Meinshausen, Nicolai 2023. Imputation scores. The Annals of Applied Statistics, Vol. 17, Issue. 3,

Berrett, Thomas B. and Samworth, Richard J. 2023. Optimal nonparametric testing of Missing Completely At Random and its connections to compatibility. The Annals of Statistics, Vol. 51, Issue. 5,

Huang, Linhui Chen, Yuanyuan Zhu, Jianjun and Zhang, Wei 2024. The role of fathers’ and mothers’ acceptance in adaptive emotion regulation in Chinese preadolescents: Distinguishing between- and within-person effects. Children and Youth Services Review, Vol. 166, Issue. , p. 107901.

Cascella, Marco Di Gennaro, Piergiacomo Crispo, Anna Vittori, Alessandro Petrucci, Emiliano Sciorio, Francesco Marinangeli, Franco Ponsiglione, Alfonso Maria Romano, Maria Ovetta, Concetta Ottaiano, Alessandro Sabbatino, Francesco Perri, Francesco Piazza, Ornella and Coluccia, Sergio 2024. Advancing the integration of biosignal-based automated pain assessment methods into a comprehensive model for addressing cancer pain. BMC Palliative Care, Vol. 23, Issue. 1,

Huang, Linhui Chen, Yuanyuan Zhu, Jianjun and Zhang, Wei 2024. Longitudinal associations between deviant peer affiliation and externalizing behavior in Chinese preadolescence: Differentiating between‐person effects from within‐person effects. Journal of Research on Adolescence, Vol. 34, Issue. 4, p. 1529.

Chen, Ruizhe Chung, Yu-Che Basu, Sanjib and Shi, Qian 2024. Diagnostic Test for Realized Missingness in Mixed-type Data. Sankhya B, Vol. 86, Issue. 1, p. 109.

Aleksić, Danijel 2024. A novel test of missing completely at random: U -statistics-based approach . Statistics, Vol. 58, Issue. 4, p. 1004.

Article contents

A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

A Nonparametric Test of Missing Completely at Random for Incomplete Multivariate Data

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests