EFFICIENCY IN ESTIMATION UNDER MONOTONIC ATTRITION

Jean-Louis Barnwell; Saraswata Chaudhuri

doi:10.1017/S0266466624000203

EFFICIENCY IN ESTIMATION UNDER MONOTONIC ATTRITION

Published online by Cambridge University Press: 16 September 2024

Jean-Louis Barnwell and

Saraswata Chaudhuri

Show author details

Jean-Louis Barnwell: Affiliation:
Analysis Group
Saraswata Chaudhuri*: Affiliation:
McGill University and CIREQ
*: Address correspondence to Saraswata Chaudhuri, Department of Economics, McGill University and CIREQ, Montreal, QC, Canada; e-mail: saraswata.chaudhuri@mcgill.ca.

Article contents

Abstract
Footnotes
References

Get access

Rights & Permissions

Abstract

Attrition is monotonic when agents leaving multi-period studies do not return. Under a general missing at random (MAR) assumption, we study efficiency in estimation of parameters defined by moment restrictions on the distributions of the counterfactuals that were unrealized due to monotonic attrition. We discuss novel issues related to overidentification, usability of sample units, and the information content of various MAR assumptions for estimation of such parameters. We propose a standard doubly robust estimator for these parameters by equating to zero the sample analog of their respective efficient influence functions. Our proposed estimator performs well and vastly outperforms other estimators in our simulation experiment and empirical illustration.

Information

Type: ARTICLES
Information: Econometric Theory , First View , pp. 1 - 34

DOI: https://doi.org/10.1017/S0266466624000203 [Opens in a new window]
Copyright: © The Author(s), 2024. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

Footnotes

We are very grateful to the Editor (Peter C. B. Phillips), the Co-Editor (Patrik Guggenberger), two anonymous referees, and Whitney Newey for their help in improving the paper. We also thank Francesco Amodio, Marine Carrasco, Daniel Farewell, Bryan Graham, Fabian Lange, Steven Lehrer, Thierry Magnac, Erica Moodie, Chris Muris, Tom Parker, Geert Ridder, Youngki Shin, and various conference and seminar participants for helpful comments. Earlier versions of the paper were circulated under different names; e.g., “A note on efficiency in estimation with monotonically missing at random data.” The views presented in this work do not reflect those of Analysis Group. Analysis Group provided no financial support for this work.

References

REFERENCES

Abowd, J. M., Crepon, B., & Kramarz, F. (2001). Moment estimation with attrition: An application to economic models. Journal of the American Statistical Association , 96, 1223–1231.10.1198/016214501753381878CrossRef Google Scholar

Abrevaya, J., & Donald, S. G. (2017). A GMM approach for dealing with missing data on regressors and instruments. Review of Economics and Statistics , 99, 657–662.10.1162/REST_a_00645CrossRef Google Scholar

Achilles, C., Bain, H. P., Bellott, F., Boyd-Zaharias, J., Finn, J., Folger, J., Johnston, J., & Word, E. (2008). Tennessee’s Student Teacher Achievement Ratio (STAR) project.Google Scholar

Ackerberg, D., Chen, X., & Hahn, J. (2012). A practical asymptotic variance estimator for two-step semiparametric estimators. The Review of Economics and Statistics , 94, 481–498.CrossRef Google Scholar

Ackerberg, D., Chen, X., Hahn, J., & Liao, Z. (2014). Asymptotic efficiency of semiparametric two-step GMM. Review of Economic Studies , 81, 919–943.10.1093/restud/rdu011CrossRef Google Scholar

Bang, H., & Robins, J. M. (2005). Doubly robust estimation in missing data and causal inference models. Biometrics , 61, 962–972.10.1111/j.1541-0420.2005.00377.xCrossRef Google Scholar PubMed

Brown, B., & Newey, W. (1998). Efficient semiparametric estimation of expectations. Econometrica , 66, 453–464.10.2307/2998566CrossRef Google Scholar

Cao, W., Tsiatis, A., & Davidian, M. (2009). Improving efficiency and robustness of the doubly robust estimator for a population mean with incomplete data. Biometrika , 96, 723–734.10.1093/biomet/asp033CrossRef Google Scholar PubMed

Chaudhuri, S. (2020). On efficiency gains from multiple incomplete subsamples. Econometric Theory , 36, 488–525.CrossRef Google Scholar

Chen, X., Hong, H., & Tarozzi, A. (2008). Semiparametric efficiency in GMM models with auxiliary data. Annals of Statistics , 36, 808–843.10.1214/009053607000000947CrossRef Google Scholar

Chen, X., Linton, O., & van Keilegom, I. (2003). Estimation of semiparametric models when the criteria function is not smooth. Econometrica , 71, 1591–1608.CrossRef Google Scholar

Chen, X., & Santos, A. (2018). Overidentification in regular models. Econometrica , 86, 1771–1817.10.3982/ECTA13559CrossRef Google Scholar

Chernozhukov, V., Escanciano, J.-C., Ichimura, H., Newey, W., & Robins, J. (2022). Locally robust semiparametric estimation. Econometrica , 90, 1501–1535.10.3982/ECTA16294CrossRef Google Scholar

Chetty, R., Friedman, J. N., Hilger, N., Saez, E., Schanzenbach, D. W., & Yagan, D. (2011). How does your kindergarten classroom affect your earnings? Evidence from Project STAR. The Quarterly Journal of Economics , 126, 1593–1660.CrossRef Google Scholar PubMed

Dardanoni, V., Modica, S., & Peracchi, F. (2011). Regression with imputed covariates: A generalized missing-indicator approach. Journal of Econometrics , 162, 362–368.CrossRef Google Scholar

Ding, W., & Lehrer, S. F. (2010). Estimating treatment effects from contaminated multiperiod education experiments: The dynamic impacts of class size reductions. The Review of Economics and Statistics , 92, 31–42.CrossRef Google Scholar

Fitzgerald, J., Gottschalk, P., & Moffitt, R. (1996). An analysis of sample attrition in panel data: The Michigan Panel Study of Income Dynamics [NBER Working paper].Google Scholar

Gill, R. D., van der Laan, M. J., & Robins, J. M. (1997). Coarsening at random: Characterizations, conjectures and counterexamples. In Lin, D. Y., & Fleming, T. R. (Eds.), Proceedings of the first Seattle symposium in biostatistics: Survival analysis . Lecture Notes in Statistics (pp. 255–294). Springer.10.1007/978-1-4684-6316-3_14CrossRef Google Scholar

Graham, B. S. (2011). Efficiency bounds for missing data models with semiparametric restrictions. Econometrica , 79, 437–452.Google Scholar

Hahn, J. (1998). On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica , 66, 315–331.10.2307/2998560CrossRef Google Scholar

Hajek, J. (1971). Comment on a paper by D. Basu. In Godambe, V. R., & Sprott, D. A. (Eds.), Foundations of statistical inference (p. 236). Holt, Rinehert and Winston.Google Scholar

Hall, A. R., & Inoue, A. (2003). The large sample behaviour of the generalized method of moments estimator in misspecified models. Journal of Econometrics , 114, 361–394.10.1016/S0304-4076(03)00089-7CrossRef Google Scholar

Hanushek, E. A. (1999). Some findings from an independent investigation of the Tennessee STAR experiment and from other investigations of class size effects. Educational Evaluation and Policy Analysis , 21, 143–163.CrossRef Google Scholar

Hirano, K., Imbens, G., & Ridder, G. (2003). Efficient estimation of average treatment effects using the estimated propensity scores. Econometrica , 71, 1161–1189.10.1111/1468-0262.00442CrossRef Google Scholar

Holcroft, C., Rotnitzky, A., & Robins, J. M. (1997). Efficient estimation of regression parameters from multistage studies with validation of outcome and covariates. Journal of Statistical Planning and Inference , 65, 349–374.10.1016/S0378-3758(97)81749-1CrossRef Google Scholar

Hoonhout, P., & Ridder, G. (2019). Nonignorable attrition in multi-period panels with refreshment samples. Journal of Business and Economic Statistics , 37, 377–390.CrossRef Google Scholar

Horvitz, D., & Thompson, D. (1952). A generalization of sampling without replacement from a finite universe. Journal of American Statistical Association , 47, 663–685.CrossRef Google Scholar

Khan, S., & Tamer, E. (2010). Irregular identification, support conditions, and inverse weight estimation. Econometrica , 78, 2021–2042.Google Scholar

Krueger, A. B. (1999). Experimental estimates of education production functions. Quarterly Journal of Economics , 114, 497–532.10.1162/003355399556052CrossRef Google Scholar

Krueger, A. B., & Whitmore, D. M. (2001). The effect of attending a small class in the early grades on college-test taking and middle school test results: Evidence from Project STAR. The Economic Journal , 111, 1–28.CrossRef Google Scholar

Muris, C. (2020). Efficient GMM estimation with incomplete data. Review of Economics and Statistics , 102, 518–530.10.1162/rest_a_00836CrossRef Google Scholar

Narain, R. D. (1951). On sampling without replacement with varying probabilities. Journal of Indian Soc. Agricultural Statistics , 3, 169–174.Google Scholar

Newey, W. (1994). The asymptotic variance of semiparametric estimators. Econometrica , 62, 1349–1382.CrossRef Google Scholar

Newey, W. K. (1990). Semiparametric efficiency bounds. Journal of Applied Econometrics , 5, 99–135.CrossRef Google Scholar

Nicoletti, C. (2006). Nonresponse in dynamic panel data models. Journal of Econometrics , 132, 461–489.10.1016/j.jeconom.2005.02.008CrossRef Google Scholar

Robins, J. M., & Gill, R. (1997). Non-response models for the analysis of non-monotone ignorable missing data. Statistics in Medicine , 16, 39–56.10.1002/(SICI)1097-0258(19970115)16:1<39::AID-SIM535>3.0.CO;2-D3.0.CO;2-D>CrossRef Google Scholar PubMed

Robins, J. M., & Ritov, Y. (1997). Toward a curse of dimensionality appropriate (CODA) asymptotic theory for semi-parametric models. Statistics in Medicine , 16, 285–319.10.1002/(SICI)1097-0258(19970215)16:3<285::AID-SIM535>3.0.CO;2-#3.0.CO;2-#>CrossRef Google Scholar

Robins, J. M., & Rotnitzky, A. (1992). Recovery of information and adjustment for dependent censoring using surrogate markers. In Jewell, N., Dietz, K., & Farewell, V. T. (Eds.), AIDS epidemiology: Methodological issues (pp. 297–331). Birkhliuser.10.1007/978-1-4757-1229-2_14CrossRef Google Scholar

Robins, J. M., & Rotnitzky, A. (1995). Semiparametric efficiency in multivariate regression models with missing data. Journal of American Statistical Association , 90, 122–129.10.1080/01621459.1995.10476494CrossRef Google Scholar

Robins, J. M., Rotnitzky, A., & Zhao, L. (1994). Estimation of regression coefficients when some regressors are not always observed. Journal of American Statistical Association , 427, 846–866.CrossRef Google Scholar

Robins, J. M., Rotnitzky, A., & Zhao, L. (1995). Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of American Statistical Association , 429, 106–121.CrossRef Google Scholar

Rothe, C., & Firpo, S. (2019). Properties of doubly robust estimators when nuisance functions are estimated nonparametrically. Econometric Theory , 35, 1048–1087.CrossRef Google Scholar

Rotnitzky, A., & Robins, J. M. (1995). Semiparametric regression estimation in the presence of dependent censoring. Biometrika , 82, 805–820.CrossRef Google Scholar

Rubin, D. (1976). Inference and missing data. Biometrika , 63, 581–592.10.1093/biomet/63.3.581CrossRef Google Scholar

Scharfstein, D. O., Rotnitzky, A., & Robins, J. M. (1999). Adjusting for nonignorable drop-out using semiparametric nonresponse models. Journal of the American Statistical Association , 94, 1096–1146.CrossRef Google Scholar

Tan, Z. (2007). Comment: Understanding OR, PS and DR. Statistical Science , 22, 560–568.CrossRef Google Scholar

Tsiatis, A. A. (2006). Semiparametric theory and missing data . Springer.Google Scholar

Vansteelandt, S., Rotnitzky, A., & Robins, J. M. (2007). Estimation of regression models for mean of repeated outcomes under nonignorable nonmonotone nonresponse. Biometrika , 94, 841–860.CrossRef Google Scholar PubMed

Wooldridge, J. M. (2002). Inverse probability weighted M-estimation for sample selection, attrition, and stratification. Portuguese Economic Journal , 1, 117–139.10.1007/s10258-002-0008-xCrossRef Google Scholar

Wooldridge, J. M. (2010). Econometric analysis of cross section & panel data . MIT Press.Google Scholar

Barnwell and Chaudhuri supplementary material

File 429.2 KB

Article contents

EFFICIENCY IN ESTIMATION UNDER MONOTONIC ATTRITION

Abstract

Information

Access options

Article purchase

Temporarily unavailable

Footnotes

References

REFERENCES

Barnwell and Chaudhuri supplementary material

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests