Hostname: page-component-78c5997874-v9fdk Total loading time: 0 Render date: 2024-11-10T16:46:37.488Z Has data issue: false hasContentIssue false

Linear Models with Outliers: Choosing between Conditional-Mean and Conditional-Median Methods

Published online by Cambridge University Press:  25 January 2021

Jeffrey J. Harden*
Affiliation:
University of North Carolina at Chapel Hill, USA
Bruce A. Desmarais
Affiliation:
University of Massachusetts–Amherst, USA
*
Jeffrey J. Harden, University of North Carolina at Chapel Hill, Department of Political Science, 312 Hamilton Hall, CB #3265, Chapel Hill, NC 27599 Email: jjharden@unc.edu

Abstract

State politics researchers commonly employ ordinary least squares (OLS) regression or one of its variants to test linear hypotheses. However, OLS is easily influenced by outliers and thus can produce misleading results when the error term distribution has heavy tails. Here we demonstrate that median regression (MR), an alternative to OLS that conditions the median of the dependent variable (rather than the mean) on the independent variables, can be a solution to this problem. Then we propose and validate a hypothesis test that applied researchers can use to select between OLS and MR in a given sample of data. Finally, we present two examples from state politics research in which (1) the test selects MR over OLS and (2) differences in results between the two methods could lead to different substantive inferences. We conclude that MR and the test we propose can improve linear models in state politics research.

Type
Research Article
Copyright
Copyright © The Author(s) 2011

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Bassett, Gilbert, and Koenker, Roger. 1978. “Asymptotic Theory of Least Absolute Error Regression.” Journal of the American Statistical Association 73:618–22.CrossRefGoogle Scholar
Berkman, Michael B., and Reenock, Christopher. 2004. “Incremental Consolidation and Comprehensive Reorganization of American State Executive Branches.” American Journal of Political Science 48:796812.CrossRefGoogle Scholar
Cook, R. Dennis. 1977. “Detection of Influential Observation in Linear Regression.” Technometrics 19:1518.Google Scholar
Desmarais, Bruce A., and Harden, Jeffrey J.. N.d. “An Unbiased Model Comparison Test Using Cross-Validation.” http://ssrn.com/abstract=1765845.Google Scholar
Downs, Anthony. 1957. An Economic Theory of Democracy. New York: Harper & Row.Google Scholar
Gujarati, Damodar N. 2003. Basic Econometrics. International ed. New York: McGraw-Hill.Google Scholar
Hao, Lingxin, and Naiman, Daniel Q.. 2007. Quantile Regression. Quantitative Applications in the Social Sciences. Thousand Oaks, CA: Sage.Google Scholar
Hastie, Trevor, and Tibshirani, Robert. 1986. “Generalized Additive Models.” Statistical Science 1:297310.CrossRefGoogle Scholar
Hogan, Robert E. 2008. “Policy Responsiveness and Incumbent Reelection in State Legislatures.” American Journal of Political Science 52:858–73.CrossRefGoogle Scholar
Huber, John D., Shipan, Charles R., and Pfahler, Madelaine. 2001. “Legislatures and Statutory Control of Bureaucracy.” American Journal of Political Science 45:330–45.CrossRefGoogle Scholar
Huber, Peter J. 1964. “Robust Estimation of a Location Parameter.” Annals of Mathematical Statistics 35:73101.CrossRefGoogle Scholar
Johnson, Norman J. 1978. “Modified t Tests and Confidence Intervals for Asymmetrical Populations.” Journal of the American Statistical Association 73:536–44.Google Scholar
Jung, Sin-Ho. 1996. “Quasi-Likelihood for Median Regression Models.” Journal of the American Statistical Association 91:251–57.CrossRefGoogle Scholar
King, Gary. 1986. “How Not to Lie with Statistics: Avoiding Common Mistakes in Quantitative Political Science.” American Journal of Political Science 30:666–87.CrossRefGoogle ScholarPubMed
Koenker, Roger. 2005. Quantile Regression. New York: Cambridge University Press.CrossRefGoogle Scholar
Koenker, Roger, and Bassett, Gilbert. 1978. “Regression Quantiles.” Econometrica 46:3350.CrossRefGoogle Scholar
Krueger, James S., and Lewis-Beck, Michael S.. 2008. “Is OLS Dead?” The Political Methodologist 15(2): 24.Google Scholar
Lai, P. Y., and Lee, Stephen M. S.. 2005. “An Overview of Asymptotic Properties of Lp Regression under General Classes of Error Distributions.” Journal of the American Statistical Association 100:446–58.CrossRefGoogle Scholar
Lange, Kenneth L., Little, Roderick J. A., and Taylor, Jeremy M. G.. 1989. “Robust Statistical Modeling Using the t Distribution.” Journal of the American Statistical Association 84:881–96.Google Scholar
Legendre, Adrien M. 1805. Nouvelles Metermination des Orbites des Comethodes pour la Detes. Paris: F. Didot.Google Scholar
Leighton, Wayne A., and Lopez, Edward J.. 2002. “Committee Assignments and the Cost of Party Loyalty.” Political Research Quarterly 55:5990.CrossRefGoogle Scholar
Lowery, David, and Gray, Virginia. 1995. “The Population Ecology of Gucci Gulch, or the Natural Regulation of Interest Group Numbers in the American States.” American Journal of Political Science 39:129.CrossRefGoogle Scholar
Maire, Christopher, and Boscovich, Ruggero Giuseppe. 1755. De Litteraria Expeditione per Pontificiam Ditionem ad Dimetiendos duos Meridiani Gradus: et Corrigendam Mappam Geographicam, iussu et auspiciis Benedicti XIV Pont. Max. In Typographio Palladis excudebant N. et M. Palearini.Google Scholar
Mineo, Angelo M., and Ruggieri, Mariantonietta. 2005. “A Software Tool for the Exponential Power Distribution: The normalp Package.” Journal of Statistical Software 12(4): 124.CrossRefGoogle Scholar
Morgenthaler, Stephan. 1994. “Small Sample Efficiency and Exact Fit for Cauchy Regression Models.” Statistics and Probability Letters 19:381–85.CrossRefGoogle Scholar
Nadaraya, E. A. 1964. “On Estimating Regression.” Theory of Probability and Its Applications 9:141–42.CrossRefGoogle Scholar
Paul, David M., and Brown, Clyde. 2006. “The Dynamics of Elite Endorsements in Professional Sports Facility Referendums.” State Politics & Policy Quarterly 6:272–99.CrossRefGoogle Scholar
Royston, Patrick, and Altman, Douglas. 1997. “Approximating Statistical Functions by Using Fractional Polynomial Regression.” Statistician 46:411–22.Google Scholar
Smyth, Padhraic. 2000. “Model Selection for Probabilistic Clustering Using Cross-Validated Likelihood.” Statistics and Computing 10:6372.CrossRefGoogle Scholar
Western, Bruce. 1995. “Concepts and Suggestions for Robust Regression Analysis.” American Journal of Political Science 39:786817.CrossRefGoogle Scholar
Wolak, Jennifer, Lowery, David, and Gray, Virginia. 2001. “California Dreaming: Replicating the ESA Model, Unusual Cases, and Comparative State Political Analysis.” State Politics & Policy Quarterly 1:255–72.CrossRefGoogle Scholar