The Geometry of Enhancement in Multiple Regression

Niels G. Waller

doi:10.1007/s11336-011-9220-x

The Geometry of Enhancement in Multiple Regression

Published online by Cambridge University Press: 01 January 2025

Niels G. Waller

Show author details

Niels G. Waller*: Affiliation:
University of Minnesota
*: Requests for reprints should be sent to Niels G. Waller, Department of Psychology, University of Minnesota, N657 Elliott Hall, Minneapolis, MN, 55455, USA. E-mail: nwaller@umn.edu

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

In linear multiple regression, “enhancement” is said to occur when R2=b′r>r′r, where b is a p×1 vector of standardized regression coefficients and r is a p×1 vector of correlations between a criterion y and a set of standardized regressors, x. When p=1 then b≡r and enhancement cannot occur. When p=2, for all full-rank Rxx≠I, Rxx=E[xx′]=VΛV′ (where VΛV′ denotes the eigen decomposition of Rxx; λ1>λ2), the set \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$\boldsymbol{B}_{1}:=\{\boldsymbol{b}_{i}:R^{2}=\boldsymbol{b}_{i}'\boldsymbol{r}_{i}=\boldsymbol{r}_{i}'\boldsymbol{r}_{i};0 \ltR^{2}\le1\}$\end{document} contains four vectors; the set \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$\boldsymbol{B}_{2}:=\{\boldsymbol{b}_{i}: R^{2}=\boldsymbol{b}_{i}'\boldsymbol{r}_{i}\gt\boldsymbol{r}_{i}'\boldsymbol{r}_{i}$\end{document}; \documentclass[12pt]{minimal}\usepackage{amsmath}\usepackage{wasysym}\usepackage{amsfonts}\usepackage{amssymb}\usepackage{amsbsy}\usepackage{mathrsfs}\usepackage{upgreek}\setlength{\oddsidemargin}{-69pt}\begin{document}$0\lt R^{2}\le1;R^{2}\lambda_{p}\leq\boldsymbol{r}_{i}'\boldsymbol{r}_{i}\lt R^{2}\}$\end{document} contains an infinite number of vectors. When p≥3 (and λ1>λ2>⋯>λp), both sets contain an uncountably infinite number of vectors. Geometrical arguments demonstrate that B1 occurs at the intersection of two hyper-ellipsoids in ℝp. Equations are provided for populating the sets B1 and B2 and for demonstrating that maximum enhancement occurs when b is collinear with the eigenvector that is associated with λp (the smallest eigenvalue of the predictor correlation matrix). These equations are used to illustrate the logic and the underlying geometry of enhancement in population, multiple-regression models. R code for simulating population regression models that exhibit enhancement of any degree and any number of predictors is included in Appendices A and B.

Keywords

suppression suppressor variable multiple regression

Information

Type: Original Paper
Information: Psychometrika , Volume 76 , Issue 4 , October 2011 , pp. 634 - 649

DOI: https://doi.org/10.1007/s11336-011-9220-x [Opens in a new window]
Copyright: Copyright © 2011 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Bertrand, P.V. (1998). Constructing explained and explanatory variables with strange statistical analysis results. The Statistician, 47, 377–383.CrossRef Google Scholar

Bertrand, P.V., Holder, R.L. (1988). A quirk in multiple-regression: the whole regression can be greater than the sum of its parts. The Statistician, 37(4), 371–374.CrossRef Google Scholar

Briel, J.B., O’Neill, K., Scheunernan, J.D. (1993). GRE technical manual, Princeton: Educational Testing Service.Google Scholar

Bryant, P. (1982). Geometry, statistics, probability: variations on a common theme. The American Statistician, 38(1), 38–48.CrossRef Google Scholar

Conger, A.J. (1974). A revised definition for suppressor variables: a guide to their identification and interpretation. Educational and Psychological Measurement, 34(1), 35–46.CrossRef Google Scholar

Cox, D.R. (1968). Notes on some aspects of regression analysis. Journal of the Royal Statistical Society, Series A, 131, 265–279.CrossRef Google Scholar

Crocker, L., Algina, J. (1986). Introduction to classical and modern test theory, New York: Holt, Rinehart, and Winston.Google Scholar

Cuadras, C.M. (1993). Interpreting an inequality in multiple regression. The American Statistician, 47(4), 256–258.CrossRef Google Scholar

Currie, I., Korabinski, A. (1984). Some comments on bivariate regression. The Statistician, 33(3), 283–292.CrossRef Google Scholar

Dayton, C.M. (1972). A method for constructing data which illustrate a suppressor variable. The American Statistician, 26(5), 36.CrossRef Google Scholar

Derksen, S., Keselman, H.J. (1992). Backward, forward and stepwise automated subset selection algorithms: frequency of obtaining authentic and noise variables. British Journal of Mathematical and Statistical Psychology, 45(2), 265–282.CrossRef Google Scholar

Dicken, C. (1963). Good impression, social desirability, and acquiescence as suppressor variables. Educational and Psychological Measurement, 23(4), 699–720.CrossRef Google Scholar

Ferguson, C.C. (1979). Intersections of ellipsoids and planes of arbitrary orientation and position. Mathematical Geology, 11(3), 329–333.CrossRef Google Scholar

Freund, R.J. (1988). When \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$R^{2}\gt r_{yx_{1}}^{2}+r_{yx_{2}}^{2}$\end{document}

(revisited). The American Statistician, 42(1), 89–90.Google Scholar

Friedman, L., Wall, M. (2005). Graphical views of suppression and multicollinearity in multiple linear regression. The American Statistician, 59(2), 127–136.CrossRef Google Scholar

Graybill, F.A. (1983). Matrices with applications in statistics, (2nd ed.). London: Thomson Learning.Google Scholar

Gromping, U. (2007). Estimators of relative importance in linear regression based on variance decomposition. The American Statistician, 61(2), 139–147.CrossRef Google Scholar

Hadi, A.S. (1996). Matrix algebra as a tool, Belmont: Duxbury Press.Google Scholar

Hamilton, D. (1987). Sometimes \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$R^{2}\gt r_{yx_{1}}^{2}+r_{yx_{2}}^{2}$\end{document}

: correlated variables are not always redundant. The American Statistician, 41(2), 129–132.Google Scholar

Hamilton, D. (1988). Reply. The American Statistician, 42, 90–91.Google Scholar

Hawkins, D.M., Fatti, L.P. (1984). Exploring multivariate data using the minor principal components. The Statistician, 33(4), 325–338.CrossRef Google Scholar

Holling, H. (1983). Suppressor structures in the general linear model. Educational and Psychological Measurement, 43(1), 1–9.CrossRef Google Scholar

Horst, P. (1941). The prediction of personal adjustment (Bulletin No. 48), New York: Social Science Research Council.Google Scholar

Hotelling, H. (1957). The relationship of the newer statistical multivariate statistical methods to factor analysis. British Journal of Statistical Psychology, 10(2), 69–79.CrossRef Google Scholar

Kendall, M.G., Stuart, A. (1973). The advanced theory of statistics, (3rd ed.). London: Charles Griffin and Company.Google Scholar

Kuncel, N.R., Hezlett, S.A., Ones, D.S. (2001). A comprehensive meta-analysis of the predictive validity of the graduate record examinations: implications for graduate student selection and performance. Psychological Bulletin, 127(1), 162–181.CrossRef Google Scholar

Lewis, J.W., Escobar, L.A. (1986). Suppression and enhancement in bivariate regression. The Statistician, 35(1), 17–26.CrossRef Google Scholar

Lipovetsky, S., Conklin, M. (2004). Enhance-synergism and suppression effects in multiple regression. International Journal of Mathematics Education in Science and Technology, 35(3), 391–402.CrossRef Google Scholar

Magnus, J.R., Neudecker, H. (1999). Matrix differential calculus with applications in statistics and econometrics, New York: Wiley.Google Scholar

Marsaglia, G., Olkin, I. (1984). Generating correlation-matrices. SIAM Journal on Scientific and Statistical Computing, 5(2), 470–475.CrossRef Google Scholar

Maassen, G.H., Bakker, A.B. (2001). Suppressor variables in path models: definitions and interpretations. Sociological Methods & Research, 30(2), 241–270.CrossRef Google Scholar

Meehl, P.E. (1945). A simple algebraic development of Horst’s suppressor variables. The American Journal of Psychology, 58(4), 550–554.CrossRef Google Scholar

Mitra, S. (1988). The relationship between the multiple and the zero-order correlation coefficients. The American Statistician, 42(1), 89.Google Scholar

Nickerson, C. (2008). Mutual suppression: comment on Paulhus et al. (2004). Multivariate Behavioral Research, 43(4), 556–563.CrossRef Google Scholar

Paulhus, D.L., Robins, R.W., Trzesniewski, K.H., Tracy, J.L. (2004). Two replicable suppressor situations in personality research. Multivariate Behavioral Research, 39(2), 301–326.CrossRef Google Scholar PubMed

R Development Core Team (2011). R: A language and environment for statistical computing, Vienna: R Foundation for Statistical Computing http://www.R-project.org/.Google Scholar

Rao, C., Mitra, S. (1971). Generalized inverse of the matrix and its applications, New York: Wiley.Google Scholar

Routledge, R.D. (1990). When stepwise regression fails: correlated variables some of which are redundant. International Journal of Mathematical Education in Science and Technology, 21(3), 403–410.CrossRef Google Scholar

Saville, D.J., Wood, G.R. (1986). A method for teaching statistics using N-dimensional geometry. The American Statistician, 40, 205–214.Google Scholar

Schey, H.M. (1993). The relationship between the magnitude of SSR(x2) and SSR(x2|x1): a geometric description. The American Statistician, 47(1), 26–30.Google Scholar

Schmidt, F., Hunter, J.E. (1996). Measurement error in psychological research: lessons from 26 research scenarios. Psychological Methods, 1(2), 199–223.CrossRef Google Scholar

Sharpe, N., Roberts, R. (1997). The relationship among sums of squares, correlation coefficients, and suppression. The American Statistician, 51(1), 46–48.CrossRef Google Scholar

Shieh, G. (2001). The inequality between the coefficient of determination and the sum of squared simple correlation coefficients. The American Statistician, 55(2), 121–124.CrossRef Google Scholar

Taleb, N.N. (2007). The black swan: the impact of the highly improbable, New York: Random House.Google Scholar

Tzelgov, J., Henik, A. (1985). A definition of suppression situations for the general linear model: a regression weights approach. Educational and Psychological Measurement, 45(2), 281–284.CrossRef Google Scholar

Tzelgov, J., Henik, A. (1991). Suppression situations in psychological research: definitions, implications, and applications. Psychological Bulletin, 109(3), 524–536.CrossRef Google Scholar

Waller, N., Jones, J. (2010). Correlation weights in multiple regression. Psychometrika, 75(1), 58–69.CrossRef Google Scholar

Wickens, T. (1995). The geometry of multivariate statistics, Mahwah: Lawrence Erlbaum Associates.Google Scholar

Wiggins, J.S. (1973). Personality and prediction: principles of personality assessment, New York: Addison-Wesley.Google Scholar

Article contents

The Geometry of Enhancement in Multiple Regression

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests