Hostname: page-component-5f745c7db-q8b2h Total loading time: 0 Render date: 2025-01-06T07:50:29.466Z Has data issue: true hasContentIssue false

Assessing the Stability of Principal Components Using Regression

Published online by Cambridge University Press:  01 January 2025

Atanu R. Sinha*
Affiliation:
John E. Anderson Graduate School of Management, University of California, Los Angeles
Bruce S. Buchanan
Affiliation:
Leonard N. Stern School of Business, New York University, New York
*
Send requests for reprints to Atanu R. Sinha, B418 Gold Hall, 110 Westwood Plaza, Los Angeles, CA 90095.

Abstract

This paper presents an analysis, based on simulation, of the stability of principal components. Stability is measured by the expectation of the absolute inner product of the sample principal component with the corresponding population component. A multiple regression model to predict stability is devised, calibrated, and tested using simulated Normal data. Results show that the model can provide useful predictions of individual principal component stability when working with correlation matrices. Further, the predictive validity of the model is tested against data simulated from three non-Normal distributions. The model predicted very well even when the data departed from normality, thus giving robustness to the proposed measure. Used in conjunction with other existing rules this measure will help the user in determining interpretability of principal components.

Type
Original Paper
Copyright
Copyright © 1995 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The authors would like to thank the four anonymous reviewers and the two editors for their valuable comments. Atanu R. Sinha gratefully acknowledges the research support received from the Marketing Studies Center, AGSM, UCLA.

References

Anderson, T. W. (1951). The asymptotic distribution of certain characteristic roots and vectors. Proceedings of the Second Berkeley Symposium on Mathematical Statistics and Probability (pp. 103130). Berkeley and Los Angeles: University of California Press.CrossRefGoogle Scholar
Anderson, T. W. (1963). Asymptotic theory for principal component analysis. Annals of Mathematical Statistics, 34, 122148.CrossRefGoogle Scholar
Daudin, J. J., Duby, C., Trecourt, P. (1988). Stability of principal component analysis studied by the bootstrap method. Statistics, 19, 241258.CrossRefGoogle Scholar
Diaconis, P., Efron, B. (1983). Computer intensive methods in statistics. Scientific American, 248(5), 116130.CrossRefGoogle Scholar
Girschik, M. A. (1936). Principal components. Journal of the American Statistical Association, 31, 519528.CrossRefGoogle Scholar
Girschik, M. A. (1939). On the sampling theory of roots of determinantal equations. Annals of Mathematical Statistics, 10, 203224.CrossRefGoogle Scholar
Green, B. F. (1977). Parameter sensitivity in multivariate methods. Journal of Multivariate Behavioral Research, 12, 263287.CrossRefGoogle ScholarPubMed
Guadagnoli, E., Velicer, W. F. (1988). Relation of sample size to the stability of component patterns. Psychological Bulletin, 103(2), 265275.CrossRefGoogle Scholar
Jackson, J. E. (1991). A user's guide to principal components, New York: Wiley.CrossRefGoogle Scholar
Krzanowski, W. J. (1984). Sensitivity of principal components. Journal of the Royal Statistical Society, Series B, 46(3), 558563.CrossRefGoogle Scholar
Lambert, Z. V., Wildt, A. R., Durand, R. M. (1991). Approximating confidence intervals for factor loadings. Journal of Multivariate Behavioral Research, 26(3), 421434.CrossRefGoogle ScholarPubMed
Morrison, D. F. (1976). Multivariate Statistical Methods 2nd ed.,, New York: McGraw-Hill.Google Scholar
Pack, P., Jolliffe, I. T., Morgan, B. J. T. (1988). Influential observations in principal components analysis: A case study. Journal of Applied Statistics, 15(1), 3952.CrossRefGoogle Scholar
Srivastava, M. S., Khatri, C. G. (1979). An Introduction to Multivariate Statistics, New York: North Holland.Google Scholar
Zwick, W. R., Velicer, W. F. (1986). Comparison of five rules for determining the number of components to retain. Psychological Bulletin, 99(3), 432442.CrossRefGoogle Scholar