Hostname: page-component-5f745c7db-f9j5r Total loading time: 0 Render date: 2025-01-06T22:05:37.907Z Has data issue: true hasContentIssue false

Correspondence Analysis and Optimal Structural Representations

Published online by Cambridge University Press:  01 January 2025

Lawrence Hubert*
Affiliation:
University of Illinois, Champaign
Phipps Arabie
Affiliation:
Graduate School of Management, Rutgers University, Newark
*
Requests for reprints should be sent to Lawrence Hubert, Department of Psychology, University of Illinois, 603 East Daniel Street, Champaign, IL 6t820.

Abstract

Many well-known measures for the comparison of distinct partitions of the same set of n objects are based on the structure of class overlap presented in the form of a contingency table (e.g., Pearson's chi-square statistic, Rand's measure, or Goodman-Kruskal's τb), but they all can be rephrased through the use of a simple cross-product index defined between the corresponding entries from two n ×n proximity matrices that provide particular a priori (numerical) codings of the within- and between-class relationships for each of the partitions. We consider the task of optimally constructing the proximity matrices characterizing the partitions (under suitable restriction) so as to maximize the cross-product measure, or equivalently, the Pearson correlation between their entries. The major result presented states that within the broad classes of matrices that are either symmetric, skew-symmetric, or completely arbitrary, optimal representations are already derivable from what is given by a simple one-dimensional correspondence analysis solution. Besides severely limiting the type of structures that might be of interest to consider for representing the proximity matrices, this result also implies that correspondence analysis beyond one dimension must always be justified from logical bases other than the optimization of a single correlational relationship between the matrices representing the two partitions.

Type
Original Paper
Copyright
Copyright © 1992 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

This research was supported in part by a grant from American Telephone and Telegraph (AT&T) to the Industrial Affiliates Program of the University of Illinois. The acting Editor for this manuscript was Shizuhiko Nishisato.

References

Benzécri, J. P. (1973). L'analyse des données, Volume 1: La taxinomie, Volume 2: L'analyse des correspondances [The analysis of data, Volume 1: Taxonomy, Volume 2: Correspondence analysis], Paris: Dunod.Google Scholar
Goodman, L. A., Kruskal, W. H. (1954). Measures of association for cross-classifications. Journal of the American Statistical Association, 49, 732764.Google Scholar
Greenacre, M. (1984). Theory and applications of correspondence analysis, London: Academic Press.Google Scholar
Hayashi, C. (1950). On the quantification of qualitative data from the mathematico-statistical point of view. Annals of the Institute of Statistical Mathematics, 2, 3547.CrossRefGoogle Scholar
Heiser, W. J. (1981). Unfolding analysis of proximity data, Leiden: University of Leiden, Department of Data Theory.Google Scholar
Heiser, W. J., Meulman, J. (1983). Analyzing rectangular tables by joint and constrained multidimensional scaling. Journal of Econometrics, 22, 139167.CrossRefGoogle Scholar
Hill, M. O. (1973). Reciprocal averaging: An eigenvector method of ordination. Journal of Ecology, 61, 237251.CrossRefGoogle Scholar
Hill, M. O. (1974). Correspondence analysis: A neglected multivariate method. Journal of the Royal Statistical Society, Series C, Applied Statistics, 23, 340354.Google Scholar
Hill, M. O. (1982). Correspondence analysis. In Klotz, S., Johnson, N. L., Read, C. B. (Eds.), Encyclopedia of statistical sciences, Volume 2 (pp. 204210). New York: Wiley.Google Scholar
Hubert, L. J. (1987). Assignment methods in combinatorial data analysis, New York: Marcel Dekker.Google Scholar
Hubert, L. J., Arabie, P. (1985). Comparing partitions. Journal of Classification, 2, 193218.CrossRefGoogle Scholar
Hubert, L. J., Arabie, P. (1986). Unidimensional scaling and combinatorial optimization. In de Leeuw, J., Heiser, W., Meulman, J., Critchley, F. (Eds.), Multidimensional data analysis (pp. 181196). Leiden: DSWO Press.Google Scholar
Lebart, L., Morineau, A., Warwick, K. M. (1984). Multivariate descriptive statistical analysis: Correspondence analysis and related techniques for large matrices, New York: Wiley.Google Scholar
Milligan, G. W., Cooper, M. C. (1986). A study of the comparability of external criteria for hierarchical cluster analysis. Multivariate Behavioral Research, 21, 441458.CrossRefGoogle ScholarPubMed
Mirkin, B. G. (1979). In Fishburn, P. C. (Eds.), Group choice, Washington, DC: V. H. Winston.Google Scholar
Nishisato, S. (1980). Analysis of categorical data: Dual scaling and its applications, Toronto: University of Toronto Press.CrossRefGoogle Scholar
Nishisato, S. (1986). Quantification of categorical data: A bibliography 1975–1986, Toronto: Microstats.Google Scholar
Torgerson, W. S. (1958). Theory and methods of scaling, New York: Wiley.Google Scholar