Hostname: page-component-5f745c7db-j9pcf Total loading time: 0 Render date: 2025-01-06T21:17:19.457Z Has data issue: true hasContentIssue false

On the Partitioning of Squared Euclidean Distance and its Applications in Cluster Analysis

Published online by Cambridge University Press:  01 January 2025

Randy L. Carter*
Affiliation:
Department of Statistics, Division of Biostatistics J. Hillis Miller Health Center, University of Florida
Robin Morris
Affiliation:
Department of Psychology, Georgia State University
Roger K. Blashfield
Affiliation:
Department of Psychiatry, J. Hillis Miller Health Center, University of Florida
*
Requests for reprints should be sent to Randy L. Carter, Box J-212, J. Hillis Miller Health Center, Gainesville, FL 32610.

Abstract

The partitioning of squared Eucliean distance between two vectors in M-dimensional space into the sum of squared lengths of vectors in mutually orthogonal subspaces is discussed and applications given to specific cluster analysis problems. Examples of how the partitioning idea can be used to help describe and interpret derived clusters, derive similarity measures for use in cluster analysis, and to design Monte Carlo studies with carefully specified types and magnitudes of differences between the underlying population mean vectors are presented. Most of the example applications presented in this paper involve the clustering of longitudinal data, but their use in cluster analysis need not be limited to this arena.

Type
Original Paper
Copyright
Copyright © 1989 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

The authors wish to thank Dr. Michael Resnick of the Department of Pediatrics at the University of Florida, Dr. Mario Ariet of the Department of Medicine, University of Florida and Children's Medical Services of the State of Florida for permission to use their child development data in our example. We also wish to express our appreciation to the referees and editors whose thorough reviews and detailed comments resulted in significant improvements to an earlier manuscript.

References

Anderberg, M. R. (1973). Cluster analysis for applications, New York: Academic Press.Google Scholar
Blashfield, R. K. (1984). Classification of psychopathology, New York: Plenum.CrossRefGoogle Scholar
Cormack, R. M. (1971). A review of classification. The Journal of the Royal Statistical Society, 134, 321367.CrossRefGoogle Scholar
Cronbach, L. J., Gleser, G. C. (1953). Assessing similarity between profiles. Psychological Bulletin, 50, 456473.CrossRefGoogle ScholarPubMed
Graham, J. R. (1977). The MMPI: A practical guide, New York: Oxford University Press.Google Scholar
Milligan, G. W. (1981). review of Monte Carlo tests of cluster analysis. Multivariate Behavioral Research, 16, 379407.CrossRefGoogle ScholarPubMed
Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66, 846850.CrossRefGoogle Scholar
Rice, C. E., Mattsson, N. B. (1966). Types based on repeated measurement. In Lorr, M. (Eds.), Explorations in typing psychotics (pp. 151191). Oxford: Pergamon Press.Google Scholar
Scheffé, H. (1959). The analysis of variance, New York: Wiley.Google Scholar
Skinner, H. A. (1978). Differentiating the contribution of elevation, scatter and shape in profile similarity. Educational and Psychological measurement, 38, 297308.CrossRefGoogle Scholar
Skinner, H. A. (1979). A model of psychopathology based on the MMPI. In Newmark, C. S. (Eds.), MMPI: Clinical and research trends (pp. 276305). New York: Praeger.Google Scholar
Ward, J. H. Jr. (1963). Hierarchical grouping to optimize an objective function. Journal of the American Statistical Association, 58, 236244.CrossRefGoogle Scholar