Hostname: page-component-745bb68f8f-d8cs5 Total loading time: 0 Render date: 2025-01-07T17:40:30.489Z Has data issue: false hasContentIssue false

Representation of Structure in Similarity Data: Problems and Prospects

Published online by Cambridge University Press:  01 January 2025

Roger N. Shepard*
Affiliation:
Stanford University

Conclusion

After struggling with the problem of representing structure in similarity data for over 20 years, I find that a number of challenging problems still remain to be overcome—even in the simplest case of the analysis of a single symmetric matrix of similarity estimates. At the same time, I am more optimistic than ever that efforts directed toward surmounting the remaining difficulties will reap both methodological and substantive benefits. The methodological benefits that I forsee include both an improved efficiency and a deeper understanding of “discovery” methods of data analysis. And the substantive benefits should follow, through the greater leverage that such methods will provide for the study of complex empirical phenomena—perhaps particularly those characteristic of the human mind.

Type
Original Paper
Copyright
Copyright © 1974 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

1

Presidential address delivered before the annual meeting of the Psychometric Society at Stanford University on March 28, 1974 (in a somewhat revised form necessarily, for purposes of publication, with more words and fewer pictures). Much of the work surveyed in this paper was supported by the Bell Telephone Laboratories, during my eight years as a member of the technical staff there, and by grants (principally, GS-1302, GS-2283, and GB-31971X) from the National Science Foundation, during my subsequent eight years at Harvard and at Stanford. I wish to acknowledge, too, the important contributions to this work made by my many students and associates during these 16 years including, particularly, Doug Carroll, Jih-Jie Chang, Steve Johnson, and Joe Kruskal (at the Bell Laboratories) and Phipps Arabie, Glen Crawford, and Jim Cunningham (at Stanford). Finally, I am indebted to Sherry Huntsberger for suggestions and extensive help in connection with the preparation of the paper itself, and to Bert Green for useful editorial comments.

References

Abelson, R. P., and Tukey, J. W. Efficient conversion of nonmetric information into metric information. Proceedings of the American Statistical Association Meetings, Social Statistics Section, 1959, 226230.Google Scholar
Anderson, J. R., and Bower, G. H. Human associative memory, 1973, Washington, D. C.: V. H. Winston & Sons.Google Scholar
Arabie, P. Concerning Monte Carlo evaluations of nonmetric multidimensional scaling algorithms. Psychometrika, 1973, 38, 607608.CrossRefGoogle Scholar
Arabie, P. and Boorman, S. A. Multidimensional scaling of measures of distance between partitions. Journal of Mathematical Psychology, 1973, 10, 148203.CrossRefGoogle Scholar
Arabie, P. and Shepard, R. N. Representation of similarities as additive combinations of discrete overlapping properties. Presented at the Mathematical Psychology Meeting in Montreal, August, 1973.Google Scholar
Arnold, J. B. A multidimensional scaling study of semantic distance. Journal of Experimental Psychology Monograph, 1971, 90, 349372.CrossRefGoogle Scholar
Attneave, F. Dimensions of similarity. American Journal of Psychology, 1950, 63, 516556.CrossRefGoogle ScholarPubMed
Beals, R., Krantz, D. H., and Tversky, A.. Foundations of multidimensional scaling. Psychological Review, 1968, 75, 127142.CrossRefGoogle ScholarPubMed
Bennett, R. S. The intrinsic dimensionality of signal collections. Doctoral dissertation, The Johns Hopkins University, 1965. University Microfilms, Inc., Ann Arbor, Michigan.Google Scholar
Bennett, R. S. The intrinsic dimensionality of signal collections. IEEE Transactions on Information Theory, 1969, IT-15(5), 517525.CrossRefGoogle Scholar
Blank, A. A. Axiomatics of binocular vision: The foundations of metric geometry in relation to space perception. Journal of the Optical Society of America, 1958, 48, 328334.CrossRefGoogle ScholarPubMed
Blank, A. A. The Luneburg theory of binocular space perception. In Koch, S. (Eds.), Psychology: A study of a science. New York: McGraw-Hill. 1959, 395426.Google Scholar
Blumenthal, L. M.. Theory and applications of distance geometry, 1953, Oxford: Clarendon Press.Google Scholar
Boyd, J. P. Information distance for discrete structures. In Shepard, R. N., Romney, A. K., & Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences, Vol. I.. New York: Seminar Press. 1972, 213223.Google Scholar
Busemann, H. The geometry of geodesics, 1955, New York: Academic Press.Google Scholar
Carroll, J. D. Individual differences and multidimensional scaling. In Shepard, R. N., Romney, A. K., & Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences, Vol. I. New York: Seminar Press. 1972, 105155 (a).Google Scholar
Carroll, J. D. Algorithms for rotation and interpretation of dimensions and for configuration matching. Bell Telephone Laboratories, 1972. (b) (Multilithed handout prepared for the workshop on multidimensional scaling, held at the University of Pennsylvania in June, 1972.).Google Scholar
Carroll, J. D., and Chang, J.-J. Analysis of individual differences in multidimensional scaling via an N-way generalization of “Eckart-Young” decomposition. Psychometrika, 1970, 35, 283319.CrossRefGoogle Scholar
Chang, J.-J., and Shepard, R. N. Exponential fitting in the proximity analysis of confusion matrices. Presented at the annual meeting of the Eastern Psychological Association, New York, April 14, 1966.Google Scholar
Clark, H. H. Linguistic processes in deductive reasoning. Psychological Review, 1969, 76, 387404.CrossRefGoogle Scholar
Clark, H. H. Word associations and linguistic theory. In Lyons, J. (Eds.), New horizons in linguistics. Baltimore, Maryland: Penguin Books. 1970, 271286.Google Scholar
Coombs, C. A theory of data, 1964, New York: Wiley.Google Scholar
Cross, D. V. Metric properties of multidimensional stimulus generalization. In Mostofsky, D. I. (Eds.), Stimulus generalization. Stanford, Calif.: Stanford University Press. 1965, 7293.Google Scholar
Cunningham, J. P. On finding an optimal tree realization of a proximity matrix. Presented at the mathematical psychology meeting, Ann Arbor, Michigan, August 29, 1974.Google Scholar
Cunningham, J. P., and Shepard, R. N. Monotone mapping of similarities into a general metric space. Journal of Mathematical Psychology, 1974, 11, (in press).CrossRefGoogle Scholar
Degerman, R. L. Multidimensional analysis of complex structure: Mixtures of class and quantitative variation. Psychometrika, 1970, 35, 475491.CrossRefGoogle Scholar
Ekman, G. Dimensions of color vision. Journal of Psychology, 1954, 38, 467474.CrossRefGoogle Scholar
Fillenbaum, S., and Rapoport, A. Structures in the subjective lexicon, 1971, New York: Academic Press.Google Scholar
Foley, J. M. The size-distance relation and the intrinsic geometry of visual space. Vision Research, 1972, 12, 323332.CrossRefGoogle ScholarPubMed
Greenberg, J. H. Language universals, 1966, The Hague: Mouton.Google Scholar
Guttman, L. A new approach to factor analysis: The radex. In Lazarsfeld, P. F. (Eds.), Mathematical thinking in the social sciences. Glencoe, Illinois: Free Press. 1954, 258348.Google Scholar
Guttman, L. A generalized simplex for factor analysis. Psychometrika, 1955, 20, 173192.CrossRefGoogle Scholar
Guttman, N., and Kalish, H. I. Discriminability and stimulus generalization. Journal of Experimental Psychology, 1956, 51, 7988.CrossRefGoogle ScholarPubMed
Halle, M. On the bases of phonology. In Fodor, J. A., Katz, J. J. (Eds.), The structure of language. Englewood Cliffs, N. J.: Prentice-Hall. 1964, 324333.Google Scholar
Harshman, R. A. Foundations of the parafac procedure: Models and conditions for an explanatory multi-modal factor analysis. Unpublished doctoral dissertation, University of California at Los Angeles, 1970.Google Scholar
Haviland, S. E., and Clark, E. V. ‘This man's father is my father's son’: A study of the acquisition of English kin terms. Journal of Child Language, 1974, 1, 2347.CrossRefGoogle Scholar
Helm, C. E. Multidimensional ratio scaling analysis of perceived color relations. Journal of the Optical Society of America, 1964, 54, 256262.CrossRefGoogle ScholarPubMed
Hyman, R., and Well, A. Judgments of similarity and spatial models. Perception and Psychophysics, 1967, 2, 233248.CrossRefGoogle Scholar
Hyman, R., and Well, A. Perceptual separability and spatial models. Perception and Psychophysics, 1968, 3, 161165.CrossRefGoogle Scholar
Indow, T. Applications of multidimensional scaling in perception. In Carterette, E. C. and Friedman, M. P. (Eds.), Handbook of perception, Vol. 2. New York: Academic Press, in press.Google Scholar
Jardine, N., Sibson, R. Mathematical taxonomy, 1971, New York: Wiley.Google Scholar
Johnson, S. C. Hierarchical clustering schemes. Psychometrika, 1967, 32, 241254.CrossRefGoogle ScholarPubMed
Klahr, D. A Monte Carlo investigation of the statistical significance of Kruskal's nonmetric scaling procedure. Psychometrika, 1969, 34, 319330.CrossRefGoogle Scholar
Koopman, R. F., and Cooper, M. Some problems with Minkowski distance models in multidimensional scaling. Presented at the annual meeting of the Psychometric Society, Stanford University, March 28, 1974.Google Scholar
Kruskal, J. B.. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika, 1964, 29, 127 (a).CrossRefGoogle Scholar
Kruskal, J. B. Nonmetric multidimensional scaling: A numerical method. Psychometrika, 1964, 29, 2842 (b).CrossRefGoogle Scholar
Kruskal, J. B. Linear transformation of multivariate data to reveal clustering. In Shepard, R. N., Romney, A. K., and Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences. Vol I. New York: Seminar Press. 1972, 179191.Google Scholar
Kruskal, J. B. Multidimensional scaling and other methods for discovering structure. In Ralston, A. J., Wilf, H. S., and Enslein, K., (Eds.), Statistical methods for digital computers. New York: Wiley, in press.Google Scholar
Levelt, W. J. M., Van de Geer, J. P., and Plomp, R. Triadic comparisons of musical intervals. British Journal of Mathematical and Statistical Psychology, 1966, 19, 163179.CrossRefGoogle ScholarPubMed
Lévi-Strauss, C. The savage mind, 1967, Chicago: University of Chicago Press.Google Scholar
Liberman, A. M., Cooper, F. S., Shankweiler, D. P., and Studdert-Kennedy, M. Perception of the speech code. Psychological Review, 1967, 74, 431461.CrossRefGoogle ScholarPubMed
Lingoes, J. C. A general survey of the Guttman-Lingoes nonmetric program series. In Shepard, R. N., Romney, A. K., & Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences. Vol. I, 1972, New York: Seminar Press.Google Scholar
Luneburg, R. K. Mathematical analysis of binocular vision, 1947, Princeton, N. J.: Princeton University Press.Google Scholar
Luneburg, R. K. The metric of binocular visual space. Journal of Optical Society of America, 1950, 40, 637642.CrossRefGoogle Scholar
Miller, G., and Nicely, P. E.. An analysis of perceptual confusions among some English consonants. Journal of the Acoustical Society of America, 1955, 27, 338352.CrossRefGoogle Scholar
Osgood, C. E., Suci, G. J., and Tannenbaum, P. H.. The measurement of meaning, 1957, Urbana, Ill.: University of Illinois Press.Google Scholar
Peters, R. W. Dimensions of perception of consonants. Journal of the Acoustical Society of America, 1963, 35, 19851989.CrossRefGoogle Scholar
Quillian, M. R. Semantic memory. In Minsky, M. (Eds.), Semantic information processing, 1968, Cambridge, Mass.: M.I.T. Press.Google Scholar
Rips, L. J., Shoben, E. J., and Smith, E. E.. Semantic distance and the verification of semantic relations. Journal of Verbal Learning and Verbal Behavior, 1973, 12, 120.CrossRefGoogle Scholar
Romney, A. K., D'Andrade, R. G.. Cognitive aspects of English kinship terms. American Anthropologist, 1964, 66, 146170.CrossRefGoogle Scholar
Rumelhart, D. E., and Abrahamson, A. A.. A model for analogical reasoning. Cognitive Psychology, 1973, 5, 128.CrossRefGoogle Scholar
Rumelhart, D. E., Lindsay, P. H., and Norman, D. A. A process model for long-term memory. In Tulving, E., Donaldson, W. (Eds.), Organization of memory, 1972, New York: Academic Press.Google Scholar
Rund, H. The differential geometry of Finsler space, 1959, Berlin: Springer-Verlag.CrossRefGoogle Scholar
Shepard, R. N. Multidimensional scaling of concepts based upon sequences of restricted associative responses. American Psychologist, 1957, 12, 440441.Google Scholar
Shepard, R. N. Stimulus and response generalization: A stochastic model relating generalization to distance in psychological space. Psychometrika, 1957, 22, 325345.CrossRefGoogle Scholar
Shepard, R. N. Stimulus and response generalization: Deduction of the generalization gradient from a trace model. Psychological Review, 1958, 65, 242256.CrossRefGoogle ScholarPubMed
Shepard, R. N. Stimulus and response generalization: Tests of a model relating generalization to distance in psychological space. Journal of Experimental Psychology, 1958, 55, 509523.CrossRefGoogle Scholar
Shepard, R. N. Similarity of stimuli and metric properties of behavioral data. In Gulliksen, H., Messick, S. (Eds.), Psychological scaling: Theory and applications. New York: Wiley. 1960, 3343.Google Scholar
Shepard, R. N. The analysis of proximities: Multidimensional scaling with an unknown distance function. I. Psychometrika, 1962, 27, 125140.CrossRefGoogle Scholar
Shepard, R. N. The analysis of proximities: Multidimensional scaling with an unknown distance function. II. Psychometrika, 1962, 27, 219246.CrossRefGoogle Scholar
Shepard, R. N. Analysis of proximities as a technique for the study of information processing in man. Human Factors, 1963, 5, 3348.CrossRefGoogle Scholar
Shepard, R. N. Comments on Professor Underwood's paper “Stimulus selection in verbal learning.”. In Cofer, C. N. and Musgrave, B. S. (Eds.), Verbal behavior and learning: Problems and processes. New York: McGraw-Hill. 1963, 4870.Google Scholar
Shepard, R. N. Attention and the metric structure of the stimulus space. Journal of Mathematical Psychology, 1964, 1, 5487.CrossRefGoogle Scholar
Shepard, R. N. Polynomial fitting in the analysis of proximities. Proceedings of the XVIIth international congress of psychology. Amsterdam: North-Holland Publisher. 1964, 345346.Google Scholar
Shepard, R. N. Approximation to uniform gradients of generalization by monotone transformations of scale. In Mostofsky, D. I. (Eds.), Stimulus Generalization. Stanford, Calif.: Stanford University Press. 1965, 94110.Google Scholar
Shepard, R. N. Computer explorations of psychological space. Invited research address presented at the annual meeting of the American Psychological Association, New York, September 3, 1966. (a).Google Scholar
Shepard, R. N. Metric structures in ordinal data. Journal of Mathematical Psychology, 1966, 3, 287315.CrossRefGoogle Scholar
Shepard, R. N. Continuity versus the triangle inequality as a central principle for the spatial analysis of similarity data. Presented in a symposium on alternative models for the geometric representation of psychological data, at the mathematical psychology meetings, Stanford University, August 28, 1968.Google Scholar
Shepard, R. N. Introduction to Volume I. In Shepard, R. N., Romney, A. K. and Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences, Vol. I. New York: Seminar Press. 1972, 120.Google Scholar
Shepard, R. N. A taxonomy of some principal types of data and of multidimensional methods for their analysis. In Shepard, R. N., Romney, A. K. and Nerlove, S. B. (Eds.), Multidimensional scaling: Theory and applications in the behavioral sciences, Vol. I. New York: Seminar Press. 1972, 2147.Google Scholar
Shepard, R. N. Psychological representation of speech sounds. In David, E. E. and Denes, P. B. (Eds.), Human communication: A unified view. New York: McGraw-Hill. 1972, 67113.Google Scholar
Shepard, R. N., and Carroll, J. D. Parametric representation of nonlinear data structures. In Krishnaiah, P. R. (Eds.), Multivariate analysis: Proceedings of an international symposium. New York: Academic Press. 1966, 561592.Google Scholar
Shepard, R. N., and Cermak, G. W. Perceptual-cognitive explorations of a toroidal set of free-form stimuli. Cognitive Psychology, 1973, 4, 351377.CrossRefGoogle Scholar
Shepard, R. N., Hovland, C. I., and Jenkins, H. M. Learning and memorization of classifications. Psychological Monographs, 1961, 75, (13, whole no. 517).CrossRefGoogle Scholar
Shepard, R. N., Kilpatric, D. W., and Cunningham, J. P. The internal representation of numbers. Cognitive Psychology, in press.Google Scholar
Silberstein, L. Investigations of the intrinsic properties of the color domain. Journal of the Optical Society of America, 1938, 28, 6385.CrossRefGoogle Scholar
Silberstein, L. and MacAdam, D. L. The distribution of color matchings around a color center. Journal of the Optical Society of America, 1945, 35, 3239.CrossRefGoogle Scholar
Sokal, R. R., and Sneath, P. H. A. Principles of numerical taxonomy, 1963, San Francisco: W. H. Freeman.Google Scholar
Stenson, H. H., and Knoll, R. L. Goodness of fit for random rankings in Kruskal's nonmetric scaling procedure. Psychological Bulletin, 1969, 71, 122126.CrossRefGoogle Scholar
Thomas, H. Spatial models and multidimensional scaling of random shapes. American Journal of Psychology, 1968, 81, 551558.CrossRefGoogle ScholarPubMed
Torgerson, W. S. Multidimensional scaling: I. theory and method. Psychometrika, 1952, 17, 401419.CrossRefGoogle Scholar
Torgerson, W. S. Theory and methods of scaling, 1958, New York: Wiley.Google Scholar
Torgerson, W. S. Multidimensional scaling of similarity. Psychometrika, 1965, 30, 379393.CrossRefGoogle ScholarPubMed
Tucker, L. R. Relations between multidimensional scaling and three-mode factor analysis. Psychometrika, 1972, 37, 327.CrossRefGoogle Scholar
Wish, M., and Carroll, J. D. Applications of “INDSCAL” to studies of human perception and judgment. In Carterette, E. C. and Friedman, M. P. (Eds.), Handbook of perception, Vol. 2. New York: Academic Press, in press.Google Scholar
Young, F. W. Nonmetric multidimensional scaling: Recovery of metric information. Psychometrika, 1970, 35, 455473.CrossRefGoogle Scholar
Young, F. W., and Torgerson, W. S. TORSCA, A FORTRAN IV program for Shepard-Kruskal multidimensional scaling analysis. Behavioral Science, 1967, 12, 498498.Google Scholar