Hostname: page-component-5f745c7db-q8b2h Total loading time: 0 Render date: 2025-01-06T07:41:41.047Z Has data issue: true hasContentIssue false

A Probability Model for Errors of Classification. I. General Considerations

Published online by Cambridge University Press:  01 January 2025

J. P. Sutcliffe*
Affiliation:
University of Sydney

Abstract

This paper seeks to meet the need for a general treatment of the problem of error in classification. Within an m-attribute classificatory system, an object's typical subclass is that subclass to which it is most often allocated under repeated experimentally independent applications of the classificatory criteria. In these terms, an error of classification is an atypical subclass allocation. This leads to definition of probabilities O of occasional subclass membership, probabilities T of typical subclass membership, and probabilities E of error or, more generally, occasional subclass membership conditional upon typical subclass membership. In the relationship f: (O, T, E) the relative incidence of independent O, T, and E values is such that generally one can specify O values given T and E, but one cannot generally specify T and E values given O. Under the restrictions of homogeneity of E values for all members of a given typical subclass, mutual stochastic independence of errors of classification, and suitable conditions of replication, one can find particular systems O = f (T, E) which are solvable for T and E given O. A minimum of three replications of occasional classification is necessary for a solution of systems for marginal attributes, and a minimum of two replications is needed with any cross-classification. Although for such systems one can always specify T and E values given O values, the solution is unique for dichotomous systems only.

Type
Original Paper
Copyright
Copyright © 1965 Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

*

With grateful acknowledgement to the Rockefeller Foundation; and to the United States Department of Health, Education, and Welfare, Public Health Service, for N. I. M. H. Grant M-3950.

References

Airy, G. B. On the algebraical and numerical theory of errors of observation and the combination of observations, Cambridge and London: McMillan, 1861.Google Scholar
Boring, E. G. The logic of the normal law of error in mental measurement. Amer. J. Psychol., 1920, 31, 133.CrossRefGoogle Scholar
Feller, W. An introduction to probability theory and its applications (2nd. ed.), New York: Wiley, 1957.Google Scholar
Gulliksen, H. Theory of mental tests, New York: Wiley, 1950.CrossRefGoogle Scholar
Guttman, L. The test-retest reliability of qualitative data. Psychometrika, 1946, 11, 8195.CrossRefGoogle ScholarPubMed
Lazarsfeld, P. F. Latent structure analysis. In Koch, S. (Eds.), Psychology: a study of science (vol. 3), New York: McGraw-Hill, 1959.Google Scholar
Scheffé, H. The analysis of variance, New York: Wiley, 1959.Google Scholar
Solomon, H. Studies in item analysis and prediction, Stanford: Stanford Univ. Press, 1961.Google Scholar
Spearman, C. Correlation calculated from faulty data. Brit. J. Psychol., 1910, 3, 271295.Google Scholar
Yule, G. U. and Kendall, M. G. An introduction to the theory of statistics (11th ed.), London: Griffin, 1937.Google Scholar