Hostname: page-component-745bb68f8f-s22k5 Total loading time: 0 Render date: 2025-01-07T19:15:39.036Z Has data issue: false hasContentIssue false

Prediction in Future Samples Studied in Terms of the Gain from Selection

Published online by Cambridge University Press:  01 January 2025

Alan L. Gross*
Affiliation:
City University of New York

Abstract

The gain from selection (GS) is defined as the standardized average performance of a group of subjects selected in a future sample using a regression equation derived on an earlier sample. Expressions for the expected value, density, and distribution function (DF) of GS are derived and studied in terms of sample size, number of predictors, and the prior distribution assigned to the population multiple correlation. The DF of GS is further used to determine how large sample sizes must be so that with probability .90 (.95), the expected GS will be within 90 percent of its maximum possible value. An approximately unbiased estimator of the expected GS is also derived.

Type
Original Paper
Copyright
Copyright © 1973 The Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Browne, M. W. Precision of prediction, 1969, Princeton, New Jersey: Educational Testing ServiceGoogle Scholar
Burkett, G. R. A Study of reduced rank models for multiple prediction. Psychometric Monographs, 1964, No. 12.Google Scholar
Cochran, W. G. Improvement by means of selection. Proceedings of the Second Berkely Symposium on Mathematical Statistics and Probability. 1951, 449470.CrossRefGoogle Scholar
Cronbach, L. J., Gleser, G. C. Psychological tests and personnel decisions, 1957, Urbana: University of Illinois PressGoogle Scholar
Fisher, R. A. The influence of rainfall on the yield of wheat at Rothamstead. Philosophical Transactions of the Royal Society of London, 1924, 213, 89142Google Scholar
Harter, H. L. Expected values of normal order statistics. Biometrika, 1961, 48, 151165CrossRefGoogle Scholar
Herzberg, P. A. The parameters of cross validation. Psychometric Monographs, 1969, No. 16.Google Scholar
Kendall, M. G., and Stuart, A. The advanced theory of statistics, Volumes I and II, 1967, London: GriffenGoogle Scholar
Olkin, I., and Pratt, J. W. Unbiased estimation of certain correlation coefficients. Annals of Mathematical Statistics, 1958, 29, 201211CrossRefGoogle Scholar
Park, C. N. A cross validation approach to the sample size problem in regression models. Unpublished doctoral dissertation, Purdue University, 1970.Google Scholar
Rao, C. R. Linear statistical inference and its applications, 1965, New York: WileyGoogle Scholar
Sarhan, A. E., and Greenberg, B. G. Estimation of location and scale parameters by order statistics. Annals of Mathematical Statistics, 1956, 27, 427451CrossRefGoogle Scholar
Waterson, G. A. Linear estimation in censored samples from multivariate normal populations. Annals of Mathematical Statistics, 1959, 30, 814824CrossRefGoogle Scholar