Hostname: page-component-745bb68f8f-v2bm5 Total loading time: 0 Render date: 2025-01-07T19:32:46.909Z Has data issue: false hasContentIssue false

Theory of Learning with Constant, Variable, or Contingent Probabilities of Reinforcement

Published online by Cambridge University Press:  01 January 2025

W. K. Estes*
Affiliation:
Indiana University

Abstract

The methods used in recent probabilistic learning models to generate mean curves of learning under random reinforcement are extended to the general case in which probability of reinforcement may vary in any specified manner as a function of trials and to cases in which probability of reinforcement on a given trial is contingent upon responses or outcomes of preceding trials.

Type
Original Paper
Copyright
Copyright © 1957 Psychometric Society

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Footnotes

*

This paper was prepared while the writer was in residence at the Center for Advanced Study in the Behavioral Sciences, Stanford, California. The research on which it is based was supported by a faculty research grant from the Social Science Research Council.

References

Burke, C. J. and Estes, W. K. A component model for stimulus variables in discrimination learning. Psychometrika, 1957, 22, 133145CrossRefGoogle Scholar
Bush, R. R. and Mosteller, F. Stochastic models for learning, New York: Wiley, 1955CrossRefGoogle Scholar
Detambel, M. H. A test of a model for multiple-choice behavior. J. exp. Psychol., 1955, 49, 97104CrossRefGoogle Scholar
Estes, W. K. Toward a statistical theory of learning. Psychol. Rev., 1950, 57, 94107CrossRefGoogle Scholar
Estes, W. K. Individual behavior in uncertain situations. In Thrall, R. M., Coombs, C. H., Davis, R. L. (Eds.), Decision processes (pp. 127137). New York: Wiley, 1954Google Scholar
Estes, W. K. and Burke, C. J. A theory of stimulus variability in learning. Psychol. Rev., 1953, 60, 276286CrossRefGoogle ScholarPubMed
Estes, W. K. and Burke, C. J. Application of a statistical model to simple discrimination learning in human subjects. J. exp. Psychol., 1955, 50, 8188CrossRefGoogle ScholarPubMed
Estes, W. K. and Straughan, J. H. Analysis of a verbal conditioning situation in terms of statistical learning theory. J. exp. Psychol., 1954, 47, 225234CrossRefGoogle Scholar
Grant, D. A., Hake, H. W., and Hornseth, J. P. Acquisition and extinction of a verbal conditioned response with differing percentages of reinforcement. J. exp. Psychol., 1951, 42, 15CrossRefGoogle ScholarPubMed
Hake, H. W. and Hyman, R. Perception of the statistical structure of a random series of binary symbols. J. exp. Psychol., 1953, 45, 6474CrossRefGoogle ScholarPubMed
Humphreys, L. G. Acquisition and extinction of verbal expectations in a situation analogous to conditioning. J. exp. Psychol., 1939, 25, 294301CrossRefGoogle Scholar
Jordan, C. Calculus of finite differences, New York: Chelsea, 1950Google Scholar
Neimark, E. D. Effects of type of non-reinforcement and number of alternative responses in two verbal conditioning situations. J. exp. Psychol., 1956, 52, 209220CrossRefGoogle Scholar
Richardson, C. H. An introduction to the calculus of finite differences, New York: Van Nostrand, 1954Google Scholar