Estimation and control in Markov chains

P. Mandl

doi:10.2307/1426206

Estimation and control in Markov chains

Published online by Cambridge University Press: 01 July 2016

P. Mandl

Show author details

P. Mandl*: Affiliation:
Institute of Information Theory and Automation, Czechoslovak Academy of Sciences

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

We consider a finite controlled Markov chain, the description of which depends on an unknown parameter a, and investigate the following control policy. To each a an optimal stationary control is associated. a is estimated recurrently from the trajectory by the minimum contrast method, and the optimal stationary control corresponding to the estimate is used. We present asymptotic properties of the estimate and of the criterion function. They follow from the law of large numbers and from the central limit theorem for controlled Markov chains derived with the aid of martingales.

Keywords

CONTROLLED MARKOV CHAINS ASYMPTOTIC BEHAVIOUR UNKNOWN PARAMETERS MINIMUM CONTRAST ESTIMATES CONTROLS BASED ON ESTIMATES

Information

Type: Research Article
Information: Advances in Applied Probability , Volume 6 , Issue 1 , March 1974 , pp. 40 - 60

DOI: https://doi.org/10.2307/1426206 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 1974

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable

References

Bellman, R. (1957) A Markovian decision process. J. Math. and Mech. 6, 679–684.Google Scholar

Billingsley, P. (1961) Statistical Inference for Markov Processes. University of Chicago Press, Chicago.Google Scholar

Billingsley, P. (1961) The Lindeberg-Lévy theorem for martingales. Proc. Amer. Math. Soc. 12, 788–792.Google Scholar

Brown, B. M. (1971) Martingale central limit theorems. Ann. Math. Statist. 42, 59–66.Google Scholar

Brown, B. M. and Eagleson, G. K. (1971) Martingale convergence to infinitely divisible laws with finite variances. Trans. Amer. Math. Soc. 162, 449–453.Google Scholar

Gänssler, P. (1972) Note on minimum contrast estimates for Markov processes. Metrika 19, 115–130.Google Scholar

Howard, R. A. (1960) Dynamic Programming and Markov Processes. Technology Press and John Wiley, New York.Google Scholar

Loève, M. (1960) Probability Theory. D. van Nostrand, Princeton, N. J.Google Scholar

Mandl, P. (1971a) On the variance in controlled Markov chains. Kybernetika (Prague) 7, 1–12.Google Scholar

Mandl, P. (1971b) On the control of a Markov chain in the presence of unknown parameters. Trans. Sixth Prague Conf. on Inf. Theory, Random Proc., Statist. Decision Functions, 601–612. Academia, Prague.Google Scholar

Mandl, P. (1972) An application of Itô's formula to stochastic control systems. Stability of Stoch. Dynamical Systems, Proc. Int. Symposium, 8–13. Springer-Verlag, Heidelberg.Google Scholar

Mandl, P. (1973a) On the adaptive control of finite state Markov processes. Z. Wahrscheinlichkeitsth. 27, 263–276.Google Scholar

Mandl, P. (1973b) A connection between controlled Markov chains and martingales. Kybernetika (Prague) 9, 237–241.Google Scholar

Pfanzagl, J. (1969) On the measurability and consistency of minimum contrast estimates. Metrika 14, 249–272.Google Scholar

Article contents

Estimation and control in Markov chains

Abstract

Keywords

Information

Access options

Article purchase

Temporarily unavailable

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests