Hostname: page-component-78c5997874-s2hrs Total loading time: 0 Render date: 2024-11-11T00:36:15.657Z Has data issue: false hasContentIssue false

A finite exact algorithm to solve a dice game

Published online by Cambridge University Press:  24 March 2016

Ernesto Mordecki
Affiliation:
Centro de Matemática, Facultad de Ciencias, Universidad de la República, Iguá 4225, CP 11400 Montevideo, Uruguay.

Abstract

We provide an algorithm to find the value and an optimal strategy of the Ten Thousand dice game solitaire variant in the framework of Markov control processes. Once an optimal critical threshold is found, the set of nonstopping states of the game becomes finite and the solution is found by a backwards algorithm that gives the values for each one of these states of the game. The algorithm is finite and exact. The strategy to find the critical threshold comes from the continuous pasting condition used in optimal stopping problems for continuous-time processes with jumps.

Type
Research Papers
Copyright
Copyright © Applied Probability Trust 2016 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

[1]Blackwell, D. (1967). Positive dynamic programming. In Proc. Fifth Berkeley Symp. Math. Statist. Prob. (Berkeley, CA, 1965/66), Vol. I, Statistics, University of California Press, pp. 415418. Google Scholar
[2]Filar, J. and Vrieze, K. (1997). Competitive Markov Decision Processes. Springer, New York. Google Scholar
[3]Haigh, J. and Roters, M. (2000). Optimal strategy in a dice game. J. Appl. Prob. 37, 11101116. Google Scholar
[4]Hald, A. (1990). A History of Probability and Statistics and Their Applications Before 1750. John Wiley, New York. Google Scholar
[5]Hernández-Lerma, O., Carrasco, G. and Pérez-Hernández, R. (1999). Markov control processes with the expected total cost criterion: optimality, stability, and transient models. Acta Appl. Math. 59, 229269. CrossRefGoogle Scholar
[6]Mordecki, E. (1999). Optimal stopping for a diffusion with jumps. Finance Stoch. 3, 227236. Google Scholar
[7]Neller, T. W. and Presser, C. G. M. (2004). Optimal play of the dice game pig. The UMAP J. 25, 2547. Google Scholar
[8]Pliska, S. R. (1978). On the transient case for Markov decision chains with general state spaces. In Dynamic Programming and Its Applications (Proc. Conf., University of British Columbia, Vancouver, 1977), Academic Press, New York, pp. 335349. Google Scholar
[9]Puterman, M. L. (1994). Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley, New York. Google Scholar
[10]Roters, M. (1998). Optimal stopping in a dice game. J. Appl. Prob. 35, 229235. Google Scholar
[11]Tijms, H. (2007). Dice games and stochastic dynamic programming. Morfismos 11, 114. Google Scholar
[12]Tijms, H. and van der Wal, J. (2006). A real-world stochastic two-person game. Prob. Eng. Inf. Sci. 20, 599608. CrossRefGoogle Scholar