Abstract
This paper presents the first performance results for Ballard’s *-Minimax algorithms applied to a real–world domain: backgammon. It is shown that with effective move ordering and probing the Star2 algorithm considerably outperforms Expectimax. Star2 allows strong backgammon programs to conduct depth-5 full-width searches (up from 3) under tournament conditions on regular hardware without using risky forward-pruning techniques. We also present empirical evidence that with today’s sophisticated evaluation functions good checker play in backgammon does not require deep searches.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
GNU Backgammon (backgammon software), http://www.gnubg.org/
Jellyfish (backgammon software), http://jelly.effect.no/
Snowie (backgammon software), http://www.bgsnowie.com/
Ballard, B.W.: The *-Minimax Search Procedure for Trees Containing Chance Nodes. Artificial Intelligence 21(3), 327–350 (1983)
Berliner, H.: Backgammon Computer Program Beats World Champion. Artificial Intelligence 14, 205–220 (1980)
Hauk, T.: Search in Trees with Chance Nodes. M.Sc. Thesis, Computing Science Department, University of Alberta (2004)
Hauk, T., Buro, M., Schaeffer, J.: Rediscovering *-Minimax. In: Cai, Y., Abascal, J. (eds.) Ambient Intelligence in Everyday Life. LNCS (LNAI), vol. 3864, pp. 35–50. Springer, Heidelberg (2006)
Howard, M.: The Duel (August 5, 2003) [Online], http://www.rec.games.backgammon.org
Michie, D.: Game-playing and game-learning automata. In: Fox, L. (ed.) Advances in Programming and Non-Numerical Computation, pp. 183–200. Pergamon, New York (1966)
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
Tesauro, G.: Neurogammon: A neural-network backgammon learning program. In: Levy, D.N., Beal, D.F. (eds.) Heuristic Programming in Artificial Intelligence, pp. 78–80. Ellis Horwood (1989)
Tesauro, G.: Temporal Difference Learning and TD-Gammon. Communications of the ACM 38(3) (March 1995)
Tesauro, G.: Programming Backgammon Using Self-Teaching Neural Nets. Artificial Intelligence 134(1-2), 181–199 (2002)
Tesauro, G., Galperin, G.R.: On-line policy improvement using Monte Carlo search. In: Proc. of NIPS, pp. 1068–1074 (1996)
Thyssen, J.: Personal Communication (2003)
Zobrist, A.L.: A New Hashing Method with Applications for Game Playing. ICCA Journal 13(2), 69–73 (1990)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hauk, T., Buro, M., Schaeffer, J. (2006). *-Minimax Performance in Backgammon. In: van den Herik, H.J., Björnsson, Y., Netanyahu, N.S. (eds) Computers and Games. CG 2004. Lecture Notes in Computer Science, vol 3846. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11674399_4
Download citation
DOI: https://doi.org/10.1007/11674399_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32488-1
Online ISBN: 978-3-540-32489-8
eBook Packages: Computer ScienceComputer Science (R0)