skip to main content
research-article
Free Access

Heads-up limit hold'em poker is solved

Published:24 October 2017Publication History
Skip Abstract Section

Abstract

Poker is a family of games that exhibit imperfect information, where players do not have full knowledge of past events. While many perfect information games have been solved (e.g., Connect-Four and checkers), no nontrivial imperfect information game played competitively by humans has previously been solved. In this paper, we announce that the smallest variant of poker in-play, heads-up limit Texas hold'em, is now essentially weakly solved. Furthermore, this computation formally proves the common wisdom that the dealer in the game holds a significant advantage. This result was enabled by a new algorithm, CFR+, which is capable of solving extensive-form games three orders of magnitude larger than previously possible. This paper is an extended version of the original 2015 Science article, with additional results showing Cepheus' in-game performance against computer and human opponents.

References

  1. Poker: A big deal. The Economist. Economist Newspaper Ltd., London. December 22, 31--38, 2007.Google ScholarGoogle Scholar
  2. Allis, V.L. A Knowledge-Based Approach to Connect-Four. The Game is Solved: White Wins. Master's thesis, Vrije Universiteit, Amsterdam, The Netherlands, 1988.Google ScholarGoogle Scholar
  3. Allis, V.L. Searching for Solutions in Games and Artificial Intelligence. PhD thesis, Vrije Universiteit Amsterdam, The Netherlands, 1994.Google ScholarGoogle Scholar
  4. Babbage, C. Passages from the Life of a Philosopher. Longman, Green, Longman, Roberts, and Green, London, 1864. Chapter 34.Google ScholarGoogle Scholar
  5. Billings, D., Burch, N., Davidson, A., Holte, R.C., Schaeffer, J., Schauenberg, T., Szafron, D. Approximating game-theoretic optimal strategies for full-scale poker. IJCAI, (2003), 661--668. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Billings, D., Davidson, A., Schaeffer, J., Szafron, D. The challenge of poker. Artificial Intelligence 134, 1--2 (2002), 201--240. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Borel, E., Ville, J. Applications de la théorie des probabilités aux jeux de hasard. Gauthier-Villars, 1938.Google ScholarGoogle Scholar
  8. Bowling, M., Burch, N., Johanson, M., Tammelin, O. The cepheus website, 2015. http://poker.srv.ualberta.ca.Google ScholarGoogle Scholar
  9. Bowling, M., Burch, N., Johanson, M., Tammelin, O. Heads-up limit hold'em poker is solved. Science 347, 6218 (Jan. 2015), 145--149.Google ScholarGoogle ScholarCross RefCross Ref
  10. Bowling, M., Burch, N., Johanson, M., Tammelin, O. Heads-up limit hold'em poker is solved: Supplementary online material, January 2015.Google ScholarGoogle Scholar
  11. Bowling, M., Johanson, M., Burch, N., Szafron, D. Strategy evaluation in extensive games with importance sampling. ICML, (2008), 72--79. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Bronowski, J. Ascent of man. Documentary, 1973. Episode 13.Google ScholarGoogle Scholar
  13. Buro, M. Solving the oshi-zumo game. Adv. in Comp. Games 135, (2004) 361--366.Google ScholarGoogle ScholarCross RefCross Ref
  14. Campbell, M., Hoane, A.J. Jr., Hsu, F. Deep blue. Artificial Intelligence 134, (Jan. 2002), 57--83. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Chen, k., Bowling, M. Tractable objectives for robust policy optimization. Adv. Neural Inf. Process. Syst. (NIPS) 25, (2012), 2078--2086. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Craig, M. The Professor, the Banker, and the Suicide King: Inside the Richest Poker Game of All Time. Grand Central Publishing, New York, NY, 2006.Google ScholarGoogle Scholar
  17. Ferrucci, D. Introduction to "this is watson." IBM J. Res. Dev 56, 3.4 (May 2012) 1:1--1:15. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Gilpin, A., Hoda, S., Peña, J., Sandholm, T. Gradient-based algorithms for finding nash equilibria in extensive form games. WINE, (2007), 57--69. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Gilpin, A., Sandholm, T. Lossless abstraction of imperfect information games. J. ACM 54, 5 (2007). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Jackson, E. Slumbot: An implementation of counterfactual regret minimization on commodity hardware. In Proceedings of the 2012 Computer Poker Symposium. (2012).Google ScholarGoogle Scholar
  21. Johanson, M., Bard, N. Burch, N., Bowling, M. Finding optimal abstract strategies in extensive form games. AAAI, (2012), 1371--1379. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Johanson, M., Bard, N., Lanctot, M., Gibson, R., Bowling, M. Efficient Nash equilibrium approximation through Monte Carlo counterfactual regret minimization. AAMAS (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Johanson, M., Waugh, K., Bowling, M., Zinkevich, M. Accelerating best response calculation in large extensive games. IJCAI (2011), 258--265. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Johanson, M.B. Robust Strategies and Counter-Strategies: From Superhuman to Optimal Play. PhD thesis, University of Alberta, Edmonton, Alberta, Canada, 2016.Google ScholarGoogle Scholar
  25. Karmarkar, N. A new polynomial-time algorithm for linear programming. In Proceedings of the Sixteenth Annual ACM Symposium on Theory of Computing (1984), ACM, New York, NY, 302--311. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Koller, D., Megiddo, N. The complexity of two-person zero-sum games in extensive form. Games Econ. Behav 4, 4 (1992), 528--552.Google ScholarGoogle ScholarCross RefCross Ref
  27. Koller, D., Megiddo, N., von Stengel, B. Efficient computation of equilibria for extensive two-person games. Games Econ. Behav 14, 2 (1996).Google ScholarGoogle ScholarCross RefCross Ref
  28. Koller, D., Pfeffer, A. Representations and solutions for game-theoretic problems. Artificial Intelligence 94, (1997), 167--215. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Kuhn, H. Simplified two-person poker. In Contributions to the Theory of Games, volume 1 of Annals of mathematics studies. H. Kuhn and A. Tucker, eds. Princeton University Press, Princeton, New Jersey, 1950, 97--103.Google ScholarGoogle Scholar
  30. Mirowski, P. What were von neumann and morgenstern trying to accomplish? In Toward a History of Game Theory. Weintraub, ed. Duke University Press, 1992, 113--147. Mirowski cites Turing as author of the paragraph containing this remark. The paragraph appeared in {46}, in a chapter with Turing listed as one of three contributors. Which parts of the chapter are the work of which contributor, particularly the introductory material containing this quote, is not made explicit.Google ScholarGoogle Scholar
  31. Morgenstern, O. The cold war is cold poker. N. Y. Times Mag. (Feb. 5 1961) pages 21--22.Google ScholarGoogle Scholar
  32. Nash, J.F., Shapley, L.S. A simple 3-person poker game. In Contributions to the Theory of Games I. Princeton University Press, Princeton, New Jersey, 1950, 105--116.Google ScholarGoogle Scholar
  33. Nesterov, Y. Excessive gap technique in nonsmooth convex minimization. SIAM Journal on Optimization 16, 1 (2005), 233--249. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Rehmeyer, J., Fox, N., Rico, R. Ante up, human: The adventures of polaris the poker-playing robot. Wired 16, 12 (Dec. 2008), 186--191.Google ScholarGoogle Scholar
  35. Romanovskii, I.V. Reduction of a game with complete memory to a matrix game. Soviet Mathematics 3, (1962), 678--681.Google ScholarGoogle Scholar
  36. Rubin, J., Watson, I. Computer poker: a review. Artificial Intelligence 175,(2011), 958--987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Sandholm, T. The state of solving large incomplete-information games, and application to poker. AI Mag. 31, 4 (2010), 13--32.Google ScholarGoogle Scholar
  38. Schaeffer, J., Lake, R., Lu, P., Bryant, M. Chinook the world man-machine checkers champion. AI Mag. 17, 1 (1996), 21--29.Google ScholarGoogle Scholar
  39. Schaeffer, J., Neil Burch, Y.B., Kishimoto, A., Müller, M., Lake, R., Lu, P. Sutphen, S. Checkers is solved. Science 317, 5844 (2007), 1518--1522.Google ScholarGoogle ScholarCross RefCross Ref
  40. Shannon, C.E. Programming a computer for playing chess. Philosophical Magazine, Series 7, 41, 314 (Mar. 1950), 256--275.Google ScholarGoogle Scholar
  41. Shi, J., Littman, M.L. Abstraction methods for game theoretic poker. In Comp. Games, (2000), 333--345. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. Southey, F., Bowling, M., Larson, B., Piccione, C., Burch, N., Billings, D., Rayner, D.C. Bayes' bluff: Opponent modelling in poker. UAI, (2005) 550--558. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. Tambe, M. Security and Game Theory: Algorithms, Deployed Systems, Lessons Learned. Cambridge University Press, Cambridge, England, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. Tammelin, O. Cfr+. CoRR, abs/1407.5042, 2014.Google ScholarGoogle Scholar
  45. Tammelin, O., Burch, N., Johanson, M., Bowling, M. Solving heads-up limit texas hold'em. In Proceedings of the 24th International Joint Conference on Artificial Intelligence, 2015, 645--652. Google ScholarGoogle ScholarDigital LibraryDigital Library
  46. Turing, A. Digital computers applied to games. In Faster Than Thought. B.V. Bowden, ed. Chapter 25. Pitman, 1976.Google ScholarGoogle Scholar
  47. von Neumann, J. Zur theorie der gesellschaftsspiele. Mathematische Annalen 100, 1 (1928), 295--320.Google ScholarGoogle Scholar
  48. von Neumann, J., Morgenstern, O. Theory of Games and Economic Behavior. Princeton University Press, Princeton, Second Edition, 1947.Google ScholarGoogle Scholar
  49. Waugh, K., Bagnell, J.A. A unified view of large-scale zero-sum equilibrium computation. In AAAI Workshop on Computer Poker and Imperfect Information, 2015.Google ScholarGoogle Scholar
  50. Waugh, K., Bard, N., Bowling, M. Strategy grafting in extensive games. In Advances in Neural Information Processing Systems 22 (NIPS-09), 2009. http://webdocs.cs.ualberta.ca/~games/poker/publications/NIPS09-graft.pdf. Google ScholarGoogle ScholarDigital LibraryDigital Library
  51. Zermelo, E. Über eine Anwendung der Mengenlehre auf die Theorie des Schachspiels. In Proceedings of the Fifth International Congress Mathematics. Cambridge University Press, Cambridge, 1913, 501--504.Google ScholarGoogle Scholar
  52. Zinkevich, M., Johanson, M., Bowling, M., Piccione, C. Regret minimization in games with incomplete information. NIPS (2008), 905--912. Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. Zinkevich, M., Littman, M. The AAAI computer poker competition. J. Inter. Comp. Games Association 29, (2006), News item.Google ScholarGoogle Scholar

Index Terms

  1. Heads-up limit hold'em poker is solved

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image Communications of the ACM
          Communications of the ACM  Volume 60, Issue 11
          November 2017
          95 pages
          ISSN:0001-0782
          EISSN:1557-7317
          DOI:10.1145/3154816
          Issue’s Table of Contents

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 24 October 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        HTML Format

        View this article in HTML Format .

        View HTML Format