Loading [a11y]/accessibility-menu.js
MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network | IEEE Journals & Magazine | IEEE Xplore

MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network


Abstract:

Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though ...Show More

Abstract:

Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though AI is successful in these games with small action spaces, it does not play well in large-scale multiplayer, imperfect-information games like DouDiZhu. DouZero, a DouDizhu AI system, has recently been proposed and beaten all the existing DouDizhu AI programs. This article introduces minimum split pruning (MSP) and a single Q-network to accelerate the training of DouZero, called MDou. Our experiments show that MDou improved through self-play using limited computational ability (only a 4-core CPU and 1 GPU) and less learning time (30 days), while achieving comparable performance to DouZero.
Published in: IEEE Transactions on Games ( Volume: 16, Issue: 1, March 2024)
Page(s): 90 - 101
Date of Publication: 23 November 2022

ISSN Information:

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.