Abstract:
Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though ...Show MoreMetadata
Abstract:
Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though AI is successful in these games with small action spaces, it does not play well in large-scale multiplayer, imperfect-information games like DouDiZhu. DouZero, a DouDizhu AI system, has recently been proposed and beaten all the existing DouDizhu AI programs. This article introduces minimum split pruning (MSP) and a single Q-network to accelerate the training of DouZero, called MDou. Our experiments show that MDou improved through self-play using limited computational ability (only a 4-core CPU and 1 GPU) and less learning time (30 days), while achieving comparable performance to DouZero.
Published in: IEEE Transactions on Games ( Volume: 16, Issue: 1, March 2024)