Journals & Magazines >IEEE Transactions on Games >Volume: 16 Issue: 1

MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though ...Show More

Metadata

Abstract:

Artificial intelligence (AI) has demonstrated outstanding performance in some perfect- and imperfect-information games, such as Go, Atari, and Texas Hold'em. Even though AI is successful in these games with small action spaces, it does not play well in large-scale multiplayer, imperfect-information games like DouDiZhu. DouZero, a DouDizhu AI system, has recently been proposed and beaten all the existing DouDizhu AI programs. This article introduces minimum split pruning (MSP) and a single Q-network to accelerate the training of DouZero, called MDou. Our experiments show that MDou improved through self-play using limited computational ability (only a 4-core CPU and 1 GPU) and less learning time (30 days), while achieving comparable performance to DouZero.

Published in: IEEE Transactions on Games ( Volume: 16, Issue: 1, March 2024)

Page(s): 90 - 101

Date of Publication: 23 November 2022

ISSN Information:

DOI: 10.1109/TG.2022.3223926

Funding Agency:

Contents

References is not available for this document.

MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MDou: Accelerating DouDiZhu Self-Play Learning Using Monte-Carlo Method With Minimum Split Pruning and a Single Q-Network

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?