Stability and Admissibility Analysis for Zero-Sum Games Under General Value Iteration Formulation | IEEE Journals & Magazine | IEEE Xplore

Stability and Admissibility Analysis for Zero-Sum Games Under General Value Iteration Formulation


Abstract:

In this article, the general value iteration (GVI) algorithm for discrete-time zero-sum games is investigated. The theoretical analysis focuses on stability properties of...Show More

Abstract:

In this article, the general value iteration (GVI) algorithm for discrete-time zero-sum games is investigated. The theoretical analysis focuses on stability properties of the systems and also the admissibility properties of the iterative policy pair. A new criterion is established to determine the admissibility of the current policy pair. Besides, based on the admissibility criterion, the improved GVI algorithm toward zero-sum games is developed to guarantee that all iterative policy pairs are admissible if the current policy pair satisfies the criterion. On the basis of the attraction domain, we demonstrate that the state trajectory will stay in the region using the fixed or the evolving policy pair if the initial state belongs to the domain. It is emphasized that the evolving policy pair can stabilize the controlled system. These theoretical results are applied to linear and nonlinear systems via offline and online critic control design.
Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 34, Issue: 11, November 2023)
Page(s): 8707 - 8718
Date of Publication: 03 March 2022

ISSN Information:

PubMed ID: 35239493

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.