An online value iteration method for linear-quadratic mean field social control with unknown dynamics

Wang, Bing-Chang; Li, Shumei; Cao, Ying

doi:10.1007/s11432-023-3962-0

An online value iteration method for linear-quadratic mean field social control with unknown dynamics

Letter
Published: 27 March 2024

Volume 67, article number 140203, (2024)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Bing-Chang Wang¹,
Shumei Li¹ &
Ying Cao¹

103 Accesses
Explore all metrics

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Huang M, Caines P E, Malhame R P. Large-population cost-coupled LQG problems with nonuniform agents: individual-mass behavior and decentralized ε-nash equilibria. IEEE Trans Autom Control, 2007, 52: 1560–1571
Article MathSciNet Google Scholar
Li T, Zhang J F. Asymptotically optimal decentralized control for large population stochastic multiagent systems. IEEE Trans Autom Control, 2008, 53: 1643–1660
Article MathSciNet Google Scholar
Wang B C, Zhang H, Zhang J F. Mean field linear-quadratic control: uniform stabilization and social optimality. Automatica, 2020, 121: 109088
Article MathSciNet Google Scholar
Bian T, Jiang Z P. Continuous-time robust dynamic programming. SIAM J Control Optim, 2019, 57: 4150–4174
Article MathSciNet Google Scholar
Xu Z, Shen T, Huang M. Model-free policy iteration approach to NCE-based strategy design for linear quadratic Gaussian games. Automatica, 2023, 155: 111162
Article MathSciNet Google Scholar
Li N, Li X, Xu Z Q. Policy iteration reinforcement learning method for continuous-time mean-field linear-quadratic optimal problem. 2023. ArXiv:2305.00424

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (Grant No. 62122043).

Author information

Authors and Affiliations

School of Control Science and Engineering, Shandong University, Jinan, 250000, China
Bing-Chang Wang, Shumei Li & Ying Cao

Authors

Bing-Chang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shumei Li
View author publications
You can also search for this author in PubMed Google Scholar
Ying Cao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shumei Li.

Additional information

Supporting information Appendixes A–E. The supporting information is available online at info.scichina.com and link.springer.com. The supporting materials are published as submitted, without typesetting or editing. The responsibility for scientific accuracy and content remains entirely with the authors.

Supplementary File