Adaptive Decision-Making Strategies in the Game with Environment

Kravets, Petro; Vysotska, Victoria; Lytvyn, Vasyl; Chyrun, Lyubomyr

doi:10.1007/978-3-031-16203-9_17

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 149))

Included in the following conference series:

International Scientific Conference “Intellectual Systems of Decision Making and Problem of Computational Intelligence”

481 Accesses

Abstract

The problems that occurred during decision-making and supported by the stochastic game model with the environment are investigated and analysed. A method of adaptive choice of solutions based on stimulatory learning and application of Boltzmann distribution is developed. The algorithm and software of the decision-making system in the games with the environment are developed. The computer simulation results of stochastic choice of solutions are analysed and described. The developed decision-making method provides stochastic minimisation of the agent’s mean loss function and is based on adaptive parametric identification of the environment using Q-learning and defining a mixed strategy on the Boltzmann distribution. With proper parameter selection, the method provides close to 1 order of asymptotic convergence, the limit value for a given class of recurrent methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Babichev, S., Lytvynenko, V., Osypenko, V.: Implementation of the objective clustering inductive technology based on DBSCAN clustering algorithm. In: Proceedings of the 12th International Scientific and Technical Conference on Computer Sciences and Information Technologies, CSIT 2017, vol. 1, pp. 479–484 (2017). https://doi.org/10.1109/STC-CSIT.2017.8098832
Babichev, S.A., Kornelyuk, A.I., Lytvynenko, V.I., Osypenko, V.V.: Computational analysis of microarray gene expression profiles of lung cancer. Biopol. Cell 32(1), 70–79 (2016). https://doi.org/10.7124/bc.00090F
Article Google Scholar
Bowles, J., Silvina, A.: Model checking cancer automata. In: IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), pp. 376–379 (2016). https://doi.org/10.1109/BHI.2016.7455913
Flieger, S.: Implementing the patient-centered medical home in complex adaptive systems: becoming a relationship-centered patient-centered medical home. Health Care Manage. Rev. 42(2), 112–121 (2017). https://doi.org/10.1097/HMR.0000000000000100
Article Google Scholar
Fricke, G., Letendre, K., Moses, M., Cannon, J.: Persistence and adaptation in immunity: T cells balance the extent and thoroughness of search. PLoS Comput. Biol. 12(3) (2016). https://doi.org/10.1371/journal.pcbi.1004818
Huang, K., Zheng, X., Cheng, Y., Yang, Y.: Behavior-based cellular automaton model for pedestrian dynamics. Appl. Math. Comput. 292, 417–424 (2017). https://doi.org/10.1016/j.amc.2016.07.002
Article MATH Google Scholar
Kravets, P., Lytvyn, V., Dobrotvor, I., Sachenko, O., Vysotska, V., Sachenko, A.: Matrix stochastic game with q-learning for multi-agent systems. Lecture Notes on Data Engineering and Communications Technologies 83, 304–314 (2021). https://doi.org/10.1007/978-3-030-80472-5_26
Marasanov, V., Stepanchikov, D., Sharko, A., Sharko, O.: Technology for determining the residual life of metal structures under conditions of combined loading according to acoustic emission measurements. Commun. Comput. Inf. Sci. 1158 (2020). https://doi.org/10.1007/978-3-030-61656-4_13
Mohamed, W., Hamza, A.: Medical image registration using stochastic optimisation. Opt. Lasers Eng. 48(12), 1213–1223 (2010). https://doi.org/10.1016/j.optlaseng.2010.06.011
Article Google Scholar
Narendra, K.S., Thathachar, M.A.L.: Learning automata - a survey. IEEE Trans. Syst. Man Cybern. 4, 323–334 (1974)
Google Scholar
Ning, C., You, F.: Data-driven adaptive nested robust optimisation: general modeling framework and efficient computational algorithm for decision making under uncertainty. AIChE J. 63(9), 3790–3817 (2017). https://doi.org/10.1002/aic.15717
Article Google Scholar
Ozcift, A., Gulten, A.: Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. Comput. Meth. Program. Biomed. 104(3), 443–451 (2011). https://doi.org/10.1016/j.cmpb.2011.03.018
Article Google Scholar
Robbins, H.: Some aspects of the sequential design of experiments. Bull. Am. Math. Soc. 58(5), 527–535 (1952). https://doi.org/10.1016/j.ins.2020.06.069
Article MathSciNet MATH Google Scholar
Stewart, A., Bosch, N., D’mello, S.: Generalizability of face-based mind wandering detection across task contexts. In: The 10th International Conference on Educational Data Mining Society, pp. 88–95 (2017)
Google Scholar
Sutton, R., Barto, A.: Reinforcement learning: an introduction (2017). http://incompleteideas.net/book/bookdraft2017nov5.pdf
Tsetlin, M.L.: Automaton Theory and Modeling of Biological Systems. Academic Press Inc., New York (1973)
Google Scholar
Wooldridge, M.: An Introduction to Multi-Agent Systems. John Wiley and Sons, Hoboken, New Jersey, U.S. (2009)
Google Scholar
Yurtkuran, A., Emel, E.: An adaptive artificial bee colony algorithm for global optimisation. Appl. Math. Comput. 271, 1004–1023 (2015). https://doi.org/10.1016/j.amc.2015.09.064
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Lviv Polytechnic National University, Lviv, Ukraine
Petro Kravets, Victoria Vysotska & Vasyl Lytvyn
Osnabrück University, Osnabrück, Germany
Victoria Vysotska
Ivan Franko National University of Lviv, Lviv, Ukraine
Lyubomyr Chyrun

Authors

Petro Kravets
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Vysotska
View author publications
You can also search for this author in PubMed Google Scholar
Vasyl Lytvyn
View author publications
You can also search for this author in PubMed Google Scholar
Lyubomyr Chyrun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Victoria Vysotska .

Editor information

Editors and Affiliations

Jan Evangelista Purkyně University in Ústi nad Labem, Ústi nad Labem, Czech Republic
Sergii Babichev
Kherson National Technical University, Kherson, Ukraine
Volodymyr Lytvynenko

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kravets, P., Vysotska, V., Lytvyn, V., Chyrun, L. (2023). Adaptive Decision-Making Strategies in the Game with Environment. In: Babichev, S., Lytvynenko, V. (eds) Lecture Notes in Data Engineering, Computational Intelligence, and Decision Making. ISDMCI 2022. Lecture Notes on Data Engineering and Communications Technologies, vol 149. Springer, Cham. https://doi.org/10.1007/978-3-031-16203-9_17

Download citation

DOI: https://doi.org/10.1007/978-3-031-16203-9_17
Published: 14 September 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-16202-2
Online ISBN: 978-3-031-16203-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Adaptive Decision-Making Strategies in the Game with Environment