Abstract
Fuzzy theory is a discipline that has recently appeared in the mathematical literature. It generalizes classic situations. Therefore, its success continues to increase and to keep going up from time to time. In this work, we consider the model of Markov decision processes where the information on the costs includes imprecision. The fuzzy cost is represented by the fuzzy number set and the infinite horizon discounted cost is minimized from any stationary policy. This paper presents in the first part the notion of fuzzy sets and some axiomatic basis and relevant concepts with fuzzy theory in short. In second part, we propose a new definition of total discounted fuzzy cost in infinite planning horizon. We will compute an optimal stationary policy that minimizes the total fuzzy discounted cost by a new approach based on some standard algorithms of the dynamic programming using the ranking function concept. The last adapted criterion has many applications in several areas such that the forest management, the management of energy consumption, the finance, the communication system (mobile networks).
Similar content being viewed by others
References
Archibald, T. W., & Possani, E. (2019). Investment and operational decisions for start-up companies: A game theory and Markov decision process approach. Annals of Operations Research,. https://doi.org/10.1007/s10479-019-03426-5.
Bahri, O., & Talbi, E. G. (2020). Robustness-based approach for fuzzy multi-objective problems. Annals of Operations Research,. https://doi.org/10.1007/s10479-020-03567-y.
Balbus, L., Jaśkiewicz, A., & Nowak, A. S. (2018). Markov perfect equilibria in a dynamic decision model with quasi-hyperbolic discounting. Annals of Operations Research, 287, 573–591.
Bellman, R. E., & Zadeh, L. A. (1970). Decision making in a fuzzy environment. Management Science, 17, 141–164.
Bertsekas, D. P., & Shreve, S. E. (1978). Stochastic optimal control. New York: Academic Press.
Bhulai, S., Blok, H., & Spieksma, F. M. (2019). K competing queues with customer abandonment: optimality of a generalised \(\text{ c }\mu \)-rule by the Smoothed Rate Truncation method. Annals of Operations Research,. https://doi.org/10.1007/10479-019-03131-3.
Canbolat, P. G., & Rothblum, U. G. (2013). (Approximate) iterated successive approximations algorithm for sequential decision processes. Annals of Operations Research, 208, 309–320.
Derman, C. (1970). Finte state Markovian decision processes. New York: Academic Press.
Diamond, P., & Kloeden, P. (1994). Metric spaces of fuzzy sets, theory and applications. Singapore: World Scientific.
Dubois, D., & Prade, H. (1983). Ranking of fuzzy numbers in the setting of possibility theory. Information Sciences, 30(3), 183–224.
Dubois, D., & Prade, H. (2000). Fundamentals of fuzzy sets. Boston: Kluwer Academic Publishers.
Howard, R. A. (1960). Dynamic programming and Markov processes. Cambridge: MIT Press.
Buckley, J. J. (2005). Fuzzy probabilities. Berlin: Springer.
Klir, G., & Yuan, B. (1995). Fuzzy sets and fuzzy logic, theory and applications. Upper Saddle River: Prentice Hall.
Kurano, M., Nakagami, J., & Yoshida, Y. (2003). Markov decision processes with fuzzy rewards. Journal of Nonlinear Analysis and Convex Analysis, 4(1), 105–115.
Mahdavi-Amiri, N., & Nasseri, S. H. (2006). Duality in fuzzy number linear programming by use of a certain linear ranking function. Applied Mathematics and Computation, 180, 206–216.
Mohammed, A. (2019). Towards a sustainable assessment of suppliers: An integrated fuzzy TOPSIS-possibilistic multi-objective approach. Annals of Operations Research,. https://doi.org/10.1007/s10479-019-03167-5.
Piunovskiy, A. B. (2013). Examples in Markov decision processes (Vol. 2). London: World Scientific.
Powell, W. B. (2012). Perspectives of approximate dynamic programming. Annals of Operations Research, 13(2), 1–38.
Puri, M. L., & Ralesca, D. A. (1986). Fuzzy random variable. Journal of Mathematical Analysis and Applications, 114, 402–422.
Puterman, M. L. (2014). Markov decision processes: Discrete stochastic dynamic programming. New York: Wiley.
Roy, J., Pamuc̆ar, D., & Kar, S. (2019). Evaluation and selection of third party logistics provider under sustainability perspectives: An interval valued fuzzy-rough approach. Annals of Operations Research,. https://doi.org/10.1007/s10479-019-03501-x.
Yager, R. R. (1981). A procedure for ordering fuzzy subsets of the unit interval. Information Sciences, 24, 143–161.
Zadeh, L. A. (1965). Fuzzy sets. Information and Control, 8, 338–353.
Acknowledgements
The authors would like to thank the Editor and the anonymous referees for their constructive comments, careful reading of the manuscript, valuable suggestions and of a number of helpful remarks which significantly improved the presentation of this paper. We also wish to express our sincere thanks to the following people. Firstly, Professor Dr. S. Melliani of Sultan Moulay Slimane University, Beni Mellal, Morocco for his help in fuzzy theory and encouragement during the period of research. Secondly, Mr. Lekbir Tansaoui, ELT teacher, co-author and textbook designer in Mokhtar Essoussi High School, Oued Zem, Morocco for proofreading this paper.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Semmouri, A., Jourhmane, M. & Belhallaj, Z. Discounted Markov decision processes with fuzzy costs. Ann Oper Res 295, 769–786 (2020). https://doi.org/10.1007/s10479-020-03783-6
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-020-03783-6