FCM-Type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes

Notsu, Akira; Ueno, Takanori; Hattori, Yuichi; Ubukata, Seiki; Honda, Katsuhiro

doi:10.1007/978-3-319-25135-6_21

Akira Notsu¹⁶,
Takanori Ueno¹⁶,
Yuichi Hattori¹⁶,
Seiki Ubukata¹⁶ &
…
Katsuhiro Honda¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 9376))

Included in the following conference series:

International Symposium on Integrated Uncertainty in Knowledge Modelling and Decision Making

1059 Accesses

Abstract

In applying reinforcement learning to continuous space problems, discretization or redefinition of the learning space can be a promising approach. Several methods and algorithms have been introduced to learning agents to respond to this problem. In our previous study, we introduced an FCCM clustering technique into Q-learning (called QL-FCCM) and its transfer learning in the Markov process. Since we could not respond to complicated environments like a non-Markov process, in this study, we propose a method in which an agent updates his Q-table by changing the trade-off ratio, Q-learning and QL-FCCM, based on the damping ratio. We conducted numerical experiments of the single pendulum standing problem and our model resulted in a smooth learning process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Genetic Algorithm-Optimized Fuzzy Lyapunov Reinforcement Learning for Nonlinear Systems

Article 19 September 2019

Multi-Agent Reward-Iteration Fuzzy Q-Learning

Article 13 April 2021

Smooth Q-Learning: An Algorithm for Independent Learners in Stochastic Cooperative Markov Games

Article 18 July 2023

References

Sutton, R.S., Bart, A.G.: Generalization in Reinforcement Learning-An Introduction. The MIT Press (1998)
Google Scholar
Notsu, A., Honda, H., Ichihashi, H., Wada, H.: Contraction algorithm in state and action space for Q-learning. In: 10th International Symposium on Advanced Intelligent Systems, pp. 93–96 (2009)
Google Scholar
Komori, Y., Notsu, A., Honda, K., Ichihashi, H.: Determination of the change timing of space segmentation using PCA for reinforcement learning. In: The 6th International Conference on Soft Computing and Intelligent Systems The 13th International Symposium on Advanced Intelligent Systems, pp. 2287–2290 (2012)
Google Scholar
Kosko, B.: Neural Networks and Fuzzy Systems: A Dynamical Systems Approach to Machine Intelligence. Prentice Hall, Englewood Cliffs (1992)
MATH Google Scholar
Hammell, R.J., Sudkamp, T.: Learning Fuzzy Rules from Data. http://ftp.rta.nato.int/public/pubfulltext/rto/mp/rto-mp-003/mp-003-08.pdf
Komori, Y., Notsu, A., Honda, K., Ichihashi, H.: Automatic Adaptive Space Segmentation for Reinforcement Learning. International Journal of Fuzzy Logic and Intelligent Systems 12(1), 36–41 (2012)
Article Google Scholar
Notsu, A., Honda, K., Ichihashi, H., Komori, Y.: Simple reinforcement learning for small-memory agent. In: 10th International Conference on Machine Learning and Applications, vol. 1, pp. 458–461 (2011)
Google Scholar
Ueno, T., Notsu, A., Honda, K.: Application of FCM-type co-clustering to an agent in reinforcement learning. In: 1st IIAI International Conference on Advanced Information Technologies, vol. 12, pp. 1–5 (2013)
Google Scholar
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press (1981)
Google Scholar
Oh, C.H., Honda, K., Ichihashi, H.: Fuzzy clustering for categorical multivariate data. In: Joint 9th IFSA World Congress and 20th NAFIPS International Conference, pp. 2154–2159 (2001)
Google Scholar
Tsuda, K., Minoh, M., Ikeda, K.: Extracting straight lines by sequential fuzzy clustering. Pattern Recognition Letters. 17, 643–649 (1996)
Article Google Scholar
Matsumoto, Y., Honda, K., Notsu, A., Ichihashi, H.: Exclusive Partition in FCM-type Co-clustering and Its Application to Collaborative Filtering. International Journal of Computer Science and Network Security 12(12), 52–58 (2012)
Google Scholar
Honda, K., Notsu, A., Ichihashi, H.: Collaborative Filtering by Sequential User-Item Co-cluster Extraction from Rectangular Relational Data. International Journal of Knowledge Engineering and Soft Data Paradigms(IJKESDP) 2(4), 312–327 (2010)
Article Google Scholar
Hathaway, R.J., Davenport, J.W., Bezdek, J.C.: Relational duals of the $c$-means clustering algorithms. Pattern Recognition 22(2), 205–212 (1989)
Article MathSciNet MATH Google Scholar
Watkins, C., Dayan, P.: Technical note: Q-learning. Machine Learning 3(8), 279–292 (1992)
MATH Google Scholar
Rummery, G.A., Niranjan, M: On-line Q-learning using connectionist systems, Technical Report CUED/F-INFENG/TR 166. Engineering Department, Cambridge University (1994)
Google Scholar
Jaakkola, T., Shingh, S.P., Jordan, M.: I: Reinforcement Learning Algorithm for Partially Observable Markov Decision Process. Advances in Neural Information Processing System 7, 345–352 (1994)
Google Scholar
Miyamoto, S., Ichihashi, H., Honda, K.: Algorithms for fuzzy clustering. Springer (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Graduate School of Engineering, Osaka Prefecture University, Gakuen 1-1, Naka, Sakai, Osaka, 599-8531, Japan
Akira Notsu, Takanori Ueno, Yuichi Hattori, Seiki Ubukata & Katsuhiro Honda

Authors

Akira Notsu
View author publications
You can also search for this author in PubMed Google Scholar
Takanori Ueno
View author publications
You can also search for this author in PubMed Google Scholar
Yuichi Hattori
View author publications
You can also search for this author in PubMed Google Scholar
Seiki Ubukata
View author publications
You can also search for this author in PubMed Google Scholar
Katsuhiro Honda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Akira Notsu .

Editor information

Editors and Affiliations

Science and Technology, Japan Advanced Institute of, Nomi, Japan
Van-Nam Huynh
Science, Dept of Systems Innovation, Osaka University,Graduate School of, Osaka, Japan
Masahiro Inuiguchi
Université de Technologie de Compiègne, Compiègne, France
Thierry Demoeux

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Notsu, A., Ueno, T., Hattori, Y., Ubukata, S., Honda, K. (2015). FCM-Type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes. In: Huynh, VN., Inuiguchi, M., Demoeux, T. (eds) Integrated Uncertainty in Knowledge Modelling and Decision Making. IUKM 2015. Lecture Notes in Computer Science(), vol 9376. Springer, Cham. https://doi.org/10.1007/978-3-319-25135-6_21

Download citation

DOI: https://doi.org/10.1007/978-3-319-25135-6_21
Published: 01 January 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25134-9
Online ISBN: 978-3-319-25135-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

FCM-Type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Genetic Algorithm-Optimized Fuzzy Lyapunov Reinforcement Learning for Nonlinear Systems

Multi-Agent Reward-Iteration Fuzzy Q-Learning

Smooth Q-Learning: An Algorithm for Independent Learners in Stochastic Cooperative Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

FCM-Type Co-clustering Transfer Reinforcement Learning for Non-Markov Processes

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Genetic Algorithm-Optimized Fuzzy Lyapunov Reinforcement Learning for Nonlinear Systems

Multi-Agent Reward-Iteration Fuzzy Q-Learning

Smooth Q-Learning: An Algorithm for Independent Learners in Stochastic Cooperative Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation