Curriculum-Oriented Multi-goal Agent for Adaptive Learning

Ma, Jieyue; Li, Xiaoli; Zhang, Xin; Liu, Tingting; Du, Yuefeng; Li, Tie

doi:10.1007/978-981-16-0479-9_9

Jieyue Ma⁷,
Xiaoli Li⁷,
Xin Zhang⁷,
Tingting Liu⁷,
Yuefeng Du⁷ &
…
Tie Li⁸

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1373))

Included in the following conference series:

Asia-Pacific Web (APWeb) and Web-Age Information Management (WAIM) Joint International Conference on Web and Big Data

363 Accesses

Abstract

Adaptive learning is an important part of Intelligent Tutoring System (ITS). Given that students have different learning targets and knowledge concepts proficiency, a smart intelligent tutor should be able to provide personalized learning materials to them, and help students master target knowledge and skills with learning materials as less as possible. Reinforcement Learning (RL) algorithms are good at solving sequence decision problems, so they are widely used in learning material recommendation. However, the existing intelligent tutoring systems based on reinforcement learning usually consider only one learning target. Moreover, the agent needs to learn in the case of sparse rewards, resulting in inefficient learning. To this end, we propose a curriculum-oriented multi-goal reinforcement learning method, which combines an off-policy RL algorithm with Hindsight Experience Replay (HER) to enable the agent to learn from past failed experiences to alleviate the problem of sparse rewards. Besides, our method is applicable to the case of multi-goal learning, and the agent learns specific strategy for each goal. Additionally, according to different learning stages of the agent, we set different learning pseudo goals adaptively for it to accelerate learning speed.

Supported by the National Natural Science Foundation of China (U1811261).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Alkhatlan, A., Kalita, J.: Intelligent tutoring systems: a comprehensive historical survey with recent developments. arXiv preprint arXiv:1812.09628 (2018)
Zhang, S., Chang, H.-H.: From smart testing to smart learning: how testing technology can assist the new generation of education. Int. J. Smart Technol. Learn. 1(1), 67–92 (2016)
Article Google Scholar
Chen, Y., et al.: Recommendation system for adaptive learning. Appl. Psychol. Measur. 42(1), 24–41 (2018)
Article Google Scholar
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
MATH Google Scholar
Han, R., Chen, K., Tan, C.: Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning. Br. J. Math. Stat. Psychol. 73(3), 522–540 (2020)
Article Google Scholar
Zou, L., et al.: Reinforcement learning to optimize long-term user engagement in recommender systems. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2019)
Google Scholar
Portelas, R., et al.: Automatic curriculum learning for deep RL: a short survey. arXiv preprint arXiv:2003.04664 (2020)
Florensa, C., et al.: Automatic goal generation for reinforcement learning agents. In: International Conference on Machine Learning (2018)
Google Scholar
Colas, C., et al.: CURIOUS: intrinsically motivated modular multi-goal reinforcement learning. In: International Conference on Machine Learning (2019)
Google Scholar
Narvekar, S., et al.: Curriculum learning for reinforcement learning domains: a framework and survey. arXiv preprint arXiv:2003.04960 (2020)
von Davier, M.: The DINA model as a constrained general diagnostic model: two variants of a model equivalency. Br. J. Math. Stat. Psychol. 67(1), 49–71 (2014)
Article MathSciNet Google Scholar
Embretson, S.E., Reise, S.P.: Item Response Theory. Psychology Press (2013)
Google Scholar
Wang, F., et al.: Neural cognitive diagnosis for intelligent education systems. arXiv preprint arXiv:1908.08733 (2019)
Schaul, T., et al.: Universal value function approximators. In: International Conference on Machine Learning (2015)
Google Scholar
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Article Google Scholar
Florensa, C., et al.: Reverse curriculum generation for reinforcement learning. arXiv preprint arXiv:1707.05300 (2017)

Download references

Acknowledgement

This research was supported by the Joint Funds of the National Natural Science Foundation of China under Grant No. U1811261, the Project of Liaoning Provincial Public Opinion and Network Security Big Data System Engineering Laboratory.

Author information

Authors and Affiliations

School of Information, Liaoning University, Shenyang, 110036, China
Jieyue Ma, Xiaoli Li, Xin Zhang, Tingting Liu & Yuefeng Du
Shenyang AeroTech Co. Ltd., Shenyang, China
Tie Li

Authors

Jieyue Ma
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoli Li
View author publications
You can also search for this author in PubMed Google Scholar
Xin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Tingting Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yuefeng Du
View author publications
You can also search for this author in PubMed Google Scholar
Tie Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Northwestern Polytechnical University, Xi’an, China
Qun Chen
Deakin University, Geelong, VIC, Australia
Jianxin Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ma, J., Li, X., Zhang, X., Liu, T., Du, Y., Li, T. (2021). Curriculum-Oriented Multi-goal Agent for Adaptive Learning. In: Chen, Q., Li, J. (eds) Web and Big Data. APWeb-WAIM 2020 International Workshops. APWeb-WAIM 2020. Communications in Computer and Information Science, vol 1373. Springer, Singapore. https://doi.org/10.1007/978-981-16-0479-9_9

Download citation

DOI: https://doi.org/10.1007/978-981-16-0479-9_9
Published: 01 April 2021
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0478-2
Online ISBN: 978-981-16-0479-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics