Abstract
Adaptive learning is an important part of Intelligent Tutoring System (ITS). Given that students have different learning targets and knowledge concepts proficiency, a smart intelligent tutor should be able to provide personalized learning materials to them, and help students master target knowledge and skills with learning materials as less as possible. Reinforcement Learning (RL) algorithms are good at solving sequence decision problems, so they are widely used in learning material recommendation. However, the existing intelligent tutoring systems based on reinforcement learning usually consider only one learning target. Moreover, the agent needs to learn in the case of sparse rewards, resulting in inefficient learning. To this end, we propose a curriculum-oriented multi-goal reinforcement learning method, which combines an off-policy RL algorithm with Hindsight Experience Replay (HER) to enable the agent to learn from past failed experiences to alleviate the problem of sparse rewards. Besides, our method is applicable to the case of multi-goal learning, and the agent learns specific strategy for each goal. Additionally, according to different learning stages of the agent, we set different learning pseudo goals adaptively for it to accelerate learning speed.
Supported by the National Natural Science Foundation of China (U1811261).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alkhatlan, A., Kalita, J.: Intelligent tutoring systems: a comprehensive historical survey with recent developments. arXiv preprint arXiv:1812.09628 (2018)
Zhang, S., Chang, H.-H.: From smart testing to smart learning: how testing technology can assist the new generation of education. Int. J. Smart Technol. Learn. 1(1), 67–92 (2016)
Chen, Y., et al.: Recommendation system for adaptive learning. Appl. Psychol. Measur. 42(1), 24–41 (2018)
Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (2018)
Han, R., Chen, K., Tan, C.: Curiosity-driven recommendation strategy for adaptive learning via deep reinforcement learning. Br. J. Math. Stat. Psychol. 73(3), 522–540 (2020)
Zou, L., et al.: Reinforcement learning to optimize long-term user engagement in recommender systems. In: Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (2019)
Portelas, R., et al.: Automatic curriculum learning for deep RL: a short survey. arXiv preprint arXiv:2003.04664 (2020)
Florensa, C., et al.: Automatic goal generation for reinforcement learning agents. In: International Conference on Machine Learning (2018)
Colas, C., et al.: CURIOUS: intrinsically motivated modular multi-goal reinforcement learning. In: International Conference on Machine Learning (2019)
Narvekar, S., et al.: Curriculum learning for reinforcement learning domains: a framework and survey. arXiv preprint arXiv:2003.04960 (2020)
von Davier, M.: The DINA model as a constrained general diagnostic model: two variants of a model equivalency. Br. J. Math. Stat. Psychol. 67(1), 49–71 (2014)
Embretson, S.E., Reise, S.P.: Item Response Theory. Psychology Press (2013)
Wang, F., et al.: Neural cognitive diagnosis for intelligent education systems. arXiv preprint arXiv:1908.08733 (2019)
Schaul, T., et al.: Universal value function approximators. In: International Conference on Machine Learning (2015)
Mnih, V., et al.: Human-level control through deep reinforcement learning. Nature 518(7540), 529–533 (2015)
Florensa, C., et al.: Reverse curriculum generation for reinforcement learning. arXiv preprint arXiv:1707.05300 (2017)
Acknowledgement
This research was supported by the Joint Funds of the National Natural Science Foundation of China under Grant No. U1811261, the Project of Liaoning Provincial Public Opinion and Network Security Big Data System Engineering Laboratory.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Ma, J., Li, X., Zhang, X., Liu, T., Du, Y., Li, T. (2021). Curriculum-Oriented Multi-goal Agent for Adaptive Learning. In: Chen, Q., Li, J. (eds) Web and Big Data. APWeb-WAIM 2020 International Workshops. APWeb-WAIM 2020. Communications in Computer and Information Science, vol 1373. Springer, Singapore. https://doi.org/10.1007/978-981-16-0479-9_9
Download citation
DOI: https://doi.org/10.1007/978-981-16-0479-9_9
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-0478-2
Online ISBN: 978-981-16-0479-9
eBook Packages: Computer ScienceComputer Science (R0)