Combining Learner Model and Reinforcement Learning for Adaptive Sequencing of Learning Activities

Yessad, Amel

doi:10.1007/978-3-031-20617-7_13

Amel Yessad ORCID: orcid.org/0000-0001-7575-6433¹⁹

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 580))

Included in the following conference series:

International Conference in Methodologies and intelligent Systems for Techhnology Enhanced Learning

354 Accesses

Abstract

In this paper, we present an approach for adapting the sequencing of learning activities that relies on the Q-learning, a reinforcement learning algorithm. The Q-learning learns a sequencing policy to select learning activities that improves the knowledge states of students.

In this research, we rely on the student knowledge state inferred by the Bayesian Knowledge Tracing (BKT) at every testing activity to calculate the reward of the Q-Learning. The more the Q-Learning decision improves the student knowledge state the greater the reward received by the Q-Learning. In addition, we propose a 3-step method aiming to ensure that the use of the Q-Learning is education domain compliant. It consists on training the Q-Learning first on simulated students to answer the “cold start” problem of the Q-Learning.

We present empirical results showing that the sequencing policy resulting from the 3-step method provides the ITS with an efficient strategy to improve the students’ knowledge states.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 149.00; Price excludes VAT (USA)

Softcover Book: USD 199.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aleven, V., et al.: Instruction based on adaptive learning technologies. Handbook of Research on Learning and Instruction, pp. 522–560 (2016)
Google Scholar
Bassen, J., et al.: Reinforcement learning for the adaptive scheduling of educational activities. In: Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems (2020)
Google Scholar
Corbett, A.T., Anderson, J.R.: Knowledge tracing: modeling the acquisition of procedural knowledge. User Model. User-Adap. Inter. 4(4), 253–278 (1994). https://doi.org/10.1007/BF01099821
Article Google Scholar
Doroudi, S., et al.: Sequence matters but how exactly? a method for evaluating activity sequences from data. In: Grantee Submission (2016)
Google Scholar
Doroudi, S., Aleven, V., Brunskill, E.: Where’s the reward? Int. J. Artif. Intell. Educ. 29(4), 568–620 (2019). https://doi.org/10.1007/s40593-019-00187-x
Article Google Scholar
Efremov, A., Ghosh, A., Singla, A.: Zero-shot learning of hint policy via reinforcement learning and program synthesis. In: International Educational Data Mining Society (2020)
Google Scholar
Mandel, T., et al.: Offline policy evaluation across representations with applications to educational games. In: AAMAS, vol. 1077 (2014)
Google Scholar
Watkins, C.J.C.H.: Learning from delayed rewards (1989)
Google Scholar

Download references

Author information

Authors and Affiliations

Sorbonne Université, CNRS, LIP6, 4 Place Jussieu, 75252, Paris Cedex 05, France
Amel Yessad

Authors

Amel Yessad
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Amel Yessad .

Editor information

Editors and Affiliations

Sapienza University of Rome, Rome, Roma, Italy
Marco Temperini
University of Salerno, Montoro - Avellino, Italy
Vittorio Scarano
Leibniz University of Hanover, Hanover, Germany
Ivana Marenzi
DFKI GmbH, Berlin, Germany
Milos Kravcik
Computer and Information Technology Department, University of Craiova, Craiova, Romania
Elvira Popescu
University of Bari Aldo Moro, Bari, Italy
Rosa Lanzilotti
Faculty of Computer Science, Free University of Bozen-Bolzano, Bolzano, Italy
Rosella Gennari
University of Salamanca, Salamanca, Spain
Fernando De La Prieta
DISIM, University of L’Aquila, L’Aquila, L’Aquila, Italy
Tania Di Mascio
University of L’Aquila, L’Aquila, Italy
Pierpaolo Vittorini

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yessad, A. (2023). Combining Learner Model and Reinforcement Learning for Adaptive Sequencing of Learning Activities. In: Temperini, M., et al. Methodologies and Intelligent Systems for Technology Enhanced Learning, 12th International Conference. MIS4TEL 2022. Lecture Notes in Networks and Systems, vol 580. Springer, Cham. https://doi.org/10.1007/978-3-031-20617-7_13

Download citation

DOI: https://doi.org/10.1007/978-3-031-20617-7_13
Published: 23 November 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20616-0
Online ISBN: 978-3-031-20617-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

Combining Learner Model and Reinforcement Learning for Adaptive Sequencing of Learning Activities