research-article

MAN: Memory-augmented Attentive Networks for Deep Learning-based Knowledge Tracing

Authors:

Ting WangAuthors Info & Claims

ACM Transactions on Information Systems, Volume 42, Issue 1

Article No.: 8, Pages 1 - 22

https://doi.org/10.1145/3589340

Published: 18 August 2023 Publication History

Abstract

Knowledge Tracing (KT) is the task of modeling a learner’s knowledge state to predict future performance in e-learning systems based on past performance. Deep learning-based methods, such as recurrent neural networks, memory-augmented neural networks, and attention-based neural networks, have recently been used in KT. Such methods have demonstrated excellent performance in capturing the latent dependencies of a learner’s knowledge state on recent exercises. However, these methods have limitations when it comes to dealing with the so-called Skill Switching Phenomenon (SSP), i.e., when learners respond to exercises in an e-learning system, the latent skills in the exercises typically switch irregularly. SSP will deteriorate the performance of deep learning-based approaches for simulating the learner’s knowledge state during skill switching, particularly when the association between the switching skills and the previously learned skills is weak. To address this problem, we propose the Memory-augmented Attentive Network (MAN), which combines the advantages of memory-augmented neural networks and attention-based neural networks. Specifically, in MAN, memory-augmented neural networks are used to model learners’ longer term memory knowledge, while attention-based neural networks are used to model learners’ recent term knowledge. In addition, we design a context-aware attention mechanism that automatically weighs the tradeoff between these two types of knowledge. With extensive experiments on several e-learning datasets, we show that MAN effectively improve predictive accuracies of existing state-of-the-art DLKT methods.

References

[1]

Hao Cen, Kenneth Koedinger, and Brian Junker. 2006. Learning factors analysis—A general method for cognitive model evaluation and improvement. In Proceedings of the International Conference on Intelligent Tutoring Systems. Springer, 164–175.

Digital Library

[2]

Youngduck Choi, Youngnam Lee, Junghyun Cho, Jineon Baek, Byungsoo Kim, Yeongmin Cha, Dongmin Shin, Chan Bae, and Jaewe Heo. 2020. Towards an appropriate query, key, and value computation for knowledge tracing. In Proceedings of the 7th ACM Conference on Learning@Scale. 341–344.

Digital Library

[3]

Albert T. Corbett and John R. Anderson. 1994. Knowledge tracing: Modeling the acquisition of procedural knowledge. User Model. User-adapt. Interact. 4, 4 (1994), 253–278.

[4]

Antoine Cully and Yiannis Demiris. 2019. Online knowledge level tracking with data-driven student models and collaborative filtering. IEEE Trans. Knowl. Data Eng. 32, 10 (2019), 2000–2013.

[5]

Aritra Ghosh, Neil Heffernan, and Andrew S. Lan. 2020. Context-aware attentive knowledge tracing. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 2330–2339.

Digital Library

[6]

Alex Graves, Greg Wayne, Malcolm Reynolds, Tim Harley, Ivo Danihelka, Agnieszka Grabska-Barwińska, Sergio Gómez Colmenarejo, Edward Grefenstette, Tiago Ramalho, John Agapiou, et al. 2016. Hybrid computing using a neural network with dynamic external memory. Nature 538, 7626 (2016), 471–476.

[7]

Liangliang He. 2021. Integrating performance and side factors into embeddings for deep learning-based knowledge tracing. In Proceedings of the IEEE International Conference on Multimedia and Expo (ICME). IEEE, 1–6.

[8]

Liangliang He, Jintao Tang, Xiao Li, Pancheng Wang, Feng Chen, and Ting Wang. 2022. Multi-type factors representation learning for deep learning-based knowledge tracing. World Wide Web 25, 3 (2022), 1343–1372.

Digital Library

[9]

Liangliang He, Jintao Tang, Xiao Li, and Ting Wang. 2020. ADKT: Adaptive deep knowledge tracing. In Proceedings of the International Conference on Web Information Systems Engineering. Springer, 302–314.

Digital Library

[10]

Yong He, Cheng Wang, Nan Li, and Zhenyu Zeng. 2020. Attention and memory-augmented networks for dual-view sequential learning. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining. 125–134.

Digital Library

[11]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural Computat. 9, 8 (1997), 1735–1780.

Digital Library

[12]

Mohammad Khajah, Robert V. Lindsey, and Michael C. Mozer. 2016. How deep is knowledge tracing? arXiv preprint arXiv:1604.02416 (2016).

[13]

Qi Liu, Zhenya Huang, Yu Yin, Enhong Chen, Hui Xiong, Yu Su, and Guoping Hu. 2019. EKT: Exercise-aware knowledge tracing for student performance prediction. IEEE Trans. Knowl. Data Eng. 33, 1 (2019), 100–115.

Digital Library

[14]

Qi Liu, Shuanghong Shen, Zhenya Huang, Enhong Chen, and Yonghe Zheng. 2021. A survey of knowledge tracing. arXiv preprint arXiv:2105.15106 (2021).

[15]

Fandong Meng, Zhaopeng Tu, Yong Cheng, Haiyang Wu, Junjie Zhai, Yuekui Yang, and Di Wang. 2018. Neural machine translation with key-value memory-augmented attention. arXiv preprint arXiv:1806.11249 (2018).

[16]

Koki Nagatani, Qian Zhang, Masahiro Sato, Yan-Ying Chen, Francine Chen, and Tomoko Ohkuma. 2019. Augmenting knowledge tracing by considering forgetting behavior. In Proceedings of the World Wide Web Conference. 3101–3107.

Digital Library

[17]

Ebba Ossiannilsson. 2020. Sustainability: Special issue: The futures of education in the global context: Sustainable distance education. Sustainability (07 2020) (2020).

[18]

Shalini Pandey and George Karypis. 2019. A self-attentive model for knowledge tracing. arXiv preprint arXiv:1907.06837 (2019).

[19]

Shalini Pandey and Jaideep Srivastava. 2020. RKT: Relation-aware self-attention for knowledge tracing. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1205–1214.

Digital Library

[20]

Philip I. Pavlik Jr, Hao Cen, and Kenneth R. Koedinger. 2009. Performance factors analysis—A new alternative to knowledge tracing. Conference on Artificial Intelligence in Education: Building Learning Systems That Care: from Knowledge Representation to Affective Modelling. IOS Press, (2009).

[21]

Chris Piech, Jonathan Spencer, Jonathan Huang, Surya Ganguli, Mehran Sahami, Leonidas Guibas, and Jascha Sohl-Dickstein. 2015. Deep knowledge tracing. arXiv preprint arXiv:1506.05908 (2015).

[22]

Yanjun Pu, Wenjun Wu, Yong Han, and Dengbo Chen. 2018. Parallelizing Bayesian knowledge tracing tool for large-scale online learning analytics. In Proceedings of the IEEE International Conference on Big Data (Big Data). IEEE, 3245–3254.

[23]

Yanjun Pu, Wenjun Wu, Tianrui Jiang, M. C. Desmarais, C. F. Lynch, A. Merceron, and R. Nkambou. 2019. ATC framework: A fully automatic cognitive tracing model for student and educational contents. In Proceedings of the International Conference of the Educational Data Mining Society.

[24]

Joseph Rollinson and Emma Brunskill. 2015. From predictive models to instructional policies. In Proceedings of the International Conference of the Educational Data Mining Society.

[25]

Adam Santoro, Sergey Bartunov, Matthew Botvinick, Daan Wierstra, and Timothy Lillicrap. 2016. Meta-learning with memory-augmented neural networks. In Proceedings of the International Conference on Machine Learning. PMLR, 1842–1850.

[26]

Dongmin Shin, Yugeun Shim, Hangyeol Yu, Seewoo Lee, Byungsoo Kim, and Youngduck Choi. 2021. Saint+: Integrating temporal features for ednet correctness prediction. In Proceedings of the 11th International Learning Analytics and Knowledge Conference. 490–496.

Digital Library

[27]

Xiangyu Song, Jianxin Li, Yifu Tang, Taige Zhao, Yunliang Chen, and Ziyu Guan. 2021. JKT: A joint graph convolutional network based deep knowledge tracing. Inf. Sci. 580 (2021), 510–523.

Digital Library

[28]

Sainbayar Sukhbaatar, Jason Weston, Rob Fergus, et al. 2015. End-to-end memory networks. Adv. Neural Inf. Process. Systems 28 (2015).

[29]

Nguyen Thai-Nghe, Lucas Drumond, Tomáš Horváth, Artus Krohn-Grimberghe, Alexandros Nanopoulos, and Lars Schmidt-Thieme. 2012. Factorization techniques for predicting student performance. In Educational Recommender Systems and Technologies: Practices and Challenges. IGI Global, 129–153.

[30]

Zhaopeng Tu, Zhengdong Lu, Yang Liu, Xiaohua Liu, and Hang Li. 2016. Modeling coverage for neural machine translation. arXiv preprint arXiv:1601.04811 (2016).

[31]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N. Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017).

[32]

Ronald J. Williams and David Zipser. 1989. A learning algorithm for continually running fully recurrent neural networks. Neural Computat. 1, 2 (1989), 270–280.

Digital Library

[33]

Kevin H. Wilson, Yan Karklin, Bojian Han, and Chaitanya Ekanadham. 2016. Back to the basics: Bayesian extensions of IRT outperform neural networks for proficiency estimation. arXiv preprint arXiv:1604.02336 (2016).

[34]

Chunpu Xu, Yu Li, Chengming Li, Xiang Ao, Min Yang, and Jinwen Tian. 2020. Interactive key-value memory-augmented attention for image paragraph captioning. In Proceedings of the 28th International Conference on Computational Linguistics. 3132–3142.

[35]

Jiani Zhang, Xingjian Shi, Irwin King, and Dit-Yan Yeung. 2017. Dynamic key-value memory networks for knowledge tracing. In Proceedings of the 26th International Conference on World Wide Web. 765–774.

Digital Library

Cited By

Bello ROwolawi PWyk ETu C(2024)Transfer Learning-Driven Cattle Instance Segmentation Using Deep Learning ModelssAgriculture10.3390/agriculture1412228214:12(2282)Online publication date: 12-Dec-2024
https://doi.org/10.3390/agriculture14122282
Pu YLiu FShi RYuan HChen RPeng TWu W(2024)ELAKT: Enhancing Locality for Attentive Knowledge TracingACM Transactions on Information Systems10.1145/365260142:4(1-27)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3652601
Zhang HZhang XShang YLuo XZhang Y(2024)Customized adversarial training enhances the performance of knowledge tracing tasks2024 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)10.1109/ISPA63168.2024.00108(808-815)Online publication date: 30-Oct-2024
https://doi.org/10.1109/ISPA63168.2024.00108
Show More Cited By

Index Terms

MAN: Memory-augmented Attentive Networks for Deep Learning-based Knowledge Tracing

Recommendations

The Potential for the Use of Deep Neural Networks in e-Learning Student Evaluation with New Data Augmentation Method
Artificial Intelligence in Education
Abstract
This study attempts to use a deep neural network to assess the acquisition of knowledge and skills by students. This module is intended to shape a personalized learning path through the e-learning system. Assessing student progress at each stage ...
Dynamic Key-Value Memory Networks for Knowledge Tracing
WWW '17: Proceedings of the 26th International Conference on World Wide Web

Knowledge Tracing (KT) is a task of tracing evolving knowledge state of students with respect to one or more concepts as they engage in a sequence of learning activities. One important purpose of KT is to personalize the practice sequence to help ...
BiRNN-DKT: Transfer Bi-directional LSTM RNN for Knowledge Tracing
Web Information Systems and Applications
Abstract
In recent years, online education is transforming from mobile education to intelligent education, and the rapid development of machine learning breathe into intelligent education with powerful energy. Deep Knowledge Tracing (DKT) is a state of the ...

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Information Systems

ACM Transactions on Information Systems Volume 42, Issue 1

January 2024

924 pages

EISSN:1558-2868

DOI:10.1145/3613513

Editor:
Min Zhang
Tsinghua University, China

Issue’s Table of Contents

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 18 August 2023

Accepted: 12 March 2023

Revised: 26 January 2023

Received: 30 August 2022

Published in TOIS Volume 42, Issue 1

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Key Research and Development Project of China
National Natural Science Foundation of China

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
926
Total Downloads

Downloads (Last 12 months)547
Downloads (Last 6 weeks)54

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Bello ROwolawi PWyk ETu C(2024)Transfer Learning-Driven Cattle Instance Segmentation Using Deep Learning ModelssAgriculture10.3390/agriculture1412228214:12(2282)Online publication date: 12-Dec-2024
https://doi.org/10.3390/agriculture14122282
Pu YLiu FShi RYuan HChen RPeng TWu W(2024)ELAKT: Enhancing Locality for Attentive Knowledge TracingACM Transactions on Information Systems10.1145/365260142:4(1-27)Online publication date: 26-Apr-2024
https://dl.acm.org/doi/10.1145/3652601
Zhang HZhang XShang YLuo XZhang Y(2024)Customized adversarial training enhances the performance of knowledge tracing tasks2024 IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA)10.1109/ISPA63168.2024.00108(808-815)Online publication date: 30-Oct-2024
https://doi.org/10.1109/ISPA63168.2024.00108
Yukun WXingjian XFanjun M(2024)Deep Knowledge Tracking Model Integrating Multiple Feature Personalization Factors2024 IEEE Cyber Science and Technology Congress (CyberSciTech)10.1109/CyberSciTech64112.2024.00068(394-399)Online publication date: 5-Nov-2024
https://doi.org/10.1109/CyberSciTech64112.2024.00068
Wang YHuo YYang CHuang XXia DFeng F(2024)Knowledge ontology enhanced model for explainable knowledge tracingJournal of King Saud University - Computer and Information Sciences10.1016/j.jksuci.2024.10206536:5Online publication date: 24-Jul-2024
https://dl.acm.org/doi/10.1016/j.jksuci.2024.102065

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Full Text

View this article in Full Text.

Figures

Tables

Media

View full text|Download PDF

View Issue’s Table of Contents