Abstract
Educational Cognitive Diagnosis (CD) aims to provide students’ mastery levels on different concepts. One common observation is that students often conduct many exercises but engage with a small subset of concepts, leading to a sparsity barrier. Current CD models mostly adopt mastery levels on all concepts as student modeling, overlooking the sparsity barrier. If a student does not interact with all concepts, we can not ensure that each dimension of mastery levels on concepts can be well-trained. In this paper, we propose a novel Enhancing Student Representations in Cognitive Diagnosis (ESR-CD), which combines application abilities and comprehension degrees for mastery levels on concepts. To model application ability, we propose a sparsity-based mask module that solely depends on the dense student-concept entries. Simultaneously, to further enhance comprehension degrees, we propose two layers: a matrix factorization layer and a relation refinement layer. Extensive experiments on two real-world datasets demonstrate the effectiveness of ESR-CD.
Similar content being viewed by others
References
Chen X, Wu L, Liu F, Chen L, Zhang K, Hong R, Wang M. Disentangling cognitive diagnosis with limited exercise labels. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. 2023, 792
Liu Y, Zhang T, Wang X, Yu G, Li T. New development of cognitive diagnosis models. Frontiers of Computer Science, 2023, 17(1): 171604
Shen J, Qian H, Zhang W, Zhou A. Symbolic cognitive diagnosis via hybrid optimization for intelligent education systems. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 14928–14936
Gao W, Wang H, Liu Q, Wang F, Lin X, Yue L, Zhang Z, Lv R, Wang S. Leveraging transferable knowledge concept graph embedding for cold-start cognitive diagnosis. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023, 983–992
Liu H, Zhang T, Li F, Yu M, Yu G. A probabilistic generative model for tracking multi-knowledge concept mastery probability. Frontiers of Computer Science, 2024, 18(3): 183602
Dai H, Zhang Y, Yun Y, An R, Zhang W, Shang X. Adaptive metaknowledge dictionary learning for incremental knowledge tracing. Engineering Applications of Artificial Intelligence, 2024, 132: 107969
Wang F, Liu Q, Chen E, Huang Z, Chen Y, Yin Y, Huang Z, Wang S. Neural cognitive diagnosis for intelligent education systems. In: Proceedings of the 34th AAAI Conference on Artificial Intelligence. 2020, 6153–6161
Liu S, Qian H, Li M, Zhou A. QCCDM: a Q-augmented causal cognitive diagnosis model for student learning. In: Proceedings of the 26th European Conference on Artificial Intelligence. 2023, 1536–1543
Wang F, Gao W, Liu Q, Li J, Zhao G, Zhang Z, Huang Z, Zhu M, Wang S, Tong W, Chen E. A survey of models for cognitive diagnosis: new developments and future directions. 2024, arXiv preprint arXiv: 2407.05458
Liu Q. Towards a new generation of cognitive diagnosis. In: Proceedings of the 30th International Joint Conference on Artificial Intelligence. 2021, 4961–4964
Gao W, Liu Q, Huang Z, Yin Y, Bi H, Wang M C, Ma J, Wang S, Su Y. RCD: relation map driven cognitive diagnosis for intelligent education systems. In: Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2021, 501–510
Li J, Wang F, Liu Q, Zhu M, Huang W, Huang Z, Chen E, Su Y, Wang S. HierCDF: a Bayesian network-based hierarchical cognitive diagnosis framework. In: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022, 904–913
Wang F, Liu Q, Chen E, Huang Z, Yin Y, Wang S, Su Y. NeuralCD: a general framework for cognitive diagnosis. IEEE Transactions on Knowledge and Data Engineering, 2023, 35(8): 8312–8327
Burke K. The Mindful School: How To Assess Thoughtful Outcomes: K-College. Palatine: IRI Skylight Publishing, 1993
Embretson S E, Reise S P. Item Response Theory for Psychologists. New York: Psychology Press, 2013
Ma H, Wang C, Zhu H, Yang S, Zhang X, Zhang X. Enhancing cognitive diagnosis using un-interacted exercises: a collaboration-aware mixed sampling approach. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8877–8885
Gao W, Liu Q, Wang H, Yue L, Bi H, Gu Y, Yao F, Zhang Z, Li X, He Y. Zero-1-to-3: domain-level zero-shot cognitive diagnosis via one batch of early-bird students towards three diagnostic objectives. In: Proceedings of the 38th AAAI Conference on Artificial Intelligence. 2024, 8417–8426
Li M, Qian H, Lv J, He M, Zhang W, Zhou A. Foundation model enhanced derivative-free cognitive diagnosis. Frontiers of Computer Science, 2025, 19(1): 191318
Zhang Z, Liu Q, Jiang H, Wang F, Zhuang Y, Wu L, Gao W, Chen E. FairLISA: fair user modeling with limited sensitive attributes information. In: Proceedings of the 37th Conference on Neural Information Processing Systems. 2023, 1796
Zhang Y, Dai H, Yun Y, Liu S, Lan A, Shang X. Meta-knowledge dictionary learning on 1-bit response data for student knowledge diagnosis. Knowledge-Based Systems, 2020, 205: 106290
De La Torre J. Dina model and parameter estimation: a didactic. Journal of educational and behavioral statistics, 2009, 34(1): 115–130
Wang S, Zeng Z, Yang X, Zhang X. Self-supervised graph learning for long-tailed cognitive diagnosis. In: Proceedings of the 37th AAAI Conference on Artificial Intelligence. 2023, 110–118
Qian H, Liu S, Li M, Li B, Liu Z, Zhou A. ORCDF: an oversmoothing-resistant cognitive diagnosis framework for student learning in online education systems. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 2455–2466
Wu L, Chen X, Liu F, Xie J, Xia C, Tan Z, Tian M, Li J, Zhang K, Lian D, Hong R, Wang M. EduStudio: towards a unified library for student cognitive modeling. Frontiers of Computer Science, 2025, 19(8): 198342
Liu S, Shen J, Qian H, Zhou A. Inductive cognitive diagnosis for fast student learning in web-based intelligent education systems. In: Proceedings of the ACM Web Conference 2024. 2024, 4260–4271
Zhang Y, Qin C, Shen D, Ma H, Zhang L, Zhang X, Zhu H. ReliCD: a reliable cognitive diagnosis framework with confidence awareness. In: Proceedings of 2023 IEEE International Conference on Data Mining (ICDM). 2023, 858–867
Li J, Liu Q, Wang F, Liu J, Huang Z, Yao F, Zhu L, Su Y. Towards the identifiability and explainability for personalized learner modeling: an inductive paradigm. In: Proceedings of the ACM Web Conference 2024. 2024, 3420–3431
Zhang Z, Wu L, Liu Q, Liu J, Huang Z, Yin Y, Zhuang Y, Gao W, Chen E. Understanding and improving fairness in cognitive diagnosis. Science China Information Sciences, 2024, 67(5): 152106
Zhang D, Zhang K, Wu L, Tian M, Hong R, Wang M. Path-Specific causal reasoning for fairness-aware cognitive diagnosis. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 4143–4154
Zhang Z, Liu Q, Hu Z, Zhan Y, Huang Z, Gao W, Mao Q. Enhancing fairness in meta-learned user modeling via adaptive sampling. In: Proceedings of the ACM Web Conference 2024. 2024, 3241–3252
Chen L, Wu L, Zhang K, Hong R, Lian D, Zhang Z, Zhou J, Wang M. Improving recommendation fairness via data augmentation. In: Proceedings of the ACM Web Conference 2023. 2023, 1012–1020
Jiang Y, Ma H, Zhang X, Li Z, Chang L. Incorporating metapath interaction on heterogeneous information network for social recommendation. Frontiers of Computer Science, 2024, 18(1): 181302
Gao C, Wang S, Li S, Chen J, He X, Lei W, Li B, Zhang Y, Jiang P. CIRS: bursting filter bubbles by counterfactual interactive recommender system. ACM Transactions on Information Systems, 2023, 42(1): 14
Cai M, Hou M, Chen L, Wu L, Bai H, Li Y, Wang M. Mitigating recommendation biases via group-alignment and global-uniformity in representation learning. ACM Transactions on Intelligent Systems and Technology, 2024, 15(5): 101
Cai M, Chen L, Wang Y, Bai H, Sun P, Wu L, Zhang M, Wang M. Popularity-aware alignment and contrast for mitigating popularity bias. In: Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024, 187–198
Huang L, Ma H, He X, Chang L. Multi-affect(ed): improving recommendation with similarity-enhanced user reliability and influence propagation. Frontiers of Computer Science, 2021, 15(5): 155331
Shuai J, Zhang K, Wu L, Sun P, Hong R, Wang M, Li Y. A review-aware graph contrastive learning framework for recommendation. In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2022, 1283–1293
Zhang J, Zhu Y, Liu Q, Wu S, Wang S, Wang L. Mining latent structures for multimedia recommendation. In: Proceedings of the 29th ACM International Conference on Multimedia. 2021, 3872–3880
He X, Deng K, Wang X, Li Y, Zhang Y, Wang M. LightGCN: simplifying and powering graph convolution network for recommendation. In: Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval. 2020, 639–648
Zhuang Y, Liu Q, Zhao G, Huang Z, Huang W, Pardos Z A, Chen E, Wu J, Li X. A bounded ability estimation for computerized adaptive testing. In: Proceedings of the 37th International Conference on Neural Information Processing Systems. 2023, 111
Hu Y, Koren Y, Volinsky C. Collaborative filtering for implicit feedback datasets. In: Proceedings of the 8th IEEE International Conference on Data Mining. 2008, 263–272
He X, Liao L, Zhang H, Nie L, Hu X, Chua T S. Neural collaborative filtering. In: Proceedings of the 26th International Conference on World Wide Web. 2017, 173–182
Salakhutdinov R, Mnih A. Probabilistic matrix factorization. In: Proceedings of the 21st International Conference on Neural Information Processing Systems. 2007, 1257–1264
Wu C, Wang X, Lian D, Xie X, Chen E. A causality inspired framework for model interpretation. In: Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2023, 2731–2741
Yu J, Lu M, Zhong Q, Yao Z, Tu S, Liao Z, Li X, Li M, Hou L, Zheng H T, Li J, Tang J. MoocRadar: a fine-grained and multi-aspect knowledge repository for improving cognitive student modeling in MOOCs. In: Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023, 2924–2934
Reckase M D. The past and future of multidimensional item response theory. Applied Psychological Measurement, 1997, 21(1): 25–36
Li S, Guan Q, Fang L, Xiao F, He Z, He Y, Luo W. Cognitive diagnosis focusing on knowledge concepts. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 3272–3281
Ma H, Li M, Wu L, Zhang H, Cao Y, Zhang X, Zhao X. Knowledge-sensed cognitive diagnosis for intelligent education platforms. In: Proceedings of the 31st ACM International Conference on Information & Knowledge Management. 2022, 1451–1460
Acknowledgements
This work has been supported in part by grants from the National Science and Technology Major Project (2021ZD0111802), the New Cornerstone Science Foundation through the XPLORER PRIZE, the National Natural Science Foundation of China (Grant Nos. 72188101, 62376086), and the Joint Funds of the National Natural Science Foundation of China (U22A2094).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests The authors declare that they have no competing interests or financial conflicts to disclose.
Additional information
Pengyang SHAO is currently pursuing a PhD degree at Hefei University of Technology (HFUT), China. He received his Bachelor’s degree in 2019 from the same university. His research interest lies on data mining, and large language models. He has published several papers in leading conferences and journals, including KDD, WWW, ACM TOIS and SCIS.
Kun ZHANG received a PhD degree in computer science and technology from the University of Science and Technology of China, China in 2019. He is currently a faculty member at the Hefei University of Technology (HFUT), China. His research interests include Natural Language Understanding and Recommender Systems. He has published several papers in referred journals and conferences, such as IEEE TSMC:S, IEEE TKDE, ACM TKDD, AAAI, KDD, ACL, and ICDM. He received the KDD 2018 Best Student Paper Award.
Chen GAO is now a Faculty Member (Research-track AP) of BNRist, Tsinghua University, China. He obtained his PhD degree (advised by Prof. Yong Li and Prof. Depeng Jin) and bachelor’s degree from the Department of Electronic Engineering, Tsinghua University in 2021 and 2016, respectively. His research primarily focuses on data mining (recommender system and spatio-temporal data mining), large language model, embodied agent, etc., with over 60 papers in top-tier venues (50+ CCF-A), attracting over 4,000 citations.
Lei CHEN is currently a postdoctoral researcher at Tsinghua University, China. He received his PhD from Hefei University of Technology, China in 2022. His research primarily focuses on fairness-aware recommender systems and large language model applications. He has published several papers in leading conferences and journals, including WWW, SIGIR, IEEE TKDE, and ACM TOIS.
Miaomiao CAI is a PhD candidate at Hefei University of Technology, China, where she also earned her Bachelor’s degree in Engineering in 2020. Her research primarily focuses on debiasing techniques for recommender systems. She has published several papers in leading conferences and journals, including ACM KDD, ACM MM, and ACM TIST.
Le WU is currently a professor at Hefei University of Technology (HFUT), China. She received her PhD degree from the University of Science and Technology of China (USTC), China. Her general area of research interests are data mining, recommender systems, and responsible user modeling. She has published more than 60 papers in referred journals and conferences, such as IEEE TKDE, NIPS, SIGIR, WWW, and AAAI. Dr. Le Wu is the recipient of the Best of SDM 2015 Award, and the Distinguished Dissertation Award from the China Association for Artificial Intelligence (CAAI) 2017.
Yong LI received the BS degree from Huazhong University of Science and Technology, China in 2007, and the MS and the PhD degrees in electrical engineering from Tsinghua University, China in 2009 and 2012, respectively. Currently, he is a Faculty Member of the Department of Electronic Engineering, Tsinghua University, China. His research interests are in the areas of wireless networking, mobile computing and urban computing. Dr. Li has served as General Chair, TPC Chair, TPC Member for several international workshops and conferences, and he is on the editorial board of four international journals.
Meng WANG is a professor at Hefei University of Technology, China. He received his BE degree and PhD degrees in the Special Class for the Gifted Young and the Department of Electronic Engineering and Information Science from the University of Science and Technology of China (USTC), China in 2003 and 2008, respectively. His current research interests include multimedia content analysis, computer vision, and pattern recognition. He is an associate editor of IEEE Transactions on Knowledge and Data Engineering (IEEE TKDE), IEEE Transactions on Circuits and Systems for Video Technology (IEEE TCSVT), IEEE Transactions on Multimedia (IEEE TMM), and IEEE Transactions on Neural Networks and Learning Systems (IEEE TNNLS).
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Shao, P., Zhang, K., Gao, C. et al. Breaking student-concept sparsity barrier for cognitive diagnosis. Front. Comput. Sci. 19, 1911363 (2025). https://doi.org/10.1007/s11704-025-40591-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11704-025-40591-2