Abstract
When students use e-learning systems such as learning management systems and e-book systems, the operation logs are stored and analyzed to understand student learning behaviors. For implementing some applications, such as dashboard systems and at-risk student detection, the operation logs are mainly transformed into features designed by researchers. Such hand-crafted features, like the number of operations, are easily interpretable. However, the power of the hand-craft features may be limited for the recent large-scale educational dataset. In machine learning research, data-driven features are demonstrated to be a better representation than hand-crafted features. However, there are few discussions in the educational data due to a need for many operation logs. In this study, we collect reading logs of an e-book system. We propose a representation learning method for the reading logs based on contrastive learning. Our proposed method transforms time-series reading logs into reading behavior feature vectors directly without hand-crafted features. In our experiments, we demonstrate that the power of our feature representation is better than a traditional count-based hand-crafted feature representation in the at-risk student detection task. In addition, we investigate the characteristics of the feature space learned by our proposed method.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Chen, T., Kornblith, S., Norouzi, M., Hinton, G.: A simple framework for contrastive learning of visual representations. In: Proceedings of the 37th International Conference on Machine Learning. Proceedings of Machine Learning Research, vol. 119, pp. 1597–1607 (2020)
Choi, Y., et al.: EdNet: a large-scale hierarchical dataset in education. In: Bittencourt, I.I., Cukurova, M., Muldner, K., Luckin, R., Millán, E. (eds.) AIED 2020. LNCS (LNAI), vol. 12164, pp. 69–73. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-52240-7_13
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Workshop on Statistical Learning in Computer Vision, ECCV, pp. 59–74. Prague (2004)
Ding, M., Yang, K., Yeung, D.Y., Pong, T.C.: Effective feature learning with unsupervised learning for improving the predictive models in massive open online courses. In: Proceedings of the 9th International Conference on Learning Analytics & Knowledge, pp. 135–144 (2019)
Gao, T., Yao, X., Chen, D.: Simcse: simple contrastive learning of sentence embeddings. In: Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp. 6894–6910 (2021)
Ke, G., He, D., Liu, T.Y.: Rethinking positional encoding in language pre-training. In: International Conference on Learning Representations (2020)
Kenton, J.D.M.W.C., Toutanova, L.K.: Bert: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of NAACL-HLT, pp. 4171–4186 (2019)
Lopez Zapata, E., Minematsu, T., Taniguchi, Y., Okubo, F., Shimada, A.: Encoding students reading characteristics to improve low academic performance predictive models. In: Companion Proceedings 12th International Conference on Learning Analytics & Knowledge (LAK22), pp. 36–38, March 2022
Loshchilov, I., Hutter, F.: Decoupled weight decay regularization. In: International Conference on Learning Representations (2018)
Van der Maaten, L., Hinton, G.: Visualizing data using t-SNE. J. Mach. Learn. Res. 9(86), 2579–2605 (2008)
Marras, M., Vignoud, J.T.T., Kaser, T.: Can feature predictive power generalize? Benchmarking early predictors of student success across flipped and online courses. In: 14th International Conference on Educational Data Mining, pp. 150–160 (2021)
Northcutt, C.G., Ho, A.D., Chuang, I.L.: Detecting and preventing “multiple-account” cheating in massive open online courses. Comput. Educ. 100, 71–80 (2016)
Ogata, H., et al.: Learning analytics for E-Book-based educational big data in higher education. In: Yasuura, H., Kyung, C.-M., Liu, Y., Lin, Y.-L. (eds.) Smart Sensors at the IoT Frontier, pp. 327–350. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55345-0_13
Okubo, F., Yamashita, T., Shimada, A., Ogata, H.: A neural network approach for students’ performance prediction. In: Proceedings of the Seventh International Learning Analytics & Knowledge Conference, pp. 598–599 (2017)
Park, J., Denaro, K., Rodriguez, F., Smyth, P., Warschauer, M.: Detecting changes in student behavior from clickstream data. In: Proceedings of the Seventh International Learning Analytics & Knowledge Conference, pp. 21–30 (2017)
Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(140), 1–67 (2020)
Vaswani, A., et al.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30, 6000–6010 (2017)
Wang, F., Liu, H.: Understanding the behaviour of contrastive loss. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2495–2504 (2021)
Acknowledgements
This work was supported by JSPS KAKENHI Grant Number JP21K17864.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Minematsu, T., Taniguchi, Y., Shimada, A. (2023). Contrastive Learning for Reading Behavior Embedding in E-book System. In: Wang, N., Rebolledo-Mendez, G., Matsuda, N., Santos, O.C., Dimitrova, V. (eds) Artificial Intelligence in Education. AIED 2023. Lecture Notes in Computer Science(), vol 13916. Springer, Cham. https://doi.org/10.1007/978-3-031-36272-9_35
Download citation
DOI: https://doi.org/10.1007/978-3-031-36272-9_35
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-36271-2
Online ISBN: 978-3-031-36272-9
eBook Packages: Computer ScienceComputer Science (R0)