Abstract
Urgent intervention in learner forum posts have recently occupied a very important role in research in Massive Open Online Course (MOOC) environments. Intervening in time may make the difference between a learner dropping out or staying on a course. However, due to the typical extremely high learner-to-instructor ratio in MOOCs, it is very challenging – if not sometimes impossible - for the instructor to monitor all the existing posts and identify which need immediate intervention, to encourage retention. Current approaches are based on shallow machine learning and deep learning. Whilst deep learning methods have been shown to be most accurate in many domains, the exact architecture can be very domain-dependent. In spite of their sheer size and representation power, deep neural networks are known to perform better when a problem is divided into the right sub-problems. These sub-problems can be further assembled together, to answer to the original problem, in what we intuitively call a ‘plug & play’-like fashion, similarly to puzzles – via hybrid (deep) neural networks. Hence, in this paper, we address this problem by proposing a classification model for identifying when a given post needs intervention from an instructor, based on hybrid neural networks. We represent words using two different methods; word2vec: that capture the word's semantic and syntactic characteristics; and transformer model (BERT): which represents each word according to its context. Then we construct different architectures, integrating various deep neural networks (DNNs) -‘word-based’ or ‘word-character based’, as we expected that adding additional character-sequence information may increase performance. For word-based, we apply convolutional neural network (CNN) and/or different types of recurrent neural networks (RNN); in some scenarios we added attention. This is to present a comprehensive answer to the character-sequence question in particular, as well as to the urgency of intervention need prediction in MOOC forums, in general. Experimental results demonstrate that using BERT rather than word2vec as a word embedding enhances performance in different models (the optimal result is the CNN + LSTM + Attention model based on BERT at word-level). Interestingly, adding word-character input does not improve the performance, as it does for word2vec.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Arguello, J., Shaffer, K.: Predicting speech acts in MOOC forum posts. In: Ninth International AAAI Conference on Web and Social Media (2015)
Chaturvedi, S., Goldwasser, D., Daumé III, H.: Predicting instructor's intervention in MOOC forums. In: Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, vol. 1: Long Papers (2014)
Yang, T.-Y., et al.: Behavior-based grade prediction for MOOCs via time series neural networks. IEEE J. Sel. Topics Signal Process. 11(5), 716–728 (2017)
Wise, A.F., Cui, Y., Vytasek, J.: Bringing order to chaos in MOOC discussion forums with content-related thread identification. In: Proceedings of the Sixth International Conference on Learning Analytics & Knowledge. ACM (2016)
Yang, D., et al.: Exploring the effect of confusion in discussion forums of massive open online courses. In: Proceedings of the Second (2015) ACM Conference on Learning@ Scale. ACM (2015)
Crossley, S., et al.: Combining click-stream data with NLP tools to better understand MOOC completion. In: Proceedings of the Sixth International Conference on Learning Analytics & Knowledge. ACM (2016)
Kizilcec, R.F., Halawa, S.: Attrition and achievement gaps in online learning. In: Proceedings of the Second (2015) ACM Conference on Learning@ Scale (2015)
Chandrasekaran, M.K., et al.: Using discourse signals for robust instructor intervention prediction. In: Thirty-First AAAI Conference on Artificial Intelligence (2017)
Minaee, S., et al.: Deep learning–based text classification: a comprehensive review. ACM Comput. Surv. (CSUR) 54(3), 1–40 (2021)
Rani, S., Kumar, P.: Deep learning based sentiment analysis using convolution neural network. Arab. J. Sci. Eng. 44(4), 3305–3314 (2019)
Devlin, J., et al.: Bert: pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805 (2018)
Mazari, A.C., Boudoukhani, N., Djeffal, A.: BERT-based ensemble learning for multi-aspect hate speech detection. Cluster Comput., 1–15 (2023)
Khodeir, N.A.: Bi-GRU Urgent classification for MOOC discussion forums based on BERT. IEEE Access 9, 58243–58255 (2021)
Yin, W., et al.: Comparative study of cnn and rnn for natural language processing. arXiv preprint arXiv:1702.01923 (2017)
Mikolov, T., et al.: Distributed representations of words and phrases and their compositionality. In Advances in Neural Information Processing Systems (2013)
Chandrasekaran, M.K., et al.: Learning instructor intervention from mooc forums: early results and issues. arXiv preprint arXiv:1504.07206 (2015)
Agrawal, A., et al.: YouEDU: addressing confusion in MOOC discussion forums by recommending instructional video clips. In: The 8th International Conference on Educational Data Mining (2015)
Bakharia, A.: Towards cross-domain mooc forum post classification. In: Proceedings of the Third (2016) ACM Conference on Learning@ Scale. ACM (2016)
Wei, X., et al.: A convolution-LSTM-based deep neural network for cross-domain MOOC forum post classification. Information 8(3), 92 (2017)
Almatrafi, O., Johri, A., Rangwala, H.: Needle in a haystack: Identifying learner posts that require urgent response in MOOC discussion forums. Comput. Educ. 118, 1–9 (2018)
Sun, X., et al.: Identification of urgent posts in MOOC discussion forums using an improved RCNN. In: 2019 IEEE World Conference on Engineering Education (EDUNINE). IEEE (2019)
Alrajhi, L., Alharbi, K., Cristea, A.I.: A Multidimensional deep learner model of urgent instructor intervention need in MOOC forum posts. In: Kumar, V., Troussas, C. (eds.) ITS 2020. LNCS, vol. 12149, pp. 226–236. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-49663-0_27
Guo, S.X., et al.: Attention-based character-word hybrid neural networks with semantic and structural information for identifying of urgent posts in MOOC discussion forums. IEEE Access 7, 120522–120532 (2019)
Kim, Y.: Convolutional neural networks for sentence classification. arXiv preprint arXiv:1408.5882 (2014)
Elman, J.L.: Finding structure in time. Cogn. Sci. 14(2), 179–211 (1990)
Hochreiter, S., Schmidhuber, J., Elvezia, C.: Long short-term memory. Neural Compu. 9(8), 1735–1780 (1997)
Chung, J., et al.: Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv preprint arXiv:1412.3555 (2014)
Schuster, M., Paliwal, K.K.: Bidirectional recurrent neural networks. IEEE Trans. Signal Process. 45(11), 2673–2681 (1997)
Zhao, C., Han, J.G., Xu, X.: CNN and RNN based neural networks for action recognition. In: Journal of Physics: Conference Series. IOP Publishing (2018)
Ullah, A., et al.: Action recognition in video sequences using deep bi-directional LSTM with CNN features. IEEE Access 6, 1155–1166 (2017)
Tsironi, E., et al.: An analysis of convolutional long short-term memory recurrent neural networks for gesture recognition. Neurocomputing 268, 76–86 (2017)
Wang, X., Jiang, W., Luo, Z.: Combination of convolutional and recurrent neural network for sentiment analysis of short texts. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical papers (2016)
Lai, S., et al.: Recurrent convolutional neural networks for text classification. In: Twenty-Ninth AAAI Conference on Artificial Intelligence (2015)
Zhang, Z., Robinson, D., Tepper, J.: Detecting hate speech on twitter using a convolution-gru based deep neural network. In: Gangemi, A., et al. (eds.) The Semantic Web, pp. 745–760. Springer International Publishing, Cham (2018). https://doi.org/10.1007/978-3-319-93417-4_48
Liang, D., Xu, W., Zhao, Y.: Combining word-level and character-level representations for relation classification of informal text. In: Proceedings of the 2nd Workshop on Representation Learning for NLP (2017)
Yenigalla, P., Kar, S., Singh, C., Nagar, A., Mathur, G.: Addressing unseen word problem in text classification. In: Silberztein, M., Atigui, F., Kornyshova, E., Métais, E., Meziane, F. (eds.) Natural Language Processing and Information Systems, pp. 339–351. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-91947-8_36
Wise, A.F., et al.: Mining for gold: identifying content-related MOOC discussion threads across domains through linguistic modeling. Internet High. Educ. 32, 11–28 (2017)
Clark, K., et al.: What does bert look at? an analysis of bert's attention. arXiv preprint arXiv:1906.04341 (2019)
Yang, Z., et al.: Hierarchical attention networks for document classification. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (2016)
Zhang, X., Zhao, J., LeCun, Y.: Character-level convolutional networks for text classification. In: Advances in Neural Information Processing Systems (2015)
Alamri, A., et al.: MOOC next week dropout prediction: weekly assessing time and learning patterns (2021)
Boukkouri, H.E., et al.: CharacterBERT: reconciling ELMo and BERT for word-level open-vocabulary representations from characters. arXiv preprint arXiv:2010.10392 (2020)
Ma, W., et al.: CharBERT: character-aware pre-trained language model. arXiv preprint arXiv:2011.01513 (2020)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Alrajhi, L., Cristea, A.I. (2023). Plug & Play with Deep Neural Networks: Classifying Posts that Need Urgent Intervention in MOOCs. In: Frasson, C., Mylonas, P., Troussas, C. (eds) Augmented Intelligence and Intelligent Tutoring Systems. ITS 2023. Lecture Notes in Computer Science, vol 13891. Springer, Cham. https://doi.org/10.1007/978-3-031-32883-1_57
Download citation
DOI: https://doi.org/10.1007/978-3-031-32883-1_57
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-32882-4
Online ISBN: 978-3-031-32883-1
eBook Packages: Computer ScienceComputer Science (R0)