Abstract
In this paper we present the rationale, and some initial results, of an automated system for classroom analysis which is based on speaker diarization techniques and non-verbal discourse features extracted from audio recordings. We have employed several Machine Learning algorithms and audio processing methods with classroom recordings related to several undergraduate courses. After determining the identity of the speaker in a recorded class, we can distinguish whether the speaker is a teacher, a student, there are multiple speakers at the same time, or silence. An important contribution of our work is that, from that information, we derive several non-verbal features that can be used to describe patterns. Our preliminary results show that it is possible to extract valuable information using data visualization. As we show, different teachers and teaching methods generate identifiable patterns, that might be used to analyze, for example, which methodologies and teaching styles provide higher levels of interaction or participation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bhattacharya, I., et al.: A multimodal-sensor-enabled room for unobtrusive group meeting analysis. In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, pp. 347–355 (2018)
Donnelly, P.J., et al.: Automatic teacher modeling from live classroom audio. In: Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization, pp. 45–53 (2016)
James, A., et al.: Automated classification of classroom climate by audio analysis. In: D’Haro, L., Banchs, R., Li, H. (eds.) 9th International Workshop on Spoken Dialogue System Technology. LNEE, vol. 579, pp. 41–49. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9443-0_4
Lai, C., Carletta, J., Renals, S.: Modelling participant affect in meetings with turn-taking features. In: Workshop on Affective Social Speech Signals (2013)
Li, H., et al.: Multimodal learning for classroom activity detection. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 9234–9238 (2020)
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of International Symposium on Music Information Retrieval (2000)
Nguyen, T.D., Cannata, M., Miller, J.: Understanding student behavioral engagement: importance of student interaction with peers and teachers. J. Educ. Res. 111(2), 163–174 (2018)
Owens, M.T., et al.: Classroom sound can be used to classify teaching practices in college science courses. Proc. Natl. Acad. Sci. U.S.A. 114(12), 3085–3090 (2017)
Park, T.J., Kanda, N., Dimitriadis, D., Han, K.J., Watanabe, S., Narayanan, S.: A review of speaker diarization: recent advances with deep learning. Comput. Speech Lang. 72, 101317 (2022)
Ramakrishnan, A., Zylich, B., Ottmar, E., LoCasale-Crouch, J., Whitehill, J.: Toward automated classroom observation: multimodal machine learning to estimate class positive climate and negative climate. IEEE Trans. Affect. Comput. (2021)
Rymes, B.: Classroom Discourse Analysis: A Tool for Critical Reflection. Routledge, New York (2015)
Schlotterbeck, D., Uribe, P., Araya, R., Jimenez, A., Caballero, D.: What classroom audio tells about teaching: a cost-effective approach for detection of teaching practices using spectral audio features. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 132–140 (2021)
Su, H., Dzodzo, B., Wu, X., Liu, X., Meng, H.: Unsupervised methods for audio classification from lecture discussion recordings. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3347–3351 (2019)
Wang, Z., Pan, X., Miller, K.F., Cortina, K.S.: Automatic classification of activities in classroom discourse. Comput. Educ. 78, 115–123 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Canovas, O., Garcia, F.J. (2023). Analysis of Classroom Interaction Using Speaker Diarization and Discourse Features from Audio Recordings. In: Auer, M.E., Pachatz, W., Rüütmann, T. (eds) Learning in the Age of Digital and Green Transition. ICL 2022. Lecture Notes in Networks and Systems, vol 634. Springer, Cham. https://doi.org/10.1007/978-3-031-26190-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-26190-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26189-3
Online ISBN: 978-3-031-26190-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)