Analysis of Classroom Interaction Using Speaker Diarization and Discourse Features from Audio Recordings

Canovas, Oscar; Garcia, Felix J.

doi:10.1007/978-3-031-26190-9_7

Oscar Canovas¹² &
Felix J. Garcia¹²

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 634))

Included in the following conference series:

International Conference on Interactive Collaborative Learning

582 Accesses
1 Citations

Abstract

In this paper we present the rationale, and some initial results, of an automated system for classroom analysis which is based on speaker diarization techniques and non-verbal discourse features extracted from audio recordings. We have employed several Machine Learning algorithms and audio processing methods with classroom recordings related to several undergraduate courses. After determining the identity of the speaker in a recorded class, we can distinguish whether the speaker is a teacher, a student, there are multiple speakers at the same time, or silence. An important contribution of our work is that, from that information, we derive several non-verbal features that can be used to describe patterns. Our preliminary results show that it is possible to extract valuable information using data visualization. As we show, different teachers and teaching methods generate identifiable patterns, that might be used to analyze, for example, which methodologies and teaching styles provide higher levels of interaction or participation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 229.00; Price excludes VAT (USA)

Softcover Book: USD 299.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bhattacharya, I., et al.: A multimodal-sensor-enabled room for unobtrusive group meeting analysis. In: Proceedings of the 20th ACM International Conference on Multimodal Interaction, pp. 347–355 (2018)
Google Scholar
Donnelly, P.J., et al.: Automatic teacher modeling from live classroom audio. In: Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization, pp. 45–53 (2016)
Google Scholar
James, A., et al.: Automated classification of classroom climate by audio analysis. In: D’Haro, L., Banchs, R., Li, H. (eds.) 9th International Workshop on Spoken Dialogue System Technology. LNEE, vol. 579, pp. 41–49. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9443-0_4
Lai, C., Carletta, J., Renals, S.: Modelling participant affect in meetings with turn-taking features. In: Workshop on Affective Social Speech Signals (2013)
Google Scholar
Li, H., et al.: Multimodal learning for classroom activity detection. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 9234–9238 (2020)
Google Scholar
Logan, B.: Mel frequency cepstral coefficients for music modeling. In: Proceedings of International Symposium on Music Information Retrieval (2000)
Google Scholar
Nguyen, T.D., Cannata, M., Miller, J.: Understanding student behavioral engagement: importance of student interaction with peers and teachers. J. Educ. Res. 111(2), 163–174 (2018)
Article Google Scholar
Owens, M.T., et al.: Classroom sound can be used to classify teaching practices in college science courses. Proc. Natl. Acad. Sci. U.S.A. 114(12), 3085–3090 (2017)
Article Google Scholar
Park, T.J., Kanda, N., Dimitriadis, D., Han, K.J., Watanabe, S., Narayanan, S.: A review of speaker diarization: recent advances with deep learning. Comput. Speech Lang. 72, 101317 (2022)
Article Google Scholar
Ramakrishnan, A., Zylich, B., Ottmar, E., LoCasale-Crouch, J., Whitehill, J.: Toward automated classroom observation: multimodal machine learning to estimate class positive climate and negative climate. IEEE Trans. Affect. Comput. (2021)
Google Scholar
Rymes, B.: Classroom Discourse Analysis: A Tool for Critical Reflection. Routledge, New York (2015)
Book Google Scholar
Schlotterbeck, D., Uribe, P., Araya, R., Jimenez, A., Caballero, D.: What classroom audio tells about teaching: a cost-effective approach for detection of teaching practices using spectral audio features. In: LAK21: 11th International Learning Analytics and Knowledge Conference, pp. 132–140 (2021)
Google Scholar
Su, H., Dzodzo, B., Wu, X., Liu, X., Meng, H.: Unsupervised methods for audio classification from lecture discussion recordings. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, pp. 3347–3351 (2019)
Google Scholar
Wang, Z., Pan, X., Miller, K.F., Cortina, K.S.: Automatic classification of activities in classroom discourse. Comput. Educ. 78, 115–123 (2014)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering and Technology, University of Murcia, Murcia, 30100, Spain
Oscar Canovas & Felix J. Garcia

Authors

Oscar Canovas
View author publications
You can also search for this author in PubMed Google Scholar
Felix J. Garcia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Oscar Canovas .

Editor information

Editors and Affiliations

CTI Global, Frankfurt/Main, Germany
Michael E. Auer
Federal Ministry of Education, Science and Research, Vienna, Austria
Wolfgang Pachatz
Tallinn University of Technology, Tallinn, Estonia
Tiia Rüütmann

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Canovas, O., Garcia, F.J. (2023). Analysis of Classroom Interaction Using Speaker Diarization and Discourse Features from Audio Recordings. In: Auer, M.E., Pachatz, W., Rüütmann, T. (eds) Learning in the Age of Digital and Green Transition. ICL 2022. Lecture Notes in Networks and Systems, vol 634. Springer, Cham. https://doi.org/10.1007/978-3-031-26190-9_7

Download citation

DOI: https://doi.org/10.1007/978-3-031-26190-9_7
Published: 23 February 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-26189-3
Online ISBN: 978-3-031-26190-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics