Abstract
Classroom instruction behavior analysis is one of the effective methods to optimize methods and designs of classroom teaching, improving the quality and efficiency of classroom instruction, and enriching the practical knowledge of teachers. With the continuous breakthroughs in artificial intelligence technology, there is a growing body of research on instructional behavior analysis. Instruction behavior recognition algorithms face great challenges due to the lack of appropriate datasets as data support. Therefore, in this paper, we propose a dataset CIBR for classroom instructional behavior recognition. The dataset is based on real classroom teaching videos from primary and secondary schools, covering 14 instructional behaviors, with a total of 2, 380 video samples. We evaluated our CIBR dataset on five commonly used behavior recognition models, namely 3D-RESnet 50, I3D, R (2 + 1) D-RGB, S3D and Timesformer, and compared it with the mainstream behavior recognition datasets UCF101 and HMDB51. The results show that the CIBR dataset is reliable, feasible, and effective. The creation of the CIBR dataset provides a data base for AI techniques to automatically identify and analyze the behavior of teachers and students in primary and secondary schools.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bertasius, G., wang, H., Torresani, L.: Is space-time attention all you need for video understanding? In: International Conference on Machine Learning (2021)
Carreira, J., Zisserman, A.: Quo vadis, action recognition? A new model and the kinetics dataset. In: 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4724–4733 (2017). https://doi.org/10.1109/CVPR.2017.502
Gu, C., et al.: Ava: a video dataset of patio-temporally localized atomic visual actions. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6047–6056 (2018). https://doi.org/10.1109/CVPR.2018.00633
Hara, k., Kataoka, H., Satoh, Y.: Can spatiotemporal 3d CNNs retrace the history of 2d CNNs and ImageNet? In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 6546–6555 (2018). https://doi.org/10.1109/CVPR.2018.00685
Hassan, S.A., Akbar, S., Rehman, A., Saba, T., koliva, H., Bahaj, S.A.: Recent developments in detection of central serous retinopathy through imaging and artificial intelligence techniques-a review. IEEE Access 9, 168731–168748 (2021). https://doi.org/10.1109/ACCESS.2021.3108395
Heilbron, F.C., Escorcia, V., Ghanem, B., Niebles, J.C.: Activitynet: a large- scale video benchmark for human activity understanding. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 961–970 (2015). https://doi.org/10.1109/CVPR.2015.7298698
kay, w., et al.: The kinetics human action video dataset. Arxiv abs/1705.06950 (2017)
Kim, w., Choi, H.k., Jang, B.T., Lim, J.: Driver distraction detection using single convolutional neural network. In: 2017 International Conference on Information and Communication Technology Convergence (ICTC), pp. 1203–1205 (2017). https://doi.org/10.1109/ICTC.2017.8190898
Kuehne, H., Jhuang, H., Garrote, E., Poggio, T., Serre, T.: HMDB: a large video database for human motion recognition. In: 2011 International Conference on Computer Vision, pp. 2556–2563 (2011). https://doi.org/10.1109/IccV.2011.6126543
Li, Y., Chen, L., He, R., wang, Z., Wu, G., wang, L.: Multisports: a multi- person video dataset of spatio-temporally localized sports actions. In: 2021 IEEE/cVF International Conference on Computer Vision (ICCV), pp. 13516–13525 (2021)
Lin, Z., Xu, L.: Recognition of the research on teachers, behavior. Teach. Educ. Res. 2, 23–26 (2006)
Soomrom, K., Zamir, A.R., Shah, M.: ucf101: a dataset of 101 human actions classes from videos in the wild. arxiv https://arxiv.org/abs/1212.0402 (2012)
Xie, S., Sun, C., Huang, J., Tu, Z., Murphy, K.P.: Rethinking spatiotemporal feature learning: speed-accuracy tradeoffs in video classification. In: European Conference on Computer Vision (2017)
Xue, X.: Application of s-t analysis method in teaching. Jiangsu Educ. Res. 29, 4–8 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Xu, M., Shen, Y., Wang, S., Yuan, X. (2024). Oriented Classroom Instructional Behavior Recognition Benchmark. In: Kubincová, Z., et al. Emerging Technologies for Education. SETE 2023. Lecture Notes in Computer Science, vol 14606. Springer, Singapore. https://doi.org/10.1007/978-981-97-4243-1_11
Download citation
DOI: https://doi.org/10.1007/978-981-97-4243-1_11
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-4242-4
Online ISBN: 978-981-97-4243-1
eBook Packages: Computer ScienceComputer Science (R0)