Skip to main content
Log in

Abstractive video lecture summarization: applications and future prospects

  • Published:
Education and Information Technologies Aims and scope Submit manuscript

Abstract

Modern educational technology systems allow learners to access large amounts of learning materials such as educational videos, learning notes, and teaching books. Automated summarization techniques simplify the access and exploration of complex data collections by producing synthetic versions of the original content. This paper addresses the problem of video lecture summarization by means of abstractive techniques. To enhance the accessibility of the video lecture content in challenging contexts or while coping with learners with special needs it produces a synthetic textual summary condensing the key concepts mentioned in the lecture’s speech. Unlike prior works based on extractive methods, the proposed method can produce more readable and actionable summaries, not necessarily composed of existing portions of speech content. To compensate the lack of annotated data, it also opportunistically reuses the pretrained models available for meeting summarization. The experimental results achieved on a benchmark dataset show that the proposed method generates more fluent and actionable summaries than prior approaches simply relying on content extraction. Finally, we also envision further applications of summarization techniques to learning content. The future prospects of use of summarization techniques in education have shown to go well beyond video summarization.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

Data Availability

The datasets analyzed during and/or analysed during the current study are available at https://ocw.mit.edu/

Notes

  1. https://ocw.mit.edu/about/ Latest access: August 2022

  2. http://uis.unesco.org/en/topic/international-standard-classification-education-isced Latest access: May 2022

  3. https://cloud.google.com/speech-to-text Latest access: May 2022

  4. https://pypi.org/project/fastpunct/ Latest access: May 2022

  5. https://pypi.org/project/pytextrank/ Latest access: June 2022

  6. https://pypi.org/project/sentence-transformers/0.3.0/ Latest access: June 2022

  7. https://huggingface.co/docs/transformers/ Latest access: September 2022

  8. https://ocw.mit.edu/about/ Latest access: August 2022

References

  • Abhilash, R. K., Anurag, C., & Avinash, V. (2021). Lecture video summarization using subtitles. In A. Haldorai, A. Ramu, & S. Mohanram (Eds.), 2nd EAI International Conference on Big Data Innovation for Sustainable Cognitive Computing (pp. 83–92). Cham: Springer International Publishing.

    Chapter  Google Scholar 

  • Alam, T., Khan, A., & Alam, F. (2020). Punctuation restoration using transformer models for high-and low-resource languages. In: Proceedings of the Sixth Workshop on Noisy User-generated Text (W-NUT 2020). Association for Computational Linguistics, Online (pp.132–142). https://doi.org/10.18653/v1/2020.wnut-1.18. https://aclanthology.org/2020.wnut-1.18

  • Atapattu, T., & Falkner, K. (2018). Impact of lecturer’s discourse for students’ video engagement: Video learning analytics case study of moocs. J Learn Anal, 5(3). https://doi.org/10.18608/jla.2018.53.12

  • Baralis, E., & Cagliero, L. (2016). Learning from summaries: Supporting e-learning activities by means of document summarization. IEEE Transactions on Emerging Topics in Computing, 4(3), 416–428. https://doi.org/10.1109/TETC.2015.2493338

    Article  Google Scholar 

  • Baralis, E., & Cagliero, L. (2018). Highlighter: Automatic highlighting of electronic learning documents. IEEE Trans Emerg Top Comput, 6(1), 7–19. https://doi.org/10.1109/TETC.2017.2681655

    Article  Google Scholar 

  • Benedetto, I., Canale, L., Farinetti, L., Cagliero, L., & Quatra, M. (2022). Leveraging summarization techniques in educational technology systems. 46th IEEE Annual Computers, Software, And Applications Conferenc, COMPSAC 2022, Los Alamitos, CA, USA, June 27 - July 1, 2022 (pp. 415-416). https://doi.org/10.1109/COMPSAC54236.2022.00068

  • Borsos, Z., Marinier, R., & Vincent, D. (2022). Audiolm: a language modeling approach to audio generation. https://doi.org/10.48550/arXiv.2209.03143. arXiv:2209.03143

  • Cagliero, L., Farinetti, L., & Baralis, E. (2019). Recommending personalized summaries of teaching materials. IEEE Access, 7:22,729–22,739. https://doi.org/10.1109/ACCESS.2019.2899655

  • Carletta, J., Ashby, S., & Bourban, S. (2005). The ami meeting corpus: A pre-announcement.https://doi.org/10.1007/11677482_3

    Article  Google Scholar 

  • Chandrasekaran, D., & Mago, V. (2021). Evolution of semantic similarity-a survey. ACM Computing Surveys, 54(2), 1–37. https://doi.org/10.1145/3440755

    Article  Google Scholar 

  • Choudary, C., & Liu, T. (2007). Summarization of visual content in instructional videos. IEEE Transactions on Multimedia, 9(7), 1443–1455. https://doi.org/10.1109/TMM.2007.906602

    Article  Google Scholar 

  • Devlin, J., Chang, M.W., & Lee, K. (2019). Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv:1810.04805

  • El-Kassas, W. S., Salama, C. R., & Rafea, A. A. (2021). Automatic text summarization: A comprehensive survey. Expert Systems with Applications, 165(113), 679. https://doi.org/10.1016/j.eswa.2020.113679

    Article  Google Scholar 

  • Fujii, Y., Yamamoto, K., & Kitaoka, N. (2008) Class lecture summarization taking into account consecutiveness of important sentences. 2438–2441

  • Garg, S. (2017). Automatic text summarization of video lectures using subtitles. In: Patnaik, S., & Popentiu-Vladicescu, F. (Eds.) Recent Developments in Intelligent Computing, Communication and Devices (pp. 45–52). Springer Singapore, Singapore

  • Gliwa, B., Mochol, I., & Biesek, M. (2019) Samsum corpus: A human-annotated dialogue dataset for abstractive summarization. arXiv:1911.12237

  • Gottipati, S., Shankararaman, V., & Ramesh, R. (2019). TopicSummary: A Tool for Analyzing Class Discussion Forums using Topic Based Summarizations. In: 2019 IEEE Frontiers in Education Conference (FIE) (pp. 1–9). IEEE, Covington, KY, USA. https://doi.org/10.1109/FIE43999.2019.9028526. https://ieeexplore.ieee.org/document/9028526/

  • Goularte, F. B., Nassar, S. M., & Fileto, R. (2019). A text summarization method based on fuzzy rules and applicable to automated assessment. Expert Systems with Applications, 115, 264–275. https://doi.org/10.1016/j.eswa.2018.07.047, . https://www.linkinghub.elsevier.com/retrieve/pii/S0957417418304743

  • Hermann, K.M., Kocisky, T., & Grefenstette, E. (2015). Teaching machines to read and comprehend. In: NIPS

  • Janin, A., Baron, D., & Edwards, J. (2003). The icsi meeting corpus. I–364. https://doi.org/10.1109/ICASSP.2003.1198793

  • Khalil, M., Prinsloo, P., & Slade, S. (2022). A comparison of learning analytics frameworks: A systematic review. In: LAK22: 12th International Learning Analytics and Knowledge Conference (pp. 152-163). Association for Computing Machinery, New York, NY, USA, LAK22. https://doi.org/10.1145/3506860.3506878

  • Lee, H., Liu, M., & Riaz, H. (2021). Attention based video summaries of live online zoom classes. https://dblp.org/rec/journals/corr/abs-2101-06328.bib

  • Lewis, M., Liu, Y., & Goyal, N. (2019) Bart: Denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. https://aclanthology.org/2020.acl-main.703/

  • Lin, C.Y. (2004). Rouge: A package for automatic evaluation of summaries. p 10

  • Litvak, M., & Vanetik, N. (2017). Query-based summarization using MDL principle. In: Proceedings of the MultiLing 2017 Workshop on Summarization and Summary Evaluation Across Source Types and Genres (pp. 22–31). Association for Computational Linguistics, Valencia, Spain. https://doi.org/10.18653/v1/W17-1004. https://aclanthology.org/W17-1004

  • Lv, T., Cui, L., & Vasilijevic, M. (2021). Vt-ssum: A benchmark dataset for video transcript segmentation and summarization. arXiv:2106.05606

  • Mihalcea, R., & Tarau, P. (2004). TextRank: Bringing order into text. In:Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (pp. 404–411). Association for Computational Linguistics, Barcelona, Spain. https://aclanthology.org/W04-3252

  • Miller, D. (2019). Leveraging BERT for extractive text summarization on lectures. arXiv:1906.04165

  • Mitchell, A., Petter, S., & Harris, A. (2017). Learning by doing: Twenty successful active learning exercises for information systems courses. Journal of Information Technology Education : Innovations in Practice, 16:21–46. https://doi.org/10.28945/3643

  • Page, L., Brin, S., & Motwani, R. (1999). The pagerank citation ranking: Bringing order to the web. Stanford InfoLab: Tech. rep.

    Google Scholar 

  • Parmanto, B., Ferrydiansyah, R., & Saptono, A. (2005) Access: Accessibility through simplification and summarization. In: Proceedings of the 2005 International Cross-Disciplinary Workshop on Web Accessibility (W4A) (pp. 18–25). Association for Computing Machinery, New York, NY, USA, W4A ’05. https://doi.org/10.1145/1061811.1061815

  • Pedrotti, M., & Nistor, N. (2014). Online lecture videos in higher education: Acceptance and motivation effects on students’ system use. In: IEEE 14th International Conference on Advanced Learning Technologies (pp. 477–479). ICALT 2014, Athens, Greece, July 7-10, 2014. IEEE Computer Society. https://doi.org/10.1109/ICALT.2014.141

  • Pi, Z., Zhang, Y., & Xu, K. (2022). Does an outline of contents promote learning from videos? a study on learning performance and engagement. Education and Information Technologies, 28, 3493–3511. https://doi.org/10.1007/s10639-022-11361-5

    Article  Google Scholar 

  • Pramudianto, F., Chhabra, T., & Gehringer, E. (2016). Assessing the quality of automatic summarization for peer review in education. In: EDM

  • Rahman, M. R., Shah, S., & Subhlok, J. (2020). Visual summarization of lecture video segments for enhanced navigation.https://doi.org/10.1109/ISM.2020.00033

    Article  Google Scholar 

  • Romero, C., & Ventura, S. (2020). Educational data mining and learning analytics: An updated survey. WIREs Data Mining and Knowledge Discovery 10(3):e1355. https://doi.org/10.1002/widm.1355. https://onlinelibrary.wiley.com/doi/abs/10.1002/widm.1355. arXiv:10.1002/widm.1355

  • Saini, M., Arora, V., & Singh, M. (2022). Artificial intelligence inspired multilanguage framework for note-taking and qualitative content-based analysis of lectures. Education and Information Technologies, 1–23

  • Shimada, A., Okubo, F., & Yin, C. (2018). Automatic summarization of lecture slides for enhanced student preview-technical report and user study. IEEE Transactions on Learning Technologies 11(2):165–178. https://doi.org/10.1109/TLT.2017.2682086, funding Information: This research was partially supported by ”PRESTO”, Japan Science and Technology Agency (JST) Japan, and ”Research and Development on Fundamental and Utilization Technologies for Social Big Data” (178A03), the Commissioned Research of the National Institute of Information and Communications Technology (NICT) Japan. Publisher Copyright: 2008-2011 IEEE.

  • Tan, B., Qin, L., & Xing, E.P. (2020). Summarizing text on any aspects: A knowledge-informed weakly-supervised approach. arXiv:2010.06792

  • Tilk, O., & Alumäe, T. (2016). Bidirectional recurrent neural network with attention mechanism for punctuation restoration. In: INTERSPEECH

  • Wang, F., & Chen, Z. (2018). Self-attention based network for punctuation restoration. In: 2018 24th International Conference on Pattern Recognition (ICPR), 2803–2808. https://doi.org/10.1109/ICPR.2018.8545470

  • Yoo, T., Jeong, H., & Lee, D. (2021). Lectys: A system for summarizing lecture videos on youtube. In: 26th International Conference on Intelligent User Interfaces - Companion (p 90–92). Association for Computing Machinery, New York, NY, USA, IUI ’21 Companion. https://doi.org/10.1145/3397482.3450722

  • Zhang, J., Zhao, Y., & Saleh M. (2019a). PEGASUS: pre-training with extracted gap-sentences for abstractive summarization. arXiv:1912.08777

  • Zhang, T., Kishore V., & Wu, F., (2019b). Bertscore: Evaluating text generation with BERT. arXiv:1904.09675

  • Zhu, C., Xu, R., & Zeng, M. (2020). A hierarchical network for abstractive meeting summarization with cross-domain pretraining. Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing. https://www.microsoft.com/en-us/research/publication/end-to-end-abstractive-summarization-for-meetings/

  • Zhu, J., Li, H., & Liu, T. (2018). MSMO: Multimodal summarization with multimodal output. In: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing (pp. 4154–4164). Association for Computational Linguistics, Brussels, Belgium. https://doi.org/10.18653/v1/D18-1448. https://aclanthology.org/D18-1448

Download references

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed equally to this research work. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Irene Benedetto.

Ethics declarations

Informed consent

The paper is an extended version of the preliminary work presented in Benedetto et al. (2022). Unlike the prior work, the current manuscript contains \(\bullet \) An overview of the existing benchmark datasets for video lecture summarization (see Section 2.1 of the current manuscript). \(\bullet \) A more thorough description of the presented methodology (see Section 2.2). \(\bullet \) A validation of the summaries generated from the open-source video lectures available in the MIT OpenCourseWare repository (see Sections 2.3, 2.4, and 2.5). \(\bullet \) A more extensive overview of the related works on summarization in education (See Section 3). \(\bullet \) A discussion of the future prospects of use of summarization techniques in education (See Section 4).

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Benedetto, I., La Quatra, M., Cagliero, L. et al. Abstractive video lecture summarization: applications and future prospects. Educ Inf Technol 29, 2951–2971 (2024). https://doi.org/10.1007/s10639-023-11855-w

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10639-023-11855-w

Keyword

Navigation