Lecture Video Summarization Using Deep Learning

Khetarpaul, Sonia; Jain, Lakshay; Goyal, Kush; Tej, P. Vishnu

doi:10.1007/978-981-97-5934-7_9

Sonia Khetarpaul ORCID: orcid.org/0000-0001-6058-7235¹¹,
Lakshay Jain¹¹,
Kush Goyal¹¹ &
…
P. Vishnu Tej¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 2145))

Included in the following conference series:

Asian Conference on Intelligent Information and Database Systems

161 Accesses

Abstract

In today’s rapidly evolving digital era, educational institutions and professors are increasingly sharing video lectures online to support students. While these lectures effectively teach topics from scratch, they pose challenges for quick revisions. Viewers struggle to control the pace, often interrupting playback to navigate the content. Additionally, finding specific information within the video and skimming the unstructured transcript for relevant content can be cumbersome, hindering efficient learning. The objective of this paper is to provide a framework for summarizing video lectures to facilitate quick revisions. We further aim to help students with topic-wise preparation by summarizing specific lecture sections relevant to them, saving them the extra effort of manually searching through the entire lectures. We employ shot detection, speech transcription, transcript summarization, and ontology tree to generate topic-wise slideshow summaries. The summarization models: BART, T5, GPT3, Extractive, and Seq2Seq are evaluated on the VT-Ssum dataset, and their ROUGE scores are compared to select the best model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Abstractive Summarization of Lectures and Lecture Segments Transcripts with BART

Abstractive video lecture summarization: applications and future prospects

Article 16 June 2023

Lecture Video Summarization Using Subtitles

References

Andra, M.B., Usagawa, T.: Automatic lecture video content summarization with attention-based recurrent neural network. In: 2019 International Conference of Artificial Intelligence and Information Technology (ICAIIT), pp. 54–59. IEEE (2019)
Google Scholar
Brown, T., et al.: Language models are few-shot learners. Adv. Neural Inf. Process. Syst. 33, 1877–1901 (2020)
Google Scholar
Che, X., Yang, H., Meinel, C.: Lecture video segmentation by automatically analyzing the synchronized slides. In: Proceedings of the 21st ACM International Conference on Multimedia, pp. 345–348 (2013)
Google Scholar
Chen, Y.N., Huang, Y., Yeh, C.F., Lee, L.S.: Spoken lecture summarization by random walk over a graph constructed with automatically extracted key terms. In: Twelfth Annual Conference of the International Speech Communication Association (2011)
Google Scholar
Chopra, S., Auli, M., Rush, A.M.: Abstractive sentence summarization with attentive recurrent neural networks. In: Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 93–98 (2016)
Google Scholar
Lewis, M., et al.: BART: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In: Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pp. 7871–7880 (2020)
Google Scholar
Lv, T., Cui, L., Vasilijevic, M., Wei, F.: VT-SSum: a benchmark dataset for video transcript segmentation and summarization. arXiv preprint arXiv:2106.05606 (2021)
Ma, D., Agam, G.: Lecture video segmentation and indexing. In: Document Recognition and Retrieval XIX, vol. 8297, pp. 238–245. SPIE (2012)
Google Scholar
Nallapati, R., Zhou, B., Gu?lçehre, C., Xiang, B.: Abstractive text summarization using sequence-to-sequence RNNs and beyond. In: Proceedings of the 20th SIGNLL Conference on Computational Natural Language Learning, pp. 280–290 (2016)
Google Scholar
Paulus, R., Xiong, C., Socher, R.: A deep reinforced model for abstractive summarization. In: 6th International Conference on Learning Representations, ICLR 2018, Conference Track Proceedings (2018)
Google Scholar
Raffel, C., et al.: Exploring the limits of transfer learning with a unified text-to-text transformer. J. Mach. Learn. Res. 21(1), 5485–5551 (2020)
Google Scholar
Rush, A.M., Chopra, S., Weston, J.: A neural attention model for abstractive sentence summarization. In: Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, pp. 379–389 (2015)
Google Scholar
Salatino, A.A., Thanapalasingam, T., Mannocci, A., Birukou, A., Osborne, F., Motta, E.: The computer science ontology: a comprehensive automatically-generated taxonomy of research areas. Data Intell. 2(3), 379–416 (2020)
Google Scholar
See, A., Liu, P.J., Manning, C.D.: Get to the point: summarization with pointer-generator networks. In: Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics, Volume 1: Long Papers), pp. 1073–1083 (2017)
Google Scholar
Shi, T., Keneshloo, Y., Ramakrishnan, N., Reddy, C.K.: Neural abstractive text summarization with sequence-to-sequence models. ACM Trans. Data Sci. 2(1), 1–37 (2021)
Google Scholar

Download references

Author information

Authors and Affiliations

Shiv Nadar Institution of Eminence, Delhi-NCR, India
Sonia Khetarpaul, Lakshay Jain, Kush Goyal & P. Vishnu Tej

Authors

Sonia Khetarpaul
View author publications
You can also search for this author in PubMed Google Scholar
Lakshay Jain
View author publications
You can also search for this author in PubMed Google Scholar
Kush Goyal
View author publications
You can also search for this author in PubMed Google Scholar
P. Vishnu Tej
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sonia Khetarpaul .

Editor information

Editors and Affiliations

Wroclaw University of Science and Technology, Wrocław, Poland
Ngoc Thanh Nguyen
University of Pau and Adour Countries, Pau, France
Richard Chbeir
Open University of Cyprus, Latsia, Cyprus
Yannis Manolopoulos
Iwate Prefectural University, Takizawa, Japan
Hamido Fujita
National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
Japan Advanced Institute of Science and Technology, Nomi, Japan
Le Minh Nguyen
Wrocław University of Science and Technology, Wrocław, Poland
Krystian Wojtkiewicz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khetarpaul, S., Jain, L., Goyal, K., Tej, P.V. (2024). Lecture Video Summarization Using Deep Learning. In: Nguyen, N.T., et al. Recent Challenges in Intelligent Information and Database Systems. ACIIDS 2024. Communications in Computer and Information Science, vol 2145. Springer, Singapore. https://doi.org/10.1007/978-981-97-5934-7_9

Download citation

DOI: https://doi.org/10.1007/978-981-97-5934-7_9
Published: 13 August 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5933-0
Online ISBN: 978-981-97-5934-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics