Lecture Video Browsing Using Multimodal Information Resources

Yang, Haojin; Grünewald, Franka; Bauer, Matthias; Meinel, Christoph

doi:10.1007/978-3-642-41175-5_21

Lecture Video Browsing Using Multimodal Information Resources

Haojin Yang¹⁸,
Franka Grünewald¹⁸,
Matthias Bauer¹⁸ &
…
Christoph Meinel¹⁸

Conference paper

1864 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8167))

Abstract

In the last decade e-lecturing has become more and more popular. The amount of lecture video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient method for video retrieval in WWW or within large lecture video archives is urgently needed. This paper presents an approach for automated video indexing and video search in large lecture video archives. First of all, we apply automatic video segmentation and key-frame detection to offer a visual guideline for the video content navigation. Subsequently, we extract textual metadata by applying video Optical Character Recognition (OCR) technology on key-frames and by performing Automatic Speech Recognition (ASR) on lecture audio tracks. The OCR and ASR transcript as well as detected slide text line types are adopted for keyword extraction, by which both video- and segment-level keywords are extracted respectively. Furthermore, we developed a content-based video search function and conducted a user study for evaluating the performance and the effectiveness of proposed indexing methods in our lecture video archive.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adcock, J., Cooper, M., Denoue, L., Pirsiavash, H.: Talkminer: A lecture webcast search engine. In: Proc. of the ACM International conference on Multimedia, MM 2010, pp. 241–250. ACM, Firenze (2010)
Google Scholar
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Chapter Google Scholar
Wang, T.-C.P.F., Ngo, C.-W.: Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Journal of Pattern Recognition 41(10), 3257–3269 (2008)
Article Google Scholar
Glass, J., Hazen, T.J., Hetherington, L., Wang, C.: Analysis and processing of lecture audio data: Preliminary investigations. In: Proc. of the HLT-NAACL Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval (2004)
Google Scholar
Haubold, A., Kender, J.R.: Augmented segmentation and visualization for presentation videos. In: Proc. of the 13th Annual ACM International Conference on Multimedia, pp. 51–60. ACM (2005)
Google Scholar
Lee, D., Lee, G.G.: A korean spoken document retrieval system for lecture search. In: Proc. of the SSCS Speech Search Workshop at SIGIR (2008)
Google Scholar
Leeuwis, E., Federico, M., Cettolo, M.: Language modeling and transcription of the ted corpus lectures. In: Proc. of the IEEE ICASSP, pp. 232–235. IEEE (2003)
Google Scholar
Nandzik, J., Litz, B., Flores-Herr, N., Löhden, A., Konya, I., Baum, D., Bergholz, A., Schönfu, D., Fey, C., Osterhoff, J., Waitelonis, J., Sack, H., Köhler, R., Ndjiki-Nya, P.: Contentus—technologies for next generation multimedia libraries. In: Multimedia Tools and Applications, pp. 1–43 (2012)
Google Scholar
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management, 513–523 (1988)
Google Scholar
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proc. of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT-NAACL 2003), pp. 252–259 (2003)
Google Scholar
Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012)
Article Google Scholar
Yang, H., Oehlke, C., Meinel, C.: An automated analysis and indexing framework for lecture video portal. In: Popescu, E., Li, Q., Klamma, R., Leung, H., Specht, M. (eds.) ICWL 2012. LNCS, vol. 7558, pp. 285–294. Springer, Heidelberg (2012)
Chapter Google Scholar
Yang, H., Sack, H., Meinel, C.: Lecture video indexing and analysis using video ocr technology. International Journal of Multimedia Processing and Technologies (JMPT) 2(4), 176–196 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Hasso-Plattner-Institute (HPI), University of Potsdam, Germany
Haojin Yang, Franka Grünewald, Matthias Bauer & Christoph Meinel

Authors

Haojin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Franka Grünewald
View author publications
You can also search for this author in PubMed Google Scholar
Matthias Bauer
View author publications
You can also search for this author in PubMed Google Scholar
Christoph Meinel
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Digital Arts and Multimedia Design, Tajen University, No. 20, Weixin Road, 90741, Yanpu Township, Pingtung County, Taiwan
Jhing-Fa Wang
Department of Computer Science, City University of Hong Kong, Tat Chee Avenue, Kowloon, Hong Kong, China
Rynson Lau

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yang, H., Grünewald, F., Bauer, M., Meinel, C. (2013). Lecture Video Browsing Using Multimodal Information Resources. In: Wang, JF., Lau, R. (eds) Advances in Web-Based Learning – ICWL 2013. ICWL 2013. Lecture Notes in Computer Science, vol 8167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41175-5_21

Download citation

DOI: https://doi.org/10.1007/978-3-642-41175-5_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41174-8
Online ISBN: 978-3-642-41175-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics