Abstract
In the last decade e-lecturing has become more and more popular. The amount of lecture video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient method for video retrieval in WWW or within large lecture video archives is urgently needed. This paper presents an approach for automated video indexing and video search in large lecture video archives. First of all, we apply automatic video segmentation and key-frame detection to offer a visual guideline for the video content navigation. Subsequently, we extract textual metadata by applying video Optical Character Recognition (OCR) technology on key-frames and by performing Automatic Speech Recognition (ASR) on lecture audio tracks. The OCR and ASR transcript as well as detected slide text line types are adopted for keyword extraction, by which both video- and segment-level keywords are extracted respectively. Furthermore, we developed a content-based video search function and conducted a user study for evaluating the performance and the effectiveness of proposed indexing methods in our lecture video archive.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Adcock, J., Cooper, M., Denoue, L., Pirsiavash, H.: Talkminer: A lecture webcast search engine. In: Proc. of the ACM International conference on Multimedia, MM 2010, pp. 241–250. ACM, Firenze (2010)
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)
Wang, T.-C.P.F., Ngo, C.-W.: Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Journal of Pattern Recognition 41(10), 3257–3269 (2008)
Glass, J., Hazen, T.J., Hetherington, L., Wang, C.: Analysis and processing of lecture audio data: Preliminary investigations. In: Proc. of the HLT-NAACL Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval (2004)
Haubold, A., Kender, J.R.: Augmented segmentation and visualization for presentation videos. In: Proc. of the 13th Annual ACM International Conference on Multimedia, pp. 51–60. ACM (2005)
Lee, D., Lee, G.G.: A korean spoken document retrieval system for lecture search. In: Proc. of the SSCS Speech Search Workshop at SIGIR (2008)
Leeuwis, E., Federico, M., Cettolo, M.: Language modeling and transcription of the ted corpus lectures. In: Proc. of the IEEE ICASSP, pp. 232–235. IEEE (2003)
Nandzik, J., Litz, B., Flores-Herr, N., Löhden, A., Konya, I., Baum, D., Bergholz, A., Schönfu, D., Fey, C., Osterhoff, J., Waitelonis, J., Sack, H., Köhler, R., Ndjiki-Nya, P.: Contentus—technologies for next generation multimedia libraries. In: Multimedia Tools and Applications, pp. 1–43 (2012)
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management, 513–523 (1988)
Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proc. of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT-NAACL 2003), pp. 252–259 (2003)
Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012)
Yang, H., Oehlke, C., Meinel, C.: An automated analysis and indexing framework for lecture video portal. In: Popescu, E., Li, Q., Klamma, R., Leung, H., Specht, M. (eds.) ICWL 2012. LNCS, vol. 7558, pp. 285–294. Springer, Heidelberg (2012)
Yang, H., Sack, H., Meinel, C.: Lecture video indexing and analysis using video ocr technology. International Journal of Multimedia Processing and Technologies (JMPT) 2(4), 176–196 (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yang, H., Grünewald, F., Bauer, M., Meinel, C. (2013). Lecture Video Browsing Using Multimodal Information Resources. In: Wang, JF., Lau, R. (eds) Advances in Web-Based Learning – ICWL 2013. ICWL 2013. Lecture Notes in Computer Science, vol 8167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41175-5_21
Download citation
DOI: https://doi.org/10.1007/978-3-642-41175-5_21
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41174-8
Online ISBN: 978-3-642-41175-5
eBook Packages: Computer ScienceComputer Science (R0)