Skip to main content

Lecture Video Browsing Using Multimodal Information Resources

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8167))

Abstract

In the last decade e-lecturing has become more and more popular. The amount of lecture video data on the World Wide Web (WWW) is growing rapidly. Therefore, a more efficient method for video retrieval in WWW or within large lecture video archives is urgently needed. This paper presents an approach for automated video indexing and video search in large lecture video archives. First of all, we apply automatic video segmentation and key-frame detection to offer a visual guideline for the video content navigation. Subsequently, we extract textual metadata by applying video Optical Character Recognition (OCR) technology on key-frames and by performing Automatic Speech Recognition (ASR) on lecture audio tracks. The OCR and ASR transcript as well as detected slide text line types are adopted for keyword extraction, by which both video- and segment-level keywords are extracted respectively. Furthermore, we developed a content-based video search function and conducted a user study for evaluating the performance and the effectiveness of proposed indexing methods in our lecture video archive.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adcock, J., Cooper, M., Denoue, L., Pirsiavash, H.: Talkminer: A lecture webcast search engine. In: Proc. of the ACM International conference on Multimedia, MM 2010, pp. 241–250. ACM, Firenze (2010)

    Google Scholar 

  2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: A nucleus for a web of open data. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  3. Wang, T.-C.P.F., Ngo, C.-W.: Structuring low-quality videotaped lectures for cross-reference browsing by video text analysis. Journal of Pattern Recognition 41(10), 3257–3269 (2008)

    Article  Google Scholar 

  4. Glass, J., Hazen, T.J., Hetherington, L., Wang, C.: Analysis and processing of lecture audio data: Preliminary investigations. In: Proc. of the HLT-NAACL Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval (2004)

    Google Scholar 

  5. Haubold, A., Kender, J.R.: Augmented segmentation and visualization for presentation videos. In: Proc. of the 13th Annual ACM International Conference on Multimedia, pp. 51–60. ACM (2005)

    Google Scholar 

  6. Lee, D., Lee, G.G.: A korean spoken document retrieval system for lecture search. In: Proc. of the SSCS Speech Search Workshop at SIGIR (2008)

    Google Scholar 

  7. Leeuwis, E., Federico, M., Cettolo, M.: Language modeling and transcription of the ted corpus lectures. In: Proc. of the IEEE ICASSP, pp. 232–235. IEEE (2003)

    Google Scholar 

  8. Nandzik, J., Litz, B., Flores-Herr, N., Löhden, A., Konya, I., Baum, D., Bergholz, A., Schönfu, D., Fey, C., Osterhoff, J., Waitelonis, J., Sack, H., Köhler, R., Ndjiki-Nya, P.: Contentus—technologies for next generation multimedia libraries. In: Multimedia Tools and Applications, pp. 1–43 (2012)

    Google Scholar 

  9. Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. Information Processing and Management, 513–523 (1988)

    Google Scholar 

  10. Toutanova, K., Klein, D., Manning, C., Singer, Y.: Feature-rich part-of-speech tagging with a cyclic dependency network. In: Proc. of Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (HLT-NAACL 2003), pp. 252–259 (2003)

    Google Scholar 

  11. Waitelonis, J., Sack, H.: Towards exploratory video search using linked data. Multimedia Tools and Applications 59, 645–672 (2012)

    Article  Google Scholar 

  12. Yang, H., Oehlke, C., Meinel, C.: An automated analysis and indexing framework for lecture video portal. In: Popescu, E., Li, Q., Klamma, R., Leung, H., Specht, M. (eds.) ICWL 2012. LNCS, vol. 7558, pp. 285–294. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  13. Yang, H., Sack, H., Meinel, C.: Lecture video indexing and analysis using video ocr technology. International Journal of Multimedia Processing and Technologies (JMPT) 2(4), 176–196 (2012)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yang, H., Grünewald, F., Bauer, M., Meinel, C. (2013). Lecture Video Browsing Using Multimodal Information Resources. In: Wang, JF., Lau, R. (eds) Advances in Web-Based Learning – ICWL 2013. ICWL 2013. Lecture Notes in Computer Science, vol 8167. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41175-5_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-41175-5_21

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-41174-8

  • Online ISBN: 978-3-642-41175-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics