Skip to main content

Multimodal Video Database Modeling, Querying and Browsing

  • Conference paper
  • 2612 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3733))

Abstract

In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodality, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. In the querying part, semantic queries, spatial queries, regional queries, spatio-temporal queries, and temporal queries have been processed over video collections using the proposed model. In the browsing parts, video collections are navigated using category information, visual and auditory hierarchies.

This work is supported in part by Turkish State Planning Organization (DPT) under grant number 2004K120720.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Moncrieff, S., Dorai, C., Venkatesh, S.: Detecting indexical signs in film audio for scene interpretation. In: IEEE ICME 2001, Tokyo, Japan, pp. 1192–1195 (2001)

    Google Scholar 

  2. Nam, J., Alghoniemy, M., Tewfik, A.H.: Audio-visual content-based violent scene characterization. In: IEEE Int. Conf. on Image Processing, vol. 1, pp. 353–357 (1998)

    Google Scholar 

  3. Petkovic, M., Mihajlovic, V., Jonker, W.: Multi-Modal Extraction of Highlights from TV Formula 1 Programs. In: IEEE ICME, Lausanne, Switzerland, pp. 817–820 (2002)

    Google Scholar 

  4. Petkovic, M., Jonker, W.: A framework for video modeling. In: Eighteenth IASTED Int. Conf. Applied Informatics, Innsbruck, Austria (2000)

    Google Scholar 

  5. Informedia-II Digital Video Library, http://www.informedia.cs.cmu.edu/

  6. Agius, H.W., Angelides, M.C.: Modeling content for semantic-level querying of multimedia. Multimedia Tools and Applications 15(1), 5–37 (2001)

    Article  MATH  Google Scholar 

  7. Huang, Q., Puri, A., Liu, Z.: Multimedia search and retrieval: new concepts, system implementation, and application. IEEE Trans.on Circ. and Syst. for Video Tech. 10(5), 679–692 (2000)

    Article  Google Scholar 

  8. Gibbon, D., Bejeja, L., Liu, Z., Renger, B., Shahraray, B.: Creating Personalized Video Presentations using Multimodal Processing. In: Furht, B. (ed.) Handbook of Multimedia Databases, pp. 1107–1131. CRC Press, Boca Raton (2003)

    Google Scholar 

  9. Snoek, C.G.M., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications, 5–35 (2005)

    Google Scholar 

  10. Adalı, S., Candan, K.S., Chen, S., Erol, K., Subrahmanian, V.S.: The advanced video information system: data structures and query processing. Mult. Syst. 4, 172–186 (1996)

    Article  Google Scholar 

  11. Dönderler, M.E., Saykol, E., Arslan, U., Ulusoy, O.: BilVideo: Design and Implementation of a Video Database Management System. Multimedia Tools and Applications (to appear)

    Google Scholar 

  12. Köprülü, M., Cicekli, N.K., Yazici, A.: Spatio-temporal querying in video databases. Inf. Sci. 160(1-4), 131–152 (2004)

    Article  Google Scholar 

  13. Ekin, A., Tekalp, A.M., Mehrotra, R.: Integrated semantic-syntactic video modeling for search and browsing. IEEE Trans. on Multimedia 6(6), 839–851 (2004)

    Article  Google Scholar 

  14. Li, J.Z., Özsu, M.T., Szafron, D.: Modeling of moving objects in a video database. In: Proc. of IEEE Int. Conf. on Multimedia Computing and Systems, Ottawa, Canada, pp. 336–343 (1997)

    Google Scholar 

  15. Durak, N., Yazıcı, A.: Semantic Video Modeling And Retrieval with Visual, Auditory, Textual Sources, MS. Thesis, Metu, Ankara (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Durak, N., Yazici, A. (2005). Multimodal Video Database Modeling, Querying and Browsing. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_82

Download citation

  • DOI: https://doi.org/10.1007/11569596_82

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29414-6

  • Online ISBN: 978-3-540-32085-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics