Multimodal Video Database Modeling, Querying and Browsing

Durak, Nurcan; Yazici, Adnan

doi:10.1007/11569596_82

Nurcan Durak¹⁹ &
Adnan Yazici¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3733))

Included in the following conference series:

International Symposium on Computer and Information Sciences

2726 Accesses

Abstract

In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodality, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. In the querying part, semantic queries, spatial queries, regional queries, spatio-temporal queries, and temporal queries have been processed over video collections using the proposed model. In the browsing parts, video collections are navigated using category information, visual and auditory hierarchies.

This work is supported in part by Turkish State Planning Organization (DPT) under grant number 2004K120720.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

An intelligent multimedia information system for multimodal content extraction and querying

Article 31 January 2017

METU-MMDS: An Intelligent Multimedia Database System for Multimodal Content Extraction and Querying

VERGE: A Multimodal Interactive Search Engine for Video Browsing and Retrieval

References

Moncrieff, S., Dorai, C., Venkatesh, S.: Detecting indexical signs in film audio for scene interpretation. In: IEEE ICME 2001, Tokyo, Japan, pp. 1192–1195 (2001)
Google Scholar
Nam, J., Alghoniemy, M., Tewfik, A.H.: Audio-visual content-based violent scene characterization. In: IEEE Int. Conf. on Image Processing, vol. 1, pp. 353–357 (1998)
Google Scholar
Petkovic, M., Mihajlovic, V., Jonker, W.: Multi-Modal Extraction of Highlights from TV Formula 1 Programs. In: IEEE ICME, Lausanne, Switzerland, pp. 817–820 (2002)
Google Scholar
Petkovic, M., Jonker, W.: A framework for video modeling. In: Eighteenth IASTED Int. Conf. Applied Informatics, Innsbruck, Austria (2000)
Google Scholar
Informedia-II Digital Video Library, http://www.informedia.cs.cmu.edu/
Agius, H.W., Angelides, M.C.: Modeling content for semantic-level querying of multimedia. Multimedia Tools and Applications 15(1), 5–37 (2001)
Article MATH Google Scholar
Huang, Q., Puri, A., Liu, Z.: Multimedia search and retrieval: new concepts, system implementation, and application. IEEE Trans.on Circ. and Syst. for Video Tech. 10(5), 679–692 (2000)
Article Google Scholar
Gibbon, D., Bejeja, L., Liu, Z., Renger, B., Shahraray, B.: Creating Personalized Video Presentations using Multimodal Processing. In: Furht, B. (ed.) Handbook of Multimedia Databases, pp. 1107–1131. CRC Press, Boca Raton (2003)
Google Scholar
Snoek, C.G.M., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications, 5–35 (2005)
Google Scholar
Adalı, S., Candan, K.S., Chen, S., Erol, K., Subrahmanian, V.S.: The advanced video information system: data structures and query processing. Mult. Syst. 4, 172–186 (1996)
Article Google Scholar
Dönderler, M.E., Saykol, E., Arslan, U., Ulusoy, O.: BilVideo: Design and Implementation of a Video Database Management System. Multimedia Tools and Applications (to appear)
Google Scholar
Köprülü, M., Cicekli, N.K., Yazici, A.: Spatio-temporal querying in video databases. Inf. Sci. 160(1-4), 131–152 (2004)
Article Google Scholar
Ekin, A., Tekalp, A.M., Mehrotra, R.: Integrated semantic-syntactic video modeling for search and browsing. IEEE Trans. on Multimedia 6(6), 839–851 (2004)
Article Google Scholar
Li, J.Z., Özsu, M.T., Szafron, D.: Modeling of moving objects in a video database. In: Proc. of IEEE Int. Conf. on Multimedia Computing and Systems, Ottawa, Canada, pp. 336–343 (1997)
Google Scholar
Durak, N., Yazıcı, A.: Semantic Video Modeling And Retrieval with Visual, Auditory, Textual Sources, MS. Thesis, Metu, Ankara (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Computer Engineering, Middle East Technical University, Ankara, Turkey
Nurcan Durak & Adnan Yazici

Authors

Nurcan Durak
View author publications
You can also search for this author in PubMed Google Scholar
Adnan Yazici
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Engineering, Boğaziçi University, 34342, Bebek, Istanbul, Turkey
pInar Yolum & Can Özturan &
Computer Engineering Department, Boğaziçi University, 34342, Bebek, İstanbul, Turkey
Tunga Güngör
Computer Engineering Department, Bogazici University, 80815, Bebek, Istanbul, Turkey
Fikret Gürgen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Durak, N., Yazici, A. (2005). Multimodal Video Database Modeling, Querying and Browsing. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_82

Download citation

DOI: https://doi.org/10.1007/11569596_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29414-6
Online ISBN: 978-3-540-32085-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics