Abstract
In this paper, a multimodal video indexing and retrieval system, MMVIRS, is presented. MMVIRS models the auditory, visual, and textual sources of video collections from a semantic perspective. Besides multimodality, our model is constituted on semantic hierarchies that enable us to access the video from different semantic levels. MMVIRS has been implemented with data annotation, querying and browsing parts. In the annotation part, metadata information and video semantics are extracted in hierarchical ways. In the querying part, semantic queries, spatial queries, regional queries, spatio-temporal queries, and temporal queries have been processed over video collections using the proposed model. In the browsing parts, video collections are navigated using category information, visual and auditory hierarchies.
This work is supported in part by Turkish State Planning Organization (DPT) under grant number 2004K120720.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Moncrieff, S., Dorai, C., Venkatesh, S.: Detecting indexical signs in film audio for scene interpretation. In: IEEE ICME 2001, Tokyo, Japan, pp. 1192–1195 (2001)
Nam, J., Alghoniemy, M., Tewfik, A.H.: Audio-visual content-based violent scene characterization. In: IEEE Int. Conf. on Image Processing, vol. 1, pp. 353–357 (1998)
Petkovic, M., Mihajlovic, V., Jonker, W.: Multi-Modal Extraction of Highlights from TV Formula 1 Programs. In: IEEE ICME, Lausanne, Switzerland, pp. 817–820 (2002)
Petkovic, M., Jonker, W.: A framework for video modeling. In: Eighteenth IASTED Int. Conf. Applied Informatics, Innsbruck, Austria (2000)
Informedia-II Digital Video Library, http://www.informedia.cs.cmu.edu/
Agius, H.W., Angelides, M.C.: Modeling content for semantic-level querying of multimedia. Multimedia Tools and Applications 15(1), 5–37 (2001)
Huang, Q., Puri, A., Liu, Z.: Multimedia search and retrieval: new concepts, system implementation, and application. IEEE Trans.on Circ. and Syst. for Video Tech. 10(5), 679–692 (2000)
Gibbon, D., Bejeja, L., Liu, Z., Renger, B., Shahraray, B.: Creating Personalized Video Presentations using Multimodal Processing. In: Furht, B. (ed.) Handbook of Multimedia Databases, pp. 1107–1131. CRC Press, Boca Raton (2003)
Snoek, C.G.M., Worring, M.: Multimodal video indexing: A review of the state-of-the-art. Multimedia Tools and Applications, 5–35 (2005)
Adalı, S., Candan, K.S., Chen, S., Erol, K., Subrahmanian, V.S.: The advanced video information system: data structures and query processing. Mult. Syst. 4, 172–186 (1996)
Dönderler, M.E., Saykol, E., Arslan, U., Ulusoy, O.: BilVideo: Design and Implementation of a Video Database Management System. Multimedia Tools and Applications (to appear)
Köprülü, M., Cicekli, N.K., Yazici, A.: Spatio-temporal querying in video databases. Inf. Sci. 160(1-4), 131–152 (2004)
Ekin, A., Tekalp, A.M., Mehrotra, R.: Integrated semantic-syntactic video modeling for search and browsing. IEEE Trans. on Multimedia 6(6), 839–851 (2004)
Li, J.Z., Özsu, M.T., Szafron, D.: Modeling of moving objects in a video database. In: Proc. of IEEE Int. Conf. on Multimedia Computing and Systems, Ottawa, Canada, pp. 336–343 (1997)
Durak, N., Yazıcı, A.: Semantic Video Modeling And Retrieval with Visual, Auditory, Textual Sources, MS. Thesis, Metu, Ankara (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Durak, N., Yazici, A. (2005). Multimodal Video Database Modeling, Querying and Browsing. In: Yolum, p., Güngör, T., Gürgen, F., Özturan, C. (eds) Computer and Information Sciences - ISCIS 2005. ISCIS 2005. Lecture Notes in Computer Science, vol 3733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11569596_82
Download citation
DOI: https://doi.org/10.1007/11569596_82
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29414-6
Online ISBN: 978-3-540-32085-2
eBook Packages: Computer ScienceComputer Science (R0)