Abstract
A main problem with the handling of multimedia databases is the navigation through and the search within the content of a database. The problem arises from the difference between the possible textual description (annotation) of the database content and its visual appearance. Overcoming the so called - semantic gap - has been in the focus of research for some time. This paper presents a new system for similarity-based browsing of multimedia databases. The system aims at decreasing the semantic gap by using a tree structure, built up on balanced hierarchical clustering. Using this approach, operators are provided with an intuitive and easy-to-use browsing tool. An important objective of this paper is not only on the description of the database organization and retrieval structure, but also how the illustrated techniques might be integrated into a single system.
Our main contribution is the direct use of a balanced tree structure for navigating through the database of keyframes, paired with an easy-to-use interface, offering a coarse to fine similarity-based view of the grouped database content.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., Gorkani, M., Hafner, J., Lee, D., Petkovic, D., Steele, D., Yanker, P.: Query by image and video content: the qbic system. Computer 28(9), 23–32 (1995)
Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Contentbased image retrieval at the end ofthe early years. IEEE transactions on pattern analysis and machine intelligence 22(12), 1349–1379 (2000)
Broecker, L., Bogen, M., Cremers, A.B.: Bridging the semantic gap in content-based image retrieval systems. In: Internet Multimedia Management Systems II. Volume 4519 of the Society of Photo-Optical Instrumentation Engineers (SPIE) Conference, 54–62 (2001)
Zhao, R., Grosky, W.: Narrowing the semantic gap-improved text-based web document retrieval using visual features. Multimedia, IEEE Transactions on 4(2), 189–200 (2002)
Zhao, R., Grosky, W.: Bridging the Semantic Gap in Image Retrieval. Distributed Multimedia Databases: Techniques and Applications (2003)
Dorai, C., Venkatesh, S.: Bridging the semantic gap with computational media aesthetics. Multimedia, IEEE 10(2), 15–17 (2003)
de Rooij, O., Snoek, C.G.M., Worring, M.: Mediamill: semantic video search using the rotorbrowser. In: [7], p. 649 (2007)
Barecke, T., Kijak, E., Nurnberger, A., Detyniecki, M.: Videosom: A som-based interface for video browsing. Image And Video Retrieval, Proceedings 4071, 506–509 (2006)
Rautiainen, M., Ojala, T., Seppanen, T.: Cluster-temporal browsing of large news video databases. IEEE Int. Conference on Multimedia and Expo. 2, 751–754 (2004)
Chen, J., Bouman, C., Dalton, J.: Similarity pyramids for browsing and organization of large image databases. SPIE Human Vision and Electronic Imaging III 3299 (1998)
Chen, J.Y., Bouman, C., Dalton, J.: Hierarchical browsing and search of large image databases. Image Processing, IEEE Transactions on 9(3), 442–455 (2000)
Taskiran, C., Chen, J., Albiol, A., Torres, L., Bouman, C., Delp, E.: Vibe: A compressed video database structured for active browsing and search. IEEE Transactions on Multimedia 6(1), 103–118 (2004)
Johnson, S.C.: Hierarchical clustering schemes. Psychometrika 32(3), 241–241 (1967)
Manjunath, B., Ohm, J., Vasudevan, V., Yamada, A.: Color and texture descriptors. IEEE Trans. on Circuits Syst. for Video Techn. 11(6) (2001)
McQueen, J.: Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Schwarz, G.: Estimating the dimension of a model. Annals of Statistics 6(2), 461–464 (1978)
Borth, D., Ulges, A., Schulze, C., Breuel, T.M.: Keyframe extraction for video taggging and summarization. In: Informatiktage 2008, pp. 45–48 (2008)
Ulges, A., Schulze, C., Keysers, D., Breuel, T.M.: Content-based video tagging for online video portals. In: MUSCLE/Image-CLEF Workshop (2007)
Tamura, H., Mori, S., Yamawaki, T.: Textual features corresponding to visual perception. IEEE Transactions on Systems, Man, and Cybernetics SMC-8(6) (1978)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Borth, D., Schulze, C., Ulges, A., Breuel, T.M. (2008). Navidgator - Similarity Based Browsing for Image and Video Databases. In: Dengel, A.R., Berns, K., Breuel, T.M., Bomarius, F., Roth-Berghofer, T.R. (eds) KI 2008: Advances in Artificial Intelligence. KI 2008. Lecture Notes in Computer Science(), vol 5243. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85845-4_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-85845-4_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85844-7
Online ISBN: 978-3-540-85845-4
eBook Packages: Computer ScienceComputer Science (R0)