Abstract
The performance of object-based image retrieval systems remains unsatisfactory, as it relies highly on visual similarity and regularity among images of same semantic class. In order to retrieve images beyond their visual appearances, we propose a novel image presentation, i.e. bag of visual synset. A visual synset is defined as a probabilistic relevance-consistent cluster of visual words (quantized vectors of region descriptors such as SIFT), in which the member visual words w induce similar semantic inference P(c|w) towards the image class c. The visual synset can be obtained by finding an optimal distributional clustering of visual words, based on Information Bottleneck principle. The testing on Caltech-256 datasets shows that by fusing the visual words in a relevance consistent way, the visual synset can partially bridge visual differences of images of same class and deliver satisfactory retrieval of relevant images with different visual appearances.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Jing, F., Li, M., Zhang, L., Zhang, H.-J., Zhang, B.: Learning in region-based image retrieval. In: Proceedings of Conference on Image and Video Retrieval, pp. 206–215 (2003)
Carson, C., Belongie, S., Greenspan, H., Malik, J.: Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1026–1038 (2002)
Zheng, Q.-F., Wang, W.-Q., Gao, W.: Effective and efficient object-based image retrieval using visual phrases. In: Proceedings of ACM international conference on Multimedia, Santa Barbara, CA, USA, pp. 77–80 (2006)
Faloutsos, C., Barber, R., Flickner, M., Hafner, J., Niblack, W., Petkovic, D., Equitz, W.: Efficient and effective querying by image content. Journal of Intelligent Information Systems 3(3-4), 231–262 (1994)
Gupta, A.H., Jain, R.: Visual information retrieval. Communications of the ACM 40(5), 70–79 (1997)
Smith, J.R., Chang, S.-F.: VisualSEEk: a fully automated content-based image query system. In: Proceedings of ACM conference on Multimedia, Boston, U.S, pp. 87–98 (November 1996)
Wang, J.Z., Li, J., Wiederhold, G.: SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(9), 947–963 (2001)
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 20, 91–110 (2003)
Bekkerman, R., El-Yaniv, R., Tishby, N., Winter, Y.: Distributional word clusters vs. words for text categorization. Journal of Machine Learning Research g 3, 1183–1208 (2003)
Squire, D., Muller, W., Muller, H., Pun, T.: Content-based visual query of image databases: inspirations from text retrieval. Pattern Recognition Letters 21, 1193–1198 (2000)
Kadir, T., Brady, M.: Saliency, scale and image description. International Journal of Computer Vision 45(2), 83–105 (2001)
Bekkerman, R., El-Yaniv, R., Tishby, N., Winter, Y.: Distributional word clusters vs. words for text categorization. Journal of Machine Learning Research
Slonim, N., Friedman, N., Tishby, N.: Agglomerative multivariate information bottleneck. In: Advances in Neural Information Processing Systems (NIPS) (2001)
Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Proceedings of ICML, Nashville, US, pp. 412–420 (1997)
Liu, Y., Zhang, D., Lu, G., Ma, W.-Y.: A survey of content-based image retrieval with high-level semantics. Pattern Recognition 40(1), 262–282 (2007)
Witten, I.H., Moffat, A., Bell, T.C.: Managing gigabytes: compressing and indexing documents and images. Morgan Kaufmann Publishers Inc, San Francisco (1999)
Griffin, G., Holub, A., Perona, P.: The Caltech-256, Caltech Technical Report
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zheng, YT., Neo, SY., Chua, TS., Tian, Q. (2008). Object-Based Image Retrieval Beyond Visual Appearances. In: Satoh, S., Nack, F., Etoh, M. (eds) Advances in Multimedia Modeling. MMM 2008. Lecture Notes in Computer Science, vol 4903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77409-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-540-77409-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77407-5
Online ISBN: 978-3-540-77409-9
eBook Packages: Computer ScienceComputer Science (R0)