Object-Based Image Retrieval Beyond Visual Appearances

Zheng, Yan-Tao; Neo, Shi-Yong; Chua, Tat-Seng; Tian, Qi

doi:10.1007/978-3-540-77409-9_2

Yan-Tao Zheng¹,
Shi-Yong Neo¹,
Tat-Seng Chua¹ &
…
Qi Tian²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4903))

Included in the following conference series:

International Conference on Multimedia Modeling

1660 Accesses
1 Citations

Abstract

The performance of object-based image retrieval systems remains unsatisfactory, as it relies highly on visual similarity and regularity among images of same semantic class. In order to retrieve images beyond their visual appearances, we propose a novel image presentation, i.e. bag of visual synset. A visual synset is defined as a probabilistic relevance-consistent cluster of visual words (quantized vectors of region descriptors such as SIFT), in which the member visual words w induce similar semantic inference P(c|w) towards the image class c. The visual synset can be obtained by finding an optimal distributional clustering of visual words, based on Information Bottleneck principle. The testing on Caltech-256 datasets shows that by fusing the visual words in a relevance consistent way, the visual synset can partially bridge visual differences of images of same class and deliver satisfactory retrieval of relevant images with different visual appearances.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Jing, F., Li, M., Zhang, L., Zhang, H.-J., Zhang, B.: Learning in region-based image retrieval. In: Proceedings of Conference on Image and Video Retrieval, pp. 206–215 (2003)
Google Scholar
Carson, C., Belongie, S., Greenspan, H., Malik, J.: Blobworld: Image Segmentation Using Expectation-Maximization and Its Application to Image Querying. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(8), 1026–1038 (2002)
Article Google Scholar
Zheng, Q.-F., Wang, W.-Q., Gao, W.: Effective and efficient object-based image retrieval using visual phrases. In: Proceedings of ACM international conference on Multimedia, Santa Barbara, CA, USA, pp. 77–80 (2006)
Google Scholar
Faloutsos, C., Barber, R., Flickner, M., Hafner, J., Niblack, W., Petkovic, D., Equitz, W.: Efficient and effective querying by image content. Journal of Intelligent Information Systems 3(3-4), 231–262 (1994)
Article Google Scholar
Gupta, A.H., Jain, R.: Visual information retrieval. Communications of the ACM 40(5), 70–79 (1997)
Article Google Scholar
Smith, J.R., Chang, S.-F.: VisualSEEk: a fully automated content-based image query system. In: Proceedings of ACM conference on Multimedia, Boston, U.S, pp. 87–98 (November 1996)
Google Scholar
Wang, J.Z., Li, J., Wiederhold, G.: SIMPLIcity: Semantics-Sensitive Integrated Matching for Picture Libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(9), 947–963 (2001)
Article Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 20, 91–110 (2003)
Google Scholar
Bekkerman, R., El-Yaniv, R., Tishby, N., Winter, Y.: Distributional word clusters vs. words for text categorization. Journal of Machine Learning Research g 3, 1183–1208 (2003)
Article MATH Google Scholar
Squire, D., Muller, W., Muller, H., Pun, T.: Content-based visual query of image databases: inspirations from text retrieval. Pattern Recognition Letters 21, 1193–1198 (2000)
Article MATH Google Scholar
Kadir, T., Brady, M.: Saliency, scale and image description. International Journal of Computer Vision 45(2), 83–105 (2001)
Article MATH Google Scholar
Bekkerman, R., El-Yaniv, R., Tishby, N., Winter, Y.: Distributional word clusters vs. words for text categorization. Journal of Machine Learning Research
Google Scholar
Slonim, N., Friedman, N., Tishby, N.: Agglomerative multivariate information bottleneck. In: Advances in Neural Information Processing Systems (NIPS) (2001)
Google Scholar
Yang, Y., Pedersen, J.O.: A comparative study on feature selection in text categorization. In: Proceedings of ICML, Nashville, US, pp. 412–420 (1997)
Google Scholar
Liu, Y., Zhang, D., Lu, G., Ma, W.-Y.: A survey of content-based image retrieval with high-level semantics. Pattern Recognition 40(1), 262–282 (2007)
Article MATH Google Scholar
Witten, I.H., Moffat, A., Bell, T.C.: Managing gigabytes: compressing and indexing documents and images. Morgan Kaufmann Publishers Inc, San Francisco (1999)
Google Scholar
Griffin, G., Holub, A., Perona, P.: The Caltech-256, Caltech Technical Report
Google Scholar

Download references

Author information

Authors and Affiliations

National University of Singapore, 3 Science Dr, 117543, Singapore
Yan-Tao Zheng, Shi-Yong Neo & Tat-Seng Chua
Institute for Infocomm Research (I2R), 21 Heng Mui Keng Terrace, 119613, Singapore
Qi Tian

Authors

Yan-Tao Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Shi-Yong Neo
View author publications
You can also search for this author in PubMed Google Scholar
Tat-Seng Chua
View author publications
You can also search for this author in PubMed Google Scholar
Qi Tian
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Shin’ichi Satoh Frank Nack Minoru Etoh

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zheng, YT., Neo, SY., Chua, TS., Tian, Q. (2008). Object-Based Image Retrieval Beyond Visual Appearances. In: Satoh, S., Nack, F., Etoh, M. (eds) Advances in Multimedia Modeling. MMM 2008. Lecture Notes in Computer Science, vol 4903. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77409-9_2

Download citation

DOI: https://doi.org/10.1007/978-3-540-77409-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77407-5
Online ISBN: 978-3-540-77409-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics