Abstract
Multimedia information retrieval suffers from the semantic gap, a difference between human perception and machine representation of images. In order to reduce the gap, a quantum theory inspired theoretical framework for integration of text and visual features has been proposed. This article is a follow-up work on this model. Previously, two relatively straightforward statistical approaches for making associations between dimensions of both feature spaces were employed, but with unsatisfactory results. In this paper, we propose to alleviate the problem regarding unannotated images by projecting them onto subspaces representing visual context and by incorporating a quantum-like measurement. The proposed principled approach extends the traditional vector space model (VSM) and seamlessly integrates with the tensor-based framework. Here, we experimentally test the novel association methods in a small-scale experiment.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 349–354. Springer, Heidelberg (2002)
Yang, J., Jiang, Y.G., Hauptmann, A.G., Ngo, C.W.: Evaluating Bag-of-Visual-Words Representations in Scene Classification. In: Proc. of the Int. Workshop on Multimedia IR, vol. 206 (2007)
Jamieson, M., Dickinson, S., Stevenson, S., Wachsmuth, S.: Using Language to Drive the Perceptual Grouping of Local Image Features. In: IEEE Comp. Society Conference on Comp. Vision and Pattern Rec., vol. 2, pp. 2102–2109 (2006)
Li, J., Wang, J.Z.: Real-Time Computerized Annotation of Pictures. IEEE Tran. on Pattern Anal. and Machine Int. 30, 985–1002 (2008)
Yanai, K.: Generic Image Classification Using Visual Knowledge on the Web. In: Proc. of the 11-th ACM Int. Conf. on Multimedia, pp. 167–176 (2003)
Tjondronegoro, D., Zhang, J., Gu, J., Nguyen, A., Geva, S.: Integrating Text Retrieval and Image Retrieval in XML Document Searching. In: Advances in XML Inf. Retr. and Evaluation (2005)
Rahman, M.M., Bhattacharya, P., Desai, B.C.: A Unified Image Retrieval Framework on Local Visual and Semantic Concept-Based Feature Spaces. J. Visual Communication and Image Representation 20, 450–462 (2009)
Simpson, M., Rahaman, M.M.: Text and Content Based Approaches to Image Retrieval for the ImageClef2009 Medical Retrieval Track. In: Working Notes for the CLEF 2009 Workshop (2009)
Min, P., Kazhdan, M., Funkhouser, T.: A comparison of text and shape matching for retrieval of online 3D models. In: Heery, R., Lyon, L. (eds.) ECDL 2004. LNCS, vol. 3232, pp. 209–220. Springer, Heidelberg (2004)
van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)
Griffiths, R.B.: Consistent Quantum Theory. Cambridge University Press, Cambridge (2003)
Melucci, M.: Context Modeling and Discovery Using Vector Space Bases. In: Proc. of the ACM Conf. on Inf. and Knowledge Management, pp. 808–815 (2005)
Di Buccio, E., Melucci, M., Song, D.: Towards Predicting Relevance Using a Quantum-Like Framework. In: The 33rd European Conference on IR, pp. 19–21 (2011)
Biancalana, C., Lapolla, A., Micarelli, A.: Personalized web search using correlation matrix for query expansion. In: Cordeiro, J., Hammoudi, S., Filipe, J. (eds.) Web Information Systems and Technologies. LNBIP, vol. 18, pp. 186–198. Springer, Heidelberg (2009)
Aharonov, Y., Albert, D.Z., Au, C.K.: New Interpretation of the Scalar Product in Hilbert Space. Phys. Rev. Lett. 47, 1029–1031 (1981)
Wang, J., Song, D., Kaliciak, L.: Tensor Product of Correlated Text and Visual Features: A Quantum Theory Inspired Image Retrieval Framework. In: AAAI-Fall 2010 Symp. on Quant. Inf. for Cognitive, Social, and Semantic Processes, pp. 109–116 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kaliciak, L., Wang, J., Song, D., Zhang, P., Hou, Y. (2011). Contextual Image Annotation via Projection and Quantum Theory Inspired Measurement for Integration of Text and Visual Features. In: Song, D., Melucci, M., Frommholz, I., Zhang, P., Wang, L., Arafat, S. (eds) Quantum Interaction. QI 2011. Lecture Notes in Computer Science, vol 7052. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24971-6_23
Download citation
DOI: https://doi.org/10.1007/978-3-642-24971-6_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24970-9
Online ISBN: 978-3-642-24971-6
eBook Packages: Computer ScienceComputer Science (R0)