Contextual Image Annotation via Projection and Quantum Theory Inspired Measurement for Integration of Text and Visual Features

Kaliciak, Leszek; Wang, Jun; Song, Dawei; Zhang, Peng; Hou, Yuexian

doi:10.1007/978-3-642-24971-6_23

Leszek Kaliciak²¹,
Jun Wang²¹,
Dawei Song²¹,
Peng Zhang²¹ &
…
Yuexian Hou²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7052))

Included in the following conference series:

International Symposium on Quantum Interaction

771 Accesses
1 Citations

Abstract

Multimedia information retrieval suffers from the semantic gap, a difference between human perception and machine representation of images. In order to reduce the gap, a quantum theory inspired theoretical framework for integration of text and visual features has been proposed. This article is a follow-up work on this model. Previously, two relatively straightforward statistical approaches for making associations between dimensions of both feature spaces were employed, but with unsatisfactory results. In this paper, we propose to alleviate the problem regarding unannotated images by projecting them onto subspaces representing visual context and by incorporating a quantum-like measurement. The proposed principled approach extends the traditional vector space model (VSM) and seamlessly integrates with the tensor-based framework. Here, we experimentally test the novel association methods in a small-scale experiment.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Learning a Semantic Space for Modeling Images, Tags and Feelings in Cross-Media Search

Characterization and classification of semantic image-text relations

Article Open access 22 January 2020

Estimating the information gap between textual and visual representations

Article 01 December 2017

References

Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.: Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 349–354. Springer, Heidelberg (2002)
Google Scholar
Yang, J., Jiang, Y.G., Hauptmann, A.G., Ngo, C.W.: Evaluating Bag-of-Visual-Words Representations in Scene Classification. In: Proc. of the Int. Workshop on Multimedia IR, vol. 206 (2007)
Google Scholar
Jamieson, M., Dickinson, S., Stevenson, S., Wachsmuth, S.: Using Language to Drive the Perceptual Grouping of Local Image Features. In: IEEE Comp. Society Conference on Comp. Vision and Pattern Rec., vol. 2, pp. 2102–2109 (2006)
Google Scholar
Li, J., Wang, J.Z.: Real-Time Computerized Annotation of Pictures. IEEE Tran. on Pattern Anal. and Machine Int. 30, 985–1002 (2008)
Article Google Scholar
Yanai, K.: Generic Image Classification Using Visual Knowledge on the Web. In: Proc. of the 11-th ACM Int. Conf. on Multimedia, pp. 167–176 (2003)
Google Scholar
Tjondronegoro, D., Zhang, J., Gu, J., Nguyen, A., Geva, S.: Integrating Text Retrieval and Image Retrieval in XML Document Searching. In: Advances in XML Inf. Retr. and Evaluation (2005)
Google Scholar
Rahman, M.M., Bhattacharya, P., Desai, B.C.: A Unified Image Retrieval Framework on Local Visual and Semantic Concept-Based Feature Spaces. J. Visual Communication and Image Representation 20, 450–462 (2009)
Article Google Scholar
Simpson, M., Rahaman, M.M.: Text and Content Based Approaches to Image Retrieval for the ImageClef2009 Medical Retrieval Track. In: Working Notes for the CLEF 2009 Workshop (2009)
Google Scholar
Min, P., Kazhdan, M., Funkhouser, T.: A comparison of text and shape matching for retrieval of online 3D models. In: Heery, R., Lyon, L. (eds.) ECDL 2004. LNCS, vol. 3232, pp. 209–220. Springer, Heidelberg (2004)
Chapter Google Scholar
van Rijsbergen, C.J.: The Geometry of Information Retrieval. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
Griffiths, R.B.: Consistent Quantum Theory. Cambridge University Press, Cambridge (2003)
Google Scholar
Melucci, M.: Context Modeling and Discovery Using Vector Space Bases. In: Proc. of the ACM Conf. on Inf. and Knowledge Management, pp. 808–815 (2005)
Google Scholar
Di Buccio, E., Melucci, M., Song, D.: Towards Predicting Relevance Using a Quantum-Like Framework. In: The 33rd European Conference on IR, pp. 19–21 (2011)
Google Scholar
Biancalana, C., Lapolla, A., Micarelli, A.: Personalized web search using correlation matrix for query expansion. In: Cordeiro, J., Hammoudi, S., Filipe, J. (eds.) Web Information Systems and Technologies. LNBIP, vol. 18, pp. 186–198. Springer, Heidelberg (2009)
Chapter Google Scholar
Aharonov, Y., Albert, D.Z., Au, C.K.: New Interpretation of the Scalar Product in Hilbert Space. Phys. Rev. Lett. 47, 1029–1031 (1981)
Article MathSciNet Google Scholar
Wang, J., Song, D., Kaliciak, L.: Tensor Product of Correlated Text and Visual Features: A Quantum Theory Inspired Image Retrieval Framework. In: AAAI-Fall 2010 Symp. on Quant. Inf. for Cognitive, Social, and Semantic Processes, pp. 109–116 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

The Robert Gordon University, Aberdeen, UK
Leszek Kaliciak, Jun Wang, Dawei Song & Peng Zhang
Tianjin University, Tianjin, China
Yuexian Hou

Authors

Leszek Kaliciak
View author publications
You can also search for this author in PubMed Google Scholar
Jun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dawei Song
View author publications
You can also search for this author in PubMed Google Scholar
Peng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yuexian Hou
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computing, The Robert Gordon University, St. Andrew Street, AB25 1HG, Aberdeen, U.K.
Dawei Song
Department of Information Engineering, University of Padua, Via Gradenigo 6/B, 35131, Padova, Italy
Massimo Melucci
Department of Computer Science and Technology, University of Bedfordshire, Park Square, LU1 3JU, Luton, UK
Ingo Frommholz
School of Computing, The Robert Gordon University, St. Andrew Street, AB25 1HG, Aberdeen, UK
Peng Zhang & Lei Wang &
School of Computing, University of Glasgow, 18 Lilybank Gardens, G 128 QQ, Glasgow, UK
Sachi Arafat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaliciak, L., Wang, J., Song, D., Zhang, P., Hou, Y. (2011). Contextual Image Annotation via Projection and Quantum Theory Inspired Measurement for Integration of Text and Visual Features. In: Song, D., Melucci, M., Frommholz, I., Zhang, P., Wang, L., Arafat, S. (eds) Quantum Interaction. QI 2011. Lecture Notes in Computer Science, vol 7052. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24971-6_23

Download citation

DOI: https://doi.org/10.1007/978-3-642-24971-6_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24970-9
Online ISBN: 978-3-642-24971-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics