Exploring the Semantics behind a Collection to Improve Automated Image Annotation

Llorente, Ainhoa; Motta, Enrico; Rüger, Stefan

doi:10.1007/978-3-642-15751-6_40

Ainhoa Llorente²³,
Enrico Motta²³ &
Stefan Rüger²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6242))

Included in the following conference series:

Workshop of the Cross-Language Evaluation Forum for European Languages

488 Accesses
3 Citations

Abstract

The goal of this research is to explore several semantic relatedness measures that help to refine annotations generated by a baseline non-parametric density estimation algorithm. Thus, we analyse the benefits of performing a statistical correlation using the training set or using the World Wide Web versus approaches based on a thesaurus like WordNet or Wikipedia (considered as a hyperlink structure). Experiments are carried out using the dataset provided by the 2009 edition of the ImageCLEF competition, a subset of the MIR-Flickr 25k collection. Best results correspond to approaches based on statistical correlation as they do not depend on a prior disambiguation phase like WordNet and Wikipedia. Further work needs to be done to assess whether proper disambiguation schemas might improve their performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Miller, G.A., Charles, W.G.: Contextual correlates of semantic similarity. Journal of Language and Cognitive Processes 6, 1–28 (1991)
Article Google Scholar
Llorente, A., Rüger, S.: Using second order statistics to enhance automated image annotation. In: Proceedings of the 31st European Conference on Information Retrieval, vol. 5478, pp. 570–577 (2009)
Google Scholar
Nowak, S., Dunker, P.: Overview of the CLEF 2009 Large Scale – Visual Concenpt Detection and Annotation Task. In: Peters, C., et al. (eds.) CLEF 2009 Workshop, Part II. LNCS, vol. 6242, pp. 94–109. Springer, Heidelberg (2010)
Google Scholar
Yavlinsky, A., Schofield, E., Rüger, S.: Automated image annotation using global features and robust nonparametric density estimation. In: Proceedings of the International ACM Conference on Image and Video Retrieval, pp. 507–517 (2005)
Google Scholar
Gracia, J., Mena, E.: Web-based measure of semantic relatedness. In: Bailey, J., Maier, D., Schewe, K.-D., Thalheim, B., Wang, X.S. (eds.) WISE 2008. LNCS, vol. 5175, pp. 136–150. Springer, Heidelberg (2008)
Chapter Google Scholar
Cilibrasi, R., Vitanyi, P.: The Google similarity distance. IEEE Transactions on Knowledge and Data Engineering 19(3), 370–383 (2007)
Article Google Scholar
Budanitsky, A., Hirst, G.: Evaluating WordNet-based Measures of Lexical Semantic Relatedness. Computational Linguistics 32(1), 13–47 (2006)
Article Google Scholar
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of International Conference Research on Computational Linguistics (1997)
Google Scholar
Hirst, G., St-Onge, D.: Lexical chains as representations of context for the detection and correction of malapropisms. In: WordNet: A Lexical Database for English, pp. 305–332. The MIT Press, Cambridge (1998)
Google Scholar
Resnik, P.: Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 448–453 (1995)
Google Scholar
Banerjee, S., Pedersen, T.: Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of the Eighteenth International Conference on Artificial Intelligence (2003)
Google Scholar
Medelyan, O., Milne, D., Legg, C., Witten, I.H.: Mining meaning from wikipedia. International Journal of Human-Computer Studies 67(9), 716–754 (2009)
Article Google Scholar
Ponzetto, S., Strube, M.: Knowledge derived from wikipedia for computing semantic relatedness. Journal of Artificial Intelligence Research (JAIR) 30, 181–212 (2007)
MATH Google Scholar
Gabrilovich, E., Markovitch, S.: Computing semantic relatedness using wikipedia-based explicit semantic analysis. In: Proceedings of the 20th International Joint Conference for Artificial Intelligence, pp. 1606–1611 (2007)
Google Scholar
Milne, D., Witten, I.: An effective, low-cost measure of semantic relatedness obtained from wikipedia links. In: Proceedings of the first AAAI Workshop on Wikipedia and Artifical Intellegence (2008)
Google Scholar
Agirre, E., Alfonseca, E., Hall, K., Kravalova, J., Pasca, M., Soroa, A.: A study on similarity and relatedness using distributional and wordnet-based approaches. In: Proceedings of NAACL-HLT (2009)
Google Scholar
Nowak, S., Lukashevich, H.: Multilabel classification evaluation using ontology information. In: Proceedings of ESWC Workshop on Inductive Reasoning and Machine Learning on the Semantic Web (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Media Institute, The Open University, Walton Hall, Milton Keynes, MK7 6AA, United Kingdom
Ainhoa Llorente, Enrico Motta & Stefan Rüger

Authors

Ainhoa Llorente
View author publications
You can also search for this author in PubMed Google Scholar
Enrico Motta
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Rüger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

ISTI-CNR, Area Ricerca CNR, Via Moruzzi, 1, 56124, Pisa, Italy
Carol Peters
Idiap Research Institute, Rue Marconi 19, 1920, Martigny, Switzerland
Barbara Caputo
LSI-UNED, Juan del Rosal, 16, 28040, Madrid, Spain
Julio Gonzalo
Centre for Digital Video Processing, School of Computing, Dublin City University, Dublin 9, Ireland
Gareth J. F. Jones
Oregon Health and Science University, 3181 SW Sam Jackson Park Road, 97239-3098, Portland, OR, USA
Jayashree Kalpathy-Cramer
University of Applied Sciences Western Switzerland, TechnoArk 3, 3960, Sierre, Switzerland
Henning Müller
Centrum Wiskunde and Infoormatica, Science Park 123, 1098, Amsterdam, XG, The Netherlands
Theodora Tsikrika

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Llorente, A., Motta, E., Rüger, S. (2010). Exploring the Semantics behind a Collection to Improve Automated Image Annotation. In: Peters, C., et al. Multilingual Information Access Evaluation II. Multimedia Experiments. CLEF 2009. Lecture Notes in Computer Science, vol 6242. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15751-6_40

Download citation

DOI: https://doi.org/10.1007/978-3-642-15751-6_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15750-9
Online ISBN: 978-3-642-15751-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics