Using Visual Cues for the Extraction of Web Image Semantic Information

Tryfou, Georgina; Tsapatsoulis, Nicolas

doi:10.1007/978-3-642-33290-6_42

Georgina Tryfou¹⁹ &
Nicolas Tsapatsoulis¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7489))

Included in the following conference series:

International Conference on Theory and Practice of Digital Libraries

2251 Accesses
1 Citations

Abstract

Mining information for the images that currently exist in huge amounts on the web, has been a main scientific interest during the past years. Several methods have been exploited and web image information is extracted from textual sources such as image file names, anchor texts, existing keywords and, of course, surrounding text. However, the systems that attempt to mine information for images using surrounding text suffer from several problems, such as the inability to correctly assign all relevant text to an image and discard the irrelevant text as well. A novel method for extracting web image information is discussed in the present paper. The proposed system uses visual cues in order to cluster a web page into several regions and assign to each hosted image the text that most possibly refers to it. Three different approaches to the problem of text to image assignment are discussed and evaluated. The evaluation procedure indicates the advantages of using visual cues and two dimensional euclidean measures for extracting information for web images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ortega-Binderberger, M., Mexico, A.: Webmars: A multimedia search engine for the world wide web (1999)
Google Scholar
Alexandre, L., Pereira, M., Madeira, S., Cordeiro, J., Dias, G.: Web image indexing: Combining image analysis with text processing. In: Proceedings of the 5th International Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS 2004 (2004)
Google Scholar
Alcic, S., Conrad, S.: A clustering-based approach to web image context extraction. In: MMEDIA 2011 (2011)
Google Scholar
Cai, D., Yu, S., Wen, J.R., Ma, W.Y.: Vips: a vision based page segmentation algorithm. Technical report, Microsoft Research (2003)
Google Scholar
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Commun. ACM 18(11), 613–620 (1975)
Article MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Communication and Internet Studies, Cyprus University of Technology, Limassol, Cyprus
Georgina Tryfou & Nicolas Tsapatsoulis

Authors

Georgina Tryfou
View author publications
You can also search for this author in PubMed Google Scholar
Nicolas Tsapatsoulis
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Multimedia and Graphic Arts, Cyprus University of Technology, 3036, Limassol, Cyprus
Panayiotis Zaphiris & Fernando Loizides &
School of Informatics, City University of London, Northampton Square, EC1V 0HB, London, UK
George Buchanan
School of Library, Archival and Information Studies, Irving K. Barber Learning Centre, The University of British Columbia, V6T 1Z3, Vancouver, BC, Canada
Edie Rasmussen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tryfou, G., Tsapatsoulis, N. (2012). Using Visual Cues for the Extraction of Web Image Semantic Information. In: Zaphiris, P., Buchanan, G., Rasmussen, E., Loizides, F. (eds) Theory and Practice of Digital Libraries. TPDL 2012. Lecture Notes in Computer Science, vol 7489. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33290-6_42

Download citation

DOI: https://doi.org/10.1007/978-3-642-33290-6_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33289-0
Online ISBN: 978-3-642-33290-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics