research-article

Towards modelling visual ambiguity for visual object detection

Authors:
Elisavet Chatzilari

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece and University of Surrey Guildford, UK

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece and University of Surrey Guildford, UK
View Profile

,
Spiros Nikolopoulos

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece
View Profile

,
Yiannis Kompatsiaris

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece

Informatics and Telematics Institute Centre for Research & Technology Hellas, Thessaloniki, Greece
View Profile

,
Josef Kittler

University of Surrey, Guildford, UK

University of Surrey, Guildford, UK
View Profile

i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven BusinessSeptember 2014Article No.: 4Pages 1–7https://doi.org/10.1145/2637748.2638431

Published:16 September 2014Publication History

i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business

Pages 1–7

ABSTRACT

The widespread adoption of Web 2.0 applications has resulted in the creation of huge amounts of user-generated multimedia content, a fact that motivated the investigation of employing this content for training. However, the nature of these annotations (i.e. global level) and the noise existing in the associated information, as well as the ambiguity that characterizes these examples disqualifies them from being directly appropriate learning samples. Nevertheless, the tremendous volume of data that is currently hosted in social networks gives us the luxury to disregard a substantial number of candidate learning examples, provided we can devise a gauging mechanism that could filter out any ambiguous or noisy samples. Our objective in this work is to define a measure for visual ambiguity, which is caused by the visual similarity of semantically dissimilar concepts, in order to help in the process of selecting positive training regions from user tagged images. This is done by limiting the search space of the potential images to the ones yielding a higher probability to contain the desired regions, while at the same time not including visually ambiguous objects that could confuse the selection algorithm. Experimental results show that the employment of visual ambiguity allows for better separation between the targeted true positive and the undesired negative regions.

References

E. Chatzilari, S. Nikolopoulos, Y. Kompatsiaris, and J. Kittler. Multi-modal region selection approach for training object detectors. In Proceedings of the 2nd ACM International Conference on Multimedia Retrieval, pages 5:1--5:8. ACM, 2012. Google ScholarDigital Library
E. Chatzilari, S. Nikolopoulos, Y. Kompatsiaris, and J. Kittler. Active learning in social context for image classification. In 9th Int. Conference on Computer Vision Theory and Applications (VISAPP), Lisbon, Portugal, January 5-8 2014.Google Scholar
E. Chatzilari, S. Nikolopoulos, I. Patras, and I. Kompatsiaris. Leveraging social media for scalable object detection. Pattern Recognition, 45(8):2962--2979, 2012. Google ScholarDigital Library
H. J. Escalante, C. A. Hernandez, J. A. Gonzalez, A. Lspez-Lspez, M. Montes, E. F. Morales, L. E. Sucar, L. Villase?or, and M. Grubinger. The segmented and annotated iapr tc-12 benchmark. CVIU, 2010. Google ScholarDigital Library
C. Fellbaum, editor. WordNet An Electronic Lexical Database. The MIT Press, Cambridge, MA; London, May 1998.Google Scholar
X. Li, C. G. M. Snoek, M. Worring, D. C. Koelma, and A. W. M. Smeulders. Bootstrapping visual categorization with relevant negatives. IEEE Trans. on Multimedia, In press, 2013.Google Scholar
B. T. Mark J. Huiskes and M. S. Lew. New trends and ideas in visual concept detection: The mir flickr retrieval evaluation initiative. In MIR '10: Proceedings of the 2010 ACM International Conference on Multimedia Information Retrieval, pages 527--536, New York, NY, USA, 2010. ACM. Google ScholarDigital Library
V. Ng and C. Cardie. Bootstrapping coreference classifiers with multiple machine learning algorithms. In Proceedings of the 2003 conference on Empirical methods in natural language processing, EMNLP '03, pages 113--120, 2003. Google ScholarDigital Library
S. Patwardhan. Incorporating Dictionary and Corpus Information into a Context Vector Measure of Semantic Relatedness. Master's thesis, University of Minnesota, Duluth, August 2003.Google Scholar
Y. Shen and J. Fan. Leveraging loosely-tagged images and inter-object correlations for tag recommendation. In ACM, MM '10, 2010. Google ScholarDigital Library
J. C. van Gemert, C. J. Veenman, A. W. M. Smeulders, and J. M. Geusebroek. Visual word ambiguity. IEEE Transactions on Pattern Analysis and Machine Intelligence, 32(7):1271--1283, 2010. Google ScholarDigital Library
S. Vijayanarasimhan and K. Grauman. Large-scale live active learning: Training object detectors with crawled data and crowds. In CVPR, pages 1449--1456, 2011. Google ScholarDigital Library

Index Terms

Towards modelling visual ambiguity for visual object detection
1. Computing methodologies
  1. Machine learning
2. Information systems
  1. Information retrieval

Recommendations

Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors
MM '13: Proceedings of the 21st ACM international conference on Multimedia

Motivated by the abundant availability of user-generated multimedia content, a data augmentation approach that enhances an initial manually labelled training set with regions from user tagged images is presented. Initially, object detection classifiers ...
Read More
Seven Types of Visual Ambiguity: On the Merits and Risks of Multiple Interpretations of Collaborative Visualizations
IV '08: Proceedings of the 2008 12th International Conference Information Visualisation

The use of visuals as collaboration catalysts has recently gained attention in research on group work, knowledge management, sense making, and collaboration in general. A special feature of such visualizations (i.e., sketches, diagrams, visual metaphors,...
Read More
TAPHSIR: towards AnaPHoric ambiguity detection and ReSolution in requirements
ESEC/FSE 2022: Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering

We introduce TAPHSIR – a tool for anaphoric ambiguity detection and anaphora resolution in requirements. TAPHSIR facilities reviewing the use of pronouns in a requirements specification and revising those pronouns that can lead to misunderstandings ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business
September 2014
262 pages
ISBN:9781450327695
DOI:10.1145/2637748

Copyright © 2014 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 September 2014
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
multimedia data augmentation
semantic segmentation
social bootstrapping
user tagged images
visual ambiguity
Qualifiers
- research-article
Conference

Acceptance Rates
i-KNOW '14 Paper Acceptance Rate25of73submissions,34%Overall Acceptance Rate77of238submissions,32%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 47
  Total Downloads
- Downloads (Last 12 months)4
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Towards modelling visual ambiguity for visual object detection

i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business

ABSTRACT

References

Cited By

Index Terms

Recommendations

Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors

Seven Types of Visual Ambiguity: On the Merits and Risks of Multiple Interpretations of Collaborative Visualizations

TAPHSIR: towards AnaPHoric ambiguity detection and ReSolution in requirements

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Towards modelling visual ambiguity for visual object detection

i-KNOW '14: Proceedings of the 14th International Conference on Knowledge Technologies and Data-driven Business

ABSTRACT

References

Cited By

Index Terms

Recommendations

Using tagged images of low visual ambiguity to boost the learning efficiency of object detectors

Seven Types of Visual Ambiguity: On the Merits and Risks of Multiple Interpretations of Collaborative Visualizations

TAPHSIR: towards AnaPHoric ambiguity detection and ReSolution in requirements

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media