Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations

Avraham, Tamar; Lindenbaum, Michael

doi:10.1007/978-3-540-24671-8_5

Tamar Avraham¹⁶ &
Michael Lindenbaum¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3022))

Included in the following conference series:

European Conference on Computer Vision

1634 Accesses
4 Citations

Abstract

A dynamic visual search framework based mainly on inner-scene similarity is proposed. Algorithms as well as measures quantifying the difficulty of search tasks are suggested. Given a number of candidates (e.g. sub-images), our basic hypothesis is that more visually similar candidates are more likely to have the same identity. Both deterministic and stochastic approaches, relying on this hypothesis, are used to quantify this intuition. Under the deterministic approach, we suggest a measure similar to Kolmogorov’s ε-covering that quantifies the difficulty of a search task and bounds the performance of all search algorithms. We also suggest a simple algorithm that meets this bound. Under the stochastic approach, we model the identities of the candidates as correlated random variables and characterize the task using its second order statistics. We derive a search procedure based on minimum MSE linear estimation. Simple extensions enable the algorithm to use top-down and/or bottom-up information, when available.

Download to read the full chapter text

Chapter PDF

Scene Matching Techniques: Modeling and Analysis

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Automatic and online setting of similarity thresholds in content-based visual information retrieval problems

Article Open access 08 March 2016

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Avraham, T., Lindenbaum, M.: A Probabilistic Estimation Approach for Dynamic Visual Search. In: Proceedings of International Workshop on Attention and Performance in Computer Vision (WAPCV), pp. 1–8 (2003)
Google Scholar
Avraham, T., Lindenbaum, M.: CIS Report #CIS-2003-02. Technion - Israel Institute of Technology, Haifa 32000, Israel (2003)
Google Scholar
Duncan, J., Humphreys, G.W.: Visual search and stimulus similarity. Psychological Review 96, 433–458 (1989)
Article Google Scholar
Gonzalez, T.F.: Clustering to minimize the maximum intercluster distance. Theoretical Computer Science 38(2-3), 293–306 (1985)
Article MATH MathSciNet Google Scholar
Humphreys, G.W., Muller, H.J.: Search via recursive rejection (serr): A connectionist model of visual search. Cognitive Psychology 25, 43–110 (1993)
Article Google Scholar
Itti, L.: Models of bottom-up and top-down visual attention. Thesis (January 2000)
Google Scholar
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. PAMI 20(11), 1254–1259 (1998)
Google Scholar
Koch, C., Ullman, S.: Shifts in selective visual attention: towards the underlying neural vircuity. Human Neurobiology 4, 219–227 (1985)
Google Scholar
Kolmogorov, A.N., Tikhomirov, V.M.: Epsilon-entropy and epsilon-capacity of sets in functional spaces. AMS Translations. Series 2 17, 277–364 (1961)
Google Scholar
Martin, D., Fowlkes, C., Tal, D., Malik, J.: A database of human segmented natural images and its application to evaluating segmentation algorithms and measuring ecological statistics. In: Proc. 8th ICCV, July 2001, vol. 2, pp. 416–423 (2001)
Google Scholar
Neisser, U.: Cognitive Psychology. Appleton-Century-Crofts, New York (1967)
Google Scholar
Nene, S., Nayar, S., Murase, H.: Columbia object image library (coil-100). Technical Report CUCS-006-96, Department of Computer Science, Columbia University (February 1996)
Google Scholar
Papoulis, A., Pillai, S.U.: Probability, Random Variables, and Stochastic Processes, 4th edn. McGraw-Hill, New York (2002)
Google Scholar
Rao, R.P.N., Ballard, D.H.: An active vision architecture based on iconic representations. Artificial Intelligence 78(1-2), 461–505 (1995)
Article Google Scholar
Rimey, R.D., Brown, C.M.: Control of selective perception using bayes nets and decision theory. International Journal of Computer Vision 12, 173–207 (1994)
Article Google Scholar
Swain, M.J., Ballard, D.H.: Color indexing. IJCV 7, 11–32 (1991)
Article Google Scholar
Tagare, H., Toyama, K., Wang, J.G.: A maximum-likelihood strategy for directing attention during visual search. IEEE PAMI 23(5), 490–500 (2001)
Google Scholar
Torralba, A., Sinha, P.: Statistical context priming for object detection. In: Proceedings of the 8th ICCV, pp. 763–770 (2001)
Google Scholar
Treisman, A., Gelade, G.: A feature integration theory of attention. Cognitive Psychology 12, 97–136 (1980)
Article Google Scholar
Tsotsos, J.K.: On the relative complexity of active versus passive visual search. IJCV 7(2), 127–141 (1992)
Article Google Scholar
Tsotsos, J.K., Culhane, S.M., Wai, W.Y.K., Lai, Y., Davis, N., Nuflo, F.J.: Modeling visual attention via selective tuning. Artificial intelligence 78(1-2), 507–545 (1995)
Article Google Scholar
Wixson, L.E., Ballard, D.H.: Using intermediate objects to improve the efficiency of visual-search. IJCV 12(2-3), 209–230 (1994)
Article Google Scholar
Wolfe, J.M.: Guided search 2.0: A revised model of visual search. Psychonomic. Bulletin and Review 1(2), 202–238 (1994)
Google Scholar
Yarbus, A.L.: Eye Movements and Vision. Plenum Press, New York (1967)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science Department, Technion, Haifa, 32000, Israel
Tamar Avraham & Michael Lindenbaum

Authors

Tamar Avraham
View author publications
You can also search for this author in PubMed Google Scholar
Michael Lindenbaum
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Center for Machine Perception, Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University, Prague 6, Czech Republic
Tomás Pajdla
Center for Machine Perception, Dept. of Cybernetics, Faculty of Elec. Eng., Czech Technical University in Prague, Karlovo nám. 13, 121 35, Prague, Czech Rep.
Jiří Matas

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Avraham, T., Lindenbaum, M. (2004). Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations. In: Pajdla, T., Matas, J. (eds) Computer Vision - ECCV 2004. ECCV 2004. Lecture Notes in Computer Science, vol 3022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24671-8_5

Download citation

DOI: https://doi.org/10.1007/978-3-540-24671-8_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21983-5
Online ISBN: 978-3-540-24671-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics

Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations

Abstract

Chapter PDF

Similar content being viewed by others

Scene Matching Techniques: Modeling and Analysis

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Automatic and online setting of similarity thresholds in content-based visual information retrieval problems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Dynamic Visual Search Using Inner-Scene Similarity: Algorithms and Inherent Limitations

Abstract

Chapter PDF

Similar content being viewed by others

Scene Matching Techniques: Modeling and Analysis

VocMatch: Efficient Multiview Correspondence for Structure from Motion

Automatic and online setting of similarity thresholds in content-based visual information retrieval problems

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation