Abstract
Saliency is an attribute that is not included in an object itself, but arises from complex relations to the scene. Common belief in neuroscience is that objects are eye-catching if they exhibit an anomaly in some basic feature of human perception. This enables detection of object-like structures without prior knowledge. In this paper, we introduce an approach that models these object-to-scene relations based on probability theory. We rely on the conventional structure of cognitive visual attention systems, measuring saliency by local center to surround differences on several basic feature cues and multiple scales, but innovate how to model appearance and to quantify differences. Therefore, we propose an efficient procedure to compute ML-estimates for (multivariate) normal distributions of local feature statistics. Reducing feature statistics to Gaussians facilitates a closed-form solution for the W 2-distance (Wasserstein metric based on the Euclidean norm) between a center and a surround distribution. On a widely used benchmark for salient object detection, our approach, named CoDi-Saliency (for Continuous Distributions), outperformed nine state-of-the-art saliency detectors in terms of precision and recall.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Achanta, R., Hemami, S., Estrada, F., Süsstrunk, S.: Frequency-tuned salient region detection. In: CVPR (2009)
Achanta, R., Süsstrunk, S.: Saliency detection using maximum symmetric surround. In: Proc. of Int’l Conf. on Image Processing, ICIP (2010)
Bruce, N., Tsotsos, J.: Saliency, attention, and visual search: An information theoretic approach. Journal of Vision 9(3), 1–24 (2009)
Frintrop, S.: VOCUS: A Visual Attention System for Object Detection and Goal-Directed Search. LNCS (LNAI), vol. 3899. Springer, Heidelberg (2006)
Frintrop, S., Rome, E., Christensen, H.I.: Computational visual attention systems and their cognitive foundation: A survey. ACM Trans. on Applied Perception 7(1) (2010)
Gao, D., Vasconcelos, N.: Bottom-up saliency is a discriminant process. In: Proc. of ICCV (2007)
Givens, C.R., Shortt, R.M.: A class of wasserstein metrics for probability distributions. Michigan Math. J. 31(2) (1984)
Hou, X., Zhang, L.: Saliency detection: a spectral residual approach. In: Proc. of CVPR (2007)
Hou, X., Zhang, L.: Dynamic visual attention: Searching for coding length increments. In: Advances in Neural Information Processing Systems (2008)
Hurvich, L., Jameson, D.: An opponent-process theory of color vision. Psychological Review 64(6) (1957)
Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. on PAMI 20(11) (1998)
Klein, D.A., Frintrop, S.: Center-surround divergence of feature statistics for salient object detection. In: Proc. of Int’l Conf. on Computer Vision, ICCV (2011)
Kovesi, P.: Arbitrary gaussian filtering with 25 additions and 5 multiplications per pixel. Tech. Rep. UWA-CSSE-09-002, School of Computer Science, University of Western Australia (2009)
Lin, Y., Fang, B., Tang, Y.: A computational model for saliency maps by using local entropy. In: Proc. of AAAI (2010)
Liu, T., Yuan, Z., Sun, J., Wang, J., Zheng, N., Tang, X., Shum, H.Y.: Learning to detect a salient object. IEEE Trans. on PAMI (2009)
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int’l J. of Computer Vision (IJCV) 60(2), 91–110 (2004)
Ma, Y.F., Zhang, H.J.: Contrast-based image attention analysis by using fuzzy growing. In: ACM Int’l Conf. on Multimedia (2003)
Ruesch, J., Lopes, M., Bernardino, A., Hörnstein, J., Santos-Victor, J., Pfeifer, R.: Multimodal saliency-based bottom-up attention: A framework for the humanoid robot icub. In: Proc. of Int’l Conf. on Robotics and Automation, ICRA (2008)
Schauerte, B., Kühn, B., Kroschel, K., Stiefelhagen, R.: Multimodal saliency-based attention for object-based scene analysis. In: Proc. of Int’l Conf. on Intelligent Robots and Systems, IROS (2011)
Treisman, A.M., Gelade, G.: A feature integration theory of attention. Cognitive Psychology 12, 97–136 (1980)
Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Networks (2006)
Wolfe, J.M.: Guided search 2.0: A revised model of visual search. Psychonomic Bulletin and Review 1(2) (1994)
Wolfe, J.M., Horowitz, T.S.: What attributes guide the deployment of visual attention and how do they do it? Nature Reviews Neuroscience 5, 1–7 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Klein, D.A., Frintrop, S. (2012). Salient Pattern Detection Using W 2 on Multivariate Normal Distributions. In: Pinz, A., Pock, T., Bischof, H., Leberl, F. (eds) Pattern Recognition. DAGM/OAGM 2012. Lecture Notes in Computer Science, vol 7476. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32717-9_25
Download citation
DOI: https://doi.org/10.1007/978-3-642-32717-9_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32716-2
Online ISBN: 978-3-642-32717-9
eBook Packages: Computer ScienceComputer Science (R0)