Abstract
We provide a biological justification for the success of spectral domain models of visual attention and propose a refined spectral domain based spatiotemporal saliency map model including a more biologically plausible method for motion saliency generation. We base our approach on the idea of spectral whitening (SW), and show that this whitening process is an estimation of divisive normalization, a model of lateral surround inhibition. Experimental results reveal that SW is a better performer at predicating eye fixation locations than other state-of-the-art spatial domain models for color images, achieving a 92% consistency with human behavior in urban environments. In addition, the model is simple and fast, capable of generating saliency maps in real-time.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Itti, L., Koch, C., Niebur, E.: A Model of Saliency-Based Visual Attention for Rapid Scene Analysis. IEEE Trans. PAMI 20, 1254–1259 (1998)
Bruce, N.D., Tsotsos, J.K.: Saliency Based on Information Maximization. In: Proc. NIPS (2005)
Itti, L., Baldi, P.: Bayesian Surprise Attracts Human Attention. In: Proc. NIPS (2005)
Harel, J., Koch, C., Perona, P.: Graph-Based Visual Saliency. In: Proc. NIPS (2006)
Simoncelli, E.P., Schwartz, O.: Modeling Surround Suppression in V1 Neurons with a Statistically-Derived Normalization Model. In: Proc. NIPS (1998)
Cavanaugh, J.R., Bair, W., Movshon, J.A.: Orientation-Selective Setting of Contrast Gain by the Surrounds of Macaque Striate Cortex Neurons. Neuroscience Abstracts 23, 227.2 (1997)
Hou, X., Zhang, L.: Saliency Detection: A Spectral Residual Approach. In: Proc. CVPR (2007)
Guo, C.L., Ma, Q., Zhang, L.M.: Spatio-Temporal Saliency Detection Using Phase Spectrum of Quaternion Fourier Transform. In: Proc. CVPR (2008)
Li, Z., Dayan, P.: Pre-Attentive Visual Selection. Neural Networks 19, 143–1439 (2006)
Ell, T.A., Sangwine, S.J.: Hypercomplex Fourier Transforms of Color Images. IEEE Trans. Image Processing 16, 22–35 (2007)
Cao, A., Schiller, P.H.: Neural Responses to Relative Speed in the Primary Visual Cortex of Rhesus Monkey. Visual Neuroscience 20, 77–84 (2003)
De Castro, E., Morandi, C.: Registration of Translated and Rotated Images Using Finite Fourier Transforms. IEEE Trans. PAMI 9, 700–703 (1987)
Tatler, B.W., Baddeley, R.J., Gilchrist, I.D.: Visual Correlates of Fixation Selection: Effects of Scale and Time. Vision Research 45, 643–659 (2005)
Gao, D., Mahadevan, V., Vasconcelos, N.: The Discriminant Center-Surround Hypothesis for Bottom-Up Saliency. In: Proc. NIPS (2007)
Walther, D., Koch, C.: Modeling attention to salient proto-objects. Neural Networks 19, 139–1407 (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bian, P., Zhang, L. (2009). Biological Plausibility of Spectral Domain Approach for Spatiotemporal Visual Saliency. In: Köppen, M., Kasabov, N., Coghill, G. (eds) Advances in Neuro-Information Processing. ICONIP 2008. Lecture Notes in Computer Science, vol 5506. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02490-0_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-02490-0_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02489-4
Online ISBN: 978-3-642-02490-0
eBook Packages: Computer ScienceComputer Science (R0)