Abstract
We present a novel visual saliency detection method using covariance matrices on a Riemannian manifold. After over-segmentation, superpixels are generated and featured by the region covariance matrix. The superpixels on image boundary are regarded as possible background cues and are used to build the background dictionary. A sparse model is then constructed based on the background dictionary, where a kernel method, embedding Riemannian manifolds into reproducing kernel Hilbert space, is used. For each superpixel, we compute sparse reconstruction errors as a saliency measurement, which are then weighted based on the local context and global context information. Finally, multi-scale reconstruction errors are integrated to reduce the effect of the scale problem, and an object-biased Gaussian model is adopted to refine the saliency map. The main contribution of this paper is using a kernel sparse representation of the region covariance descriptors for saliency detection. Experiments with public benchmark dataset show that the proposed algorithm outperforms the state-of-the-art methods in terms of precision, recall, and mean absolute error, which demonstrate that our method is more effective in uniformly highlighting salient objects and is robust to background noise.







Similar content being viewed by others
References
R. Achanta, A. Shaji, K. Smith, A. Lucchi, P. Fua, S. Susstrunk, Slic superpixels. EPFL, Lausssanne, Switzerland, Technical Report. 149300 (2010)
V. Arsigny, P. Fillard, X. Pennec, N. Ayache, Geometric means in a novel vector space structure on symmetric positive-definite matrices. SIAM J. Matrix Anal. Appl. 29(1), 328–347 (2007)
R. Achanta, S. Hemami, F.J. Estrada, S. Süsstrunk, Frequency-tuned salient region detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2009), pp. 1597–1604
S. Avidan, A. Shamir, Seam carving for content-aware image resizing. ACM Trans. Graph. 26(3), 10 (2007)
R. Achanta, F.J. Estrada, P. Wils, S. Süsstrunk, Salient region detection and segmentation, in Computer Vision Systems (Springer, Berlin, 2008), pp. 66–75
B. Alexe, T. Deselaers, V. Ferrari, Measuring the objectness of image windows. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2189–2202 (2012)
A. Borji, L. Itti, Exploiting local and global patch rarities for saliency detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2012), pp. 478–485
C. Christopoulos, A. Skodras, T. Ebrahimi, The JPEG2000 still image coding system:an overview. IEEE Trans. Consum. Electron. 46(4), 1103–1127 (2000)
M.-M. Cheng, G.-X. Zhang, N. J. Mitra, X. Huang, S.-M. Hu, Global contrast based salient region detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2011), pp. 409–416
L. Duan, C. Wu, J. Miao, L. Qing, Y. Fu, Visual saliency detection by spatially weighted dissimilarity, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2011), pp. 473–480
Y.M. Fang, Z.Z. Chen, W.S. Lin, C.W. Lin, Saliency detection in the compressed domain for adaptive image retargeting. IEEE Trans. Image Process. 21(9), 3888–3901 (2012)
J. Feng, Y.C. Wei, L.T. Tao, C. Zhang, J. Sun, Salient object detection by composition,in Proceedings of the IEEE International Conference on Computer Vision, (2011), pp. 1028–1035
K. Guo, P. Ishwar, J. Konrad, Action recognition using sparse representation on covariance manifolds of optical flow, in The 7th IEEE International Conference on Advanced Video and Signal Based Surveillance, (2010), pp. 188–195
S. Gao, I.W.H. Tsang, L.T. Chia, Sparse representation with kernels. IEEE Trans. Image Process. 22(2), 423–434 (2013)
S. Goferman, L. Zelnik-Manor, A. Tal, Context-aware saliency detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2010), pp. 2376–2383
P.S. Hiremath, J. Pujari, Content based image retrieval using color boosted salient points and shape features of an image. Int. J. Image Process. 2(1), 10–17 (2008)
J. Harel, C. Koch, P. Perona, Graph-based visual saliency, in Advances in Neural Processing Systems, (2006), pp. 545–552
X.D. Hou, L.Q. Zhang, Saliency detection: a spectral residual approach, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2007), pp. 1–8
M.T. Harandi, C. Sanderson, R. Hartley, B. C. Lovell, Sparse coding and dictionary learning for symmetric positive definite matrices: a kernel approach, in Proceedings of the11th European Conference on Computer Vision, (2012), pp. 216–229
L. Itti, C. Koch, E. Niebur, A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)
M.W. Jian, J.Y. Dong, J. Ma, Image retrieval using wavelet-based salient regions. Imaging Sci. J. 59(4), 219–231 (2011)
T. Judd, K. Ehinger, F. Durand, A. Torralba, Learning to predict where humans look, in Proceedings of the IEEE International Conference on Computer Vision, (2009), pp. 2106–2113
A.D. Klein, S. Frintrop, Center-surround divergence of feature statistics for salient object detection, in Proceedings of the IEEE International Conference on Computer Vision, (2011), pp. 2214–2219
T. Liu, Z. Yuan, J. Sun, J. Wang, N. Zheng, X. Tang, H.-Y. Shum, Learning to detect a salient object. IEEE Trans. Pattern Anal. Mach. Intell. 33(2), 353–367 (2011)
P. Li, Q. Wang, W. Zuo, L. Zhang, Log-Euclidean Kernels for sparse representation and dictionary learning, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2013), pp. 1601–1608
X. Li, H. Lu, L. Zhang, X. Ruan, M.-H. Yang, Saliency detection via dense and sparse reconstruction, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2013), pp. 2976–2983
Y. Ma, B. Miller, I. Cohen, Video sequence querying using clustering of objects’ appearance models, in Advances in Visual Computing, (2007), pp. 328–329
Y.-F. Ma, H.J. Zhang, Contrast-based image attention analysis by using fuzzy growing, in Proceedings of the 11th ACM International Conference on Multimedia, (2003), pp. 374–381
F. Perazz, P. Krahenbuhl, Y. Pritch, A. Hornung, Saliency filters: contrast based filtering for salient region detection, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2012), pp. 733–740
F. Porikli, O. Tuzel, P. Meer, Covariance tracking using model update based on lie algebra, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2006), pp. 728–735
Y. Pang, Y. Yuan, X. Li, Gabor-based region covariance matrices for face recognition. IEEE Trans. Circuits Syst. Video Technol. 18(7), 989–993 (2008)
X. Pennec, P. Fillard, N. Ayache, Riemannian framework for tensor computing. Int. J. Comput. Vis. 66(1), 41–66 (2006)
U. Rutishause, D. Walther, C. Koch, P. Perona, Is bottom-up attention useful for object recognition?, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2012), pp. 2–37
E. Rahtu, J. Kannala, M. Salo, J. Heikkilä, Segmenting salient objects from images and videos, in Proceedings of the11th European Conference on Computer Vision, (2010), pp. 366–379
G. Sharma, F. Jurie, C. Schmid, Discriminative spatial saliency for image classification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2012), pp. 3506–3513
X. Shen, Y. Wu, A unified approach to salient object detection via low rank matrix recovery, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2012), pp. 853–860
R. Sivalingam, D. Boley, V. Morellas, V.N. Papanikolopoulos, Tensor sparse coding for region covariances, in Proceedings of the11th European Conference on Computer Vision, (2010), pp. 722–735
A. Treisman, G. Gelade, A feature-integration theory of attention. Cognit Psychol 12(1), 97–136 (1980)
O. Tuzel, F. Porikl, P. Meer, Region covariance: a fast descriptor for detection and classification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2006), pp. 589–600
O. Tuzel, F. Porikl, P. Meer, Pedestrian detection via classification on Riemannian manifolds. IEEE Trans. Pattern Anal. Mach. Intell. 30(10), 1713–1727 (2008)
O. Tuze, F. Porikli, P. Meer, Human detection via classification on Riemannian manifolds, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, (2007), pp. 1–8
H.W. Tian, Y.M. Fang, Y. Zhao, W.S. Lin, R.R. Ni, Z.F. Zhu, Salient region detection by fusing bottom-up and top-down features extracted from a single image. IEEE Trans. Image Process. 23(10), 4389–4398 (2014)
Y. Wei, F. Wen, W. Zhu, J. Sun, Geodesic saliency using background priors, in Proceedings of the11th European Conference on Computer Vision, (2012), pp. 29–42
Y.L. Xie, H.C. Lu, Visual saliency detection based on Bayesian model, in 18th IEEE International Conference on Image Processing, (2011), pp. 645–648
Y. Xie, H. Lu, M.-H. Yang, Bayesian saliency via low and mid level cues. IEEE Trans. Image Process. 22(5), 1689–1698 (2013)
Y. Xie, B. C. Vemuri, J. Ho, Statistical analysis of tensor fields, in Medical Image Computing and Computer-Assisted Intervention, (2010), pp. 682–689
Y. Zhai, M. Shah, Visual attention detection in video sequences using spatiotemporal cues, in Proceedings of the 14th Annual ACM International Conference on Multimedia, (2006), pp. 815–824
Z.-F. Zhu, Q. Chen, Y. Zhao, Ensemble dictionary learning for saliency detection. Image Vis. Comput. 32(3), 180–188 (2014)
Acknowledgments
The work described in this paper is supported by the Science and Technology Foundation of Henan Province of China (Grant No. 142102310562).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Zhang, YY., Wang, ZP. & Lv, XD. Saliency Detection via Sparse Reconstruction Errors of Covariance Descriptors on Riemannian Manifolds. Circuits Syst Signal Process 35, 4372–4389 (2016). https://doi.org/10.1007/s00034-016-0267-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00034-016-0267-x