Skip to main content
Log in

Extraction of salient objects based on image clustering and saliency

  • Industrial and Commercial Application
  • Published:
Pattern Analysis and Applications Aims and scope Submit manuscript

Abstract

Over the past decades, numerous methods have been proposed on salient object detection. However, most of these methods need users’ interactions as a prerequisite to control their progress. In this paper, we propose a novel method for extraction of salient objects based on image clustering and saliency map from natural scene images. This method is a combination of image clustering, saliency map generation and automatic initialization. First, a graph based clustering method is applied to split the input image into regions. Second, a saliency map of the input image is generated using the contrast among split regions. From the split regions and generated saliency map, an adaptive threshold is defined, which classify the split regions into foreground and background. After that, the initial mask for object detection is determined using the classified foreground and background clusters and saliency values. A grab-cut with our initial mask is applied to extract the objects of interest, and the experimental results have shown that our proposed method is able to replace manual labeling of initialization in object detection.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Fussenegger M, Opelt A, Pinz A, Auer P (2004) Object recognition using segmentation for feature detection. In: Proc. IEEE int. conf. pattern recognition, pp 41–44

  2. Hirata K, Kasutani E, Hara Y (2002) On image segmentation for object-based image retrieval. In: Proc, IEEE int. conf. pattern recognition, pp 1031–1034

  3. Barrett WA, Cheney AS (2002) Object-based image editing. ACM Trans Graph 21(3):777–784

    Article  Google Scholar 

  4. Agrawal AK, Chellappa R (2005) Moving object segmentation and dynamic scene reconstruction using two frames. ICASSP 2:705–708

    Google Scholar 

  5. Mortensen EN, Barrett WA (1995) Intelligent scissors for image composition. In: Proc. of ACM SIGGRAPH, pp 191–198

  6. Chuang YY, Curless B, Salesin DH, Szeliski R (2001) A bayesian approach to digital matting. In: Proc. of IEEE international conference on computer vision and pattern recognition, pp 264–271

  7. Meyer F (1994) Topographic distance and watershed lines. Sig Process 38:113–125

    Article  MATH  Google Scholar 

  8. Boykov Y, Jolly MP (2001) Interactive graph cuts for optimal boundary and region segmentation of objects in N-D images. In: Proc. IEEE int. conf. computer vision, pp 105–112

  9. Boykov Y, Funka-Lea G (2006) Graph cuts and efficient N-D image segmentation. Int J Comput Vis 70(2):109–131

    Article  Google Scholar 

  10. Rother C, Kolmogorov V, Blake A (2004) Grabcut-interactive foreground extraction using iterated graph cuts. In: Proc. ACM SIGGRAPH, pp 309–314

  11. Kass M, Witkin A, Terzopoulous D (1987) Snakes: active contour models. Int J Comput Vis 1:321–331

    Article  Google Scholar 

  12. Achanta R, Hemami S, Estrada F, Susstrunk S (2009) Frequency-tuned salient region detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Miami, pp 1597–1604

  13. Cheng MM, Zhang GX, Mitra NJ, Huang X, Hu SM (2011) Global contrast based salient region detection. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, pp 409–416

  14. Anh NTL, Kim YC, Lee GS (2012) Morphological gradient applied to new active contour model for color image segmentation. In: Proceedings of the 6th international conference on ubiquitous information management and communication Malaysia, (CD-pub)

  15. Tsotsos JK, Culhane SM, Wai WYK, Lai Y, Davis N, Nuflo F (1995) Modeling visual attention via selective tuning. Artif Intell 78(1–2):507–545

    Article  Google Scholar 

  16. Olshausen B, Anderson C, Van Essen D (1993) A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information. J Neurosci 13:4700–4719

    Google Scholar 

  17. Itti L, Koch C, Niebur E (1998) A model of saliency-based visual attention for rapid scene analysis. IEEE Trans Pattern Anal Mach Intell 20(11):1254–1259

    Article  Google Scholar 

  18. Ma YF, Zhang HJ (2003) Contrast-based image attention analysis by using fuzzy growing. ACM international conference on multimedia

  19. Kadir T, Zisserman A, Brady M (2004) An affine invariant salient region detector. European conference on computer vision

  20. Itti L, Baldi PF (2005) Bayesian surprise attracts human attention. Adv Neural Inf Process Syst 19:547–554

    Google Scholar 

  21. Eihhauser W, Konig P (2003) Does luminance-contrast contribute to a saliency map for overt visual attention? Eur J Neurosci 17:1089–1097

    Article  Google Scholar 

  22. Felzenszwalb P, Huttenlocher D (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181

    Article  Google Scholar 

  23. Parvati K, Prakasa Rao BS, Mariya Das M (2008) Image segmentation using gray-scale morphology and marker-controlled watershed transformation. Discret Dyn Nat Soc J 1–8

  24. Chan AB, Vasconcelos N (2008) Modeling, clustering, and segmenting video with mixtures of dynamic textures. IEEE Trans Pattern Anal Mach Intell 30(5):909–926

    Article  MATH  Google Scholar 

  25. Chen J, Zhao G, Salo M, Rahtu E, Pietikäinen M (2013) Automatic dynamic texture segmentation using local descriptors and optical flow. IEEE Trans Image Process 22(1):326–339

    Article  MathSciNet  Google Scholar 

  26. Xie Y, Lu H, Yang MH (2013) Bayesian saliency via low and mid level cues. IEEE Trans Image Process 22(5):1689–1698

    Article  MathSciNet  Google Scholar 

  27. Mishray A, Aloimonosy Y, Fah CL (2009) Active segmentation with fixation. In: Computer vision, 2009 IEEE 12th international conference, pp 468–475

  28. Liu W, Tao D (2013) Multiview hessian regularization for image annotation. IEEE Trans Image Process 22(7):2676–2687

    Article  MathSciNet  Google Scholar 

  29. Liu W, Tao D, Cheng J, Tang Y (2014) Multiview hessian discriminative sparse coding for image annotation. Comput Vis Image Underst 118:50–60

    Article  Google Scholar 

  30. Koch C, Ullman S (1985) Shifts in selective visual attention: towards the underlying neural circuitry. Human Neurobiol 4(4):219–227

    Google Scholar 

  31. Frintrop S, Klodt M, Rome E (2007) A real-time visual attention system using integral images. International conference on computer vision systems

  32. Hu Y, Xie X, Ma WY, Chia LT, Rajan D (2004) Salient region detection using weighted feature maps based on the human visual attention model. Pacific Rim conference on multimedia

  33. Gao D, Vasconcelos N (2007) Bottom-up saliency is a discriminant process. IEEE conference on computer vision

  34. Hou X, Zhang L (2007) Saliency detection: a spectral residual approach. IEEE conference on computer vision and pattern recognition

  35. Harel J, Koch C, Perona P (2007) Graph-based visual saliency. Adv Neural Inf Process Syst 19:545–552

    Google Scholar 

  36. Bruce N, Tsotsos J (2007) Attention based on information maximization. J Vis 7(9):950

    Article  Google Scholar 

Download references

Acknowledgments

This research was supported by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education, Science and Technology (2013-022495).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Soo Hyung Kim.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Na, I.S., Le, H., Kim, S.H. et al. Extraction of salient objects based on image clustering and saliency. Pattern Anal Applic 18, 667–675 (2015). https://doi.org/10.1007/s10044-015-0459-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10044-015-0459-1

Keywords

Navigation