Image aesthetics enhancement using composition-based saliency detection

Zhao, Handong; Chen, Jingjing; Han, Yahong; Cao, Xiaochun

doi:10.1007/s00530-014-0373-1

Image aesthetics enhancement using composition-based saliency detection

Special Issue Paper
Published: 03 April 2014

Volume 21, pages 159–168, (2015)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Handong Zhao¹,
Jingjing Chen¹,
Yahong Han^1,3 &
…
Xiaochun Cao²

519 Accesses
6 Citations
3 Altmetric
Explore all metrics

Abstract

Visual saliency detection and segmentation are widely used in many applications in image processing and computer vision. However, existing saliency detection methods have not fully taken the spatial information of salient regions into account. Inspired by the basic photographic composition rules, we present a novel saliency detection method, which utilizes the knowledge of photographic composition as priors to improve the saliency detection results. Moreover, an online parameter selection method is proposed when utilizing GrabCut to achieve the saliency segmentation result. Besides, to test the applicability of our method, we present a novel post-processing framework for the photographs to be more artistic. The salient region and depth map are firstly computed. The salient region keeps its sharpness, while other parts in the photograph get blurred based on the depth map. To our best knowledge, this is a novel image-based attempt to enhance aesthetics by post-processing a photograph via realistic blurring. We test our method on the 1,000 benchmark test images and dataset MSRA. Extensive experimental results show the applicability and effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Aggregating complementary boundary contrast with smoothing for salient region detection

Article 02 June 2016

Image aesthetics assessment using composite features from transformer and CNN

Article 01 August 2023

Saliency Map Improvement Using Edge-Aware Filtering

References

Achanta, R., Estrada, F.J., Wils, P., Süsstrunk, S.: Salient region detection and segmentation. In: International conference on computer vision, system, pp. 66–75 (2008)
Achanta, R., Hemami, S.S., Estrada, F.J., Süsstrunk, S.: Frequency-tuned salient region detection. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1597–1604 (2009)
Bae, S., Durand, F.: Defocus magnification. Comput. Graph. Forum 26, 571–579 (2007)
Article Google Scholar
Chen, J., Zhao, H., Han, Y., Cao, X.: Visual saliency detection based on photographic composition. In: International conference on internet multimedia computing and service, pp. 13–16 (2013)
Cheng, M.M., Mitra, N.J., Huang, X., Hu, S.M.: Salient shape: group saliency in image collections. Vis. Comput. 30(4), 443–453 (2014)
Google Scholar
Cheng, M.M., Zhang, G.X., Mitra, N.J., Huang, X., Hu, S.M.: Global contrast based salient region detection. In: CVPR, pp. 409–416 (2011)
Daly, S.: The visible differences predictor: an algorithm for the assessment of image fidelity. In: SPIE/IS&T 1992 Symposium on Electronic Imaging: Science and Technology, pp 2–15 (1992)
Das, S., Ahuja, N.: Performance analysis of stereo, vergence, and focus as depth cues for active vision. IEEE. Trans. Pattern Anal. Mach. Intell.17(12), 1213–1219 (1995)
Article Google Scholar
Datta, R., Joshi, D., Li, J., Wang, J.Z.: Studying aesthetics in photographic images using a computational approach. In: ECCV, pp. 288–301 (2006)
Datta, R., Li, J., Wang, J.Z.: Learning the consensus on visual quality for next generation image management. In: ACM multimedia, pp. 533–536 (2007)
Datta, R., Li, J., Wang, J.Z.: Algorithmic inferencing of aesthetics and emotion in natural images: An exposition. In: ICIP, special session on image aesthetics: mood and emotion, pp. 105–108 (2008)
Davies, E.R.: Machine vision: theory, algorithms and practicalities. In: pp. 42–44. Academic Press, London (1990)
Google Scholar
Eltoukhy, H.A., Kavusi, S.: Computationally efficient algorithm for multifocus image reconstruction. In: Sensors and camera systems for scientific, industrial, and digital photography applications, pp. 332–341 (2003)
Forsyth, D.A., Ponce, J.: Computer vision: a modern approach. Prentice Hall Professional Technical Reference (2002)
Goferman, S., Manor, L.Z., Tal, A.: Context-aware saliency detection. In: CVPR, pp. 2376–2383 (2010)
Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Advances in Neural Information Processing Systems (NIPS), pp. 545–552 (2006)
Hong, R., Wang, M., Xu, M., Yan, S., Chua, T.S.: Dynamic captioning: Video accessibility enhancement for hearing impairment. In: ACM multimedia, pp. 421–430 (2010)
Hong, R., Wang, M., Yuan, X.T., Xu, M., Jiang, J., Yan, S., Chua, T.S.: Video accessibility enhancement for hearing impaired users. ACM. Trans. Multimed. Comput.7S, 24–42 (2011)
Google Scholar
Hou, X., Zhang, L.: Saliency detection: a spectral residual approach. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–8 (2007)
Huhle, B., Schairer, T., Jenke, P., Straßer, W.: Realistic depth blur for images with range data. Dynamic 3D, imaging pp. 84–95 (2009)
Krages, B.: Photography: the art of composition, 1st edn. Allworth Press, New York (2005)
Google Scholar
Liu, L., Chen, R., Wolf, L., Cohen-Or, D.: Optimizing photo composition. Comput. Graph. Forum 29, 469–478 (2010)
Article Google Scholar
Liu, T., Sun, J., Zheng, N.N., Tang, X., Shum, H.Y.: Learning to detect a salient object. In: CVPR, pp. 1–8 (2007)
Ma, Y.F., Zhang, H.: Contrast-based image attention analysis by using fuzzy growing. In: ACM multimedia, pp. 374–381 (2003)
Mahmoud, T.A., Marshall, S.: Threshold decomposition driven adaptive morphological filter for image sharpening. In: VISAPP, pp. 40–45 (2007)
Maki, A., Watanabe, M., Geotensity, C.W.: Combining motion and lighting for 3D surface reconstruction. Int. J. Comput. Vis.48(2), 75–90 (2002)
Article MATH Google Scholar
Malik, J., Rosenholtz, R.: Computing local surface orientation and shape from texture for curved surfaces. Int. J. Comput. Vis. 23(2), 149–168 (1997)
Article Google Scholar
McGuire, M., Matusik, W., Pfister, H., Hughes, J.F., Durand, F.: Defocus video matting. ACM Trans. Graph. 24(3), 567–576 (2005)
Google Scholar
Moutoussis, K., Zeki, S.: A direct demonstration of perceptual asynchrony in vision. In: Proceedings of the Royal Society of London. Series B: Biological Sciences, pp. 393–399 (1997)
Nagai, T., Ikehara, M., Kurematsu, A.: Hmm-based surface reconstruction from single images. Syst. Comput. Jpn. 38(11), 80–89 (2007)
Article Google Scholar
Peng, B., Veksler, O.: Parameter selection for graph cut based image segmentation. In: BMVC, pp. 332–341 (2008)
Peters, G.: Aesthetic primitives of images for visualization. In: IEEE international conference on information visualization, pp. 316–325 (2007)
Rother, C., Kolmogorov, V., Blake, A.: Grabcut: interactive foreground extraction using iterated graph cuts. ACM Trans. Graph 23, 309–314 (2004)
Article Google Scholar
Saxena, A., Chung, S.H., Ng, A.Y.: Learning depth from single monocular images. In: Advances in Neural Information Processing Systems (NIPS) (2005)
Saxena, A., Chung, S.H., Ng, A.Y.: 3-d depth reconstruction from a single still image. Int. J. Comput. Vis. 76, 53–69 (2008)
Article Google Scholar
Saxena, A., Sun, M., Ng, A.Y.: Make3d: learning 3d scene structure from a single still image. IEEE Trans. Pattern Anal. Mach. Intell. 31(5), 824–840 (2009)
Article Google Scholar
Scharstein, D., Szeliski, R.: A taxonomy and evaluation of dense two-frame stereo correspondence algorithms. Int. J. Comput. Vis. 47, 1–35 (2002)
Article Google Scholar
Schavemaker, J.G.M., Reinders, M.J.T., Gerbrands, J.J., Backer, E.: Image sharpening by morphological filtering. Pattern Recogn. 33(6), 997–1012 (2000)
Google Scholar
Subbarao, M., Wei, T.C., Surya, G.: Focused image recovery from two defocused images recorded with different camera settings. IEEE Trans. Image Process. 4(12), 1613–1628 (1995)
Google Scholar
Tatler, B.W.: The central fixation bias in scene viewing: selecting an optimal viewing position independently of motor biases and image feature distributions. J. Vis. pp. 1–17 (2007)
Valenti, R., Jaimes, A., Sebe, N.: Sonify your face: Facial expressions for sound generation. In: ACM multimedia, pp. 1363–1372 (2010)
Valenti, R., Sebe, N., Gevers, T.: Facial expression recognition: a fully integrated approach. In: International conference on image analysis and processing workshops, pp. 125–130 (2007)
Wang, M., Hong, R., Yuan, X.T., Yan, S., Chua, T.S.: Movie2comics: towards a lively video content presentation. Trans. Multimed.14, 858–870 (2012)
Article Google Scholar
Watson, A.B.: Toward a perceptual video quality metric. In: SPIE, pp. 139–147 (1998)
Zhai, Y., Shah, M.: Visual attention detection in video sequences using spatiotemporal cues. In: ACM multimedia, pp. 815–824 (2006)
Zhang, M., Zhang, L., Sun, Y., Feng, L., Ma, W.Y.: Auto cropping for digital photographs. In: ICME, pp. 438–441 (2005)

Download references

Acknowledgment

This work is partly supported by National Program on Key Basic Research Project (973 Program) under Grant 2013CB329301, National High-tech R&D Program of China (2013AA01A601), 100 Talents Program of The Chinese Academy of Sciences, the NSFC (under Grant 61202166), and Doctoral Fund of Ministry of Education of China (under Grant 20120032120042).

Author information

Authors and Affiliations

School of Computer Science and Technology, Tianjin University, Tianjin, 300072, China
Handong Zhao, Jingjing Chen & Yahong Han
State Key Laboratory of Information Security, Institute of Information Engineering, Chinese Academy of Sciences, Beijing, 100093, China
Xiaochun Cao
Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin University, Tianjin, China
Yahong Han

Authors

Handong Zhao
View author publications
You can also search for this author inPubMed Google Scholar
Jingjing Chen
View author publications
You can also search for this author inPubMed Google Scholar
Yahong Han
View author publications
You can also search for this author inPubMed Google Scholar
Xiaochun Cao
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Yahong Han.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zhao, H., Chen, J., Han, Y. et al. Image aesthetics enhancement using composition-based saliency detection. Multimedia Systems 21, 159–168 (2015). https://doi.org/10.1007/s00530-014-0373-1

Download citation

Published: 03 April 2014
Issue Date: March 2015
DOI: https://doi.org/10.1007/s00530-014-0373-1

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image aesthetics enhancement using composition-based saliency detection

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Aggregating complementary boundary contrast with smoothing for salient region detection

Image aesthetics assessment using composite features from transformer and CNN

Saliency Map Improvement Using Edge-Aware Filtering

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now