skip to main content
10.1145/3126686.3126712acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article
Public Access

Beyond Saliency: Assessing Visual Balance with High-level Cues

Authors Info & Claims
Published:23 October 2017Publication History

ABSTRACT

Automatic composition optimization is a vital technique for computational photography systems. Balance in composition is one of the agreed-upon principles of aesthetics and is commonly employed as a visual feature in many computational aesthetics studies. It refers to an equilibrium of visual weights within composition. Existing composition optimization and aesthetic quality assessment systems utilize the saliency map to represent balance. However, saliency map methods fail to account for high-level visual features that are important for compositional balance. Our work establishes a framework for the purpose of evaluating the relationship between visual features and compositional balance. This provides a better understanding of compositional balance and help improve composition optimization performance. A dataset based on a human subject study was created with photos representing main balance concepts such as symmetric, dynamic balance, and imbalance. We take the visual center given by human subjects as the dependent variable and the center-of-mass for each type of visual features as the predictor variable. Based on a linear regression model, we can assess how much each type of visual features contributes to the prediction of the visual center. Our findings show that high-level visual elements can help increase prediction accuracy with significance on top of saliency maps. Specifically, extra information provided through human and dominant vanishing point detection is statistically significant for assessing balance in the composition.

References

  1. Radhakrishna Achanta, Sheila Hemami, Francisco Estrada, and Sabine Susstrunk. 2009. Frequency-tuned salient region detection. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 1597--1604.Google ScholarGoogle ScholarCross RefCross Ref
  2. Rudolf Arnheim. 1954. Art and Visual Perception: A Psychology of the Creative Eye. Univ. of California Press.Google ScholarGoogle Scholar
  3. Subhabrata Bhattacharya, Rahul Sukthankar, and Mubarak Shah. 2010. A framework for photo-quality assessment and enhancement based on visual aesthetics. In Proceedings of International Conference on Multimedia. ACM, 271--280. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Ali Borji and Laurent Itti. 2013. State-of-the-art in visual attention modeling. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 1 (2013), 185--207. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Ritendra Datta, Dhiraj Joshi, Jia Li, and James Z.Wang. 2006. Studying aesthetics in photographic images using a computational approach. Proceedings of the European Conference on Computer Vision (2006), 288--301. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Lijuan Duan, Chunpeng Wu, Jun Miao, Laiyun Qing, and Yu Fu. 2011. Visual saliency detection by spatially weighted dissimilarity. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 473--480. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Erkut Erdem and Aykut Erdem. 2013. Visual saliency estimation by nonlinearly integrating features using region covariances. Journal of Vision 13, 4 (2013), 11--11.Google ScholarGoogle ScholarCross RefCross Ref
  8. Sharon Gershoni and Shaul Hochstein. 2011. Measuring pictorial balance perception at first glance using Japanese calligraphy. i-Perception 2, 6 (2011), 508--527.Google ScholarGoogle Scholar
  9. Stas Goferman, Lihi Zelnik-Manor, and Ayellet Tal. 2012. Context-aware saliency detection. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 10 (2012), 1915--1926. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Y.W. Guo, M. Liu, T.T. Gu, and W.P. Wang. 2012. Improving photo composition elegantly: Considering image similarity during composition optimization. In Proceedings of Computer Graphics Forum, Vol. 31. Wiley Online Library, 2193-- 2202. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Jonathan Harel, Christof Koch, and Pietro Perona. 2007. Graph-based visual saliency. In Advances in neural information processing systems. 545--552. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Xiaodi Hou, Jonathan Harel, and Christof Koch. 2012. Image signature: Highlighting sparse salient regions. IEEE Transactions on Pattern Analysis and Machine Intelligence 34, 1 (2012), 194--201. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ali Jahanian, Jerry Liu, Qian Lin, Daniel Tretter, Eamonn O'Brien-Strain, Seungyon Claire Lee, Nic Lyons, and Jan Allebach. 2013. Recommendation system for automatic design of magazine covers. In Proceedings of International Conference on Intelligent User Interfaces. ACM, 95--106. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Huaizu Jiang, Jingdong Wang, Zejian Yuan, Yang Wu, Nanning Zheng, and Shipeng Li. 2013. Salient object detection: A discriminative regional feature integration approach. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 2083--2090. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Dhiraj Joshi, Ritendra Datta, Elena Fedorovskaya, Quang-Tuan Luong, James Z Wang, Jia Li, and Jiebo Luo. 2011. Aesthetics and emotions in images. IEEE Signal Processing Magazine 28, 5 (2011), 94--115.Google ScholarGoogle ScholarCross RefCross Ref
  16. Tilke Judd, Krista Ehinger, Frédo Durand, and Antonio Torralba. 2009. Learning to predict where humans look. In Proceedings of International Conference on Computer Vision. IEEE, 2106--2113.Google ScholarGoogle ScholarCross RefCross Ref
  17. John Kahrs, Sharon Calahan, Dave Carson, and Stephen Poster. 1996. Pixel cinematography: a lighting approach for computer graphics. ACM SIGGRAPH Course Notes (1996), 433--42.Google ScholarGoogle Scholar
  18. David A. Lauer and Stephen Pentak. 2011. Design Basics. Cengage Learning.Google ScholarGoogle Scholar
  19. Guanbin Li and Yizhou Yu. 2015. Visual saliency based on multiscale deep features. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 5455--5463.Google ScholarGoogle Scholar
  20. Ligang Liu, Renjie Chen, Lior Wolf, and Daniel Cohen-Or. 2010. Optimizing photo composition. In Proceedings of Computer Graphics Forum, Vol. 29. Wiley Online Library, 469--478.Google ScholarGoogle ScholarCross RefCross Ref
  21. Wei Luo, Xiaogang Wang, and Xiaoou Tang. 2011. Content-based photo quality assessment. In Proceedings of International Conference on Computer Vision. IEEE, 2206--2213. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. I.C. McManus, D. Edmondson, and J. Rodger. 1985. Balance in pictures. British Journal of Psychology 76, 3 (1985), 311--324.Google ScholarGoogle ScholarCross RefCross Ref
  23. I.C. McManus, Katharina Stöver, and Do Kim. 2011. Arnheim's Gestalt theory of visual balance: Examining the compositional structure of art photographs and abstract images. i-Perception 2, 6 (2011), 615--647.Google ScholarGoogle Scholar
  24. Robert H. Morriss and William P. Dunlap. 1988. Influence of chroma and hue on spatial balance of color pairs. Color Research & Application 13, 6 (1988), 385--388.Google ScholarGoogle ScholarCross RefCross Ref
  25. Naila Murray, Maria Vanrell, Xavier Otazu, and C. Alejandro Parraga. 2011. Saliency estimation using a non-parametric low-level vision model. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 433--440. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Tam V. Nguyen, Bingbing Ni, Hairong Liu,Wei Xia, Jiebo Luo, Mohan Kankanhalli, and Shuicheng Yan. 2013. Image re-attentionizing. IEEE Transactions on 15, 8 (2013), 1910--1919. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Aude Oliva, Antonio Torralba, Monica S. Castelhano, and John M. Henderson. 2003. Top-down control of visual attention in object detection. In Proceedings of The International Conference on Image Processing, Vol. 1. IEEE, I--253.Google ScholarGoogle Scholar
  28. Junting Pan, Elisa Sayrol, Xavier Giro-i Nieto, Kevin McGuinness, and Noel E. O'Connor. 2016. Shallowand deep convolutional networks for saliency prediction. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 598--606.Google ScholarGoogle Scholar
  29. Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the Conference on Computer Vision and Pattern Recognition. IEEE, 779--788.Google ScholarGoogle ScholarCross RefCross Ref
  30. Wirawit Rueopas, Sangsan Leelhapantu, and Thanarat H. Chalidabhongse. 2016. A corner-based saliency model. In Proceedings of International Joint Conference on Computer Science and Software Engineering. IEEE, 1--6.Google ScholarGoogle Scholar
  31. Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, et al. 2015. Imagenet large scale visual recognition challenge. International Journal of Computer Vision 115, 3 (2015), 211--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. Hamed R. Tavakoli and Jorma Laaksonen. 2016. Bottom-Up Fixation Prediction Using Unsupervised Hierarchical Models. In Proceedings of Asian Conference on Computer Vision. Springer, 287--302.Google ScholarGoogle Scholar
  33. Hamed R. Tavakoli, Esa Rahtu, and Janne Heikkilä. 2011. Fast and efficient saliency detection using sparse sampling and kernel density estimation. In Proceedings of Scandinavian Conference on Image Analysis. Springer, 666--675. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Zhi Tian, Weilin Huang, Tong He, Pan He, and Yu Qiao. 2016. Detecting text in natural image with connectionist text proposal network. In Proceedings of the European Conference on Computer Vision. Springer, 56--72.Google ScholarGoogle ScholarCross RefCross Ref
  35. Anne Treisman. 1985. Preattentive processing in vision. Computer vision, graphics, and image processing 31, 2 (1985), 156--177. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Xianjun Sam Zheng, Ishani Chakraborty, James Jeng-Weei Lin, and Robert Rauschenberger. 2009. Correlating low-level image statistics with users-rapid aesthetic and affective judgments of web pages. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. ACM, 1--10. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. Zihan Zhou, Farshid Farhat, and James Z. Wang. 2017. Detecting dominant vanishing points In natural scenes with application to composition-sensitive image retrieval. IEEE Transactions on Multimedia (2017).Google ScholarGoogle Scholar

Index Terms

  1. Beyond Saliency: Assessing Visual Balance with High-level Cues

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          Thematic Workshops '17: Proceedings of the on Thematic Workshops of ACM Multimedia 2017
          October 2017
          558 pages
          ISBN:9781450354165
          DOI:10.1145/3126686

          Copyright © 2017 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 23 October 2017

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader