Saliency Prediction for Visual Regions of Interest with Applications in Advertising

Jain, Shailee; Sowmya Kamath, S.

doi:10.1007/978-3-319-56687-0_5

Shailee Jain²⁰ &
S. Sowmya Kamath²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10165))

Included in the following conference series:

International Workshop on Video Analytics for Audience Measurement in Retail and Digital Signage
International Workshop on Face and Facial Expression Recognition from Real World Videos

1002 Accesses
1 Citations

Abstract

Human visual fixations play a vital role in a plethora of genres, ranging from advertising design to human-computer interaction. Considering saliency in images thus brings significant merits to Computer Vision tasks dealing with human perception. Several classification models have been developed to incorporate various feature levels and estimate free eye-gazes. However, for real-time applications (Here, real-time applications refer to those that are time, and often resource-constrained, requiring speedy results. It does not imply on-line data analysis), the deep convolution neural networks are either difficult to deploy, given current hardware limitations or the proposed classifiers cannot effectively combine image semantics with low-level attributes. In this paper, we propose a novel neural network approach to predict human fixations, specifically aimed at advertisements. Such analysis significantly impacts the brand value and assists in audience measurement. A dataset containing 400 print ads across 21 successful brands was used to successfully evaluate the effectiveness of advertisements and their associated fixations, based on the proposed saliency prediction model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Available online at http://people.csail.mit.edu/tjudd/WherePeopleLook.
2.
The salient points shown by actual fixations, i.e., raw user inputs (not normalized), depict wagering user attention and thus, do not completely portray salient locations in the image. Instead, they are used as a rough baseline for comparison.
3.
By effectiveness, we mean that the advertisement highlights the product, company etc. and immediately captures consumer attention, sparking interest.

References

Wang, Z., Lu, L., Bovik, A.C.: Foveation scalable video coding with automatic fixation selection. IEEE Trans. Image Process. 12(2), 243–254 (2003)
Article Google Scholar
Santella, A., Agrawala, M., DeCarlo, D., Salesin, D., Cohen, M.: Gaze-based interaction for semi-automatic photo cropping, pp. 771–780 (2006)
Google Scholar
Rubinstein, M., Shamir, A., Avidan, S.: Improved seam carving for video retargeting. ACM Trans. Graph. 27(3) 16:1–16:9 (2008)
Google Scholar
DeCarlo, D., Santella, A.: Stylization and abstraction of photographs. ACM Trans. Graph. 21(3), 769–776 (2002)
Article Google Scholar
Judd, T., Ehinger, K., Durand, F., Torralba, A.: Learning to predict where humans look. 2106–2113 (2009)
Google Scholar
Itti, L., Koch, C., Niebur, E., et al.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)
Article Google Scholar
Hou, X., Zhang, L.: Saliency detection: a spectral residual approach, pp. 1–8 (2007)
Google Scholar
Judd, T., Durand, F., Torralba, A.: A benchmark of computational models of saliency to predict human fixations (2012)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. Int. J. Comput. Vis. 42(3), 145–175 (2001)
Article MATH Google Scholar
Jain, E., Mukerjee, A., Kochhar, S.: Predicting where humans look in a visual search: Incorporating context-based guidance
Google Scholar
Borji, A., Tavakoli, H.R., Sihite, D.N., Itti, L.: Analysis of scores, datasets, and models in visual saliency prediction. In: IEEE ICCV 2013, pp. 921–928. IEEE (2013)
Google Scholar
Zhao, R., Ouyang, W., Li, H., Wang, X.: Saliency detection by multi-context deep learning. In: IEEE Conference of Computer Vision and Pattern Recognition (CVPR), pp. 1265–1274 (2015)
Google Scholar
Collobert, R., Bengio, S.: Links between perceptrons, MLPs and SVMs. In: Proceedings of the Twenty-First International Conference on Machine Learning, ICML 2004, p. 23. ACM, New York (2004)
Google Scholar
Bengio, Y., LeCun, Y., et al.: Scaling learning algorithms towards AI. Large-scale Kernel Mach. 34(5), 1–41 (2007)
Google Scholar
Russell, B.C., Torralba, A., Murphy, K.P., Freeman, W.T.: Labelme: a database and web-based tool for image annotation. Int. J. Comput. Vis. 77(1), 157–173 (2008)
Article Google Scholar
Viola, P., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)
Article Google Scholar
Felzenszwalb, P., McAllester, D., Ramanan, D.: A discriminatively trained, multiscale, deformable part model, pp. 1–8 (2008)
Google Scholar
Bottou, L., Cortes, C., Denker, J.S., Drucker, H., Guyon, I., Jackel, L.D., LeCun, Y., Muller, U.A., Sackinger, E., Simard, P., et al.: Comparison of classifier methods: a case study in handwritten digit recognition. In: ICPR, pp. 77–87 (1994)
Google Scholar
The Mathworks, Inc. Natick, Massachusetts: MATLAB version 8.5.0.197613 (R2015a) (2015)
Google Scholar

Download references

Author information

Authors and Affiliations

National Institute of Technology Karnataka, Surathkal, 575025, India
Shailee Jain & S. Sowmya Kamath

Authors

Shailee Jain
View author publications
You can also search for this author in PubMed Google Scholar
S. Sowmya Kamath
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shailee Jain .

Editor information

Editors and Affiliations

Aalborg University, Aalborg, Denmark
Kamal Nasrollahi
Institute of Applied Sciences and Intelligent Systems, Lecce, Italy
Cosimo Distante
Stevens Institute of Technology, Hoboken, New Jersey, USA
Gang Hua
Queen Mary University of London, London, United Kingdom
Andrea Cavallaro
Aalborg University, Aalborg, Denmark
Thomas B. Moeslund
Università di Catania, Catania, Italy
Sebastiano Battiato
Rensselaer Polytechnic Institute, Troy, New York, USA
Qiang Ji

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jain, S., Sowmya Kamath, S. (2017). Saliency Prediction for Visual Regions of Interest with Applications in Advertising. In: Nasrollahi, K., et al. Video Analytics. Face and Facial Expression Recognition and Audience Measurement. VAAM FFER 2016 2016. Lecture Notes in Computer Science(), vol 10165. Springer, Cham. https://doi.org/10.1007/978-3-319-56687-0_5

Download citation

DOI: https://doi.org/10.1007/978-3-319-56687-0_5
Published: 29 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56686-3
Online ISBN: 978-3-319-56687-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics