Skip to main content

Weakly Supervised Learning of Image Emotion Analysis Based on Cross-spatial Pooling

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11818))

Abstract

Convolutional neural networks (CNNs) simulate the structure and function of the nervous system based on biological characteristics. CNNs have been used to understand the emotions that images convey. Most existing studies of emotion analysis have focused only on image emotion classification, and few studies have paid attention to relevant regions evoking emotions. In this paper, we solve the issues of image emotion classification and emotional region localization based on weakly supervised deep learning in a unified framework. We train a fully convolutional network, followed by our proposed cross-spatial pooling strategy, to generate an emotional activation map (EAM), which represents the relevant region that could evoke emotion in an image and is only labelled with an image-level annotation. Extensive experiments demonstrate that our proposed method has the best performance in the accuracy of classification and emotional region localization.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   59.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   79.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Peng, K.C., Chen, K.C., Sadovnik, A., Gallagher, A.: A mixed bag of emotions: model, predict, and transfer emotion distributions. In: The 2015 IEEE Conference on Computer Vision and Pattern Recognition, Boston, pp. 860–868. IEEE Press (2015)

    Google Scholar 

  2. You, Q.Z., Luo, J.B., Jin, H.L., Yang, J.C.: Building a large scale dataset for image emotion recognition: the fine print and the benchmark. In: The 30th Conference on Artificial Intelligence, Arizona, pp. 308–314. IEEE Press (2016)

    Google Scholar 

  3. You, Q.Z., Luo, J.B., Jin, H.L., Yang, J.C.: Robust image sentiment analysis using progressively trained and domain transferred deep networks. In: The 29th Conference on Artificial Intelligence, Austin, pp. 381–388. IEEE Press (2015)

    Google Scholar 

  4. Victor, C., Brendan, J., Giró-i-Nieto, X.: From pixels to sentiment: fine-tuning CNNs for visual sentiment prediction. Image Vis. Comput. 65, 15–22 (2017)

    Article  Google Scholar 

  5. Peng, K.C., Sadovnik, A., Gallagher, A., Chen, T.: Where do emotions come from? Predicting the emotion stimuli map. In: The 2016 IEEE International Conference on Image Processing, Phoenix, pp. 614–618. IEEE Press (2016)

    Google Scholar 

  6. Yang, J.F., She, D.Y., Lai, Y.k., Yang, M.H.: Retrieving and classifying affective images via deep metric learning. In: the 30th innovative Applications of Artificial Intelligence, New Orleans, pp. 491–498. IEEE Press (2018)

    Google Scholar 

  7. Borth, D., Ji, R.R., Chen, T., et al.: Large-scale visual sentiment ontology and detectors using adjective noun pairs. In: The 2013 ACM Multimedia Conference, Barcelona, pp. 223–232. ACM Press (2013)

    Google Scholar 

  8. Chen, T., Borth, D., Darrell, T, et al.: DeepSentibank: visual sentiment concept classification with deep convolutional neural networks. arXiv preprint arXiv:1410.8586 (2014)

  9. Ali, A.R., Shahid, U., Ali, M., et al.: High-level concepts for affective understanding of images. In: The 2017 IEEE Winter Conference on Applications of Computer Vision, Santa Rosa, pp. 678–687. IEEE Press (2017)

    Google Scholar 

  10. Kosti, R., Alvarez, J.M., Recasens A., et al.: Emotion recognition in context. In: The 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, pp. 1960–1968. IEEE Press (2017)

    Google Scholar 

  11. Bilen, H., Vedaldi, A.: Weakly supervised deep detection networks. In: The 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 2846–2854. IEEE Press (2016)

    Google Scholar 

  12. Cinbis, R.G., Verbeek, J., Schmid, C.: Weakly supervised object localization with multi-fold multiple instance learning. IEEE Trans. Pattern Anal. Mach. Intell. 39(1), 189–203 (2016)

    Article  Google Scholar 

  13. Zhou, B.L, Aditya, K., Agata, L., Aude, O., Antonio, T.: Learning deep features for discriminative localization. In: The 2016 IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, pp. 2921–2929. IEEE Press (2016)

    Google Scholar 

  14. Selvaraju, R.R, Cogswell, M., Das, A., et al.: Grad-CAM: visual explanations from deep networks via gradient-based localization. In: The 2017 IEEE International Conference on Computer Vision, Venice, pp. 618–626. IEEE Press (2017)

    Google Scholar 

  15. Durand, T., Mordan, T., Thome, N., et al.: WILDCAT: weakly supervised learning of deep convnets for image classification, pointwise localization and segmentation. In: The 2017 IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, pp. 5957–5966. IEEE Press (2017)

    Google Scholar 

  16. Zhu, Y., Zhou, Y., Ye Q., et al.: Soft proposal networks for weakly supervised object localization. In: The 2017 IEEE International Conference on Computer Vision, Italy, pp. 1859–1868. IEEE Press (2017)

    Google Scholar 

  17. Yang, J.F., She. D.Y., Lai, Y.K., Rosin, P.L.: Weakly supervised coupled networks for visual sentiment analysis. In: The 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, pp. 7584–7592. IEEE Press (2018)

    Google Scholar 

  18. Fan, S.J., Shen, Z.Q., Jiang, M., Koening L., et al.: Emotional attention: a study of image sentiment and visual attention. In: The 2018 IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, pp. 7521–7531. IEEE Press (2018)

    Google Scholar 

  19. Paszke, A., Gross, S., Chintala, S., et al.: Pytorch. https://pytorch.org/ (2017)

  20. Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)

    Article  Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China under Grant Nos. 61163019 and No. 61540062, the Yunnan Applied Basic Research Key Project under Grant No. 2014FA021, and the Yunnan Provincial Education Department’s Scientific Research Fund Industrialization Project under Grant No. 2016CYH03.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Guoqin Peng or Dan Xu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Peng, G., Xu, D. (2019). Weakly Supervised Learning of Image Emotion Analysis Based on Cross-spatial Pooling. In: Sun, Z., He, R., Feng, J., Shan, S., Guo, Z. (eds) Biometric Recognition. CCBR 2019. Lecture Notes in Computer Science(), vol 11818. Springer, Cham. https://doi.org/10.1007/978-3-030-31456-9_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-31456-9_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-31455-2

  • Online ISBN: 978-3-030-31456-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics