skip to main content
research-article

A New Foreground-Background based Method for Behavior-Oriented Social Media Image Classification

Published: 12 November 2021 Publication History

Abstract

Due to various applications, research on personal traits using information on social media has become an important area. In this paper, a new method for the classification of behavior-oriented social images uploaded on various social media platforms is presented. The proposed method introduces a multimodality concept using skin of different parts of human body and background information, such as indoor and outdoor environments. For each image, the proposed method detects skin candidate components based on R, G, B color spaces and entropy features. The iterative mutual nearest neighbor approach is proposed to detect accurate skin candidate components, which result in foreground components. Next, the proposed method detects the remaining part (other than skin components) as background components based on structure tensor of R, G, B color spaces, and Maximally Stable Extremal Regions (MSER) concept in the wavelet domain. We then explore Hanman Transform for extracting context features from foreground and background components through clustering and fusion operation. These features are then fed to an SVM classifier for the classification of behavior-oriented images. Comprehensive experiments on 10-class datasets of Normal Behavior-Oriented Social media Image (NBSI) and Abnormal Behavior-Oriented Social media Image (ABSI) show that the proposed method is effective and outperforms the existing methods in terms of average classification rate. Also, the results on the benchmark dataset of five classes of personality traits and two classes of emotions of different facial expressions (FERPlus dataset) demonstrated the robustness of the proposed method over the existing methods.

References

[1]
M. Cheung, J. She, and Z. Jie. 2015. Connection discovery using big data of user shared images in social media. IEEE Trans, MM 17, (2015), 1417–1428.
[2]
Y. Hu, L. Manikonda, and S. Kambhampati. 2014. What we Instagram: A first analysis of Instagram photo content and user types. In Proc. AAAI, 595–598.
[3]
L. Liu, D. P. Pietro, Z. R. Samani, M. E. Moghadadam, and L. Ungar. 2016. Analyzing personality through social media profile picture choice. In Proc. ICWSM, 2016.
[4]
A. Tiwari, C. V. D. Weth, and M. Kankanhalli. 2018. Multimodal multiplatform social media event summarization. ACM. TOMM 14, 38 (2018).
[5]
T. Wang and B. Li. 2017. Sentiment analysis for social media images. In Proc. ICDAMW, 1584–1591.
[6]
D. Wang, C. Otto, and A. K. Jain. 2017. Face search at scale. IEEE Trans. PAMI 39, 1122–1136, 2017.
[7]
A. Farzindar and D. Inkpen. 2015. Natural language processing for social media. Synthesis Lectures on Human Language Techniques, Morgan and Claypool Publishers, 2015.
[8]
C. Shen, Z. Jin, W. Chu, R. Jiang, Y. Chen, G. J. Qi, and X. S. Hua. 2019. Multi-level similarity perception network for person re-identification. ACM. TOMM 15, 32 (2019).
[9]
H. Han, C. Otto, X. Liu, and A. K. Jain. 2015. Demographic estimation from face images: Human vs. machine performance. IEEE Trans. PAMI 37, (2015), 1148–1161.
[10]
A. B. Mabrouk and E. Zagrouba. 2018. Abnormal behavior recognition for intelligent video surveillance systems: A review. Expert Systems with Applications 91, (2018), 480–491.
[11]
R. Dwivedi and S. Dey. 2018. Score level fusion for cancelable multi-biometric verification. Pattern Recognition Letters, 2018.
[12]
S. Kumano, K. Otsuka, R. Ishii, and J. Yamato. 2017. Collective first person for automatic gaze analysis in multiparty conversations. IEEE. Trans. MM 1, 107–122.
[13]
X. Huang, A. Dhall, R. Goecke, M. Pietikainen, and G. Zhao. 2017. Multi-modal framework for analyzing the affect of a group of people. IEEE. Trans. MM 8, 1–16, 2017.
[14]
S. Jiang, G. Chen, X. Song, and L. Liu. 2019. Deep patch representation with shared codebook for scene classification. ACM. TOMM 15, 5 (2019).
[15]
Z. Chen, S. Al, and C. Jia. 2019. Structure-aware deep learning for product image classification. ACM. TOMM 15, 4, (2019).
[16]
A. Wang, J. Cai, J. Lu, and T. J. Cham. 2018. Structure-aware multimodal features fusion for RGB-D scene classification and beyond. ACM. TOMM 14, 39 (2018).
[17]
A. G. Howard, M. Zhu, B. Chen, D. Kalenichenko, W. Wang, T. Weyand, M. Andreetto, and H. Adam. 2017. Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017).
[18]
Y. Benezeth, P. M. Jodoin, V. Saligrama, and C. Rosenberger. 2009. Abnormal event detection based on spatio-temporal co-occurences. In Proc. CVPR, 2458–2465, 2009.
[19]
K. V. Beeck, K. V. Engeland, J. Vennekens, and T. Goedeme. 2017. Abnormal behavior detection in LWIR surveillance of railway platforms. In Proc. AVSS 2017.
[20]
C. Rougier, J. Meunier, A. S. Arnaud, and J. Rousseau. 2011. Robust video surveillance for fall detection based on human shape deformation. IEEE Trans. CSVT 21, (2011), 611–622.
[21]
S. Cosar, G. Donatiello, V. Bogorny, C. Garate, L. Otavio, and F. Bremond. 2017. Toward abnormal trajectory and event detection in video surveillance. IEEE Trans. CSVT 27, (2017), 683–695.
[22]
C. W. Liu, H. T. Chen, K. H. Lo, C. J. Wang, and J. H. Chuang. 2017. Accelerating vanishing point based line sampling scheme for real time people localization. IEEE Trans. CSVT 27, (2017), 409–420.
[23]
Y. Zhang, L. Qin, R. Ji, H. Yao, and Q. Huang. 2015. Social attribute aware force model: Exploiting richness of interaction for abnormal crowd detection. IEEE Trans. CSVT 25, (2015), 1231–1245.
[24]
T. Wang and H. Snoussi. 2014. Detection of abnormal visual events via global optical flow orientation histogram. IEEE Trans. IFS 9, (2014), 988–998.
[25]
C. Tiwari, M. Hanmandlu, and S. Vasikarla. 2015. Suspicious face detection based on eye and other facial features movement monitoring. In Proc. AIPR, 1–8.
[26]
S. C. Hsu, C. H. Chuang, C. L. Huang, P. R. Teng, and M. J. Lin. 2018. A video based abnormal human behavior detection for psychiatric patient monitoring. In Proc. IWAIT, 1–4, 2018.
[27]
E. Barsoum, C. Zhang, C. C. Ferrer, and Z. Zhang. 2016. Training deep networks for facial expression recognition with crowd-sourced label distribution. In Proc. ACMMM, 279–283, 2016.
[28]
M. Sharama, A. S. Jalal, and A. Khan. 2019. Emotion recognition using facial expression by using key points descriptor and texture features. Multimedia Tools and Applications 78, (2019), 16195–16219.
[29]
D. Mungra, A. Agrawal, P. Sharma, S. Tanwar, and M. S. Obaidat. 2020. PRATIT: A CNN-based emotion recognition system using histogram equalization and data augmentation. Multimedia Tools and Applications, 2285–2307.
[30]
D. Krishnani, P. Shivakumara, T. Lu, U. Pal, and R. Ramachandra. 2019. Structure function based transform features for behavior-oriented social media image classification. In Proc. ACPR, 594–608.
[31]
J. Grover and M. Hanmandlu. 2018. The fusion of multispectral palmprints using the information set based features and classifier. Engineering Applications of Artificial Intelligence 67, (2018), 111–125.
[32]
A. Sarkar, A. L. Abbott, and Z. Doerzaph. 2017. Universal skin detection without color information. In Proc. WCACV, 20–28.
[33]
K. S. Raghunandan, P. Shivakumara, S. Roy, G. H. Kumar, U. Pal, and T. Lu. 2018. Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Trans. CSVT, 1–16.
[34]
J. C. Platt. 1999. Fast training of support vector machines using sequential minimal optimization. Advances in Kernel Methods, 185–208.
[35]
K. Zhang, Z. Zhang, Z. Li, and Y. Qiao. 2016. Joint face detection and alignment using multi-task cascaded convolutional networks. IEEE Signal Processing Letters 23, (2016), 1499–1503.
[36]
H. Lee and C. Kim. 2014. Blurred image region detection and segmentation. In Proc. ICIP 4427–4431, 2014.
[37]
X. Xu, Y. Wang, and S. Chen. 2016. Medical image fusion using discrete fractional wavelet transform. Biomedical Signal Processing and Control 27, 103–111, 2016.
[38]
A. Krizhevsky, I. Sutskever, and G. E. Hinton. 2012. ImageNet classification with deep convolutional neural networks. In Proc. NIPS, 1–9, 2012.
[39]
K. Smonyan and A. Zisserman. 2015. Very deep convolutional networks for large scale image recognition. In Proc. ICLR, 1–14.
[40]
C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erthan, V. Vanhoucke, and A. Rabinovich. 2015. Going deeper with convolutions. In Proc. CVPR, 1–9.
[41]
K. He, X. Zhang, S. Ren, and J. Sun. 2016. Deep residual learning for image recognition. In Proc. CVPR, 770–778, 2016.

Cited By

View all
  • (2024)InteractNet: Social Interaction Recognition for Semantic-rich VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366366820:8(1-21)Online publication date: 12-Jun-2024
  • (2022)Discriminative feature selection with directional outliers correcting for data classificationPattern Recognition10.1016/j.patcog.2022.108541126:COnline publication date: 1-Jun-2022

Index Terms

  1. A New Foreground-Background based Method for Behavior-Oriented Social Media Image Classification

        Recommendations

        Comments

        Information & Contributors

        Information

        Published In

        cover image ACM Transactions on Multimedia Computing, Communications, and Applications
        ACM Transactions on Multimedia Computing, Communications, and Applications  Volume 17, Issue 4
        November 2021
        529 pages
        ISSN:1551-6857
        EISSN:1551-6865
        DOI:10.1145/3492437
        Issue’s Table of Contents

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        Published: 12 November 2021
        Accepted: 01 March 2021
        Revised: 01 August 2020
        Received: 01 July 2019
        Published in TOMM Volume 17, Issue 4

        Permissions

        Request permissions for this article.

        Check for updates

        Author Tags

        1. Social media
        2. person traits
        3. person behavior
        4. hanman transform
        5. person identification
        6. behavior oriented social media image classification

        Qualifiers

        • Research-article
        • Refereed

        Funding Sources

        • Natural Science Foundation of China
        • Faculty

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • Downloads (Last 12 months)45
        • Downloads (Last 6 weeks)1
        Reflects downloads up to 20 Jan 2025

        Other Metrics

        Citations

        Cited By

        View all
        • (2024)InteractNet: Social Interaction Recognition for Semantic-rich VideosACM Transactions on Multimedia Computing, Communications, and Applications10.1145/366366820:8(1-21)Online publication date: 12-Jun-2024
        • (2022)Discriminative feature selection with directional outliers correcting for data classificationPattern Recognition10.1016/j.patcog.2022.108541126:COnline publication date: 1-Jun-2022

        View Options

        Login options

        Full Access

        View options

        PDF

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader

        Full Text

        View this article in Full Text.

        Full Text

        HTML Format

        View this article in HTML Format.

        HTML Format

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media