skip to main content
10.1145/3606042.3616462acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Intelligent Classification of Multimedia Images Based on Class Information Mining

Published:29 October 2023Publication History

ABSTRACT

The classification research of multimedia images has always been of great concern, and related technologies are constantly being improved to increase accuracy. Adequate evaluation and mining of sample information is an important direction, but it is always a challenge. In this article, we propose a method for constructing deep learning training datasets, which fully considers the intra class and inter class features of the samples. The intra class dispersion of the sample is evaluated by the distance from the sample features to the prototype, while inter class confusion between classes is evaluated by combining the distance of the prototype in the feature space with intra class dispersion. Based on the intra class and inter class features of samples, determine the proportion of imbalanced construction to achieve the construction of imbalanced datasets. This method has the potential to be applied to different multimedia visual tasks.

References

  1. Mateusz Buda, Atsuto Maki, and Maciej A Mazurowski. 2018. A systematic study of the class imbalance problem in convolutional neural networks. Neural networks , Vol. 106 (2018), 249--259.Google ScholarGoogle Scholar
  2. Yin Cui, Menglin Jia, Tsung-Yi Lin, Yang Song, and Serge Belongie. 2019. Class-balanced loss based on effective number of samples. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9268--9277.Google ScholarGoogle ScholarCross RefCross Ref
  3. Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770--778.Google ScholarGoogle ScholarCross RefCross Ref
  4. Youngkyu Hong, Seungju Han, Kwanghee Choi, Seokjun Seo, Beomsu Kim, and Buru Chang. 2021. Disentangling label distribution for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 6626--6636.Google ScholarGoogle ScholarCross RefCross Ref
  5. Haeyong Kang, Thang Vu, and Chang D Yoo. 2021. Learning imbalanced datasets with maximum margin loss. In 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 1269--1273.Google ScholarGoogle ScholarCross RefCross Ref
  6. Guipeng Lan, Shuai Xiao, Jiabao Wen, Desheng Chen, and Yong Zhu. 2022. Data-Driven Deepfake Forensics Model Based on Large-Scale Frequency and Noise Features. IEEE Intelligent Systems (2022).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yang Li and Xuewei Chao. 2022. Distance-entropy: an effective indicator for selecting informative data. Frontiers in Plant Science , Vol. 12 (2022), 818895.Google ScholarGoogle ScholarCross RefCross Ref
  8. Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980--2988.Google ScholarGoogle ScholarCross RefCross Ref
  9. Ziwei Liu, Zhongqi Miao, Xiaohang Zhan, Jiayun Wang, Boqing Gong, and Stella X Yu. 2019. Large-scale long-tailed recognition in an open world. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 2537--2546.Google ScholarGoogle ScholarCross RefCross Ref
  10. Jiawei Ren, Cunjun Yu, Xiao Ma, Haiyu Zhao, Shuai Yi, et al. 2020. Balanced meta-softmax for long-tailed visual recognition. Advances in neural information processing systems , Vol. 33 (2020), 4175--4186.Google ScholarGoogle Scholar
  11. Jiawei Ren, Mingyuan Zhang, Cunjun Yu, and Ziwei Liu. 2022. Balanced mse for imbalanced visual regression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 7926--7935.Google ScholarGoogle ScholarCross RefCross Ref
  12. Samarth Sinha, Sayna Ebrahimi, and Trevor Darrell. 2019. Variational adversarial active learning. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 5972--5981.Google ScholarGoogle ScholarCross RefCross Ref
  13. Oriol Vinyals, Charles Blundell, Timothy Lillicrap, Daan Wierstra, et al. 2016. Matching networks for one shot learning. Advances in neural information processing systems , Vol. 29 (2016).Google ScholarGoogle Scholar
  14. Shuai Xiao, Guipeng Lan, Jiachen Yang, Yang Li, and Jiabao Wen. 2022. Securing the socio-cyber world: multiorder attribute node association classification for manipulated media. IEEE Transactions on Computational Social Systems 99 (2022), 1--10.Google ScholarGoogle ScholarCross RefCross Ref
  15. Shuai Xiao, Guipeng Lan, Jiachen Yang, Wen Lu, Qinggang Meng, and Xinbo Gao. 2023. MCS-GAN: A Different Understanding for Generalization of Deep Forgery Detection. IEEE Transactions on Multimedia (2023).Google ScholarGoogle Scholar
  16. Jiachen Yang, Guipeng Lan, Yang Li, Yicheng Gong, Zhuo Zhang, and Sezai Ercisli. 2022. Data quality assessment and analysis for pest identification in smart agriculture. Computers and Electrical Engineering , Vol. 103 (2022), 108322.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Donggeun Yoo and In So Kweon. 2019. Learning loss for active learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 93--102.Google ScholarGoogle ScholarCross RefCross Ref
  18. Weiping Yu, Taojiannan Yang, and Chen Chen. 2021. Towards resolving the challenge of long-tail distribution in UAV images for object detection. In Proceedings of the IEEE/CVF winter conference on applications of computer vision. 3258--3267.Google ScholarGoogle ScholarCross RefCross Ref
  19. Boyan Zhou, Quan Cui, Xiu-Shen Wei, and Zhao-Min Chen. 2020. Bbn: Bilateral-branch network with cumulative learning for long-tailed visual recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition. 9719--9728.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Intelligent Classification of Multimedia Images Based on Class Information Mining

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        AMC-SME '23: Proceedings of the 2023 Workshop on Advanced Multimedia Computing for Smart Manufacturing and Engineering
        October 2023
        83 pages
        ISBN:9798400702730
        DOI:10.1145/3606042

        Copyright © 2023 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 29 October 2023

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Upcoming Conference

        MM '24
        MM '24: The 32nd ACM International Conference on Multimedia
        October 28 - November 1, 2024
        Melbourne , VIC , Australia
      • Article Metrics

        • Downloads (Last 12 months)26
        • Downloads (Last 6 weeks)6

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader