Solving Class Imbalance Problem in Target Detection with a Squared Cross Entropy Based Method

Chen, Guanyu; Wang, Quanyu; Li, Qi; Hu, Jun; Liu, Jingyi

doi:10.1007/978-981-99-4742-3_10

Guanyu Chen¹³,
Quanyu Wang^13,14,
Qi Li¹⁵,
Jun Hu¹³ &
…
Jingyi Liu¹³

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14087))

Included in the following conference series:

International Conference on Intelligent Computing

902 Accesses

Abstract

The foreground-background class imbalance in target detection is inevitable, which is caused by the training data set. Specifically, the number of targets contained in any image of the training data set is generally very small, that is, the number of positive examples is small, while the number of the negative examples from the background is large. Therefore, the ability of the algorithm to detect the negatives is stronger than that of positive examples. The Focal Loss algorithm solves this problem by improving the classification loss function. However, Focal Loss brings additional hyper-parameters, which remains to be further adjusted. This paper refers to the idea of Focal Loss from the classification loss function, and proposes new a classification loss function SCE that is similar to Focal Loss but does not contain any extra hyper-parameters. Experiments in the paper prove that SCE can obtain performance equivalent to Focal Loss without introducing hyper-parameters.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zou, Z., Shi, Z., Guo, Y., Ye, J.: Object Detection in 20 Years: A Survey. arXiv Prepr arXiv:1905.05055v2 (2019)
Wu, X., Sahoo, D., Hoi, S.: Recent advances in deep learning for object detection. Neurocomputing 396, 39–64 (2020). https://doi.org/10.1016/j.neucom.2020.01.085
Article Google Scholar
Hariharan, B., Arbeláez, P., Girshick, R., Malik, J.: Simultaneous detection and segmentation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8695, pp. 297–312. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10584-0_20
Chapter Google Scholar
Hariharan, B., Arbelaez, P., Girshick, R., Malik, J.: Hypercolumns for object segmentation and fine-grained localization. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, pp. 447–456 (2015). https://doi.org/10.1109/CVPR.2015.7298642
Dai, J., He, K., Sun, J.: Instance-aware semantic segmentation via multi-task network cascades. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, pp. 3150–3158 (2016). https://doi.org/10.1109/CVPR.2016.343
He, K., Gkioxari, G., Dollar, P., Girshick, R.: Mask R-CNN. In: 2017 IEEE International Conference on Computer Vision (ICCV), Venice, pp. 2980–2988 (2017). https://doi.org/10.1109/ICCV.2017.322
Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 664–676 (2017). https://doi.org/10.1109/TPAMI.2016.2598339
Article Google Scholar
Kang, K., et al.: T-CNN: tubelets with convolutional neural networks for object detection from videos. IEEE Trans. Circuits Syst. Video Technol. 28(10), 2896–2907 (2018). https://doi.org/10.1109/TCSVT.2017.2736553
Article Google Scholar
LeCun, Y., Bengio, Y., Hinton, G.: Deep learning. Nature 521(7553), 436–444 (2015). https://doi.org/10.1038/nature14539
Article Google Scholar
Chen, G., Cai, Z., Li, X.: Recognition and classification of high-resolution remote sensing image based on convolutional neural. Int. J. Performability Eng. 14(11), 2852–2863 (2018)
Google Scholar
Chen, G., Cai, Z., Li, X.: Classification of remote sensing images based on distributed convolutional neural network model. Int. J. Performability Eng. 15(6), 1508–1517 (2019)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.: Imagenet classification with deep convolutional neural networks. In: NIPS, vol. 25. Curran Associates Inc. (2012)
Google Scholar
Girshick, R., Donahue, J., Darrell, T., Malik, J.: Region-based convolutional networks for accurate object detection and segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 38(1), 142–158 (2016). https://doi.org/10.1109/TPAMI.2015.2437384
Article Google Scholar
Lin, T., Goyal, P., Girshick, R., He, K., Dollar, P.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2020). https://doi.org/10.1109/TPAMI.2018.2858826
Article Google Scholar
Zhang, Z., Qiao, S., Xie, C., Shen, W., Wang, B., Yuille, A.: Single-shot object detection with enriched semantics. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, pp. 5813–5821 (2018). https://doi.org/10.1109/CVPR.2018.00609
Lyu, S., Cai, X., Feng, R.: YOLOv3 network based on improved loss function. Comput. Syst. Appl. 28(2), 1–7 (2019). https://doi.org/10.15888/j.cnki.csa.006772
Article Google Scholar
Li, Y., Hou, L., Wang, C.: Moving objects detection in automatic driving based on YOLOv3. Comput. Eng. Des. 40(4) (2019)
Google Scholar
Jin, Y., Luo, N.: Improved YOLOv2 vehicle real-time detection algorithm combined with multi-scale features. Comput. Eng. Des. 40(05) (2019)
Google Scholar
Oksuz, K., Cam, B., Kalkan, S., Akbas, E.: Imbalance problems in object detection: a review. IEEE Trans. Pattern Anal. Mach. Intell. (2021). https://doi.org/10.1109/TPAMI.2020.2981890
Article Google Scholar

Download references

Author information

Authors and Affiliations

Informatization Office, China University of Geosciences, Wuhan, 430074, China
Guanyu Chen, Quanyu Wang, Jun Hu & Jingyi Liu
School of Computer Science, China University of Geosciences, Wuhan, 430074, China
Quanyu Wang
Informatization Office, China University of Geosciences, Wuhan, 430074, China
Qi Li

Authors

Guanyu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Quanyu Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qi Li
View author publications
You can also search for this author in PubMed Google Scholar
Jun Hu
View author publications
You can also search for this author in PubMed Google Scholar
Jingyi Liu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Quanyu Wang or Qi Li .

Editor information

Editors and Affiliations

Department of Computer Science, Eastern Institute of Technology, Zhejiang, China
De-Shuang Huang
University of Wollongong, North Wollongong, NSW, Australia
Prashan Premaratne
Zhengzhou University of Light Industry, Zhengzhou, China
Baohua Jin
Zhong Yuan University of Technology, Zhengzhou, China
Boyang Qu
University of Ulsan, Ulsan, Korea (Republic of)
Kang-Hyun Jo
Department of Computer Science, Liverpool John Moores University, Liverpool, UK
Abir Hussain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, G., Wang, Q., Li, Q., Hu, J., Liu, J. (2023). Solving Class Imbalance Problem in Target Detection with a Squared Cross Entropy Based Method. In: Huang, DS., Premaratne, P., Jin, B., Qu, B., Jo, KH., Hussain, A. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2023. Lecture Notes in Computer Science, vol 14087. Springer, Singapore. https://doi.org/10.1007/978-981-99-4742-3_10

Download citation

DOI: https://doi.org/10.1007/978-981-99-4742-3_10
Published: 30 July 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-4741-6
Online ISBN: 978-981-99-4742-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics