skip to main content
10.1145/3488933.3488952acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

CSCP-Net: Cuboid-Shaped Child-Pyramid Augmentation

Authors Info & Claims
Published:25 February 2022Publication History

ABSTRACT

Feature fusion is an important trick to improve the performance of object detectors at present. When extracting features of a certain scale, we must consider the semantics of all feature points in a large neighborhood centered on this feature. With the expansion of the reception field, the information entropy of feature points decreases and becomes easier to learn. Therefore, we propose a new feature fusion method – Look Around, which is different from the previous FPN, PAFPN, BiFPN, etc. Our feature fusion will make full use of the relationship between these feature points and supplement the semantic information of target feature points by neighbors. After extensive experiments on the PASCAL VOC dataset, the result shows that the Look Around Fusion method improves mAP by 3.5%, which is better than FPN and FSSD.

References

  1. B. Li, J. Yan, W. Wu, Z. Zhu, and X. Hu, “High Performance Visual Tracking with Siamese Region Proposal Network,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 8971–8980, 2018, doi: 10.1109/CVPR.2018.00935.Google ScholarGoogle ScholarCross RefCross Ref
  2. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” 3rd Int. Conf. Learn. Represent. ICLR 2015 - Conf. Track Proc., pp. 1–14, 2015.Google ScholarGoogle Scholar
  3. S. Ren, K. He, R. Girshick, and J. Sun, “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 39, no. 6, pp. 1137–1149, 2017, doi: 10.1109/TPAMI.2016.2577031.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Tan, R. Pang, and Q. V Le, “EfficientDet: Scalable and Efficient Object Detection.”Google ScholarGoogle Scholar
  5. K. He, G. Gkioxari, P. Dollár, and R. Girshick, “Mask R-CNN,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 2, pp. 386–397, 2020, doi: 10.1109/TPAMI.2018.2844175.Google ScholarGoogle ScholarCross RefCross Ref
  6. X. Li, T. Lai, S. Wang, Q. Chen, C. Yang, and R. Chen, “Weighted feature pyramid networks for object detection,” Proc. - 2019 IEEE Intl Conf Parallel Distrib. Process. with Appl. Big Data Cloud Comput. Sustain. Comput. Commun. Soc. Comput. Networking, ISPA/BDCloud/SustainCom/SocialCom 2019, pp. 1500–1504, 2019, doi: 10.1109/ISPA-BDCloud-SustainCom-SocialCom48970.2019.00217.Google ScholarGoogle ScholarCross RefCross Ref
  7. S. Liu, L. Qi, H. Qin, J. Shi, and J. Jia, “Path Aggregation Network for Instance Segmentation,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 8759–8768, 2018, doi: 10.1109/CVPR.2018.00913.Google ScholarGoogle ScholarCross RefCross Ref
  8. C. Guo, B. Fan, Q. Zhang, S. Xiang, and C. Pan, “AUGFPN: Improving multi-scale feature learning for object detection,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 12592–12601, 2020, doi: 10.1109/CVPR42600.2020.01261.Google ScholarGoogle ScholarCross RefCross Ref
  9. G. Ghiasi, T. Y. Lin, and Q. V. Le, “NAS-FPN: Learning scalable feature pyramid architecture for object detection,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2019-June, pp. 7029–7038, 2019, doi: 10.1109/CVPR.2019.00720.Google ScholarGoogle ScholarCross RefCross Ref
  10. Z. X. Li and F. Q. Zhou, “FSSD: Feature fusion single shot multibox detector,” arXiv, vol. 1, 2017.Google ScholarGoogle Scholar
  11. Y. Luo , “CE-FPN: Enhancing Channel Information for Object Detection,” vol. 14, no. 8, pp. 1–9, 2015.Google ScholarGoogle Scholar
  12. R. Girshick, J. Donahue, T. Darrell, and J. Malik, “Rich feature hierarchies for accurate object detection and semantic segmentation,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., pp. 580–587, 2014, doi: 10.1109/CVPR.2014.81.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. R. Girshick, “Fast R-CNN,” Proc. IEEE Int. Conf. Comput. Vis., vol. 2015 Inter, pp. 1440–1448, 2015, doi: 10.1109/ICCV.2015.169.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2016-Decem, pp. 779–788, 2016, doi: 10.1109/CVPR.2016.91.Google ScholarGoogle ScholarCross RefCross Ref
  15. J. Redmon and A. Farhadi, “YOLO9000: Better, faster, stronger,” Proc. - 30th IEEE Conf. Comput. Vis. Pattern Recognition, CVPR 2017, vol. 2017-Janua, pp. 6517–6525, 2017, doi: 10.1109/CVPR.2017.690.Google ScholarGoogle ScholarCross RefCross Ref
  16. J. Redmon and A. Farhadi, “YOLO v.3,” Tech Rep., pp. 1–6, 2018, [Online]. Available: https://pjreddie.com/media/files/papers/YOLOv3.pdf.Google ScholarGoogle Scholar
  17. A. Bochkovskiy, C. Y. Wang, and H. Y. M. Liao, “YOLOv4: Optimal Speed and Accuracy of Object Detection,” arXiv, 2020.Google ScholarGoogle Scholar
  18. W. Liu , “SSD: Single shot multibox detector,” Lect. Notes Comput. Sci. (including Subser. Lect. Notes Artif. Intell. Lect. Notes Bioinformatics), vol. 9905 LNCS, pp. 21–37, 2016, doi: 10.1007/978-3-319-46448-0_2.Google ScholarGoogle Scholar
  19. C. V Mar, “CornerNet: Detecting Objects as Paired Keypoints,” 2017.Google ScholarGoogle Scholar
  20. Q. Zhao , “M2det: A single-shot object detector based on multi-level feature pyramid network,” 33rd AAAI Conf. Artif. Intell. AAAI 2019, 31st Innov. Appl. Artif. Intell. Conf. IAAI 2019 9th AAAI Symp. Educ. Adv. Artif. Intell. EAAI 2019, pp. 9259–9266, 2019, doi: 10.1609/aaai.v33i01.33019259.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. T. Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollar, “Focal Loss for Dense Object Detection,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 42, no. 2, pp. 318–327, 2020, doi: 10.1109/TPAMI.2018.2858826.Google ScholarGoogle ScholarCross RefCross Ref
  22. H. Zhao, J. Shi, X. Qi, X. Wang, J. Jia, and S. G. Limited, “Pyramid Scene Parsing Network.”Google ScholarGoogle Scholar
  23. F. Yu and V. Koltun, “Multi-scale context aggregation by dilated convolutions,” 4th Int. Conf. Learn. Represent. ICLR 2016 - Conf. Track Proc., 2016.Google ScholarGoogle Scholar
  24. K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” Proc. IEEE Comput. Soc. Conf. Comput. Vis. Pattern Recognit., vol. 2016-Decem, pp. 770–778, 2016, doi: 10.1109/CVPR.2016.90.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. CSCP-Net: Cuboid-Shaped Child-Pyramid Augmentation
      Index terms have been assigned to the content through auto-classification.

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        AIPR '21: Proceedings of the 2021 4th International Conference on Artificial Intelligence and Pattern Recognition
        September 2021
        715 pages
        ISBN:9781450384087
        DOI:10.1145/3488933

        Copyright © 2021 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 25 February 2022

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed limited
      • Article Metrics

        • Downloads (Last 12 months)7
        • Downloads (Last 6 weeks)0

        Other Metrics

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      HTML Format

      View this article in HTML Format .

      View HTML Format