skip to main content
10.1145/3595916.3628349acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

One-Epoch Training for Object Detection in Fisheye Images

Published:01 January 2024Publication History

ABSTRACT

This challenge is divided into two stages: qualification and final competition. We will acquire regular image data and need to perform detection on images with a fisheye effect. The approach described in this context begins by taking the original images and transforming them to mimic fisheye effect images for training. Furthermore, this challenge imposes limitations on computational resources, so striking a balance between accuracy and speed is a crucial aspect. In this paper, we asserted that our approach for this competition can achieve high performance with just one epoch of training. In summary, we achieved the top position among 24 participating teams in the qualification competition and secured the fourth position among the 11 successful submitted teams in the final competition. The corresponding source code will be available at: One-Epoch Training for Object Detection in Fisheye Images.

References

  1. Chun-Hao Chao, Pin-Lun Hsu, Hung-Yi Lee, and Yu-Chiang Frank Wang. 2020. Self-supervised deep learning for fisheye image rectification. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2248–2252.Google ScholarGoogle ScholarCross RefCross Ref
  2. Xun Cheng and Jianbo Yu. 2020. RetinaNet with difference channel attention and adaptively spatial feature fusion for steel surface defect detection. IEEE Transactions on Instrumentation and Measurement 70 (2020), 1–11.Google ScholarGoogle Scholar
  3. Ekin D Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, and Quoc V Le. 2018. Autoaugment: Learning augmentation policies from data. arXiv preprint arXiv:1805.09501 (2018).Google ScholarGoogle Scholar
  4. Zhihao Duan, Ozan Tezcan, Hayato Nakamura, Prakash Ishwar, and Janusz Konrad. 2020. Rapid: rotation-aware people detection in overhead fisheye images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 636–637.Google ScholarGoogle ScholarCross RefCross Ref
  5. Siqi Fan. 2018. Conversion Tool for FishEye Dataset. https://github.com/leofansq/Tools_KITTI2FishEyeGoogle ScholarGoogle Scholar
  6. Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Erkhembayar Ganbold, Jun-Wei Hsieh, Ming-Ching Chang, Ping-Yang Chen, Byambaa Dorj, Hamad Al Jassmi, Ganzorig Batnasan, Fady Alnajjar, 2023. FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5304–5312.Google ScholarGoogle ScholarCross RefCross Ref
  7. Aran Komatsuzaki. 2019. One epoch is all you need. arXiv preprint arXiv:1906.06669 (2019).Google ScholarGoogle Scholar
  8. Brett Koonce and Brett Koonce. 2021. EfficientNet. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization (2021), 109–123.Google ScholarGoogle ScholarCross RefCross Ref
  9. Oded Krams and Nahum Kiryati. 2017. People detection in top-view fisheye imaging. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, 1–6.Google ScholarGoogle ScholarCross RefCross Ref
  10. Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).Google ScholarGoogle Scholar
  11. Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, 740–755.Google ScholarGoogle ScholarCross RefCross Ref
  12. Zhiqiu Lin, Jin Sun, Abe Davis, and Noah Snavely. 2020. Visual chirality. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12295–12303.Google ScholarGoogle ScholarCross RefCross Ref
  13. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer, 21–37.Google ScholarGoogle Scholar
  14. I-chan Lo, Kuang-tsu Shih, and Homer H Chen. 2018. Image stitching for dual fisheye cameras. In 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 3164–3168.Google ScholarGoogle Scholar
  15. Yen-Sok Poon, Chih-Chun Lin, Yu-Hsuan Liu, and Chih-Peng Fan. 2022. YOLO-based deep learning design for in-cabin monitoring system with fisheye-lens camera. In 2022 IEEE International Conference on Consumer Electronics (ICCE). IEEE, 1–4.Google ScholarGoogle ScholarCross RefCross Ref
  16. Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciarán Eising, Ahmad El-Sallab, and SK Yogamani. 2020. FisheyeYOLO: Object detection on fisheye cameras for autonomous driving. In Proceedings of the Machine Learning for Autonomous Driving NeurIPS 2020 Virtual Workshop, Virtual, Vol. 11.Google ScholarGoogle Scholar
  17. Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciaran Eising, Ahmad El-Sallab, and Senthil Yogamani. 2021. Generalized object detection on fisheye cameras for autonomous driving: Dataset, representations and baseline. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2272–2280.Google ScholarGoogle ScholarCross RefCross Ref
  18. Mingxing Tan and Quoc Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning. PMLR, 6105–6114.Google ScholarGoogle Scholar
  19. Shengbang Tong, Yubei Chen, Yi Ma, and Yann Lecun. 2023. EMP-SSL: Towards Self-Supervised Learning in One Training Epoch. arXiv preprint arXiv:2304.03977 (2023).Google ScholarGoogle Scholar
  20. wufish | NYCU PAIR Labs. 2022. iVS-Dataset. https://pairlabs.nycu.edu.tw:52959/?p=28Google ScholarGoogle Scholar
  21. Senthil Yogamani, Ciaran Hughes, Jonathan Horgan, Ganesh Sistu, Padraig Varley, Derek O’Dea, Michal Uricar, Stefan Milz, Martin Simon, Karl Amende, Christian Witt, Hazem Rashed, Sumanth Chennupati, Sanjaya Nayak, Saquib Mansoor, Xavier Perrotton, and Patrick Perez. 2019. WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous Driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). https://doi.org/10.1109/ICCV.2019.00940Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. One-Epoch Training for Object Detection in Fisheye Images

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia
              December 2023
              745 pages
              ISBN:9798400702051
              DOI:10.1145/3595916

              Copyright © 2023 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 January 2024

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed limited

              Acceptance Rates

              Overall Acceptance Rate59of204submissions,29%

              Upcoming Conference

              MM '24
              MM '24: The 32nd ACM International Conference on Multimedia
              October 28 - November 1, 2024
              Melbourne , VIC , Australia
            • Article Metrics

              • Downloads (Last 12 months)57
              • Downloads (Last 6 weeks)6

              Other Metrics

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader

            HTML Format

            View this article in HTML Format .

            View HTML Format