research-article

One-Epoch Training for Object Detection in Fisheye Images

Author:

Yu-Hsi ChenAuthors Info & Claims

MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia

Article No.: 110, Pages 1 - 5

https://doi.org/10.1145/3595916.3628349

Published: 01 January 2024 Publication History

Abstract

This challenge is divided into two stages: qualification and final competition. We will acquire regular image data and need to perform detection on images with a fisheye effect. The approach described in this context begins by taking the original images and transforming them to mimic fisheye effect images for training. Furthermore, this challenge imposes limitations on computational resources, so striking a balance between accuracy and speed is a crucial aspect. In this paper, we asserted that our approach for this competition can achieve high performance with just one epoch of training. In summary, we achieved the top position among 24 participating teams in the qualification competition and secured the fourth position among the 11 successful submitted teams in the final competition. The corresponding source code will be available at: One-Epoch Training for Object Detection in Fisheye Images.

References

[1]

Chun-Hao Chao, Pin-Lun Hsu, Hung-Yi Lee, and Yu-Chiang Frank Wang. 2020. Self-supervised deep learning for fisheye image rectification. In ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2248–2252.

[2]

Xun Cheng and Jianbo Yu. 2020. RetinaNet with difference channel attention and adaptively spatial feature fusion for steel surface defect detection. IEEE Transactions on Instrumentation and Measurement 70 (2020), 1–11.

[3]

Ekin D Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, and Quoc V Le. 2018. Autoaugment: Learning augmentation policies from data. arXiv preprint arXiv:1805.09501 (2018).

[4]

Zhihao Duan, Ozan Tezcan, Hayato Nakamura, Prakash Ishwar, and Janusz Konrad. 2020. Rapid: rotation-aware people detection in overhead fisheye images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops. 636–637.

[5]

Siqi Fan. 2018. Conversion Tool for FishEye Dataset. https://github.com/leofansq/Tools_KITTI2FishEye

[6]

Munkhjargal Gochoo, Munkh-Erdene Otgonbold, Erkhembayar Ganbold, Jun-Wei Hsieh, Ming-Ching Chang, Ping-Yang Chen, Byambaa Dorj, Hamad Al Jassmi, Ganzorig Batnasan, Fady Alnajjar, 2023. FishEye8K: A Benchmark and Dataset for Fisheye Camera Object Detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 5304–5312.

[7]

Aran Komatsuzaki. 2019. One epoch is all you need. arXiv preprint arXiv:1906.06669 (2019).

[8]

Brett Koonce and Brett Koonce. 2021. EfficientNet. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization (2021), 109–123.

[9]

Oded Krams and Nahum Kiryati. 2017. People detection in top-view fisheye imaging. In 2017 14th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, 1–6.

[10]

Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. 2012. Imagenet classification with deep convolutional neural networks. Advances in neural information processing systems 25 (2012).

[11]

Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, 740–755.

[12]

Zhiqiu Lin, Jin Sun, Abe Davis, and Noah Snavely. 2020. Visual chirality. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 12295–12303.

[13]

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11–14, 2016, Proceedings, Part I 14. Springer, 21–37.

[14]

I-chan Lo, Kuang-tsu Shih, and Homer H Chen. 2018. Image stitching for dual fisheye cameras. In 2018 25th IEEE International Conference on Image Processing (ICIP). IEEE, 3164–3168.

[15]

Yen-Sok Poon, Chih-Chun Lin, Yu-Hsuan Liu, and Chih-Peng Fan. 2022. YOLO-based deep learning design for in-cabin monitoring system with fisheye-lens camera. In 2022 IEEE International Conference on Consumer Electronics (ICCE). IEEE, 1–4.

[16]

Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciarán Eising, Ahmad El-Sallab, and SK Yogamani. 2020. FisheyeYOLO: Object detection on fisheye cameras for autonomous driving. In Proceedings of the Machine Learning for Autonomous Driving NeurIPS 2020 Virtual Workshop, Virtual, Vol. 11.

[17]

Hazem Rashed, Eslam Mohamed, Ganesh Sistu, Varun Ravi Kumar, Ciaran Eising, Ahmad El-Sallab, and Senthil Yogamani. 2021. Generalized object detection on fisheye cameras for autonomous driving: Dataset, representations and baseline. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2272–2280.

[18]

Mingxing Tan and Quoc Le. 2019. Efficientnet: Rethinking model scaling for convolutional neural networks. In International conference on machine learning. PMLR, 6105–6114.

[19]

Shengbang Tong, Yubei Chen, Yi Ma, and Yann Lecun. 2023. EMP-SSL: Towards Self-Supervised Learning in One Training Epoch. arXiv preprint arXiv:2304.03977 (2023).

[20]

wufish | NYCU PAIR Labs. 2022. iVS-Dataset. https://pairlabs.nycu.edu.tw:52959/?p=28

[21]

Senthil Yogamani, Ciaran Hughes, Jonathan Horgan, Ganesh Sistu, Padraig Varley, Derek O’Dea, Michal Uricar, Stefan Milz, Martin Simon, Karl Amende, Christian Witt, Hazem Rashed, Sumanth Chennupati, Sanjaya Nayak, Saquib Mansoor, Xavier Perrotton, and Patrick Perez. 2019. WoodScape: A Multi-Task, Multi-Camera Fisheye Dataset for Autonomous Driving. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV). https://doi.org/10.1109/ICCV.2019.00940

Cited By

Index Terms

One-Epoch Training for Object Detection in Fisheye Images

Recommendations

Training object detectors from few weakly-labeled and many unlabeled images
Highlights
- A novel method to train detector by few weakly-labeled images and lots of unlabeled images.
Abstract
Weakly-supervised object detection attempts to limit the amount of supervision by dispensing the need for bounding boxes, but still assumes image-level labels on the entire training set. In this work, we study the problem of training ...
Improved YOLOv7 models based on modulated deformable convolution and swin transformer for object detection in fisheye images
Abstract
Thanks to the wide view field, the fisheye camera can get much more visual information. Thus, it is widely used in the field of computer vision. However, projection is often required for fisheye images to be used for object detection. Meanwhile, ...
Highlights
- Modulated Deformable Convolution is introduced into YOLOv7 to effectively detect distorted objects in fisheye images.
- Swin Transformer Block is introduced into YOLOv7 to effectively recognize small objects in fisheye images.
- A SMDC-...
Generation of Panoramic View from 360 Degree Fisheye Images Based on Angular Fisheye Projection
DCABES '11: Proceedings of the 2011 10th International Symposium on Distributed Computing and Applications to Business, Engineering and Science

this paper proposes a algorithm to generate the panoramic view from 360 ãfisheye images based on angular fisheye projection model. Firstly, the image plane will be mapped to the view plane, the sphere plane, to obtain the coordinates of spherical ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MMAsia '23: Proceedings of the 5th ACM International Conference on Multimedia in Asia

December 2023

745 pages

ISBN:9798400702051

DOI:10.1145/3595916

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 01 January 2024

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

MMAsia '23

Sponsor:

SIGMM

MMAsia '23: ACM Multimedia Asia

December 6 - 8, 2023

Tainan, Taiwan

Acceptance Rates

Overall Acceptance Rate 59 of 204 submissions, 29%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

0
Total Citations
79
Total Downloads

Downloads (Last 12 months)46
Downloads (Last 6 weeks)5

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten