ABSTRACT
In this paper we propose ÖWF-OD, a new dataset for object detection in archival film content. The dataset enables the evaluation of object detection methods on image data with very different image qualities. 1,000 selected keyframes from 100 hours of video material have been annotated, 4,480 bounding boxes are labeled. In addition to these annotations, image quality measures were calculated for the keyframes in order to assess the influence of image quality on object detection. We evaluate different versions of the YOLO object detector to provide a baseline of object detection results for this dataset. The annotation data and the code to extract the keyframes are provided at https://github.com/TailoredMediaProject/OEWF_ObjectDetection.
- Hannes Fassold, Stefanie Wechtitsch, Albert Hofmann, Werner Bailer, Peter Schallauer, Roberto Borgotallo, Alberto Messina, Mohan Liu, Patrick Ndjiki-Nya, and Peter Altendorf. 2012. Automated visual quality analysis for media production. In 2012 IEEE International Symposium on Multimedia. IEEE, 394–400.Google ScholarDigital Library
- Harri Kiiskinen and Tommi Römpötti. 2021. MoMaF Movie metadata for Finnish Fiction Films, technical metadata dataset 1. https://doi.org/10.5281/zenodo.4923146 Updated data for film lengths and durations..Google ScholarCross Ref
- Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Dollár, and C Lawrence Zitnick. 2014. Microsoft coco: Common objects in context. In Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, 740–755.Google ScholarCross Ref
- Markus Mühling, Manja Meister, Nikolaus Korfhage, Jörg Wehling, Angelika Hörth, Ralph Ewerth, and Bernd Freisleben. 2016. Content-based video retrieval in historical collections of the German Broadcasting Archive. International Journal on Digital Libraries 20 (2016), 167–183.Google ScholarDigital Library
- J. Redmon, S. Divvala, R. Girshick, and A. Farhadi. 2016. You Only Look Once: Unified, Real-Time Object Detection. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, Los Alamitos, CA, USA, 779–788. https://doi.org/10.1109/CVPR.2016.91Google ScholarCross Ref
- Joseph Redmon and Ali Farhadi. 2018. YOLOv3: An Incremental Improvement. CoRR abs/1804.02767 (2018). arXiv:1804.02767http://arxiv.org/abs/1804.02767Google Scholar
- Xiaofeng Ren. 2008. Finding people in archive films through tracking. In 2008 IEEE Conference on Computer Vision and Pattern Recognition. IEEE, 1–8.Google Scholar
- Artem Reshetnikov, Maria-Cristina Marinescu, and Joaquim More Lopez. 2023. DEArt: Dataset of European Art. In Computer Vision–ECCV 2022 Workshops: Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part I. Springer, 218–233.Google Scholar
- Markus Seidl, Matthias Zeppelzauer, Dalibor Mitrović, and Christian Breiteneder. 2011. Gradual Transition Detection in Historic Film Material—a Systematic Study. Journal on Computing and Cultural Heritage 4, 3, Article 10 (dec 2011), 18 pages. https://doi.org/10.1145/2069276.2069279Google ScholarDigital Library
- Georg Thallinger and Werner Bailer. 2021. Automatic Analysis of Amateur Film and Video Collections. In 2021 International Conference on Content-Based Multimedia Indexing (CBMI). IEEE, 1–4.Google Scholar
- C. Wang, A. Bochkovskiy, and H. Liao. 2021. Scaled-YOLOv4: Scaling Cross Stage Partial Network. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE Computer Society, Los Alamitos, CA, USA, 13024–13033. https://doi.org/10.1109/CVPR46437.2021.01283Google ScholarCross Ref
- Chien-Yao Wang, Alexey Bochkovskiy, and Hong-Yuan Mark Liao. 2022. YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors. arxiv:2207.02696 [cs.CV]Google Scholar
- Kin-Yiu Wong. 2020. Pytorch_YOLOv4. https://github.com/WongKinYiu/PyTorch_YOLOv4/tree/u5Google Scholar
- Wen Xiang. 2022. Object Detection in Finnish Movies. Master’s thesis. Aalto University. School of Science. http://urn.fi/URN:NBN:fi:aalto-202208285192Google Scholar
- Matthias Zeppelzauer, Dalibor Mitrovic, and Christian Breiteneder. 2008. Analysis of Historical Artistic Documentaries. In 2008 Ninth International Workshop on Image Analysis for Multimedia Interactive Services. 201–206. https://doi.org/10.1109/WIAMIS.2008.11Google ScholarDigital Library
Index Terms
- ÖWF-OD: A Dataset for Object Detection in Archival Film Content
Recommendations
Tools, techniques, datasets and application areas for object detection in an image: a review
AbstractObject detection is one of the most fundamental and challenging tasks to locate objects in images and videos. Over the past, it has gained much attention to do more research on computer vision tasks such as object classification, counting of ...
DEArt: Dataset of European Art
Computer Vision – ECCV 2022 WorkshopsAbstractLarge datasets that were made publicly available to the research community over the last 20 years have been a key enabling factor for the advances in deep learning algorithms for NLP or computer vision. These datasets are generally pairs of ...
Dataset and Benchmarking of Real-Time Embedded Object Detection for RoboCup SSL
RoboCup 2021: Robot World Cup XXIVAbstractWhen producing a model to object detection in a specific context, the first obstacle is to have a dataset labeling the desired classes. In RoboCup, some leagues already have more than one dataset to train and evaluate a model. However, in the ...
Comments