research-article

Prioritizing Testing Instances to Enhance the Robustness of Object Detection Systems

Authors:

Jia LiuAuthors Info & Claims

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on Internetware

Pages 194 - 204

https://doi.org/10.1145/3609437.3609446

Published: 05 October 2023 Publication History

Abstract

Object detection models have been widely deployed in military and life-related intelligent software systems. However, along with the outstanding success of object detection, it may exhibit abnormal behavior and lead to severe accidents and losses. During the development and evaluation process, training and evaluating an object detection model are computationally intensive, while preparing annotated tests requires extremely heavy manual labor. Therefore, reducing the annotation budget of test data collection becomes a challenging and necessary task. Although many test prioritization approaches for DNN-based systems have been proposed, the large differences between classification and object detection make them difficult to apply to testing object detection models.

In this paper, we propose DeepView, a novel instance-level test prioritization tool for object detection models to reduce data annotation costs. DeepView first splits the object detection results into instances, and then computes the localization and classification capabilities of the instances, respectively. Next, we design a test prioritization tool that enables testers to improve model performance by focusing on instances that may cause model errors from a large unlabeled dataset. To evaluate DeepView, we conduct extensive experiments on two kinds of object detection model architectures and two commonly used datasets. The experimental results show that DeepView outperforms existing test prioritization approaches regarding effectiveness and diversity. Also, we observe that using DeepView can effectively improve the accuracy and robustness of object detection models.

References

[1]

2012. Pascal VOC. http://host.robots.ox.ac.uk/pascal/VOC/.

[2]

2022. Python 3.9.15. https://www.python.org/downloads/release/python-3915/.

[3]

2022. Pytorch 1.12.0. https://pytorch.org/get-started/previous-versions/.

[4]

2022. Pytorch Faster-RCNN on COCO. https://pytorch.org/vision/stable/models/generated/torchvision.models.detection.fasterrcnn_resnet50_fpn_v2.html.

[5]

2022. Pytorch SSD on COCO. https://pytorch.org/vision/stable/models/generated/torchvision.models.detection.ssd300_vgg16.html.

[6]

Clemens-Alexander Brust, Christoph Käding, and Joachim Denzler. 2018. Active learning for deep object detection. arXiv preprint arXiv:1809.09875 (2018).

[7]

Tsong Y Chen, Shing C Cheung, and Shiu Ming Yiu. 2020. Metamorphic testing: a new approach for generating next test cases. arXiv preprint arXiv:2002.12543 (2020).

[8]

Tsong Yueh Chen, Hing Leung, and Ieng Kei Mak. 2005. Adaptive random testing. In Advances in Computer Science-ASIAN 2004. Higher-Level Decision Making: 9th Asian Computing Science Conference. Dedicated to Jean-Louis Lassez on the Occasion of His 5th Birthday. Chiang Mai, Thailand, December 8-10, 2004. Proceedings 9. Springer, 320–329.

[9]

Google Cloud. 2022. AI Platform Data Labeling Service pricing. https://cloud.google.com/ai-platform/data-labeling/pricing.

[10]

Jifeng Dai, Yi Li, Kaiming He, and Jian Sun. 2016. R-fcn: Object detection via region-based fully convolutional networks. Advances in neural information processing systems 29 (2016).

[11]

Yang Feng, Qingkai Shi, Xinyu Gao, Jun Wan, Chunrong Fang, and Zhenyu Chen. 2020. DeepGini: prioritizing massive tests to enhance the robustness of deep neural networks. In ISSTA ’20: 29th ACM SIGSOFT International Symposium on Software Testing and Analysis, Virtual Event, USA, July 18-22, 2020, Sarfraz Khurshid and Corina S. Pasareanu (Eds.). ACM, 177–188. https://doi.org/10.1145/3395363.3397357

Digital Library

[12]

Zhanpeng Feng, Shiliang Zhang, Rinyoichi Takezoe, Wenze Hu, Manmohan Chandraker, Li-Jia Li, Vijay K Narayanan, and Xiaoyu Wang. 2022. ALBench: A Framework for Evaluating Active Learning in Object Detection. arXiv preprint arXiv:2207.13339 (2022).

[13]

Xinyu Gao, Yang Feng, Yining Yin, Zixi Liu, Zhenyu Chen, and Baowen Xu. 2022. Adaptive test selection for deep neural networks. In Proceedings of the 44th International Conference on Software Engineering. 73–85.

Digital Library

[14]

Yuan Gao, Jiayi Ma, and Alan L Yuille. 2017. Semi-supervised sparse representation based classification for face recognition with insufficient labeled samples. IEEE Transactions on Image Processing 26, 5 (2017), 2545–2560.

Digital Library

[15]

Elmar Haussmann, Michele Fenzi, Kashyap Chitta, Jan Ivanecky, Hanson Xu, Donna Roy, Akshita Mittel, Nicolas Koumchatzky, Clement Farabet, and Jose M Alvarez. 2020. Scalable active learning for object detection. In 2020 IEEE intelligent vehicles symposium (iv). IEEE, 1430–1435.

[16]

Andrew J. Hawkins. 2019. Tesla’s Autopilot was engaged when Model 3 crashed into truck, report states. https://www.theverge.com/2019/5/16/18627766/tesla-autopilot-fatal-crash-delray-florida-ntsb-model-3.

[17]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2015. Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE transactions on pattern analysis and machine intelligence 37, 9 (2015), 1904–1916.

Digital Library

[18]

Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.

[19]

Chieh-Chi Kao, Teng-Yok Lee, Pradeep Sen, and Ming-Yu Liu. 2018. Localization-aware active learning for object detection. In Asian Conference on Computer Vision. Springer, 506–522.

[20]

Jinhan Kim, Robert Feldt, and Shin Yoo. 2019. Guiding deep learning system testing using surprise adequacy. In Proceedings of the 41st International Conference on Software Engineering, ICSE 2019, Montreal, QC, Canada, May 25-31, 2019, Joanne M. Atlee, Tevfik Bultan, and Jon Whittle (Eds.). IEEE / ACM, 1039–1049. https://doi.org/10.1109/ICSE.2019.00108

Digital Library

[21]

Jinhan Kim, Jeongil Ju, Robert Feldt, and Shin Yoo. 2020. Reducing DNN labelling cost using surprise adequacy: an industrial case study for autonomous driving. In ESEC/FSE ’20: 28th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Virtual Event, USA, November 8-13, 2020, Prem Devanbu, Myra B. Cohen, and Thomas Zimmermann (Eds.). ACM, 1466–1476. https://doi.org/10.1145/3368089.3417065

Digital Library

[22]

Yann LeCun, Yoshua Bengio, and Geoffrey E. Hinton. 2015. Deep learning. Nature (2015).

[23]

Timothée Lesort, Massimo Caccia, and Irina Rish. 2021. Understanding continual learning settings with data distribution drift analysis. arXiv preprint arXiv:2104.01678 (2021).

[24]

Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, and Piotr Dollár. 2017. Focal loss for dense object detection. In Proceedings of the IEEE international conference on computer vision. 2980–2988.

[25]

Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, and Alexander C Berg. 2016. Ssd: Single shot multibox detector. In European conference on computer vision. Springer, 21–37.

[26]

Zixi Liu, Yang Feng, Yining Yin, and Zhenyu Chen. 2022. DeepState: Selecting Test Suites to Enhance the Robustness of Recurrent Neural Networks. In 44th IEEE/ACM 44th International Conference on Software Engineering, ICSE 2022, Pittsburgh, PA, USA, May 25-27, 2022. ACM, 598–609. https://doi.org/10.1145/3510003.3510231

Digital Library

[27]

Matt McFarland. 2020. Uber self-driving car operator charged in pedestrian death. https://www.cnn.com/2020/09/18/cars/uber-vasquez-charged/index.html.

[28]

Microsoft. 2017. MS COCO. https://cocodataset.org/.

[29]

Dim P. Papadopoulos, Jasper Uijlings, Frank Keller, and Vittorio Ferrari. 2016. We don’t need no bounding-boxes: Training object class detectors using only human verification. computer vision and pattern recognition (2016).

[30]

Joseph Redmon, Santosh Divvala, Ross Girshick, and Ali Farhadi. 2016. You only look once: Unified, real-time object detection. In Proceedings of the IEEE conference on computer vision and pattern recognition. 779–788.

[31]

Shaoqing Ren, Kaiming He, Ross Girshick, and Jian Sun. 2015. Faster r-cnn: Towards real-time object detection with region proposal networks. Advances in neural information processing systems 28 (2015).

[32]

Gregg Rothermel, Roland H. Untch, Chengyun Chu, and Mary Jean Harrold. 2001. Prioritizing test cases for regression testing. IEEE Transactions on software engineering 27, 10 (2001), 929–948.

Digital Library

[33]

Soumya Roy, Asim Unmesh, and Vinay P Namboodiri. 2018. Deep active learning for object detection. In BMVC. 91.

[34]

Pierre Sermanet, David Eigen, Xiang Zhang, Michaël Mathieu, Rob Fergus, and Yann LeCun. 2013. Overfeat: Integrated recognition, localization and detection using convolutional networks. arXiv preprint arXiv:1312.6229 (2013).

[35]

Karen Simonyan and Andrew Zisserman. 2014. Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014).

[36]

Hao Su, Jia Deng, and Li Fei-Fei. 2012. Crowdsourcing annotations for visual object detection. In Workshops at the Twenty-Sixth AAAI Conference on Artificial Intelligence.

[37]

Manika Tyagi and Sona Malhotra. 2014. Test case prioritization using multi objective particle swarm optimizer. In 2014 International Conference on Signal Propagation and Computer Technology (ICSPCT 2014). IEEE, 390–395.

[38]

Zan Wang, Hanmo You, Junjie Chen, Yingyi Zhang, Xuyuan Dong, and Wenbin Zhang. 2021. Prioritizing Test Inputs for Deep Neural Networks via Mutation Analysis. In 43rd IEEE/ACM International Conference on Software Engineering, ICSE 2021, Madrid, Spain, 22-30 May 2021. IEEE, 397–409. https://doi.org/10.1109/ICSE43902.2021.00046

Digital Library

[39]

Jiaxi Wu, Jiaxin Chen, and Di Huang. 2022. Entropy-based Active Learning for Object Detection with Progressive Diversity Constraint. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 9397–9406.

[40]

Weiping Yu, Sijie Zhu, Taojiannan Yang, and Chen Chen. 2022. Consistency-based active learning for object detection. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3951–3960.

[41]

Ekim Yurtsever, Jacob Lambert, Alexander Carballo, and Kazuya Takeda. 2020. A Survey of Autonomous Driving: Common Practices and Emerging Technologies. IEEE Access 8 (2020), 58443–58469. https://doi.org/10.1109/ACCESS.2020.2983149

[42]

Jie M. Zhang, Mark Harman, Lei Ma, and Yang Liu. 2022. Machine Learning Testing: Survey, Landscapes and Horizons. IEEE Trans. Software Eng. 48, 2 (2022), 1–36. https://doi.org/10.1109/TSE.2019.2962027

Digital Library

[43]

W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld. 2003. Face Recognition: A Literature Survey. ACM Comput. Surv. 35, 4 (dec 2003), 399–458. https://doi.org/10.1145/954339.954342

Digital Library

[44]

Zhengxia Zou, Zhenwei Shi, Yuhong Guo, and Jieping Ye. 2019. Object Detection in 20 Years: A Survey. arXiv: Computer Vision and Pattern Recognition (2019).

Cited By

Wang SLi DLi HZhao MWong W(2024)A Survey on Test Input Selection and Prioritization for Deep Neural Networks2024 10th International Symposium on System Security, Safety, and Reliability (ISSSR)10.1109/ISSSR61934.2024.00035(232-243)Online publication date: 16-Mar-2024
https://doi.org/10.1109/ISSSR61934.2024.00035
Weng SFeng YYin YDai YLiu JZhao Z(2024)Seeing the invisible: test prioritization for object detection systemEmpirical Software Engineering10.1007/s10664-024-10539-429:6Online publication date: 23-Sep-2024
https://doi.org/10.1007/s10664-024-10539-4

Index Terms

Prioritizing Testing Instances to Enhance the Robustness of Object Detection Systems
1. Software and its engineering
  1. Software creation and management
    1. Software verification and validation
      1. Software defect analysis
        Software testing and debugging

Recommendations

Seeing the invisible: test prioritization for object detection system
Abstract
Object detection models have been deployed in various safety-critical software systems. However, an inadequately tested object detection system may exhibit aberrant behavior in applications, potentially leading to immeasurable losses to users. The ...
Faster mutation testing inspired by test prioritization and reduction
ISSTA 2013: Proceedings of the 2013 International Symposium on Software Testing and Analysis

Mutation testing is a well-known but costly approach for determining test adequacy. The central idea behind the approach is to generate mutants, which are small syntactic transformations of the program under test, and then to measure for a given test ...
Prioritizing State-Based Aspect Tests
ICST '10: Proceedings of the 2010 Third International Conference on Software Testing, Verification and Validation

In aspect-oriented programming, aspects are essentially incremental modifications to their base classes. Therefore aspect-oriented programs can be tested in an incremental fashion – we can first test the base classes and then test the base classes and ...

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences

Internetware '23: Proceedings of the 14th Asia-Pacific Symposium on Internetware

August 2023

332 pages

ISBN:9798400708947

DOI:10.1145/3609437

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 05 October 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article
Research
Refereed limited

Conference

Internetware 2023

Internetware 2023: 14th Asia-Pacific Symposium on Internetware

August 4 - 6, 2023

Hangzhou, China

Acceptance Rates

Overall Acceptance Rate 55 of 111 submissions, 50%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

2
Total Citations
View Citations
116
Total Downloads

Downloads (Last 12 months)82
Downloads (Last 6 weeks)11

Reflects downloads up to 20 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Wang SLi DLi HZhao MWong W(2024)A Survey on Test Input Selection and Prioritization for Deep Neural Networks2024 10th International Symposium on System Security, Safety, and Reliability (ISSSR)10.1109/ISSSR61934.2024.00035(232-243)Online publication date: 16-Mar-2024
https://doi.org/10.1109/ISSSR61934.2024.00035
Weng SFeng YYin YDai YLiu JZhao Z(2024)Seeing the invisible: test prioritization for object detection systemEmpirical Software Engineering10.1007/s10664-024-10539-429:6Online publication date: 23-Sep-2024
https://doi.org/10.1007/s10664-024-10539-4

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

HTML Format

View this article in HTML Format.

Figures

Tables

Media

View Table of Conten