Diffusion Init: Stronger Initialisation of Decision-Based Black-Box Attacks for Visual Object Tracking

Wang, Renjie; Xu, Tianyang; Zhao, Shaochuan; Wu, Xiao-Jun; Kittler, Josef

doi:10.1007/978-3-031-47637-2_28

Renjie Wang¹³,
Tianyang Xu¹³,
Shaochuan Zhao¹³,
Xiao-Jun Wu¹³ &
…
Josef Kittler¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14407))

Included in the following conference series:

Asian Conference on Pattern Recognition

563 Accesses

Abstract

Adversarial attacks have emerged in the field of visual object tracking to mislead the tracker and result in its failure. Black-box attacks in particular have attracted increasing attention for their affinity with real-world applications. In the paradigm of decision-based black-box attacks, the magnitude of perturbation is gradually amplified, while the optimisation direction is predefined by an initial adversarial sample. Considering the pivotal role played by the initial adversarial sample in determining the success of an attack, we utilise the noise generated from the reverse process of a diffusion model as a better attacking direction. On the one hand, the diffusion model generates Gaussian noise, which formulate global information interaction, with a comprehensive impact on Transformer-based trackers. On the other hand, the diffusion model pays more attention to the target region during the inverse process, resulting in a more powerful perturbation of the target object. Our method, which is widely applicable, has been validated on a range of trackers using several benchmarking datasets. It is shown to deliver more extensive tracking performance degradation, compared to other state-of-the-art methods. We also investigate different approaches to the problem of generating the initial adversarial sample, confirming the effectiveness and rationality of our proposed diffusion initialisation method.

This work is supported in part by the National Natural Science Foundation of China (Grant No. 62106089, 62020106012).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Adversarial attack can help visual tracking

Article 14 March 2022

Multi-Model UNet: An Adversarial Defense Mechanism for Robust Visual Tracking

Article Open access 01 April 2024

An efficient method to fool and enhance object tracking with adversarial perturbations

Article 14 March 2023

References

Augustin, M., Boreiko, V., Croce, F., Hein, M.: Diffusion visual counterfactual explanations. In: Oh, A.H., Agarwal, A., Belgrave, D., Cho, K. (eds.) Advances in Neural Information Processing Systems (NeurIPS) (2022)
Google Scholar
Bai, S., Li, Y., Zhou, Y., Li, Q., Torr, P.S.: Adversarial metric attack and defense for person re-identification. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 43(06), 2119–2126 (2021)
Article Google Scholar
Bhat, G., Danelljan, M., Gool, L.V., Timofte, R.: Learning discriminative model prediction for tracking. In: International Conference on Computer Vision (ICCV), pp. 6182–6191 (2019)
Google Scholar
Brendel, W., Rauber, J., Bethge, M.: Decision-based adversarial attacks: reliable attacks against black-box machine learning models. In: International Conference on Learning Representations (ICLR) (2018)
Google Scholar
Chen, J., Jordan, M.I., Wainwright, M.J.: HopSkipJumpAttack: a query-efficient decision-based attack. In: IEEE Symposium on Security and Privacy (SP), pp. 1277–1294. IEEE (2020)
Google Scholar
Chen, X., Yan, X., Zheng, F., Jiang, Y., Ji, R.: One-shot adversarial attacks on visual tracking with dual attention. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2020)
Google Scholar
Damer, N., Fang, M., Siebke, P., Kolf, J.N., Huber, M., Boutros, F.: MorDIFF: recognition vulnerability and attack detectability of face morphing attacks created by diffusion autoencoders (2023)
Google Scholar
Dhariwal, P., Nichol, A.: Diffusion models beat GANs on image synthesis. In: Advances in Neural Information Processing Systems (NIPS), vol. 34, pp. 8780–8794 (2021)
Google Scholar
Dong, Y., et al.: Efficient decision-based black-box adversarial attacks on face recognition. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7714–7722 (2019)
Google Scholar
Guo, Q., et al.: SPARK: spatial-aware online incremental attack against visual tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12370, pp. 202–219. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58595-2_13
Chapter Google Scholar
Ho, J., Jain, A., Abbeel, P.: Denoising diffusion probabilistic models. In: Advances in Neural Information Processing Systems (NeurIPS), vol. 33, pp. 6840–6851 (2020)
Google Scholar
Huang, L., Zhao, X., Huang, K.: GOT-10k: a large high-diversity benchmark for generic object tracking in the wild. IEEE Trans. Pattern Anal. Mach. Intell. 43(5), 1562–1577 (2021)
Article Google Scholar
Jeanneret, G., Simon, L., Jurie, F.: Adversarial counterfactual visual explanations. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16425–16435 (2023)
Google Scholar
Jia, S., Ma, C., Song, Y., Yang, X.: Robust tracking against adversarial attacks. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12364, pp. 69–84. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58529-7_5
Chapter Google Scholar
Jia, S., Song, Y., Ma, C., Yang, X.: IoU attack: towards temporally coherent black-box adversarial attack for visual object tracking. In: Conference on Computer Vision and Pattern Recognition (CVPR) (2021)
Google Scholar
Li, B., Wu, W., Wang, Q., Zhang, F., Xing, J., Yan, J.: SiamRPN++: evolution of Siamese visual tracking with very deep networks. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 4277–4286 (2019)
Google Scholar
Liang, S., Wei, X., Yao, S., Cao, X.: Efficient adversarial attacks for visual object tracking. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12371, pp. 34–50. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58574-7_3
Chapter Google Scholar
Maho, T., Furon, T., Merrer, E.L.: SurFree: a fast surrogate-free black-box attack. Conference on Computer Vision and Pattern Recognition (CVPR), pp. 10425–10434 (2020)
Google Scholar
Mayer, C., et al.: Transforming model prediction for tracking. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 8731–8740, June 2022
Google Scholar
Mueller, M., Smith, N., Ghanem, B.: A benchmark and simulator for UAV tracking. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9905, pp. 445–461. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46448-0_27
Chapter Google Scholar
Nie, W., Guo, B., Huang, Y., Xiao, C., Vahdat, A., Anandkumar, A.: Diffusion models for adversarial purification. In: International Conference on Machine Learning (ICML) (2022)
Google Scholar
Papernot, N., McDaniel, P., Goodfellow, I., Jha, S., Celik, Z.B., Swami, A.: Practical black-box attacks against machine learning. In: Proceedings of the 2017 ACM on Asia Conference on Computer and Communications Security, pp. 506–519 (2017)
Google Scholar
Song, J., Meng, C., Ermon, S.: Denoising diffusion implicit models. In: International Conference on Learning Representations (ICLR) (2021)
Google Scholar
Wang, J., Lyu, Z., Lin, D., Dai, B., Fu, H.: Guided diffusion model for adversarial purification (2022)
Google Scholar
Wu, Q., Ye, H., Gu, Y.: Guided diffusion model for adversarial purification from random noise (2022)
Google Scholar
Wu, Y., Lim, J., Yang, M.H.: Object tracking benchmark. IEEE Trans. Pattern Anal. Mach. Intell. (TPAMI) 37(9), 1834–1848 (2015)
Article Google Scholar
Xu, T., Feng, Z., Wu, X.J., Kittler, J.: Adaptive channel selection for robust visual object tracking with discriminative correlation filters. Int. J. Comput. Vision 129, 1359–1375 (2021)
Article MATH Google Scholar
Xu, T., Feng, Z., Wu, X.J., Kittler, J.: Toward robust visual object tracking with independent target-agnostic detection and effective Siamese cross-task interaction. IEEE Trans. Image Process. 32, 1541–1554 (2023)
Article Google Scholar
Xu, T., Wu, X.J., Kittler, J.: Non-negative subspace representation learning scheme for correlation filter based tracking. In: 2018 24th International Conference on Pattern Recognition (ICPR), pp. 1888–1893. IEEE (2018)
Google Scholar
Xu, T., Zhu, X.F., Wu, X.J.: Learning spatio-temporal discriminative model for affine subspace based visual object tracking. Vis. Intell. 1(1), 4 (2023)
Article Google Scholar
Yan, B., Peng, H., Fu, J., Wang, D., Lu, H.: Learning spatio-temporal transformer for visual tracking. In: International Conference on Computer Vision (ICCV), pp. 10448–10457 (2021)
Google Scholar
Yan, B., Wang, D., Lu, H., Yang, X.: Cooling-shrinking attack: blinding the tracker with imperceptible noises. In: Conference on Computer Vision and Pattern Recognition (CVPR), pp. 987–996 (2020)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Artificial Intelligence and Computer Science, Jiangnan University, Wuxi, Jiangsu, China
Renjie Wang, Tianyang Xu, Shaochuan Zhao & Xiao-Jun Wu
Centre for Vision, Speech and Signal Processing, University of Surrey, Guildford, GU2 7XH, UK
Josef Kittler

Authors

Renjie Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyang Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shaochuan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xiao-Jun Wu
View author publications
You can also search for this author in PubMed Google Scholar
Josef Kittler
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tianyang Xu .

Editor information

Editors and Affiliations

Kyushu Institute of Technology, Kitakyushu, Fukuoka, Japan
Huimin Lu
The University of Sydney, Sydney, NSW, Australia
Michael Blumenstein
Yonsei University, Seoul, Korea (Republic of)
Sung-Bae Cho
Chinese Academy of Sciences, Bejing, China
Cheng-Lin Liu
Osaka University, Osaka, Ibaraki, Japan
Yasushi Yagi
Kyushu Institute of Technology, Kitakyushu, Japan
Tohru Kamiya

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, R., Xu, T., Zhao, S., Wu, XJ., Kittler, J. (2023). Diffusion Init: Stronger Initialisation of Decision-Based Black-Box Attacks for Visual Object Tracking. In: Lu, H., Blumenstein, M., Cho, SB., Liu, CL., Yagi, Y., Kamiya, T. (eds) Pattern Recognition. ACPR 2023. Lecture Notes in Computer Science, vol 14407. Springer, Cham. https://doi.org/10.1007/978-3-031-47637-2_28

Download citation

DOI: https://doi.org/10.1007/978-3-031-47637-2_28
Published: 05 November 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-47636-5
Online ISBN: 978-3-031-47637-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Diffusion Init: Stronger Initialisation of Decision-Based Black-Box Attacks for Visual Object Tracking