Applications of Quantum Embedding in Computer Vision

Zhang, Juntao; Zhou, Jun; Wang, Hailong; Lei, Yang; Cheng, Peng; Li, Zehan; Wu, Hao; Yu, Kun; An, Wenbo

doi:10.1007/978-981-99-8145-8_14

Juntao Zhang ORCID: orcid.org/0000-0001-8174-5378¹⁰,
Jun Zhou¹⁰,
Hailong Wang¹⁰,
Yang Lei¹⁰,
Peng Cheng¹¹,
Zehan Li¹²,
Hao Wu¹⁰,
Kun Yu^10,12 &
…
Wenbo An¹⁰

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1965))

Included in the following conference series:

International Conference on Neural Information Processing

388 Accesses

Abstract

Nowadays, Deep Neural Networks (DNNs) are fundamental to many vision tasks, including large-scale visual recognition. As the primary goal of the DNNs is to characterize complex boundaries of thousands of classes in a high-dimensional space, it is critical to learn higher-order representations for enhancing nonlinear modeling capability. Recently, a novel method called Quantum-State-based Mapping (QSM) has been proposed to improve the feature calibration ability of the existing attention modules in transfer learning tasks. QSM uses the wave function describing the state of microscopic particles to map the feature vector into the probability space. In essence, QSM introduces a novel higher-order representation to improve the nonlinear capability of the network. In this paper, we extend QSM to Quantum Embedding (QE) for designing new attention modules and Self-Organizing Maps, a class of unsupervised learning methods. We also conducted experiments to validate the effectiveness of QE.

J. Zhou—Co-first author.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bausch, J.: Recurrent quantum neural networks. In: NeurIPS, pp. 1368–1379 (2020)
Google Scholar
Chen, X., Hsieh, C.J., Gong, B.: When vision transformers outperform resnets without pre-training or strong data augmentations. In: ICLR, pp. 869 (2022)
Google Scholar
Chen, Y., Kalantidis, Y., Li, J., Yan, S., Feng, J.: A\(\hat{\,}\)2-nets: Double attention networks. In: NeurIPS, pp. 350–359 (2018)
Google Scholar
Cottrell, M., Olteanu, M., Rossi, F., Villa-Vialaneix, N.: Theoretical and applied aspects of the self-organizing maps. In: Merényi, E., Mendenhall, M.J., O’Driscoll, P. (eds.) Advances in Self-Organizing Maps and Learning Vector Quantization. AISC, vol. 428, pp. 3–26. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-28518-4_1
Chapter Google Scholar
Dosovitskiy, A., et al.: An image is worth 16 \(\times \) 16 words: transformers for image recognition at scale. In: ICLR (2021)
Google Scholar
Gao, Z., Xie, J., Wang, Q., Li, P.: Global second-order pooling convolutional networks. In: CVPR, pp. 3024–3033 (2019)
Google Scholar
Garg, D., Ikbal, S., Srivastava, S.K., Vishwakarma, H., Karanam, H.P., Subramaniam, L.V.: Quantum embedding of knowledge for reasoning. In: NeurIPS, pp. 5595–5605 (2019)
Google Scholar
Goyal, P., et al.: Accurate, large minibatch SGD: training imagenet in 1 hour. CoRR abs/1706.02677 (2017). http://arxiv.org/abs/1706.02677
Griffiths, D.J.: Introduction to quantum mechanics. Am. J. Phys. 63(8), 1–12 (2005)
Google Scholar
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: CVPR, pp. 7132–7141 (2018)
Google Scholar
Kerenidis, I., Landman, J., Luongo, A., Prakash, A.: q-means: a quantum algorithm for unsupervised machine learning. In: NeurIPS, pp. 4136–4146 (2019)
Google Scholar
Kerenidis, I., Landman, J., Prakash, A.: Quantum algorithms for deep convolutional neural networks. In: ICLR (2020)
Google Scholar
Kohonen, T.: Self-organized formation of topologically correct feature maps. Biol. Cybern. 43(1), 59–69 (1982)
Article MathSciNet MATH Google Scholar
Lee, H., Kim, H., Nam, H.: SRM: a style-based recalibration module for convolutional neural networks. In: ICCV, pp. 1854–1862 (2019)
Google Scholar
MMClassification Contributors: Openmmlab’s image classification toolbox and benchmark. https://github.com/open-mmlab/mmclassification (2020)
Olivas, E.S., Guerrero, J.D.M., Martinez-Sober, M., Magdalena-Benedito, J.R., Serrano, L., et al.: Handbook of Research on Machine Learning Applications and Trends: Algorithms, Methods, and Techniques. IGI Global (2009)
Google Scholar
Qin, Z., Zhang, P., Wu, F., Li, X.: FCANet: frequency channel attention networks. In: ICCV, pp. 763–772 (2021)
Google Scholar
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: CVPR, pp. 11531–11539 (2020)
Google Scholar
Wightman, R., Touvron, H., Jegou, H.: Resnet strikes back: an improved training procedure in TIMM. In: NeurIPS 2021 Workshop on ImageNet: Past, Present, and Future (2021)
Google Scholar
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Chapter Google Scholar
Yang, Z., Zhu, L., Wu, Y., Yang, Y.: Gated channel transformation for visual recognition. In: CVPR (2020)
Google Scholar
Yosinski, J., Clune, J., Bengio, Y., Lipson, H.: How transferable are features in deep neural networks? In: Proceedings of the 27th International Conference on Neural Information Processing Systems - Volume 2, pp. 3320–3328 (2014)
Google Scholar
Zhang, J., et al.: An application of quantum mechanics to attention methods in computer vision. In: ICASSP (2023)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of System Engineering, AMS, Beijing, China
Juntao Zhang, Jun Zhou, Hailong Wang, Yang Lei, Hao Wu, Kun Yu & Wenbo An
Coolanyp L.L.C., Wuxi, China
Peng Cheng
University of Electronic Science and Technology of China, Chengdu, China
Zehan Li & Kun Yu

Authors

Juntao Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Hailong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yang Lei
View author publications
You can also search for this author in PubMed Google Scholar
Peng Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Zehan Li
View author publications
You can also search for this author in PubMed Google Scholar
Hao Wu
View author publications
You can also search for this author in PubMed Google Scholar
Kun Yu
View author publications
You can also search for this author in PubMed Google Scholar
Wenbo An
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yang Lei .

Editor information

Editors and Affiliations

School of Automation, Central South University, Changsha, China
Biao Luo
Institute of Automation, Chinese Academy of Sciences, Beijing, China
Long Cheng
Institute of Cyber-Systems and Control, Zhejiang University, Hangzhou, China
Zheng-Guang Wu
School of Automation, Guangdong University of Technology, Guangzhou, China
Hongyi Li
School of Electrical Engineering and Telecommunications, UNSW Sydney, Sydney, NSW, Australia
Chaojie Li

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J. et al. (2024). Applications of Quantum Embedding in Computer Vision. In: Luo, B., Cheng, L., Wu, ZG., Li, H., Li, C. (eds) Neural Information Processing. ICONIP 2023. Communications in Computer and Information Science, vol 1965. Springer, Singapore. https://doi.org/10.1007/978-981-99-8145-8_14

Download citation

DOI: https://doi.org/10.1007/978-981-99-8145-8_14
Published: 27 November 2023
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8144-1
Online ISBN: 978-981-99-8145-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Applications of Quantum Embedding in Computer Vision