PAAM (Parameter-free Attentional Aggregation Model)

Qi, Xuan-Hao; Zhi, Min; Mi, Zeng; Hu, Wei; Yin, Yan-Jun; Zhang, Yue-Ning; Duan, Wen-Tao; Lian, Zhe

doi:10.1007/978-981-97-5600-1_12

Xuan-Hao Qi¹⁰,
Min Zhi¹⁰,
Zeng Mi¹⁰,
Wei Hu¹⁰,
Yan-Jun Yin¹⁰,
Yue-Ning Zhang¹⁰,
Wen-Tao Duan¹⁰ &
…
Zhe Lian¹⁰

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 14868))

Included in the following conference series:

International Conference on Intelligent Computing

432 Accesses

Abstract

The channel attention mechanism and spatial attention mechanism are crucial in enhancing the performance of convolutional neural networks. However, most existing methods focus on developing more intricate attention modules to improve performance, which inevitably increases the number of model parameters. To address the trade-off between performance and parameter count, this paper introduces an efficient Parameter-free Attention Aggregation Model (PAAM) plug-and-play module. The module first creates a Local Feature Enhancement Module (LFEM) using adaptive pooling. Firstly, the local feature enhancement module (LFEM) is constructed through adaptive pooling to enhance the expression of local features; secondly, the local-global feature interaction module (L-GFIM) is used to realize the mutual compensation between local and global features, which effectively extends the coverage of local-global interaction. The experimental results indicate that PAAM outperforms the SOTA model in ImageNet-1K, Cifar-10, and Cifar-100 image classification datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

AFS: Attention Using First and Second Order Information to Enrich Features

SPEM: Self-adaptive Pooling Enhanced Attention Module for Image Recognition

TripleFormer: improving transformer-based image classification method using multiple self-attention inputs

Article 01 March 2024

References

Chen, Y.: Convolutional neural network for sentence classification. Master’s thesis, University of Waterloo (2015)
Google Scholar
Krizhevsky, A., Sutskever, I., Hinton, G.E.: ImageNet classification with deep convolutional neural networks. Commun. ACM 60(6), 84–90 (2017)
Article Google Scholar
Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
Goodfellow, I., et al.: Generative adversarial nets. Adv. Neural Inf. Process. Syst. 27 (2014)
Google Scholar
He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: Convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Howard, A.G., et al.: MobileNets: Efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Zhang, X., Zhou, X., Lin, M., Sun, J.: ShuffleNet: an extremely efficient convolutional neural network for mobile devices. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6848–6856 (2018)
Google Scholar
Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)
Google Scholar
Yin, Y., et al.: Artificial neural networks for finger vein recognition: a survey. arXiv preprint arXiv:2208.13341 (2022)
Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Google Scholar
Xie, S., Girshick, R., Doll´ar, P., Tu, Z., He, K.: Aggregated residual transformations for deep neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1492–1500 (2017)
Google Scholar
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11534–11542 (2020)
Google Scholar
Yang, Z., Zhu, L., Wu, Y., Yang, Y.: Gated channel transformation for visual recognition. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11794–11803 (2020)
Google Scholar
Li, X., Wang, W., Hu, X., Yang, J.: Selective kernel networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 510–519 (2019)
Google Scholar
Wu, T., Tang, S., Zhang, R., Cao, J., Zhang, Y.: CGNet: a light-weight context guided network for semantic segmentation. IEEE Trans. Image Process. 30, 1169–1179 (2020)
Article Google Scholar
Woo, S., Park, J., Lee, J.Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 3–19 (2018)
Google Scholar
Park, J., Woo, S., Lee, J.Y., Kweon, I.S.: BAM: Bottleneck attention module. arXiv preprint arXiv:1807.06514 (2018)
Zhao, H., et al.: PSANet: Pointwise spatial attention network for scene parsing. In: Proceedings of the European Conference on Computer Vision (ECCV), pp. 267–283 (2018)
Google Scholar
Xu, K., Wang, Z., Shi, J., Li, H., Zhang, Q.C.: A2-net: Molecular structure estimation from cryo-EM density volumes. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33, pp. 1230–1237 (2019)
Google Scholar

Download references

Acknowledgments

This work was supported by Natural Science Foundation Program of Inner Mongolia (No. 2023MS06009).

Author information

Authors and Affiliations

College of Computer Science and Technology, Inner Mongolia Normal University, Hohhot, 010022, China
Xuan-Hao Qi, Min Zhi, Zeng Mi, Wei Hu, Yan-Jun Yin, Yue-Ning Zhang, Wen-Tao Duan & Zhe Lian

Authors

Xuan-Hao Qi
View author publications
You can also search for this author in PubMed Google Scholar
Min Zhi
View author publications
You can also search for this author in PubMed Google Scholar
Zeng Mi
View author publications
You can also search for this author in PubMed Google Scholar
Wei Hu
View author publications
You can also search for this author in PubMed Google Scholar
Yan-Jun Yin
View author publications
You can also search for this author in PubMed Google Scholar
Yue-Ning Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Wen-Tao Duan
View author publications
You can also search for this author in PubMed Google Scholar
Zhe Lian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Min Zhi .

Editor information

Editors and Affiliations

Eastern Institute of Technology, Ningbo, China
De-Shuang Huang
Tianjin University of Science and Technology, Tianjin, China
Chuanlei Zhang
Eastern Institute of Technology, Ningbo, China
Qinhu Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qi, XH. et al. (2024). PAAM (Parameter-free Attentional Aggregation Model). In: Huang, DS., Zhang, C., Zhang, Q. (eds) Advanced Intelligent Computing Technology and Applications. ICIC 2024. Lecture Notes in Computer Science, vol 14868. Springer, Singapore. https://doi.org/10.1007/978-981-97-5600-1_12

Download citation

DOI: https://doi.org/10.1007/978-981-97-5600-1_12
Published: 30 July 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-5599-8
Online ISBN: 978-981-97-5600-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PAAM (Parameter-free Attentional Aggregation Model)

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

AFS: Attention Using First and Second Order Information to Enrich Features

SPEM: Self-adaptive Pooling Enhanced Attention Module for Image Recognition

TripleFormer: improving transformer-based image classification method using multiple self-attention inputs

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

PAAM (Parameter-free Attentional Aggregation Model)

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

AFS: Attention Using First and Second Order Information to Enrich Features

SPEM: Self-adaptive Pooling Enhanced Attention Module for Image Recognition

TripleFormer: improving transformer-based image classification method using multiple self-attention inputs

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation