A maximum-entropy-attention-based convolutional neural network for image perception

Chen, Qili; Zhang, Ancai; Pan, Guangyuan

doi:10.1007/s00521-022-07564-z

A maximum-entropy-attention-based convolutional neural network for image perception

S.I.: AI based Techniques and Applications for Intelligent IoT Systems
Published: 23 July 2022

Volume 35, pages 8647–8655, (2023)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

420 Accesses
Explore all metrics

Abstract

In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

AFS: Attention Using First and Second Order Information to Enrich Features

An Efficient Deep Convolutional Neural Network for Visual Image Classification

A feature-wise attention module based on the difference with surrounding features for convolutional neural networks

Article 21 January 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Data availability

Data available on request from the authors.

References

Gu K, Zhang Y, Qiao J (2021) Ensemble meta-learning for few-shot soot density recognition. IEEE Trans Industr Inf 17(3):2261–2270
Article Google Scholar
Zhu M, Ge D (2020) Image quality assessment based on deep learning with FPGA implementation. Signal Process: Image Commun 1(83):115780
Google Scholar
Han G, Cheng Q, Sun X, Li L, Di W (2019) A biological mechanism based structure self-adaptive algorithm for feedforward neural network and its engineering applications. IEEE Access 7:25111–25122
Article Google Scholar
Han H, Liu H, Li J, Qiao J (2021) Cooperative fuzzy-neural control for wastewater treatment process. IEEE Trans Industr Inf 17(9):5971–5981
Article Google Scholar
Han G, Li L, Di W, Sun X, Bu T, Lin T (2020) Multiscale convolutional generative adversarial network for anchorage grout defect detection. IEEE Trans Instrum Meas 70:1–10
Google Scholar
Yang L, Wang L, Su Y, Gao Y (2021) Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition. Signal Process: Image Commun 1(96):116297
Google Scholar
Yin P, Yuan R, Cheng Y, Wu Q (2020) Deep guidance network for biomedical image segmentation. IEEE Access 8:116106–116116
Article Google Scholar
Wong A, Famuori M, Shafiee MJ, Li F, Chwyl B, Chung J (2019) YOLO Nano: a highly compact you only look once convolutional neural network for object detection. Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition, 22–25.
Oktay O, Ferrante E, Kamnitsas K (2018) Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans Med Imaging 37(2):384–395
Article Google Scholar
Pan G, Fu L, Thakali L (2017) Development of a global road safety performance function using deep neural networks. Int J Transp Sci Technol 6(3):159–173
Article Google Scholar
Samek W, Binder A, Montavon G, Lapuschkin S, Müller KR (2017) Evaluating the visualization of what a deep neural network has learned. IEEE Trans Neural Netw Learn Syst 28(11):2660–2673
Article MathSciNet Google Scholar
Pan G, Fu L, Chen Q, Yu M, Muresan M (2020) Road safety performance function analysis with visual feature importance of deep neural nets. IEEE/CAA J Automatica Sinica 7(3):735–744
Article Google Scholar
Chen Q, Pan G, Chen W, Wu P (2021) A novel explainable deep belief network framework and its application for feature importance analysis. IEEE Sens J 21:25001–25009
Article Google Scholar
Gu K, Tao D, Qiao J, Lin W (2018) Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans Neural Netw Learn Syst 29(4):1301–1313
Article Google Scholar
Gu K, Zhang Y, Qiao J (2020) Vision-based monitoring of flare soot. IEEE Trans Instrum Meas 69(9):7136–7145
Article Google Scholar
Liu H, Chu W, Wang H (2020) Automatic segmentation algorithm of ultrasound heart image based on convolutional neural network and image saliency. IEEE Access 8:104445–104457
Article Google Scholar
Chen W, Gu K, Zhao T, Jiang G, Callet PL (2021) Semi-reference sonar image quality assessment based on task and visual perception. IEEE Trans Multimedia 23:1008–1020
Article Google Scholar
Zhu X, Zhang X, Zhang T, Zhu P, Tang X, Li C (2020) Discriminative feature pyramid network for object detection in remote sensing images. International Joint Conference on Neural Networks (IJCNN), 1–7.
Shi X, Qiu G, Yin C, Huang X, Chen K, Cheng Y, Zhong S (2021) An improved bearing fault diagnosis scheme based on hierarchical fuzzy entropy and Alexnet network. IEEE Access 9:61710–61720
Article Google Scholar
Avula SB, Badri SJ, Reddy G (2020) A novel forest fire detection system using fuzzy entropy optimized thresholding and STN-based CNN. IEEE International Conference on Communication Systems & Networks, 750–755.
Tian Y, Pan G (2020) An unsupervised regularization and dropout based deep neural network and its application for thermal error prediction. Appl Sci 10(8):2870
Article Google Scholar
Chen Q, Pan G (2021) A structure-self-organizing DBN for image recognition. Neural Comput Appl 33(7553):877–886
Article Google Scholar
Pan G, Fu L, Yu R, Muresan M, Evaluation of alternative pre-trained convolutional neural networks for winter road surface condition monitoring. IEEE International Conference on Transportation Information and Safety, (2019), 614–620.
Gaus YFA, Bhowmik N, Akçay S, Guillén-Garcia PM, Barker JW, Breckon TP (2019) Evaluation of a dual convolutional neural network architecture for object-wise anomaly detection in cluttered X-ray security imagery. International Joint Conference on Neural Networks (IJCNN), 1–8.
Nie D, Wang L, Adeli E, Lao C, Lin W, Shen D (2018) 3-D fully convolutional networks for multimodal isointense infant brain image segmentation. IEEE Trans Cybernetics 49(3):1123–1136
Article Google Scholar
Zhang M, Jiao L, Shang R, Zhang X, Li L (2020) Unsupervised EA-based fuzzy clustering for image segmentation. IEEE Access 8:8627–8647. https://doi.org/10.1109/ACCESS.2019.2963363
Article Google Scholar
Lei T, Liu P, Jia X, Zhang X, Meng H, Nandi AK (2020) Automatic fuzzy clustering framework for image segmentation. IEEE Trans Fuzzy Syst 28(9):2078–2092
Article Google Scholar
Bazaluk O, Kotenko S, Nitsenko V (2021) Entropy as an objective function of optimization multimodal transportations. Entropy 23:946
Article MathSciNet Google Scholar
Li L, He H, Li J (2020) Entropy-based sampling approaches for multi-class imbalanced problems. IEEE Trans Knowl Data Eng 32(11):2159–2170
Article Google Scholar
Hussain L, Aziz W, Alshdadi AA, Ahmed Nadeem MS, Khan IR, Chaudhry Q (2019) Analyzing the dynamics of lung cancer imaging data using refined fuzzy entropy methods by extracting different features. IEEE Access 7:64704–64721
Article Google Scholar
Chakraborty DB, Pal SK (2018) Neighborhood rough filter and intuitionistic entropy in unsupervised tracking. IEEE Trans Fuzzy Syst 26(4):2188–2200
Article Google Scholar
Alwerfali HN, Al-qaness MAA, Elaziz MAbd, Ewees A, Oliva D, Songfeng L (2020) Multi-level image thresholding based on modified spherical search optimizer and fuzzy entropy. Entropy 22(3):328. https://doi.org/10.3390/e22030328
Article MathSciNet Google Scholar
Gu K, Zhang Y, Qiao J (2020) Random forest ensemble for river turbidity measurement from space remote sensing data. IEEE Trans Instrum Meas 69(11):9028–9036
Article Google Scholar
Ye M, Yan X, Jia M (2021) Rolling bearing fault diagnosis based on VMD-MPE and PSO-SVM. Entropy 23:762
Article Google Scholar
Jalal A, Ahmed A, Rafique AA, Kim K (2021) Scene semantic recognition based on modified fuzzy c-mean and maximum entropy using object-to-object relations. IEEE Access 9:27758–27772
Article Google Scholar
Bera A, Wharton Z, Liu Y, Bessis N, Behera A (2021) Attend and guide (AG-Net): A keypoints-driven attention-based deep network for image recognition. IEEE Trans Image Process 30:3691–3704. https://doi.org/10.1109/TIP.2021.3064256
Article Google Scholar
Gu R et al (2021) CA-Net: comprehensive attention convolutional neural networks for explainable medical image segmentation. IEEE Trans Med Imaging 40(2):699–711. https://doi.org/10.1109/TMI.2020.3035253
Article MathSciNet Google Scholar
World Health Organization (WHO), Global status report on road safety, WHO Press, World Health Organization, Geneva, Switzerland, 2018.

Download references

Acknowledgements

This research was funded by the National Natural Science Foundation of China under Grant no. 62103056 and 62103177, the Development Plan of Youth Innovation Team of the University in Shandong Province under Grant no. 2019KJN007, and the Natural Science Foundation Program of Shandong Province under Grant no. ZR2019YQ28.

Author information

Authors and Affiliations

School of Automation, Beijing Information Science and Technology University, Beijing, 100192, China
Qili Chen
School of Automation and Electrical Engineering, Linyi University, Linyi, 276000, China
Ancai Zhang & Guangyuan Pan

Authors

Qili Chen
View author publications
You can also search for this author inPubMed Google Scholar
Ancai Zhang
View author publications
You can also search for this author inPubMed Google Scholar
Guangyuan Pan
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Guangyuan Pan.

Ethics declarations

Conflicts of interest

All the authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, Q., Zhang, A. & Pan, G. A maximum-entropy-attention-based convolutional neural network for image perception. Neural Comput & Applic 35, 8647–8655 (2023). https://doi.org/10.1007/s00521-022-07564-z

Download citation

Received: 29 March 2022
Accepted: 27 June 2022
Published: 23 July 2022
Issue Date: April 2023
DOI: https://doi.org/10.1007/s00521-022-07564-z

Keywords

Part of a collection:

Special Issue on Artificial Intelligence based Techniques and Applications for Intelligent IoT Systems (AI-TAIoT)

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A maximum-entropy-attention-based convolutional neural network for image perception

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

AFS: Attention Using First and Second Order Information to Enrich Features

An Efficient Deep Convolutional Neural Network for Visual Image Classification

A feature-wise attention module based on the difference with surrounding features for convolutional neural networks

Explore related subjects

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflicts of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now