Abstract
In recent years, image perception such as enhancement, classification and object detection with deep learning has achieved significant successes. However, in real world under extreme conditions, the training of a deep learning model often yields low accuracy, low efficiency in feature extraction and generalizability, due to the inner uncourteous and uninterpretable characteristics. In this paper, a maximal-entropy-attention-based convolutional neural network (MEA-CNN) framework is proposed. A maximum entropy algorithm is first used for image feature pre-extraction. An attention mechanism is then proposed by combining the extracted features on original images. By applying the mechanism, the key areas of an image are enhanced, and noised area can be ignored. Afterward, the processed images are transferred into region convolutional neural network, which is a well-known pre-trained CNN model, for further feature learning and extraction. Finally, two real-world experiments on traffic sign recognition and road surface condition monitoring are designed. The results show that the proposed framework has high testing accuracy, with improvements of 17% and 2.9%, compared with some other existing methods. In addition, the features extracted by the model are more easily interpretable.



Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Data availability
Data available on request from the authors.
References
Gu K, Zhang Y, Qiao J (2021) Ensemble meta-learning for few-shot soot density recognition. IEEE Trans Industr Inf 17(3):2261–2270
Zhu M, Ge D (2020) Image quality assessment based on deep learning with FPGA implementation. Signal Process: Image Commun 1(83):115780
Han G, Cheng Q, Sun X, Li L, Di W (2019) A biological mechanism based structure self-adaptive algorithm for feedforward neural network and its engineering applications. IEEE Access 7:25111–25122
Han H, Liu H, Li J, Qiao J (2021) Cooperative fuzzy-neural control for wastewater treatment process. IEEE Trans Industr Inf 17(9):5971–5981
Han G, Li L, Di W, Sun X, Bu T, Lin T (2020) Multiscale convolutional generative adversarial network for anchorage grout defect detection. IEEE Trans Instrum Meas 70:1–10
Yang L, Wang L, Su Y, Gao Y (2021) Bag of shape descriptor using unsupervised deep learning for non-rigid shape recognition. Signal Process: Image Commun 1(96):116297
Yin P, Yuan R, Cheng Y, Wu Q (2020) Deep guidance network for biomedical image segmentation. IEEE Access 8:116106–116116
Wong A, Famuori M, Shafiee MJ, Li F, Chwyl B, Chung J (2019) YOLO Nano: a highly compact you only look once convolutional neural network for object detection. Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition, 22–25.
Oktay O, Ferrante E, Kamnitsas K (2018) Anatomically constrained neural networks (ACNNs): application to cardiac image enhancement and segmentation. IEEE Trans Med Imaging 37(2):384–395
Pan G, Fu L, Thakali L (2017) Development of a global road safety performance function using deep neural networks. Int J Transp Sci Technol 6(3):159–173
Samek W, Binder A, Montavon G, Lapuschkin S, Müller KR (2017) Evaluating the visualization of what a deep neural network has learned. IEEE Trans Neural Netw Learn Syst 28(11):2660–2673
Pan G, Fu L, Chen Q, Yu M, Muresan M (2020) Road safety performance function analysis with visual feature importance of deep neural nets. IEEE/CAA J Automatica Sinica 7(3):735–744
Chen Q, Pan G, Chen W, Wu P (2021) A novel explainable deep belief network framework and its application for feature importance analysis. IEEE Sens J 21:25001–25009
Gu K, Tao D, Qiao J, Lin W (2018) Learning a no-reference quality assessment model of enhanced images with big data. IEEE Trans Neural Netw Learn Syst 29(4):1301–1313
Gu K, Zhang Y, Qiao J (2020) Vision-based monitoring of flare soot. IEEE Trans Instrum Meas 69(9):7136–7145
Liu H, Chu W, Wang H (2020) Automatic segmentation algorithm of ultrasound heart image based on convolutional neural network and image saliency. IEEE Access 8:104445–104457
Chen W, Gu K, Zhao T, Jiang G, Callet PL (2021) Semi-reference sonar image quality assessment based on task and visual perception. IEEE Trans Multimedia 23:1008–1020
Zhu X, Zhang X, Zhang T, Zhu P, Tang X, Li C (2020) Discriminative feature pyramid network for object detection in remote sensing images. International Joint Conference on Neural Networks (IJCNN), 1–7.
Shi X, Qiu G, Yin C, Huang X, Chen K, Cheng Y, Zhong S (2021) An improved bearing fault diagnosis scheme based on hierarchical fuzzy entropy and Alexnet network. IEEE Access 9:61710–61720
Avula SB, Badri SJ, Reddy G (2020) A novel forest fire detection system using fuzzy entropy optimized thresholding and STN-based CNN. IEEE International Conference on Communication Systems & Networks, 750–755.
Tian Y, Pan G (2020) An unsupervised regularization and dropout based deep neural network and its application for thermal error prediction. Appl Sci 10(8):2870
Chen Q, Pan G (2021) A structure-self-organizing DBN for image recognition. Neural Comput Appl 33(7553):877–886
Pan G, Fu L, Yu R, Muresan M, Evaluation of alternative pre-trained convolutional neural networks for winter road surface condition monitoring. IEEE International Conference on Transportation Information and Safety, (2019), 614–620.
Gaus YFA, Bhowmik N, Akçay S, Guillén-Garcia PM, Barker JW, Breckon TP (2019) Evaluation of a dual convolutional neural network architecture for object-wise anomaly detection in cluttered X-ray security imagery. International Joint Conference on Neural Networks (IJCNN), 1–8.
Nie D, Wang L, Adeli E, Lao C, Lin W, Shen D (2018) 3-D fully convolutional networks for multimodal isointense infant brain image segmentation. IEEE Trans Cybernetics 49(3):1123–1136
Zhang M, Jiao L, Shang R, Zhang X, Li L (2020) Unsupervised EA-based fuzzy clustering for image segmentation. IEEE Access 8:8627–8647. https://doi.org/10.1109/ACCESS.2019.2963363
Lei T, Liu P, Jia X, Zhang X, Meng H, Nandi AK (2020) Automatic fuzzy clustering framework for image segmentation. IEEE Trans Fuzzy Syst 28(9):2078–2092
Bazaluk O, Kotenko S, Nitsenko V (2021) Entropy as an objective function of optimization multimodal transportations. Entropy 23:946
Li L, He H, Li J (2020) Entropy-based sampling approaches for multi-class imbalanced problems. IEEE Trans Knowl Data Eng 32(11):2159–2170
Hussain L, Aziz W, Alshdadi AA, Ahmed Nadeem MS, Khan IR, Chaudhry Q (2019) Analyzing the dynamics of lung cancer imaging data using refined fuzzy entropy methods by extracting different features. IEEE Access 7:64704–64721
Chakraborty DB, Pal SK (2018) Neighborhood rough filter and intuitionistic entropy in unsupervised tracking. IEEE Trans Fuzzy Syst 26(4):2188–2200
Alwerfali HN, Al-qaness MAA, Elaziz MAbd, Ewees A, Oliva D, Songfeng L (2020) Multi-level image thresholding based on modified spherical search optimizer and fuzzy entropy. Entropy 22(3):328. https://doi.org/10.3390/e22030328
Gu K, Zhang Y, Qiao J (2020) Random forest ensemble for river turbidity measurement from space remote sensing data. IEEE Trans Instrum Meas 69(11):9028–9036
Ye M, Yan X, Jia M (2021) Rolling bearing fault diagnosis based on VMD-MPE and PSO-SVM. Entropy 23:762
Jalal A, Ahmed A, Rafique AA, Kim K (2021) Scene semantic recognition based on modified fuzzy c-mean and maximum entropy using object-to-object relations. IEEE Access 9:27758–27772
Bera A, Wharton Z, Liu Y, Bessis N, Behera A (2021) Attend and guide (AG-Net): A keypoints-driven attention-based deep network for image recognition. IEEE Trans Image Process 30:3691–3704. https://doi.org/10.1109/TIP.2021.3064256
Gu R et al (2021) CA-Net: comprehensive attention convolutional neural networks for explainable medical image segmentation. IEEE Trans Med Imaging 40(2):699–711. https://doi.org/10.1109/TMI.2020.3035253
World Health Organization (WHO), Global status report on road safety, WHO Press, World Health Organization, Geneva, Switzerland, 2018.
Acknowledgements
This research was funded by the National Natural Science Foundation of China under Grant no. 62103056 and 62103177, the Development Plan of Youth Innovation Team of the University in Shandong Province under Grant no. 2019KJN007, and the Natural Science Foundation Program of Shandong Province under Grant no. ZR2019YQ28.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
All the authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, Q., Zhang, A. & Pan, G. A maximum-entropy-attention-based convolutional neural network for image perception. Neural Comput & Applic 35, 8647–8655 (2023). https://doi.org/10.1007/s00521-022-07564-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00521-022-07564-z