EHDC: enhanced dilated convolution framework for underwater blurred target recognition

Lei Cai; Xiaochen Qin; Tao Xu

doi:10.1017/S0263574722001059

EHDC: enhanced dilated convolution framework for underwater blurred target recognition

Published online by Cambridge University Press: 26 July 2022

Lei Cai

Xiaochen Qin

and

Tao Xu

Show author details

Lei Cai*: Affiliation:
School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, China
Xiaochen Qin: Affiliation:
School of Information Engineering, Henan Institute of Science and Technology, Xinxiang, China
Tao Xu: Affiliation:
School of Artificial Intelligence, Henan Institute of Science and Technology, Xinxiang, China
*: *Corresponding author. E-mail: cailei2014@126.com

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The autonomous underwater vehicle (AUV) has a problem with feature loss when recognizing small targets underwater. At present, algorithms usually use multi-scale feature extraction to solve the problem, but this method increases the computational effort of the algorithm. In addition, low underwater light and turbid water result in incomplete information on target features. This paper proposes an enhanced dilated convolution framework (EHDC) for underwater blurred target recognition. Firstly, this paper extracts small target features through hybrid dilated convolution networks, increasing the perceptive field of the algorithm without increasing the computational power of the algorithm. Secondly, the proposed algorithm learns spatial semantic features through an adaptive correlation matrix and compensates for the missing features of the target. Finally, this paper fuses spatial semantic features and visual features for the recognition of small underwater blurred targets. Experiments show that the proposed method improves the recognition accuracy by 1.04% compared to existing methods when recognizing small underwater blurred targets.

Keywords

blurred small target hybrid dilated convolution spatial semantic features low light conditions target recognition

Type: Research Article
Information: Robotica , Volume 41 , Issue 3 , March 2023 , pp. 900 - 911

DOI: https://doi.org/10.1017/S0263574722001059 [Opens in a new window]
Copyright: © The Author(s), 2022. Published by Cambridge University Press

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Lu, L., Li, H., Ding, Z. and Guo, Q., “An improved target detection method based on multiscale features fusion,” Microw. Opt. Technol. Lett. 62(9), 1451–1460 (2020).CrossRef Google Scholar

Wang, P., Chen, P., Yuan, Y., Liu, D., Huang, Z., Hou, X. and Cotrell, G., “Understanding Convolution for Semantic Segmentation,” In: 2018 18th IEEE Winter Conference on Applications of Computer Vision (WACV), Lake Tahoe, NV, USA (2018).Google Scholar

Sun, Q. and Cai, L., “Multi-AUV Target Recognition Method Based on GAN-meta Learning,” In: 2020 5th International Conference On Advanced Robotics and Mechatronics (ICARM 2020), Shenzhen, China (2020) pp. 374–379.Google Scholar

Cai, L., Chen, C. and Chai, H., “Underwater distortion target recognition network (UDTRNet) via enhanced image features,” Comput. Intell. Neurosci. 1(9), 1–10 (2021).Google Scholar

Jian, M., Qi, Q., Dong, J., Yin, Y. and Lam, K. M., “Integrating QDWD with pattern distinctness and local contrast for underwater saliency detection,” J. Vis. Commun. Image Represent. 53, 31–41 (2018).CrossRef Google Scholar

Kong, W., Hong, J., Jia, M., Yao, J., Cong, W., Hu, H. and Zhang, H., “YOLOv3-DPFIN: A dual-path feature fusion neural network for robust real-time sonar target detection,” IEEE Sens. J. 20(7), 3745–3756 (2019).CrossRef Google Scholar

Wu, Q., An, Z., Chen, H., Qian, X. and Sun, L., “Small target recognition method on weak features,” Multimed. Tools Appl. 80(3), 4183–4201 (2021).CrossRef Google Scholar

Gongor, F. and Tutsoy, O., “Design and implementation of a facial character analysis algorithm for humanoid robots,” Robotica 37(11), 1835–1849 (2019).CrossRef Google Scholar

Li, J., Zhang, F., Xiang, Y., Pan, S., “Towards small target recognition with photonics-based high resolution radar range profiles,” Opt. Express 29(20), 31574–31581 (2021).CrossRef Google Scholar PubMed

Cao, C., Hou, Q., Gulliver, T. A. and Lan, Q., “A passive detection algorithm for low-altitude small target based on a wavelet neural network,” Soft Comput. 24(14), 10693–10703 (2020).CrossRef Google Scholar

Shuang-Chen, W. and Zheng-Rong, Z., “Small target detection in infrared images using deep convolutional neural networks,” J. Infrared Millim. Waves 38(3), 371 (2019).Google Scholar

He, Y., Zhang, C., Mu, T., Yan, T., Wang, Y. and Chen, Z., “Multiscale local gray dynamic range method for infrared small-target detection,” IEEE Geosci. Remote Sens. Lett. 18(10), 1846–1850 (2020).CrossRef Google Scholar

Kannappan, P. and Tanner, H. G., “Distance-based global descriptors for multi-view object recognition,” Robotica 38(1), 106–117 (2020).CrossRef Google Scholar

Deng, H., Sun, X. and Zhou, X., “A multiscale fuzzy metric for detecting small infrared targets against chaotic cloudy/sea-sky backgrounds,” IEEE Trans. Cybern. 49(5), 1694–1707 (2018).CrossRef Google Scholar PubMed

Cheng, L. B., Jiang, Z. H., H.Li, B. W. and Huang, Q., “Target-tools recognition method based on an image feature library for space station cabin service robots,” Robotica 34(4), 925–941 (2016).CrossRef Google Scholar

Li, W., Zhang, X., Peng, Y. and Dong, M., “DMNet: A network architecture using dilated convolution and multiscale mechanisms for spatiotemporal fusion of remote sensing images,” IEEE Sens. J. 20(20), 12190–12202 (2020).CrossRef Google Scholar

Wang, Y., Hu, S., Wang, G., Chen, C. and Pan, Z., “Pan “Multi-scale dilated convolution of convolutional neural network for crowd counting,” Multimed. Tools Appl. 79(1), 1057–1073 (2020).CrossRef Google Scholar

Jian, M., Liu, X., Luo, H., Lu, X., Yu, H. and Dong, J., “Underwater image processing and analysis: A review,” Signal Process. Image Commun. 91, 116088 (2021).CrossRef Google Scholar

Jian, M., Qi, Q., Yu, H., Dong, J., Cui, C., Nie, X., Zhang, H., Yin, Y. and Lam, K. M., “The extended marine underwater environment database and baseline evaluations,” Appl. Soft. Comput. 80, 425–437 (2019).CrossRef Google Scholar

Fang, J. and Liu, G., “Visual object tracking based on mutual learning between cohort multiscale feature-fusion networks with weighted loss,” IEEE Trans. Circuits Syst. Video Technol. 31(3), 1055–1065 (2020).CrossRef Google Scholar

Shen, C., Zhao, X., Fan, X., Lian, X., Zhang, F., Kreidieh, A. R. and Liu, Z., “Multi-receptive field graph convolutional neural networks for pedestrian detection,” IET Intell. Transp. Syst. 13(9), 1319–1328 (2019).CrossRef Google Scholar

Gama, F., Isufi, E., Leus, G. and Ribeiro, A., “Graphs, convolutions, and neural networks: From graph filters to graph neural networks,” IEEE Signal Process. Mag. 37(6), 128–138 (2020).CrossRef Google Scholar

Fu, B., Fu, S., Wang, L., Dong, Y. and Ren, Y., “Deep residual split directed graph convolutional neural networks for action recognition,” IEEE Multimed. 27(4), 9–17 (2020).CrossRef Google Scholar

Lu, Y., Chen, Y., Zhao, D., Liu, B., Lai, Z. and Chen, J., “CNN-G: Convolutional neural network combined with graph for image segmentation with theoretical analysis,” IEEE Trans. Cogn. Dev. Syst. 13(3), 631–644 (2020).CrossRef Google Scholar

Zhang, J., Jin, X., Sun, J., Wang, J., Sangaiah, A. K., “Spatial and semantic convolutional features for robust visual object tracking,” Multimed. Tools Appl. 79(21), 15095–15115 (2020).CrossRef Google Scholar

Zhang, P. and Zhang, J. X., “Deep learning analysis based on multi-sensor fusion data for hemiplegia rehabilitation training system for stoke patients,” Robotica 40(3), 780–797 (2022).CrossRef Google Scholar

Tian, S., Kang, L., Xing, X., Li, Z., Zhao, L., Fan, C. and Zhang, Y., “Siamese graph embedding network for object detection in remote sensing images,” IEEE Geosci. Remote Sens. Lett. 2(4), 602–606 (2020).Google Scholar

Li, H., Qiu, K., Chen, L., Mei, X., Hong, L., Tao, C., “SCAttNet: Semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images,” IEEE Geosci. Remote Sens. Lett. 18(5), 905–909 (2020).CrossRef Google Scholar

Yin, L. and Hu, H., “Enhanced global attention upsample decoder based on enhanced spatial attention and feature aggregation module for semantic segmentation,” Electron. Lett. 56(13), 659–661 (2020).CrossRef Google Scholar

Wang, S., Lan, L., Zhang, X. and Luo, Z., “GateCap: Gated spatial and semantic attention model for image captioning,” Multimed. Tools Appl. 79(17), 11531–11549 (2020).CrossRef Google Scholar

Jian, M., Wang, J., Yu, H. and Wang, G. G., “Integrating object proposal with attention networks for video saliency detection,” Inf. Sci. 576, 819–830 (2021).CrossRef Google Scholar

Avelin, B. and Nyström, K., “Neural ODEs as the deep limit of ResNets with constant weights,” Anal. Appl. 19(3), 397–437 (2021).CrossRef Google Scholar

Zhang, X., Chen, Z., Wu, Q. J., Cai, L., Lu, D. and Li, X., “Fast semantic segmentation for scene perception,” IEEE Trans. Ind. Inform. 15(2), 1183–1192 (2018).CrossRef Google Scholar

Yang, T., Wei, Y., Tu, Z., Zeng, H., Kinsy, M. A., Zheng, N. and Ren, P., “Design space exploration of neural network activation function circuits,” IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 38(10), 1974–1978 (2018).CrossRef Google Scholar

Li, Q., Peng, X., Qiao, Y. and Peng, Q., “Learning label correlations for multi-label image recognition with graph networks,” Pattern Recognit. Lett. 138(1), 378–384 (2020).CrossRef Google Scholar

Li, Y., Zhang, X. and Chen, D., “Csrnet: Dilated Convolutional Neural Networks for Understanding the Highly Congested Scenes,” In: 31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, pp. 1091–1100.Google Scholar

Jiang, J., Lyu, C., Liu, S., He, Y. and Hao, X., “RWSNet: A semantic segmentation network based on SegNet combined with random walk for remote sensing,” Int. J. Remote Sens. 41(2), 487–505 (2020).CrossRef Google Scholar

Tian, H., Zheng, Y. and Jin, Z., “MobileNet-SSD MicroScope Using Adaptive Error Correction Algorithm: Real-Time Detection of License Plates on Mobile Devices,” In: 6th International Conference on Energy, Environment and Materials Science (EEMS), Hulun Buir, China (2020) pp. 1091–1100.Google Scholar

Hu, X., Li, H., Li, X. and Wang, C., “MobileNet-SSD MicroScope using adaptive error correction algorithm: Real-time detection of license plates on mobile devices,” IET Intell. 14(2), 110–118 (2020).Google Scholar

Article contents

EHDC: enhanced dilated convolution framework for underwater blurred target recognition

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests