A deep Q network assisted method for underwater gliders standoff tracking to the static target

Zang, Wenchuan; Yao, Peng; Lv, Kunling; Song, Dalei

doi:10.1007/s00521-022-07408-w

A deep Q network assisted method for underwater gliders standoff tracking to the static target

Original Article
Published: 07 August 2022

Volume 34, pages 20575–20587, (2022)
Cite this article

Neural Computing and Applications Aims and scope Submit manuscript

Wenchuan Zang¹,
Peng Yao²,
Kunling Lv² &
…
Dalei Song ORCID: orcid.org/0000-0001-5407-5989²

297 Accesses
1 Altmetric
Explore all metrics

Abstract

Underwater gliders lack the necessary navigation equipment and have low control performance, which deteriorate the autonomy and efficiency of the sampling. The underwater gliders standoff tracking based on the Lyapunov guidance vector fields is introduced in this work to enhance the autonomy of gliders in observing the potential static targets. To avoid designing complex control processes, we convert the standoff tracking into a Markovian decision process and introduce reinforcement learning methods to solve the task. Also, to trade-off the fast training and achieving acceptable results, we design a control framework that integrates classical controller and reinforcement learning. The simulations show that the proposed framework outperform than the comparison method. This work can provide a new pattern for the sampling control of gliders. The proposed method combining reinforcement learning with classical controller can provide a reference for other applications of reinforcement learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Adaptive Path Planning for Plume Detection with an Underwater Glider

USV Path Planning Based on Adaptive Fuzzy Reward

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

Data Availability Statement

Data sharing not applicable to this article as no datasets were generated or analyzed during the current study.

References

Petritoli E, Cagnetti M, Leccese F (2020) Simulation of autonomous underwater vehicles (auvs) swarm diffusion. Sensors 20(17):4950
Article Google Scholar
Petritoli E, Leccese F, Cagnetti M (2020) Underwater gliders: mission profiles and utilisation strategies in the mediterranean sea. In: 2019 IMEKO TC19 international workshop on metrology for the sea: learning to measure sea health parameters. MetroSea 2019, pp 254–258
Zhang Y, Zhang Z, Quan Z, Liu G-L (2021) Hydrodynamic performance and calculation of lift-drag ratio on underwater glider. J Mar Sci Technol 26(1):16–23
Article Google Scholar
Zang W, Yao P, Song D (2022) Standoff tracking control of underwater glider to moving target. Appl Math Modell 102:1–20
Article MathSciNet Google Scholar
Du X, Zhang X (2020) Influence of ocean currents on the stability of underwater glider self-mooring motion with a cable. Nonlinear Dyn 99(3):2291–2317
Article Google Scholar
Benoit-Bird KJ, Patrick Welch T, Waluk CM et al (2018) Equipping an underwater glider with a new echosounder to explore ocean ecosystems. Limnol. Oceanogr. Methods 16(11):734–749
Article Google Scholar
Chen X, Kong Y, Fang X, Wu Q (2013) A fast two-stage aco algorithm for robotic path planning. Neural Comput Appl 22(2):313–319
Article Google Scholar
Petritoli E, Leccese F (2018) High accuracy attitude and navigation system for an autonomous underwater vehicle (auv). Acta Imeko 7(2):3–9
Article Google Scholar
Wu H, Niu W, Wang S, Yan S (2021) An optimization method for control parameters of underwater gliders considering energy consumption and motion accuracy. Appl Math Modell 90:1099–1119
Article MathSciNet MATH Google Scholar
Shu Y, Chen J, Li S, Wang Q, Yu J, Wang D (2019) Field-observation for an anticyclonic mesoscale eddy consisted of twelve gliders and sixty-two expendable probes in the northern south china sea during summer 2017. Sci China Earth Sci 62(2):451–458
Article Google Scholar
Qiu C, Mao H, Liu H, Xie Q, Yu J, Su D, Ouyang J, Lian S (2019) Deformation of a warm eddy in the northern south china sea. J Geophys Res Oceans 124(8):5551–5564
Article Google Scholar
Yao P, Wang H, Su Z (2016) Cooperative path planning with applications to target tracking and obstacle avoidance for multi-uavs. Aerosp Sci Technol 54:10–22
Article Google Scholar
Liblik T, Karstensen J, Testor P, Alenius P, Hayes D, Ruiz S, Heywood K, Pouliquen S, Mortier L, Mauri E (2016) Potential for an underwater glider component as part of the global ocean observing system. Methods Oceanogr 17:50–82
Article Google Scholar
Lawrence D (2003) Lyapunov vector fields for uav flock coordination. In: 2nd AIAA “Unmanned Unlimited” Conf. and Workshop & Exhibit, p 6575
Wei X, Yao P, Xie Z (2020) Comprehensive optimization of energy storage and standoff tracking for solarpowered UAV. IEEE Syst J 14(4):5133–5143
Article Google Scholar
Liang Y, Jia Y, Du J, Zhang J (2015) Vector field guidance for three-dimensional curved path following with fixed-wing uavs. In: 2015 American control conference (ACC). IEEE, pp 1187–1192
Zhang S, Yu J, Zhang A, Zhang F (2013) Spiraling motion of underwater gliders: modeling, analysis, and experimental results. Ocean Eng 60:1–13
Article Google Scholar
Fu J, Zhang J, She Z, Ovur SE, Li W, Qi W, Su H, Ferrigno G, De Momi E (2021) Whole-body spatial teleoperation control of a hexapod robot in unstructured environment. In: 2021 6th IEEE international conference on advanced robotics and mechatronics (ICARM). IEEE, pp 93–98
Cao F (2020) Pid controller optimized by genetic algorithm for direct-drive servo system. Neural Comput Appl 32(1):23–30
Article Google Scholar
Sang H, Zhou Y, Sun X, Yang S (2018) Heading tracking control with an adaptive hybrid control for under actuated underwater glider. ISA Trans 80:554–563
Article Google Scholar
Zhou S, Zhou Y, Xu Z, Chang W, Cheng Y (2019) The landing safety prediction model by integrating pattern recognition and markov chain with flight data. Neural Comput Appl 31(1):147–159
Article Google Scholar
Sutton RS, Barto AG (2018) Reinforcement learning: an introduction. MIT Press, Cambridge
MATH Google Scholar
Sun Y, Ran X, Zhang G, Xu H, Wang X (2020) Auv 3d path planning based on the improved hierarchical deep q network. J Mar Sci Eng 8(2):145
Article Google Scholar
Zang W, Nie Y, Song D, Guo T, Li K (2019) Research on constraining strategies of underwater glider’s trajectory under the influence of ocean currents based on dqn algorithm. In: OCEANS 2019 MTS/IEEE SEATTLE. IEEE, pp 1–5
Mnih V, Kavukcuoglu K, Silver D, Graves A, Antonoglou I, Wierstra D, Riedmiller M (2013) Playing atari with deep reinforcement learning. arXiv:1312.5602
Lv L, Zhang S, Ding D, Wang Y (2019) Path planning via an improved dqn-based learning policy. IEEE Access 7:67319–67330
Article Google Scholar
Moon JH, Jee SC, Lee HJ (2016) Output-feedback control of underwater gliders by buoyancy and pitching moment control: feedback linearization approach. Int J Control Automat Syst 14(1):255–262
Article Google Scholar
Zhang F, Tan X, Khalil HK (2012) Passivity-based controller design for stablization of underwater gliders. In: 2012 American control conference (ACC). IEEE, pp 5408–5413
Zhang S-W, Yu J-C, Zhang A-Q (2012) Optimal control for underwater gliders in the vertical plane. Control Theory Appl 29(1):19–26
Google Scholar

Download references

Acknowledgements

This work is supported by the National Natural Science Foundation of China (Grant No. 51909252), and the Fundamental Research Funds for the Central Universities (Grant No. 202061004). This work is also partly supported by the China Scholarship Council.

Author information

Authors and Affiliations

The College of Information Science and Engineering, Ocean University of China, Songling Road, Qingdao, 266000, Shandong, China
Wenchuan Zang
The College of Engineering, Ocean University of China, Songling Road, Qingdao, 266000, Shandong, China
Peng Yao, Kunling Lv & Dalei Song

Authors

Wenchuan Zang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Yao
View author publications
You can also search for this author in PubMed Google Scholar
Kunling Lv
View author publications
You can also search for this author in PubMed Google Scholar
Dalei Song
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Wenchuan Zang, software, conceptualization, manuscript preparation-writing. Peng Yao, methodology, conceptualization. Kunling Lv, methodology, manuscript preparation-editing. Dalei Song, manuscript preparation-editing.

Corresponding author

Correspondence to Dalei Song.

Ethics declarations

Conflict of interest

The authors have no conflicts of interest to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Zang, W., Yao, P., Lv, K. et al. A deep Q network assisted method for underwater gliders standoff tracking to the static target. Neural Comput & Applic 34, 20575–20587 (2022). https://doi.org/10.1007/s00521-022-07408-w

Download citation

Received: 19 October 2021
Accepted: 06 May 2022
Published: 07 August 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s00521-022-07408-w

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A deep Q network assisted method for underwater gliders standoff tracking to the static target

Abstract

Access this article

Similar content being viewed by others

Adaptive Path Planning for Plume Detection with an Underwater Glider

USV Path Planning Based on Adaptive Fuzzy Reward

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A deep Q network assisted method for underwater gliders standoff tracking to the static target

Abstract

Access this article

Similar content being viewed by others

Adaptive Path Planning for Plume Detection with an Underwater Glider

USV Path Planning Based on Adaptive Fuzzy Reward

Path Planning of Maritime Autonomous Surface Ships in Unknown Environment with Reinforcement Learning

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation