research-article

Evolutionary Multi-Objective Deep Reinforcement Learning for Autonomous UAV Navigation in Large-Scale Complex Environments

Authors:

Hisao IshibuchiAuthors Info & Claims

GECCO '23: Proceedings of the Genetic and Evolutionary Computation Conference

Pages 633 - 641

https://doi.org/10.1145/3583131.3590446

Published: 12 July 2023 Publication History

Abstract

Autonomous navigation of Unmanned Aerial Vehicles (UAVs) in large-scale complex environments presents a significant challenge in modern aerospace engineering, as it requires effective decision-making in an environment with limited sensing capacity, dynamic changes, and dense obstacles. Reinforcement Learning (RL) has been applied in sequential control problems, but the manual setting of hyperparameters, including reward functions, often results in suboptimal solutions and inadequate training. To address these limitations, we propose a framework that combines Multi-Objective Evolutionary Algorithms (MOEAs) with RL algorithms. The proposed framework generates a set of non-dominating parameters for the reward function using MOEAs, leading to diverse decision-making preferences, efficient convergence, and improved performance. The framework was tested on the autonomous navigation of UAVs and demonstrated significant improvement compared to traditional RL methods. This work offers a novel perspective on the problem of autonomous UAV navigation in large-scale complex environments and highlights the potential for further improvement through the integration of RL and MOEAs.

Supplementary Material

ZIP File (p633-an-suppl.zip)

Supplemental material.

Download
27.50 KB

References

[1]

Julian Blank and Kalyanmoy Deb. 2020. Pymoo: Multi-objective optimization in python. IEEE Access 8 (2020), 89497--89509.

[2]

Bruno N Coelho, Vitor N Coelho, Igor M Coelho, Luiz S Ochi, Roozbeh Haghnazar, Demetrius Zuidema, Milton SF Lima, and Adilson R da Costa. 2017. A multiobjective green UAV routing problem. Computers & Operations Research 88 (2017), 306--315.

Digital Library

[3]

Kalyanmoy Deb, Ram Bhushan Agrawal, et al. 1995. Simulated binary crossover for continuous search space. Complex systems 9, 2 (1995), 115--148.

[4]

Kalyanmoy Deb, Mayank Goyal, et al. 1996. A combined genetic adaptive search (GeneAS) for engineering design. Computer Science and informatics 26 (1996), 30--45.

[5]

Kalyanmoy Deb, Amrit Pratap, Sameer Agarwal, and TAMT Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation 6, 2 (2002), 182--197.

[6]

Taha Elmokadem and Andrey V Savkin. 2021. Towards fully autonomous UAVs: A survey. Sensors 21, 18 (2021), 6223.

[7]

Bangkui Fan, Yun Li, Ruiyu Zhang, and Qiqi Fu. 2020. Review on the technological development and application of UAV systems. Chinese Journal of Electronics 29, 2 (2020), 199--207.

[8]

Youngjib Ham, Kevin K Han, Jacob J Lin, and Mani Golparvar-Fard. 2016. Visual monitoring of civil infrastructure systems via camera-equipped Unmanned Aerial Vehicles (UAVs): a review of related works. Visualization in Engineering 4, 1 (2016), 1--8.

[9]

Samira Hayat, Evşen Yanmaz, Timothy X Brown, and Christian Bettstetter. 2017. Multi-objective UAV path planning for search and rescue. In 2017 IEEE international conference on robotics and automation (ICRA). IEEE, 5569--5574.

Digital Library

[10]

Nikolas Hohmann, Mariusz Bujny, Jurgen Adamy, and Markus Olhofer. 2022. Multi-objective 3D Path Planning for UAVs in Large-Scale Urban Scenarios. Evolutionary Computation (CEC) (2022), 1--8.

[11]

Gary B Lamont, James N Slear, and Kenneth Melendez. 2007. UAV swarm mission planning and routing using multi-objective evolutionary algorithms. In 2007 IEEE Symposium on Computational Intelligence in Multi-Criteria Decision-Making. IEEE, 10--20.

[12]

Thomas Lee, Susan Mckeever, and Jane Courtney. 2021. Flying free: A research overview of deep learning in drone navigation autonomy. Drones 5, 2 (2021), 52.

[13]

Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015).

[14]

Xiaofeng Liu, Limei Gao, Zhiwei Guan, Yuqing Song, and Rui Zhang. 2016. A multi-objective optimization model for planning unmanned aerial vehicle cruise route. International Journal of Advanced Robotic Systems 13, 3 (2016), 116.

[15]

Abdul Majeed and Seong Oun Hwang. 2021. A multi-objective coverage path planning algorithm for UAVs to cover spatially distributed regions in urban environments. Aerospace 8, 11 (2021), 343.

[16]

Eric J Michaud, Adam Gleave, and Stuart Russell. 2020. Understanding learned reward functions. arXiv preprint arXiv:2012.05862 (2020).

[17]

Francesco Nex and Fabio Remondino. 2014. UAV for 3D mapping applications: a review. Applied geomatics 6 (2014), 1--15.

[18]

Markus Peschl, Arkady Zgonnikov, Frans A Oliehoek, and Luciano C Siebert. 2021. MORAL: Aligning AI with human norms through multi-objective reinforced active learning. arXiv preprint arXiv:2201.00012 (2021).

[19]

Amir Ramezani Dooraki and Deok-Jin Lee. 2022. A Multi-Objective Reinforcement Learning Based Controller for Autonomous Navigation in Challenging Environments. Machines 10, 7 (2022), 500.

[20]

Tao Ren, Jianwei Niu, Jiahe Cui, Zhenchao Ouyang, and Xuefeng Liu. 2021. An application of multi-objective reinforcement learning for efficient model-free control of canals deployed with IoT networks. Journal of Network and Computer Applications 182 (2021), 103049.

[21]

Mithra Sivakumar and Naga Malleswari TYJ. 2021. A literature survey of unmanned aerial vehicle usage for civil applications. Journal of Aerospace Technology and Management 13 (2021).

[22]

Teodor Tomic, Korbinian Schmid, Philipp Lutz, Andreas Domel, Michael Kassecker, Elmar Mair, Iris Lynne Grixa, Felix Ruess, Michael Suppa, and Darius Burschka. 2012. Toward a fully autonomous UAV: Research platform for indoor and outdoor urban search and rescue. IEEE robotics & automation magazine 19, 3 (2012), 46--56.

[23]

Chao Wang, Jian Wang, Yuan Shen, and Xudong Zhang. 2019. Autonomous navigation of UAVs in large-scale complex environments: A deep reinforcement learning approach. IEEE Transactions on Vehicular Technology 68, 3 (2019), 2124--2136.

[24]

Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, and Wojciech Matusik. 2020. Prediction-guided multi-objective reinforcement learning for continuous robot control. In International conference on machine learning. PMLR, 10607--10616.

[25]

Chunhua Zhang and John M Kovacs. 2012. The application of small unmanned aerial systems for precision agriculture: a review. Precision agriculture 13 (2012), 693--712.

[26]

Qi-dan Zhu and Yu Ma. 2020. A design of T-foil and trim tab for fast catamaran based on NSGA-II. Journal of Hydrodynamics 32, 1 (2020), 161--174.

Cited By

Tuncal AErol U(2024)Integrating Unmanned Aerial Vehicles in Airspace: A Systematic ReviewJournal of Aviation Research10.51785/jar.13932716:1(89-115)Online publication date: 28-Feb-2024
https://doi.org/10.51785/jar.1393271

Recommendations

Reinforcement Learning for Autonomous UAV Navigation Using Function Approximation
2018 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR)
Ahstract- Unmanned aerial vehicles (UAV) are commonly used for search and rescue missions in unknown environments, where an exact mathematical model of the environment may not be available. This paper proposes a framework for the UAV to locate a missing ...
Autonomous UAV Navigation in Wilderness Search-and-Rescue Operations Using Deep Reinforcement Learning
AI 2022: Advances in Artificial Intelligence
Abstract
Wilderness Search and Rescue (WiSAR) operations require navigating large unknown environments and locating missing victims with high precision and in a timely manner. Several studies used deep reinforcement learning (DRL) to allow for the ...
Autonomous navigation of UAV in complex environment : a deep reinforcement learning method based on temporal attention: Autonomous navigation of UAV in complex environment...
Abstract
With the increasing demand for Unmanned Aerial Vehicles (UAVs) in both military and civil applications, the ability for UAVs to automatically avoid obstacles and navigate to specific destinations has been receiving growing attention. However, most ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

GECCO '23: Proceedings of the Genetic and Evolutionary Computation Conference

July 2023

1667 pages

ISBN:9798400701191

DOI:10.1145/3583131

Chair:
Sara Silva,
Program Chair:
Luís Paquete

Copyright © 2023 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGEVO: ACM Special Interest Group on Genetic and Evolutionary Computation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 July 2023

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Qualifiers

Research-article

Conference

GECCO '23

Sponsor:

SIGEVO

GECCO '23: Genetic and Evolutionary Computation Conference

July 15 - 19, 2023

Lisbon, Portugal

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
322
Total Downloads

Downloads (Last 12 months)160
Downloads (Last 6 weeks)13

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Tuncal AErol U(2024)Integrating Unmanned Aerial Vehicles in Airspace: A Systematic ReviewJournal of Aviation Research10.51785/jar.13932716:1(89-115)Online publication date: 28-Feb-2024
https://doi.org/10.51785/jar.1393271

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten