skip to main content
10.1145/3583131.3590446acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
research-article

Evolutionary Multi-Objective Deep Reinforcement Learning for Autonomous UAV Navigation in Large-Scale Complex Environments

Published: 12 July 2023 Publication History

Abstract

Autonomous navigation of Unmanned Aerial Vehicles (UAVs) in large-scale complex environments presents a significant challenge in modern aerospace engineering, as it requires effective decision-making in an environment with limited sensing capacity, dynamic changes, and dense obstacles. Reinforcement Learning (RL) has been applied in sequential control problems, but the manual setting of hyperparameters, including reward functions, often results in suboptimal solutions and inadequate training. To address these limitations, we propose a framework that combines Multi-Objective Evolutionary Algorithms (MOEAs) with RL algorithms. The proposed framework generates a set of non-dominating parameters for the reward function using MOEAs, leading to diverse decision-making preferences, efficient convergence, and improved performance. The framework was tested on the autonomous navigation of UAVs and demonstrated significant improvement compared to traditional RL methods. This work offers a novel perspective on the problem of autonomous UAV navigation in large-scale complex environments and highlights the potential for further improvement through the integration of RL and MOEAs.

Supplementary Material

ZIP File (p633-an-suppl.zip)
Supplemental material.

References

[1]
Julian Blank and Kalyanmoy Deb. 2020. Pymoo: Multi-objective optimization in python. IEEE Access 8 (2020), 89497--89509.
[2]
Bruno N Coelho, Vitor N Coelho, Igor M Coelho, Luiz S Ochi, Roozbeh Haghnazar, Demetrius Zuidema, Milton SF Lima, and Adilson R da Costa. 2017. A multiobjective green UAV routing problem. Computers & Operations Research 88 (2017), 306--315.
[3]
Kalyanmoy Deb, Ram Bhushan Agrawal, et al. 1995. Simulated binary crossover for continuous search space. Complex systems 9, 2 (1995), 115--148.
[4]
Kalyanmoy Deb, Mayank Goyal, et al. 1996. A combined genetic adaptive search (GeneAS) for engineering design. Computer Science and informatics 26 (1996), 30--45.
[5]
Kalyanmoy Deb, Amrit Pratap, Sameer Agarwal, and TAMT Meyarivan. 2002. A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE transactions on evolutionary computation 6, 2 (2002), 182--197.
[6]
Taha Elmokadem and Andrey V Savkin. 2021. Towards fully autonomous UAVs: A survey. Sensors 21, 18 (2021), 6223.
[7]
Bangkui Fan, Yun Li, Ruiyu Zhang, and Qiqi Fu. 2020. Review on the technological development and application of UAV systems. Chinese Journal of Electronics 29, 2 (2020), 199--207.
[8]
Youngjib Ham, Kevin K Han, Jacob J Lin, and Mani Golparvar-Fard. 2016. Visual monitoring of civil infrastructure systems via camera-equipped Unmanned Aerial Vehicles (UAVs): a review of related works. Visualization in Engineering 4, 1 (2016), 1--8.
[9]
Samira Hayat, Evşen Yanmaz, Timothy X Brown, and Christian Bettstetter. 2017. Multi-objective UAV path planning for search and rescue. In 2017 IEEE international conference on robotics and automation (ICRA). IEEE, 5569--5574.
[10]
Nikolas Hohmann, Mariusz Bujny, Jurgen Adamy, and Markus Olhofer. 2022. Multi-objective 3D Path Planning for UAVs in Large-Scale Urban Scenarios. Evolutionary Computation (CEC) (2022), 1--8.
[11]
Gary B Lamont, James N Slear, and Kenneth Melendez. 2007. UAV swarm mission planning and routing using multi-objective evolutionary algorithms. In 2007 IEEE Symposium on Computational Intelligence in Multi-Criteria Decision-Making. IEEE, 10--20.
[12]
Thomas Lee, Susan Mckeever, and Jane Courtney. 2021. Flying free: A research overview of deep learning in drone navigation autonomy. Drones 5, 2 (2021), 52.
[13]
Timothy P Lillicrap, Jonathan J Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, and Daan Wierstra. 2015. Continuous control with deep reinforcement learning. arXiv preprint arXiv:1509.02971 (2015).
[14]
Xiaofeng Liu, Limei Gao, Zhiwei Guan, Yuqing Song, and Rui Zhang. 2016. A multi-objective optimization model for planning unmanned aerial vehicle cruise route. International Journal of Advanced Robotic Systems 13, 3 (2016), 116.
[15]
Abdul Majeed and Seong Oun Hwang. 2021. A multi-objective coverage path planning algorithm for UAVs to cover spatially distributed regions in urban environments. Aerospace 8, 11 (2021), 343.
[16]
Eric J Michaud, Adam Gleave, and Stuart Russell. 2020. Understanding learned reward functions. arXiv preprint arXiv:2012.05862 (2020).
[17]
Francesco Nex and Fabio Remondino. 2014. UAV for 3D mapping applications: a review. Applied geomatics 6 (2014), 1--15.
[18]
Markus Peschl, Arkady Zgonnikov, Frans A Oliehoek, and Luciano C Siebert. 2021. MORAL: Aligning AI with human norms through multi-objective reinforced active learning. arXiv preprint arXiv:2201.00012 (2021).
[19]
Amir Ramezani Dooraki and Deok-Jin Lee. 2022. A Multi-Objective Reinforcement Learning Based Controller for Autonomous Navigation in Challenging Environments. Machines 10, 7 (2022), 500.
[20]
Tao Ren, Jianwei Niu, Jiahe Cui, Zhenchao Ouyang, and Xuefeng Liu. 2021. An application of multi-objective reinforcement learning for efficient model-free control of canals deployed with IoT networks. Journal of Network and Computer Applications 182 (2021), 103049.
[21]
Mithra Sivakumar and Naga Malleswari TYJ. 2021. A literature survey of unmanned aerial vehicle usage for civil applications. Journal of Aerospace Technology and Management 13 (2021).
[22]
Teodor Tomic, Korbinian Schmid, Philipp Lutz, Andreas Domel, Michael Kassecker, Elmar Mair, Iris Lynne Grixa, Felix Ruess, Michael Suppa, and Darius Burschka. 2012. Toward a fully autonomous UAV: Research platform for indoor and outdoor urban search and rescue. IEEE robotics & automation magazine 19, 3 (2012), 46--56.
[23]
Chao Wang, Jian Wang, Yuan Shen, and Xudong Zhang. 2019. Autonomous navigation of UAVs in large-scale complex environments: A deep reinforcement learning approach. IEEE Transactions on Vehicular Technology 68, 3 (2019), 2124--2136.
[24]
Jie Xu, Yunsheng Tian, Pingchuan Ma, Daniela Rus, Shinjiro Sueda, and Wojciech Matusik. 2020. Prediction-guided multi-objective reinforcement learning for continuous robot control. In International conference on machine learning. PMLR, 10607--10616.
[25]
Chunhua Zhang and John M Kovacs. 2012. The application of small unmanned aerial systems for precision agriculture: a review. Precision agriculture 13 (2012), 693--712.
[26]
Qi-dan Zhu and Yu Ma. 2020. A design of T-foil and trim tab for fast catamaran based on NSGA-II. Journal of Hydrodynamics 32, 1 (2020), 161--174.

Cited By

View all
  • (2024)Integrating Unmanned Aerial Vehicles in Airspace: A Systematic ReviewJournal of Aviation Research10.51785/jar.13932716:1(89-115)Online publication date: 28-Feb-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
GECCO '23: Proceedings of the Genetic and Evolutionary Computation Conference
July 2023
1667 pages
ISBN:9798400701191
DOI:10.1145/3583131
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 July 2023

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

GECCO '23
Sponsor:

Acceptance Rates

Overall Acceptance Rate 1,669 of 4,410 submissions, 38%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)160
  • Downloads (Last 6 weeks)13
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Integrating Unmanned Aerial Vehicles in Airspace: A Systematic ReviewJournal of Aviation Research10.51785/jar.13932716:1(89-115)Online publication date: 28-Feb-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media