Loading [MathJax]/extensions/MathZoom.js
A Multi-UAV Monitoring and Search Strategy Based on Multi-Agent Reinforcement Learning | IEEE Conference Publication | IEEE Xplore

A Multi-UAV Monitoring and Search Strategy Based on Multi-Agent Reinforcement Learning


Abstract:

Multiple UAVs have been widely used for targets search and monitoring. Nonetheless, in practice, this problem is challenging as targets usually move randomly, but the tra...Show More

Abstract:

Multiple UAVs have been widely used for targets search and monitoring. Nonetheless, in practice, this problem is challenging as targets usually move randomly, but the trajectories of the targets cannot be predicted in advance by UAV swarms. Moreover, the efficiency of exploration, which indirectly impacts monitoring, is low when the number of targets in the environment is unknown. A Search Extended Deep Deterministic Policy Gradient (SEDDPG) method is proposed by utilizing distributed training and information sharing among multiple agents. Specifically, a f ramework o f d ecentralized p artially observable Markov decision processes is established to express this multiobjective optimization problem. Through distributed training and information sharing among multiple agents, a Search Extended Deep Deterministic Policy Gradient (SEDDPG) method is proposed to synchronize exploration and monitoring tasks. This helps multiple UAVs to monitor targets in the environment as extensively as possible. Simulation results demonstrate that the framework is converged, and outperforms in terms of monitoring ability and search efficiency.
Date of Conference: 18-20 October 2024
Date Added to IEEE Xplore: 22 January 2025
ISBN Information:

ISSN Information:

Conference Location: Nanjing, China

Funding Agency:


I. Introduction

Unmanned Aerial Vehicles (UAVs) play a crucial role in monitoring targets within defined areas due to their versatility, agility, and cost-effectiveness. They have been employed in diverse fields, such as environmental monitoring [1], reconnaissance and search [2], agriculture [3] and security surveillance [4]. Compared to a single UAV, multiple UAVs can cover larger areas more efficiently [5], reducing the time required for mission completion. Furthermore, they improve the robustness and reliability of the monitoring operation. For instance, UAV swarms can maintain continuous observation even if one unit fails, ensuring the integrity of the data collected. This collaborative approach enhances the overall effectiveness of the mission and enables more complex tasks to be performed with higher precision [6].

Contact IEEE to Subscribe

References

References is not available for this document.