A Decentralized Multi-Agent Reinforcement Learning Framework for Cooperative UAV Search: Navigating Non-Stationarity via Reward Shaping

doi:10.63944/e1p7kr68

A Decentralized Multi-Agent Reinforcement Learning Framework for Cooperative UAV Search: Navigating Non-Stationarity via Reward Shaping

Authors

DOI:

Keywords:

Multi Agent Reinforcement Learning, Cooperative Search, Decentralized Partially Observable Markov Decision Process, Reward Shaping, Non-stationarity

Abstract

Due to the complex coupling between high-dimensional state spaces and the stringent constraints of local perception, collaborative search by multi UAV swarms in unknown environments poses a formidable challenge within the field of autonomous systems. Although early algorithmic attempts often assumed stationary targets or perfect communication networks, the inherent non-stationarity of real-world dynamic environments renders traditional independent learning paradigms highly inefficient. Sometimes even leading to complete divergence during training. In an effort to overcome these analytical bottlenecks, this paper explores a Decentralized Partially Observable Markov Decision Process framework and introduces specific Multi Agent Reinforcement Learning methods under a Centralized Training with Decentralized Execution architecture. To fully validate and elucidate these emergent collaborative behaviors, extensive verification in real-world physical environments, as well as further research specifically addressing communication-constrained settings.

Downloads

Published

2026-04-02

Issue

Vol. 1 No. 03 (2026):

Section

Research Articles

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors retain copyright and grant the journal the right of first publication. This work is licensed under the Creative Commons Attribution 4.0 International License (CC BY 4.0). Provided that appropriate credit is given and the original publication in this journal is properly cited, others may copy, distribute, transmit, and adapt the work, including for commercial purposes. Authors may also deposit the published version in institutional repositories or on personal websites with a full citation to the final published article and a link to the journal page.

How to Cite

A Decentralized Multi-Agent Reinforcement Learning Framework for Cooperative UAV Search: Navigating Non-Stationarity via Reward Shaping. (2026). International Journal of Computer Science and Engineering, 1(03), 101-109. https://doi.org/10.63944/e1p7kr68

Download Citation

A Decentralized Multi-Agent Reinforcement Learning Framework for Cooperative UAV Search: Navigating Non-Stationarity via Reward Shaping

Authors

DOI:

Keywords:

Abstract

Downloads

Published

Issue

Section

Categories

License

How to Cite

Information