pp. 1619-1630
S&M3275 Research Paper of Special Issue https://doi.org/10.18494/SAM4305 Published: May 22, 2023 Multi-objective Game Learning Algorithm Based on Multi-armed Bandit in Underwater Acoustic Communication Networks [PDF] Hui Wang and Liejun Yang (Received January 7, 2023; Accepted April 25, 2023) Keywords: underwater acoustic communication, reinforcement learning, power allocation, multi-armed bandit
To address the challenges of interference in underwater multi-node communication and enhance the efficiency of underwater acoustic communication, we propose a multi-objective game learning algorithm based on the multi-armed bandit framework. Firstly, the multi-objective optimization problem is constructed as a multi-node multi-armed bandit (MAB) game model. Secondly, we incorporate the overall network interference level and nodes’ power cost in the utility function to achieve the desired optimization objectives. Thirdly, we establish the existence and uniqueness of the Nash equilibrium point of the game model and introduce an improved greedy strategy MAB learning algorithm to determine the equilibrium solution. Finally, our simulation results demonstrate that the proposed algorithm effectively optimizes interference management while enhancing the nodes’ adaptive capabilities.
Corresponding author: Hui WangThis work is licensed under a Creative Commons Attribution 4.0 International License. Cite this article Hui Wang and Liejun Yang, Multi-objective Game Learning Algorithm Based on Multi-armed Bandit in Underwater Acoustic Communication Networks, Sens. Mater., Vol. 35, No. 5, 2023, p. 1619-1630. |