Chinese Journal of Ship Research, Volume. 20, Issue 1, 172(2025)
Autonomous decision-making method of unmanned ship based on improved DDPG algorithm
To enhance the safety and efficiency of maritime traffic, this paper proposes an autonomous collision avoidance decision-making method for unmanned ships based on an enhanced Deep Deterministic Policy Gradient (DDPG) algorithm.
In order to address the issues of low data utilization and poor convergence in traditional DDPG algorithms, we employ Priority Experience Replay (PER) to dynamically adjust experience priority, reduce sample correlation, and utilize the Long Short-Term Memory (LSTM) network to improve the algorithm convergence. Based on the domain knowledge of ships and adhering to the International Regulations for Preventing Collisions at Sea (COLREGs), a model for determining meeting situations and a novel set of reward functions that consider urgent scenarios when other ships fail to comply with the COLREGs are introduced. Generalization experiments are conducted involving two-ship and multi-ship encounters to validate the effectiveness of the proposed method.
As the experimental results demonstrate, compared to traditional DDPG algorithms, our improved approach enhances the convergence speed by approximately 28.8%.
The trained model enables autonomous decision-making and navigation while ensuring compliance with the COLREGs, thereby providing valuable insights for intelligent decision-making in the field of maritime transportation.
Get Citation
Copy Citation Text
Wei GUAN, Shuhui HAO, Zhewen CUI, Miaomiao WANG. Autonomous decision-making method of unmanned ship based on improved DDPG algorithm[J]. Chinese Journal of Ship Research, 2025, 20(1): 172
Category: Planning and Decision-making
Received: May. 14, 2024
Accepted: --
Published Online: Mar. 13, 2025
The Author Email: