Chinese Journal of Ship Research, Volume. 20, Issue 1, 172(2025)
Autonomous decision-making method of unmanned ship based on improved DDPG algorithm
[19] [19] ŚMIERZCHALSKI R. Ships'' domains as collision risk at sea in the evolutionary method of trajecty planning[M]SAEED K, PEJAŚ J. Infmation Processing Security Systems. Boston: Springer, 2005: 411−422.
[21] [21] CHRISTIANO P F, LEIKE J, BROWN T B, et al. Deep reinfcement learning from human preferences[C]Proceedings of the 31st International Conference on Neural Infmation Processing Systems. Long Beach: Curran Associates Inc. , 2017: 4299−4307.
[22] [22] ZHENG Z Y, OH J, SINGH S. On learning intrinsic rewards f policy gradient methods[C]Proceedings of the 32nd International Conference on Neural Infmation Processing Systems. Montréal: Curran Associates Inc., 2018: 4644−4654.
[23] [23] ZHENG Z Y, OH J, HESSEL M, et al. What can learned intrinsic rewards capture[C]Proceedings of the 37th International Conference on Machine Learning. PMLR, 2019: 1060.
Get Citation
Copy Citation Text
Wei GUAN, Shuhui HAO, Zhewen CUI, Miaomiao WANG. Autonomous decision-making method of unmanned ship based on improved DDPG algorithm[J]. Chinese Journal of Ship Research, 2025, 20(1): 172
Category: Planning and Decision-making
Received: May. 14, 2024
Accepted: --
Published Online: Mar. 13, 2025
The Author Email: