Chinese Journal of Ship Research, Volume. 19, Issue 1, 137(2024)

Adaptive control of unmanned surface vehicle based on improved DDPG algorithm

Lifei SONG1,2, Chuanyi XU1,2, Le HAO1,2, Rong GUO1,2, and Wei CHAI1,2
Author Affiliations
  • 1Key Laboratory of High Performance Ship Technology of Ministry of Education, Wuhan University of Technology, Wuhan 430063, China
  • 2School of Naval Architecture, Ocean and Energy Power Engineering, Wuhan University of Technology, Wuhan 430063, China
  • show less

    Objective

    In order to tackle the issue of the poor navigation stability of unmanned surface vehicles (USVs) under interference conditions, an intelligent control parameter adjustment strategy based on the deep reinforcement learning (DRL) method is proposed.

    Method

    A dynamic model of a USV combining the line-of-sight (LOS) method and PID navigation controller is established to conduct its navigation control tasks. In view of the time-varying characteristics of PID parameters for course control under interference conditions, the DRL theory is introduced. The environmental state, action and reward functions of the intelligent agent are designed to adjust the PID parameters online. An improved deep deterministic policy gradient (DDPG) algorithm is proposed to increase the convergence speed and address the issue of the occurrence of local optima during the training process. Specifically, the original experience pool is separated into success and failure experience pools, and an adaptive sampling mechanism is designed to optimize the experience pool playback structure.

    Results

    The simulation results show that the improved algorithm converges rapidly with a slightly improved average return in the later stages of training. Under interference conditions, the lateral errors and heading angle deviations of the controller based on the improved DDPG algorithm are reduced significantly. Path tracking can be maintained more steadily after fitting the desired path faster.

    Conclusion

    The improved algorithm greatly reduces the cost of training time, enhances the steady-state performance of the agent in the later stages of training and achieves more accurate path tracking.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Lifei SONG, Chuanyi XU, Le HAO, Rong GUO, Wei CHAI. Adaptive control of unmanned surface vehicle based on improved DDPG algorithm[J]. Chinese Journal of Ship Research, 2024, 19(1): 137

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Oct. 11, 2022

    Accepted: --

    Published Online: Mar. 18, 2025

    The Author Email:

    DOI:10.19693/j.issn.1673-3185.03122

    Topics