Chinese Journal of Ship Research, Volume. 19, Issue 1, 256(2024)

Unmanned surface vehicle escape strategy based on hybrid sampling deep Q-network

Yuanpeng YANG1,2, Lifei SONG2, Jiaqi MAO2, Yi LI2, and Houjing CHEN2,3
Author Affiliations
  • 1Systems Engineering Research Institute, CSSC, Beijing 100094, China
  • 2Key Laboratory of High Performance Ship Technology of Ministry of Education, Wuhan University of Technology, Wuhan 430063, China
  • 3China Ship Development and Design Center, Wuhan 430064, China
  • show less

    Objective

    Aiming at the encirclement tactics adopted by enemy ships, this study focuses on the problem of planning an escape strategy when an unmanned surface vehicle (USV) is surrounded by enemy ships.

    Methods

    A hybrid sampling deep Q-network (HS-DQN) reinforcement learning algorithm is proposed which gradually increases the playback frequency of important samples and retains a certain level of exploration to prevent it from falling into local optimization. The state space, action space and reward function are designed to obtain the USV's optimal escape strategy, and its performance is compared with that of the deep Q-network (DQN) algorithm in terms of reward and escape success rate.

    Results

    The simulation results show that using the HS-DQN algorithm for training increases the escape success rate by 2% and the convergence speed by 20%.

    Conclusions

    The HS-DQN algorithm can reduce the number of useless explorations and speed up the convergence of the algorithm. The simulation results verify the effectiveness of the USV escape strategy.

    Keywords
    Tools

    Get Citation

    Copy Citation Text

    Yuanpeng YANG, Lifei SONG, Jiaqi MAO, Yi LI, Houjing CHEN. Unmanned surface vehicle escape strategy based on hybrid sampling deep Q-network[J]. Chinese Journal of Ship Research, 2024, 19(1): 256

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Sep. 27, 2022

    Accepted: --

    Published Online: Mar. 18, 2025

    The Author Email:

    DOI:10.19693/j.issn.1673-3185.03105

    Topics