Optics and Precision Engineering, Volume. 33, Issue 6, 979(2025)
Autonomous decision-making for spacecraft close approaches in the Earth-Moon environment
Aiming at the autonomous decision-making problem of spacecraft approaching closely in the Earth-Moon environment, a decision-making method based on an improved Proximal Policy Optimization (PPO) algorithm was proposed to enable the tracking spacecraft to reach the state required for docking with the target spacecraft within a specified time. First, an LSTM network was introduced into the strategic network structure of the PPO algorithm to handle state inputs and increase the robustness of the algorithm in learning tasks with random parameters. Secondly, a state-based internal reward exploration mechanism was proposed to improve the algorithm's exploration ability by linearly superimposing it with the algorithm's basic reward. In addition, an importance sampling ratio constraint was designed and introduced into the strategy loss function to prevent high variance objective estimation from endangering the optimization of the objective function. Finally, the effectiveness of the proposed method was verified by comparing the learning reward and task execution results with other learning algorithms. The simulation results show that the learning reward value of the improved PPO algorithm is increased by 15%, the fuel consumption of performing close tasks is reduced by 57%, and the mission success rate is increased by 1% when there is unmodelled interference. This method can significantly improve the spacecraft's autonomous decision-making capabilities when performing close missions.
Get Citation
Copy Citation Text
Cheng HUANG, Zhicong QIU, Jiazhong XU. Autonomous decision-making for spacecraft close approaches in the Earth-Moon environment[J]. Optics and Precision Engineering, 2025, 33(6): 979
Category:
Received: Aug. 14, 2024
Accepted: --
Published Online: Jun. 16, 2025
The Author Email: Cheng HUANG (huangchengsunxi@163.com)