Journal of Electronic Science and Technology, Volume. 23, Issue 2, 100303(2025)

Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios

Zarina Kutpanova1, Mustafa Kadhim2, Xu Zheng2, and Nurkhat Zhakiyev3
Author Affiliations
  • 1Department of Computer Engineering, Astana IT University, Astana, 010000, Kazakhstan
  • 2School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, China
  • 3Department of Computational and Data Science, Astana IT University, Astana, 010000, Kazakhstan
  • show less
    Figures & Tables(14)
    Illustration of UAVs’ payloads delivery in natural disaster environment.
    Illustration of RL for UAVs. In this illustration, π(s) indicates the action function π(·) with the current state s as input; action i-j, reward i-j, and state i-j refer to the movement, the received rewards, and the new state for UAV i in j-th round, respectively.
    Scenes of input maps that are in two ways: (a) centered or (b) non-centered, where the location of UAV is shown by the star symbol and the point where the dashed lines connect.
    Illustration of the reward rate estimation calculation when UAV moves to a specific grid (like the green grid). The estimation considers the steps moving to the grid, the distance among the grid, each delivery point (DP, orange grid), the optimal landing area (blue grid), the urgent landing area (gray grid), as well as the remaining delivery points. In the left figure, b(i) indicates the battery level of UAVi and indicates the size of payloads to be delivered at this grid. In the right one, and are the estimated return rates when UAVi flies to each of these grids.
    Architecture of DQN in the proposed model. The convolutional Q-network will encode various information that exists in the region. Here fcenter, flocal, and fglobal indicate the functions deriving central, local, and global features, respectively; π(a|s) indicates the probability of each action a given current state s; π(s) refers to strategy functions of action.
    Example trajectories for the Manhattan32 map: Baseline with (a) 2 and (b) 3 agents; our proposed method with (c) 2 and (d) 3 agents.
    Example trajectories for the Urban50 map: Baseline with (a) 2 and (b) 3 agents; our proposed method with (c) 2 and (d) 3 agents.
    Delivery points coverage situations: (a) Manhattan32 and (b) Urban50.
    Impact of the number of UAVs deployed: (a) Manhattan32 and (b) Urban50.
    Impact of the number of delivery points: (a) Manhattan32 and (b) Urban50.
    Influence of maximum battery capacity: (a) Manhattan32 and (b) Urban50.
    • Table 1. [in Chinese]

      View table
      View in Article

      Table 1. [in Chinese]

      Algorithm 1 Multi-UAV path planning for multiple emergency payloads delivery in natural disaster environments
      1: Input: Grid world $ \mathcal{M} $, delivery points $ K $, utilized UAVs $ \mathcal{I} $, a central server $ {\text {Base}} $, and total time period $ T $
      2: Output: Action series $ {\text {Action}} $ for all UAVs
      3: for each time slot $ t $ in $ T $do
      4: for each UAV $ U_i $ in $ \mathcal{I} $do
      5: $ U_i $ requests $ k $ consecutive actions from $ \text {Base} $
      6: for$ j $ from $ 1 $ to $ k $do
      7: Run the proposed model
      8: $ {\text{Action}}_i \leftarrow \underset{{\boldsymbol a}\in \mathcal{A}}{ {{\mathrm{arg}}\, {\mathrm{max}} }}\,{{Q}_{\theta }} \left(s {\mathrm{,}}\; {\boldsymbol a} \right) $
      9: end for
      10: $ \text {Base} $ returns $ {\text{Action}}_i $ to $ U_i $
      11: end for
      12: All UAVs execute $ k $ actions
      13: $ t = t +k $
      14: end for
    • Table 1. Simulated results for Manhatttan32.

      View table
      View in Article

      Table 1. Simulated results for Manhatttan32.

      Manhattan32BaselineOur proposed method
      RSR (optimal)91.52%51.69%
      RSR (urgent landing)/47.69%
      RSR (total)91.52%99.38%
      Delivery ratio84.32%88.40%
      Delivery ratio and landed77.17%87.86%
    • Table 2. Simulated results for Urban50.

      View table
      View in Article

      Table 2. Simulated results for Urban50.

      Urban50BaselineOur proposed method
      RSR (optimal)89.52%62.42%
      RSR (urgent landing)/34.21%
      RSR (total)89.52%96.63%
      Delivery ratio82.36%87.01%
      Delivery ratio and landed73.72%84.07%
    Tools

    Get Citation

    Copy Citation Text

    Zarina Kutpanova, Mustafa Kadhim, Xu Zheng, Nurkhat Zhakiyev. Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios[J]. Journal of Electronic Science and Technology, 2025, 23(2): 100303

    Download Citation

    EndNote(RIS)BibTexPlain Text
    Save article for my favorites
    Paper Information

    Category:

    Received: Sep. 12, 2024

    Accepted: Feb. 24, 2025

    Published Online: Jun. 16, 2025

    The Author Email:

    DOI:10.1016/j.jnlest.2025.100303

    Topics