Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios

Zarina Kutpanova; Mustafa Kadhim; Xu Zheng; Nurkhat Zhakiyev

doi:10.1016/j.jnlest.2025.100303

Journal of Electronic Science and Technology, Volume. 23, Issue 2, 100303(2025)

Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios

Zarina Kutpanova¹, Mustafa Kadhim², Xu Zheng², and Nurkhat Zhakiyev³

Author Affiliations

¹Department of Computer Engineering, Astana IT University, Astana, 010000, Kazakhstan

²School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu, 610054, China

³Department of Computational and Data Science, Astana IT University, Astana, 010000, Kazakhstan

show less

Figures & Tables(14)

Fig. 1. Illustration of UAVs’ payloads delivery in natural disaster environment.

Download full size

View in Article

Fig. 2. Illustration of RL for UAVs. In this illustration, π(s) indicates the action function π(·) with the current state s as input; action i-j, reward i-j, and state i-j refer to the movement, the received rewards, and the new state for UAV i in j-th round, respectively.

Download full size

View in Article

Fig. 3. Scenes of input maps that are in two ways: (a) centered or (b) non-centered, where the location of UAV is shown by the star symbol and the point where the dashed lines connect.

Download full size

View in Article

Fig. 4. Illustration of the reward rate estimation calculation when UAV moves to a specific grid (like the green grid). The estimation considers the steps moving to the grid, the distance among the grid, each delivery point (DP, orange grid), the optimal landing area (blue grid), the urgent landing area (gray grid), as well as the remaining delivery points. In the left figure, b(i) indicates the battery level of UAV_i and indicates the size of payloads to be delivered at this grid. In the right one, and are the estimated return rates when UAV_i flies to each of these grids.

Download full size

View in Article

Fig. 5. Architecture of DQN in the proposed model. The convolutional Q-network will encode various information that exists in the region. Here f_center, f_local, and f_global indicate the functions deriving central, local, and global features, respectively; π(a|s) indicates the probability of each action a given current state s; π(s) refers to strategy functions of action.

Download full size

View in Article

Fig. 6. Example trajectories for the Manhattan32 map: Baseline with (a) 2 and (b) 3 agents; our proposed method with (c) 2 and (d) 3 agents.

Download full size

View in Article

Fig. 7. Example trajectories for the Urban50 map: Baseline with (a) 2 and (b) 3 agents; our proposed method with (c) 2 and (d) 3 agents.

Download full size

View in Article

Fig. 8. Delivery points coverage situations: (a) Manhattan32 and (b) Urban50.

Download full size

View in Article

Fig. 9. Impact of the number of UAVs deployed: (a) Manhattan32 and (b) Urban50.

Download full size

View in Article

Fig. 10. Impact of the number of delivery points: (a) Manhattan32 and (b) Urban50.

Download full size

View in Article

Fig. 11. Influence of maximum battery capacity: (a) Manhattan32 and (b) Urban50.

Download full size

View in Article

Table 1. [in Chinese]

View table

View in Article

Table 1. [in Chinese]

Algorithm 1 Multi-UAV path planning for multiple emergency payloads delivery in natural disaster environments

1: Input: Grid world $ \mathcal{M} $, delivery points $ K $, utilized UAVs $ \mathcal{I} $, a central server $ {\text {Base}} $, and total time period $ T $

2: Output: Action series $ {\text {Action}} $ for all UAVs

3: for each time slot $ t $ in $ T $do

4: for each UAV $ U_i $ in $ \mathcal{I} $do

5: $ U_i $ requests $ k $ consecutive actions from $ \text {Base} $

6: for$ j $ from $ 1 $ to $ k $do

7: Run the proposed model

8: $ {\text{Action}}_i \leftarrow \underset{{\boldsymbol a}\in \mathcal{A}}{ {{\mathrm{arg}}\, {\mathrm{max}} }}\,{{Q}_{\theta }} \left(s {\mathrm{,}}\; {\boldsymbol a} \right) $

9: end for

10: $ \text {Base} $ returns $ {\text{Action}}_i $ to $ U_i $

11: end for

12: All UAVs execute $ k $ actions

13: $ t = t +k $

14: end for

Table 1. Simulated results for Manhatttan32.
View table
View in Article
Table 1. Simulated results for Manhatttan32.
Manhattan32 Baseline Our proposed method
RSR (optimal) 91.52% 51.69%
RSR (urgent landing) / 47.69%
RSR (total) 91.52% 99.38%
Delivery ratio 84.32% 88.40%
Delivery ratio and landed 77.17% 87.86%

Table 2. Simulated results for Urban50.
View table
View in Article
Table 2. Simulated results for Urban50.
Urban50 Baseline Our proposed method
RSR (optimal) 89.52% 62.42%
RSR (urgent landing) / 34.21%
RSR (total) 89.52% 96.63%
Delivery ratio 82.36% 87.01%
Delivery ratio and landed 73.72% 84.07%

Tools

Get Citation

Copy Citation Text

Zarina Kutpanova, Mustafa Kadhim, Xu Zheng, Nurkhat Zhakiyev. Multi-UAV path planning for multiple emergency payloads delivery in natural disaster scenarios[J]. Journal of Electronic Science and Technology, 2025, 23(2): 100303

Download Citation

EndNote(RIS)BibTex Plain Text

Set citation alerts for article

Save article for my favorites