Cooperative Search Method for Multiple UAVs Based on Deep Reinforcement Learning

. 2022 Sep 6;22(18):6737. doi: 10.3390/s22186737

Parameter	Definition
s	state of UAV
a	action of UAV
R	reward function
d	euclidean distance
ρ	switch for R₃
k	discount factor
π	strategy of the agent
p	action choice probability
y_i	winning score list
z_i	winning agent list
V	state value function
Q	action value function
P	state transition matrix
e	revenue function of the searched area
z	time
ε	search capability of UAV
α	learning rate
b_i	bundle of agent
c_ij[b_i]	score function
L_t	maximum assigned task number