A Shortest Distance Priority UAV Path Planning Algorithm for Precision Agriculture

. 2024 Nov 25;24(23):7514. doi: 10.3390/s24237514

Algorithm 1 Our proposed Q-learning algorithm for the UAV path planning problem.

Input: Source location, destination location, and solution space
Output: Optimal path for UAV from source to destination
1: Initialize

Q (γ, λ) \leftarrow 0 (Γ states, Λ actions)

;
2: for each episode do
3: set

γ_{t} \leftarrow λ

random state from state set

Γ

;
4: while

(γ_{t} \neq target)

do
5: for each

λ_{t}^{i} \in Λ

where

i \in [u p, d o w n, l e f t, r i g h t]

do
6: Determine location

l o c_{λ_{t}^{i}}

of agent by doing action

λ_{t}^{i}

7: Calculate distance

d i s t_{t}^{i} \in d i s t_{t}

from

l o c_{λ_{t}^{i}}

to
Target location.
8: Choose

l o c_{λ_{t}^{i}}

corresponds to smallest

d i s t_{t}^{i}

from

d i s t_{t}

9: Choose

λ_{t}^{i}

corresponds to

l o c_{λ_{t}^{i}}

which makes the
agent move closer to Target location
10: end
11: Perform action

λ_{t}^{i}

and receive penalty or reward
12: Update

Q (γ_{t}, λ_{t})

13: end
14: end