Skip to main content
. 2022 Aug 22;24(8):1168. doi: 10.3390/e24081168
Algorithm 1. Generating the training data set
Input: domain C=c1,c2,,cn,
  set A=,,,,,,,, of possible actions,
  probability pTA of true alarms (Equation (3)),
  rate α of false alarms and their probability pFA=αpTA (Equation (4)),
  sensor sensitivity λ,
  range ξ1,ξ2 of possible numbers 0<ξ1<ξ2n1 of targets,
  length L0, of the agent’s trajectory,
  number N0, of agent trajectories,
  initial probability map P0 on the domain C.
Output: data set that is an L×N table of pairs c,P of agent positions c and corresponding probability maps P.
1. Create the L×N data table.
2. For each agent trajectory j=1,,N do:
3. Choose a number ξξ1,ξ2 of targets according to a uniform distribution on the interval ξ1,ξ2.
4. Choose the target locations c1, c2,,cξC randomly according to the uniform distribution on the domain C.
5. Choose the initial agent position c0C randomly according to the uniform distribution on the domain C.
6. For l=0,, L1 do:
7. Save the pair cl,Pl as the jth element of the data table.
8. Choose an action alA randomly according to the uniform distribution on the set A.
9. Apply the chosen action and set the next position cl+1=acl of the agent.
10. Calculate the next probability map Pl+1 with Equations (20) and (21).
11. End for
12. End for
13. Return the data table.