Algorithm 1. Generating the training data set |
Input: domain , |
set of possible actions, |
probability of true alarms (Equation (3)), |
rate of false alarms and their probability (Equation (4)), |
sensor sensitivity , |
range of possible numbers of targets, |
length of the agent’s trajectory, |
number of agent trajectories, |
initial probability map on the domain . |
Output: data set that is an table of pairs of agent positions and corresponding probability maps . |
1. Create the data table. |
2. For each agent trajectory do: |
3. Choose a number of targets according to a uniform distribution on the interval . |
4. Choose the target locations randomly according to the uniform distribution on the domain . |
5. Choose the initial agent position randomly according to the uniform distribution on the domain . |
6. For do: |
7. Save the pair as the th element of the data table. |
8. Choose an action randomly according to the uniform distribution on the set . |
9. Apply the chosen action and set the next position of the agent. |
10. Calculate the next probability map with Equations (20) and (21). |
11. End for |
12. End for |
13. Return the data table. |