|
Algorithm 2: Actor Network Forward Pass for Joint Action Generation |
|
|
|
|
| 1: Phase A: Select Offloading Target via Attention
|
|
|
|
|
|
do
|
|
)
|
| 6: end for
|
| 7: Mask scores for invalid/unavailable targets
|
|
|
|
)
|
|
]
|
| 11: Phase B: Determine Continuous Ratios
|
|
= 0 then
|
|
← 1
|
| 14: else
|
|
+ 1
|
|
|
|
+ 1
|
|
|
| 19: end if
|
| 20: Calculate total log probability log π from the distributions
|
|
|