Skip to main content
. 2024 Jun 24;24(13):4104. doi: 10.3390/s24134104
Algorithm 1. Dynamic variance sampling algorithm flow.
for t=1,2,,T do
        for each node i=1,,N, sample θi(t) independently from the Nμ^it,1kit+1 distribution
        select node: i(t)=argmaxiθi(t)
        observe reward: ri(t)
        update selected times: ki(t)t+1=ki(t)t+1
        update mean benefit: μit(t+1)=μitt×ki(t)t+ri(t)ki(t)+1
end for