0
|
Requie: a Stepsize
|
1
|
Exponential decay rates for the moment estimates
|
2
|
: Initial parameter vector
|
3
|
← 0 (Initial 1st moment vector)
|
4
|
← 0 (Initial 2nd moment vector)
|
5
|
t ←0 : (Initial time step)
|
6
|
not converged do
|
7
|
t ← t+1
|
8
|
Feature_vector ← BackboneNetwork(x) (Extract feature vector using CNN )
|
9
|
DropoutFeatureVector ← Dropout(Feature_vector) (Dropout layer)
|
10
|
← fullyConnectedNetwork1(DropoutFeatureVector) (Predict Coarse Classification)
|
11
|
← fullyConnectedNetwork2(DropoutFeatureVector) (Predict Fine Classificatoin)
|
12
|
(Compute Loss)
|
13
|
←
(Get gradients w.r.t. stochastic objective at timestep t)
|
14
|
←
(Update biased first moment estimate)
|
15
|
←
(Update biased second raw moment estimate)
|
16
|
←
(Compute bias-correced first moment estimate)
|
17
|
←(Update parameters)
|
18
|
end while
|
19
|
(Resulting parameters)
|