|
Algorithm 2 DL training procedure. |
|
Input: Data set , training data number , training batch size B, total training epoch number and learning rate . |
|
Output: A neural network with . |
| 1: initialization: Set the first training epoch number and a neural network Q with random weight . |
| 2: Randomly select data from as and the rest to testing data set . |
| 3: while
do
|
| 4: Randomly select B data from , and compute the sum of the cross entropy loss. |
| 5: Compute the gradients of . |
| 6: Perform Adaptive Moment Estimation (Adam) gradient descent training to update using the obtained gradients. |
| 7: Calculate the accuracy of neural networks replicating human behavior in training data set . |
| 8: Perform
|
| 9: end while
|
| 10: Calculate the accuracy of neural networks replicating human behavior in training data set . |
| 11: return the trained neural network with . |