Table 1.
Notations and terminologies
| Notation/tern | Description |
|---|---|
| Number of devices | |
| Number of global epochs | |
|
Number of local iterations in the epoch on the th devices |
|
| Global model in the epoch on sever | |
| The entry of | |
| Model initialized from , updated in the th local iteration, on the th device | |
| the model before is updated, on the th devices | |
| A hyper-parameter | |
| A random number | |
| A small constant | |
| A magnitude coefficient | |
| Dataset on the th device | |
| Data(minibatch) sampled from | |
| Learning rate | |
| Server | The place where the training data are placed |
| Worker | One worker on each device, process that trains the model |