Skip to main content
. 2022 Oct 17;14:70. doi: 10.1186/s13321-022-00652-1

Fig. 4.

Fig. 4

Overview of DDP training. Each process independently loads and processes a batch of data and synchronizes local gradients with others through a gradient aggregation process which requires global communications