Skip to main content
. Author manuscript; available in PMC: 2023 Dec 20.
Published in final edited form as: Proc IEEE Int Conf Clust Comput. 2022 Oct 18;2022:230–242. doi: 10.1109/cluster51413.2022.00036

Fig. 2.

Fig. 2.

Diagrammatic depiction of the overall algorithm for one bulk time step. Operations have been batched into a single category for simplicity. The shown structure reflects optimizations to best overlap compute-intensive simulation time steps on the GPU/window with the CPU/bulk. The only synchronization points between the window and bulk parts of the APR model are the bulk-to-window, window-to-bulk communication routines, and checking/re-initialization if the window moves.