FIG. 8.
Performance comparison of cubewalkers, cana, and booleannet on a high-performance computer. Cell Collective models were run using each tool using synchronous update. Timings were generated using a workstation with two AMD EPYC 7542 CPUs (32 cores and 64 threads each) at and two 10 752 CUDA-core NVIDIA A6000 GPUs with 48GB of GDDR6 memory (only one GPU was used for the benchmarks). For the cubewalkers and cana tests, 2500 time steps and 2500 walkers (initial conditions) were used; for booleannet, 100 time steps and 100 initial conditions were used. For cana and booleannet, initial conditions were simulated in 128 parallel threads. On specialized hardware taking full advantage of parallelism, we see that the performance gap between cubewalkers and the other methods is narrowed compared to the performance gap on consumer hardware. Nevertheless, the gap remains considerable.