Skip to main content
. 2020 Jul 31;27(Pt 5):1297–1306. doi: 10.1107/S1600577520008140

Table 3. GPU processing times on NVIDIA QUADRO P6000 of the proposed online algorithms, as shown using a NVIDIA profiler.

They are compatible with the available 1 ms time slot of the foreseen SSX experiment. For comparison with real-world use cases, we provide numbers for the yet to be published peak_finder algorithm from the GPU pyFAI suite. This algorithm implements the peakfinder8 algorithm from CrystFEL on GPUs.

  Kernel execution time (in nanoseconds for one image of the given size)
Data batch (image + pedestal) JUNGFRAU raw-data pre-processing Pre-processing + ‘simplistic’ Bragg’s peaks count Pre-processing + CSR matrix compression Peak_finder (pyFAI)§
No. of image × size        
 10 × 500K pixels 28 31 147 N/A
 10 × 4M pixels 219 240 469 2167
 10 × 2070 × 2167 (JUNGFRAU) 212 240 492 N/A

Performed on a stack of images.

Performed sequentially.

§

OpenCL implementation, for reference.