Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2016 Nov 15;5:e18722. doi: 10.7554/eLife.18722

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© 2016, Kimanius et al

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

PMC Copyright notice

Appendix 1—figure 1. — The kernel is initiated with $c e i l (𝐏 / P_{0})$ thread-blocks and $N$ threads, where P is the total number of projections. The work flow of a thread-block in each iteration $i$ is divided into two stages. In stage A the $N$ pixels of $P_{0}$ reference slices are fetched through texture memory, interpolated, and stored in shared memory. This data is then exhaustively reused in stage B, where groups of threads compute the differences to the corresponding translated image components. Individual threads within a group work with different image components, $n$ , of each reference slice, $p$ . Collectively all threads iterate through the $N$ components of each reference slice, for a total of $N \times P_{0}$ components for each iteration $i$ . The final result is reduced back into shared memory through atomic reduction operations. All image components are covered as $i$ goes from 1 to $c e i l (C / N)$ , where C is the total number of Fourier components. A reduced sum of differences for each pair of orientation and translation is written to global memory prior to the kernel exiting.

DOI: http://dx.doi.org/10.7554/eLife.18722.012