. Author manuscript; available in PMC: 2014 Aug 15.

Published in final edited form as: J Comput Chem. 2013 Jun 4;34(22):1949–1960. doi: 10.1002/jcc.23340

Table 3.

A detailed description for implementing duplicated calculations at extended boundary grids

Given N_CPU available and N_midpt mid-points, we assume N_midpt ≫ N_CPU.

Step 1
All atoms are sorted according to their z-coordinates either by the regular (sequential) Quicksort method on the master processor, or by a parallel Quicksort method on all processors.
Step 2
Many midpoints may have the same z-coordinates. Count the number of different z-coordinates that the mid-points have (for example, say p different z-coordinates in total) and save the number of mid- points with the same z-coordinate in an array named counts (for example, say counts={k₁, k₂, …, k_p} such that $\sum_{i = 1}^{p} k_{i} = N_{midpt}$ .
Step 3
Each processor intends to perform calculations on the number of midpoints as close as possible in order to balance the workload on each processor as much as possible. Because of this, first of all, the smallest number q₁ satisfying $\sum_{i = 1}^{q_{1}} k_{i} \geq N_{midpt} / N_{CPU}$ is searched and obtained. Then all mid-points with their z- coordinates belonging to the subset {k₁, k₂, …, k_q₁, k_q₁+1} are given to the 1^st processor for future parallel computing. Notice that all mid-points with z-coordinates equal to k_q₁+1 serve as the “extended right boundary grids”, especially in the case of k_q₁+1 = k_q₁ + 1. The results on them are calculated in case they may have impact on the calculations on the rest of points. However, these results are not collected by the master processor for the final assemblage.

Similarly, starting from k_q₁ + 1, the smallest number q₂ satisfying $\sum_{i = q_{1 + 1}}^{q_{2}} k_{i} \geq N_{midpt} / N_{CPU}$ is searched and obtained. Then all mid-points with their z-coordinates belonging to the subset {k_q₁, k_q₁+1, …, k_q₂, k_q₂+1} are given to the 2^nd processor for future parallel computing. All mid-points with z-coordinates equal to k_q₁ or k_q₂+1 serve as the “extended left/right boundary grids”, respectively, and the results obtained on them are not collected.

Repeat the same procedure until all mid-points are assigned to one processor. It is possible that there are more processors left. In such case, these processors are marked “idle” and not involved in the next step parallel computing.
Step 4
All processors perform calculations of constructing surface on its own mid-points independently. The obtained results are sent back to the master processor for the final assemblage.