Fast multipole methods on graphics processors
Title | Fast multipole methods on graphics processors |
Publication Type | Journal Articles |
Year of Publication | 2008 |
Authors | Gumerov NA, Duraiswami R |
Journal | Journal of Computational Physics |
Volume | 227 |
Issue | 18 |
Pagination | 8290 - 8313 |
Date Published | 2008/09/10/ |
ISBN Number | 0021-9991 |
Abstract | The fast multipole method allows the rapid approximate evaluation of sums of radial basis functions. For a specified accuracy, ϵ , the method scales as O ( N ) in both time and memory compared to the direct method with complexity O ( N 2 ) , which allows the solution of larger problems with given resources. Graphical processing units (GPU) are now increasingly viewed as data parallel compute coprocessors that can provide significant computational performance at low price. We describe acceleration of the FMM using the data parallel GPU architecture.The FMM has a complex hierarchical (adaptive) structure, which is not easily implemented on data-parallel processors. We described strategies for parallelization of all components of the FMM, develop a model to explain the performance of the algorithm on the GPU architecture; and determined optimal settings for the FMM on the GPU. These optimal settings are different from those on usual CPUs. Some innovations in the FMM algorithm, including the use of modified stencils, real polynomial basis functions for the Laplace kernel, and decompositions of the translation operators, are also described. |
URL | http://www.sciencedirect.com/science/article/pii/S0021999108002921 |
DOI | 10.1016/j.jcp.2008.05.023 |