Vanderbauwhede, W. and Takemi, T. (2015) Twinned Buffering: A simple and Highly Effective Scheme for Parallelization of Successive Over-Relaxation on GPUs and Other Accelerators. In: International Conference on High Performance Computing & Simulation (HPCS), Amsterdam, The Netherlands, 20-24 Jul 2015, pp. 436-443. ISBN 9781467378123 (doi: 10.1109/HPCSim.2015.7237073)
Full text not currently available from Enlighten.
Abstract
In this paper we present a new scheme for parallelization of the Successive Over-Relaxation method for solving the Poisson equation over a 3-D volume. Our new scheme is both simple and effective, outperforming the conventional Red-Black scheme by a factor of 16 on an NVIDIA GeForce GTX 590 GPU, a factor of 11 on an NVIDIA GeForce TITAN Black GPU and a factor of 5 on an Intel Xeon Phi. The speed-up compared to the fully optimised reference implementation running on an Intel Xeon CPU is 16 times on the GTX 590, 22 times on the TITAN and 5 times on the Xeon Phi. We explain the rationale and the implementation in OpenCL and present the performance evaluation results.
Item Type: | Conference Proceedings |
---|---|
Status: | Published |
Refereed: | Yes |
Glasgow Author(s) Enlighten ID: | Vanderbauwhede, Professor Wim |
Authors: | Vanderbauwhede, W., and Takemi, T. |
College/School: | College of Science and Engineering > School of Computing Science |
ISBN: | 9781467378123 |
Related URLs: |
University Staff: Request a correction | Enlighten Editors: Update this record