Twinned Buffering: A simple and Highly Effective Scheme for Parallelization of Successive Over-Relaxation on GPUs and Other Accelerators

Vanderbauwhede, W. and Takemi, T. (2015) Twinned Buffering: A simple and Highly Effective Scheme for Parallelization of Successive Over-Relaxation on GPUs and Other Accelerators. In: International Conference on High Performance Computing & Simulation (HPCS), Amsterdam, The Netherlands, 20-24 Jul 2015, pp. 436-443. ISBN 9781467378123 (doi: 10.1109/HPCSim.2015.7237073)

Full text not currently available from Enlighten.

Abstract

In this paper we present a new scheme for parallelization of the Successive Over-Relaxation method for solving the Poisson equation over a 3-D volume. Our new scheme is both simple and effective, outperforming the conventional Red-Black scheme by a factor of 16 on an NVIDIA GeForce GTX 590 GPU, a factor of 11 on an NVIDIA GeForce TITAN Black GPU and a factor of 5 on an Intel Xeon Phi. The speed-up compared to the fully optimised reference implementation running on an Intel Xeon CPU is 16 times on the GTX 590, 22 times on the TITAN and 5 times on the Xeon Phi. We explain the rationale and the implementation in OpenCL and present the performance evaluation results.

Item Type:Conference Proceedings
Status:Published
Refereed:Yes
Glasgow Author(s) Enlighten ID:Vanderbauwhede, Professor Wim
Authors: Vanderbauwhede, W., and Takemi, T.
College/School:College of Science and Engineering > School of Computing Science
ISBN:9781467378123
Related URLs:

University Staff: Request a correction | Enlighten Editors: Update this record