An experimental study on performance portability of OpenCL kernels

Abstract

Accelerator processors allow energy-efficient computation at high performance, especially for computationintensive applications. There exists a plethora of different accelerator architectures, such as GPUs and the Cell Broadband Engine. Each accelerator has its own programming language, but the recently introduced OpenCL language unifies accelerator programming languages. Hereby, OpenCL achieves functional protability, allowing to reduce the development time of kernels. Functional portability however has limited value without performance portability: the possibility to re-use optimized kernels with good performance. This paper investigates the specificity of code optimizations to accelerator architecture and the severity of lack of performance portability.

BibTeX key: Rul2010-si
entry type: inproceedings
booktitle: 2010 Symposium on Application Accelerators in High Performance Computing (SAAHPC '10)
year: 2010
publisher: biblio.ugent.be

BibSonomy

An experimental study on performance portability of OpenCL kernels

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on