Incollection,

Expression Templates and OpenCL

, and .
Parallel Processing and Applied Mathematics, volume 7204 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, (2012)

Abstract

In this paper we discuss the interaction of expression templates with OpenCL devices. We show how the expression tree of expression templates can be used to generate problem specific OpenCL kernels. In a second approach we use expression templates to optimize the data transfer between the host and the device which leads to a measurable performance increase in a domain specific language approach. We tested the functionality, correctness and performance for both implementations in a case study for vector and matrix operations.

Tags

Users

  • @se-group
  • @nehmeier
  • @info2

Comments and Reviews