Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design

P. Thoman, K. Kofler, H. Studt, J. Thomson, und T. Fahringer. Euro-Par 2011 Parallel Processing, Springer Berlin Heidelberg, (2011)

Zusammenfassung

The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using a single unified programming interface and language. While the standard guarantees portability of functionality for complying applications and platforms, performance portability on such a diverse set of hardware is limited. Devices may vary significantly in memory architecture as well as type, number and complexity of computational units. To characterize and compare the OpenCL performance of existing and future devices we propose a suite of microbenchmarks, uCLbench. We present measurements for eight hardware architectures -- four GPUs, three CPUs and one accelerator -- and illustrate how the results accurately reflect unique characteristics of the respective platform. In addition to measuring quantities traditionally benchmarked on CPUs like arithmetic throughput or the bandwidth and latency of various address spaces, the suite also includes code designed to determine parameters unique to OpenCL like the dynamic branching penalties prevalent on GPUs. We demonstrate how our results can be used to guide algorithm design and optimization for any given platform on an example kernel that represents the key computation of a linear multigrid solver. Guided manual optimization of this kernel results in an average improvement of 61\% across the eight platforms tested.

Links und Ressourcen

BibTeX-Schlüssel: Thoman2011-zy
Eintragstyp: incollection
Buchtitel: Euro-Par 2011 Parallel Processing
Jahr: 2011
Seiten: 438--452
Verlag: Springer Berlin Heidelberg
Reihe: Lecture Notes in Computer Science

@christophvs Tags hervorgehoben

Zitieren Sie diese Publikation

%0 Book Section %1 Thoman2011-zy %A Thoman, Peter %A Kofler, Klaus %A Studt, Heiko %A Thomson, John %A Fahringer, Thomas %B Euro-Par 2011 Parallel Processing %D 2011 %I Springer Berlin Heidelberg %K Expose OpenCL %P 438--452 %T Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design %X The OpenCL standard allows targeting a large variety of CPU, GPU and accelerator architectures using a single unified programming interface and language. While the standard guarantees portability of functionality for complying applications and platforms, performance portability on such a diverse set of hardware is limited. Devices may vary significantly in memory architecture as well as type, number and complexity of computational units. To characterize and compare the OpenCL performance of existing and future devices we propose a suite of microbenchmarks, uCLbench. We present measurements for eight hardware architectures -- four GPUs, three CPUs and one accelerator -- and illustrate how the results accurately reflect unique characteristics of the respective platform. In addition to measuring quantities traditionally benchmarked on CPUs like arithmetic throughput or the bandwidth and latency of various address spaces, the suite also includes code designed to determine parameters unique to OpenCL like the dynamic branching penalties prevalent on GPUs. We demonstrate how our results can be used to guide algorithm design and optimization for any given platform on an example kernel that represents the key computation of a linear multigrid solver. Guided manual optimization of this kernel results in an average improvement of 61\% across the eight platforms tested.

BibSonomy

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen
(0)

BibSonomy

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design

Zusammenfassung

Links und Ressourcen

Tags

Community

Zitieren Sie diese Publikation

Mehr Zitationsstile

Suchen auf

Metadaten

Kommentare und Rezensionen (0)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Automatic OpenCL Device Characterization: Guiding Optimized Kernel Design

Kommentare und Rezensionen
(0)