Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks.

Y. Tsai, P. Luszczek, J. Kurzak, and J. Dongarra. MLHPC@SC, page 9-18. IEEE Computer Society, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Schi-chien Tsai

Yuan-feen Tsai

Wei-Ding Tsai

Ming-Cheng Tsai

Other publications of authors with the same name

Mixed-Precision Algorithm for Finding Selected Eigenvalues and Eigenvectors of Symmetric and Hermitian Matrices1.Y. Tsai, P. Luszczek, and J. Dongarra. ScalAH@SC, page 43-50. IEEE, (2022)Adaptive block size for dense QR factorization in hybrid CPU-GPU systems via statistical modeling.R. Chen, Y. Tsai, and W. Wang. Parallel Comput., 40 (5-6): 70-85 (2014)Autotuning Numerical Dense Linear Algebra for Batched Computation With GPU Hardware Accelerators.J. Dongarra, M. Gates, J. Kurzak, P. Luszczek, and Y. Tsai. Proc. IEEE, 106 (11): 2040-2055 (2018)A survey of numerical linear algebra methods utilizing mixed-precision arithmetic.A. Abdelfattah, H. Anzt, E. Boman, E. Carson, T. Cojean, J. Dongarra, A. Fox, M. Gates, N. Higham, X. Li and 11 other author(s). Int. J. High Perform. Comput. Appl., (2021)A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic.A. Abdelfattah, H. Anzt, E. Boman, E. Carson, T. Cojean, J. Dongarra, M. Gates, T. Grützmacher, N. Higham, X. Li and 15 other author(s). CoRR, (2020)Massively Parallel Automated Software Tuning.J. Kurzak, Y. Tsai, M. Gates, A. Abdelfattah, and J. Dongarra. ICPP, page 92:1-92:10. ACM, (2019)Scalable Data Generation for Evaluating Mixed-Precision Solvers.P. Luszczek, Y. Tsai, N. Lindquist, H. Anzt, and J. Dongarra. HPEC, page 1-6. IEEE, (2020)Tuning Block Size for QR Factorization on CPU-GPU Hybrid Systems.Y. Tsai, W. Wang, and R. Chen. MCSoC, page 205-211. IEEE Computer Society, (2012)Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks.Y. Tsai, P. Luszczek, J. Kurzak, and J. Dongarra. MLHPC@SC, page 9-18. IEEE Computer Society, (2016)

BibSonomy

Disambiguation of "Tsai, Yaohung M."

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks.

Please choose a person to relate this publication to

Schi-chien Tsai

Yuan-feen Tsai

Wei-Ding Tsai

Wei-Ding Tsai

Ming-Cheng Tsai

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Tsai, Yaohung M."

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks.

Please choose a person to relate this publication to

Schi-chien Tsai

Yuan-feen Tsai

Wei-Ding Tsai

Wei-Ding Tsai

Ming-Cheng Tsai

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Performance-Portable Autotuning of OpenCL Kernels for Convolutional Layers of Deep Neural Networks.