Author of the publication

Performance engineering for real and complex tall & skinny matrix multiplication kernels on GPUs.

, , , and . Int. J. High Perform. Comput. Appl., (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

K-way p-spectral clustering on Grassmann manifolds., , , and . CoRR, (2020)LIKWID: Lightweight Performance Tools., , and . CHPC, page 165-175. Springer, (2010)A Recursive Algebraic Coloring Technique for Hardware-Efficient Symmetric Sparse Matrix-Vector Multiplication., , , , , , , and . CoRR, (2019)Delay Propagation and Overlapping Mechanisms on Clusters: A Case Study of Idle Periods based on Workload, Communication, and Delay Granularity., , and . CoRR, (2019)Exact Numerical Treatment of Finite Quantum Systems Using Leading-Edge Supercomputers., , , and . HPSC, page 165-177. Springer, (2003)Validation of hardware events for successful performance pattern identification in High Performance Computing., , , and . CoRR, (2017)Asynchronous MPI for the Masses, , , and . CoRR, (2013)Analytic Performance Modeling and Analysis of Detailed Neuron Simulations., , , and . CoRR, (2019)Algebraic Temporal Blocking for Sparse Iterative Solvers on Multi-Core CPUs., , , , and . CoRR, (2023)ECM modeling and performance tuning of SpMV and Lattice QCD on A64FX., , , , , , and . CoRR, (2021)