Author of the publication

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units.

, , , , and . Concurr. Comput. Pract. Exp., (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Compression and load balancing for efficient sparse matrix-vector product on multicore processors and graphics processing units., , , , and . Concurr. Comput. Pract. Exp., (2022)Compressed Basis GMRES on High Performance GPUs., , , , and . CoRR, (2020)Look-ahead in the two-sided reduction to compact band forms for symmetric eigenvalue problems and the SVD., , , , and . Numer. Algorithms, 80 (2): 635-660 (2019)High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS., , , , , and . J. Syst. Archit., (2022)Balanced and Compressed Coordinate Layout for the Sparse Matrix-Vector Product on GPUs., , , , and . Euro-Par Workshops, volume 12480 of Lecture Notes in Computer Science, page 83-95. Springer, (2020)Residual Replacement in Mixed-Precision Iterative Refinement for Sparse Linear Systems., , , , and . ISC Workshops, volume 11203 of Lecture Notes in Computer Science, page 554-561. Springer, (2018)Performance-energy trade-offs of deep learning convolution algorithms on ARM processors., , , , , , and . J. Supercomput., 79 (9): 9819-9836 (June 2023)BestOf: an online implementation selector for the training and inference of deep neural networks., , , and . J. Supercomput., 78 (16): 17543-17558 (2022)Two-Sided Reduction to Compact Band Forms with Look-Ahead., , , , and . CoRR, (2017)Fast Truncated SVD of Sparse and Dense Matrices on Graphics Processors., , and . CoRR, (2024)