Author of the publication

JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication.

, , and . CGO, page 448-459. IEEE, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication., , and . CGO, page 448-459. IEEE, (2024)Compiler Optimization for Irregular Memory Access Patterns in PGAS Programs., , and . LCPC, volume 13829 of Lecture Notes in Computer Science, page 3-21. Springer, (2022)Performance Evaluation of Parallel Sparse Tensor Decomposition Implementations., , and . IA3@SC, page 54-57. IEEE Computer Society, (2016)Performance challenges for heterogeneous distributed tensor decompositions., , and . HPEC, page 1-7. IEEE, (2017)Performance Strategies for Parallel Bitonic Sort on a Migratory Thread Architecture., , and . HPEC, page 1-7. IEEE, (2020)Compiler Optimizations for Irregular Memory Access Patterns in the PGAS Programming Model.. University of Maryland, College Park, MD, USA, (2023)base-search.net (ftunivmaryland:oai:drum.lib.umd.edu:1903/30764).JITSPMM: Just-in-Time Instruction Generation for Accelerated Sparse Matrix-Matrix Multiplication., , and . CoRR, (2023)Impact of Traditional Sparse Optimizations on a Migratory Thread Architecture., and . IA3@SC, page 45-52. IEEE, (2018)Exploring Parallel Bitonic Sort on a Migratory Thread Architecture., , , and . HPEC, page 1-7. IEEE, (2018)Performance considerations for scalable parallel tensor decomposition., , and . J. Parallel Distributed Comput., (2019)