Author of the publication

CRSD: Application Specific Auto-tuning of SpMV for Diagonal Sparse Matrices.

, , , , , and . Euro-Par (2), volume 6853 of Lecture Notes in Computer Science, page 316-327. Springer, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An Insightful Program Performance Tuning Chain for GPU Computing., , , and . ICA3PP (1), volume 7439 of Lecture Notes in Computer Science, page 502-516. Springer, (2012)MPFFT: An Auto-Tuning FFT Library for OpenCL GPUs., , , , and . J. Comput. Sci. Technol., 28 (1): 90-105 (2013)GPURoofline: A Model for Guiding Performance Optimizations on GPUs., , , , , and . Euro-Par, volume 7484 of Lecture Notes in Computer Science, page 920-932. Springer, (2012)Highly Optimized Code Generation for Stencil Codes with Computation Reuse for GPUs., , and . J. Comput. Sci. Technol., 31 (6): 1262-1274 (2016)Efficient Pipeline Planning for Expedited Distributed DNN Training., , , , , , and . INFOCOM, page 340-349. IEEE, (2022)Optimizing distributed training deployment in heterogeneous GPU clusters., , , , , , , , and . CoNEXT, page 93-107. ACM, (2020)Memristors for neural branch prediction: a case study in strict latency and write endurance challenges., , , , , , , and . Conf. Computing Frontiers, page 26:1-26:10. ACM, (2013)Learning beyond Predefined Label Space via Bayesian Nonparametric Topic Modelling., , , , and . CoRR, (2019)CLSIFT: An Optimization Study of the Scale Invariance Feature Transform on GPUs., , , , and . HPCC/EUC, page 93-100. IEEE, (2013)Minimal Multi-threading: Finding and Removing Redundant Instructions in Multi-threaded Processors., , , , , , and . MICRO, page 337-348. IEEE Computer Society, (2010)