Author of the publication

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators.

, , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 72-79. Springer, (2012)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimizing Memory-Bound SYMV Kernel on GPU Hardware Accelerators., , , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 72-79. Springer, (2012)Parallelization of an Object-Oriented Unstructured Aeroacoustics Solver., , , and . PPSC, SIAM, (1999)Parallel Algorithms for PDE-Constrained Optimization., , , , , and . Parallel Processing for Scientific Computing, volume 20 of Software, Environments, Tools, SIAM, (2006)A Scalable Community Detection Algorithm for Large Graphs Using Stochastic Block Models., , , , and . IJCAI, page 2090-2096. AAAI Press, (2015)A scalable community detection algorithm for large graphs using stochastic block models., , , , and . Intell. Data Anal., (2018)Redesigning Triangular Dense Matrix Computations on GPUs., , and . Euro-Par, volume 9833 of Lecture Notes in Computer Science, page 477-489. Springer, (2016)Exploiting Data Sparsity for Large-Scale Matrix Computations., , , , , and . Euro-Par, volume 11014 of Lecture Notes in Computer Science, page 721-734. Springer, (2018)Unstructured computational aerodynamics on many integrated core architecture., , and . Parallel Comput., (2016)Multidimensional Intratile Parallelization for Memory-Starved Stencil Computations., , , and . ACM Trans. Parallel Comput., 4 (3): 12:1-12:32 (2018)A Quasi-algebraic Multigrid Approach to Fracture Problems Based on Extended Finite Elements., , , , and . SIAM J. Sci. Comput., (2012)