Author of the publication

Analysis and tuning of libtensor framework on multicore architectures.

, , , and . HiPC, page 1-10. IEEE Computer Society, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CSPACER: A Reduced API Set Runtime for the Space Consistency Model.. HPC Asia, page 58-68. ACM, (2021)Performance Trade-offs in GPU Communication: A Study of Host and Device-initiated Approaches., , , , , , , and . PMBS@SC, page 126-137. IEEE, (2020)Fine-grained parallelization of lattice QCD kernel routine on GPUs., , and . J. Parallel Distributed Comput., 68 (10): 1350-1359 (2008)Optimized pre-copy live migration for memory intensive applications., , , and . SC, page 40:1-40:11. ACM, (2011)Exploiting communication concurrency on high performance computing systems., , , and . PMAM@PPoPP, page 132-143. ACM, (2015)Concurrent Phase Classification for Accelerating MPSoC Simulation., , and . ARCS Workshops, volume P-200 of LNI, page 421-432. GI, (2012)On the Exploitation of Value Predication and Producer Identification to Reduce Barrier Synchronization Time., and . IPDPS, page 43. IEEE Computer Society, (2001)Characterizing the Performance of Parallel Applications on Multi-socket Virtual Machines., , and . CCGRID, page 1-12. IEEE Computer Society, (2011)Architectural Requirements for Deep Learning Workloads in HPC Environments., , , , , , , , and . PMBS, page 7-17. IEEE, (2021)Poster: Advances in Gyrokinetic Particle in Cell Simulation for Fusion Plasmas to Extreme Scale., , , , , , , and . SC Companion, page 1441. IEEE Computer Society, (2012)