Author of the publication

XcalableACC: extension of XcalableMP PGAS language using OpenACC for accelerator clusters.

, , , , , , , and . WACCPD@SC, page 27-36. IEEE Computer Society, (2014)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Source-to-Source OpenACC Compiler for CUDA., , and . Euro-Par Workshops, volume 8374 of Lecture Notes in Computer Science, page 178-187. Springer, (2013)An Efficient Technique for Large Mini-batch Challenge of DNNs Training on Large Scale Cluster., , , , , , , , and . HPDC, page 203-207. ACM, (2020)Evaluation of XcalableACC with tightly coupled accelerators/InfiniBand hybrid communication on accelerated cluster., , , , , , , and . Int. J. High Perform. Comput. Appl., (2019)Yet Another Accelerated SGD: ResNet-50 Training on ImageNet in 74.7 seconds., , , , , , , , and . CoRR, (2019)Estimation of Shor's Circuit for 2048-bit Integers based on Quantum Simulator., , , , , and . IACR Cryptol. ePrint Arch., (2023)The 16, 384-node Parallelism of 3D-CNN Training on An Arm CPU based Supercomputer., , , , , , , , , and 2 other author(s). HiPC, page 152-161. IEEE, (2021)Experiments and Resource Analysis of Shor's Factorization Using a Quantum Simulator., , , , , and . ICISC (1), volume 14561 of Lecture Notes in Computer Science, page 119-139. Springer, (2023)mpiQulacs: A Scalable Distributed Quantum Computer Simulator for ARM-based Clusters., , , , , , , and . QCE, page 959-969. IEEE, (2023)mpiQulacs: A Distributed Quantum Computer Simulator for A64FX-based Cluster Systems., , , , , , , and . CoRR, (2022)Implementation and Evaluation of One-sided PGAS Communication in XcalableACC for Accelerated Clusters., , , , and . CCGrid, page 625-634. IEEE Computer Society / ACM, (2017)