Author of the publication

High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach.

, , , , , , and . SC, page 69:1-69:11. ACM, (2011)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

On Scale-out Deep Learning Training for Cloud and HPC., , , , , , , , , and 1 other author(s). CoRR, (2018)High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach, , , , , , and . Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, page 69:1--69:11. New York, NY, USA, ACM, (2011)Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers., , , and . IPDPS, page 1-6. IEEE, (2007)Benefits of I/O Acceleration Technology (I/OAT) in Clusters., and . ISPASS, page 220-229. IEEE Computer Society, (2007)Designing Efficient Cooperative Caching Schemes for Multi-Tier Data-Centers over RDMA-enabled Networks., , , and . CCGRID, page 401-408. IEEE Computer Society, (2006)Distributed Deep Learning Using Synchronous Stochastic Gradient Descent., , , , , , , and . CoRR, (2016)DiST: A Scalable, Efficient P2P Lookup Protocol., , and . AP2PC, volume 3601 of Lecture Notes in Computer Science, page 40-53. Springer, (2004)Improving concurrency and asynchrony in multithreaded MPI applications using software offloading., , , , , , , and . SC, page 30:1-30:12. ACM, (2015)Improving Communication Performance and Scalability of Native Applications on Intel Xeon Phi Coprocessor Clusters., , , , , , , , , and 1 other author(s). IPDPS, page 1083-1092. IEEE Computer Society, (2014)Advanced RDMA-Based Admission Control for Modern Data-Centers., , , and . CCGRID, page 384-391. IEEE Computer Society, (2008)