Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

At the Locus of Performance: Quantifying the Effects of Copious 3D-Stacked Cache on HPC Workloads., , , , , , , , , and 2 other author(s). ACM Trans. Archit. Code Optim., 20 (4): 57:1-57:26 (December 2023)Poster: fast GPU read alignment with burrows wheeler transform based index., , and . SC Companion, page 21-22. ACM, (2011)Scaling distributed deep learning workloads beyond the memory capacity with KARMA., , , , , , , and . SC, page 19. IEEE/ACM, (2020)Sequence Alignment on Massively Parallel Heterogeneous Systems., , and . IPDPS Workshops, page 2498-2501. IEEE Computer Society, (2012)Intrinsic Evaluations of Word Embeddings: What Can We Do Better?, and . RepEval@ACL, page 36-42. Association for Computational Linguistics, (2016)Large-scale distributed sorting for GPU-based heterogeneous supercomputers., , , , and . IEEE BigData, page 510-518. IEEE Computer Society, (2014)Why Globally Re-shuffle? Revisiting Data Shuffling in Large Scale Deep Learning., , , , , , , and . IPDPS, page 1085-1096. IEEE, (2022)A Multi GPU Read Alignment Algorithm with Model-Based Performance Optimization., , and . VECPAR, volume 7851 of Lecture Notes in Computer Science, page 270-277. Springer, (2012)Learning Neural Representations for Predicting GPU Performance., , , and . ISC, volume 11501 of Lecture Notes in Computer Science, page 40-58. Springer, (2019)Outlier Dimensions that Disrupt Transformers are Driven by Frequency., , , and . EMNLP (Findings), page 1286-1304. Association for Computational Linguistics, (2022)