Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Analyzing and Leveraging Decoupled L1 Caches in GPUs., , , , and . HPCA, page 467-478. IEEE, (2021)Efficient Cache Utilization via Model-aware Data Placement for Recommendation Models., , and . MEMSYS, page 2:1-2:11. ACM, (2021)Architectural Support for Efficient Large-Scale Automata Processing., , , , and . MICRO, page 908-920. IEEE Computer Society, (2018)Balanced Data Placement for GEMV Acceleration with Processing-In-Memory., , and . CoRR, (2024)Just-in-time Quantization with Processing-In-Memory for Efficient ML Training., , , , and . CoRR, (2023)Inclusive-PIM: Hardware-Software Co-design for Broad Acceleration on Commercial PIM Architectures., , , , , and . CoRR, (2023)Analyzing and Leveraging Shared L1 Caches in GPUs., , , , and . PACT, page 161-173. ACM, (2020)Controlled Kernel Launch for Dynamic Parallelism in GPUs., , , , , , , , and . HPCA, page 649-660. IEEE Computer Society, (2017)Collaborative Acceleration for FFT on Commercial Processing-In-Memory Architectures., and . CoRR, (2023)Analyzing and Leveraging Remote-Core Bandwidth for Enhanced Performance in GPUs., , , and . PACT, page 258-271. IEEE, (2019)