Author of the publication

LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.

, , , and . HPCA, page 221-234. IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

DORA: Optimizing Smartphone Energy Efficiency and Web Browser Performance under Interference., , , , and . ISPASS, page 64-75. IEEE Computer Society, (2018)Understanding the Future of Energy Efficiency in Multi-Module GPUs., , , and . HPCA, page 519-532. IEEE, (2019)ID-cache: instruction and memory divergence based cache management for GPUs., , and . IISWC, page 158-167. IEEE Computer Society, (2016)Characterization and Throttling-Based Mitigation of Memory Interference for Heterogeneous Smartphones., , and . IISWC, page 22-33. IEEE Computer Society, (2015)MCM-GPU: Multi-Chip-Module GPUs for Continued Performance Scalability., , , , , , , , and . ISCA, page 320-332. ACM, (2017)Estimating correlation for a real-time measure of connectivity., , , , and . EMBC, page 5190-5193. IEEE, (2012)Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference., , , , , and . CoRR, (2024)LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs., , , and . HPCA, page 221-234. IEEE Computer Society, (2018)CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads., , and . ISCA, page 515-527. ACM, (2015)Beyond the socket: NUMA-aware GPUs., , , , , , , and . MICRO, page 123-135. ACM, (2017)