Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.

A. Arunkumar, S. Lee, V. Soundararajan, and C. Wu. HPCA, page 221-234. IEEE Computer Society, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Arunkumar Ramaswamy

Arunkumar Dhayalan

Akhil Chandra

Arunkumar Subramanian

Arunkumar Mitra

Other publications of authors with the same name

DORA: Optimizing Smartphone Energy Efficiency and Web Browser Performance under Interference.D. Shingari, A. Arunkumar, B. Gaudette, S. Vrudhula, and C. Wu. ISPASS, page 64-75. IEEE Computer Society, (2018)Understanding the Future of Energy Efficiency in Multi-Module GPUs.A. Arunkumar, E. Bolotin, D. Nellans, and C. Wu. HPCA, page 519-532. IEEE, (2019)ID-cache: instruction and memory divergence based cache management for GPUs.A. Arunkumar, S. Lee, and C. Wu. IISWC, page 158-167. IEEE Computer Society, (2016)Characterization and Throttling-Based Mitigation of Memory Interference for Heterogeneous Smartphones.D. Shingari, A. Arunkumar, and C. Wu. IISWC, page 22-33. IEEE Computer Society, (2015)MCM-GPU: Multi-Chip-Module GPUs for Continued Performance Scalability.A. Arunkumar, E. Bolotin, B. Cho, U. Milic, E. Ebrahimi, O. Villa, A. Jaleel, C. Wu, and D. Nellans. ISCA, page 320-332. ACM, (2017)Estimating correlation for a real-time measure of connectivity.A. Arunkumar, A. Panday, B. Joshi, A. Ravindran, and H. Zaveri. EMBC, page 5190-5193. IEEE, (2012)Keyformer: KV Cache Reduction through Key Tokens Selection for Efficient Generative Inference.M. Adnan, A. Arunkumar, G. Jain, P. Nair, I. Soloveychik, and P. Kamath. CoRR, (2024)LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.A. Arunkumar, S. Lee, V. Soundararajan, and C. Wu. HPCA, page 221-234. IEEE Computer Society, (2018)CAWA: coordinated warp scheduling and cache prioritization for critical warp acceleration of GPGPU workloads.S. Lee, A. Arunkumar, and C. Wu. ISCA, page 515-527. ACM, (2015)Beyond the socket: NUMA-aware GPUs.U. Milic, O. Villa, E. Bolotin, A. Arunkumar, E. Ebrahimi, A. Jaleel, A. Ramírez, and D. Nellans. MICRO, page 123-135. ACM, (2017)

BibSonomy

Disambiguation of "Arunkumar, Akhil"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.

Please choose a person to relate this publication to

Arunkumar Ramaswamy

Arunkumar Dhayalan

Akhil Chandra

Arunkumar Subramanian

Arunkumar Mitra

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Arunkumar, Akhil"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.

Please choose a person to relate this publication to

Arunkumar Ramaswamy

Arunkumar Dhayalan

Akhil Chandra

Arunkumar Subramanian

Arunkumar Mitra

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs.