copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition.

L. Ye, Z. Tao, Y. Huang, and Y. Li. CoRR, (2024)

Links and resources

BibTeX key: journals/corr/abs-2402-15220
entry type: article
year: 2024
journal: CoRR
volume: abs/2402.15220
ee: https://doi.org/10.48550/arXiv.2402.15220
url: http://dblp.uni-trier.de/db/journals/corr/corr2402.html#abs-2402-15220

Tags

Cite this publication

search on

Meta data

Last update 21 days ago
Created a month ago

Comments and Reviews
(0)

There is no review or comment yet. You can write one!