Inproceedings,

SpotServe: Serving Generative Large Language Models on Preemptible Instances.

, , , , , , and .
ASPLOS (2), page 1112-1127. ACM, (2024)

Meta data

Tags

Users

  • @dblp

Comments and Reviews