Inproceedings,

μLayer: Low Latency On-Device Inference Using Cooperative Single-Layer Acceleration and Processor-Friendly Quantization.

, , , , and .
EuroSys, page 45:1-45:15. ACM, (2019)

Meta data

Tags

Users

  • @dblp

Comments and Reviews