Author of the publication

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity.

, , , , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Auto-Parallelizing Large Models with Rhino: A Systematic Approach on Production AI Platform., , , , , , , , , and . CoRR, (2023)DREW: Efficient Winograd CNN Inference with Deep Reuse., , , , , and . WWW, page 1807-1816. ACM, (2022)Higher-order Weyl superconductors with anisotropic Weyl-point connectivity, , , , , , and . Phys. Rev. B, 103 (18): 184510 (May 19, 2021)DISC: A Dynamic Shape Compiler for Machine Learning Workloads., , , , , , , , , and . EuroMLSys@EuroSys, page 89-95. ACM, (2021)Design of Combined Receiving Lens for Panoramic Laser Fuze Detection., , , and . ICIA, page 281-285. IEEE, (2018)HiWayLib: A Software Framework for Enabling High Performance Communications for Heterogeneous Pipeline Computations., , , , , and . ASPLOS, page 153-166. ACM, (2019)Optimizing distributed training deployment in heterogeneous GPU clusters., , , , , , , , and . CoNEXT, page 93-107. ACM, (2020)Illumination Variation in Images in Independent Component Analysis and Principal Component Analysis Subspaces., and . ISDA (2), page 724-729. IEEE Computer Society, (2006)MIMO channel estimation based on distributed compressed sensing for LTE-advanced., , , , and . ICICS, page 1-5. IEEE, (2013)Fast Robust Point Cloud Registration Based on Compatibility Graph and Accelerated Guided Sampling., , , and . Remote. Sens., 16 (15): 2789 (August 2024)