Author of the publication

Searching Optimal Floating-Point Format for Sub-8-Bit Large Language Model Inference.

, , , , and . ICEIC, page 1-4. IEEE, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Approximate Computing Techniques for Deep Neural Networks., and . Approximate Circuits, Springer, (2019)A 7-nm Four-Core Mixed-Precision AI Chip With 26.2-TFLOPS Hybrid-FP8 Training, 104.9-TOPS INT4 Inference, and Workload-Aware Throttling., , , , , , , , , and 34 other author(s). IEEE J. Solid State Circuits, 57 (1): 182-197 (2022)Robust Machine Learning Systems: Challenges, Current Trends, Perspectives, and the Road Ahead., , , , , , and . CoRR, (2021)PillarAcc: Sparse PointPillars Accelerator for Real-Time Point Cloud 3D Object Detection on Edge Devices., , , , , , , and . CoRR, (2023)DeepTools: Compiler and Execution Runtime Extensions for RaPiD AI Accelerator., , , , , , , , , and 3 other author(s). IEEE Micro, 39 (5): 102-111 (2019)A Time Synchronization Protocol for Barrage Relay Networks., , , , and . Sensors, 23 (5): 2447 (March 2023)Transmission Power Control with the Guaranteed Communication Reliability in WSN., , , , and . IJDSN, (2015)Internal Task-Aware Command Scheduling to Improve Read Performance of Embedded Flash Storage Systems., , , , , , and . IEEE Access, (2021)Hardware and Software Co-optimization for the Initialization Failure of the ReRAM-based Cross-bar Array., , , , and . ACM J. Emerg. Technol. Comput. Syst., 16 (4): 36:1-36:19 (2020)Lightweight Error Correction for In-Storage Acceleration of Large Language Model Inference., , , and . ICEIC, page 1-4. IEEE, (2024)