Author of the publication

Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators.

, , , , , , , , , , , and . ASPLOS (3), page 314-328. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Cambricon-D: Full-Network Differential Acceleration for Diffusion Models., , , , , , , , , and 6 other author(s). ISCA, page 903-914. IEEE, (2024)Cambricon-P: A Bitflow Architecture for Arbitrary Precision Computing., , , , , , , , , and . MICRO, page 57-72. IEEE, (2022)MSCU: Accelerating CNN Inference with Multiple Sizes of Compute Unit on FPGAs., , , and . MCSoC, page 106-113. IEEE, (2021)BabelTower: Learning to Auto-parallelized Program Translation., , , , , , , , , and 3 other author(s). ICML, volume 162 of Proceedings of Machine Learning Research, page 23685-23700. PMLR, (2022)Performance Analysis of GPU-Based Convolutional Neural Networks., , , , and . ICPP, page 67-76. IEEE Computer Society, (2016)Cambricon-U: A Systolic Random Increment Memory Architecture for Unary Computing., , , , , , , , , and 2 other author(s). MICRO, page 424-437. ACM, (2023)BALTO: fast tensor program optimization with diversity-based active learning., , , , , , , , , and . ICLR, OpenReview.net, (2023)Heron: Automatically Constrained High-Performance Library Generation for Deep Learning Accelerators., , , , , , , , , and 2 other author(s). ASPLOS (3), page 314-328. ACM, (2023)TensorTEE: Unifying Heterogeneous TEE Granularity for Efficient Secure Collaborative Tensor Computing., , , , , , , , , and 4 other author(s). CoRR, (2024)A new asynchronous parallel load flow calculation algorithm., , , and . RAM, page 1027-1031. IEEE, (2008)