Author of the publication

LIBSHALOM: optimizing small and irregular-shaped matrix multiplications on ARMv8 multi-cores.

, , , , and . SC, page 72. ACM, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

NoC power optimization using combined routing algorithms., , and . ICIS, page 299-304. IEEE Computer Society, (2017)LIBSHALOM: optimizing small and irregular-shaped matrix multiplications on ARMv8 multi-cores., , , , and . SC, page 72. ACM, (2021)LARE: A Linear Approximate Reinforcement Learning Based Adaptive Routing for Network-on-Chips., , , , , and . ISCAS, page 1-5. IEEE, (2023)Efficiently Running SpMV on Multi-core DSPs for Banded Matrix., , , , and . ICA3PP (5), volume 14491 of Lecture Notes in Computer Science, page 201-220. Springer, (2023)DNNEmu: A Lightweight Performance Emulator for Distributed DNN Training., , , and . ICA3PP, volume 13777 of Lecture Notes in Computer Science, page 722-736. Springer, (2022)Performance Evaluation of Memory-Centric ARMv8 Many-Core Architectures: A Case Study with Phytium 2000+., , , and . J. Comput. Sci. Technol., 36 (1): 33-43 (2021)A survey of machine learning for Network-on-Chips., , , , and . J. Parallel Distributed Comput., (April 2024)CIB-HIER: Centralized Input Buffer Design in Hierarchical High-radix Routers., , , , , and . ACM Trans. Archit. Code Optim., 18 (4): 50:1-50:21 (2021)FLYER: Fine-grained landmark based greedy geographic routing under uncertain locations., , and . ICC, page 166-171. IEEE, (2014)CRSP: Network Congestion Control through Credit Reservation., , , and . ISPA/IUCC/BDCloud/SocialCom/SustainCom, page 692-699. IEEE, (2018)