Author of the publication

Adaptable Butterfly Accelerator for Attention-based NNs via Hardware and Algorithm Co-design.

, , , , , , , and . MICRO, page 599-615. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Fully-Pipelined Hardware Design for Gaussian Mixture Models., , , , and . IEEE Trans. Computers, 66 (11): 1837-1850 (2017)Dynamic scheduling Monte-Carlo framework for multi-accelerator heterogeneous clusters., , , and . FPT, page 233-240. IEEE, (2010)Have GPUs Made FPGAs Redundant in the Field of Video Processing?, , , and . FPT, page 111-118. IEEE, (2005)A comparison of FPGAs, GPUS and CPUS for Smith-Waterman algorithm (abstract only)., , and . FPGA, page 281. ACM, (2011)ADAM: Automated Design Analysis and Merging for Speeding up FPGA Development., , and . FPGA, page 189-198. ACM, (2018)Axel: a heterogeneous cluster with FPGAs and GPUs., and . FPGA, page 115-124. ACM, (2010)Reconfigurable Hardware Acceleration of Canonical Graph Labelling., , and . ARC, volume 4419 of Lecture Notes in Computer Science, page 302-313. Springer, (2007)Reducing Underflow in Mixed Precision Training by Gradient Scaling., , , and . IJCAI, page 2922-2928. ijcai.org, (2020)Scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic..Efficient reconfigurable design for pricing asian options., , , and . SIGARCH Comput. Archit. News, 38 (4): 14-20 (2010)Efficient Weight Reuse for Large LSTMs., , , , , , and . ASAP, page 17-24. IEEE, (2019)