Author of the publication

Reformulating the direct convolution for high-performance deep learning inference on ARM processors.

, , , , , , , and . J. Syst. Archit., (February 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Concurrent and Accurate RNA Sequencing on Multicore Platforms, , , , , , and . CoRR, (2013)High performance and energy efficient inference for deep learning on multicore ARM processors using general optimization techniques and BLIS., , , , , and . J. Syst. Archit., (2022)The UJIpenchars Database: a Pen-Based Database of Isolated Handwritten Characters., , , , , , , , , and 6 other author(s). LREC, European Language Resources Association, (2008)Speeding Up the Computation of the Edit Distance for Cyclic Strings., and . ICPR, page 2891-2894. IEEE Computer Society, (2000)Concurrent and Accurate Short Read Mapping on Multicore Processors., , , , , , and . IEEE ACM Trans. Comput. Biol. Bioinform., 12 (5): 995-1007 (2015)Parallel solution of large-scale algebraic Bernoulli equations with the matrix sign function method., , and . Int. J. Comput. Sci. Eng., 4 (2): 88-93 (2009)A Flexible Research-Oriented Framework for Distributed Training of Deep Neural Networks., , , , and . IPDPS Workshops, page 730-739. IEEE, (2021)FaST-LMM for Two-Way Epistasis Tests on High-Performance Clusters., , , , , , and . J. Comput. Biol., 25 (8): 862-870 (2018)Some approaches to statistical and finite-state speech-to-speech translation., , , , , , , , , and . Comput. Speech Lang., 18 (1): 25-47 (2004)Parallel Solution of Large-Scale Algebraic Bernoulli Equations with the Matrix Sign Function Method., , and . ICPP Workshops, page 189-193. IEEE Computer Society, (2005)