Author of the publication

Efficient cuDNN-Compatible Convolution-Pooling on the GPU.

, , , , , , and . PPAM (2), volume 12044 of Lecture Notes in Computer Science, page 46-58. Springer, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

An Optimal Parallel Algorithm for Computing the Summed Area Table on the GPU., , , , , and . IPDPS Workshops, page 763-772. IEEE Computer Society, (2018)GPU-Accelerated Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices., , , , , , and . CANDAR, page 490-496. IEEE Computer Society, (2016)Tile art image generation using parallel greedy algorithm on the GPU and its approximation with machine learning., , , , and . Concurr. Comput. Pract. Exp., (2021)A Square Pointillism Image Generation, and Its GPU Acceleration., , , and . CANDAR, page 38-47. IEEE Computer Society, (2017)Efficient Triangular Matrix Vector Multiplication on the GPU., , , and . PPAM (1), volume 12043 of Lecture Notes in Computer Science, page 493-504. Springer, (2019)Efficient convolution pooling on the GPU., , , , , , and . J. Parallel Distributed Comput., (2020)Efficient cuDNN-Compatible Convolution-Pooling on the GPU., , , , , , and . PPAM (2), volume 12044 of Lecture Notes in Computer Science, page 46-58. Springer, (2019)Tile Art Image Generation Using Conditional Generative Adversarial Networks., , , , and . CANDAR Workshops, page 209-215. IEEE Computer Society, (2018)Almost optimal column-wise prefix-sum computation on the GPU., , , , and . J. Supercomput., 74 (4): 1510-1521 (2018)An Efficient GPU Implementation of Bulk Computation of the Eigenvalue Problem for Many Small Real Non-symmetric Matrices., , , , , , and . Int. J. Netw. Comput., 7 (2): 227-247 (2017)