Author of the publication

GOPipe: A Granularity-Oblivious Programming Framework for Pipelined Stencil Executions on GPU.

, , , , and . PACT, page 43-54. ACM, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Hyperparameter Optimization for Effort Estimation., , , , , and . CoRR, (2018)TOP: A Framework for Enabling Algorithmic Optimizations for Distance-Related Problems., , , and . Proc. VLDB Endow., 8 (10): 1046-1057 (2015)CoCoPIE: Making Mobile AI Sweet As PIE -Compression-Compilation Co-Design Goes a Long Way., , , and . CoRR, (2020)Deep reuse: streamline CNN inference on the fly via coarse-grained computation reuse., and . ICS, page 438-448. ACM, (2019)Optimizing Data Placement on GPU Memory: A Portable Approach., , , and . IEEE Trans. Computers, 66 (3): 473-487 (2017)Special Issue: Graph Computing., , , and . Concurr. Comput. Pract. Exp., (2020)GLORE: generalized loop redundancy elimination upon LER-notation., and . Proc. ACM Program. Lang., 1 (OOPSLA): 74:1-74:28 (2017)An Infrastructure for Tackling Input-Sensitivity of GPU Program Optimizations., , , and . Int. J. Parallel Program., 41 (6): 855-869 (2013)Predicting locality phases for dynamic memory optimization., , and . J. Parallel Distributed Comput., 67 (7): 783-796 (2007)Expanding the Edge: Enabling Efficient Winograd CNN Inference With Deep Reuse on Edge Device., , , , , , , and . IEEE Trans. Knowl. Data Eng., 35 (10): 10181-10196 (October 2023)