Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CoopStreaming: A Novel Peer-to-Peer System for Fast Live Media Streaming., , , and . WAIM, volume 3739 of Lecture Notes in Computer Science, page 882-887. Springer, (2005)Towards Efficient Large-Scale Graph Neural Network Computing., , , , , , and . CoRR, (2018)BitNet: Scaling 1-bit Transformers for Large Language Models., , , , , , , , , and . CoRR, (2023)Garaph: Efficient GPU-accelerated Graph Processing on a Single Machine with Balanced Replication., , , , and . USENIX Annual Technical Conference, page 195-207. USENIX Association, (2017)PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation., , , , , , , , , and 1 other author(s). SOSP, page 331-347. ACM, (2023)Accelerating GNN training with locality-aware partial execution., , , , , , , and . APSys, page 34-41. ACM, (2021)ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores., , , , , , , , , and . PPoPP, page 333-347. ACM, (2024)CuWide: Towards Efficient Flow-based Training for Sparse Wide Models on GPUs (Extended Abstract)., , , , , , and . ICDE, page 2330-2331. IEEE, (2021)NeuGraph: Parallel Deep Neural Network Computation on Large Graphs., , , , , , and . USENIX Annual Technical Conference, page 443-458. USENIX Association, (2019)FlexMoE: Scaling Large-scale Sparse Pre-trained Model Training via Dynamic Device Placement., , , , , , , and . Proc. ACM Manag. Data, 1 (1): 110:1-110:19 (2023)