From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

SLIDE : In Defense of Smart Algorithms over Hardware Acceleration for Large-Scale Deep Learning Systems, , , , , и . (2019)cite arxiv:1903.03129Comment: Published at MLSys 2020.Lsh-sampling Breaks the Computation Chicken-and-egg Loop in Adaptive Stochastic Gradient Estimation., , и . CoRR, (2019)Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models., , , , , , и . CoRR, (2021)Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding., , , , , , и . CoRR, (2024)Laughing Hyena Distillery: Extracting Compact Recurrences From Convolutions., , , , , , , , , и 4 other автор(ы). CoRR, (2023)Learn To be Efficient: Build Structured Sparsity in Large Language Models., , , , и . CoRR, (2024)Efficient Streaming Language Models with Attention Sinks., , , , и . CoRR, (2023)Scatterbrain: Unifying Sparse and Low-rank Attention Approximation., , , , , и . CoRR, (2021)Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer., , , и . CoRR, (2023)SOLAR: Sparse Orthogonal Learned and Random Embeddings., , и . ICLR, OpenReview.net, (2021)