Inproceedings,

Towards A Unified View of Sparse Feed-Forward Network in Pretraining Large Language Model.

, , , , and .
EMNLP, page 15038-15061. Association for Computational Linguistics, (2023)

Meta data

Tags

Users

  • @dblp

Comments and Reviews