Article,

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding.

, , , , , , , , and .
CoRR, (2020)

Meta data

Tags

Users

  • @dblp

Comments and Reviews