Author of the publication

Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer.

, , , , , , and . ICLR (Poster), OpenReview.net, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

No persons found for author name Mirhoseini, Azalia
add a person with the name Mirhoseini, Azalia
 

Other publications of authors with the same name

RankMap: A Platform-Aware Framework for Distributed Learning from Dense Datasets., , , , and . CoRR, (2015)GAP: Generalizable Approximate Graph Partitioning Framework., , , , and . CoRR, (2019)A Single-Shot Generalized Device Placement for Large Dataflow Graphs., , , , , , , and . IEEE Micro, 40 (5): 26-36 (2020)A Full-stack Accelerator Search Technique for Vision Applications., , , , , and . CoRR, (2021)Deep Mixture of Experts via Shallow Embedding., , , , , , and . CoRR, (2018)Learning to Design Accurate Deep Learning Accelerators with Inaccurate Multipliers., , , , , and . DATE, page 184-189. IEEE, (2022)Delving into Macro Placement with Reinforcement Learning., , , , , , , and . MLCAD, page 1-3. IEEE, (2021)HypoEnergy. Hybrid supercapacitor-battery power-supply optimization for Energy efficiency., and . DATE, page 887-890. IEEE, (2011)Deep3: Leveraging Three Levels of Parallelism for Efficient Deep Learning., , and . DAC, page 61:1-61:6. ACM, (2017)DeLight: Adding Energy Dimension To Deep Neural Networks., , and . ISLPED, page 112-117. ACM, (2016)