Jeff Dean

論文

2017

  • Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer

2022

  • ST-MoE: Designing Stable and Transferable Sparse Expert Models

./ ../pages

Type to search.