romanohu
Home About Page Search
Home  »  Pages  »  Research  »  Book_src  »  03_research  »  Authors  »  Overseas  »  William Fedus

William Fedus

論文

2022

  • Switch Transformers: Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
  • ST-MoE: Designing Stable and Transferable Sparse Expert Models
@2026 romanohu | links |