romanohu
Home
»
Pages
»
Research
»
03_research
»
Authors
»
Overseas
»
Maxim Krikun
Maxim Krikun
論文
2020
GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding