Dehao Chen

論文

2020

  • GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding

./ ../pages

Type to search.