Zero-3 Offload: Scale DL models to trillion parameters without code changes

3 years ago
Anonymous $hYN7Hy7o7J

Zero-3 Offload: Scale DL models to trillion parameters without code changes

Mar 13, 2021, 4:23pm UTC
https://www.deepspeed.ai/news/2021/03/07/zero3-offload.html