Zero-3 Offload: Scale DL models to trillion parameters without code changes
Zero-3 Offload: Scale DL models to trillion parameters without code changes
Mar 13, 2021, 4:23pm UTC
https://www.deepspeed.ai/news/2021/03/07/zero3-offload.html