DeepSpeed: Extreme-scale model training for everyone
DeepSpeed: Extreme-scale model training for everyone – Microsoft Research
DeepSpeed continues to innovate, making its tools more powerful while broadening its reach. Learn how it now powers 10x bigger model training on one GPU, 10x longer input sequences, 5x less communication volume, & scales to train trillion-parameter models.
Source: www.microsoft.com/en-us/research/blog/deepspeed-extreme-scale-model-training-for-everyone/
September 14, 2020
Subscribe
Login
Please login to comment
0 Comments