DeepSpeed: Extreme-scale model training for everyone

0
TAGS: ,

DeepSpeed: Extreme-scale model training for everyone

DeepSpeed: Extreme-scale model training for everyone – Microsoft Research

DeepSpeed continues to innovate, making its tools more powerful while broadening its reach. Learn how it now powers 10x bigger model training on one GPU, 10x longer input sequences, 5x less communication volume, & scales to train trillion-parameter models.

Source: www.microsoft.com/en-us/research/blog/deepspeed-extreme-scale-model-training-for-everyone/

September 14, 2020
Subscribe
Notify of
0 Comments
Inline Feedbacks
View all comments

Subscribe to our Digest