Microsoft Open-Sources the Deep Learning Training Optimization Library
Past 6 days Received 19 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
41.7k
Stars
No
Chinese
Python
Language
Yes
Active
473
Contributors
1k
Issues
Yes
Organization
0.18.6
Latest
5k
Forks
Apache-2.0
License
More
As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.