Microsoft Open-Sources the Deep Learning Training Optimization Library
Past 7 days Received 56 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
39.9k
Stars
No
Chinese
Python
Language
Yes
Active
445
Contributors
1k
Issues
Yes
Organization
0.17.5
Latest
5k
Forks
Apache-2.0
License
More
As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.