Microsoft Open-Sources the Deep Learning Training Optimization Library
Past 7 days Received 63 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
40.2k
Stars
No
Chinese
Python
Language
Yes
Active
452
Contributors
1k
Issues
Yes
Organization
0.17.6
Latest
5k
Forks
Apache-2.0
License
More
As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.