Microsoft Open-Sources the Deep Learning Training Optimization Library
Past 7 days Received 80 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
35.9k
Stars
No
Chinese
Python
Language
Yes
Active
394
Contributors
1k
Issues
Yes
Organization
0.16.2
Latest
4k
Forks
Apache-2.0
License
More
As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.