Microsoft Open-Sources the Deep Learning Training Optimization Library
Past 7 days Received 77 stars ✨
Free•Apache-2.0
Claim
Discuss
Collect
Share
35.3k
Stars
No
Chinese
Python
Language
Yes
Active
376
Contributors
1k
Issues
Yes
Organization
0.15.3
Latest
4k
Forks
Apache-2.0
License
More
As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.