DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

DeepSpeed

众所周知训练大模型(LLM)是一件“费时费钱”的事情，该项目通过 ZeRO++ 技术，在训练时将模型状态分割到每个 GPU 上，通过提高吞吐量的方式，降低训练所需的时间和成本。

As everyone knows, training large language models (LLM) is a 'time-consuming and costly' task. This project reduces the time and cost required for training by leveraging ZeRO++ technology, which segments the model state onto each GPU, thereby increasing throughput.

DeepSpeed

DeepSpeed

Comments