Repository Details
Shared by


HelloGitHub Rating
0 ratings
Free•MIT
Claim
Discuss
Collect
Share
79.6k
Stars
Yes
Chinese
Other
Language
Yes
Active
8
Contributors
279
Issues
Yes
Organization
None
Latest
1w
Forks
MIT
License
More

This project creatively builds upon the DeepSeek V3 base model and employs large-scale reinforcement learning techniques to successfully train an inference model entirely enhanced by reinforcement learning. It matches the intelligence level of OpenAI's o1 official version while boasting extremely low training costs. The model weights are open-sourced, and the training methods and techniques are publicly disclosed.
Comments
Rating:
No comments yet