Repository Details
Shared by


HelloGitHub Rating
0 ratings
Free•MIT
Claim
Discuss
Collect
Share
91k
Stars
Yes
Chinese
Other
Language
Yes
Active
13
Contributors
61
Issues
Yes
Organization
1.0.0
Latest
1w
Forks
MIT
License
More

This project creatively builds upon the DeepSeek V3 base model and employs large-scale reinforcement learning techniques to successfully train an inference model entirely enhanced by reinforcement learning. It matches the intelligence level of OpenAI's o1 official version while boasting extremely low training costs. The model weights are open-sourced, and the training methods and techniques are publicly disclosed.
Comments
Rating:
No comments yet