Repository Details
Shared by
HelloGitHub Rating
0 ratings
The Inference Engine Dedicated to DeepSeek Developed by Redis Creator
Past 6 days Received 758 stars ✨
Free•MIT
Claim
Discuss
Collect
Share
12.1k
Stars
No
Chinese
C
Language
Yes
Active
29
Contributors
122
Issues
No
Organization
None
Latest
1k
Forks
MIT
License
More

This project is a lightweight local inference engine developed in C by the creator of Redis, specifically designed for the DeepSeek-V4-Flash model. It is not just a simple GGUF runner, but a fully functional independent inference engine that supports hardware acceleration with Metal and CUDA, 2-bit quantization, persistent disk KV cache, HTTP API services, and programming agents, etc.
Comments
Rating:
No comments yet