Repository Details
Shared by
HelloGitHub Rating
0 ratings
Free•MIT
Claim
Discuss
Collect
Share
7.6k
Stars
No
Chinese
Python
Language
Yes
Active
12
Contributors
1
Issues
No
Organization
None
Latest
1k
Forks
MIT
License
More

This project is a hands-on tutorial for training large language models from scratch. It doesn't just call transformers to run an example, but implements the complete process from the ground up using PyTorch, including Transformer architecture, pre-training, supervised fine-tuning, reward modeling, and evaluation.
Comments
Rating:
No comments yet