下拉刷新
Repository Details
Shared bynavbar_avatar
repo_avatar
HelloGitHub Rating
0 ratings
Training Large Language Models from Scratch
FreeMIT
Claim
Collect
Share
7.6k
Stars
No
Chinese
Python
Language
Yes
Active
12
Contributors
1
Issues
No
Organization
None
Latest
1k
Forks
MIT
License
More
train-llm-from-scratch image
This project is a hands-on tutorial for training large language models from scratch. It doesn't just call transformers to run an example, but implements the complete process from the ground up using PyTorch, including Transformer architecture, pre-training, supervised fine-tuning, reward modeling, and evaluation.
Included in:
Vol.123
Tags:
Tutorial
AI
LLM

Comments

Rating:
No comments yet