Repository Details
Shared by
HelloGitHub Rating
0 ratings
Free•MIT
Claim
Discuss
Collect
Share
13.6k
Stars
No
Chinese
Jupyter Notebook
Language
No
Active
1
Contributors
20
Issues
No
Organization
None
Latest
1k
Forks
MIT
License
More
This project helps people deeply understand how Large Language Models (LLMs) work by building Llama 3 layer by layer. The author uses the PyTorch framework to implement loading model weights, text tokenization, model configuration, and step-by-step implementation of key components in the Transformer model.
Comments
Rating:
No comments yet