ArtificialZeng/transformers-Explained
官方transformers源码解析。AI大模型时代,pytorch、transformer是新操作系统,其他都是运行在其上面的软件。
This project offers detailed explanations of the Hugging Face Transformers library's source code, focusing on models like LLaMA and Baichuan2. It helps AI/ML engineers and researchers understand the inner workings of large language models, providing insights into model architecture, training, and deployment. The input is the existing Transformers code, and the output is a deeper conceptual understanding for the practitioner.
No commits in the last 6 months.
Use this if you are an AI/ML engineer or researcher who wants to understand the underlying code of large language models powered by the Hugging Face Transformers library.
Not ideal if you are looking for a user-friendly tool to directly apply large language models without delving into their technical implementation.
Stars
16
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
Sep 25, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/ArtificialZeng/transformers-Explained"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lucidrains/x-transformers
A concise but complete full-attention transformer with a set of promising experimental features...
kanishkamisra/minicons
Utility for behavioral and representational analyses of Language Models
lucidrains/simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
lucidrains/dreamer4
Implementation of Danijar's latest iteration for his Dreamer line of work
Nicolepcx/Transformers-in-Action
This is the corresponding code for the book Transformers in Action