ArtificialZeng/transformers-Explained

官方transformers源码解析。AI大模型时代，pytorch、transformer是新操作系统，其他都是运行在其上面的软件。

/ 100

Experimental

This project offers detailed explanations of the Hugging Face Transformers library's source code, focusing on models like LLaMA and Baichuan2. It helps AI/ML engineers and researchers understand the inner workings of large language models, providing insights into model architecture, training, and deployment. The input is the existing Transformers code, and the output is a deeper conceptual understanding for the practitioner.

No commits in the last 6 months.

Use this if you are an AI/ML engineer or researcher who wants to understand the underlying code of large language models powered by the Hugging Face Transformers library.

Not ideal if you are looking for a user-friendly tool to directly apply large language models without delving into their technical implementation.

Large Language Models AI/ML Engineering Deep Learning Research Model Architecture Natural Language Processing

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 6 / 25

Maturity 16 / 25

Community 5 / 25

How are scores calculated?

Stars

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

lucidrains/x-transformers

A concise but complete full-attention transformer with a set of promising experimental features...

kanishkamisra/minicons

Utility for behavioral and representational analyses of Language Models

lucidrains/simple-hierarchical-transformer

Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT

lucidrains/dreamer4

Implementation of Danijar's latest iteration for his Dreamer line of work

Nicolepcx/Transformers-in-Action

This is the corresponding code for the book Transformers in Action

Explore Transformer Models

All categories Trending Transformer directory Insights