CogitatorTech/zigformer

An educational transformer-based LLM in pure Zig

/ 100

Emerging

This project helps software developers understand how large language models (LLMs) work. It takes raw text and question-answer datasets as input, processes them, and outputs a trained LLM. Developers can then use this model to generate text, answer questions, or integrate it into their applications. This is for developers or students looking to learn the underlying mechanics of modern AI language models.

Use this if you are a developer who wants to learn the fundamental architecture and implementation of a transformer-based large language model from scratch, without heavy external dependencies like PyTorch.

Not ideal if you need a production-ready, highly optimized LLM for commercial applications or if you are not comfortable working with the Zig programming language.

AI-development LLM-architecture compiler-programming deep-learning-education natural-language-processing-implementation

No Package No Dependents

Maintenance 6 / 25

Adoption 8 / 25

Maturity 15 / 25

Community 3 / 25

How are scores calculated?

Stars

Forks

Language

Zig

License

MIT

Higher-rated alternatives

huggingface/text-generation-inference

Large Language Model Text Generation Inference

OpenMachine-ai/transformer-tricks

A collection of tricks and tools to speed up transformer models

poloclub/transformer-explainer

Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization

IBM/TabFormer

Code & Data for "Tabular Transformers for Modeling Multivariate Time Series" (ICASSP, 2021)

tensorgi/TPA

[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6)...

Explore Transformer Models

All categories Trending Transformer directory Insights