hao-ai-lab/JacobiForcing

Jacobi Forcing: Fast and Accurate Diffusion-style Decoding

/ 100

Emerging

This project helps anyone working with Large Language Models (LLMs) who needs faster text generation. It takes an existing LLM, trains it with a new technique called Jacobi Forcing, and outputs a significantly quicker model. The end user is typically a developer or researcher deploying or fine-tuning LLMs for applications where speed is critical, such as chatbots or coding assistants.

143 stars.

Use this if you want to accelerate the text generation speed of your causal LLMs, especially for tasks like coding or mathematics, without sacrificing output quality.

Not ideal if you are not working with LLMs or if you prioritize maximum generation quality over speed improvements.

LLM-deployment text-generation-speed model-acceleration natural-language-processing AI-research

No Package No Dependents

Maintenance 10 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 8 / 25

How are scores calculated?

Stars

143

Forks

Language

Python

License

Apache-2.0

Higher-rated alternatives

sgl-project/SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

structuredllm/syncode

Efficient and general syntactical decoding for Large Language Models

SafeAILab/EAGLE

Official Implementation of EAGLE-1 (ICML'24), EAGLE-2 (EMNLP'24), and EAGLE-3 (NeurIPS'25).

romsto/Speculative-Decoding

Implementation of the paper Fast Inference from Transformers via Speculative Decoding, Leviathan...

kssteven418/BigLittleDecoder

[NeurIPS'23] Speculative Decoding with Big Little Decoder

Explore Transformer Models

All categories Trending Transformer directory Insights