Speculative Decoding Algorithms LLM Tools

Implementations, frameworks, and optimization techniques for speculative decoding that accelerate LLM inference through draft model speculation and verification. Does NOT include general LLM inference optimization, quantization methods, or non-speculative decoding strategies.

There are 4 speculative decoding algorithms tools tracked. The highest-rated is vitali87/speculant-graph at 35/100 with 9 stars.

Get all 4 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=speculative-decoding-algorithms&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 vitali87/speculant-graph

Graph drafts, LLM verifies: a novel speculative decoding framework

35
Emerging
2 hsj576/GRIFFIN

Official Implementation of "GRIFFIN: Effective Token Alignment for Faster...

33
Emerging
3 Hambaobao/HCP-Coder

Hierarchical Context Pruning (HCP): A strategy to optimize real-world code...

31
Emerging
4 Geralt-Targaryen/Awesome-Speculative-Decoding

Reading notes on Speculative Decoding papers

25
Experimental