Speculative Decoding Algorithms LLM Tools
Implementations, frameworks, and optimization techniques for speculative decoding that accelerate LLM inference through draft model speculation and verification. Does NOT include general LLM inference optimization, quantization methods, or non-speculative decoding strategies.
There are 4 speculative decoding algorithms tools tracked. The highest-rated is vitali87/speculant-graph at 35/100 with 9 stars.
Get all 4 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=speculative-decoding-algorithms&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
vitali87/speculant-graph
Graph drafts, LLM verifies: a novel speculative decoding framework |
|
Emerging |
| 2 |
hsj576/GRIFFIN
Official Implementation of "GRIFFIN: Effective Token Alignment for Faster... |
|
Emerging |
| 3 |
Hambaobao/HCP-Coder
Hierarchical Context Pruning (HCP): A strategy to optimize real-world code... |
|
Emerging |
| 4 |
Geralt-Targaryen/Awesome-Speculative-Decoding
Reading notes on Speculative Decoding papers |
|
Experimental |