Manchery/awesome-visual-tokenizer
[WIP🚧] 2025 up-to-date list of resources on visual tokenizers (primarily for visual generation). Give it a star 🌟 if you find it useful.
This is a curated collection of research papers and open-source projects focused on 'visual tokenizers,' which are crucial for advanced image and video generation. It helps researchers and engineers stay up-to-date with the latest techniques and implementations. You'll find academic papers on quantization methods and links to relevant open-source codebases, offering insights into how visual data is broken down and reconstructed for generative AI.
No commits in the last 6 months.
Use this if you are a researcher or engineer working on generative AI models and need a quick overview of key advancements and implementations in visual tokenization.
Not ideal if you are a beginner looking for an introduction to generative AI concepts or if you need a hands-on tutorial for building a visual tokenizer from scratch.
Stars
20
Forks
—
Language
—
License
—
Category
Last pushed
Jan 05, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Manchery/awesome-visual-tokenizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
limix-ldm-ai/LimiX
LimiX: Unleashing Structured-Data Modeling Capability for Generalist Intelligence...
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
google-research/plur
PLUR (Programming-Language Understanding and Repair) is a collection of source code datasets...
YalaLab/pillar-finetune
Finetuning framework for Pillar medical imaging models.
Cloud-CV/diverse-beam-search
:mag: :shipit: Decoding Diverse Solutions from Neural Sequence Models