howard-hou/VisualRWKV
VisualRWKV is the visual-enhanced version of the RWKV language model, enabling RWKV to handle various visual tasks.
VisualRWKV helps developers integrate visual understanding into language models. It takes images and text as input and produces text responses that interpret the visual information, much like how a human would describe or analyze a picture. This is for machine learning engineers and researchers working on multimodal AI applications.
244 stars.
Use this if you are a machine learning engineer or researcher building a visual language model and want to explore the RWKV architecture for handling image-based tasks.
Not ideal if you are an end-user looking for a ready-to-use application, as this is a foundational model for developers to build upon.
Stars
244
Forks
20
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/howard-hou/VisualRWKV"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency,...
sophgo/LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
NotPunchnox/rkllama
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning...
Deep-Spark/DeepSparkHub
DeepSparkHub selects hundreds of application algorithms and models, covering various fields of...
bentoml/llm-inference-handbook
Everything you need to know about LLM inference