Hoar012/RAP-MLLM
[CVPR 2025] RAP: Retrieval-Augmented Personalization
This project helps anyone working with image-based AI models to customize their models with specific concepts, objects, or styles. You provide images and descriptions of unique concepts, and the system learns to recognize and generate content related to them. This is ideal for content creators, designers, or anyone needing to personalize an AI's visual understanding.
Use this if you need an AI to understand and generate content about specific, niche visual concepts that aren't widely known.
Not ideal if you're looking for a simple, off-the-shelf image recognition tool for common objects and concepts.
Stars
81
Forks
4
Language
Python
License
—
Category
Last pushed
Nov 23, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/rag/Hoar012/RAP-MLLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
illuin-tech/colpali
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
AnswerDotAI/byaldi
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
jolibrain/colette
Multimodal RAG to search and interact locally with technical documents of any kind
nannib/nbmultirag
Un framework in Italiano ed Inglese, che permette di chattare con i propri documenti in RAG,...
OpenBMB/VisRAG
Parsing-free RAG supported by VLMs