eric-ai-lab/MiniGPT-5
Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"
MiniGPT-5 helps you create stories or conversational responses that seamlessly blend both images and text. You provide either text, an image, or a sequence of both, and it generates coherent, contextually relevant images and text together. This is ideal for content creators, marketers, or educators who need to produce engaging multimodal narratives.
863 stars. No commits in the last 6 months.
Use this if you need to generate images and descriptive text in an interleaved fashion for storytelling, content creation, or interactive multimodal conversations.
Not ideal if you primarily need to generate only text or only images without the need for their tight, interleaved integration.
Stars
863
Forks
52
Language
Python
License
Apache-2.0
Category
Last pushed
May 08, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/eric-ai-lab/MiniGPT-5"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
tabularis-ai/be_great
A novel approach for synthesizing tabular data using pretrained large language models
EleutherAI/gpt-neox
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron...
shibing624/textgen
TextGen: Implementation of Text Generation models, include LLaMA, BLOOM, GPT2, BART, T5, SongNet...
ai-forever/ru-gpts
Russian GPT3 models.
AdityaNG/kan-gpt
The PyTorch implementation of Generative Pre-trained Transformers (GPTs) using Kolmogorov-Arnold...