Multimodal Streamlit Apps Generative AI Tools
Streamlit applications that integrate multimodal AI capabilities (text, image, vision analysis) with APIs like Gemini, Groq, or Perplexity. Does NOT include standalone image generation, single-modality chatbots, or non-Streamlit implementations.
There are 51 multimodal streamlit apps tools tracked. The highest-rated is stavrostheocharis/auto-streamlit-studio at 39/100 with 18 stars.
Get all 51 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=generative-ai&subcategory=multimodal-streamlit-apps&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Tool | Score | Tier |
|---|---|---|---|
| 1 |
stavrostheocharis/auto-streamlit-studio
AutoStreamlit Studio is an intelligent assistant designed to streamline the... |
|
Emerging |
| 2 |
dhineshaps/fetquest-genai
The FET Quest Model Portfolio Project created to explore the Generative AI... |
|
Emerging |
| 3 |
sitammeur/streamlit-app-builder
A Streamlit-based AI assistant generates custom Streamlit app code from... |
|
Emerging |
| 4 |
grandelli/dfcx-geminiprovision
A Dialogflow CX implementation of a purely determistic agent (intent-based)... |
|
Emerging |
| 5 |
sitammeur/PicQ
PicQ: Demo for MiniCPM-o 2.6 to answer questions about images using natural language. |
|
Emerging |
| 6 |
bnarasimha21/audio-vision-assistant
Multimodal AI assistant combining audio and vision capabilities for accessibility |
|
Emerging |
| 7 |
Pavansomisetty21/Visual-Question-Answering-using-Gemini-LLM
In this we explore into visual Question Answering Using Gemini LLM and... |
|
Emerging |
| 8 |
arjunprabhulal/function-calling-gemma3
Demo project showcasing Gemma3 function calling capabilities using Ollama.... |
|
Emerging |
| 9 |
AdritPal08/End-To-End-Project-Using-Gemini-Gemify
Create stunning content with LLM, the app that uses Google Gemini’s... |
|
Emerging |
| 10 |
TABREZ-96/Grammer_Guruji
Grammar Guruji is an interactive web application powered by Streamlit,... |
|
Experimental |
| 11 |
codingaslu/Voice-Vision-Assistant-for-Blind
Voice & Vision Assistant for the Blind is an AI-powered assistant that helps... |
|
Experimental |
| 12 |
ThaiMinhLam/SymbolicResoning
Participating in Explainable AI for Educational Question-Answering with... |
|
Experimental |
| 13 |
mananp-2730/AI-VA
Multimodal SaaS Voice Assistant for Spatial BI. Analyzes raw CSVs and... |
|
Experimental |
| 14 |
AILucifer99/Gemini-GenAI-Studio
An implementation of a end to end application that will automate multiple... |
|
Experimental |
| 15 |
Eatosin/Retina-UX-Auditor
A Physics-Informed UI/UX Audit Engine. Uses Computer Vision (OpenCV... |
|
Experimental |
| 16 |
Krish-afk-bot/ai-visibility-tracker
Track brand visibility inside AI-generated recommendations from LLMs like... |
|
Experimental |
| 17 |
awakening-ai/OmniResponse
OmniResponse: Online Multimodal Conversational Response Generation in Dyadic... |
|
Experimental |
| 18 |
abubakarsayem/Amazon-KDP-Book-Metadata-Generator-with-LLM
A Streamlit web app that generates Amazon KDP book titles and descriptions... |
|
Experimental |
| 19 |
smaranjitghose/ObjectSightAI
A powerful and intuitive image analysis interface powered by Google's Gemini... |
|
Experimental |
| 20 |
Mukku27/Inventory-Management-Using-GenAI
An intelligent, LLM-powered inventory management system leveraging Google's... |
|
Experimental |
| 21 |
fork123aniket/Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot
Streamlit App Combining Vision, Language, and Audio AI Models |
|
Experimental |
| 22 |
KaiTheRedNinja/GUI-Dog
A digital "guide dog" for the visually impaired |
|
Experimental |
| 23 |
sitammeur/VidiQA
VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using... |
|
Experimental |
| 24 |
munas-git/GenAITopicModeling-ResearchTool-2
Enhanced automated topic classification & modeling tool leveraging Google’s... |
|
Experimental |
| 25 |
erroralex/Metadata-Viewer
A JavaFX desktop application for extracting and managing AI image generation... |
|
Experimental |
| 26 |
CarlosKhoury/Golf-Swing-Analyzer
A computer vision web app that analyzes golf swings using MediaPipe and... |
|
Experimental |
| 27 |
loresico/gemma3-vision-demo
Multimodal Q&A demo using Google DeepMind's Gemma 3 |
|
Experimental |
| 28 |
ethank2222/TrinityAI
Combines the three most popular LLMs on the market into one Generative AI... |
|
Experimental |
| 29 |
MRamya-sri/Q-A_System-and-Image_Interpretation-using-GEMINI_LLM
Project demonstrates Q/A System and Image Interpretation using GEMINI LLM. |
|
Experimental |
| 30 |
Rahilshah01/multimodal-vision-ai-chat
A high-performance Multimodal AI Chatbot using Gemini 2.0 Flash to perform... |
|
Experimental |
| 31 |
Shishir420-GIT/Automation-Generator
This application allows users to upload an SOP based pdf, which lets them... |
|
Experimental |
| 32 |
Korosh-Rajaei/Marley-and-Me-pet-adoption-software
Pet description generator and translator software using Dash, Flask,... |
|
Experimental |
| 33 |
guille123giles-cloud/ai-note-digitizer
AI Note Digitizer Pro | Aplicación de Streamlit que utiliza Google Gemini... |
|
Experimental |
| 34 |
abderrewakbendaoud/pokeroast
🎮 Analyze Pokemon teams with AI to expose weaknesses and improve strategies,... |
|
Experimental |
| 35 |
Ahmed-Yusuf-1/Vision
a React Native mobile application designed to provide users with an... |
|
Experimental |
| 36 |
ankur-mali/tax-law-reasoning-generator
A Python-based system for generating synthetic tax law cases to evaluate... |
|
Experimental |
| 37 |
bdgaskins27889/cvi-ai-assistant
Generative AI for Community Violence Intervention — Trauma-Informed,... |
|
Experimental |
| 38 |
JocelynVelarde/hack4her-genai-app
Build your first GenAI App using MongoDB, Gemini API and Streamlit |
|
Experimental |
| 39 |
psiba15/gemini-image-storyteller
AI-powered Streamlit app that generates stories and narrated audio from... |
|
Experimental |
| 40 |
Mohshaikh23/Gemini-Pro-LLM-App
A LLLM app using Gemini pro API |
|
Experimental |
| 41 |
mariamashraf731/VisionPal-Assistive-AI
An AI-powered assistive assistant for the visually impaired. Leverages Llama... |
|
Experimental |
| 42 |
mubasshirahmxd/Gemini-Multimodal-App
Multimodal Gemini AI app with text chat, streaming responses, and... |
|
Experimental |
| 43 |
cunhanina/pokeroast
An AI-powered "Cyber-Bullying" Dashboard that uses GenAI & Data Science to... |
|
Experimental |
| 44 |
BuddyLim/iuys
Intelligently Understanding Your Screenshots |
|
Experimental |
| 45 |
ArchismwanChatterjee/SightSync-Harmony
This is my Buildspace S5 N&W project |
|
Experimental |
| 46 |
kikugo/ISITADINOSAUR
A Streamlit app using Google's Gemini models to analyze images. |
|
Experimental |
| 47 |
malleswarigelli/QA_Information_Retrival_Application
Build Generative AI, custom Question/Answer or Information Retrival... |
|
Experimental |
| 48 |
scionoftech/functiongemma-finetuning-e-commerce
A comprehensive, production-ready tutorial for fine-tuning Google's... |
|
Experimental |
| 49 |
rugveddarwhekar/reddit_title_generation
This project is a Streamlit-based web application that leverages the Gemini... |
|
Experimental |
| 50 |
kelwynOliveira/Loki-ai-content-generator
AI-powered content assistant built with Streamlit + Gemini. Generate... |
|
Experimental |
| 51 |
sanskaryo/github-roast-app
Github roast app with roast levels |
|
Experimental |