Multimodal Streamlit Apps Generative AI Tools

Streamlit applications that integrate multimodal AI capabilities (text, image, vision analysis) with APIs like Gemini, Groq, or Perplexity. Does NOT include standalone image generation, single-modality chatbots, or non-Streamlit implementations.

There are 51 multimodal streamlit apps tools tracked. The highest-rated is stavrostheocharis/auto-streamlit-studio at 39/100 with 18 stars.

Get all 51 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=generative-ai&subcategory=multimodal-streamlit-apps&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 stavrostheocharis/auto-streamlit-studio

AutoStreamlit Studio is an intelligent assistant designed to streamline the...

39
Emerging
2 dhineshaps/fetquest-genai

The FET Quest Model Portfolio Project created to explore the Generative AI...

36
Emerging
3 sitammeur/streamlit-app-builder

A Streamlit-based AI assistant generates custom Streamlit app code from...

35
Emerging
4 grandelli/dfcx-geminiprovision

A Dialogflow CX implementation of a purely determistic agent (intent-based)...

32
Emerging
5 sitammeur/PicQ

PicQ: Demo for MiniCPM-o 2.6 to answer questions about images using natural language.

31
Emerging
6 bnarasimha21/audio-vision-assistant

Multimodal AI assistant combining audio and vision capabilities for accessibility

30
Emerging
7 Pavansomisetty21/Visual-Question-Answering-using-Gemini-LLM

In this we explore into visual Question Answering Using Gemini LLM and...

30
Emerging
8 arjunprabhulal/function-calling-gemma3

Demo project showcasing Gemma3 function calling capabilities using Ollama....

30
Emerging
9 AdritPal08/End-To-End-Project-Using-Gemini-Gemify

Create stunning content with LLM, the app that uses Google Gemini’s...

30
Emerging
10 TABREZ-96/Grammer_Guruji

Grammar Guruji is an interactive web application powered by Streamlit,...

29
Experimental
11 codingaslu/Voice-Vision-Assistant-for-Blind

Voice & Vision Assistant for the Blind is an AI-powered assistant that helps...

27
Experimental
12 ThaiMinhLam/SymbolicResoning

Participating in Explainable AI for Educational Question-Answering with...

27
Experimental
13 mananp-2730/AI-VA

Multimodal SaaS Voice Assistant for Spatial BI. Analyzes raw CSVs and...

25
Experimental
14 AILucifer99/Gemini-GenAI-Studio

An implementation of a end to end application that will automate multiple...

25
Experimental
15 Eatosin/Retina-UX-Auditor

A Physics-Informed UI/UX Audit Engine. Uses Computer Vision (OpenCV...

24
Experimental
16 Krish-afk-bot/ai-visibility-tracker

Track brand visibility inside AI-generated recommendations from LLMs like...

22
Experimental
17 awakening-ai/OmniResponse

OmniResponse: Online Multimodal Conversational Response Generation in Dyadic...

22
Experimental
18 abubakarsayem/Amazon-KDP-Book-Metadata-Generator-with-LLM

A Streamlit web app that generates Amazon KDP book titles and descriptions...

21
Experimental
19 smaranjitghose/ObjectSightAI

A powerful and intuitive image analysis interface powered by Google's Gemini...

20
Experimental
20 Mukku27/Inventory-Management-Using-GenAI

An intelligent, LLM-powered inventory management system leveraging Google's...

20
Experimental
21 fork123aniket/Multi-Round-VLM-powered-Multimodal-Conversational-AI-Navigation-Bot

Streamlit App Combining Vision, Language, and Audio AI Models

19
Experimental
22 KaiTheRedNinja/GUI-Dog

A digital "guide dog" for the visually impaired

19
Experimental
23 sitammeur/VidiQA

VidiQA: Demo for MiniCPM-V 2.6 to answer questions about videos using...

18
Experimental
24 munas-git/GenAITopicModeling-ResearchTool-2

Enhanced automated topic classification & modeling tool leveraging Google’s...

18
Experimental
25 erroralex/Metadata-Viewer

A JavaFX desktop application for extracting and managing AI image generation...

18
Experimental
26 CarlosKhoury/Golf-Swing-Analyzer

A computer vision web app that analyzes golf swings using MediaPipe and...

17
Experimental
27 loresico/gemma3-vision-demo

Multimodal Q&A demo using Google DeepMind's Gemma 3

17
Experimental
28 ethank2222/TrinityAI

Combines the three most popular LLMs on the market into one Generative AI...

17
Experimental
29 MRamya-sri/Q-A_System-and-Image_Interpretation-using-GEMINI_LLM

Project demonstrates Q/A System and Image Interpretation using GEMINI LLM.

17
Experimental
30 Rahilshah01/multimodal-vision-ai-chat

A high-performance Multimodal AI Chatbot using Gemini 2.0 Flash to perform...

16
Experimental
31 Shishir420-GIT/Automation-Generator

This application allows users to upload an SOP based pdf, which lets them...

15
Experimental
32 Korosh-Rajaei/Marley-and-Me-pet-adoption-software

Pet description generator and translator software using Dash, Flask,...

14
Experimental
33 guille123giles-cloud/ai-note-digitizer

AI Note Digitizer Pro | Aplicación de Streamlit que utiliza Google Gemini...

13
Experimental
34 abderrewakbendaoud/pokeroast

🎮 Analyze Pokemon teams with AI to expose weaknesses and improve strategies,...

13
Experimental
35 Ahmed-Yusuf-1/Vision

a React Native mobile application designed to provide users with an...

13
Experimental
36 ankur-mali/tax-law-reasoning-generator

A Python-based system for generating synthetic tax law cases to evaluate...

13
Experimental
37 bdgaskins27889/cvi-ai-assistant

Generative AI for Community Violence Intervention — Trauma-Informed,...

13
Experimental
38 JocelynVelarde/hack4her-genai-app

Build your first GenAI App using MongoDB, Gemini API and Streamlit

13
Experimental
39 psiba15/gemini-image-storyteller

AI-powered Streamlit app that generates stories and narrated audio from...

13
Experimental
40 Mohshaikh23/Gemini-Pro-LLM-App

A LLLM app using Gemini pro API

13
Experimental
41 mariamashraf731/VisionPal-Assistive-AI

An AI-powered assistive assistant for the visually impaired. Leverages Llama...

12
Experimental
42 mubasshirahmxd/Gemini-Multimodal-App

Multimodal Gemini AI app with text chat, streaming responses, and...

12
Experimental
43 cunhanina/pokeroast

An AI-powered "Cyber-Bullying" Dashboard that uses GenAI & Data Science to...

12
Experimental
44 BuddyLim/iuys

Intelligently Understanding Your Screenshots

11
Experimental
45 ArchismwanChatterjee/SightSync-Harmony

This is my Buildspace S5 N&W project

11
Experimental
46 kikugo/ISITADINOSAUR

A Streamlit app using Google's Gemini models to analyze images.

11
Experimental
47 malleswarigelli/QA_Information_Retrival_Application

Build Generative AI, custom Question/Answer or Information Retrival...

11
Experimental
48 scionoftech/functiongemma-finetuning-e-commerce

A comprehensive, production-ready tutorial for fine-tuning Google's...

11
Experimental
49 rugveddarwhekar/reddit_title_generation

This project is a Streamlit-based web application that leverages the Gemini...

11
Experimental
50 kelwynOliveira/Loki-ai-content-generator

AI-powered content assistant built with Streamlit + Gemini. Generate...

10
Experimental
51 sanskaryo/github-roast-app

Github roast app with roast levels

10
Experimental