youkpan/gemini-assistant

Google Gemini Voice/Vision Assistant with gemini-1.5-pro / gemini-1.5-flash modal ! #Gemini 1.5 Flash #Gemini 1.5 Pro

21
/ 100
Experimental

This tool transforms your spoken questions and camera or screen-shared images into intelligent, human-like responses. It's like having an AI assistant that can see and hear what you do, and then talk back to you. Anyone who needs quick, context-aware information or analysis based on visual and verbal input would find this useful.

No commits in the last 6 months.

Use this if you need an interactive AI assistant that can understand both what you say and what it sees through your camera or screen, providing verbal explanations in real-time.

Not ideal if you primarily work with text-only data or need an AI for behind-the-scenes data processing without direct voice and vision interaction.

visual-search voice-assistant interactive-AI real-time-information multimodal-analysis
Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 5 / 25
Maturity 16 / 25
Community 0 / 25

How are scores calculated?

Stars

11

Forks

Language

TypeScript

License

GPL-2.0

Last pushed

May 18, 2024

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/youkpan/gemini-assistant"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.