Ejb503/multimodal-mcp-client
A Multi-modal MCP client for voice powered agentic workflows
This is a tool for developers who want to build advanced, voice-controlled AI applications. It allows you to create AI systems that understand spoken language, text, and visual information, and then respond vocally. You would use this to power complex, multi-step AI workflows through natural speech commands, integrating various AI capabilities into one seamless experience.
210 stars. No commits in the last 6 months.
Use this if you are a developer looking to build innovative AI applications that can be controlled entirely through natural voice commands and process diverse inputs like speech and images.
Not ideal if you are an end-user looking for a ready-to-use voice assistant; this is a toolkit for developers to build such systems.
Stars
210
Forks
35
Language
TypeScript
License
MIT
Category
Last pushed
Feb 03, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/mcp/Ejb503/multimodal-mcp-client"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
DMontgomery40/deepseek-mcp-server
Model Context Protocol server for DeepSeek's advanced language models
upstash/context7
Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors
graphlit/graphlit-mcp-server
Model Context Protocol (MCP) Server for Graphlit Platform
dvcrn/mcp-server-siri-shortcuts
MCP for calling Siri Shorcuts from LLMs
rawveg/ollama-mcp
An MCP Server for Ollama