Ejb503/multimodal-mcp-client

A Multi-modal MCP client for voice powered agentic workflows

/ 100

Emerging

This is a tool for developers who want to build advanced, voice-controlled AI applications. It allows you to create AI systems that understand spoken language, text, and visual information, and then respond vocally. You would use this to power complex, multi-step AI workflows through natural speech commands, integrating various AI capabilities into one seamless experience.

210 stars. No commits in the last 6 months.

Use this if you are a developer looking to build innovative AI applications that can be controlled entirely through natural voice commands and process diverse inputs like speech and images.

Not ideal if you are an end-user looking for a ready-to-use voice assistant; this is a toolkit for developers to build such systems.

AI-application-development voice-user-interface multimodal-AI AI-workflow-automation developer-tools

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 19 / 25

How are scores calculated?

Stars

210

Forks

Language

TypeScript

License

MIT

Higher-rated alternatives

DMontgomery40/deepseek-mcp-server

Model Context Protocol server for DeepSeek's advanced language models

upstash/context7

Context7 Platform -- Up-to-date code documentation for LLMs and AI code editors

graphlit/graphlit-mcp-server

Model Context Protocol (MCP) Server for Graphlit Platform

dvcrn/mcp-server-siri-shortcuts

MCP for calling Siri Shorcuts from LLMs

rawveg/ollama-mcp

An MCP Server for Ollama

Explore MCP Servers

All categories Trending MCP Server directory Insights