derek-byte/multilingual-voice-assistant-llm
cohere labs - aya expedition 2025: integrating speech & audio into aya vision (all in one toolkit for integrating tts/stt) ⚙️
This toolkit helps you integrate speech and audio processing into AI systems like Aya Vision. It takes spoken language (audio) and converts it into text, and also converts text into spoken language. This allows AI to understand voice commands and respond verbally, making interactions more natural and accessible. It's designed for AI developers working on multimodal applications.
No commits in the last 6 months.
Use this if you are developing AI applications that need to process voice input (Speech-to-Text) and generate spoken responses (Text-to-Speech) to enhance user interaction and accessibility.
Not ideal if you are looking for a standalone end-user application for speech translation or voice assistance, as this is a toolkit for developers.
Stars
8
Forks
1
Language
Python
License
Apache-2.0
Category
Last pushed
May 15, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/derek-byte/multilingual-voice-assistant-llm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
asiff00/On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech communication with AI...
VideotronicMaker/LM-Studio-Voice-Conversation
Python app for LM Studio-enhanced voice conversations with local LLMs. Uses Whisper for...
syntithenai/hermod
voice services stack from audio hardware through hotword, ASR, NLU, AI routing and TTS bound by...
bold-ronin/lira
A Voice-First AI Companion
voice-engine/make-a-smart-speaker
A collection of resources to make a smart speaker